Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: five_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ACAGCTCCAACGGTCCAATCCATTTTTAAAATCCCAGGAGATCCCTTCGTTAACTCTGAAAACTAGCCGTTACTCTCACCAAAAGCCCCCTCTCTCTCCCCTTAAATATTTTCGCCGACTTTCCATTTCCGATCCACATTTGAATCTCTCAAATCTCTCTCCCTTCTATCCGAGATCTTCACTCGAGTACATTCATGGCGCTGCCGTCTAATAAGTCGTCTTCTCCGTCGATGGTCGCCGGAAGAACAAGCCCTAATTCCAGAAATTCTGAAATCAGTAACCCTATCCGCCGGAGCTTCTCCGGCAACCCGTTTTCGAAGCCGTCGATCGTCGCCAATCCGAGGAGCTTGAACCCTATCACTCCGGCGAACAGTCCCTCTGGTTTGTTTGAGTTCTCTACCTGGTTAATTTCTTTCTTGGGATTTTAGAGTGGTTTAAATCTGTTTGGTTGCTGAGAAAATGGCTGAGAAAATTTATTAGGGCTACAAAATCAAAAGAATCTCTATTCTCTACTAATAATTAGTATCTAAACTTTATTTTCGCTCTGATTTCGATGGTTAATTTTGATTCTTCCACCGATTGTTTTCCTTCTTGGGATGATTTTGAACAATCGGATTGTTTTTGAGTGGTTCGAATCTGTTTGGTTGTCTAGAAAGTAGCCGAGAAAATGCTTATATTCCGTTCCCTTTTTCTTCTGAATCTGAAAATGCTTCCATGTTCTTATTTGTTTCTTTGAATAACTTTTCAGATTATCCACGAAGGAGCAGAGAAAATTTATTTACTTCTCGTGACAATGAGGAGAAAGAAAATGGAAAAGATCAGAGTCCGAAACCTGTCCGAGTCCGTTCGCCGACGGTCGGAAAATCGACGAAGCACTTCATGTCGCCGACGATCTCCGCCGCCTCCAAGATCGCCGTGTCTCCGAAGAAGAAGATTCTGGGCGATCGGAATGAGCCAGTTCGGTCGTCTCTTTCATTTTCTGGCATGAAAAGCTCTTCACTCAACTCGGTGAATCCAAATTCAGAGGCATCAACGGCACTTGAATCCGATACGAACCCTCAAATTGCTCCGATTTCGAATCCCAAATCATCAAAATCTGTGAGATTCGGTGGGGTTGAGGTCATTTCTGGTTCGTATGACGATTCAGAATTCACACACCGATACGATCCAGAAGTGGTAACTATGGCAGTTGAAACCGATACGAAGCCTGAAACCGCTCTGATTTCAGAATCTGCCATGGCAGCACCACCTCCCAAATCATCTGCAACTGTGAAATTCGGTGGCTTTGAGGTTATTTCTGATTCGTATGACGATTCCGAATCCACATACCGAAATGATACGAACTCAGATATTGTAACAATGGCAGTCGAAGCTGACACGATGCCTGAAATCGCTCCGATTTCCGCCATTGCAGCAGCACCGCCTAAAGCTTCAAAGACTGTGAGATTTGCTGATGTCGAGGTAATCTCTATCTCAAACAATGATTTAGTGTCTCCGGTTAAGAATAATTTTACTGAAGAATTGGATAGTGTCAATCTCGATCCAAGTTTTAAGCTCAGTCCTGTTTCTTCTCCAATGGAGGTAGCACCTCTTGATGCTGATCCATTAATACCTCCATATGATCCCAAAACCAATTACCTATCGCCAAGGCCACAGTTCCTCCATTACAGACCAAACCAAAGAATCAATCGATACGAACCAGACGGTAGACTTGAGGAACTCTTTGCCTCTGCCAATGTTTCCGAGTCCGAGTTCACAGAGGAGACTGACTCTGAAGATCCACAGATGGAATCTGATGAAGCTTCTTCCAGTGAATCGCAAATGGAAGAAAAAGAAGAGGAGGAGGAAGAAGAAGAGATTATTAATGTTTCTGAACAAAGCCCCATTGAAGCTAAAAAGTCATCTAAGCTTCAGATTTCAAGAATATTCAAGATCAGTTCTCTTCTTTTGATTCTGTTTACTGCTTGCTTTTCAATTTGTGTTGTTAATGTCCATGATCCAAATATCTTTGAAAGAGCAAGGTTGTTAACATTGGAGGATCCATCTGAAATTTATGAGTTCGCGATGACAAATTTCAATGTGTTGGTGGGGAAACTTGAGGTTTGGCATGCGAATTCCAAATCTTTCATTTCTGATATGGTTTTCAACATCAGAGGAGGGCGGACATTGATTTATCTTAACCAGACCGAATTCTTCAACAAGGATGTCATTGCGGTTGGACAGTGTCTTGTATTATCTCGTCAGACCTTGTGGGAAGAAGAAAACAATTTGAGTGTAATGGAAGAAGCTATGAAGGATACAGAAATTGACATTGTCGAAGAACATACTGAGAGAGAAGATCAGAATGAAGTACAAGAACGAGAAGAAGAATCGTTGCGAGAGATTGAAGCCATGAAGGAGAGAGAATTTGACATCGATCCTGTTGAAAGAGAAGCTCAGAGTGAAAAAGTAGAAGAAGAAGAACCGTTGGAAGAGACTGAAGCCATGAAGGAGAGAGAAATTGACATCGATCCTGTCGAAGGAGAAGCTCAGAAGGAAGAAGTAGAAGAAGAGTCGATTGAGGCTTCTGCAAAATCAGCCTATGAAATATCATTGCAAGAGATTGTGGAAGAGAAACTGGATGAACTTGTTGAAGTAGAAGAAGACGTCCAAGAGAAACAAACCGAAGAGAACTACGAAGTTTCTTCATCACCGTATTCTAAAATTCATGATCAAATTGAACCAGAAGCAGGAACAGGAAGAAACGAACGAACGATTCAACAGAGCAACACAGAATTTCAATATCAATCACCTCCAGTTTCTCCTCCTTCTGAACCTCAATCTGATGTTGAAGACAACAGCGGCAACATCGATCTCATCGGAGCAGCAACCGAAAACGGAATCTCAAGAGATTTCAAACAGAACACTGCGATTATAGTATCTGCAATACTGCTAGGTTTATCTCTAATTGCAGGTCTGATTTATGCAAGAAAATCAGGCTCGGGCTCAGGCTCAAGACCATCCACGGCGGGCGCGGCCATTGTTGAAAAGAGAGAGGAGCAGCCGCCATTGCTGAAAGAGAAGAGGACGAACCAGAGTCCGGCGGAAGAAGCAGAAGAAATTGGTGATGACGATCGTGATGATGATATGGCTGGAGGAGAATCTTGCTCTTCTGAAATGAGCAGTTTCCAATACAGCGGCACAAAAGGAGGAGAAACAGAAGCAACGAAGAGATCGAGCGAAGAAGCTCGGAGCCATAGCCATGGGAGGAAGACGATGAGAAGGAATTCAAGAAGAGAATCAATGGCTTCTTCTTCTTTGGAAGAATATTCAGTGTCCACTTCGGCTTCTCCATCTTATGGAAGTTTCACAACTTATGAGAAAATCCCAATCAAACATGTGAGTATCTCAAAACTGAAACTTTTTTTTTTTCAAATCTAAAATAATTATTTTAGAAAATCTATTTTTACAAGATAATTTTGCTAAACAGTTTTCAAAATAAAGATGATAACTGAATCTTTTGTTATTTTTCAAGTTTTTTTTTTTTTAAATCTAAAGTATTAATTTATTTTTTTTTTTTTCCTTGGAAGGATTAAAAATTTTGAGACAGACAGACTTCTTTTTTGTCTCAAACATGACCTTTTTTTTTTTTTTTTTTCTATTTTTATTATTATTATTTTTTCCTTTTCATTTCAGGGAAGTGGAGATGAAGAGATCGTGACCCCGGTCAGACGCTCTAGTAGAATTAGAAAAGCAACACAACAATAGTTAATTTGATTTATGTATTAATTAGAATGTGATTCTACGATGGAAACTTAGAGTGTTTTTTGTGCATACGTTTGACTCTAGTGAAATTAGAAAGCAAGATGGCAGAAGTTAGAATTAGAAAGCAACGCAACAAATAGTTGCTATATTTTGTGTTGTTGAGCCTATGTTGGAACTTGGATTATTTTGTGTATATATTTGAGTTTCTTAAAAGGAAAAAGAAAAAAAAGCCATATAAGTCATTCTAACACATTTCTTCCCA
mRNA sequence
ATGGCGCTGCCGTCTAATAAGTCGTCTTCTCCGTCGATGGTCGCCGGAAGAACAAGCCCTAATTCCAGAAATTCTGAAATCAGTAACCCTATCCGCCGGAGCTTCTCCGGCAACCCGTTTTCGAAGCCGTCGATCGTCGCCAATCCGAGGAGCTTGAACCCTATCACTCCGGCGAACAGTCCCTCTGATTATCCACGAAGGAGCAGAGAAAATTTATTTACTTCTCGTGACAATGAGGAGAAAGAAAATGGAAAAGATCAGAGTCCGAAACCTGTCCGAGTCCGTTCGCCGACGGTCGGAAAATCGACGAAGCACTTCATGTCGCCGACGATCTCCGCCGCCTCCAAGATCGCCGTGTCTCCGAAGAAGAAGATTCTGGGCGATCGGAATGAGCCAGTTCGGTCGTCTCTTTCATTTTCTGGCATGAAAAGCTCTTCACTCAACTCGGTGAATCCAAATTCAGAGGCATCAACGGCACTTGAATCCGATACGAACCCTCAAATTGCTCCGATTTCGAATCCCAAATCATCAAAATCTGTGAGATTCGGTGGGGTTGAGGTCATTTCTGGTTCGTATGACGATTCAGAATTCACACACCGATACGATCCAGAAGTGGTAACTATGGCAGTTGAAACCGATACGAAGCCTGAAACCGCTCTGATTTCAGAATCTGCCATGGCAGCACCACCTCCCAAATCATCTGCAACTGTGAAATTCGGTGGCTTTGAGGTTATTTCTGATTCGTATGACGATTCCGAATCCACATACCGAAATGATACGAACTCAGATATTGTAACAATGGCAGTCGAAGCTGACACGATGCCTGAAATCGCTCCGATTTCCGCCATTGCAGCAGCACCGCCTAAAGCTTCAAAGACTGTGAGATTTGCTGATGTCGAGGTAATCTCTATCTCAAACAATGATTTAGTGTCTCCGGTTAAGAATAATTTTACTGAAGAATTGGATAGTGTCAATCTCGATCCAAGTTTTAAGCTCAGTCCTGTTTCTTCTCCAATGGAGGTAGCACCTCTTGATGCTGATCCATTAATACCTCCATATGATCCCAAAACCAATTACCTATCGCCAAGGCCACAGTTCCTCCATTACAGACCAAACCAAAGAATCAATCGATACGAACCAGACGGTAGACTTGAGGAACTCTTTGCCTCTGCCAATGTTTCCGAGTCCGAGTTCACAGAGGAGACTGACTCTGAAGATCCACAGATGGAATCTGATGAAGCTTCTTCCAGTGAATCGCAAATGGAAGAAAAAGAAGAGGAGGAGGAAGAAGAAGAGATTATTAATGTTTCTGAACAAAGCCCCATTGAAGCTAAAAAGTCATCTAAGCTTCAGATTTCAAGAATATTCAAGATCAGTTCTCTTCTTTTGATTCTGTTTACTGCTTGCTTTTCAATTTGTGTTGTTAATGTCCATGATCCAAATATCTTTGAAAGAGCAAGGTTGTTAACATTGGAGGATCCATCTGAAATTTATGAGTTCGCGATGACAAATTTCAATGTGTTGGTGGGGAAACTTGAGGTTTGGCATGCGAATTCCAAATCTTTCATTTCTGATATGGTTTTCAACATCAGAGGAGGGCGGACATTGATTTATCTTAACCAGACCGAATTCTTCAACAAGGATGTCATTGCGGTTGGACAGTGTCTTGTATTATCTCGTCAGACCTTGTGGGAAGAAGAAAACAATTTGAGTGTAATGGAAGAAGCTATGAAGGATACAGAAATTGACATTGTCGAAGAACATACTGAGAGAGAAGATCAGAATGAAGTACAAGAACGAGAAGAAGAATCGTTGCGAGAGATTGAAGCCATGAAGGAGAGAGAATTTGACATCGATCCTGTTGAAAGAGAAGCTCAGAGTGAAAAAGTAGAAGAAGAAGAACCGTTGGAAGAGACTGAAGCCATGAAGGAGAGAGAAATTGACATCGATCCTGTCGAAGGAGAAGCTCAGAAGGAAGAAGTAGAAGAAGAGTCGATTGAGGCTTCTGCAAAATCAGCCTATGAAATATCATTGCAAGAGATTGTGGAAGAGAAACTGGATGAACTTGTTGAAGTAGAAGAAGACGTCCAAGAGAAACAAACCGAAGAGAACTACGAAGTTTCTTCATCACCGTATTCTAAAATTCATGATCAAATTGAACCAGAAGCAGGAACAGGAAGAAACGAACGAACGATTCAACAGAGCAACACAGAATTTCAATATCAATCACCTCCAGTTTCTCCTCCTTCTGAACCTCAATCTGATGTTGAAGACAACAGCGGCAACATCGATCTCATCGGAGCAGCAACCGAAAACGGAATCTCAAGAGATTTCAAACAGAACACTGCGATTATAGTATCTGCAATACTGCTAGGTTTATCTCTAATTGCAGGTCTGATTTATGCAAGAAAATCAGGCTCGGGCTCAGGCTCAAGACCATCCACGGCGGGCGCGGCCATTGTTGAAAAGAGAGAGGAGCAGCCGCCATTGCTGAAAGAGAAGAGGACGAACCAGAGTCCGGCGGAAGAAGCAGAAGAAATTGGTGATGACGATCGTGATGATGATATGGCTGGAGGAGAATCTTGCTCTTCTGAAATGAGCAGTTTCCAATACAGCGGCACAAAAGGAGGAGAAACAGAAGCAACGAAGAGATCGAGCGAAGAAGCTCGGAGCCATAGCCATGGGAGGAAGACGATGAGAAGGAATTCAAGAAGAGAATCAATGGCTTCTTCTTCTTTGGAAGAATATTCAGTGTCCACTTCGGCTTCTCCATCTTATGGAAGTTTCACAACTTATGAGAAAATCCCAATCAAACATGGAAGTGGAGATGAAGAGATCGTGACCCCGGTCAGACGCTCTAGTAGAATTAGAAAAGCAACACAACAATAG
Coding sequence (CDS)
ATGGCGCTGCCGTCTAATAAGTCGTCTTCTCCGTCGATGGTCGCCGGAAGAACAAGCCCTAATTCCAGAAATTCTGAAATCAGTAACCCTATCCGCCGGAGCTTCTCCGGCAACCCGTTTTCGAAGCCGTCGATCGTCGCCAATCCGAGGAGCTTGAACCCTATCACTCCGGCGAACAGTCCCTCTGATTATCCACGAAGGAGCAGAGAAAATTTATTTACTTCTCGTGACAATGAGGAGAAAGAAAATGGAAAAGATCAGAGTCCGAAACCTGTCCGAGTCCGTTCGCCGACGGTCGGAAAATCGACGAAGCACTTCATGTCGCCGACGATCTCCGCCGCCTCCAAGATCGCCGTGTCTCCGAAGAAGAAGATTCTGGGCGATCGGAATGAGCCAGTTCGGTCGTCTCTTTCATTTTCTGGCATGAAAAGCTCTTCACTCAACTCGGTGAATCCAAATTCAGAGGCATCAACGGCACTTGAATCCGATACGAACCCTCAAATTGCTCCGATTTCGAATCCCAAATCATCAAAATCTGTGAGATTCGGTGGGGTTGAGGTCATTTCTGGTTCGTATGACGATTCAGAATTCACACACCGATACGATCCAGAAGTGGTAACTATGGCAGTTGAAACCGATACGAAGCCTGAAACCGCTCTGATTTCAGAATCTGCCATGGCAGCACCACCTCCCAAATCATCTGCAACTGTGAAATTCGGTGGCTTTGAGGTTATTTCTGATTCGTATGACGATTCCGAATCCACATACCGAAATGATACGAACTCAGATATTGTAACAATGGCAGTCGAAGCTGACACGATGCCTGAAATCGCTCCGATTTCCGCCATTGCAGCAGCACCGCCTAAAGCTTCAAAGACTGTGAGATTTGCTGATGTCGAGGTAATCTCTATCTCAAACAATGATTTAGTGTCTCCGGTTAAGAATAATTTTACTGAAGAATTGGATAGTGTCAATCTCGATCCAAGTTTTAAGCTCAGTCCTGTTTCTTCTCCAATGGAGGTAGCACCTCTTGATGCTGATCCATTAATACCTCCATATGATCCCAAAACCAATTACCTATCGCCAAGGCCACAGTTCCTCCATTACAGACCAAACCAAAGAATCAATCGATACGAACCAGACGGTAGACTTGAGGAACTCTTTGCCTCTGCCAATGTTTCCGAGTCCGAGTTCACAGAGGAGACTGACTCTGAAGATCCACAGATGGAATCTGATGAAGCTTCTTCCAGTGAATCGCAAATGGAAGAAAAAGAAGAGGAGGAGGAAGAAGAAGAGATTATTAATGTTTCTGAACAAAGCCCCATTGAAGCTAAAAAGTCATCTAAGCTTCAGATTTCAAGAATATTCAAGATCAGTTCTCTTCTTTTGATTCTGTTTACTGCTTGCTTTTCAATTTGTGTTGTTAATGTCCATGATCCAAATATCTTTGAAAGAGCAAGGTTGTTAACATTGGAGGATCCATCTGAAATTTATGAGTTCGCGATGACAAATTTCAATGTGTTGGTGGGGAAACTTGAGGTTTGGCATGCGAATTCCAAATCTTTCATTTCTGATATGGTTTTCAACATCAGAGGAGGGCGGACATTGATTTATCTTAACCAGACCGAATTCTTCAACAAGGATGTCATTGCGGTTGGACAGTGTCTTGTATTATCTCGTCAGACCTTGTGGGAAGAAGAAAACAATTTGAGTGTAATGGAAGAAGCTATGAAGGATACAGAAATTGACATTGTCGAAGAACATACTGAGAGAGAAGATCAGAATGAAGTACAAGAACGAGAAGAAGAATCGTTGCGAGAGATTGAAGCCATGAAGGAGAGAGAATTTGACATCGATCCTGTTGAAAGAGAAGCTCAGAGTGAAAAAGTAGAAGAAGAAGAACCGTTGGAAGAGACTGAAGCCATGAAGGAGAGAGAAATTGACATCGATCCTGTCGAAGGAGAAGCTCAGAAGGAAGAAGTAGAAGAAGAGTCGATTGAGGCTTCTGCAAAATCAGCCTATGAAATATCATTGCAAGAGATTGTGGAAGAGAAACTGGATGAACTTGTTGAAGTAGAAGAAGACGTCCAAGAGAAACAAACCGAAGAGAACTACGAAGTTTCTTCATCACCGTATTCTAAAATTCATGATCAAATTGAACCAGAAGCAGGAACAGGAAGAAACGAACGAACGATTCAACAGAGCAACACAGAATTTCAATATCAATCACCTCCAGTTTCTCCTCCTTCTGAACCTCAATCTGATGTTGAAGACAACAGCGGCAACATCGATCTCATCGGAGCAGCAACCGAAAACGGAATCTCAAGAGATTTCAAACAGAACACTGCGATTATAGTATCTGCAATACTGCTAGGTTTATCTCTAATTGCAGGTCTGATTTATGCAAGAAAATCAGGCTCGGGCTCAGGCTCAAGACCATCCACGGCGGGCGCGGCCATTGTTGAAAAGAGAGAGGAGCAGCCGCCATTGCTGAAAGAGAAGAGGACGAACCAGAGTCCGGCGGAAGAAGCAGAAGAAATTGGTGATGACGATCGTGATGATGATATGGCTGGAGGAGAATCTTGCTCTTCTGAAATGAGCAGTTTCCAATACAGCGGCACAAAAGGAGGAGAAACAGAAGCAACGAAGAGATCGAGCGAAGAAGCTCGGAGCCATAGCCATGGGAGGAAGACGATGAGAAGGAATTCAAGAAGAGAATCAATGGCTTCTTCTTCTTTGGAAGAATATTCAGTGTCCACTTCGGCTTCTCCATCTTATGGAAGTTTCACAACTTATGAGAAAATCCCAATCAAACATGGAAGTGGAGATGAAGAGATCGTGACCCCGGTCAGACGCTCTAGTAGAATTAGAAAAGCAACACAACAATAG
Protein sequence
MALPSNKSSSPSMVAGRTSPNSRNSEISNPIRRSFSGNPFSKPSIVANPRSLNPITPANSPSDYPRRSRENLFTSRDNEEKENGKDQSPKPVRVRSPTVGKSTKHFMSPTISAASKIAVSPKKKILGDRNEPVRSSLSFSGMKSSSLNSVNPNSEASTALESDTNPQIAPISNPKSSKSVRFGGVEVISGSYDDSEFTHRYDPEVVTMAVETDTKPETALISESAMAAPPPKSSATVKFGGFEVISDSYDDSESTYRNDTNSDIVTMAVEADTMPEIAPISAIAAAPPKASKTVRFADVEVISISNNDLVSPVKNNFTEELDSVNLDPSFKLSPVSSPMEVAPLDADPLIPPYDPKTNYLSPRPQFLHYRPNQRINRYEPDGRLEELFASANVSESEFTEETDSEDPQMESDEASSSESQMEEKEEEEEEEEIINVSEQSPIEAKKSSKLQISRIFKISSLLLILFTACFSICVVNVHDPNIFERARLLTLEDPSEIYEFAMTNFNVLVGKLEVWHANSKSFISDMVFNIRGGRTLIYLNQTEFFNKDVIAVGQCLVLSRQTLWEEENNLSVMEEAMKDTEIDIVEEHTEREDQNEVQEREEESLREIEAMKEREFDIDPVEREAQSEKVEEEEPLEETEAMKEREIDIDPVEGEAQKEEVEEESIEASAKSAYEISLQEIVEEKLDELVEVEEDVQEKQTEENYEVSSSPYSKIHDQIEPEAGTGRNERTIQQSNTEFQYQSPPVSPPSEPQSDVEDNSGNIDLIGAATENGISRDFKQNTAIIVSAILLGLSLIAGLIYARKSGSGSGSRPSTAGAAIVEKREEQPPLLKEKRTNQSPAEEAEEIGDDDRDDDMAGGESCSSEMSSFQYSGTKGGETEATKRSSEEARSHSHGRKTMRRNSRRESMASSSLEEYSVSTSASPSYGSFTTYEKIPIKHGSGDEEIVTPVRRSSRIRKATQQ
Homology
BLAST of Spg002379 vs. NCBI nr
Match:
XP_022984664.1 (uncharacterized protein LOC111482876 isoform X4 [Cucurbita maxima])
HSP 1 Score: 988.0 bits (2553), Expect = 5.6e-284
Identity = 664/1043 (63.66%), Postives = 761/1043 (72.96%), Query Frame = 0
Query: 1 MALPSNKSSSPSMVAGRTSPNSRNSEISNPIRRSFSGNPFSKPSIVANPRSLNPITPANS 60
MALPSN+SSSPSMV GRTSP SRNSEISNP+ RSFS NPFSKPSI + +SLNPITPAN+
Sbjct: 1 MALPSNRSSSPSMVTGRTSPISRNSEISNPVYRSFSSNPFSKPSIATSLKSLNPITPANN 60
Query: 61 PS--DYPRR----SRENLFTSRDNEEKENGKDQSPKPVRVRSPTVGKSTKHFMSPTISAA 120
PS DYP + SRE LFTSRDNE+KENGKDQSPK RVRSPTVGKS K+FMS TISAA
Sbjct: 61 PSVADYPPQRNSVSREILFTSRDNEDKENGKDQSPKLTRVRSPTVGKSMKNFMSSTISAA 120
Query: 121 SKIAVSPKKKILGDRNEPVRSSLSFSGMKSSSLNSVNPNSEASTALESDTNPQIAPISNP 180
SKIAVSPKKKILGDRNEPVRSSLSFSGMKSSSLNSVNP EAS A ESDTNP + ISNP
Sbjct: 121 SKIAVSPKKKILGDRNEPVRSSLSFSGMKSSSLNSVNPTPEASMAFESDTNPPMPLISNP 180
Query: 181 KSSKSVRFGGVEVISGSYDDSEFTHRY--DPEVVTMAVETDTKPETALISESAMAAPPPK 240
KS+K+VRFGGVEVISGSY+DSE +RY +PE+VT+A TD+K I++SA+AA K
Sbjct: 181 KSTKTVRFGGVEVISGSYEDSESAYRYNLNPELVTIAAVTDSKSGIVPIAKSAIAAASSK 240
Query: 241 SSATVKFGGFEVISDSYDDSESTYR--NDTNSDIVTMAVEADTMPEIAPI--SAIAAAPP 300
SS TV FGGFEVISDSYDDSESTYR +D N + VT+AVEAD PEI PI S IAA P
Sbjct: 241 SSKTVTFGGFEVISDSYDDSESTYRHGHDPNPEAVTVAVEADAEPEIGPISDSDIAAVTP 300
Query: 301 KASKTVRFADVEVISISNNDLVSPVKNNFTEELDSVNLDPSFKLSPVSSPMEVAPLDADP 360
+ASK +RF+D+E ++SNN L S V +NFTEE+D VNLDPSF +SPVSSPM +AP+DADP
Sbjct: 301 EASKIMRFSDLE--AVSNNALESSVNSNFTEEVDCVNLDPSFNISPVSSPM-IAPMDADP 360
Query: 361 LIPPYDPKTNYLSPRPQFLHYRPNQRINRYEPDGRLEELFASANVSESEFTEETDSEDPQ 420
+I PYDPKTNYLSPRPQFLHY PN+RINR PDGR EELF++ +EETD EDPQ
Sbjct: 361 IITPYDPKTNYLSPRPQFLHYNPNRRINR--PDGRFEELFST--------SEETDCEDPQ 420
Query: 421 MESDEASSSESQMEEKEEEEEEEEIINVSEQSPIEAKKSSKLQISRIFKISSLLLILFTA 480
ESDE SS+ESQM+E+E+EEE ++VSEQ P E KKSSK +SRIFKISSLLLILFTA
Sbjct: 421 KESDEVSSNESQMKEEEKEEE----VDVSEQGPTEVKKSSKPLLSRIFKISSLLLILFTA 480
Query: 481 CFSICVVNVHDPNIFERARLLTLEDPSEIYEFAMTNFNVLVGKLEVWHANSKSFISDMVF 540
C SICVVNVHDP IFER+ LLT+ D SEI+ A TNFNVLVGKLE+WHANS SFISD+VF
Sbjct: 481 CLSICVVNVHDPTIFERSTLLTMGDQSEIFASAKTNFNVLVGKLEIWHANSISFISDVVF 540
Query: 541 NIRGGRTLIYLNQTEFFNKDVIAVGQCLVLSRQTLWEEENNLSVMEEAMKDTE------- 600
N RGG LI+LNQTEFF DV QCLVLS Q +WEEENNL EAMKD E
Sbjct: 541 NFRGGPPLIHLNQTEFFYGDVNKDEQCLVLSHQNVWEEENNLMNAMEAMKDREGQNKEGQ 600
Query: 601 ----------IDIVE---EHTEREDQNEVQEREEESLREIEAM------KEREFD----- 660
I + E + ERE QNE E EE+S +EIEA E+E D
Sbjct: 601 EQEEDAQEEAIKVKEIGIQTVERESQNE--EVEEQSFQEIEARTNDSENSEKENDEASEE 660
Query: 661 -----IDPVEREAQS-EKVEEEEPLEETEAMKEREIDIDPVEGEAQKEEVEEESIE---- 720
I+ +E E Q+ E E++E ++TEAMKEREI I+ VE E+Q EEVEEE +
Sbjct: 661 SLQEIIEHIEGEGQNIEGQEQQEEAQDTEAMKEREIGIETVERESQNEEVEEEPFQKTEA 720
Query: 721 -ASAKSAYEISLQEIVEEKLDELVEVEEDVQEKQTEENYEVSSSPYSKIHDQIEPEAGTG 780
A+ + E E EE L E+VE EE VQEK T EN++ SSS K+HD+IE A T
Sbjct: 721 KANDQKDREEENDEASEESLLEIVE-EESVQEK-TVENFKASSSSDFKLHDEIEQAAAT- 780
Query: 781 RNERTIQQSNTEFQYQSPPV-SPPSEPQSDVEDNSGN--IDLIGAATENGISRDFKQNTA 840
E T +++NTEFQYQSPPV SPPSE QSDVE+ +G +DLI AT GISRDF QNTA
Sbjct: 781 --EETQEETNTEFQYQSPPVSSPPSEHQSDVEEENGGKIVDLIRTAT--GISRDFTQNTA 840
Query: 841 IIVSAILLGLSLI--AGLIYARKSGSGSGSRPSTAGAAIVEKREEQPPLLKEKRTNQSPA 900
I+SAILLGL LI AGLIYARK SGSR +T+ AAI E+++E+ PLLK+K+TNQS
Sbjct: 841 AIISAILLGLFLIIPAGLIYARK----SGSRRTTSTAAIAEEQQEE-PLLKDKKTNQSLV 900
Query: 901 EEAEEIGDDDRDDDMAGGESCSSEMSS-FQYSGTKGGETEATKRSSE------------- 958
EE EE D DDD GE CSSE SS FQYS + GETEA KRSSE
Sbjct: 901 EEEEEEDALDDDDDDMAGEFCSSETSSFFQYSSVREGETEAAKRSSEFQSHSHVRRENSR 960
BLAST of Spg002379 vs. NCBI nr
Match:
XP_022984665.1 (uncharacterized protein LOC111482876 isoform X5 [Cucurbita maxima])
HSP 1 Score: 986.9 bits (2550), Expect = 1.3e-283
Identity = 664/1043 (63.66%), Postives = 760/1043 (72.87%), Query Frame = 0
Query: 1 MALPSNKSSSPSMVAGRTSPNSRNSEISNPIRRSFSGNPFSKPSIVANPRSLNPITPANS 60
MALPSN+SSSPSMV GRTSP SRNSEISNP+ RSFS NPFSKPSI + +SLNPITPAN+
Sbjct: 1 MALPSNRSSSPSMVTGRTSPISRNSEISNPVYRSFSSNPFSKPSIATSLKSLNPITPANN 60
Query: 61 PS--DYPRR----SRENLFTSRDNEEKENGKDQSPKPVRVRSPTVGKSTKHFMSPTISAA 120
PS DYP + SRE LFTSRDNE+KENGKDQSPK RVRSPTVGKS K+FMS TISAA
Sbjct: 61 PSVADYPPQRNSVSREILFTSRDNEDKENGKDQSPKLTRVRSPTVGKSMKNFMSSTISAA 120
Query: 121 SKIAVSPKKKILGDRNEPVRSSLSFSGMKSSSLNSVNPNSEASTALESDTNPQIAPISNP 180
SKIAVSPKKKILGDRNEPVRSSLSFSGMKSSSLNSVNP EAS A ESDTNP + ISNP
Sbjct: 121 SKIAVSPKKKILGDRNEPVRSSLSFSGMKSSSLNSVNPTPEASMAFESDTNPPMPLISNP 180
Query: 181 KSSKSVRFGGVEVISGSYDDSEFTHRY--DPEVVTMAVETDTKPETALISESAMAAPPPK 240
KS+K+VRFGGVEVISGSY+DSE +RY +PE+VT+A TD+K I++SA+AA K
Sbjct: 181 KSTKTVRFGGVEVISGSYEDSESAYRYNLNPELVTIAAVTDSKSGIVPIAKSAIAAASSK 240
Query: 241 SSATVKFGGFEVISDSYDDSESTYR--NDTNSDIVTMAVEADTMPEIAPI--SAIAAAPP 300
SS TV FGGFEVISDSYDDSESTYR +D N + VT+AVEAD PEI PI S IAA P
Sbjct: 241 SSKTVTFGGFEVISDSYDDSESTYRHGHDPNPEAVTVAVEADAEPEIGPISDSDIAAVTP 300
Query: 301 KASKTVRFADVEVISISNNDLVSPVKNNFTEELDSVNLDPSFKLSPVSSPMEVAPLDADP 360
+ASK +RF+D+E ++SNN L S V +NFTEE+D VNLDPSF +SPVSSPM +AP+DADP
Sbjct: 301 EASKIMRFSDLE--AVSNNALESSVNSNFTEEVDCVNLDPSFNISPVSSPM-IAPMDADP 360
Query: 361 LIPPYDPKTNYLSPRPQFLHYRPNQRINRYEPDGRLEELFASANVSESEFTEETDSEDPQ 420
+I PYDPKTNYLSPRPQFLHY PN+RINR PDGR EELF++ +EETD EDPQ
Sbjct: 361 IITPYDPKTNYLSPRPQFLHYNPNRRINR--PDGRFEELFST--------SEETDCEDPQ 420
Query: 421 MESDEASSSESQMEEKEEEEEEEEIINVSEQSPIEAKKSSKLQISRIFKISSLLLILFTA 480
ESDE SS+ESQM+E+E+EEE ++VSEQ P E KKSSK +SRIFKISSLLLILFTA
Sbjct: 421 KESDEVSSNESQMKEEEKEEE----VDVSEQGPTEVKKSSKPLLSRIFKISSLLLILFTA 480
Query: 481 CFSICVVNVHDPNIFERARLLTLEDPSEIYEFAMTNFNVLVGKLEVWHANSKSFISDMVF 540
C SICVVNVHDP IFER+ LLT+ D SEI+ A TNFNVLVGKLE+WHANS SFISD+VF
Sbjct: 481 CLSICVVNVHDPTIFERSTLLTMGDQSEIFASAKTNFNVLVGKLEIWHANSISFISDVVF 540
Query: 541 NIRGGRTLIYLNQTEFFNKDVIAVGQCLVLSRQTLWEEENNLSVMEEAMKDTE------- 600
N RGG LI+LNQTEFF DV QCLVLS Q +WEEENNL EAMKD E
Sbjct: 541 NFRGGPPLIHLNQTEFFYGDVNKDEQCLVLSHQNVWEEENNLMNAMEAMKDREGQNKEGQ 600
Query: 601 ----------IDIVE---EHTEREDQNEVQEREEESLREIEAM------KEREFD----- 660
I + E + ERE QNE E EE+S +EIEA E+E D
Sbjct: 601 EQEEDAQEEAIKVKEIGIQTVERESQNE--EVEEQSFQEIEARTNDSENSEKENDEASEE 660
Query: 661 -----IDPVEREAQS-EKVEEEEPLEETEAMKEREIDIDPVEGEAQKEEVEEESIE---- 720
I+ +E E Q+ E E++E ++TEAMKEREI I+ VE E+Q EEVEEE +
Sbjct: 661 SLQEIIEHIEGEGQNIEGQEQQEEAQDTEAMKEREIGIETVERESQNEEVEEEPFQKTEA 720
Query: 721 -ASAKSAYEISLQEIVEEKLDELVEVEEDVQEKQTEENYEVSSSPYSKIHDQIEPEAGTG 780
A+ + E E EE L E+VE EE VQEK T EN++ SSS K+H QIE A TG
Sbjct: 721 KANDQKDREEENDEASEESLLEIVE-EESVQEK-TVENFKASSSSDFKLHGQIEQAAATG 780
Query: 781 RNERTIQQSNTEFQYQSPPV-SPPSEPQSDVEDNSGN--IDLIGAATENGISRDFKQNTA 840
T +++NTEFQYQSPPV SPPSE QSDVE+ +G +DLI AT GISRDF QNTA
Sbjct: 781 ---ETQEETNTEFQYQSPPVSSPPSEHQSDVEEENGGKIVDLIRTAT--GISRDFTQNTA 840
Query: 841 IIVSAILLGLSLI--AGLIYARKSGSGSGSRPSTAGAAIVEKREEQPPLLKEKRTNQSPA 900
I+SAILLGL LI AGLIYARK SGSR +T+ AAI E+++E+ PLLK+K+TNQS
Sbjct: 841 AIISAILLGLFLIIPAGLIYARK----SGSRRTTSTAAIAEEQQEE-PLLKDKKTNQSLV 900
Query: 901 EEAEEIGDDDRDDDMAGGESCSSEMSS-FQYSGTKGGETEATKRSSE------------- 958
EE EE D DDD GE CSSE SS FQYS + GETEA KRSSE
Sbjct: 901 EEEEEEDALDDDDDDMAGEFCSSETSSFFQYSSVREGETEAAKRSSEFQSHSHVRRENSR 960
BLAST of Spg002379 vs. NCBI nr
Match:
XP_022984662.1 (uncharacterized protein LOC111482876 isoform X2 [Cucurbita maxima])
HSP 1 Score: 980.7 bits (2534), Expect = 9.0e-282
Identity = 665/1077 (61.75%), Postives = 761/1077 (70.66%), Query Frame = 0
Query: 1 MALPSNKSSSPSMVAGRTSPNSRNSEISNPIRRSFSGNPFSKPSIVANPRSLNPITPANS 60
MALPSN+SSSPSMV GRTSP SRNSEISNP+ RSFS NPFSKPSI + +SLNPITPAN+
Sbjct: 1 MALPSNRSSSPSMVTGRTSPISRNSEISNPVYRSFSSNPFSKPSIATSLKSLNPITPANN 60
Query: 61 PSDYPRR----SRENLFTSRDNEEKENGKDQSPKPVRVRSPTVGKSTKHFMSPTISAASK 120
PSDYP + SRE LFTSRDNE+KENGKDQSPK RVRSPTVGKS K+FMS TISAASK
Sbjct: 61 PSDYPPQRNSVSREILFTSRDNEDKENGKDQSPKLTRVRSPTVGKSMKNFMSSTISAASK 120
Query: 121 IAVSPKKKILGDRNEPVRSSLSFSGMKSSSLNSVNPNSEASTALESDTNPQIAPISNPKS 180
IAVSPKKKILGDRNEPVRSSLSFSGMKSSSLNSVNP EAS A ESDTNP + ISNPKS
Sbjct: 121 IAVSPKKKILGDRNEPVRSSLSFSGMKSSSLNSVNPTPEASMAFESDTNPPMPLISNPKS 180
Query: 181 SKSVRFGGVEVISGSYDDSEFTHRY--DPEVVTMAVETDTKPETALISESAMAAPPPKSS 240
+K+VRFGGVEVISGSY+DSE +RY +PE+VT+A TD+K I++SA+AA KSS
Sbjct: 181 TKTVRFGGVEVISGSYEDSESAYRYNLNPELVTIAAVTDSKSGIVPIAKSAIAAASSKSS 240
Query: 241 ATVKFGGFEVISDSYDDSESTYR--NDTNSDIVTMAVEADTMPEIAPI--SAIAAAPPKA 300
TV FGGFEVISDSYDDSESTYR +D N + VT+AVEAD PEI PI S IAA P+A
Sbjct: 241 KTVTFGGFEVISDSYDDSESTYRHGHDPNPEAVTVAVEADAEPEIGPISDSDIAAVTPEA 300
Query: 301 SKTVRFADVEVISISNNDLVSPVKNNFTEELDSVNLDPSFKLSPVSSPMEVAPLDADPLI 360
SK +RF+D+E ++SNN L S V +NFTEE+D VNLDPSF +SPVSSPM +AP+DADP+I
Sbjct: 301 SKIMRFSDLE--AVSNNALESSVNSNFTEEVDCVNLDPSFNISPVSSPM-IAPMDADPII 360
Query: 361 PPYDPKTNYLSPRPQFLHYRPNQRINRYEPDGRLEELFASANVSESEFTEETDSEDPQME 420
PYDPKTNYLSPRPQFLHY PN+RINR PDGR EELF++ +EETD EDPQ E
Sbjct: 361 TPYDPKTNYLSPRPQFLHYNPNRRINR--PDGRFEELFST--------SEETDCEDPQKE 420
Query: 421 SDEASSSESQMEEKEEEEEEEEIINVSEQSPIEAKKSSKLQISRIFKISSLLLILFTACF 480
SDE SS+ESQM+E+E+EEE ++VSEQ P E KKSSK +SRIFKISSLLLILFTAC
Sbjct: 421 SDEVSSNESQMKEEEKEEE----VDVSEQGPTEVKKSSKPLLSRIFKISSLLLILFTACL 480
Query: 481 SICVVNVHDPNIFERARLLTLEDPSEIYEFAMTNFNVLVGKLEVWHANSKSFISDMVFNI 540
SICVVNVHDP IFER+ LLT+ D SEI+ A TNFNVLVGKLE+WHANS SFISD+VFN
Sbjct: 481 SICVVNVHDPTIFERSTLLTMGDQSEIFASAKTNFNVLVGKLEIWHANSISFISDVVFNF 540
Query: 541 RGGRTLIYLNQTEFFNKDVIAVGQCLVLSRQTLWEEENNLSVMEEAMKDTE--------- 600
RGG LI+LNQTEFF DV QCLVLS Q +WEEENNL EAMKD E
Sbjct: 541 RGGPPLIHLNQTEFFYGDVNKDEQCLVLSHQNVWEEENNLMNAMEAMKDREGQNKEGQEQ 600
Query: 601 --------IDIVE---EHTEREDQNEVQEREEESLREIEAM------KEREFD------- 660
I + E + ERE QNE E EE+S +EIEA E+E D
Sbjct: 601 EEDAQEEAIKVKEIGIQTVERESQNE--EVEEQSFQEIEARTNDSENSEKENDEASEESL 660
Query: 661 ---IDPVEREAQS-EKVEEEEPLEETEAMKEREIDIDPVEGEAQKEEVEEESIE-----A 720
I+ +E E Q+ E E++E ++TEAMKEREI I+ VE E+Q EEVEEE + A
Sbjct: 661 QEIIEHIEGEGQNIEGQEQQEEAQDTEAMKEREIGIETVERESQNEEVEEEPFQKTEAKA 720
Query: 721 SAKSAYEISLQEIVEEKLDELVEVEEDVQEKQTEENYEVSSSPYSKIHDQIEPEAGTGR- 780
+ + E E EE L E+VE EE VQEK T EN++ SSS K+H QIE A TG
Sbjct: 721 NDQKDREEENDEASEESLLEIVE-EESVQEK-TVENFKASSSSDFKLHGQIEQAAATGET 780
Query: 781 -----------------------------------NERTIQQSNTEFQYQSPPV-SPPSE 840
E T +++NTEFQYQSPPV SPPSE
Sbjct: 781 HYEIEQAAATGETHYEIEQAAATGETHYEIEQAAATEETQEETNTEFQYQSPPVSSPPSE 840
Query: 841 PQSDVEDNSGN--IDLIGAATENGISRDFKQNTAIIVSAILLGLSLI--AGLIYARKSGS 900
QSDVE+ +G +DLI AT GISRDF QNTA I+SAILLGL LI AGLIYARK
Sbjct: 841 HQSDVEEENGGKIVDLIRTAT--GISRDFTQNTAAIISAILLGLFLIIPAGLIYARK--- 900
Query: 901 GSGSRPSTAGAAIVEKREEQPPLLKEKRTNQSPAEEAEEIGDDDRDDDMAGGESCSSEMS 958
SGSR +T+ AAI E+++E+ PLLK+K+TNQS EE EE D DDD GE CSSE S
Sbjct: 901 -SGSRRTTSTAAIAEEQQEE-PLLKDKKTNQSLVEEEEEEDALDDDDDDMAGEFCSSETS 960
BLAST of Spg002379 vs. NCBI nr
Match:
XP_022984663.1 (uncharacterized protein LOC111482876 isoform X3 [Cucurbita maxima])
HSP 1 Score: 980.7 bits (2534), Expect = 9.0e-282
Identity = 665/1066 (62.38%), Postives = 761/1066 (71.39%), Query Frame = 0
Query: 1 MALPSNKSSSPSMVAGRTSPNSRNSEISNPIRRSFSGNPFSKPSIVANPRSLNPITPANS 60
MALPSN+SSSPSMV GRTSP SRNSEISNP+ RSFS NPFSKPSI + +SLNPITPAN+
Sbjct: 1 MALPSNRSSSPSMVTGRTSPISRNSEISNPVYRSFSSNPFSKPSIATSLKSLNPITPANN 60
Query: 61 PS--DYPRR----SRENLFTSRDNEEKENGKDQSPKPVRVRSPTVGKSTKHFMSPTISAA 120
PS DYP + SRE LFTSRDNE+KENGKDQSPK RVRSPTVGKS K+FMS TISAA
Sbjct: 61 PSVADYPPQRNSVSREILFTSRDNEDKENGKDQSPKLTRVRSPTVGKSMKNFMSSTISAA 120
Query: 121 SKIAVSPKKKILGDRNEPVRSSLSFSGMKSSSLNSVNPNSEASTALESDTNPQIAPISNP 180
SKIAVSPKKKILGDRNEPVRSSLSFSGMKSSSLNSVNP EAS A ESDTNP + ISNP
Sbjct: 121 SKIAVSPKKKILGDRNEPVRSSLSFSGMKSSSLNSVNPTPEASMAFESDTNPPMPLISNP 180
Query: 181 KSSKSVRFGGVEVISGSYDDSEFTHRY--DPEVVTMAVETDTKPETALISESAMAAPPPK 240
KS+K+VRFGGVEVISGSY+DSE +RY +PE+VT+A TD+K I++SA+AA K
Sbjct: 181 KSTKTVRFGGVEVISGSYEDSESAYRYNLNPELVTIAAVTDSKSGIVPIAKSAIAAASSK 240
Query: 241 SSATVKFGGFEVISDSYDDSESTYR--NDTNSDIVTMAVEADTMPEIAPI--SAIAAAPP 300
SS TV FGGFEVISDSYDDSESTYR +D N + VT+AVEAD PEI PI S IAA P
Sbjct: 241 SSKTVTFGGFEVISDSYDDSESTYRHGHDPNPEAVTVAVEADAEPEIGPISDSDIAAVTP 300
Query: 301 KASKTVRFADVEVISISNNDLVSPVKNNFTEELDSVNLDPSFKLSPVSSPMEVAPLDADP 360
+ASK +RF+D+E ++SNN L S V +NFTEE+D VNLDPSF +SPVSSPM +AP+DADP
Sbjct: 301 EASKIMRFSDLE--AVSNNALESSVNSNFTEEVDCVNLDPSFNISPVSSPM-IAPMDADP 360
Query: 361 LIPPYDPKTNYLSPRPQFLHYRPNQRINRYEPDGRLEELFASANVSESEFTEETDSEDPQ 420
+I PYDPKTNYLSPRPQFLHY PN+RINR PDGR EELF++ +EETD EDPQ
Sbjct: 361 IITPYDPKTNYLSPRPQFLHYNPNRRINR--PDGRFEELFST--------SEETDCEDPQ 420
Query: 421 MESDEASSSESQMEEKEEEEEEEEIINVSEQSPIEAKKSSKLQISRIFKISSLLLILFTA 480
ESDE SS+ESQM+E+E+EEE ++VSEQ P E KKSSK +SRIFKISSLLLILFTA
Sbjct: 421 KESDEVSSNESQMKEEEKEEE----VDVSEQGPTEVKKSSKPLLSRIFKISSLLLILFTA 480
Query: 481 CFSICVVNVHDPNIFERARLLTLEDPSEIYEFAMTNFNVLVGKLEVWHANSKSFISDMVF 540
C SICVVNVHDP IFER+ LLT+ D SEI+ A TNFNVLVGKLE+WHANS SFISD+VF
Sbjct: 481 CLSICVVNVHDPTIFERSTLLTMGDQSEIFASAKTNFNVLVGKLEIWHANSISFISDVVF 540
Query: 541 NIRGGRTLIYLNQTEFFNKDVIAVGQCLVLSRQTLWEEENNLSVMEEAMKDTE------- 600
N RGG LI+LNQTEFF DV QCLVLS Q +WEEENNL EAMKD E
Sbjct: 541 NFRGGPPLIHLNQTEFFYGDVNKDEQCLVLSHQNVWEEENNLMNAMEAMKDREGQNKEGQ 600
Query: 601 ----------IDIVE---EHTEREDQNEVQEREEESLREIEAM------KEREFD----- 660
I + E + ERE QNE E EE+S +EIEA E+E D
Sbjct: 601 EQEEDAQEEAIKVKEIGIQTVERESQNE--EVEEQSFQEIEARTNDSENSEKENDEASEE 660
Query: 661 -----IDPVEREAQS-EKVEEEEPLEETEAMKEREIDIDPVEGEAQKEEVEEESIE---- 720
I+ +E E Q+ E E++E ++TEAMKEREI I+ VE E+Q EEVEEE +
Sbjct: 661 SLQEIIEHIEGEGQNIEGQEQQEEAQDTEAMKEREIGIETVERESQNEEVEEEPFQKTEA 720
Query: 721 -ASAKSAYEISLQEIVEEKLDELVEVEEDVQEKQTEENYEVSSSPYSKIHDQIEPEAGTG 780
A+ + E E EE L E+VE EE VQEK T EN++ SSS K+H QIE A TG
Sbjct: 721 KANDQKDREEENDEASEESLLEIVE-EESVQEK-TVENFKASSSSDFKLHGQIEQAAATG 780
Query: 781 R-----------------------NERTIQQSNTEFQYQSPPV-SPPSEPQSDVEDNSGN 840
E T +++NTEFQYQSPPV SPPSE QSDVE+ +G
Sbjct: 781 ETHYEIEQAAATGETHYEIEQAAATEETQEETNTEFQYQSPPVSSPPSEHQSDVEEENGG 840
Query: 841 --IDLIGAATENGISRDFKQNTAIIVSAILLGLSLI--AGLIYARKSGSGSGSRPSTAGA 900
+DLI AT GISRDF QNTA I+SAILLGL LI AGLIYARK SGSR +T+ A
Sbjct: 841 KIVDLIRTAT--GISRDFTQNTAAIISAILLGLFLIIPAGLIYARK----SGSRRTTSTA 900
Query: 901 AIVEKREEQPPLLKEKRTNQSPAEEAEEIGDDDRDDDMAGGESCSSEMSS-FQYSGTKGG 958
AI E+++E+ PLLK+K+TNQS EE EE D DDD GE CSSE SS FQYS + G
Sbjct: 901 AIAEEQQEE-PLLKDKKTNQSLVEEEEEEDALDDDDDDMAGEFCSSETSSFFQYSSVREG 960
BLAST of Spg002379 vs. NCBI nr
Match:
XP_022984661.1 (uncharacterized protein LOC111482876 isoform X1 [Cucurbita maxima])
HSP 1 Score: 975.7 bits (2521), Expect = 2.9e-280
Identity = 665/1079 (61.63%), Postives = 761/1079 (70.53%), Query Frame = 0
Query: 1 MALPSNKSSSPSMVAGRTSPNSRNSEISNPIRRSFSGNPFSKPSIVANPRSLNPITPANS 60
MALPSN+SSSPSMV GRTSP SRNSEISNP+ RSFS NPFSKPSI + +SLNPITPAN+
Sbjct: 1 MALPSNRSSSPSMVTGRTSPISRNSEISNPVYRSFSSNPFSKPSIATSLKSLNPITPANN 60
Query: 61 PS--DYPRR----SRENLFTSRDNEEKENGKDQSPKPVRVRSPTVGKSTKHFMSPTISAA 120
PS DYP + SRE LFTSRDNE+KENGKDQSPK RVRSPTVGKS K+FMS TISAA
Sbjct: 61 PSVADYPPQRNSVSREILFTSRDNEDKENGKDQSPKLTRVRSPTVGKSMKNFMSSTISAA 120
Query: 121 SKIAVSPKKKILGDRNEPVRSSLSFSGMKSSSLNSVNPNSEASTALESDTNPQIAPISNP 180
SKIAVSPKKKILGDRNEPVRSSLSFSGMKSSSLNSVNP EAS A ESDTNP + ISNP
Sbjct: 121 SKIAVSPKKKILGDRNEPVRSSLSFSGMKSSSLNSVNPTPEASMAFESDTNPPMPLISNP 180
Query: 181 KSSKSVRFGGVEVISGSYDDSEFTHRY--DPEVVTMAVETDTKPETALISESAMAAPPPK 240
KS+K+VRFGGVEVISGSY+DSE +RY +PE+VT+A TD+K I++SA+AA K
Sbjct: 181 KSTKTVRFGGVEVISGSYEDSESAYRYNLNPELVTIAAVTDSKSGIVPIAKSAIAAASSK 240
Query: 241 SSATVKFGGFEVISDSYDDSESTYR--NDTNSDIVTMAVEADTMPEIAPI--SAIAAAPP 300
SS TV FGGFEVISDSYDDSESTYR +D N + VT+AVEAD PEI PI S IAA P
Sbjct: 241 SSKTVTFGGFEVISDSYDDSESTYRHGHDPNPEAVTVAVEADAEPEIGPISDSDIAAVTP 300
Query: 301 KASKTVRFADVEVISISNNDLVSPVKNNFTEELDSVNLDPSFKLSPVSSPMEVAPLDADP 360
+ASK +RF+D+E ++SNN L S V +NFTEE+D VNLDPSF +SPVSSPM +AP+DADP
Sbjct: 301 EASKIMRFSDLE--AVSNNALESSVNSNFTEEVDCVNLDPSFNISPVSSPM-IAPMDADP 360
Query: 361 LIPPYDPKTNYLSPRPQFLHYRPNQRINRYEPDGRLEELFASANVSESEFTEETDSEDPQ 420
+I PYDPKTNYLSPRPQFLHY PN+RINR PDGR EELF++ +EETD EDPQ
Sbjct: 361 IITPYDPKTNYLSPRPQFLHYNPNRRINR--PDGRFEELFST--------SEETDCEDPQ 420
Query: 421 MESDEASSSESQMEEKEEEEEEEEIINVSEQSPIEAKKSSKLQISRIFKISSLLLILFTA 480
ESDE SS+ESQM+E+E+EEE ++VSEQ P E KKSSK +SRIFKISSLLLILFTA
Sbjct: 421 KESDEVSSNESQMKEEEKEEE----VDVSEQGPTEVKKSSKPLLSRIFKISSLLLILFTA 480
Query: 481 CFSICVVNVHDPNIFERARLLTLEDPSEIYEFAMTNFNVLVGKLEVWHANSKSFISDMVF 540
C SICVVNVHDP IFER+ LLT+ D SEI+ A TNFNVLVGKLE+WHANS SFISD+VF
Sbjct: 481 CLSICVVNVHDPTIFERSTLLTMGDQSEIFASAKTNFNVLVGKLEIWHANSISFISDVVF 540
Query: 541 NIRGGRTLIYLNQTEFFNKDVIAVGQCLVLSRQTLWEEENNLSVMEEAMKDTE------- 600
N RGG LI+LNQTEFF DV QCLVLS Q +WEEENNL EAMKD E
Sbjct: 541 NFRGGPPLIHLNQTEFFYGDVNKDEQCLVLSHQNVWEEENNLMNAMEAMKDREGQNKEGQ 600
Query: 601 ----------IDIVE---EHTEREDQNEVQEREEESLREIEAM------KEREFD----- 660
I + E + ERE QNE E EE+S +EIEA E+E D
Sbjct: 601 EQEEDAQEEAIKVKEIGIQTVERESQNE--EVEEQSFQEIEARTNDSENSEKENDEASEE 660
Query: 661 -----IDPVEREAQS-EKVEEEEPLEETEAMKEREIDIDPVEGEAQKEEVEEESIE---- 720
I+ +E E Q+ E E++E ++TEAMKEREI I+ VE E+Q EEVEEE +
Sbjct: 661 SLQEIIEHIEGEGQNIEGQEQQEEAQDTEAMKEREIGIETVERESQNEEVEEEPFQKTEA 720
Query: 721 -ASAKSAYEISLQEIVEEKLDELVEVEEDVQEKQTEENYEVSSSPYSKIHDQIEPEAGTG 780
A+ + E E EE L E+VE EE VQEK T EN++ SSS K+H QIE A TG
Sbjct: 721 KANDQKDREEENDEASEESLLEIVE-EESVQEK-TVENFKASSSSDFKLHGQIEQAAATG 780
Query: 781 R------------------------------------NERTIQQSNTEFQYQSPPV-SPP 840
E T +++NTEFQYQSPPV SPP
Sbjct: 781 ETHYEIEQAAATGETHYEIEQAAATGETHYEIEQAAATEETQEETNTEFQYQSPPVSSPP 840
Query: 841 SEPQSDVEDNSGN--IDLIGAATENGISRDFKQNTAIIVSAILLGLSLI--AGLIYARKS 900
SE QSDVE+ +G +DLI AT GISRDF QNTA I+SAILLGL LI AGLIYARK
Sbjct: 841 SEHQSDVEEENGGKIVDLIRTAT--GISRDFTQNTAAIISAILLGLFLIIPAGLIYARK- 900
Query: 901 GSGSGSRPSTAGAAIVEKREEQPPLLKEKRTNQSPAEEAEEIGDDDRDDDMAGGESCSSE 958
SGSR +T+ AAI E+++E+ PLLK+K+TNQS EE EE D DDD GE CSSE
Sbjct: 901 ---SGSRRTTSTAAIAEEQQEE-PLLKDKKTNQSLVEEEEEEDALDDDDDDMAGEFCSSE 960
BLAST of Spg002379 vs. ExPASy TrEMBL
Match:
A0A6J1JB72 (uncharacterized protein LOC111482876 isoform X4 OS=Cucurbita maxima OX=3661 GN=LOC111482876 PE=4 SV=1)
HSP 1 Score: 988.0 bits (2553), Expect = 2.7e-284
Identity = 664/1043 (63.66%), Postives = 761/1043 (72.96%), Query Frame = 0
Query: 1 MALPSNKSSSPSMVAGRTSPNSRNSEISNPIRRSFSGNPFSKPSIVANPRSLNPITPANS 60
MALPSN+SSSPSMV GRTSP SRNSEISNP+ RSFS NPFSKPSI + +SLNPITPAN+
Sbjct: 1 MALPSNRSSSPSMVTGRTSPISRNSEISNPVYRSFSSNPFSKPSIATSLKSLNPITPANN 60
Query: 61 PS--DYPRR----SRENLFTSRDNEEKENGKDQSPKPVRVRSPTVGKSTKHFMSPTISAA 120
PS DYP + SRE LFTSRDNE+KENGKDQSPK RVRSPTVGKS K+FMS TISAA
Sbjct: 61 PSVADYPPQRNSVSREILFTSRDNEDKENGKDQSPKLTRVRSPTVGKSMKNFMSSTISAA 120
Query: 121 SKIAVSPKKKILGDRNEPVRSSLSFSGMKSSSLNSVNPNSEASTALESDTNPQIAPISNP 180
SKIAVSPKKKILGDRNEPVRSSLSFSGMKSSSLNSVNP EAS A ESDTNP + ISNP
Sbjct: 121 SKIAVSPKKKILGDRNEPVRSSLSFSGMKSSSLNSVNPTPEASMAFESDTNPPMPLISNP 180
Query: 181 KSSKSVRFGGVEVISGSYDDSEFTHRY--DPEVVTMAVETDTKPETALISESAMAAPPPK 240
KS+K+VRFGGVEVISGSY+DSE +RY +PE+VT+A TD+K I++SA+AA K
Sbjct: 181 KSTKTVRFGGVEVISGSYEDSESAYRYNLNPELVTIAAVTDSKSGIVPIAKSAIAAASSK 240
Query: 241 SSATVKFGGFEVISDSYDDSESTYR--NDTNSDIVTMAVEADTMPEIAPI--SAIAAAPP 300
SS TV FGGFEVISDSYDDSESTYR +D N + VT+AVEAD PEI PI S IAA P
Sbjct: 241 SSKTVTFGGFEVISDSYDDSESTYRHGHDPNPEAVTVAVEADAEPEIGPISDSDIAAVTP 300
Query: 301 KASKTVRFADVEVISISNNDLVSPVKNNFTEELDSVNLDPSFKLSPVSSPMEVAPLDADP 360
+ASK +RF+D+E ++SNN L S V +NFTEE+D VNLDPSF +SPVSSPM +AP+DADP
Sbjct: 301 EASKIMRFSDLE--AVSNNALESSVNSNFTEEVDCVNLDPSFNISPVSSPM-IAPMDADP 360
Query: 361 LIPPYDPKTNYLSPRPQFLHYRPNQRINRYEPDGRLEELFASANVSESEFTEETDSEDPQ 420
+I PYDPKTNYLSPRPQFLHY PN+RINR PDGR EELF++ +EETD EDPQ
Sbjct: 361 IITPYDPKTNYLSPRPQFLHYNPNRRINR--PDGRFEELFST--------SEETDCEDPQ 420
Query: 421 MESDEASSSESQMEEKEEEEEEEEIINVSEQSPIEAKKSSKLQISRIFKISSLLLILFTA 480
ESDE SS+ESQM+E+E+EEE ++VSEQ P E KKSSK +SRIFKISSLLLILFTA
Sbjct: 421 KESDEVSSNESQMKEEEKEEE----VDVSEQGPTEVKKSSKPLLSRIFKISSLLLILFTA 480
Query: 481 CFSICVVNVHDPNIFERARLLTLEDPSEIYEFAMTNFNVLVGKLEVWHANSKSFISDMVF 540
C SICVVNVHDP IFER+ LLT+ D SEI+ A TNFNVLVGKLE+WHANS SFISD+VF
Sbjct: 481 CLSICVVNVHDPTIFERSTLLTMGDQSEIFASAKTNFNVLVGKLEIWHANSISFISDVVF 540
Query: 541 NIRGGRTLIYLNQTEFFNKDVIAVGQCLVLSRQTLWEEENNLSVMEEAMKDTE------- 600
N RGG LI+LNQTEFF DV QCLVLS Q +WEEENNL EAMKD E
Sbjct: 541 NFRGGPPLIHLNQTEFFYGDVNKDEQCLVLSHQNVWEEENNLMNAMEAMKDREGQNKEGQ 600
Query: 601 ----------IDIVE---EHTEREDQNEVQEREEESLREIEAM------KEREFD----- 660
I + E + ERE QNE E EE+S +EIEA E+E D
Sbjct: 601 EQEEDAQEEAIKVKEIGIQTVERESQNE--EVEEQSFQEIEARTNDSENSEKENDEASEE 660
Query: 661 -----IDPVEREAQS-EKVEEEEPLEETEAMKEREIDIDPVEGEAQKEEVEEESIE---- 720
I+ +E E Q+ E E++E ++TEAMKEREI I+ VE E+Q EEVEEE +
Sbjct: 661 SLQEIIEHIEGEGQNIEGQEQQEEAQDTEAMKEREIGIETVERESQNEEVEEEPFQKTEA 720
Query: 721 -ASAKSAYEISLQEIVEEKLDELVEVEEDVQEKQTEENYEVSSSPYSKIHDQIEPEAGTG 780
A+ + E E EE L E+VE EE VQEK T EN++ SSS K+HD+IE A T
Sbjct: 721 KANDQKDREEENDEASEESLLEIVE-EESVQEK-TVENFKASSSSDFKLHDEIEQAAAT- 780
Query: 781 RNERTIQQSNTEFQYQSPPV-SPPSEPQSDVEDNSGN--IDLIGAATENGISRDFKQNTA 840
E T +++NTEFQYQSPPV SPPSE QSDVE+ +G +DLI AT GISRDF QNTA
Sbjct: 781 --EETQEETNTEFQYQSPPVSSPPSEHQSDVEEENGGKIVDLIRTAT--GISRDFTQNTA 840
Query: 841 IIVSAILLGLSLI--AGLIYARKSGSGSGSRPSTAGAAIVEKREEQPPLLKEKRTNQSPA 900
I+SAILLGL LI AGLIYARK SGSR +T+ AAI E+++E+ PLLK+K+TNQS
Sbjct: 841 AIISAILLGLFLIIPAGLIYARK----SGSRRTTSTAAIAEEQQEE-PLLKDKKTNQSLV 900
Query: 901 EEAEEIGDDDRDDDMAGGESCSSEMSS-FQYSGTKGGETEATKRSSE------------- 958
EE EE D DDD GE CSSE SS FQYS + GETEA KRSSE
Sbjct: 901 EEEEEEDALDDDDDDMAGEFCSSETSSFFQYSSVREGETEAAKRSSEFQSHSHVRRENSR 960
BLAST of Spg002379 vs. ExPASy TrEMBL
Match:
A0A6J1J980 (uncharacterized protein LOC111482876 isoform X5 OS=Cucurbita maxima OX=3661 GN=LOC111482876 PE=4 SV=1)
HSP 1 Score: 986.9 bits (2550), Expect = 6.1e-284
Identity = 664/1043 (63.66%), Postives = 760/1043 (72.87%), Query Frame = 0
Query: 1 MALPSNKSSSPSMVAGRTSPNSRNSEISNPIRRSFSGNPFSKPSIVANPRSLNPITPANS 60
MALPSN+SSSPSMV GRTSP SRNSEISNP+ RSFS NPFSKPSI + +SLNPITPAN+
Sbjct: 1 MALPSNRSSSPSMVTGRTSPISRNSEISNPVYRSFSSNPFSKPSIATSLKSLNPITPANN 60
Query: 61 PS--DYPRR----SRENLFTSRDNEEKENGKDQSPKPVRVRSPTVGKSTKHFMSPTISAA 120
PS DYP + SRE LFTSRDNE+KENGKDQSPK RVRSPTVGKS K+FMS TISAA
Sbjct: 61 PSVADYPPQRNSVSREILFTSRDNEDKENGKDQSPKLTRVRSPTVGKSMKNFMSSTISAA 120
Query: 121 SKIAVSPKKKILGDRNEPVRSSLSFSGMKSSSLNSVNPNSEASTALESDTNPQIAPISNP 180
SKIAVSPKKKILGDRNEPVRSSLSFSGMKSSSLNSVNP EAS A ESDTNP + ISNP
Sbjct: 121 SKIAVSPKKKILGDRNEPVRSSLSFSGMKSSSLNSVNPTPEASMAFESDTNPPMPLISNP 180
Query: 181 KSSKSVRFGGVEVISGSYDDSEFTHRY--DPEVVTMAVETDTKPETALISESAMAAPPPK 240
KS+K+VRFGGVEVISGSY+DSE +RY +PE+VT+A TD+K I++SA+AA K
Sbjct: 181 KSTKTVRFGGVEVISGSYEDSESAYRYNLNPELVTIAAVTDSKSGIVPIAKSAIAAASSK 240
Query: 241 SSATVKFGGFEVISDSYDDSESTYR--NDTNSDIVTMAVEADTMPEIAPI--SAIAAAPP 300
SS TV FGGFEVISDSYDDSESTYR +D N + VT+AVEAD PEI PI S IAA P
Sbjct: 241 SSKTVTFGGFEVISDSYDDSESTYRHGHDPNPEAVTVAVEADAEPEIGPISDSDIAAVTP 300
Query: 301 KASKTVRFADVEVISISNNDLVSPVKNNFTEELDSVNLDPSFKLSPVSSPMEVAPLDADP 360
+ASK +RF+D+E ++SNN L S V +NFTEE+D VNLDPSF +SPVSSPM +AP+DADP
Sbjct: 301 EASKIMRFSDLE--AVSNNALESSVNSNFTEEVDCVNLDPSFNISPVSSPM-IAPMDADP 360
Query: 361 LIPPYDPKTNYLSPRPQFLHYRPNQRINRYEPDGRLEELFASANVSESEFTEETDSEDPQ 420
+I PYDPKTNYLSPRPQFLHY PN+RINR PDGR EELF++ +EETD EDPQ
Sbjct: 361 IITPYDPKTNYLSPRPQFLHYNPNRRINR--PDGRFEELFST--------SEETDCEDPQ 420
Query: 421 MESDEASSSESQMEEKEEEEEEEEIINVSEQSPIEAKKSSKLQISRIFKISSLLLILFTA 480
ESDE SS+ESQM+E+E+EEE ++VSEQ P E KKSSK +SRIFKISSLLLILFTA
Sbjct: 421 KESDEVSSNESQMKEEEKEEE----VDVSEQGPTEVKKSSKPLLSRIFKISSLLLILFTA 480
Query: 481 CFSICVVNVHDPNIFERARLLTLEDPSEIYEFAMTNFNVLVGKLEVWHANSKSFISDMVF 540
C SICVVNVHDP IFER+ LLT+ D SEI+ A TNFNVLVGKLE+WHANS SFISD+VF
Sbjct: 481 CLSICVVNVHDPTIFERSTLLTMGDQSEIFASAKTNFNVLVGKLEIWHANSISFISDVVF 540
Query: 541 NIRGGRTLIYLNQTEFFNKDVIAVGQCLVLSRQTLWEEENNLSVMEEAMKDTE------- 600
N RGG LI+LNQTEFF DV QCLVLS Q +WEEENNL EAMKD E
Sbjct: 541 NFRGGPPLIHLNQTEFFYGDVNKDEQCLVLSHQNVWEEENNLMNAMEAMKDREGQNKEGQ 600
Query: 601 ----------IDIVE---EHTEREDQNEVQEREEESLREIEAM------KEREFD----- 660
I + E + ERE QNE E EE+S +EIEA E+E D
Sbjct: 601 EQEEDAQEEAIKVKEIGIQTVERESQNE--EVEEQSFQEIEARTNDSENSEKENDEASEE 660
Query: 661 -----IDPVEREAQS-EKVEEEEPLEETEAMKEREIDIDPVEGEAQKEEVEEESIE---- 720
I+ +E E Q+ E E++E ++TEAMKEREI I+ VE E+Q EEVEEE +
Sbjct: 661 SLQEIIEHIEGEGQNIEGQEQQEEAQDTEAMKEREIGIETVERESQNEEVEEEPFQKTEA 720
Query: 721 -ASAKSAYEISLQEIVEEKLDELVEVEEDVQEKQTEENYEVSSSPYSKIHDQIEPEAGTG 780
A+ + E E EE L E+VE EE VQEK T EN++ SSS K+H QIE A TG
Sbjct: 721 KANDQKDREEENDEASEESLLEIVE-EESVQEK-TVENFKASSSSDFKLHGQIEQAAATG 780
Query: 781 RNERTIQQSNTEFQYQSPPV-SPPSEPQSDVEDNSGN--IDLIGAATENGISRDFKQNTA 840
T +++NTEFQYQSPPV SPPSE QSDVE+ +G +DLI AT GISRDF QNTA
Sbjct: 781 ---ETQEETNTEFQYQSPPVSSPPSEHQSDVEEENGGKIVDLIRTAT--GISRDFTQNTA 840
Query: 841 IIVSAILLGLSLI--AGLIYARKSGSGSGSRPSTAGAAIVEKREEQPPLLKEKRTNQSPA 900
I+SAILLGL LI AGLIYARK SGSR +T+ AAI E+++E+ PLLK+K+TNQS
Sbjct: 841 AIISAILLGLFLIIPAGLIYARK----SGSRRTTSTAAIAEEQQEE-PLLKDKKTNQSLV 900
Query: 901 EEAEEIGDDDRDDDMAGGESCSSEMSS-FQYSGTKGGETEATKRSSE------------- 958
EE EE D DDD GE CSSE SS FQYS + GETEA KRSSE
Sbjct: 901 EEEEEEDALDDDDDDMAGEFCSSETSSFFQYSSVREGETEAAKRSSEFQSHSHVRRENSR 960
BLAST of Spg002379 vs. ExPASy TrEMBL
Match:
A0A6J1J2S7 (uncharacterized protein LOC111482876 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111482876 PE=4 SV=1)
HSP 1 Score: 980.7 bits (2534), Expect = 4.4e-282
Identity = 665/1077 (61.75%), Postives = 761/1077 (70.66%), Query Frame = 0
Query: 1 MALPSNKSSSPSMVAGRTSPNSRNSEISNPIRRSFSGNPFSKPSIVANPRSLNPITPANS 60
MALPSN+SSSPSMV GRTSP SRNSEISNP+ RSFS NPFSKPSI + +SLNPITPAN+
Sbjct: 1 MALPSNRSSSPSMVTGRTSPISRNSEISNPVYRSFSSNPFSKPSIATSLKSLNPITPANN 60
Query: 61 PSDYPRR----SRENLFTSRDNEEKENGKDQSPKPVRVRSPTVGKSTKHFMSPTISAASK 120
PSDYP + SRE LFTSRDNE+KENGKDQSPK RVRSPTVGKS K+FMS TISAASK
Sbjct: 61 PSDYPPQRNSVSREILFTSRDNEDKENGKDQSPKLTRVRSPTVGKSMKNFMSSTISAASK 120
Query: 121 IAVSPKKKILGDRNEPVRSSLSFSGMKSSSLNSVNPNSEASTALESDTNPQIAPISNPKS 180
IAVSPKKKILGDRNEPVRSSLSFSGMKSSSLNSVNP EAS A ESDTNP + ISNPKS
Sbjct: 121 IAVSPKKKILGDRNEPVRSSLSFSGMKSSSLNSVNPTPEASMAFESDTNPPMPLISNPKS 180
Query: 181 SKSVRFGGVEVISGSYDDSEFTHRY--DPEVVTMAVETDTKPETALISESAMAAPPPKSS 240
+K+VRFGGVEVISGSY+DSE +RY +PE+VT+A TD+K I++SA+AA KSS
Sbjct: 181 TKTVRFGGVEVISGSYEDSESAYRYNLNPELVTIAAVTDSKSGIVPIAKSAIAAASSKSS 240
Query: 241 ATVKFGGFEVISDSYDDSESTYR--NDTNSDIVTMAVEADTMPEIAPI--SAIAAAPPKA 300
TV FGGFEVISDSYDDSESTYR +D N + VT+AVEAD PEI PI S IAA P+A
Sbjct: 241 KTVTFGGFEVISDSYDDSESTYRHGHDPNPEAVTVAVEADAEPEIGPISDSDIAAVTPEA 300
Query: 301 SKTVRFADVEVISISNNDLVSPVKNNFTEELDSVNLDPSFKLSPVSSPMEVAPLDADPLI 360
SK +RF+D+E ++SNN L S V +NFTEE+D VNLDPSF +SPVSSPM +AP+DADP+I
Sbjct: 301 SKIMRFSDLE--AVSNNALESSVNSNFTEEVDCVNLDPSFNISPVSSPM-IAPMDADPII 360
Query: 361 PPYDPKTNYLSPRPQFLHYRPNQRINRYEPDGRLEELFASANVSESEFTEETDSEDPQME 420
PYDPKTNYLSPRPQFLHY PN+RINR PDGR EELF++ +EETD EDPQ E
Sbjct: 361 TPYDPKTNYLSPRPQFLHYNPNRRINR--PDGRFEELFST--------SEETDCEDPQKE 420
Query: 421 SDEASSSESQMEEKEEEEEEEEIINVSEQSPIEAKKSSKLQISRIFKISSLLLILFTACF 480
SDE SS+ESQM+E+E+EEE ++VSEQ P E KKSSK +SRIFKISSLLLILFTAC
Sbjct: 421 SDEVSSNESQMKEEEKEEE----VDVSEQGPTEVKKSSKPLLSRIFKISSLLLILFTACL 480
Query: 481 SICVVNVHDPNIFERARLLTLEDPSEIYEFAMTNFNVLVGKLEVWHANSKSFISDMVFNI 540
SICVVNVHDP IFER+ LLT+ D SEI+ A TNFNVLVGKLE+WHANS SFISD+VFN
Sbjct: 481 SICVVNVHDPTIFERSTLLTMGDQSEIFASAKTNFNVLVGKLEIWHANSISFISDVVFNF 540
Query: 541 RGGRTLIYLNQTEFFNKDVIAVGQCLVLSRQTLWEEENNLSVMEEAMKDTE--------- 600
RGG LI+LNQTEFF DV QCLVLS Q +WEEENNL EAMKD E
Sbjct: 541 RGGPPLIHLNQTEFFYGDVNKDEQCLVLSHQNVWEEENNLMNAMEAMKDREGQNKEGQEQ 600
Query: 601 --------IDIVE---EHTEREDQNEVQEREEESLREIEAM------KEREFD------- 660
I + E + ERE QNE E EE+S +EIEA E+E D
Sbjct: 601 EEDAQEEAIKVKEIGIQTVERESQNE--EVEEQSFQEIEARTNDSENSEKENDEASEESL 660
Query: 661 ---IDPVEREAQS-EKVEEEEPLEETEAMKEREIDIDPVEGEAQKEEVEEESIE-----A 720
I+ +E E Q+ E E++E ++TEAMKEREI I+ VE E+Q EEVEEE + A
Sbjct: 661 QEIIEHIEGEGQNIEGQEQQEEAQDTEAMKEREIGIETVERESQNEEVEEEPFQKTEAKA 720
Query: 721 SAKSAYEISLQEIVEEKLDELVEVEEDVQEKQTEENYEVSSSPYSKIHDQIEPEAGTGR- 780
+ + E E EE L E+VE EE VQEK T EN++ SSS K+H QIE A TG
Sbjct: 721 NDQKDREEENDEASEESLLEIVE-EESVQEK-TVENFKASSSSDFKLHGQIEQAAATGET 780
Query: 781 -----------------------------------NERTIQQSNTEFQYQSPPV-SPPSE 840
E T +++NTEFQYQSPPV SPPSE
Sbjct: 781 HYEIEQAAATGETHYEIEQAAATGETHYEIEQAAATEETQEETNTEFQYQSPPVSSPPSE 840
Query: 841 PQSDVEDNSGN--IDLIGAATENGISRDFKQNTAIIVSAILLGLSLI--AGLIYARKSGS 900
QSDVE+ +G +DLI AT GISRDF QNTA I+SAILLGL LI AGLIYARK
Sbjct: 841 HQSDVEEENGGKIVDLIRTAT--GISRDFTQNTAAIISAILLGLFLIIPAGLIYARK--- 900
Query: 901 GSGSRPSTAGAAIVEKREEQPPLLKEKRTNQSPAEEAEEIGDDDRDDDMAGGESCSSEMS 958
SGSR +T+ AAI E+++E+ PLLK+K+TNQS EE EE D DDD GE CSSE S
Sbjct: 901 -SGSRRTTSTAAIAEEQQEE-PLLKDKKTNQSLVEEEEEEDALDDDDDDMAGEFCSSETS 960
BLAST of Spg002379 vs. ExPASy TrEMBL
Match:
A0A6J1JB65 (uncharacterized protein LOC111482876 isoform X3 OS=Cucurbita maxima OX=3661 GN=LOC111482876 PE=4 SV=1)
HSP 1 Score: 980.7 bits (2534), Expect = 4.4e-282
Identity = 665/1066 (62.38%), Postives = 761/1066 (71.39%), Query Frame = 0
Query: 1 MALPSNKSSSPSMVAGRTSPNSRNSEISNPIRRSFSGNPFSKPSIVANPRSLNPITPANS 60
MALPSN+SSSPSMV GRTSP SRNSEISNP+ RSFS NPFSKPSI + +SLNPITPAN+
Sbjct: 1 MALPSNRSSSPSMVTGRTSPISRNSEISNPVYRSFSSNPFSKPSIATSLKSLNPITPANN 60
Query: 61 PS--DYPRR----SRENLFTSRDNEEKENGKDQSPKPVRVRSPTVGKSTKHFMSPTISAA 120
PS DYP + SRE LFTSRDNE+KENGKDQSPK RVRSPTVGKS K+FMS TISAA
Sbjct: 61 PSVADYPPQRNSVSREILFTSRDNEDKENGKDQSPKLTRVRSPTVGKSMKNFMSSTISAA 120
Query: 121 SKIAVSPKKKILGDRNEPVRSSLSFSGMKSSSLNSVNPNSEASTALESDTNPQIAPISNP 180
SKIAVSPKKKILGDRNEPVRSSLSFSGMKSSSLNSVNP EAS A ESDTNP + ISNP
Sbjct: 121 SKIAVSPKKKILGDRNEPVRSSLSFSGMKSSSLNSVNPTPEASMAFESDTNPPMPLISNP 180
Query: 181 KSSKSVRFGGVEVISGSYDDSEFTHRY--DPEVVTMAVETDTKPETALISESAMAAPPPK 240
KS+K+VRFGGVEVISGSY+DSE +RY +PE+VT+A TD+K I++SA+AA K
Sbjct: 181 KSTKTVRFGGVEVISGSYEDSESAYRYNLNPELVTIAAVTDSKSGIVPIAKSAIAAASSK 240
Query: 241 SSATVKFGGFEVISDSYDDSESTYR--NDTNSDIVTMAVEADTMPEIAPI--SAIAAAPP 300
SS TV FGGFEVISDSYDDSESTYR +D N + VT+AVEAD PEI PI S IAA P
Sbjct: 241 SSKTVTFGGFEVISDSYDDSESTYRHGHDPNPEAVTVAVEADAEPEIGPISDSDIAAVTP 300
Query: 301 KASKTVRFADVEVISISNNDLVSPVKNNFTEELDSVNLDPSFKLSPVSSPMEVAPLDADP 360
+ASK +RF+D+E ++SNN L S V +NFTEE+D VNLDPSF +SPVSSPM +AP+DADP
Sbjct: 301 EASKIMRFSDLE--AVSNNALESSVNSNFTEEVDCVNLDPSFNISPVSSPM-IAPMDADP 360
Query: 361 LIPPYDPKTNYLSPRPQFLHYRPNQRINRYEPDGRLEELFASANVSESEFTEETDSEDPQ 420
+I PYDPKTNYLSPRPQFLHY PN+RINR PDGR EELF++ +EETD EDPQ
Sbjct: 361 IITPYDPKTNYLSPRPQFLHYNPNRRINR--PDGRFEELFST--------SEETDCEDPQ 420
Query: 421 MESDEASSSESQMEEKEEEEEEEEIINVSEQSPIEAKKSSKLQISRIFKISSLLLILFTA 480
ESDE SS+ESQM+E+E+EEE ++VSEQ P E KKSSK +SRIFKISSLLLILFTA
Sbjct: 421 KESDEVSSNESQMKEEEKEEE----VDVSEQGPTEVKKSSKPLLSRIFKISSLLLILFTA 480
Query: 481 CFSICVVNVHDPNIFERARLLTLEDPSEIYEFAMTNFNVLVGKLEVWHANSKSFISDMVF 540
C SICVVNVHDP IFER+ LLT+ D SEI+ A TNFNVLVGKLE+WHANS SFISD+VF
Sbjct: 481 CLSICVVNVHDPTIFERSTLLTMGDQSEIFASAKTNFNVLVGKLEIWHANSISFISDVVF 540
Query: 541 NIRGGRTLIYLNQTEFFNKDVIAVGQCLVLSRQTLWEEENNLSVMEEAMKDTE------- 600
N RGG LI+LNQTEFF DV QCLVLS Q +WEEENNL EAMKD E
Sbjct: 541 NFRGGPPLIHLNQTEFFYGDVNKDEQCLVLSHQNVWEEENNLMNAMEAMKDREGQNKEGQ 600
Query: 601 ----------IDIVE---EHTEREDQNEVQEREEESLREIEAM------KEREFD----- 660
I + E + ERE QNE E EE+S +EIEA E+E D
Sbjct: 601 EQEEDAQEEAIKVKEIGIQTVERESQNE--EVEEQSFQEIEARTNDSENSEKENDEASEE 660
Query: 661 -----IDPVEREAQS-EKVEEEEPLEETEAMKEREIDIDPVEGEAQKEEVEEESIE---- 720
I+ +E E Q+ E E++E ++TEAMKEREI I+ VE E+Q EEVEEE +
Sbjct: 661 SLQEIIEHIEGEGQNIEGQEQQEEAQDTEAMKEREIGIETVERESQNEEVEEEPFQKTEA 720
Query: 721 -ASAKSAYEISLQEIVEEKLDELVEVEEDVQEKQTEENYEVSSSPYSKIHDQIEPEAGTG 780
A+ + E E EE L E+VE EE VQEK T EN++ SSS K+H QIE A TG
Sbjct: 721 KANDQKDREEENDEASEESLLEIVE-EESVQEK-TVENFKASSSSDFKLHGQIEQAAATG 780
Query: 781 R-----------------------NERTIQQSNTEFQYQSPPV-SPPSEPQSDVEDNSGN 840
E T +++NTEFQYQSPPV SPPSE QSDVE+ +G
Sbjct: 781 ETHYEIEQAAATGETHYEIEQAAATEETQEETNTEFQYQSPPVSSPPSEHQSDVEEENGG 840
Query: 841 --IDLIGAATENGISRDFKQNTAIIVSAILLGLSLI--AGLIYARKSGSGSGSRPSTAGA 900
+DLI AT GISRDF QNTA I+SAILLGL LI AGLIYARK SGSR +T+ A
Sbjct: 841 KIVDLIRTAT--GISRDFTQNTAAIISAILLGLFLIIPAGLIYARK----SGSRRTTSTA 900
Query: 901 AIVEKREEQPPLLKEKRTNQSPAEEAEEIGDDDRDDDMAGGESCSSEMSS-FQYSGTKGG 958
AI E+++E+ PLLK+K+TNQS EE EE D DDD GE CSSE SS FQYS + G
Sbjct: 901 AIAEEQQEE-PLLKDKKTNQSLVEEEEEEDALDDDDDDMAGEFCSSETSSFFQYSSVREG 960
BLAST of Spg002379 vs. ExPASy TrEMBL
Match:
A0A6J1J5X0 (uncharacterized protein LOC111482876 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111482876 PE=4 SV=1)
HSP 1 Score: 975.7 bits (2521), Expect = 1.4e-280
Identity = 665/1079 (61.63%), Postives = 761/1079 (70.53%), Query Frame = 0
Query: 1 MALPSNKSSSPSMVAGRTSPNSRNSEISNPIRRSFSGNPFSKPSIVANPRSLNPITPANS 60
MALPSN+SSSPSMV GRTSP SRNSEISNP+ RSFS NPFSKPSI + +SLNPITPAN+
Sbjct: 1 MALPSNRSSSPSMVTGRTSPISRNSEISNPVYRSFSSNPFSKPSIATSLKSLNPITPANN 60
Query: 61 PS--DYPRR----SRENLFTSRDNEEKENGKDQSPKPVRVRSPTVGKSTKHFMSPTISAA 120
PS DYP + SRE LFTSRDNE+KENGKDQSPK RVRSPTVGKS K+FMS TISAA
Sbjct: 61 PSVADYPPQRNSVSREILFTSRDNEDKENGKDQSPKLTRVRSPTVGKSMKNFMSSTISAA 120
Query: 121 SKIAVSPKKKILGDRNEPVRSSLSFSGMKSSSLNSVNPNSEASTALESDTNPQIAPISNP 180
SKIAVSPKKKILGDRNEPVRSSLSFSGMKSSSLNSVNP EAS A ESDTNP + ISNP
Sbjct: 121 SKIAVSPKKKILGDRNEPVRSSLSFSGMKSSSLNSVNPTPEASMAFESDTNPPMPLISNP 180
Query: 181 KSSKSVRFGGVEVISGSYDDSEFTHRY--DPEVVTMAVETDTKPETALISESAMAAPPPK 240
KS+K+VRFGGVEVISGSY+DSE +RY +PE+VT+A TD+K I++SA+AA K
Sbjct: 181 KSTKTVRFGGVEVISGSYEDSESAYRYNLNPELVTIAAVTDSKSGIVPIAKSAIAAASSK 240
Query: 241 SSATVKFGGFEVISDSYDDSESTYR--NDTNSDIVTMAVEADTMPEIAPI--SAIAAAPP 300
SS TV FGGFEVISDSYDDSESTYR +D N + VT+AVEAD PEI PI S IAA P
Sbjct: 241 SSKTVTFGGFEVISDSYDDSESTYRHGHDPNPEAVTVAVEADAEPEIGPISDSDIAAVTP 300
Query: 301 KASKTVRFADVEVISISNNDLVSPVKNNFTEELDSVNLDPSFKLSPVSSPMEVAPLDADP 360
+ASK +RF+D+E ++SNN L S V +NFTEE+D VNLDPSF +SPVSSPM +AP+DADP
Sbjct: 301 EASKIMRFSDLE--AVSNNALESSVNSNFTEEVDCVNLDPSFNISPVSSPM-IAPMDADP 360
Query: 361 LIPPYDPKTNYLSPRPQFLHYRPNQRINRYEPDGRLEELFASANVSESEFTEETDSEDPQ 420
+I PYDPKTNYLSPRPQFLHY PN+RINR PDGR EELF++ +EETD EDPQ
Sbjct: 361 IITPYDPKTNYLSPRPQFLHYNPNRRINR--PDGRFEELFST--------SEETDCEDPQ 420
Query: 421 MESDEASSSESQMEEKEEEEEEEEIINVSEQSPIEAKKSSKLQISRIFKISSLLLILFTA 480
ESDE SS+ESQM+E+E+EEE ++VSEQ P E KKSSK +SRIFKISSLLLILFTA
Sbjct: 421 KESDEVSSNESQMKEEEKEEE----VDVSEQGPTEVKKSSKPLLSRIFKISSLLLILFTA 480
Query: 481 CFSICVVNVHDPNIFERARLLTLEDPSEIYEFAMTNFNVLVGKLEVWHANSKSFISDMVF 540
C SICVVNVHDP IFER+ LLT+ D SEI+ A TNFNVLVGKLE+WHANS SFISD+VF
Sbjct: 481 CLSICVVNVHDPTIFERSTLLTMGDQSEIFASAKTNFNVLVGKLEIWHANSISFISDVVF 540
Query: 541 NIRGGRTLIYLNQTEFFNKDVIAVGQCLVLSRQTLWEEENNLSVMEEAMKDTE------- 600
N RGG LI+LNQTEFF DV QCLVLS Q +WEEENNL EAMKD E
Sbjct: 541 NFRGGPPLIHLNQTEFFYGDVNKDEQCLVLSHQNVWEEENNLMNAMEAMKDREGQNKEGQ 600
Query: 601 ----------IDIVE---EHTEREDQNEVQEREEESLREIEAM------KEREFD----- 660
I + E + ERE QNE E EE+S +EIEA E+E D
Sbjct: 601 EQEEDAQEEAIKVKEIGIQTVERESQNE--EVEEQSFQEIEARTNDSENSEKENDEASEE 660
Query: 661 -----IDPVEREAQS-EKVEEEEPLEETEAMKEREIDIDPVEGEAQKEEVEEESIE---- 720
I+ +E E Q+ E E++E ++TEAMKEREI I+ VE E+Q EEVEEE +
Sbjct: 661 SLQEIIEHIEGEGQNIEGQEQQEEAQDTEAMKEREIGIETVERESQNEEVEEEPFQKTEA 720
Query: 721 -ASAKSAYEISLQEIVEEKLDELVEVEEDVQEKQTEENYEVSSSPYSKIHDQIEPEAGTG 780
A+ + E E EE L E+VE EE VQEK T EN++ SSS K+H QIE A TG
Sbjct: 721 KANDQKDREEENDEASEESLLEIVE-EESVQEK-TVENFKASSSSDFKLHGQIEQAAATG 780
Query: 781 R------------------------------------NERTIQQSNTEFQYQSPPV-SPP 840
E T +++NTEFQYQSPPV SPP
Sbjct: 781 ETHYEIEQAAATGETHYEIEQAAATGETHYEIEQAAATEETQEETNTEFQYQSPPVSSPP 840
Query: 841 SEPQSDVEDNSGN--IDLIGAATENGISRDFKQNTAIIVSAILLGLSLI--AGLIYARKS 900
SE QSDVE+ +G +DLI AT GISRDF QNTA I+SAILLGL LI AGLIYARK
Sbjct: 841 SEHQSDVEEENGGKIVDLIRTAT--GISRDFTQNTAAIISAILLGLFLIIPAGLIYARK- 900
Query: 901 GSGSGSRPSTAGAAIVEKREEQPPLLKEKRTNQSPAEEAEEIGDDDRDDDMAGGESCSSE 958
SGSR +T+ AAI E+++E+ PLLK+K+TNQS EE EE D DDD GE CSSE
Sbjct: 901 ---SGSRRTTSTAAIAEEQQEE-PLLKDKKTNQSLVEEEEEEDALDDDDDDMAGEFCSSE 960
BLAST of Spg002379 vs. TAIR 10
Match:
AT1G16630.1 (unknown protein; LOCATED IN: chloroplast; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G16270.1); Has 10587 Blast hits to 5736 proteins in 617 species: Archae - 88; Bacteria - 963; Metazoa - 3686; Fungi - 820; Plants - 541; Viruses - 438; Other Eukaryotes - 4051 (source: NCBI BLink). )
HSP 1 Score: 107.5 bits (267), Expect = 6.3e-23
Identity = 263/995 (26.43%), Postives = 405/995 (40.70%), Query Frame = 0
Query: 8 SSSPSMVAGRTSPNSRNSEISNPIRRSFSGNPFSKPSIVANPRSLNPITPANSPSDYPRR 67
SSSPSM R +P RNSE + +RRSF GNPFS A+P N I R
Sbjct: 20 SSSPSM-PSRPNPKQRNSETGDLMRRSFRGNPFS-----ADPSRRNSI----------GR 79
Query: 68 SRENLFTSRDNEEKENGKDQSPKPVRVRSPTVGKSTKHFMSPTISAASKIAVSPKKKILG 127
N D +E +N KDQ V+ PT K +KHFMSPTISA SKI SP+KKIL
Sbjct: 80 ECSNRVEIGD-KENQNDKDQIANV--VKGPT--KGSKHFMSPTISAVSKINPSPRKKILS 139
Query: 128 DRNEPVRSSLSFSGMKSSSLNSVNPNSEASTALESDTNPQIAPISNPKSSKSVRFGGVEV 187
D+NE RS S + V S SV F V
Sbjct: 140 DKNEVSRSF-------DKSHHQVQVKS------------------------SVSFSDVIS 199
Query: 188 ISGSYDDSEFTHRYDPEVVTMAVETDTKPETALISESAMAAPPPKSSATVKFGGFEVISD 247
I G D +V + ++ ET + E + S + F+ I +
Sbjct: 200 IIGE----------DKDVDQICID-----ETKQLRE--------EESHDITVSDFDEILE 259
Query: 248 SYDDSESTYRNDTNSDIVTMAVEADTMPEIAPISAIAAAPPKASKTVRFADVEVISISNN 307
+ S+++ I+ PP T
Sbjct: 260 RKSNDNSSFK-------------------------ISPLPPYVPCTF------------- 319
Query: 308 DLVSPVKNNFTEELDSVNLDPSFKLSPVSSPMEVAPLDADPLIPPYDPKTNYLSPRPQFL 367
PV EV DP++ PYDPK NYLSPRPQFL
Sbjct: 320 --------------------------PVFESHEV-----DPVVAPYDPKKNYLSPRPQFL 379
Query: 368 HYRPNQRI-NRYEPDGRLEELFAS-ANVSESEFTEETDSEDPQMESDEASSSESQMEEKE 427
HY+PN +I +R + +LEELF S ++ S+++ + E + E Q E + +EE+E
Sbjct: 380 HYKPNPKIEHRSDECKQLEELFISESSSSDTDLSAEREEEGQQEEEVASQEGVVAVEEQE 439
Query: 428 EEEEE-----EEIINVSEQSPIEAKKSSK-------------------LQISRIFKISSL 487
++ EE EEI++V + +EA +S + SR K S L
Sbjct: 440 DDGEERLEAAEEILDVDGEERLEAVESDDEEEEVVVGESIEEEETHQISKQSRFSKTSML 499
Query: 488 L---LILFTACFSICVVNVHDPNIFERARLLTLEDPSEIYEFAMTNFNVLVGKLEVWHAN 547
L L L A + + EI A NF L KL +W +
Sbjct: 500 LGWILALGVAYLLLVSSTTFSQQTITDSPFYQFNISPEIIMSASENFEQLGAKLRMWAES 559
Query: 548 SKSFISDMVFNIRGGRTLIYLNQTEFFNKDVIAVGQCLVLSRQTLWEEENNLSVMEEAMK 607
S ++ +V ++R + +F N V+ + L +++ + +++ +
Sbjct: 560 SFVYLDKLVSSLREEEGSV---PFQFHNLTVLLEDKRL---SDAVFQSTSVEIIVDGFIV 619
Query: 608 DT-EIDIVEEHTEREDQNEVQEREEESLREIEAMKEREFDIDPVEREAQSEKVEEEEPLE 667
D+ E+DI E + ++ E E E+ EI E D + VE+E + KV E ++
Sbjct: 620 DSLEVDIEEVNVGHQE----PEEESENSGEISLEAVYEEDDNEVEQENEEGKV-NLEIVD 679
Query: 668 ETEAMKEREI--DIDPVEGEAQKEEVEEESIEASAKSAYEISLQEIVEEKLDELVEVEED 727
E + E +I D + GE E + EE E +E E + + E E D
Sbjct: 680 ECDEQAEIKIATDTEVNGGERYSESLSEEGHGGQETDVVE-GQEEYEENDQNNMEEAESD 739
Query: 728 VQEKQTEENYEVSSSPYSKIH----DQIEPEAGTGR---NERTIQQSNTEFQYQSPPVSP 787
Q ++ +SS+ + + ++ E G G ++ + T+ ++ V
Sbjct: 740 AQLLDDVQSAAISSNQQEQTGVANVETVQEEEGVGEIAGGSLSVSEEATDVEHDGNEVE- 799
Query: 788 PSEPQSDVEDNSGNIDLIGAATENGISRDFKQNTAIIVSAILLGLSLI-AGLIYARKSG- 847
E+ SG +++ A I ++ ++ S +++ L+ + AG + A+K
Sbjct: 800 --------EEESGFGEVVNDAGSEDILLSGQKKVLVLFSTMMVILAAVAAGFLLAKKKTK 838
Query: 848 ----SGSGSRPSTAGAAIVEKREEQPPLLKEKRTNQSPAEEAEEIGDDDRDDDMAGGESC 907
P+ A V + L++E+ ++ + EE EE+GDD +
Sbjct: 860 PVMLQHEDGEPTAISATKVVEHVPVENLIRERLSSLNFKEEEEEVGDDRK-------REV 838
Query: 908 SSEMSSFQYSGTKGGETEATKRSSEEARSHSHGRKTMRRNSRRESMASSSLEEYSVSTSA 958
SS S +S +K + ++ + H G + N ESMASS+ EYS+
Sbjct: 920 SSFPSEMSFSFSKNKPLHSCSNKKDDLKEHQSGGGGKKSNDSGESMASSA-SEYSI---G 838
BLAST of Spg002379 vs. TAIR 10
Match:
AT2G16270.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 16 plant structures; EXPRESSED DURING: 9 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G16630.1); Has 1844 Blast hits to 1256 proteins in 271 species: Archae - 6; Bacteria - 283; Metazoa - 434; Fungi - 153; Plants - 91; Viruses - 52; Other Eukaryotes - 825 (source: NCBI BLink). )
HSP 1 Score: 104.8 bits (260), Expect = 4.1e-22
Identity = 260/977 (26.61%), Postives = 395/977 (40.43%), Query Frame = 0
Query: 1 MALPSNKSSSPS-MVAGRTSPNSRNSEISNPIRRSFSGNPFSKPSIVANPRSLNPITPAN 60
MA P+NK+ S S + R +P RNSE +P+RRSF GNPF S V N
Sbjct: 1 MASPTNKNPSFSPPIPNRPNPKPRNSEAGDPLRRSFGGNPFPANSKV------------N 60
Query: 61 SPSDYPRRSRENLFTSRDNEEKENGKDQSPKPVRVRSPTVGKSTKHFMSPTISAASKIAV 120
PSD RR N F +KEN KPV++ K +K+FMSPTISA SKI
Sbjct: 61 IPSDLTRR---NSF----GGDKEN----ETKPVQL----TPKGSKNFMSPTISAVSKINA 120
Query: 121 SPKKKILGDRNEPVRSSLSFSGMKSSSLNSVNPNSEASTALESDTNPQIAPISNPKSSKS 180
SP+K++L D+NE S SFS +K L N ++ ++
Sbjct: 121 SPRKRVLSDKNE---MSRSFSDVKGLILEDDNKR------------------NHHRAKSC 180
Query: 181 VRFGGVEVISGSYDDSEFTHRYDPEVVTMAVETDTKPETALISESAMAAPPPKSSATVKF 240
V F V D+ +F +D V
Sbjct: 181 VSFSDVLHTICIDDEKKFVESHDMTVTDF------------------------------- 240
Query: 241 GGFEVISDSYDDSESTYRNDTNSDIVTMAVEADTMPEIAPISAIAAAPPKASKTVRFADV 300
D + Y N K + ++
Sbjct: 241 -----------DEKEVYEN---------------------------------KGITYS-- 300
Query: 301 EVISISNNDLVSPVKNNFTEELDSVNLDPSFKLS-----PVSSPMEVAPLDADPLIPPYD 360
DP F++S P +SP E A + D L+PPYD
Sbjct: 301 ---------------------------DPRFRISPRPSVPYTSP-EFAACEVDTLLPPYD 360
Query: 361 PKTNYLSPRPQFLHYRPNQRI-NRYEPDGRLEELFASANVSESEFTEETDSEDPQMESDE 420
PK N+LSPRPQFLHY+PN RI R++ +LEELF S + S+ +SE+ + + E
Sbjct: 361 PKKNFLSPRPQFLHYKPNPRIEKRFDECKQLEELFISESSSDDTELSVEESEEQEKDGAE 420
Query: 421 ASSSESQMEEKEEEEEEEEIINVSEQSPIEAKKSSKLQISRIFKISSLLLILFTACFSIC 480
E + E+ E+ E E + V E + K SR FK L L A +
Sbjct: 421 EVVVEEETEDVEQSEAESDEEMVCESVEETTSQVPKQSGSRKFKFLGWFLAL--ALGYLL 480
Query: 481 VVNVHDPNIFERARLLTLEDPSEIYEFA-MTNFNVLVGKLEVWHANSKSFISDMVFNIRG 540
V P ++ P EI EFA N + L KL +S ++ ++ +
Sbjct: 481 VSATFSP--LMKSSFNEFHIPKEITEFAKANNLDQLSDKLWTLTESSLVYMDKLISRLGR 540
Query: 541 GR----TLIYLNQTEFFNKDVIAVGQCLVLSRQTLWEEENNLSVMEEAMKDTEIDIVEEH 600
G L + N T + C+ + ++ L +EN+ S E +++D ++ EE
Sbjct: 541 GNEEYSQLQFHNLTYTLEDSTVFKPTCVEIIQEPL--QENSRS--ENSLEDGSVN--EEE 600
Query: 601 TEREDQNEVQEREEESLREIEAMKEREFDIDPVEREAQSEKVEEEEPLEETEAMKEREID 660
+ E+ +EV + +E L E++ DI+ + E + + E+ E ++E E+
Sbjct: 601 SGAEENSEVVCQFDE-LAEVKP----STDIESNDGERNLKALFEDGLELNIEELRESEMS 660
Query: 661 -IDPVEGEAQKEEVEEESIEASAKSAYEISLQEIVEEKLDELVEVEEDVQEKQTEENYEV 720
+ +E E + EE E E+I + E + + +E E V E +EE+
Sbjct: 661 PEEKLETEKKLEETESEAIYINQPDV------EFAAINVHQHIESEILVAESGSEES--- 720
Query: 721 SSSPYSKIHDQIEPEAGTGRNERTIQQSNTEFQYQSPPVSPPSEPQSDVEDNSGNI---- 780
+ +I D + E G+ + + +S E+ G I
Sbjct: 721 ----FGEIGDLLHLEVGSYND------------------LAKGDAESGSEEGFGEIAAET 750
Query: 781 --DLIGAATENGISRDFKQNTAIIVSAILLGLSLIAGLIYARKSGSGSGSRPS-TAGAAI 840
DL + + + I++S+ +L L +A ++A+K+ + ++P+ + +
Sbjct: 781 SDDLHLKVRSSNKAYNDSTKLMIVLSSTVLVLLAVASFVFAKKTKLVAATKPAPESNMEL 750
Query: 841 VEKREEQPPLLKEKRTNQSPAEEAEEIGDDDRDDDMAGGESCSSEMSSFQYSGTKGGETE 900
+ L+KEK + + EE DD + SC E S KGG+
Sbjct: 841 NLSHVPEENLVKEKLFSLNFEEEV----DDKMSNSFQKKSSCHKEPQS------KGGKKN 750
Query: 901 ATKRSSEEARSHSHGRKTMRRNSRRESMASSSLEEYSVSTSASPSYGSFTTYEKIPIKHG 958
SS + RRESMASS+ EYS+ S SYGSFTTYEKIPIK G
Sbjct: 901 NNNSSSSKL--------------RRESMASSA-SEYSI---GSFSYGSFTTYEKIPIKSG 750
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_022984664.1 | 5.6e-284 | 63.66 | uncharacterized protein LOC111482876 isoform X4 [Cucurbita maxima] | [more] |
XP_022984665.1 | 1.3e-283 | 63.66 | uncharacterized protein LOC111482876 isoform X5 [Cucurbita maxima] | [more] |
XP_022984662.1 | 9.0e-282 | 61.75 | uncharacterized protein LOC111482876 isoform X2 [Cucurbita maxima] | [more] |
XP_022984663.1 | 9.0e-282 | 62.38 | uncharacterized protein LOC111482876 isoform X3 [Cucurbita maxima] | [more] |
XP_022984661.1 | 2.9e-280 | 61.63 | uncharacterized protein LOC111482876 isoform X1 [Cucurbita maxima] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1JB72 | 2.7e-284 | 63.66 | uncharacterized protein LOC111482876 isoform X4 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1J980 | 6.1e-284 | 63.66 | uncharacterized protein LOC111482876 isoform X5 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1J2S7 | 4.4e-282 | 61.75 | uncharacterized protein LOC111482876 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1JB65 | 4.4e-282 | 62.38 | uncharacterized protein LOC111482876 isoform X3 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1J5X0 | 1.4e-280 | 61.63 | uncharacterized protein LOC111482876 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
Match Name | E-value | Identity | Description | |
AT1G16630.1 | 6.3e-23 | 26.43 | unknown protein; LOCATED IN: chloroplast; BEST Arabidopsis thaliana protein matc... | [more] |
AT2G16270.1 | 4.1e-22 | 26.61 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |