Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAATCACAGAAGAAATATCCTAGAAGTAACTCAGACTCAAATGTTGCGTTGAAGTCTTCAACTTTCAAGTTTGGTGTGGAACCTGACTATGACCTTGTGAAGTCAAGGCATCACGTGTGCAGTGGTGAAGTTAGTGCAACTTTGAGTAAAGTTGATCAAGAGGAGAGTAATTCGACTGAATCAACCTCTTGTATTGAATCAGATGAAGTCTTCCAAAATGGACTTCCTACTGAATCGAAGGATCATAAAAACGTAGAAGAAGTTGCATGTGAGGAGGTAATACATTGTTCTGTAAATTCGACAATAAACACGACATTGACATCCAGTGGGACTAATAACCAAGTAGGAACTAGCTCTTTAAGTTCTGATAACTGTTCATCATGCCTAAGTGAAGGAGACAGTAATACCCTCTGCTCGAACCATGGAAATCTGGAATCTTCATCCACATCAGACTCAGAAGATGCTAGCCATCAATCAGAAGGAAAAGAATCTTCAGCATCCATTCAGAATGGCTTCTCTGAACATCATGAGATAAGGATGGATAAAGTAATTGGAGGCGATACCATGGGGAGCATGATTCCTTCCGGTCTTTCACAAGATAACGAGGGATGTAAAGTTCTGGGAAATGCACCGAAAAAAGTTCCCCAGAACTTTGAAGCGGGATTCTCTGCTGTTAGTTTGGATTCCCCATGTCAAGTAACACTTCCTTCAATTCAGAACCAAAATATTCACTTTCCAGTGTTTCAGGTTCCTCCATCAATGGGTTATTACCATCAAAATTCAGTTTCATGGCCAGCAGCTCATGCTAATGGGATAATGCCTTTCTCCTATTCAAATCATTGTCTATATACCAATCCTCTTGGGTATGGTTTAAATGGAAACCCACGCTTCTGCATGCAATATGGTCATTTGCATCATCTAGCGACTCCGTTTTCAACCCTAGCCCAGTTCCTATTTATCAGCCAGCGGCCAAAGCCAGCAGTGGTATATATGTCGAAGATAGAAATCAGGTCTCCAAATCAGGTGCAATAGCAGAAAGCTCAGATGTAGCTAATCCGGACATTGTCGTTGCTGCTGGACTCCCGAATGCACTCAGTTCACCACCAAGCGGAGATTGTAAGCAAAATGATACTTCTTCCAAATTGCAAAAGGATAGCCCAAGCTTTTCATTGTTCCATTTTGGAGGGCCTGTTGCACTTTCAACAGGAGGTAAATTAAATCTCATGCCTTCCAAGGAAGATGATATCGGGGATTTTCCGAGAAATAATGAAGCGGATGTTATTGACAATGGTCACGCTTTCAATAAGAAGGAAACTGCCATTGAAGAATACAACTTGTTTGCAGCAAGCAATGGCATGAGGTTTTCATTCTTCTGA
mRNA sequence
ATGGAATCACAGAAGAAATATCCTAGAAGTAACTCAGACTCAAATGTTGCGTTGAAGTCTTCAACTTTCAAGTTTGGTGTGGAACCTGACTATGACCTTGTGAAGTCAAGGCATCACGTGTGCAGTGGTGAAGTTAGTGCAACTTTGAGTAAAGTTGATCAAGAGGAGAGTAATTCGACTGAATCAACCTCTTGTATTGAATCAGATGAAGTCTTCCAAAATGGACTTCCTACTGAATCGAAGGATCATAAAAACGTAGAAGAAGTTGCATGTGAGGAGGTAATACATTGTTCTGTAAATTCGACAATAAACACGACATTGACATCCAGTGGGACTAATAACCAAGTAGGAACTAGCTCTTTAAGTTCTGATAACTGTTCATCATGCCTAAGTGAAGGAGACAGTAATACCCTCTGCTCGAACCATGGAAATCTGGAATCTTCATCCACATCAGACTCAGAAGATGCTAGCCATCAATCAGAAGGAAAAGAATCTTCAGCATCCATTCAGAATGGCTTCTCTGAACATCATGAGATAAGGATGGATAAAGTAATTGGAGGCGATACCATGGGGAGCATGATTCCTTCCGGTCTTTCACAAGATAACGAGGGATGTAAAGTTCTGGGAAATGCACCGAAAAAAGTTCCCCAGAACTTTGAAGCGGGATTCTCTGCTGTTAGTTTGGATTCCCCATGTCAAGTAACACTTCCTTCAATTCAGAACCAAAATATTCACTTTCCAGTGTTTCAGGTTCCTCCATCAATGGGTTATTACCATCAAAATTCAGTTTCATGGCCAGCAGCTCATGCTAATGGGATAATGCCTTTCTCCTATTCAAATCATTGTCTATATACCAATCCTCTTGGCGACTCCGTTTTCAACCCTAGCCCAGTTCCTATTTATCAGCCAGCGGCCAAAGCCAGCAGTGGTATATATGTCGAAGATAGAAATCAGGTCTCCAAATCAGGTGCAATAGCAGAAAGCTCAGATGTAGCTAATCCGGACATTGTCGTTGCTGCTGGACTCCCGAATGCACTCAGTTCACCACCAAGCGGAGATTGTAAGCAAAATGATACTTCTTCCAAATTGCAAAAGGATAGCCCAAGCTTTTCATTGTTCCATTTTGGAGGGCCTGTTGCACTTTCAACAGGAGGTAAATTAAATCTCATGCCTTCCAAGGAAGATGATATCGGGGATTTTCCGAGAAATAATGAAGCGGATGTTATTGACAATGGTCACGCTTTCAATAAGAAGGAAACTGCCATTGAAGAATACAACTTGTTTGCAGCAAGCAATGGCATGAGGTTTTCATTCTTCTGA
Coding sequence (CDS)
ATGGAATCACAGAAGAAATATCCTAGAAGTAACTCAGACTCAAATGTTGCGTTGAAGTCTTCAACTTTCAAGTTTGGTGTGGAACCTGACTATGACCTTGTGAAGTCAAGGCATCACGTGTGCAGTGGTGAAGTTAGTGCAACTTTGAGTAAAGTTGATCAAGAGGAGAGTAATTCGACTGAATCAACCTCTTGTATTGAATCAGATGAAGTCTTCCAAAATGGACTTCCTACTGAATCGAAGGATCATAAAAACGTAGAAGAAGTTGCATGTGAGGAGGTAATACATTGTTCTGTAAATTCGACAATAAACACGACATTGACATCCAGTGGGACTAATAACCAAGTAGGAACTAGCTCTTTAAGTTCTGATAACTGTTCATCATGCCTAAGTGAAGGAGACAGTAATACCCTCTGCTCGAACCATGGAAATCTGGAATCTTCATCCACATCAGACTCAGAAGATGCTAGCCATCAATCAGAAGGAAAAGAATCTTCAGCATCCATTCAGAATGGCTTCTCTGAACATCATGAGATAAGGATGGATAAAGTAATTGGAGGCGATACCATGGGGAGCATGATTCCTTCCGGTCTTTCACAAGATAACGAGGGATGTAAAGTTCTGGGAAATGCACCGAAAAAAGTTCCCCAGAACTTTGAAGCGGGATTCTCTGCTGTTAGTTTGGATTCCCCATGTCAAGTAACACTTCCTTCAATTCAGAACCAAAATATTCACTTTCCAGTGTTTCAGGTTCCTCCATCAATGGGTTATTACCATCAAAATTCAGTTTCATGGCCAGCAGCTCATGCTAATGGGATAATGCCTTTCTCCTATTCAAATCATTGTCTATATACCAATCCTCTTGGCGACTCCGTTTTCAACCCTAGCCCAGTTCCTATTTATCAGCCAGCGGCCAAAGCCAGCAGTGGTATATATGTCGAAGATAGAAATCAGGTCTCCAAATCAGGTGCAATAGCAGAAAGCTCAGATGTAGCTAATCCGGACATTGTCGTTGCTGCTGGACTCCCGAATGCACTCAGTTCACCACCAAGCGGAGATTGTAAGCAAAATGATACTTCTTCCAAATTGCAAAAGGATAGCCCAAGCTTTTCATTGTTCCATTTTGGAGGGCCTGTTGCACTTTCAACAGGAGGTAAATTAAATCTCATGCCTTCCAAGGAAGATGATATCGGGGATTTTCCGAGAAATAATGAAGCGGATGTTATTGACAATGGTCACGCTTTCAATAAGAAGGAAACTGCCATTGAAGAATACAACTTGTTTGCAGCAAGCAATGGCATGAGGTTTTCATTCTTCTGA
Protein sequence
MESQKKYPRSNSDSNVALKSSTFKFGVEPDYDLVKSRHHVCSGEVSATLSKVDQEESNSTESTSCIESDEVFQNGLPTESKDHKNVEEVACEEVIHCSVNSTINTTLTSSGTNNQVGTSSLSSDNCSSCLSEGDSNTLCSNHGNLESSSTSDSEDASHQSEGKESSASIQNGFSEHHEIRMDKVIGGDTMGSMIPSGLSQDNEGCKVLGNAPKKVPQNFEAGFSAVSLDSPCQVTLPSIQNQNIHFPVFQVPPSMGYYHQNSVSWPAAHANGIMPFSYSNHCLYTNPLGDSVFNPSPVPIYQPAAKASSGIYVEDRNQVSKSGAIAESSDVANPDIVVAAGLPNALSSPPSGDCKQNDTSSKLQKDSPSFSLFHFGGPVALSTGGKLNLMPSKEDDIGDFPRNNEADVIDNGHAFNKKETAIEEYNLFAASNGMRFSFF
Homology
BLAST of Sgr012139 vs. NCBI nr
Match:
XP_022154911.1 (uncharacterized protein LOC111022059 [Momordica charantia])
HSP 1 Score: 734.2 bits (1894), Expect = 6.7e-208
Identity = 387/460 (84.13%), Postives = 402/460 (87.39%), Query Frame = 0
Query: 1 MESQKKYPRSNSDSNVALKSSTFKFGVEPDYDLVKSRHHVCSGEVSATLSKVDQEESNST 60
MESQKKYPRSNSD NVA+KSSTFKFGVEPDYDL KSRH VCSGEVS KVDQEESNST
Sbjct: 818 MESQKKYPRSNSDPNVAMKSSTFKFGVEPDYDLAKSRHDVCSGEVSVASGKVDQEESNST 877
Query: 61 ESTSCIESDEVFQNGLPTESKDHKNVEEVACEEVIHCSVNSTINTTLTSSGTNNQVGTSS 120
ESTS IESDEVFQNGLPTE KDHKNVEE ACEE CS+NSTIN+TL SSG NN VGTSS
Sbjct: 878 ESTSGIESDEVFQNGLPTEPKDHKNVEEDACEEATQCSINSTINSTLRSSGKNNHVGTSS 937
Query: 121 LSSDNCSSCLSEGDSNTLCSNHGNLESSSTSDSEDASHQ-SEGKESSASIQNGFSEHHEI 180
LSSDNCSSCLSEGDSN +CSNHGNLESSSTSDSEDASHQ SEGKESSASIQNGFSE HEI
Sbjct: 938 LSSDNCSSCLSEGDSNXICSNHGNLESSSTSDSEDASHQSSEGKESSASIQNGFSERHEI 997
Query: 181 RMDKVIGGDTMGSMIPSGLSQDNEGCKVLGNAPKKVPQNFEAGFSAVSLDSPCQVTLPSI 240
RMDKV GG++MG+ I GL QDNEGCKVLGNAP VP NFEAGFSAVSLDSPCQVTLPSI
Sbjct: 998 RMDKVNGGESMGTRIHFGLPQDNEGCKVLGNAPMNVPHNFEAGFSAVSLDSPCQVTLPSI 1057
Query: 241 QNQNIHFPVFQVPPSMGYYHQNSVSWPAAHANGIMPFSYSNHCLYTNPLG---------- 300
QNQNIHFPVFQVPPSMGYYHQNSVSWPAAHANG+MPFSYSNHCLY NPLG
Sbjct: 1058 QNQNIHFPVFQVPPSMGYYHQNSVSWPAAHANGMMPFSYSNHCLYANPLGYGLDGNPRFC 1117
Query: 301 ----------DSVFNPSPVPIYQPAAKASSGIYVEDRNQVSKSGAIAESSDVANPDIVVA 360
VFNPSPVPIYQPAAKAS+GIYVEDR+QVSK+GAIAESSDVANPD+VV
Sbjct: 1118 MQYGHLHHLATPVFNPSPVPIYQPAAKASNGIYVEDRSQVSKAGAIAESSDVANPDVVVT 1177
Query: 361 AGLPNALSSPPSGDCKQNDTSSKLQKDSPSFSLFHFGGPVALSTGGKLNLMPSKEDDIGD 420
AGLP AL SPPSGDCKQNDT SKLQK S SFSLFHFGGPVALSTGGKLNLMPSKEDD G
Sbjct: 1178 AGLPYALGSPPSGDCKQNDT-SKLQKGSSSFSLFHFGGPVALSTGGKLNLMPSKEDDTGV 1237
Query: 421 FPRNNEADVIDNGHAFNKKETAIEEYNLFAASNGMRFSFF 440
FPRN+EADV+DNGHAFNKK+TAIEEYNLFAASNGMRFSFF
Sbjct: 1238 FPRNSEADVVDNGHAFNKKDTAIEEYNLFAASNGMRFSFF 1276
BLAST of Sgr012139 vs. NCBI nr
Match:
XP_023543532.1 (uncharacterized protein LOC111803390 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 704.5 bits (1817), Expect = 5.7e-199
Identity = 375/460 (81.52%), Postives = 393/460 (85.43%), Query Frame = 0
Query: 1 MESQKKYPRSNSDSNVALKSSTFKFGVEPDYDLVKSRHHVCSGEVSATLSKVDQEESNST 60
MESQKKYPRSNSDSNVALKSSTFKFGVEPDYDLVKSRH CSGEVS VDQEESNST
Sbjct: 819 MESQKKYPRSNSDSNVALKSSTFKFGVEPDYDLVKSRHECCSGEVSVASGTVDQEESNST 878
Query: 61 ESTSCIESDEVFQNGLPTESKDHKNVEEVACEEVIHCSVNSTINTTLTSSGTNNQVGTSS 120
ESTS IESDEVFQNGLP E KDHKNVEE ACEEV CSVNST++ +TSSGT+NQ GTSS
Sbjct: 879 ESTSVIESDEVFQNGLPIELKDHKNVEEDACEEVTPCSVNSTVDMKMTSSGTSNQAGTSS 938
Query: 121 LSSDNCSSCLSEGDSNTLCSNHGNLESSSTSDSEDASHQSEGKESSASIQNGFSEHHEIR 180
L+SDNCSSC SEGDSNT+CSNHGNLESSSTSDSE ASHQSEGKESSASIQ GFSEHHEIR
Sbjct: 939 LNSDNCSSCPSEGDSNTICSNHGNLESSSTSDSEYASHQSEGKESSASIQYGFSEHHEIR 998
Query: 181 MDKVIGGDTMGSMIPSGLSQDNEGCKVLGNAPKKVPQNFEAGFSAVSLDSPCQVTLPSIQ 240
MDK IGGD MGS SGLSQDNEGCKV GNAPK +PQNFEAGFSAV+LDSPC VTLPS+Q
Sbjct: 999 MDKAIGGDAMGSTNCSGLSQDNEGCKVQGNAPKNIPQNFEAGFSAVNLDSPCHVTLPSVQ 1058
Query: 241 NQNIHFPVFQVPPSMGYYHQNSVSWPAA-HANGIMPFSYSNHCLYTNP------------ 300
NQN+HFPVFQVPPSMGYY+QNSVSWPAA HANGIMPFSYSNHCLY NP
Sbjct: 1059 NQNVHFPVFQVPPSMGYYNQNSVSWPAAVHANGIMPFSYSNHCLYANPLGYGLNGNPRFC 1118
Query: 301 --------LGDSVFNPSPVPIYQPAAKASSGIYVEDRNQVSKSGAIAESSDVANPDIVVA 360
LG+ VFNPSPVPIYQPA KAS+GI+VEDR QVSKSGAI ESS VANPD+VV
Sbjct: 1119 MRYGHLHHLGNPVFNPSPVPIYQPATKASNGIFVEDRTQVSKSGAITESS-VANPDVVVT 1178
Query: 361 AGLPNALSSPPSGDCKQNDTSSKLQKDSPSFSLFHFGGPVALSTGGKLNLMPSKEDDIGD 420
+GLP ALSSPPSGDCKQNDTSSKLQKDS SFSLFHFGGPVALSTGGKLNLMPSKED
Sbjct: 1179 SGLPYALSSPPSGDCKQNDTSSKLQKDSSSFSLFHFGGPVALSTGGKLNLMPSKED---- 1238
Query: 421 FPRNNEADVIDNGHAFNKKETAIEEYNLFAASNGMRFSFF 440
NNE +V+ NGH FNKKETAIEEYNLFAASNGMRFSFF
Sbjct: 1239 ---NNEVEVVGNGHGFNKKETAIEEYNLFAASNGMRFSFF 1270
BLAST of Sgr012139 vs. NCBI nr
Match:
XP_022967698.1 (uncharacterized protein LOC111467149 [Cucurbita maxima])
HSP 1 Score: 703.0 bits (1813), Expect = 1.7e-198
Identity = 374/460 (81.30%), Postives = 392/460 (85.22%), Query Frame = 0
Query: 1 MESQKKYPRSNSDSNVALKSSTFKFGVEPDYDLVKSRHHVCSGEVSATLSKVDQEESNST 60
MESQKKYPRSNSDSNVALKSSTFKFGVEPDY+LVKSRH CSGEVS VDQEESNST
Sbjct: 826 MESQKKYPRSNSDSNVALKSSTFKFGVEPDYELVKSRHECCSGEVSVASGTVDQEESNST 885
Query: 61 ESTSCIESDEVFQNGLPTESKDHKNVEEVACEEVIHCSVNSTINTTLTSSGTNNQVGTSS 120
ESTS IESDEVFQNGLP ESKDHKNVE+ ACEEV CSVN T++ +TSSGT+NQ GTSS
Sbjct: 886 ESTSVIESDEVFQNGLPIESKDHKNVEDDACEEVTPCSVNLTVDMKMTSSGTSNQAGTSS 945
Query: 121 LSSDNCSSCLSEGDSNTLCSNHGNLESSSTSDSEDASHQSEGKESSASIQNGFSEHHEIR 180
L+SDNCSSC SEGDSNT+CSNHGNLESSSTSDSE ASHQSEGKESSASIQ GFSEHHEIR
Sbjct: 946 LNSDNCSSCPSEGDSNTICSNHGNLESSSTSDSEYASHQSEGKESSASIQYGFSEHHEIR 1005
Query: 181 MDKVIGGDTMGSMIPSGLSQDNEGCKVLGNAPKKVPQNFEAGFSAVSLDSPCQVTLPSIQ 240
MDK IGGD +GS SGLSQDNEGCKV GNAPK VPQNFEAGFSAV+LDSPC VTLPS+Q
Sbjct: 1006 MDKAIGGDALGSTNSSGLSQDNEGCKVQGNAPKNVPQNFEAGFSAVNLDSPCHVTLPSVQ 1065
Query: 241 NQNIHFPVFQVPPSMGYYHQNSVSWPAA-HANGIMPFSYSNHCLYTNPLG---------- 300
NQN+HFPVFQVPPSMGYYHQNSVSWPAA HANGIMPFSYSNHCLY NPLG
Sbjct: 1066 NQNVHFPVFQVPPSMGYYHQNSVSWPAAVHANGIMPFSYSNHCLYANPLGYGLNGNPRFC 1125
Query: 301 ----------DSVFNPSPVPIYQPAAKASSGIYVEDRNQVSKSGAIAESSDVANPDIVVA 360
+ VFNPSPVPIYQPAAKAS+GI+VEDR QVSKSGAI ESS VANPD+VV
Sbjct: 1126 MRYGHLHHLANPVFNPSPVPIYQPAAKASNGIFVEDRTQVSKSGAITESS-VANPDVVVT 1185
Query: 361 AGLPNALSSPPSGDCKQNDTSSKLQKDSPSFSLFHFGGPVALSTGGKLNLMPSKEDDIGD 420
GLP ALSSPPSGDCKQNDTSSKLQKDS SFSLFHFGGPVALSTGGKLN MPSKED
Sbjct: 1186 TGLPYALSSPPSGDCKQNDTSSKLQKDSSSFSLFHFGGPVALSTGGKLNPMPSKED---- 1245
Query: 421 FPRNNEADVIDNGHAFNKKETAIEEYNLFAASNGMRFSFF 440
NNE +V+ NGH FNKKETAIEEYNLFAASNGMRFSFF
Sbjct: 1246 ---NNEVEVVGNGHGFNKKETAIEEYNLFAASNGMRFSFF 1277
BLAST of Sgr012139 vs. NCBI nr
Match:
KAG6603257.1 (hypothetical protein SDJN03_03866, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 696.4 bits (1796), Expect = 1.5e-196
Identity = 373/461 (80.91%), Postives = 391/461 (84.82%), Query Frame = 0
Query: 1 MESQKKYPRSNSDSNVALKSSTFKFGVEPDYDLVKSRHHVCSGEVSATLSKVDQEESNST 60
MESQKKYPRSNSDSNVALKSSTFKFGVEPDYDLVKSRH CSGEVS VDQEESNST
Sbjct: 814 MESQKKYPRSNSDSNVALKSSTFKFGVEPDYDLVKSRHECCSGEVSVASGTVDQEESNST 873
Query: 61 ESTSCIESDEVFQNGLPTESKDHKNVEEVACEEVIHCSVNSTINTTLTSSGTNNQVGTSS 120
ESTS IESD+VFQNGLP E KDHKNVEE ACEEV CSVNST++ +TS GT+NQ GTSS
Sbjct: 874 ESTSVIESDDVFQNGLPIELKDHKNVEEDACEEVTPCSVNSTVDMKMTSCGTSNQAGTSS 933
Query: 121 LSSDNCSSCLSEGDSNTLCSNHGNLESSSTSDSEDASHQSEGKESSASIQNGFSEHHEIR 180
L+SDNCSSC SEGDSNT+CSNHGNLESSSTSDSE ASHQSEGKESSASIQ GFSEHHEIR
Sbjct: 934 LNSDNCSSCPSEGDSNTICSNHGNLESSSTSDSEYASHQSEGKESSASIQYGFSEHHEIR 993
Query: 181 MDKVIGGDTMGSMIPSGLSQDNEGCKVLGNAPKKVPQNFEAGFSAVSLDSPCQVTLPSIQ 240
MDK IGGD MGS SGLSQDNEGCKV G APK VPQNFEAGFSAV+LDSPC VTLPS+Q
Sbjct: 994 MDKAIGGDAMGSTNCSGLSQDNEGCKVQGKAPKNVPQNFEAGFSAVNLDSPCHVTLPSVQ 1053
Query: 241 NQNIHFPVFQVPPSMGYYHQNSVSWPAA-HANGIMPFSYSNHCLYTNPLG---------- 300
NQN+HFPVFQVPPSMGYYHQNSVSWPAA HANGIMPFSYSNHC+Y NPLG
Sbjct: 1054 NQNVHFPVFQVPPSMGYYHQNSVSWPAAVHANGIMPFSYSNHCVYANPLGYGLNGNPRFC 1113
Query: 301 ----------DSVFNPSPVPIYQPAAKASSGIYVEDRNQVSKSGAIAESSDVANPDIVVA 360
+ VFNPSPVPIYQPAAKAS+GI+VEDR QVSKSGAI ESS ANPD+VV
Sbjct: 1114 MRYGHLHHLANPVFNPSPVPIYQPAAKASNGIFVEDRTQVSKSGAITESS-AANPDVVVT 1173
Query: 361 AGLPNALSSPPSGDCKQNDTSSKLQKDSPSFSLFHFGGPVALST-GGKLNLMPSKEDDIG 420
+GLP ALSSPPSGDCKQNDTSSKLQKDS SFSLFHFGGPVALST GGKLNLMPSKED
Sbjct: 1174 SGLPYALSSPPSGDCKQNDTSSKLQKDSSSFSLFHFGGPVALSTGGGKLNLMPSKED--- 1233
Query: 421 DFPRNNEADVIDNGHAFNKKETAIEEYNLFAASNGMRFSFF 440
NNE +V+ NGH FNKKETAIEEYNLFAASNGMRFSFF
Sbjct: 1234 ----NNEVEVVGNGHGFNKKETAIEEYNLFAASNGMRFSFF 1266
BLAST of Sgr012139 vs. NCBI nr
Match:
XP_022928663.1 (uncharacterized protein LOC111435513 [Cucurbita moschata])
HSP 1 Score: 696.4 bits (1796), Expect = 1.5e-196
Identity = 373/461 (80.91%), Postives = 391/461 (84.82%), Query Frame = 0
Query: 1 MESQKKYPRSNSDSNVALKSSTFKFGVEPDYDLVKSRHHVCSGEVSATLSKVDQEESNST 60
MESQKKYPRSNSDSNVALKSSTFKFGVEPDYDLVKSRH CSGEVS VDQEESNST
Sbjct: 814 MESQKKYPRSNSDSNVALKSSTFKFGVEPDYDLVKSRHECCSGEVSVASGTVDQEESNST 873
Query: 61 ESTSCIESDEVFQNGLPTESKDHKNVEEVACEEVIHCSVNSTINTTLTSSGTNNQVGTSS 120
ESTS IESD+VFQNGLP E KDHKNVEE ACEEV CSVNST++ +TS GT+NQ GTSS
Sbjct: 874 ESTSVIESDDVFQNGLPIELKDHKNVEEDACEEVTPCSVNSTVDMKMTSCGTSNQAGTSS 933
Query: 121 LSSDNCSSCLSEGDSNTLCSNHGNLESSSTSDSEDASHQSEGKESSASIQNGFSEHHEIR 180
L+SDNCSSC SEGDSNT+CSNHGNLESSSTSDSE ASHQSEGKESSASIQ GFSEHHEIR
Sbjct: 934 LNSDNCSSCPSEGDSNTICSNHGNLESSSTSDSEYASHQSEGKESSASIQYGFSEHHEIR 993
Query: 181 MDKVIGGDTMGSMIPSGLSQDNEGCKVLGNAPKKVPQNFEAGFSAVSLDSPCQVTLPSIQ 240
MDK IGGD MGS SGLSQDNEGCKV G APK VPQNFEAGFSAV+LDSPC VTLPS+Q
Sbjct: 994 MDKAIGGDAMGSTNCSGLSQDNEGCKVQGKAPKNVPQNFEAGFSAVNLDSPCHVTLPSVQ 1053
Query: 241 NQNIHFPVFQVPPSMGYYHQNSVSWPAA-HANGIMPFSYSNHCLYTNPLG---------- 300
NQN+HFPVFQVPPSMGYYHQNSVSWPAA HANGIMPFSYSNHC+Y NPLG
Sbjct: 1054 NQNVHFPVFQVPPSMGYYHQNSVSWPAAVHANGIMPFSYSNHCVYANPLGYGLNGNPRFC 1113
Query: 301 ----------DSVFNPSPVPIYQPAAKASSGIYVEDRNQVSKSGAIAESSDVANPDIVVA 360
+ VFNPSPVPIYQPAAKAS+GI+VEDR QVSKSGAI ESS ANPD+VV
Sbjct: 1114 MRYGHLHHLANPVFNPSPVPIYQPAAKASNGIFVEDRTQVSKSGAITESS-AANPDVVVT 1173
Query: 361 AGLPNALSSPPSGDCKQNDTSSKLQKDSPSFSLFHFGGPVALST-GGKLNLMPSKEDDIG 420
+GLP ALSSPPSGDCKQNDTSSKLQKDS SFSLFHFGGPVALST GGKLNLMPSKED
Sbjct: 1174 SGLPYALSSPPSGDCKQNDTSSKLQKDSSSFSLFHFGGPVALSTGGGKLNLMPSKED--- 1233
Query: 421 DFPRNNEADVIDNGHAFNKKETAIEEYNLFAASNGMRFSFF 440
NNE +V+ NGH FNKKETAIEEYNLFAASNGMRFSFF
Sbjct: 1234 ----NNEVEVVGNGHGFNKKETAIEEYNLFAASNGMRFSFF 1266
BLAST of Sgr012139 vs. ExPASy TrEMBL
Match:
A0A6J1DQ45 (uncharacterized protein LOC111022059 OS=Momordica charantia OX=3673 GN=LOC111022059 PE=4 SV=1)
HSP 1 Score: 734.2 bits (1894), Expect = 3.2e-208
Identity = 387/460 (84.13%), Postives = 402/460 (87.39%), Query Frame = 0
Query: 1 MESQKKYPRSNSDSNVALKSSTFKFGVEPDYDLVKSRHHVCSGEVSATLSKVDQEESNST 60
MESQKKYPRSNSD NVA+KSSTFKFGVEPDYDL KSRH VCSGEVS KVDQEESNST
Sbjct: 818 MESQKKYPRSNSDPNVAMKSSTFKFGVEPDYDLAKSRHDVCSGEVSVASGKVDQEESNST 877
Query: 61 ESTSCIESDEVFQNGLPTESKDHKNVEEVACEEVIHCSVNSTINTTLTSSGTNNQVGTSS 120
ESTS IESDEVFQNGLPTE KDHKNVEE ACEE CS+NSTIN+TL SSG NN VGTSS
Sbjct: 878 ESTSGIESDEVFQNGLPTEPKDHKNVEEDACEEATQCSINSTINSTLRSSGKNNHVGTSS 937
Query: 121 LSSDNCSSCLSEGDSNTLCSNHGNLESSSTSDSEDASHQ-SEGKESSASIQNGFSEHHEI 180
LSSDNCSSCLSEGDSN +CSNHGNLESSSTSDSEDASHQ SEGKESSASIQNGFSE HEI
Sbjct: 938 LSSDNCSSCLSEGDSNXICSNHGNLESSSTSDSEDASHQSSEGKESSASIQNGFSERHEI 997
Query: 181 RMDKVIGGDTMGSMIPSGLSQDNEGCKVLGNAPKKVPQNFEAGFSAVSLDSPCQVTLPSI 240
RMDKV GG++MG+ I GL QDNEGCKVLGNAP VP NFEAGFSAVSLDSPCQVTLPSI
Sbjct: 998 RMDKVNGGESMGTRIHFGLPQDNEGCKVLGNAPMNVPHNFEAGFSAVSLDSPCQVTLPSI 1057
Query: 241 QNQNIHFPVFQVPPSMGYYHQNSVSWPAAHANGIMPFSYSNHCLYTNPLG---------- 300
QNQNIHFPVFQVPPSMGYYHQNSVSWPAAHANG+MPFSYSNHCLY NPLG
Sbjct: 1058 QNQNIHFPVFQVPPSMGYYHQNSVSWPAAHANGMMPFSYSNHCLYANPLGYGLDGNPRFC 1117
Query: 301 ----------DSVFNPSPVPIYQPAAKASSGIYVEDRNQVSKSGAIAESSDVANPDIVVA 360
VFNPSPVPIYQPAAKAS+GIYVEDR+QVSK+GAIAESSDVANPD+VV
Sbjct: 1118 MQYGHLHHLATPVFNPSPVPIYQPAAKASNGIYVEDRSQVSKAGAIAESSDVANPDVVVT 1177
Query: 361 AGLPNALSSPPSGDCKQNDTSSKLQKDSPSFSLFHFGGPVALSTGGKLNLMPSKEDDIGD 420
AGLP AL SPPSGDCKQNDT SKLQK S SFSLFHFGGPVALSTGGKLNLMPSKEDD G
Sbjct: 1178 AGLPYALGSPPSGDCKQNDT-SKLQKGSSSFSLFHFGGPVALSTGGKLNLMPSKEDDTGV 1237
Query: 421 FPRNNEADVIDNGHAFNKKETAIEEYNLFAASNGMRFSFF 440
FPRN+EADV+DNGHAFNKK+TAIEEYNLFAASNGMRFSFF
Sbjct: 1238 FPRNSEADVVDNGHAFNKKDTAIEEYNLFAASNGMRFSFF 1276
BLAST of Sgr012139 vs. ExPASy TrEMBL
Match:
A0A6J1HVV4 (uncharacterized protein LOC111467149 OS=Cucurbita maxima OX=3661 GN=LOC111467149 PE=4 SV=1)
HSP 1 Score: 703.0 bits (1813), Expect = 8.0e-199
Identity = 374/460 (81.30%), Postives = 392/460 (85.22%), Query Frame = 0
Query: 1 MESQKKYPRSNSDSNVALKSSTFKFGVEPDYDLVKSRHHVCSGEVSATLSKVDQEESNST 60
MESQKKYPRSNSDSNVALKSSTFKFGVEPDY+LVKSRH CSGEVS VDQEESNST
Sbjct: 826 MESQKKYPRSNSDSNVALKSSTFKFGVEPDYELVKSRHECCSGEVSVASGTVDQEESNST 885
Query: 61 ESTSCIESDEVFQNGLPTESKDHKNVEEVACEEVIHCSVNSTINTTLTSSGTNNQVGTSS 120
ESTS IESDEVFQNGLP ESKDHKNVE+ ACEEV CSVN T++ +TSSGT+NQ GTSS
Sbjct: 886 ESTSVIESDEVFQNGLPIESKDHKNVEDDACEEVTPCSVNLTVDMKMTSSGTSNQAGTSS 945
Query: 121 LSSDNCSSCLSEGDSNTLCSNHGNLESSSTSDSEDASHQSEGKESSASIQNGFSEHHEIR 180
L+SDNCSSC SEGDSNT+CSNHGNLESSSTSDSE ASHQSEGKESSASIQ GFSEHHEIR
Sbjct: 946 LNSDNCSSCPSEGDSNTICSNHGNLESSSTSDSEYASHQSEGKESSASIQYGFSEHHEIR 1005
Query: 181 MDKVIGGDTMGSMIPSGLSQDNEGCKVLGNAPKKVPQNFEAGFSAVSLDSPCQVTLPSIQ 240
MDK IGGD +GS SGLSQDNEGCKV GNAPK VPQNFEAGFSAV+LDSPC VTLPS+Q
Sbjct: 1006 MDKAIGGDALGSTNSSGLSQDNEGCKVQGNAPKNVPQNFEAGFSAVNLDSPCHVTLPSVQ 1065
Query: 241 NQNIHFPVFQVPPSMGYYHQNSVSWPAA-HANGIMPFSYSNHCLYTNPLG---------- 300
NQN+HFPVFQVPPSMGYYHQNSVSWPAA HANGIMPFSYSNHCLY NPLG
Sbjct: 1066 NQNVHFPVFQVPPSMGYYHQNSVSWPAAVHANGIMPFSYSNHCLYANPLGYGLNGNPRFC 1125
Query: 301 ----------DSVFNPSPVPIYQPAAKASSGIYVEDRNQVSKSGAIAESSDVANPDIVVA 360
+ VFNPSPVPIYQPAAKAS+GI+VEDR QVSKSGAI ESS VANPD+VV
Sbjct: 1126 MRYGHLHHLANPVFNPSPVPIYQPAAKASNGIFVEDRTQVSKSGAITESS-VANPDVVVT 1185
Query: 361 AGLPNALSSPPSGDCKQNDTSSKLQKDSPSFSLFHFGGPVALSTGGKLNLMPSKEDDIGD 420
GLP ALSSPPSGDCKQNDTSSKLQKDS SFSLFHFGGPVALSTGGKLN MPSKED
Sbjct: 1186 TGLPYALSSPPSGDCKQNDTSSKLQKDSSSFSLFHFGGPVALSTGGKLNPMPSKED---- 1245
Query: 421 FPRNNEADVIDNGHAFNKKETAIEEYNLFAASNGMRFSFF 440
NNE +V+ NGH FNKKETAIEEYNLFAASNGMRFSFF
Sbjct: 1246 ---NNEVEVVGNGHGFNKKETAIEEYNLFAASNGMRFSFF 1277
BLAST of Sgr012139 vs. ExPASy TrEMBL
Match:
A0A6J1EPP9 (uncharacterized protein LOC111435513 OS=Cucurbita moschata OX=3662 GN=LOC111435513 PE=4 SV=1)
HSP 1 Score: 696.4 bits (1796), Expect = 7.5e-197
Identity = 373/461 (80.91%), Postives = 391/461 (84.82%), Query Frame = 0
Query: 1 MESQKKYPRSNSDSNVALKSSTFKFGVEPDYDLVKSRHHVCSGEVSATLSKVDQEESNST 60
MESQKKYPRSNSDSNVALKSSTFKFGVEPDYDLVKSRH CSGEVS VDQEESNST
Sbjct: 814 MESQKKYPRSNSDSNVALKSSTFKFGVEPDYDLVKSRHECCSGEVSVASGTVDQEESNST 873
Query: 61 ESTSCIESDEVFQNGLPTESKDHKNVEEVACEEVIHCSVNSTINTTLTSSGTNNQVGTSS 120
ESTS IESD+VFQNGLP E KDHKNVEE ACEEV CSVNST++ +TS GT+NQ GTSS
Sbjct: 874 ESTSVIESDDVFQNGLPIELKDHKNVEEDACEEVTPCSVNSTVDMKMTSCGTSNQAGTSS 933
Query: 121 LSSDNCSSCLSEGDSNTLCSNHGNLESSSTSDSEDASHQSEGKESSASIQNGFSEHHEIR 180
L+SDNCSSC SEGDSNT+CSNHGNLESSSTSDSE ASHQSEGKESSASIQ GFSEHHEIR
Sbjct: 934 LNSDNCSSCPSEGDSNTICSNHGNLESSSTSDSEYASHQSEGKESSASIQYGFSEHHEIR 993
Query: 181 MDKVIGGDTMGSMIPSGLSQDNEGCKVLGNAPKKVPQNFEAGFSAVSLDSPCQVTLPSIQ 240
MDK IGGD MGS SGLSQDNEGCKV G APK VPQNFEAGFSAV+LDSPC VTLPS+Q
Sbjct: 994 MDKAIGGDAMGSTNCSGLSQDNEGCKVQGKAPKNVPQNFEAGFSAVNLDSPCHVTLPSVQ 1053
Query: 241 NQNIHFPVFQVPPSMGYYHQNSVSWPAA-HANGIMPFSYSNHCLYTNPLG---------- 300
NQN+HFPVFQVPPSMGYYHQNSVSWPAA HANGIMPFSYSNHC+Y NPLG
Sbjct: 1054 NQNVHFPVFQVPPSMGYYHQNSVSWPAAVHANGIMPFSYSNHCVYANPLGYGLNGNPRFC 1113
Query: 301 ----------DSVFNPSPVPIYQPAAKASSGIYVEDRNQVSKSGAIAESSDVANPDIVVA 360
+ VFNPSPVPIYQPAAKAS+GI+VEDR QVSKSGAI ESS ANPD+VV
Sbjct: 1114 MRYGHLHHLANPVFNPSPVPIYQPAAKASNGIFVEDRTQVSKSGAITESS-AANPDVVVT 1173
Query: 361 AGLPNALSSPPSGDCKQNDTSSKLQKDSPSFSLFHFGGPVALST-GGKLNLMPSKEDDIG 420
+GLP ALSSPPSGDCKQNDTSSKLQKDS SFSLFHFGGPVALST GGKLNLMPSKED
Sbjct: 1174 SGLPYALSSPPSGDCKQNDTSSKLQKDSSSFSLFHFGGPVALSTGGGKLNLMPSKED--- 1233
Query: 421 DFPRNNEADVIDNGHAFNKKETAIEEYNLFAASNGMRFSFF 440
NNE +V+ NGH FNKKETAIEEYNLFAASNGMRFSFF
Sbjct: 1234 ----NNEVEVVGNGHGFNKKETAIEEYNLFAASNGMRFSFF 1266
BLAST of Sgr012139 vs. ExPASy TrEMBL
Match:
A0A5A7SXH9 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold403G00110 PE=4 SV=1)
HSP 1 Score: 661.8 bits (1706), Expect = 2.0e-186
Identity = 366/467 (78.37%), Postives = 383/467 (82.01%), Query Frame = 0
Query: 1 MESQKKYPRSNSDSNVALKSSTFKFGVEPDYDLVKSRHHV-------CSGEVSATLSKVD 60
MESQKKYPRSNSDSNVALKSSTFKF EPDYD+VKSR V CSGEVS T VD
Sbjct: 307 MESQKKYPRSNSDSNVALKSSTFKFDAEPDYDVVKSRDGVVKSRDGFCSGEVSVTSGAVD 366
Query: 61 QEESNSTESTSCIESDEVFQNGLPTESKDHKNVEEVACEEVIHCSVNSTINTTLTSSGTN 120
QEESNSTESTS IESD+V QN ESKDHKNVEE C EV CS NS I+TTLTSSGT+
Sbjct: 367 QEESNSTESTSGIESDDVSQNENSIESKDHKNVEEDVC-EVKQCSANSAIDTTLTSSGTS 426
Query: 121 NQVGTSSLSSDNCSSCLSEGDSNTLCSNHGNLESSSTSDSEDASHQSEGKESSASIQNGF 180
NQVGTSSL+SDNCSSCLSEGDSNT+ SNHGNLESSSTSDSE ASHQSEGKESSASIQNGF
Sbjct: 427 NQVGTSSLNSDNCSSCLSEGDSNTIGSNHGNLESSSTSDSEYASHQSEGKESSASIQNGF 486
Query: 181 SEHHEIRMDKVIGGDTMGSMIPSGLSQDNEGCKVLGNAPKKVPQNFEAGFSAVSLDSPCQ 240
SEHHEIR+DK IGG+ GS SGL QDNEGC V NAPK VP NFEAGFSAVSLDSPCQ
Sbjct: 487 SEHHEIRIDKGIGGEARGSRSYSGLPQDNEGCNVQVNAPKNVPHNFEAGFSAVSLDSPCQ 546
Query: 241 VTLPSIQNQNIHFPVFQVPPSMGYYHQNSVSWP-AAHANGIMPFSYSNHCLYTNPLG--- 300
VTLPSIQNQNIHFPVFQVPPSM YYHQNSVSWP AAHANGIMPFSYSNHCLY NPLG
Sbjct: 547 VTLPSIQNQNIHFPVFQVPPSMNYYHQNSVSWPAAAHANGIMPFSYSNHCLYANPLGYGL 606
Query: 301 -----------------DSVFNPSPVPIYQPAAKASSGIYVEDRNQVSKSGAIAESSDVA 360
+ VFNPSPVPIY PA+KAS+GIY EDR QVSKSGAI+ESS VA
Sbjct: 607 NGNPRFCMQYGHLHHLSNPVFNPSPVPIYHPASKASNGIYAEDRTQVSKSGAISESS-VA 666
Query: 361 NPDIVVAAGLPNALSSPPSGDCKQNDTSSKLQKDSPSFSLFHFGGPVALSTGGKLNLMPS 420
N D+ V G ALSSPPSGD KQNDT SKLQ+DS SFSLFHFGGPVALSTGGKLNL PS
Sbjct: 667 NSDVAVTTGHQYALSSPPSGDLKQNDT-SKLQQDSSSFSLFHFGGPVALSTGGKLNLTPS 726
Query: 421 KEDDIGDFPRNNEADVIDNGHAFNKKETAIEEYNLFAASNGMRFSFF 440
KEDD+GDF RNNE +V+DNGHAFN KETAIEEYNLFAASNGMRFSFF
Sbjct: 727 KEDDVGDFSRNNEVEVVDNGHAFNMKETAIEEYNLFAASNGMRFSFF 770
BLAST of Sgr012139 vs. ExPASy TrEMBL
Match:
A0A1S3B599 (uncharacterized protein LOC103486163 OS=Cucumis melo OX=3656 GN=LOC103486163 PE=4 SV=1)
HSP 1 Score: 661.8 bits (1706), Expect = 2.0e-186
Identity = 366/467 (78.37%), Postives = 383/467 (82.01%), Query Frame = 0
Query: 1 MESQKKYPRSNSDSNVALKSSTFKFGVEPDYDLVKSRHHV-------CSGEVSATLSKVD 60
MESQKKYPRSNSDSNVALKSSTFKF EPDYD+VKSR V CSGEVS T VD
Sbjct: 816 MESQKKYPRSNSDSNVALKSSTFKFDAEPDYDVVKSRDGVVKSRDGFCSGEVSVTSGAVD 875
Query: 61 QEESNSTESTSCIESDEVFQNGLPTESKDHKNVEEVACEEVIHCSVNSTINTTLTSSGTN 120
QEESNSTESTS IESD+V QN ESKDHKNVEE C EV CS NS I+TTLTSSGT+
Sbjct: 876 QEESNSTESTSGIESDDVSQNENSIESKDHKNVEEDVC-EVKQCSANSAIDTTLTSSGTS 935
Query: 121 NQVGTSSLSSDNCSSCLSEGDSNTLCSNHGNLESSSTSDSEDASHQSEGKESSASIQNGF 180
NQVGTSSL+SDNCSSCLSEGDSNT+ SNHGNLESSSTSDSE ASHQSEGKESSASIQNGF
Sbjct: 936 NQVGTSSLNSDNCSSCLSEGDSNTIGSNHGNLESSSTSDSEYASHQSEGKESSASIQNGF 995
Query: 181 SEHHEIRMDKVIGGDTMGSMIPSGLSQDNEGCKVLGNAPKKVPQNFEAGFSAVSLDSPCQ 240
SEHHEIR+DK IGG+ GS SGL QDNEGC V NAPK VP NFEAGFSAVSLDSPCQ
Sbjct: 996 SEHHEIRIDKGIGGEARGSRSYSGLPQDNEGCNVQVNAPKNVPHNFEAGFSAVSLDSPCQ 1055
Query: 241 VTLPSIQNQNIHFPVFQVPPSMGYYHQNSVSWP-AAHANGIMPFSYSNHCLYTNPLG--- 300
VTLPSIQNQNIHFPVFQVPPSM YYHQNSVSWP AAHANGIMPFSYSNHCLY NPLG
Sbjct: 1056 VTLPSIQNQNIHFPVFQVPPSMNYYHQNSVSWPAAAHANGIMPFSYSNHCLYANPLGYGL 1115
Query: 301 -----------------DSVFNPSPVPIYQPAAKASSGIYVEDRNQVSKSGAIAESSDVA 360
+ VFNPSPVPIY PA+KAS+GIY EDR QVSKSGAI+ESS VA
Sbjct: 1116 NGNPRFCMQYGHLHHLSNPVFNPSPVPIYHPASKASNGIYAEDRTQVSKSGAISESS-VA 1175
Query: 361 NPDIVVAAGLPNALSSPPSGDCKQNDTSSKLQKDSPSFSLFHFGGPVALSTGGKLNLMPS 420
N D+ V G ALSSPPSGD KQNDT SKLQ+DS SFSLFHFGGPVALSTGGKLNL PS
Sbjct: 1176 NSDVAVTTGHQYALSSPPSGDLKQNDT-SKLQQDSSSFSLFHFGGPVALSTGGKLNLTPS 1235
Query: 421 KEDDIGDFPRNNEADVIDNGHAFNKKETAIEEYNLFAASNGMRFSFF 440
KEDD+GDF RNNE +V+DNGHAFN KETAIEEYNLFAASNGMRFSFF
Sbjct: 1236 KEDDVGDFSRNNEVEVVDNGHAFNMKETAIEEYNLFAASNGMRFSFF 1279
BLAST of Sgr012139 vs. TAIR 10
Match:
AT2G41960.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G58050.1); Has 11991 Blast hits to 7260 proteins in 458 species: Archae - 17; Bacteria - 481; Metazoa - 5028; Fungi - 1325; Plants - 615; Viruses - 38; Other Eukaryotes - 4487 (source: NCBI BLink). )
HSP 1 Score: 161.8 bits (408), Expect = 1.3e-39
Identity = 124/341 (36.36%), Postives = 169/341 (49.56%), Query Frame = 0
Query: 135 SNTLCSNHGNLESSSTSDSEDASHQSEGKESSASIQNGFSEHHEIRMDKVI-----GGDT 194
S + SN+GN+ESSS SDSE AS QSEG+E+ QN + HE ++KV D
Sbjct: 890 SRSSSSNNGNIESSSMSDSEVASQQSEGRENLVDTQNDMPDCHEKMVEKVTEMSMDERDV 949
Query: 195 MGSMIPSGLSQDNEGCKVLGN---APKKVPQNFEAGFSAVS-LDSPCQVTLPSIQNQNIH 254
+ S L DN K+ G P + +N G + S L P + LP + NQ+I
Sbjct: 950 LKIKNISNLPADNGESKLSGTPFMVPSQNMENMVPGLNTGSYLSQPQNMILPQMLNQSIP 1009
Query: 255 FPVFQVPPSMGYYHQNSVSWPAAHANGIMPFSYSNHCLYTNPLGDSV------------- 314
PVFQ P +MGYYHQ VSW +A NG+M F + NH +YT PLG S+
Sbjct: 1010 LPVFQAPSTMGYYHQAPVSWSSASTNGLMQFPHPNHYVYTGPLGYSLNGESPLCMQYGTP 1069
Query: 315 --------FNPSPVPIYQPAAKASSGIYVEDRNQVS--KSGAIAESSDVANPDIVVAAGL 374
FN PVPI+ P A+ ++ V+ + + + E+++ ++
Sbjct: 1070 LNHSAAPFFNSGPVPIFHPFAETNTMNTVDQAQPLEPLEHSFLKEANERRFNEM------ 1129
Query: 375 PNALSSPPSGDCKQNDTSSKLQKDSPSFSLFHFGGPVALSTGGKLNLMPSKEDDIGDFPR 434
L P C Q D+ +FSLFHFGGPVALSTG K N SK+ + DF
Sbjct: 1130 --PLMETPRKRCPQTDSDE-------NFSLFHFGGPVALSTGSKANPARSKDGILEDFSL 1189
Query: 435 NNEADVI---DNGHAFNKKETAI-EEYNLFAASNGMRFSFF 440
D + G++ +KE + EEYNLFA SN +RFS F
Sbjct: 1190 QFSGDHVFGDPTGNSKKEKENTVGEEYNLFATSNSLRFSIF 1215
BLAST of Sgr012139 vs. TAIR 10
Match:
AT3G58050.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G41960.1); Has 13384 Blast hits to 8116 proteins in 546 species: Archae - 41; Bacteria - 766; Metazoa - 5596; Fungi - 1431; Plants - 589; Viruses - 46; Other Eukaryotes - 4915 (source: NCBI BLink). )
HSP 1 Score: 161.8 bits (408), Expect = 1.3e-39
Identity = 155/465 (33.33%), Postives = 207/465 (44.52%), Query Frame = 0
Query: 5 KKYPRSNSDSNVALKSSTFKFGVEPDYDLVKSRHHVCSG-EVSATLSKVDQEESNSTEST 64
KKYPRSNS S V ++ STFK D + ++ + S +V+ L + ++ NS ES
Sbjct: 850 KKYPRSNSYSEVTVRCSTFKAEEIEDAIVAENSSDLLSQCKVTEKLDNIKLKDENSMES- 909
Query: 65 SCIESDEVFQNGLPTESKDHKNVEEVACEEVIHCSVNSTINTTLTSSGTNNQVGTSSLSS 124
E+K+ ++++ + +S+ SS
Sbjct: 910 --------------GETKNGWHLKD--------------------------PMMSSTSSS 969
Query: 125 DNCSSCLSEGDSNTLCSNHGNLESSSTSDSEDASHQSEGKES-SASIQNGFSEHHEIRMD 184
DNCSSCLSEG+SNT+ SN+GN ESSSTSDSEDAS QSEG+ES QN D
Sbjct: 970 DNCSSCLSEGESNTVSSNNGNTESSSTSDSEDASQQSEGRESIVVGTQN----------D 1029
Query: 185 KVIGGDTMGSMIP------SGLSQDNEGCKVLGNAPKKVPQNFEAGFSAVSLDSPCQVTL 244
+I T S IP +G + DN N G V P
Sbjct: 1030 ILIPDTTGKSKIPETPIVVTGNNMDNNS-----------NNNMVHGLVDV---QPQGGMF 1089
Query: 245 PSIQNQNIHFPVFQVPPSMGYYHQ-NSVSWPAAHANGIMPFSYSNHCLYTNPLGDSV--- 304
P + QN+ +PVFQ MGY+HQ VSWP ANG++PF + N LYT PLG S+
Sbjct: 1090 PHLLTQNLQYPVFQTASPMGYFHQAPPVSWPTGPANGLIPFPHPNPYLYTGPLGYSMNGD 1149
Query: 305 ------------------FNPSPVPIYQPAAKASSGIYVEDRNQVSKSGAIAESSDVANP 364
FNP PVP++ P +K ++ ED+ Q
Sbjct: 1150 PPLCLQYGSPLNHAATPFFNPGPVPVFHPFSKTNT----EDQAQ---------------- 1209
Query: 365 DIVVAAGLPNALSSPPSGDCKQNDTSSKLQKDSPSFSLFHFGGPVALSTGGKLNLMPSKE 424
L P +C + + +D SFSLFHF GPV LSTG K SK+
Sbjct: 1210 ----------NLEPPLELNCLAPPETQTVNED--SFSLFHFSGPVGLSTGSKSKPAHSKD 1209
Query: 425 DDIGDFPRNNEADVIDNGHAFNKKETAIEEYNLFAASNGMRFSFF 440
+ DV+ N + K+ +EEYNLFA NG+RFS F
Sbjct: 1270 GIL--------RDVVGNIYTKAKESKEVEEYNLFATGNGLRFSLF 1209
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_022154911.1 | 6.7e-208 | 84.13 | uncharacterized protein LOC111022059 [Momordica charantia] | [more] |
XP_023543532.1 | 5.7e-199 | 81.52 | uncharacterized protein LOC111803390 [Cucurbita pepo subsp. pepo] | [more] |
XP_022967698.1 | 1.7e-198 | 81.30 | uncharacterized protein LOC111467149 [Cucurbita maxima] | [more] |
KAG6603257.1 | 1.5e-196 | 80.91 | hypothetical protein SDJN03_03866, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
XP_022928663.1 | 1.5e-196 | 80.91 | uncharacterized protein LOC111435513 [Cucurbita moschata] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1DQ45 | 3.2e-208 | 84.13 | uncharacterized protein LOC111022059 OS=Momordica charantia OX=3673 GN=LOC111022... | [more] |
A0A6J1HVV4 | 8.0e-199 | 81.30 | uncharacterized protein LOC111467149 OS=Cucurbita maxima OX=3661 GN=LOC111467149... | [more] |
A0A6J1EPP9 | 7.5e-197 | 80.91 | uncharacterized protein LOC111435513 OS=Cucurbita moschata OX=3662 GN=LOC1114355... | [more] |
A0A5A7SXH9 | 2.0e-186 | 78.37 | Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold... | [more] |
A0A1S3B599 | 2.0e-186 | 78.37 | uncharacterized protein LOC103486163 OS=Cucumis melo OX=3656 GN=LOC103486163 PE=... | [more] |
Match Name | E-value | Identity | Description | |
AT2G41960.1 | 1.3e-39 | 36.36 | unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... | [more] |
AT3G58050.1 | 1.3e-39 | 33.33 | unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... | [more] |