Sgr022051 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr022051
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionDUF1308 domain-containing protein
Locationtig00153874: 424131 .. 428388 (+)
RNA-Seq ExpressionSgr022051
SyntenySgr022051
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAGAACCTGATGGAGTAGAATTGGCAAAGCAAAGATGCAGAGCGGTCATCGACAGAATCCAAAGGCTGCCTTCTTCCACCAACATCACCATTTCAAGTAGGCGAACTCTCCTCAAATTGGCCCTTCGCGAGCTCAATTTCCTCTCTCGCTGCTCCTTCTCCACCCAACTCAGGTTCCACTTCTCAAATCAATCATTTACAGTTTTGAATCCTCTGTATCTTCTCTGATATTCATGAATGATTTATATACAGCTTGAACATTGGGCACCTCGAGGCCATTGTTCACATTCTTCAGCAGCCTTCTGTCACCGGAATTTCCCGCGTCTGTAAGCCGATTCCGTTGCATGCTCGTCGTATCGAGAGTGCGAAGGAAACCAAATCTTCCTGTTCGAAGGTTGTTTATGTCGATATAATCTGCACTTTGAATAGGAACCCAGTGTGGGTAATTGTATCAGTTAGAAAACCCAAGTACATTTCTTGGTATAAGGGCCACAGAAATAAGGGCTTAAAATCTCGACTAGAGGAAGTGGTTGATGCAGCTCGCTCTTTGGAGGCCTTAGAACCTTGCTCGATCATCTTGTTCTTTTCACATGGACTTGATCATTTGATTCTGGAAAGACTCCGAGATGAATTTATGGCCACTGAGTACAATTTTAATTTCTCAGATTTCGATTTTGGTTTCTCTGAGATTGAAGGTGATTGGGTTAATGTGCTTCCAAGAAGCTACAAAGAAGCCTGTGTTCTTGAAATAAAAGTTAATGATAGGAATTGTGGGGTTACGAGTTCAAATTGCAACCGTAAAGTATGTTGTACTGATGTGGATGAGCCAGAGCTTTTGGACAAGTATCTGAAGAGAGATCTGGGGGATCCTTTCTGCTCCATCGTTATGGCAATGAAACCTAATCCTATGGGTATGGAAGACATGGAATCTGCAAGTTTGGAACATTTGTTGGGTGGTGATAGTGATCTAATCAATTTTGATACGACAGCATTAATTGCATTAGTATCAGGCATCAGTAATGGTTGTGTTGCTAAATTATTGGCTACCCCAGAGAGTGAATTGAGACAGAAGTACAAGAGTAATTATGATTTTGTTATTGGTCAGGTGAGTTTTGTTTGCAGATGAGCAGTATCCTGTACTATCTTTGTGTAGCCTTTCAGCCACTGTTATACTCACAAGTTACTGTGGTTTAAGTTTGGAACTGATGAGTATTAATTTAAAGCAACGAATTATGTAGTAGAATATAATACTCCACCTATTTTACTGAAATAAGATCTTATTTTCAGTATGATTTTTTTCTTGCTTTTGTATTTTAGTCATCTGTTTTGCTACTTTATATGTCTAATTTAATTTCAGAAGGTCTTGACTTAGTTAAAACTATTGAGTTATCTTGAACTTGCATGTAGACCGCTTTATGTTCGAGTTGAGAATGTTCCTTAGTATCTTTGTTTTTTTTTTTTTTTTTTTGAACAAGTTCCTTAGTATCTTGTCTTTGGTAGTATGGTATTTCTGCTGAACATATGTTGCTTAAATGTTAAGTGGCATTAATGGGGGTTATATACGAAAACAATATTTGGTGGAAAAACACGAAACACTTCTTTTACGAGCTACTCCTTATACTTATCATGATTTGGTATTGTTGTTGTATGGTGTATGCATCTATTGCTAAAGTCATCTGCTACAATTAGTCAGCCCAATGGAATTTCCTTCCTCTAAGATTCCAAGAGATGTAATTAAAACCTCTCTCCTTGTGCAACAAGATAAGATTGGGATACAGTAAGCCTTGGCAGAAACCTTTGTTTGAATTACACTGTGACGAAATATTCAAGCACACAGAGAGACTTGTTTAGTCTATTAACTTATTCTTTAAATTTATCATTATCTTTTTTTCTTCTTCAGAGCATATCATAACAAATGCTATTTTTGAATTTTTAGAATTGAAAATTGAGGTTGATTTACCAATCAATTATTTTGTTCATTTCTCTGCTAGTGAATGATCTTAACTTGCATTTACTAGTATTGTTATTTTTTTGGGCTGTAGCCAATGTAATTCTCTCTCTCAATCTTCTGCTAGGTGATGTCAGAAATTCAGAAGCCTATTCTTGTGGAGCTGAGTTCTCTTTTATCTGGAAAAAGAGGTATAATTTGCCAAAGTGTTCACTCTGAGTTCAAGGAACTAGTAACAATGTGCGGAGGGCCTAATGAGAAGTCCAGAGCAAACCACTTACTAAAGCACATCATGTGAGTACATTAACAAGGTTTTATTGTTTTTTAAGCTTCTTAATTTCATTCTCATTCATTTGAGAACATATATATATATATATATATTTCATATTTTTATGACCTTTAGTTTCGTAAATCCAGAAAGGAAATAATTTTCTGGGTCTTTCCAGGATTACTAAGAATAATTAGGTATATTTGCTGGAAGTGATTATGTGGTCAAGTGTTGGAGTTTCTGCATTCCAATTATAATCTGGTTAAATGAAATGTCTAACTAATGTTCCCTTCAAGTATCACTCTAGTCTCCAGCTACATTTATATTATATGTTCAATTGAATGGTCAAAGATCATTGCTAGGCAGTTTACAAAAAAGGATGATATTGTCACATGAAATGGTATTTATGCCAATGTGTTAAACATTTAAAAGCCTCTGAGAATTCATCCAATGTCGGGGATCCATGTTAAACCATGCTCAAGCCGAGCGTTCTTCTCCCGAAGCTGCACTTTTGGTCTACAATCCACCGTAAACCGCAGAGCCTTCTCAAAAAACACACCTTCCTTTTAATAGAGCCAGGGACATCGAGGGAGGCTAGGACTAAACATTTAGCAAGAAGAGAAGAATTCTTCCTCTTTATTTTTTCCTCCTGATATTTGTGAGTGTCAGGTCAGCTTGCACACACCTTCGCTAATCTCACGAGTTAACAGCCAAATCCTACAACATTTTGTTGCCAAGAAAACTCATAAGATTTTAAATCTTAGGTACAGGCCACCATAGTTTGAACCCATTCTCTCCAAGCCCCTCCACTCTTGACCACTAGGGTAACCCAGGAAGGATAATCCTTCCTTCCTCTAGATGCTAAGACCCAACCCCCTTCATATCCCCAGATGACACCTTGCCTTCATATCCAGCCCCTCTCGAATTAAGTCCCTCGAATACTTCTCTATGAGCCCTGCTTGCTTACAGAAGTATCGAGGGCACATTCGATAAAATGGATTGGTTGAAGTTTAACTGTTCTCCTGAAGACAAGAGGTCTTCGTTACATGTTTTAGCTTAGCCACAATTTTATCCACCATTAAGTTCTATTGTTCTAGAACTCATCTAATCACTGAGGCACCAAGCAGCAGCTCAAGTCTTAACACCCTAGAAGCTTTTGTTACAGAAGAATTCAAAAGCAATAGAGCTTATCCACTTCACAGCTCTCCACCAGCTCAGCCTCAATGCCATGTCAATCAACTACTTATAGTACATCTGCTTCCAAGACAAAAATTAAGGCTCTACATATATGTTTTAGTCCCTAATATTTAGGTATTGTTTTTAATTTAATCCTTATTGTTTTGAAAGTTTCAATTTAGTTCCTTATGTTATAATATTTTTAAATTATTCTCTTAACCCATTTGGAGACCAAAGCTTGGTATGTATTTGAGGGAGACATAAAGTAGGTAGACATGCTAGATAACATAACAGTAGATCAAGTCTTCCCCCATTGGAGAAATCAGAGCCTTTTCACATGTCAAAGATTTTTTAAACTTTATTAGAAGCCCAATTTTCTTATTTTTATTAGAATATTACATCATAAGGGGTGAGAATTTGAACCTACGACCTCTAAAGAGGAAGTGAAGATGTCATAACCACTGAGTTATGCTCATGTTGGCCAGAAACCCAATTTTCTGGAGTCACCTCCCAACGACATACCGAGATAGCTAAGCAGCCACTTCTTTTGGGCAATCAAGATTCTCAAGTTATGATTCTATCATGTTATGGATAAGTTTATTTTTGAGATATAGATTACATAGTAGTTTATTAATTGGCCTGAAAAATGAAACAGGACTGTTTCAGACACGGCATCGAAACGTATGACGTGTCTCCCGACCACCAGAAAGTTGGCTTTGAAGAACAAGGTTGTTTTTGGTACTGGTGACTACTGGAAGGCCGCAACCTTGACTGCTAATATGTCATTTGTTCGAGCAGTATCACAGACTGGAATGTCCCTCTTTACCTTTGAGCATAGGCCTCGAGCTTTAACGGGTGATTAG

mRNA sequence

ATGGAAGAACCTGATGGAGTAGAATTGGCAAAGCAAAGATGCAGAGCGGTCATCGACAGAATCCAAAGGCTGCCTTCTTCCACCAACATCACCATTTCAAGTAGGCGAACTCTCCTCAAATTGGCCCTTCGCGAGCTCAATTTCCTCTCTCGCTGCTCCTTCTCCACCCAACTCAGCTTGAACATTGGGCACCTCGAGGCCATTGTTCACATTCTTCAGCAGCCTTCTGTCACCGGAATTTCCCGCGTCTGTAAGCCGATTCCGTTGCATGCTCGTCGTATCGAGAGTGCGAAGGAAACCAAATCTTCCTGTTCGAAGGTTGTTTATGTCGATATAATCTGCACTTTGAATAGGAACCCAGTGTGGGTAATTGTATCAGTTAGAAAACCCAAGTACATTTCTTGGTATAAGGGCCACAGAAATAAGGGCTTAAAATCTCGACTAGAGGAAGTGGTTGATGCAGCTCGCTCTTTGGAGGCCTTAGAACCTTGCTCGATCATCTTGTTCTTTTCACATGGACTTGATCATTTGATTCTGGAAAGACTCCGAGATGAATTTATGGCCACTGAGTACAATTTTAATTTCTCAGATTTCGATTTTGGTTTCTCTGAGATTGAAGGTGATTGGGTTAATGTGCTTCCAAGAAGCTACAAAGAAGCCTGTGTTCTTGAAATAAAAGTTAATGATAGGAATTGTGGGGTTACGAGTTCAAATTGCAACCGTAAAGTATGTTGTACTGATGTGGATGAGCCAGAGCTTTTGGACAAGTATCTGAAGAGAGATCTGGGGGATCCTTTCTGCTCCATCGTTATGGCAATGAAACCTAATCCTATGGGTATGGAAGACATGGAATCTGCAAGTTTGGAACATTTGTTGGGTGGTGATAGTGATCTAATCAATTTTGATACGACAGCATTAATTGCATTAGTATCAGGCATCAGTAATGGTTGTGTTGCTAAATTATTGGCTACCCCAGAGAGTGAATTGAGACAGAAGTACAAGAGTAATTATGATTTTGTTATTGGTCAGGTGATGTCAGAAATTCAGAAGCCTATTCTTGTGGAGCTGAGTTCTCTTTTATCTGGAAAAAGAGGTATAATTTGCCAAAGTGTTCACTCTGAGTTCAAGGAACTAGTAACAATGTGCGGAGGGCCTAATGAGAAGTCCAGAGCAAACCACTTACTAAAGCACATCATGACTGTTTCAGACACGGCATCGAAACGTATGACGTGTCTCCCGACCACCAGAAAGTTGGCTTTGAAGAACAAGGTTGTTTTTGGTACTGGTGACTACTGGAAGGCCGCAACCTTGACTGCTAATATGTCATTTGTTCGAGCAGTATCACAGACTGGAATGTCCCTCTTTACCTTTGAGCATAGGCCTCGAGCTTTAACGGGTGATTAG

Coding sequence (CDS)

ATGGAAGAACCTGATGGAGTAGAATTGGCAAAGCAAAGATGCAGAGCGGTCATCGACAGAATCCAAAGGCTGCCTTCTTCCACCAACATCACCATTTCAAGTAGGCGAACTCTCCTCAAATTGGCCCTTCGCGAGCTCAATTTCCTCTCTCGCTGCTCCTTCTCCACCCAACTCAGCTTGAACATTGGGCACCTCGAGGCCATTGTTCACATTCTTCAGCAGCCTTCTGTCACCGGAATTTCCCGCGTCTGTAAGCCGATTCCGTTGCATGCTCGTCGTATCGAGAGTGCGAAGGAAACCAAATCTTCCTGTTCGAAGGTTGTTTATGTCGATATAATCTGCACTTTGAATAGGAACCCAGTGTGGGTAATTGTATCAGTTAGAAAACCCAAGTACATTTCTTGGTATAAGGGCCACAGAAATAAGGGCTTAAAATCTCGACTAGAGGAAGTGGTTGATGCAGCTCGCTCTTTGGAGGCCTTAGAACCTTGCTCGATCATCTTGTTCTTTTCACATGGACTTGATCATTTGATTCTGGAAAGACTCCGAGATGAATTTATGGCCACTGAGTACAATTTTAATTTCTCAGATTTCGATTTTGGTTTCTCTGAGATTGAAGGTGATTGGGTTAATGTGCTTCCAAGAAGCTACAAAGAAGCCTGTGTTCTTGAAATAAAAGTTAATGATAGGAATTGTGGGGTTACGAGTTCAAATTGCAACCGTAAAGTATGTTGTACTGATGTGGATGAGCCAGAGCTTTTGGACAAGTATCTGAAGAGAGATCTGGGGGATCCTTTCTGCTCCATCGTTATGGCAATGAAACCTAATCCTATGGGTATGGAAGACATGGAATCTGCAAGTTTGGAACATTTGTTGGGTGGTGATAGTGATCTAATCAATTTTGATACGACAGCATTAATTGCATTAGTATCAGGCATCAGTAATGGTTGTGTTGCTAAATTATTGGCTACCCCAGAGAGTGAATTGAGACAGAAGTACAAGAGTAATTATGATTTTGTTATTGGTCAGGTGATGTCAGAAATTCAGAAGCCTATTCTTGTGGAGCTGAGTTCTCTTTTATCTGGAAAAAGAGGTATAATTTGCCAAAGTGTTCACTCTGAGTTCAAGGAACTAGTAACAATGTGCGGAGGGCCTAATGAGAAGTCCAGAGCAAACCACTTACTAAAGCACATCATGACTGTTTCAGACACGGCATCGAAACGTATGACGTGTCTCCCGACCACCAGAAAGTTGGCTTTGAAGAACAAGGTTGTTTTTGGTACTGGTGACTACTGGAAGGCCGCAACCTTGACTGCTAATATGTCATTTGTTCGAGCAGTATCACAGACTGGAATGTCCCTCTTTACCTTTGAGCATAGGCCTCGAGCTTTAACGGGTGATTAG

Protein sequence

MEEPDGVELAKQRCRAVIDRIQRLPSSTNITISSRRTLLKLALRELNFLSRCSFSTQLSLNIGHLEAIVHILQQPSVTGISRVCKPIPLHARRIESAKETKSSCSKVVYVDIICTLNRNPVWVIVSVRKPKYISWYKGHRNKGLKSRLEEVVDAARSLEALEPCSIILFFSHGLDHLILERLRDEFMATEYNFNFSDFDFGFSEIEGDWVNVLPRSYKEACVLEIKVNDRNCGVTSSNCNRKVCCTDVDEPELLDKYLKRDLGDPFCSIVMAMKPNPMGMEDMESASLEHLLGGDSDLINFDTTALIALVSGISNGCVAKLLATPESELRQKYKSNYDFVIGQVMSEIQKPILVELSSLLSGKRGIICQSVHSEFKELVTMCGGPNEKSRANHLLKHIMTVSDTASKRMTCLPTTRKLALKNKVVFGTGDYWKAATLTANMSFVRAVSQTGMSLFTFEHRPRALTGD
Homology
BLAST of Sgr022051 vs. NCBI nr
Match: XP_022152808.1 (UPF0415 protein C7orf25 homolog isoform X2 [Momordica charantia])

HSP 1 Score: 797.3 bits (2058), Expect = 6.8e-227
Identity = 403/468 (86.11%), Postives = 425/468 (90.81%), Query Frame = 0

Query: 1   MEEPDGVELAKQRCRAVIDRIQRLPSSTNITISSRRTLLKLALRELNFLSRCSFSTQLSL 60
           MEE DGVELAKQRCRAVIDRI+RLPSSTNIT+SSRRTLLKLALRELNFLSRCS ST LSL
Sbjct: 1   MEESDGVELAKQRCRAVIDRIERLPSSTNITLSSRRTLLKLALRELNFLSRCSSSTPLSL 60

Query: 61  NIGHLEAIVHILQQPSVTGISRVCKPIPLHARRIESAKETKSSCSKVVYVDIICTLNRNP 120
           NIGHLEA+VHILQ PSVTGISRVCKPIPLH RR+ES        SKVV+VDIIC LNR P
Sbjct: 61  NIGHLEAVVHILQHPSVTGISRVCKPIPLHPRRVES--------SKVVHVDIICILNRKP 120

Query: 121 VWVIVSVRKPKYISWYKG-HRNKGLKSRLEEVVDAARSLEALEPCSIILFFSHGLDHLIL 180
           VWVIVS RKP+Y+SW  G HR+KGLKSRLEEVVDAARS  ALEPC IILFF+HGLDH IL
Sbjct: 121 VWVIVSDRKPRYVSWNDGRHRSKGLKSRLEEVVDAARSFPALEPCLIILFFAHGLDHFIL 180

Query: 181 ERLRDEFMATEYNFNFSDFDFGFSEIEGDWVNVLPRSYKEACVLEIKVNDRNCGVTSSNC 240
            RL+DEF A EY+F+FSDFDFGFSE EGDWVNVLPRSYKEACVLEIKV+D NCGV SSNC
Sbjct: 181 RRLQDEFRAAEYSFDFSDFDFGFSETEGDWVNVLPRSYKEACVLEIKVHDGNCGVKSSNC 240

Query: 241 NRKVCCTDVDEPELLDKYLKRDLGDPFCSIVMAMKPNPMGMEDMESASLEHLLGGDSDLI 300
           N + C T VDEPELL++ L RDLGDPFCSI+MAMKPNP+GMEDMES SLE+LLGGD DLI
Sbjct: 241 NSEACFTGVDEPELLERNLNRDLGDPFCSIIMAMKPNPVGMEDMESESLENLLGGDEDLI 300

Query: 301 NFDTTALIALVSGISNGCVAKLLATPESELRQKYKSNYDFVIGQVMSEIQKPILVELSSL 360
           N DTTALIALVSGISNGCVAKLLATPESELR+KYKSNYDFVIGQVMSEIQKPILVELSSL
Sbjct: 301 NLDTTALIALVSGISNGCVAKLLATPESELRRKYKSNYDFVIGQVMSEIQKPILVELSSL 360

Query: 361 LSGKRGIICQSVHSEFKELVTMCGGPNEKSRANHLLKHIMTVSDTASKRMTCLPTTRKLA 420
           LSGKRGIICQSVHSEFKELVTMCGGPNEKSRANHLLKHIM V D ASKRMTCLPTTRKLA
Sbjct: 361 LSGKRGIICQSVHSEFKELVTMCGGPNEKSRANHLLKHIMVVPDRASKRMTCLPTTRKLA 420

Query: 421 LKNKVVFGTGDYWKAATLTANMSFVRAVSQTGMSLFTFEHRPRALTGD 468
           LKNKVVFGTGDYW A TL+ANMSFVRAVSQTGMSLFT+EHRPRALTGD
Sbjct: 421 LKNKVVFGTGDYWNAPTLSANMSFVRAVSQTGMSLFTYEHRPRALTGD 460

BLAST of Sgr022051 vs. NCBI nr
Match: XP_022152807.1 (UPF0415 protein C7orf25 homolog isoform X1 [Momordica charantia])

HSP 1 Score: 785.0 bits (2026), Expect = 3.5e-223
Identity = 403/489 (82.41%), Postives = 425/489 (86.91%), Query Frame = 0

Query: 1   MEEPDGVELAKQRCRAVIDRIQRLPSSTNITISSRRTLLKLALRELNFLSRCSFSTQLSL 60
           MEE DGVELAKQRCRAVIDRI+RLPSSTNIT+SSRRTLLKLALRELNFLSRCS ST LSL
Sbjct: 1   MEESDGVELAKQRCRAVIDRIERLPSSTNITLSSRRTLLKLALRELNFLSRCSSSTPLSL 60

Query: 61  NIGHLEAIVHILQQPSVTGISRVCKPIPLHARRIESAKETKSSCSKVVYVDIICTLNRNP 120
           NIGHLEA+VHILQ PSVTGISRVCKPIPLH RR+ES        SKVV+VDIIC LNR P
Sbjct: 61  NIGHLEAVVHILQHPSVTGISRVCKPIPLHPRRVES--------SKVVHVDIICILNRKP 120

Query: 121 VWVIVSVRKPKYISWYKG-HRNKGLKSRLEEVVDAARSLEALEPCSIILFFSHGLDHLIL 180
           VWVIVS RKP+Y+SW  G HR+KGLKSRLEEVVDAARS  ALEPC IILFF+HGLDH IL
Sbjct: 121 VWVIVSDRKPRYVSWNDGRHRSKGLKSRLEEVVDAARSFPALEPCLIILFFAHGLDHFIL 180

Query: 181 ERLRDEFMATEYNFNFSDFDFGFSEIEGDWVNVLPRSYKEACVLEIKVNDRNCGVTSSNC 240
            RL+DEF A EY+F+FSDFDFGFSE EGDWVNVLPRSYKEACVLEIKV+D NCGV SSNC
Sbjct: 181 RRLQDEFRAAEYSFDFSDFDFGFSETEGDWVNVLPRSYKEACVLEIKVHDGNCGVKSSNC 240

Query: 241 NRKVCCTDVDEPELLDKYLKRDLGDPFCSIVMAMKPNPMGMEDMESASLEHLLGGDSDLI 300
           N + C T VDEPELL++ L RDLGDPFCSI+MAMKPNP+GMEDMES SLE+LLGGD DLI
Sbjct: 241 NSEACFTGVDEPELLERNLNRDLGDPFCSIIMAMKPNPVGMEDMESESLENLLGGDEDLI 300

Query: 301 NFDTTALIALVSGISNGCVAKLLATPESELRQKYKSNYDFVIG----------------- 360
           N DTTALIALVSGISNGCVAKLLATPESELR+KYKSNYDFVIG                 
Sbjct: 301 NLDTTALIALVSGISNGCVAKLLATPESELRRKYKSNYDFVIGQVNFVWRSFCSLSSTVI 360

Query: 361 ----QVMSEIQKPILVELSSLLSGKRGIICQSVHSEFKELVTMCGGPNEKSRANHLLKHI 420
               QVMSEIQKPILVELSSLLSGKRGIICQSVHSEFKELVTMCGGPNEKSRANHLLKHI
Sbjct: 361 FTSYQVMSEIQKPILVELSSLLSGKRGIICQSVHSEFKELVTMCGGPNEKSRANHLLKHI 420

Query: 421 MTVSDTASKRMTCLPTTRKLALKNKVVFGTGDYWKAATLTANMSFVRAVSQTGMSLFTFE 468
           M V D ASKRMTCLPTTRKLALKNKVVFGTGDYW A TL+ANMSFVRAVSQTGMSLFT+E
Sbjct: 421 MVVPDRASKRMTCLPTTRKLALKNKVVFGTGDYWNAPTLSANMSFVRAVSQTGMSLFTYE 480

BLAST of Sgr022051 vs. NCBI nr
Match: XP_038906087.1 (UPF0415 protein C7orf25 homolog [Benincasa hispida])

HSP 1 Score: 780.0 bits (2013), Expect = 1.1e-221
Identity = 396/470 (84.26%), Postives = 421/470 (89.57%), Query Frame = 0

Query: 1   MEEPDGVELAKQRCRAVIDRIQRLPSSTNITISSRRTLLKLALRELNFLSRC--SFSTQL 60
           M EP+ +ELAKQRCRAVID I+ LPSSTNIT+SS RTL KLALRELNFLSRC  S ST L
Sbjct: 1   MAEPETLELAKQRCRAVIDIIETLPSSTNITVSSSRTLHKLALRELNFLSRCSSSSSTPL 60

Query: 61  SLNIGHLEAIVHILQQPSVTGISRVCKPIPLHARRIESAKETKSSCSKVVYVDIICTLNR 120
           SLNIGHLEAIVHILQ PSVTGISRVCKPIP             SSCSK VYVDIICTL++
Sbjct: 61  SLNIGHLEAIVHILQHPSVTGISRVCKPIP-------------SSCSKPVYVDIICTLDK 120

Query: 121 NPVWVIVSVRKPKYISWYKGHRNKGLKSRLEEVVDAARSLEALEPCSIILFFSHGLDHLI 180
           NPVWVIVS RKP+YISWYKGHR+KGLKSRLEEV+DAARSL+ALEPCSIILFFSHGLD  I
Sbjct: 121 NPVWVIVSDRKPRYISWYKGHRSKGLKSRLEEVIDAARSLQALEPCSIILFFSHGLDQFI 180

Query: 181 LERLRDEFMATEYNFNFSDFDFGFSEIEGDWVNVLPRSYKEACVLEIKVNDRNCGVTSSN 240
           LE+LRDEF A E+NFNFSDFDFGFSEI+GDWVNVLPRSY+EA VLEIKVNDR CGVTS N
Sbjct: 181 LEKLRDEFKANEFNFNFSDFDFGFSEIDGDWVNVLPRSYEEALVLEIKVNDRKCGVTSLN 240

Query: 241 CNRKVCCTDVDEPELLDKYLKRDLGDPFCSIVMAMKPNPM-GMEDMESASLEHLLGGDSD 300
            N   C T VD+PE+LD Y++RD+ DPFCS+VMAMKPNPM G+EDMESASLEH LGGD+D
Sbjct: 241 YNSTACSTGVDDPEILDNYVERDIWDPFCSVVMAMKPNPMIGIEDMESASLEHFLGGDND 300

Query: 301 LINFDTTALIALVSGISNGCVAKLLATPESELRQKYKSNYDFVIGQVMSEIQKPILVELS 360
           LINFDTTALIALVSGISNGCVAKLLATPESELRQKYKSNYDFVIGQ MSEI+KPILVELS
Sbjct: 301 LINFDTTALIALVSGISNGCVAKLLATPESELRQKYKSNYDFVIGQAMSEIEKPILVELS 360

Query: 361 SLLSGKRGIICQSVHSEFKELVTMCGGPNEKSRANHLLKHIMTVSDTASKRMTCLPTTRK 420
           SLL+GKRGIICQSVHSEFKELVTMCGGPNEKSRANHLLKHI+ V D ASKRMTCLPTTRK
Sbjct: 361 SLLTGKRGIICQSVHSEFKELVTMCGGPNEKSRANHLLKHILVVPDMASKRMTCLPTTRK 420

Query: 421 LALKNKVVFGTGDYWKAATLTANMSFVRAVSQTGMSLFTFEHRPRALTGD 468
           LALKNKVVFGTGDYW A TLTANMSFVRAVSQTGMSLFTFEHRPRALTGD
Sbjct: 421 LALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALTGD 457

BLAST of Sgr022051 vs. NCBI nr
Match: XP_022923546.1 (uncharacterized protein LOC111431203 isoform X2 [Cucurbita moschata])

HSP 1 Score: 767.3 bits (1980), Expect = 7.6e-218
Identity = 389/470 (82.77%), Postives = 415/470 (88.30%), Query Frame = 0

Query: 1   MEEPDGVELAKQRCRAVIDRIQRLPSSTNITISSRRTLLKLALRELNFLSRC--SFSTQL 60
           M EPD VELAKQRCRAV+D I+ LPSSTNIT+SS RTL KLALRELNFLSRC  S ST L
Sbjct: 1   MAEPDAVELAKQRCRAVMDMIEALPSSTNITLSSSRTLHKLALRELNFLSRCSSSSSTPL 60

Query: 61  SLNIGHLEAIVHILQQPSVTGISRVCKPIPLHARRIESAKETKSSCSKVVYVDIICTLNR 120
           SLNIGHLEAIVHILQ PSV GISRVCKPIP             S CSK VYVDIICTLNR
Sbjct: 61  SLNIGHLEAIVHILQHPSVAGISRVCKPIP-------------SPCSKAVYVDIICTLNR 120

Query: 121 NPVWVIVSVRKPKYISWYKGHRNKGLKSRLEEVVDAARSLEALEPCSIILFFSHGLDHLI 180
           NPVW+IVS RKP+YISW++GHR+KGLKSR+EEVVDAARSL+ALEPCSIILFFSHGLD  I
Sbjct: 121 NPVWIIVSDRKPRYISWHRGHRSKGLKSRIEEVVDAARSLQALEPCSIILFFSHGLDQFI 180

Query: 181 LERLRDEFMATEYNFNFSDFDFGFSEIEGDWVNVLPRSYKEACVLEIKVNDRNCGVTSSN 240
           LERLRDEF ATE+NFNFSD DF FSEI+ DWVNVLPR YKEACVLEIKVNDRNCG+TSSN
Sbjct: 181 LERLRDEFRATEFNFNFSDMDFDFSEIDDDWVNVLPRRYKEACVLEIKVNDRNCGITSSN 240

Query: 241 CNRKVCCTDVDEPELLDKYLKRDLGDPFCSIVMAMKPNPM-GMEDMESASLEHLLGGDSD 300
           C  K+C T V+EPE+LDKY++RDLG PFCS+V AMKPNPM G+ED+ES SLEHLL GD+D
Sbjct: 241 CISKLCSTGVNEPEILDKYVERDLGVPFCSVVKAMKPNPMIGIEDLESTSLEHLLDGDTD 300

Query: 301 LINFDTTALIALVSGISNGCVAKLLATPESELRQKYKSNYDFVIGQVMSEIQKPILVELS 360
           LINFDTTALIALVSGISNGCVAKLLATPE EL+QKYKSNYDFVI QVMSEIQKPILVELS
Sbjct: 301 LINFDTTALIALVSGISNGCVAKLLATPEDELKQKYKSNYDFVIDQVMSEIQKPILVELS 360

Query: 361 SLLSGKRGIICQSVHSEFKELVTMCGGPNEKSRANHLLKHIMTVSDTASKRMTCLPTTRK 420
           S LSGKRGIICQSVHSEFKELVTMCGGP EKSR+N+LLKHIM V D ASKRMTCLPTTRK
Sbjct: 361 SFLSGKRGIICQSVHSEFKELVTMCGGPYEKSRSNYLLKHIMVVPDMASKRMTCLPTTRK 420

Query: 421 LALKNKVVFGTGDYWKAATLTANMSFVRAVSQTGMSLFTFEHRPRALTGD 468
           LALKNK+VFGTGDYW A TLTANMSFVRAVSQTGMSL T EHRPRALTGD
Sbjct: 421 LALKNKIVFGTGDYWNAPTLTANMSFVRAVSQTGMSLLTLEHRPRALTGD 457

BLAST of Sgr022051 vs. NCBI nr
Match: XP_023553240.1 (uncharacterized protein LOC111810716 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 766.9 bits (1979), Expect = 9.9e-218
Identity = 389/470 (82.77%), Postives = 415/470 (88.30%), Query Frame = 0

Query: 1   MEEPDGVELAKQRCRAVIDRIQRLPSSTNITISSRRTLLKLALRELNFLSRC--SFSTQL 60
           M EPD VEL KQRCRAV+D I+ LPSSTNI++SS RTL KLALRELNFLSRC  S ST L
Sbjct: 1   MAEPDAVELPKQRCRAVMDMIEALPSSTNISVSSSRTLHKLALRELNFLSRCSSSSSTPL 60

Query: 61  SLNIGHLEAIVHILQQPSVTGISRVCKPIPLHARRIESAKETKSSCSKVVYVDIICTLNR 120
           SLNIGHLEAIVHILQ PSV GISRVCKPIP             S CSK VYVDIICTLNR
Sbjct: 61  SLNIGHLEAIVHILQHPSVAGISRVCKPIP-------------SPCSKAVYVDIICTLNR 120

Query: 121 NPVWVIVSVRKPKYISWYKGHRNKGLKSRLEEVVDAARSLEALEPCSIILFFSHGLDHLI 180
           NPVW+IVS RKP+YISW++GHR+KGLKSRLEEVVDAARSL+ALEPCSIILFFSHGLD  I
Sbjct: 121 NPVWIIVSDRKPRYISWHRGHRSKGLKSRLEEVVDAARSLQALEPCSIILFFSHGLDQFI 180

Query: 181 LERLRDEFMATEYNFNFSDFDFGFSEIEGDWVNVLPRSYKEACVLEIKVNDRNCGVTSSN 240
           LERLRDEF ATE+NF+FSD DF FSEI+ DWVNVLPR YKEACVLEIKVNDRNCG+TSSN
Sbjct: 181 LERLRDEFRATEFNFSFSDIDFDFSEIDDDWVNVLPRRYKEACVLEIKVNDRNCGITSSN 240

Query: 241 CNRKVCCTDVDEPELLDKYLKRDLGDPFCSIVMAMKPNPM-GMEDMESASLEHLLGGDSD 300
           C  K+C T VDEPE+LDKY++RDLG PFCS+V AMKPNPM G+ED+ES SLEHLL GD+D
Sbjct: 241 CISKLCSTGVDEPEILDKYVERDLGVPFCSVVKAMKPNPMIGIEDLESTSLEHLLDGDTD 300

Query: 301 LINFDTTALIALVSGISNGCVAKLLATPESELRQKYKSNYDFVIGQVMSEIQKPILVELS 360
           LINFDTTALIALVSGISNGCVAKLLATPE EL+QKYKSNYDFVI QVMSEIQKPILVELS
Sbjct: 301 LINFDTTALIALVSGISNGCVAKLLATPEDELKQKYKSNYDFVIDQVMSEIQKPILVELS 360

Query: 361 SLLSGKRGIICQSVHSEFKELVTMCGGPNEKSRANHLLKHIMTVSDTASKRMTCLPTTRK 420
           S LSGKRGIICQSVHSEFKELVTMCGGP EKSRAN+LLKHIM V D ASKRMTCLPTTRK
Sbjct: 361 SFLSGKRGIICQSVHSEFKELVTMCGGPYEKSRANYLLKHIMVVPDMASKRMTCLPTTRK 420

Query: 421 LALKNKVVFGTGDYWKAATLTANMSFVRAVSQTGMSLFTFEHRPRALTGD 468
           LALKNK+VFGTGDYW A+TLTANMSFVRAVSQTGMSL T EHRPRALTGD
Sbjct: 421 LALKNKIVFGTGDYWNASTLTANMSFVRAVSQTGMSLLTLEHRPRALTGD 457

BLAST of Sgr022051 vs. ExPASy Swiss-Prot
Match: Q803H0 (UPF0415 protein C7orf25 homolog OS=Danio rerio OX=7955 GN=zgc:55781 PE=2 SV=1)

HSP 1 Score: 97.4 bits (241), Expect = 4.4e-19
Identity = 58/167 (34.73%), Postives = 83/167 (49.70%), Query Frame = 0

Query: 299 INFDTTALIALVSGISNGCVAKLLATPESELRQKYKSNYDFVIGQVMSEIQKPILVELSS 358
           +N D T LI  VS +S+G                +      +  Q   E Q+ +L  L  
Sbjct: 264 VNLDITTLITYVSSLSHG-------------NCHFTFKEVVLTEQAAQERQEKVLPRLEE 323

Query: 359 LLSGKRGIICQSVHSEFKELVTMCGGPNEKSRANHLLKHIMTVSDTASKRMTCLPTTRKL 418
            + GK    CQS   +F+ ++   GGP EKSRA  LL  +  V D  S+R   L  + K+
Sbjct: 324 FMKGKELFACQSAVEDFRVILDTLGGPGEKSRAEELLARLKVVPDQPSERTQRLVMSSKV 383

Query: 419 ALKNKVVFGTGDYWKAATLTANMSFVRAVSQTGMSLFTFEHRPRALT 466
             ++ ++FGTGD  +A T+TAN  FVRA +  G+    F H+PRALT
Sbjct: 384 NRRSLMIFGTGDTLRAITMTANSGFVRAAANQGVRFSVFIHQPRALT 417

BLAST of Sgr022051 vs. ExPASy Swiss-Prot
Match: Q08AW5 (UPF0415 protein C7orf25 homolog OS=Xenopus laevis OX=8355 PE=2 SV=1)

HSP 1 Score: 93.2 bits (230), Expect = 8.3e-18
Identity = 59/167 (35.33%), Postives = 86/167 (51.50%), Query Frame = 0

Query: 299 INFDTTALIALVSGISNGCVAKLLATPESELRQKYKSNYDFVIGQVMSEIQKPILVELSS 358
           +N D T LI  VS +S+G    L        ++K  +       Q   E Q+ +L  L S
Sbjct: 251 VNLDITTLITYVSALSHGGCHWL-------FKEKVLTE------QAAQERQEKVLPLLKS 310

Query: 359 LLSGKRGIICQSVHSEFKELVTMCGGPNEKSRANHLLKHIMTVSDTASKRMTCLPTTRKL 418
            +  K    C+S   +F+ ++   GGP EK RA  L+K I  V D  S+R + L  + K+
Sbjct: 311 FMEAKELFACESAIKDFQSILETLGGPAEKERAALLVKSITVVPDQPSERASKLACSSKI 370

Query: 419 ALKNKVVFGTGDYWKAATLTANMSFVRAVSQTGMSLFTFEHRPRALT 466
             ++  +FGTG+  KA T+TAN  FVRA +  G+    F H+PRALT
Sbjct: 371 NSRSISIFGTGETLKAITMTANSGFVRAAANQGVKFSVFIHQPRALT 404

BLAST of Sgr022051 vs. ExPASy Swiss-Prot
Match: Q91WD4 (UPF0415 protein C7orf25 homolog OS=Mus musculus OX=10090 PE=2 SV=1)

HSP 1 Score: 92.8 bits (229), Expect = 1.1e-17
Identity = 56/168 (33.33%), Postives = 84/168 (50.00%), Query Frame = 0

Query: 299 INFDTTALIALVSGIS-NGCVAKLLATPESELRQKYKSNYDFVIGQVMSEIQKPILVELS 358
           +N D T LI  VS +S  GC               +      +  Q   E ++ +L +L 
Sbjct: 248 VNLDITTLITYVSAMSYGGC--------------HFVFKEKVLTEQAEQERKERVLPQLE 307

Query: 359 SLLSGKRGIICQSVHSEFKELVTMCGGPNEKSRANHLLKHIMTVSDTASKRMTCLPTTRK 418
           + +  K    C+S   +F+ ++   GGP E+ RA+ L+K I  V D  S+R   L  + K
Sbjct: 308 AFMKDKELFACESAVKDFQSILDTLGGPGERERADVLIKRISVVPDQPSERALRLVASSK 367

Query: 419 LALKNKVVFGTGDYWKAATLTANMSFVRAVSQTGMSLFTFEHRPRALT 466
           +  ++  +FGTGD  KA T+TAN  FVRA +  G+    F H+PRALT
Sbjct: 368 INSRSLTIFGTGDTLKAITMTANSGFVRAANNQGVKFSVFIHQPRALT 401

BLAST of Sgr022051 vs. ExPASy Swiss-Prot
Match: Q5BKL1 (UPF0415 protein C7orf25 homolog OS=Xenopus tropicalis OX=8364 GN=TGas015c11.1 PE=2 SV=1)

HSP 1 Score: 92.8 bits (229), Expect = 1.1e-17
Identity = 58/167 (34.73%), Postives = 86/167 (51.50%), Query Frame = 0

Query: 299 INFDTTALIALVSGISNGCVAKLLATPESELRQKYKSNYDFVIGQVMSEIQKPILVELSS 358
           +N D T LI  VS +S+G         E   ++K  +       Q   E Q+ +L  L+S
Sbjct: 251 VNLDITTLITYVSALSHGGC-------EWIFKEKVLTE------QAAQERQEKVLPLLNS 310

Query: 359 LLSGKRGIICQSVHSEFKELVTMCGGPNEKSRANHLLKHIMTVSDTASKRMTCLPTTRKL 418
            +  K    C+    +F+ ++   GGP EK RA  L+K I  V D  S+R   L ++ K+
Sbjct: 311 FMEAKELFACECAVKDFQSILETLGGPAEKERAASLVKRITVVPDQPSERALQLASSSKI 370

Query: 419 ALKNKVVFGTGDYWKAATLTANMSFVRAVSQTGMSLFTFEHRPRALT 466
             ++  +FGTG+  KA T+TAN  FVRA +  G+    F H+PRALT
Sbjct: 371 NSRSISIFGTGESLKAITMTANSGFVRAAANQGVKFSVFIHQPRALT 404

BLAST of Sgr022051 vs. ExPASy Swiss-Prot
Match: Q9BPX7 (UPF0415 protein C7orf25 OS=Homo sapiens OX=9606 GN=C7orf25 PE=1 SV=1)

HSP 1 Score: 92.4 bits (228), Expect = 1.4e-17
Identity = 56/168 (33.33%), Postives = 83/168 (49.40%), Query Frame = 0

Query: 299 INFDTTALIALVSGIS-NGCVAKLLATPESELRQKYKSNYDFVIGQVMSEIQKPILVELS 358
           +N D T LI  VS +S  GC               +      +  Q   E ++ +L +L 
Sbjct: 248 VNLDITTLITYVSALSYGGC--------------HFIFKEKVLTEQAEQERKEQVLPQLE 307

Query: 359 SLLSGKRGIICQSVHSEFKELVTMCGGPNEKSRANHLLKHIMTVSDTASKRMTCLPTTRK 418
           + +  K    C+S   +F+ ++   GGP E+ RA  L+K I  V D  S+R   L  + K
Sbjct: 308 AFMKDKELFACESAVKDFQSILDTLGGPGERERATVLIKRINVVPDQPSERALRLVASSK 367

Query: 419 LALKNKVVFGTGDYWKAATLTANMSFVRAVSQTGMSLFTFEHRPRALT 466
           +  ++  +FGTGD  KA T+TAN  FVRA +  G+    F H+PRALT
Sbjct: 368 INSRSLTIFGTGDTLKAITMTANSGFVRAANNQGVKFSVFIHQPRALT 401

BLAST of Sgr022051 vs. ExPASy TrEMBL
Match: A0A6J1DFW3 (UPF0415 protein C7orf25 homolog isoform X2 OS=Momordica charantia OX=3673 GN=LOC111020434 PE=4 SV=1)

HSP 1 Score: 797.3 bits (2058), Expect = 3.3e-227
Identity = 403/468 (86.11%), Postives = 425/468 (90.81%), Query Frame = 0

Query: 1   MEEPDGVELAKQRCRAVIDRIQRLPSSTNITISSRRTLLKLALRELNFLSRCSFSTQLSL 60
           MEE DGVELAKQRCRAVIDRI+RLPSSTNIT+SSRRTLLKLALRELNFLSRCS ST LSL
Sbjct: 1   MEESDGVELAKQRCRAVIDRIERLPSSTNITLSSRRTLLKLALRELNFLSRCSSSTPLSL 60

Query: 61  NIGHLEAIVHILQQPSVTGISRVCKPIPLHARRIESAKETKSSCSKVVYVDIICTLNRNP 120
           NIGHLEA+VHILQ PSVTGISRVCKPIPLH RR+ES        SKVV+VDIIC LNR P
Sbjct: 61  NIGHLEAVVHILQHPSVTGISRVCKPIPLHPRRVES--------SKVVHVDIICILNRKP 120

Query: 121 VWVIVSVRKPKYISWYKG-HRNKGLKSRLEEVVDAARSLEALEPCSIILFFSHGLDHLIL 180
           VWVIVS RKP+Y+SW  G HR+KGLKSRLEEVVDAARS  ALEPC IILFF+HGLDH IL
Sbjct: 121 VWVIVSDRKPRYVSWNDGRHRSKGLKSRLEEVVDAARSFPALEPCLIILFFAHGLDHFIL 180

Query: 181 ERLRDEFMATEYNFNFSDFDFGFSEIEGDWVNVLPRSYKEACVLEIKVNDRNCGVTSSNC 240
            RL+DEF A EY+F+FSDFDFGFSE EGDWVNVLPRSYKEACVLEIKV+D NCGV SSNC
Sbjct: 181 RRLQDEFRAAEYSFDFSDFDFGFSETEGDWVNVLPRSYKEACVLEIKVHDGNCGVKSSNC 240

Query: 241 NRKVCCTDVDEPELLDKYLKRDLGDPFCSIVMAMKPNPMGMEDMESASLEHLLGGDSDLI 300
           N + C T VDEPELL++ L RDLGDPFCSI+MAMKPNP+GMEDMES SLE+LLGGD DLI
Sbjct: 241 NSEACFTGVDEPELLERNLNRDLGDPFCSIIMAMKPNPVGMEDMESESLENLLGGDEDLI 300

Query: 301 NFDTTALIALVSGISNGCVAKLLATPESELRQKYKSNYDFVIGQVMSEIQKPILVELSSL 360
           N DTTALIALVSGISNGCVAKLLATPESELR+KYKSNYDFVIGQVMSEIQKPILVELSSL
Sbjct: 301 NLDTTALIALVSGISNGCVAKLLATPESELRRKYKSNYDFVIGQVMSEIQKPILVELSSL 360

Query: 361 LSGKRGIICQSVHSEFKELVTMCGGPNEKSRANHLLKHIMTVSDTASKRMTCLPTTRKLA 420
           LSGKRGIICQSVHSEFKELVTMCGGPNEKSRANHLLKHIM V D ASKRMTCLPTTRKLA
Sbjct: 361 LSGKRGIICQSVHSEFKELVTMCGGPNEKSRANHLLKHIMVVPDRASKRMTCLPTTRKLA 420

Query: 421 LKNKVVFGTGDYWKAATLTANMSFVRAVSQTGMSLFTFEHRPRALTGD 468
           LKNKVVFGTGDYW A TL+ANMSFVRAVSQTGMSLFT+EHRPRALTGD
Sbjct: 421 LKNKVVFGTGDYWNAPTLSANMSFVRAVSQTGMSLFTYEHRPRALTGD 460

BLAST of Sgr022051 vs. ExPASy TrEMBL
Match: A0A6J1DH98 (UPF0415 protein C7orf25 homolog isoform X1 OS=Momordica charantia OX=3673 GN=LOC111020434 PE=4 SV=1)

HSP 1 Score: 785.0 bits (2026), Expect = 1.7e-223
Identity = 403/489 (82.41%), Postives = 425/489 (86.91%), Query Frame = 0

Query: 1   MEEPDGVELAKQRCRAVIDRIQRLPSSTNITISSRRTLLKLALRELNFLSRCSFSTQLSL 60
           MEE DGVELAKQRCRAVIDRI+RLPSSTNIT+SSRRTLLKLALRELNFLSRCS ST LSL
Sbjct: 1   MEESDGVELAKQRCRAVIDRIERLPSSTNITLSSRRTLLKLALRELNFLSRCSSSTPLSL 60

Query: 61  NIGHLEAIVHILQQPSVTGISRVCKPIPLHARRIESAKETKSSCSKVVYVDIICTLNRNP 120
           NIGHLEA+VHILQ PSVTGISRVCKPIPLH RR+ES        SKVV+VDIIC LNR P
Sbjct: 61  NIGHLEAVVHILQHPSVTGISRVCKPIPLHPRRVES--------SKVVHVDIICILNRKP 120

Query: 121 VWVIVSVRKPKYISWYKG-HRNKGLKSRLEEVVDAARSLEALEPCSIILFFSHGLDHLIL 180
           VWVIVS RKP+Y+SW  G HR+KGLKSRLEEVVDAARS  ALEPC IILFF+HGLDH IL
Sbjct: 121 VWVIVSDRKPRYVSWNDGRHRSKGLKSRLEEVVDAARSFPALEPCLIILFFAHGLDHFIL 180

Query: 181 ERLRDEFMATEYNFNFSDFDFGFSEIEGDWVNVLPRSYKEACVLEIKVNDRNCGVTSSNC 240
            RL+DEF A EY+F+FSDFDFGFSE EGDWVNVLPRSYKEACVLEIKV+D NCGV SSNC
Sbjct: 181 RRLQDEFRAAEYSFDFSDFDFGFSETEGDWVNVLPRSYKEACVLEIKVHDGNCGVKSSNC 240

Query: 241 NRKVCCTDVDEPELLDKYLKRDLGDPFCSIVMAMKPNPMGMEDMESASLEHLLGGDSDLI 300
           N + C T VDEPELL++ L RDLGDPFCSI+MAMKPNP+GMEDMES SLE+LLGGD DLI
Sbjct: 241 NSEACFTGVDEPELLERNLNRDLGDPFCSIIMAMKPNPVGMEDMESESLENLLGGDEDLI 300

Query: 301 NFDTTALIALVSGISNGCVAKLLATPESELRQKYKSNYDFVIG----------------- 360
           N DTTALIALVSGISNGCVAKLLATPESELR+KYKSNYDFVIG                 
Sbjct: 301 NLDTTALIALVSGISNGCVAKLLATPESELRRKYKSNYDFVIGQVNFVWRSFCSLSSTVI 360

Query: 361 ----QVMSEIQKPILVELSSLLSGKRGIICQSVHSEFKELVTMCGGPNEKSRANHLLKHI 420
               QVMSEIQKPILVELSSLLSGKRGIICQSVHSEFKELVTMCGGPNEKSRANHLLKHI
Sbjct: 361 FTSYQVMSEIQKPILVELSSLLSGKRGIICQSVHSEFKELVTMCGGPNEKSRANHLLKHI 420

Query: 421 MTVSDTASKRMTCLPTTRKLALKNKVVFGTGDYWKAATLTANMSFVRAVSQTGMSLFTFE 468
           M V D ASKRMTCLPTTRKLALKNKVVFGTGDYW A TL+ANMSFVRAVSQTGMSLFT+E
Sbjct: 421 MVVPDRASKRMTCLPTTRKLALKNKVVFGTGDYWNAPTLSANMSFVRAVSQTGMSLFTYE 480

BLAST of Sgr022051 vs. ExPASy TrEMBL
Match: A0A6J1E731 (uncharacterized protein LOC111431203 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111431203 PE=4 SV=1)

HSP 1 Score: 767.3 bits (1980), Expect = 3.7e-218
Identity = 389/470 (82.77%), Postives = 415/470 (88.30%), Query Frame = 0

Query: 1   MEEPDGVELAKQRCRAVIDRIQRLPSSTNITISSRRTLLKLALRELNFLSRC--SFSTQL 60
           M EPD VELAKQRCRAV+D I+ LPSSTNIT+SS RTL KLALRELNFLSRC  S ST L
Sbjct: 1   MAEPDAVELAKQRCRAVMDMIEALPSSTNITLSSSRTLHKLALRELNFLSRCSSSSSTPL 60

Query: 61  SLNIGHLEAIVHILQQPSVTGISRVCKPIPLHARRIESAKETKSSCSKVVYVDIICTLNR 120
           SLNIGHLEAIVHILQ PSV GISRVCKPIP             S CSK VYVDIICTLNR
Sbjct: 61  SLNIGHLEAIVHILQHPSVAGISRVCKPIP-------------SPCSKAVYVDIICTLNR 120

Query: 121 NPVWVIVSVRKPKYISWYKGHRNKGLKSRLEEVVDAARSLEALEPCSIILFFSHGLDHLI 180
           NPVW+IVS RKP+YISW++GHR+KGLKSR+EEVVDAARSL+ALEPCSIILFFSHGLD  I
Sbjct: 121 NPVWIIVSDRKPRYISWHRGHRSKGLKSRIEEVVDAARSLQALEPCSIILFFSHGLDQFI 180

Query: 181 LERLRDEFMATEYNFNFSDFDFGFSEIEGDWVNVLPRSYKEACVLEIKVNDRNCGVTSSN 240
           LERLRDEF ATE+NFNFSD DF FSEI+ DWVNVLPR YKEACVLEIKVNDRNCG+TSSN
Sbjct: 181 LERLRDEFRATEFNFNFSDMDFDFSEIDDDWVNVLPRRYKEACVLEIKVNDRNCGITSSN 240

Query: 241 CNRKVCCTDVDEPELLDKYLKRDLGDPFCSIVMAMKPNPM-GMEDMESASLEHLLGGDSD 300
           C  K+C T V+EPE+LDKY++RDLG PFCS+V AMKPNPM G+ED+ES SLEHLL GD+D
Sbjct: 241 CISKLCSTGVNEPEILDKYVERDLGVPFCSVVKAMKPNPMIGIEDLESTSLEHLLDGDTD 300

Query: 301 LINFDTTALIALVSGISNGCVAKLLATPESELRQKYKSNYDFVIGQVMSEIQKPILVELS 360
           LINFDTTALIALVSGISNGCVAKLLATPE EL+QKYKSNYDFVI QVMSEIQKPILVELS
Sbjct: 301 LINFDTTALIALVSGISNGCVAKLLATPEDELKQKYKSNYDFVIDQVMSEIQKPILVELS 360

Query: 361 SLLSGKRGIICQSVHSEFKELVTMCGGPNEKSRANHLLKHIMTVSDTASKRMTCLPTTRK 420
           S LSGKRGIICQSVHSEFKELVTMCGGP EKSR+N+LLKHIM V D ASKRMTCLPTTRK
Sbjct: 361 SFLSGKRGIICQSVHSEFKELVTMCGGPYEKSRSNYLLKHIMVVPDMASKRMTCLPTTRK 420

Query: 421 LALKNKVVFGTGDYWKAATLTANMSFVRAVSQTGMSLFTFEHRPRALTGD 468
           LALKNK+VFGTGDYW A TLTANMSFVRAVSQTGMSL T EHRPRALTGD
Sbjct: 421 LALKNKIVFGTGDYWNAPTLTANMSFVRAVSQTGMSLLTLEHRPRALTGD 457

BLAST of Sgr022051 vs. ExPASy TrEMBL
Match: A0A5D3D7K2 (UPF0415 protein C7orf25-like protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold443G00790 PE=4 SV=1)

HSP 1 Score: 765.0 bits (1974), Expect = 1.8e-217
Identity = 388/470 (82.55%), Postives = 417/470 (88.72%), Query Frame = 0

Query: 1   MEEPDGVELAKQRCRAVIDRIQRLPSSTNITISSRRTLLKLALRELNFLSRCSF--STQL 60
           M EP+ VELAKQRC+A++D I+ LPSSTNI++S  +TL KLALRELNFLSRCSF  ST L
Sbjct: 1   MAEPNTVELAKQRCKAIMDIIETLPSSTNISVSCTQTLQKLALRELNFLSRCSFSSSTPL 60

Query: 61  SLNIGHLEAIVHILQQPSVTGISRVCKPIPLHARRIESAKETKSSCSKVVYVDIICTLNR 120
           SLNIGHLEAIVHILQ PSVTGISRVCKPIP             SS SK VYVDIICTLNR
Sbjct: 61  SLNIGHLEAIVHILQHPSVTGISRVCKPIP------------SSSSSKAVYVDIICTLNR 120

Query: 121 NPVWVIVSVRKPKYISWYKGHRNKGLKSRLEEVVDAARSLEALEPCSIILFFSHGLDHLI 180
           NPVWVIVS RKP+YISWYKGHR+KGLKSRLEEV+DAA SL+ALEPCSIILFFSHGLD  I
Sbjct: 121 NPVWVIVSDRKPRYISWYKGHRSKGLKSRLEEVIDAAGSLQALEPCSIILFFSHGLDQFI 180

Query: 181 LERLRDEFMATEYNFNFSDFDFGFSEIEGDWVNVLPRSYKEACVLEIKVNDRNCGVTSSN 240
           LERLRDEF ATE++FNFSDFDFGFSEI+GDW+NVL RSYKEACVLEIKV+DRNCG TSSN
Sbjct: 181 LERLRDEFKATEFHFNFSDFDFGFSEIDGDWINVLSRSYKEACVLEIKVSDRNCGATSSN 240

Query: 241 CNRKVCCTDVDEPELLDKYLKRDLGDPFCSIVMAMKPNPM-GMEDMESASLEHLLGGDSD 300
            N KVC + VDEP++L+   + DLGD FCS+VMAMKPNPM G+EDMESA+LE LLGGDSD
Sbjct: 241 YNSKVCSSGVDEPDILNSNTEIDLGDSFCSVVMAMKPNPMNGIEDMESANLEQLLGGDSD 300

Query: 301 LINFDTTALIALVSGISNGCVAKLLATPESELRQKYKSNYDFVIGQVMSEIQKPILVELS 360
           LINFDTTALIALVSGISNGC AKLLATPE+EL+QKYKSNYDFVIGQ MSEI+KPILVEL 
Sbjct: 301 LINFDTTALIALVSGISNGCAAKLLATPENELKQKYKSNYDFVIGQAMSEIKKPILVELG 360

Query: 361 SLLSGKRGIICQSVHSEFKELVTMCGGPNEKSRANHLLKHIMTVSDTASKRMTCLPTTRK 420
           SLLSGKRGIICQSVHSEFKEL+TMCGGPNEKSRANHLLKHIM V D  SKRMTCLPTTRK
Sbjct: 361 SLLSGKRGIICQSVHSEFKELITMCGGPNEKSRANHLLKHIMVVLDMVSKRMTCLPTTRK 420

Query: 421 LALKNKVVFGTGDYWKAATLTANMSFVRAVSQTGMSLFTFEHRPRALTGD 468
           LALKNKVVFGTGDYW A TLTANMSFVRAVSQTGMSLFTFEHRPRALTGD
Sbjct: 421 LALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALTGD 458

BLAST of Sgr022051 vs. ExPASy TrEMBL
Match: A0A1S3BKD2 (UPF0415 protein C7orf25 homolog OS=Cucumis melo OX=3656 GN=LOC103490978 PE=4 SV=1)

HSP 1 Score: 763.5 bits (1970), Expect = 5.3e-217
Identity = 387/470 (82.34%), Postives = 417/470 (88.72%), Query Frame = 0

Query: 1   MEEPDGVELAKQRCRAVIDRIQRLPSSTNITISSRRTLLKLALRELNFLSRCSF--STQL 60
           M EP+ VELAKQRC+A++D I+ LPSSTNI++S  +TL KLALRELNFLSRCSF  ST L
Sbjct: 1   MAEPNTVELAKQRCKAIMDIIETLPSSTNISVSCTQTLQKLALRELNFLSRCSFSSSTPL 60

Query: 61  SLNIGHLEAIVHILQQPSVTGISRVCKPIPLHARRIESAKETKSSCSKVVYVDIICTLNR 120
           SLNIGHLEAIVHILQ PSVTGISRVCKPIP             SS SK VYVDIICTLNR
Sbjct: 61  SLNIGHLEAIVHILQHPSVTGISRVCKPIP------------SSSSSKAVYVDIICTLNR 120

Query: 121 NPVWVIVSVRKPKYISWYKGHRNKGLKSRLEEVVDAARSLEALEPCSIILFFSHGLDHLI 180
           NPVWVIVS RKP+YISWYKGHR+KGLKSRLEEV+DAA SL+ALEPCSIILFFSHGLD  I
Sbjct: 121 NPVWVIVSDRKPRYISWYKGHRSKGLKSRLEEVIDAAGSLQALEPCSIILFFSHGLDQFI 180

Query: 181 LERLRDEFMATEYNFNFSDFDFGFSEIEGDWVNVLPRSYKEACVLEIKVNDRNCGVTSSN 240
           LERLRDEF ATE++FNFSDFDFGFSEI+GDW+NVL RSY+EACVLEIKV+DRNCG TSSN
Sbjct: 181 LERLRDEFKATEFHFNFSDFDFGFSEIDGDWINVLSRSYEEACVLEIKVSDRNCGATSSN 240

Query: 241 CNRKVCCTDVDEPELLDKYLKRDLGDPFCSIVMAMKPNPM-GMEDMESASLEHLLGGDSD 300
            N KVC + VDEP++L+   + DLGD FCS+VMAMKPNPM G+EDMESA+LE LLGGDSD
Sbjct: 241 YNSKVCSSGVDEPDILNSNTEIDLGDSFCSVVMAMKPNPMNGIEDMESANLEQLLGGDSD 300

Query: 301 LINFDTTALIALVSGISNGCVAKLLATPESELRQKYKSNYDFVIGQVMSEIQKPILVELS 360
           LINFDTTALIALVSGISNGC AKLLATPE+EL+QKYKSNYDFVIGQ MSEI+KPILVEL 
Sbjct: 301 LINFDTTALIALVSGISNGCAAKLLATPENELKQKYKSNYDFVIGQAMSEIKKPILVELG 360

Query: 361 SLLSGKRGIICQSVHSEFKELVTMCGGPNEKSRANHLLKHIMTVSDTASKRMTCLPTTRK 420
           SLLSGKRGIICQSVHSEFKEL+TMCGGPNEKSRANHLLKHIM V D  SKRMTCLPTTRK
Sbjct: 361 SLLSGKRGIICQSVHSEFKELITMCGGPNEKSRANHLLKHIMVVLDMVSKRMTCLPTTRK 420

Query: 421 LALKNKVVFGTGDYWKAATLTANMSFVRAVSQTGMSLFTFEHRPRALTGD 468
           LALKNKVVFGTGDYW A TLTANMSFVRAVSQTGMSLFTFEHRPRALTGD
Sbjct: 421 LALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALTGD 458

BLAST of Sgr022051 vs. TAIR 10
Match: AT1G73380.1 (unknown protein; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1308 (InterPro:IPR010733); Has 162 Blast hits to 160 proteins in 67 species: Archae - 0; Bacteria - 2; Metazoa - 120; Fungi - 0; Plants - 34; Viruses - 0; Other Eukaryotes - 6 (source: NCBI BLink). )

HSP 1 Score: 417.9 bits (1073), Expect = 1.1e-116
Identity = 244/470 (51.91%), Postives = 312/470 (66.38%), Query Frame = 0

Query: 7   VELAKQRCRAVIDRIQRLPSSTNITISSRRTLLKLALRELNFLSRCSFSTQ---LSLNIG 66
           +E+AKQRC +VI  I+ LP ST IT S RRTLLKLA  EL+FLS  S       LS+NIG
Sbjct: 6   IEIAKQRCESVIRTIENLPLSTAITASCRRTLLKLASSELSFLSSLSSDPSPKPLSVNIG 65

Query: 67  HLEAIVHILQQPSVTGISRVCKPIPLHARRIESAKETKSSCSKVVYVDIICTLNRNPVWV 126
           H+E++V ILQ PS+TG+SRVCKPIPL                  V+VD++CTL + PVW+
Sbjct: 66  HIESVVRILQLPSITGVSRVCKPIPLPIGG--------------VHVDLVCTLGKVPVWI 125

Query: 127 IVSVRKPKYISWY-KGHRNKGLKSRLEEVVDAARSLEALEPCSIILFFSHGLDHLILERL 186
           IVS R P+YISW    H +KGL+SR+E+++ AA S   L+P S+ILFF++GL   + E+L
Sbjct: 126 IVSDRNPRYISWNGDRHGSKGLRSRIEQILAAANSTTTLKPSSVILFFANGLPSSVYEKL 185

Query: 187 RDEFMATEYNFNF-SDFDFGFS---EIEGDWVNVL-PRSYKEACVLEIKVNDRNCGVTSS 246
           +DEF A  ++F F SD D   S   + + +WVNV+  RSYKEA  +EIK+ D+   + S 
Sbjct: 186 KDEFGAVYFDFGFDSDSDSDISMLDDFDCEWVNVVRTRSYKEAVSIEIKLIDQCDSLASP 245

Query: 247 NCNRKVCCTDVDEPELLDKYLKRDLGDPFCSIVMAMKPNPMGMEDMESASLEHLLGGDSD 306
                V     +  EL  K       D F +++ +M+                LLG D  
Sbjct: 246 ETEVLV---QAEVTELSQK-------DAFSTVISSMR----------------LLGEDC- 305

Query: 307 LINFDTTALIALVSGISNGCVAKLLATPESELRQKYKSNYDFVIGQVMSEIQKPILVELS 366
           LINFDTTAL+ALVSGISNGC  +L+  PE EL +K+K N  FVI Q  SEI+KP LV++ 
Sbjct: 306 LINFDTTALVALVSGISNGCAERLVDMPEIELEEKFKGNTVFVIAQARSEIEKPGLVKVG 365

Query: 367 SLLSGKRGIICQSVHSEFKELVTMCGGPNEKSRANHLLKHIMTVSDTASKRMTCLPTTRK 426
           ++LSGKRGI+C+SV SEFKELV+M  GPNEK RA  LLK +M V+D  S+R+  LPTTRK
Sbjct: 366 TVLSGKRGIVCKSVFSEFKELVSMYAGPNEKLRAEQLLKSLMVVNDNPSERVMSLPTTRK 425

Query: 427 LALKNKVVFGTGDYWKAATLTANMSFVRAVSQTGMSLFTFEHRPRALTGD 468
           LA+KNK VFGTGD W A TLTANM+FVRAV+Q+GMSL T +H PRALTGD
Sbjct: 426 LAMKNKTVFGTGDRWGAPTLTANMAFVRAVAQSGMSLSTIDHSPRALTGD 434

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022152808.16.8e-22786.11UPF0415 protein C7orf25 homolog isoform X2 [Momordica charantia][more]
XP_022152807.13.5e-22382.41UPF0415 protein C7orf25 homolog isoform X1 [Momordica charantia][more]
XP_038906087.11.1e-22184.26UPF0415 protein C7orf25 homolog [Benincasa hispida][more]
XP_022923546.17.6e-21882.77uncharacterized protein LOC111431203 isoform X2 [Cucurbita moschata][more]
XP_023553240.19.9e-21882.77uncharacterized protein LOC111810716 [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
Q803H04.4e-1934.73UPF0415 protein C7orf25 homolog OS=Danio rerio OX=7955 GN=zgc:55781 PE=2 SV=1[more]
Q08AW58.3e-1835.33UPF0415 protein C7orf25 homolog OS=Xenopus laevis OX=8355 PE=2 SV=1[more]
Q91WD41.1e-1733.33UPF0415 protein C7orf25 homolog OS=Mus musculus OX=10090 PE=2 SV=1[more]
Q5BKL11.1e-1734.73UPF0415 protein C7orf25 homolog OS=Xenopus tropicalis OX=8364 GN=TGas015c11.1 PE... [more]
Q9BPX71.4e-1733.33UPF0415 protein C7orf25 OS=Homo sapiens OX=9606 GN=C7orf25 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A6J1DFW33.3e-22786.11UPF0415 protein C7orf25 homolog isoform X2 OS=Momordica charantia OX=3673 GN=LOC... [more]
A0A6J1DH981.7e-22382.41UPF0415 protein C7orf25 homolog isoform X1 OS=Momordica charantia OX=3673 GN=LOC... [more]
A0A6J1E7313.7e-21882.77uncharacterized protein LOC111431203 isoform X2 OS=Cucurbita moschata OX=3662 GN... [more]
A0A5D3D7K21.8e-21782.55UPF0415 protein C7orf25-like protein OS=Cucumis melo var. makuwa OX=1194695 GN=E... [more]
A0A1S3BKD25.3e-21782.34UPF0415 protein C7orf25 homolog OS=Cucumis melo OX=3656 GN=LOC103490978 PE=4 SV=... [more]
Match NameE-valueIdentityDescription
AT1G73380.11.1e-11651.91unknown protein; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1308... [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 141..161
NoneNo IPR availablePANTHERPTHR13379UNCHARACTERIZED DUF1308coord: 7..467
IPR010733Domain of unknown function DUF1308PFAMPF07000DUF1308coord: 299..465
e-value: 1.3E-31
score: 109.8

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr022051.1Sgr022051.1mRNA