CsGy7G000500 (gene) Cucumber (Gy14) v2.1

Overview
NameCsGy7G000500
Typegene
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionRetrotran_gag_3 domain-containing protein
LocationGy14Chr7: 619538 .. 621735 (+)
RNA-Seq ExpressionCsGy7G000500
SyntenyCsGy7G000500
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGTTCCTCAACTACCTTGCCTTCTTCTTCAGCTGAGAAAGACTCACTTTCACCAATTTTTCTACTGTCCAACATTTGTAACCTGATTTCAATGAGGCTTGACTCTACAAATTTTGTCCTTTGGAAGTTCCAATTGACAGCGATTTTGAAAGCTCATAAACTTTTTGGCTTTGTTGATGGTACTAATCCATGTCCTCAGACTAGTCCGTCTACTACCTCGACCGTTCCGCCTCAAACGAATCCTTTATATGAAGATTGGATTGCTAAGGATCAGGCTCTTATGACAGTCATAAATGCTACACTTTCACCTGAGGCTTTGGCATATGTTGTTGGAAGCACTTCTTCCAAACAGGTTTGGGATGTTCTTGCAAAGCTTTATTCTTCTGGTTCCCGGTCTAATGTGGTGAATTTGAAGTCCGATTTGCAAACTATTTACAAGAAGCCTGATGAATCTATTGATGCCTATATTAAACGGATTAAGGAGATCAAGGATAAACTTGCTAATGTTTCTACTTTTATCAATGAAGAGGATCTTCTTATCTATGCTTTAAATGGCCTTCCAAATGAGTACAACACTTTCCGAACGTCAATGCGTACACGTTCTCAACCTGTTACTTTTGAAGAACTTCATGTTCTTCTAAGAGCTGAGGAATCGGCTCTTGCAAAACAATCTAAGTGTGATGATTCGTATAATCAACCGACTGTTTTACTCTCTTCTTCTCAATCTCTCATGTCATGTGCTCCTACTTTCAATAACAACTTTGTTCGAGGCAACGGACATGGTAAAAATTATGGACATGGACGTTTTTCTTTCGATGCTCAAACTCGTGGTCATGGTTTGTCTCAAGAACAAAAGCCCGTTCATGATAATCATGCAACTTGTCAGATTTGTTCACGTCGTGGCCACACTGCACTCGATTGTTTCAATCGCATGAACTATAATTTTCAAGGACGTCATCCTCCACAACAACTTGCTGCAATGGTTGCATCGCAAAATAATGCATTTCTATCTATTGTGAATTCGTCTTCTTTGACCGATTCGGGTTGCAACACTCATATTACTTCAGACATGAATTATGTTTCTCTTGCACCTGAATATAATGGTGAAGAACAAGTTGGTGTTGGTAATGGACAGACTCGGCCTATTTCTCACTCAGGTTCTGATACTTTTGAACCTTCTTCCTATTTCTCTCTATCTAATCTTGTTTTTGTTCTTAATATCATTTCTAGTTTCCTTTTTGTTCATTAACTTTGTGTTAAAAATAATTGTCTTATCGCTTTTTGTTCCGAATAATTTTCAATTCAGGACAAGTTTTCGGGAAAATTTTTGTTCCAAAGGCCTAGCATTGATGATCTGATCACTTCTAAGGCTGTGGTTGCTTCTAGTTTAGCTTCCACCAGTCTGTTGTTCTACAGTTGCTAATGTTGCTGACAAGTCTTCTTTTTCTTATATTGCTGTTCTTCATAACTTCGTTGTTTCCTTGTTGCATTTGCCGTTTACTTCAAATAAAGTGAGTGTCAATATTGTCTACATGGAAAAAGTGAGGAATCATTCTTTTACTTGATCAAACAAGGTTTCTGTATATTTCTTTTAGAGCCTTTTCATATTGATGTCTGTGATTCTCTCACTAAAATTTCTATAAATGGTTTGAGTTGTTTAAGCTACCATATTGAATCAATATTCTTTCAATCACTGATATTTTGAAAAGGTTGAAATTGATTTTCAAGAAGCTAAAAGTATTACAAATTTAGGATTCAATTCGACACAAATTTCAATCTATCTAGTAAGTTAATTAAATTTTAATAGTGTTAATGTACGAAGTGTCTATCATTGACATGAAGTTGAAGTTCATCAACTTATTAGACATATCCAGGAAAATTTTTAACAAAAACAAAAGTTTTTTTGAAAGTTGAGAATTTATTAAACACTTTTAAGGTTGATCCAACTTATTAAAAAATAGTTTGAGAGTTTAAATCTATTGTATGTCACTAAAATATGTCATCCTTAAAACTTTGGCTATATTCTATTAAAACTGTTTTTACCTTATATGCTTTGAACCTACAACAATATTTTCCAACCATTTCTAGAACCTGATTTCTAAAATTGTTTTCTCCAAAACAGGGCATTTCAAACTCGAGTTGGCCTCTTGCTCAGCAAGTGTCGCACAATCGTTTTGTGCCACTAACCATGCATGA

mRNA sequence

ATGAGTTCCTCAACTACCTTGCCTTCTTCTTCAGCTGAGAAAGACTCACTTTCACCAATTTTTCTACTGTCCAACATTTGTAACCTGATTTCAATGAGGCTTGACTCTACAAATTTTGTCCTTTGGAAGTTCCAATTGACAGCGATTTTGAAAGCTCATAAACTTTTTGGCTTTGTTGATGGTACTAATCCATGTCCTCAGACTAGTCCGTCTACTACCTCGACCGTTCCGCCTCAAACGAATCCTTTATATGAAGATTGGATTGCTAAGGATCAGGCTCTTATGACAGTCATAAATGCTACACTTTCACCTGAGGCTTTGGCATATGTTGTTGGAAGCACTTCTTCCAAACAGGTTTGGGATGTTCTTGCAAAGCTTTATTCTTCTGGTTCCCGGTCTAATGTGGTGAATTTGAAGTCCGATTTGCAAACTATTTACAAGAAGCCTGATGAATCTATTGATGCCTATATTAAACGGATTAAGGAGATCAAGGATAAACTTGCTAATGTTTCTACTTTTATCAATGAAGAGGATCTTCTTATCTATGCTTTAAATGGCCTTCCAAATGAGTACAACACTTTCCGAACGTCAATGCGTACACGTTCTCAACCTGTTACTTTTGAAGAACTTCATGTTCTTCTAAGAGCTGAGGAATCGGCTCTTGCAAAACAATCTAAGTGTGATGATTCGTATAATCAACCGACTGTTTTACTCTCTTCTTCTCAATCTCTCATGTCATGTGCTCCTACTTTCAATAACAACTTTGTTCGAGGCAACGGACATGGTAAAAATTATGGACATGGACGTTTTTCTTTCGATGCTCAAACTCGTGGTCATGGTTTGTCTCAAGAACAAAAGCCCGTTCATGATAATCATGCAACTTGTCAGATTTGTTCACGTCGTGGCCACACTGCACTCGATTGTTTCAATCGCATGAACTATAATTTTCAAGGACGTCATCCTCCACAACAACTTGCTGCAATGGTTGCATCGCAAAATAATGCATTTCTATCTATTGTGAATTCGTCTTCTTTGACCGATTCGGGTTGCAACACTCATATTACTTCAGACATGAATTATGTTTCTCTTGCACCTGAATATAATGGTGAAGAACAAGTTGGTGTTGGTAATGGACAGACTCGGCCTATTTCTCACTCAGGGCATTTCAAACTCGAGTTGGCCTCTTGCTCAGCAAGTGTCGCACAATCGTTTTGTGCCACTAACCATGCATGA

Coding sequence (CDS)

ATGAGTTCCTCAACTACCTTGCCTTCTTCTTCAGCTGAGAAAGACTCACTTTCACCAATTTTTCTACTGTCCAACATTTGTAACCTGATTTCAATGAGGCTTGACTCTACAAATTTTGTCCTTTGGAAGTTCCAATTGACAGCGATTTTGAAAGCTCATAAACTTTTTGGCTTTGTTGATGGTACTAATCCATGTCCTCAGACTAGTCCGTCTACTACCTCGACCGTTCCGCCTCAAACGAATCCTTTATATGAAGATTGGATTGCTAAGGATCAGGCTCTTATGACAGTCATAAATGCTACACTTTCACCTGAGGCTTTGGCATATGTTGTTGGAAGCACTTCTTCCAAACAGGTTTGGGATGTTCTTGCAAAGCTTTATTCTTCTGGTTCCCGGTCTAATGTGGTGAATTTGAAGTCCGATTTGCAAACTATTTACAAGAAGCCTGATGAATCTATTGATGCCTATATTAAACGGATTAAGGAGATCAAGGATAAACTTGCTAATGTTTCTACTTTTATCAATGAAGAGGATCTTCTTATCTATGCTTTAAATGGCCTTCCAAATGAGTACAACACTTTCCGAACGTCAATGCGTACACGTTCTCAACCTGTTACTTTTGAAGAACTTCATGTTCTTCTAAGAGCTGAGGAATCGGCTCTTGCAAAACAATCTAAGTGTGATGATTCGTATAATCAACCGACTGTTTTACTCTCTTCTTCTCAATCTCTCATGTCATGTGCTCCTACTTTCAATAACAACTTTGTTCGAGGCAACGGACATGGTAAAAATTATGGACATGGACGTTTTTCTTTCGATGCTCAAACTCGTGGTCATGGTTTGTCTCAAGAACAAAAGCCCGTTCATGATAATCATGCAACTTGTCAGATTTGTTCACGTCGTGGCCACACTGCACTCGATTGTTTCAATCGCATGAACTATAATTTTCAAGGACGTCATCCTCCACAACAACTTGCTGCAATGGTTGCATCGCAAAATAATGCATTTCTATCTATTGTGAATTCGTCTTCTTTGACCGATTCGGGTTGCAACACTCATATTACTTCAGACATGAATTATGTTTCTCTTGCACCTGAATATAATGGTGAAGAACAAGTTGGTGTTGGTAATGGACAGACTCGGCCTATTTCTCACTCAGGGCATTTCAAACTCGAGTTGGCCTCTTGCTCAGCAAGTGTCGCACAATCGTTTTGTGCCACTAACCATGCATGA

Protein sequence

MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLFGFVDGTNPCPQTSPSTTSTVPPQTNPLYEDWIAKDQALMTVINATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRTSMRTRSQPVTFEELHVLLRAEESALAKQSKCDDSYNQPTVLLSSSQSLMSCAPTFNNNFVRGNGHGKNYGHGRFSFDAQTRGHGLSQEQKPVHDNHATCQICSRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTHITSDMNYVSLAPEYNGEEQVGVGNGQTRPISHSGHFKLELASCSASVAQSFCATNHA*
Homology
BLAST of CsGy7G000500 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 137.5 bits (345), Expect = 3.4e-31
Identity = 105/372 (28.23%), Postives = 177/372 (47.58%), Query Frame = 0

Query: 33  RLDSTNFVLWKFQLTAILKAHKLFGFVDGTNPCPQTSPSTTST-VPPQTNPLYEDWIAKD 92
           +L STN+++W  Q+ A+   ++L GF+DG+   P   P+T  T   P+ NP Y  W  +D
Sbjct: 25  KLTSTNYLMWSRQVHALFDGYELAGFLDGSTTMP---PATIGTDAAPRVNPDYTRWKRQD 84

Query: 93  QALMTVINATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDE 152
           + + + +   +S      V  +T++ Q+W+ L K+Y++ S  +V  L++ L+  + K  +
Sbjct: 85  KLIYSAVLGAISMSVQPAVSRATTAAQIWETLRKIYANPSYGHVTQLRTQLKQ-WTKGTK 144

Query: 153 SIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRTSMRTRSQPVTFEELH 212
           +ID Y++ +    D+LA +   ++ ++ +   L  LP EY      +  +  P T  E+H
Sbjct: 145 TIDDYMQGLVTRFDQLALLGKPMDHDEQVERVLENLPEEYKPVIDQIAAKDTPPTLTEIH 204

Query: 213 VLLRAEESALAKQSKCDDSYNQPTVLLSSSQSLMSCAPTFNNNFVRGNGHGK-----NYG 272
             L   ES +   S         TV+  ++ ++     T  NN   GN + +     N  
Sbjct: 205 ERLLNHESKILAVSSA-------TVIPITANAVSHRNTTTTNNNNNGNRNNRYDNRNNNN 264

Query: 273 HGRFSFDAQTRGHGLSQEQKPVHDNHATCQICSRRGHTALDCFNRMNY--NFQGRHPPQQ 332
           + +    + T  H  + + KP       CQIC  +GH+A  C    ++  +   + PP  
Sbjct: 265 NSKPWQQSSTNFHPNNNQSKPY---LGKCQICGVQGHSAKRCSQLQHFLSSVNSQQPPSP 324

Query: 333 LAAMVASQNNAFLSIVNSSS-LTDSGCNTHITSDMNYVSLAPEYNGEEQVGVGNGQTRPI 392
                   N A  S  +S++ L DSG   HITSD N +SL   Y G + V V +G T PI
Sbjct: 325 FTPWQPRANLALGSPYSSNNWLLDSGATHHITSDFNNLSLHQPYTGGDDVMVADGSTIPI 382

Query: 393 SHSGHFKLELAS 396
           SH+G   L   S
Sbjct: 385 SHTGSTSLSTKS 382

BLAST of CsGy7G000500 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 116.3 bits (290), Expect = 8.1e-25
Identity = 98/372 (26.34%), Postives = 168/372 (45.16%), Query Frame = 0

Query: 33  RLDSTNFVLWKFQLTAILKAHKLFGFVDGTNPCPQTSPSTTST-VPPQTNPLYEDWIAKD 92
           +L STN+++W  Q+ A+   ++L GF+DG+ P P   P+T  T   P+ NP Y  W  +D
Sbjct: 25  KLTSTNYLMWSRQVHALFDGYELAGFLDGSTPMP---PATIGTDAVPRVNPDYTRWRRQD 84

Query: 93  QALMTVINATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDE 152
           + + + I   +S      V  +T++ Q+W+ L K+Y++ S  +V  L+            
Sbjct: 85  KLIYSAILGAISMSVQPAVSRATTAAQIWETLRKIYANPSYGHVTQLR------------ 144

Query: 153 SIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRTSMRTRSQPVTFEELH 212
               +I R     D+LA +   ++ ++ +   L  LP++Y      +  +  P +  E+H
Sbjct: 145 ----FITRF----DQLALLGKPMDHDEQVERVLENLPDDYKPVIDQIAAKDTPPSLTEIH 204

Query: 213 VLLRAEESALAKQSKCDDSYNQPTVLLSSSQSLMSCAPTFNNNFVRGNGHGKNYGHGRFS 272
             L   ES L   +  +        ++  + ++++   T  N      G  +NY +    
Sbjct: 205 ERLINRESKLLALNSAE--------VVPITANVVTHRNTNTNRNQNNRGDNRNYNNNNNR 264

Query: 273 FDA-QTRGHGLSQEQKPVHDNHATCQICSRRGHTALDCFNRMNYNFQGRHPPQQLAAMVA 332
            ++ Q    G   + +        CQICS +GH+A  C     + FQ     QQ  +   
Sbjct: 265 SNSWQPSSSGSRSDNRQPKPYLGRCQICSVQGHSAKRC--PQLHQFQSTTNQQQSTSPFT 324

Query: 333 ----SQNNAFLSIVNSSS-LTDSGCNTHITSDMNYVSLAPEYNGEEQVGVGNGQTRPISH 392
                 N A  S  N+++ L DSG   HITSD N +S    Y G + V + +G T PI+H
Sbjct: 325 PWQPRANLAVNSPYNANNWLLDSGATHHITSDFNNLSFHQPYTGGDDVMIADGSTIPITH 363

Query: 393 SGHFKLELASCS 398
           +G   L  +S S
Sbjct: 385 TGSASLPTSSRS 363

BLAST of CsGy7G000500 vs. NCBI nr
Match: KAE8645659.1 (hypothetical protein Csa_020439 [Cucumis sativus])

HSP 1 Score: 809 bits (2089), Expect = 2.89e-295
Identity = 409/410 (99.76%), Postives = 410/410 (100.00%), Query Frame = 0

Query: 1   MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLFGFVD 60
           MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLFGFVD
Sbjct: 1   MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLFGFVD 60

Query: 61  GTNPCPQTSPSTTSTVPPQTNPLYEDWIAKDQALMTVINATLSPEALAYVVGSTSSKQVW 120
           GTNPCPQTSPSTTSTVPPQTNPLYEDWIAKDQALMTVINATLSPEALAYVVGSTSSKQVW
Sbjct: 61  GTNPCPQTSPSTTSTVPPQTNPLYEDWIAKDQALMTVINATLSPEALAYVVGSTSSKQVW 120

Query: 121 DVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLL 180
           DVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLL
Sbjct: 121 DVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLL 180

Query: 181 IYALNGLPNEYNTFRTSMRTRSQPVTFEELHVLLRAEESALAKQSKCDDSYNQPTVLLSS 240
           IYALNGLPNEYNTFRTSMRTRSQPVTFEELHVLLRAEESALAKQSKCDDSYNQPTVLLSS
Sbjct: 181 IYALNGLPNEYNTFRTSMRTRSQPVTFEELHVLLRAEESALAKQSKCDDSYNQPTVLLSS 240

Query: 241 SQSLMSCAPTFNNNFVRGNGHGKNYGHGRFSFDAQTRGHGLSQEQKPVHDNHATCQICSR 300
           SQSL+SCAPTFNNNFVRGNGHGKNYGHGRFSFDAQTRGHGLSQEQKPVHDNHATCQICSR
Sbjct: 241 SQSLLSCAPTFNNNFVRGNGHGKNYGHGRFSFDAQTRGHGLSQEQKPVHDNHATCQICSR 300

Query: 301 RGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTHITSDMNY 360
           RGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTHITSDMNY
Sbjct: 301 RGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTHITSDMNY 360

Query: 361 VSLAPEYNGEEQVGVGNGQTRPISHSGHFKLELASCSASVAQSFCATNHA 410
           VSLAPEYNGEEQVGVGNGQTRPISHSGHFKLELASCSASVAQSFCATNHA
Sbjct: 361 VSLAPEYNGEEQVGVGNGQTRPISHSGHFKLELASCSASVAQSFCATNHA 410

BLAST of CsGy7G000500 vs. NCBI nr
Match: XP_008448007.1 (PREDICTED: uncharacterized protein LOC103490319 isoform X2 [Cucumis melo])

HSP 1 Score: 764 bits (1972), Expect = 2.46e-277
Identity = 389/412 (94.42%), Postives = 399/412 (96.84%), Query Frame = 0

Query: 1   MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLFGFVD 60
           MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKL+GF+D
Sbjct: 1   MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLYGFID 60

Query: 61  GTNPCPQ--TSPSTTSTVPPQTNPLYEDWIAKDQALMTVINATLSPEALAYVVGSTSSKQ 120
           GTNPCP    + S+TSTVPPQ+NP YEDWIAKDQALMTVINATLSPEALAYVVGSTSSKQ
Sbjct: 61  GTNPCPPRTNNSSSTSTVPPQSNPSYEDWIAKDQALMTVINATLSPEALAYVVGSTSSKQ 120

Query: 121 VWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEED 180
           VWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEED
Sbjct: 121 VWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEED 180

Query: 181 LLIYALNGLPNEYNTFRTSMRTRSQPVTFEELHVLLRAEESALAKQSKCDDSYNQPTVLL 240
           LLIYALNGLPNEYNTFRTSMRTRSQPVTFEELHVLLRAEESALAKQSK DDSYNQPTVLL
Sbjct: 181 LLIYALNGLPNEYNTFRTSMRTRSQPVTFEELHVLLRAEESALAKQSKGDDSYNQPTVLL 240

Query: 241 SSSQSLMSCAPTFNNNFVRGNGHGKNYGHGRFSFDAQTRGHGLSQEQKPVHDNHATCQIC 300
           SSSQSL+SCAPTF+NNFVRGNGHGK+YGHGRFSFDAQTRGHG S EQK VHDNHATCQIC
Sbjct: 241 SSSQSLLSCAPTFDNNFVRGNGHGKHYGHGRFSFDAQTRGHGSSPEQKSVHDNHATCQIC 300

Query: 301 SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTHITSDM 360
           SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNT ITSDM
Sbjct: 301 SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTRITSDM 360

Query: 361 NYVSLAPEYNGEEQVGVGNGQTRPISHSGHFKLELASCSASVAQSFCATNHA 410
           NYVSLAPEYNGEEQVG+GNGQTRP+SHSGHFK ELASCSASVAQ FCATNHA
Sbjct: 361 NYVSLAPEYNGEEQVGIGNGQTRPMSHSGHFKFELASCSASVAQLFCATNHA 412

BLAST of CsGy7G000500 vs. NCBI nr
Match: XP_011658579.1 (uncharacterized protein LOC105436058 [Cucumis sativus])

HSP 1 Score: 763 bits (1970), Expect = 2.51e-277
Identity = 386/387 (99.74%), Postives = 387/387 (100.00%), Query Frame = 0

Query: 1   MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLFGFVD 60
           MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLFGFVD
Sbjct: 1   MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLFGFVD 60

Query: 61  GTNPCPQTSPSTTSTVPPQTNPLYEDWIAKDQALMTVINATLSPEALAYVVGSTSSKQVW 120
           GTNPCPQTSPSTTSTVPPQTNPLYEDWIAKDQALMTVINATLSPEALAYVVGSTSSKQVW
Sbjct: 61  GTNPCPQTSPSTTSTVPPQTNPLYEDWIAKDQALMTVINATLSPEALAYVVGSTSSKQVW 120

Query: 121 DVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLL 180
           DVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLL
Sbjct: 121 DVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLL 180

Query: 181 IYALNGLPNEYNTFRTSMRTRSQPVTFEELHVLLRAEESALAKQSKCDDSYNQPTVLLSS 240
           IYALNGLPNEYNTFRTSMRTRSQPVTFEELHVLLRAEESALAKQSKCDDSYNQPTVLLSS
Sbjct: 181 IYALNGLPNEYNTFRTSMRTRSQPVTFEELHVLLRAEESALAKQSKCDDSYNQPTVLLSS 240

Query: 241 SQSLMSCAPTFNNNFVRGNGHGKNYGHGRFSFDAQTRGHGLSQEQKPVHDNHATCQICSR 300
           SQSL+SCAPTFNNNFVRGNGHGKNYGHGRFSFDAQTRGHGLSQEQKPVHDNHATCQICSR
Sbjct: 241 SQSLLSCAPTFNNNFVRGNGHGKNYGHGRFSFDAQTRGHGLSQEQKPVHDNHATCQICSR 300

Query: 301 RGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTHITSDMNY 360
           RGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTHITSDMNY
Sbjct: 301 RGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTHITSDMNY 360

Query: 361 VSLAPEYNGEEQVGVGNGQTRPISHSG 387
           VSLAPEYNGEEQVGVGNGQTRPISHSG
Sbjct: 361 VSLAPEYNGEEQVGVGNGQTRPISHSG 387

BLAST of CsGy7G000500 vs. NCBI nr
Match: XP_016900446.1 (PREDICTED: uncharacterized protein LOC103490319 isoform X1 [Cucumis melo])

HSP 1 Score: 764 bits (1972), Expect = 3.72e-277
Identity = 389/412 (94.42%), Postives = 399/412 (96.84%), Query Frame = 0

Query: 1   MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLFGFVD 60
           MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKL+GF+D
Sbjct: 1   MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLYGFID 60

Query: 61  GTNPCPQ--TSPSTTSTVPPQTNPLYEDWIAKDQALMTVINATLSPEALAYVVGSTSSKQ 120
           GTNPCP    + S+TSTVPPQ+NP YEDWIAKDQALMTVINATLSPEALAYVVGSTSSKQ
Sbjct: 61  GTNPCPPRTNNSSSTSTVPPQSNPSYEDWIAKDQALMTVINATLSPEALAYVVGSTSSKQ 120

Query: 121 VWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEED 180
           VWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEED
Sbjct: 121 VWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEED 180

Query: 181 LLIYALNGLPNEYNTFRTSMRTRSQPVTFEELHVLLRAEESALAKQSKCDDSYNQPTVLL 240
           LLIYALNGLPNEYNTFRTSMRTRSQPVTFEELHVLLRAEESALAKQSK DDSYNQPTVLL
Sbjct: 181 LLIYALNGLPNEYNTFRTSMRTRSQPVTFEELHVLLRAEESALAKQSKGDDSYNQPTVLL 240

Query: 241 SSSQSLMSCAPTFNNNFVRGNGHGKNYGHGRFSFDAQTRGHGLSQEQKPVHDNHATCQIC 300
           SSSQSL+SCAPTF+NNFVRGNGHGK+YGHGRFSFDAQTRGHG S EQK VHDNHATCQIC
Sbjct: 241 SSSQSLLSCAPTFDNNFVRGNGHGKHYGHGRFSFDAQTRGHGSSPEQKSVHDNHATCQIC 300

Query: 301 SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTHITSDM 360
           SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNT ITSDM
Sbjct: 301 SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTRITSDM 360

Query: 361 NYVSLAPEYNGEEQVGVGNGQTRPISHSGHFKLELASCSASVAQSFCATNHA 410
           NYVSLAPEYNGEEQVG+GNGQTRP+SHSGHFK ELASCSASVAQ FCATNHA
Sbjct: 361 NYVSLAPEYNGEEQVGIGNGQTRPMSHSGHFKFELASCSASVAQLFCATNHA 412

BLAST of CsGy7G000500 vs. NCBI nr
Match: XP_008448008.1 (PREDICTED: uncharacterized protein LOC103490319 isoform X3 [Cucumis melo])

HSP 1 Score: 722 bits (1863), Expect = 5.46e-261
Identity = 368/389 (94.60%), Postives = 378/389 (97.17%), Query Frame = 0

Query: 1   MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLFGFVD 60
           MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKL+GF+D
Sbjct: 1   MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLYGFID 60

Query: 61  GTNPCPQ--TSPSTTSTVPPQTNPLYEDWIAKDQALMTVINATLSPEALAYVVGSTSSKQ 120
           GTNPCP    + S+TSTVPPQ+NP YEDWIAKDQALMTVINATLSPEALAYVVGSTSSKQ
Sbjct: 61  GTNPCPPRTNNSSSTSTVPPQSNPSYEDWIAKDQALMTVINATLSPEALAYVVGSTSSKQ 120

Query: 121 VWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEED 180
           VWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEED
Sbjct: 121 VWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEED 180

Query: 181 LLIYALNGLPNEYNTFRTSMRTRSQPVTFEELHVLLRAEESALAKQSKCDDSYNQPTVLL 240
           LLIYALNGLPNEYNTFRTSMRTRSQPVTFEELHVLLRAEESALAKQSK DDSYNQPTVLL
Sbjct: 181 LLIYALNGLPNEYNTFRTSMRTRSQPVTFEELHVLLRAEESALAKQSKGDDSYNQPTVLL 240

Query: 241 SSSQSLMSCAPTFNNNFVRGNGHGKNYGHGRFSFDAQTRGHGLSQEQKPVHDNHATCQIC 300
           SSSQSL+SCAPTF+NNFVRGNGHGK+YGHGRFSFDAQTRGHG S EQK VHDNHATCQIC
Sbjct: 241 SSSQSLLSCAPTFDNNFVRGNGHGKHYGHGRFSFDAQTRGHGSSPEQKSVHDNHATCQIC 300

Query: 301 SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTHITSDM 360
           SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNT ITSDM
Sbjct: 301 SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTRITSDM 360

Query: 361 NYVSLAPEYNGEEQVGVGNGQTRPISHSG 387
           NYVSLAPEYNGEEQVG+GNGQTRP+SHSG
Sbjct: 361 NYVSLAPEYNGEEQVGIGNGQTRPMSHSG 389

BLAST of CsGy7G000500 vs. ExPASy TrEMBL
Match: A0A1S3BI58 (uncharacterized protein LOC103490319 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103490319 PE=4 SV=1)

HSP 1 Score: 764 bits (1972), Expect = 1.19e-277
Identity = 389/412 (94.42%), Postives = 399/412 (96.84%), Query Frame = 0

Query: 1   MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLFGFVD 60
           MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKL+GF+D
Sbjct: 1   MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLYGFID 60

Query: 61  GTNPCPQ--TSPSTTSTVPPQTNPLYEDWIAKDQALMTVINATLSPEALAYVVGSTSSKQ 120
           GTNPCP    + S+TSTVPPQ+NP YEDWIAKDQALMTVINATLSPEALAYVVGSTSSKQ
Sbjct: 61  GTNPCPPRTNNSSSTSTVPPQSNPSYEDWIAKDQALMTVINATLSPEALAYVVGSTSSKQ 120

Query: 121 VWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEED 180
           VWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEED
Sbjct: 121 VWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEED 180

Query: 181 LLIYALNGLPNEYNTFRTSMRTRSQPVTFEELHVLLRAEESALAKQSKCDDSYNQPTVLL 240
           LLIYALNGLPNEYNTFRTSMRTRSQPVTFEELHVLLRAEESALAKQSK DDSYNQPTVLL
Sbjct: 181 LLIYALNGLPNEYNTFRTSMRTRSQPVTFEELHVLLRAEESALAKQSKGDDSYNQPTVLL 240

Query: 241 SSSQSLMSCAPTFNNNFVRGNGHGKNYGHGRFSFDAQTRGHGLSQEQKPVHDNHATCQIC 300
           SSSQSL+SCAPTF+NNFVRGNGHGK+YGHGRFSFDAQTRGHG S EQK VHDNHATCQIC
Sbjct: 241 SSSQSLLSCAPTFDNNFVRGNGHGKHYGHGRFSFDAQTRGHGSSPEQKSVHDNHATCQIC 300

Query: 301 SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTHITSDM 360
           SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNT ITSDM
Sbjct: 301 SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTRITSDM 360

Query: 361 NYVSLAPEYNGEEQVGVGNGQTRPISHSGHFKLELASCSASVAQSFCATNHA 410
           NYVSLAPEYNGEEQVG+GNGQTRP+SHSGHFK ELASCSASVAQ FCATNHA
Sbjct: 361 NYVSLAPEYNGEEQVGIGNGQTRPMSHSGHFKFELASCSASVAQLFCATNHA 412

BLAST of CsGy7G000500 vs. ExPASy TrEMBL
Match: A0A1S4DWT9 (uncharacterized protein LOC103490319 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103490319 PE=4 SV=1)

HSP 1 Score: 764 bits (1972), Expect = 1.80e-277
Identity = 389/412 (94.42%), Postives = 399/412 (96.84%), Query Frame = 0

Query: 1   MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLFGFVD 60
           MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKL+GF+D
Sbjct: 1   MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLYGFID 60

Query: 61  GTNPCPQ--TSPSTTSTVPPQTNPLYEDWIAKDQALMTVINATLSPEALAYVVGSTSSKQ 120
           GTNPCP    + S+TSTVPPQ+NP YEDWIAKDQALMTVINATLSPEALAYVVGSTSSKQ
Sbjct: 61  GTNPCPPRTNNSSSTSTVPPQSNPSYEDWIAKDQALMTVINATLSPEALAYVVGSTSSKQ 120

Query: 121 VWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEED 180
           VWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEED
Sbjct: 121 VWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEED 180

Query: 181 LLIYALNGLPNEYNTFRTSMRTRSQPVTFEELHVLLRAEESALAKQSKCDDSYNQPTVLL 240
           LLIYALNGLPNEYNTFRTSMRTRSQPVTFEELHVLLRAEESALAKQSK DDSYNQPTVLL
Sbjct: 181 LLIYALNGLPNEYNTFRTSMRTRSQPVTFEELHVLLRAEESALAKQSKGDDSYNQPTVLL 240

Query: 241 SSSQSLMSCAPTFNNNFVRGNGHGKNYGHGRFSFDAQTRGHGLSQEQKPVHDNHATCQIC 300
           SSSQSL+SCAPTF+NNFVRGNGHGK+YGHGRFSFDAQTRGHG S EQK VHDNHATCQIC
Sbjct: 241 SSSQSLLSCAPTFDNNFVRGNGHGKHYGHGRFSFDAQTRGHGSSPEQKSVHDNHATCQIC 300

Query: 301 SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTHITSDM 360
           SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNT ITSDM
Sbjct: 301 SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTRITSDM 360

Query: 361 NYVSLAPEYNGEEQVGVGNGQTRPISHSGHFKLELASCSASVAQSFCATNHA 410
           NYVSLAPEYNGEEQVG+GNGQTRP+SHSGHFK ELASCSASVAQ FCATNHA
Sbjct: 361 NYVSLAPEYNGEEQVGIGNGQTRPMSHSGHFKFELASCSASVAQLFCATNHA 412

BLAST of CsGy7G000500 vs. ExPASy TrEMBL
Match: A0A1S3BIR3 (uncharacterized protein LOC103490319 isoform X3 OS=Cucumis melo OX=3656 GN=LOC103490319 PE=4 SV=1)

HSP 1 Score: 722 bits (1863), Expect = 2.64e-261
Identity = 368/389 (94.60%), Postives = 378/389 (97.17%), Query Frame = 0

Query: 1   MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLFGFVD 60
           MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKL+GF+D
Sbjct: 1   MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLYGFID 60

Query: 61  GTNPCPQ--TSPSTTSTVPPQTNPLYEDWIAKDQALMTVINATLSPEALAYVVGSTSSKQ 120
           GTNPCP    + S+TSTVPPQ+NP YEDWIAKDQALMTVINATLSPEALAYVVGSTSSKQ
Sbjct: 61  GTNPCPPRTNNSSSTSTVPPQSNPSYEDWIAKDQALMTVINATLSPEALAYVVGSTSSKQ 120

Query: 121 VWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEED 180
           VWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEED
Sbjct: 121 VWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEED 180

Query: 181 LLIYALNGLPNEYNTFRTSMRTRSQPVTFEELHVLLRAEESALAKQSKCDDSYNQPTVLL 240
           LLIYALNGLPNEYNTFRTSMRTRSQPVTFEELHVLLRAEESALAKQSK DDSYNQPTVLL
Sbjct: 181 LLIYALNGLPNEYNTFRTSMRTRSQPVTFEELHVLLRAEESALAKQSKGDDSYNQPTVLL 240

Query: 241 SSSQSLMSCAPTFNNNFVRGNGHGKNYGHGRFSFDAQTRGHGLSQEQKPVHDNHATCQIC 300
           SSSQSL+SCAPTF+NNFVRGNGHGK+YGHGRFSFDAQTRGHG S EQK VHDNHATCQIC
Sbjct: 241 SSSQSLLSCAPTFDNNFVRGNGHGKHYGHGRFSFDAQTRGHGSSPEQKSVHDNHATCQIC 300

Query: 301 SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTHITSDM 360
           SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNT ITSDM
Sbjct: 301 SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTRITSDM 360

Query: 361 NYVSLAPEYNGEEQVGVGNGQTRPISHSG 387
           NYVSLAPEYNGEEQVG+GNGQTRP+SHSG
Sbjct: 361 NYVSLAPEYNGEEQVGIGNGQTRPMSHSG 389

BLAST of CsGy7G000500 vs. ExPASy TrEMBL
Match: A0A5D3CLI6 (T4.5 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold106G001120 PE=4 SV=1)

HSP 1 Score: 721 bits (1862), Expect = 6.13e-260
Identity = 371/403 (92.06%), Postives = 382/403 (94.79%), Query Frame = 0

Query: 1   MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLFGFVD 60
           MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKL+GF+D
Sbjct: 1   MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLYGFID 60

Query: 61  GTNPCPQ--TSPSTTSTVPPQTNPLYEDWIAKDQALMTVINATLSPEALAYVVGSTSSKQ 120
           GTNPCP    + S+TSTVPPQ+NP YEDWIAKDQALMTVINATLSPEALAYVVGSTSSKQ
Sbjct: 61  GTNPCPPRTNNSSSTSTVPPQSNPSYEDWIAKDQALMTVINATLSPEALAYVVGSTSSKQ 120

Query: 121 VWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEED 180
           VWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEED
Sbjct: 121 VWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEED 180

Query: 181 LLIYALNGLPNEYNTFRTSMRTRSQPVTFEELHVLLRAEESALAKQSKCDDSYNQPTVLL 240
           LLIYALNGLPNEYNTFRTSMRTRSQPVTFEELHVLLRAEESALAKQSK DDSYNQPTVLL
Sbjct: 181 LLIYALNGLPNEYNTFRTSMRTRSQPVTFEELHVLLRAEESALAKQSKGDDSYNQPTVLL 240

Query: 241 SSSQSLMSCAPTFNNNFVRGNGHGKNYGHGRFSFDAQTRGHGLSQEQKPVHDNHATCQIC 300
           SSSQSL+SCAPTF+NNFVRGNGHGK+YGHGRFSFDAQTRGHG S EQK VHDNHATCQIC
Sbjct: 241 SSSQSLLSCAPTFDNNFVRGNGHGKHYGHGRFSFDAQTRGHGSSPEQKSVHDNHATCQIC 300

Query: 301 SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTHITSDM 360
           SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNT ITSDM
Sbjct: 301 SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTRITSDM 360

Query: 361 NYVSLAPEYNGEEQVGVGNGQTRPISHSGHFKLELASCSASVA 401
           NYVSLAPEYNGEEQVG+GNGQTRP+SHS +      S  A VA
Sbjct: 361 NYVSLAPEYNGEEQVGIGNGQTRPMSHSVYLPPVCCSTVAYVA 403

BLAST of CsGy7G000500 vs. ExPASy TrEMBL
Match: A0A6J1D9L6 (uncharacterized protein LOC111018892 OS=Momordica charantia OX=3673 GN=LOC111018892 PE=4 SV=1)

HSP 1 Score: 432 bits (1110), Expect = 3.05e-146
Identity = 236/403 (58.56%), Postives = 293/403 (72.70%), Query Frame = 0

Query: 6   TLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLFGFVDGTNPC 65
           T  S++ +KD  SPIFLLSNICNL+S+RLDST+F+LWKFQLTAILKAHKLFGF+DG+   
Sbjct: 23  TSSSTNTKKDLHSPIFLLSNICNLVSIRLDSTDFILWKFQLTAILKAHKLFGFIDGSVSA 82

Query: 66  P----------QTSPSTTSTVPPQTNPLYEDWIAKDQALMTVINATLSPEALAYVVGSTS 125
           P          ++ P+TT+++P   NP +EDWIAKDQALMT+INATLS EALAYVV S +
Sbjct: 83  PSQFLASSSETESQPTTTTSLP-VINPHFEDWIAKDQALMTLINATLSAEALAYVVRSGT 142

Query: 126 SKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFIN 185
           SKQVW+VL K YSS SR+NVVNLKSDLQ+I KK +ESIDAY+KRIKEIKDK ANVS  IN
Sbjct: 143 SKQVWEVLEKHYSSNSRTNVVNLKSDLQSIVKKTEESIDAYVKRIKEIKDKFANVSITIN 202

Query: 186 EEDLLIYALNGLPNEYNTFRTSMRTRSQPVTFEELHVLLRAEESALAKQSKCDDSYNQPT 245
           +E LLIYALNGL  EYNT  TSMRTR+Q V+FEELHV +++EESA+ KQ K +D   QP 
Sbjct: 203 DEYLLIYALNGLSTEYNTLSTSMRTRAQSVSFEELHVFMKSEESAIEKQMKREDLVTQPN 262

Query: 246 VLLSSSQSLMSCAPTFNNNFVRGNGHGKNYGHGRFSFDAQTRGHGLSQEQKPVH-----D 305
            L +SS    +    F+ N     G GKN G G+ +F       G  +           D
Sbjct: 263 ALFASSPQSQNRTSAFHPNQSHDRGRGKNNGRGKANFAPTFTNQGRGRSSGNFFTSFQAD 322

Query: 306 NHATCQICSRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLT---D 365
           N + CQIC + GHTALDC+NRMN++FQGRHPP QLAAMVA QNN++L++ NSS  T   D
Sbjct: 323 NRSPCQICGKLGHTALDCYNRMNFHFQGRHPPPQLAAMVAVQNNSYLAVGNSSPTTWLAD 382

Query: 366 SGCNTHITSDMNYVSLAP---EYNGEEQVGVGNGQTRPISHSG 387
           S CNTH+T+D++ +S+A    +YNGEE + VG+GQ+ PI+H G
Sbjct: 383 SECNTHMTADLSNLSIASIASDYNGEENISVGSGQSFPITHFG 424

BLAST of CsGy7G000500 vs. TAIR 10
Match: AT1G34070.1 (CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G48050.1); Has 648 Blast hits to 647 proteins in 29 species: Archae - 0; Bacteria - 0; Metazoa - 16; Fungi - 25; Plants - 607; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 65.9 bits (159), Expect = 8.9e-11
Identity = 67/278 (24.10%), Postives = 130/278 (46.76%), Query Frame = 0

Query: 20  IFLLSNICNLISMRLD--STNFVLWKFQLTAILKAHKLFGFVDGTNPCPQTSPSTTSTVP 79
           I+ +SNI + I + LD   +N+  W+        +  + G +DGT             +P
Sbjct: 10  IYGVSNIKSHIPVMLDIEESNYDAWRELFLTHCLSFDVMGHIDGT------------LLP 69

Query: 80  PQTNPLYEDWIAKDQALMTVINATLSPEAL-AYVVGSTSSKQVWDVLAKLYSSGSRSNVV 139
              N +  +W  +D  +   +  TL+P+      V S++S+ +W  +   + +   +  +
Sbjct: 70  TNANDV--NWQKRDGIVKLSLYGTLTPKQFQGSFVTSSTSRDIWLRIKNQFRNNKDARAL 129

Query: 140 NLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRT 199
            L S+L+T     D  +  Y +++K++ D L NV   + + +L++Y LNGL  +++    
Sbjct: 130 RLDSELRT-KDIGDMRVADYYRKMKKLADSLRNVDVPVTDRNLVMYVLNGLNPKFDNIIN 189

Query: 200 SMRTRSQPVTFEELHVLLRAEESALAKQSKCDDSYNQPTVLLSSSQSLMSC--APTFNNN 259
            ++ R    +F++   +L+ EE  L +  K + ++    V  SSS ++++C  AP    N
Sbjct: 190 VIKHRQPFPSFDDAATMLQEEEDRLKRAIKPNPTH----VDHSSSSTVLACSEAPPV-TN 249

Query: 260 FVRGNGHGKNY-GHGRFSFDAQTRGHGLSQEQKPVHDN 292
           F R  G+   Y G GR +   + RG   S    P  ++
Sbjct: 250 FQRSGGNQMGYRGRGRGNNIFRGRGGRFSYYNMPTFNS 267

BLAST of CsGy7G000500 vs. TAIR 10
Match: AT1G21280.1 (CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 51.6 bits (122), Expect = 1.7e-06
Identity = 37/163 (22.70%), Postives = 79/163 (48.47%), Query Frame = 0

Query: 6   TLPSSSAEKDSLSPIFLLSNI-----CNLISMRLDSTNFVLWKFQLTAILKAHKLFGFVD 65
           T+ S S   D  SP +L  +I      ++  +  D  N+V WK +  + L+  K FGF+D
Sbjct: 4   TIKSVSPTSDPDSPYYLPPDIHHPSDFSIQKLSKDEDNYVAWKIRFRSFLRVTKKFGFID 63

Query: 66  GTNPCPQTSPSTTSTVPPQTNPLYEDWIAKDQALMTVINATLSPEALAYVVGSTSSKQVW 125
           GT P            P   +PLY+ W   +  +M  +  +++ + L  V+ + ++ ++W
Sbjct: 64  GTLP-----------KPDPFSPLYQPWEQCNAMVMYWLMNSMTDKLLESVMYAETAHKMW 123

Query: 126 DVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEI 164
           + L +++       +  L+  L T+ ++  +S++ Y  ++ ++
Sbjct: 124 EDLRRVFVPCVDLKIYQLRRRLATL-RQGGDSVEEYFGKLSKV 154

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q94HW23.4e-3128.23Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
Q9ZT948.1e-2526.34Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
Match NameE-valueIdentityDescription
KAE8645659.12.89e-29599.76hypothetical protein Csa_020439 [Cucumis sativus][more]
XP_008448007.12.46e-27794.42PREDICTED: uncharacterized protein LOC103490319 isoform X2 [Cucumis melo][more]
XP_011658579.12.51e-27799.74uncharacterized protein LOC105436058 [Cucumis sativus][more]
XP_016900446.13.72e-27794.42PREDICTED: uncharacterized protein LOC103490319 isoform X1 [Cucumis melo][more]
XP_008448008.15.46e-26194.60PREDICTED: uncharacterized protein LOC103490319 isoform X3 [Cucumis melo][more]
Match NameE-valueIdentityDescription
A0A1S3BI581.19e-27794.42uncharacterized protein LOC103490319 isoform X2 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A1S4DWT91.80e-27794.42uncharacterized protein LOC103490319 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A1S3BIR32.64e-26194.60uncharacterized protein LOC103490319 isoform X3 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A5D3CLI66.13e-26092.06T4.5 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold106G001120 PE=4 SV=... [more]
A0A6J1D9L63.05e-14658.56uncharacterized protein LOC111018892 OS=Momordica charantia OX=3673 GN=LOC111018... [more]
Match NameE-valueIdentityDescription
AT1G34070.18.9e-1124.10CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); BE... [more]
AT1G21280.11.7e-0622.70CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Ha... [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (Gy14) v2.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR029472Retrotransposon Copia-like, N-terminalPFAMPF14244Retrotran_gag_3coord: 28..66
e-value: 2.9E-8
score: 33.4
NoneNo IPR availablePFAMPF14223Retrotran_gag_2coord: 87..222
e-value: 1.2E-24
score: 86.7
NoneNo IPR availablePANTHERPTHR34222:SF48SUBFAMILY NOT NAMEDcoord: 31..393
NoneNo IPR availablePANTHERPTHR34222FAMILY NOT NAMEDcoord: 31..393

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsGy7G000500.2CsGy7G000500.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005488 binding