CsaV3_7G000640.1 (mRNA) Cucumber (Chinese Long) v3

NameCsaV3_7G000640.1
TypemRNA
OrganismCucumis sativus (Cucumber (Chinese Long) v3)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Locationchr7 : 675454 .. 680156 (+)
Sequence length1233
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
GTTTCTCATGGTATCAGAGCCTAACATTTGTTTTTCTTCTTCACTCATCTTTTCTTTTTAAGAAGTGTTTGAACTTCTTTCTTTGTGAAACTGAAGTCTCAAGTCTCTCTTGATCGGAGAGCTTCTTTTGCCAAGTGTATCTGATGAGTTCCTCAACTACCTTGCCTTCTTCTTCAGCTGAGAAAGACTCACTTTCACCAATTTTTCTACTGTCCAACATTTGTAACCTGATTTCAATGAGGCTTGACTCTACAAATTTTGTCCTTTGGAAGTTCCAATTGACAGCGATTTTGAAAGCTCATAAACTTTTTGGCTTTGTTGATGGTACTAATCCATGTCCTCAGACTAGTCCGTCTACTACCTCGACCGTTCCGCCTCAAACGAATCCTTTATATGAAGATTGGATTGCTAAGGATCAGGCTCTTATGACAGTCATAAATGCTACACTTTCACCTGAGGCTTTGGCATATGTTGTTGGAAGCACTTCTTCCAAACAGGTTTGGGATGTTCTTGCAAAGCTTTATTCTTCTGGTTCCCGGTCTAATGTGGTGAATTTGAAGTCCGATTTGCAAACTATTTACAAGAAGCCTGATGAGTCTATTGATGCCTATATTAAACGGATTAAGGAGATCAAGGATAAACTTGCTAATGTTTCTACTTTTATCAATGAAGAGGATCTTCTTATCTATGCTTTAAATGGCCTTCCAAATGAGTACAACACTTTCCGAACGTCAATGCGTACACGTTCTCAACCTGTTACTTTTGAAGAACTTCATGTTCTTCTAAGAGCTGAGGAATCGGCTCTTGCAAAACAATCTAAGTGTGATGATTCGTATAATCAACCGACTGTTTTACTCTCTTCTTCTCAATCTCTCCTGTCATGTGCTCCTACTTTCAATAACAACTTTGTTCGAGGCAACGGACATGGTAAAAATTATGGACATGGACGTTTTTCTTTCGATGCTCAAACTCGTGGTCATGGTTTGTCTCAAGAACAAAAGCCCGTTCATGATAATCATGCAACTTGTCAGATTTGTTCACGTCGTGGCCACACTGCACTCGATTGTTTCAATCGCATGAACTATAATTTTCAAGGACGTCATCCTCCACAACAACTTGCTGCAATGGTTGCATCGCAAAATAATGCATTTCTATCTATTGTGAATTCGTCTTCTTTGACCGATTCGGGTTGCAACACTCATATTACTTCAGACATGAATTATGTTTCTCTTGCACCTGAATATAATGGTGAAGAACAAGTTGGTGTTGGTAATGGACAGACTCGGCCTATTTCTCACTCAGGTTCTGATACTTTTGAACCTTCTTCCTATTTCTCTCTATCTAATCTTGTTTTTGTTCTTAATATCATTTCTAGTTTCCTTTTTGTTCATTAACTTTGTGTTAAAAATAATTGTCTTATCGCTTTTTGTTCCGAATAATTTTCAATTCAGGACAAGTTTTCGGGAAAATTTTTGTTCCAAAGGCCTAGCATTGATGATCTGATCACTTCTAAGGCTGTGGTTGCTTCTAGTTTAGCTTCCACCAGTCTGTTGTTCTATAGTTGCTAATGTTGCTGACAAGTCTTCTTTTTCTTATATTGCTGTTCTTCATAACTTCGTTGTTTCCTTGTTGCATTTGCCGTTTACTTCAAATAAAGTGAGTGTCAATATTGTCTACATGGAAAAAGTGAGGAATCATTCTTTTACTTGATCAAACAAGGTTTCTGTATATTTCTTTTAGAGCCTTTTCATATTGATGTCTGTGATTCTCTCACTAAAATTTCTATAAATGGTTTGAGTTGTTTAAGCTACCATATTGAATCAATATTCTTTCAATCACTGATATTTTGAAAAGGTTGAAATTGATTTTCAAGAAGCTAAAAGTATTACAAATTTAGGATTCAATTCGACACAAATTTCAATCTATCTAGTAAGTTAATTAAATTTTAATAGTGTTAATGTACGAAGTGTCTATCATTGACATGAAGTTGAAGTTCATCAACTTATTAGACATATCCAGGAAAATTTTTAACAAAAACAAAAGTTTTTTTGAAAGTTGAGAATTTATTAAACACTTTTAAGGTTGATCCAACTTATTAAAAAATAGTTTGAGAGTTTAAATCTATTGTATGTCACTAAAATATGTCATCCTTAAAACTTTGGCTATATTCTATTAAAACTGTTTTTACCTTATATGCTTTGAACCTACAACAATATTTTCCAACCATTTCTAGAACCTGATTTCTAAAATTGTTTTCTCCAAAACAGGGCATTTCAAACTCGAGTTGGCCTCTTGCTCAGCAAGTGTCGCACAATCGTTTTGTGCCACTAACCATGCATGATGGAAAGTTAAACGATGGGATAGCCAAATAATCCACCTTCCTTGACCTAGCAAAGAATATGATATGGCTACAAGGTTAAGTAATATTTTTATGTACAGATTAAATTGTATGAAATGTGGCTATGTTATGTGGAAGCCACTATTAACTCATTGTCGAAGTGCAGTGATTGAGATTGTATATATCTGTTGGAAAGCACATTTCCACAGTTGTGCAAACTATAAATCCCACCACTTTCAATTAAATGTAGTAGAAAATAGTATCACTGCATCAGTCATTTATGCATTTCACATGGTATTGTACTTTAGTTAGTGTACAAAATTAGCTGCCACAGAATTGAATATATGGGGAGGAATCAAGAGTAAATTCTTTTGTTTAGGTTAGTGTTTTGTTGGTTCTGTTTTGGAATAAACATAATCAATATACTGTTAATCACAGTAATAAGTCTCTTAAATAGCAGTTAACTGAATCCAGAGTAATTGGTTATCCCAGTCTATTATACCTTGAAATAATTGTTATTTGACCTGCTTAGATCTTCATCTTCTGAGTAATTCCTATATAGCTACTGTAGAGAATAAATGGAAGAATAATAAAGAATTAAAATAGGAAAAGAAGAAGATTGTCCAACTAATATCATCTATTCAGTTTCGTGTTTCTGTCACGAAGAGACAGAAGTGTACAATATTGTAAAAGAATGAGTTTGTTTTCAAAAGTGTTGAAACTCTTCTCTGTCTACGTTCTACTATACTGTTCTATACCTTCCTTTGTACCGTCTATTTCTAAACTCTATCCTTATGACTCATGAGTCATTTAAACTATGAATAAGATTAACAAAAGTATCTTCATTGCAGGGAATTGTGATGCCACCCATTGCATAATCATATCCAAATTCTTCTTCAGCTTGACTCAACAAATCTTGAAACAAAGGATGGTTCAAGTAAGATAGCGAGATGACAAAACGCTTCTTCTGTTCTTCTCCGACATAGACCGTAAAGCATCCTTTCGGAACATCAAGAGACTTTGGAGTGGCTCTATTTCCTGACGATGTGGATCGTCGAAGACTTGGCTTAGCAAGAACAATGCTAGGCAAACGAAACCCCATGGTATAGTATTTTTCTTTTCAAGGAATTAGTGGAACTTTAACTTGAAGGATTGTGAAAGAAGATTTCTTGTGATGTGGGAATGTGGAAGGCTTGCTGTTTGTGAACCCAATTACATGATCCTTTATATAAATGGAGGGTGAGATTTATTGGAGATAGAAATATTGACAACGCAAGAAGGTAGCATTAAATGCTAGAAAGGAACCAGCTCAAAAACAATCACATGGATTTGTCTTTGAAGCCATAATGAGAACTTCAAGTTCATGAGTTAACGTGGGGCCATTTTGGATCCTTGCGTAGAAACATCATAGGTGGCCTACTTTTGTTTATGGACAATATTGAGAAGAGAATTCAATACAACTAAGCAAGTTCCAGTAGATTCATAGACATATTTGAGGCACTGAAGTTTTTGTTGTCCTCTTGGTGAGATAACAATTGACAGACTATATGTTGACAGAAAATTTAACTGCATCATATAGCCATGTCTGTCACAATTTTTCGTGGTATAAGAATTTTAACTAAAGTGGGTTTTTTGTCGGTTGTAAAGCTCGGAAAATTCATATGTACGGGTACCGTAGTACAACAGAAACTTGTAAGTGGAAGAAAATACTTCAAGTACAAGATGACAAAGAAAAAGAAACCCAATTGTTAGTTTAGCATTCATTATGAAGAGAACATTTCTTTTTCTGAGCAAGAAACGTAAATATTCAATGATGTAACGAAGAGAATAATACTGATAACAAACGCAATAACAATAAGACACTATAAATAATAGGTTAAAAAAAAAATTCTAAGAAGAAAGATCTATGAAAGTACCCCAATTAATATAAATATTCTGACGGGAAAATCTCATGATCAACTTGGATAAAGAACACCATGATGAAGCCTTCAAAGGAGCTGATTTGAAATGATCGACCTAAGGAAGAAGCTTGCCATGGAAAATTCTTAATTGAACCAGTAGCTCGATCTTGGCAACCAAGTTGTCCCACTGCTATTTATTGTTTACTATTTTTTTTTAATAGAATCAACAACTTTTGTTAAGAAAGAACGAAAGAATTCTGGGGGGCTAGTGGCATTCTTAGGAAGAAAAAAAATTCCCTCTTGAATATTGTACCAACTCAAAGACATGTTTACCTGTTCCTCAAAGGCATCTTTCCTTTTGACATCTTCATTTGTGTTTTTTATACCTTTCAATTCATCTTATTCAGTTTCAGTTCACAACAAATGCAAATCATCATTTATTGGTTACACTAATTATTATTTTG

mRNA sequence

ATGAGTTCCTCAACTACCTTGCCTTCTTCTTCAGCTGAGAAAGACTCACTTTCACCAATTTTTCTACTGTCCAACATTTGTAACCTGATTTCAATGAGGCTTGACTCTACAAATTTTGTCCTTTGGAAGTTCCAATTGACAGCGATTTTGAAAGCTCATAAACTTTTTGGCTTTGTTGATGGTACTAATCCATGTCCTCAGACTAGTCCGTCTACTACCTCGACCGTTCCGCCTCAAACGAATCCTTTATATGAAGATTGGATTGCTAAGGATCAGGCTCTTATGACAGTCATAAATGCTACACTTTCACCTGAGGCTTTGGCATATGTTGTTGGAAGCACTTCTTCCAAACAGGTTTGGGATGTTCTTGCAAAGCTTTATTCTTCTGGTTCCCGGTCTAATGTGGTGAATTTGAAGTCCGATTTGCAAACTATTTACAAGAAGCCTGATGAGTCTATTGATGCCTATATTAAACGGATTAAGGAGATCAAGGATAAACTTGCTAATGTTTCTACTTTTATCAATGAAGAGGATCTTCTTATCTATGCTTTAAATGGCCTTCCAAATGAGTACAACACTTTCCGAACGTCAATGCGTACACGTTCTCAACCTGTTACTTTTGAAGAACTTCATGTTCTTCTAAGAGCTGAGGAATCGGCTCTTGCAAAACAATCTAAGTGTGATGATTCGTATAATCAACCGACTGTTTTACTCTCTTCTTCTCAATCTCTCCTGTCATGTGCTCCTACTTTCAATAACAACTTTGTTCGAGGCAACGGACATGGTAAAAATTATGGACATGGACGTTTTTCTTTCGATGCTCAAACTCGTGGTCATGGTTTGTCTCAAGAACAAAAGCCCGTTCATGATAATCATGCAACTTGTCAGATTTGTTCACGTCGTGGCCACACTGCACTCGATTGTTTCAATCGCATGAACTATAATTTTCAAGGACGTCATCCTCCACAACAACTTGCTGCAATGGTTGCATCGCAAAATAATGCATTTCTATCTATTGTGAATTCGTCTTCTTTGACCGATTCGGGTTGCAACACTCATATTACTTCAGACATGAATTATGTTTCTCTTGCACCTGAATATAATGGTGAAGAACAAGTTGGTGTTGGTAATGGACAGACTCGGCCTATTTCTCACTCAGGGCATTTCAAACTCGAGTTGGCCTCTTGCTCAGCAAGTGTCGCACAATCGTTTTGTGCCACTAACCATGCATGA

Coding sequence (CDS)

ATGAGTTCCTCAACTACCTTGCCTTCTTCTTCAGCTGAGAAAGACTCACTTTCACCAATTTTTCTACTGTCCAACATTTGTAACCTGATTTCAATGAGGCTTGACTCTACAAATTTTGTCCTTTGGAAGTTCCAATTGACAGCGATTTTGAAAGCTCATAAACTTTTTGGCTTTGTTGATGGTACTAATCCATGTCCTCAGACTAGTCCGTCTACTACCTCGACCGTTCCGCCTCAAACGAATCCTTTATATGAAGATTGGATTGCTAAGGATCAGGCTCTTATGACAGTCATAAATGCTACACTTTCACCTGAGGCTTTGGCATATGTTGTTGGAAGCACTTCTTCCAAACAGGTTTGGGATGTTCTTGCAAAGCTTTATTCTTCTGGTTCCCGGTCTAATGTGGTGAATTTGAAGTCCGATTTGCAAACTATTTACAAGAAGCCTGATGAGTCTATTGATGCCTATATTAAACGGATTAAGGAGATCAAGGATAAACTTGCTAATGTTTCTACTTTTATCAATGAAGAGGATCTTCTTATCTATGCTTTAAATGGCCTTCCAAATGAGTACAACACTTTCCGAACGTCAATGCGTACACGTTCTCAACCTGTTACTTTTGAAGAACTTCATGTTCTTCTAAGAGCTGAGGAATCGGCTCTTGCAAAACAATCTAAGTGTGATGATTCGTATAATCAACCGACTGTTTTACTCTCTTCTTCTCAATCTCTCCTGTCATGTGCTCCTACTTTCAATAACAACTTTGTTCGAGGCAACGGACATGGTAAAAATTATGGACATGGACGTTTTTCTTTCGATGCTCAAACTCGTGGTCATGGTTTGTCTCAAGAACAAAAGCCCGTTCATGATAATCATGCAACTTGTCAGATTTGTTCACGTCGTGGCCACACTGCACTCGATTGTTTCAATCGCATGAACTATAATTTTCAAGGACGTCATCCTCCACAACAACTTGCTGCAATGGTTGCATCGCAAAATAATGCATTTCTATCTATTGTGAATTCGTCTTCTTTGACCGATTCGGGTTGCAACACTCATATTACTTCAGACATGAATTATGTTTCTCTTGCACCTGAATATAATGGTGAAGAACAAGTTGGTGTTGGTAATGGACAGACTCGGCCTATTTCTCACTCAGGGCATTTCAAACTCGAGTTGGCCTCTTGCTCAGCAAGTGTCGCACAATCGTTTTGTGCCACTAACCATGCATGA

Protein sequence

MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLFGFVDGTNPCPQTSPSTTSTVPPQTNPLYEDWIAKDQALMTVINATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRTSMRTRSQPVTFEELHVLLRAEESALAKQSKCDDSYNQPTVLLSSSQSLLSCAPTFNNNFVRGNGHGKNYGHGRFSFDAQTRGHGLSQEQKPVHDNHATCQICSRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTHITSDMNYVSLAPEYNGEEQVGVGNGQTRPISHSGHFKLELASCSASVAQSFCATNHA
BLAST of CsaV3_7G000640.1 vs. NCBI nr
Match: XP_011658578.1 (PREDICTED: uncharacterized protein LOC105436058 isoform X1 [Cucumis sativus])

HSP 1 Score: 811.6 bits (2095), Expect = 1.2e-231
Identity = 410/410 (100.00%), Postives = 410/410 (100.00%), Query Frame = 0

Query: 1   MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLFGFVD 60
           MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLFGFVD
Sbjct: 1   MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLFGFVD 60

Query: 61  GTNPCPQTSPSTTSTVPPQTNPLYEDWIAKDQALMTVINATLSPEALAYVVGSTSSKQVW 120
           GTNPCPQTSPSTTSTVPPQTNPLYEDWIAKDQALMTVINATLSPEALAYVVGSTSSKQVW
Sbjct: 61  GTNPCPQTSPSTTSTVPPQTNPLYEDWIAKDQALMTVINATLSPEALAYVVGSTSSKQVW 120

Query: 121 DVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLL 180
           DVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLL
Sbjct: 121 DVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLL 180

Query: 181 IYALNGLPNEYNTFRTSMRTRSQPVTFEELHVLLRAEESALAKQSKCDDSYNQPTVLLSS 240
           IYALNGLPNEYNTFRTSMRTRSQPVTFEELHVLLRAEESALAKQSKCDDSYNQPTVLLSS
Sbjct: 181 IYALNGLPNEYNTFRTSMRTRSQPVTFEELHVLLRAEESALAKQSKCDDSYNQPTVLLSS 240

Query: 241 SQSLLSCAPTFNNNFVRGNGHGKNYGHGRFSFDAQTRGHGLSQEQKPVHDNHATCQICSR 300
           SQSLLSCAPTFNNNFVRGNGHGKNYGHGRFSFDAQTRGHGLSQEQKPVHDNHATCQICSR
Sbjct: 241 SQSLLSCAPTFNNNFVRGNGHGKNYGHGRFSFDAQTRGHGLSQEQKPVHDNHATCQICSR 300

Query: 301 RGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTHITSDMNY 360
           RGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTHITSDMNY
Sbjct: 301 RGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTHITSDMNY 360

Query: 361 VSLAPEYNGEEQVGVGNGQTRPISHSGHFKLELASCSASVAQSFCATNHA 411
           VSLAPEYNGEEQVGVGNGQTRPISHSGHFKLELASCSASVAQSFCATNHA
Sbjct: 361 VSLAPEYNGEEQVGVGNGQTRPISHSGHFKLELASCSASVAQSFCATNHA 410

BLAST of CsaV3_7G000640.1 vs. NCBI nr
Match: XP_016900446.1 (PREDICTED: uncharacterized protein LOC103490319 isoform X1 [Cucumis melo])

HSP 1 Score: 766.5 bits (1978), Expect = 4.5e-218
Identity = 390/412 (94.66%), Postives = 399/412 (96.84%), Query Frame = 0

Query: 1   MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLFGFVD 60
           MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKL+GF+D
Sbjct: 1   MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLYGFID 60

Query: 61  GTNPCP--QTSPSTTSTVPPQTNPLYEDWIAKDQALMTVINATLSPEALAYVVGSTSSKQ 120
           GTNPCP    + S+TSTVPPQ+NP YEDWIAKDQALMTVINATLSPEALAYVVGSTSSKQ
Sbjct: 61  GTNPCPPRTNNSSSTSTVPPQSNPSYEDWIAKDQALMTVINATLSPEALAYVVGSTSSKQ 120

Query: 121 VWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEED 180
           VWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEED
Sbjct: 121 VWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEED 180

Query: 181 LLIYALNGLPNEYNTFRTSMRTRSQPVTFEELHVLLRAEESALAKQSKCDDSYNQPTVLL 240
           LLIYALNGLPNEYNTFRTSMRTRSQPVTFEELHVLLRAEESALAKQSK DDSYNQPTVLL
Sbjct: 181 LLIYALNGLPNEYNTFRTSMRTRSQPVTFEELHVLLRAEESALAKQSKGDDSYNQPTVLL 240

Query: 241 SSSQSLLSCAPTFNNNFVRGNGHGKNYGHGRFSFDAQTRGHGLSQEQKPVHDNHATCQIC 300
           SSSQSLLSCAPTF+NNFVRGNGHGK+YGHGRFSFDAQTRGHG S EQK VHDNHATCQIC
Sbjct: 241 SSSQSLLSCAPTFDNNFVRGNGHGKHYGHGRFSFDAQTRGHGSSPEQKSVHDNHATCQIC 300

Query: 301 SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTHITSDM 360
           SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNT ITSDM
Sbjct: 301 SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTRITSDM 360

Query: 361 NYVSLAPEYNGEEQVGVGNGQTRPISHSGHFKLELASCSASVAQSFCATNHA 411
           NYVSLAPEYNGEEQVG+GNGQTRP+SHSGHFK ELASCSASVAQ FCATNHA
Sbjct: 361 NYVSLAPEYNGEEQVGIGNGQTRPMSHSGHFKFELASCSASVAQLFCATNHA 412

BLAST of CsaV3_7G000640.1 vs. NCBI nr
Match: XP_008448007.1 (PREDICTED: uncharacterized protein LOC103490319 isoform X2 [Cucumis melo])

HSP 1 Score: 766.5 bits (1978), Expect = 4.5e-218
Identity = 390/412 (94.66%), Postives = 399/412 (96.84%), Query Frame = 0

Query: 1   MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLFGFVD 60
           MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKL+GF+D
Sbjct: 1   MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLYGFID 60

Query: 61  GTNPCP--QTSPSTTSTVPPQTNPLYEDWIAKDQALMTVINATLSPEALAYVVGSTSSKQ 120
           GTNPCP    + S+TSTVPPQ+NP YEDWIAKDQALMTVINATLSPEALAYVVGSTSSKQ
Sbjct: 61  GTNPCPPRTNNSSSTSTVPPQSNPSYEDWIAKDQALMTVINATLSPEALAYVVGSTSSKQ 120

Query: 121 VWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEED 180
           VWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEED
Sbjct: 121 VWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEED 180

Query: 181 LLIYALNGLPNEYNTFRTSMRTRSQPVTFEELHVLLRAEESALAKQSKCDDSYNQPTVLL 240
           LLIYALNGLPNEYNTFRTSMRTRSQPVTFEELHVLLRAEESALAKQSK DDSYNQPTVLL
Sbjct: 181 LLIYALNGLPNEYNTFRTSMRTRSQPVTFEELHVLLRAEESALAKQSKGDDSYNQPTVLL 240

Query: 241 SSSQSLLSCAPTFNNNFVRGNGHGKNYGHGRFSFDAQTRGHGLSQEQKPVHDNHATCQIC 300
           SSSQSLLSCAPTF+NNFVRGNGHGK+YGHGRFSFDAQTRGHG S EQK VHDNHATCQIC
Sbjct: 241 SSSQSLLSCAPTFDNNFVRGNGHGKHYGHGRFSFDAQTRGHGSSPEQKSVHDNHATCQIC 300

Query: 301 SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTHITSDM 360
           SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNT ITSDM
Sbjct: 301 SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTRITSDM 360

Query: 361 NYVSLAPEYNGEEQVGVGNGQTRPISHSGHFKLELASCSASVAQSFCATNHA 411
           NYVSLAPEYNGEEQVG+GNGQTRP+SHSGHFK ELASCSASVAQ FCATNHA
Sbjct: 361 NYVSLAPEYNGEEQVGIGNGQTRPMSHSGHFKFELASCSASVAQLFCATNHA 412

BLAST of CsaV3_7G000640.1 vs. NCBI nr
Match: XP_011658579.1 (PREDICTED: uncharacterized protein LOC105436058 isoform X2 [Cucumis sativus])

HSP 1 Score: 765.8 bits (1976), Expect = 7.6e-218
Identity = 387/387 (100.00%), Postives = 387/387 (100.00%), Query Frame = 0

Query: 1   MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLFGFVD 60
           MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLFGFVD
Sbjct: 1   MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLFGFVD 60

Query: 61  GTNPCPQTSPSTTSTVPPQTNPLYEDWIAKDQALMTVINATLSPEALAYVVGSTSSKQVW 120
           GTNPCPQTSPSTTSTVPPQTNPLYEDWIAKDQALMTVINATLSPEALAYVVGSTSSKQVW
Sbjct: 61  GTNPCPQTSPSTTSTVPPQTNPLYEDWIAKDQALMTVINATLSPEALAYVVGSTSSKQVW 120

Query: 121 DVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLL 180
           DVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLL
Sbjct: 121 DVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLL 180

Query: 181 IYALNGLPNEYNTFRTSMRTRSQPVTFEELHVLLRAEESALAKQSKCDDSYNQPTVLLSS 240
           IYALNGLPNEYNTFRTSMRTRSQPVTFEELHVLLRAEESALAKQSKCDDSYNQPTVLLSS
Sbjct: 181 IYALNGLPNEYNTFRTSMRTRSQPVTFEELHVLLRAEESALAKQSKCDDSYNQPTVLLSS 240

Query: 241 SQSLLSCAPTFNNNFVRGNGHGKNYGHGRFSFDAQTRGHGLSQEQKPVHDNHATCQICSR 300
           SQSLLSCAPTFNNNFVRGNGHGKNYGHGRFSFDAQTRGHGLSQEQKPVHDNHATCQICSR
Sbjct: 241 SQSLLSCAPTFNNNFVRGNGHGKNYGHGRFSFDAQTRGHGLSQEQKPVHDNHATCQICSR 300

Query: 301 RGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTHITSDMNY 360
           RGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTHITSDMNY
Sbjct: 301 RGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTHITSDMNY 360

Query: 361 VSLAPEYNGEEQVGVGNGQTRPISHSG 388
           VSLAPEYNGEEQVGVGNGQTRPISHSG
Sbjct: 361 VSLAPEYNGEEQVGVGNGQTRPISHSG 387

BLAST of CsaV3_7G000640.1 vs. NCBI nr
Match: XP_008448008.1 (PREDICTED: uncharacterized protein LOC103490319 isoform X3 [Cucumis melo])

HSP 1 Score: 724.5 bits (1869), Expect = 2.0e-205
Identity = 369/389 (94.86%), Postives = 378/389 (97.17%), Query Frame = 0

Query: 1   MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLFGFVD 60
           MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKL+GF+D
Sbjct: 1   MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLYGFID 60

Query: 61  GTNPCP--QTSPSTTSTVPPQTNPLYEDWIAKDQALMTVINATLSPEALAYVVGSTSSKQ 120
           GTNPCP    + S+TSTVPPQ+NP YEDWIAKDQALMTVINATLSPEALAYVVGSTSSKQ
Sbjct: 61  GTNPCPPRTNNSSSTSTVPPQSNPSYEDWIAKDQALMTVINATLSPEALAYVVGSTSSKQ 120

Query: 121 VWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEED 180
           VWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEED
Sbjct: 121 VWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEED 180

Query: 181 LLIYALNGLPNEYNTFRTSMRTRSQPVTFEELHVLLRAEESALAKQSKCDDSYNQPTVLL 240
           LLIYALNGLPNEYNTFRTSMRTRSQPVTFEELHVLLRAEESALAKQSK DDSYNQPTVLL
Sbjct: 181 LLIYALNGLPNEYNTFRTSMRTRSQPVTFEELHVLLRAEESALAKQSKGDDSYNQPTVLL 240

Query: 241 SSSQSLLSCAPTFNNNFVRGNGHGKNYGHGRFSFDAQTRGHGLSQEQKPVHDNHATCQIC 300
           SSSQSLLSCAPTF+NNFVRGNGHGK+YGHGRFSFDAQTRGHG S EQK VHDNHATCQIC
Sbjct: 241 SSSQSLLSCAPTFDNNFVRGNGHGKHYGHGRFSFDAQTRGHGSSPEQKSVHDNHATCQIC 300

Query: 301 SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTHITSDM 360
           SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNT ITSDM
Sbjct: 301 SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTRITSDM 360

Query: 361 NYVSLAPEYNGEEQVGVGNGQTRPISHSG 388
           NYVSLAPEYNGEEQVG+GNGQTRP+SHSG
Sbjct: 361 NYVSLAPEYNGEEQVGIGNGQTRPMSHSG 389

BLAST of CsaV3_7G000640.1 vs. TAIR10
Match: AT1G34070.1 (Retrotransposon gag protein (InterPro:IPR005162))

HSP 1 Score: 66.6 bits (161), Expect = 4.0e-11
Identity = 68/278 (24.46%), Postives = 130/278 (46.76%), Query Frame = 0

Query: 20  IFLLSNICNLISMRLD--STNFVLWKFQLTAILKAHKLFGFVDGTNPCPQTSPSTTSTVP 79
           I+ +SNI + I + LD   +N+  W+        +  + G +DGT             +P
Sbjct: 10  IYGVSNIKSHIPVMLDIEESNYDAWRELFLTHCLSFDVMGHIDGT------------LLP 69

Query: 80  PQTNPLYEDWIAKDQALMTVINATLSPEAL-AYVVGSTSSKQVWDVLAKLYSSGSRSNVV 139
              N +  +W  +D  +   +  TL+P+      V S++S+ +W  +   + +   +  +
Sbjct: 70  TNANDV--NWQKRDGIVKLSLYGTLTPKQFQGSFVTSSTSRDIWLRIKNQFRNNKDARAL 129

Query: 140 NLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRT 199
            L S+L+T     D  +  Y +++K++ D L NV   + + +L++Y LNGL  +++    
Sbjct: 130 RLDSELRT-KDIGDMRVADYYRKMKKLADSLRNVDVPVTDRNLVMYVLNGLNPKFDNIIN 189

Query: 200 SMRTRSQPVTFEELHVLLRAEESALAKQSKCDDSYNQPTVLLSSSQSLLSC--APTFNNN 259
            ++ R    +F++   +L+ EE  L +  K + ++    V  SSS ++L+C  AP    N
Sbjct: 190 VIKHRQPFPSFDDAATMLQEEEDRLKRAIKPNPTH----VDHSSSSTVLACSEAPPV-TN 249

Query: 260 FVRGNGHGKNY-GHGRFSFDAQTRGHGLSQEQKPVHDN 292
           F R  G+   Y G GR +   + RG   S    P  ++
Sbjct: 250 FQRSGGNQMGYRGRGRGNNIFRGRGGRFSYYNMPTFNS 267

BLAST of CsaV3_7G000640.1 vs. TAIR10
Match: AT1G21280.1 (Retrotransposon gag protein (InterPro:IPR005162))

HSP 1 Score: 51.6 bits (122), Expect = 1.3e-06
Identity = 37/163 (22.70%), Postives = 79/163 (48.47%), Query Frame = 0

Query: 6   TLPSSSAEKDSLSPIFLLSNI-----CNLISMRLDSTNFVLWKFQLTAILKAHKLFGFVD 65
           T+ S S   D  SP +L  +I      ++  +  D  N+V WK +  + L+  K FGF+D
Sbjct: 4   TIKSVSPTSDPDSPYYLPPDIHHPSDFSIQKLSKDEDNYVAWKIRFRSFLRVTKKFGFID 63

Query: 66  GTNPCPQTSPSTTSTVPPQTNPLYEDWIAKDQALMTVINATLSPEALAYVVGSTSSKQVW 125
           GT P            P   +PLY+ W   +  +M  +  +++ + L  V+ + ++ ++W
Sbjct: 64  GTLP-----------KPDPFSPLYQPWEQCNAMVMYWLMNSMTDKLLESVMYAETAHKMW 123

Query: 126 DVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEI 164
           + L +++       +  L+  L T+ ++  +S++ Y  ++ ++
Sbjct: 124 EDLRRVFVPCVDLKIYQLRRRLATL-RQGGDSVEEYFGKLSKV 154

BLAST of CsaV3_7G000640.1 vs. Swiss-Prot
Match: sp|Q94HW2|POLR1_ARATH (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 115.9 bits (289), Expect = 1.0e-24
Identity = 97/367 (26.43%), Postives = 158/367 (43.05%), Query Frame = 0

Query: 33  RLDSTNFVLWKFQLTAILKAHKLFGFVDGTNPCPQTSPSTTST-VPPQTNPLYEDWIAKD 92
           +L STN+++W  Q+ A+   ++L GF+DG+   P   P+T  T   P+ NP Y  W  +D
Sbjct: 25  KLTSTNYLMWSRQVHALFDGYELAGFLDGSTTMP---PATIGTDAAPRVNPDYTRWKRQD 84

Query: 93  QALMTVINATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDE 152
           + + + +   +S      V  +T++ Q+W+ L K+Y++ S  +V  L++ L+  + K  +
Sbjct: 85  KLIYSAVLGAISMSVQPAVSRATTAAQIWETLRKIYANPSYGHVTQLRTQLKQ-WTKGTK 144

Query: 153 SIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRTSMRTRSQPVTFEELH 212
           +ID Y++ +    D+LA +   ++ ++ +   L  LP EY      +  +  P T  E+H
Sbjct: 145 TIDDYMQGLVTRFDQLALLGKPMDHDEQVERVLENLPEEYKPVIDQIAAKDTPPTLTEIH 204

Query: 213 VLLRAEESALAKQSKCDDSYNQPTVLLSSSQSLLSCAPTFNNNFVRGNGHGKNYGHGRFS 272
             L   ES +   S    +    T    S                               
Sbjct: 205 ERLLNHESKILAVSSA--TVIPITANAVSHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 264

Query: 273 FDAQTRGHGLSQEQKPVHDNHATCQICSRRGHTALDCFNRMNY--NFQGRHPPQQLAAMV 332
                         KP       CQIC  +GH+A  C    ++  +   + PP       
Sbjct: 265 XXXXXXXXXXXXXSKPY---LGKCQICGVQGHSAKRCSQLQHFLSSVNSQQPPSPFTPWQ 324

Query: 333 ASQNNAFLSIVNSSS-LTDSGCNTHITSDMNYVSLAPEYNGEEQVGVGNGQTRPISHSGH 392
              N A  S  +S++ L DSG   HITSD N +SL   Y G + V V +G T PISH+G 
Sbjct: 325 PRANLALGSPYSSNNWLLDSGATHHITSDFNNLSLHQPYTGGDDVMVADGSTIPISHTGS 382

Query: 393 FKLELAS 396
             L   S
Sbjct: 385 TSLSTKS 382

BLAST of CsaV3_7G000640.1 vs. Swiss-Prot
Match: sp|Q9ZT94|POLR2_ARATH (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 97.8 bits (242), Expect = 2.9e-19
Identity = 93/371 (25.07%), Postives = 154/371 (41.51%), Query Frame = 0

Query: 33  RLDSTNFVLWKFQLTAILKAHKLFGFVDGTNPCPQTSPSTTST-VPPQTNPLYEDWIAKD 92
           +L STN+++W  Q+ A+   ++L GF+DG+ P P   P+T  T   P+ NP Y  W  +D
Sbjct: 25  KLTSTNYLMWSRQVHALFDGYELAGFLDGSTPMP---PATIGTDAVPRVNPDYTRWRRQD 84

Query: 93  QALMTVINATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDE 152
           + + + I   +S      V  +T++ Q+W+ L K+Y++ S  +V  L+            
Sbjct: 85  KLIYSAILGAISMSVQPAVSRATTAAQIWETLRKIYANPSYGHVTQLR------------ 144

Query: 153 SIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRTSMRTRSQPVTFEELH 212
               +I R     D+LA +   ++ ++ +   L  LP++Y      +  +  P +  E+H
Sbjct: 145 ----FITRF----DQLALLGKPMDHDEQVERVLENLPDDYKPVIDQIAAKDTPPSLTEIH 204

Query: 213 VLLRAEESALAKQSKCDDSYNQPTVLLSSSQSLLSCAPTFNNNFVRGNGHGKNYGHGRFS 272
             L   ES L        + N   V+  ++  +                           
Sbjct: 205 ERLINRESKLL-------ALNSAEVVPITANVVTHRXXXXXXXXXXXXXXXXXXXXXXXX 264

Query: 273 FDAQTRGHGLSQEQKPVHDNHATCQICSRRGHTALDCFNRMNYNFQGRHPPQQLAAMVA- 332
                         +        CQICS +GH+A  C     + FQ     QQ  +    
Sbjct: 265 XXXXXXXXXXXXXXRQPKPYLGRCQICSVQGHSAKRC--PQLHQFQSTTNQQQSTSPFTP 324

Query: 333 ---SQNNAFLSIVNSSS-LTDSGCNTHITSDMNYVSLAPEYNGEEQVGVGNGQTRPISHS 392
                N A  S  N+++ L DSG   HITSD N +S    Y G + V + +G T PI+H+
Sbjct: 325 WQPRANLAVNSPYNANNWLLDSGATHHITSDFNNLSFHQPYTGGDDVMIADGSTIPITHT 363

Query: 393 GHFKLELASCS 398
           G   L  +S S
Sbjct: 385 GSASLPTSSRS 363

BLAST of CsaV3_7G000640.1 vs. TrEMBL
Match: tr|A0A1S3BI58|A0A1S3BI58_CUCME (uncharacterized protein LOC103490319 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103490319 PE=4 SV=1)

HSP 1 Score: 766.5 bits (1978), Expect = 3.0e-218
Identity = 390/412 (94.66%), Postives = 399/412 (96.84%), Query Frame = 0

Query: 1   MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLFGFVD 60
           MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKL+GF+D
Sbjct: 1   MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLYGFID 60

Query: 61  GTNPCP--QTSPSTTSTVPPQTNPLYEDWIAKDQALMTVINATLSPEALAYVVGSTSSKQ 120
           GTNPCP    + S+TSTVPPQ+NP YEDWIAKDQALMTVINATLSPEALAYVVGSTSSKQ
Sbjct: 61  GTNPCPPRTNNSSSTSTVPPQSNPSYEDWIAKDQALMTVINATLSPEALAYVVGSTSSKQ 120

Query: 121 VWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEED 180
           VWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEED
Sbjct: 121 VWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEED 180

Query: 181 LLIYALNGLPNEYNTFRTSMRTRSQPVTFEELHVLLRAEESALAKQSKCDDSYNQPTVLL 240
           LLIYALNGLPNEYNTFRTSMRTRSQPVTFEELHVLLRAEESALAKQSK DDSYNQPTVLL
Sbjct: 181 LLIYALNGLPNEYNTFRTSMRTRSQPVTFEELHVLLRAEESALAKQSKGDDSYNQPTVLL 240

Query: 241 SSSQSLLSCAPTFNNNFVRGNGHGKNYGHGRFSFDAQTRGHGLSQEQKPVHDNHATCQIC 300
           SSSQSLLSCAPTF+NNFVRGNGHGK+YGHGRFSFDAQTRGHG S EQK VHDNHATCQIC
Sbjct: 241 SSSQSLLSCAPTFDNNFVRGNGHGKHYGHGRFSFDAQTRGHGSSPEQKSVHDNHATCQIC 300

Query: 301 SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTHITSDM 360
           SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNT ITSDM
Sbjct: 301 SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTRITSDM 360

Query: 361 NYVSLAPEYNGEEQVGVGNGQTRPISHSGHFKLELASCSASVAQSFCATNHA 411
           NYVSLAPEYNGEEQVG+GNGQTRP+SHSGHFK ELASCSASVAQ FCATNHA
Sbjct: 361 NYVSLAPEYNGEEQVGIGNGQTRPMSHSGHFKFELASCSASVAQLFCATNHA 412

BLAST of CsaV3_7G000640.1 vs. TrEMBL
Match: tr|A0A1S4DWT9|A0A1S4DWT9_CUCME (uncharacterized protein LOC103490319 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103490319 PE=4 SV=1)

HSP 1 Score: 766.5 bits (1978), Expect = 3.0e-218
Identity = 390/412 (94.66%), Postives = 399/412 (96.84%), Query Frame = 0

Query: 1   MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLFGFVD 60
           MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKL+GF+D
Sbjct: 1   MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLYGFID 60

Query: 61  GTNPCP--QTSPSTTSTVPPQTNPLYEDWIAKDQALMTVINATLSPEALAYVVGSTSSKQ 120
           GTNPCP    + S+TSTVPPQ+NP YEDWIAKDQALMTVINATLSPEALAYVVGSTSSKQ
Sbjct: 61  GTNPCPPRTNNSSSTSTVPPQSNPSYEDWIAKDQALMTVINATLSPEALAYVVGSTSSKQ 120

Query: 121 VWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEED 180
           VWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEED
Sbjct: 121 VWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEED 180

Query: 181 LLIYALNGLPNEYNTFRTSMRTRSQPVTFEELHVLLRAEESALAKQSKCDDSYNQPTVLL 240
           LLIYALNGLPNEYNTFRTSMRTRSQPVTFEELHVLLRAEESALAKQSK DDSYNQPTVLL
Sbjct: 181 LLIYALNGLPNEYNTFRTSMRTRSQPVTFEELHVLLRAEESALAKQSKGDDSYNQPTVLL 240

Query: 241 SSSQSLLSCAPTFNNNFVRGNGHGKNYGHGRFSFDAQTRGHGLSQEQKPVHDNHATCQIC 300
           SSSQSLLSCAPTF+NNFVRGNGHGK+YGHGRFSFDAQTRGHG S EQK VHDNHATCQIC
Sbjct: 241 SSSQSLLSCAPTFDNNFVRGNGHGKHYGHGRFSFDAQTRGHGSSPEQKSVHDNHATCQIC 300

Query: 301 SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTHITSDM 360
           SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNT ITSDM
Sbjct: 301 SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTRITSDM 360

Query: 361 NYVSLAPEYNGEEQVGVGNGQTRPISHSGHFKLELASCSASVAQSFCATNHA 411
           NYVSLAPEYNGEEQVG+GNGQTRP+SHSGHFK ELASCSASVAQ FCATNHA
Sbjct: 361 NYVSLAPEYNGEEQVGIGNGQTRPMSHSGHFKFELASCSASVAQLFCATNHA 412

BLAST of CsaV3_7G000640.1 vs. TrEMBL
Match: tr|A0A1S3BIR3|A0A1S3BIR3_CUCME (uncharacterized protein LOC103490319 isoform X3 OS=Cucumis melo OX=3656 GN=LOC103490319 PE=4 SV=1)

HSP 1 Score: 724.5 bits (1869), Expect = 1.3e-205
Identity = 369/389 (94.86%), Postives = 378/389 (97.17%), Query Frame = 0

Query: 1   MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLFGFVD 60
           MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKL+GF+D
Sbjct: 1   MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLYGFID 60

Query: 61  GTNPCP--QTSPSTTSTVPPQTNPLYEDWIAKDQALMTVINATLSPEALAYVVGSTSSKQ 120
           GTNPCP    + S+TSTVPPQ+NP YEDWIAKDQALMTVINATLSPEALAYVVGSTSSKQ
Sbjct: 61  GTNPCPPRTNNSSSTSTVPPQSNPSYEDWIAKDQALMTVINATLSPEALAYVVGSTSSKQ 120

Query: 121 VWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEED 180
           VWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEED
Sbjct: 121 VWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEED 180

Query: 181 LLIYALNGLPNEYNTFRTSMRTRSQPVTFEELHVLLRAEESALAKQSKCDDSYNQPTVLL 240
           LLIYALNGLPNEYNTFRTSMRTRSQPVTFEELHVLLRAEESALAKQSK DDSYNQPTVLL
Sbjct: 181 LLIYALNGLPNEYNTFRTSMRTRSQPVTFEELHVLLRAEESALAKQSKGDDSYNQPTVLL 240

Query: 241 SSSQSLLSCAPTFNNNFVRGNGHGKNYGHGRFSFDAQTRGHGLSQEQKPVHDNHATCQIC 300
           SSSQSLLSCAPTF+NNFVRGNGHGK+YGHGRFSFDAQTRGHG S EQK VHDNHATCQIC
Sbjct: 241 SSSQSLLSCAPTFDNNFVRGNGHGKHYGHGRFSFDAQTRGHGSSPEQKSVHDNHATCQIC 300

Query: 301 SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTHITSDM 360
           SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNT ITSDM
Sbjct: 301 SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTRITSDM 360

Query: 361 NYVSLAPEYNGEEQVGVGNGQTRPISHSG 388
           NYVSLAPEYNGEEQVG+GNGQTRP+SHSG
Sbjct: 361 NYVSLAPEYNGEEQVGIGNGQTRPMSHSG 389

BLAST of CsaV3_7G000640.1 vs. TrEMBL
Match: tr|A0A2N9HMR4|A0A2N9HMR4_FAGSY (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS43338 PE=4 SV=1)

HSP 1 Score: 271.6 bits (693), Expect = 3.0e-69
Identity = 160/398 (40.20%), Postives = 248/398 (62.31%), Query Frame = 0

Query: 9   SSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLFGFVDG-TNPCPQ 68
           SS+    + SPI LLSNI NLIS +LDSTN+ LWK+QL +I + + L   +DG T P  +
Sbjct: 19  SSNISNITQSPILLLSNISNLISAKLDSTNYTLWKYQLLSIFECYSLLDHIDGSTQPPER 78

Query: 69  TSPSTTSTVPPQTNPLYEDWIAKDQALMTVINATLSPEALAYVVGSTSSKQVWDVLAKLY 128
                     PQ +  Y+ W  +DQAL T++NATLSP AL+ V+  ++++ VW+VL + Y
Sbjct: 79  YLQDANGQFTPQESIQYKQWKIRDQALKTLLNATLSPPALSLVIRQSTARGVWEVLERRY 138

Query: 129 SSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGL 188
           +S SR++V++LK +L  I KK +ES+  ++ R+KE++DKL+ V   I++E+LL   + GL
Sbjct: 139 TSLSRTHVLSLKGELDRIQKK-NESMSVFLDRVKELRDKLSTVGVEIDDEELLHVVMKGL 198

Query: 189 PNEYNTFRTSMRTRSQPVTFEELHVLLRAEESALAKQSKCDDSYNQPTVLLS-----SSQ 248
           P EY+ F ++MRT+ + ++ EELHV+L +EE +  K S+   S N P + ++     SS 
Sbjct: 199 PPEYDAFCSAMRTKDRSISCEELHVMLTSEEES-KKNSRGMSSDNIPHMAMAATADGSSP 258

Query: 249 SLLSCAPTFNNNFVRG-NGHGKNY-GHGRFSFDAQTRGHGLSQEQKPVHDNHATCQICSR 308
              +  P F+  + RG  G  +NY G GR ++ +   G   + +Q        TCQICS+
Sbjct: 259 VTNTPLPLFSPQWNRGRGGRSQNYRGRGRGNYGSSRGGFQQNMQQNSQAQTRPTCQICSK 318

Query: 309 RGHTALDCFNRMNYNFQGRHPPQQLAAMVA-SQNNAFLSIVNSSS--LTDSGCNTHITSD 368
            GH ALDCF+RMN+ +QGRHPP +LAA+ + + +NA  +  ++ S  ++D+G   H T D
Sbjct: 319 PGHVALDCFHRMNFAYQGRHPPAKLAAIASTNMSNAIGAPASNQSCWISDTGATDHFTPD 378

Query: 369 MNYVSLAPEYNGEEQVGVGNGQTRPISHSGHFKLELAS 396
           + ++     Y G + V VGNGQ+ PI+HSG+ +L  +S
Sbjct: 379 ITHIPDCHAYTGNDFVTVGNGQSLPITHSGNSQLHASS 414

BLAST of CsaV3_7G000640.1 vs. TrEMBL
Match: tr|A0A2N9ER29|A0A2N9ER29_FAGSY (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS4906 PE=4 SV=1)

HSP 1 Score: 271.6 bits (693), Expect = 3.0e-69
Identity = 160/398 (40.20%), Postives = 248/398 (62.31%), Query Frame = 0

Query: 9   SSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLFGFVDG-TNPCPQ 68
           SS+    + SPI LLSNI NLIS +LDSTN+ LWK+QL +I + + L   +DG T P  +
Sbjct: 19  SSNISNITQSPILLLSNISNLISAKLDSTNYTLWKYQLLSIFECYSLLDHIDGSTQPPER 78

Query: 69  TSPSTTSTVPPQTNPLYEDWIAKDQALMTVINATLSPEALAYVVGSTSSKQVWDVLAKLY 128
                     PQ +  Y+ W  +DQAL T++NATLSP AL+ V+  ++++ VW+VL + Y
Sbjct: 79  YLQDANGQFTPQESIQYKQWKIRDQALKTLLNATLSPPALSLVIRQSTARGVWEVLERRY 138

Query: 129 SSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGL 188
           +S SR++V++LK +L  I KK +ES+  ++ R+KE++DKL+ V   I++E+LL   + GL
Sbjct: 139 TSLSRTHVLSLKGELDRIQKK-NESMSVFLDRVKELRDKLSTVGVEIDDEELLHVVMKGL 198

Query: 189 PNEYNTFRTSMRTRSQPVTFEELHVLLRAEESALAKQSKCDDSYNQPTVLLS-----SSQ 248
           P EY+ F ++MRT+ + ++ EELHV+L +EE +  K S+   S N P + ++     SS 
Sbjct: 199 PPEYDAFCSAMRTKDRSISCEELHVMLTSEEES-KKNSRGMSSDNIPHMAMAATADGSSP 258

Query: 249 SLLSCAPTFNNNFVRG-NGHGKNY-GHGRFSFDAQTRGHGLSQEQKPVHDNHATCQICSR 308
              +  P F+  + RG  G  +NY G GR ++ +   G   + +Q        TCQICS+
Sbjct: 259 VTNTPLPLFSPQWNRGRGGRSQNYRGRGRGNYGSSRGGFQQNMQQNSQAQTRPTCQICSK 318

Query: 309 RGHTALDCFNRMNYNFQGRHPPQQLAAMVA-SQNNAFLSIVNSSS--LTDSGCNTHITSD 368
            GH ALDCF+RMN+ +QGRHPP +LAA+ + + +NA  +  ++ S  ++D+G   H T D
Sbjct: 319 PGHVALDCFHRMNFAYQGRHPPAKLAAIASTNMSNAIGAPASNQSCWISDTGATDHFTPD 378

Query: 369 MNYVSLAPEYNGEEQVGVGNGQTRPISHSGHFKLELAS 396
           + ++     Y G + V VGNGQ+ PI+HSG+ +L  +S
Sbjct: 379 ITHIPDCHAYTGNDFVTVGNGQSLPITHSGNSQLHASS 414

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_011658578.11.2e-231100.00PREDICTED: uncharacterized protein LOC105436058 isoform X1 [Cucumis sativus][more]
XP_016900446.14.5e-21894.66PREDICTED: uncharacterized protein LOC103490319 isoform X1 [Cucumis melo][more]
XP_008448007.14.5e-21894.66PREDICTED: uncharacterized protein LOC103490319 isoform X2 [Cucumis melo][more]
XP_011658579.17.6e-218100.00PREDICTED: uncharacterized protein LOC105436058 isoform X2 [Cucumis sativus][more]
XP_008448008.12.0e-20594.86PREDICTED: uncharacterized protein LOC103490319 isoform X3 [Cucumis melo][more]
Match NameE-valueIdentityDescription
AT1G34070.14.0e-1124.46Retrotransposon gag protein (InterPro:IPR005162)[more]
AT1G21280.11.3e-0622.70Retrotransposon gag protein (InterPro:IPR005162)[more]
Match NameE-valueIdentityDescription
sp|Q94HW2|POLR1_ARATH1.0e-2426.43Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
sp|Q9ZT94|POLR2_ARATH2.9e-1925.07Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
Match NameE-valueIdentityDescription
tr|A0A1S3BI58|A0A1S3BI58_CUCME3.0e-21894.66uncharacterized protein LOC103490319 isoform X2 OS=Cucumis melo OX=3656 GN=LOC10... [more]
tr|A0A1S4DWT9|A0A1S4DWT9_CUCME3.0e-21894.66uncharacterized protein LOC103490319 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... [more]
tr|A0A1S3BIR3|A0A1S3BIR3_CUCME1.3e-20594.86uncharacterized protein LOC103490319 isoform X3 OS=Cucumis melo OX=3656 GN=LOC10... [more]
tr|A0A2N9HMR4|A0A2N9HMR4_FAGSY3.0e-6940.20Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS43338 PE=4 SV=1[more]
tr|A0A2N9ER29|A0A2N9ER29_FAGSY3.0e-6940.20Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS4906 PE=4 SV=1[more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
IPR029472UBN2_3
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CsaV3_7G000640CsaV3_7G000640gene


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
CsaV3_7G000640.1.exon1CsaV3_7G000640.1.exon1exon
CsaV3_7G000640.1.exon2CsaV3_7G000640.1.exon2exon
CsaV3_7G000640.1.exon3CsaV3_7G000640.1.exon3exon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
CsaV3_7G000640.1.cds1CsaV3_7G000640.1.cds1CDS
CsaV3_7G000640.1.cds2CsaV3_7G000640.1.cds2CDS


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CsaV3_7G000640.1CsaV3_7G000640.1-proteinpolypeptide


Analysis Name: InterPro Annotations of cucumber chineselong genome (v3)
Date Performed: 2019-03-04
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePFAMPF14223Retrotran_gag_2coord: 87..221
e-value: 2.1E-25
score: 88.9
NoneNo IPR availablePANTHERPTHR11439:SF232SUBFAMILY NOT NAMEDcoord: 5..345
NoneNo IPR availablePANTHERPTHR11439GAG-POL-RELATED RETROTRANSPOSONcoord: 5..345
IPR029472Gag-polypeptide of LTR copia-typePFAMPF14244Retrotran_gag_3coord: 28..66
e-value: 1.1E-8
score: 34.7