PI0021918 (gene) Melon (PI 482460) v1

Overview
NamePI0021918
Typegene
OrganismCucumis metuliferus (Melon (PI 482460) v1)
DescriptionReverse transcriptase
Locationchr04: 21623847 .. 21624293 (-)
RNA-Seq ExpressionPI0021918
SyntenyPI0021918
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCCTTCATATGTCAAATTTCTGAAAGATATTTTGGCAAAGAAGCGAAGGATGAATGACTGCGAAACAATGGTTCTAACGCAAGTAACGAGCGACGTTTTCAAGAATGAGGTACCTGAGAAGATGACAGACCCTGGCAGCTTCACGGTACCATGCTCAATAGGCGGCATGGATCTAGGTCGCGCACTATGCGATTTGGGAGCCAGTATCAACCTGATGCCCCTCTCCATCTTCAAGAAATTGGAGCTAGGGGAGGCGCATCCAACACTCATGAGGCTCCAATTTGCAGATAGATCCATCACCAAGCCTGAGGGGAGAATTGAAGACGTTTTAGTCAAGGTGGATAAATTCTTGTTCCCCGTCGACTTTGTCATTCTGAATTACGAAGCGGATCAGGAGGTACCAATTATCTTGGGGCAGTCATTCTGTCAACTGGTCGCGTCCTAA

mRNA sequence

ATGCCTTCATATGTCAAATTTCTGAAAGATATTTTGGCAAAGAAGCGAAGGATGAATGACTGCGAAACAATGGTTCTAACGCAAGTAACGAGCGACGTTTTCAAGAATGAGGTACCTGAGAAGATGACAGACCCTGGCAGCTTCACGGTACCATGCTCAATAGGCGGCATGGATCTAGGTCGCGCACTATGCGATTTGGGAGCCAGTATCAACCTGATGCCCCTCTCCATCTTCAAGAAATTGGAGCTAGGGGAGGCGCATCCAACACTCATGAGGCTCCAATTTGCAGATAGATCCATCACCAAGCCTGAGGGGAGAATTGAAGACGTTTTAGTCAAGGTGGATAAATTCTTGTTCCCCGTCGACTTTGTCATTCTGAATTACGAAGCGGATCAGGAGGTACCAATTATCTTGGGGCAGTCATTCTGTCAACTGGTCGCGTCCTAA

Coding sequence (CDS)

ATGCCTTCATATGTCAAATTTCTGAAAGATATTTTGGCAAAGAAGCGAAGGATGAATGACTGCGAAACAATGGTTCTAACGCAAGTAACGAGCGACGTTTTCAAGAATGAGGTACCTGAGAAGATGACAGACCCTGGCAGCTTCACGGTACCATGCTCAATAGGCGGCATGGATCTAGGTCGCGCACTATGCGATTTGGGAGCCAGTATCAACCTGATGCCCCTCTCCATCTTCAAGAAATTGGAGCTAGGGGAGGCGCATCCAACACTCATGAGGCTCCAATTTGCAGATAGATCCATCACCAAGCCTGAGGGGAGAATTGAAGACGTTTTAGTCAAGGTGGATAAATTCTTGTTCCCCGTCGACTTTGTCATTCTGAATTACGAAGCGGATCAGGAGGTACCAATTATCTTGGGGCAGTCATTCTGTCAACTGGTCGCGTCCTAA

Protein sequence

MPSYVKFLKDILAKKRRMNDCETMVLTQVTSDVFKNEVPEKMTDPGSFTVPCSIGGMDLGRALCDLGASINLMPLSIFKKLELGEAHPTLMRLQFADRSITKPEGRIEDVLVKVDKFLFPVDFVILNYEADQEVPIILGQSFCQLVAS
Homology
BLAST of PI0021918 vs. ExPASy TrEMBL
Match: A0A6J1DZC3 (uncharacterized protein LOC111024449 OS=Momordica charantia OX=3673 GN=LOC111024449 PE=4 SV=1)

HSP 1 Score: 211.1 bits (536), Expect = 3.2e-51
Identity = 98/142 (69.01%), Postives = 124/142 (87.32%), Query Frame = 0

Query: 1   MPSYVKFLKDILAKKRRMNDCETMVLTQVTSDVFKNEVPEKMTDPGSFTVPCSIGGMDLG 60
           MP+Y KFLKDI+ +K+++ + ET+ LT+ +S+VFK+++  K+ DPGSFT+PCSIGG D+G
Sbjct: 367 MPTYAKFLKDIITRKKKLGEYETVALTECSSNVFKSKMSPKLKDPGSFTIPCSIGGKDVG 426

Query: 61  RALCDLGASINLMPLSIFKKLELGEAHPTLMRLQFADRSITKPEGRIEDVLVKVDKFLFP 120
           RALCDL ASINLMPLSIFKKLE+G+A PT + LQ ADRSITKPEG+IEDVLVKVDKF+FP
Sbjct: 427 RALCDLRASINLMPLSIFKKLEIGKASPTTVTLQLADRSITKPEGKIEDVLVKVDKFIFP 486

Query: 121 VDFVILNYEADQEVPIILGQSF 143
            DF+ILN EAD++VPIILG+ F
Sbjct: 487 ADFIILNCEADKDVPIILGRPF 508

BLAST of PI0021918 vs. ExPASy TrEMBL
Match: A0A6J1DY39 (uncharacterized protein LOC111025653 OS=Momordica charantia OX=3673 GN=LOC111025653 PE=4 SV=1)

HSP 1 Score: 210.7 bits (535), Expect = 4.2e-51
Identity = 96/142 (67.61%), Postives = 124/142 (87.32%), Query Frame = 0

Query: 1   MPSYVKFLKDILAKKRRMNDCETMVLTQVTSDVFKNEVPEKMTDPGSFTVPCSIGGMDLG 60
           MP+Y KF+KDI+ +K+++ + ET+ LT+ +S+VFK+++P K+ DPGSFT+PC IGG D+G
Sbjct: 571 MPTYAKFIKDIITRKKKLGEYETVALTECSSNVFKSKMPPKLKDPGSFTIPCLIGGKDVG 630

Query: 61  RALCDLGASINLMPLSIFKKLELGEAHPTLMRLQFADRSITKPEGRIEDVLVKVDKFLFP 120
           RALCDLGASINLMPLSIFKK E+G+A PT + LQ ADRSITKPEG+IEDVLVKVDKF+FP
Sbjct: 631 RALCDLGASINLMPLSIFKKFEIGKASPTTVTLQLADRSITKPEGKIEDVLVKVDKFIFP 690

Query: 121 VDFVILNYEADQEVPIILGQSF 143
            DF+IL+ EAD++VPIILG+ F
Sbjct: 691 TDFIILDCEADKDVPIILGRPF 712

BLAST of PI0021918 vs. ExPASy TrEMBL
Match: A0A5D3BDT2 (RT_RNaseH domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold487G00330 PE=4 SV=1)

HSP 1 Score: 208.8 bits (530), Expect = 1.6e-50
Identity = 105/142 (73.94%), Postives = 117/142 (82.39%), Query Frame = 0

Query: 1   MPSYVKFLKDILAKKRRMNDCETMVLTQVTSDVFKNEVPEKMTDPGSFTVPCSIGGMDLG 60
           MPSY KFL DILAKKRR+ND +TM L Q TSD++KN V EKMT PGSF VPCSI GMDLG
Sbjct: 1   MPSYAKFLTDILAKKRRINDFQTMALRQATSDIYKNGVLEKMTYPGSFKVPCSIDGMDLG 60

Query: 61  RALCDLGASINLMPLSIFKKLELGEAHPTLMRLQFADRSITKPEGRIEDVLVKVDKFLFP 120
             LCDLGASINLMPLSIFKKL++ E  PT MR Q ADRSIT P+G+IEDVLVKVDKFLF 
Sbjct: 61  HVLCDLGASINLMPLSIFKKLKIKEVQPTHMRFQLADRSITNPKGKIEDVLVKVDKFLFF 120

Query: 121 VDFVILNYEADQEVPIILGQSF 143
            DF+IL+YEAD+EV IIL + F
Sbjct: 121 ADFIILDYEADEEVLIILRRLF 142

BLAST of PI0021918 vs. ExPASy TrEMBL
Match: A0A1S3BHX3 (uncharacterized protein LOC103490055 OS=Cucumis melo OX=3656 GN=LOC103490055 PE=4 SV=1)

HSP 1 Score: 208.8 bits (530), Expect = 1.6e-50
Identity = 105/142 (73.94%), Postives = 117/142 (82.39%), Query Frame = 0

Query: 1   MPSYVKFLKDILAKKRRMNDCETMVLTQVTSDVFKNEVPEKMTDPGSFTVPCSIGGMDLG 60
           MPSY KFL DILAKKRR+ND +TM L Q TSD++KN V EKMT PGSF VPCSI GMDLG
Sbjct: 1   MPSYAKFLTDILAKKRRINDFQTMALRQATSDIYKNGVLEKMTYPGSFKVPCSIDGMDLG 60

Query: 61  RALCDLGASINLMPLSIFKKLELGEAHPTLMRLQFADRSITKPEGRIEDVLVKVDKFLFP 120
             LCDLGASINLMPLSIFKKL++ E  PT MR Q ADRSIT P+G+IEDVLVKVDKFLF 
Sbjct: 61  HVLCDLGASINLMPLSIFKKLKIKEVQPTHMRFQLADRSITNPKGKIEDVLVKVDKFLFF 120

Query: 121 VDFVILNYEADQEVPIILGQSF 143
            DF+IL+YEAD+EV IIL + F
Sbjct: 121 ADFIILDYEADEEVLIILRRLF 142

BLAST of PI0021918 vs. ExPASy TrEMBL
Match: A0A6J1CPJ3 (uncharacterized protein LOC111012947 OS=Momordica charantia OX=3673 GN=LOC111012947 PE=4 SV=1)

HSP 1 Score: 204.9 bits (520), Expect = 2.3e-49
Identity = 95/142 (66.90%), Postives = 121/142 (85.21%), Query Frame = 0

Query: 1   MPSYVKFLKDILAKKRRMNDCETMVLTQVTSDVFKNEVPEKMTDPGSFTVPCSIGGMDLG 60
           MP+Y KFLKDI+ +K+++ + ET+ LT+ +S+VFK++ P K+ DPGSFT+ C IGG D+G
Sbjct: 569 MPTYAKFLKDIITRKKKLGEYETVALTECSSNVFKSKXPPKLKDPGSFTIXCLIGGKDVG 628

Query: 61  RALCDLGASINLMPLSIFKKLELGEAHPTLMRLQFADRSITKPEGRIEDVLVKVDKFLFP 120
           RALCDLGA INLMPLSIFKKLE+G+A PT + L  ADRSITKPEG+IEDVLVKVDKF+FP
Sbjct: 629 RALCDLGAXINLMPLSIFKKLEIGKAXPTTVTLXLADRSITKPEGKIEDVLVKVDKFIFP 688

Query: 121 VDFVILNYEADQEVPIILGQSF 143
            DF+IL+ EAD++VPIILG+ F
Sbjct: 689 ADFIILDCEADKDVPIILGRPF 710

BLAST of PI0021918 vs. NCBI nr
Match: XP_022157836.1 (uncharacterized protein LOC111024449 [Momordica charantia])

HSP 1 Score: 211.1 bits (536), Expect = 6.6e-51
Identity = 98/142 (69.01%), Postives = 124/142 (87.32%), Query Frame = 0

Query: 1   MPSYVKFLKDILAKKRRMNDCETMVLTQVTSDVFKNEVPEKMTDPGSFTVPCSIGGMDLG 60
           MP+Y KFLKDI+ +K+++ + ET+ LT+ +S+VFK+++  K+ DPGSFT+PCSIGG D+G
Sbjct: 367 MPTYAKFLKDIITRKKKLGEYETVALTECSSNVFKSKMSPKLKDPGSFTIPCSIGGKDVG 426

Query: 61  RALCDLGASINLMPLSIFKKLELGEAHPTLMRLQFADRSITKPEGRIEDVLVKVDKFLFP 120
           RALCDL ASINLMPLSIFKKLE+G+A PT + LQ ADRSITKPEG+IEDVLVKVDKF+FP
Sbjct: 427 RALCDLRASINLMPLSIFKKLEIGKASPTTVTLQLADRSITKPEGKIEDVLVKVDKFIFP 486

Query: 121 VDFVILNYEADQEVPIILGQSF 143
            DF+ILN EAD++VPIILG+ F
Sbjct: 487 ADFIILNCEADKDVPIILGRPF 508

BLAST of PI0021918 vs. NCBI nr
Match: XP_030509265.1 (uncharacterized protein LOC115723943 [Cannabis sativa])

HSP 1 Score: 210.7 bits (535), Expect = 8.7e-51
Identity = 98/142 (69.01%), Postives = 122/142 (85.92%), Query Frame = 0

Query: 1   MPSYVKFLKDILAKKRRMNDCETMVLTQVTSDVFKNEVPEKMTDPGSFTVPCSIGGMDLG 60
           MP+YVKFLKDIL KKRR+ + ET+ LT+  S + K+++P K+ DPGSFT+PCSIGG D+G
Sbjct: 377 MPNYVKFLKDILTKKRRLGEFETVALTEGCSAMLKSKIPPKLKDPGSFTIPCSIGGRDVG 436

Query: 61  RALCDLGASINLMPLSIFKKLELGEAHPTLMRLQFADRSITKPEGRIEDVLVKVDKFLFP 120
           RALCDLGASINLMP+SIFKKL +GEA PT + LQ ADRS+  PEG+IEDVLV+VDKF+FP
Sbjct: 437 RALCDLGASINLMPMSIFKKLGIGEARPTTVTLQLADRSMAHPEGKIEDVLVQVDKFIFP 496

Query: 121 VDFVILNYEADQEVPIILGQSF 143
            DF+IL+YEAD++VPIILG+ F
Sbjct: 497 ADFIILDYEADRDVPIILGRPF 518

BLAST of PI0021918 vs. NCBI nr
Match: XP_022159235.1 (uncharacterized protein LOC111025653 [Momordica charantia])

HSP 1 Score: 210.7 bits (535), Expect = 8.7e-51
Identity = 96/142 (67.61%), Postives = 124/142 (87.32%), Query Frame = 0

Query: 1   MPSYVKFLKDILAKKRRMNDCETMVLTQVTSDVFKNEVPEKMTDPGSFTVPCSIGGMDLG 60
           MP+Y KF+KDI+ +K+++ + ET+ LT+ +S+VFK+++P K+ DPGSFT+PC IGG D+G
Sbjct: 571 MPTYAKFIKDIITRKKKLGEYETVALTECSSNVFKSKMPPKLKDPGSFTIPCLIGGKDVG 630

Query: 61  RALCDLGASINLMPLSIFKKLELGEAHPTLMRLQFADRSITKPEGRIEDVLVKVDKFLFP 120
           RALCDLGASINLMPLSIFKK E+G+A PT + LQ ADRSITKPEG+IEDVLVKVDKF+FP
Sbjct: 631 RALCDLGASINLMPLSIFKKFEIGKASPTTVTLQLADRSITKPEGKIEDVLVKVDKFIFP 690

Query: 121 VDFVILNYEADQEVPIILGQSF 143
            DF+IL+ EAD++VPIILG+ F
Sbjct: 691 TDFIILDCEADKDVPIILGRPF 712

BLAST of PI0021918 vs. NCBI nr
Match: XP_030477929.1 (uncharacterized protein LOC115694963 [Cannabis sativa])

HSP 1 Score: 210.7 bits (535), Expect = 8.7e-51
Identity = 98/142 (69.01%), Postives = 122/142 (85.92%), Query Frame = 0

Query: 1   MPSYVKFLKDILAKKRRMNDCETMVLTQVTSDVFKNEVPEKMTDPGSFTVPCSIGGMDLG 60
           MP+YVKFLKDIL KKRR+ + ET+ LT+  S + K+++P K+ DPGSFT+PCSIGG D+G
Sbjct: 29  MPNYVKFLKDILTKKRRLGEFETVALTEGCSAMLKSKIPPKLKDPGSFTIPCSIGGRDVG 88

Query: 61  RALCDLGASINLMPLSIFKKLELGEAHPTLMRLQFADRSITKPEGRIEDVLVKVDKFLFP 120
           RALCDLGASINLMP+SIFKKL +GEA PT + LQ ADRS+  PEG+IEDVLV+VDKF+FP
Sbjct: 89  RALCDLGASINLMPMSIFKKLGIGEARPTTVTLQLADRSMAHPEGKIEDVLVQVDKFIFP 148

Query: 121 VDFVILNYEADQEVPIILGQSF 143
            DF+IL+YEAD++VPIILG+ F
Sbjct: 149 ADFIILDYEADRDVPIILGRPF 170

BLAST of PI0021918 vs. NCBI nr
Match: XP_030478287.1 (uncharacterized protein LOC115695357 [Cannabis sativa])

HSP 1 Score: 210.7 bits (535), Expect = 8.7e-51
Identity = 98/142 (69.01%), Postives = 122/142 (85.92%), Query Frame = 0

Query: 1   MPSYVKFLKDILAKKRRMNDCETMVLTQVTSDVFKNEVPEKMTDPGSFTVPCSIGGMDLG 60
           MP+YVKFLKDIL KKRR+ + ET+ LT+  S + K+++P K+ DPGSFT+PCSIGG D+G
Sbjct: 50  MPNYVKFLKDILTKKRRLGEFETVALTEGCSAMLKSKIPPKLKDPGSFTIPCSIGGRDVG 109

Query: 61  RALCDLGASINLMPLSIFKKLELGEAHPTLMRLQFADRSITKPEGRIEDVLVKVDKFLFP 120
           RALCDLGASINLMP+SIFKKL +GEA PT + LQ ADRS+  PEG+IEDVLV+VDKF+FP
Sbjct: 110 RALCDLGASINLMPMSIFKKLGIGEARPTTVTLQLADRSMAHPEGKIEDVLVQVDKFIFP 169

Query: 121 VDFVILNYEADQEVPIILGQSF 143
            DF+IL+YEAD++VPIILG+ F
Sbjct: 170 ADFIILDYEADRDVPIILGRPF 191

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1DZC33.2e-5169.01uncharacterized protein LOC111024449 OS=Momordica charantia OX=3673 GN=LOC111024... [more]
A0A6J1DY394.2e-5167.61uncharacterized protein LOC111025653 OS=Momordica charantia OX=3673 GN=LOC111025... [more]
A0A5D3BDT21.6e-5073.94RT_RNaseH domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5... [more]
A0A1S3BHX31.6e-5073.94uncharacterized protein LOC103490055 OS=Cucumis melo OX=3656 GN=LOC103490055 PE=... [more]
A0A6J1CPJ32.3e-4966.90uncharacterized protein LOC111012947 OS=Momordica charantia OX=3673 GN=LOC111012... [more]
Match NameE-valueIdentityDescription
XP_022157836.16.6e-5169.01uncharacterized protein LOC111024449 [Momordica charantia][more]
XP_030509265.18.7e-5169.01uncharacterized protein LOC115723943 [Cannabis sativa][more]
XP_022159235.18.7e-5167.61uncharacterized protein LOC111025653 [Momordica charantia][more]
XP_030477929.18.7e-5169.01uncharacterized protein LOC115694963 [Cannabis sativa][more]
XP_030478287.18.7e-5169.01uncharacterized protein LOC115695357 [Cannabis sativa][more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Melon (PI 482460) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePFAMPF13650Asp_protease_2coord: 50..142
e-value: 9.5E-6
score: 26.2
NoneNo IPR availablePANTHERPTHR33067:SF23SUBFAMILY NOT NAMEDcoord: 3..142
NoneNo IPR availablePANTHERPTHR33067FAMILY NOT NAMEDcoord: 3..142
NoneNo IPR availableCDDcd00303retropepsin_likecoord: 50..144
e-value: 2.34317E-15
score: 65.0504
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 24..146
e-value: 7.9E-24
score: 85.9

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
PI0021918.1PI0021918.1mRNA