Cla019549.1 (mRNA) Watermelon (97103) v1

NameCla019549
TypemRNA
OrganismCitrullus. lanatus (Watermelon (97103) v1)
DescriptionGag-protease polyprotein (AHRD V1 *-*- Q84KB1_CUCME); contains Interpro domain(s) IPR005162 Retrotransposon gag protein
LocationChr3 : 6825642 .. 6826210 (+)
Sequence length513
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCTGAGGGTGAATCAAGTCATCCTCAAGCTGGAGTTAATGTTGAGGAACAGATCTTTACTAGAATTGCTGAAAGGTTGGCTGCAGGTATCAGATCGGTGCAGTCTGACCCAGAGAAGAAATATGGAACTGAAAGGTTGAAAGCTTTGGGTGCTACTACTTTCAAAGGCATCGCAGATCCTGTCAATGCAGAGGTATGGCTGAACCTGCTTGAAAAATGTTACCGTGTGATAAGATGCCCTAACGACAGAAAGGTAGAGTTGACAATGTTTTTACTCCAGAAGAGGGCGGAAGATTGGTGGAGAGTAATGAAGAGCAGAAGGGGAGGAACAGAAGGATAAGTTGGGACGAGTTTAAGAAAGCGTTCCAAGATAAGTATTGTACAAAAGCATTCCGTGATGCCAAACAGAATGAATTTCTTAAGTTAGTGCAAAGGTCAATGACAGTGGCAGAATACAAAAAGAAATATATTGAGTTATCCAAGTATGCCATCACCATTGTAGCTGATGAGGTCGATCGATACAAGAGATTTGAGAAGGATTGCTGGAGGAAATTCATACTCCAGTGA

mRNA sequence

ATGTCTGAGGGTGAATCAAGTCATCCTCAAGCTGGAGTTAATGTTGAGGAACAGATCTTTACTAGAATTGCTGAAAGGTTGGCTGCAGGTATCAGATCGGTGCAGTCTGACCCAGAGAAGAAATATGGAACTGAAAGGTTGAAAGCTTTGGGTGCTACTACTTTCAAAGGCATCGCAGATCCTGTCAATGCAGAGAAAGGTAGAGTTGACAATGTTTTTACTCCAGAAGAGGGCGGAAGATTGGTGGAGAGTAATGAAGAGCAGAAGGGGAGGAACAGAAGGATAAGTTGGGACGAGTTTAAGAAAGCGTTCCAAGATAAGTATTGTACAAAAGCATTCCGTGATGCCAAACAGAATGAATTTCTTAAGTTAGTGCAAAGGTCAATGACAGTGGCAGAATACAAAAAGAAATATATTGAGTTATCCAAGTATGCCATCACCATTGTAGCTGATGAGGTCGATCGATACAAGAGATTTGAGAAGGATTGCTGGAGGAAATTCATACTCCAGTGA

Coding sequence (CDS)

ATGTCTGAGGGTGAATCAAGTCATCCTCAAGCTGGAGTTAATGTTGAGGAACAGATCTTTACTAGAATTGCTGAAAGGTTGGCTGCAGGTATCAGATCGGTGCAGTCTGACCCAGAGAAGAAATATGGAACTGAAAGGTTGAAAGCTTTGGGTGCTACTACTTTCAAAGGCATCGCAGATCCTGTCAATGCAGAGAAAGGTAGAGTTGACAATGTTTTTACTCCAGAAGAGGGCGGAAGATTGGTGGAGAGTAATGAAGAGCAGAAGGGGAGGAACAGAAGGATAAGTTGGGACGAGTTTAAGAAAGCGTTCCAAGATAAGTATTGTACAAAAGCATTCCGTGATGCCAAACAGAATGAATTTCTTAAGTTAGTGCAAAGGTCAATGACAGTGGCAGAATACAAAAAGAAATATATTGAGTTATCCAAGTATGCCATCACCATTGTAGCTGATGAGGTCGATCGATACAAGAGATTTGAGAAGGATTGCTGGAGGAAATTCATACTCCAGTGA

Protein sequence

MSEGESSHPQAGVNVEEQIFTRIAERLAAGIRSVQSDPEKKYGTERLKALGATTFKGIADPVNAEKGRVDNVFTPEEGGRLVESNEEQKGRNRRISWDEFKKAFQDKYCTKAFRDAKQNEFLKLVQRSMTVAEYKKKYIELSKYAITIVADEVDRYKRFEKDCWRKFILQ
BLAST of Cla019549 vs. TrEMBL
Match: M5XSR7_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppb016975mg PE=4 SV=1)

HSP 1 Score: 80.1 bits (196), Expect = 2.9e-12
Identity = 56/180 (31.11%), Postives = 90/180 (50.00%), Query Frame = 1

Query: 6   SSHPQAGVNVEE--QIFTRIAERLAAGIRSVQSDPEKKYGTERLKALGATTFKGIADPVN 65
           S  P AG N  +  Q+ ++    +A  +R  +     +   +R+K LGA  F G  DP  
Sbjct: 36  SPSPHAGDNTLDMRQVLSQFTRTVATALRGRRGTESSEI--KRVKELGAKEFLGSTDPAE 95

Query: 66  AEKG--RVDNVFT-----PEEGGRLV------------ESNEEQKGRNRRISWDEFKKAF 125
           AE     V+ +F       E+  RL             ++ +        I+W+EF++ F
Sbjct: 96  AELWITDVERIFEVLECPAEDRVRLATFLLKGNAYHWWKAVKRGYENPAAINWEEFQRVF 155

Query: 126 QDKYCTKAFRDAKQNEFLKLVQRSMTVAEYKKKYIELSKYAITIVADEVDRYKRFEKDCW 165
            D +   +++ AK++EFL L Q SM+V EY+ K+ ELS++A  +VA E DR +RFE+  W
Sbjct: 156 SDPFYPPSYKQAKKSEFLYLKQGSMSVVEYEHKFNELSRFAPELVATEEDRCRRFEEGLW 213

BLAST of Cla019549 vs. TrEMBL
Match: M5VK25_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa015000mg PE=4 SV=1)

HSP 1 Score: 78.6 bits (192), Expect = 8.3e-12
Identity = 55/164 (33.54%), Postives = 82/164 (50.00%), Query Frame = 1

Query: 20  FTRIAERLAAGIRSVQSDPEKKYGTERLKALGATTFKGIADPVNAEKG--RVDNVFT--- 79
           FTR       G R  +S   K     R+K LGA  F G  DP  AE     V+ +F    
Sbjct: 55  FTRTMATALRGRRGTESSEIK-----RVKELGAKEFVGSTDPAEAESWITDVERIFEVLE 114

Query: 80  --PEEGGRLV------------ESNEEQKGRNRRISWDEFKKAFQDKYCTKAFRDAKQNE 139
              E+  RL             ++ +        I+W+EF++ F +++   ++R AK++E
Sbjct: 115 CPAEDRVRLATFLLKGNAYHWWKAVKRGYENPAAINWEEFQRVFSEQFYPPSYRHAKKSE 174

Query: 140 FLKLVQRSMTVAEYKKKYIELSKYAITIVADEVDRYKRFEKDCW 165
           FL L Q SM+V EY+ K+ ELS++A  +VA E DR +RFE+  W
Sbjct: 175 FLYLKQGSMSVMEYEHKFNELSRFAPELVATEEDRCRRFEEGLW 213

BLAST of Cla019549 vs. TrEMBL
Match: M5Y283_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa019381mg PE=4 SV=1)

HSP 1 Score: 73.6 bits (179), Expect = 2.7e-10
Identity = 50/157 (31.85%), Postives = 82/157 (52.23%), Query Frame = 1

Query: 32  RSVQSDPEKKYGTE-----RLKALGATTFKGIADPVNAEK-----GRVDNVFTPEEGGR- 91
           R+V +  +++  TE     R+K LGA  F G ADP  A+       R+  V    +  R 
Sbjct: 62  RTVSTALQRRRNTESSDIKRVKELGANEFHGSADPAEADACLTDVERIFEVLQCPDRDRV 121

Query: 92  -----LVESNEEQKGRNRR--------ISWDEFKKAFQDKYCTKAFRDAKQNEFLKLVQR 151
                L++ N     +  R        ++W+EF++ F D++   ++++ K++EFL L Q 
Sbjct: 122 RLAAFLLKGNAYHGWKAVRRGYANPAALTWEEFQRVFFDQFYPHSYKNEKKSEFLHLRQG 181

Query: 152 SMTVAEYKKKYIELSKYAITIVADEVDRYKRFEKDCW 165
           SM+V EY+ K+ ELS++A  +V  E DR  RFE+  W
Sbjct: 182 SMSVLEYEHKFNELSRFAPELVTTEEDRCTRFEEGLW 218

BLAST of Cla019549 vs. TrEMBL
Match: A0A0K9JUF1_9BURK (Uncharacterized protein OS=Candidatus Burkholderia humilis GN=BHUM_04587c PE=4 SV=1)

HSP 1 Score: 69.7 bits (169), Expect = 3.9e-09
Identity = 35/80 (43.75%), Postives = 52/80 (65.00%), Query Frame = 1

Query: 80  RLVESNEEQKGRNRRISWDEFKKAFQDKYCTKAFRDAKQNEFLKLVQRSMTVAEYKKKYI 139
           R++E   EQ G  R  +WD F   F++K+  +  ++ KQ EF+ L QR+MTVA+Y+KK+ 
Sbjct: 27  RIIEQKWEQIGTPR--TWDNFISVFRNKFIPEVVKERKQEEFIYLRQRTMTVAQYEKKFN 86

Query: 140 ELSKYAITIVADEVDRYKRF 160
            LSKYA  +V  ++ R KRF
Sbjct: 87  RLSKYAPDLVDTDIKRNKRF 104

BLAST of Cla019549 vs. TrEMBL
Match: M5XAA1_PRUPE (Uncharacterized protein (Fragment) OS=Prunus persica GN=PRUPE_ppb013312mg PE=4 SV=1)

HSP 1 Score: 69.7 bits (169), Expect = 3.9e-09
Identity = 32/72 (44.44%), Postives = 52/72 (72.22%), Query Frame = 1

Query: 95  ISWDEFKKAFQDKYCTKAFRDAKQNEFLKLVQRSMTVAEYKKKYIELSKYAITIVADEVD 154
           I W+EF++ F +++   ++R AK++EFL L Q SM+V EY+ K+ +LS++A  +VA E D
Sbjct: 57  IKWEEFQRVFSEQFYPHSYRHAKKSEFLYLKQGSMSVVEYEHKFNKLSRFAPELVATEDD 116

Query: 155 RYKRFEKD-CWR 166
           R +RFE+  CW+
Sbjct: 117 RCRRFEEGLCWK 128

BLAST of Cla019549 vs. NCBI nr
Match: gi|659116534|ref|XP_008458118.1| (PREDICTED: uncharacterized protein LOC103497647 [Cucumis melo])

HSP 1 Score: 140.6 bits (353), Expect = 2.6e-30
Identity = 75/178 (42.13%), Postives = 107/178 (60.11%), Query Frame = 1

Query: 2   SEGESSHPQAGVNVEEQIFTRIAERLAAGIRSVQSDPEKKYGTERLKALGATTFKGIADP 61
           SEGESSH Q  + V+E++  +I E +  G+RS+ ++ EK +G ERLKALGATTF    DP
Sbjct: 29  SEGESSHQQGDMTVDEKLLNKITEGITTGMRSLNNNLEKTFGIERLKALGATTFARTTDP 88

Query: 62  VNAE-------------------KGRVDNVFTPEEGGRLVESNEEQKGRNRRISWDEFKK 121
            + E                   K  +      +       + E +      ISW+EFKK
Sbjct: 89  TDGEGWMNVLEKCFKVMRCLDDRKVELATFLLKKGADYWWRTFESRYHDTNEISWNEFKK 148

Query: 122 AFQDKYCTKAFRDAKQNEFLKLVQRSMTVAEYKKKYIELSKYAITIVADEVDRYKRFE 161
           AF +KY  ++++DAK++EF +LV  SMT+AEY+KKYI+LSKYA +++ DE+DR KRFE
Sbjct: 149 AFYEKYYPRSYKDAKRSEFFRLVLGSMTIAEYEKKYIKLSKYATSMIEDEIDRCKRFE 206

BLAST of Cla019549 vs. NCBI nr
Match: gi|659095113|ref|XP_008448403.1| (PREDICTED: uncharacterized protein LOC103490604 [Cucumis melo])

HSP 1 Score: 107.1 bits (266), Expect = 3.1e-20
Identity = 67/179 (37.43%), Postives = 92/179 (51.40%), Query Frame = 1

Query: 2   SEGESSHPQAGVNVEEQIFTRIAERLAAGIRSVQSDPEKKYGTERLKALGATTFKGIADP 61
           SE  SS P+       + F R A+ +    R   SDPEK YG ERLK LGAT F+G  DP
Sbjct: 24  SERGSSTPRGQNEAGSERFARSAQEIGRPERVGPSDPEKMYGIERLKKLGATVFEGSTDP 83

Query: 62  VNAE-------------------KGRVDNVFTPEEGGRLVESNEEQKGRNRRISWDEFKK 121
            NAE                   K R+      +E     +S   ++   R + W  F+ 
Sbjct: 84  ANAEVWLNMLEKCFDVMNCPQKRKVRLTTFLLQKEAEGWWKSIIARRNDARTLDWQTFRG 143

Query: 122 AFQDKYCTKAFRDAKQNEFLKLVQRSMTVAEYKKKYIELSKYAITIVADEVDRYKRFEK 162
            F++KY    + +AK++EF +L Q S++VAEY++KY ELS+Y   IVA E DR  RFE+
Sbjct: 144 IFEEKYYPTTYCEAKRDEFQELKQGSLSVAEYERKYTELSRYVEVIVAYESDRCCRFER 202

BLAST of Cla019549 vs. NCBI nr
Match: gi|659087174|ref|XP_008444312.1| (PREDICTED: uncharacterized protein LOC103487680 [Cucumis melo])

HSP 1 Score: 105.1 bits (261), Expect = 1.2e-19
Identity = 64/179 (35.75%), Postives = 95/179 (53.07%), Query Frame = 1

Query: 2   SEGESSHPQAGVNVEEQIFTRIAERLAAGIRSVQSDPEKKYGTERLKALGATTFKGIADP 61
           SE  SS P+    V  + F R A+ ++   R+  SD +K YG E+LK LGAT F+G  DP
Sbjct: 9   SERGSSTPRGQNEVGSERFARSAQEISRPERARPSDLDKMYGIEQLKKLGATVFQGSTDP 68

Query: 62  VNAE-------------------KGRVDNVFTPEEGGRLVESNEEQKGRNRRISWDEFKK 121
            +AE                   K R+      +E    ++S    +   R + W  F+ 
Sbjct: 69  ADAEVWLNMLEKCFDVMSCPQEQKVRLATFLLQKEAEGWLKSIIASRNDARTLYWQTFRG 128

Query: 122 AFQDKYCTKAFRDAKQNEFLKLVQRSMTVAEYKKKYIELSKYAITIVADEVDRYKRFEK 162
            F++KY    + +AK++EF++L Q S+ VAEY++KY ELS+Y   IVA E DR +RFE+
Sbjct: 129 IFEEKYYPTTYCEAKRDEFMELKQGSLLVAEYERKYTELSRYVEMIVASESDRCRRFER 187

BLAST of Cla019549 vs. NCBI nr
Match: gi|659072264|ref|XP_008464580.1| (PREDICTED: uncharacterized protein LOC103502419 [Cucumis melo])

HSP 1 Score: 84.7 bits (208), Expect = 1.7e-13
Identity = 41/64 (64.06%), Postives = 49/64 (76.56%), Query Frame = 1

Query: 2   SEGESSHPQAGVNVEEQIFTRIAERLAAGIRSVQSDPEKKYGTERLKALGATTFKGIADP 61
           S+ ESS P    N+EEQ+  R+A+RL +GIRS QSDPEKKYG ERLKALGATTF G  +P
Sbjct: 47  SDAESSRPHVEGNMEEQLLDRLAQRLISGIRSAQSDPEKKYGIERLKALGATTFVGTTNP 106

Query: 62  VNAE 66
            +AE
Sbjct: 107 ADAE 110

BLAST of Cla019549 vs. NCBI nr
Match: gi|659113911|ref|XP_008456815.1| (PREDICTED: uncharacterized protein LOC103496653 [Cucumis melo])

HSP 1 Score: 83.6 bits (205), Expect = 3.7e-13
Identity = 51/139 (36.69%), Postives = 74/139 (53.24%), Query Frame = 1

Query: 42  YGTERLKALGATTFKGIADPVNAE-------------------KGRVDNVFTPEEGGRLV 101
           YG ERLK LGAT F+G  D  +A+                   K R+      +E     
Sbjct: 2   YGIERLKKLGATVFEGSTDLADADAWLNMLEKCFDVMSCPQERKVRLATFLLQKEAEGWW 61

Query: 102 ESNEEQKGRNRRISWDEFKKAFQDKYCTKAFRDAKQNEFLKLVQRSMTVAEYKKKYIELS 161
           +S   +    R + W  F+  F++KY    + +AK++EFL+L Q S++VAEY+ KY ELS
Sbjct: 62  KSIIARCNDARTLDWQTFRGIFEEKYYPTTYCEAKRDEFLELKQGSLSVAEYEMKYTELS 121

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
M5XSR7_PRUPE2.9e-1231.11Uncharacterized protein OS=Prunus persica GN=PRUPE_ppb016975mg PE=4 SV=1[more]
M5VK25_PRUPE8.3e-1233.54Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa015000mg PE=4 SV=1[more]
M5Y283_PRUPE2.7e-1031.85Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa019381mg PE=4 SV=1[more]
A0A0K9JUF1_9BURK3.9e-0943.75Uncharacterized protein OS=Candidatus Burkholderia humilis GN=BHUM_04587c PE=4 S... [more]
M5XAA1_PRUPE3.9e-0944.44Uncharacterized protein (Fragment) OS=Prunus persica GN=PRUPE_ppb013312mg PE=4 S... [more]
Match NameE-valueIdentityDescription
gi|659116534|ref|XP_008458118.1|2.6e-3042.13PREDICTED: uncharacterized protein LOC103497647 [Cucumis melo][more]
gi|659095113|ref|XP_008448403.1|3.1e-2037.43PREDICTED: uncharacterized protein LOC103490604 [Cucumis melo][more]
gi|659087174|ref|XP_008444312.1|1.2e-1935.75PREDICTED: uncharacterized protein LOC103487680 [Cucumis melo][more]
gi|659072264|ref|XP_008464580.1|1.7e-1364.06PREDICTED: uncharacterized protein LOC103502419 [Cucumis melo][more]
gi|659113911|ref|XP_008456815.1|3.7e-1336.69PREDICTED: uncharacterized protein LOC103496653 [Cucumis melo][more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
IPR005162Retrotrans_gag_dom
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Cla019549Cla019549gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Cla019549Cla019549.1-proteinpolypeptide


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cla019549.1.cds1Cla019549.1.cds1CDS
Cla019549.1.cds2Cla019549.1.cds2CDS


Analysis Name: InterPro Annotations of watermelon (97103)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR005162Retrotransposon gag domainPFAMPF03732Retrotrans_gagcoord: 88..152
score: 2.