Clc04G03865 (gene) Watermelon (cordophanus) v2

Overview
NameClc04G03865
Typegene
OrganismCitrullus lanatus subsp. cordophanus cv. cordophanus (Watermelon (cordophanus) v2)
DescriptionReverse transcriptase
LocationClcChr04: 12866949 .. 12868076 (-)
RNA-Seq ExpressionClc04G03865
SyntenyClc04G03865
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCCACCTCGAGGCAGGCGAGCTAGGCAAGCAGTAGTCGGAACTCCTGAAGCTACTGGGAACCATGGAGATATGTCTGAGGGTGAATCTAGTCATCCTCAAGCTGAGGTCAATGTCGAGGAACAACTATTTACTAGAATTGCCCAGAGACTGGCTGCAAGTATCGGATCAGTGGAATCCGATCCAGAAAAGAAATATAGTGTTGAAAGATTAAAGGCTTTAGGTGCTACTACTTTTGAGGGCACCGTGGAACCTGTTGAGGCAGAAACATGGCTAAATTTGCTCGAGAAATGCTATCGTGTAATGAGATGCCCCGAAGACAGAAAAGTGGAGTTGGCAGTATTCTTGCTCCAAAAAGGGGCAGAAGATTGGTGGAAGATTGTGGAAAGCAGAAGAGGAGACACAGAAGGAATAGACTGGAATGAGTTTAAGAAGGTTTTCCAAGAAAAATATTGTCCAAGATCATTCCGAGATGCTAAACGGAATGAGTTTTTGAGGTTGATACAGGGATCTATGACAGTGGCAGAATATGAAAAGAAATATACAGAACTGTCTAAATATGCCACGACGGTTATTGCAGATGAGACTGATCGATGTAAGAGGTTTGAAGAAGGGTTACGGGAGGAAATACGAACTCCAGTAACCGCCAGTGTTGAATGGATGGATTTTTCTAGCCTAGTGGAGGCAGCTATGCGGGTAGAAAAGAGTTTGATGGAGAAGAAGATAGAGCGTGATTCATCTAAAAGTGAACGTGTAGTCCACTCTTCTGGCGTTACTCTGGGACAATCAGGAAGGAGATTTGTACCTGGCGTGTTTAAAGGAGGAAACTTCAAAACTAAATCGAGCGGACAGACTACTTTTAAGACCAGTACTAGTGGAGGTATACGGGGGCAAGGACCAAAGAACGTTGGTGGTCCAGCGCAGTCAGTAGAAGGTTCTCGAAGTGGACGACCTGGTGAGTCCACAGCTAGTTCCACACAGAAACCACTGTGTCCCACCTGCGGGAAGTATCATTGGGGACAGTGTAGGGTCAATGCATGTTATAACTGTGGACAGACTAGTCATTTCAAACGAGAATGCCCTCAATTGATGCAAGAAGATAAACCTGACCAGAGAACCAGTCCCTAG

mRNA sequence

ATGCCACCTCGAGGCAGGCGAGCTAGGCAAGCAGTAGTCGGAACTCCTGAAGCTACTGGGAACCATGGAGATATGTCTGAGGGTGAATCTAGTCATCCTCAAGCTGAGGTCAATGTCGAGGAACAACTATTTACTAGAATTGCCCAGAGACTGGCTGCAAGTATCGGATCAGTGGAATCCGATCCAGAAAAGAAATATAGTGTTGAAAGATTAAAGGCTTTAGGTGCTACTACTTTTGAGGGCACCGTGGAACCTGTTGAGGCAGAAACATGGCTAAATTTGCTCGAGAAATGCTATCGTGTAATGAGATGCCCCGAAGACAGAAAAGTGGAGTTGGCAGTATTCTTGCTCCAAAAAGGGGCAGAAGATTGGTGGAAGATTGTGGAAAGCAGAAGAGGAGACACAGAAGGAATAGACTGGAATGAGTTTAAGAAGGTTTTCCAAGAAAAATATTGTCCAAGATCATTCCGAGATGCTAAACGGAATGAGTTTTTGAGGTTGATACAGGGATCTATGACAGTGGCAGAATATGAAAAGAAATATACAGAACTGTCTAAATATGCCACGACGGTTATTGCAGATGAGACTGATCGATGTAAGAGGTTTGAAGAAGGGTTACGGGAGGAAATACGAACTCCAGTAACCGCCAGTGTTGAATGGATGGATTTTTCTAGCCTAGTGGAGGCAGCTATGCGGGTAGAAAAGAGTTTGATGGAGAAGAAGATAGAGCGTGATTCATCTAAAAGTGAACGTGTAGTCCACTCTTCTGGCGTTACTCTGGGACAATCAGGAAGGAGATTTGTACCTGGCGTGTTTAAAGGAGGAAACTTCAAAACTAAATCGAGCGGACAGACTACTTTTAAGACCAGTACTAGTGGAGGTATACGGGGGCAAGGACCAAAGAACGTTGGTGGTCCAGCGCAGTCAGTAGAAGGTTCTCGAAGTGGACGACCTGGTGAGTCCACAGCTAGTTCCACACAGAAACCACTGTGTCCCACCTGCGGGAAGTATCATTGGGGACAGTGTAGGGTCAATGCATGTTATAACTGTGGACAGACTAGTCATTTCAAACGAGAATGCCCTCAATTGATGCAAGAAGATAAACCTGACCAGAGAACCAGTCCCTAG

Coding sequence (CDS)

ATGCCACCTCGAGGCAGGCGAGCTAGGCAAGCAGTAGTCGGAACTCCTGAAGCTACTGGGAACCATGGAGATATGTCTGAGGGTGAATCTAGTCATCCTCAAGCTGAGGTCAATGTCGAGGAACAACTATTTACTAGAATTGCCCAGAGACTGGCTGCAAGTATCGGATCAGTGGAATCCGATCCAGAAAAGAAATATAGTGTTGAAAGATTAAAGGCTTTAGGTGCTACTACTTTTGAGGGCACCGTGGAACCTGTTGAGGCAGAAACATGGCTAAATTTGCTCGAGAAATGCTATCGTGTAATGAGATGCCCCGAAGACAGAAAAGTGGAGTTGGCAGTATTCTTGCTCCAAAAAGGGGCAGAAGATTGGTGGAAGATTGTGGAAAGCAGAAGAGGAGACACAGAAGGAATAGACTGGAATGAGTTTAAGAAGGTTTTCCAAGAAAAATATTGTCCAAGATCATTCCGAGATGCTAAACGGAATGAGTTTTTGAGGTTGATACAGGGATCTATGACAGTGGCAGAATATGAAAAGAAATATACAGAACTGTCTAAATATGCCACGACGGTTATTGCAGATGAGACTGATCGATGTAAGAGGTTTGAAGAAGGGTTACGGGAGGAAATACGAACTCCAGTAACCGCCAGTGTTGAATGGATGGATTTTTCTAGCCTAGTGGAGGCAGCTATGCGGGTAGAAAAGAGTTTGATGGAGAAGAAGATAGAGCGTGATTCATCTAAAAGTGAACGTGTAGTCCACTCTTCTGGCGTTACTCTGGGACAATCAGGAAGGAGATTTGTACCTGGCGTGTTTAAAGGAGGAAACTTCAAAACTAAATCGAGCGGACAGACTACTTTTAAGACCAGTACTAGTGGAGGTATACGGGGGCAAGGACCAAAGAACGTTGGTGGTCCAGCGCAGTCAGTAGAAGGTTCTCGAAGTGGACGACCTGGTGAGTCCACAGCTAGTTCCACACAGAAACCACTGTGTCCCACCTGCGGGAAGTATCATTGGGGACAGTGTAGGGTCAATGCATGTTATAACTGTGGACAGACTAGTCATTTCAAACGAGAATGCCCTCAATTGATGCAAGAAGATAAACCTGACCAGAGAACCAGTCCCTAG

Protein sequence

MPPRGRRARQAVVGTPEATGNHGDMSEGESSHPQAEVNVEEQLFTRIAQRLAASIGSVESDPEKKYSVERLKALGATTFEGTVEPVEAETWLNLLEKCYRVMRCPEDRKVELAVFLLQKGAEDWWKIVESRRGDTEGIDWNEFKKVFQEKYCPRSFRDAKRNEFLRLIQGSMTVAEYEKKYTELSKYATTVIADETDRCKRFEEGLREEIRTPVTASVEWMDFSSLVEAAMRVEKSLMEKKIERDSSKSERVVHSSGVTLGQSGRRFVPGVFKGGNFKTKSSGQTTFKTSTSGGIRGQGPKNVGGPAQSVEGSRSGRPGESTASSTQKPLCPTCGKYHWGQCRVNACYNCGQTSHFKRECPQLMQEDKPDQRTSP
Homology
BLAST of Clc04G03865 vs. NCBI nr
Match: KAA0060484.1 (Gag protease polyprotein-like protein [Cucumis melo var. makuwa] >TYK18569.1 Gag protease polyprotein-like protein [Cucumis melo var. makuwa])

HSP 1 Score: 342.0 bits (876), Expect = 6.3e-90
Identity = 192/351 (54.70%), Postives = 229/351 (65.24%), Query Frame = 0

Query: 20  GNHGDMSEGE-SSHPQAEVNVEEQLFTRIAQRLAASIG-SVESDPEKKYSVERLKALGAT 79
           G  G M +G    HP AE        +  A+  A   G S +SDPEKKY +ERLKALGAT
Sbjct: 15  GAKGTMPQGRPRKHPDAEA-------SNAAREAAMGSGESTQSDPEKKYGIERLKALGAT 74

Query: 80  TFEGTVEPVEAETWLNLLEKCYRVMRCPEDRKVELAVFLLQKGAEDWWKIVESRRGDTEG 139
           TF GT  P +AE WL L+EKC+RV RCPEDRKVELA FLLQ GAEDWW++ ESRR  T  
Sbjct: 75  TFAGTTNPADAEAWLTLIEKCFRVTRCPEDRKVELAAFLLQNGAEDWWRMEESRRRTTGD 134

Query: 140 IDWNEFKKVFQEKYCPRSFRDAKRNEFLRLIQGSMTVAEYEKKYTELSKYATTVIADETD 199
           I WNEFKK F +K+ PRSFRDAKRNEFLRL QGSMT+AEYEKKYTELS YAT VI DE +
Sbjct: 135 ISWNEFKKAFFDKFYPRSFRDAKRNEFLRLTQGSMTIAEYEKKYTELSMYATRVIEDEVE 194

Query: 200 RCKRFEEGLREEIRTPVTASVEWMDFSSLVEAAMRVEKSLMEKKIERDSSKSERVVHSSG 259
           RCKRFEEGLREEIRTPVTA  +W DFS LVEAA+RVEKSL E+K ER++SK+     SS 
Sbjct: 195 RCKRFEEGLREEIRTPVTACADWNDFSKLVEAALRVEKSLNERKQERETSKNV-CTFSSS 254

Query: 260 VTLGQSGR----RFVPGVFKGGNFKTKSSGQTTFKTSTSGGIRGQGPKNVGGPAQSVEGS 319
           +   + G+    RFVPGV   GNFK++ +G +  K+ +SGG   Q       P  S  GS
Sbjct: 255 MHRNRQGKERSGRFVPGVSSRGNFKSQYNGSSFSKSGSSGG--AQRSSGSSHPISSTGGS 314

Query: 320 RSGRPGESTASSTQKPLCPTCGKYHWGQCRVNACYNCGQTSHFKRECPQLM 365
              R     + S++                 + CYNCGQ  H++R+CP L+
Sbjct: 315 HIARSDRVVSESSKS----------------SVCYNCGQPGHYRRDCPHLI 339

BLAST of Clc04G03865 vs. NCBI nr
Match: TYJ95881.1 (retrotransposon protein, putative, Ty3-gypsy subclass [Cucumis melo var. makuwa])

HSP 1 Score: 339.7 bits (870), Expect = 3.1e-89
Identity = 194/366 (53.01%), Postives = 236/366 (64.48%), Query Frame = 0

Query: 3   PRGRRARQAVVGTPEATGNHGDMSEGESSHPQAEVNVEEQLFTRIAQRLAASIGSVESDP 62
           PR     +A     EA    G+ S+ ESS P+ E NVEEQL  R+AQRL + I S +SDP
Sbjct: 6   PRKHPDAEASNAAKEAAMGSGE-SDAESSRPRVEENVEEQLLDRLAQRLVSGIRSAQSDP 65

Query: 63  EKKYSVERLKALGATTFEGTVEPVEAETWLNLLEKCYRVMRCPEDRKVELAVFLLQKGAE 122
           EKKY  ERLKALGATTF GT  P + E WL L+EKC+RV R  EDRKVELA FLLQ  AE
Sbjct: 66  EKKYGFERLKALGATTFAGTTNPTDVEAWLTLIEKCFRVTRYLEDRKVELAAFLLQNDAE 125

Query: 123 DWWKIVESRRGDTEGIDWNEFKKVFQEKYCPRSFRDAKRNEFLRLIQGSMTVAEYEKKYT 182
           DWW++ ESRR  T  + W+EFKK F +K+ PRSFRDAK NEF+RL QG+MTVAEYEKKYT
Sbjct: 126 DWWRMEESRRRTTGDMSWDEFKKAFFDKFYPRSFRDAKHNEFVRLTQGTMTVAEYEKKYT 185

Query: 183 ELSKYATTVIADETDRCKRFEEGLREEIRTPVTASVEWMDFSSLVEAAMRVEKSLMEKKI 242
           ELSKYAT VI DE +RCKRFEEGLREEIRTPVTA  +W DFS LVE A+RVEKSL E+K 
Sbjct: 186 ELSKYATRVIVDEGERCKRFEEGLREEIRTPVTACADWNDFSKLVEVALRVEKSLNERKR 245

Query: 243 ERDSSKSERVVHSSGVTLGQSGR----RFVPGVFKGGNFKTKSSGQTTFKTSTSGGIRGQ 302
           ER++SK+ R   SS +   + G+    RFVP V   G+FK++ SG +  K+ + GG   Q
Sbjct: 246 EREASKNLR-TFSSSMHRNRPGKERSGRFVPRVSSRGSFKSQYSGSSFSKSRSGGG--AQ 305

Query: 303 GPKNVGGPAQSVEGSRSGRPGESTASSTQKPLCPTCGKYHWGQCRVNACYNCGQTSHFKR 362
              +      S  GS   R     + S +                 + CYNC Q  H++R
Sbjct: 306 RSSDSSHTISSTGGSHVARSNRVVSESGKS----------------SVCYNCCQLGHYRR 351

Query: 363 ECPQLM 365
           +CP L+
Sbjct: 366 DCPHLI 351

BLAST of Clc04G03865 vs. NCBI nr
Match: TYK15233.1 (uncharacterized protein E5676_scaffold892G00030 [Cucumis melo var. makuwa])

HSP 1 Score: 330.5 bits (846), Expect = 1.9e-86
Identity = 191/351 (54.42%), Postives = 227/351 (64.67%), Query Frame = 0

Query: 20  GNHGDMSEGE-SSHPQAEVNVEEQLFTRIAQRLAASIG-SVESDPEKKYSVERLKALGAT 79
           G  G M  G    HP AE        +  A+  A   G S +SDP+KKY +ERLKALGAT
Sbjct: 145 GAKGTMPRGRLRRHPDAEA-------SNAAREAAMGSGESAQSDPKKKYGIERLKALGAT 204

Query: 80  TFEGTVEPVEAETWLNLLEKCYRVMRCPEDRKVELAVFLLQKGAEDWWKIVESRRGDTEG 139
           TF GT  P + E WL L+EKC+RV RCPEDRKVELA FLLQ GAEDWW++ ESRR  T  
Sbjct: 205 TFAGTTNPTDVEAWLTLIEKCFRVTRCPEDRKVELAAFLLQNGAEDWWRMEESRRRTTGD 264

Query: 140 IDWNEFKKVFQEKYCPRSFRDAKRNEFLRLIQGSMTVAEYEKKYTELSKYATTVIADETD 199
           I W+EFKK F +K+ PRSFRDAKRNEFLRL QGSMTVAEYEKKYTELSKYAT VI DE +
Sbjct: 265 ISWDEFKKAFFDKFYPRSFRDAKRNEFLRLTQGSMTVAEYEKKYTELSKYATRVIEDEVE 324

Query: 200 RCKRFEEGLREEIRTPVTASVEWMDFSSLVEAAMRVEKSLMEKKIERDSSKSERV----V 259
           R KRFEEGLREEIRT VTA  +W DFS LVEAA+RV KSL E+K ER++SK+ R     +
Sbjct: 325 RYKRFEEGLREEIRTSVTACADWNDFSKLVEAALRVGKSLNERKRERETSKNVRTFSSSM 384

Query: 260 HSSGVTLGQSGRRFVPGVFKGGNFKTKSSGQTTFKTSTSGGIRGQGPKNVGGPAQSVEGS 319
           H + +   +SG RFVPGV   GNFK++ +G + F  S SGG   Q       P  S+ GS
Sbjct: 385 HRNRLGKERSG-RFVPGVPSRGNFKSQYNG-SYFSNSGSGG-EAQRSSGSSHPISSIGGS 444

Query: 320 RSGRPGESTASSTQKPLCPTCGKYHWGQCRVNACYNCGQTSHFKRECPQLM 365
              R     + S                C+ + CYNCGQ  H++R+CP L+
Sbjct: 445 HIARSDRVVSES----------------CKSSVCYNCGQPGHYRRDCPHLI 469

BLAST of Clc04G03865 vs. NCBI nr
Match: KAA0039476.1 (uncharacterized protein E6C27_scaffold64G002900 [Cucumis melo var. makuwa])

HSP 1 Score: 330.1 bits (845), Expect = 2.5e-86
Identity = 190/348 (54.60%), Postives = 226/348 (64.94%), Query Frame = 0

Query: 23  GDMSEGE-SSHPQAEVNVEEQLFTRIAQRLAASIG-SVESDPEKKYSVERLKALGATTFE 82
           G M  G    HP AE        +  A+  A   G S +SDP+KKY +ERLKALGATTF 
Sbjct: 112 GTMPRGRLRRHPDAEA-------SNAAREAAMGSGESAQSDPKKKYGIERLKALGATTFA 171

Query: 83  GTVEPVEAETWLNLLEKCYRVMRCPEDRKVELAVFLLQKGAEDWWKIVESRRGDTEGIDW 142
           GT  P + E WL L+EKC+RV RCPEDRKVELA FLLQ GAEDWW++ ESRR  T  I W
Sbjct: 172 GTTNPTDVEAWLTLIEKCFRVTRCPEDRKVELAAFLLQNGAEDWWRMEESRRRTTGDISW 231

Query: 143 NEFKKVFQEKYCPRSFRDAKRNEFLRLIQGSMTVAEYEKKYTELSKYATTVIADETDRCK 202
           +EFKK F +K+ PRSFRDAKRNEFLRL QGSMTVAEYEKKYTELSKYAT VI DE +R K
Sbjct: 232 DEFKKAFFDKFYPRSFRDAKRNEFLRLTQGSMTVAEYEKKYTELSKYATRVIEDEVERYK 291

Query: 203 RFEEGLREEIRTPVTASVEWMDFSSLVEAAMRVEKSLMEKKIERDSSKSERV----VHSS 262
           RFEEGLREEIRT VTA  +W DFS LVEAA+RV KSL E+K ER++SK+ R     +H +
Sbjct: 292 RFEEGLREEIRTSVTACADWNDFSKLVEAALRVGKSLNERKRERETSKNVRTFSSSMHRN 351

Query: 263 GVTLGQSGRRFVPGVFKGGNFKTKSSGQTTFKTSTSGGIRGQGPKNVGGPAQSVEGSRSG 322
            +   +SG RFVPGV   GNFK++ +G + F  S SGG   Q       P  S+ GS   
Sbjct: 352 RLGKERSG-RFVPGVPSRGNFKSQYNG-SYFSNSGSGG-EAQRSSGSSHPISSIGGSHIA 411

Query: 323 RPGESTASSTQKPLCPTCGKYHWGQCRVNACYNCGQTSHFKRECPQLM 365
           R     + S                C+ + CYNCGQ  H++R+CP L+
Sbjct: 412 RSDRVVSES----------------CKSSVCYNCGQPGHYRRDCPHLI 433

BLAST of Clc04G03865 vs. NCBI nr
Match: KAA0066849.1 (DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa])

HSP 1 Score: 313.5 bits (802), Expect = 2.4e-81
Identity = 179/376 (47.61%), Postives = 235/376 (62.50%), Query Frame = 0

Query: 1   MPPR-GRRARQAVVGTPEATGNHGDMSEGESSHPQAEVNVEEQLFTRIAQRLAASIGSVE 60
           MPPR GRR RQ   G    T      S GESS          + F R  Q +  +  +  
Sbjct: 39  MPPRTGRRRRQNQDGMQGPTQG---PSVGESSTLGVRGGAGNEQFARTTQEIGRTDRAEP 98

Query: 61  SDPEKKYSVERLKALGATTFEGTVEPVEAETWLNLLEKCYRVMRCPEDRKVELAVFLLQK 120
           SDPEK Y +ERLK LGAT FEG+ +P +AE WLN+LEKC+ VM CPE+RKV LA FLLQK
Sbjct: 99  SDPEKAYGIERLKKLGATVFEGSTDPADAENWLNMLEKCFDVMNCPEERKVRLATFLLQK 158

Query: 121 GAEDWWKIVESRRGDTEGIDWNEFKKVFQEKYCPRSFRDAKRNEFLRLIQGSMTVAEYEK 180
            AE WWK + +RR D   +DW  F+ +F++KY P ++ +AKR+EFL L QGS++VAEYE+
Sbjct: 159 EAEGWWKSILARRSDARALDWQTFRGIFEDKYYPSTYCEAKRDEFLGLKQGSLSVAEYER 218

Query: 181 KYTELSKYATTVIADETDRCKRFEEGLREEIRTPVTASVEWMDFSSLVEAAMRVEKSLME 240
           KYTELS+YA  +IA E+DRC+RFE GLR EIRTPVTA  +W +FS LVE A+RVE+S+ E
Sbjct: 219 KYTELSRYADVIIASESDRCRRFERGLRFEIRTPVTAIAKWTNFSQLVETALRVEQSITE 278

Query: 241 KKIERDSSKSERVVHSSGVTLGQSGRRFVPG--VFKGGNFKTKSSGQTTFKTSTSGGIRG 300
           +K   + S+      +SG   G+  RRF PG  +    +FK +S GQ +   S     + 
Sbjct: 279 EKSAVELSRGTST--ASGFR-GREQRRFTPGINISSRQDFKNRSGGQASRNVSYGSVFQR 338

Query: 301 QGPKNVGGPAQSVEGSRSGRPGESTASSTQKPLCPTCGKYHWGQCRVNA--CYNCGQTSH 360
           Q  +    P +S   S+ G+  ES AS+ ++  C +CG+ H GQC V A  CY CGQ  H
Sbjct: 339 QSQRIPSQPIRSTVRSQPGQ--ESIASTVRRIPCTSCGRNHRGQCLVGAGVCYQCGQPGH 398

Query: 361 FKRECPQLMQEDKPDQ 372
           FK++CPQL    + DQ
Sbjct: 399 FKKDCPQLNMTVQRDQ 406

BLAST of Clc04G03865 vs. ExPASy TrEMBL
Match: A0A5A7UZM6 (Gag protease polyprotein-like protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold119G00750 PE=4 SV=1)

HSP 1 Score: 342.0 bits (876), Expect = 3.1e-90
Identity = 192/351 (54.70%), Postives = 229/351 (65.24%), Query Frame = 0

Query: 20  GNHGDMSEGE-SSHPQAEVNVEEQLFTRIAQRLAASIG-SVESDPEKKYSVERLKALGAT 79
           G  G M +G    HP AE        +  A+  A   G S +SDPEKKY +ERLKALGAT
Sbjct: 15  GAKGTMPQGRPRKHPDAEA-------SNAAREAAMGSGESTQSDPEKKYGIERLKALGAT 74

Query: 80  TFEGTVEPVEAETWLNLLEKCYRVMRCPEDRKVELAVFLLQKGAEDWWKIVESRRGDTEG 139
           TF GT  P +AE WL L+EKC+RV RCPEDRKVELA FLLQ GAEDWW++ ESRR  T  
Sbjct: 75  TFAGTTNPADAEAWLTLIEKCFRVTRCPEDRKVELAAFLLQNGAEDWWRMEESRRRTTGD 134

Query: 140 IDWNEFKKVFQEKYCPRSFRDAKRNEFLRLIQGSMTVAEYEKKYTELSKYATTVIADETD 199
           I WNEFKK F +K+ PRSFRDAKRNEFLRL QGSMT+AEYEKKYTELS YAT VI DE +
Sbjct: 135 ISWNEFKKAFFDKFYPRSFRDAKRNEFLRLTQGSMTIAEYEKKYTELSMYATRVIEDEVE 194

Query: 200 RCKRFEEGLREEIRTPVTASVEWMDFSSLVEAAMRVEKSLMEKKIERDSSKSERVVHSSG 259
           RCKRFEEGLREEIRTPVTA  +W DFS LVEAA+RVEKSL E+K ER++SK+     SS 
Sbjct: 195 RCKRFEEGLREEIRTPVTACADWNDFSKLVEAALRVEKSLNERKQERETSKNV-CTFSSS 254

Query: 260 VTLGQSGR----RFVPGVFKGGNFKTKSSGQTTFKTSTSGGIRGQGPKNVGGPAQSVEGS 319
           +   + G+    RFVPGV   GNFK++ +G +  K+ +SGG   Q       P  S  GS
Sbjct: 255 MHRNRQGKERSGRFVPGVSSRGNFKSQYNGSSFSKSGSSGG--AQRSSGSSHPISSTGGS 314

Query: 320 RSGRPGESTASSTQKPLCPTCGKYHWGQCRVNACYNCGQTSHFKRECPQLM 365
              R     + S++                 + CYNCGQ  H++R+CP L+
Sbjct: 315 HIARSDRVVSESSKS----------------SVCYNCGQPGHYRRDCPHLI 339

BLAST of Clc04G03865 vs. ExPASy TrEMBL
Match: A0A5D3BB91 (Reverse transcriptase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold110G001760 PE=4 SV=1)

HSP 1 Score: 339.7 bits (870), Expect = 1.5e-89
Identity = 194/366 (53.01%), Postives = 236/366 (64.48%), Query Frame = 0

Query: 3   PRGRRARQAVVGTPEATGNHGDMSEGESSHPQAEVNVEEQLFTRIAQRLAASIGSVESDP 62
           PR     +A     EA    G+ S+ ESS P+ E NVEEQL  R+AQRL + I S +SDP
Sbjct: 6   PRKHPDAEASNAAKEAAMGSGE-SDAESSRPRVEENVEEQLLDRLAQRLVSGIRSAQSDP 65

Query: 63  EKKYSVERLKALGATTFEGTVEPVEAETWLNLLEKCYRVMRCPEDRKVELAVFLLQKGAE 122
           EKKY  ERLKALGATTF GT  P + E WL L+EKC+RV R  EDRKVELA FLLQ  AE
Sbjct: 66  EKKYGFERLKALGATTFAGTTNPTDVEAWLTLIEKCFRVTRYLEDRKVELAAFLLQNDAE 125

Query: 123 DWWKIVESRRGDTEGIDWNEFKKVFQEKYCPRSFRDAKRNEFLRLIQGSMTVAEYEKKYT 182
           DWW++ ESRR  T  + W+EFKK F +K+ PRSFRDAK NEF+RL QG+MTVAEYEKKYT
Sbjct: 126 DWWRMEESRRRTTGDMSWDEFKKAFFDKFYPRSFRDAKHNEFVRLTQGTMTVAEYEKKYT 185

Query: 183 ELSKYATTVIADETDRCKRFEEGLREEIRTPVTASVEWMDFSSLVEAAMRVEKSLMEKKI 242
           ELSKYAT VI DE +RCKRFEEGLREEIRTPVTA  +W DFS LVE A+RVEKSL E+K 
Sbjct: 186 ELSKYATRVIVDEGERCKRFEEGLREEIRTPVTACADWNDFSKLVEVALRVEKSLNERKR 245

Query: 243 ERDSSKSERVVHSSGVTLGQSGR----RFVPGVFKGGNFKTKSSGQTTFKTSTSGGIRGQ 302
           ER++SK+ R   SS +   + G+    RFVP V   G+FK++ SG +  K+ + GG   Q
Sbjct: 246 EREASKNLR-TFSSSMHRNRPGKERSGRFVPRVSSRGSFKSQYSGSSFSKSRSGGG--AQ 305

Query: 303 GPKNVGGPAQSVEGSRSGRPGESTASSTQKPLCPTCGKYHWGQCRVNACYNCGQTSHFKR 362
              +      S  GS   R     + S +                 + CYNC Q  H++R
Sbjct: 306 RSSDSSHTISSTGGSHVARSNRVVSESGKS----------------SVCYNCCQLGHYRR 351

Query: 363 ECPQLM 365
           +CP L+
Sbjct: 366 DCPHLI 351

BLAST of Clc04G03865 vs. ExPASy TrEMBL
Match: A0A5D3CTK6 (CCHC-type domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold892G00030 PE=4 SV=1)

HSP 1 Score: 330.5 bits (846), Expect = 9.2e-87
Identity = 191/351 (54.42%), Postives = 227/351 (64.67%), Query Frame = 0

Query: 20  GNHGDMSEGE-SSHPQAEVNVEEQLFTRIAQRLAASIG-SVESDPEKKYSVERLKALGAT 79
           G  G M  G    HP AE        +  A+  A   G S +SDP+KKY +ERLKALGAT
Sbjct: 145 GAKGTMPRGRLRRHPDAEA-------SNAAREAAMGSGESAQSDPKKKYGIERLKALGAT 204

Query: 80  TFEGTVEPVEAETWLNLLEKCYRVMRCPEDRKVELAVFLLQKGAEDWWKIVESRRGDTEG 139
           TF GT  P + E WL L+EKC+RV RCPEDRKVELA FLLQ GAEDWW++ ESRR  T  
Sbjct: 205 TFAGTTNPTDVEAWLTLIEKCFRVTRCPEDRKVELAAFLLQNGAEDWWRMEESRRRTTGD 264

Query: 140 IDWNEFKKVFQEKYCPRSFRDAKRNEFLRLIQGSMTVAEYEKKYTELSKYATTVIADETD 199
           I W+EFKK F +K+ PRSFRDAKRNEFLRL QGSMTVAEYEKKYTELSKYAT VI DE +
Sbjct: 265 ISWDEFKKAFFDKFYPRSFRDAKRNEFLRLTQGSMTVAEYEKKYTELSKYATRVIEDEVE 324

Query: 200 RCKRFEEGLREEIRTPVTASVEWMDFSSLVEAAMRVEKSLMEKKIERDSSKSERV----V 259
           R KRFEEGLREEIRT VTA  +W DFS LVEAA+RV KSL E+K ER++SK+ R     +
Sbjct: 325 RYKRFEEGLREEIRTSVTACADWNDFSKLVEAALRVGKSLNERKRERETSKNVRTFSSSM 384

Query: 260 HSSGVTLGQSGRRFVPGVFKGGNFKTKSSGQTTFKTSTSGGIRGQGPKNVGGPAQSVEGS 319
           H + +   +SG RFVPGV   GNFK++ +G + F  S SGG   Q       P  S+ GS
Sbjct: 385 HRNRLGKERSG-RFVPGVPSRGNFKSQYNG-SYFSNSGSGG-EAQRSSGSSHPISSIGGS 444

Query: 320 RSGRPGESTASSTQKPLCPTCGKYHWGQCRVNACYNCGQTSHFKRECPQLM 365
              R     + S                C+ + CYNCGQ  H++R+CP L+
Sbjct: 445 HIARSDRVVSES----------------CKSSVCYNCGQPGHYRRDCPHLI 469

BLAST of Clc04G03865 vs. ExPASy TrEMBL
Match: A0A5A7TBS0 (CCHC-type domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold64G002900 PE=4 SV=1)

HSP 1 Score: 330.1 bits (845), Expect = 1.2e-86
Identity = 190/348 (54.60%), Postives = 226/348 (64.94%), Query Frame = 0

Query: 23  GDMSEGE-SSHPQAEVNVEEQLFTRIAQRLAASIG-SVESDPEKKYSVERLKALGATTFE 82
           G M  G    HP AE        +  A+  A   G S +SDP+KKY +ERLKALGATTF 
Sbjct: 112 GTMPRGRLRRHPDAEA-------SNAAREAAMGSGESAQSDPKKKYGIERLKALGATTFA 171

Query: 83  GTVEPVEAETWLNLLEKCYRVMRCPEDRKVELAVFLLQKGAEDWWKIVESRRGDTEGIDW 142
           GT  P + E WL L+EKC+RV RCPEDRKVELA FLLQ GAEDWW++ ESRR  T  I W
Sbjct: 172 GTTNPTDVEAWLTLIEKCFRVTRCPEDRKVELAAFLLQNGAEDWWRMEESRRRTTGDISW 231

Query: 143 NEFKKVFQEKYCPRSFRDAKRNEFLRLIQGSMTVAEYEKKYTELSKYATTVIADETDRCK 202
           +EFKK F +K+ PRSFRDAKRNEFLRL QGSMTVAEYEKKYTELSKYAT VI DE +R K
Sbjct: 232 DEFKKAFFDKFYPRSFRDAKRNEFLRLTQGSMTVAEYEKKYTELSKYATRVIEDEVERYK 291

Query: 203 RFEEGLREEIRTPVTASVEWMDFSSLVEAAMRVEKSLMEKKIERDSSKSERV----VHSS 262
           RFEEGLREEIRT VTA  +W DFS LVEAA+RV KSL E+K ER++SK+ R     +H +
Sbjct: 292 RFEEGLREEIRTSVTACADWNDFSKLVEAALRVGKSLNERKRERETSKNVRTFSSSMHRN 351

Query: 263 GVTLGQSGRRFVPGVFKGGNFKTKSSGQTTFKTSTSGGIRGQGPKNVGGPAQSVEGSRSG 322
            +   +SG RFVPGV   GNFK++ +G + F  S SGG   Q       P  S+ GS   
Sbjct: 352 RLGKERSG-RFVPGVPSRGNFKSQYNG-SYFSNSGSGG-EAQRSSGSSHPISSIGGSHIA 411

Query: 323 RPGESTASSTQKPLCPTCGKYHWGQCRVNACYNCGQTSHFKRECPQLM 365
           R     + S                C+ + CYNCGQ  H++R+CP L+
Sbjct: 412 RSDRVVSES----------------CKSSVCYNCGQPGHYRRDCPHLI 433

BLAST of Clc04G03865 vs. ExPASy TrEMBL
Match: A0A5A7U2V7 (Reverse transcriptase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold374G00630 PE=4 SV=1)

HSP 1 Score: 313.5 bits (802), Expect = 1.2e-81
Identity = 179/376 (47.61%), Postives = 235/376 (62.50%), Query Frame = 0

Query: 1   MPPR-GRRARQAVVGTPEATGNHGDMSEGESSHPQAEVNVEEQLFTRIAQRLAASIGSVE 60
           MPPR GRR RQ   G    T      S GESS          + F R  Q +  +  +  
Sbjct: 1   MPPRTGRRRRQNQDGMQGPTQG---PSVGESSTLGVRGGAGNEQFARTTQEIGRTDRAEP 60

Query: 61  SDPEKKYSVERLKALGATTFEGTVEPVEAETWLNLLEKCYRVMRCPEDRKVELAVFLLQK 120
           SDPEK Y +ERLK LGAT FEG+ +P +AE WLN+LEKC+ VM CPE+RKV LA FLLQK
Sbjct: 61  SDPEKAYGIERLKKLGATVFEGSTDPADAENWLNMLEKCFDVMNCPEERKVRLATFLLQK 120

Query: 121 GAEDWWKIVESRRGDTEGIDWNEFKKVFQEKYCPRSFRDAKRNEFLRLIQGSMTVAEYEK 180
            AE WWK + +RR D   +DW  F+ +F++KY P ++ +AKR+EFL L QGS++VAEYE+
Sbjct: 121 EAEGWWKSILARRSDARALDWQTFRGIFEDKYYPSTYCEAKRDEFLGLKQGSLSVAEYER 180

Query: 181 KYTELSKYATTVIADETDRCKRFEEGLREEIRTPVTASVEWMDFSSLVEAAMRVEKSLME 240
           KYTELS+YA  +IA E+DRC+RFE GLR EIRTPVTA  +W +FS LVE A+RVE+S+ E
Sbjct: 181 KYTELSRYADVIIASESDRCRRFERGLRFEIRTPVTAIAKWTNFSQLVETALRVEQSITE 240

Query: 241 KKIERDSSKSERVVHSSGVTLGQSGRRFVPG--VFKGGNFKTKSSGQTTFKTSTSGGIRG 300
           +K   + S+      +SG   G+  RRF PG  +    +FK +S GQ +   S     + 
Sbjct: 241 EKSAVELSRGTST--ASGFR-GREQRRFTPGINISSRQDFKNRSGGQASRNVSYGSVFQR 300

Query: 301 QGPKNVGGPAQSVEGSRSGRPGESTASSTQKPLCPTCGKYHWGQCRVNA--CYNCGQTSH 360
           Q  +    P +S   S+ G+  ES AS+ ++  C +CG+ H GQC V A  CY CGQ  H
Sbjct: 301 QSQRIPSQPIRSTVRSQPGQ--ESIASTVRRIPCTSCGRNHRGQCLVGAGVCYQCGQPGH 360

Query: 361 FKRECPQLMQEDKPDQ 372
           FK++CPQL    + DQ
Sbjct: 361 FKKDCPQLNMTVQRDQ 368

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAA0060484.16.3e-9054.70Gag protease polyprotein-like protein [Cucumis melo var. makuwa] >TYK18569.1 Gag... [more]
TYJ95881.13.1e-8953.01retrotransposon protein, putative, Ty3-gypsy subclass [Cucumis melo var. makuwa][more]
TYK15233.11.9e-8654.42uncharacterized protein E5676_scaffold892G00030 [Cucumis melo var. makuwa][more]
KAA0039476.12.5e-8654.60uncharacterized protein E6C27_scaffold64G002900 [Cucumis melo var. makuwa][more]
KAA0066849.12.4e-8147.61DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A5A7UZM63.1e-9054.70Gag protease polyprotein-like protein OS=Cucumis melo var. makuwa OX=1194695 GN=... [more]
A0A5D3BB911.5e-8953.01Reverse transcriptase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold11... [more]
A0A5D3CTK69.2e-8754.42CCHC-type domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5... [more]
A0A5A7TBS01.2e-8654.60CCHC-type domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6... [more]
A0A5A7U2V71.2e-8147.61Reverse transcriptase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold37... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Watermelon (cordophanus) v2
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001878Zinc finger, CCHC-typeSMARTSM00343c2hcfinal6coord: 346..362
e-value: 1.0E-4
score: 31.7
IPR001878Zinc finger, CCHC-typePFAMPF00098zf-CCHCcoord: 346..362
e-value: 9.6E-6
score: 25.5
IPR001878Zinc finger, CCHC-typePROSITEPS50158ZF_CCHCcoord: 347..362
score: 10.460376
IPR005162Retrotransposon gag domainPFAMPF03732Retrotrans_gagcoord: 112..208
e-value: 8.6E-15
score: 54.8
NoneNo IPR availableGENE3D4.10.60.10coord: 338..373
e-value: 1.4E-5
score: 26.6
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 314..328
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..36
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 277..297
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 271..328
NoneNo IPR availablePANTHERPTHR34482DNA DAMAGE-INDUCIBLE PROTEIN 1-LIKEcoord: 80..365
NoneNo IPR availablePANTHERPTHR34482:SF4POLYMERASES SUPERFAMILY PROTEIN, PUTATIVE ISOFORM 1-RELATEDcoord: 80..365
IPR036875Zinc finger, CCHC-type superfamilySUPERFAMILY57756Retrovirus zinc finger-like domainscoord: 345..364

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Clc04G03865.1Clc04G03865.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0090304 nucleic acid metabolic process
molecular_function GO:0016787 hydrolase activity
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0016740 transferase activity
molecular_function GO:0008270 zinc ion binding