Tan0019656 (gene) Snake gourd v1

Overview
NameTan0019656
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
LocationLG05: 26696569 .. 26698422 (+)
RNA-Seq ExpressionTan0019656
SyntenyTan0019656
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTTCGTTCTATGATGAGCTATGTTCAATTGTCTGCCTCGTTTTGGAGATACGCAGTAGAGACTACAGTTCAAATCTTGAACACTGTTCCATCAAAGAGTGTTTCAGAAACACCTTTTGAATTGTGGAAGGGGCGTAAACCTAGTTTACAACACTTCAGGATTTGGGGTTGTCCGGCACATGTGCTAGTGACAAACCCAAAGAAACTGGAACCTCATTCAAGATTATGCCAATTTGTTGGCTATCCCAAAGAAACGAGAGGTGGTCTTTTCTACGCTCCACAAGAAAACAAGGTGATTATATCGACAAACGCCACTTTCATGGAAGAAGATCACATGAGGAACCATAAACCGCGTAGTAAATTAGTGTTAAATGAAGCTACAGATGAACCAACAAGAGTTGTTGATCAAGCTGGAACTTCATCAAGAGTTGATGGAAGAGCCAGCACCTCAAGTCAGTCTCATCCTTCTCAATCGTTGGGAATGCCTCGACGCAGTGGGAGGGTTGTTTCCCAACCTGACCGTTACTTGGGTTTAGCTGAAACTCAAGTTATCATATCTGATGACGGCGTAGAAGATCCATTGTCTTATAAACAGGCAATGTATGACGTAGACAAGGACCAATGGATCAAAGCCATAGACCTTGAAATGGAGTCAATAGACTTCAATTCAGTATGGGAACTTGTAGACCAACTTGAAGGGGTTAGACTCATAGGGTGTAAATGGATCTATAAGAGAAAGAGAGATGCAATCGGAAAGGTACAGGCCTTTAAAGCTAGACTTGTAGCAAAGGGTTTTACCCAAAGGGAAGGAGTTGACTATGAAGAAACTTTTTCCCTTGTTGGTATGCTGAAGTCCATAAGGAAACTCTTGTCCATAGTCGTGTTTTATGATTATGAAATCTGGCAAATGGACGTCAAGACTGCCTTTTTGAATGGTAATCTTGACGAGAGCATCTATATGTCTTAGCCCGAAGGATTCATAACCCAAGGTCAGGAGCAAAAAGTTTGGAAGCTCAATCGATCCATTCATGGGTTGAAACAAGCATTTAGATCTTGGAATATAAGATTTGATACTGCGATCAAGTCTTATGGCTTTGACCAAAACGTTGATGACCCTTGTGTTTACAAGAGGATCATCAACAACAAAGTAGCTTTCTTAGTACTTTATGTGAATGATATCCTACTCATTGGGAATGATGTAGGATACCTAACTGACATAAAGAATTGGCTGGCGACCCAATTCCAAATGAAAGATTTGGGAGAGGCGCAATATGTTCTTGGGATTCAGATCTTCAGAATCGCAAAGAACAAAACGCTAGCTCTGTCTCAAGCCTCTTATATCGACAAAATGTTGTCCCGATATTCGATGCAGAATTCCAAGAGGGGCTTATTACCCTTCAGGCATGGAATTCATCTGTCTAAGGAACAGTGTCCTGAGACACCTCAAGAAGTTGAGGATATGAGACGTATTCCCTATGCCTCTACAATAGGTAGCTTAATGTATGCTATGTTGTGCACGAGGCCAGACATTTGCTATGCAGTGGGACAAGTCAGTAGGTACTAATCCAATCCAGGGTTAGACCACTAGACAACAGTTAAAAATATCTTCAAGTATCTTAGGAGAACGAGGGACTATACGCTTGTGTATGGGACTAAGGATTTGATCCTTATAGGATACACTGATTCTGATTTTCAGACCGATAAGGATTCTCGTAAATCCACATCGGGATCAGTTTTCACCCTTAACGGGGGAGCTATAGTATGGCGAATCATCAAGCAAAGATGCATCGCTGACTCCACAATGGAGGCAGAGTATGTCGCTACTTGTGAAGCAGCTAAAGAGGTTGTTTGA

mRNA sequence

ATGGTTCGTTCTATGATGAGCTATGTTCAATTGTCTGCCTCGTTTTGGAGATACGCAGTAGAGACTACAGTTCAAATCTTGAACACTGTTCCATCAAAGAGTGTTTCAGAAACACCTTTTGAATTGTGGAAGGGGCGTAAACCTAGTTTACAACACTTCAGGATTTGGGGTTGTCCGGCACATGTGCTAGTGACAAACCCAAAGAAACTGGAACCTCATTCAAGATTATGCCAATTTGTTGGCTATCCCAAAGAAACGAGAGGTGGTCTTTTCTACGCTCCACAAGAAAACAAGGTGATTATATCGACAAACGCCACTTTCATGGAAGAAGATCACATGAGGAACCATAAACCGCGTAGTAAATTAGTGTTAAATGAAGCTACAGATGAACCAACAAGAGTTGTTGATCAAGCTGGAACTTCATCAAGAGTTGATGGAAGAGCCAGCACCTCAAGTCAGTCTCATCCTTCTCAATCGTTGGGAATGCCTCGACGCAGTGGGAGGGTTGTTTCCCAACCTGACCGTTACTTGGGTTTAGCTGAAACTCAAGTTATCATATCTGATGACGGCGTAGAAGATCCATTGTCTTATAAACAGGCAATGTATGACGTAGACAAGGACCAATGGATCAAAGCCATAGACCTTGAAATGGAGTCAATAGACTTCAATTCAGTATGGGAACTTGTAGACCAACTTGAAGGGGTTAGACTCATAGGGTGTAAATGGATCTATAAGAGAAAGAGAGATGCAATCGGAAAGGTACAGGCCTTTAAAGCTAGACTTGTAGCAAAGGGTTTTACCCAAAGGGAAGGAGTTGACTATGAAGAAACTTTTTCCCTTGTTGGTATGCTGAAGTCCATAAGGAAACTCTTGTCCATAGTCGTGTTTTATGATTATGAAATCTGGCAAATGGACGTCAAGACTGCCTTTTTGAATGGATACCTAACTGACATAAAGAATTGGCTGGCGACCCAATTCCAAATGAAAGATTTGGGAGAGGCGCAATATGTTCTTGGGATTCAGATCTTCAGAATCGCAAAGAACAAAACGCTAGCTCTGTCTCAAGCCTCTTATATCGACAAAATGTTGTCCCGATATTCGATGCAGAATTCCAAGAGGGGCTTATTACCCTTCAGGCATGGAATTCATCTGTCTAAGGAACAGTGTCCTGAGACACCTCAAGAAGTTGAGGATATGAGACGTATTCCCTATGCCTCTACAATAGGTAGCTTAATGAGAACGAGGGACTATACGCTTGTGTATGGGACTAAGGATTTGATCCTTATAGGATACACTGATTCTGATTTTCAGACCGATAAGGATTCTCGTAAATCCACATCGGGATCAGTTTTCACCCTTAACGGGGGAGCTATAGTATGGCGAATCATCAAGCAAAGATGCATCGCTGACTCCACAATGGAGGCAGAGTATGTCGCTACTTGTGAAGCAGCTAAAGAGGTTGTTTGA

Coding sequence (CDS)

ATGGTTCGTTCTATGATGAGCTATGTTCAATTGTCTGCCTCGTTTTGGAGATACGCAGTAGAGACTACAGTTCAAATCTTGAACACTGTTCCATCAAAGAGTGTTTCAGAAACACCTTTTGAATTGTGGAAGGGGCGTAAACCTAGTTTACAACACTTCAGGATTTGGGGTTGTCCGGCACATGTGCTAGTGACAAACCCAAAGAAACTGGAACCTCATTCAAGATTATGCCAATTTGTTGGCTATCCCAAAGAAACGAGAGGTGGTCTTTTCTACGCTCCACAAGAAAACAAGGTGATTATATCGACAAACGCCACTTTCATGGAAGAAGATCACATGAGGAACCATAAACCGCGTAGTAAATTAGTGTTAAATGAAGCTACAGATGAACCAACAAGAGTTGTTGATCAAGCTGGAACTTCATCAAGAGTTGATGGAAGAGCCAGCACCTCAAGTCAGTCTCATCCTTCTCAATCGTTGGGAATGCCTCGACGCAGTGGGAGGGTTGTTTCCCAACCTGACCGTTACTTGGGTTTAGCTGAAACTCAAGTTATCATATCTGATGACGGCGTAGAAGATCCATTGTCTTATAAACAGGCAATGTATGACGTAGACAAGGACCAATGGATCAAAGCCATAGACCTTGAAATGGAGTCAATAGACTTCAATTCAGTATGGGAACTTGTAGACCAACTTGAAGGGGTTAGACTCATAGGGTGTAAATGGATCTATAAGAGAAAGAGAGATGCAATCGGAAAGGTACAGGCCTTTAAAGCTAGACTTGTAGCAAAGGGTTTTACCCAAAGGGAAGGAGTTGACTATGAAGAAACTTTTTCCCTTGTTGGTATGCTGAAGTCCATAAGGAAACTCTTGTCCATAGTCGTGTTTTATGATTATGAAATCTGGCAAATGGACGTCAAGACTGCCTTTTTGAATGGATACCTAACTGACATAAAGAATTGGCTGGCGACCCAATTCCAAATGAAAGATTTGGGAGAGGCGCAATATGTTCTTGGGATTCAGATCTTCAGAATCGCAAAGAACAAAACGCTAGCTCTGTCTCAAGCCTCTTATATCGACAAAATGTTGTCCCGATATTCGATGCAGAATTCCAAGAGGGGCTTATTACCCTTCAGGCATGGAATTCATCTGTCTAAGGAACAGTGTCCTGAGACACCTCAAGAAGTTGAGGATATGAGACGTATTCCCTATGCCTCTACAATAGGTAGCTTAATGAGAACGAGGGACTATACGCTTGTGTATGGGACTAAGGATTTGATCCTTATAGGATACACTGATTCTGATTTTCAGACCGATAAGGATTCTCGTAAATCCACATCGGGATCAGTTTTCACCCTTAACGGGGGAGCTATAGTATGGCGAATCATCAAGCAAAGATGCATCGCTGACTCCACAATGGAGGCAGAGTATGTCGCTACTTGTGAAGCAGCTAAAGAGGTTGTTTGA

Protein sequence

MVRSMMSYVQLSASFWRYAVETTVQILNTVPSKSVSETPFELWKGRKPSLQHFRIWGCPAHVLVTNPKKLEPHSRLCQFVGYPKETRGGLFYAPQENKVIISTNATFMEEDHMRNHKPRSKLVLNEATDEPTRVVDQAGTSSRVDGRASTSSQSHPSQSLGMPRRSGRVVSQPDRYLGLAETQVIISDDGVEDPLSYKQAMYDVDKDQWIKAIDLEMESIDFNSVWELVDQLEGVRLIGCKWIYKRKRDAIGKVQAFKARLVAKGFTQREGVDYEETFSLVGMLKSIRKLLSIVVFYDYEIWQMDVKTAFLNGYLTDIKNWLATQFQMKDLGEAQYVLGIQIFRIAKNKTLALSQASYIDKMLSRYSMQNSKRGLLPFRHGIHLSKEQCPETPQEVEDMRRIPYASTIGSLMRTRDYTLVYGTKDLILIGYTDSDFQTDKDSRKSTSGSVFTLNGGAIVWRIIKQRCIADSTMEAEYVATCEAAKEVV
Homology
BLAST of Tan0019656 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 258.5 bits (659), Expect = 1.6e-67
Identity = 198/647 (30.60%), Postives = 292/647 (45.13%), Query Frame = 0

Query: 2    VRSMMSYVQLSASFWRYAVETTVQILNTVPSKSVS-ETPFELWKGRKPSLQHFRIWGCP- 61
            VRSM+   +L  SFW  AV+T   ++N  PS  ++ E P  +W  ++ S  H +++GC  
Sbjct: 596  VRSMLRMAKLPKSFWGEAVQTACYLINRSPSVPLAFEIPERVWTNKEVSYSHLKVFGCRA 655

Query: 62   -AHVLVTNPKKLEPHSRLCQFVGYPKETRGGLFYAPQENKVIISTNATFMEEDHMRNHKP 121
             AHV      KL+  S  C F+GY  E  G   + P + KVI S +  F E + +R    
Sbjct: 656  FAHVPKEQRTKLDDKSIPCIFIGYGDEEFGYRLWDPVKKKVIRSRDVVFRESE-VRTAAD 715

Query: 122  RSKLVLN--------------------EATDEPTRVVDQAG----TSSRVDGRASTSSQS 181
             S+ V N                      TDE +   +Q G       ++D         
Sbjct: 716  MSEKVKNGIIPNFVTIPSTSNNPTSAESTTDEVSEQGEQPGEVIEQGEQLD--EGVEEVE 775

Query: 182  HPSQSLGMP---RRSGRVVSQPDRYLGLAETQVIISDDGVEDPLSYKQAMYDVDKDQWIK 241
            HP+Q        RRS R   +  RY   +   V+ISDD   +P S K+ +   +K+Q +K
Sbjct: 776  HPTQGEEQHQPLRRSERPRVESRRY--PSTEYVLISDD--REPESLKEVLSHPEKNQLMK 835

Query: 242  AIDLEMESIDFNSVWELVDQLEGVRLIGCKWIYKRKRDAIGKVQAFKARLVAKGFTQREG 301
            A+  EMES+  N  ++LV+  +G R + CKW++K K+D   K+  +KARLV KGF Q++G
Sbjct: 836  AMQEEMESLQKNGTYKLVELPKGKRPLKCKWVFKLKKDGDCKLVRYKARLVVKGFEQKKG 895

Query: 302  VDYEETFSLVGMLKSIRKLLSIVVFYDYEIWQMDVKTAFLNG------------------ 361
            +D++E FS V  + SIR +LS+    D E+ Q+DVKTAFL+G                  
Sbjct: 896  IDFDEIFSPVVKMTSIRTILSLAASLDLEVEQLDVKTAFLHGDLEEEIYMEQPEGFEVAG 955

Query: 362  -----------------------------------------------------------Y 421
                                                                       Y
Sbjct: 956  KKHMVCKLNKSLYGLKQAPRQWYMKFDSFMKSQTYLKTYSDPCVYFKRFSENNFIILLLY 1015

Query: 422  LTD-------------IKNWLATQFQMKDLGEAQYVLGIQIFRIAKNKTLALSQASYIDK 481
            + D             +K  L+  F MKDLG AQ +LG++I R   ++ L LSQ  YI++
Sbjct: 1016 VDDMLIVGKDKGLIAKLKGDLSKSFDMKDLGPAQQILGMKIVRERTSRKLWLSQEKYIER 1075

Query: 482  MLSRYSMQNSKRGLLPFRHGIHLSKEQCPETPQEVEDMRRIPYASTIGSLM--------- 489
            +L R++M+N+K    P    + LSK+ CP T +E  +M ++PY+S +GSLM         
Sbjct: 1076 VLERFNMKNAKPVSTPLAGHLKLSKKMCPTTVEEKGNMAKVPYSSAVGSLMYAMVCTRPD 1135

BLAST of Tan0019656 vs. ExPASy Swiss-Prot
Match: P04146 (Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3)

HSP 1 Score: 160.6 bits (405), Expect = 4.4e-38
Identity = 170/720 (23.61%), Postives = 294/720 (40.83%), Query Frame = 0

Query: 3    RSMMSYVQLSASFWRYAVETTVQILNTVPSKSV---SETPFELWKGRKPSLQHFRIWGCP 62
            R+M+S  +L  SFW  AV T   ++N +PS+++   S+TP+E+W  +KP L+H R++G  
Sbjct: 597  RTMVSGAKLDKSFWGEAVLTATYLINRIPSRALVDSSKTPYEMWHNKKPYLKHLRVFGAT 656

Query: 63   AHVLVTNPK-KLEPHSRLCQFVGY------------------------------------ 122
             +V + N + K +  S    FVGY                                    
Sbjct: 657  VYVHIKNKQGKFDDKSFKSIFVGYEPNGFKLWDAVNEKFIVARDVVVDETNMVNSRAVKF 716

Query: 123  -------PKETRGGLFYAPQENKVIIST----------NATFMEE--------------- 182
                    KE+    F  P +++ II T          N  F+++               
Sbjct: 717  ETVFLKDSKESENKNF--PNDSRKIIQTEFPNESKECDNIQFLKDSKESENKNFPNDSRK 776

Query: 183  ----------------DHMRNHKPRSKLVLNEA--TDEPTRVVDQAGTSSRVDGRASTSS 242
                              +++ K  +K  LNE+        + +  G+ +  + R S ++
Sbjct: 777  IIQTEFPNESKECDNIQFLKDSKESNKYFLNESKKRKRDDHLNESKGSGNPNESRESETA 836

Query: 243  Q-------SHPSQSLGMP---RRSGRVVSQPDRYLGLAE---TQVIISDDGV--EDPLSY 302
            +        +P+++ G+    RRS R+ ++P       +    +V+++   +  + P S+
Sbjct: 837  EHLKEIGIDNPTKNDGIEIINRRSERLKTKPQISYNEEDNSLNKVVLNAHTIFNDVPNSF 896

Query: 303  KQAMYDVDKDQWIKAIDLEMESIDFNSVWELVDQLEGVRLIGCKWIYKRKRDAIGKVQAF 362
             +  Y  DK  W +AI+ E+ +   N+ W +  + E   ++  +W++  K + +G    +
Sbjct: 897  DEIQYRDDKSSWEEAINTELNAHKINNTWTITKRPENKNIVDSRWVFSVKYNELGNPIRY 956

Query: 363  KARLVAKGFTQREGVDYEETFSLVGMLKSIRKLLSIVVFYDYEIWQMDVKTAFLNG---- 422
            KARLVA+GFTQ+  +DYEETF+ V  + S R +LS+V+ Y+ ++ QMDVKTAFLNG    
Sbjct: 957  KARLVARGFTQKYQIDYEETFAPVARISSFRFILSLVIQYNLKVHQMDVKTAFLNGTLKE 1016

Query: 423  ------------------------------------------------------------ 482
                                                                        
Sbjct: 1017 EIYMRLPQGISCNSDNVCKLNKAIYGLKQAARCWFEVFEQALKECEFVNSSVDRCIYILD 1076

Query: 483  ------------YLTDI-------------KNWLATQFQMKDLGEAQYVLGIQIFRIAKN 489
                        Y+ D+             K +L  +F+M DL E ++ +GI+I  + ++
Sbjct: 1077 KGNINENIYVLLYVDDVVIATGDMTRMNNFKRYLMEKFRMTDLNEIKHFIGIRI-EMQED 1136

BLAST of Tan0019656 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 102.1 bits (253), Expect = 1.9e-20
Identity = 109/457 (23.85%), Postives = 174/457 (38.07%), Query Frame = 0

Query: 157  SQSLGMPRRSGRVVSQPDRYLGLAETQVIISDDGVEDPLSYKQAMYDVDKDQWIKAIDLE 216
            + S+G   ++G +   P       +  + +S     +P +  QA+ D   ++W  A+  E
Sbjct: 926  THSMGTRAKAGIIKPNP-------KYSLAVSLAAESEPRTAIQALKD---ERWRNAMGSE 985

Query: 217  MESIDFNSVWELVDQLEG-VRLIGCKWIYKRKRDAIGKVQAFKARLVAKGFTQREGVDYE 276
            + +   N  W+LV      V ++GC+WI+ +K ++ G +  +KARLVAKG+ QR G+DY 
Sbjct: 986  INAQIGNHTWDLVPPPPSHVTIVGCRWIFTKKYNSDGSLNRYKARLVAKGYNQRPGLDYA 1045

Query: 277  ETFSLVGMLKSIRKLLSIVVFYDYEIWQMDVKTAFLNGYLTD------------------ 336
            ETFS V    SIR +L + V   + I Q+DV  AFL G LTD                  
Sbjct: 1046 ETFSPVIKSTSIRIVLGVAVDRSWPIRQLDVNNAFLQGTLTDDVYMSQPPGFIDKDRPNY 1105

Query: 337  ---------------------IKNWLAT-------------------------------- 396
                                 ++N+L T                                
Sbjct: 1106 VCKLRKALYGLKQAPRAWYVELRNYLLTIGFVNSVSDTSLFVLQRGKSIVYMLVYVDDIL 1165

Query: 397  ------------------QFQMKDLGEAQYVLGIQIFRIAKNKTLALSQASYIDKMLSRY 456
                              +F +KD  E  Y LGI+  R+     L LSQ  YI  +L+R 
Sbjct: 1166 ITGNDPTLLHNTLDNLSQRFSVKDHEELHYFLGIEAKRVPTG--LHLSQRRYILDLLART 1225

Query: 457  SMQNSK--------------------------RGLLPFRHGIHLSKEQCPETPQEVEDMR 488
            +M  +K                          RG++     +  ++         +    
Sbjct: 1226 NMITAKPVTTPMAPSPKLSLYSGTKLTDPTEYRGIVGSLQYLAFTRPDISYAVNRLSQFM 1285

BLAST of Tan0019656 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 95.1 bits (235), Expect = 2.3e-18
Identity = 54/126 (42.86%), Postives = 77/126 (61.11%), Query Frame = 0

Query: 193  DPLSYKQAMYDVDKDQWIKAIDLEMESIDFNSVWELV-DQLEGVRLIGCKWIYKRKRDAI 252
            +P +  QAM D   D+W +A+  E+ +   N  W+LV      V ++GC+WI+ +K ++ 
Sbjct: 938  EPRTAIQAMKD---DRWRQAMGSEINAQIGNHTWDLVPPPPPSVTIVGCRWIFTKKFNSD 997

Query: 253  GKVQAFKARLVAKGFTQREGVDYEETFSLVGMLKSIRKLLSIVVFYDYEIWQMDVKTAFL 312
            G +  +KARLVAKG+ QR G+DY ETFS V    SIR +L + V   + I Q+DV  AFL
Sbjct: 998  GSLNRYKARLVAKGYNQRPGLDYAETFSPVIKSTSIRIVLGVAVDRSWPIRQLDVNNAFL 1057

Query: 313  NGYLTD 318
             G LTD
Sbjct: 1058 QGTLTD 1060

BLAST of Tan0019656 vs. ExPASy Swiss-Prot
Match: P92520 (Uncharacterized mitochondrial protein AtMg00820 OS=Arabidopsis thaliana OX=3702 GN=AtMg00820 PE=4 SV=1)

HSP 1 Score: 72.8 bits (177), Expect = 1.2e-11
Identity = 34/96 (35.42%), Postives = 56/96 (58.33%), Query Frame = 0

Query: 198 KQAMYDVDKDQWIKAIDLEMESIDFNSVWELVDQLEGVRLIGCKWIYKRKRDAIGKVQAF 257
           K  ++ +    W +A+  E++++  N  W LV       ++GCKW++K K  + G +   
Sbjct: 29  KSVIFALKDPGWCQAMQEELDALSRNKTWILVPPPVNQNILGCKWVFKTKLHSDGTLDRL 88

Query: 258 KARLVAKGFTQREGVDYEETFSLVGMLKSIRKLLSI 294
           KARLVAKGF Q EG+ + ET+S V    +IR +L++
Sbjct: 89  KARLVAKGFHQEEGIYFVETYSPVVRTATIRTILNV 124

BLAST of Tan0019656 vs. NCBI nr
Match: KAA0025945.1 (gag/pol protein [Cucumis melo var. makuwa] >KAA0026303.1 gag/pol protein [Cucumis melo var. makuwa] >KAA0035786.1 gag/pol protein [Cucumis melo var. makuwa] >KAA0040492.1 gag/pol protein [Cucumis melo var. makuwa] >KAA0041262.1 gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 782.3 bits (2019), Expect = 2.4e-222
Identity = 416/617 (67.42%), Postives = 441/617 (71.47%), Query Frame = 0

Query: 1    MVRSMMSYVQLSASFWRYAVETTVQILNTVPSKSVSETPFELWKGRKPSLQHFRIWGCPA 60
            MVRSMMSY QL +SFW YAVET V ILN VPSKSVSETPFELW+GRKPSL HFRIWGCPA
Sbjct: 521  MVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPA 580

Query: 61   HVLVTNPKKLEPHSRLCQFVGYPKETRGGLFYAPQENKVIISTNATFMEEDHMRNHKPRS 120
            HVLVTNPKKLEP SRLCQFVGYPKETRGGLF+ PQEN+V +STNATF+EEDHMRNHKPRS
Sbjct: 581  HVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRS 640

Query: 121  KLVLNEATDEPTRVVDQAGTSSRVDGRASTSSQSHPSQSLGMPRRSGRVVSQPDRYLGLA 180
            KLVL+EATDE TRVVD+ G SSRVD   +TS QSHPSQSL MPRRSGRVVSQP+RYLGL 
Sbjct: 641  KLVLSEATDESTRVVDEVGPSSRVD-ETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLT 700

Query: 181  ETQVIISDDGVEDPLSYKQAMYDVDKDQWIKAIDLEMESIDFNSVWELVDQLEGVRLIGC 240
            ETQV+I DDGVEDPLSYKQAM DVDKDQW+KA+DLEMES+ FNSVWELVD  EGV+ IGC
Sbjct: 701  ETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGC 760

Query: 241  KWIYKRKRDAIGKVQAFKARLVAKGFTQREGVDYEETFSLVGMLKSIRKLLSIVVFYDYE 300
            KWIYKRKRD+ GKVQ FKARLVAKG+TQREGVDYEETFS V MLKSIR LLSI  FYDYE
Sbjct: 761  KWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYE 820

Query: 301  IWQMDVKTAFLN------------------------------------------------ 360
            IWQMDVKTAFLN                                                
Sbjct: 821  IWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTA 880

Query: 361  -----------------------------------------GYLTDIKNWLATQFQMKDL 420
                                                     GYLTD+K WLA QFQMKDL
Sbjct: 881  IKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDL 940

Query: 421  GEAQYVLGIQIFRIAKNKTLALSQASYIDKMLSRYSMQNSKRGLLPFRHGIHLSKEQCPE 480
            GEAQYVLGIQI R  KNKTLALSQA+YIDK+L RYSMQNSK+GLLPFRHG+HLSKEQ P+
Sbjct: 941  GEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPK 1000

Query: 481  TPQEVEDMRRIPYASTIGSLM--------------------------------------- 489
            TPQEVEDMRRIPYAS +GSLM                                       
Sbjct: 1001 TPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYL 1060

BLAST of Tan0019656 vs. NCBI nr
Match: KAA0059226.1 (gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 782.3 bits (2019), Expect = 2.4e-222
Identity = 416/617 (67.42%), Postives = 441/617 (71.47%), Query Frame = 0

Query: 1    MVRSMMSYVQLSASFWRYAVETTVQILNTVPSKSVSETPFELWKGRKPSLQHFRIWGCPA 60
            MVRSMMSY QL +SFW YAVET V ILN VPSKSVSETPFELW+GRKPSL HFRIWGCPA
Sbjct: 395  MVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPA 454

Query: 61   HVLVTNPKKLEPHSRLCQFVGYPKETRGGLFYAPQENKVIISTNATFMEEDHMRNHKPRS 120
            HVLVTNPKKLEP SRLCQFVGYPKETRGGLF+ PQEN+V +STNATF+EEDHMRNHKPRS
Sbjct: 455  HVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRS 514

Query: 121  KLVLNEATDEPTRVVDQAGTSSRVDGRASTSSQSHPSQSLGMPRRSGRVVSQPDRYLGLA 180
            KLVL+EATDE TRVVD+ G SSRVD   +TS QSHPSQSL MPRRSGRVVSQP+RYLGL 
Sbjct: 515  KLVLSEATDESTRVVDEVGPSSRVD-ETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLT 574

Query: 181  ETQVIISDDGVEDPLSYKQAMYDVDKDQWIKAIDLEMESIDFNSVWELVDQLEGVRLIGC 240
            ETQV+I DDGVEDPLSYKQAM DVDKDQW+KA+DLEMES+ FNSVWELVD  EGV+ IGC
Sbjct: 575  ETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGC 634

Query: 241  KWIYKRKRDAIGKVQAFKARLVAKGFTQREGVDYEETFSLVGMLKSIRKLLSIVVFYDYE 300
            KWIYKRKRD+ GKVQ FKARLVAKG+TQREGVDYEETFS V MLKSIR LLSI  FYDYE
Sbjct: 635  KWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYE 694

Query: 301  IWQMDVKTAFLN------------------------------------------------ 360
            IWQMDVKTAFLN                                                
Sbjct: 695  IWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTA 754

Query: 361  -----------------------------------------GYLTDIKNWLATQFQMKDL 420
                                                     GYLTD+K WLA QFQMKDL
Sbjct: 755  IKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDL 814

Query: 421  GEAQYVLGIQIFRIAKNKTLALSQASYIDKMLSRYSMQNSKRGLLPFRHGIHLSKEQCPE 480
            GEAQYVLGIQI R  KNKTLALSQA+YIDK+L RYSMQNSK+GLLPFRHG+HLSKEQ P+
Sbjct: 815  GEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPK 874

Query: 481  TPQEVEDMRRIPYASTIGSLM--------------------------------------- 489
            TPQEVEDMRRIPYAS +GSLM                                       
Sbjct: 875  TPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYL 934

BLAST of Tan0019656 vs. NCBI nr
Match: KAA0035907.1 (gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 771.2 bits (1990), Expect = 5.5e-219
Identity = 410/617 (66.45%), Postives = 439/617 (71.15%), Query Frame = 0

Query: 1    MVRSMMSYVQLSASFWRYAVETTVQILNTVPSKSVSETPFELWKGRKPSLQHFRIWGCPA 60
            MVRSMMSY QL +SFW YAVET V ILN VPSKSVSETPFELW+GRKPSL HFRIWGCPA
Sbjct: 521  MVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPA 580

Query: 61   HVLVTNPKKLEPHSRLCQFVGYPKETRGGLFYAPQENKVIISTNATFMEEDHMRNHKPRS 120
            HVLVTNPKKLEP SRLCQFVGYPKETRGGLF+ P+EN+V +STNATF+EEDHMRNHKPRS
Sbjct: 581  HVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPKENRVFVSTNATFLEEDHMRNHKPRS 640

Query: 121  KLVLNEATDEPTRVVDQAGTSSRVDGRASTSSQSHPSQSLGMPRRSGRVVSQPDRYLGLA 180
            KLVL+EATDE TRVVD+ G SSRVD   +TS QSHPSQSL MPRRSGRVVSQP+RYLGL 
Sbjct: 641  KLVLSEATDESTRVVDEVGPSSRVD-ETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLT 700

Query: 181  ETQVIISDDGVEDPLSYKQAMYDVDKDQWIKAIDLEMESIDFNSVWELVDQLEGVRLIGC 240
            ETQV+I DDGVEDPLSYKQAM DVDKDQW+KA+DLEMES+ FNSVWELVD  EGV+ IGC
Sbjct: 701  ETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGC 760

Query: 241  KWIYKRKRDAIGKVQAFKARLVAKGFTQREGVDYEETFSLVGMLKSIRKLLSIVVFYDYE 300
            KWIYKRKRD+ GKVQ FKARLVAKG+T++EGVDYEETFS V MLKSIR LLSI  FYDYE
Sbjct: 761  KWIYKRKRDSAGKVQTFKARLVAKGYTRKEGVDYEETFSSVAMLKSIRILLSIAKFYDYE 820

Query: 301  IWQMDVKTAFLN------------------------------------------------ 360
            IWQMDVKTAFLN                                                
Sbjct: 821  IWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTA 880

Query: 361  -----------------------------------------GYLTDIKNWLATQFQMKDL 420
                                                     GYLTD+K WLA QFQMKDL
Sbjct: 881  IKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDL 940

Query: 421  GEAQYVLGIQIFRIAKNKTLALSQASYIDKMLSRYSMQNSKRGLLPFRHGIHLSKEQCPE 480
            GE QYVLGIQI R  KNKTLALSQA+YIDK+L RYSMQNSK+GLLPFRHG+HLSKEQ P+
Sbjct: 941  GEGQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPK 1000

Query: 481  TPQEVEDMRRIPYASTIGSLM--------------------------------------- 489
            TPQEVEDMRRIPYAS +GSLM                                       
Sbjct: 1001 TPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIILKYL 1060

BLAST of Tan0019656 vs. NCBI nr
Match: KAA0033121.1 (gag/pol protein [Cucumis melo var. makuwa] >TYK17112.1 gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 756.5 bits (1952), Expect = 1.4e-214
Identity = 403/617 (65.32%), Postives = 436/617 (70.66%), Query Frame = 0

Query: 1   MVRSMMSYVQLSASFWRYAVETTVQILNTVPSKSVSETPFELWKGRKPSLQHFRIWGCPA 60
           MVRSMMSY QL +SFW YAVET V ILN V SKSVSETPFELW+GRKPSL HF+I GCPA
Sbjct: 131 MVRSMMSYAQLPSSFWGYAVETAVHILNNVSSKSVSETPFELWRGRKPSLSHFKILGCPA 190

Query: 61  HVLVTNPKKLEPHSRLCQFVGYPKETRGGLFYAPQENKVIISTNATFMEEDHMRNHKPRS 120
           HVLVTNPKKLEP SRLCQFVGYPKETRGGLF+ PQ+N+V++STNATF+EEDHMR+HKP++
Sbjct: 191 HVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQKNRVLVSTNATFLEEDHMRDHKPQN 250

Query: 121 KLVLNEATDEPTRVVDQAGTSSRVDGRASTSSQSHPSQSLGMPRRSGRVVSQPDRYLGLA 180
           KLVLNEA DE TRVVD+ G SSRV+   +TS QSHPSQSL MPRRSGR+VSQP+RYLGL 
Sbjct: 251 KLVLNEAIDESTRVVDEVGPSSRVN-ETTTSGQSHPSQSLRMPRRSGRIVSQPNRYLGLT 310

Query: 181 ETQVIISDDGVEDPLSYKQAMYDVDKDQWIKAIDLEMESIDFNSVWELVDQLEGVRLIGC 240
           ETQV+I DDGVEDPLSY QAM DVDKDQW+KA+DLEMES+ FN +WELVD  EGV+ IGC
Sbjct: 311 ETQVVIPDDGVEDPLSYNQAMNDVDKDQWVKAMDLEMESMYFNLMWELVDLPEGVKPIGC 370

Query: 241 KWIYKRKRDAIGKVQAFKARLVAKGFTQREGVDYEETFSLVGMLKSIRKLLSIVVFYDYE 300
           KWIYKRKRD+ GKVQ FKARLVAKG+TQREGVDYEETFS V MLKSIR LLSI  FYDYE
Sbjct: 371 KWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYE 430

Query: 301 IWQMDVKTAFLN------------------------------------------------ 360
           IW+MDV TAFLN                                                
Sbjct: 431 IWKMDVNTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTA 490

Query: 361 -----------------------------------------GYLTDIKNWLATQFQMKDL 420
                                                    GYLTD+K WLA QFQMKDL
Sbjct: 491 IKSYGFEQNVDEPCVYKKINKGKVVFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDL 550

Query: 421 GEAQYVLGIQIFRIAKNKTLALSQASYIDKMLSRYSMQNSKRGLLPFRHGIHLSKEQCPE 480
           GEAQYVLGIQI R  KNKTLALSQA+YIDKML RYSMQNSK+GLLPFRHG+HLSKEQCP+
Sbjct: 551 GEAQYVLGIQIIRDRKNKTLALSQATYIDKMLVRYSMQNSKKGLLPFRHGVHLSKEQCPK 610

Query: 481 TPQEVEDMRRIPYASTIGSLM--------------------------------------- 489
           TPQEVEDMRRIPYAS +GSLM                                       
Sbjct: 611 TPQEVEDMRRIPYASAVGSLMYVIFCTRLEICYAVRIVSRYQSNLGLDHWTAVKIILKYL 670

BLAST of Tan0019656 vs. NCBI nr
Match: ADJ18449.1 (gag/pol protein, partial [Bryonia dioica])

HSP 1 Score: 710.3 bits (1832), Expect = 1.1e-200
Identity = 378/617 (61.26%), Postives = 420/617 (68.07%), Query Frame = 0

Query: 1    MVRSMMSYVQLSASFWRYAVETTVQILNTVPSKSVSETPFELWKGRKPSLQHFRIWGCPA 60
            MVRSMMSY QL  SFW YA+ET + ILN VPSKSV ETP+ELWKGRK SL++FRIWGCPA
Sbjct: 615  MVRSMMSYAQLPDSFWGYALETAIHILNNVPSKSVLETPYELWKGRKSSLRYFRIWGCPA 674

Query: 61   HVLVTNPKKLEPHSRLCQFVGYPKETRGGLFYAPQENKVIISTNATFMEEDHMRNHKPRS 120
            HVLV NPKKLEP S+LC FVGYPKE+RGGLFY PQENKV +STNATF+EEDH RNH+PRS
Sbjct: 675  HVLVQNPKKLEPRSKLCLFVGYPKESRGGLFYHPQENKVFVSTNATFLEEDHXRNHQPRS 734

Query: 121  KLVLNEATDEPTRVVDQAGTSSRVDGRASTSSQSHPSQSLGMPRRSGRVVSQPDRYLGLA 180
            K+VL E     T   D+  +S++V  +A+ S QSH SQ L +PRRSGRVV QP+RYLGL 
Sbjct: 735  KIVLKEMFKNAT---DKPSSSTKVVDKANISDQSHTSQELRVPRRSGRVVHQPNRYLGLV 794

Query: 181  ETQVIISDDGVEDPLSYKQAMYDVDKDQWIKAIDLEMESIDFNSVWELVDQLEGVRLIGC 240
            ETQ+II DDGVEDPL+YKQAM DVD+DQWIKA++LEMES+ FNSVW LVD    V+ IGC
Sbjct: 795  ETQIIIPDDGVEDPLTYKQAMNDVDRDQWIKAMNLEMESMYFNSVWTLVDLPSDVKPIGC 854

Query: 241  KWIYKRKRDAIGKVQAFKARLVAKGFTQREGVDYEETFSLVGMLKSIRKLLSIVVFYDYE 300
            KWIYKRKRD  GKVQ FKARLVAKG+TQ+EGVDYEETFS V MLKSIR LLSI  FY+YE
Sbjct: 855  KWIYKRKRDQAGKVQTFKARLVAKGYTQKEGVDYEETFSPVAMLKSIRILLSIATFYNYE 914

Query: 301  IWQMDVKTAFLNG----------------------------------------------- 360
            IWQMDVKTAFLNG                                               
Sbjct: 915  IWQMDVKTAFLNGNLEESIYMVQPEGFIAQDQEQKVCKLQKSIYGLKQASRSWNIRFDTA 974

Query: 361  ------------------------------------------YLTDIKNWLATQFQMKDL 420
                                                      YLTD+K WL TQFQMKDL
Sbjct: 975  IKSYGFEQNVDEPCVYKKIVNSVVAFLILYVDDILLIGNDVEYLTDVKKWLNTQFQMKDL 1034

Query: 421  GEAQYVLGIQIFRIAKNKTLALSQASYIDKMLSRYSMQNSKRGLLPFRHGIHLSKEQCPE 480
            GEAQY+LGIQI R  KNKTLA+SQASYIDK+LSRY MQNSK+G LPFRHGIHLSKEQCP+
Sbjct: 1035 GEAQYILGIQIVRNRKNKTLAMSQASYIDKVLSRYKMQNSKKGQLPFRHGIHLSKEQCPK 1094

Query: 481  TPQEVEDMRRIPYASTIGSLM--------------------------------------- 489
            TPQEVEDMR IPY+S +GSLM                                       
Sbjct: 1095 TPQEVEDMRNIPYSSAVGSLMYAMLCTRPDICYSVGIVSRYQSNPGRDHWTAVKNILKYL 1154

BLAST of Tan0019656 vs. ExPASy TrEMBL
Match: A0A5A7TZD0 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1163G00090 PE=4 SV=1)

HSP 1 Score: 782.3 bits (2019), Expect = 1.2e-222
Identity = 416/617 (67.42%), Postives = 441/617 (71.47%), Query Frame = 0

Query: 1    MVRSMMSYVQLSASFWRYAVETTVQILNTVPSKSVSETPFELWKGRKPSLQHFRIWGCPA 60
            MVRSMMSY QL +SFW YAVET V ILN VPSKSVSETPFELW+GRKPSL HFRIWGCPA
Sbjct: 521  MVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPA 580

Query: 61   HVLVTNPKKLEPHSRLCQFVGYPKETRGGLFYAPQENKVIISTNATFMEEDHMRNHKPRS 120
            HVLVTNPKKLEP SRLCQFVGYPKETRGGLF+ PQEN+V +STNATF+EEDHMRNHKPRS
Sbjct: 581  HVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRS 640

Query: 121  KLVLNEATDEPTRVVDQAGTSSRVDGRASTSSQSHPSQSLGMPRRSGRVVSQPDRYLGLA 180
            KLVL+EATDE TRVVD+ G SSRVD   +TS QSHPSQSL MPRRSGRVVSQP+RYLGL 
Sbjct: 641  KLVLSEATDESTRVVDEVGPSSRVD-ETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLT 700

Query: 181  ETQVIISDDGVEDPLSYKQAMYDVDKDQWIKAIDLEMESIDFNSVWELVDQLEGVRLIGC 240
            ETQV+I DDGVEDPLSYKQAM DVDKDQW+KA+DLEMES+ FNSVWELVD  EGV+ IGC
Sbjct: 701  ETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGC 760

Query: 241  KWIYKRKRDAIGKVQAFKARLVAKGFTQREGVDYEETFSLVGMLKSIRKLLSIVVFYDYE 300
            KWIYKRKRD+ GKVQ FKARLVAKG+TQREGVDYEETFS V MLKSIR LLSI  FYDYE
Sbjct: 761  KWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYE 820

Query: 301  IWQMDVKTAFLN------------------------------------------------ 360
            IWQMDVKTAFLN                                                
Sbjct: 821  IWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTA 880

Query: 361  -----------------------------------------GYLTDIKNWLATQFQMKDL 420
                                                     GYLTD+K WLA QFQMKDL
Sbjct: 881  IKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDL 940

Query: 421  GEAQYVLGIQIFRIAKNKTLALSQASYIDKMLSRYSMQNSKRGLLPFRHGIHLSKEQCPE 480
            GEAQYVLGIQI R  KNKTLALSQA+YIDK+L RYSMQNSK+GLLPFRHG+HLSKEQ P+
Sbjct: 941  GEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPK 1000

Query: 481  TPQEVEDMRRIPYASTIGSLM--------------------------------------- 489
            TPQEVEDMRRIPYAS +GSLM                                       
Sbjct: 1001 TPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYL 1060

BLAST of Tan0019656 vs. ExPASy TrEMBL
Match: A0A5A7UYE8 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold430G001570 PE=4 SV=1)

HSP 1 Score: 782.3 bits (2019), Expect = 1.2e-222
Identity = 416/617 (67.42%), Postives = 441/617 (71.47%), Query Frame = 0

Query: 1    MVRSMMSYVQLSASFWRYAVETTVQILNTVPSKSVSETPFELWKGRKPSLQHFRIWGCPA 60
            MVRSMMSY QL +SFW YAVET V ILN VPSKSVSETPFELW+GRKPSL HFRIWGCPA
Sbjct: 395  MVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPA 454

Query: 61   HVLVTNPKKLEPHSRLCQFVGYPKETRGGLFYAPQENKVIISTNATFMEEDHMRNHKPRS 120
            HVLVTNPKKLEP SRLCQFVGYPKETRGGLF+ PQEN+V +STNATF+EEDHMRNHKPRS
Sbjct: 455  HVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRS 514

Query: 121  KLVLNEATDEPTRVVDQAGTSSRVDGRASTSSQSHPSQSLGMPRRSGRVVSQPDRYLGLA 180
            KLVL+EATDE TRVVD+ G SSRVD   +TS QSHPSQSL MPRRSGRVVSQP+RYLGL 
Sbjct: 515  KLVLSEATDESTRVVDEVGPSSRVD-ETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLT 574

Query: 181  ETQVIISDDGVEDPLSYKQAMYDVDKDQWIKAIDLEMESIDFNSVWELVDQLEGVRLIGC 240
            ETQV+I DDGVEDPLSYKQAM DVDKDQW+KA+DLEMES+ FNSVWELVD  EGV+ IGC
Sbjct: 575  ETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGC 634

Query: 241  KWIYKRKRDAIGKVQAFKARLVAKGFTQREGVDYEETFSLVGMLKSIRKLLSIVVFYDYE 300
            KWIYKRKRD+ GKVQ FKARLVAKG+TQREGVDYEETFS V MLKSIR LLSI  FYDYE
Sbjct: 635  KWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYE 694

Query: 301  IWQMDVKTAFLN------------------------------------------------ 360
            IWQMDVKTAFLN                                                
Sbjct: 695  IWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTA 754

Query: 361  -----------------------------------------GYLTDIKNWLATQFQMKDL 420
                                                     GYLTD+K WLA QFQMKDL
Sbjct: 755  IKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDL 814

Query: 421  GEAQYVLGIQIFRIAKNKTLALSQASYIDKMLSRYSMQNSKRGLLPFRHGIHLSKEQCPE 480
            GEAQYVLGIQI R  KNKTLALSQA+YIDK+L RYSMQNSK+GLLPFRHG+HLSKEQ P+
Sbjct: 815  GEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPK 874

Query: 481  TPQEVEDMRRIPYASTIGSLM--------------------------------------- 489
            TPQEVEDMRRIPYAS +GSLM                                       
Sbjct: 875  TPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYL 934

BLAST of Tan0019656 vs. ExPASy TrEMBL
Match: A0A5A7T2V9 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold56G00760 PE=4 SV=1)

HSP 1 Score: 771.2 bits (1990), Expect = 2.7e-219
Identity = 410/617 (66.45%), Postives = 439/617 (71.15%), Query Frame = 0

Query: 1    MVRSMMSYVQLSASFWRYAVETTVQILNTVPSKSVSETPFELWKGRKPSLQHFRIWGCPA 60
            MVRSMMSY QL +SFW YAVET V ILN VPSKSVSETPFELW+GRKPSL HFRIWGCPA
Sbjct: 521  MVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPA 580

Query: 61   HVLVTNPKKLEPHSRLCQFVGYPKETRGGLFYAPQENKVIISTNATFMEEDHMRNHKPRS 120
            HVLVTNPKKLEP SRLCQFVGYPKETRGGLF+ P+EN+V +STNATF+EEDHMRNHKPRS
Sbjct: 581  HVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPKENRVFVSTNATFLEEDHMRNHKPRS 640

Query: 121  KLVLNEATDEPTRVVDQAGTSSRVDGRASTSSQSHPSQSLGMPRRSGRVVSQPDRYLGLA 180
            KLVL+EATDE TRVVD+ G SSRVD   +TS QSHPSQSL MPRRSGRVVSQP+RYLGL 
Sbjct: 641  KLVLSEATDESTRVVDEVGPSSRVD-ETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLT 700

Query: 181  ETQVIISDDGVEDPLSYKQAMYDVDKDQWIKAIDLEMESIDFNSVWELVDQLEGVRLIGC 240
            ETQV+I DDGVEDPLSYKQAM DVDKDQW+KA+DLEMES+ FNSVWELVD  EGV+ IGC
Sbjct: 701  ETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGC 760

Query: 241  KWIYKRKRDAIGKVQAFKARLVAKGFTQREGVDYEETFSLVGMLKSIRKLLSIVVFYDYE 300
            KWIYKRKRD+ GKVQ FKARLVAKG+T++EGVDYEETFS V MLKSIR LLSI  FYDYE
Sbjct: 761  KWIYKRKRDSAGKVQTFKARLVAKGYTRKEGVDYEETFSSVAMLKSIRILLSIAKFYDYE 820

Query: 301  IWQMDVKTAFLN------------------------------------------------ 360
            IWQMDVKTAFLN                                                
Sbjct: 821  IWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTA 880

Query: 361  -----------------------------------------GYLTDIKNWLATQFQMKDL 420
                                                     GYLTD+K WLA QFQMKDL
Sbjct: 881  IKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDL 940

Query: 421  GEAQYVLGIQIFRIAKNKTLALSQASYIDKMLSRYSMQNSKRGLLPFRHGIHLSKEQCPE 480
            GE QYVLGIQI R  KNKTLALSQA+YIDK+L RYSMQNSK+GLLPFRHG+HLSKEQ P+
Sbjct: 941  GEGQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPK 1000

Query: 481  TPQEVEDMRRIPYASTIGSLM--------------------------------------- 489
            TPQEVEDMRRIPYAS +GSLM                                       
Sbjct: 1001 TPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIILKYL 1060

BLAST of Tan0019656 vs. ExPASy TrEMBL
Match: A0A5D3CZY3 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1032G00460 PE=4 SV=1)

HSP 1 Score: 756.5 bits (1952), Expect = 6.8e-215
Identity = 403/617 (65.32%), Postives = 436/617 (70.66%), Query Frame = 0

Query: 1   MVRSMMSYVQLSASFWRYAVETTVQILNTVPSKSVSETPFELWKGRKPSLQHFRIWGCPA 60
           MVRSMMSY QL +SFW YAVET V ILN V SKSVSETPFELW+GRKPSL HF+I GCPA
Sbjct: 131 MVRSMMSYAQLPSSFWGYAVETAVHILNNVSSKSVSETPFELWRGRKPSLSHFKILGCPA 190

Query: 61  HVLVTNPKKLEPHSRLCQFVGYPKETRGGLFYAPQENKVIISTNATFMEEDHMRNHKPRS 120
           HVLVTNPKKLEP SRLCQFVGYPKETRGGLF+ PQ+N+V++STNATF+EEDHMR+HKP++
Sbjct: 191 HVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQKNRVLVSTNATFLEEDHMRDHKPQN 250

Query: 121 KLVLNEATDEPTRVVDQAGTSSRVDGRASTSSQSHPSQSLGMPRRSGRVVSQPDRYLGLA 180
           KLVLNEA DE TRVVD+ G SSRV+   +TS QSHPSQSL MPRRSGR+VSQP+RYLGL 
Sbjct: 251 KLVLNEAIDESTRVVDEVGPSSRVN-ETTTSGQSHPSQSLRMPRRSGRIVSQPNRYLGLT 310

Query: 181 ETQVIISDDGVEDPLSYKQAMYDVDKDQWIKAIDLEMESIDFNSVWELVDQLEGVRLIGC 240
           ETQV+I DDGVEDPLSY QAM DVDKDQW+KA+DLEMES+ FN +WELVD  EGV+ IGC
Sbjct: 311 ETQVVIPDDGVEDPLSYNQAMNDVDKDQWVKAMDLEMESMYFNLMWELVDLPEGVKPIGC 370

Query: 241 KWIYKRKRDAIGKVQAFKARLVAKGFTQREGVDYEETFSLVGMLKSIRKLLSIVVFYDYE 300
           KWIYKRKRD+ GKVQ FKARLVAKG+TQREGVDYEETFS V MLKSIR LLSI  FYDYE
Sbjct: 371 KWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYE 430

Query: 301 IWQMDVKTAFLN------------------------------------------------ 360
           IW+MDV TAFLN                                                
Sbjct: 431 IWKMDVNTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTA 490

Query: 361 -----------------------------------------GYLTDIKNWLATQFQMKDL 420
                                                    GYLTD+K WLA QFQMKDL
Sbjct: 491 IKSYGFEQNVDEPCVYKKINKGKVVFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDL 550

Query: 421 GEAQYVLGIQIFRIAKNKTLALSQASYIDKMLSRYSMQNSKRGLLPFRHGIHLSKEQCPE 480
           GEAQYVLGIQI R  KNKTLALSQA+YIDKML RYSMQNSK+GLLPFRHG+HLSKEQCP+
Sbjct: 551 GEAQYVLGIQIIRDRKNKTLALSQATYIDKMLVRYSMQNSKKGLLPFRHGVHLSKEQCPK 610

Query: 481 TPQEVEDMRRIPYASTIGSLM--------------------------------------- 489
           TPQEVEDMRRIPYAS +GSLM                                       
Sbjct: 611 TPQEVEDMRRIPYASAVGSLMYVIFCTRLEICYAVRIVSRYQSNLGLDHWTAVKIILKYL 670

BLAST of Tan0019656 vs. ExPASy TrEMBL
Match: E2GK51 (Gag/pol protein (Fragment) OS=Bryonia dioica OX=3652 PE=4 SV=1)

HSP 1 Score: 710.3 bits (1832), Expect = 5.6e-201
Identity = 378/617 (61.26%), Postives = 420/617 (68.07%), Query Frame = 0

Query: 1    MVRSMMSYVQLSASFWRYAVETTVQILNTVPSKSVSETPFELWKGRKPSLQHFRIWGCPA 60
            MVRSMMSY QL  SFW YA+ET + ILN VPSKSV ETP+ELWKGRK SL++FRIWGCPA
Sbjct: 615  MVRSMMSYAQLPDSFWGYALETAIHILNNVPSKSVLETPYELWKGRKSSLRYFRIWGCPA 674

Query: 61   HVLVTNPKKLEPHSRLCQFVGYPKETRGGLFYAPQENKVIISTNATFMEEDHMRNHKPRS 120
            HVLV NPKKLEP S+LC FVGYPKE+RGGLFY PQENKV +STNATF+EEDH RNH+PRS
Sbjct: 675  HVLVQNPKKLEPRSKLCLFVGYPKESRGGLFYHPQENKVFVSTNATFLEEDHXRNHQPRS 734

Query: 121  KLVLNEATDEPTRVVDQAGTSSRVDGRASTSSQSHPSQSLGMPRRSGRVVSQPDRYLGLA 180
            K+VL E     T   D+  +S++V  +A+ S QSH SQ L +PRRSGRVV QP+RYLGL 
Sbjct: 735  KIVLKEMFKNAT---DKPSSSTKVVDKANISDQSHTSQELRVPRRSGRVVHQPNRYLGLV 794

Query: 181  ETQVIISDDGVEDPLSYKQAMYDVDKDQWIKAIDLEMESIDFNSVWELVDQLEGVRLIGC 240
            ETQ+II DDGVEDPL+YKQAM DVD+DQWIKA++LEMES+ FNSVW LVD    V+ IGC
Sbjct: 795  ETQIIIPDDGVEDPLTYKQAMNDVDRDQWIKAMNLEMESMYFNSVWTLVDLPSDVKPIGC 854

Query: 241  KWIYKRKRDAIGKVQAFKARLVAKGFTQREGVDYEETFSLVGMLKSIRKLLSIVVFYDYE 300
            KWIYKRKRD  GKVQ FKARLVAKG+TQ+EGVDYEETFS V MLKSIR LLSI  FY+YE
Sbjct: 855  KWIYKRKRDQAGKVQTFKARLVAKGYTQKEGVDYEETFSPVAMLKSIRILLSIATFYNYE 914

Query: 301  IWQMDVKTAFLNG----------------------------------------------- 360
            IWQMDVKTAFLNG                                               
Sbjct: 915  IWQMDVKTAFLNGNLEESIYMVQPEGFIAQDQEQKVCKLQKSIYGLKQASRSWNIRFDTA 974

Query: 361  ------------------------------------------YLTDIKNWLATQFQMKDL 420
                                                      YLTD+K WL TQFQMKDL
Sbjct: 975  IKSYGFEQNVDEPCVYKKIVNSVVAFLILYVDDILLIGNDVEYLTDVKKWLNTQFQMKDL 1034

Query: 421  GEAQYVLGIQIFRIAKNKTLALSQASYIDKMLSRYSMQNSKRGLLPFRHGIHLSKEQCPE 480
            GEAQY+LGIQI R  KNKTLA+SQASYIDK+LSRY MQNSK+G LPFRHGIHLSKEQCP+
Sbjct: 1035 GEAQYILGIQIVRNRKNKTLAMSQASYIDKVLSRYKMQNSKKGQLPFRHGIHLSKEQCPK 1094

Query: 481  TPQEVEDMRRIPYASTIGSLM--------------------------------------- 489
            TPQEVEDMR IPY+S +GSLM                                       
Sbjct: 1095 TPQEVEDMRNIPYSSAVGSLMYAMLCTRPDICYSVGIVSRYQSNPGRDHWTAVKNILKYL 1154

BLAST of Tan0019656 vs. TAIR 10
Match: AT4G23160.1 (cysteine-rich RLK (RECEPTOR-like protein kinase) 8 )

HSP 1 Score: 107.5 bits (267), Expect = 3.2e-23
Identity = 97/424 (22.88%), Postives = 168/424 (39.62%), Query Frame = 0

Query: 192 EDPLSYKQAMYDVDKDQWIKAIDLEMESIDFNSVWELVDQLEGVRLIGCKWIYKRKRDAI 251
           ++P +Y +A   +    W  A+D E+ +++    WE+       + IGCKW+YK K ++ 
Sbjct: 84  KEPSTYNEAKEFL---VWCGAMDDEIGAMETTHTWEICTLPPNKKPIGCKWVYKIKYNSD 143

Query: 252 GKVQAFKARLVAKGFTQREGVDYEETFSLVGMLKSIRKLLSIVVFYDYEIWQMDVKTAFL 311
           G ++ +KARLVAKG+TQ+EG+D+ ETFS V  L S++ +L+I   Y++ + Q+D+  AFL
Sbjct: 144 GTIERYKARLVAKGYTQQEGIDFIETFSPVCKLTSVKLILAISAIYNFTLHQLDISNAFL 203

Query: 312 NG---------------------------------------------------------- 371
           NG                                                          
Sbjct: 204 NGDLDEEIYMKLPPGYAARQGDSLPPNAVCYLKKSIYGLKQASRQWFLKFSVTLIGFGFV 263

Query: 372 ----------------------YLTDI-------------KNWLATQFQMKDLGEAQYVL 431
                                 Y+ DI             K+ L + F+++DLG  +Y L
Sbjct: 264 QSHSDHTYFLKITATLFLCVLVYVDDIIICSNNDAAVDELKSQLKSCFKLRDLGPLKYFL 323

Query: 432 GIQIFRIAKNKTLALSQASYIDKMLSRYSMQNSKRGLLPFRHGIHLSKEQCPE------- 489
           G++I R A    + + Q  Y   +L    +   K   +P    +  S     +       
Sbjct: 324 GLEIARSAAG--INICQRKYALDLLDETGLLGCKPSSVPMDPSVTFSAHSGGDFVDAKAY 383

BLAST of Tan0019656 vs. TAIR 10
Match: ATMG00820.1 (Reverse transcriptase (RNA-dependent DNA polymerase) )

HSP 1 Score: 72.8 bits (177), Expect = 8.7e-13
Identity = 34/96 (35.42%), Postives = 56/96 (58.33%), Query Frame = 0

Query: 198 KQAMYDVDKDQWIKAIDLEMESIDFNSVWELVDQLEGVRLIGCKWIYKRKRDAIGKVQAF 257
           K  ++ +    W +A+  E++++  N  W LV       ++GCKW++K K  + G +   
Sbjct: 29  KSVIFALKDPGWCQAMQEELDALSRNKTWILVPPPVNQNILGCKWVFKTKLHSDGTLDRL 88

Query: 258 KARLVAKGFTQREGVDYEETFSLVGMLKSIRKLLSI 294
           KARLVAKGF Q EG+ + ET+S V    +IR +L++
Sbjct: 89  KARLVAKGFHQEEGIYFVETYSPVVRTATIRTILNV 124

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
P109781.6e-6730.60Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
P041464.4e-3823.61Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3[more]
Q94HW21.9e-2023.85Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
Q9ZT942.3e-1842.86Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
P925201.2e-1135.42Uncharacterized mitochondrial protein AtMg00820 OS=Arabidopsis thaliana OX=3702 ... [more]
Match NameE-valueIdentityDescription
KAA0025945.12.4e-22267.42gag/pol protein [Cucumis melo var. makuwa] >KAA0026303.1 gag/pol protein [Cucumi... [more]
KAA0059226.12.4e-22267.42gag/pol protein [Cucumis melo var. makuwa][more]
KAA0035907.15.5e-21966.45gag/pol protein [Cucumis melo var. makuwa][more]
KAA0033121.11.4e-21465.32gag/pol protein [Cucumis melo var. makuwa] >TYK17112.1 gag/pol protein [Cucumis ... [more]
ADJ18449.11.1e-20061.26gag/pol protein, partial [Bryonia dioica][more]
Match NameE-valueIdentityDescription
A0A5A7TZD01.2e-22267.42Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1163G000... [more]
A0A5A7UYE81.2e-22267.42Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold430G0015... [more]
A0A5A7T2V92.7e-21966.45Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold56G00760... [more]
A0A5D3CZY36.8e-21565.32Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1032G004... [more]
E2GK515.6e-20161.26Gag/pol protein (Fragment) OS=Bryonia dioica OX=3652 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G23160.13.2e-2322.88cysteine-rich RLK (RECEPTOR-like protein kinase) 8 [more]
ATMG00820.18.7e-1335.42Reverse transcriptase (RNA-dependent DNA polymerase) [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 1..55
e-value: 8.9E-6
score: 27.1
IPR013103Reverse transcriptase, RNA-dependent DNA polymerasePFAMPF07727RVT_2coord: 223..315
e-value: 7.3E-25
score: 88.0
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 112..126
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 112..169
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 134..165
NoneNo IPR availablePANTHERPTHR45895FAMILY NOT NAMEDcoord: 2..315
NoneNo IPR availableCDDcd09272RNase_HI_RT_Ty1coord: 429..488
e-value: 1.17448E-25
score: 100.235
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 4..55

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0019656.1Tan0019656.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0043170 macromolecule metabolic process
biological_process GO:0006807 nitrogen compound metabolic process
biological_process GO:0044238 primary metabolic process
molecular_function GO:0003676 nucleic acid binding