CmoCh04G019810 (gene) Cucurbita moschata (Rifu)

NameCmoCh04G019810
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionTransposon Ty1-BL Gag-Pol polyprotein
LocationCmo_Chr04 : 10220333 .. 10222786 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACAAACTCAATAGTACAATTACTCGCTTCTGAGAAATTAAACGGCGACAACTACACAACTTGGAAATCAAACCTAAACACAATACTGGTCATTGATGATTTAAAGTTTGTTTTAACTGAGGAGTGTCCTCCAAACCCAAACTCAAATGCAAACCGAACAGCTCGGGATGCATATGACAGATGGATAAAGGCAAATGACAAAGCCCGAGTGTACATTCTAGCCAGCATATCTGATGTTTTGGCTAAGAAACACGATGTTATGGGTACTGCTAAAGAGATTATGGAATCTCTAAAAGGGATGTTTGGACAACCGTCCTTCTCCCTTAGACATGAAGCCATAAAATACATTTACAATTGCCGTATGAAAGAAGGGACCTCAGTTAGAGAACATGTCCTGGACATGATGGTCCATTTCAATGTGGCAGAAGAAAATGAAGCTGTCATTGATGAGAAGAGTCAAGTCAGTTTTATCATGATGTCTCTTCCGAAGAGCTTCTTCCAGTTCCGCACCAATATGGTTATGAACAAAATAGAATATAACTTGACTGCTCTTCTCAATGAGCTACAAACTTATCAGTCCCTCTTAACAAACAAGGGACAAACAGGAGAAGCAAATGTTGCCATCTCCAAGAAATTACTACGAGGATCGTCCTCCCAAAATAAGTCTGGACCTTCAACTTCTAAAAGTGTTTTGATGAAGAAGAAGGGAAAAGGGAAAAATAAGATTCCTACTAACCGCAAAAACAAGGTTCAAAAAGCAGATAAAGGAAAATGTTTCCATTGCAACGAAAACGGGCACTGGAAGAGAAATTGCCCAAAATACCTTGCAGAGAAGAAAGCCGAAAAGACACAACAAGGTAAATATGATTTACTCGTTGTAGAAATGTGTTTAGTAGAGTATGACAACTCAACTTGGATACTAGATTCAGGGGCGACTAATCATATTTGTTCTTTTTACCAGGAAACTAGCTCCTGGAGAATGCTTGCGGACGGCGAGATAACACTCAGGGTTGGAACAGGAGAGGTTGTCTCAGCAAGATCAGTGGGAAATTTAAAGTTGTTTTTTGGAGATAGATTCATTATATTAGATAATGTACTTTTTGTTCCAGGAATGAAAAGAAATCTAATATCCATCTCTTGTTTATTAGAACAGTTGTATAAAGTATCTTTTGAAATTAATGAAGTGTTCATTTGCAAAAGAGGTATTCATATTTGTTCTGCAAAACTAGAAAACAACTTATATATGTTAAAACCGAGCAAAACAAAAGCTATTTTAAATACTGAGATGTTTAAAACAGCTGAAACTCAAAATAAACGACAAAAGATTTCTCCTAATATCTTTCTTTGGCATTTAAGACTAGGCCACATTAATCTCAATAGGATTGAGAGATTGGTTAAAAGGGGACTTCTAAATAAGTTAGAAGACAATTCTTTACCTCCATGTGAGTCTTGTCTTGAGGGTAAAATGACTAAACGATCATTTTGCGAAAAAGGTTATAGAGCCAAAGAAACCTTAGAACTCGTGCATACAGATCTCTGTGGTCCAATGAATGTCAAAGCACGAGGAGTGTATGAATATTTCATCAGTTTTATTGATGATTATTCAAGGTATGGCTATCTTTACCTAATGCATCACAAGTCCGAAGCTCTTGAAAAATTCAGAGAGTATAAGACTGAGGTTCAGAATCTATTAGGTAAAACTATTAAAACACTTCGATCAGATCGAGGAGGAGAGTACATGGATTTAAGATTTCAGGACTATATGATAGAACATGGAATAAGGTCTCAACTCTCAGCCCCTGGTATGCCTCAACAAAATGGTGTATCAGAAAGGAGAAATAGAACTCTATTAGACATGGTTCGGTCTATGATGAGTTTCGCTCAATTACCCGATCCATTTTGGGGATATGCAGTGGAGACTGCTACATACATTTTGAACATGGTTCCTACTAAGAGTGTTTTAGAAACACCTTATGAGTTATGGAAAGGGCGTAAAGGTAGTTTACGTCACTTCAGAATTTGGGGATGTCCAGCACATGTGTTGTTGCAAAATCCCAAGAAATTAGAACGTCGTTCAAAATTATGTCTATTCGTAGGATACCCCAAAGAGACGAAAGGTGGTTTGTTTTATGATCCTCAAGAAAATAGAGTGTTTGTATCAACGAACGCTACATTCTTAGAGGAAGACCACGTAAGAAATCATCAACCTCGTAGCAAACTAGTATTAAGTGAGATTTCTAAAGAAGCTACTGATAAAACAACAAGAGTTGTTGATCAAGCTAGTCCTTCAACCAGAGTTGTTGATGGAGCTGACACTTCTGGTCAATCACATCCTTCTCAAGAGTTGAGAATGCCTCGACGTAGTGGGAGGGTTATAACTCAACCCGATCGTTACTTGGGTTTGGCAGAAACTCAAGTCATCATACCTGATGATGGCGTTGAGGATCCATGA

mRNA sequence

ATGACAAACTCAATAGTACAATTACTCGCTTCTGAGAAATTAAACGGCGACAACTACACAACTTGGAAATCAAACCTAAACACAATACTGGTCATTGATGATTTAAAGTTTGTTTTAACTGAGGAGTGTCCTCCAAACCCAAACTCAAATGCAAACCGAACAGCTCGGGATGCATATGACAGATGGATAAAGGCAAATGACAAAGCCCGAGTGTACATTCTAGCCAGCATATCTGATGTTTTGGCTAAGAAACACGATGTTATGGGTACTGCTAAAGAGATTATGGAATCTCTAAAAGGGATGTTTGGACAACCGTCCTTCTCCCTTAGACATGAAGCCATAAAATACATTTACAATTGCCGTATGAAAGAAGGGACCTCAGTTAGAGAACATGTCCTGGACATGATGGTCCATTTCAATGTGGCAGAAGAAAATGAAGCTGTCATTGATGAGAAGAGTCAAGTCAGTTTTATCATGATGTCTCTTCCGAAGAGCTTCTTCCAGTTCCGCACCAATATGGTTATGAACAAAATAGAATATAACTTGACTGCTCTTCTCAATGAGCTACAAACTTATCAGTCCCTCTTAACAAACAAGGGACAAACAGGAGAAGCAAATGTTGCCATCTCCAAGAAATTACTACGAGGATCGTCCTCCCAAAATAAGTCTGGACCTTCAACTTCTAAAAGTGTTTTGATGAAGAAGAAGGGAAAAGGGAAAAATAAGATTCCTACTAACCGCAAAAACAAGGTTCAAAAAGCAGATAAAGGAAAATGTTTCCATTGCAACGAAAACGGGCACTGGAAGAGAAATTGCCCAAAATACCTTGCAGAGAAGAAAGCCGAAAAGACACAACAAGGTTATAGAGCCAAAGAAACCTTAGAACTCGTGCATACAGATCTCTGTGGTCCAATGAATGTCAAAGCACGAGGAGTGTATGAATATTTCATCAGTTTTATTGATGATTATTCAAGGTATGGCTATCTTTACCTAATGCATCACAAGTCCGAAGCTCTTGAAAAATTCAGAGAGTATAAGACTGAGGTTCAGAATCTATTAGGTAAAACTATTAAAACACTTCGATCAGATCGAGGAGGAGAGTACATGGATTTAAGATTTCAGGACTATATGATAGAACATGGAATAAGGTCTCAACTCTCAGCCCCTGGTATGCCTCAACAAAATGGTGTATCAGAAAGGAGAAATAGAACTCTATTAGACATGGTTCGGTCTATGATGAGTTTCGCTCAATTACCCGATCCATTTTGGGGATATGCAGTGGAGACTGCTACATACATTTTGAACATGGTTCCTACTAAGAGTGTTTTAGAAACACCTTATGAGTTATGGAAAGGGCGTAAAGGATACCCCAAAGAGACGAAAGGTGGTTTGTTTTATGATCCTCAAGAAAATAGAGTGTTTGTATCAACGAACGCTACATTCTTAGAGGAAGACCACGTAAGAAATCATCAACCTCGTAGCAAACTAGTATTAAGTGAGATTTCTAAAGAAGCTACTGATAAAACAACAAGAGTTGTTGATCAAGCTAGTCCTTCAACCAGAGTTGTTGATGGAGCTGACACTTCTGGTCAATCACATCCTTCTCAAGAGTTGAGAATGCCTCGACGTAGTGGGAGGGTTATAACTCAACCCGATCGTTACTTGGGTTTGGCAGAAACTCAAGTCATCATACCTGATGATGGCGTTGAGGATCCATGA

Coding sequence (CDS)

ATGACAAACTCAATAGTACAATTACTCGCTTCTGAGAAATTAAACGGCGACAACTACACAACTTGGAAATCAAACCTAAACACAATACTGGTCATTGATGATTTAAAGTTTGTTTTAACTGAGGAGTGTCCTCCAAACCCAAACTCAAATGCAAACCGAACAGCTCGGGATGCATATGACAGATGGATAAAGGCAAATGACAAAGCCCGAGTGTACATTCTAGCCAGCATATCTGATGTTTTGGCTAAGAAACACGATGTTATGGGTACTGCTAAAGAGATTATGGAATCTCTAAAAGGGATGTTTGGACAACCGTCCTTCTCCCTTAGACATGAAGCCATAAAATACATTTACAATTGCCGTATGAAAGAAGGGACCTCAGTTAGAGAACATGTCCTGGACATGATGGTCCATTTCAATGTGGCAGAAGAAAATGAAGCTGTCATTGATGAGAAGAGTCAAGTCAGTTTTATCATGATGTCTCTTCCGAAGAGCTTCTTCCAGTTCCGCACCAATATGGTTATGAACAAAATAGAATATAACTTGACTGCTCTTCTCAATGAGCTACAAACTTATCAGTCCCTCTTAACAAACAAGGGACAAACAGGAGAAGCAAATGTTGCCATCTCCAAGAAATTACTACGAGGATCGTCCTCCCAAAATAAGTCTGGACCTTCAACTTCTAAAAGTGTTTTGATGAAGAAGAAGGGAAAAGGGAAAAATAAGATTCCTACTAACCGCAAAAACAAGGTTCAAAAAGCAGATAAAGGAAAATGTTTCCATTGCAACGAAAACGGGCACTGGAAGAGAAATTGCCCAAAATACCTTGCAGAGAAGAAAGCCGAAAAGACACAACAAGGTTATAGAGCCAAAGAAACCTTAGAACTCGTGCATACAGATCTCTGTGGTCCAATGAATGTCAAAGCACGAGGAGTGTATGAATATTTCATCAGTTTTATTGATGATTATTCAAGGTATGGCTATCTTTACCTAATGCATCACAAGTCCGAAGCTCTTGAAAAATTCAGAGAGTATAAGACTGAGGTTCAGAATCTATTAGGTAAAACTATTAAAACACTTCGATCAGATCGAGGAGGAGAGTACATGGATTTAAGATTTCAGGACTATATGATAGAACATGGAATAAGGTCTCAACTCTCAGCCCCTGGTATGCCTCAACAAAATGGTGTATCAGAAAGGAGAAATAGAACTCTATTAGACATGGTTCGGTCTATGATGAGTTTCGCTCAATTACCCGATCCATTTTGGGGATATGCAGTGGAGACTGCTACATACATTTTGAACATGGTTCCTACTAAGAGTGTTTTAGAAACACCTTATGAGTTATGGAAAGGGCGTAAAGGATACCCCAAAGAGACGAAAGGTGGTTTGTTTTATGATCCTCAAGAAAATAGAGTGTTTGTATCAACGAACGCTACATTCTTAGAGGAAGACCACGTAAGAAATCATCAACCTCGTAGCAAACTAGTATTAAGTGAGATTTCTAAAGAAGCTACTGATAAAACAACAAGAGTTGTTGATCAAGCTAGTCCTTCAACCAGAGTTGTTGATGGAGCTGACACTTCTGGTCAATCACATCCTTCTCAAGAGTTGAGAATGCCTCGACGTAGTGGGAGGGTTATAACTCAACCCGATCGTTACTTGGGTTTGGCAGAAACTCAAGTCATCATACCTGATGATGGCGTTGAGGATCCATGA
BLAST of CmoCh04G019810 vs. Swiss-Prot
Match: POLX_TOBAC (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum PE=2 SV=1)

HSP 1 Score: 154.5 bits (389), Expect = 3.6e-36
Identity = 70/167 (41.92%), Postives = 110/167 (65.87%), Query Frame = 1

Query: 289 RAKETLELVHTDLCGPMNVKARGVYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTE 348
           R    L+LV++D+CGPM +++ G  +YF++FIDD SR  ++Y++  K +  + F+++   
Sbjct: 476 RKLNILDLVYSDVCGPMEIESMGGNKYFVTFIDDASRKLWVYILKTKDQVFQVFQKFHAL 535

Query: 349 VQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAPGMPQQNGVSERRNRTLLDM 408
           V+   G+ +K LRSD GGEY    F++Y   HGIR + + PG PQ NGV+ER NRT+++ 
Sbjct: 536 VERETGRKLKRLRSDNGGEYTSREFEEYCSSHGIRHEKTVPGTPQHNGVAERMNRTIVEK 595

Query: 409 VRSMMSFAQLPDPFWGYAVETATYILNMVPTKSV-LETPYELWKGRK 455
           VRSM+  A+LP  FWG AV+TA Y++N  P+  +  E P  +W  ++
Sbjct: 596 VRSMLRMAKLPKSFWGEAVQTACYLINRSPSVPLAFEIPERVWTNKE 642

BLAST of CmoCh04G019810 vs. Swiss-Prot
Match: COPIA_DROME (Copia protein OS=Drosophila melanogaster GN=GIP PE=1 SV=3)

HSP 1 Score: 127.1 bits (318), Expect = 6.2e-28
Identity = 61/174 (35.06%), Postives = 101/174 (58.05%), Query Frame = 1

Query: 291 KETLELVHTDLCGPMNVKARGVYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQ 350
           K  L +VH+D+CGP+         YF+ F+D ++ Y   YL+ +KS+    F+++  + +
Sbjct: 478 KRPLFVVHSDVCGPITPVTLDDKNYFVIFVDQFTHYCVTYLIKYKSDVFSMFQDFVAKSE 537

Query: 351 NLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAPGMPQQNGVSERRNRTLLDMVR 410
                 +  L  D G EY+    + + ++ GI   L+ P  PQ NGVSER  RT+ +  R
Sbjct: 538 AHFNLKVVYLYIDNGREYLSNEMRQFCVKKGISYHLTVPHTPQLNGVSERMIRTITEKAR 597

Query: 411 SMMSFAQLPDPFWGYAVETATYILNMVPTKSVLE---TPYELWKGRKGYPKETK 462
           +M+S A+L   FWG AV TATY++N +P++++++   TPYE+W  +K Y K  +
Sbjct: 598 TMVSGAKLDKSFWGEAVLTATYLINRIPSRALVDSSKTPYEMWHNKKPYLKHLR 651

BLAST of CmoCh04G019810 vs. Swiss-Prot
Match: YD23B_YEAST (Transposon Ty2-DR3 Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=TY2B-DR3 PE=3 SV=1)

HSP 1 Score: 85.5 bits (210), Expect = 2.1e-15
Identity = 52/172 (30.23%), Postives = 86/172 (50.00%), Query Frame = 1

Query: 272 CPKYLAEKKAEKTQ-QGYRAK-----ETLELVHTDLCGPMNVKARGVYEYFISFIDDYSR 331
           CP  L  K  +    +G R K     E  + +HTD+ GP++   +    YFISF D+ +R
Sbjct: 633 CPDCLIGKSTKHRHIKGSRLKYQESYEPFQYLHTDIFGPVHHLPKSAPSYFISFTDEKTR 692

Query: 332 YGYLYLMHHKSEA--LEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIR 391
           + ++Y +H + E   L  F      ++N     +  ++ DRG EY +     +    GI 
Sbjct: 693 FQWVYPLHDRREESILNVFTSILAFIKNQFNARVLVIQMDRGSEYTNKTLHKFFTNRGIT 752

Query: 392 SQLSAPGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILN 436
           +  +     + +GV+ER NRTLL+  R+++  + LP+  W  AVE +T I N
Sbjct: 753 ACYTTTADSRAHGVAERLNRTLLNDCRTLLHCSGLPNHLWFSAVEFSTIIRN 804

BLAST of CmoCh04G019810 vs. Swiss-Prot
Match: YB11B_YEAST (Transposon Ty1-BL Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=TY1B-BL PE=1 SV=1)

HSP 1 Score: 85.5 bits (210), Expect = 2.1e-15
Identity = 50/172 (29.07%), Postives = 89/172 (51.74%), Query Frame = 1

Query: 272 CPKYLAEKKAEKTQ-QGYRAK-----ETLELVHTDLCGPMNVKARGVYEYFISFIDDYSR 331
           CP  L  K  +    +G R K     E  + +HTD+ GP++   +    YFISF D+ ++
Sbjct: 637 CPDCLIGKSTKHRHIKGSRLKYQNSYEPFQYLHTDIFGPVHNLPKSAPSYFISFTDETTK 696

Query: 332 YGYLYLMHHKSE--ALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIR 391
           + ++Y +H + E   L+ F      ++N    ++  ++ DRG EY +     ++ ++GI 
Sbjct: 697 FRWVYPLHDRREDSILDVFTTILAFIKNQFQASVLVIQMDRGSEYTNRTLHKFLEKNGIT 756

Query: 392 SQLSAPGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILN 436
              +     + +GV+ER NRTLLD  R+ +  + LP+  W  A+E +T + N
Sbjct: 757 PCYTTTADSRAHGVAERLNRTLLDDCRTQLQCSGLPNHLWFSAIEFSTIVRN 808

BLAST of CmoCh04G019810 vs. Swiss-Prot
Match: YB12B_YEAST (Transposon Ty1-BR Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=TY1B-BR PE=3 SV=1)

HSP 1 Score: 85.5 bits (210), Expect = 2.1e-15
Identity = 50/172 (29.07%), Postives = 89/172 (51.74%), Query Frame = 1

Query: 272 CPKYLAEKKAEKTQ-QGYRAK-----ETLELVHTDLCGPMNVKARGVYEYFISFIDDYSR 331
           CP  L  K  +    +G R K     E  + +HTD+ GP++   +    YFISF D+ ++
Sbjct: 637 CPDCLIGKSTKHRHIKGSRLKYQNSYEPFQYLHTDIFGPVHNLPKSAPSYFISFTDETTK 696

Query: 332 YGYLYLMHHKSE--ALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIR 391
           + ++Y +H + E   L+ F      ++N    ++  ++ DRG EY +     ++ ++GI 
Sbjct: 697 FRWVYPLHDRREDSILDVFTTILAFIKNQFQASVLVIQMDRGSEYTNRTLHKFLEKNGIT 756

Query: 392 SQLSAPGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILN 436
              +     + +GV+ER NRTLLD  R+ +  + LP+  W  A+E +T + N
Sbjct: 757 PCYTTTADSRAHGVAERLNRTLLDDCRTQLQCSGLPNHLWFSAIEFSTIVRN 808

BLAST of CmoCh04G019810 vs. TrEMBL
Match: E2GK51_BRYDI (Gag/pol protein (Fragment) OS=Bryonia dioica PE=4 SV=1)

HSP 1 Score: 407.5 bits (1046), Expect = 2.6e-110
Identity = 232/407 (57.00%), Postives = 278/407 (68.30%), Query Frame = 1

Query: 205 ANVAISKKLLRGSSSQNKSGPSTSKSVLMKKK------GKGKNKIPTNRKNKVQKADKGK 264
           ANV ++ ++ R   +QNK    +S + L   +       + +  + +   N+++      
Sbjct: 417 ANVVLNTEMFRTLETQNKKQKVSSNAYLWHLRLGHINLNRIERLVKSGILNQLEDNSLPP 476

Query: 265 CFHCNENGHWKRNCPKYLAEKKAEKTQQGYRAKETLELVHTDLCGPMNVKARGVYEYFIS 324
           C  C E    KR+            T +G RAK  LELVH+DLCGPMNVKARG YEYFIS
Sbjct: 477 CESCLEGKMTKRSF-----------TGKGLRAKVPLELVHSDLCGPMNVKARGGYEYFIS 536

Query: 325 FIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMI 384
           FIDD+SRYG++YL+HHKSE+ EKF+EYK EV+N +GKTIKTLRSDRGGEYMD +FQDY+I
Sbjct: 537 FIDDFSRYGHVYLLHHKSESFEKFKEYKAEVENEIGKTIKTLRSDRGGEYMDSKFQDYLI 596

Query: 385 EHGIRSQLSAPGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVP 444
           E GI+SQLSAP  PQQNGVSERRNRTLLDMVRSMMS+AQLPD FWGYA+ETA +ILN VP
Sbjct: 597 EFGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSYAQLPDSFWGYALETAIHILNNVP 656

Query: 445 T--------------KSVL--------------ETPYELWKGRK-----GYPKETKGGLF 504
           +              KS L              + P +L    K     GYPKE++GGLF
Sbjct: 657 SKSVLETPYELWKGRKSSLRYFRIWGCPAHVLVQNPKKLEPRSKLCLFVGYPKESRGGLF 716

Query: 505 YDPQENRVFVSTNATFLEEDHVRNHQPRSKLVLSEISKEATDKTTRVVDQASPSTRVVDG 564
           Y PQEN+VFVSTNATFLEEDH RNHQPRSK+VL E+ K ATDK        S ST+VVD 
Sbjct: 717 YHPQENKVFVSTNATFLEEDHXRNHQPRSKIVLKEMFKNATDK-------PSSSTKVVDK 776

Query: 565 ADTSGQSHPSQELRMPRRSGRVITQPDRYLGLAETQVIIPDDGVEDP 573
           A+ S QSH SQELR+PRRSGRV+ QP+RYLGL ETQ+IIPDDGVEDP
Sbjct: 777 ANISDQSHTSQELRVPRRSGRVVHQPNRYLGLVETQIIIPDDGVEDP 805

BLAST of CmoCh04G019810 vs. TrEMBL
Match: A0A165U314_9ROSI (Gag/pol protein OS=Momordica dioica PE=4 SV=1)

HSP 1 Score: 300.1 bits (767), Expect = 5.9e-78
Identity = 175/340 (51.47%), Postives = 215/340 (63.24%), Query Frame = 1

Query: 272 CPKYLAEKKAEK--TQQGYRAKETLELVHTDLCGPMNVKARGVYEYFISFIDDYSRYGYL 331
           C   L  K  ++  T +G RA + LEL+HTD+CGPM+VKARG Y+YF+SF DD SRYGY+
Sbjct: 484 CESCLEGKMTKRPFTGKGLRASDLLELIHTDVCGPMSVKARGGYQYFLSFTDDLSRYGYV 543

Query: 332 YLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAP 391
           YL+ HKSE+ EKF+E++ EV+N +GK IKTLRSDRGGEYM   F D++ E GI SQLSAP
Sbjct: 544 YLLKHKSESFEKFKEFQAEVENEIGKKIKTLRSDRGGEYMSSEFGDHLREFGIVSQLSAP 603

Query: 392 GMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSVLETPYEL 451
           G PQ NGVSERRNRTLLDMVRSMMS+A LPD FWGYA E    ILN VP+KSV ETPYEL
Sbjct: 604 GTPQCNGVSERRNRTLLDMVRSMMSYADLPDSFWGYARERERAILNRVPSKSVEETPYEL 663

Query: 452 WKGRK------------GYPKETKGGLFYDPQENRVFVST-------------------- 511
           W GRK             + K+ +        E  +FV                      
Sbjct: 664 WYGRKSSLSFLKIWGCPAHVKKLQPKKLEPRSEKCLFVGYPKETRGYYFYHPQENKVFVA 723

Query: 512 -NATFLEEDHVRNHQPRSKLVLSEISKEATDKTTRVVDQASPSTR-VVDGADTS-GQSH- 571
            N  FLE++ +  HQP SK+VL  + +          D+ S ST+ VVD A+ +  QSH 
Sbjct: 724 TNEAFLEKEFLSRHQPGSKIVLKAVVEPLIPLDG--TDKPSSSTKVVVDKAEVNDDQSHT 783

Query: 572 -PSQELRMPRRSGRVITQPDRYLGLAETQVIIPDDGVEDP 573
              QELR+PRRSGR    P+RYLGL ETQ++I D+G EDP
Sbjct: 784 PDQQELRVPRRSGRSRRAPNRYLGLVETQIMILDNGEEDP 821

BLAST of CmoCh04G019810 vs. TrEMBL
Match: W9RZ97_9ROSA (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Morus notabilis GN=L484_018680 PE=4 SV=1)

HSP 1 Score: 289.7 bits (740), Expect = 8.0e-75
Identity = 155/290 (53.45%), Postives = 197/290 (67.93%), Query Frame = 1

Query: 284 TQQGYRAKETLELVHTDLCGPMNVKARGVYEYFISFIDDYSRYGYLYLMHHKSEALEKFR 343
           T +G RA E L+L+H+D+CG +NV+ RG YEY+ +FIDDYSRYGY+YLM  KSE   KFR
Sbjct: 7   TAKGVRATEPLQLIHSDVCGLLNVQVRGAYEYYATFIDDYSRYGYVYLMQRKSETFGKFR 66

Query: 344 EYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAPGMPQQNGVSERRNR 403
           E++ EV+  LGK IKT RSDR GEYMD  F+D++IE G+ SQL+APG PQQNGV+ERRNR
Sbjct: 67  EFRAEVEKQLGKPIKTPRSDREGEYMDQEFRDFLIEEGVVSQLTAPGTPQQNGVAERRNR 126

Query: 404 TLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSVLETPYELWKGRKGYPKETKGG 463
           TLLDM+RSM+S++ LP  FWG+A++TA     +      LE   E++    GYP+ T+GG
Sbjct: 127 TLLDMIRSMLSYSSLPTSFWGHALKTAIPAHVLRQKTGKLEPRSEVYI-FVGYPQGTRGG 186

Query: 464 LFYDPQENRVFVSTNATFLEEDHVRNHQPRSKLVLSE-ISKEATDKTTRVVDQASPSTRV 523
           LFY   + +VFVSTNATFLE D++ + +PRSK+VL E +S E   + TRVV      T V
Sbjct: 187 LFYSQADQKVFVSTNATFLEHDYMIDFKPRSKIVLEELLSDEIRPQPTRVVGPLRQKTIV 246

Query: 524 VDGADTSGQSHPSQELRMPRRSGRVITQPDRYLGLAETQVIIPDDGVEDP 573
                      P Q L  PRRS RV   PDRY G  E Q++  DDG EDP
Sbjct: 247 -----------PDQTLMAPRRSRRVSRLPDRYTG--EAQIVTADDGKEDP 282

BLAST of CmoCh04G019810 vs. TrEMBL
Match: A5BQ76_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_019364 PE=4 SV=1)

HSP 1 Score: 276.2 bits (705), Expect = 9.2e-71
Identity = 139/266 (52.26%), Postives = 186/266 (69.92%), Query Frame = 1

Query: 286  QGYRAKETLELVHTDLCGPMNVKARGVYEYFISFIDDYSRYGYLYLMHHKSEALEKFREY 345
            +G RA + LEL+H+DLCGPM+V+ARG +EYF++F DDYSRYGY+YL+  KSE  EKF+ +
Sbjct: 812  KGNRANDVLELIHSDLCGPMSVQARGGFEYFVTFTDDYSRYGYIYLLCRKSECFEKFKAF 871

Query: 346  KTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAPGMPQQNGVSERRNRTL 405
            K E++   GK IKTLRSD GGEY+   F  ++ E GI SQLSAPGMPQQNGV+ERRNRTL
Sbjct: 872  KAEMEQRHGKYIKTLRSDHGGEYISREFITFLSEQGITSQLSAPGMPQQNGVAERRNRTL 931

Query: 406  LDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSVLETPYELWKGRKGYPKETKGGLF 465
            ++MVRSMMS++ LP   WG+A+ETA YILN+VP+KSV +TP ELW GR         GLF
Sbjct: 932  MEMVRSMMSYSDLPISLWGHAIETAAYILNLVPSKSVPKTPTELWTGR---------GLF 991

Query: 466  YDPQENRVFVSTNATFLEEDHVRNHQPRSKLVLSEISKEATDKTTRVVDQASPSTRVVDG 525
            Y P+  ++ VSTNA +LEE+++RNH P+S+L L+E+  +      R+          + G
Sbjct: 992  YSPKYKKIIVSTNAHYLEENYIRNHIPKSQLALNELRGDTI--PARIFPSEHEPEPFMVG 1051

Query: 526  ADTSGQSHPSQELRMPRRSGRVITQP 552
            AD          + +P+RSGR ++ P
Sbjct: 1052 AD----------IPLPQRSGRNVSGP 1056

BLAST of CmoCh04G019810 vs. TrEMBL
Match: A0A151QLI3_CAJCA (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan GN=KK1_048877 PE=4 SV=1)

HSP 1 Score: 266.9 bits (681), Expect = 5.6e-68
Identity = 186/546 (34.07%), Postives = 282/546 (51.65%), Query Frame = 1

Query: 2   TNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDR 61
           T S+  +LA+ KL G N+  W   L  +L+ + L  V+ +   P P +N +  A +AY +
Sbjct: 3   TISLCSILANGKLVGSNFDDWYRTLRIVLMHEKLIDVIDKPVVPQP-ANGDEQATNAYKK 62

Query: 62  WIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKGMFGQPSFSLRHEAIKYIYNCR 121
           +++     +  +LAS+S  L ++H+ M    +I+  LK M+G  S + R +  K ++   
Sbjct: 63  YLEDYMSTKCLLLASMSSELQRQHEDMDLV-DIINHLKKMYGGQSRTARFQLSKTLFRST 122

Query: 122 MKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNMVMNKIEYN 181
           +     V  HVL M+                     ++  L KS  +    +  + I  +
Sbjct: 123 LTANAEVGPHVLKMIS--------------------LIEQLEKSGCKLGKELSQDLILQS 182

Query: 182 LTALLNELQTYQSLLTNKGQTGEANVAISKKLLRGSSSQNKSGPSTSKSVL----MKKKG 241
           L    ++     ++     +T       + K   G +  N+   +  KS +     K+KG
Sbjct: 183 LPGTFSQFIVNFNMNKMDCKTKPKKKPFAAK--GGVTKPNRKKVTVDKSDVECFYCKQKG 242

Query: 242 KGKNK----IPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEKT-QQGYRAKE 301
             K      I + ++NK  + +K   +HC   GH        + EK+  K  ++GY  + 
Sbjct: 243 HWKRNCKKYIDSLKENK--QVNKTYLWHCRL-GH--------IGEKRINKLHKEGYLDQY 302

Query: 302 TLELVHTDLCGPMNVKARGVYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNL 361
             E   T +CGPM ++A+G Y YFI+F DD SRYG++YLM HKSE+ E F+ +++EV+  
Sbjct: 303 DYESYTTYVCGPMKIQAKGGYSYFITFTDDMSRYGFVYLMKHKSESFEMFKRFRSEVEKQ 362

Query: 362 LGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAPGMPQQNGVSERRNRTLLDMVRSM 421
            GK +K LRSDRGGEY+   F D++ E+GI SQ   PG PQ NGVSERRNRTLLDMV+SM
Sbjct: 363 TGKNVKVLRSDRGGEYLSNDFLDHLKENGILSQWKPPGTPQHNGVSERRNRTLLDMVKSM 422

Query: 422 MSFAQLPDPFWGYAVETATYILNMVPTKSVLETPYELWKGRK------------------ 481
           M F  LP   WGYA+E++ Y+LN VPTKS+  TPYE+WKGRK                  
Sbjct: 423 MGFTDLPINLWGYALESSAYLLNKVPTKSLSTTPYEIWKGRKPNLKHIKVWGCHALLRNK 482

Query: 482 -----------GYPKETKGGLFYDPQENRVFVSTNATFLEEDHVRNHQPRSKLVLSEISK 510
                      GYPKET G  F+ P +++VFV+  ATFLE + +       ++ L EI +
Sbjct: 483 LEARSQKCRFIGYPKETMGYYFFHPSDHKVFVARGATFLEREFLAEGCHGKEIDLDEI-Q 512

BLAST of CmoCh04G019810 vs. NCBI nr
Match: gi|299474487|gb|ADJ18449.1| (gag/pol protein [Bryonia dioica])

HSP 1 Score: 407.5 bits (1046), Expect = 3.8e-110
Identity = 232/407 (57.00%), Postives = 278/407 (68.30%), Query Frame = 1

Query: 205 ANVAISKKLLRGSSSQNKSGPSTSKSVLMKKK------GKGKNKIPTNRKNKVQKADKGK 264
           ANV ++ ++ R   +QNK    +S + L   +       + +  + +   N+++      
Sbjct: 417 ANVVLNTEMFRTLETQNKKQKVSSNAYLWHLRLGHINLNRIERLVKSGILNQLEDNSLPP 476

Query: 265 CFHCNENGHWKRNCPKYLAEKKAEKTQQGYRAKETLELVHTDLCGPMNVKARGVYEYFIS 324
           C  C E    KR+            T +G RAK  LELVH+DLCGPMNVKARG YEYFIS
Sbjct: 477 CESCLEGKMTKRSF-----------TGKGLRAKVPLELVHSDLCGPMNVKARGGYEYFIS 536

Query: 325 FIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMI 384
           FIDD+SRYG++YL+HHKSE+ EKF+EYK EV+N +GKTIKTLRSDRGGEYMD +FQDY+I
Sbjct: 537 FIDDFSRYGHVYLLHHKSESFEKFKEYKAEVENEIGKTIKTLRSDRGGEYMDSKFQDYLI 596

Query: 385 EHGIRSQLSAPGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVP 444
           E GI+SQLSAP  PQQNGVSERRNRTLLDMVRSMMS+AQLPD FWGYA+ETA +ILN VP
Sbjct: 597 EFGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSYAQLPDSFWGYALETAIHILNNVP 656

Query: 445 T--------------KSVL--------------ETPYELWKGRK-----GYPKETKGGLF 504
           +              KS L              + P +L    K     GYPKE++GGLF
Sbjct: 657 SKSVLETPYELWKGRKSSLRYFRIWGCPAHVLVQNPKKLEPRSKLCLFVGYPKESRGGLF 716

Query: 505 YDPQENRVFVSTNATFLEEDHVRNHQPRSKLVLSEISKEATDKTTRVVDQASPSTRVVDG 564
           Y PQEN+VFVSTNATFLEEDH RNHQPRSK+VL E+ K ATDK        S ST+VVD 
Sbjct: 717 YHPQENKVFVSTNATFLEEDHXRNHQPRSKIVLKEMFKNATDK-------PSSSTKVVDK 776

Query: 565 ADTSGQSHPSQELRMPRRSGRVITQPDRYLGLAETQVIIPDDGVEDP 573
           A+ S QSH SQELR+PRRSGRV+ QP+RYLGL ETQ+IIPDDGVEDP
Sbjct: 777 ANISDQSHTSQELRVPRRSGRVVHQPNRYLGLVETQIIIPDDGVEDP 805

BLAST of CmoCh04G019810 vs. NCBI nr
Match: gi|659113933|ref|XP_008456826.1| (PREDICTED: uncharacterized protein LOC103496664 [Cucumis melo])

HSP 1 Score: 327.8 bits (839), Expect = 3.8e-86
Identity = 173/291 (59.45%), Postives = 225/291 (77.32%), Query Frame = 1

Query: 1   MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYD 60
           MT++ + +L ++K NG+NY +WK+ +NT+L+IDDL+FVL E+CP    +NA RT R+AY+
Sbjct: 1   MTSATLNMLVADKFNGNNYASWKNTINTVLIIDDLRFVLVEKCPQVSAANATRTVREAYE 60

Query: 61  RWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKGMFGQPSFSLRHEAIKYIYNC 120
           RW KAN+KAR Y+LAS+S+VLAKK++ M TA+EIM+SL+ MFGQ S+ ++H+A+KYIYN 
Sbjct: 61  RWAKANEKARAYLLASLSEVLAKKNESMLTAREIMDSLQEMFGQASYQIKHDALKYIYNA 120

Query: 121 RMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNMVMNKIEY 180
           RM +G  VREHVL+MMV+FNVAE N AVIDE +QVSFI+ SL +SF QFR+N+VMNKI Y
Sbjct: 121 RMNDGALVREHVLNMMVYFNVAEMNGAVIDEANQVSFILESLLESFLQFRSNVVMNKIAY 180

Query: 181 NLTALLNELQTYQSLLTNKGQTGEANVAIS-KKLLRGSSSQNKSGPSTSKSVLMKKK--G 240
            LT LLNELQT++SL+  KGQ GEANVA S +K  RGS+S  K  PS+S +   KKK  G
Sbjct: 181 TLTTLLNELQTFESLMKIKGQKGEANVATSTRKFHRGSTSGTKYMPSSSGNKKWKKKKGG 240

Query: 241 KG-KNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEKTQQG 288
           +G K  +   + +K  K  KG CFHCN+ GHWKRNCPKYLAEKK  K +QG
Sbjct: 241 QGNKANLAATKTSKKAKVAKGICFHCNQEGHWKRNCPKYLAEKK--KAKQG 289

BLAST of CmoCh04G019810 vs. NCBI nr
Match: gi|1019597807|gb|AMY96445.1| (gag/pol protein [Momordica dioica])

HSP 1 Score: 300.1 bits (767), Expect = 8.5e-78
Identity = 175/340 (51.47%), Postives = 215/340 (63.24%), Query Frame = 1

Query: 272 CPKYLAEKKAEK--TQQGYRAKETLELVHTDLCGPMNVKARGVYEYFISFIDDYSRYGYL 331
           C   L  K  ++  T +G RA + LEL+HTD+CGPM+VKARG Y+YF+SF DD SRYGY+
Sbjct: 484 CESCLEGKMTKRPFTGKGLRASDLLELIHTDVCGPMSVKARGGYQYFLSFTDDLSRYGYV 543

Query: 332 YLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAP 391
           YL+ HKSE+ EKF+E++ EV+N +GK IKTLRSDRGGEYM   F D++ E GI SQLSAP
Sbjct: 544 YLLKHKSESFEKFKEFQAEVENEIGKKIKTLRSDRGGEYMSSEFGDHLREFGIVSQLSAP 603

Query: 392 GMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSVLETPYEL 451
           G PQ NGVSERRNRTLLDMVRSMMS+A LPD FWGYA E    ILN VP+KSV ETPYEL
Sbjct: 604 GTPQCNGVSERRNRTLLDMVRSMMSYADLPDSFWGYARERERAILNRVPSKSVEETPYEL 663

Query: 452 WKGRK------------GYPKETKGGLFYDPQENRVFVST-------------------- 511
           W GRK             + K+ +        E  +FV                      
Sbjct: 664 WYGRKSSLSFLKIWGCPAHVKKLQPKKLEPRSEKCLFVGYPKETRGYYFYHPQENKVFVA 723

Query: 512 -NATFLEEDHVRNHQPRSKLVLSEISKEATDKTTRVVDQASPSTR-VVDGADTS-GQSH- 571
            N  FLE++ +  HQP SK+VL  + +          D+ S ST+ VVD A+ +  QSH 
Sbjct: 724 TNEAFLEKEFLSRHQPGSKIVLKAVVEPLIPLDG--TDKPSSSTKVVVDKAEVNDDQSHT 783

Query: 572 -PSQELRMPRRSGRVITQPDRYLGLAETQVIIPDDGVEDP 573
              QELR+PRRSGR    P+RYLGL ETQ++I D+G EDP
Sbjct: 784 PDQQELRVPRRSGRSRRAPNRYLGLVETQIMILDNGEEDP 821

BLAST of CmoCh04G019810 vs. NCBI nr
Match: gi|703144320|ref|XP_010108259.1| (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Morus notabilis])

HSP 1 Score: 289.7 bits (740), Expect = 1.1e-74
Identity = 155/290 (53.45%), Postives = 197/290 (67.93%), Query Frame = 1

Query: 284 TQQGYRAKETLELVHTDLCGPMNVKARGVYEYFISFIDDYSRYGYLYLMHHKSEALEKFR 343
           T +G RA E L+L+H+D+CG +NV+ RG YEY+ +FIDDYSRYGY+YLM  KSE   KFR
Sbjct: 7   TAKGVRATEPLQLIHSDVCGLLNVQVRGAYEYYATFIDDYSRYGYVYLMQRKSETFGKFR 66

Query: 344 EYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAPGMPQQNGVSERRNR 403
           E++ EV+  LGK IKT RSDR GEYMD  F+D++IE G+ SQL+APG PQQNGV+ERRNR
Sbjct: 67  EFRAEVEKQLGKPIKTPRSDREGEYMDQEFRDFLIEEGVVSQLTAPGTPQQNGVAERRNR 126

Query: 404 TLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSVLETPYELWKGRKGYPKETKGG 463
           TLLDM+RSM+S++ LP  FWG+A++TA     +      LE   E++    GYP+ T+GG
Sbjct: 127 TLLDMIRSMLSYSSLPTSFWGHALKTAIPAHVLRQKTGKLEPRSEVYI-FVGYPQGTRGG 186

Query: 464 LFYDPQENRVFVSTNATFLEEDHVRNHQPRSKLVLSE-ISKEATDKTTRVVDQASPSTRV 523
           LFY   + +VFVSTNATFLE D++ + +PRSK+VL E +S E   + TRVV      T V
Sbjct: 187 LFYSQADQKVFVSTNATFLEHDYMIDFKPRSKIVLEELLSDEIRPQPTRVVGPLRQKTIV 246

Query: 524 VDGADTSGQSHPSQELRMPRRSGRVITQPDRYLGLAETQVIIPDDGVEDP 573
                      P Q L  PRRS RV   PDRY G  E Q++  DDG EDP
Sbjct: 247 -----------PDQTLMAPRRSRRVSRLPDRYTG--EAQIVTADDGKEDP 282

BLAST of CmoCh04G019810 vs. NCBI nr
Match: gi|147815925|emb|CAN77149.1| (hypothetical protein VITISV_019364 [Vitis vinifera])

HSP 1 Score: 276.2 bits (705), Expect = 1.3e-70
Identity = 139/266 (52.26%), Postives = 186/266 (69.92%), Query Frame = 1

Query: 286  QGYRAKETLELVHTDLCGPMNVKARGVYEYFISFIDDYSRYGYLYLMHHKSEALEKFREY 345
            +G RA + LEL+H+DLCGPM+V+ARG +EYF++F DDYSRYGY+YL+  KSE  EKF+ +
Sbjct: 812  KGNRANDVLELIHSDLCGPMSVQARGGFEYFVTFTDDYSRYGYIYLLCRKSECFEKFKAF 871

Query: 346  KTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAPGMPQQNGVSERRNRTL 405
            K E++   GK IKTLRSD GGEY+   F  ++ E GI SQLSAPGMPQQNGV+ERRNRTL
Sbjct: 872  KAEMEQRHGKYIKTLRSDHGGEYISREFITFLSEQGITSQLSAPGMPQQNGVAERRNRTL 931

Query: 406  LDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSVLETPYELWKGRKGYPKETKGGLF 465
            ++MVRSMMS++ LP   WG+A+ETA YILN+VP+KSV +TP ELW GR         GLF
Sbjct: 932  MEMVRSMMSYSDLPISLWGHAIETAAYILNLVPSKSVPKTPTELWTGR---------GLF 991

Query: 466  YDPQENRVFVSTNATFLEEDHVRNHQPRSKLVLSEISKEATDKTTRVVDQASPSTRVVDG 525
            Y P+  ++ VSTNA +LEE+++RNH P+S+L L+E+  +      R+          + G
Sbjct: 992  YSPKYKKIIVSTNAHYLEENYIRNHIPKSQLALNELRGDTI--PARIFPSEHEPEPFMVG 1051

Query: 526  ADTSGQSHPSQELRMPRRSGRVITQP 552
            AD          + +P+RSGR ++ P
Sbjct: 1052 AD----------IPLPQRSGRNVSGP 1056

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
POLX_TOBAC3.6e-3641.92Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
COPIA_DROME6.2e-2835.06Copia protein OS=Drosophila melanogaster GN=GIP PE=1 SV=3[more]
YD23B_YEAST2.1e-1530.23Transposon Ty2-DR3 Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC ... [more]
YB11B_YEAST2.1e-1529.07Transposon Ty1-BL Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 2... [more]
YB12B_YEAST2.1e-1529.07Transposon Ty1-BR Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 2... [more]
Match NameE-valueIdentityDescription
E2GK51_BRYDI2.6e-11057.00Gag/pol protein (Fragment) OS=Bryonia dioica PE=4 SV=1[more]
A0A165U314_9ROSI5.9e-7851.47Gag/pol protein OS=Momordica dioica PE=4 SV=1[more]
W9RZ97_9ROSA8.0e-7553.45Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Morus notabilis G... [more]
A5BQ76_VITVI9.2e-7152.26Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_019364 PE=4 SV=1[more]
A0A151QLI3_CAJCA5.6e-6834.07Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan GN=... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
gi|299474487|gb|ADJ18449.1|3.8e-11057.00gag/pol protein [Bryonia dioica][more]
gi|659113933|ref|XP_008456826.1|3.8e-8659.45PREDICTED: uncharacterized protein LOC103496664 [Cucumis melo][more]
gi|1019597807|gb|AMY96445.1|8.5e-7851.47gag/pol protein [Momordica dioica][more]
gi|703144320|ref|XP_010108259.1|1.1e-7453.45Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Morus notabilis][more]
gi|147815925|emb|CAN77149.1|1.3e-7052.26hypothetical protein VITISV_019364 [Vitis vinifera][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001584Integrase_cat-core
IPR001878Znf_CCHC
IPR012337RNaseH-like_sf
Vocabulary: Biological Process
TermDefinition
GO:0015074DNA integration
Vocabulary: Molecular Function
TermDefinition
GO:0003676nucleic acid binding
GO:0008270zinc ion binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
cellular_component GO:0005575 cellular_component
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0008270 zinc ion binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh04G019810.1CmoCh04G019810.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001584Integrase, catalytic corePFAMPF00665rvecoord: 293..408
score: 4.3
IPR001584Integrase, catalytic corePROFILEPS50994INTEGRASEcoord: 289..454
score: 22
IPR001878Zinc finger, CCHC-typeGENE3DG3DSA:4.10.60.10coord: 253..275
score: 2.
IPR001878Zinc finger, CCHC-typePFAMPF00098zf-CCHCcoord: 257..274
score: 3.
IPR001878Zinc finger, CCHC-typeSMARTSM00343c2hcfinal6coord: 258..274
score: 0.
IPR001878Zinc finger, CCHC-typePROFILEPS50158ZF_CCHCcoord: 258..274
score: 9
IPR001878Zinc finger, CCHC-typeunknownSSF57756Retrovirus zinc finger-like domainscoord: 247..278
score: 1.4
IPR012337Ribonuclease H-like domainGENE3DG3DSA:3.30.420.10coord: 286..447
score: 3.0
IPR012337Ribonuclease H-like domainunknownSSF53098Ribonuclease H-likecoord: 289..448
score: 4.68
NoneNo IPR availableunknownCoilCoilcoord: 335..355
scor
NoneNo IPR availablePANTHERPTHR11439GAG-POL-RELATED RETROTRANSPOSONcoord: 12..554
score: 7.4E
NoneNo IPR availablePANTHERPTHR11439:SF192SUBFAMILY NOT NAMEDcoord: 12..554
score: 7.4E
NoneNo IPR availablePFAMPF14223Retrotran_gag_2coord: 62..193
score: 7.0

The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None