CmoCh04G019820 (gene) Cucurbita moschata (Rifu)

NameCmoCh04G019820
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionTransposon Ty1-H Gag-Pol polyprotein
LocationCmo_Chr04 : 10222837 .. 10233255 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAATCTTGAAATGGAGTCAATGTACTTCAATTCAGTCTGGGAACTTGTAGATCAACCGGATGGGGTAAAACCTATTGGTTGCAAATGGATCTATAAGAGGAAACGAGACCAAACCGGTAAGGTACAGACCTTTAAAGCACGACTAGTAGCAAAGGGTTATACCCAAAGAGAGGGGGTGGACTATGAAGAAACCTTCTCTCCTGTTGCTATGCTTAAATCAATAAGAATACTCTTGTCTATTGCCACATTTTATGATTATGAAATTTGGCAAATGGATGTTAAGACAGCTTTTCTAAACGGCAATCTTGAAGAGAGTATCTATATGGCTCAACCAGAGGGGTTCATTGAACAGGATCACGAGCAAAGGGTTTGCAAGCTTAAAAGATCCATTTATGGGTTGAAGCAAGCATCTCGATCCTGGAATATAAAGTTTGATACTGCGATCAAATCTTATGGCTTTAAACAGAATGTTGACGAACCTTGTGTTTATAAAAGGATAGTCAACTCTACAGTAGCTTTCCTGGTGTTGTACGTTGACGATATCCTACTCATTGGAAATGATGTGGGAATTCTGACTGACATTAAGCATTGGCTGGCGACACAATTCCAAATGAAAGATTTGGGAGAGGCTCAGTTTGTTCTTGGAATCCAAATTATTCGGAATCGCAAGAACAAAACACTAGCATTGTCTCAAGCATCGTACATCGACAAAATGTTGATTCGATATAAGATGCAGGACTCCAAGAAAGGATTATTACCTTTCAGGCATGGAGTTCATTTGTCGAAGGAACAAAGTCCTAAGACACCTCAAGAAGTTGAGGATATGAAACACATTCCCTATGCATCAGCGGTCGGTAGTCTGATGTATGCCATGCTTTGTACCCGACCCGACATATGCTATGCTGTGGGAATTGTCAGCAGATATCAGTCCAATCCGGGACGTGCTCATTGGACTGCCGTTAAGAATATCCTCAAGTATCTTAGGACAACGAGGGACTATATGCTAATGTACGGTGCTAAGGATCTGATCCTTACAGGGTACACTGACTCAGATTTTCAGACCGATGTAGATTCGAGGAAATCGACATCAGGATCTGTCTTCACTCTGAACGAAGGAGCNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNAAGAAGTAACTTCAGGGGTAAAACGGTAAATTGACCCAGCTCTAGTTACGAACAACCTGTGAAGAATCAACTTACTAATCATGGTTATATCAGATGGACAGAAATATATCTATAGTGAGGGGAGTGCAACTACTAGGCTATAGTGGAGTGTCCTAGTAGTTAACGAATGTTGGTTAACCAGGTTAATGAGTTTAATCGGTTAATCTTGGATCGTTGGAGCCCATGATCTATAGGTCCATTAGGTCCCTCTGCTAGCTCATATCGGACTAAACCATAGAACAATGTGACGAGTGAGTTTGAAGTGTTCGAATTCAAATTAGGGAATATGCGTTACATATATACGATATATTTAACCGCAATTTTGTTTTATTTACGAATTAAACGAAAAGAGAGAATGTGAGATATTTAAATAAGATTTAAATATCTTAATCAATTATGTGTCGGAATTGATTGGGATTGGCATTTGAATTAATATTAAATATTAATTAAATAGTTTAATTAATCATTTAATGTTACTTTAATTTGAAATTCATTATTCAAATTAATTGTTGAATTAAAATGAAGAAAGTCAAAATGTTGACTTTCATCTAATGGAAAAATCCATGTTAGTGGAATTTTGAGGTGTCAAAATTTGGTTTAACCAAACTTGCATGTTTGCCAAATAAACCCCACAAGTTTGAGTGGAATTTTGAGTGTCAAAATTTGGTTAACTCAAACATGCATGCTTGCCACATAATCCCAAACTCGGAATTTTAAGTGTCAAAATTTGGTGATCTCAAGTTTGCATGAACATGCAAACTTGACTCTTTAAATAGAGTGATCATGATACTTGGAGGCATTTTTTTTTTCTTCATCTTGATGAAAAACCTTGAGAAAAACTTCTCACAAACACTCTCTTTCACATATACAAAATCCCTCCCAAGTTTAGTCACTCACCGTATTCCACAATCCCTTTCTAAGGCCGGAGGATAGTGGAGAAGACACTAGTGGTGGTTCGAAACCGATTCGTGAGGCAAAAGGAGTTTGAACTACAAAAGGTTAGTATTTAAACTCATTAAGTTTTAATACTTATGAACATGCTAGTAATTTACAATAATTGATGTTCTAAAAGTGCCTAAGATCCAAATTGCTTCCGCATGTGTTATATTAACACCAACAGCAATGTCATCCTATTCCCTGTTGATCCAAATCCACTCTAAAATCCAAACCTTTTCTTGAGTTGCATATCTCTTCTCGGACTCTAAGGCATTAGAACCTTTTTTTTTTCTTTTGCCCGACTGCATTAGTAGTAATTCAAGTTCTAACGGTTATCGTTTACCACTTCTTCCAAGAATATTAGGGTTTGACATCTATGACCATATAACACCTCAATGGTGCCATCTATATGGTCGCTACGTAACTATTATTATAAGCAAACTCTACCAAGTGTAAATGGGTATCCTAACTACCAATAAAGTCTAACAAATATGTCACTACCCGATTCTTTAGGCTTCGAATCTTGGACCATAACGAAGAAGAAAAAGATGAAAGAAAATAACATATAATAAAGTACGGTAAAATTTTATGCAATTTAAAATCAGCAACACCACAATTAAAAACAAAACGAATCCAATTACAAGCCAGTGTTGTTCGAGTTTATTACAAATGAAAAATAGGTTTGGATACAACTTCAAAACGAAATACAAAAGATACAAAATGCACAGATGACTCACCACTCAAAACAACCCTCTCTATGACCGCATAGCTCCTGAACACTCCCTAACCTCAGCCGACAATCAGCAACACTTGGAAAAGAGATGAGGAATGAGGTGAGTATAAACATACTCGGTAAGCAACTTAGTTGTAGACTCATCATATCCATATCGTAGTAAGTTCACATAGACTTTTCTCTAGACACTAAGAGTTCGGATTTAGTTCTACAACTATTTGAACTTTTAAGAATGCCTCAAAATTTCGAGTGGTTCTCATATCCATTACAAGTCCACTCTAAATACGAAGGAGGTTCATCCATTTCTAGCACCATGGCTCTAATTATAATACCTCAAGCCTAAAGAAATGTTTAAGGGCACATAAGGAAGTTTATAGGATCCAAATTTGGCGTGCAGGACAAAAATGTCATTTTTTTTAATTTTTAAGGAACTTTCCACCCTTTTAATTCCTTATAGAAAAGAAATTTTAAGGTACTTAAGGAAATAAAAAGTTCAGAAAATGAAAAATTACTAAAATACCTATGATGTTCAAGTTAATTACAAAAATACCATTAGATCATAAAATTAGATACGCTACCGTAGTTGCTTACACAAAATCGATGAGATCTTTTCGTCCAACCATTGGATCTTCACGGCTAAAATTTAAAATATAGGTCGACAAGCCTTTAATTGAAGAAGATATTTTCAGTAGAATATCTAAAACATTCAACCTCAAATTCATGACGATTTTGGAGGTTTCAATGTTGGTTTAAACTTCCTCTATAAAGTATGAAGCAGGAGCTTTCATTTGAAACAAGAATCAAGCATCGACAAGCTCACAAACTTGAGTTATAAGCGTTTCACTTTTAGGTATTTCTTTTAAATTTGAGAGTTATAGTCTACTATTTCCCTTTAAACTTGTTTCAACACCTCCTGACTAAGATTACCCACAAGATTGGTAAAGAAAGACAATGTTGAAAGTCTCCCTAACCTATACGCTTAATTCAATTCAATATAGAACTCAAGAAAATCTTGGAGAGCCAATCCAAGACAACTGTGAGATCCCACATTGGTTGGAGAGGGAAATGAAGCATTCCTTATAAGGGTGTGGAAACCTCTCCCTAGCAGACGTGTTTTGAAAACTTTGAGAGGAAACCTGAAAGGGAAAGCCCAAAGAGAACATCTACTAGTGGTGGGCTTGGAGTGTTACAAATGGTATCAGAGCAAGACACCAGGTTGTGTGCTAACGAGAATGCCAGGCTCCCAAGGGAGTAGATTGTAAGATCCTCACATCAATTGGAGGGAGGAAGACACCAGGCGATGTGCTAACGAGGCTCCCAAGGGAGTAGATGGTAAGATCTCATATCAATTGGAAAGAGGAATGAAACATTCTTTTTACGGGTGTAAAAACTGCTCCCTAACAGAGCCTATTAAAACCATGAGGGATAGTACCAAAGGAAAAACCCAAAAAAAGCCCAAAGAGGATAATATTGCGGTGGACTTGGGCTGTTACAGCGGTAATAAATTTGTAGTGTGATGCCGATTCCATTTGGCTTCTACACTCTCTAAGAACACTTTTGCAGAAGGTGAAATTTCAAAATATTCAAGAAAACTCGAGGAATTTGTGACTCCAAACCCGTGCCAAACACTTCCTTGGATGATGAGTATTTTTCATAAATACCACGAACATTGTTGCAATCCAAACCTATGTCAAGCACTTCCTAAGATCGAGGGTAGTTGAAGAAATCGTAATCGGGTCCATTTTAAACAAGATTGAGGATAATCTAAACATATCATCTTACTAGATAAAACAAATATCATTAGCTTCGTAGAGATGAGAGGAGAAGAATTCATGATGGTATCGAGTCTTGAGGAGGTCCCTATCGTGTGACCAATAAAAGCTAATTTTGCAAAAATAAAAAAGATAGAAAGAAAAAAGTAAGGGAGCCGAGAGTTTGGATTTCGAAATTGGAGACACCCATGTAAATTCATTCATCCAAAATGAATTCTTAAGCTTGAGAGAAAGATAAGGAAGAGATCTCGTGAAGGAATCAAGACTTGAAAGTGTCTTTTGTCCCAACCAATAAGACTTTTTCTAGGCTTATCTCCGACGAGAAACATACTTTCGAGCAAAGTTTCGAGAGCCATAATTTAAGTTATAGTTCATCATGATATGAAATTAGTACCTAGCTCTTCTTTGCAATGTCATCTTCAAGTTTCATAAAACAAACAAGGCCATTCTTAAGAAATTGTTGATAGTTCAAAAAACTAGACTTAAAGATAGATCACTACAAAAATTCATAGTAATTCATATTCCAAAAAACTCGAGAGCGATGGTCCCAAACGTCAACATTGAAGTAGCCCTCTTCAATTTTTTTTTTTTAAGAAATACTTGTAAGAATCACTTGGGTTTCAATATTAACCTTAAGTTAAGGATCTAAACGTATTTTGCTAAAAAATCAAAAAACGTTAAAATAACCCTTTAAACGAACTCTGCTATCCATGAAAAATTACATTTTTGCCCTTTTTTTCTTTTAGTTTCTTTTGAGCATTAGAGTCTCATTAAAAAACTAACCATTATACAAGTAAAAATGTTAAAAGATGGGTTAATCTCAGGTGTTAGAGCCAACAAGTCCTCCATCTCAAGTTTGGTGAGATTCTTTCCTTCTTGAAATTCTTTGCTTCTATGTCTCTTTTATGGAGAAGAACACTGAGATATTTTATTAAACTTATCTTTGCTTAAACTACAATATAAATAATTTATAACTATAACATTAAAAACACGTTCTTTTTCCATATCATTGTCATCATATTCCTATTCTAACTTACATCACTATCATCAACCATTTACTCCGAAATATAATAACTTTTTAACTACATTTAACCATAAATAATAGTCAACAAATTGAAAACAAATTTTTATTTATAACCTATGACACAACTTGTAGTGGCCGTCAAATTTGACCTACTCTACTCTCTAAATACTTCTTAAAAATTTAAATAAAATTTTTTCGACTATTTATATGAATTAAAGTCTAACCGCTACACAGTTCTTTTTACAGGTTCAAAAGTAAATCTGATCCATTTCACAGGTCGTGTGTTTGAACATAAAATTTATAGTTCGGAGAAAATAAGGATTAAAAAGTAGAAAAGTTGTTTGGAAAATTATGGGAAAAATATGAATTAAAAATAGAATCTAAATTTAAAAAGAGAAATAAGGGTGGATATTGAAAGGAATAGTATATATAATATTTTTTTTTAAAAAGAAAATTAATTAATATAATAATAATTAATTATTATATATTTCTTTATTTAATAATAACCAAAAACATTGGCAGTGACTCAGCCATTTAGAATTTGTTGTTCAACTTGCTCTCATTTAAGTGATTTTGAAATTATTATTTTGACTTTTATGGACACATAAAAATACTAATAATATTATTATCATCAAAATATTAGTTAATTAATTAATTAAAATAATTTAAGGTTGAACATCGATATTCTGTTAAACATATTTTAAAGTTTTATTTTAAATGTCTGAAAATAAAATGTTTGAGAATCAAATTAATGTGTTTGATTACTAGAAATGTTATCAAAATTTCTTTCAATTATTTTATTATGTTATTTAATTTTAAATAACTAAAGTAGATAATTATTTTATTTTTATTAAACTAATTTTAACTAATTAAAATTTAATTTATTTTCGACCGGTAAATTTTAGTTTACATTGAATAATTTATATTTATATTGATTTTAGTAAAATTATATAAACTTTTACTTATACTACATATTTAAAAAATGAAGAATAAATTATATTTAATTAGATATACTTATAAATCGTATACTATTGTAATAATTTATTAATAAAACAATAATATGCCTCTTAACTCTTAATTAAATTTTTATGCATTAATAAGTATATTATAAATAACATAATTAAATCTTACCTTTTTTTTATAGTTTGTTTAGCATCAATATTAATAATTATTCTCGAGTAAATTATTTTAATAAAATTATACCAAAAAAATTAATTTTAAATAATTAGTAGAGTCATCTAATTGAATAAAAAAATAAAAAAATACCATAAAACTCATGTCCGTTCCCTTCGAGGCATCATAATAGTCCTCATCTATTCTCATTTAATAGGTTGAAAATGATGACTTCGTTTATTGTCAACGAGAATTAATTTTTGGAGTATATAGGGTGTCGTTTAATGGTGCCACCAGTATTGAGGAGTTAGGCCTAGGGCGATTAGGGTAGTGCGCCAAGGCAACTAGTCGCCTAACTTTGAGGAAGAGTCGTTGTAGTAAATCTGCCAAGAGATGGGGGACATGGAGTCAAATAGGGTAATTCAGTAGCTGGCAAGCAGAGGGGGTTACTCTTTCAATCTTAGGTAACTATATGGTCCTATGGGAATGGTTCACCAACTAATCTCGCATGGATGACGAGCAAGGTAGCACACACCATTTTGTCACGAGAGAGTTTTCGGAGGTAGCACCTAGACCCATGACACTGGTTTTACACCACTTAAGTAAAGCAAGCTATCTACGTCTAAGTACAGAAAAGGGTGAACCACCCCCCTCATTCCTAAAGTGGGTCAGTGAATAACTAAAGCACCGCTCAGAACTTCGAGGCAATCTAGATGCTCAGTTAGCAGTAGAGACTAGGTCTGAACTTCCTAGAGCTTAGAGTAAAATTCGTGGATATCTACTAGGTTAAGGGTTCAATGAGAGCCTACAAGTAGGTTTCATACTAAGTATTTTATAATTACCCATTTTTCCTTTCTTTTTCAGGTGATTTCAGGCTGCCAATCAAGGTGGAGGAGCGTTTGATAGCTGTGATAGTCACAGAAAGGGTCATTCTGGAGCGTCAATTCGTTTTTATATTTTGAGTCATTATGCAAGGGAGCCCACCGAGAAGCTAAGTGCCCACAAAAGGCGACACACTATGCCTTCTAAGTAAAATTTGGAATGATCTAAAAAAATATCGTTCACTCCGCGTGGGGAGTTAAAGCACTTGTTCACTTTCCATAAGAGAGTGGAAGAGATAGTGGAAAATGGTATTGTGTATGTCGAGACCTAAGTTAACCATATAGTGCCCAGGAACACCATTGTCGACTTTGGAGCCACCCACAATTTCATGATGGTGACTAAAGCAAAACGCTTGAACATCCCTTGCTATTGAGACAGAGGAATGATGAAAGTCGTCAATTCAGCAGCTTTACCACTTATAAGTTCTAAAAAGAACTCTAGTTAAGATGGGAACTTGAAATGAACGAACTAGCTTTGTAATAGTCAAGATGGATGACTTCAACATCGTGTCAGGATGGATTTCCTGCTAGAACATAAAGTTATTCTCAAGCCCCTAGCAAAATGCTTAGTAGTAACGGGGTCTAACCCGACATTCATTCAGACAATTACCCGTCAACCAAAAGGGGTGAAAATGATGATGACACTATAGTTGAGAAGGGATCTTAGCCATGATGAACCTATGCTCGTTGTCGGTGTCATAGTTTATAGGAGAGCTTGAGTGAGCTAACTCCAATAGAAACCTCGTGATCGCTGCACAAACACTGTGATGCAAGACAAGAAAGTCAGACAAAATATTTATCTCCACCTGGGCAAATAGATCACGAGATTGATTTTATGTCACTGGAGAGATCACTTGTCATAGATGGATTTTTTTTTTTTTTGTTTTATTTCCTATATTTTTTTTATAAAAATAATAACAAACAAATAAAAAGGATAAAAAAAAAAAAAGTAGTTTTTTAAAAACTTAGTGGCTAGTCCGTACATGCCGACTTCTTCGTTCTTAAGAAACTTGTTGTACAGTGTGAAAGAAGAGCTCTGTTGCGTTCCTTAACGAGGCTGCAAAGTGAGGCCTCGGAAAATGACGTCTTCTTTCAATTTGAAGATGTTAAAGCATCGGAAAAGATCATCAATGAAGTTCTGTGTACGGAAAAGGCGGAAAAAGAAGGCGTAAGTGGACCCATATGGTACTATGGAGGTCATTTTTTAATCTCTCAATGGCTTTTCCAAGTCAATTTTTAGTCTACTTTGTTTTCTTCATCAATGCGTTTGCTCTTATTTCGTCCTCACTTGTTCCCATTGTTTCTGATTCTATTAGAAGAATTCGAAGTGAGCGTTAAGAGGGTGTGA

mRNA sequence

ATGAATCTTGAAATGGAGTCAATGTACTTCAATTCAGTCTGGGAACTTGTAGATCAACCGGATGGGGTAAAACCTATTGGTTGCAAATGGATCTATAAGAGGAAACGAGACCAAACCGGTAAGGTACAGACCTTTAAAGCACGACTAGTAGCAAAGGGTTATACCCAAAGAGAGGGGGTGGACTATGAAGAAACCTTCTCTCCTGTTGCTATGCTTAAATCAATAAGAATACTCTTGTCTATTGCCACATTTTATGATTATGAAATTTGGCAAATGGATGTTAAGACAGCTTTTCTAAACGGCAATCTTGAAGAGAGTATCTATATGGCTCAACCAGAGGGGTTCATTGAACAGGATCACGAGCAAAGGGTTTGCAAGCTTAAAAGATCCATTTATGGGTTGAAGCAAGCATCTCGATCCTGGAATATAAAGTTTGATACTGCGATCAAATCTTATGGCTTTAAACAGAATGTTGACGAACCTTGTGTTTATAAAAGGATAGTCAACTCTACAGTAGCTTTCCTGGTGTTGTACGTTGACGATATCCTACTCATTGGAAATGATGTGGGAATTCTGACTGACATTAAGCATTGGCTGGCGACACAATTCCAAATGAAAGATTTGGGAGAGGCTCAGTTTGTTCTTGGAATCCAAATTATTCGGAATCGCAAGAACAAAACACTAGCATTGTCTCAAGCATCGTACATCGACAAAATGTTGATTCGATATAAGATGCAGGACTCCAAGAAAGGATTATTACCTTTCAGGCATGGAGTTCATTTGTCGAAGGAACAAAGTCCTAAGACACCTCAAGAAGTTGAGGATATGAAACACATTCCCTATGCATCAGCGGTCGGTAGTCTGATGTATGCCATGCTTTGTACCCGACCCGACATATGCTATGCTGTGGGAATTGTCAGCAGATATCAGTCCAATCCGGGACGTGCTCATTGGACTGCCGTTAAGAATATCCTCAAGTATCTTAGGACAACGAGGGACTATATGCTAATGTACGGTGCTAAGGATCTGATCCTTACAGGGTACACTGACTCAGATTTTCAGACCGATTGTGAAAGAAGAGCTCTGTTGCGTTCCTTAACGAGGCTGCAAAGTGAGGCCTCGGAAAATGACGTCTTCTTTCAATTTGAAGATGTTAAAGCATCGGAAAAGATCATCAATGAAGTTCTGTGTACGGAAAAGGCGGAAAAAGAAGGCGTAAGTGGACCCATATGGTACTATGGAGAAGAATTCGAAGTGAGCGTTAAGAGGGTGTGA

Coding sequence (CDS)

ATGAATCTTGAAATGGAGTCAATGTACTTCAATTCAGTCTGGGAACTTGTAGATCAACCGGATGGGGTAAAACCTATTGGTTGCAAATGGATCTATAAGAGGAAACGAGACCAAACCGGTAAGGTACAGACCTTTAAAGCACGACTAGTAGCAAAGGGTTATACCCAAAGAGAGGGGGTGGACTATGAAGAAACCTTCTCTCCTGTTGCTATGCTTAAATCAATAAGAATACTCTTGTCTATTGCCACATTTTATGATTATGAAATTTGGCAAATGGATGTTAAGACAGCTTTTCTAAACGGCAATCTTGAAGAGAGTATCTATATGGCTCAACCAGAGGGGTTCATTGAACAGGATCACGAGCAAAGGGTTTGCAAGCTTAAAAGATCCATTTATGGGTTGAAGCAAGCATCTCGATCCTGGAATATAAAGTTTGATACTGCGATCAAATCTTATGGCTTTAAACAGAATGTTGACGAACCTTGTGTTTATAAAAGGATAGTCAACTCTACAGTAGCTTTCCTGGTGTTGTACGTTGACGATATCCTACTCATTGGAAATGATGTGGGAATTCTGACTGACATTAAGCATTGGCTGGCGACACAATTCCAAATGAAAGATTTGGGAGAGGCTCAGTTTGTTCTTGGAATCCAAATTATTCGGAATCGCAAGAACAAAACACTAGCATTGTCTCAAGCATCGTACATCGACAAAATGTTGATTCGATATAAGATGCAGGACTCCAAGAAAGGATTATTACCTTTCAGGCATGGAGTTCATTTGTCGAAGGAACAAAGTCCTAAGACACCTCAAGAAGTTGAGGATATGAAACACATTCCCTATGCATCAGCGGTCGGTAGTCTGATGTATGCCATGCTTTGTACCCGACCCGACATATGCTATGCTGTGGGAATTGTCAGCAGATATCAGTCCAATCCGGGACGTGCTCATTGGACTGCCGTTAAGAATATCCTCAAGTATCTTAGGACAACGAGGGACTATATGCTAATGTACGGTGCTAAGGATCTGATCCTTACAGGGTACACTGACTCAGATTTTCAGACCGATTGTGAAAGAAGAGCTCTGTTGCGTTCCTTAACGAGGCTGCAAAGTGAGGCCTCGGAAAATGACGTCTTCTTTCAATTTGAAGATGTTAAAGCATCGGAAAAGATCATCAATGAAGTTCTGTGTACGGAAAAGGCGGAAAAAGAAGGCGTAAGTGGACCCATATGGTACTATGGAGAAGAATTCGAAGTGAGCGTTAAGAGGGTGTGA
BLAST of CmoCh04G019820 vs. Swiss-Prot
Match: POLX_TOBAC (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum PE=2 SV=1)

HSP 1 Score: 362.8 bits (930), Expect = 5.0e-99
Identity = 178/361 (49.31%), Postives = 248/361 (68.70%), Query Frame = 1

Query: 1    MNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGV 60
            M  EMES+  N  ++LV+ P G +P+ CKW++K K+D   K+  +KARLV KG+ Q++G+
Sbjct: 830  MQEEMESLQKNGTYKLVELPKGKRPLKCKWVFKLKKDGDCKLVRYKARLVVKGFEQKKGI 889

Query: 61   DYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMAQPEGFIEQDH 120
            D++E FSPV  + SIR +LS+A   D E+ Q+DVKTAFL+G+LEE IYM QPEGF     
Sbjct: 890  DFDEIFSPVVKMTSIRTILSLAASLDLEVEQLDVKTAFLHGDLEEEIYMEQPEGFEVAGK 949

Query: 121  EQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVY-KRIVNSTVAFLVLYV 180
            +  VCKL +S+YGLKQA R W +KFD+ +KS  + +   +PCVY KR   +    L+LYV
Sbjct: 950  KHMVCKLNKSLYGLKQAPRQWYMKFDSFMKSQTYLKTYSDPCVYFKRFSENNFIILLLYV 1009

Query: 181  DDILLIGNDVGILTDIKHWLATQFQMKDLGEAQFVLGIQIIRNRKNKTLALSQASYIDKM 240
            DD+L++G D G++  +K  L+  F MKDLG AQ +LG++I+R R ++ L LSQ  YI+++
Sbjct: 1010 DDMLIVGKDKGLIAKLKGDLSKSFDMKDLGPAQQILGMKIVRERTSRKLWLSQEKYIERV 1069

Query: 241  LIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDI 300
            L R+ M+++K    P    + LSK+  P T +E  +M  +PY+SAVGSLMYAM+CTRPDI
Sbjct: 1070 LERFNMKNAKPVSTPLAGHLKLSKKMCPTTVEEKGNMAKVPYSSAVGSLMYAMVCTRPDI 1129

Query: 301  CYAVGIVSRYQSNPGRAHWTAVKNILKYLRTTRDYMLMYGAKDLILTGYTDSDFQTDCER 360
             +AVG+VSR+  NPG+ HW AVK IL+YLR T    L +G  D IL GYTD+D   D + 
Sbjct: 1130 AHAVGVVSRFLENPGKEHWEAVKWILRYLRGTTGDCLCFGGSDPILKGYTDADMAGDIDN 1189

BLAST of CmoCh04G019820 vs. Swiss-Prot
Match: COPIA_DROME (Copia protein OS=Drosophila melanogaster GN=GIP PE=1 SV=3)

HSP 1 Score: 238.4 bits (607), Expect = 1.4e-61
Identity = 129/359 (35.93%), Postives = 209/359 (58.22%), Query Frame = 1

Query: 1    MNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGV 60
            +N E+ +   N+ W +  +P+    +  +W++  K ++ G    +KARLVA+G+TQ+  +
Sbjct: 910  INTELNAHKINNTWTITKRPENKNIVDSRWVFSVKYNELGNPIRYKARLVARGFTQKYQI 969

Query: 61   DYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMAQPEGFIEQDH 120
            DYEETF+PVA + S R +LS+   Y+ ++ QMDVKTAFLNG L+E IYM  P+G     +
Sbjct: 970  DYEETFAPVARISSFRFILSLVIQYNLKVHQMDVKTAFLNGTLKEEIYMRLPQGI--SCN 1029

Query: 121  EQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVY---KRIVNSTVAFLVL 180
               VCKL ++IYGLKQA+R W   F+ A+K   F  +  + C+Y   K  +N  + +++L
Sbjct: 1030 SDNVCKLNKAIYGLKQAARCWFEVFEQALKECEFVNSSVDRCIYILDKGNINENI-YVLL 1089

Query: 181  YVDDILLIGNDVGILTDIKHWLATQFQMKDLGEAQFVLGIQIIRNRKNKTLALSQASYID 240
            YVDD+++   D+  + + K +L  +F+M DL E +  +GI+I    +   + LSQ++Y+ 
Sbjct: 1090 YVDDVVIATGDMTRMNNFKRYLMEKFRMTDLNEIKHFIGIRI--EMQEDKIYLSQSAYVK 1149

Query: 241  KMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRP 300
            K+L ++ M++      P    ++     S       ++  + P  S +G LMY MLCTRP
Sbjct: 1150 KILSKFNMENCNAVSTPLPSKINYELLNS-------DEDCNTPCRSLIGCLMYIMLCTRP 1209

Query: 301  DICYAVGIVSRYQSNPGRAHWTAVKNILKYLRTTRDYMLMYG---AKDLILTGYTDSDF 354
            D+  AV I+SRY S      W  +K +L+YL+ T D  L++    A +  + GY DSD+
Sbjct: 1210 DLTTAVNILSRYSSKNNSELWQNLKRVLRYLKGTIDMKLIFKKNLAFENKIIGYVDSDW 1256

BLAST of CmoCh04G019820 vs. Swiss-Prot
Match: YCH4_YEAST (Putative transposon Ty5-1 protein YCL074W OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=TY5A PE=5 SV=2)

HSP 1 Score: 139.8 bits (351), Expect = 6.8e-32
Identity = 88/261 (33.72%), Postives = 129/261 (49.43%), Query Frame = 1

Query: 92  MDVKTAFLNGNLEESIYMAQPEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKS 151
           MDV TAFLN  ++E IY+ QP GF+ + +   V +L   +YGLKQA   WN   +  +K 
Sbjct: 1   MDVDTAFLNSTMDEPIYVKQPPGFVNERNPDYVWELYGGMYGLKQAPLLWNEHINNTLKK 60

Query: 152 YGFKQNVDEPCVYKRIVNSTVAFLVLYVDDILLIGNDVGILTDIKHWLATQFQMKDLGEA 211
            GF ++  E  +Y R  +    ++ +YVDD+L+      I   +K  L   + MKDLG+ 
Sbjct: 61  IGFCRHEGEHGLYFRSTSDGPIYIAVYVDDLLVAAPSPKIYDRVKQELTKLYSMKDLGKV 120

Query: 212 QFVLGIQIIRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQ 271
              LG+  I    N  + LS   YI K     ++   K    P  +   L +  SP    
Sbjct: 121 DKFLGLN-IHQSSNGDITLSLQDYIAKAASESEINTFKLTQTPLCNSKPLFETTSP---- 180

Query: 272 EVEDMKHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRTT 331
            ++D+   PY S VG L++     RPDI Y V ++SR+   P   H  + + +L+YL TT
Sbjct: 181 HLKDI--TPYQSIVGQLLFCANTGRPDISYPVSLLSRFLREPRAIHLESARRVLRYLYTT 240

Query: 332 RDYMLMY-GAKDLILTGYTDS 352
           R   L Y     L LT Y D+
Sbjct: 241 RSMCLKYRSGSQLALTVYCDA 254

BLAST of CmoCh04G019820 vs. Swiss-Prot
Match: YH41B_YEAST (Transposon Ty4-H Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=TY4B-H PE=3 SV=1)

HSP 1 Score: 94.4 bits (233), Expect = 3.3e-18
Identity = 82/330 (24.85%), Postives = 152/330 (46.06%), Query Frame = 1

Query: 45   FKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLE 104
            +KAR+V +G TQ     Y    +       I+I L IA   +  +  +D+  AFL   LE
Sbjct: 1336 YKARIVCRGDTQSPDT-YSVITTESLNHNHIKIFLMIANNRNMFMKTLDINHAFLYAKLE 1395

Query: 105  ESIYMAQPEGFIEQDHEQR-VCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCV 164
            E IY+  P       H++R V KL +++YGLKQ+ + WN      +   G K N   P +
Sbjct: 1396 EEIYIPHP-------HDRRCVVKLNKALYGLKQSPKEWNDHLRQYLNGIGLKDNSYTPGL 1455

Query: 165  YKRIVNSTVAFLVLYVDDILLIGNDVGILTDIKHWLATQFQMKDLGEA------QFVLGI 224
            Y+         + +YVDD ++  ++   L +  + L + F++K  G          +LG+
Sbjct: 1456 YQ--TEDKNLMIAVYVDDCVIAASNEQRLDEFINKLKSNFELKITGTLIDDVLDTDILGM 1515

Query: 225  QIIRNRKNKTLALSQASYIDKMLIRY--KMQDSKKGLLPFRHGVHLSKEQSPKTPQEVED 284
             ++ N++  T+ L+  S+I++M  +Y  +++  +K  +P      +  ++      E E 
Sbjct: 1516 DLVYNKRLGTIDLTLKSFINRMDKKYNEELKKIRKSSIPHMSTYKIDPKKDVLQMSEEEF 1575

Query: 285  MKHI-PYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRTTRDY 344
             + +      +G L Y     R DI +AV  V+R  + P    +  +  I++YL   +D 
Sbjct: 1576 RQGVLKLQQLLGELNYVRHKCRYDINFAVKKVARLVNYPHERVFYMIYKIIQYLVRYKDI 1635

Query: 345  MLMYGA---KDLILTGYTDSDFQTDCERRA 362
             + Y     KD  +   TD+   ++ + ++
Sbjct: 1636 GIHYDRDCNKDKKVIAITDASVGSEYDAQS 1655

BLAST of CmoCh04G019820 vs. Swiss-Prot
Match: YJ41B_YEAST (Transposon Ty4-J Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=TY4B-J PE=3 SV=3)

HSP 1 Score: 94.0 bits (232), Expect = 4.3e-18
Identity = 82/330 (24.85%), Postives = 152/330 (46.06%), Query Frame = 1

Query: 45   FKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLE 104
            +KAR+V +G TQ     Y    +       I+I L IA   +  +  +D+  AFL   LE
Sbjct: 1337 YKARIVCRGDTQSPDT-YSVITTESLNHNHIKIFLMIANNRNMFMKTLDINHAFLYAKLE 1396

Query: 105  ESIYMAQPEGFIEQDHEQR-VCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCV 164
            E IY+  P       H++R V KL +++YGLKQ+ + WN      +   G K N   P +
Sbjct: 1397 EEIYIPHP-------HDRRCVVKLNKALYGLKQSPKEWNDHLRQYLNGIGLKDNSYTPGL 1456

Query: 165  YKRIVNSTVAFLVLYVDDILLIGNDVGILTDIKHWLATQFQMKDLGEA------QFVLGI 224
            Y+         + +YVDD ++  ++   L +  + L + F++K  G          +LG+
Sbjct: 1457 YQ--TEDKNLMIAVYVDDCVIAASNEQRLDEFINKLKSNFELKITGTLIDDVLDTDILGM 1516

Query: 225  QIIRNRKNKTLALSQASYIDKMLIRY--KMQDSKKGLLPFRHGVHLSKEQSPKTPQEVED 284
             ++ N++  T+ L+  S+I++M  +Y  +++  +K  +P      +  ++      E E 
Sbjct: 1517 DLVYNKRLGTIDLTLKSFINRMDKKYNEELKKIRKSSIPHMSTYKIDPKKDVLQMSEEEF 1576

Query: 285  MKHI-PYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRTTRDY 344
             + +      +G L Y     R DI +AV  V+R  + P    +  +  I++YL   +D 
Sbjct: 1577 RQGVLKLQQLLGELNYVRHKCRYDIEFAVKKVARLVNYPHERVFYMIYKIIQYLVRYKDI 1636

Query: 345  MLMYGA---KDLILTGYTDSDFQTDCERRA 362
             + Y     KD  +   TD+   ++ + ++
Sbjct: 1637 GIHYDRDCNKDKKVIAITDASVGSEYDAQS 1656

BLAST of CmoCh04G019820 vs. TrEMBL
Match: E2GK51_BRYDI (Gag/pol protein (Fragment) OS=Bryonia dioica PE=4 SV=1)

HSP 1 Score: 654.1 bits (1686), Expect = 1.2e-184
Identity = 315/360 (87.50%), Postives = 340/360 (94.44%), Query Frame = 1

Query: 1    MNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGV 60
            MNLEMESMYFNSVW LVD P  VKPIGCKWIYKRKRDQ GKVQTFKARLVAKGYTQ+EGV
Sbjct: 824  MNLEMESMYFNSVWTLVDLPSDVKPIGCKWIYKRKRDQAGKVQTFKARLVAKGYTQKEGV 883

Query: 61   DYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMAQPEGFIEQDH 120
            DYEETFSPVAMLKSIRILLSIATFY+YEIWQMDVKTAFLNGNLEESIYM QPEGFI QD 
Sbjct: 884  DYEETFSPVAMLKSIRILLSIATFYNYEIWQMDVKTAFLNGNLEESIYMVQPEGFIAQDQ 943

Query: 121  EQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFLVLYVD 180
            EQ+VCKL++SIYGLKQASRSWNI+FDTAIKSYGF+QNVDEPCVYK+IVNS VAFL+LYVD
Sbjct: 944  EQKVCKLQKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKKIVNSVVAFLILYVD 1003

Query: 181  DILLIGNDVGILTDIKHWLATQFQMKDLGEAQFVLGIQIIRNRKNKTLALSQASYIDKML 240
            DILLIGNDV  LTD+K WL TQFQMKDLGEAQ++LGIQI+RNRKNKTLA+SQASYIDK+L
Sbjct: 1004 DILLIGNDVEYLTDVKKWLNTQFQMKDLGEAQYILGIQIVRNRKNKTLAMSQASYIDKVL 1063

Query: 241  IRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDIC 300
             RYKMQ+SKKG LPFRHG+HLSKEQ PKTPQEVEDM++IPY+SAVGSLMYAMLCTRPDIC
Sbjct: 1064 SRYKMQNSKKGQLPFRHGIHLSKEQCPKTPQEVEDMRNIPYSSAVGSLMYAMLCTRPDIC 1123

Query: 301  YAVGIVSRYQSNPGRAHWTAVKNILKYLRTTRDYMLMYGAKDLILTGYTDSDFQTDCERR 360
            Y+VGIVSRYQSNPGR HWTAVKNILKYLR TR+YML+YGAKDLILTGYTDSDFQ+D + R
Sbjct: 1124 YSVGIVSRYQSNPGRDHWTAVKNILKYLRRTRNYMLVYGAKDLILTGYTDSDFQSDKDAR 1183

BLAST of CmoCh04G019820 vs. TrEMBL
Match: A5AUE7_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_021035 PE=4 SV=1)

HSP 1 Score: 564.7 bits (1454), Expect = 9.6e-158
Identity = 268/360 (74.44%), Postives = 315/360 (87.50%), Query Frame = 1

Query: 1   MNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGV 60
           M LE+ESMY NSVW+LVD P+G+KPIGCKWIYK KR   GKV+TFKARLVAKG+TQ+EGV
Sbjct: 164 MKLEIESMYSNSVWKLVDLPEGIKPIGCKWIYKXKRGPNGKVETFKARLVAKGFTQKEGV 223

Query: 61  DYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMAQPEGFIEQDH 120
           DYE+TFSPV MLKSIRILLSI  +YDYEIWQMDVKT FLNG+LEE+IYM QPEGF+ +D 
Sbjct: 224 DYEDTFSPVXMLKSIRILLSIXAYYDYEIWQMDVKTXFLNGHLEETIYMVQPEGFVVKDQ 283

Query: 121 EQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFLVLYVD 180
           EQ+VCKL+RSIYGLKQASRSWNI F+ AIKSYGF+QN+ EPCVYK+I    V FLVLYVD
Sbjct: 284 EQKVCKLQRSIYGLKQASRSWNIIFNEAIKSYGFEQNLGEPCVYKQIGGDKVVFLVLYVD 343

Query: 181 DILLIGNDVGILTDIKHWLATQFQMKDLGEAQFVLGIQIIRNRKNKTLALSQASYIDKML 240
           DILLIGNDV  L+ +K+WLA+QFQMKDLGEA ++LGIQ+ R+RKN+ LALSQA+YIDK+L
Sbjct: 344 DILLIGNDVESLSKVKNWLASQFQMKDLGEASYILGIQMTRDRKNRLLALSQAAYIDKVL 403

Query: 241 IRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDIC 300
           +++ M++SKKG LP RHGVHLSKEQ PKTPQ+ E M+ +PYASAVGSLMYAMLCTRPDIC
Sbjct: 404 VKFAMENSKKGNLPSRHGVHLSKEQCPKTPQDEEKMRRVPYASAVGSLMYAMLCTRPDIC 463

Query: 301 YAVGIVSRYQSNPGRAHWTAVKNILKYLRTTRDYMLMYGAKDLILTGYTDSDFQTDCERR 360
           +AVG+VSRYQSNPG  HW AVK+ILKYLR TR+YML+Y  ++LI  GYTDSDFQ+D + R
Sbjct: 464 FAVGVVSRYQSNPGLDHWVAVKHILKYLRRTRNYMLVYSGRELIPIGYTDSDFQSDRDSR 523

BLAST of CmoCh04G019820 vs. TrEMBL
Match: A0A165U314_9ROSI (Gag/pol protein OS=Momordica dioica PE=4 SV=1)

HSP 1 Score: 563.9 bits (1452), Expect = 1.6e-157
Identity = 270/357 (75.63%), Postives = 306/357 (85.71%), Query Frame = 1

Query: 1    MNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGV 60
            MN EMESMY N VW LVD P  VKPIGCKWIYK+KRDQ   V  FKARLVAKG+T+   +
Sbjct: 840  MNSEMESMYDNKVWTLVDLPSDVKPIGCKWIYKKKRDQDSNVTVFKARLVAKGFTRSLSL 899

Query: 61   DYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMAQPEGFIEQDH 120
             YEETFSPVAMLKSIRI+L+IA F+DYEIWQMDVKTAFLNGNLEESIYM QPEGF+ QD 
Sbjct: 900  SYEETFSPVAMLKSIRIILAIAAFFDYEIWQMDVKTAFLNGNLEESIYMIQPEGFVAQDQ 959

Query: 121  EQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFLVLYVD 180
            EQ+ CKL+ SIYGLKQASRSWNI+FD  IK++GF QNVDE CVYK+I  S VAFL+LYVD
Sbjct: 960  EQKACKLQGSIYGLKQASRSWNIRFDEVIKAFGFIQNVDESCVYKKISGSVVAFLILYVD 1019

Query: 181  DILLIGNDVGILTDIKHWLATQFQMKDLGEAQFVLGIQIIRNRKNKTLALSQASYIDKML 240
            DILLIGNDV  L D+K WL T F MKDLGEAQ++LGI+I R+R NKT+ +SQ++YIDK+L
Sbjct: 1020 DILLIGNDVEYLEDVKKWLNTSFSMKDLGEAQYILGIRIYRDRSNKTIGMSQSTYIDKVL 1079

Query: 241  IRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDIC 300
             R+KMQDSKKGLLPFRHG+HLSKEQ PKTPQEVEDM++IPY+SA+GSLMYAMLCTRPD+C
Sbjct: 1080 SRFKMQDSKKGLLPFRHGIHLSKEQCPKTPQEVEDMRNIPYSSAIGSLMYAMLCTRPDVC 1139

Query: 301  YAVGIVSRYQSNPGRAHWTAVKNILKYLRTTRDYMLMYGA-KDLILTGYTDSDFQTD 357
            YA+ IVSRYQSNPGR HWTAVKNILKYLR TR+  L+YG  KDL + GYTDS FQTD
Sbjct: 1140 YALSIVSRYQSNPGRDHWTAVKNILKYLRRTRNMFLVYGGDKDLAVKGYTDSSFQTD 1196

BLAST of CmoCh04G019820 vs. TrEMBL
Match: A5C065_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_044406 PE=4 SV=1)

HSP 1 Score: 523.5 bits (1347), Expect = 2.4e-145
Identity = 251/360 (69.72%), Postives = 295/360 (81.94%), Query Frame = 1

Query: 1   MNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGV 60
           M  EMESMY N VWELV+ P GVKPIGCKWIYK+KR   GK +T+KA LVAKGYTQ+EG+
Sbjct: 129 MKSEMESMYSNQVWELVEPPKGVKPIGCKWIYKKKRGIDGKXZTYKAXLVAKGYTQKEGI 188

Query: 61  DYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMAQPEGFIEQDH 120
           DYEETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNG+L+E IYM Q EGFI    
Sbjct: 189 DYEETFSPVAMLKSIRILLSIAAYFDYEIWQMDVKTAFLNGSLDECIYMKQXEGFIXNGQ 248

Query: 121 EQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFLVLYVD 180
           E  +CKL RSIYGLKQASRSWN  FD  IK++GF Q  DE CVYK+     V FLVLYVD
Sbjct: 249 EHLLCKLNRSIYGLKQASRSWNTCFDQTIKTFGFDQCHDESCVYKKWNGKKVVFLVLYVD 308

Query: 181 DILLIGNDVGILTDIKHWLATQFQMKDLGEAQFVLGIQIIRNRKNKTLALSQASYIDKML 240
           DILLIGN +G+LT +K WL+ +F MKDLGEA  +LGI+++RNRK K + LSQA YID +L
Sbjct: 309 DILLIGNCIGMLTSVKDWLSQRFDMKDLGEAAHILGIKLMRNRKKKMIGLSQALYIDTIL 368

Query: 241 IRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDIC 300
            R+ MQ SKKG LPFRHG+ LSK+QSPKTP+E+E MK +PYASAVGSLMYAMLCTRPDIC
Sbjct: 369 NRFNMQGSKKGFLPFRHGIXLSKDQSPKTPEEIESMKAVPYASAVGSLMYAMLCTRPDIC 428

Query: 301 YAVGIVSRYQSNPGRAHWTAVKNILKYLRTTRDYMLMYGAKDLILTGYTDSDFQTDCERR 360
           +AVG+VSR+QSN GR HW AVK+I+KYL+ TRDYML++ +++L   GYTDSDFQ+D + R
Sbjct: 429 FAVGMVSRFQSNXGREHWXAVKHIIKYLKRTRDYMLVFQSENLXPIGYTDSDFQSDQDSR 488

BLAST of CmoCh04G019820 vs. TrEMBL
Match: O23864_9ORYZ (Polyprotein OS=Oryza australiensis PE=4 SV=1)

HSP 1 Score: 481.5 bits (1238), Expect = 1.1e-132
Identity = 223/357 (62.46%), Postives = 281/357 (78.71%), Query Frame = 1

Query: 1    MNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGV 60
            M  E+ESMY N VW LVD PDGVK I CKW++K+K D  G V  +KARLVAKG+ Q +GV
Sbjct: 814  MKSEIESMYDNQVWNLVDPPDGVKTIECKWLFKKKADMDGNVHIYKARLVAKGFKQIQGV 873

Query: 61   DYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMAQPEGFIEQDH 120
            DY+ETFSPVAMLKSIRI+L+IA ++DYEIWQMDVKTAFLNGNL E +YM QP+GF++ + 
Sbjct: 874  DYDETFSPVAMLKSIRIILAIAAYFDYEIWQMDVKTAFLNGNLSEDVYMIQPQGFVDPES 933

Query: 121  EQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFLVLYVD 180
              ++CKL++SIYGLKQASRSWNI+FD  IK +GF +N +E CVYK++  S + FL+LYVD
Sbjct: 934  PGKICKLQKSIYGLKQASRSWNIRFDEVIKGFGFIKNEEEACVYKKVSGSAIVFLILYVD 993

Query: 181  DILLIGNDVGILTDIKHWLATQFQMKDLGEAQFVLGIQIIRNRKNKTLALSQASYIDKML 240
            DILLIGND+ +L  +K  L   F MKDLGEA ++LGI+I R+R  + + LSQ++YIDK+L
Sbjct: 994  DILLIGNDIPMLESVKSSLKNSFSMKDLGEAAYILGIRIYRDRSKRLIGLSQSTYIDKVL 1053

Query: 241  IRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDIC 300
             R+ M DSKKG LP  HG++LSK Q P+T  E   M  +PYASA+GS+MYAMLCTRPD+ 
Sbjct: 1054 KRFNMHDSKKGFLPMSHGINLSKNQCPQTHDERNKMGMVPYASAIGSIMYAMLCTRPDVS 1113

Query: 301  YAVGIVSRYQSNPGRAHWTAVKNILKYLRTTRDYMLMYGA-KDLILTGYTDSDFQTD 357
            YA+   SRYQS+PG  HWTAVKNILKYLR T+D  L+YG  +DL+++GYTD+ FQTD
Sbjct: 1114 YALSATSRYQSDPGEGHWTAVKNILKYLRRTKDMFLVYGGEEDLVVSGYTDASFQTD 1170

BLAST of CmoCh04G019820 vs. TAIR10
Match: AT4G23160.1 (AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 8)

HSP 1 Score: 232.6 bits (592), Expect = 4.4e-61
Identity = 129/360 (35.83%), Postives = 206/360 (57.22%), Query Frame = 1

Query: 1   MNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGV 60
           M+ E+ +M     WE+   P   KPIGCKW+YK K +  G ++ +KARLVAKGYTQ+EG+
Sbjct: 102 MDDEIGAMETTHTWEICTLPPNKKPIGCKWVYKIKYNSDGTIERYKARLVAKGYTQQEGI 161

Query: 61  DYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMAQPEGFIEQDH 120
           D+ ETFSPV  L S++++L+I+  Y++ + Q+D+  AFLNG+L+E IYM  P G+  +  
Sbjct: 162 DFIETFSPVCKLTSVKLILAISAIYNFTLHQLDISNAFLNGDLDEEIYMKLPPGYAARQG 221

Query: 121 E----QRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFLV 180
           +      VC LK+SIYGLKQASR W +KF   +  +GF Q+  +   + +I  +    ++
Sbjct: 222 DSLPPNAVCYLKKSIYGLKQASRQWFLKFSVTLIGFGFVQSHSDHTYFLKITATLFLCVL 281

Query: 181 LYVDDILLIGNDVGILTDIKHWLATQFQMKDLGEAQFVLGIQIIRNRKNKTLALSQASYI 240
           +YVDDI++  N+   + ++K  L + F+++DLG  ++ LG++I R+     + + Q  Y 
Sbjct: 282 VYVDDIIICSNNDAAVDELKSQLKSCFKLRDLGPLKYFLGLEIARSAAG--INICQRKYA 341

Query: 241 DKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTR 300
             +L    +   K   +P    V  S         +  D K   Y   +G LMY  + TR
Sbjct: 342 LDLLDETGLLGCKPSSVPMDPSVTFSAHSG----GDFVDAK--AYRRLIGRLMYLQI-TR 401

Query: 301 PDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRTTRDYMLMYGAK-DLILTGYTDSDFQT 356
            DI +AV  +S++   P  AH  AV  IL Y++ T    L Y ++ ++ L  ++D+ FQ+
Sbjct: 402 LDISFAVNKLSQFSEAPRLAHQQAVMKILHYIKGTVGQGLFYSSQAEMQLQVFSDASFQS 452

BLAST of CmoCh04G019820 vs. TAIR10
Match: ATMG00810.1 (ATMG00810.1 DNA/RNA polymerases superfamily protein)

HSP 1 Score: 81.3 bits (199), Expect = 1.6e-15
Identity = 62/181 (34.25%), Postives = 94/181 (51.93%), Query Frame = 1

Query: 174 FLVLYVDDILLIGNDVGILTDIKHWLATQFQMKDLGEAQFVLGIQIIRNRKNKTLALSQA 233
           +L+LYVDDILL G+   +L  +   L++ F MKDLG   + LGIQI  +     L LSQ 
Sbjct: 2   YLLLYVDDILLTGSSNTLLNMLIFQLSSTFSMKDLGPVHYFLGIQIKTHPSG--LFLSQT 61

Query: 234 SYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAML 293
            Y +++L    M D K    P    ++ S   + K P   +      + S VG+L Y  L
Sbjct: 62  KYAEQILNNAGMLDCKPMSTPLPLKLN-SSVSTAKYPDPSD------FRSIVGALQYLTL 121

Query: 294 CTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRTTRDY-MLMYGAKDLILTGYTDSD 353
            TRPDI YAV IV +    P  A +  +K +L+Y++ T  + + ++    L +  + DSD
Sbjct: 122 -TRPDISYAVNIVCQRMHEPTLADFDLLKRVLRYVKGTIFHGLYIHKNSKLNVQAFCDSD 172

BLAST of CmoCh04G019820 vs. TAIR10
Match: ATMG00820.1 (ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase))

HSP 1 Score: 75.1 bits (183), Expect = 1.2e-13
Identity = 34/82 (41.46%), Postives = 50/82 (60.98%), Query Frame = 1

Query: 1   MNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGV 60
           M  E++++  N  W LV  P     +GCKW++K K    G +   KARLVAKG+ Q EG+
Sbjct: 44  MQEELDALSRNKTWILVPPPVNQNILGCKWVFKTKLHSDGTLDRLKARLVAKGFHQEEGI 103

Query: 61  DYEETFSPVAMLKSIRILLSIA 83
            + ET+SPV    +IR +L++A
Sbjct: 104 YFVETYSPVVRTATIRTILNVA 125

BLAST of CmoCh04G019820 vs. NCBI nr
Match: gi|299474487|gb|ADJ18449.1| (gag/pol protein [Bryonia dioica])

HSP 1 Score: 654.1 bits (1686), Expect = 1.7e-184
Identity = 315/360 (87.50%), Postives = 340/360 (94.44%), Query Frame = 1

Query: 1    MNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGV 60
            MNLEMESMYFNSVW LVD P  VKPIGCKWIYKRKRDQ GKVQTFKARLVAKGYTQ+EGV
Sbjct: 824  MNLEMESMYFNSVWTLVDLPSDVKPIGCKWIYKRKRDQAGKVQTFKARLVAKGYTQKEGV 883

Query: 61   DYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMAQPEGFIEQDH 120
            DYEETFSPVAMLKSIRILLSIATFY+YEIWQMDVKTAFLNGNLEESIYM QPEGFI QD 
Sbjct: 884  DYEETFSPVAMLKSIRILLSIATFYNYEIWQMDVKTAFLNGNLEESIYMVQPEGFIAQDQ 943

Query: 121  EQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFLVLYVD 180
            EQ+VCKL++SIYGLKQASRSWNI+FDTAIKSYGF+QNVDEPCVYK+IVNS VAFL+LYVD
Sbjct: 944  EQKVCKLQKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKKIVNSVVAFLILYVD 1003

Query: 181  DILLIGNDVGILTDIKHWLATQFQMKDLGEAQFVLGIQIIRNRKNKTLALSQASYIDKML 240
            DILLIGNDV  LTD+K WL TQFQMKDLGEAQ++LGIQI+RNRKNKTLA+SQASYIDK+L
Sbjct: 1004 DILLIGNDVEYLTDVKKWLNTQFQMKDLGEAQYILGIQIVRNRKNKTLAMSQASYIDKVL 1063

Query: 241  IRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDIC 300
             RYKMQ+SKKG LPFRHG+HLSKEQ PKTPQEVEDM++IPY+SAVGSLMYAMLCTRPDIC
Sbjct: 1064 SRYKMQNSKKGQLPFRHGIHLSKEQCPKTPQEVEDMRNIPYSSAVGSLMYAMLCTRPDIC 1123

Query: 301  YAVGIVSRYQSNPGRAHWTAVKNILKYLRTTRDYMLMYGAKDLILTGYTDSDFQTDCERR 360
            Y+VGIVSRYQSNPGR HWTAVKNILKYLR TR+YML+YGAKDLILTGYTDSDFQ+D + R
Sbjct: 1124 YSVGIVSRYQSNPGRDHWTAVKNILKYLRRTRNYMLVYGAKDLILTGYTDSDFQSDKDAR 1183

BLAST of CmoCh04G019820 vs. NCBI nr
Match: gi|147768021|emb|CAN69397.1| (hypothetical protein VITISV_021035 [Vitis vinifera])

HSP 1 Score: 564.7 bits (1454), Expect = 1.4e-157
Identity = 268/360 (74.44%), Postives = 315/360 (87.50%), Query Frame = 1

Query: 1   MNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGV 60
           M LE+ESMY NSVW+LVD P+G+KPIGCKWIYK KR   GKV+TFKARLVAKG+TQ+EGV
Sbjct: 164 MKLEIESMYSNSVWKLVDLPEGIKPIGCKWIYKXKRGPNGKVETFKARLVAKGFTQKEGV 223

Query: 61  DYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMAQPEGFIEQDH 120
           DYE+TFSPV MLKSIRILLSI  +YDYEIWQMDVKT FLNG+LEE+IYM QPEGF+ +D 
Sbjct: 224 DYEDTFSPVXMLKSIRILLSIXAYYDYEIWQMDVKTXFLNGHLEETIYMVQPEGFVVKDQ 283

Query: 121 EQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFLVLYVD 180
           EQ+VCKL+RSIYGLKQASRSWNI F+ AIKSYGF+QN+ EPCVYK+I    V FLVLYVD
Sbjct: 284 EQKVCKLQRSIYGLKQASRSWNIIFNEAIKSYGFEQNLGEPCVYKQIGGDKVVFLVLYVD 343

Query: 181 DILLIGNDVGILTDIKHWLATQFQMKDLGEAQFVLGIQIIRNRKNKTLALSQASYIDKML 240
           DILLIGNDV  L+ +K+WLA+QFQMKDLGEA ++LGIQ+ R+RKN+ LALSQA+YIDK+L
Sbjct: 344 DILLIGNDVESLSKVKNWLASQFQMKDLGEASYILGIQMTRDRKNRLLALSQAAYIDKVL 403

Query: 241 IRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDIC 300
           +++ M++SKKG LP RHGVHLSKEQ PKTPQ+ E M+ +PYASAVGSLMYAMLCTRPDIC
Sbjct: 404 VKFAMENSKKGNLPSRHGVHLSKEQCPKTPQDEEKMRRVPYASAVGSLMYAMLCTRPDIC 463

Query: 301 YAVGIVSRYQSNPGRAHWTAVKNILKYLRTTRDYMLMYGAKDLILTGYTDSDFQTDCERR 360
           +AVG+VSRYQSNPG  HW AVK+ILKYLR TR+YML+Y  ++LI  GYTDSDFQ+D + R
Sbjct: 464 FAVGVVSRYQSNPGLDHWVAVKHILKYLRRTRNYMLVYSGRELIPIGYTDSDFQSDRDSR 523

BLAST of CmoCh04G019820 vs. NCBI nr
Match: gi|1019597807|gb|AMY96445.1| (gag/pol protein [Momordica dioica])

HSP 1 Score: 563.9 bits (1452), Expect = 2.3e-157
Identity = 270/357 (75.63%), Postives = 306/357 (85.71%), Query Frame = 1

Query: 1    MNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGV 60
            MN EMESMY N VW LVD P  VKPIGCKWIYK+KRDQ   V  FKARLVAKG+T+   +
Sbjct: 840  MNSEMESMYDNKVWTLVDLPSDVKPIGCKWIYKKKRDQDSNVTVFKARLVAKGFTRSLSL 899

Query: 61   DYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMAQPEGFIEQDH 120
             YEETFSPVAMLKSIRI+L+IA F+DYEIWQMDVKTAFLNGNLEESIYM QPEGF+ QD 
Sbjct: 900  SYEETFSPVAMLKSIRIILAIAAFFDYEIWQMDVKTAFLNGNLEESIYMIQPEGFVAQDQ 959

Query: 121  EQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFLVLYVD 180
            EQ+ CKL+ SIYGLKQASRSWNI+FD  IK++GF QNVDE CVYK+I  S VAFL+LYVD
Sbjct: 960  EQKACKLQGSIYGLKQASRSWNIRFDEVIKAFGFIQNVDESCVYKKISGSVVAFLILYVD 1019

Query: 181  DILLIGNDVGILTDIKHWLATQFQMKDLGEAQFVLGIQIIRNRKNKTLALSQASYIDKML 240
            DILLIGNDV  L D+K WL T F MKDLGEAQ++LGI+I R+R NKT+ +SQ++YIDK+L
Sbjct: 1020 DILLIGNDVEYLEDVKKWLNTSFSMKDLGEAQYILGIRIYRDRSNKTIGMSQSTYIDKVL 1079

Query: 241  IRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDIC 300
             R+KMQDSKKGLLPFRHG+HLSKEQ PKTPQEVEDM++IPY+SA+GSLMYAMLCTRPD+C
Sbjct: 1080 SRFKMQDSKKGLLPFRHGIHLSKEQCPKTPQEVEDMRNIPYSSAIGSLMYAMLCTRPDVC 1139

Query: 301  YAVGIVSRYQSNPGRAHWTAVKNILKYLRTTRDYMLMYGA-KDLILTGYTDSDFQTD 357
            YA+ IVSRYQSNPGR HWTAVKNILKYLR TR+  L+YG  KDL + GYTDS FQTD
Sbjct: 1140 YALSIVSRYQSNPGRDHWTAVKNILKYLRRTRNMFLVYGGDKDLAVKGYTDSSFQTD 1196

BLAST of CmoCh04G019820 vs. NCBI nr
Match: gi|147822228|emb|CAN64055.1| (hypothetical protein VITISV_044406 [Vitis vinifera])

HSP 1 Score: 523.5 bits (1347), Expect = 3.5e-145
Identity = 251/360 (69.72%), Postives = 295/360 (81.94%), Query Frame = 1

Query: 1   MNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGV 60
           M  EMESMY N VWELV+ P GVKPIGCKWIYK+KR   GK +T+KA LVAKGYTQ+EG+
Sbjct: 129 MKSEMESMYSNQVWELVEPPKGVKPIGCKWIYKKKRGIDGKXZTYKAXLVAKGYTQKEGI 188

Query: 61  DYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMAQPEGFIEQDH 120
           DYEETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNG+L+E IYM Q EGFI    
Sbjct: 189 DYEETFSPVAMLKSIRILLSIAAYFDYEIWQMDVKTAFLNGSLDECIYMKQXEGFIXNGQ 248

Query: 121 EQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFLVLYVD 180
           E  +CKL RSIYGLKQASRSWN  FD  IK++GF Q  DE CVYK+     V FLVLYVD
Sbjct: 249 EHLLCKLNRSIYGLKQASRSWNTCFDQTIKTFGFDQCHDESCVYKKWNGKKVVFLVLYVD 308

Query: 181 DILLIGNDVGILTDIKHWLATQFQMKDLGEAQFVLGIQIIRNRKNKTLALSQASYIDKML 240
           DILLIGN +G+LT +K WL+ +F MKDLGEA  +LGI+++RNRK K + LSQA YID +L
Sbjct: 309 DILLIGNCIGMLTSVKDWLSQRFDMKDLGEAAHILGIKLMRNRKKKMIGLSQALYIDTIL 368

Query: 241 IRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDIC 300
            R+ MQ SKKG LPFRHG+ LSK+QSPKTP+E+E MK +PYASAVGSLMYAMLCTRPDIC
Sbjct: 369 NRFNMQGSKKGFLPFRHGIXLSKDQSPKTPEEIESMKAVPYASAVGSLMYAMLCTRPDIC 428

Query: 301 YAVGIVSRYQSNPGRAHWTAVKNILKYLRTTRDYMLMYGAKDLILTGYTDSDFQTDCERR 360
           +AVG+VSR+QSN GR HW AVK+I+KYL+ TRDYML++ +++L   GYTDSDFQ+D + R
Sbjct: 429 FAVGMVSRFQSNXGREHWXAVKHIIKYLKRTRDYMLVFQSENLXPIGYTDSDFQSDQDSR 488

BLAST of CmoCh04G019820 vs. NCBI nr
Match: gi|2443320|dbj|BAA22288.1| (polyprotein [Oryza australiensis])

HSP 1 Score: 481.5 bits (1238), Expect = 1.5e-132
Identity = 223/357 (62.46%), Postives = 281/357 (78.71%), Query Frame = 1

Query: 1    MNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGV 60
            M  E+ESMY N VW LVD PDGVK I CKW++K+K D  G V  +KARLVAKG+ Q +GV
Sbjct: 814  MKSEIESMYDNQVWNLVDPPDGVKTIECKWLFKKKADMDGNVHIYKARLVAKGFKQIQGV 873

Query: 61   DYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMAQPEGFIEQDH 120
            DY+ETFSPVAMLKSIRI+L+IA ++DYEIWQMDVKTAFLNGNL E +YM QP+GF++ + 
Sbjct: 874  DYDETFSPVAMLKSIRIILAIAAYFDYEIWQMDVKTAFLNGNLSEDVYMIQPQGFVDPES 933

Query: 121  EQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFLVLYVD 180
              ++CKL++SIYGLKQASRSWNI+FD  IK +GF +N +E CVYK++  S + FL+LYVD
Sbjct: 934  PGKICKLQKSIYGLKQASRSWNIRFDEVIKGFGFIKNEEEACVYKKVSGSAIVFLILYVD 993

Query: 181  DILLIGNDVGILTDIKHWLATQFQMKDLGEAQFVLGIQIIRNRKNKTLALSQASYIDKML 240
            DILLIGND+ +L  +K  L   F MKDLGEA ++LGI+I R+R  + + LSQ++YIDK+L
Sbjct: 994  DILLIGNDIPMLESVKSSLKNSFSMKDLGEAAYILGIRIYRDRSKRLIGLSQSTYIDKVL 1053

Query: 241  IRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDIC 300
             R+ M DSKKG LP  HG++LSK Q P+T  E   M  +PYASA+GS+MYAMLCTRPD+ 
Sbjct: 1054 KRFNMHDSKKGFLPMSHGINLSKNQCPQTHDERNKMGMVPYASAIGSIMYAMLCTRPDVS 1113

Query: 301  YAVGIVSRYQSNPGRAHWTAVKNILKYLRTTRDYMLMYGA-KDLILTGYTDSDFQTD 357
            YA+   SRYQS+PG  HWTAVKNILKYLR T+D  L+YG  +DL+++GYTD+ FQTD
Sbjct: 1114 YALSATSRYQSDPGEGHWTAVKNILKYLRRTKDMFLVYGGEEDLVVSGYTDASFQTD 1170

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
POLX_TOBAC5.0e-9949.31Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
COPIA_DROME1.4e-6135.93Copia protein OS=Drosophila melanogaster GN=GIP PE=1 SV=3[more]
YCH4_YEAST6.8e-3233.72Putative transposon Ty5-1 protein YCL074W OS=Saccharomyces cerevisiae (strain AT... [more]
YH41B_YEAST3.3e-1824.85Transposon Ty4-H Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 20... [more]
YJ41B_YEAST4.3e-1824.85Transposon Ty4-J Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 20... [more]
Match NameE-valueIdentityDescription
E2GK51_BRYDI1.2e-18487.50Gag/pol protein (Fragment) OS=Bryonia dioica PE=4 SV=1[more]
A5AUE7_VITVI9.6e-15874.44Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_021035 PE=4 SV=1[more]
A0A165U314_9ROSI1.6e-15775.63Gag/pol protein OS=Momordica dioica PE=4 SV=1[more]
A5C065_VITVI2.4e-14569.72Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_044406 PE=4 SV=1[more]
O23864_9ORYZ1.1e-13262.46Polyprotein OS=Oryza australiensis PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G23160.14.4e-6135.83 cysteine-rich RLK (RECEPTOR-like protein kinase) 8[more]
ATMG00810.11.6e-1534.25ATMG00810.1 DNA/RNA polymerases superfamily protein[more]
ATMG00820.11.2e-1341.46ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)[more]
Match NameE-valueIdentityDescription
gi|299474487|gb|ADJ18449.1|1.7e-18487.50gag/pol protein [Bryonia dioica][more]
gi|147768021|emb|CAN69397.1|1.4e-15774.44hypothetical protein VITISV_021035 [Vitis vinifera][more]
gi|1019597807|gb|AMY96445.1|2.3e-15775.63gag/pol protein [Momordica dioica][more]
gi|147822228|emb|CAN64055.1|3.5e-14569.72hypothetical protein VITISV_044406 [Vitis vinifera][more]
gi|2443320|dbj|BAA22288.1|1.5e-13262.46polyprotein [Oryza australiensis][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR013103RVT_2
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
biological_process GO:0006310 DNA recombination
cellular_component GO:0005575 cellular_component
molecular_function GO:0003677 DNA binding
molecular_function GO:0008270 zinc ion binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh04G019820.1CmoCh04G019820.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR013103Reverse transcriptase, RNA-dependent DNA polymerasePFAMPF07727RVT_2coord: 11..254
score: 8.8
NoneNo IPR availablePANTHERPTHR11439GAG-POL-RELATED RETROTRANSPOSONcoord: 1..356
score: 5.9E
NoneNo IPR availableunknownSSF56672DNA/RNA polymerasescoord: 11..235
score: 6.23E-15coord: 265..346
score: 6.23

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
CmoCh04G019820Cp4.1LG01g16700Cucurbita pepo (Zucchini)cmocpeB673
The following gene(s) are paralogous to this gene:

None