CmoCh04G021440 (gene) Cucurbita moschata (Rifu)

NameCmoCh04G021440
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionRetrotransposon protein, putative, Ty1-copia subclass
LocationCmo_Chr04 : 14906602 .. 14914959 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACAAACTCAATAGTACAATTACTTGCTTCTGAGAAATTAAACGGCAACAATTACACAACTTGGAAATCAAACCTAAATACAATACTTGTAATTGATGATTTGAGGTTTGTTTTAACTGAGGAATGTCCTCCAAACCCTAGCTCAAATGCAAACCGAACAGTTCGGGATGCATATGACAGATGGACAAAGGAAAATGACAAAGCCCGAGTATACATTTTAGCCAGCATATCTGATGTTTTAGCTAAGAAACACAATGTCATGGGTACTGCTAAAGAGATTATGGAATCTCTGAAAGGGATGTTTGGACAACCGTCCTTCTCCCTTAGATATGACGCCATAAAATACATTTACAACTACCGTATGAAAGAAGGGACCTCAGTTAGAGAACATGTCCTGGACATGATGGTCCATTTCAATGTGACAAAAAAAAATGAAGCTGTCATTGATGAGAAGAGTCAAGTTAGTTTTATCATGGAGTCTCTTCCGAAGAGTTTCTTCCAGTTTCGCACAAATGTGATAATGAACAAGAAAGAATATAACTTGACTGCTCTTCCCAATGAGCTACAGACTTATCAGTTCCTCTTAACGAACAAGGGACAAACAGGAGAAGCAAATGTTGCTATCTTTAAGAAATTACTACGAGGATTGTCCTCCAAAAATAAGTCTGGACCTTCAACTTCTAAAAGTGTTTTGATGAAGAAGAAGGGTAAAGGGAAAAATAAGATTCTTTCTGACTGCAAACACAAGGTTCAAAAAGCACATAAAGAAAAATGTTTCCATTGCAACAAAAACGGGCACTGGAAGAGAAATTGCCCAAAATACCTTACAGAGAAGAAAGCCGAAAAGACACAACAAGATAAATATGATTTACTCGTTGTAGAAATATGTTTAGTAGAGTATGATAACTCAACTTGGATACTAGATTCAGGGGCCACTAATCATATTTGTTCTTTTTTCCAGGAAACTAGTTCCTGGAGAATGCTTGCGGACGACGAGATAACACTCAGGGTTGGAACAGGAGAGGTTGTCTAAGCAAGATCAGTAGGAAATTTAAAGTTGTTTTTTGGAGATAGATTCATTATATTAGATAATGTACTTTTTGTTCCTCGAATGAAAGGAAATCTAATATCTATCTCTTGTTTATTAGAACAATTGTATAAAGTATCTTTTGAAATTAATGAAGTGTTCATTTGCAAAAGAGGTATTCATATTTGTTCTGCAAAACTAGAAAACAACTTATATATGTTAAAACCGAGCGAAACAAAAGCTATTTTAAATATCGAGATGTTTAAAACAGCTGAAACTCAAAATAAACGACAAAAGATTTCTTCTAATACCTATCTTTGGCACTTAAGACTAGGCCATATTAATCTCAATAGAATTGAGAGATTGGTTAAAAGAGGACTTCTAAATGAGCTAGAAGACAACTCTTTACCTCCATGTGAGTCTTGTCTTGAGGGTAAAATGACTAAACAACCGTTTTTTGAAAAAGGTTATAGAGCCAAAGAAACCTTAGAACTTGTGCATACAGATCTCTGTGGTCCAATGAATATCAAAGCACAAGGAGGGTGTGAATATTTCATCAGTTTTATTGATGATTATTCAAGGTATGACTATCTTTACCTAATGCATCACAAGTCCGAAGCTCTTGAAAAATTCAGAGAGTATAAGACTGAGGTTGAGAATCTATTAGGTAAAACTATTAAAACACTTCGATCAGATCGAGGTGGAGAGTATATGGACTTAAGATTTCAGGACTATATGATAGAACATGGAATAAGGTCTCAACTCTCAGCCCCTGGTATGTCTCAATAAAATGGTGTATCAGAAAGGAGAAATAGAACTCTATTAGACATGGTTCGGTCTATGATGAGTTTCGCTCAATTACCTGATCCGTTTTGAGGATGTGCAGTGGAAACTGCTACATACATTTTGAACATGGTTCCTTCTAAGAGTGTTTCAGAAACACCTTATGAGTTATGGAAAGGGCGTAAAGGTAGTTTACGTCACTTCAGGATTTGGGGATGTCTAGCACATGTGTTGTTGCAAAACCCCAAGAAATTAGAACGTCGTTCAAAATTATGTCTATTCGTAGGATACCCCAAAGAGACGAAAGGTGGTTTGTTTTATGACCCTCAAGAAAATAGAGTGCTTGTATCAACAAATGCTACATTCTGAGAGGAAGACCACGTAAGAAATCATCAACCTCGCAGCAAACTAGTATTAAGTGAGATTTCTAAAGAAGCTAATGATAAAACAACAAGAGTTGTTGATCAAACTGGTCCTTCAACAAGAATTGTTGATGGAGCTGGCACTTCTGGTCAATCACATCCTTCTCACGAGTTGAGAATGCCTCAACGTAGTGGGAGGGTTATAACTCAACCCGATCGTTACTTAAAACTCAAGTCATCATACCTGATGATGGCGTTGAGGATCCATTGACCTATAAACAGGCAATGAATGATGAAGATAGAGATCAGTGGATTAAAGACATGAATCTTGAAATGGAGTCAATTACTTCAATTCAGTTTGGGAACTTGTAGATNNNNNNNNNNATGGTCCATTTCAATGTGACAAAAAAAAATGAAGCTGTCATTGATGAGAAGAGTCAAGTTAGTTTTATCATGGAGTCTCTTCCGAAGAGTTTCTTCCAGTTTCGCACAAATGTGATAATGAACAAGAAAGAATATAACTTGACTGCTCTTCCCAATGAGCTACAGACTTATCAGTTCCTCTTAACGAACAAGGGACAAACAGGAGAAGCAAATGTTGCTATCTTTAAGAAATTACTACGAGGATTGTCCTCCAAAAATAAGTCTGGACCTTCAACTTCTAAAAGTGTTTTGATGAAGAAGAAGGGTAAAGGGAAAAATAAGATTCTTTCTGACTGCAAACACAAGGTTCAAAAAGCACATAAAGAAAAATGTTTCCATTGCAACAAAAACGGGCACTGGAAGAGAAATTGCCCAAAATACCTTACAGAGAAGAAAGCCGAAAAGACACAACAAGATAAATATGATTTACTCGTTGTAGAAATATGTTTAGTAGAGTATGATAACTCAACTTGGATACTAGATTCAGGGGCCACTAATCATATTTGTTCTTTTTTCCAGGAAACTAGTTCCTGGAGAATGCTTGCGGACGACGAGATAACACTCAGGGTTGGAACAGGAGAGGTTGTCTAAGCAAGATCAGTAGGAAATTTAAAGTTGTTTTTTGGAGATAGATTCATTATATTAGATAATGTACTTTTTGTTCCTCGAATGAAAGGAAATCTAATATCTATCTCTTGTTTATTAGAACAATTGTATAAAGTATCTTTTGAAATTAATGAAGTGTTCATTTGCAAAAGAGGTATTCATATTTGTTCTGCAAAACTAGAAAACAACTTATATATGTTAAAACCGAGCGAAACAAAAGCTATTTTAAATATCGAGATGTTTAAAACAGCTGAAACTCAAAATAAACGACAAAAGATTTCTTCTAATACCTATCTTTGGCACTTAAGACTAGGCCATATTAATCTCAATAGAATTGAGAGATTGGTTAAAAGAGGACTTCTAAATGAGCTAGAAGACAACTCTTTACCTCCATGTGAGTCTTGTCTTGAGGGTAAAATGACTAAACAACCGTTTTTTGAAAAAGGTTATAGAGCCAAAGAAACCTTAGAACTTGTGCATACAGATCTCTGTGGTCCAATGAATATCAAAGCACAAGGAGGGTGTGAATATTTCATCAGTTTTATTGATGATTATTCAAGGTATGACTATCTTTACCTAATGCATCACAAGTCCGAAGCTCTTGAAAAATTCAGAGAGTATAAGACTGAGGTTGAGAATCTATTAGGTAAAACTATTAAAACACTTCGATCAGATCGAGGTGGAGAGTATATGGACTTAAGATTTCAGGACTATATGATAGAACATGGAATAAGGTCTCAACTCTCAGCCCCTGGTATGTCTCAATAAAATGGTGTATCAGAAAGGAGAAATAGAACTCTATTAGACATGGTTCGGTCTATGATGAGTTTCGCTCAATTACCTGATCCGTTTTGAGGATGTGCAGTGGAAACTGCTACATACATTTTGAACATGGTTCCTTCTAAGAGTGTTTCAGAAACACCTTATGAGTTATGGAAAGGGCGTAAAGGTAGTTTACGTCACTTCAGGATTTGGGGATGTCTAGCACATGTGTTGTTGCAAAACCCCAAGAAATTAGAACGTCGTTCAAAATTATGTCTATTCGTAGGATACCCCAAAGAGACGAAAGGTGGTTTGTTTTATGACCCTCAAGAAAATAGAGTGCTTGTATCAACAAATGCTACATTCTGAGAGGAAGACCACGTAAGAAATCATCAACCTCGCAGCAAACTAGTATTAAGTGAGATTTCTAAAGAAGCTAATGATAAAACAACAAGAGTTGTTGATCAAACTGGTCCTTCAACAAGAATTGTTGATGGAGCTGGCACTTCTGGTCAATCACATCCTTCTCACGAGTTGAGAATGCCTCAACGTAGTGGGAGGGTTATAACTCAACCCGATCGTTACTTAAAACTCAAGTCATCATACCTGATGATGGCGTTGAGGATCCATTGACCTATAAACAGGCAATGAATGATGAAGATAGAGATCAGTGGATTAAAGACATGAATCTTGAAATGGAGTCAATTACTTCAATTCAGTTTGGGAACTTGTAGATCAACCAGATGGGGTAAAACCTATTGGTTGCAAATGGATCTATAAGAGGAAACGAGACCAAACTGGTAAGGTACAGATCTTTAAAGCTCGACTTGTAGCAAAGTGTTATACCCAAAGAGAGAGGGTGGACTATGAAGAAACTTTCTCTCCTATTGCTATGCTTAAATCGATTAGAATACTCTTGTCCATTGCCACATTTTATGATTATGAAATTTGGGAAATGGATGTTAAGACAGCCTTTCTAAACGACAATCTTGAAGAGAGTATCTATATGACTCAACTAGAGGGGTTCATTGAACAGGATCGTGAGCAAAGGGTTTGCAAGCTTAAAAGATTCATTTATGGGTTAAAGCAAGCATCTCGATCCTGGAATATAAAGTTTGATACTGCGATCAAATCTTATGGCTTTAAACAGAATGTTGACGAACCTTGTGTTTATAAAAGGATAGTCAACTCTACAGTAGCTTTCCTAGTATTGTACGTTGACGATATCCTACTCATTCGGAATGATGTGGGATTTTTGACTGACATTAAGCATTGGCTAGCGACACAATTCTAAATGAAAGATTTGGGAGAGGCTCGGTTTGTTCTTAGAATCCAAATTGTACGGAATCGCAAGAATAAAACACTAGCATTGTCTCAGGCATCATACATTGACAAAATGTTGATTCGATATAAGATGCAGGACTCCAAGAAAGGATTATTATCTTTCAGGCATGGAGTTCATTTGTCGAAGGAACAAAGTCCTAAGACACCTCAAGAAGTTGAGGATATGAGACATATTCCCTATGCATCAGCAGTCGGTAGTCTGATGTATGCCATGCTATGTANNNNNNNNNNTTTTTTGGAGATAGATTCATTATATTAGATAATGTACTTTTTGTTCCTCGAATGAAAGGAAATCTAATATCTATCTCTTGTTTATTAGAACAATTGTATAAAGTATCTTTTGAAATTAATGAAGTGTTCATTTGCAAAAGAGGTATTCATATTTGTTCTGCAAAACTAGAAAACAACTTATATATGTTAAAACCGAGCGAAACAAAAGCTATTTTAAATATCGAGATGTTTAAAACAGCTGAAACTCAAAATAAACGACAAAAGATTTCTTCTAATACCTATCTTTGGCACTTAAGACTAGGCCATATTAATCTCAATAGAATTGAGAGATTGGTTAAAAGAGGACTTCTAAATGAGCTAGAAGACAACTCTTTACCTCCATGTGAGTCTTGTCTTGAGGGTAAAATGACTAAACAACCGTTTTTTGAAAAAGGTTATAGAGCCAAAGAAACCTTAGAACTTGTGCATACAGATCTCTGTGGTCCAATGAATATCAAAGCACAAGGAGGGTGTGAATATTTCATCAGTTTTATTGATGATTATTCAAGGTATGACTATCTTTACCTAATGCATCACAAGTCCGAAGCTCTTGAAAAATTCAGAGAGTATAAGACTGAGGTTGAGAATCTATTAGGTAAAACTATTAAAACACTTCGATCAGATCGAGGTGGAGAGTATATGGACTTAAGATTTCAGGACTATATGATAGAACATGGAATAAGGTCTCAACTCTCAGCCCCTGGTATGTCTCAATAAAATGGTGTATCAGAAAGGAGAAATAGAACTCTATTAGACATGGTTCGGTCTATGATGAGTTTCGCTCAATTACCTGATCCGTTTTGAGGATGTGCAGTGGAAACTGCTACATACATTTTGAACATGGTTCCTTCTAAGAGTGTTTCAGAAACACCTTATGAGTTATGGAAAGGGCGTAAAGGTAGTTTACGTCACTTCAGGATTTGGGGATGTCTAGCACATGTGTTGTTGCAAAACCCCAAGAAATTAGAACGTCGTTCAAAATTATGTCTATTCGTAGGATACCCCAAAGAGACGAAAGGTGGTTTGTTTTATGACCCTCAAGAAAATAGAGTGCTTGTATCAACAAATGCTACATTCTGAGAGGAAGACCACGTAAGAAATCATCAACCTCGCAGCAAACTAGTATTAAGTGAGATTTCTAAAGAAGCTAATGATAAAACAACAAGAGTTGTTGATCAAACTGGTCCTTCAACAAGAATTGTTGATGGAGCTGGCACTTCTGGTCAATCACATCCTTCTCACGAGTTGAGAATGCCTCAACGTAGTGGGAGGGTTATAACTCAACCCGATCGTTACTTAAAACTCAAGTCATCATACCTGATGATGGCGTTGAGGATCCATTGACCTATAAACAGGCAATGAATGATGAAGATAGAGATCAGTGGATTAAAGACATGAATCTTGAAATGGAGTCAATTACTTCAATTCAGTTTGGGAACTTGTAGATCAACCAGATGGGGTAAAACCTATTGGTTGCAAATGGATCTATAAGAGGAAACGAGACCAAACTGGTAAGGTACAGATCTTTAAAGCTCGACTTGTAGCAAAGTGTTATACCCAAAGAGAGAGGGTGGACTATGAAGAAACTTTCTCTCCTATTGCTATGCTTAAATCGATTAGAATACTCTTGTCCATTGCCACATTTTATGATTATGAAATTTGGGAAATGGATGTTAAGACAGCCTTTCTAAACGACAATCTTGAAGAGAGTATCTATATGACTCAACTAGAGGGGTTCATTGAACAGGATCGTGAGCAAAGGGTTTGCAAGCTTAAAAGATTCATTTATGGGTTAAAGCAAGCATCTCGATCCTGGAATATAAAGTTTGATACTGCGATCAAATCTTATGGCTTTAAACAGAATGTTGACGAACCTTGTGTTTATAAAAGGATAGTCAACTCTACAGTAGCTTTCCTAGTATTGTACGTTGACGATATCCTACTCATTCGGAATGATGTGGGATTTTTGACTGACATTAAGCATTGGCTAGCGACACAATTCTAAATGAAAGATTTGGGAGAGGCTCGGTTTGTTCTTAGAATCCAAATTGTACGGAATCGCAAGAATAAAACACTAGCATTGTCTCAGGCATCATACATTGACAAAATGTTGATTCGATATAAGATGCAGGACTCCAAGAAAGGATTATTATCTTTCAGGCATGGAGTTCATTTGTCGAAGGAACAAAGTCCTAAGACACCTCAAGAAGTTGAGGATATGAGACATATTCCCTATGCATCAGCAGTCGGTAGTCTGATGTATGCCATGCTATGTACCCAACCCGACATATGCTATGCTATTGGAATTGTCAGCAGATATCAGTCCAATTCGGGGCATGCTCATTGGACTGCTGTTAAGAATATCCTCAAGTATCTTCGGAGAACGAGGGACTATATGCTAATGTACGGTGCTAAGGATCTGATCCTTACAGGGTACACTGACTCAGATTTTCAGACCGATGTAGATTCAAGGAAATCGACATCAGAATCTGTCTTCACTCTGAACGGAGGAGCAATAATATGGAGAAGCATAAAGCAAGGTTGCATTGCTAATTCCACCATGGAGGCTGAGTATGTTGCTGTTTATGAAGCAGCGAAAGAATCTGTATGGCATAGGAAGTTCTTAACTCATTTGGAAGTCGTTCCAAATATGCATCTTCCCGTCACTCTTTATTGTGATAACAGTGGAGCAGTTGCAAATTCAAAAGAACCAAGAAGCCATAAGCGAGGAAAACATATTGAGCGCAAATATCATCTCATAAGGGAGATTGTGCAATGA

mRNA sequence

ATGACAAACTCAATAGTACAATTACTTGCTTCTGAGAAATTAAACGGCAACAATTACACAACTTGGAAATCAAACCTAAATACAATACTTGTAATTGATGATTTGAGGTTTGTTTTAACTGAGGAATGTCCTCCAAACCCTAGCTCAAATGCAAACCGAACAGTTCGGGATGCATATGACAGATGGACAAAGGAAAATGACAAAGCCCGAGTATACATTTTAGCCAGCATATCTGATGTTTTAGCTAAGAAACACAATGTCATGGGTACTGCTAAAGAGATTATGGAATCTCTGAAAGGGATGTTTGGACAACCGTCCTTCTCCCTTAGATATGACGCCATAAAATACATTTACAACTACCGTATGAAAGAAGGGACCTCAGTTAGAGAACATGTCCTGGACATGATGGTCCATTTCAATGTGACAAAAAAAAATGAAGCTGTCATTGATGAGAAGAGTCAAGTTAGTTTTATCATGGAGTCTCTTCCGAAGAGTTTCTTCCAGTTTCGCACAAATGTGATAATGAACAAGAAAGAATATAACTTGACTGCTCTTCCCAATGAGCTACAGACTTATCAGTTCCTCTTAACGAACAAGGGACAAACAGGAGAAGCAAATGTTGCTATCTTTAAGAAATTACTACGAGGATTGTCCTCCAAAAATAAGTATGACTATCTTTACCTAATGCATCACAAGTCCGAAGCTCTTGAAAAATTCAGAGAGTATAAGACTGAGGTTGAGAATCTATTAGGTAAAACTATTAAAACACTTCGATCAGATCGAGGTGGAGAGTATATGGACTTAAGATTTCAGGACTATATGATAGAACATGGAATAAGGTCTCAACTCTCAGCCCCTGTGGAAACTGCTACATACATTTTGAACATGGTTCCTTCTAAGAGTGTTTCAGAAACACCTTATGAGTTATGGAAAGGGCGTAAAGGTTATAGAGCCAAAGAAACCTTAGAACTTGTGCATACAGATCTCTGTGGTCCAATGAATATCAAAGCACAAGGAGGGTGTGAATATTTCATCAGTTTTATTGATGATTATTCAAGGTATGACTATCTTTACCTAATGCATCACAAGTCCGAAGCTCTTGAAAAATTCAGAGAGTATAAGACTGAGGTTGAGAATCTATTAGGTAAAACTATTAAAACACTTCGATCAGATCGAGGTGGAGAGTATATGGACTTAAGATTTCAGGACTATATGATAGAACATGGAATAAGGTCTCAACTCTCAGCCCCTGTGGAAACTGCTACATACATTTTGAACATGGTTCCTTCTAAGAGTGTTTCAGAAACACCTTATGAGTTATGGAAAGGGCGTAAAGATCAACCAGATGGGGTAAAACCTATTGGTTGCAAATGGATCTATAAGAGGAAACGAGACCAAACTGGTAAGGTACAGATCTTTAAAGCTCGACTTGTAGCAAAGTGTTATACCCAAAGAGAGAGGGTGGACTATGAAGAAACTTTCTCTCCTATTGCTATGCTTAAATCGATTAGAATACTCTTGTCCATTGCCACATTTTATGATTATGAAATTTGGGAAATGGATGTTAAGACAGCCTTTCTAAACGACAATCTTGAAGAGAGTATCTATATGACTCAACTAGAGGGGTTCATTGAACAGGATCGTGAGCAAAGGGTTTGCAAGCTTAAAAGATTCATTTATGGGTTAAAGCAAGCATCTCGATCCTGGAATATAAAGTTTGATACTGCGATCAAATCTTATGGCTTTAAACAGAATGTTGACGAACCTTGTGTTTATAAAAGGATAGTCAACTCTACAGTAGCTTTCCTAGTATTGTATGACTATCTTTACCTAATGCATCACAAGTCCGAAGCTCTTGAAAAATTCAGAGAGTATAAGACTGAGGTTGAGAATCTATTAGGTAAAACTATTAAAACACTTCGATCAGATCGAGGTGGAGAGTATATGGACTTAAGATTTCAGGACTATATGATAGAACATGGAATAAGGTCTCAACTCTCAGCCCCTGTGGAAACTGCTACATACATTTTGAACATGGTTCCTTCTAAGAGTGTTTCAGAAACACCTTATGAGTTATGGAAAGGGCGTAAAGATCAACCAGATGGGGTAAAACCTATTGGTTGCAAATGGATCTATAAGAGGAAACGAGACCAAACTGGTAAGGACTCCAAGAAAGGATTATTATCTTTCAGGCATGGAGTTCATTTGTCGAAGGAACAAAGTCCTAAGACACCTCAAGAAGTTGAGGATATGAGACATATTCCCTATGCATCAGCAGTCGGTAGTCTGATGTATGCCATGCTATGTACCCAACCCGACATATGCTATGCTATTGGAATTGTCAGCAGATATCAGTCCAATTCGGGGCATGCTCATTGGACTGCTGTTAAGAATATCCTCAAGTATCTTCGGAGAACGAGGGACTATATGCTAATGTACGGTGCTAAGGATCTGATCCTTACAGGGTACACTGACTCAGATTTTCAGACCGATGTAGATTCAAGGAAATCGACATCAGAATCTGTCTTCACTCTGAACGGAGGAGCAATAATATGGAGAAGCATAAAGCAAGGTTGCATTGCTAATTCCACCATGGAGGCTGAGTATGTTGCTGTTTATGAAGCAGCGAAAGAATCTGTATGGCATAGGAAGTTCTTAACTCATTTGGAAGTCGTTCCAAATATGCATCTTCCCGTCACTCTTTATTGTGATAACAGTGGAGCAGTTGCAAATTCAAAAGAACCAAGAAGCCATAAGCGAGGAAAACATATTGAGCGCAAATATCATCTCATAAGGGAGATTGTGCAATGA

Coding sequence (CDS)

ATGACAAACTCAATAGTACAATTACTTGCTTCTGAGAAATTAAACGGCAACAATTACACAACTTGGAAATCAAACCTAAATACAATACTTGTAATTGATGATTTGAGGTTTGTTTTAACTGAGGAATGTCCTCCAAACCCTAGCTCAAATGCAAACCGAACAGTTCGGGATGCATATGACAGATGGACAAAGGAAAATGACAAAGCCCGAGTATACATTTTAGCCAGCATATCTGATGTTTTAGCTAAGAAACACAATGTCATGGGTACTGCTAAAGAGATTATGGAATCTCTGAAAGGGATGTTTGGACAACCGTCCTTCTCCCTTAGATATGACGCCATAAAATACATTTACAACTACCGTATGAAAGAAGGGACCTCAGTTAGAGAACATGTCCTGGACATGATGGTCCATTTCAATGTGACAAAAAAAAATGAAGCTGTCATTGATGAGAAGAGTCAAGTTAGTTTTATCATGGAGTCTCTTCCGAAGAGTTTCTTCCAGTTTCGCACAAATGTGATAATGAACAAGAAAGAATATAACTTGACTGCTCTTCCCAATGAGCTACAGACTTATCAGTTCCTCTTAACGAACAAGGGACAAACAGGAGAAGCAAATGTTGCTATCTTTAAGAAATTACTACGAGGATTGTCCTCCAAAAATAAGTATGACTATCTTTACCTAATGCATCACAAGTCCGAAGCTCTTGAAAAATTCAGAGAGTATAAGACTGAGGTTGAGAATCTATTAGGTAAAACTATTAAAACACTTCGATCAGATCGAGGTGGAGAGTATATGGACTTAAGATTTCAGGACTATATGATAGAACATGGAATAAGGTCTCAACTCTCAGCCCCTGTGGAAACTGCTACATACATTTTGAACATGGTTCCTTCTAAGAGTGTTTCAGAAACACCTTATGAGTTATGGAAAGGGCGTAAAGGTTATAGAGCCAAAGAAACCTTAGAACTTGTGCATACAGATCTCTGTGGTCCAATGAATATCAAAGCACAAGGAGGGTGTGAATATTTCATCAGTTTTATTGATGATTATTCAAGGTATGACTATCTTTACCTAATGCATCACAAGTCCGAAGCTCTTGAAAAATTCAGAGAGTATAAGACTGAGGTTGAGAATCTATTAGGTAAAACTATTAAAACACTTCGATCAGATCGAGGTGGAGAGTATATGGACTTAAGATTTCAGGACTATATGATAGAACATGGAATAAGGTCTCAACTCTCAGCCCCTGTGGAAACTGCTACATACATTTTGAACATGGTTCCTTCTAAGAGTGTTTCAGAAACACCTTATGAGTTATGGAAAGGGCGTAAAGATCAACCAGATGGGGTAAAACCTATTGGTTGCAAATGGATCTATAAGAGGAAACGAGACCAAACTGGTAAGGTACAGATCTTTAAAGCTCGACTTGTAGCAAAGTGTTATACCCAAAGAGAGAGGGTGGACTATGAAGAAACTTTCTCTCCTATTGCTATGCTTAAATCGATTAGAATACTCTTGTCCATTGCCACATTTTATGATTATGAAATTTGGGAAATGGATGTTAAGACAGCCTTTCTAAACGACAATCTTGAAGAGAGTATCTATATGACTCAACTAGAGGGGTTCATTGAACAGGATCGTGAGCAAAGGGTTTGCAAGCTTAAAAGATTCATTTATGGGTTAAAGCAAGCATCTCGATCCTGGAATATAAAGTTTGATACTGCGATCAAATCTTATGGCTTTAAACAGAATGTTGACGAACCTTGTGTTTATAAAAGGATAGTCAACTCTACAGTAGCTTTCCTAGTATTGTATGACTATCTTTACCTAATGCATCACAAGTCCGAAGCTCTTGAAAAATTCAGAGAGTATAAGACTGAGGTTGAGAATCTATTAGGTAAAACTATTAAAACACTTCGATCAGATCGAGGTGGAGAGTATATGGACTTAAGATTTCAGGACTATATGATAGAACATGGAATAAGGTCTCAACTCTCAGCCCCTGTGGAAACTGCTACATACATTTTGAACATGGTTCCTTCTAAGAGTGTTTCAGAAACACCTTATGAGTTATGGAAAGGGCGTAAAGATCAACCAGATGGGGTAAAACCTATTGGTTGCAAATGGATCTATAAGAGGAAACGAGACCAAACTGGTAAGGACTCCAAGAAAGGATTATTATCTTTCAGGCATGGAGTTCATTTGTCGAAGGAACAAAGTCCTAAGACACCTCAAGAAGTTGAGGATATGAGACATATTCCCTATGCATCAGCAGTCGGTAGTCTGATGTATGCCATGCTATGTACCCAACCCGACATATGCTATGCTATTGGAATTGTCAGCAGATATCAGTCCAATTCGGGGCATGCTCATTGGACTGCTGTTAAGAATATCCTCAAGTATCTTCGGAGAACGAGGGACTATATGCTAATGTACGGTGCTAAGGATCTGATCCTTACAGGGTACACTGACTCAGATTTTCAGACCGATGTAGATTCAAGGAAATCGACATCAGAATCTGTCTTCACTCTGAACGGAGGAGCAATAATATGGAGAAGCATAAAGCAAGGTTGCATTGCTAATTCCACCATGGAGGCTGAGTATGTTGCTGTTTATGAAGCAGCGAAAGAATCTGTATGGCATAGGAAGTTCTTAACTCATTTGGAAGTCGTTCCAAATATGCATCTTCCCGTCACTCTTTATTGTGATAACAGTGGAGCAGTTGCAAATTCAAAAGAACCAAGAAGCCATAAGCGAGGAAAACATATTGAGCGCAAATATCATCTCATAAGGGAGATTGTGCAATGA
BLAST of CmoCh04G021440 vs. Swiss-Prot
Match: POLX_TOBAC (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum PE=2 SV=1)

HSP 1 Score: 197.2 bits (500), Expect = 8.0e-49
Identity = 99/200 (49.50%), Postives = 134/200 (67.00%), Query Frame = 1

Query: 736  LSKEQSPKTPQEVEDMRHIPYASAVGSLMYAMLCTQPDICYAIGIVSRYQSNSGHAHWTA 795
            LSK+  P T +E  +M  +PY+SAVGSLMYAM+CT+PDI +A+G+VSR+  N G  HW A
Sbjct: 1091 LSKKMCPTTVEEKGNMAKVPYSSAVGSLMYAMVCTRPDIAHAVGVVSRFLENPGKEHWEA 1150

Query: 796  VKNILKYLRRTRDYMLMYGAKDLILTGYTDSDFQTDVDSRKSTSESVFTLNGGAIIWRSI 855
            VK IL+YLR T    L +G  D IL GYTD+D   D+D+RKS++  +FT +GGAI W+S 
Sbjct: 1151 VKWILRYLRGTTGDCLCFGGSDPILKGYTDADMAGDIDNRKSSTGYLFTFSGGAISWQSK 1210

Query: 856  KQGCIANSTMEAEYVAVYEAAKESVWHRKFLTHLEVVPNMHLPVTLYCDNSGAVANSKEP 915
             Q C+A ST EAEY+A  E  KE +W ++FL  L +    ++   +YCD+  A+  SK  
Sbjct: 1211 LQKCVALSTTEAEYIAATETGKEMIWLKRFLQELGLHQKEYV---VYCDSQSAIDLSKNS 1270

Query: 916  RSHKRGKHIERKYHLIREIV 936
              H R KHI+ +YH IRE+V
Sbjct: 1271 MYHARTKHIDVRYHWIREMV 1287

BLAST of CmoCh04G021440 vs. Swiss-Prot
Match: COPIA_DROME (Copia protein OS=Drosophila melanogaster GN=GIP PE=1 SV=3)

HSP 1 Score: 136.3 bits (342), Expect = 1.7e-30
Identity = 72/186 (38.71%), Postives = 111/186 (59.68%), Query Frame = 1

Query: 755  PYASAVGSLMYAMLCTQPDICYAIGIVSRYQSNSGHAHWTAVKNILKYLRRTRDYMLMYG 814
            P  S +G LMY MLCT+PD+  A+ I+SRY S +    W  +K +L+YL+ T D  L++ 
Sbjct: 1180 PCRSLIGCLMYIMLCTRPDLTTAVNILSRYSSKNNSELWQNLKRVLRYLKGTIDMKLIFK 1239

Query: 815  ---AKDLILTGYTDSDFQTDVDSRKSTSESVFTL-NGGAIIWRSIKQGCIANSTMEAEYV 874
               A +  + GY DSD+      RKST+  +F + +   I W + +Q  +A S+ EAEY+
Sbjct: 1240 KNLAFENKIIGYVDSDWAGSEIDRKSTTGYLFKMFDFNLICWNTKRQNSVAASSTEAEYM 1299

Query: 875  AVYEAAKESVWHRKFLTHLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHL 934
            A++EA +E++W +  LT + +   +  P+ +Y DN G ++ +  P  HKR KHI+ KYH 
Sbjct: 1300 ALFEAVREALWLKFLLTSINI--KLENPIKIYEDNQGCISIANNPSCHKRAKHIDIKYHF 1359

Query: 935  IREIVQ 937
             RE VQ
Sbjct: 1360 AREQVQ 1363

BLAST of CmoCh04G021440 vs. Swiss-Prot
Match: YCH4_YEAST (Putative transposon Ty5-1 protein YCL074W OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=TY5A PE=5 SV=2)

HSP 1 Score: 67.0 bits (162), Expect = 1.2e-09
Identity = 50/170 (29.41%), Postives = 84/170 (49.41%), Query Frame = 1

Query: 520 MDVKTAFLNDNLEESIYMTQLEGFIEQDREQRVCKLKRFIYGLKQASRSWNIKFDTAIKS 579
           MDV TAFLN  ++E IY+ Q  GF+ +     V +L   +YGLKQA   WN   +  +K 
Sbjct: 1   MDVDTAFLNSTMDEPIYVKQPPGFVNERNPDYVWELYGGMYGLKQAPLLWNEHINNTLKK 60

Query: 580 YGFKQNVDEPCVYKRIVNSTVAFLVLY-DYLYLMHHKSEALEKFREYKTEVENL--LGKT 639
            GF ++  E  +Y R  +    ++ +Y D L +     +  ++ ++  T++ ++  LGK 
Sbjct: 61  IGFCRHEGEHGLYFRSTSDGPIYIAVYVDDLLVAAPSPKIYDRVKQELTKLYSMKDLGKV 120

Query: 640 IKTL-----RSDRGGEYMDLRFQDYMIEHGIRSQLSAPVETATYILNMVP 682
            K L     +S  G   + L  QDY+ +    S+++    T T + N  P
Sbjct: 121 DKFLGLNIHQSSNGD--ITLSLQDYIAKAASESEINTFKLTQTPLCNSKP 168

BLAST of CmoCh04G021440 vs. Swiss-Prot
Match: YD22B_YEAST (Transposon Ty2-DR2 Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=TY2B-DR2 PE=3 SV=2)

HSP 1 Score: 55.5 bits (132), Expect = 3.7e-06
Identity = 30/104 (28.85%), Postives = 50/104 (48.08%), Query Frame = 1

Query: 314 KGYRAK-----ETLELVHTDLCGPMNIKAQGGCEYFISFIDDYSRYDYLYLMHHKSE--A 373
           KG R K     E  + +HTD+ GP++   +    YFISF D+ +R+ ++Y +H + E   
Sbjct: 648 KGSRLKYQESYEPFQYLHTDIFGPVHHLPKSAPSYFISFTDEKTRFQWVYPLHDRREESI 707

Query: 374 LEKFREYKTEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGI 411
           L  F      ++N     +  ++ DRG EY +     +    GI
Sbjct: 708 LNVFTSILAFIKNQFNARVLVIQMDRGSEYTNKTLHKFFTNRGI 751

BLAST of CmoCh04G021440 vs. Swiss-Prot
Match: YD23B_YEAST (Transposon Ty2-DR3 Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=TY2B-DR3 PE=3 SV=1)

HSP 1 Score: 55.5 bits (132), Expect = 3.7e-06
Identity = 30/104 (28.85%), Postives = 50/104 (48.08%), Query Frame = 1

Query: 314 KGYRAK-----ETLELVHTDLCGPMNIKAQGGCEYFISFIDDYSRYDYLYLMHHKSE--A 373
           KG R K     E  + +HTD+ GP++   +    YFISF D+ +R+ ++Y +H + E   
Sbjct: 648 KGSRLKYQESYEPFQYLHTDIFGPVHHLPKSAPSYFISFTDEKTRFQWVYPLHDRREESI 707

Query: 374 LEKFREYKTEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGI 411
           L  F      ++N     +  ++ DRG EY +     +    GI
Sbjct: 708 LNVFTSILAFIKNQFNARVLVIQMDRGSEYTNKTLHKFFTNRGI 751

BLAST of CmoCh04G021440 vs. TrEMBL
Match: E2GK51_BRYDI (Gag/pol protein (Fragment) OS=Bryonia dioica PE=4 SV=1)

HSP 1 Score: 545.0 bits (1403), Expect = 1.7e-151
Identity = 295/494 (59.72%), Postives = 352/494 (71.26%), Query Frame = 1

Query: 446  DQPDGVKPIGCKWIYKRKRDQTGKVQIFKARLVAKCYTQRERVDYEETFSPIAMLKSIRI 505
            D P  VKPIGCKWIYKRKRDQ GKVQ FKARLVAK YTQ+E VDYEETFSP+AMLKSIRI
Sbjct: 841  DLPSDVKPIGCKWIYKRKRDQAGKVQTFKARLVAKGYTQKEGVDYEETFSPVAMLKSIRI 900

Query: 506  LLSIATFYDYEIWEMDVKTAFLNDNLEESIYMTQLEGFIEQDREQRVCKLKRFIYGLKQA 565
            LLSIATFY+YEIW+MDVKTAFLN NLEESIYM Q EGFI QD+EQ+VCKL++        
Sbjct: 901  LLSIATFYNYEIWQMDVKTAFLNGNLEESIYMVQPEGFIAQDQEQKVCKLQK-------- 960

Query: 566  SRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFLVLYDYLYLMHHKSEALEKFREY 625
                          YG KQ             ++ ++ + +D     +   + +++   Y
Sbjct: 961  ------------SIYGLKQ-------------ASRSWNIRFDTAIKSYGFEQNVDEPCVY 1020

Query: 626  KTEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAPVETATYILNMVPSKSV 685
            K  V +++              ++ L   D ++   I + +    +   ++      K +
Sbjct: 1021 KKIVNSVVA-------------FLILYVDDILL---IGNDVEYLTDVKKWLNTQFQMKDL 1080

Query: 686  SETPY----ELWKGRKDQPDGVKPIGCKWIYKRKRDQTGKDSKKGLLSFRHGVHLSKEQS 745
             E  Y    ++ + RK++   +      +I K       ++SKKG L FRHG+HLSKEQ 
Sbjct: 1081 GEAQYILGIQIVRNRKNKTLAMSQ--ASYIDKVLSRYKMQNSKKGQLPFRHGIHLSKEQC 1140

Query: 746  PKTPQEVEDMRHIPYASAVGSLMYAMLCTQPDICYAIGIVSRYQSNSGHAHWTAVKNILK 805
            PKTPQEVEDMR+IPY+SAVGSLMYAMLCT+PDICY++GIVSRYQSN G  HWTAVKNILK
Sbjct: 1141 PKTPQEVEDMRNIPYSSAVGSLMYAMLCTRPDICYSVGIVSRYQSNPGRDHWTAVKNILK 1200

Query: 806  YLRRTRDYMLMYGAKDLILTGYTDSDFQTDVDSRKSTSESVFTLNGGAIIWRSIKQGCIA 865
            YLRRTR+YML+YGAKDLILTGYTDSDFQ+D D+RKSTS SVFTLNGGA++WRS+KQ CIA
Sbjct: 1201 YLRRTRNYMLVYGAKDLILTGYTDSDFQSDKDARKSTSGSVFTLNGGAVVWRSVKQTCIA 1260

Query: 866  NSTMEAEYVAVYEAAKESVWHRKFLTHLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRG 925
            +STMEAEYVA  EAAKE+VW RKFLT LEVVPNMHLP+TLYCDNSGAVANSKEPRSHKRG
Sbjct: 1261 DSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDNSGAVANSKEPRSHKRG 1283

Query: 926  KHIERKYHLIREIV 936
            KHIERKYHLIREIV
Sbjct: 1321 KHIERKYHLIREIV 1283

BLAST of CmoCh04G021440 vs. TrEMBL
Match: Q2QUJ4_ORYSJ (Retrotransposon protein, putative, Ty1-copia subclass OS=Oryza sativa subsp. japonica GN=LOC_Os12g16060 PE=4 SV=2)

HSP 1 Score: 411.0 bits (1055), Expect = 3.9e-111
Identity = 276/735 (37.55%), Postives = 393/735 (53.47%), Query Frame = 1

Query: 253  TIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAP-------------------------- 312
            TIK LRSDRGGEY+ L F +++ E GI  QL+ P                          
Sbjct: 354  TIKYLRSDRGGEYLSLEFGNHLKECGIVPQLTPPGTPQWNGVSERRNRTLLDIVRSMMSQ 413

Query: 313  -----------VETATYILNMVPSKSVSETPYELWKGR-------KGYRAKETLELVHTD 372
                       +E A + LN VPSKSV++TPYE+W G+       K +  +  ++ + +D
Sbjct: 414  TDLPLSFWAYALEKAAFTLNKVPSKSVNKTPYEIWTGKRPSLSFLKIWGCEVYVKRLQSD 473

Query: 373  LCGPMNIKAQGGCEYFISFIDD------YSRYDYLYLMHHKSEALEKFREYKTEVENLLG 432
               P + K      +F+ +  +      Y+R +    +      LEK    +    ++L 
Sbjct: 474  KLTPKSDKC-----FFVGYPKETKGYYFYNREEGKVFVARHGVFLEKEFILRKNSGSMLW 533

Query: 433  KTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAPVETATYILNMVPSKSVSETPYELWK 492
            K    L     G  + LR  D    +          E A   L  + S+  S    ++W 
Sbjct: 534  K--HQLHEGPKGYGVHLRENDEPTTY----------EEAMVGLGAMKSEIESMHVNQVWN 593

Query: 493  GRKDQPDGVKPIGCKWIYKRKRDQTGKVQIFKARLVAKCYTQRERVDYEETFSPIAMLKS 552
               D PDGVK I  KW++K+K D  G V I+KARLVAK + Q + VDY+ETFS +AMLKS
Sbjct: 594  -LVDPPDGVKAIEWKWVFKKKTDVDGNVHIYKARLVAKGFRQIQGVDYDETFSLVAMLKS 653

Query: 553  IRILLSIATFYDYEIWEMDVKTAFLNDNLEESIYMTQLEGFIEQDREQRVCKLKRFIYGL 612
            I+I+L+I  ++DYEIW+MDVKTAF+N N++E +YMTQL+GF++    +++CKL++ IYGL
Sbjct: 654  IQIVLAITAYFDYEIWQMDVKTAFVNGNIDEDVYMTQLKGFVDPQSAKKICKLQKSIYGL 713

Query: 613  KQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFLVLY-DYLYLMHHKSEALEK 672
            KQASRSWNI+FD  +K++               ++ T   LVLY D + L+ +    LE 
Sbjct: 714  KQASRSWNIRFDEVVKAWA--------------LSKTKKSLVLYVDDILLIGNDIPMLE- 773

Query: 673  FREYKTEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAPVETATYILNMVP 732
                KT ++N           D G     L  + Y      RS+    +  +TYI  ++ 
Sbjct: 774  --SVKTSLKNSFS------MKDLGEAAYILGIRIYR----DRSKRLIGLSQSTYIDKVLK 833

Query: 733  SKSVSETPYELWKGRKDQPDGVKPIGCKWIYKRKRDQTGKDSKKGLLSFRHGVHLSKEQS 792
              ++ ++     KG      G+       + K +  QT  +  K                
Sbjct: 834  RFNMQDSK----KGFLPMSHGIN------LGKNQCPQTTDERNK---------------- 893

Query: 793  PKTPQEVEDMRHIPYASAVGSLMYAMLCTQPDICYAIGIVSRYQSNSGHAHWTAVKNILK 852
                     M  IPYASA+GS+MYAMLCT PD+ YA+   SRYQS+ G +HW AVKNILK
Sbjct: 894  ---------MSVIPYASAIGSIMYAMLCTHPDVSYALSATSRYQSDPGESHWIAVKNILK 953

Query: 853  YLRRTRDYMLMYGA-KDLILTGYTDSDFQTDVDSRKSTSESVFTLNGGAIIWRSIKQGCI 912
            YLRRT+D  L+YG  ++L++ GYTD+ FQTD D  +S S  VF LNGGA+ W+S KQ  +
Sbjct: 954  YLRRTKDMFLVYGGQEELVVKGYTDASFQTDKDDFRSQSGFVFCLNGGAVSWKSSKQDTV 1007

Query: 913  ANSTMEAEYVAVYEAAKESVWHRKFLTHLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKR 936
             +S  EAEY+A  EAAKE+VW +KF++ L V+ +   P+ LYCDNSGA+A +KEPRSH++
Sbjct: 1014 VDSITEAEYIAASEAAKEAVWIKKFVSQLGVMTSASSPMDLYCDNSGAIAQAKEPRSHQK 1007

BLAST of CmoCh04G021440 vs. TrEMBL
Match: Q53K55_ORYSJ (Retrotransposon protein, putative, Ty1-copia sub-class OS=Oryza sativa subsp. japonica GN=LOC_Os11g23750 PE=4 SV=1)

HSP 1 Score: 394.0 bits (1011), Expect = 4.9e-106
Identity = 247/616 (40.10%), Postives = 337/616 (54.71%), Query Frame = 1

Query: 317 RAKETLELVHTDLCGPMNIKAQGGCEYFISFIDDYSRYDYLYLMHHKSEALEKFREYKTE 376
           +A E L LVHTD+CGPM+  A+GG  YFI+F +D+SRY Y+YLM HKS++ EKF+E++ E
Sbjct: 376 KASELLGLVHTDVCGPMSSTARGGFGYFITFTNDFSRYGYVYLMRHKSDSFEKFKEFQNE 435

Query: 377 VENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAPVE----------------- 436
           V+N LGKTIK LRSDRGGEY+ L F +++ E GI  QL+ P                   
Sbjct: 436 VQNHLGKTIKYLRSDRGGEYLSLEFGNHLKECGIVPQLTPPGTPQWNGVSERRNRTLLDM 495

Query: 437 --------------------TATYILNMVPSKSVSETPYELWKGRKDQPDGVKPIGCKWI 496
                               TA + LN VPSKSV +TPYE+W G++     +K  GC+  
Sbjct: 496 VRSMMSQTDLPLSFWGYTLETAAFTLNRVPSKSVDKTPYEIWTGKRPSLSFLKIWGCEET 555

Query: 497 YKRKRDQTGKVQIFKARLVAKCYTQRERVDYEETFSPIAMLKSIRILLSIA--------- 556
            +     T   Q  +         Q E+V  E      A  +S RI  + A         
Sbjct: 556 PENASTSTQPQQAEQ-----DVVQQVEQVVVEPVVEAPASRRSERIRRTPASCGSTGFTK 615

Query: 557 ----TFYDYEIWEMDVKTAFLNDNLEESIYMTQLEGFIEQDREQRVCKLKRFIYGLKQAS 616
               T Y  +IW+MDVKTAFLN NL+E +YMTQ +GF++    +++CKL++ IYGLKQAS
Sbjct: 616 VRKDTTYTCKIWQMDVKTAFLNGNLDEDVYMTQPKGFVDPQSAKKICKLQKSIYGLKQAS 675

Query: 617 RSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFLVLY-DYLYLMHHKSEALEKFREY 676
           RSWNI FD  +K+ GF +N  EPCVYK+I  S + FL+LY D + L+ +    LE     
Sbjct: 676 RSWNIHFDEIVKALGFVKNEQEPCVYKKISGSALVFLILYVDDILLIENDIPMLE---SV 735

Query: 677 KTEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAPVETATYILNMVPSKSV 736
           KT ++N           D G     L  + Y      RS+    +  +TYI  ++   ++
Sbjct: 736 KTSLKNSFS------MKDLGEAAYILGIRIYK----DRSKRLIGLSQSTYIDKVLKRFNM 795

Query: 737 SETPYELWKGRKDQPDGVKPIGCKWIYKRKRDQTGKDSKKGLLSFRHGVHLSKEQSPKTP 796
            ++     KG      G+       + K +  QT  +  K                    
Sbjct: 796 QDSK----KGFLPMSHGIN------LGKNQCPQTTNERNK-------------------- 855

Query: 797 QEVEDMRHIPYASAVGSLMYAMLCTQPDICYAIGIVSRYQSNSGHAHWTAVKNILKYLRR 856
                M  IPYASA+GS+MYAMLCT+PD+ YA+   S+YQS+ G +HW A+KNILKYLRR
Sbjct: 856 -----MSVIPYASAIGSIMYAMLCTRPDVSYALSATSQYQSDPGESHWIALKNILKYLRR 915

Query: 857 TRDYMLMYGA-KDLILTGYTDSDFQTDVDSRKSTSESVFTLNGGAIIWRSIKQGCIANST 881
           T+D  L+YG  ++L++ GYTD+ FQ D D  +S S  VF LNGGA+ W+S KQ  +A+ST
Sbjct: 916 TKDMFLVYGGQEELVVNGYTDASFQIDKDDFRSQSGFVFYLNGGAVSWKSSKQDTVADST 938

BLAST of CmoCh04G021440 vs. TrEMBL
Match: Q5R1J2_ORYSJ (Integrase core domain, putative OS=Oryza sativa subsp. japonica GN=LOC_Os11g18580 PE=4 SV=1)

HSP 1 Score: 379.0 bits (972), Expect = 1.6e-101
Identity = 218/477 (45.70%), Postives = 295/477 (61.84%), Query Frame = 1

Query: 461  KRKRDQTGKVQIFKARLVAKCYTQRERVDYEETFSPIAMLKSIRILLSIATFYDYEIWEM 520
            K+K D  G V I+KARLVAK + Q + VDY+ETFSP+AMLK I I+L+IA ++DYEIW+M
Sbjct: 717  KKKTDVDGNVHIYKARLVAKGFRQIQGVDYDETFSPVAMLKFIWIVLAIAAYFDYEIWQM 776

Query: 521  DVKTAFLNDNLEESIYMTQLEGFIEQDREQRVCKLKRFIYGLKQASRSWNIKFDTAIKSY 580
             VKTAFLN NL+E +YMTQ +GF +    +++CKL + IYGLKQASRSWNI+FD  +K+ 
Sbjct: 777  YVKTAFLNGNLDEDVYMTQPKGFDDPQSAKKICKLHQSIYGLKQASRSWNIRFDEVVKAL 836

Query: 581  GFKQNVDEPCVYKRIVNSTVAFLVLY-DYLYLMHHKSEALEKFREYKTEVENLLGKTIKT 640
            GF +N +EPCVYK+I  S + FL+LY D + L+ +    LE     KT ++N        
Sbjct: 837  GFVRNEEEPCVYKKISGSALVFLILYVDDILLIGNDISMLE---SVKTSLKNSFS----- 896

Query: 641  LRSDRGGEYMDLRFQDYMIEHGIRSQLSAPVETATYILNMVPSKSVSETPYELWKGRKDQ 700
               D G     L  + Y      RS+    +  +TYI  ++   ++ ++     KG    
Sbjct: 897  -MKDLGEAAYILGIRIYR----DRSKRLIGLSQSTYIDKVLKRFNMQDSK----KGFLPM 956

Query: 701  PDGVKPIGCKWIYKRKRDQTGKDSKKGLLSFRHGVHLSKEQSPKTPQEVEDMRHIPYASA 760
              G+       + K +  QT  +  K                         M  IPYASA
Sbjct: 957  SHGIN------LGKNQCPQTTDERNK-------------------------MSVIPYASA 1016

Query: 761  VGSLMYAMLCTQPDICYAIGIVSRYQSNSGHAHWTAVKNILKYLRRTRDYMLMYGA-KDL 820
            +GS+MYAMLCT+PD+ YA+   SRYQS+ G +HW AVKNILKYLRRT+D  L YG  ++L
Sbjct: 1017 IGSIMYAMLCTRPDVSYALSATSRYQSDPGESHWIAVKNILKYLRRTKDMFLAYGGQEEL 1076

Query: 821  ILTGYTDSDFQTDVDSRKSTSESVFTLNGGAIIWRSIKQGCIANSTMEAEYVAVYEAAKE 880
            ++ GYTD+ FQ D D  +S S  VF LNGGA+ W+S KQ  + +ST EAEY+    AA E
Sbjct: 1077 VVNGYTDASFQIDKDDFRSQSGFVFCLNGGAVSWKSSKQDIVVDSTTEAEYI----AASE 1136

Query: 881  SVWHRKFLTHLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIV 936
            +VW +KF++ L V+ +    + LYCDNSGA+A +KEPRSH++ KHI R+YHLIREIV
Sbjct: 1137 AVWIKKFVSQLGVMTSASSSMDLYCDNSGAIAQAKEPRSHQKSKHILRQYHLIREIV 1141

BLAST of CmoCh04G021440 vs. TrEMBL
Match: A0A165U314_9ROSI (Gag/pol protein OS=Momordica dioica PE=4 SV=1)

HSP 1 Score: 349.0 bits (894), Expect = 1.8e-92
Identity = 171/217 (78.80%), Postives = 189/217 (87.10%), Query Frame = 1

Query: 721  KDSKKGLLSFRHGVHLSKEQSPKTPQEVEDMRHIPYASAVGSLMYAMLCTQPDICYAIGI 780
            +DSKKGLL FRHG+HLSKEQ PKTPQEVEDMR+IPY+SA+GSLMYAMLCT+PD+CYA+ I
Sbjct: 1085 QDSKKGLLPFRHGIHLSKEQCPKTPQEVEDMRNIPYSSAIGSLMYAMLCTRPDVCYALSI 1144

Query: 781  VSRYQSNSGHAHWTAVKNILKYLRRTRDYMLMYGA-KDLILTGYTDSDFQTDVDSRKSTS 840
            VSRYQSN G  HWTAVKNILKYLRRTR+  L+YG  KDL + GYTDS FQTD D  KS S
Sbjct: 1145 VSRYQSNPGRDHWTAVKNILKYLRRTRNMFLVYGGDKDLAVKGYTDSSFQTDKDDSKSQS 1204

Query: 841  ESVFTLNGGAIIWRSIKQGCIANSTMEAEYVAVYEAAKESVWHRKFLTHLEVVPNMHLPV 900
              VFTLNGGA+ WRS KQ C+A+ST EAEYVA  EAAKE+VW RKFLT L VVPNMHLP+
Sbjct: 1205 -GVFTLNGGAVSWRSSKQTCVADSTCEAEYVAACEAAKEAVWIRKFLTDLGVVPNMHLPI 1264

Query: 901  TLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQ 937
            TLYCDNSGAVAN+KEPRSHKRGKHIERKYHLIREIV+
Sbjct: 1265 TLYCDNSGAVANAKEPRSHKRGKHIERKYHLIREIVE 1300

BLAST of CmoCh04G021440 vs. TAIR10
Match: AT4G23160.1 (AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 8)

HSP 1 Score: 139.8 bits (351), Expect = 8.5e-33
Identity = 69/181 (38.12%), Postives = 112/181 (61.88%), Query Frame = 1

Query: 448 PDGVKPIGCKWIYKRKRDQTGKVQIFKARLVAKCYTQRERVDYEETFSPIAMLKSIRILL 507
           P   KPIGCKW+YK K +  G ++ +KARLVAK YTQ+E +D+ ETFSP+  L S++++L
Sbjct: 121 PPNKKPIGCKWVYKIKYNSDGTIERYKARLVAKGYTQQEGIDFIETFSPVCKLTSVKLIL 180

Query: 508 SIATFYDYEIWEMDVKTAFLNDNLEESIYMTQLEGFIEQDRE----QRVCKLKRFIYGLK 567
           +I+  Y++ + ++D+  AFLN +L+E IYM    G+  +  +      VC LK+ IYGLK
Sbjct: 181 AISAIYNFTLHQLDISNAFLNGDLDEEIYMKLPPGYAARQGDSLPPNAVCYLKKSIYGLK 240

Query: 568 QASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNST-VAFLVLYDYLYLMHHKSEALEKF 624
           QASR W +KF   +  +GF Q+  +   + +I  +  +  LV  D + +  +   A+++ 
Sbjct: 241 QASRQWFLKFSVTLIGFGFVQSHSDHTYFLKITATLFLCVLVYVDDIIICSNNDAAVDEL 300

BLAST of CmoCh04G021440 vs. TAIR10
Match: ATMG00820.1 (ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase))

HSP 1 Score: 55.5 bits (132), Expect = 2.1e-07
Identity = 24/57 (42.11%), Postives = 37/57 (64.91%), Query Frame = 1

Query: 454 IGCKWIYKRKRDQTGKVQIFKARLVAKCYTQRERVDYEETFSPIAMLKSIRILLSIA 511
           +GCKW++K K    G +   KARLVAK + Q E + + ET+SP+    +IR +L++A
Sbjct: 69  LGCKWVFKTKLHSDGTLDRLKARLVAKGFHQEEGIYFVETYSPVVRTATIRTILNVA 125

BLAST of CmoCh04G021440 vs. NCBI nr
Match: gi|299474487|gb|ADJ18449.1| (gag/pol protein [Bryonia dioica])

HSP 1 Score: 545.0 bits (1403), Expect = 2.5e-151
Identity = 295/494 (59.72%), Postives = 352/494 (71.26%), Query Frame = 1

Query: 446  DQPDGVKPIGCKWIYKRKRDQTGKVQIFKARLVAKCYTQRERVDYEETFSPIAMLKSIRI 505
            D P  VKPIGCKWIYKRKRDQ GKVQ FKARLVAK YTQ+E VDYEETFSP+AMLKSIRI
Sbjct: 841  DLPSDVKPIGCKWIYKRKRDQAGKVQTFKARLVAKGYTQKEGVDYEETFSPVAMLKSIRI 900

Query: 506  LLSIATFYDYEIWEMDVKTAFLNDNLEESIYMTQLEGFIEQDREQRVCKLKRFIYGLKQA 565
            LLSIATFY+YEIW+MDVKTAFLN NLEESIYM Q EGFI QD+EQ+VCKL++        
Sbjct: 901  LLSIATFYNYEIWQMDVKTAFLNGNLEESIYMVQPEGFIAQDQEQKVCKLQK-------- 960

Query: 566  SRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFLVLYDYLYLMHHKSEALEKFREY 625
                          YG KQ             ++ ++ + +D     +   + +++   Y
Sbjct: 961  ------------SIYGLKQ-------------ASRSWNIRFDTAIKSYGFEQNVDEPCVY 1020

Query: 626  KTEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAPVETATYILNMVPSKSV 685
            K  V +++              ++ L   D ++   I + +    +   ++      K +
Sbjct: 1021 KKIVNSVVA-------------FLILYVDDILL---IGNDVEYLTDVKKWLNTQFQMKDL 1080

Query: 686  SETPY----ELWKGRKDQPDGVKPIGCKWIYKRKRDQTGKDSKKGLLSFRHGVHLSKEQS 745
             E  Y    ++ + RK++   +      +I K       ++SKKG L FRHG+HLSKEQ 
Sbjct: 1081 GEAQYILGIQIVRNRKNKTLAMSQ--ASYIDKVLSRYKMQNSKKGQLPFRHGIHLSKEQC 1140

Query: 746  PKTPQEVEDMRHIPYASAVGSLMYAMLCTQPDICYAIGIVSRYQSNSGHAHWTAVKNILK 805
            PKTPQEVEDMR+IPY+SAVGSLMYAMLCT+PDICY++GIVSRYQSN G  HWTAVKNILK
Sbjct: 1141 PKTPQEVEDMRNIPYSSAVGSLMYAMLCTRPDICYSVGIVSRYQSNPGRDHWTAVKNILK 1200

Query: 806  YLRRTRDYMLMYGAKDLILTGYTDSDFQTDVDSRKSTSESVFTLNGGAIIWRSIKQGCIA 865
            YLRRTR+YML+YGAKDLILTGYTDSDFQ+D D+RKSTS SVFTLNGGA++WRS+KQ CIA
Sbjct: 1201 YLRRTRNYMLVYGAKDLILTGYTDSDFQSDKDARKSTSGSVFTLNGGAVVWRSVKQTCIA 1260

Query: 866  NSTMEAEYVAVYEAAKESVWHRKFLTHLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRG 925
            +STMEAEYVA  EAAKE+VW RKFLT LEVVPNMHLP+TLYCDNSGAVANSKEPRSHKRG
Sbjct: 1261 DSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDNSGAVANSKEPRSHKRG 1283

Query: 926  KHIERKYHLIREIV 936
            KHIERKYHLIREIV
Sbjct: 1321 KHIERKYHLIREIV 1283

BLAST of CmoCh04G021440 vs. NCBI nr
Match: gi|108862437|gb|ABA97342.2| (retrotransposon protein, putative, Ty1-copia subclass [Oryza sativa Japonica Group])

HSP 1 Score: 411.0 bits (1055), Expect = 5.6e-111
Identity = 276/735 (37.55%), Postives = 393/735 (53.47%), Query Frame = 1

Query: 253  TIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAP-------------------------- 312
            TIK LRSDRGGEY+ L F +++ E GI  QL+ P                          
Sbjct: 354  TIKYLRSDRGGEYLSLEFGNHLKECGIVPQLTPPGTPQWNGVSERRNRTLLDIVRSMMSQ 413

Query: 313  -----------VETATYILNMVPSKSVSETPYELWKGR-------KGYRAKETLELVHTD 372
                       +E A + LN VPSKSV++TPYE+W G+       K +  +  ++ + +D
Sbjct: 414  TDLPLSFWAYALEKAAFTLNKVPSKSVNKTPYEIWTGKRPSLSFLKIWGCEVYVKRLQSD 473

Query: 373  LCGPMNIKAQGGCEYFISFIDD------YSRYDYLYLMHHKSEALEKFREYKTEVENLLG 432
               P + K      +F+ +  +      Y+R +    +      LEK    +    ++L 
Sbjct: 474  KLTPKSDKC-----FFVGYPKETKGYYFYNREEGKVFVARHGVFLEKEFILRKNSGSMLW 533

Query: 433  KTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAPVETATYILNMVPSKSVSETPYELWK 492
            K    L     G  + LR  D    +          E A   L  + S+  S    ++W 
Sbjct: 534  K--HQLHEGPKGYGVHLRENDEPTTY----------EEAMVGLGAMKSEIESMHVNQVWN 593

Query: 493  GRKDQPDGVKPIGCKWIYKRKRDQTGKVQIFKARLVAKCYTQRERVDYEETFSPIAMLKS 552
               D PDGVK I  KW++K+K D  G V I+KARLVAK + Q + VDY+ETFS +AMLKS
Sbjct: 594  -LVDPPDGVKAIEWKWVFKKKTDVDGNVHIYKARLVAKGFRQIQGVDYDETFSLVAMLKS 653

Query: 553  IRILLSIATFYDYEIWEMDVKTAFLNDNLEESIYMTQLEGFIEQDREQRVCKLKRFIYGL 612
            I+I+L+I  ++DYEIW+MDVKTAF+N N++E +YMTQL+GF++    +++CKL++ IYGL
Sbjct: 654  IQIVLAITAYFDYEIWQMDVKTAFVNGNIDEDVYMTQLKGFVDPQSAKKICKLQKSIYGL 713

Query: 613  KQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFLVLY-DYLYLMHHKSEALEK 672
            KQASRSWNI+FD  +K++               ++ T   LVLY D + L+ +    LE 
Sbjct: 714  KQASRSWNIRFDEVVKAWA--------------LSKTKKSLVLYVDDILLIGNDIPMLE- 773

Query: 673  FREYKTEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAPVETATYILNMVP 732
                KT ++N           D G     L  + Y      RS+    +  +TYI  ++ 
Sbjct: 774  --SVKTSLKNSFS------MKDLGEAAYILGIRIYR----DRSKRLIGLSQSTYIDKVLK 833

Query: 733  SKSVSETPYELWKGRKDQPDGVKPIGCKWIYKRKRDQTGKDSKKGLLSFRHGVHLSKEQS 792
              ++ ++     KG      G+       + K +  QT  +  K                
Sbjct: 834  RFNMQDSK----KGFLPMSHGIN------LGKNQCPQTTDERNK---------------- 893

Query: 793  PKTPQEVEDMRHIPYASAVGSLMYAMLCTQPDICYAIGIVSRYQSNSGHAHWTAVKNILK 852
                     M  IPYASA+GS+MYAMLCT PD+ YA+   SRYQS+ G +HW AVKNILK
Sbjct: 894  ---------MSVIPYASAIGSIMYAMLCTHPDVSYALSATSRYQSDPGESHWIAVKNILK 953

Query: 853  YLRRTRDYMLMYGA-KDLILTGYTDSDFQTDVDSRKSTSESVFTLNGGAIIWRSIKQGCI 912
            YLRRT+D  L+YG  ++L++ GYTD+ FQTD D  +S S  VF LNGGA+ W+S KQ  +
Sbjct: 954  YLRRTKDMFLVYGGQEELVVKGYTDASFQTDKDDFRSQSGFVFCLNGGAVSWKSSKQDTV 1007

Query: 913  ANSTMEAEYVAVYEAAKESVWHRKFLTHLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKR 936
             +S  EAEY+A  EAAKE+VW +KF++ L V+ +   P+ LYCDNSGA+A +KEPRSH++
Sbjct: 1014 VDSITEAEYIAASEAAKEAVWIKKFVSQLGVMTSASSPMDLYCDNSGAIAQAKEPRSHQK 1007

BLAST of CmoCh04G021440 vs. NCBI nr
Match: gi|62732694|gb|AAX94813.1| (retrotransposon protein, putative, Ty1-copia sub-class [Oryza sativa Japonica Group])

HSP 1 Score: 394.0 bits (1011), Expect = 7.1e-106
Identity = 247/616 (40.10%), Postives = 337/616 (54.71%), Query Frame = 1

Query: 317 RAKETLELVHTDLCGPMNIKAQGGCEYFISFIDDYSRYDYLYLMHHKSEALEKFREYKTE 376
           +A E L LVHTD+CGPM+  A+GG  YFI+F +D+SRY Y+YLM HKS++ EKF+E++ E
Sbjct: 376 KASELLGLVHTDVCGPMSSTARGGFGYFITFTNDFSRYGYVYLMRHKSDSFEKFKEFQNE 435

Query: 377 VENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAPVE----------------- 436
           V+N LGKTIK LRSDRGGEY+ L F +++ E GI  QL+ P                   
Sbjct: 436 VQNHLGKTIKYLRSDRGGEYLSLEFGNHLKECGIVPQLTPPGTPQWNGVSERRNRTLLDM 495

Query: 437 --------------------TATYILNMVPSKSVSETPYELWKGRKDQPDGVKPIGCKWI 496
                               TA + LN VPSKSV +TPYE+W G++     +K  GC+  
Sbjct: 496 VRSMMSQTDLPLSFWGYTLETAAFTLNRVPSKSVDKTPYEIWTGKRPSLSFLKIWGCEET 555

Query: 497 YKRKRDQTGKVQIFKARLVAKCYTQRERVDYEETFSPIAMLKSIRILLSIA--------- 556
            +     T   Q  +         Q E+V  E      A  +S RI  + A         
Sbjct: 556 PENASTSTQPQQAEQ-----DVVQQVEQVVVEPVVEAPASRRSERIRRTPASCGSTGFTK 615

Query: 557 ----TFYDYEIWEMDVKTAFLNDNLEESIYMTQLEGFIEQDREQRVCKLKRFIYGLKQAS 616
               T Y  +IW+MDVKTAFLN NL+E +YMTQ +GF++    +++CKL++ IYGLKQAS
Sbjct: 616 VRKDTTYTCKIWQMDVKTAFLNGNLDEDVYMTQPKGFVDPQSAKKICKLQKSIYGLKQAS 675

Query: 617 RSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFLVLY-DYLYLMHHKSEALEKFREY 676
           RSWNI FD  +K+ GF +N  EPCVYK+I  S + FL+LY D + L+ +    LE     
Sbjct: 676 RSWNIHFDEIVKALGFVKNEQEPCVYKKISGSALVFLILYVDDILLIENDIPMLE---SV 735

Query: 677 KTEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAPVETATYILNMVPSKSV 736
           KT ++N           D G     L  + Y      RS+    +  +TYI  ++   ++
Sbjct: 736 KTSLKNSFS------MKDLGEAAYILGIRIYK----DRSKRLIGLSQSTYIDKVLKRFNM 795

Query: 737 SETPYELWKGRKDQPDGVKPIGCKWIYKRKRDQTGKDSKKGLLSFRHGVHLSKEQSPKTP 796
            ++     KG      G+       + K +  QT  +  K                    
Sbjct: 796 QDSK----KGFLPMSHGIN------LGKNQCPQTTNERNK-------------------- 855

Query: 797 QEVEDMRHIPYASAVGSLMYAMLCTQPDICYAIGIVSRYQSNSGHAHWTAVKNILKYLRR 856
                M  IPYASA+GS+MYAMLCT+PD+ YA+   S+YQS+ G +HW A+KNILKYLRR
Sbjct: 856 -----MSVIPYASAIGSIMYAMLCTRPDVSYALSATSQYQSDPGESHWIALKNILKYLRR 915

Query: 857 TRDYMLMYGA-KDLILTGYTDSDFQTDVDSRKSTSESVFTLNGGAIIWRSIKQGCIANST 881
           T+D  L+YG  ++L++ GYTD+ FQ D D  +S S  VF LNGGA+ W+S KQ  +A+ST
Sbjct: 916 TKDMFLVYGGQEELVVNGYTDASFQIDKDDFRSQSGFVFYLNGGAVSWKSSKQDTVADST 938

BLAST of CmoCh04G021440 vs. NCBI nr
Match: gi|56382058|gb|AAV85747.1| (Integrase core domain, putative [Oryza sativa Japonica Group])

HSP 1 Score: 379.0 bits (972), Expect = 2.4e-101
Identity = 218/477 (45.70%), Postives = 295/477 (61.84%), Query Frame = 1

Query: 461  KRKRDQTGKVQIFKARLVAKCYTQRERVDYEETFSPIAMLKSIRILLSIATFYDYEIWEM 520
            K+K D  G V I+KARLVAK + Q + VDY+ETFSP+AMLK I I+L+IA ++DYEIW+M
Sbjct: 717  KKKTDVDGNVHIYKARLVAKGFRQIQGVDYDETFSPVAMLKFIWIVLAIAAYFDYEIWQM 776

Query: 521  DVKTAFLNDNLEESIYMTQLEGFIEQDREQRVCKLKRFIYGLKQASRSWNIKFDTAIKSY 580
             VKTAFLN NL+E +YMTQ +GF +    +++CKL + IYGLKQASRSWNI+FD  +K+ 
Sbjct: 777  YVKTAFLNGNLDEDVYMTQPKGFDDPQSAKKICKLHQSIYGLKQASRSWNIRFDEVVKAL 836

Query: 581  GFKQNVDEPCVYKRIVNSTVAFLVLY-DYLYLMHHKSEALEKFREYKTEVENLLGKTIKT 640
            GF +N +EPCVYK+I  S + FL+LY D + L+ +    LE     KT ++N        
Sbjct: 837  GFVRNEEEPCVYKKISGSALVFLILYVDDILLIGNDISMLE---SVKTSLKNSFS----- 896

Query: 641  LRSDRGGEYMDLRFQDYMIEHGIRSQLSAPVETATYILNMVPSKSVSETPYELWKGRKDQ 700
               D G     L  + Y      RS+    +  +TYI  ++   ++ ++     KG    
Sbjct: 897  -MKDLGEAAYILGIRIYR----DRSKRLIGLSQSTYIDKVLKRFNMQDSK----KGFLPM 956

Query: 701  PDGVKPIGCKWIYKRKRDQTGKDSKKGLLSFRHGVHLSKEQSPKTPQEVEDMRHIPYASA 760
              G+       + K +  QT  +  K                         M  IPYASA
Sbjct: 957  SHGIN------LGKNQCPQTTDERNK-------------------------MSVIPYASA 1016

Query: 761  VGSLMYAMLCTQPDICYAIGIVSRYQSNSGHAHWTAVKNILKYLRRTRDYMLMYGA-KDL 820
            +GS+MYAMLCT+PD+ YA+   SRYQS+ G +HW AVKNILKYLRRT+D  L YG  ++L
Sbjct: 1017 IGSIMYAMLCTRPDVSYALSATSRYQSDPGESHWIAVKNILKYLRRTKDMFLAYGGQEEL 1076

Query: 821  ILTGYTDSDFQTDVDSRKSTSESVFTLNGGAIIWRSIKQGCIANSTMEAEYVAVYEAAKE 880
            ++ GYTD+ FQ D D  +S S  VF LNGGA+ W+S KQ  + +ST EAEY+    AA E
Sbjct: 1077 VVNGYTDASFQIDKDDFRSQSGFVFCLNGGAVSWKSSKQDIVVDSTTEAEYI----AASE 1136

Query: 881  SVWHRKFLTHLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIV 936
            +VW +KF++ L V+ +    + LYCDNSGA+A +KEPRSH++ KHI R+YHLIREIV
Sbjct: 1137 AVWIKKFVSQLGVMTSASSSMDLYCDNSGAIAQAKEPRSHQKSKHILRQYHLIREIV 1141

BLAST of CmoCh04G021440 vs. NCBI nr
Match: gi|1019597807|gb|AMY96445.1| (gag/pol protein [Momordica dioica])

HSP 1 Score: 349.0 bits (894), Expect = 2.6e-92
Identity = 171/217 (78.80%), Postives = 189/217 (87.10%), Query Frame = 1

Query: 721  KDSKKGLLSFRHGVHLSKEQSPKTPQEVEDMRHIPYASAVGSLMYAMLCTQPDICYAIGI 780
            +DSKKGLL FRHG+HLSKEQ PKTPQEVEDMR+IPY+SA+GSLMYAMLCT+PD+CYA+ I
Sbjct: 1085 QDSKKGLLPFRHGIHLSKEQCPKTPQEVEDMRNIPYSSAIGSLMYAMLCTRPDVCYALSI 1144

Query: 781  VSRYQSNSGHAHWTAVKNILKYLRRTRDYMLMYGA-KDLILTGYTDSDFQTDVDSRKSTS 840
            VSRYQSN G  HWTAVKNILKYLRRTR+  L+YG  KDL + GYTDS FQTD D  KS S
Sbjct: 1145 VSRYQSNPGRDHWTAVKNILKYLRRTRNMFLVYGGDKDLAVKGYTDSSFQTDKDDSKSQS 1204

Query: 841  ESVFTLNGGAIIWRSIKQGCIANSTMEAEYVAVYEAAKESVWHRKFLTHLEVVPNMHLPV 900
              VFTLNGGA+ WRS KQ C+A+ST EAEYVA  EAAKE+VW RKFLT L VVPNMHLP+
Sbjct: 1205 -GVFTLNGGAVSWRSSKQTCVADSTCEAEYVAACEAAKEAVWIRKFLTDLGVVPNMHLPI 1264

Query: 901  TLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQ 937
            TLYCDNSGAVAN+KEPRSHKRGKHIERKYHLIREIV+
Sbjct: 1265 TLYCDNSGAVANAKEPRSHKRGKHIERKYHLIREIVE 1300

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
POLX_TOBAC8.0e-4949.50Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
COPIA_DROME1.7e-3038.71Copia protein OS=Drosophila melanogaster GN=GIP PE=1 SV=3[more]
YCH4_YEAST1.2e-0929.41Putative transposon Ty5-1 protein YCL074W OS=Saccharomyces cerevisiae (strain AT... [more]
YD22B_YEAST3.7e-0628.85Transposon Ty2-DR2 Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC ... [more]
YD23B_YEAST3.7e-0628.85Transposon Ty2-DR3 Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC ... [more]
Match NameE-valueIdentityDescription
E2GK51_BRYDI1.7e-15159.72Gag/pol protein (Fragment) OS=Bryonia dioica PE=4 SV=1[more]
Q2QUJ4_ORYSJ3.9e-11137.55Retrotransposon protein, putative, Ty1-copia subclass OS=Oryza sativa subsp. jap... [more]
Q53K55_ORYSJ4.9e-10640.10Retrotransposon protein, putative, Ty1-copia sub-class OS=Oryza sativa subsp. ja... [more]
Q5R1J2_ORYSJ1.6e-10145.70Integrase core domain, putative OS=Oryza sativa subsp. japonica GN=LOC_Os11g1858... [more]
A0A165U314_9ROSI1.8e-9278.80Gag/pol protein OS=Momordica dioica PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G23160.18.5e-3338.12 cysteine-rich RLK (RECEPTOR-like protein kinase) 8[more]
ATMG00820.12.1e-0742.11ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)[more]
Match NameE-valueIdentityDescription
gi|299474487|gb|ADJ18449.1|2.5e-15159.72gag/pol protein [Bryonia dioica][more]
gi|108862437|gb|ABA97342.2|5.6e-11137.55retrotransposon protein, putative, Ty1-copia subclass [Oryza sativa Japonica Gro... [more]
gi|62732694|gb|AAX94813.1|7.1e-10640.10retrotransposon protein, putative, Ty1-copia sub-class [Oryza sativa Japonica Gr... [more]
gi|56382058|gb|AAV85747.1|2.4e-10145.70Integrase core domain, putative [Oryza sativa Japonica Group][more]
gi|1019597807|gb|AMY96445.1|2.6e-9278.80gag/pol protein [Momordica dioica][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001584Integrase_cat-core
IPR012337RNaseH-like_sf
IPR013103RVT_2
Vocabulary: Biological Process
TermDefinition
GO:0015074DNA integration
Vocabulary: Molecular Function
TermDefinition
GO:0003676nucleic acid binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
biological_process GO:0006310 DNA recombination
cellular_component GO:0005575 cellular_component
molecular_function GO:0003677 DNA binding
molecular_function GO:0046872 metal ion binding
molecular_function GO:0003676 nucleic acid binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh04G021440.1CmoCh04G021440.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001584Integrase, catalytic corePFAMPF00665rvecoord: 321..417
score: 1.3
IPR001584Integrase, catalytic corePROFILEPS50994INTEGRASEcoord: 302..417
score: 11
IPR012337Ribonuclease H-like domainGENE3DG3DSA:3.30.420.10coord: 316..417
score: 1.1
IPR012337Ribonuclease H-like domainunknownSSF53098Ribonuclease H-likecoord: 317..439
score: 4.7E-20coord: 607..697
score: 6.07E-12coord: 224..315
score: 5.12
IPR013103Reverse transcriptase, RNA-dependent DNA polymerasePFAMPF07727RVT_2coord: 446..624
score: 2.9
NoneNo IPR availableunknownCoilCoilcoord: 363..383
score: -coord: 232..252
score: -coord: 615..635
scor
NoneNo IPR availablePANTHERPTHR11439GAG-POL-RELATED RETROTRANSPOSONcoord: 232..247
score: 1.3E-195coord: 740..899
score: 1.3E-195coord: 295..630
score: 1.3E-195coord: 12..208
score: 1.3E
NoneNo IPR availablePANTHERPTHR11439:SF192SUBFAMILY NOT NAMEDcoord: 232..247
score: 1.3E-195coord: 740..899
score: 1.3E-195coord: 295..630
score: 1.3E-195coord: 12..208
score: 1.3E
NoneNo IPR availablePFAMPF14223Retrotran_gag_2coord: 62..191
score: 1.5

The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None