CmoCh05G011780 (gene) Cucurbita moschata (Rifu)

NameCmoCh05G011780
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionTransposon Ty1-OL Gag-Pol polyprotein
LocationCmo_Chr05 : 9267906 .. 9271860 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCTACCATGTCTTGTTCCTCCTCTCCATCCACCATCTCCAACACCGTCACTCGTGCTCGCACTACTCAGATTCGTGTTGAGCTCGCCACGTCCAAGAAACGAGATCAATCTGCTGCAAATTATTTTTGCAAGATCAAAGGGCTAGCCACCGAGCTGGCCGCCGCCGGCTCTGCCTTGCAGGATGATGATGTGATCGCGTATCTTCTCGCTGGTCTTGGCCCAGACTATGATCCCTTCGTCACCTCAATGACTACCAAGAGTGAAGCCCTCACGCTTGATGATGTGTTTGCACATCTAATGATGTATGAAGCTCACCAACTACAACACCAGGCTGAACTTCAGTTAAATTCGGGATCTTCGGCCAATTATACTGGTCGTGCTGAACTTCAGTTAAATCTTGGATCTTCTGCCAATTATGCTAGTCGTGGTGGTCAGCAAAAGAATCGTGGGCGTAGGGATCGTGGTCGTGGTCGTTCTCAAGGTTATGCACCCTCTCGTCCTGCTGGTGATCGTCGTGGCCCTTCTGCTCGTCCTTCCTGCCAGATCTGCGGCAAAGTGGGGCATACTGCTATACGCTGCTGGCATAGGATGGATGAGTCCTATCAAGATGAACCTTCTTCTGCTCCTCCTACGGCACTGGCGGCTACTTCCTCTTACAAGATTGACCCAAATTGGTACAGCGACACAGGCGCTACGGACCATATCACCAGTGACCTGGATCGTCTCGCTGTGCATGAACATTATCATGGAGGTGAACAAGTTCAAGTCGGCAATGGAGCAAGTTTGCGTGTTTTGCATACTGGTCATTCTCTAATTAATACTGCTACTCGTTCTCTTGCGTTGCGTAATATTTTGCATGTGATTGAAATTTCTAAACATCTTCTTTCCGTTCATAAATTTTCTCGTGATAATGACGTATTTTTTGAATTCGATCCTTGGCATTTTTCTATAAAGGATCGACAGTCGAGGAAAAGTCTCCTAAATGGGAGGTGTGAATCTGGTCTTTATCCTATTAAGCCATCCGATGTCGATAATCTCAAGCACGTCTTGGTGAGTAGATCTACTACTCACGCCCAATGGCATGCACGTCTTGGACATCCTTCATCTCAAGTAGTAAAATCCATTTTGCGTCTAAATAATATTTCGTGTGCTAGTGAGTCATCTTTGTCCGTTTGTAATGCGTGTTAGTTAGCAAAGAGTCATTAATTACCATATACTAGTTCTTCCCATAGGTCTTCGTCACCTTTGGAACTTATTTTTTCGGATGTTTGGGGCCCTGCACCTCCATCTGTTGGGGGTTTTAAATATTATATTAGTTTCATTGATGATTTCAGTAAATTTTCGTGGATCGATTTGATGCATGATCGTACAGAAGCTCCTCGTATATTTTTGCAATTCCAAGCTCATGTTGAGTGTCTCATAGATACTAAAATCAAGTGGGTCCAATCTAATTGGGGTGGGGAATATCAGAAAATTCATAACACATTTTTTCGTTCCCTTGGAATTGGTCATCGTGTTTCGTGCCCTCACACACATCAACAAAATGGGTCTGCTGAACGCAAGCACCGTCATATTGTTGAAACTGGCCTAGCCCTTCTAGCTCATGCTCATGTACCTATTAAATTTTGGGATGACGTGTTTCTTATAGCCACATATCTCATCAATCGTCTTCCTACTCGTGTCATCGATAAAAAATGCCCCTTAGAGCTTCTTTTTCATACCCCACCAAATTACTCCTTATTAAAAAATTTTGGGTGTGCTTGTTGGCCTCATCTTCGTCCTTATAACAAACAAAAGCTCTCTTTTCGATCAAAGGAATGTGTCTTTCTAGGCTACAGTTCTTCACATAAGGGGTATAAGTGTCTTGACACCGATTCTGGTCGTGTCTATATGTCTAGAGATGTCATATTTGATGAAAATGTTTTTCCATTCAAGAGAGCCCGACCTAATTCTTCCCCAACCATGCAGTCGACGCATAATGCCCTTGATTTGTGCACCTTGCATTTGGGTAATAGCAGTACTAATTTGGAGAATGATCACATGCATATGTCTGGGCCTACTAACTCTTTGGATGCAGAAAATTTGGTGTCTACATCAGCTTCGGAATTGCCGCAACAATCCTCCGCGTCGCTGCCATGCGAATCGGCGTTGGTTGTTCCGCCAATGATTGAGGCCTCGGCTCCTCCGCCAGCAGATGATATTGCACAATGCCCGGTCGAATCCTCGGCTGCTGGTCAACCAACTGTTGTAGCATCGGTTGCTCCCCTCGCAACGACTGATACGGCCATCCCCTCAATGTGGATCCTGCACCTACTACTCATCCGTATGGTACTCGATTGAAGCACAATATCAAGAAACCCAAGGTGCGTACAGATGGAACAGTAACATATCTTGTAGCTCGGTCTTCTGCCTCTGAACCTACTTCACATATTATTGCTATGGAGCATCCCCTCTGGCGTCAGGCAATGAATGATGAATTTCAGGCACTTCAAAAAAATAAGACATGGCACTTAGTTTCTCCTCGTGCTGGTCTCAACGTTATTGATTGCAAATGGGTTTTCAAACTCAAGCAAAAGCCAGATGGCTCTATTGATCGCTACAAAGCACGCCTGGTTGCTAAAGGTTTTAAACAGCAGGCGTTGATTATGATGATACCTTTAGTCCAATTGTTAAGCCCACTACCATTCGACTCTTATTATCTCTTGCTATTTCTCGTGGTTGGGCTATTCGGCAGATTGATATTCAAAATGCTTTTCTTCATGGCTTTCTTAATGAAGATGTTTATATGAAGCAGCCCCCTGGATTTGTGGATTCTCAACACCCTGGTTATCTCTGCAAGCTGGATAAGTCGCTTTATGGCCTTAAACAAGCTCCGCGTGCCTGGTTTTCTCGCCTTAGCTCCAAACAATTACAGCTGGATTTTACACCTTCAAAGGCTGATGTCTCTCTTTTCATTTTTAACAAAACGGGCATTCAGATGTATATCCTCATCTACGTTGATGACATTATTATCATCAGCTCATCTTCTACGGCTATTGAGAAACTTCTTACACAACTTCAGGATGATTTTGTCGTCAAGGATCTTGGTCTTTTGAGTTATTTTCTTGGGATTGAGGTCCGCCATACTTCCAGTGGACTTATTCTGACACAACATAAATACATTCGAGATTTATTAGCCAGAACCGATATGCTCACCTCCAAAGGTGTGCCCACACCTATGCTTCCCAGTGAGAAGTTGTTATTGAATGGTGGTAAAAAGCTCTCACCTGAGGATACTACTCGCTATCGAAGTGTCGTTGGTGCTCTCCAATATTTGTCTCTGACACGTCCTGATATATCCTTTTGTGTCAACAGAGTGTGTCAGTTCATGTCCTCTCCGACTTCTATACATTGGGCGACAGTCAAACGAATTCTCCGTTATCTACATGACACTATTGATATGAGTTTGTGTCTTACAAAGTCCAGCACTGATTTGTTGAGTGCCTTTTCAGATGCTGATTGGGCTGGGAATCCTGATGATCGTCGAAGCACTGGAGGCTATGTGATCTTCTTTGGTGGCAATCTTATCTCTTGGAGTTCGAGGAAACAATCGACAGTATCTCGTTCTAGTATGGAAGCCGAATATAAGGCGGTTGCTGATGCCACTGCCGAATTAATTTGGATCCAAGTCCTCTTGCGTGAGCTCGGGATCTCGCAAGCGCGAGCGCGTAGCCTATGGTGTGACAACATTGGTGCCACCTACCTATCCGCCAATCCAATCTTTCATCGACGGACGAAGCATGTTGAGGTTGATTATCACTTCGTTCGTGAACGAGTATCGACTCGTCAGCTTGATGTTCGAGTCATATCTTCCAAGGATCAGCTCGCCGATATCATGACAAAGCCACTGCCAGCTCCTTCTTTTAGCTATTTTAGGCGCCATCTGAACTTAGTAGTACATCGTCCAGATTGA

mRNA sequence

ATGTCTACCATGTCTTGTTCCTCCTCTCCATCCACCATCTCCAACACCGTCACTCGTGCTCGCACTACTCAGATTCGTGTTGAGCTCGCCACGTCCAAGAAACGAGATCAATCTGCTGCAAATTATTTTTGCAAGATCAAAGGGCTAGCCACCGAGCTGGCCGCCGCCGGCTCTGCCTTGCAGGATGATGATGTGATCGCGTATCTTCTCGCTGGTCTTGGCCCAGACTATGATCCCTTCGTCACCTCAATGACTACCAAGAGTGAAGCCCTCACGCTTGATGATGTGTTTGCACATCTAATGATGTATGAAGCTCACCAACTACAACACCAGGCTGAACTTCAGTTAAATTCGGGATCTTCGGCCAATTATACTGGTCGTGCTGAACTTCAGTTAAATCTTGGATCTTCTGCCAATTATGCTAGTCGTGGTGGTCAGCAAAAGAATCGTGGGCGTAGGGATCGTGGTCGTGGTCGTTCTCAAGGTTATGCACCCTCTCGTCCTGCTGGTGATCGTCGTGGCCCTTCTGCTCGTCCTTCCTGCCAGATCTGCGGCAAAGTGGGGCATACTGCTATACGCTGCTGGCATAGGATGGATGAGTCCTATCAAGATGAACCTTCTTCTGCTCCTCCTACGGCACTGGCGGCTACTTCCTCTTACAAGATTGACCCAAATTGGTACAGCGACACAGGCGCTACGGACCATATCACCAGTGACCTGGATCGTCTCGCTGTGCATGAACATTATCATGGAGGTGAACAAGTTCAAGTCGGCAATGGAGCAAGTTTGCGTGTTTTGCATACTGGTCATTCTCTAATTAATACTGCTACTCGTTCTCTTGCGTTGCGTAATATTTTGCATGTGATTGAAATTTCTAAACATCTTCTTTCCGTTCATAAATTTTCTCGTGATAATGACGTATTTTTTGAATTCGATCCTTGGCATTTTTCTATAAAGGATCGACAGTCGAGGAAAAGTCTCCTAAATGGGAGGTGTGAATCTGGTCTTTATCCTATTAAGCCATCCGATGTCGATAATCTCAAGCACGTCTTGGTGAGTAGATCTACTACTCACGCCCAATGGCATGCACGTCTTGGACATCCTTCATCTCAAGTAGTAAAATCCATTTTGCGTCTAAATAATATTTCGTGTGCTAGCTACAGTTCTTCACATAAGGGGTATAAGTGTCTTGACACCGATTCTGGTCGTGTCTATATGTCTAGAGATGTCATATTTGATGAAAATGTTTTTCCATTCAAGAGAGCCCGACCTAATTCTTCCCCAACCATGCAGTCGACGCATAATGCCCTTGATTTGTGCACCTTGCATTTGGGTAATAGCAGTACTAATTTGGAGAATGATCACATGCATATGTCTGGGCCTACTAACTCTTTGGATGCAGAAAATTTGGTGTCTACATCAGCTTCGGAATTGCCGCAACAATCCTCCGCGTCGCTGCCATGCGAATCGGCGTTGGTTGTTCCGCCAATGATTGAGGCCTCGGCTCCTCCGCCAGCAGATGATATTGCACAATGCCCGGTCGAATCCTCGGCTGCTGGTCAACCAACTGTTGTAGCATCGGTTGCTCCCCTCGCAACGACTGATACGGCCATCCCCTCAATGTGGATCCTGCACCTACTACTCATCCCTCGGTCTTCTGCCTCTGAACCTACTTCACATATTATTGCTATGGAGCATCCCCTCTGGCGTCAGGCAATGAATGATGAATTTCAGGCACTTCAAAAAAATAAGACATGGCACTTAGTTTCTCCTCGTGCTGGTCTCAACGTTATTGATTGCAAATGGGTTTTCAAACTCAAGCAAAAGCCAGATGGCTCTATTGATCGCTACAAAGCACGCCTGATTGATATTCAAAATGCTTTTCTTCATGGCTTTCTTAATGAAGATGTTTATATGAAGCAGCCCCCTGGATTTGTGGATTCTCAACACCCTGGTTATCTCTGCAAGCTGGATAAGTCGCTTTATGGCCTTAAACAAGCTCCGCGTGCCTGGTTTTCTCGCCTTAGCTCCAAACAATTACAGCTGGATTTTACACCTTCAAAGGCTGATGTCTCTCTTTTCATTTTTAACAAAACGGGCATTCAGATGTATATCCTCATCTACGTTGATGACATTATTATCATCAGCTCATCTTCTACGGCTATTGAGAAACTTCTTACACAACTTCAGGATGATTTTGTCGTCAAGGATCTTGGTCTTTTGAGTTATTTTCTTGGGATTGAGGTCCGCCATACTTCCAGTGGACTTATTCTGACACAACATAAATACATTCGAGATTTATTAGCCAGAACCGATATGCTCACCTCCAAAGGTGTGCCCACACCTATGCTTCCCAGTGAGAAGTTGTTATTGAATGGTGGTAAAAAGCTCTCACCTGAGGATACTACTCGCTATCGAAGTGTCGTTGGTGCTCTCCAATATTTGTCTCTGACACGTCCTGATATATCCTTTTGTGTCAACAGAGTGTGTCAGTTCATGTCCTCTCCGACTTCTATACATTGGGCGACAGTCAAACGAATTCTCCGTTATCTACATGACACTATTGATATGAGTTTGTGTCTTACAAAGTCCAGCACTGATTTGTTGAGTGCCTTTTCAGATGCTGATTGGGCTGGGAATCCTGATGATCGTCGAAGCACTGGAGGCTATGTGATCTTCTTTGGTGGCAATCTTATCTCTTGGAGTTCGAGGAAACAATCGACAGTATCTCGTTCTAGTATGGAAGCCGAATATAAGGCGGTTGCTGATGCCACTGCCGAATTAATTTGGATCCAAGTCCTCTTGCGTGAGCTCGGGATCTCGCAAGCGCGAGCGCGTAGCCTATGGTGTGACAACATTGGTGCCACCTACCTATCCGCCAATCCAATCTTTCATCGACGGACGAAGCATGTTGAGGTTGATTATCACTTCGTTCGTGAACGAGTATCGACTCGTCAGCTTGATGTTCGAGTCATATCTTCCAAGGATCAGCTCGCCGATATCATGACAAAGCCACTGCCAGCTCCTTCTTTTAGCTATTTTAGGCGCCATCTGAACTTAGTAGTACATCGTCCAGATTGA

Coding sequence (CDS)

ATGTCTACCATGTCTTGTTCCTCCTCTCCATCCACCATCTCCAACACCGTCACTCGTGCTCGCACTACTCAGATTCGTGTTGAGCTCGCCACGTCCAAGAAACGAGATCAATCTGCTGCAAATTATTTTTGCAAGATCAAAGGGCTAGCCACCGAGCTGGCCGCCGCCGGCTCTGCCTTGCAGGATGATGATGTGATCGCGTATCTTCTCGCTGGTCTTGGCCCAGACTATGATCCCTTCGTCACCTCAATGACTACCAAGAGTGAAGCCCTCACGCTTGATGATGTGTTTGCACATCTAATGATGTATGAAGCTCACCAACTACAACACCAGGCTGAACTTCAGTTAAATTCGGGATCTTCGGCCAATTATACTGGTCGTGCTGAACTTCAGTTAAATCTTGGATCTTCTGCCAATTATGCTAGTCGTGGTGGTCAGCAAAAGAATCGTGGGCGTAGGGATCGTGGTCGTGGTCGTTCTCAAGGTTATGCACCCTCTCGTCCTGCTGGTGATCGTCGTGGCCCTTCTGCTCGTCCTTCCTGCCAGATCTGCGGCAAAGTGGGGCATACTGCTATACGCTGCTGGCATAGGATGGATGAGTCCTATCAAGATGAACCTTCTTCTGCTCCTCCTACGGCACTGGCGGCTACTTCCTCTTACAAGATTGACCCAAATTGGTACAGCGACACAGGCGCTACGGACCATATCACCAGTGACCTGGATCGTCTCGCTGTGCATGAACATTATCATGGAGGTGAACAAGTTCAAGTCGGCAATGGAGCAAGTTTGCGTGTTTTGCATACTGGTCATTCTCTAATTAATACTGCTACTCGTTCTCTTGCGTTGCGTAATATTTTGCATGTGATTGAAATTTCTAAACATCTTCTTTCCGTTCATAAATTTTCTCGTGATAATGACGTATTTTTTGAATTCGATCCTTGGCATTTTTCTATAAAGGATCGACAGTCGAGGAAAAGTCTCCTAAATGGGAGGTGTGAATCTGGTCTTTATCCTATTAAGCCATCCGATGTCGATAATCTCAAGCACGTCTTGGTGAGTAGATCTACTACTCACGCCCAATGGCATGCACGTCTTGGACATCCTTCATCTCAAGTAGTAAAATCCATTTTGCGTCTAAATAATATTTCGTGTGCTAGCTACAGTTCTTCACATAAGGGGTATAAGTGTCTTGACACCGATTCTGGTCGTGTCTATATGTCTAGAGATGTCATATTTGATGAAAATGTTTTTCCATTCAAGAGAGCCCGACCTAATTCTTCCCCAACCATGCAGTCGACGCATAATGCCCTTGATTTGTGCACCTTGCATTTGGGTAATAGCAGTACTAATTTGGAGAATGATCACATGCATATGTCTGGGCCTACTAACTCTTTGGATGCAGAAAATTTGGTGTCTACATCAGCTTCGGAATTGCCGCAACAATCCTCCGCGTCGCTGCCATGCGAATCGGCGTTGGTTGTTCCGCCAATGATTGAGGCCTCGGCTCCTCCGCCAGCAGATGATATTGCACAATGCCCGGTCGAATCCTCGGCTGCTGGTCAACCAACTGTTGTAGCATCGGTTGCTCCCCTCGCAACGACTGATACGGCCATCCCCTCAATGTGGATCCTGCACCTACTACTCATCCCTCGGTCTTCTGCCTCTGAACCTACTTCACATATTATTGCTATGGAGCATCCCCTCTGGCGTCAGGCAATGAATGATGAATTTCAGGCACTTCAAAAAAATAAGACATGGCACTTAGTTTCTCCTCGTGCTGGTCTCAACGTTATTGATTGCAAATGGGTTTTCAAACTCAAGCAAAAGCCAGATGGCTCTATTGATCGCTACAAAGCACGCCTGATTGATATTCAAAATGCTTTTCTTCATGGCTTTCTTAATGAAGATGTTTATATGAAGCAGCCCCCTGGATTTGTGGATTCTCAACACCCTGGTTATCTCTGCAAGCTGGATAAGTCGCTTTATGGCCTTAAACAAGCTCCGCGTGCCTGGTTTTCTCGCCTTAGCTCCAAACAATTACAGCTGGATTTTACACCTTCAAAGGCTGATGTCTCTCTTTTCATTTTTAACAAAACGGGCATTCAGATGTATATCCTCATCTACGTTGATGACATTATTATCATCAGCTCATCTTCTACGGCTATTGAGAAACTTCTTACACAACTTCAGGATGATTTTGTCGTCAAGGATCTTGGTCTTTTGAGTTATTTTCTTGGGATTGAGGTCCGCCATACTTCCAGTGGACTTATTCTGACACAACATAAATACATTCGAGATTTATTAGCCAGAACCGATATGCTCACCTCCAAAGGTGTGCCCACACCTATGCTTCCCAGTGAGAAGTTGTTATTGAATGGTGGTAAAAAGCTCTCACCTGAGGATACTACTCGCTATCGAAGTGTCGTTGGTGCTCTCCAATATTTGTCTCTGACACGTCCTGATATATCCTTTTGTGTCAACAGAGTGTGTCAGTTCATGTCCTCTCCGACTTCTATACATTGGGCGACAGTCAAACGAATTCTCCGTTATCTACATGACACTATTGATATGAGTTTGTGTCTTACAAAGTCCAGCACTGATTTGTTGAGTGCCTTTTCAGATGCTGATTGGGCTGGGAATCCTGATGATCGTCGAAGCACTGGAGGCTATGTGATCTTCTTTGGTGGCAATCTTATCTCTTGGAGTTCGAGGAAACAATCGACAGTATCTCGTTCTAGTATGGAAGCCGAATATAAGGCGGTTGCTGATGCCACTGCCGAATTAATTTGGATCCAAGTCCTCTTGCGTGAGCTCGGGATCTCGCAAGCGCGAGCGCGTAGCCTATGGTGTGACAACATTGGTGCCACCTACCTATCCGCCAATCCAATCTTTCATCGACGGACGAAGCATGTTGAGGTTGATTATCACTTCGTTCGTGAACGAGTATCGACTCGTCAGCTTGATGTTCGAGTCATATCTTCCAAGGATCAGCTCGCCGATATCATGACAAAGCCACTGCCAGCTCCTTCTTTTAGCTATTTTAGGCGCCATCTGAACTTAGTAGTACATCGTCCAGATTGA
BLAST of CmoCh05G011780 vs. Swiss-Prot
Match: POLX_TOBAC (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum PE=2 SV=1)

HSP 1 Score: 272.3 bits (695), Expect = 2.1e-71
Identity = 181/518 (34.94%), Postives = 262/518 (50.58%), Query Frame = 1

Query: 552  SSASEPTSHIIAMEHPLWRQ---AMNDEFQALQKNKTWHLVSPRAGLNVIDC-------- 611
            S   EP S    + HP   Q   AM +E ++LQKN T+ LV    G   + C        
Sbjct: 806  SDDREPESLKEVLSHPEKNQLMKAMQEEMESLQKNGTYKLVELPKGKRPLKCKWVFKLKK 865

Query: 612  -----------KWVFK-LKQKPDGSIDRY----------------------KARLIDIQN 671
                       + V K  +QK     D                        +   +D++ 
Sbjct: 866  DGDCKLVRYKARLVVKGFEQKKGIDFDEIFSPVVKMTSIRTILSLAASLDLEVEQLDVKT 925

Query: 672  AFLHGFLNEDVYMKQPPGFVDSQHPGYLCKLDKSLYGLKQAPRAWFSRLSSKQLQLDFTP 731
            AFLHG L E++YM+QP GF  +     +CKL+KSLYGLKQAPR W+ +  S      +  
Sbjct: 926  AFLHGDLEEEIYMEQPEGFEVAGKKHMVCKLNKSLYGLKQAPRQWYMKFDSFMKSQTYLK 985

Query: 732  SKADVSLFI--FNKTGIQMYILIYVDDIIIISSSSTAIEKLLTQLQDDFVVKDLGLLSYF 791
            + +D  ++   F++    + +L+YVDD++I+      I KL   L   F +KDLG     
Sbjct: 986  TYSDPCVYFKRFSENNF-IILLLYVDDMLIVGKDKGLIAKLKGDLSKSFDMKDLGPAQQI 1045

Query: 792  LGIEV--RHTSSGLILTQHKYIRDLLARTDMLTSKGVPTPMLPSEKLLLNGGKKLSPEDT 851
            LG+++    TS  L L+Q KYI  +L R +M  +K V TP+    KL     KK+ P   
Sbjct: 1046 LGMKIVRERTSRKLWLSQEKYIERVLERFNMKNAKPVSTPLAGHLKL----SKKMCPTTV 1105

Query: 852  TR--------YRSVVGALQY-LSLTRPDISFCVNRVCQFMSSPTSIHWATVKRILRYLHD 911
                      Y S VG+L Y +  TRPDI+  V  V +F+ +P   HW  VK ILRYL  
Sbjct: 1106 EEKGNMAKVPYSSAVGSLMYAMVCTRPDIAHAVGVVSRFLENPGKEHWEAVKWILRYLRG 1165

Query: 912  TIDMSLCLTKSSTDLLSAFSDADWAGNPDDRRSTGGYVIFFGGNLISWSSRKQSTVSRSS 971
            T    LC    S  +L  ++DAD AG+ D+R+S+ GY+  F G  ISW S+ Q  V+ S+
Sbjct: 1166 TTGDCLCF-GGSDPILKGYTDADMAGDIDNRKSSTGYLFTFSGGAISWQSKLQKCVALST 1225

Query: 972  MEAEYKAVADATAELIWIQVLLRELGISQARARSLWCDNIGATYLSANPIFHRRTKHVEV 1012
             EAEY A  +   E+IW++  L+ELG+ Q +   ++CD+  A  LS N ++H RTKH++V
Sbjct: 1226 TEAEYIAATETGKEMIWLKRFLQELGLHQ-KEYVVYCDSQSAIDLSKNSMYHARTKHIDV 1285

BLAST of CmoCh05G011780 vs. Swiss-Prot
Match: COPIA_DROME (Copia protein OS=Drosophila melanogaster GN=GIP PE=1 SV=3)

HSP 1 Score: 271.2 bits (692), Expect = 4.7e-71
Identity = 152/411 (36.98%), Postives = 238/411 (57.91%), Query Frame = 1

Query: 618  KARLIDIQNAFLHGFLNEDVYMKQPPGFVDSQHPGYLCKLDKSLYGLKQAPRAWFSRLSS 677
            K   +D++ AFL+G L E++YM+ P G   S +   +CKL+K++YGLKQA R WF     
Sbjct: 997  KVHQMDVKTAFLNGTLKEEIYMRLPQGI--SCNSDNVCKLNKAIYGLKQAARCWFEVFEQ 1056

Query: 678  KQLQLDFTPSKADVSLFIFNKTGIQ--MYILIYVDDIIIISSSSTAIEKLLTQLQDDFVV 737
               + +F  S  D  ++I +K  I   +Y+L+YVDD++I +   T +      L + F +
Sbjct: 1057 ALKECEFVNSSVDRCIYILDKGNINENIYVLLYVDDVVIATGDMTRMNNFKRYLMEKFRM 1116

Query: 738  KDLGLLSYFLGIEVRHTSSGLILTQHKYIRDLLARTDMLTSKGVPTPMLPSEKLLLNGGK 797
             DL  + +F+GI +      + L+Q  Y++ +L++ +M     V TP LPS+   +N   
Sbjct: 1117 TDLNEIKHFIGIRIEMQEDKIYLSQSAYVKKILSKFNMENCNAVSTP-LPSK---INYEL 1176

Query: 798  KLSPED-TTRYRSVVGALQYLSL-TRPDISFCVNRVCQFMSSPTSIHWATVKRILRYLHD 857
              S ED  T  RS++G L Y+ L TRPD++  VN + ++ S   S  W  +KR+LRYL  
Sbjct: 1177 LNSDEDCNTPCRSLIGCLMYIMLCTRPDLTTAVNILSRYSSKNNSELWQNLKRVLRYLKG 1236

Query: 858  TIDMSLCLTKSST--DLLSAFSDADWAGNPDDRRSTGGYVI-FFGGNLISWSSRKQSTVS 917
            TIDM L   K+    + +  + D+DWAG+  DR+ST GY+   F  NLI W++++Q++V+
Sbjct: 1237 TIDMKLIFKKNLAFENKIIGYVDSDWAGSEIDRKSTTGYLFKMFDFNLICWNTKRQNSVA 1296

Query: 918  RSSMEAEYKAVADATAELIWIQVLLRELGISQARARSLWCDNIGATYLSANPIFHRRTKH 977
             SS EAEY A+ +A  E +W++ LL  + I       ++ DN G   ++ NP  H+R KH
Sbjct: 1297 ASSTEAEYMALFEAVREALWLKFLLTSINIKLENPIKIYEDNQGCISIANNPSCHKRAKH 1356

Query: 978  VEVDYHFVRERVSTRQLDVRVISSKDQLADIMTKPLPAPSFSYFRRHLNLV 1022
            +++ YHF RE+V    + +  I +++QLADI TKPLPA  F   R  L L+
Sbjct: 1357 IDIKYHFAREQVQNNVICLEYIPTENQLADIFTKPLPAARFVELRDKLGLL 1401

BLAST of CmoCh05G011780 vs. Swiss-Prot
Match: M810_ARATH (Uncharacterized mitochondrial protein AtMg00810 OS=Arabidopsis thaliana GN=AtMg00810 PE=4 SV=1)

HSP 1 Score: 199.5 bits (506), Expect = 1.8e-49
Identity = 109/228 (47.81%), Postives = 146/228 (64.04%), Query Frame = 1

Query: 703 MYILIYVDDIIIISSSSTAIEKLLTQLQDDFVVKDLGLLSYFLGIEVRHTSSGLILTQHK 762
           MY+L+YVDDI++  SS+T +  L+ QL   F +KDLG + YFLGI+++   SGL L+Q K
Sbjct: 1   MYLLLYVDDILLTGSSNTLLNMLIFQLSSTFSMKDLGPVHYFLGIQIKTHPSGLFLSQTK 60

Query: 763 YIRDLLARTDMLTSKGVPTPMLPSEKLLLNGGKKLSPEDTTRYRSVVGALQYLSLTRPDI 822
           Y   +L    ML  K + TP+       ++  K   P D   +RS+VGALQYL+LTRPDI
Sbjct: 61  YAEQILNNAGMLDCKPMSTPLPLKLNSSVSTAKYPDPSD---FRSIVGALQYLTLTRPDI 120

Query: 823 SFCVNRVCQFMSSPTSIHWATVKRILRYLHDTIDMSLCLTKSSTDLLSAFSDADWAGNPD 882
           S+ VN VCQ M  PT   +  +KR+LRY+  TI   L + K+S   + AF D+DWAG   
Sbjct: 121 SYAVNIVCQRMHEPTLADFDLLKRVLRYVKGTIFHGLYIHKNSKLNVQAFCDSDWAGCTS 180

Query: 883 DRRSTGGYVIFFGGNLISWSSRKQSTVSRSSMEAEYKAVADATAELIW 931
            RRST G+  F G N+ISWS+++Q TVSRSS E EY+A+A   AEL W
Sbjct: 181 TRRSTTGFCTFLGCNIISWSAKRQPTVSRSSTETEYRALALTAAELTW 225

BLAST of CmoCh05G011780 vs. Swiss-Prot
Match: YCH4_YEAST (Putative transposon Ty5-1 protein YCL074W OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=TY5A PE=5 SV=2)

HSP 1 Score: 178.3 bits (451), Expect = 4.2e-43
Identity = 101/310 (32.58%), Postives = 174/310 (56.13%), Query Frame = 1

Query: 622 IDIQNAFLHGFLNEDVYMKQPPGFVDSQHPGYLCKLDKSLYGLKQAPRAWFSRLSSKQLQ 681
           +D+  AFL+  ++E +Y+KQPPGFV+ ++P Y+ +L   +YGLKQAP  W   +++   +
Sbjct: 1   MDVDTAFLNSTMDEPIYVKQPPGFVNERNPDYVWELYGGMYGLKQAPLLWNEHINNTLKK 60

Query: 682 LDFTPSKADVSLFIFNKTGIQMYILIYVDDIIIISSSSTAIEKLLTQLQDDFVVKDLGLL 741
           + F   + +  L+  + +   +YI +YVDD+++ + S    +++  +L   + +KDLG +
Sbjct: 61  IGFCRHEGEHGLYFRSTSDGPIYIAVYVDDLLVAAPSPKIYDRVKQELTKLYSMKDLGKV 120

Query: 742 SYFLGIEVRHTSSG-LILTQHKYIRDLLARTDMLTSKGVPTPMLPSEKLLLNGGKKLSPE 801
             FLG+ +  +S+G + L+   YI    + +++ T K   TP+  S+ L       L  +
Sbjct: 121 DKFLGLNIHQSSNGDITLSLQDYIAKAASESEINTFKLTQTPLCNSKPLFETTSPHL--K 180

Query: 802 DTTRYRSVVGALQYLSLT-RPDISFCVNRVCQFMSSPTSIHWATVKRILRYLHDTIDMSL 861
           D T Y+S+VG L + + T RPDIS+ V+ + +F+  P +IH  + +R+LRYL+ T  M L
Sbjct: 181 DITPYQSIVGQLLFCANTGRPDISYPVSLLSRFLREPRAIHLESARRVLRYLYTTRSMCL 240

Query: 862 CLTKSSTDLLSAFSDADWAGNPDDRRSTGGYVIFFGGNLISWSSRK-QSTVSRSSMEAEY 921
                S   L+ + DA      D   STGGYV    G  ++WSS+K +  +   S EAEY
Sbjct: 241 KYRSGSQLALTVYCDASHGAIHDLPHSTGGYVTLLAGAPVTWSSKKLKGVIPVPSTEAEY 300

Query: 922 KAVADATAEL 929
              ++   E+
Sbjct: 301 ITASETVMEI 308

BLAST of CmoCh05G011780 vs. Swiss-Prot
Match: YD11B_YEAST (Transposon Ty1-DR1 Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=TY1B-DR1 PE=3 SV=2)

HSP 1 Score: 110.2 bits (274), Expect = 1.4e-22
Identity = 107/420 (25.48%), Postives = 189/420 (45.00%), Query Frame = 1

Query: 615  DRYKARLIDIQNAFLHGFLNEDVYMKQPP--GFVDSQHPGYLCKLDKSLYGLKQAPRAWF 674
            + Y    +DI +A+L+  + E++Y++ PP  G  D      L +L KSLYGLKQ+   W+
Sbjct: 1338 NNYYITQLDISSAYLYADIKEELYIRPPPHLGMNDK-----LIRLKKSLYGLKQSGANWY 1397

Query: 675  SRLSSKQLQLDFTPSKADVSLFIFNKTGIQMYILIYVDDIIIISSSSTAIEKLLTQLQDD 734
              + S  ++          S    N    Q+ I ++VDD+I+ S    A +K++T L+  
Sbjct: 1398 ETIKSYLIKQCGMEEVRGWSCVFKNS---QVTICLFVDDMILFSKDLNANKKIITTLKKQ 1457

Query: 735  FVVKDLGL------LSY-FLGIEVRHTSSGLI-LTQHKYIRDLLARTDM--------LTS 794
            +  K + L      + Y  LG+E+++  S  + L   K + + L + ++        L +
Sbjct: 1458 YDTKIINLGEGDNEIQYDILGLEIKYQRSKYMKLGMEKSLTEKLPKLNVPLNPKGKKLRA 1517

Query: 795  KGVPTPMLPSEKLLLNGGKKLSPEDTTRYRSVVGALQYLSLT-RPDISFCVNRVCQFMSS 854
             G P   +  ++L ++  +    E     + ++G   Y+    R D+ + +N + Q +  
Sbjct: 1518 PGQPGHYIDQDELEIDEDEY--KEKVHEMQKLIGLASYVGYKFRFDLLYYINTLAQHILF 1577

Query: 855  PTSIHWATVKRILRYLHDTIDMSLCLTKSST----DLLSAFSDADWAGNPDDRRSTGGYV 914
            P+         +++++ DT D  L   K+      + L A SDA + GN    +S  G +
Sbjct: 1578 PSRQVLDMTYELIQFMWDTRDKQLIWHKNKPTKPDNKLVAISDASY-GNQPYYKSQIGNI 1637

Query: 915  IFFGGNLISWSSRKQSTVSRSSMEAEYKAVADATAELIWIQVLLRELGISQARARSLWCD 974
                G +I   S K S    S+ EAE  AV++A   L  +  L++EL        SL   
Sbjct: 1638 FLLNGKVIGGKSTKASLTCTSTTEAEIHAVSEAIPLLNNLSHLVQELNKKPIIKGSLTDS 1697

Query: 975  NIGATYLSANPIFHRRTKHVEVDYHFVRERVSTRQLDVRVISSKDQLADIMTKPLPAPSF 1012
                + + +      R +        +R+ VS   L V  I +K  +AD+MTKPLP  +F
Sbjct: 1698 RSTISIIKSTNEEKFRNRFFGTKAMRLRDEVSGNNLYVYYIETKKNIADVMTKPLPIKTF 1746

BLAST of CmoCh05G011780 vs. TrEMBL
Match: Q2QY49_ORYSJ (Retrotransposon protein, putative, Ty1-copia subclass OS=Oryza sativa subsp. japonica GN=LOC_Os12g03850 PE=4 SV=1)

HSP 1 Score: 542.3 bits (1396), Expect = 1.2e-150
Identity = 299/623 (47.99%), Postives = 393/623 (63.08%), Query Frame = 1

Query: 450  NLENDHMHMSGPTNSLDAENLVSTSASELPQQSSASLPCESALVVPPMIEASAPPPADDI 509
            ++ + H    G T+S+   ++VS++A++    +  S       V  P    +A      I
Sbjct: 295  SITHPHDEPGGDTSSISPADVVSSAAADGMHATHGSATSSGGNVSSPHTFDAA----QQI 354

Query: 510  AQCPVESSAAGQPTVVASVAPLATTDTAIPSMWILHLLLIPRSSASEPTSHIIAMEHPLW 569
             Q PV  S  G         P   TD  +    +        +   EP+S   A+    W
Sbjct: 355  QQRPVTRSQHGVHR------PKKYTDGTVRYGCL--------TETGEPSSLQEALSSANW 414

Query: 570  RQAMNDEFQALQKNKTWHLVSPRAGLNVIDCKWVFKLKQKPDGSIDRYKARLIDIQNAFL 629
            +QAM+ EF AL  NKTWHLV P  G N+ID KWV+K+K+K DG+IDRYKARL+       
Sbjct: 415  KQAMDKEFSALLHNKTWHLVPPVKGKNIIDSKWVYKIKKKADGTIDRYKARLVAKGFKQR 474

Query: 630  HGFLNED------------------------------------------VYMKQPPGFVD 689
            +G   ED                                          VYM+QPPGF D
Sbjct: 475  YGIDYEDTFSPVVKAATIRLVLSIAMSQGWSLRQLDVQNAFLHGYLDEEVYMRQPPGFED 534

Query: 690  SQHPGYLCKLDKSLYGLKQAPRAWFSRLSSKQLQLDFTPSKADVSLFIFNKTGIQMYILI 749
            ++ P +LCKLDK+LYGLKQAPRAW+SRLS K  +L F+ SKAD SLF +NK   +M++L+
Sbjct: 535  ARQPHFLCKLDKALYGLKQAPRAWYSRLSKKLQELGFSSSKADTSLFFYNKGHHKMFVLV 594

Query: 750  YVDDIIIISSSSTAIEKLLTQLQDDFVVKDLGLLSYFLGIEVRHTSSGLILTQHKYIRDL 809
            YVDDII+ SSSS A+  LL  L+ DF +KDLG L YFLGIEV+ T   L+L+Q +Y  +L
Sbjct: 595  YVDDIIVASSSSPAVNALLKDLEKDFALKDLGDLHYFLGIEVKRTPQTLLLSQERYTTEL 654

Query: 810  LARTDMLTSKGVPTPMLPSEKLLLNGGKKLSPEDTTRYRSVVGALQYLSLTRPDISFCVN 869
            L R +M + K V TP+  +EKL +  G +L P D T+YRS+VGALQYL+LTRPDIS+ VN
Sbjct: 655  LERVNMTSCKPVSTPLSTAEKLSVEIGDELGPSDVTQYRSIVGALQYLTLTRPDISYSVN 714

Query: 870  RVCQFMSSPTSIHWATVKRILRYLHDTIDMSLCLTKSSTDLLSAFSDADWAGNPD----- 929
            +VCQF+ +PT+ HW+ VKRILRYL  T+D+ L + KSS++L+SAFSDADWAG+ D     
Sbjct: 715  KVCQFLQTPTTAHWSAVKRILRYLKGTLDLGLKIVKSSSNLVSAFSDADWAGSVDDRRST 774

Query: 930  ----DRRSTGGYVIFFGGNLISWSSRKQSTVSRSSMEAEYKAVADATAELIWIQVLLREL 989
                DRRSTGG+ +FFG NLISWS+RKQ+TVSRSS EAEYKA+A+A AE+IW++ LL EL
Sbjct: 775  GGFADRRSTGGFAVFFGDNLISWSARKQATVSRSSTEAEYKALANAAAEIIWVRKLLTEL 834

Query: 990  GISQARARSLWCDNIGATYLSANPIFHRRTKHVEVDYHFVRERVSTRQLDVRVISSKDQL 1022
            GI    A  LWCDN+GATY++ANP+FH RTKH+EVDYHFVRE+V+ + LD+  +SS+DQ+
Sbjct: 835  GILHPNAARLWCDNLGATYMTANPVFHARTKHIEVDYHFVREQVAQKLLDIHFVSSQDQV 894

BLAST of CmoCh05G011780 vs. TrEMBL
Match: Q8W0X9_MAIZE (Putative copia-like retrotransposon Hopscotch polyprotein OS=Zea mays GN=Z178A11.9 PE=4 SV=1)

HSP 1 Score: 542.0 bits (1395), Expect = 1.6e-150
Identity = 273/511 (53.42%), Postives = 351/511 (68.69%), Query Frame = 1

Query: 552  SSASEPTSHIIAMEHPLWRQAMNDEFQALQKNKTWHLVSPRAGLNVIDCKWVFKLKQKPD 611
            +S+ EP     A+    W+ AM+ E+ AL KNKTWHLV P+ G NVI CKWV+K+K+K D
Sbjct: 800  TSSGEPYDLNEALGDVNWKDAMDIEYSALMKNKTWHLVPPKKGRNVIGCKWVYKIKRKAD 859

Query: 612  GSIDRYKARL----------IDIQNAFLH----------------------------GFL 671
            GS+DRYKARL          ID  + F                               FL
Sbjct: 860  GSLDRYKARLVAKGYKQQYGIDYDDTFSPVVKHATIRIILSIAVSRGWSLCQLDVQNAFL 919

Query: 672  N----EDVYMKQPPGFVDSQHPGYLCKLDKSLYGLKQAPRAWFSRLSSKQLQLDFTPSKA 731
            +    E+VYM+QPPG+ DS    Y+CKLDK+LYGLKQAPRAW+SRLS+K L L F  SKA
Sbjct: 920  HGVLEEEVYMQQPPGYEDSTKLNYVCKLDKALYGLKQAPRAWYSRLSNKLLSLGFQASKA 979

Query: 732  DVSLFIFNKTGIQMYILIYVDDIIIISSSSTAIEKLLTQLQDDFVVKDLGLLSYFLGIEV 791
            D SLF +NK  + +++L+YVDDII+ SS+  A E LL+ L  +F +KDLG L+YFLGIEV
Sbjct: 980  DTSLFFYNKGSVTIFVLVYVDDIIVASSTHKATEALLSDLNKEFALKDLGDLNYFLGIEV 1039

Query: 792  RHTSSGLILTQHKYIRDLLARTDMLTSKGVPTPMLPSEKLLLNGGKKLSPEDTTRYRSVV 851
                 G+ILTQ KY  DLL +  M   K + TP+  SEKL ++ G  L  +D T+YRS+V
Sbjct: 1040 NKVRDGIILTQDKYASDLLKKVGMSDCKPISTPLSTSEKLSIHEGSPLGEKDITQYRSIV 1099

Query: 852  GALQYLSLTRPDISFCVNRVCQFMSSPTSIHWATVKRILRYLHDTIDMSLCLTKSSTDLL 911
            GALQYL+LTRPDI+F VN+VCQF+ +PT++HWA VKRILRY+    ++ L + +S + L+
Sbjct: 1100 GALQYLTLTRPDIAFSVNKVCQFLHAPTTLHWAAVKRILRYIKQCTNLGLHIHRSDSTLV 1159

Query: 912  SAFSDADWAGNPDDRRSTGGYVIFFGGNLISWSSRKQSTVSRSSMEAEYKAVADATAELI 971
            SAFSDADWAG+ DDR+STGG+ +F G NL+SWS+RKQ TVSRSS E+EYKA+A+ATAELI
Sbjct: 1160 SAFSDADWAGSVDDRKSTGGFAVFLGSNLVSWSARKQPTVSRSSTESEYKALANATAELI 1219

Query: 972  WIQVLLRELGISQARARSLWCDNIGATYLSANPIFHRRTKHVEVDYHFVRERVSTRQLDV 1021
            W+Q+LL E+ I   RA  LWCDN+GA YLSANPIFH RTKH+EVDYHFVR+RV+ + LD+
Sbjct: 1220 WVQILLTEISIKSPRAAKLWCDNLGAKYLSANPIFHARTKHIEVDYHFVRDRVAKKLLDI 1279

BLAST of CmoCh05G011780 vs. TrEMBL
Match: Q7XE22_ORYSJ (Retrotransposon protein, putative, Ty1-copia subclass OS=Oryza sativa subsp. japonica GN=LOC_Os10g30510 PE=4 SV=1)

HSP 1 Score: 538.5 bits (1386), Expect = 1.8e-149
Identity = 270/511 (52.84%), Postives = 343/511 (67.12%), Query Frame = 1

Query: 552  SSASEPTSHIIAMEHPLWRQAMNDEFQALQKNKTWHLVSPRAGLNVIDCKWVFKLKQKPD 611
            ++  EP +   AM +  WR AM  E+ A   NKTWHLV P  G N+IDCKW++K+K+K D
Sbjct: 421  TTTGEPENLREAMANSNWRLAMEQEYSAFMSNKTWHLVPPTQGKNIIDCKWMYKIKRKAD 480

Query: 612  GSIDRYKARLIDIQNAFLHGFLNEDV---------------------------------- 671
            GSIDRYKARL+       +G   ED                                   
Sbjct: 481  GSIDRYKARLVAKGFKQRYGIDYEDTFSLVVKAATIRLILSIAVSKGWSLRQLDVQNAFL 540

Query: 672  --------YMKQPPGFVDSQHPGYLCKLDKSLYGLKQAPRAWFSRLSSKQLQLDFTPSKA 731
                    YM+QPPGF +   P YLCKLDK+LYGLKQAPRAW+SRLS+K  +L F  SKA
Sbjct: 541  HGYLEEEVYMRQPPGFENKGQPNYLCKLDKALYGLKQAPRAWYSRLSTKLQELGFISSKA 600

Query: 732  DVSLFIFNKTGIQMYILIYVDDIIIISSSSTAIEKLLTQLQDDFVVKDLGLLSYFLGIEV 791
            D SLF +NK G  ++IL+YVDDII+ SSS+  +  LL  L+ DF +KDLG L YFLGIEV
Sbjct: 601  DTSLFFYNKGGCTIFILVYVDDIIVASSSAEVVAALLKDLEKDFALKDLGDLHYFLGIEV 660

Query: 792  RHTSSGLILTQHKYIRDLLARTDMLTSKGVPTPMLPSEKLLLNGGKKLSPEDTTRYRSVV 851
            +  S GL+L+Q  Y  D+L R  M   K   TP+  +EKL +  G  L   D + YRS+V
Sbjct: 661  KKVSQGLVLSQAWYASDILKRAGMSICKPASTPLSTTEKLSIEDGDFLGQNDASHYRSIV 720

Query: 852  GALQYLSLTRPDISFCVNRVCQFMSSPTSIHWATVKRILRYLHDTIDMSLCLTKSSTDLL 911
            GALQYL+LTR D+SF VN+VCQF+ SPT++HW+ VKRILRY+  T++  L   KS + L+
Sbjct: 721  GALQYLTLTRSDLSFLVNKVCQFLHSPTTVHWSAVKRILRYIKGTVEFGLRFGKSDSMLI 780

Query: 912  SAFSDADWAGNPDDRRSTGGYVIFFGGNLISWSSRKQSTVSRSSMEAEYKAVADATAELI 971
            SAFSDADWAG  DDRRSTGG+ +F G NLISWS+RKQ+TVSRSS EAEYKA+A+AT E+ 
Sbjct: 781  SAFSDADWAGCSDDRRSTGGFAVFLGPNLISWSARKQATVSRSSTEAEYKALANATTEVT 840

Query: 972  WIQVLLRELGISQARARSLWCDNIGATYLSANPIFHRRTKHVEVDYHFVRERVSTRQLDV 1021
            W++ +L EL I++     LWCDN+GATYLSANP+FH RTKH+E+DYHFVRE+V+ + LD+
Sbjct: 841  WVRKILDELRIARPSVAQLWCDNLGATYLSANPVFHARTKHIEIDYHFVREQVAKKLLDI 900

BLAST of CmoCh05G011780 vs. TrEMBL
Match: V9GZT4_MAIZE (Copia-like retrotransposon Hopscotch polyprotein OS=Zea mays GN=gag PE=4 SV=1)

HSP 1 Score: 527.7 bits (1358), Expect = 3.1e-146
Identity = 329/725 (45.38%), Postives = 421/725 (58.07%), Query Frame = 1

Query: 387  YSSSHKGYKCLDTDSGRVYMSRDVIFDENVFPFKRARPNS-----SPTMQSTHNAL--DL 446
            YS+ HKG+KCLD  +GR+Y+SRDV+FDE+VFPF     N+     S  +   H++   ++
Sbjct: 710  YSNMHKGFKCLDISTGRIYISRDVVFDEHVFPFASLNKNAGVKYTSEVLLLPHDSCGNNM 769

Query: 447  CTLHL-------------------GNSSTNLENDH---MHMSGPTNSLDAENLVSTS--- 506
             T H                    GNS     N+    +  SGP        LV +S   
Sbjct: 770  LTDHANNLPGSSSPLPFLAQHFLQGNSEVPTSNNTAMALPASGPNEVSVPPALVPSSLVP 829

Query: 507  -ASELPQQSSASLP----CESALVVPPMIEASAP--PPADDIAQCPVESSAAGQPTVVAS 566
             AS  P   SA+       +S    PP+   S    P AD + Q P  S A   P     
Sbjct: 830  AASPAPTGVSANAEPAPEADSLSSGPPVATESVTGVPDADPLLQAPGSSVAHQTP----D 889

Query: 567  VAPLATTDTAIPSMWILHLLLIPR-------------SSASEPTSHIIAMEHPLWRQAMN 626
             APL+    A P   + H +  P+             +  +EP+S   A+  P WR AM 
Sbjct: 890  SAPLSA---AAPRTRLQHGISKPKQFTDGTVRYGNAAARITEPSSVSEALADPQWRAAME 949

Query: 627  DEFQALQKNKTWHLVSPRAGLNVIDCKWVFKLKQKPDGSIDRYKARLID----------- 686
             EFQALQKN TW LV P    N+IDCKWVFK+K   DGSIDR KARL+            
Sbjct: 950  AEFQALQKNNTWTLVPPDRTRNLIDCKWVFKVKYNADGSIDRLKARLVAKGFKQQYGIDY 1009

Query: 687  -------IQNAFLHGFLNEDVYMKQPPGFVDSQHPGYLCKLDKSLYGLKQAP-------- 746
                   ++++ +   L+  V  K     +D Q+      L++++Y +KQ P        
Sbjct: 1010 DDTFSPVVKHSTIRLVLSLAVSQKWSLRQLDVQNAFLHGILEETVY-MKQPPGFADTTHP 1069

Query: 747  -----------------RAWFSRLSSKQLQLDFTPSKADVSLFIFNKTGIQMYILIYVDD 806
                             RAW+SRLS K   L F PSKADVSLFI+N     +YIL+YVDD
Sbjct: 1070 NYHCHLQKSLYGLKQRPRAWYSRLSEKLQSLGFVPSKADVSLFIYNAHSTAIYILVYVDD 1129

Query: 807  IIIISSSSTAIEKLLTQLQDDFVVKDLGLLSYFLGIEVRHTSSGLILTQHKYIRDLLART 866
            III  SS  AI+ +L +L+DDF +KDLG L YFLGIEV     GL+L Q KY RDLL R 
Sbjct: 1130 IIITGSSPHAIDNVLAKLKDDFAIKDLGDLHYFLGIEVHRKGDGLLLCQEKYARDLLKRV 1189

Query: 867  DMLTSKGVPTPMLPSEKLLLNGGKKLSPEDTTRYRSVVGALQYLSLTRPDISFCVNRVCQ 926
             M   K V TP+  SEKL  + G  LSPE+TT+YRSVVGALQYL+LTRPD+S+ +NRVCQ
Sbjct: 1190 GMECCKPVHTPVATSEKLSASAGTLLSPEETTKYRSVVGALQYLTLTRPDLSYAINRVCQ 1249

Query: 927  FMSSPTSIHWATVKRILRYLHDTIDMSLCLTKSSTDLLSAFSDADWAGNPDDRRSTGGYV 986
            F+ +PT +HW  VKRILR +  TI + L +  S + +LSAFSDADWAG PDDR+STGGY 
Sbjct: 1250 FLHAPTDLHWTAVKRILRNIQHTIGLGLTIRPSLSLMLSAFSDADWAGCPDDRKSTGGYA 1309

Query: 987  IFFGGNLISWSSRKQSTVSRSSMEAEYKAVADATAELIWIQVLLRELGISQARARSLWCD 1017
            +F G NLISW+S+KQSTVSRSS EAEYKA+A+ATAE+IW+Q LL ELGI       LWCD
Sbjct: 1310 LFLGPNLISWNSKKQSTVSRSSTEAEYKAMANATAEVIWLQSLLHELGIRLTGIPRLWCD 1369

BLAST of CmoCh05G011780 vs. TrEMBL
Match: Q6ATL7_ORYSJ (Putative polyprotein OS=Oryza sativa subsp. japonica GN=OSJNBb0021K20.25 PE=4 SV=1)

HSP 1 Score: 523.5 bits (1347), Expect = 5.9e-145
Identity = 312/737 (42.33%), Postives = 419/737 (56.85%), Query Frame = 1

Query: 374  KSILRLNNISCA--SYSSSHKGYKCLDTDSGRVYMSRDVIFDENVFPFKRARPNSSPTMQ 433
            K  L+  + +C    YS+ HKG+KCLD  +GRVY+SRDV+FDE  FPF +  PN    ++
Sbjct: 693  KHKLQFRSTTCTFLGYSTLHKGFKCLDPSTGRVYISRDVVFDETQFPFTKLHPNVGAKLR 752

Query: 434  STHNALDLCTLHLGN-----------------SSTNLENDHMHMSGPTNSLD-AENLVST 493
            +    +      L                   S+ N++ D  + + P    D A + VS 
Sbjct: 753  AEIALVPELAASLPRGLQQISSVINTPENANVSNENMQQDSTYDNEPETETDGAPDTVSA 812

Query: 494  SA-------------------SELPQQSSASLPCESALVVPPMIEASAPPPADDIAQCPV 553
            +A                   S+    S AS P  SA   P    + +  P    +Q   
Sbjct: 813  NAPAESSGSPPINEPASPFGESDSATASPASAPVNSA-PHPDAAASGSSAPRGSTSQGGT 872

Query: 554  ESSAAGQPTVVASVA------PLATTDTAIPSMWILHLLLIP---RSSASEPTSHIIAME 613
             S A   P    +V       P     + I    +     +     +S  EP +   A++
Sbjct: 873  PSVAIDDPHPATTVTGQEAQRPRTRLQSGIRKEKVYTDGTVKWGMLTSTGEPENLQDALQ 932

Query: 614  HPLWRQAMNDEFQALQKNKTWHLVSPRAGLNVIDCKWVFKLKQKPDGSIDRYKARLIDIQ 673
            +  W+ AM+ E+ AL KN TWHLV P+ G NVIDCKWV+K+K+K DGS+DRYKARL+   
Sbjct: 933  NNNWKCAMDAEYMALIKNNTWHLVPPQQGRNVIDCKWVYKIKRKQDGSLDRYKARLVAKG 992

Query: 674  NAFLHGFLNEDVY--------------MKQPPGF----VDSQHP---------------- 733
                +G   ED +              +    G+    +D Q+                 
Sbjct: 993  FKQRYGIDYEDTFSPVVKAATIRIILSIAVSRGWCLRQLDVQNAFLHGVLEEEVYMKQPP 1052

Query: 734  --------GYLCKLDKSLYGLKQAPRAWFSRLSSKQLQLDFTPSKADVSLFIFNKTGIQM 793
                     Y+CKLDK+LYGLKQAPRAW+SRLS K   L F  SKAD SLF +NK  + +
Sbjct: 1053 GYENPSTPDYVCKLDKALYGLKQAPRAWYSRLSGKLHDLGFKGSKADTSLFFYNKGSLTI 1112

Query: 794  YILIYVDDIIIISSSSTAIEKLLTQLQDDFVVKDLGLLSYFLGIEVRHTSSGLILTQHKY 853
            ++LIYVDDII++SS   A+  LL  LQ +F +KDLG L YFLGIEV     G++++Q KY
Sbjct: 1113 FLLIYVDDIIVVSSRKEAVSALLQDLQKEFALKDLGDLHYFLGIEVTKIPGGILMSQEKY 1172

Query: 854  IRDLLARTDMLTSKGVPTPMLPSEKLLLNGGKKLSPEDTTRYRSVVGALQYLSLTRPDIS 913
              DLL R +M   K V TP+  SEKL+   G  L P D T+YRS+VGALQYL+LTR DI+
Sbjct: 1173 ASDLLKRVNMSDCKSVATPLSASEKLIAGKGTILGPNDATQYRSIVGALQYLTLTRLDIA 1232

Query: 914  FCVNRVCQFMSSPTSIHWATVKRILRYLHDTIDMSLCLTKSSTDLLSAFSDADWAGNPDD 973
            F VN+VCQF+ +PT+ HWA VKRILRY+     + L + KSS+ ++S +SDADWAG  DD
Sbjct: 1233 FSVNKVCQFLHNPTTEHWAAVKRILRYIKQCTGLGLRICKSSSMIVSGYSDADWAGCLDD 1292

Query: 974  RRSTGGYVIFFGGNLISWSSRKQSTVSRSSMEAEYKAVADATAELIWIQVLLRELGISQA 1021
            RRSTGG+ ++ G NL+SW+++KQ+TVSRSS EAEYKA+A+ATAE++W+Q LL+EL I   
Sbjct: 1293 RRSTGGFAVYLGDNLVSWNAKKQATVSRSSTEAEYKALANATAEIMWVQTLLQELNIVSP 1352

BLAST of CmoCh05G011780 vs. TAIR10
Match: AT4G23160.1 (AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 8)

HSP 1 Score: 296.6 bits (758), Expect = 5.9e-80
Identity = 156/373 (41.82%), Postives = 224/373 (60.05%), Query Frame = 1

Query: 613 SIDRYKARLIDIQNAFLHGFLNEDVYMKQPPGFV----DSQHPGYLCKLDKSLYGLKQAP 672
           +I  +    +DI NAFL+G L+E++YMK PPG+     DS  P  +C L KS+YGLKQA 
Sbjct: 184 AIYNFTLHQLDISNAFLNGDLDEEIYMKLPPGYAARQGDSLPPNAVCYLKKSIYGLKQAS 243

Query: 673 RAWFSRLSSKQLQLDFTPSKADVSLFIFNKTGIQMYILIYVDDIIIISSSSTAIEKLLTQ 732
           R WF + S   +   F  S +D + F+     + + +L+YVDDIII S++  A+++L +Q
Sbjct: 244 RQWFLKFSVTLIGFGFVQSHSDHTYFLKITATLFLCVLVYVDDIIICSNNDAAVDELKSQ 303

Query: 733 LQDDFVVKDLGLLSYFLGIEVRHTSSGLILTQHKYIRDLLARTDMLTSKGVPTPMLPSEK 792
           L+  F ++DLG L YFLG+E+  +++G+ + Q KY  DLL  T +L  K    PM PS  
Sbjct: 304 LKSCFKLRDLGPLKYFLGLEIARSAAGINICQRKYALDLLDETGLLGCKPSSVPMDPSVT 363

Query: 793 LLLNGGKKLSPEDTTRYRSVVGALQYLSLTRPDISFCVNRVCQFMSSPTSIHWATVKRIL 852
              + G      D   YR ++G L YL +TR DISF VN++ QF  +P   H   V +IL
Sbjct: 364 FSAHSGGDFV--DAKAYRRLIGRLMYLQITRLDISFAVNKLSQFSEAPRLAHQQAVMKIL 423

Query: 853 RYLHDTIDMSLCLTKSSTDLLSAFSDADWAGNPDDRRSTGGYVIFFGGNLISWSSRKQST 912
            Y+  T+   L  +  +   L  FSDA +    D RRST GY +F G +LISW S+KQ  
Sbjct: 424 HYIKGTVGQGLFYSSQAEMQLQVFSDASFQSCKDTRRSTNGYCMFLGTSLISWKSKKQQV 483

Query: 913 VSRSSMEAEYKAVADATAELIWIQVLLRELGISQARARSLWCDNIGATYLSANPIFHRRT 972
           VS+SS EAEY+A++ AT E++W+    REL +  ++   L+CDN  A +++ N +FH RT
Sbjct: 484 VSKSSAEAEYRALSFATDEMMWLAQFFRELQLPLSKPTLLFCDNTAAIHIATNAVFHERT 543

Query: 973 KHVEVDYHFVRER 982
           KH+E D H VRER
Sbjct: 544 KHIESDCHSVRER 554

BLAST of CmoCh05G011780 vs. TAIR10
Match: ATMG00810.1 (ATMG00810.1 DNA/RNA polymerases superfamily protein)

HSP 1 Score: 199.5 bits (506), Expect = 9.9e-51
Identity = 109/228 (47.81%), Postives = 146/228 (64.04%), Query Frame = 1

Query: 703 MYILIYVDDIIIISSSSTAIEKLLTQLQDDFVVKDLGLLSYFLGIEVRHTSSGLILTQHK 762
           MY+L+YVDDI++  SS+T +  L+ QL   F +KDLG + YFLGI+++   SGL L+Q K
Sbjct: 1   MYLLLYVDDILLTGSSNTLLNMLIFQLSSTFSMKDLGPVHYFLGIQIKTHPSGLFLSQTK 60

Query: 763 YIRDLLARTDMLTSKGVPTPMLPSEKLLLNGGKKLSPEDTTRYRSVVGALQYLSLTRPDI 822
           Y   +L    ML  K + TP+       ++  K   P D   +RS+VGALQYL+LTRPDI
Sbjct: 61  YAEQILNNAGMLDCKPMSTPLPLKLNSSVSTAKYPDPSD---FRSIVGALQYLTLTRPDI 120

Query: 823 SFCVNRVCQFMSSPTSIHWATVKRILRYLHDTIDMSLCLTKSSTDLLSAFSDADWAGNPD 882
           S+ VN VCQ M  PT   +  +KR+LRY+  TI   L + K+S   + AF D+DWAG   
Sbjct: 121 SYAVNIVCQRMHEPTLADFDLLKRVLRYVKGTIFHGLYIHKNSKLNVQAFCDSDWAGCTS 180

Query: 883 DRRSTGGYVIFFGGNLISWSSRKQSTVSRSSMEAEYKAVADATAELIW 931
            RRST G+  F G N+ISWS+++Q TVSRSS E EY+A+A   AEL W
Sbjct: 181 TRRSTTGFCTFLGCNIISWSAKRQPTVSRSSTETEYRALALTAAELTW 225

BLAST of CmoCh05G011780 vs. TAIR10
Match: ATMG00820.1 (ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase))

HSP 1 Score: 82.8 bits (203), Expect = 1.3e-15
Identity = 38/76 (50.00%), Postives = 48/76 (63.16%), Query Frame = 1

Query: 547 LLIPRSSASEPTSHIIAMEHPLWRQAMNDEFQALQKNKTWHLVSPRAGLNVIDCKWVFKL 606
           L I  +   EP S I A++ P W QAM +E  AL +NKTW LV P    N++ CKWVFK 
Sbjct: 18  LTITTTIKKEPKSVIFALKDPGWCQAMQEELDALSRNKTWILVPPPVNQNILGCKWVFKT 77

Query: 607 KQKPDGSIDRYKARLI 623
           K   DG++DR KARL+
Sbjct: 78  KLHSDGTLDRLKARLV 93

BLAST of CmoCh05G011780 vs. TAIR10
Match: ATMG00240.1 (ATMG00240.1 Gag-Pol-related retrotransposon family protein)

HSP 1 Score: 68.9 bits (167), Expect = 2.0e-11
Identity = 32/77 (41.56%), Postives = 48/77 (62.34%), Query Frame = 1

Query: 814 YLSLTRPDISFCVNRVCQFMSSPTSIHWATVKRILRYLHDTIDMSLCLTKSSTDLLSAFS 873
           YL++TRPD++F VNR+ QF S+  +     V ++L Y+  T+   L  + +S   L AF+
Sbjct: 2   YLTITRPDLTFAVNRLSQFSSASRTAQMQAVYKVLHYVKGTVGQGLFYSATSDLQLKAFA 61

Query: 874 DADWAGNPDDRRSTGGY 891
           D+DWA  PD RRS  G+
Sbjct: 62  DSDWASCPDTRRSVTGF 78

BLAST of CmoCh05G011780 vs. NCBI nr
Match: gi|657965292|ref|XP_008374297.1| (PREDICTED: uncharacterized protein LOC103437591 [Malus domestica])

HSP 1 Score: 580.1 bits (1494), Expect = 7.6e-162
Identity = 397/1106 (35.90%), Postives = 556/1106 (50.27%), Query Frame = 1

Query: 16   TVTRARTTQIRVELATSKKRDQSAANYFCKIKGLATELAAAGSALQDDDVIAYLLAGLGP 75
            + ++ R  Q+R EL  + + D S A+Y  K+  LA  LA +G+ + + D++A +++ +GP
Sbjct: 119  STSQNRIIQMRTELMNTSRGDLSIADYLDKVNILADNLALSGALVSESDLVAIIMSKVGP 178

Query: 76   DYDPFVTSMTTKSEALTLDDVFAHLMMYEAHQLQHQAELQLNSGSSANYTGRAELQLNLG 135
             ++  V S   +   +T   + + L+  E           + + + A   G        G
Sbjct: 179  QFETTVASAQARDTPITYTALESLLLSAEQRHTAFSLPSDVGTSAFATVRGGRGTSRGRG 238

Query: 136  SSANYASRGGQQKNRGRRDRGRGRSQGYAPSRPAGDRR---------------------- 195
             +      GG  +  G      G    ++ S  A  RR                      
Sbjct: 239  GAFRGGRGGGYARGSGSSRNYHG-DFSHSSSPAASSRRQPASSNGILDPAPASGGFSPAP 298

Query: 196  --GPSARPSCQICGKVGHTAIRCWHRMDESYQDEPSSAPPTALAATSSYKIDPN------ 255
               PS R  CQIC + GH+AI C++R++ SY+    S+   A AA    +  P       
Sbjct: 299  AFSPSGRIQCQICQRYGHSAIDCYNRLNMSYEGRVPSSRLQAYAAAPQSRGVPXAPXASP 358

Query: 256  ---WYSDTGATDHITSDLDRLAVHEHYHGGEQVQVGNGAS-LRVLHTGHSLINTATRSLA 315
               W  D+GA  HIT+D+ +L+    YHG +Q+   +G S L +   G S ++T      
Sbjct: 359  AQPWLFDSGANSHITNDVGQLSNPREYHGTDQIXXXHGGSGLHISKIGDSFLHTKLAXFX 418

Query: 316  LRNILHVIEISKHLLSVHKFSRDNDVFFEFDPWHFSIKDRQSRKSLLNGRCESGLYPIKP 375
            L N L+    S +++S+++F+ DND FF   P  + ++D ++ K L  G   +GLYP   
Sbjct: 419  LLNTLYCPHASTNIISLNRFAADNDCFFTIHPRFYHVQDSRTGKILFQGPSNNGLYPSHS 478

Query: 376  SDVDNLKHVLVSRSTTHAQWHARLGHPSSQVVKSILRLNNISCASYSSSHKGYKCLDTDS 435
            S   +     V    + A WH+R                                LD  +
Sbjct: 479  SHQPSGVFACVGERVSDALWHSR--------------------------------LDPST 538

Query: 436  GRVYMSRDVIFDENVFPFKRARPNSSPTMQSTHNALDLCTLHLGNSSTNLENDHMHMSGP 495
            GRV++SR V FDE+ FP+K +   S   + S+ + +    L +G S +           P
Sbjct: 539  GRVFLSRHVXFDEHXFPYKESIVPSPSXLSSSXDPV----LTIGPSPS--------XGAP 598

Query: 496  TNSL--DAENLVSTSAS-----------ELPQQSSASLPCESALVVPPMIEASAPPPADD 555
            T S+   + ++ ST  S             P +     P  SA  + P   AS+P PA  
Sbjct: 599  TPSIPQPSSSIPSTLPSXPTQRPLQVXTRXPSRPXXFPPXTSASPLSP--TASSPQPAFP 658

Query: 556  IAQCPVESSAAGQPTVVASVAPLATTDTAIPSMWILHLLLIPRSSAS------------- 615
            I      +  A  P V  + +  +T + +  S       ++ RS                
Sbjct: 659  ITPTXPSAPGANLPPVAPAASVSSTPEVSDXSH-----SMVTRSKVGVRKPNPKYVFVTT 718

Query: 616  ------EPTSHIIAMEHPLWRQAMNDEFQALQKNKTWHLVSPRAGLNVIDCKWVFKLKQK 675
                  EPT    A +H  WR+AM DEF ALQ N T  LV     +NV+  KWV+K+K++
Sbjct: 719  FSDXLVEPTCFSQAHKHTEWRKAMADEFTALQXNGTXTLVPFHHTMNVLPNKWVYKIKRR 778

Query: 676  PDGSIDRYKARLI------------------------------------------DIQNA 735
             DGSI+RYKARL+                                          D QNA
Sbjct: 779  XDGSIERYKARLVXNGFHQQEGLDYGETFSPVVNHATIRLILSIXIHYNWPXXQLDXQNA 838

Query: 736  FLHGFLNEDVYMKQPPGFVDSQHPGYLCKLDKSLYGLKQAPRAWFSRLSSKQLQLDFTPS 795
            FLHG LNEDVYM+QPPGFVDSQ P ++CKL +SLYGLKQAPRAWF   S     L F  S
Sbjct: 839  FLHGSLNEDVYMRQPPGFVDSQRPXHVCKLQRSLYGLKQAPRAWFQCFSQHLEHLGFVSS 898

Query: 796  KADVSLFIFNKTGIQMYILIYVDDIIIISSSSTAIEKLLTQLQDDFVVKDLGLLSYFLGI 855
              D SLF F    + +Y+LIYVDDI+I  +S+  I +L+ QL   F +KDLG L Y LG+
Sbjct: 899  HXDSSLFTFFDGSVXLYLLIYVDDILITGNSTDHIARLIQQLGVLFSMKDLGPLHYXLGM 958

Query: 856  EVRHTSSGLILTQHKYIRDLLARTDMLTSKGVPTPMLPSEKLLLNGGKKLSPEDTTRYRS 915
            EV  T++GL LTQ KYI DLL RT+ML  K +  P +P  +L L+ G+ LS  D T +RS
Sbjct: 959  EVHXTAAGLHLTQAKYITDLLKRTNMLDCKPISAPXIPGRRLXLSDGEXLS--DLTEFRS 1018

Query: 916  VVGALQYLSLTRPDISFCVNRVCQFMSSPTSIHWATVKRILRYLHDTIDMSLCLTKSSTD 975
            VV ALQYL  TRPDI F VN+VCQFM  PT IH   VKRILRYL  T +  + + K S  
Sbjct: 1019 VVXALQYLLFTRPDIXFAVNQVCQFMHRPTXIHXVAVKRILRYLKATPNHGI-VYKPSPL 1078

Query: 976  LLSAFSDADWAGNPDDRRSTGGYVIFFGGNLISWSSRKQSTVSRSSMEAEYKAVADATAE 1014
             L+A+ DAD+AG+PDDRRSTGGY IF G NLISWSS+KQ  VSRSS EAEY  +A     
Sbjct: 1079 HLTAYXDADYAGDPDDRRSTGGYCIFLGDNLISWSSKKQRGVSRSSTEAEYXQLAXTAXA 1138

BLAST of CmoCh05G011780 vs. NCBI nr
Match: gi|77552925|gb|ABA95721.1| (retrotransposon protein, putative, Ty1-copia subclass [Oryza sativa Japonica Group])

HSP 1 Score: 542.3 bits (1396), Expect = 1.8e-150
Identity = 299/623 (47.99%), Postives = 393/623 (63.08%), Query Frame = 1

Query: 450  NLENDHMHMSGPTNSLDAENLVSTSASELPQQSSASLPCESALVVPPMIEASAPPPADDI 509
            ++ + H    G T+S+   ++VS++A++    +  S       V  P    +A      I
Sbjct: 295  SITHPHDEPGGDTSSISPADVVSSAAADGMHATHGSATSSGGNVSSPHTFDAA----QQI 354

Query: 510  AQCPVESSAAGQPTVVASVAPLATTDTAIPSMWILHLLLIPRSSASEPTSHIIAMEHPLW 569
             Q PV  S  G         P   TD  +    +        +   EP+S   A+    W
Sbjct: 355  QQRPVTRSQHGVHR------PKKYTDGTVRYGCL--------TETGEPSSLQEALSSANW 414

Query: 570  RQAMNDEFQALQKNKTWHLVSPRAGLNVIDCKWVFKLKQKPDGSIDRYKARLIDIQNAFL 629
            +QAM+ EF AL  NKTWHLV P  G N+ID KWV+K+K+K DG+IDRYKARL+       
Sbjct: 415  KQAMDKEFSALLHNKTWHLVPPVKGKNIIDSKWVYKIKKKADGTIDRYKARLVAKGFKQR 474

Query: 630  HGFLNED------------------------------------------VYMKQPPGFVD 689
            +G   ED                                          VYM+QPPGF D
Sbjct: 475  YGIDYEDTFSPVVKAATIRLVLSIAMSQGWSLRQLDVQNAFLHGYLDEEVYMRQPPGFED 534

Query: 690  SQHPGYLCKLDKSLYGLKQAPRAWFSRLSSKQLQLDFTPSKADVSLFIFNKTGIQMYILI 749
            ++ P +LCKLDK+LYGLKQAPRAW+SRLS K  +L F+ SKAD SLF +NK   +M++L+
Sbjct: 535  ARQPHFLCKLDKALYGLKQAPRAWYSRLSKKLQELGFSSSKADTSLFFYNKGHHKMFVLV 594

Query: 750  YVDDIIIISSSSTAIEKLLTQLQDDFVVKDLGLLSYFLGIEVRHTSSGLILTQHKYIRDL 809
            YVDDII+ SSSS A+  LL  L+ DF +KDLG L YFLGIEV+ T   L+L+Q +Y  +L
Sbjct: 595  YVDDIIVASSSSPAVNALLKDLEKDFALKDLGDLHYFLGIEVKRTPQTLLLSQERYTTEL 654

Query: 810  LARTDMLTSKGVPTPMLPSEKLLLNGGKKLSPEDTTRYRSVVGALQYLSLTRPDISFCVN 869
            L R +M + K V TP+  +EKL +  G +L P D T+YRS+VGALQYL+LTRPDIS+ VN
Sbjct: 655  LERVNMTSCKPVSTPLSTAEKLSVEIGDELGPSDVTQYRSIVGALQYLTLTRPDISYSVN 714

Query: 870  RVCQFMSSPTSIHWATVKRILRYLHDTIDMSLCLTKSSTDLLSAFSDADWAGNPD----- 929
            +VCQF+ +PT+ HW+ VKRILRYL  T+D+ L + KSS++L+SAFSDADWAG+ D     
Sbjct: 715  KVCQFLQTPTTAHWSAVKRILRYLKGTLDLGLKIVKSSSNLVSAFSDADWAGSVDDRRST 774

Query: 930  ----DRRSTGGYVIFFGGNLISWSSRKQSTVSRSSMEAEYKAVADATAELIWIQVLLREL 989
                DRRSTGG+ +FFG NLISWS+RKQ+TVSRSS EAEYKA+A+A AE+IW++ LL EL
Sbjct: 775  GGFADRRSTGGFAVFFGDNLISWSARKQATVSRSSTEAEYKALANAAAEIIWVRKLLTEL 834

Query: 990  GISQARARSLWCDNIGATYLSANPIFHRRTKHVEVDYHFVRERVSTRQLDVRVISSKDQL 1022
            GI    A  LWCDN+GATY++ANP+FH RTKH+EVDYHFVRE+V+ + LD+  +SS+DQ+
Sbjct: 835  GILHPNAARLWCDNLGATYMTANPVFHARTKHIEVDYHFVREQVAQKLLDIHFVSSQDQV 894

BLAST of CmoCh05G011780 vs. NCBI nr
Match: gi|18254413|gb|AAL66754.1|AF464738_5 (putative copia-like retrotransposon Hopscotch polyprotein [Zea mays])

HSP 1 Score: 542.0 bits (1395), Expect = 2.3e-150
Identity = 273/511 (53.42%), Postives = 351/511 (68.69%), Query Frame = 1

Query: 552  SSASEPTSHIIAMEHPLWRQAMNDEFQALQKNKTWHLVSPRAGLNVIDCKWVFKLKQKPD 611
            +S+ EP     A+    W+ AM+ E+ AL KNKTWHLV P+ G NVI CKWV+K+K+K D
Sbjct: 800  TSSGEPYDLNEALGDVNWKDAMDIEYSALMKNKTWHLVPPKKGRNVIGCKWVYKIKRKAD 859

Query: 612  GSIDRYKARL----------IDIQNAFLH----------------------------GFL 671
            GS+DRYKARL          ID  + F                               FL
Sbjct: 860  GSLDRYKARLVAKGYKQQYGIDYDDTFSPVVKHATIRIILSIAVSRGWSLCQLDVQNAFL 919

Query: 672  N----EDVYMKQPPGFVDSQHPGYLCKLDKSLYGLKQAPRAWFSRLSSKQLQLDFTPSKA 731
            +    E+VYM+QPPG+ DS    Y+CKLDK+LYGLKQAPRAW+SRLS+K L L F  SKA
Sbjct: 920  HGVLEEEVYMQQPPGYEDSTKLNYVCKLDKALYGLKQAPRAWYSRLSNKLLSLGFQASKA 979

Query: 732  DVSLFIFNKTGIQMYILIYVDDIIIISSSSTAIEKLLTQLQDDFVVKDLGLLSYFLGIEV 791
            D SLF +NK  + +++L+YVDDII+ SS+  A E LL+ L  +F +KDLG L+YFLGIEV
Sbjct: 980  DTSLFFYNKGSVTIFVLVYVDDIIVASSTHKATEALLSDLNKEFALKDLGDLNYFLGIEV 1039

Query: 792  RHTSSGLILTQHKYIRDLLARTDMLTSKGVPTPMLPSEKLLLNGGKKLSPEDTTRYRSVV 851
                 G+ILTQ KY  DLL +  M   K + TP+  SEKL ++ G  L  +D T+YRS+V
Sbjct: 1040 NKVRDGIILTQDKYASDLLKKVGMSDCKPISTPLSTSEKLSIHEGSPLGEKDITQYRSIV 1099

Query: 852  GALQYLSLTRPDISFCVNRVCQFMSSPTSIHWATVKRILRYLHDTIDMSLCLTKSSTDLL 911
            GALQYL+LTRPDI+F VN+VCQF+ +PT++HWA VKRILRY+    ++ L + +S + L+
Sbjct: 1100 GALQYLTLTRPDIAFSVNKVCQFLHAPTTLHWAAVKRILRYIKQCTNLGLHIHRSDSTLV 1159

Query: 912  SAFSDADWAGNPDDRRSTGGYVIFFGGNLISWSSRKQSTVSRSSMEAEYKAVADATAELI 971
            SAFSDADWAG+ DDR+STGG+ +F G NL+SWS+RKQ TVSRSS E+EYKA+A+ATAELI
Sbjct: 1160 SAFSDADWAGSVDDRKSTGGFAVFLGSNLVSWSARKQPTVSRSSTESEYKALANATAELI 1219

Query: 972  WIQVLLRELGISQARARSLWCDNIGATYLSANPIFHRRTKHVEVDYHFVRERVSTRQLDV 1021
            W+Q+LL E+ I   RA  LWCDN+GA YLSANPIFH RTKH+EVDYHFVR+RV+ + LD+
Sbjct: 1220 WVQILLTEISIKSPRAAKLWCDNLGAKYLSANPIFHARTKHIEVDYHFVRDRVAKKLLDI 1279

BLAST of CmoCh05G011780 vs. NCBI nr
Match: gi|31432318|gb|AAP53968.1| (retrotransposon protein, putative, Ty1-copia subclass [Oryza sativa Japonica Group])

HSP 1 Score: 538.5 bits (1386), Expect = 2.5e-149
Identity = 270/511 (52.84%), Postives = 343/511 (67.12%), Query Frame = 1

Query: 552  SSASEPTSHIIAMEHPLWRQAMNDEFQALQKNKTWHLVSPRAGLNVIDCKWVFKLKQKPD 611
            ++  EP +   AM +  WR AM  E+ A   NKTWHLV P  G N+IDCKW++K+K+K D
Sbjct: 421  TTTGEPENLREAMANSNWRLAMEQEYSAFMSNKTWHLVPPTQGKNIIDCKWMYKIKRKAD 480

Query: 612  GSIDRYKARLIDIQNAFLHGFLNEDV---------------------------------- 671
            GSIDRYKARL+       +G   ED                                   
Sbjct: 481  GSIDRYKARLVAKGFKQRYGIDYEDTFSLVVKAATIRLILSIAVSKGWSLRQLDVQNAFL 540

Query: 672  --------YMKQPPGFVDSQHPGYLCKLDKSLYGLKQAPRAWFSRLSSKQLQLDFTPSKA 731
                    YM+QPPGF +   P YLCKLDK+LYGLKQAPRAW+SRLS+K  +L F  SKA
Sbjct: 541  HGYLEEEVYMRQPPGFENKGQPNYLCKLDKALYGLKQAPRAWYSRLSTKLQELGFISSKA 600

Query: 732  DVSLFIFNKTGIQMYILIYVDDIIIISSSSTAIEKLLTQLQDDFVVKDLGLLSYFLGIEV 791
            D SLF +NK G  ++IL+YVDDII+ SSS+  +  LL  L+ DF +KDLG L YFLGIEV
Sbjct: 601  DTSLFFYNKGGCTIFILVYVDDIIVASSSAEVVAALLKDLEKDFALKDLGDLHYFLGIEV 660

Query: 792  RHTSSGLILTQHKYIRDLLARTDMLTSKGVPTPMLPSEKLLLNGGKKLSPEDTTRYRSVV 851
            +  S GL+L+Q  Y  D+L R  M   K   TP+  +EKL +  G  L   D + YRS+V
Sbjct: 661  KKVSQGLVLSQAWYASDILKRAGMSICKPASTPLSTTEKLSIEDGDFLGQNDASHYRSIV 720

Query: 852  GALQYLSLTRPDISFCVNRVCQFMSSPTSIHWATVKRILRYLHDTIDMSLCLTKSSTDLL 911
            GALQYL+LTR D+SF VN+VCQF+ SPT++HW+ VKRILRY+  T++  L   KS + L+
Sbjct: 721  GALQYLTLTRSDLSFLVNKVCQFLHSPTTVHWSAVKRILRYIKGTVEFGLRFGKSDSMLI 780

Query: 912  SAFSDADWAGNPDDRRSTGGYVIFFGGNLISWSSRKQSTVSRSSMEAEYKAVADATAELI 971
            SAFSDADWAG  DDRRSTGG+ +F G NLISWS+RKQ+TVSRSS EAEYKA+A+AT E+ 
Sbjct: 781  SAFSDADWAGCSDDRRSTGGFAVFLGPNLISWSARKQATVSRSSTEAEYKALANATTEVT 840

Query: 972  WIQVLLRELGISQARARSLWCDNIGATYLSANPIFHRRTKHVEVDYHFVRERVSTRQLDV 1021
            W++ +L EL I++     LWCDN+GATYLSANP+FH RTKH+E+DYHFVRE+V+ + LD+
Sbjct: 841  WVRKILDELRIARPSVAQLWCDNLGATYLSANPVFHARTKHIEIDYHFVREQVAKKLLDI 900

BLAST of CmoCh05G011780 vs. NCBI nr
Match: gi|802566240|ref|XP_012067590.1| (PREDICTED: uncharacterized protein LOC105630400 [Jatropha curcas])

HSP 1 Score: 538.1 bits (1385), Expect = 3.3e-149
Identity = 331/855 (38.71%), Postives = 471/855 (55.09%), Query Frame = 1

Query: 223  DPNWYSDTGATDHITSDLDRLAVHEHYHGGEQVQVGNGASLRVLHTGHSLINTATRSLAL 282
            DP+W  D+GA  H+  DL  LA+H  Y G + V +G+G S  + HTG   +N+AT+ L+L
Sbjct: 50   DPHWVVDSGANQHVVEDLKNLAIHSDYEGSDNVFLGDGKSYPISHTGTVTLNSATKPLSL 109

Query: 283  RNILHVIEISKHLLSVHKFSRDNDVFFEFDPWHFSIKDRQSRKSLLNGRCESGLYPIKPS 342
            +N+L V  + K+L+SV+K   DN+V   F+P+ F++KD  +   L  G+  + +Y   P+
Sbjct: 110  KNVLCVPALDKNLISVYKLCTDNNVEVTFNPFLFTVKDLSTGVLLAEGKPINDIYEWPPN 169

Query: 343  ----DVDNLKHVLVSRSTTHAQWHARLGHPSSQVVKSILRL-------NNISCASYSSSH 402
                   N+ H++ S   +   WH RLGHPS  +++ IL+        NN  C S  S+ 
Sbjct: 170  LSRGSSLNINHIVAS---SQPLWHRRLGHPSIPILRLILQSFDLLVSNNNFFCDS--STQ 229

Query: 403  KGYKCLDTDSGRVYMSRDVIFDENVFPFKRARPNSSPTMQSTHNALDLCT-LH-LGNSST 462
              YKC D  S ++Y+SR V F E+ FPF      ++   Q   + LD  + +H +   S 
Sbjct: 230  SAYKCYDPHSSKIYLSRHVDFIESDFPFSSL---ATALEQPDSHTLDTWSPIHFILPMSA 289

Query: 463  NLENDHMHMSGPTNSLDAENLVSTSASELPQQSSASLPCESALVVPPMIEASAPPPADDI 522
             L++     SG  N ++ ++  STS S + QQ   S   +S L           P   D+
Sbjct: 290  GLDS-----SGQPNYVNPDS--STSPSLVQQQPEHSNISDSGLGTEISNSPILLPTPSDV 349

Query: 523  AQCPVESSAAGQPTVVASVAPLATTDTAIPSMWILHLLLIPRSSAS---EPTSHIIAMEH 582
                + +S   Q     +V P+ T         I  L L     +     P +   A++ 
Sbjct: 350  VSSLLLASNLNQ-----NVHPMVTRSKNQIRKPINRLCLSTEVQSDIQFVPKTVAQALKD 409

Query: 583  PLWRQAMNDEFQALQKNKTWHLVSPRAGLNVIDCKWVFKLKQKPDGSIDRYKARLIDIQN 642
            P WR AM +E+QAL K KTW LV                            KA  +++ N
Sbjct: 410  PKWRSAMEEEYQALMKQKTWSLVPLE-------------------------KANNVNVNN 469

Query: 643  AFLHGFLNEDVYMKQPPGFVDSQHPGYLCKLDKSLYGLKQAPRAWFSRLSSKQLQLDFTP 702
            AFLHG LNEDV+M+QPPGF+   +  ++CKL+KSLYGLKQAPR W+S L+S  +   F  
Sbjct: 470  AFLHGKLNEDVFMEQPPGFIQHHNLKFVCKLEKSLYGLKQAPRVWYSALTSFLIAAGFQQ 529

Query: 703  SKADVSLFIFNKTGIQMYILIYVDDIIIISSSSTAIEKLLTQLQDDFVVKDLGLLSYFLG 762
            SK+D  LFI+++ G  +Y+L+YVDDIII  SS+  IE  + QL   F +KDLG L YFLG
Sbjct: 530  SKSDSCLFIYHRQGTVLYLLVYVDDIIITGSSAMRIEAFINQLGKAFSIKDLGNLHYFLG 589

Query: 763  IEVRHTSSGLILTQHKYIRDLLARTDMLTSKGVPTPMLPSEKLLLNGGKKLSPEDTTRYR 822
            +EV+ T++GL L+QH YIRDLL +  M  +K   TPM  S  + LN       +D T YR
Sbjct: 590  VEVKRTATGLFLSQHNYIRDLLEKAHMHEAKSASTPM--SSTMTLNSSDSACLQDATVYR 649

Query: 823  SVVGALQYLSLTRPDISFCVNRVCQFMSSPTSIHWATVKRILRYLHDTIDMSLCLTKSST 882
            +++G+LQYL LTRPD++F VN++ Q+ S PT  HW  +KR+LRYL  T D  L L + + 
Sbjct: 650  ALIGSLQYLLLTRPDVAFVVNKLAQYTSKPTEKHWTALKRLLRYLVCTFDFGLNLCRQTD 709

Query: 883  DLLSAFSDADWAGNPDDRRSTGGYVIFFGGNLISWSSRKQSTVSRSSMEAEYKAVADATA 942
            D L AF DADWAGN DDR ST  Y+IF G N ++WSS+KQ TV+RSS EAEYK+VA   A
Sbjct: 710  DDLHAFFDADWAGNHDDRTSTSAYIIFLGKNPVAWSSKKQKTVARSSTEAEYKSVASTAA 769

Query: 943  ELIWIQVLLRELGISQARARSLWCDNIGATYLSANPIFHRRTKHVEVDYHFVRERV---- 1002
            +L W++ LL EL   + ++  ++CDN+GATY++AN +FH R KH+ +DYHFVR+ V    
Sbjct: 770  DLAWVRNLLDELQFPRPKSPVIYCDNLGATYVAANSVFHSRMKHIAIDYHFVRQHVQNGC 829

Query: 1003 ----STRQLDVRVISSK----------------------------DQLADIMTKPLPAPS 1026
                +T      V  S+                            DQLAD++TKPL   +
Sbjct: 830  DNLGATYVAANSVFHSRMKHIAIDYHFVRQHVQNGTLRVTHISAKDQLADMLTKPLSFTA 857

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
POLX_TOBAC2.1e-7134.94Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
COPIA_DROME4.7e-7136.98Copia protein OS=Drosophila melanogaster GN=GIP PE=1 SV=3[more]
M810_ARATH1.8e-4947.81Uncharacterized mitochondrial protein AtMg00810 OS=Arabidopsis thaliana GN=AtMg0... [more]
YCH4_YEAST4.2e-4332.58Putative transposon Ty5-1 protein YCL074W OS=Saccharomyces cerevisiae (strain AT... [more]
YD11B_YEAST1.4e-2225.48Transposon Ty1-DR1 Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC ... [more]
Match NameE-valueIdentityDescription
Q2QY49_ORYSJ1.2e-15047.99Retrotransposon protein, putative, Ty1-copia subclass OS=Oryza sativa subsp. jap... [more]
Q8W0X9_MAIZE1.6e-15053.42Putative copia-like retrotransposon Hopscotch polyprotein OS=Zea mays GN=Z178A11... [more]
Q7XE22_ORYSJ1.8e-14952.84Retrotransposon protein, putative, Ty1-copia subclass OS=Oryza sativa subsp. jap... [more]
V9GZT4_MAIZE3.1e-14645.38Copia-like retrotransposon Hopscotch polyprotein OS=Zea mays GN=gag PE=4 SV=1[more]
Q6ATL7_ORYSJ5.9e-14542.33Putative polyprotein OS=Oryza sativa subsp. japonica GN=OSJNBb0021K20.25 PE=4 SV... [more]
Match NameE-valueIdentityDescription
AT4G23160.15.9e-8041.82 cysteine-rich RLK (RECEPTOR-like protein kinase) 8[more]
ATMG00810.19.9e-5147.81ATMG00810.1 DNA/RNA polymerases superfamily protein[more]
ATMG00820.11.3e-1550.00ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)[more]
ATMG00240.12.0e-1141.56ATMG00240.1 Gag-Pol-related retrotransposon family protein[more]
Match NameE-valueIdentityDescription
gi|657965292|ref|XP_008374297.1|7.6e-16235.90PREDICTED: uncharacterized protein LOC103437591 [Malus domestica][more]
gi|77552925|gb|ABA95721.1|1.8e-15047.99retrotransposon protein, putative, Ty1-copia subclass [Oryza sativa Japonica Gro... [more]
gi|18254413|gb|AAL66754.1|AF464738_52.3e-15053.42putative copia-like retrotransposon Hopscotch polyprotein [Zea mays][more]
gi|31432318|gb|AAP53968.1|2.5e-14952.84retrotransposon protein, putative, Ty1-copia subclass [Oryza sativa Japonica Gro... [more]
gi|802566240|ref|XP_012067590.1|3.3e-14938.71PREDICTED: uncharacterized protein LOC105630400 [Jatropha curcas][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR013103RVT_2
IPR025724GAG-pre-integrase_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006259 DNA metabolic process
biological_process GO:0022900 electron transport chain
cellular_component GO:0005739 mitochondrion
molecular_function GO:0009055 electron carrier activity
molecular_function GO:0003676 nucleic acid binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh05G011780.1CmoCh05G011780.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR013103Reverse transcriptase, RNA-dependent DNA polymerasePFAMPF07727RVT_2coord: 622..783
score: 1.1
IPR025724GAG-pre-integrase domainPFAMPF13976gag_pre-integrscoord: 335..390
score: 8.
NoneNo IPR availablePANTHERPTHR11439GAG-POL-RELATED RETROTRANSPOSONcoord: 144..339
score: 0.0coord: 552..945
score: 0.0coord: 14..125
score: 0.0coord: 358..515
score:
NoneNo IPR availablePANTHERPTHR11439:SF185SUBFAMILY NOT NAMEDcoord: 14..125
score: 0.0coord: 144..339
score: 0.0coord: 552..945
score: 0.0coord: 358..515
score:
NoneNo IPR availablePFAMPF14223Retrotran_gag_2coord: 19..106
score: 2.8
NoneNo IPR availableunknownSSF56672DNA/RNA polymerasescoord: 583..763
score: 1.16E-30coord: 795..977
score: 1.16

The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None