CmoCh04G023510 (gene) Cucurbita moschata (Rifu)

NameCmoCh04G023510
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionTransposon Ty1-H Gag-Pol polyprotein
LocationCmo_Chr04 : 17472419 .. 17476429 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACAAACTCAATAGTACAATTACTCGCTTCTGAGAAATTAAACGGCGACAACTACACAACTTGGAAATCAAACCTAAACACAATACTGGTCATTGATGATTTAAAGTTTGTTTTAACTGAGGAGTGTCCTCCAAACACAAACTCAAATGCAAACCGAACAGCTCGGGATGCATATGACAGATGGATAAAGGCAAATGACAAAGCCCGAGTGTACATTCTAGCCAGCATATCTGATGTTTTGGCTAAGAAACACGATGTTATGGGTACTGCTAAAGAGATTATGGAATCTCTAAAAGGGATGTTTGGACAACCGTCCTTCTCCCTTAGACATGAAGCCATAAAATACATTTACAACTGCCGTATGAAAGAAGGGACCTCAGTTAGAGAACATGTCCTGGACATGATGGTCCATTTCAATGTGGCAGAAGAAAATGAAGCTGTCATTGATGAGAAGAGTCAAGTCAGTTTTATCATGATGTCTCTTCCGAAGAGCTTCTTCCAGTTCCGCACCAATGTGGTTATGAACAAAATAGAATATAACTTGACTGCTCTTCTCAATGAGCTACAGACTTATCAGTCCCTCTTAACAAACAAGGGACAAACAGGAGAAGCAAATGTTGCCATCTCCAAGAAATTACTACGAGGATCGTCCTCCCAAAATAAGTCTGGACCTTCAACTTCTAAAAGTGTTTTGATGAAGAAGAAGGGAAAAGGGAAAAATAAGATTCCTACTAACCGCAAGAACAAGGTTCAAAAAGCAGATAAAGGAAAATGTTTCCATTGCAACGAAAACGGGCACTGGAAGAGAAATTGCCCAAAATACCTTGCAGAGAAGAAAGCCGAAAAGACACAACAAGGTAAATATGATTTACTCGTTGTAGAAATGTGTTTAGTAGAGTATGACAACTCAACTTGGATACTAGATTCAGGGGCGACTAATCATATTTGTTCTTTTTACCAGGAAACTAGCTCCTGGAGAATGCTTGCGGACGGCGAGATAACACTCAGGGTTGGAACAGAAGAGGTTGTCTCAGCAAGATCAGTGGGAAATTTAAAGTTGTTTTTTGGAGATAGATTCATTATATTAGATAATGTACTTTTTGTTCCAGGAATGAAAAGAAATCTAATATCCATCTCTTGTTTATTAGAACAGTTGTATAAAGTATCTTTTGAAATTAATGAAGTGTTCATTTGCAAAAGAGGTATTCATATTTGTTCTGCAAAACTAGAAAACAACTTATATATGTTAAAACCGAGCAAAACAAAAGCTATTTTAAATACTGAGATGTTTAAAACAGCTGAAACTCAAAATAAACGACAAAAGATTTCTCCTAATATCTTTCTTTGGCATTTAAGACTAGGCCACATTAATCTCAATAGGATTGAGAGATTGGTTAAAAGGGGACTTCTAAATAAGTTAGAAGACAATTCTTTACCTCCATGTGAGTCTTGTCTTGAGGGTAAAATGACTAAACGATCATTTTGCGAAAAAGGTTATAGAGCCAAAGAAACCTTAGAACTCGTGCATACAGATCTCTGTGGTCCAATGAATGTCAAAGCACGAGGAGGGTATGAATATTTCATCAGTTTTATTGATGATTATTCAAGGTATGGCTATCTTTACCTAATGCATCACAAGTCCGAAGCTCTTGAAAAATTCAGAGAGTATAAGACTGAGGTTCAGAATCTATTAGGTAAAACTATTAAAACACTTCGATCAGATCGAGGAGGAGAGTACATGGATTTAAGATTTCAGGACTATATGATAGAACATGGAATAAGGTCTCAACTCTCAGCCACTGGTATGCCTCAACAAAATGGTGTATCAGAAAGGAGAAATAGAACTCTATTAGACATGGTTCGGTCTATGATGAGTTTCGCTCAATTACCCGATCCATTTTGGGGATATGCAGTGGAGACTGCTACATACATTTTGAACATGGTTCCTACTAAGAGTGTTTTAGAAACACCTTATGAGTTATGGAAAGGGCGTAAAGGTAGTTTACGTCACTTCAGAATTTGGGGATGTCCAGCACATGTGTTGTTGCAAAATCCCAAGAAATTAGAACGTCGTTCAAAATTATGTCTATTCGTAGGATACCCCAAAGAGACGAAAGGTGGTTTGTTTTATGATCCTCAAGAAAATAGAGTGTTTGTATCAACGAACGCTACATTCTTAGAGGAAGACCACGTAAGAAATCATCAACCTCGTAGCAAACTAGTATTAAGTGAGATTTCTAAAGAAGCTACTGATAAAACAACAAGAGTTGTTGATCAAGCTGGTCCTTCAACCAGAGTTGTTGATGGAGCTGACACTTCTGGTCAATCACATCCTTCTCAAGAGTTGAGAATGCCTCGACGTAGTGGGAGGGTTATAACTCAACCCGATCGTTACTTGGGTTTAGCAGAAACTCAAGTCATCATACCTGATGATGGCGTTGAGGATCCATTGACCTATAAACAGGCAATGAATGATGAGGATAGAGATCAGTGGATTAAAGCCATGAATCTTGAAATGGAGTCAATGTACTTCAATTCAGTCTGGGAACTTGTAGATCAACCGGATGGGGTAAAACCTATTGGTTGCAAATGGATCTATAAGAGGAAACGAGACCAAACCGGTAAGGTACAGACCTTTAAAGCACGACTAGTAGCAAAGGGTTATACCCAAAGAGAGGGGGTGGACTATGAAGAAACCTTCTCTCCTGTTGCTATGCTTAAATCAATAAGAATACTCTTTTCTATTGCCACATTTTATGATTATGAAATTTGGCAAATGGATGTTAAGACAGCTTTTCTAAACGGCAATCTTGAAGAGAGTATCTATATGGCTCAACCAGAGGGGTTCATTGAACAGGATCATTAGCAAAGGGTTTGCAAGCTTAAAAGATCCATTTATGGGTTGAAGCAAGCATCTCGATCCTGGAATATAAAGTTTGATACTGCGATCAAATCTTATGGCTTTAAACAGAATGTTGACGAACCTTGTGTTTATAAAAGGATAGTCAACTCTACAGTAGCTTTCCTGGTGTTGTACGTTGACGATATCCTACTCATTGGAAATGATGTGAGAATTCTGACTGACATTAAGCATTGGCTGGCAACACAATTCCAAATGAAAGATTTGGGAGAGGCTCAGTTTGTTCTTGGAATCGAAATTATTCGGAATCGCAAGAACAAAACACTAGCATTGTCTCAAGCATCGTACATCGACAAAATGTTGATTCGATATAAGATGCAGGACTCCAAGAAAGGATTATTACCTTTCAGGCATGGAGTTCATTTGTCGAAGGAACAAAGTCCTAAGACACCTCAAGAAGTTGAGGATATGAAACACATTCCCTATGCATCAGCGGTCGGTAGTCTGATGTATGCCATGCTTTGTACTCGACCCGACATATGCTATGCTGTGGGAATTGTCAGCAGATATCAGTCCAATCCGGGACGTGCTCATTGGACTGCCGTTAAGAATATTCTCAAGTATCTTAGGAGAACGAGGGACTATATGCTAATGTACGGTGTTAAGGATCTGATCCTTATAGGGTACACTGACTCAGATTTTCAGACCGATGTAGATTCGAGGAAATCGACATCAGGATCTGTCTTCACTCTGAACGGAGGAGCAATAATATGGAGAAGCATAAAGCAAGGTTGCATTGCTGATTCCACCATGGAGGCTGAGTATGTTGTTGCTTGTGAAGCAGCGAAAGAATCTGTATGGCTTAGGAAGTTCTTAACTGATTTGGAAGTCGTTCCAAATATGCATCTTCCCGTCACTCTTTATTGTGATAACAGTGGAGCAGTTGCAAATTCAAAAGAACCAAGAAGCCATAAGCGAGGAAAACATATTGAACGCAAATATCATCTCATAAGGGAGATTGTGCAACGAGGAGATGTGATCGTCACACAGATAGCTTCAGAGCACAACATTGCTGATCCATTTACAAAGCCCCTCACGGCTAAAGTGTTTGAAGGGCACCTAGTGAGTCTAGGACTACGAGTTATGTAA

mRNA sequence

ATGACAAACTCAATAGTACAATTACTCGCTTCTGAGAAATTAAACGGCGACAACTACACAACTTGGAAATCAAACCTAAACACAATACTGGTCATTGATGATTTAAAGTTTGTTTTAACTGAGGAGTGTCCTCCAAACACAAACTCAAATGCAAACCGAACAGCTCGGGATGCATATGACAGATGGATAAAGGCAAATGACAAAGCCCGAGTGTACATTCTAGCCAGCATATCTGATGTTTTGGCTAAGAAACACGATGTTATGGGTACTGCTAAAGAGATTATGGAATCTCTAAAAGGGATGTTTGGACAACCGTCCTTCTCCCTTAGACATGAAGCCATAAAATACATTTACAACTGCCGTATGAAAGAAGGGACCTCAGTTAGAGAACATGTCCTGGACATGATGGTCCATTTCAATGTGGCAGAAGAAAATGAAGCTGTCATTGATGAGAAGAGTCAAGTCAGTTTTATCATGATGTCTCTTCCGAAGAGCTTCTTCCAGTTCCGCACCAATGTGGTTATGAACAAAATAGAATATAACTTGACTGCTCTTCTCAATGAGCTACAGACTTATCAGTCCCTCTTAACAAACAAGGGACAAACAGGAGAAGCAAATGTTGCCATCTCCAAGAAATTACTACGAGGATCGTCCTCCCAAAATAAGTCTGGACCTTCAACTTCTAAAAGTGTTTTGATGAAGAAGAAGGGAAAAGGGAAAAATAAGATTCCTACTAACCGCAAGAACAAGGTTCAAAAAGCAGATAAAGGAAAATGTTTCCATTGCAACGAAAACGGGCACTGGAAGAGAAATTGCCCAAAATACCTTGCAGAGAAGAAAGCCGAAAAGACACAACAAGGTTATAGAGCCAAAGAAACCTTAGAACTCGTGCATACAGATCTCTGTGGTCCAATGAATGTCAAAGCACGAGGAGGGTATGAATATTTCATCAGTTTTATTGATGATTATTCAAGGTATGGCTATCTTTACCTAATGCATCACAAGTCCGAAGCTCTTGAAAAATTCAGAGAGTATAAGACTGAGGTTCAGAATCTATTAGGTAAAACTATTAAAACACTTCGATCAGATCGAGGAGGAGAGTACATGGATTTAAGATTTCAGGACTATATGATAGAACATGGAATAAGGTCTCAACTCTCAGCCACTGGTATGCCTCAACAAAATGGTGTATCAGAAAGGAGAAATAGAACTCTATTAGACATGGTTCGGTCTATGATGAGTTTCGCTCAATTACCCGATCCATTTTGGGGATATGCAGTGGAGACTGCTACATACATTTTGAACATGGTTCCTACTAAGAGTGTTTTAGAAACACCTTATGAGTTATGGAAAGGGCGTAAAGGATACCCCAAAGAGACGAAAGGTGGTTTGTTTTATGATCCTCAAGAAAATAGAGTGTTTGTATCAACGAACGCTACATTCTTAGAGGAAGACCACGTAAGAAATCATCAACCTCGTAGCAAACTAGTATTAAGTGAGATTTCTAAAGAAGCTACTGATAAAACAACAAGAGTTGTTGATCAAGCTGGTCCTTCAACCAGAGTTGTTGATGGAGCTGACACTTCTGGTCAATCACATCCTTCTCAAGAGTTGAGAATGCCTCGACGTAGTGGGAGGGTTATAACTCAACCCGATCGTTACTTGGGTTTAGCAGAAACTCAAGTCATCATACCTGATGATGGCGTTGAGGATCCATTGACCTATAAACAGGCAATGAATGATGAGGATAGAGATCAGTGGATTAAAGCCATGAATCTTGAAATGGAGTCAATGTACTTCAATTCAGTCTGGGAACTTGTAGATCAACCGGATGGGGTAAAACCTATTGGTTGCAAATGGATCTATAAGAGGAAACGAGACCAAACCGGTAAGAATGTTGACGAACCTTGTGTTTATAAAAGGATAGTCAACTCTACAGTAGCTTTCCTGGTGTTGTACGTTGACGATATCCTACTCATTGGAAATGATGTGAGAATTCTGACTGACATTAAGCATTGGCTGGCAACACAATTCCAAATGAAAGATTTGGGAGAGGCTCAGTTTGTTCTTGGAATCGAAATTATTCGGAATCGCAAGAACAAAACACTAGCATTGTCTCAAGCATCGTACATCGACAAAATGTTGATTCGATATAAGATGCAGGACTCCAAGAAAGGATTATTACCTTTCAGGCATGGAGTTCATTTGTCGAAGGAACAAAGTCCTAAGACACCTCAAGAAGTTGAGGATATGAAACACATTCCCTATGCATCAGCGGTCGGTAGTCTGATGTATGCCATGCTTTGTACTCGACCCGACATATGCTATGCTGTGGGAATTGTCAGCAGATATCAGTCCAATCCGGGACGTGCTCATTGGACTGCCGTTAAGAATATTCTCAAGTATCTTAGGAGAACGAGGGACTATATGCTAATGTACGGTGTTAAGGATCTGATCCTTATAGGGTACACTGACTCAGATTTTCAGACCGATGTAGATTCGAGGAAATCGACATCAGGATCTGTCTTCACTCTGAACGGAGGAGCAATAATATGGAGAAGCATAAAGCAAGGTTGCATTGCTGATTCCACCATGGAGGCTGAGTATGTTGTTGCTTGTGAAGCAGCGAAAGAATCTGTATGGCTTAGGAAGTTCTTAACTGATTTGGAAGTCGTTCCAAATATGCATCTTCCCGTCACTCTTTATTGTGATAACAGTGGAGCAGTTGCAAATTCAAAAGAACCAAGAAGCCATAAGCGAGGAAAACATATTGAACGCAAATATCATCTCATAAGGGAGATTGTGCAACGAGGAGATGTGATCGTCACACAGATAGCTTCAGAGCACAACATTGCTGATCCATTTACAAAGCCCCTCACGGCTAAAGTGTTTGAAGGGCACCTAGTGAGTCTAGGACTACGAGTTATGTAA

Coding sequence (CDS)

ATGACAAACTCAATAGTACAATTACTCGCTTCTGAGAAATTAAACGGCGACAACTACACAACTTGGAAATCAAACCTAAACACAATACTGGTCATTGATGATTTAAAGTTTGTTTTAACTGAGGAGTGTCCTCCAAACACAAACTCAAATGCAAACCGAACAGCTCGGGATGCATATGACAGATGGATAAAGGCAAATGACAAAGCCCGAGTGTACATTCTAGCCAGCATATCTGATGTTTTGGCTAAGAAACACGATGTTATGGGTACTGCTAAAGAGATTATGGAATCTCTAAAAGGGATGTTTGGACAACCGTCCTTCTCCCTTAGACATGAAGCCATAAAATACATTTACAACTGCCGTATGAAAGAAGGGACCTCAGTTAGAGAACATGTCCTGGACATGATGGTCCATTTCAATGTGGCAGAAGAAAATGAAGCTGTCATTGATGAGAAGAGTCAAGTCAGTTTTATCATGATGTCTCTTCCGAAGAGCTTCTTCCAGTTCCGCACCAATGTGGTTATGAACAAAATAGAATATAACTTGACTGCTCTTCTCAATGAGCTACAGACTTATCAGTCCCTCTTAACAAACAAGGGACAAACAGGAGAAGCAAATGTTGCCATCTCCAAGAAATTACTACGAGGATCGTCCTCCCAAAATAAGTCTGGACCTTCAACTTCTAAAAGTGTTTTGATGAAGAAGAAGGGAAAAGGGAAAAATAAGATTCCTACTAACCGCAAGAACAAGGTTCAAAAAGCAGATAAAGGAAAATGTTTCCATTGCAACGAAAACGGGCACTGGAAGAGAAATTGCCCAAAATACCTTGCAGAGAAGAAAGCCGAAAAGACACAACAAGGTTATAGAGCCAAAGAAACCTTAGAACTCGTGCATACAGATCTCTGTGGTCCAATGAATGTCAAAGCACGAGGAGGGTATGAATATTTCATCAGTTTTATTGATGATTATTCAAGGTATGGCTATCTTTACCTAATGCATCACAAGTCCGAAGCTCTTGAAAAATTCAGAGAGTATAAGACTGAGGTTCAGAATCTATTAGGTAAAACTATTAAAACACTTCGATCAGATCGAGGAGGAGAGTACATGGATTTAAGATTTCAGGACTATATGATAGAACATGGAATAAGGTCTCAACTCTCAGCCACTGGTATGCCTCAACAAAATGGTGTATCAGAAAGGAGAAATAGAACTCTATTAGACATGGTTCGGTCTATGATGAGTTTCGCTCAATTACCCGATCCATTTTGGGGATATGCAGTGGAGACTGCTACATACATTTTGAACATGGTTCCTACTAAGAGTGTTTTAGAAACACCTTATGAGTTATGGAAAGGGCGTAAAGGATACCCCAAAGAGACGAAAGGTGGTTTGTTTTATGATCCTCAAGAAAATAGAGTGTTTGTATCAACGAACGCTACATTCTTAGAGGAAGACCACGTAAGAAATCATCAACCTCGTAGCAAACTAGTATTAAGTGAGATTTCTAAAGAAGCTACTGATAAAACAACAAGAGTTGTTGATCAAGCTGGTCCTTCAACCAGAGTTGTTGATGGAGCTGACACTTCTGGTCAATCACATCCTTCTCAAGAGTTGAGAATGCCTCGACGTAGTGGGAGGGTTATAACTCAACCCGATCGTTACTTGGGTTTAGCAGAAACTCAAGTCATCATACCTGATGATGGCGTTGAGGATCCATTGACCTATAAACAGGCAATGAATGATGAGGATAGAGATCAGTGGATTAAAGCCATGAATCTTGAAATGGAGTCAATGTACTTCAATTCAGTCTGGGAACTTGTAGATCAACCGGATGGGGTAAAACCTATTGGTTGCAAATGGATCTATAAGAGGAAACGAGACCAAACCGGTAAGAATGTTGACGAACCTTGTGTTTATAAAAGGATAGTCAACTCTACAGTAGCTTTCCTGGTGTTGTACGTTGACGATATCCTACTCATTGGAAATGATGTGAGAATTCTGACTGACATTAAGCATTGGCTGGCAACACAATTCCAAATGAAAGATTTGGGAGAGGCTCAGTTTGTTCTTGGAATCGAAATTATTCGGAATCGCAAGAACAAAACACTAGCATTGTCTCAAGCATCGTACATCGACAAAATGTTGATTCGATATAAGATGCAGGACTCCAAGAAAGGATTATTACCTTTCAGGCATGGAGTTCATTTGTCGAAGGAACAAAGTCCTAAGACACCTCAAGAAGTTGAGGATATGAAACACATTCCCTATGCATCAGCGGTCGGTAGTCTGATGTATGCCATGCTTTGTACTCGACCCGACATATGCTATGCTGTGGGAATTGTCAGCAGATATCAGTCCAATCCGGGACGTGCTCATTGGACTGCCGTTAAGAATATTCTCAAGTATCTTAGGAGAACGAGGGACTATATGCTAATGTACGGTGTTAAGGATCTGATCCTTATAGGGTACACTGACTCAGATTTTCAGACCGATGTAGATTCGAGGAAATCGACATCAGGATCTGTCTTCACTCTGAACGGAGGAGCAATAATATGGAGAAGCATAAAGCAAGGTTGCATTGCTGATTCCACCATGGAGGCTGAGTATGTTGTTGCTTGTGAAGCAGCGAAAGAATCTGTATGGCTTAGGAAGTTCTTAACTGATTTGGAAGTCGTTCCAAATATGCATCTTCCCGTCACTCTTTATTGTGATAACAGTGGAGCAGTTGCAAATTCAAAAGAACCAAGAAGCCATAAGCGAGGAAAACATATTGAACGCAAATATCATCTCATAAGGGAGATTGTGCAACGAGGAGATGTGATCGTCACACAGATAGCTTCAGAGCACAACATTGCTGATCCATTTACAAAGCCCCTCACGGCTAAAGTGTTTGAAGGGCACCTAGTGAGTCTAGGACTACGAGTTATGTAA
BLAST of CmoCh04G023510 vs. Swiss-Prot
Match: POLX_TOBAC (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum PE=2 SV=1)

HSP 1 Score: 300.4 bits (768), Expect = 7.0e-80
Identity = 160/359 (44.57%), Postives = 226/359 (62.95%), Query Frame = 1

Query: 612  GVKPIGCKWIYK----RKRDQTGKNVDEPCVY-KRIVNSTVAFLVLYVDDILLIGNDVRI 671
            G+K    +W  K     K     K   +PCVY KR   +    L+LYVDD+L++G D  +
Sbjct: 962  GLKQAPRQWYMKFDSFMKSQTYLKTYSDPCVYFKRFSENNFIILLLYVDDMLIVGKDKGL 1021

Query: 672  LTDIKHWLATQFQMKDLGEAQFVLGIEIIRNRKNKTLALSQASYIDKMLIRYKMQDSKKG 731
            +  +K  L+  F MKDLG AQ +LG++I+R R ++ L LSQ  YI+++L R+ M+++K  
Sbjct: 1022 IAKLKGDLSKSFDMKDLGPAQQILGMKIVRERTSRKLWLSQEKYIERVLERFNMKNAKPV 1081

Query: 732  LLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQS 791
              P    + LSK+  P T +E  +M  +PY+SAVGSLMYAM+CTRPDI +AVG+VSR+  
Sbjct: 1082 STPLAGHLKLSKKMCPTTVEEKGNMAKVPYSSAVGSLMYAMVCTRPDIAHAVGVVSRFLE 1141

Query: 792  NPGRAHWTAVKNILKYLRRTRDYMLMYGVKDLILIGYTDSDFQTDVDSRKSTSGSVFTLN 851
            NPG+ HW AVK IL+YLR T    L +G  D IL GYTD+D   D+D+RKS++G +FT +
Sbjct: 1142 NPGKEHWEAVKWILRYLRGTTGDCLCFGGSDPILKGYTDADMAGDIDNRKSSTGYLFTFS 1201

Query: 852  GGAIIWRSIKQGCIADSTMEAEYVVACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNS 911
            GGAI W+S  Q C+A ST EAEY+ A E  KE +WL++FL +L +    ++   +YCD+ 
Sbjct: 1202 GGAISWQSKLQKCVALSTTEAEYIAATETGKEMIWLKRFLQELGLHQKEYV---VYCDSQ 1261

Query: 912  GAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTQIASEHNIADPFTKPLTAKVFE 966
             A+  SK    H R KHI+ +YH IRE+V    + V +I++  N AD  TK +    FE
Sbjct: 1262 SAIDLSKNSMYHARTKHIDVRYHWIREMVDDESLKVLKISTNENPADMLTKVVPRNKFE 1317

BLAST of CmoCh04G023510 vs. Swiss-Prot
Match: COPIA_DROME (Copia protein OS=Drosophila melanogaster GN=GIP PE=1 SV=3)

HSP 1 Score: 200.3 bits (508), Expect = 9.8e-50
Identity = 122/345 (35.36%), Postives = 191/345 (55.36%), Query Frame = 1

Query: 637  CVY---KRIVNSTVAFLVLYVDDILLIGNDVRILTDIKHWLATQFQMKDLGEAQFVLGIE 696
            C+Y   K  +N  + +++LYVDD+++   D+  + + K +L  +F+M DL E +  +GI 
Sbjct: 1069 CIYILDKGNINENI-YVLLYVDDVVIATGDMTRMNNFKRYLMEKFRMTDLNEIKHFIGIR 1128

Query: 697  IIRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKH 756
            I    +   + LSQ++Y+ K+L ++ M++      P    ++     S       ED  +
Sbjct: 1129 I--EMQEDKIYLSQSAYVKKILSKFNMENCNAVSTPLPSKINYELLNSD------EDC-N 1188

Query: 757  IPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMY 816
             P  S +G LMY MLCTRPD+  AV I+SRY S      W  +K +L+YL+ T D  L++
Sbjct: 1189 TPCRSLIGCLMYIMLCTRPDLTTAVNILSRYSSKNNSELWQNLKRVLRYLKGTIDMKLIF 1248

Query: 817  GVKDLI----LIGYTDSDFQTDVDSRKSTSGSVFTL-NGGAIIWRSIKQGCIADSTMEAE 876
              K+L     +IGY DSD+      RKST+G +F + +   I W + +Q  +A S+ EAE
Sbjct: 1249 K-KNLAFENKIIGYVDSDWAGSEIDRKSTTGYLFKMFDFNLICWNTKRQNSVAASSTEAE 1308

Query: 877  YVVACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKY 936
            Y+   EA +E++WL+  LT + +   +  P+ +Y DN G ++ +  P  HKR KHI+ KY
Sbjct: 1309 YMALFEAVREALWLKFLLTSINI--KLENPIKIYEDNQGCISIANNPSCHKRAKHIDIKY 1368

Query: 937  HLIREIVQRGDVIVTQIASEHNIADPFTKPLTAKVFEGHLVSLGL 974
            H  RE VQ   + +  I +E+ +AD FTKPL A  F      LGL
Sbjct: 1369 HFAREQVQNNVICLEYIPTENQLADIFTKPLPAARFVELRDKLGL 1400

BLAST of CmoCh04G023510 vs. Swiss-Prot
Match: YCH4_YEAST (Putative transposon Ty5-1 protein YCL074W OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=TY5A PE=5 SV=2)

HSP 1 Score: 106.3 bits (264), Expect = 1.9e-21
Identity = 86/284 (30.28%), Postives = 125/284 (44.01%), Query Frame = 1

Query: 603 VWELVDQPDGVKPIGCKWIYKRKRDQTGKNVD------EPCVYKRIVNSTVAFLVLYVDD 662
           VWEL     G+K     W      + T K +       E  +Y R  +    ++ +YVDD
Sbjct: 33  VWELYGGMYGLKQAPLLW--NEHINNTLKKIGFCRHEGEHGLYFRSTSDGPIYIAVYVDD 92

Query: 663 ILLIGNDVRILTDIKHWLATQFQMKDLGEAQFVLGIEIIRNRKNKTLALSQASYIDKMLI 722
           +L+     +I   +K  L   + MKDLG+    LG+ I ++  N  + LS   YI K   
Sbjct: 93  LLVAAPSPKIYDRVKQELTKLYSMKDLGKVDKFLGLNIHQS-SNGDITLSLQDYIAKAAS 152

Query: 723 RYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDICY 782
             ++   K    P  +   L +  SP     ++D+   PY S VG L++     RPDI Y
Sbjct: 153 ESEINTFKLTQTPLCNSKPLFETTSP----HLKDIT--PYQSIVGQLLFCANTGRPDISY 212

Query: 783 AVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMY-GVKDLILIGYTDSDFQTDVDSR 842
            V ++SR+   P   H  + + +L+YL  TR   L Y     L L  Y D+      D  
Sbjct: 213 PVSLLSRFLREPRAIHLESARRVLRYLYTTRSMCLKYRSGSQLALTVYCDASHGAIHDLP 272

Query: 843 KSTSGSVFTLNGGAIIWRSIK-QGCIADSTMEAEYVVACEAAKE 879
            ST G V  L G  + W S K +G I   + EAEY+ A E   E
Sbjct: 273 HSTGGYVTLLAGAPVTWSSKKLKGVIPVPSTEAEYITASETVME 307

BLAST of CmoCh04G023510 vs. Swiss-Prot
Match: M810_ARATH (Uncharacterized mitochondrial protein AtMg00810 OS=Arabidopsis thaliana GN=AtMg00810 PE=4 SV=1)

HSP 1 Score: 101.3 bits (251), Expect = 6.2e-20
Identity = 76/234 (32.48%), Postives = 117/234 (50.00%), Query Frame = 1

Query: 649 FLVLYVDDILLIGNDVRILTDIKHWLATQFQMKDLGEAQFVLGIEIIRNRKNKTLALSQA 708
           +L+LYVDDILL G+   +L  +   L++ F MKDLG   + LGI+I  +     L LSQ 
Sbjct: 2   YLLLYVDDILLTGSSNTLLNMLIFQLSSTFSMKDLGPVHYFLGIQIKTHPSG--LFLSQT 61

Query: 709 SYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAML 768
            Y +++L    M D K    P    ++ S   + K P   +      + S VG+L Y  L
Sbjct: 62  KYAEQILNNAGMLDCKPMSTPLPLKLN-SSVSTAKYPDPSD------FRSIVGALQYLTL 121

Query: 769 CTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDY-MLMYGVKDLILIGYTDSD 828
            TRPDI YAV IV +    P  A +  +K +L+Y++ T  + + ++    L +  + DSD
Sbjct: 122 -TRPDISYAVNIVCQRMHEPTLADFDLLKRVLRYVKGTIFHGLYIHKNSKLNVQAFCDSD 181

Query: 829 FQTDVDSRKSTSGSVFTLNGGAIIWRSIKQGCIADSTMEAEYVVACEAAKESVW 882
           +     +R+ST+G    L    I W + +Q  ++ S+ E EY      A E  W
Sbjct: 182 WAGCTSTRRSTTGFCTFLGCNIISWSAKRQPTVSRSSTETEYRALALTAAELTW 225

BLAST of CmoCh04G023510 vs. Swiss-Prot
Match: YP14B_YEAST (Transposon Ty1-PR3 Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=TY1B-PR3 PE=1 SV=1)

HSP 1 Score: 87.8 bits (216), Expect = 7.1e-16
Identity = 51/172 (29.65%), Postives = 90/172 (52.33%), Query Frame = 1

Query: 272 CPKYLAEKKAEKTQ-QGYRAK-----ETLELVHTDLCGPMNVKARGGYEYFISFIDDYSR 331
           CP  L  K  +    +G R K     E  + +HTD+ GP++   +    YFISF D+ ++
Sbjct: 637 CPDCLIGKSTKHRHIKGSRLKYQNSYEPFQYLHTDIFGPVHNLPKSAPSYFISFTDETTK 696

Query: 332 YGYLYLMHHKSE--ALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIR 391
           + ++Y +H + E   L+ F      ++N    ++  ++ DRG EY +     ++ ++GI 
Sbjct: 697 FRWVYPLHDRREDSILDVFTTILAFIKNQFQASVLVIQMDRGSEYTNRTLHKFLEKNGIT 756

Query: 392 SQLSATGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILN 436
              + T   + +GV+ER NRTLLD  R+ +  + LP+  W  A+E +T + N
Sbjct: 757 PCYTTTADSRAHGVAERLNRTLLDDCRTQLQCSGLPNHLWFSAIEFSTIVRN 808

BLAST of CmoCh04G023510 vs. TrEMBL
Match: E2GK51_BRYDI (Gag/pol protein (Fragment) OS=Bryonia dioica PE=4 SV=1)

HSP 1 Score: 573.2 bits (1476), Expect = 6.2e-160
Identity = 273/314 (86.94%), Postives = 299/314 (95.22%), Query Frame = 1

Query: 631  KNVDEPCVYKRIVNSTVAFLVLYVDDILLIGNDVRILTDIKHWLATQFQMKDLGEAQFVL 690
            +NVDEPCVYK+IVNS VAFL+LYVDDILLIGNDV  LTD+K WL TQFQMKDLGEAQ++L
Sbjct: 979  QNVDEPCVYKKIVNSVVAFLILYVDDILLIGNDVEYLTDVKKWLNTQFQMKDLGEAQYIL 1038

Query: 691  GIEIIRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVED 750
            GI+I+RNRKNKTLA+SQASYIDK+L RYKMQ+SKKG LPFRHG+HLSKEQ PKTPQEVED
Sbjct: 1039 GIQIVRNRKNKTLAMSQASYIDKVLSRYKMQNSKKGQLPFRHGIHLSKEQCPKTPQEVED 1098

Query: 751  MKHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYM 810
            M++IPY+SAVGSLMYAMLCTRPDICY+VGIVSRYQSNPGR HWTAVKNILKYLRRTR+YM
Sbjct: 1099 MRNIPYSSAVGSLMYAMLCTRPDICYSVGIVSRYQSNPGRDHWTAVKNILKYLRRTRNYM 1158

Query: 811  LMYGVKDLILIGYTDSDFQTDVDSRKSTSGSVFTLNGGAIIWRSIKQGCIADSTMEAEYV 870
            L+YG KDLIL GYTDSDFQ+D D+RKSTSGSVFTLNGGA++WRS+KQ CIADSTMEAEYV
Sbjct: 1159 LVYGAKDLILTGYTDSDFQSDKDARKSTSGSVFTLNGGAVVWRSVKQTCIADSTMEAEYV 1218

Query: 871  VACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHL 930
             ACEAAKE+VWLRKFLTDLEVVPNMHLP+TLYCDNSGAVANSKEPRSHKRGKHIERKYHL
Sbjct: 1219 AACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHL 1278

Query: 931  IREIVQRGDVIVTQ 945
            IREIV RGDV+VTQ
Sbjct: 1279 IREIVHRGDVVVTQ 1292

BLAST of CmoCh04G023510 vs. TrEMBL
Match: A5AUE7_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_021035 PE=4 SV=1)

HSP 1 Score: 531.2 bits (1367), Expect = 2.7e-147
Identity = 254/346 (73.41%), Postives = 302/346 (87.28%), Query Frame = 1

Query: 631 KNVDEPCVYKRIVNSTVAFLVLYVDDILLIGNDVRILTDIKHWLATQFQMKDLGEAQFVL 690
           +N+ EPCVYK+I    V FLVLYVDDILLIGNDV  L+ +K+WLA+QFQMKDLGEA ++L
Sbjct: 319 QNLGEPCVYKQIGGDKVVFLVLYVDDILLIGNDVESLSKVKNWLASQFQMKDLGEASYIL 378

Query: 691 GIEIIRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVED 750
           GI++ R+RKN+ LALSQA+YIDK+L+++ M++SKKG LP RHGVHLSKEQ PKTPQ+ E 
Sbjct: 379 GIQMTRDRKNRLLALSQAAYIDKVLVKFAMENSKKGNLPSRHGVHLSKEQCPKTPQDEEK 438

Query: 751 MKHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYM 810
           M+ +PYASAVGSLMYAMLCTRPDIC+AVG+VSRYQSNPG  HW AVK+ILKYLRRTR+YM
Sbjct: 439 MRRVPYASAVGSLMYAMLCTRPDICFAVGVVSRYQSNPGLDHWVAVKHILKYLRRTRNYM 498

Query: 811 LMYGVKDLILIGYTDSDFQTDVDSRKSTSGSVFTLNGGAIIWRSIKQGCIADSTMEAEYV 870
           L+Y  ++LI IGYTDSDFQ+D DSRKSTS +VFTL GGAIIWRS+KQ C+ADSTMEAEYV
Sbjct: 499 LVYSGRELIPIGYTDSDFQSDRDSRKSTSEAVFTLGGGAIIWRSVKQTCVADSTMEAEYV 558

Query: 871 VACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHL 930
            ACEAAKE+VWLR+FL +LEVVPNMH P+ LYCDNSGAVAN+KEPR+H++GKHIERK+HL
Sbjct: 559 AACEAAKEAVWLREFLKELEVVPNMHEPIRLYCDNSGAVANAKEPRNHRKGKHIERKFHL 618

Query: 931 IREIVQRGDVIVTQIASEHNIADPFTKPLTAKVFEGHLVSLGLRVM 977
           +REIV RGDV V +IAS +NIADPFTK L A+ FE HL  +GLR M
Sbjct: 619 VREIVSRGDVSVEKIASANNIADPFTKTLPARSFEQHLEGMGLREM 664

BLAST of CmoCh04G023510 vs. TrEMBL
Match: D3IVU0_9POAL (Putative retrotransposon protein OS=Phyllostachys edulis PE=4 SV=1)

HSP 1 Score: 524.6 bits (1350), Expect = 2.5e-145
Identity = 287/621 (46.22%), Postives = 396/621 (63.77%), Query Frame = 1

Query: 366 GEYMDLRFQDYMIEHGIRSQLSATGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGY 425
           GE++ L F +++ + GI  QL+  G PQ NGVSERRNRTLLDMVRSMMS + LP  FWGY
Sbjct: 292 GEFLSLEFGNHLKQCGIVPQLTPPGTPQWNGVSERRNRTLLDMVRSMMSQSDLPLYFWGY 351

Query: 426 AVETATYILNMVPTKSVLETPYELWKGRKGYPKETKGGLFYDPQENRVFVSTNATFLEED 485
           A+ETA +ILN +  KS  +  + +     GYP+ETKG  FY+ +E++VFV+ N  FLE++
Sbjct: 352 ALETAAFILNKLTPKS--DKCFFI-----GYPRETKGYYFYNREEDKVFVARNGVFLEKE 411

Query: 486 HVRNHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRVVDGADTSGQSHPSQELRMPRRSG 545
            +       K+ L EI + + + +T       P+  ++D         P  +   PRRS 
Sbjct: 412 FLSKGVSGRKVRLEEIRETSENVST-------PTEHLLDEQIVV---EPVDKAPAPRRSK 471

Query: 546 RVITQPDRY---LGLAETQVIIPDDGVEDPLTYKQAMNDEDRDQWIKAMNLE--MESMYF 605
           R    P RY   + L    +    D     +  K A  + +  + +     E  ++    
Sbjct: 472 RPRQLPTRYGHDILLLNNAIAAYFDYEIWQMDVKTAFLNGNLHEDVYMTQPEGFVDPNNA 531

Query: 606 NSVWELVDQPDGVKPIGCKWIYKR----KRDQTGKNVDEPCVYKRIVNSTVAFLVLYVDD 665
           + V +L     G+K     W  +     KR    KN +EPCVY ++  ST+  L+LYVDD
Sbjct: 532 SKVCKLQKSIYGLKQASRSWNIRFDEEIKRFGFIKNKEEPCVYMKVSGSTLVILILYVDD 591

Query: 666 ILLIGNDVRILTDIKHWLATQFQMKDLGEAQFVLGIEIIRNRKNKTLALSQASYIDKMLI 725
           ILL+GND+ +L  +K  L   F MKDLG+A ++LGI I R+R  + + LSQ  YIDK+L 
Sbjct: 592 ILLVGNDIPMLESVKSSLRKSFSMKDLGDAAYILGIRIYRDRSKRLIGLSQEMYIDKVLN 651

Query: 726 RYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDICY 785
           R+ MQ+SK+G LP  HG++LSK Q P T  E + M  IPYASA+GS+MYAM+CTRPD+ Y
Sbjct: 652 RFNMQNSKRGFLPMAHGINLSKNQCPTTTDERDKMSDIPYASAIGSIMYAMICTRPDVSY 711

Query: 786 AVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMY-GVKDLILIGYTDSDFQTDVDSR 845
           A+ + SRYQ++P   HWTAVKNILKYLRRT+D  L+Y G ++L++ GYTD+ FQTD D  
Sbjct: 712 ALSVTSRYQADPSEGHWTAVKNILKYLRRTKDVFLVYGGDEELVVNGYTDASFQTDKDDY 771

Query: 846 KSTSGSVFTLNGGAIIWRSIKQGCIADSTMEAEYVVACEAAKESVWLRKFLTDLEVVPNM 905
           +S SG VF LNGGA+ W+S KQ  +ADST EAEY+ A EAAKE VW+R F+T+L +VP+ 
Sbjct: 772 RSQSGFVFILNGGAVSWKSSKQETVADSTTEAEYIAASEAAKEGVWIRNFITELGMVPSA 831

Query: 906 HLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTQIASEHNIADPF 965
             P+ LYCDN+GA+A +KEPRSH++ KHI R+YHLIRE+V RGDV + ++ ++ NIADP 
Sbjct: 832 SSPMDLYCDNNGAIAQAKEPRSHQKSKHILRRYHLIRELVDRGDVKICKVHTDLNIADPL 891

Query: 966 TKPLTAKVFEGHLVSLGLRVM 977
           TKPLT    E H  ++G+R +
Sbjct: 892 TKPLTQPKHEAHTRAIGIRYL 895

BLAST of CmoCh04G023510 vs. TrEMBL
Match: H2KWP6_ORYSJ (Retrotransposon protein, putative, Ty1-copia subclass OS=Oryza sativa subsp. japonica GN=LOC_Os12g25734 PE=4 SV=1)

HSP 1 Score: 520.0 bits (1338), Expect = 6.2e-144
Identity = 329/826 (39.83%), Postives = 448/826 (54.24%), Query Frame = 1

Query: 254 ADKGKCFHCNENGHWKRNCPKYLAEKKAEKTQQGYRAKETLELVHTDLCGPMNVKARGGY 313
           A   +CF C E  HWKRNC KYL + K +K Q G  +     +         N K +   
Sbjct: 132 AKDAECFFCKEADHWKRNCKKYLEQLK-QKQQDGKSSTSVYNI---------NAKRQRPN 191

Query: 314 EYFISFIDDYSRYGYLYLMHHKSEALEK-----------FREYKTEVQNLLGKTIK---T 373
           +   +FI       + +L H   + +EK           F  ++T    LLGK  K   T
Sbjct: 192 DLNPTFI------WHCHLGHINEKRMEKLHRDGLLHSFDFESFETCESCLLGKMTKAPFT 251

Query: 374 LRSDRGGEYMDLRFQDYMIEHGIRSQLSATGMPQQNGVSERRNRTLLDMVRSMMSFAQLP 433
            +S+R  E + L + D   E GI  QL+  G PQ NGVSERRNRTLLDMVRSMMS   LP
Sbjct: 252 GQSERASELLGLVYTD---ECGIVPQLTPPGTPQWNGVSERRNRTLLDMVRSMMSQTDLP 311

Query: 434 DPFWGYAVETATYILNMVPTKSVLETPYELWKGRK------------------------- 493
             FWGYA+ETA + LN VP+KS+ +TPYE+W G++                         
Sbjct: 312 LSFWGYALETAAFTLNRVPSKSLDKTPYEIWTGKRPSLSFLKIWGCEVYVKRLQSDKLTP 371

Query: 494 --------GYPKETKGGLFYDPQENRVFVSTNATFLEEDHVRNHQPRSKLVLSEISKEAT 553
                   GYPKETKG  FY+ +E++VFV+ +  FLE++ +        + L EI +   
Sbjct: 372 KSDKYFFVGYPKETKGYYFYNREEDKVFVARHDVFLEKEFISTKDSGIMVRLEEIQETPK 431

Query: 554 DKTTRVVDQAGPSTRVVDGADTSGQSHPSQELRMPRRSGRVITQPDRY--LGLAETQVII 613
           + +T    Q      V        +  P  E    RRS R+   P RY  L   +  +++
Sbjct: 432 NASTSTQPQQDEQDVVQQVEQVVVE--PVVEAPASRRSERIWRTPARYALLTTGQRDILL 491

Query: 614 PDDGVEDPLTYKQAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKR 673
            D+  + P TY++AM   D  +W+ AM  E+ESM+ N VW LVD PDGVK I CKW++K+
Sbjct: 492 LDN--DKPTTYEEAMVGPDSKKWLGAMKSEIESMHVNQVWNLVDPPDGVKGIECKWVFKK 551

Query: 674 KRDQTGKNVDEPCVYK-RIVNSTVAFL--VLYVDDILLIG--NDVRILT------DIKHW 733
           K D  G NV    +YK R+V      +  V Y      +G    +RI+       D + W
Sbjct: 552 KTDLDG-NVH---IYKARLVAKGFRQIQSVDYDGTFSSVGMLKSIRIILAIPAYFDYEIW 611

Query: 734 ---LATQFQMKDLGEAQFVLGIE------------------IIRNRKN------------ 793
              + T F  ++L E + + G++                   ++N +             
Sbjct: 612 QMDVKTAFLNENLDEDKSIYGLKQASRSWNIRFDEVVKALGFVKNEEEPCVYKKISGSAL 671

Query: 794 ------------------KTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSP 853
                             + L  SQ++YIDK+L R+ MQDSKKG LP  HG++L K Q P
Sbjct: 672 DLREAAYILSIRIYRDRSRRLIGSQSTYIDKVLKRFNMQDSKKGFLPLSHGINLGKNQCP 731

Query: 854 KTPQEVEDMKHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKY 913
           +T  E   M  IPYASA+GS+MYAMLCTRPD+ YA+   S+YQS+PG +HW AVKNILKY
Sbjct: 732 QTTDERNKMSVIPYASAIGSIMYAMLCTRPDVPYALSATSQYQSDPGESHWIAVKNILKY 791

Query: 914 LRRTRDYMLMY-GVKDLILIGYTDSDFQTDVDSRKSTSGSVFTLNGGAIIWRSIKQGCIA 968
           LRRT+D  L+Y G ++L++  YTD+ FQTD D  +S SG VF LNGGA+ W+S KQ  +A
Sbjct: 792 LRRTKDMFLIYGGQEELVVNNYTDASFQTDKDDFRSQSGFVFCLNGGAVSWKSSKQDTVA 851

BLAST of CmoCh04G023510 vs. TrEMBL
Match: A0A165U314_9ROSI (Gag/pol protein OS=Momordica dioica PE=4 SV=1)

HSP 1 Score: 509.2 bits (1310), Expect = 1.1e-140
Identity = 246/316 (77.85%), Postives = 278/316 (87.97%), Query Frame = 1

Query: 631  KNVDEPCVYKRIVNSTVAFLVLYVDDILLIGNDVRILTDIKHWLATQFQMKDLGEAQFVL 690
            +NVDE CVYK+I  S VAFL+LYVDDILLIGNDV  L D+K WL T F MKDLGEAQ++L
Sbjct: 995  QNVDESCVYKKISGSVVAFLILYVDDILLIGNDVEYLEDVKKWLNTSFSMKDLGEAQYIL 1054

Query: 691  GIEIIRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVED 750
            GI I R+R NKT+ +SQ++YIDK+L R+KMQDSKKGLLPFRHG+HLSKEQ PKTPQEVED
Sbjct: 1055 GIRIYRDRSNKTIGMSQSTYIDKVLSRFKMQDSKKGLLPFRHGIHLSKEQCPKTPQEVED 1114

Query: 751  MKHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYM 810
            M++IPY+SA+GSLMYAMLCTRPD+CYA+ IVSRYQSNPGR HWTAVKNILKYLRRTR+  
Sbjct: 1115 MRNIPYSSAIGSLMYAMLCTRPDVCYALSIVSRYQSNPGRDHWTAVKNILKYLRRTRNMF 1174

Query: 811  LMY-GVKDLILIGYTDSDFQTDVDSRKSTSGSVFTLNGGAIIWRSIKQGCIADSTMEAEY 870
            L+Y G KDL + GYTDS FQTD D  KS SG VFTLNGGA+ WRS KQ C+ADST EAEY
Sbjct: 1175 LVYGGDKDLAVKGYTDSSFQTDKDDSKSQSG-VFTLNGGAVSWRSSKQTCVADSTCEAEY 1234

Query: 871  VVACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYH 930
            V ACEAAKE+VW+RKFLTDL VVPNMHLP+TLYCDNSGAVAN+KEPRSHKRGKHIERKYH
Sbjct: 1235 VAACEAAKEAVWIRKFLTDLGVVPNMHLPITLYCDNSGAVANAKEPRSHKRGKHIERKYH 1294

Query: 931  LIREIVQRGDVIVTQI 946
            LIREIV+RGDV+V Q+
Sbjct: 1295 LIREIVERGDVVVCQM 1309

BLAST of CmoCh04G023510 vs. TAIR10
Match: AT4G23160.1 (AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 8)

HSP 1 Score: 137.5 bits (345), Expect = 4.4e-32
Identity = 100/343 (29.15%), Postives = 168/343 (48.98%), Query Frame = 1

Query: 596 ESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTG----KNVDEPCVYKRIVNSTVAFLV 655
           +S+  N+V  L     G+K    +W  K      G    ++  +   + +I  +    ++
Sbjct: 222 DSLPPNAVCYLKKSIYGLKQASRQWFLKFSVTLIGFGFVQSHSDHTYFLKITATLFLCVL 281

Query: 656 LYVDDILLIGNDVRILTDIKHWLATQFQMKDLGEAQFVLGIEIIRNRKNKTLALSQASYI 715
           +YVDDI++  N+   + ++K  L + F+++DLG  ++ LG+EI R+     + + Q  Y 
Sbjct: 282 VYVDDIIICSNNDAAVDELKSQLKSCFKLRDLGPLKYFLGLEIARSAAG--INICQRKYA 341

Query: 716 DKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTR 775
             +L    +   K   +P    V  S         +  D K   Y   +G LMY  + TR
Sbjct: 342 LDLLDETGLLGCKPSSVPMDPSVTFSAHSGG----DFVDAK--AYRRLIGRLMYLQI-TR 401

Query: 776 PDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMYGVK-DLILIGYTDSDFQT 835
            DI +AV  +S++   P  AH  AV  IL Y++ T    L Y  + ++ L  ++D+ FQ+
Sbjct: 402 LDISFAVNKLSQFSEAPRLAHQQAVMKILHYIKGTVGQGLFYSSQAEMQLQVFSDASFQS 461

Query: 836 DVDSRKSTSGSVFTLNGGAIIWRSIKQGCIADSTMEAEYVVACEAAKESVWLRKFLTDLE 895
             D+R+ST+G    L    I W+S KQ  ++ S+ EAEY     A  E +WL +F  +L+
Sbjct: 462 CKDTRRSTNGYCMFLGTSLISWKSKKQQVVSKSSAEAEYRALSFATDEMMWLAQFFRELQ 521

Query: 896 VVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIRE 934
           +   +  P  L+CDN+ A+  +     H+R KHIE   H +RE
Sbjct: 522 L--PLSKPTLLFCDNTAAIHIATNAVFHERTKHIESDCHSVRE 553

BLAST of CmoCh04G023510 vs. TAIR10
Match: ATMG00810.1 (ATMG00810.1 DNA/RNA polymerases superfamily protein)

HSP 1 Score: 101.3 bits (251), Expect = 3.5e-21
Identity = 76/234 (32.48%), Postives = 117/234 (50.00%), Query Frame = 1

Query: 649 FLVLYVDDILLIGNDVRILTDIKHWLATQFQMKDLGEAQFVLGIEIIRNRKNKTLALSQA 708
           +L+LYVDDILL G+   +L  +   L++ F MKDLG   + LGI+I  +     L LSQ 
Sbjct: 2   YLLLYVDDILLTGSSNTLLNMLIFQLSSTFSMKDLGPVHYFLGIQIKTHPSG--LFLSQT 61

Query: 709 SYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAML 768
            Y +++L    M D K    P    ++ S   + K P   +      + S VG+L Y  L
Sbjct: 62  KYAEQILNNAGMLDCKPMSTPLPLKLN-SSVSTAKYPDPSD------FRSIVGALQYLTL 121

Query: 769 CTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDY-MLMYGVKDLILIGYTDSD 828
            TRPDI YAV IV +    P  A +  +K +L+Y++ T  + + ++    L +  + DSD
Sbjct: 122 -TRPDISYAVNIVCQRMHEPTLADFDLLKRVLRYVKGTIFHGLYIHKNSKLNVQAFCDSD 181

Query: 829 FQTDVDSRKSTSGSVFTLNGGAIIWRSIKQGCIADSTMEAEYVVACEAAKESVW 882
           +     +R+ST+G    L    I W + +Q  ++ S+ E EY      A E  W
Sbjct: 182 WAGCTSTRRSTTGFCTFLGCNIISWSAKRQPTVSRSSTETEYRALALTAAELTW 225

BLAST of CmoCh04G023510 vs. NCBI nr
Match: gi|299474487|gb|ADJ18449.1| (gag/pol protein [Bryonia dioica])

HSP 1 Score: 573.2 bits (1476), Expect = 8.9e-160
Identity = 273/314 (86.94%), Postives = 299/314 (95.22%), Query Frame = 1

Query: 631  KNVDEPCVYKRIVNSTVAFLVLYVDDILLIGNDVRILTDIKHWLATQFQMKDLGEAQFVL 690
            +NVDEPCVYK+IVNS VAFL+LYVDDILLIGNDV  LTD+K WL TQFQMKDLGEAQ++L
Sbjct: 979  QNVDEPCVYKKIVNSVVAFLILYVDDILLIGNDVEYLTDVKKWLNTQFQMKDLGEAQYIL 1038

Query: 691  GIEIIRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVED 750
            GI+I+RNRKNKTLA+SQASYIDK+L RYKMQ+SKKG LPFRHG+HLSKEQ PKTPQEVED
Sbjct: 1039 GIQIVRNRKNKTLAMSQASYIDKVLSRYKMQNSKKGQLPFRHGIHLSKEQCPKTPQEVED 1098

Query: 751  MKHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYM 810
            M++IPY+SAVGSLMYAMLCTRPDICY+VGIVSRYQSNPGR HWTAVKNILKYLRRTR+YM
Sbjct: 1099 MRNIPYSSAVGSLMYAMLCTRPDICYSVGIVSRYQSNPGRDHWTAVKNILKYLRRTRNYM 1158

Query: 811  LMYGVKDLILIGYTDSDFQTDVDSRKSTSGSVFTLNGGAIIWRSIKQGCIADSTMEAEYV 870
            L+YG KDLIL GYTDSDFQ+D D+RKSTSGSVFTLNGGA++WRS+KQ CIADSTMEAEYV
Sbjct: 1159 LVYGAKDLILTGYTDSDFQSDKDARKSTSGSVFTLNGGAVVWRSVKQTCIADSTMEAEYV 1218

Query: 871  VACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHL 930
             ACEAAKE+VWLRKFLTDLEVVPNMHLP+TLYCDNSGAVANSKEPRSHKRGKHIERKYHL
Sbjct: 1219 AACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHL 1278

Query: 931  IREIVQRGDVIVTQ 945
            IREIV RGDV+VTQ
Sbjct: 1279 IREIVHRGDVVVTQ 1292

BLAST of CmoCh04G023510 vs. NCBI nr
Match: gi|147768021|emb|CAN69397.1| (hypothetical protein VITISV_021035 [Vitis vinifera])

HSP 1 Score: 531.2 bits (1367), Expect = 3.9e-147
Identity = 254/346 (73.41%), Postives = 302/346 (87.28%), Query Frame = 1

Query: 631 KNVDEPCVYKRIVNSTVAFLVLYVDDILLIGNDVRILTDIKHWLATQFQMKDLGEAQFVL 690
           +N+ EPCVYK+I    V FLVLYVDDILLIGNDV  L+ +K+WLA+QFQMKDLGEA ++L
Sbjct: 319 QNLGEPCVYKQIGGDKVVFLVLYVDDILLIGNDVESLSKVKNWLASQFQMKDLGEASYIL 378

Query: 691 GIEIIRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVED 750
           GI++ R+RKN+ LALSQA+YIDK+L+++ M++SKKG LP RHGVHLSKEQ PKTPQ+ E 
Sbjct: 379 GIQMTRDRKNRLLALSQAAYIDKVLVKFAMENSKKGNLPSRHGVHLSKEQCPKTPQDEEK 438

Query: 751 MKHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYM 810
           M+ +PYASAVGSLMYAMLCTRPDIC+AVG+VSRYQSNPG  HW AVK+ILKYLRRTR+YM
Sbjct: 439 MRRVPYASAVGSLMYAMLCTRPDICFAVGVVSRYQSNPGLDHWVAVKHILKYLRRTRNYM 498

Query: 811 LMYGVKDLILIGYTDSDFQTDVDSRKSTSGSVFTLNGGAIIWRSIKQGCIADSTMEAEYV 870
           L+Y  ++LI IGYTDSDFQ+D DSRKSTS +VFTL GGAIIWRS+KQ C+ADSTMEAEYV
Sbjct: 499 LVYSGRELIPIGYTDSDFQSDRDSRKSTSEAVFTLGGGAIIWRSVKQTCVADSTMEAEYV 558

Query: 871 VACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHL 930
            ACEAAKE+VWLR+FL +LEVVPNMH P+ LYCDNSGAVAN+KEPR+H++GKHIERK+HL
Sbjct: 559 AACEAAKEAVWLREFLKELEVVPNMHEPIRLYCDNSGAVANAKEPRNHRKGKHIERKFHL 618

Query: 931 IREIVQRGDVIVTQIASEHNIADPFTKPLTAKVFEGHLVSLGLRVM 977
           +REIV RGDV V +IAS +NIADPFTK L A+ FE HL  +GLR M
Sbjct: 619 VREIVSRGDVSVEKIASANNIADPFTKTLPARSFEQHLEGMGLREM 664

BLAST of CmoCh04G023510 vs. NCBI nr
Match: gi|284434733|gb|ADB85430.1| (putative retrotransposon protein [Phyllostachys edulis])

HSP 1 Score: 524.6 bits (1350), Expect = 3.6e-145
Identity = 287/621 (46.22%), Postives = 396/621 (63.77%), Query Frame = 1

Query: 366 GEYMDLRFQDYMIEHGIRSQLSATGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGY 425
           GE++ L F +++ + GI  QL+  G PQ NGVSERRNRTLLDMVRSMMS + LP  FWGY
Sbjct: 292 GEFLSLEFGNHLKQCGIVPQLTPPGTPQWNGVSERRNRTLLDMVRSMMSQSDLPLYFWGY 351

Query: 426 AVETATYILNMVPTKSVLETPYELWKGRKGYPKETKGGLFYDPQENRVFVSTNATFLEED 485
           A+ETA +ILN +  KS  +  + +     GYP+ETKG  FY+ +E++VFV+ N  FLE++
Sbjct: 352 ALETAAFILNKLTPKS--DKCFFI-----GYPRETKGYYFYNREEDKVFVARNGVFLEKE 411

Query: 486 HVRNHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRVVDGADTSGQSHPSQELRMPRRSG 545
            +       K+ L EI + + + +T       P+  ++D         P  +   PRRS 
Sbjct: 412 FLSKGVSGRKVRLEEIRETSENVST-------PTEHLLDEQIVV---EPVDKAPAPRRSK 471

Query: 546 RVITQPDRY---LGLAETQVIIPDDGVEDPLTYKQAMNDEDRDQWIKAMNLE--MESMYF 605
           R    P RY   + L    +    D     +  K A  + +  + +     E  ++    
Sbjct: 472 RPRQLPTRYGHDILLLNNAIAAYFDYEIWQMDVKTAFLNGNLHEDVYMTQPEGFVDPNNA 531

Query: 606 NSVWELVDQPDGVKPIGCKWIYKR----KRDQTGKNVDEPCVYKRIVNSTVAFLVLYVDD 665
           + V +L     G+K     W  +     KR    KN +EPCVY ++  ST+  L+LYVDD
Sbjct: 532 SKVCKLQKSIYGLKQASRSWNIRFDEEIKRFGFIKNKEEPCVYMKVSGSTLVILILYVDD 591

Query: 666 ILLIGNDVRILTDIKHWLATQFQMKDLGEAQFVLGIEIIRNRKNKTLALSQASYIDKMLI 725
           ILL+GND+ +L  +K  L   F MKDLG+A ++LGI I R+R  + + LSQ  YIDK+L 
Sbjct: 592 ILLVGNDIPMLESVKSSLRKSFSMKDLGDAAYILGIRIYRDRSKRLIGLSQEMYIDKVLN 651

Query: 726 RYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDICY 785
           R+ MQ+SK+G LP  HG++LSK Q P T  E + M  IPYASA+GS+MYAM+CTRPD+ Y
Sbjct: 652 RFNMQNSKRGFLPMAHGINLSKNQCPTTTDERDKMSDIPYASAIGSIMYAMICTRPDVSY 711

Query: 786 AVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMY-GVKDLILIGYTDSDFQTDVDSR 845
           A+ + SRYQ++P   HWTAVKNILKYLRRT+D  L+Y G ++L++ GYTD+ FQTD D  
Sbjct: 712 ALSVTSRYQADPSEGHWTAVKNILKYLRRTKDVFLVYGGDEELVVNGYTDASFQTDKDDY 771

Query: 846 KSTSGSVFTLNGGAIIWRSIKQGCIADSTMEAEYVVACEAAKESVWLRKFLTDLEVVPNM 905
           +S SG VF LNGGA+ W+S KQ  +ADST EAEY+ A EAAKE VW+R F+T+L +VP+ 
Sbjct: 772 RSQSGFVFILNGGAVSWKSSKQETVADSTTEAEYIAASEAAKEGVWIRNFITELGMVPSA 831

Query: 906 HLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTQIASEHNIADPF 965
             P+ LYCDN+GA+A +KEPRSH++ KHI R+YHLIRE+V RGDV + ++ ++ NIADP 
Sbjct: 832 SSPMDLYCDNNGAIAQAKEPRSHQKSKHILRRYHLIRELVDRGDVKICKVHTDLNIADPL 891

Query: 966 TKPLTAKVFEGHLVSLGLRVM 977
           TKPLT    E H  ++G+R +
Sbjct: 892 TKPLTQPKHEAHTRAIGIRYL 895

BLAST of CmoCh04G023510 vs. NCBI nr
Match: gi|108862621|gb|ABG22008.1| (retrotransposon protein, putative, Ty1-copia subclass [Oryza sativa Japonica Group])

HSP 1 Score: 520.0 bits (1338), Expect = 8.9e-144
Identity = 329/826 (39.83%), Postives = 448/826 (54.24%), Query Frame = 1

Query: 254 ADKGKCFHCNENGHWKRNCPKYLAEKKAEKTQQGYRAKETLELVHTDLCGPMNVKARGGY 313
           A   +CF C E  HWKRNC KYL + K +K Q G  +     +         N K +   
Sbjct: 132 AKDAECFFCKEADHWKRNCKKYLEQLK-QKQQDGKSSTSVYNI---------NAKRQRPN 191

Query: 314 EYFISFIDDYSRYGYLYLMHHKSEALEK-----------FREYKTEVQNLLGKTIK---T 373
           +   +FI       + +L H   + +EK           F  ++T    LLGK  K   T
Sbjct: 192 DLNPTFI------WHCHLGHINEKRMEKLHRDGLLHSFDFESFETCESCLLGKMTKAPFT 251

Query: 374 LRSDRGGEYMDLRFQDYMIEHGIRSQLSATGMPQQNGVSERRNRTLLDMVRSMMSFAQLP 433
            +S+R  E + L + D   E GI  QL+  G PQ NGVSERRNRTLLDMVRSMMS   LP
Sbjct: 252 GQSERASELLGLVYTD---ECGIVPQLTPPGTPQWNGVSERRNRTLLDMVRSMMSQTDLP 311

Query: 434 DPFWGYAVETATYILNMVPTKSVLETPYELWKGRK------------------------- 493
             FWGYA+ETA + LN VP+KS+ +TPYE+W G++                         
Sbjct: 312 LSFWGYALETAAFTLNRVPSKSLDKTPYEIWTGKRPSLSFLKIWGCEVYVKRLQSDKLTP 371

Query: 494 --------GYPKETKGGLFYDPQENRVFVSTNATFLEEDHVRNHQPRSKLVLSEISKEAT 553
                   GYPKETKG  FY+ +E++VFV+ +  FLE++ +        + L EI +   
Sbjct: 372 KSDKYFFVGYPKETKGYYFYNREEDKVFVARHDVFLEKEFISTKDSGIMVRLEEIQETPK 431

Query: 554 DKTTRVVDQAGPSTRVVDGADTSGQSHPSQELRMPRRSGRVITQPDRY--LGLAETQVII 613
           + +T    Q      V        +  P  E    RRS R+   P RY  L   +  +++
Sbjct: 432 NASTSTQPQQDEQDVVQQVEQVVVE--PVVEAPASRRSERIWRTPARYALLTTGQRDILL 491

Query: 614 PDDGVEDPLTYKQAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKR 673
            D+  + P TY++AM   D  +W+ AM  E+ESM+ N VW LVD PDGVK I CKW++K+
Sbjct: 492 LDN--DKPTTYEEAMVGPDSKKWLGAMKSEIESMHVNQVWNLVDPPDGVKGIECKWVFKK 551

Query: 674 KRDQTGKNVDEPCVYK-RIVNSTVAFL--VLYVDDILLIG--NDVRILT------DIKHW 733
           K D  G NV    +YK R+V      +  V Y      +G    +RI+       D + W
Sbjct: 552 KTDLDG-NVH---IYKARLVAKGFRQIQSVDYDGTFSSVGMLKSIRIILAIPAYFDYEIW 611

Query: 734 ---LATQFQMKDLGEAQFVLGIE------------------IIRNRKN------------ 793
              + T F  ++L E + + G++                   ++N +             
Sbjct: 612 QMDVKTAFLNENLDEDKSIYGLKQASRSWNIRFDEVVKALGFVKNEEEPCVYKKISGSAL 671

Query: 794 ------------------KTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSP 853
                             + L  SQ++YIDK+L R+ MQDSKKG LP  HG++L K Q P
Sbjct: 672 DLREAAYILSIRIYRDRSRRLIGSQSTYIDKVLKRFNMQDSKKGFLPLSHGINLGKNQCP 731

Query: 854 KTPQEVEDMKHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKY 913
           +T  E   M  IPYASA+GS+MYAMLCTRPD+ YA+   S+YQS+PG +HW AVKNILKY
Sbjct: 732 QTTDERNKMSVIPYASAIGSIMYAMLCTRPDVPYALSATSQYQSDPGESHWIAVKNILKY 791

Query: 914 LRRTRDYMLMY-GVKDLILIGYTDSDFQTDVDSRKSTSGSVFTLNGGAIIWRSIKQGCIA 968
           LRRT+D  L+Y G ++L++  YTD+ FQTD D  +S SG VF LNGGA+ W+S KQ  +A
Sbjct: 792 LRRTKDMFLIYGGQEELVVNNYTDASFQTDKDDFRSQSGFVFCLNGGAVSWKSSKQDTVA 851

BLAST of CmoCh04G023510 vs. NCBI nr
Match: gi|1019597807|gb|AMY96445.1| (gag/pol protein [Momordica dioica])

HSP 1 Score: 509.2 bits (1310), Expect = 1.6e-140
Identity = 246/316 (77.85%), Postives = 278/316 (87.97%), Query Frame = 1

Query: 631  KNVDEPCVYKRIVNSTVAFLVLYVDDILLIGNDVRILTDIKHWLATQFQMKDLGEAQFVL 690
            +NVDE CVYK+I  S VAFL+LYVDDILLIGNDV  L D+K WL T F MKDLGEAQ++L
Sbjct: 995  QNVDESCVYKKISGSVVAFLILYVDDILLIGNDVEYLEDVKKWLNTSFSMKDLGEAQYIL 1054

Query: 691  GIEIIRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVED 750
            GI I R+R NKT+ +SQ++YIDK+L R+KMQDSKKGLLPFRHG+HLSKEQ PKTPQEVED
Sbjct: 1055 GIRIYRDRSNKTIGMSQSTYIDKVLSRFKMQDSKKGLLPFRHGIHLSKEQCPKTPQEVED 1114

Query: 751  MKHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYM 810
            M++IPY+SA+GSLMYAMLCTRPD+CYA+ IVSRYQSNPGR HWTAVKNILKYLRRTR+  
Sbjct: 1115 MRNIPYSSAIGSLMYAMLCTRPDVCYALSIVSRYQSNPGRDHWTAVKNILKYLRRTRNMF 1174

Query: 811  LMY-GVKDLILIGYTDSDFQTDVDSRKSTSGSVFTLNGGAIIWRSIKQGCIADSTMEAEY 870
            L+Y G KDL + GYTDS FQTD D  KS SG VFTLNGGA+ WRS KQ C+ADST EAEY
Sbjct: 1175 LVYGGDKDLAVKGYTDSSFQTDKDDSKSQSG-VFTLNGGAVSWRSSKQTCVADSTCEAEY 1234

Query: 871  VVACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYH 930
            V ACEAAKE+VW+RKFLTDL VVPNMHLP+TLYCDNSGAVAN+KEPRSHKRGKHIERKYH
Sbjct: 1235 VAACEAAKEAVWIRKFLTDLGVVPNMHLPITLYCDNSGAVANAKEPRSHKRGKHIERKYH 1294

Query: 931  LIREIVQRGDVIVTQI 946
            LIREIV+RGDV+V Q+
Sbjct: 1295 LIREIVERGDVVVCQM 1309

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
POLX_TOBAC7.0e-8044.57Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
COPIA_DROME9.8e-5035.36Copia protein OS=Drosophila melanogaster GN=GIP PE=1 SV=3[more]
YCH4_YEAST1.9e-2130.28Putative transposon Ty5-1 protein YCL074W OS=Saccharomyces cerevisiae (strain AT... [more]
M810_ARATH6.2e-2032.48Uncharacterized mitochondrial protein AtMg00810 OS=Arabidopsis thaliana GN=AtMg0... [more]
YP14B_YEAST7.1e-1629.65Transposon Ty1-PR3 Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC ... [more]
Match NameE-valueIdentityDescription
E2GK51_BRYDI6.2e-16086.94Gag/pol protein (Fragment) OS=Bryonia dioica PE=4 SV=1[more]
A5AUE7_VITVI2.7e-14773.41Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_021035 PE=4 SV=1[more]
D3IVU0_9POAL2.5e-14546.22Putative retrotransposon protein OS=Phyllostachys edulis PE=4 SV=1[more]
H2KWP6_ORYSJ6.2e-14439.83Retrotransposon protein, putative, Ty1-copia subclass OS=Oryza sativa subsp. jap... [more]
A0A165U314_9ROSI1.1e-14077.85Gag/pol protein OS=Momordica dioica PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G23160.14.4e-3229.15 cysteine-rich RLK (RECEPTOR-like protein kinase) 8[more]
ATMG00810.13.5e-2132.48ATMG00810.1 DNA/RNA polymerases superfamily protein[more]
Match NameE-valueIdentityDescription
gi|299474487|gb|ADJ18449.1|8.9e-16086.94gag/pol protein [Bryonia dioica][more]
gi|147768021|emb|CAN69397.1|3.9e-14773.41hypothetical protein VITISV_021035 [Vitis vinifera][more]
gi|284434733|gb|ADB85430.1|3.6e-14546.22putative retrotransposon protein [Phyllostachys edulis][more]
gi|108862621|gb|ABG22008.1|8.9e-14439.83retrotransposon protein, putative, Ty1-copia subclass [Oryza sativa Japonica Gro... [more]
gi|1019597807|gb|AMY96445.1|1.6e-14077.85gag/pol protein [Momordica dioica][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001584Integrase_cat-core
IPR001878Znf_CCHC
IPR012337RNaseH-like_sf
IPR013103RVT_2
Vocabulary: Biological Process
TermDefinition
GO:0015074DNA integration
Vocabulary: Molecular Function
TermDefinition
GO:0003676nucleic acid binding
GO:0008270zinc ion binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
biological_process GO:0006310 DNA recombination
cellular_component GO:0005575 cellular_component
molecular_function GO:0003677 DNA binding
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0003676 nucleic acid binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh04G023510.1CmoCh04G023510.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001584Integrase, catalytic corePFAMPF00665rvecoord: 293..408
score: 5.8
IPR001584Integrase, catalytic corePROFILEPS50994INTEGRASEcoord: 289..454
score: 21
IPR001878Zinc finger, CCHC-typeGENE3DG3DSA:4.10.60.10coord: 253..275
score: 6.
IPR001878Zinc finger, CCHC-typePFAMPF00098zf-CCHCcoord: 257..274
score: 5.
IPR001878Zinc finger, CCHC-typeSMARTSM00343c2hcfinal6coord: 258..274
score: 0.
IPR001878Zinc finger, CCHC-typePROFILEPS50158ZF_CCHCcoord: 258..274
score: 9
IPR001878Zinc finger, CCHC-typeunknownSSF57756Retrovirus zinc finger-like domainscoord: 247..277
score: 2.9
IPR012337Ribonuclease H-like domainGENE3DG3DSA:3.30.420.10coord: 286..447
score: 6.0
IPR012337Ribonuclease H-like domainunknownSSF53098Ribonuclease H-likecoord: 289..448
score: 4.86
IPR013103Reverse transcriptase, RNA-dependent DNA polymerasePFAMPF07727RVT_2coord: 633..729
score: 2.4E-21coord: 601..632
score: 3.
NoneNo IPR availableunknownCoilCoilcoord: 335..355
scor
NoneNo IPR availablePANTHERPTHR11439GAG-POL-RELATED RETROTRANSPOSONcoord: 12..899
score: 1.2E
NoneNo IPR availablePANTHERPTHR11439:SF192SUBFAMILY NOT NAMEDcoord: 12..899
score: 1.2E
NoneNo IPR availablePFAMPF14223Retrotran_gag_2coord: 62..193
score: 3.3
NoneNo IPR availableunknownSSF56672DNA/RNA polymerasescoord: 740..929
score: 8.45E-6coord: 648..710
score: 8.4

The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None