CmoCh04G021790 (gene) Cucurbita moschata (Rifu)

NameCmoCh04G021790
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionRetrotransposon protein, putative, Ty3-gypsy subclass
LocationCmo_Chr04 : 15932703 .. 15934254 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGAAAGAGCTTCCGGTTTCAGGTTACGAAGCCGGGAAGTGTTTCTAAGACAATAATTGCTCTAAAGGCTAGAAAGTTGATTAGGCATGATGTTGTGGCGTTCCTGGCTAGTGTGGCAAAAGTGAGACGAGTTGATAAAGATGTATCAGTTGTACCTGTAGTGAATGAGTTCCTGGATGTCTTCCTGGATGAGATGCCTGGTTTGCCACCAGAGCGGGAAGTCAACCTTGACATTGAACTTGAACCGGGGATGACACTTAACTCTAAAGCTCCTTATAGGATGGCCCCCACGGAATTGAAAGAGTTGAAATTGCAACTACAAGAATTGTTGAACCAGGGGTTCATACGACCTAGTGTGTCACCGTGGGGAGCTCGTGTTGTTTGTGAAGAAGAAGGACGGTACACTCCGTTTCTGTATTGATTATAGGAAGCTGAATACGGTGACCATTAAAAATAAGTATCCCTTGCCGCGAATTGATGACTTGTTTGACCAGCTTCAAGGGGCAGCGGTATTTTCGAAGATTGATCTTTGTTCTGGTTATCACCAGATAAGAGTCAAAGAAGATGACGTACCGAAGACAGCTTTTCGTACTCGGTATGGGCATTATGAGTTTGTTGTGATGTCCTTTGGCTTGACTAATGCCCCTGCAGTGTTTATGGAGCTGATGAATCGGGTATTCCAGGATTTTCTGGATTCTTTTGTCATTGTGTTCATTGATGATATCTTGGTTTATTCCAAGACAAACGATGAACATGCAGAACATTTGAGGAAGGTTTTGTTGGTTCTGCGTAAACAAAGATTATATGCCAAGTTCTCAAAATGCGAGTTTTGACTTCAAAAGGTAGTATTCCTTGGTCATGTGGTATCCAAGGATGGTATAACTGTTGATCCAGCAAAGGTGGAGGCAATTATAGGTTGGGTTCGACCAACTACAGTTACTGAGGTGAGAAGTTTTCTGGGTTTAGCCGGATATTACAGGCGCTTTATTAAAGACTTTGCAAGGATTGCTGCACCACTGACTCAGTTAACCCGAAAAGGGAAGAAATTTGATTGGAGTCGAGCGTGTGAAAGTAGTTTTCAGGAACTCAAGGAAAGATTAGCGTCAGCCCCAGTGCTTATTGTACCTAACGGTACTGGAAACCTAGTAATTTATAGTGATGCCTCTAAGCATGGGTTGGGGTGCGTACTTATGCAAAACGGGAGAGTTATTGCTTATGCCTCTCGGCAATTAAAGGATTATGAACGCAGTTACCCAACTCATGATTTAGAATTAGCTGCTGTGGGGTTTGCTCTGAAGGTATGGAGACATTATTTGTATGGTGAGAGGATACAAGTACATACTGATCATAAGAGTCTTAAATATCTGTTCACCCAGAAAGAGCTCATCATGAGGCAACGTCGGTGGTTGGAATTGGTAAAAGATTATGATGTGGAGATCCTACATCATCCTGGTAAAGCTAATGTGGTAGCTGATGCCTTGAGTCGTAAGACAGCTCACTCATCTGCAATGCTAACGAGACAACACAACATTCAGATGGAGTTTGAATGA

mRNA sequence

ATGGGAAAGAGCTTCCGGTTTCAGGTTACGAAGCCGGGAAGTGTTTCTAAGACAATAATTGCTCTAAAGGCTAGAAAGTTGATTAGGCATGATGTTGTGGCGTTCCTGGCTAGTGTGGCAAAAGTGAGACGAGTTGATAAAGATGTATCAGTTGTACCTGTAGTGAATGAGTTCCTGGATGTCTTCCTGGATGAGATGCCTGGTTTGCCACCAGAGCGGGAAGTCAACCTTGACATTGAACTTGAACCGGGGATGACACTTAACTCTAAAGCTCCTTATAGGATGGCCCCCACGGAATTGAAAGAGTTGAAATTGCAACTACAAGAATTGTTGAACCAGGGGAAGCTGAATACGGTGACCATTAAAAATAAGTATCCCTTGCCGCGAATTGATGACTTGTTTGACCAGCTTCAAGGGGCAGCGGTATTTTCGAAGATTGATCTTTGTTCTGGTTATCACCAGATAAGAGTCAAAGAAGATGACGTACCGAAGACAGCTTTTCGTACTCGGTATGGGCATTATGAGTTTGTTGTGATGTCCTTTGGCTTGACTAATGCCCCTGCAGTGTTTATGGAGCTGATGAATCGGGTATTCCAGGATTTTCTGGATTCTTTTGTCATTGTGTTCATTGATGATATCTTGGTTTATTCCAAGACAAACGATGAACATGCAGAACATTTGAGGAAGGTAGTATTCCTTGGTCATGTGGTATCCAAGGATGGTATAACTGTTGATCCAGCAAAGGTGGAGGCAATTATAGGTTGGGTTCGACCAACTACAGTTACTGAGGTGAGAAGTTTTCTGGGTTTAGCCGGATATTACAGGCGCTTTATTAAAGACTTTGCAAGGATTGCTGCACCACTGACTCAGTTAACCCGAAAAGGGAAGAAATTTGATTGGAGTCGAGCGTGTGAAAGTAGTTTTCAGGAACTCAAGGAAAGATTAGCGTCAGCCCCAGTGCTTATTGTACCTAACGGTACTGGAAACCTAGTAATTTATAGTGATGCCTCTAAGCATGGGTTGGGGTGCGTACTTATGCAAAACGGGAGAGTTATTGCTTATGCCTCTCGGCAATTAAAGGATTATGAACGCAGTTACCCAACTCATGATTTAGAATTAGCTGCTGTGGGGTTTGCTCTGAAGGTATGGAGACATTATTTGTATGGTGAGAGGATACAAGTACATACTGATCATAAGAGTCTTAAATATCTGTTCACCCAGAAAGAGCTCATCATGAGGCAACGTCGGTGGTTGGAATTGGTAAAAGATTATGATGTGGAGATCCTACATCATCCTGGTAAAGCTAATGTGGTAGCTGATGCCTTGAGTCGTAAGACAGCTCACTCATCTGCAATGCTAACGAGACAACACAACATTCAGATGGAGTTTGAATGA

Coding sequence (CDS)

ATGGGAAAGAGCTTCCGGTTTCAGGTTACGAAGCCGGGAAGTGTTTCTAAGACAATAATTGCTCTAAAGGCTAGAAAGTTGATTAGGCATGATGTTGTGGCGTTCCTGGCTAGTGTGGCAAAAGTGAGACGAGTTGATAAAGATGTATCAGTTGTACCTGTAGTGAATGAGTTCCTGGATGTCTTCCTGGATGAGATGCCTGGTTTGCCACCAGAGCGGGAAGTCAACCTTGACATTGAACTTGAACCGGGGATGACACTTAACTCTAAAGCTCCTTATAGGATGGCCCCCACGGAATTGAAAGAGTTGAAATTGCAACTACAAGAATTGTTGAACCAGGGGAAGCTGAATACGGTGACCATTAAAAATAAGTATCCCTTGCCGCGAATTGATGACTTGTTTGACCAGCTTCAAGGGGCAGCGGTATTTTCGAAGATTGATCTTTGTTCTGGTTATCACCAGATAAGAGTCAAAGAAGATGACGTACCGAAGACAGCTTTTCGTACTCGGTATGGGCATTATGAGTTTGTTGTGATGTCCTTTGGCTTGACTAATGCCCCTGCAGTGTTTATGGAGCTGATGAATCGGGTATTCCAGGATTTTCTGGATTCTTTTGTCATTGTGTTCATTGATGATATCTTGGTTTATTCCAAGACAAACGATGAACATGCAGAACATTTGAGGAAGGTAGTATTCCTTGGTCATGTGGTATCCAAGGATGGTATAACTGTTGATCCAGCAAAGGTGGAGGCAATTATAGGTTGGGTTCGACCAACTACAGTTACTGAGGTGAGAAGTTTTCTGGGTTTAGCCGGATATTACAGGCGCTTTATTAAAGACTTTGCAAGGATTGCTGCACCACTGACTCAGTTAACCCGAAAAGGGAAGAAATTTGATTGGAGTCGAGCGTGTGAAAGTAGTTTTCAGGAACTCAAGGAAAGATTAGCGTCAGCCCCAGTGCTTATTGTACCTAACGGTACTGGAAACCTAGTAATTTATAGTGATGCCTCTAAGCATGGGTTGGGGTGCGTACTTATGCAAAACGGGAGAGTTATTGCTTATGCCTCTCGGCAATTAAAGGATTATGAACGCAGTTACCCAACTCATGATTTAGAATTAGCTGCTGTGGGGTTTGCTCTGAAGGTATGGAGACATTATTTGTATGGTGAGAGGATACAAGTACATACTGATCATAAGAGTCTTAAATATCTGTTCACCCAGAAAGAGCTCATCATGAGGCAACGTCGGTGGTTGGAATTGGTAAAAGATTATGATGTGGAGATCCTACATCATCCTGGTAAAGCTAATGTGGTAGCTGATGCCTTGAGTCGTAAGACAGCTCACTCATCTGCAATGCTAACGAGACAACACAACATTCAGATGGAGTTTGAATGA
BLAST of CmoCh04G021790 vs. Swiss-Prot
Match: POL3_DROME (Retrovirus-related Pol polyprotein from transposon 17.6 OS=Drosophila melanogaster GN=pol PE=3 SV=1)

HSP 1 Score: 274.2 bits (700), Expect = 2.5e-72
Identity = 148/404 (36.63%), Postives = 236/404 (58.42%), Query Frame = 1

Query: 72  EREVNLDIE--LEPGMTLNSKAPYR----MAPTELKEL-KLQLQELLNQGKLNTVTIKNK 131
           E+EV   I+  L  G+   S +PY     + P +     K + + +++  KLN +T+ ++
Sbjct: 220 EQEVESQIQDMLNQGIIRTSNSPYNSPIWVVPKKQDASGKQKFRIVIDYRKLNEITVGDR 279

Query: 132 YPLPRIDDLFDQLQGAAVFSKIDLCSGYHQIRVKEDDVPKTAFRTRYGHYEFVVMSFGLT 191
           +P+P +D++  +L     F+ IDL  G+HQI +  + V KTAF T++GHYE++ M FGL 
Sbjct: 280 HPIPNMDEILGKLGRCNYFTTIDLAKGFHQIEMDPESVSKTAFSTKHGHYEYLRMPFGLK 339

Query: 192 NAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEHAEHL----------------- 251
           NAPA F   MN + +  L+   +V++DDI+V+S + DEH + L                 
Sbjct: 340 NAPATFQRCMNDILRPLLNKHCLVYLDDIIVFSTSLDEHLQSLGLVFEKLAKANLKLQLD 399

Query: 252 ------RKVVFLGHVVSKDGITVDPAKVEAIIGWVRPTTVTEVRSFLGLAGYYRRFIKDF 311
                 ++  FLGHV++ DGI  +P K+EAI  +  PT   E+++FLGL GYYR+FI +F
Sbjct: 400 KCEFLKQETTFLGHVLTPDGIKPNPEKIEAIQKYPIPTKPKEIKAFLGLTGYYRKFIPNF 459

Query: 312 ARIAAPLTQLTRKGKKFDWSR-ACESSFQELKERLASAPVLIVPNGTGNLVIYSDASKHG 371
           A IA P+T+  +K  K D +    +S+F++LK  ++  P+L VP+ T    + +DAS   
Sbjct: 460 ADIAKPMTKCLKKNMKIDTTNPEYDSAFKKLKYLISEDPILKVPDFTKKFTLTTDASDVA 519

Query: 372 LGCVLMQNGRVIAYASRQLKDYERSYPTHDLELAAVGFALKVWRHYLYGERIQVHTDHKS 431
           LG VL Q+G  ++Y SR L ++E +Y T + EL A+ +A K +RHYL G   ++ +DH+ 
Sbjct: 520 LGAVLSQDGHPLSYISRTLNEHEINYSTIEKELLAIVWATKTFRHYLLGRHFEISSDHQP 579

Query: 432 LKYLFTQKELIMRQRRWLELVKDYDVEILHHPGKANVVADALSR 445
           L +L+  K+   +  RW   + ++D +I +  GK N VADALSR
Sbjct: 580 LSWLYRMKDPNSKLTRWRVKLSEFDFDIKYIKGKENCVADALSR 623

BLAST of CmoCh04G021790 vs. Swiss-Prot
Match: POL2_DROME (Retrovirus-related Pol polyprotein from transposon 297 OS=Drosophila melanogaster GN=pol PE=3 SV=1)

HSP 1 Score: 265.4 bits (677), Expect = 1.2e-69
Identity = 147/394 (37.31%), Postives = 225/394 (57.11%), Query Frame = 1

Query: 81  LEPGMTLNSKAPYRMAPTELKELKL------QLQELLNQGKLNTVTIKNKYPLPRIDDLF 140
           L  G+   S +PY  +PT +   K       + + +++  KLN +TI ++YP+P +D++ 
Sbjct: 230 LNQGLIRESNSPYN-SPTWVVPKKPDASGANKYRVVIDYRKLNEITIPDRYPIPNMDEIL 289

Query: 141 DQLQGAAVFSKIDLCSGYHQIRVKEDDVPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELM 200
            +L     F+ IDL  G+HQI + E+ + KTAF T+ GHYE++ M FGL NAPA F   M
Sbjct: 290 GKLGKCQYFTTIDLAKGFHQIEMDEESISKTAFSTKSGHYEYLRMPFGLRNAPATFQRCM 349

Query: 201 NRVFQDFLDSFVIVFIDDILVYSKTNDEH----------------------AEHLRKVV- 260
           N + +  L+   +V++DDI+++S +  EH                       E L+K   
Sbjct: 350 NNILRPLLNKHCLVYLDDIIIFSTSLTEHLNSIQLVFTKLADANLKLQLDKCEFLKKEAN 409

Query: 261 FLGHVVSKDGITVDPAKVEAIIGWVRPTTVTEVRSFLGLAGYYRRFIKDFARIAAPLTQL 320
           FLGH+V+ DGI  +P KV+AI+ +  PT   E+R+FLGL GYYR+FI ++A IA P+T  
Sbjct: 410 FLGHIVTPDGIKPNPIKVKAIVSYPIPTKDKEIRAFLGLTGYYRKFIPNYADIAKPMTSC 469

Query: 321 TRKGKKFDWSR-ACESSFQELKERLASAPVLIVPNGTGNLVIYSDASKHGLGCVLMQNGR 380
            +K  K D  +     +F++LK  +   P+L +P+     V+ +DAS   LG VL QNG 
Sbjct: 470 LKKRTKIDTQKLEYIEAFEKLKALIIRDPILQLPDFEKKFVLTTDASNLALGAVLSQNGH 529

Query: 381 VIAYASRQLKDYERSYPTHDLELAAVGFALKVWRHYLYGERIQVHTDHKSLKYLFTQKEL 440
            I++ SR L D+E +Y   + EL A+ +A K +RHYL G +  + +DH+ L++L   KE 
Sbjct: 530 PISFISRTLNDHELNYSAIEKELLAIVWATKTFRHYLLGRQFLIASDHQPLRWLHNLKEP 589

Query: 441 IMRQRRWLELVKDYDVEILHHPGKANVVADALSR 445
             +  RW   + +Y  +I +  GK N VADALSR
Sbjct: 590 GAKLERWRVRLSEYQFKIDYIKGKENSVADALSR 622

BLAST of CmoCh04G021790 vs. Swiss-Prot
Match: POL5_DROME (Retrovirus-related Pol polyprotein from transposon opus OS=Drosophila melanogaster GN=pol PE=3 SV=1)

HSP 1 Score: 258.1 bits (658), Expect = 1.9e-67
Identity = 154/417 (36.93%), Postives = 231/417 (55.40%), Query Frame = 1

Query: 72  EREVNLDIELEPGMTLNSKAPYR----MAPTELKEL-KLQLQELLNQGKLNTVTIKNKYP 131
           E E  +D  L+ G+   S +PY     + P + K   + Q + +++  +LNTVTI + YP
Sbjct: 138 EVERQIDELLQDGIIRPSNSPYNSPIWIVPKKPKPNGEKQYRMVVDFKRLNTVTIPDTYP 197

Query: 132 LPRIDDLFDQLQGAAVFSKIDLCSGYHQIRVKEDDVPKTAFRTRYGHYEFVVMSFGLTNA 191
           +P I+     L  A  F+ +DL SG+HQI +KE D+PKTAF T  G YEF+ + FGL NA
Sbjct: 198 IPDINATLASLGNAKYFTTLDLTSGFHQIHMKESDIPKTAFSTLNGKYEFLRLPFGLKNA 257

Query: 192 PAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEHAEHLRKVV--------------- 251
           PA+F  +++ + ++ +     V+IDDI+V+S+  D H ++LR V+               
Sbjct: 258 PAIFQRMIDDILREHIGKVCYVYIDDIIVFSEDYDTHWKNLRLVLASLSKANLQVNLEKS 317

Query: 252 --------FLGHVVSKDGITVDPAKVEAIIGWVRPTTVTEVRSFLGLAGYYRRFIKDFAR 311
                   FLG++V+ DGI  DP KV AI     PT+V E++ FLG+  YYR+FI+D+A+
Sbjct: 318 HFLDTQVEFLGYIVTADGIKADPKKVRAISEMPPPTSVKELKRFLGMTSYYRKFIQDYAK 377

Query: 312 IAAPLTQLTR-----------KGKKFDWSRACESSFQELKERLASAPVLIVPNGTGNLVI 371
           +A PLT LTR                        SF +LK  L S+ +L  P  T    +
Sbjct: 378 VAKPLTNLTRGLYANIKSSQSSKVPITLDETALQSFNDLKSILCSSEILAFPCFTKPFHL 437

Query: 372 YSDASKHGLGCVLMQN----GRVIAYASRQLKDYERSYPTHDLELAAVGFALKVWRHYLY 431
            +DAS   +G VL Q+     R IAY SR L   E +Y T + E+ A+ ++L   R YLY
Sbjct: 438 TTDASNWAIGAVLSQDDQGRDRPIAYISRSLNKTEENYATIEKEMLAIIWSLDNLRAYLY 497

Query: 432 GE-RIQVHTDHKSLKYLFTQKELIMRQRRWLELVKDYDVEILHHPGKANVVADALSR 445
           G   I+V+TDH+ L +    +    + +RW   +++Y+ E+++ PGK+NVVADALSR
Sbjct: 498 GAGTIKVYTDHQPLTFALGNRNFNAKLKRWKARIEEYNCELIYKPGKSNVVADALSR 554

BLAST of CmoCh04G021790 vs. Swiss-Prot
Match: POLY_DROME (Retrovirus-related Pol polyprotein from transposon gypsy OS=Drosophila melanogaster GN=pol PE=3 SV=1)

HSP 1 Score: 232.3 bits (591), Expect = 1.1e-59
Identity = 159/492 (32.32%), Postives = 249/492 (50.61%), Query Frame = 1

Query: 9   VTKPGSVSK----TIIALKARKLIRHDVVAFLASV-AKVRRVDKDV---SVVPVVNEFLD 68
           +  P SV K    TII  K      ++ + F  +V A +R VD +       P +    D
Sbjct: 136 IVVPDSVKKEFKDTIIRRKKAFSTTNEALPFNTAVTATIRTVDNEPVYSRAYPTLMGVSD 195

Query: 69  VFLDEMPGLPPEREVNLDIELEPGMTLNSKAPYRMAPTELKELK-------LQLQELLNQ 128
              +E+  L           L+ G+   S++PY  +PT + + K          + +++ 
Sbjct: 196 FVNNEVKQL-----------LKDGIIRPSRSPYN-SPTWVVDKKGTDAFGNPNKRLVIDF 255

Query: 129 GKLNTVTIKNKYPLPRIDDLFDQLQGAAVFSKIDLCSGYHQIRVKEDDVPKTAFRTRYGH 188
            KLN  TI ++YP+P I  +   L  A  F+ +DL SGYHQI + E D  KT+F    G 
Sbjct: 256 RKLNEKTIPDRYPMPSIPMILANLGKAKFFTTLDLKSGYHQIYLAEHDREKTSFSVNGGK 315

Query: 189 YEFVVMSFGLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEHAEHL------ 248
           YEF  + FGL NA ++F   ++ V ++ +     V++DD++++S+   +H  H+      
Sbjct: 316 YEFCRLPFGLRNASSIFQRALDDVLREQIGKICYVYVDDVIIFSENESDHVRHIDTVLKC 375

Query: 249 -----------------RKVVFLGHVVSKDGITVDPAKVEAIIGWVRPTTVTEVRSFLGL 308
                              V +LG +VSKDG   DP KV+AI  +  P  V +VRSFLGL
Sbjct: 376 LIDANMRVSQEKTRFFKESVEYLGFIVSKDGTKSDPEKVKAIQEYPEPDCVYKVRSFLGL 435

Query: 309 AGYYRRFIKDFARIAAPLTQLTR-----------KGKKFDWSRACESSFQELKERLASAP 368
           A YYR FIKDFA IA P+T + +           K    +++    ++FQ L+  LAS  
Sbjct: 436 ASYYRVFIKDFAAIARPITDILKGENGSVSKHMSKKIPVEFNETQRNAFQRLRNILASED 495

Query: 369 VLI-VPNGTGNLVIYSDASKHGLGCVLMQNGRVIAYASRQLKDYERSYPTHDLELAAVGF 428
           V++  P+      + +DAS  G+G VL Q GR I   SR LK  E++Y T++ EL A+ +
Sbjct: 496 VILKYPDFKKPFDLTTDASASGIGAVLSQEGRPITMISRTLKQPEQNYATNERELLAIVW 555

Query: 429 ALKVWRHYLYGER-IQVHTDHKSLKYLFTQKELIMRQRRWLELVKDYDVEILHHPGKANV 450
           AL   +++LYG R I + TDH+ L +    +    + +RW   +  ++ ++ + PGK N 
Sbjct: 556 ALGKLQNFLYGSREINIFTDHQPLTFAVADRNTNAKIKRWKSYIDQHNAKVFYKPGKENF 615

BLAST of CmoCh04G021790 vs. Swiss-Prot
Match: YG31B_YEAST (Transposon Ty3-G Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=TY3B-G PE=1 SV=3)

HSP 1 Score: 226.1 bits (575), Expect = 7.9e-58
Identity = 146/450 (32.44%), Postives = 226/450 (50.22%), Query Frame = 1

Query: 57   EFLDVFLDEMPGLPPERE---VNLDIELEPGMTLNSKAPYRMAPTELKELKLQLQELLNQ 116
            ++ ++  +++P  P +     V  DIE++PG  L    PY +     +E+   +Q+LL+ 
Sbjct: 563  KYREIIRNDLPPRPADINNIPVKHDIEIKPGARLPRLQPYHVTEKNEQEINKIVQKLLDN 622

Query: 117  G------------------------------KLNTVTIKNKYPLPRIDDLFDQLQGAAVF 176
                                            LN  TI + +PLPRID+L  ++  A +F
Sbjct: 623  KFIVPSKSPCSSPVVLVPKKDGTFRLCVDYRTLNKATISDPFPLPRIDNLLSRIGNAQIF 682

Query: 177  SKIDLCSGYHQIRVKEDDVPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFQDFLD 236
            + +DL SGYHQI ++  D  KTAF T  G YE+ VM FGL NAP+ F   M   F+D   
Sbjct: 683  TTLDLHSGYHQIPMEPKDRYKTAFVTPSGKYEYTVMPFGLVNAPSTFARYMADTFRDL-- 742

Query: 237  SFVIVFIDDILVYSKTNDEHAEHLRKVV-----------------------FLGHVVSKD 296
             FV V++DDIL++S++ +EH +HL  V+                       FLG+ +   
Sbjct: 743  RFVNVYLDDILIFSESPEEHWKHLDTVLERLKNENLIVKKKKCKFASEETEFLGYSIGIQ 802

Query: 297  GITVDPAKVEAIIGWVRPTTVTEVRSFLGLAGYYRRFIKDFARIAAPLTQLTRKGKKFDW 356
             I     K  AI  +  P TV + + FLG+  YYRRFI + ++IA P+        K  W
Sbjct: 803  KIAPLQHKCAAIRDFPTPKTVKQAQRFLGMINYYRRFIPNCSKIAQPIQLFICD--KSQW 862

Query: 357  SRACESSFQELKERLASAPVLIVPNGTGNLVIYSDASKHGLGCVLMQNGR------VIAY 416
            +   + +  +LK+ L ++PVL+  N   N  + +DASK G+G VL +         V+ Y
Sbjct: 863  TEKQDKAIDKLKDALCNSPVLVPFNNKANYRLTTDASKDGIGAVLEEVDNKNKLVGVVGY 922

Query: 417  ASRQLKDYERSYPTHDLELAAVGFALKVWRHYLYGERIQVHTDHKSLKYLFTQKELIMRQ 445
             S+ L+  +++YP  +LEL  +  AL  +R+ L+G+   + TDH SL  L  + E   R 
Sbjct: 923  FSKSLESAQKNYPAGELELLGIIKALHHFRYMLHGKHFTLRTDHISLLSLQNKNEPARRV 982

BLAST of CmoCh04G021790 vs. TrEMBL
Match: M5WLY8_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa021229mg PE=4 SV=1)

HSP 1 Score: 579.7 bits (1493), Expect = 3.1e-162
Identity = 295/488 (60.45%), Postives = 357/488 (73.16%), Query Frame = 1

Query: 19  IIALKARKLIRHDVVAFLASVAKVRRVDKDVSVVPVVNEFLDVFLDEMPGLPPEREVNLD 78
           I A+ A++L+R     ++A V   R     +  +PV+ +F DVF +++PGLPP RE+   
Sbjct: 197 ISAMTAKRLLRKGCSGYIAHVIDTRDNGLRLEDIPVIQDFPDVFPEDLPGLPPHREIEFV 256

Query: 79  IELEPGMTLNSKAPYRMAPTELKELKLQLQELLNQG------------------------ 138
           IEL PG    S+APYRMAP EL+ELK QLQEL+++G                        
Sbjct: 257 IELAPGTNPISQAPYRMAPAELRELKTQLQELVDKGFIRPSFSPWGAPVLFVKKKDGTMR 316

Query: 139 ------KLNTVTIKNKYPLPRIDDLFDQLQGAAVFSKIDLCSGYHQIRVKEDDVPKTAFR 198
                 +LN +T++N+YPLPRIDDLFDQL+GA VFSKIDL SGYHQ+RV+E+D+PKTAFR
Sbjct: 317 LCVDYRQLNKITVRNRYPLPRIDDLFDQLKGAKVFSKIDLRSGYHQLRVREEDMPKTAFR 376

Query: 199 TRYGHYEFVVMSFGLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEHAEHLR 258
           TRYGHYEF+VM FGLTNAPA FM+LMNRVF+ +LD FVIVFIDDILVYSK+   H +HL 
Sbjct: 377 TRYGHYEFLVMPFGLTNAPAAFMDLMNRVFRRYLDRFVIVFIDDILVYSKSQKAHMKHLN 436

Query: 259 -----------------------KVVFLGHVVSKDGITVDPAKVEAIIGWVRPTTVTEVR 318
                                  +V FLGHV+S +GI VDP K+EA++ W+RPT+VTE+R
Sbjct: 437 LVLRTLRRRQLYAKFSKCQFWLDRVSFLGHVISAEGIYVDPQKIEAVVNWLRPTSVTEIR 496

Query: 319 SFLGLAGYYRRFIKDFARIAAPLTQLTRKGKKFDWSRACESSFQELKERLASAPVLIVPN 378
           SFLGLAGYYRRF++ F+ IAAPLT LTRKG KF WS  CE SF ELK RL +APVL +P+
Sbjct: 497 SFLGLAGYYRRFVEGFSTIAAPLTYLTRKGVKFVWSDKCEESFIELKTRLTTAPVLALPD 556

Query: 379 GTGNLVIYSDASKHGLGCVLMQNGRVIAYASRQLKDYERSYPTHDLELAAVGFALKVWRH 438
            +GN VIYSDAS+ GLGCVLMQ+GRVIAYASRQLK +E +YP HDLELAAV FALK+WRH
Sbjct: 557 DSGNFVIYSDASQQGLGCVLMQHGRVIAYASRQLKKHELNYPVHDLELAAVVFALKIWRH 616

Query: 439 YLYGERIQVHTDHKSLKYLFTQKELIMRQRRWLELVKDYDVEILHHPGKANVVADALSRK 454
           YLYGE  Q+ TDHKSLKYLFTQKEL +RQRRWLEL+KDYD  I HHPG+ANVVADALSRK
Sbjct: 617 YLYGETCQIFTDHKSLKYLFTQKELNLRQRRWLELIKDYDCTIEHHPGRANVVADALSRK 676

BLAST of CmoCh04G021790 vs. TrEMBL
Match: Q84KB0_CUCME (Pol protein OS=Cucumis melo subsp. melo PE=4 SV=1)

HSP 1 Score: 577.8 bits (1488), Expect = 1.2e-161
Identity = 286/423 (67.61%), Postives = 336/423 (79.43%), Query Frame = 1

Query: 95  MAPTELKELKLQLQELLNQG------------------------------KLNTVTIKNK 154
           MAP ELKELK+QLQELL++G                              +LN VT+KN+
Sbjct: 1   MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNR 60

Query: 155 YPLPRIDDLFDQLQGAAVFSKIDLCSGYHQIRVKEDDVPKTAFRTRYGHYEFVVMSFGLT 214
           YPLPRIDDLFDQLQGA VFSKIDL SGYHQ+R+K++DVPKTAFR+RYGHY+F+VMSFGLT
Sbjct: 61  YPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYQFIVMSFGLT 120

Query: 215 NAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEHAEHLRKVV------------- 274
           NAPAVFM+LMNRVF++FLD+FVIVFIDDIL+YSKT  EH EHLR V+             
Sbjct: 121 NAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFS 180

Query: 275 ----------FLGHVVSKDGITVDPAKVEAIIGWVRPTTVTEVRSFLGLAGYYRRFIKDF 334
                     FLGHVVSK G++VDPAK+EA+ GW RP+TV+EVRSFLGLAGYYRRF+++F
Sbjct: 181 KCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENF 240

Query: 335 ARIAAPLTQLTRKGKKFDWSRACESSFQELKERLASAPVLIVPNGTGNLVIYSDASKHGL 394
           +RIA PLTQLTRKG  F WS+ACE SFQ LK++L +APVL VP+G+GN VIYSDASK GL
Sbjct: 241 SRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKLVTAPVLTVPDGSGNFVIYSDASKKGL 300

Query: 395 GCVLMQNGRVIAYASRQLKDYERSYPTHDLELAAVGFALKVWRHYLYGERIQVHTDHKSL 454
           GCVLMQ G+V+AYASRQLK +E++YPTHDLELAAV FALK+WRHYLYGE+IQ+ TDHKSL
Sbjct: 301 GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSL 360

Query: 455 KYLFTQKELIMRQRRWLELVKDYDVEILHHPGKANVVADALSRKTAHSSAMLTRQHNIQM 465
           KY FTQKEL MRQRRWLELVKDYD EIL+HPGKANVVADALSRK +HS+A++TRQ  +  
Sbjct: 361 KYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHR 420

BLAST of CmoCh04G021790 vs. TrEMBL
Match: A0A061E6T4_THECC (DNA/RNA polymerases superfamily protein OS=Theobroma cacao GN=TCM_009549 PE=4 SV=1)

HSP 1 Score: 577.0 bits (1486), Expect = 2.0e-161
Identity = 300/491 (61.10%), Postives = 356/491 (72.51%), Query Frame = 1

Query: 16  SKTIIALKARKLIRHDVVAFLASVAKVRRVDKDVSVVPVVNEFLDVFLDEMPGLPPEREV 75
           S  I A+KA KL++     +LA V    + +  +  V +V+EF DVF D++PGLPP+RE+
Sbjct: 506 SCVISAIKASKLVQKGYSTYLAYVIDTSKGEPKLEDVSIVSEFPDVFPDDLPGLPPDREL 565

Query: 76  NLDIELEPGMTLNSKAPYRMAPTELKELKLQLQELLNQG--------------------- 135
              I+L PG    S  PYRMAPTELKELK+QLQEL+++G                     
Sbjct: 566 EFPIDLLPGTAPISIPPYRMAPTELKELKVQLQELVDKGFIRPSISPWGAPILFVKKKDG 625

Query: 136 ---------KLNTVTIKNKYPLPRIDDLFDQLQGAAVFSKIDLCSGYHQIRVKEDDVPKT 195
                    +LN +TIKNKYPLPRIDDLFDQLQGA VFSK+DL SGYHQ+R+KE DVPKT
Sbjct: 626 TLRLCIDCRQLNRMTIKNKYPLPRIDDLFDQLQGATVFSKVDLRSGYHQLRIKEQDVPKT 685

Query: 196 AFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEHAE 255
           AFRTRYGHYEF+VM FGLTNAPA FM+LMNRVF  +LD FVIVFIDDILVYS+ NDEHA 
Sbjct: 686 AFRTRYGHYEFLVMPFGLTNAPAAFMDLMNRVFHPYLDKFVIVFIDDILVYSRDNDEHAA 745

Query: 256 HLR-----------------------KVVFLGHVVSKDGITVDPAKVEAIIGWVRPTTVT 315
           HLR                       +VVFLGH+VS+ GI VDP KVEAI+ W +P TVT
Sbjct: 746 HLRIVLQTLRERQLYAKFSKCEFWLQEVVFLGHIVSRTGIYVDPKKVEAILQWEQPKTVT 805

Query: 316 EVRSFLGLAGYYRRFIKDFARIAAPLTQLTRKGKKFDWSRACESSFQELKERLASAPVLI 375
           E+RSFLGLAGYYRRF++ F+ +AAPLT+LTRKG KF W   CE+ FQELK RL SAPVL 
Sbjct: 806 EIRSFLGLAGYYRRFVQGFSLVAAPLTRLTRKGVKFVWDDVCENRFQELKNRLTSAPVLT 865

Query: 376 VPNGTGNLVIYSDASKHGLGCVLMQNGRVIAYASRQLKDYERSYPTHDLELAAVGFALKV 435
           +P      ++YSDASK GLGCVLMQ+ +V+AYASRQLK +E +YPTHDLELAAV FALK+
Sbjct: 866 LPVNGKGFIVYSDASKLGLGCVLMQDEKVVAYASRQLKRHEANYPTHDLELAAVVFALKI 925

Query: 436 WRHYLYGERIQVHTDHKSLKYLFTQKELIMRQRRWLELVKDYDVEILHHPGKANVVADAL 454
           WRHYLYGE  ++ TDHKSLKYL TQKEL +RQRRWLEL+KDYD+ I +H GKANVVADAL
Sbjct: 926 WRHYLYGEHCRIFTDHKSLKYLLTQKELNLRQRRWLELIKDYDLVIDYHLGKANVVADAL 985

BLAST of CmoCh04G021790 vs. TrEMBL
Match: A0A061DW51_THECC (DNA/RNA polymerases superfamily protein OS=Theobroma cacao GN=TCM_003698 PE=4 SV=1)

HSP 1 Score: 568.9 bits (1465), Expect = 5.5e-159
Identity = 297/491 (60.49%), Postives = 353/491 (71.89%), Query Frame = 1

Query: 16  SKTIIALKARKLIRHDVVAFLASVAKVRRVDKDVSVVPVVNEFLDVFLDEMPGLPPEREV 75
           S  I A+KA KL++     +LA V    + +  +  VP+V+EF DVF D++PGLPP+RE+
Sbjct: 469 SCVISAIKASKLVQKGYPTYLAYVIDTSKGEPKLEDVPIVSEFPDVFPDDLPGLPPDREL 528

Query: 76  NLDIELEPGMTLNSKAPYRMAPTELKELKLQLQELLNQG--------------------- 135
              I+L PG    S  PYRMAP ELKELK+QLQEL+++G                     
Sbjct: 529 EFPIDLLPGTAPISIPPYRMAPAELKELKVQLQELVDKGFIRPSISPWGAPILFVKKKDG 588

Query: 136 ---------KLNTVTIKNKYPLPRIDDLFDQLQGAAVFSKIDLCSGYHQIRVKEDDVPKT 195
                    +LN +TIKNKYPLPRIDD+FDQLQGA VFSK++L SGYHQ+R+KE DV KT
Sbjct: 589 TLRLCIDYRQLNRMTIKNKYPLPRIDDIFDQLQGATVFSKVNLRSGYHQLRIKEQDVLKT 648

Query: 196 AFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEHAE 255
            FRTRYGHYEF+VM FGLTNAPA FM+LM+RVF  +LD FVIVFIDDILVY + NDEHA 
Sbjct: 649 EFRTRYGHYEFLVMPFGLTNAPATFMDLMSRVFHPYLDKFVIVFIDDILVYLRDNDEHAA 708

Query: 256 HLR-----------------------KVVFLGHVVSKDGITVDPAKVEAIIGWVRPTTVT 315
           HLR                       +VVFLGHVVS+ GI VDP KVEAI+ W +P TVT
Sbjct: 709 HLRIVLQTLRERQLYAKFSKCEFWLQEVVFLGHVVSRTGIYVDPKKVEAILQWEQPKTVT 768

Query: 316 EVRSFLGLAGYYRRFIKDFARIAAPLTQLTRKGKKFDWSRACESSFQELKERLASAPVLI 375
           E+RSFLGLAGYYRRF++ F+ IAAPLT+LTRKG KF W   CE+ FQELK RL  APVL 
Sbjct: 769 EIRSFLGLAGYYRRFVQGFSLIAAPLTRLTRKGVKFVWDDVCENRFQELKNRLTFAPVLT 828

Query: 376 VPNGTGNLVIYSDASKHGLGCVLMQNGRVIAYASRQLKDYERSYPTHDLELAAVGFALKV 435
           +P      V+YSDASK GLGCVLMQ+ +V+AYASRQLK +E +YPTHDLELAAV FALK+
Sbjct: 829 LPVNGKGFVVYSDASKLGLGCVLMQDEKVVAYASRQLKRHEANYPTHDLELAAVVFALKI 888

Query: 436 WRHYLYGERIQVHTDHKSLKYLFTQKELIMRQRRWLELVKDYDVEILHHPGKANVVADAL 454
           WRHYLYGE  Q+ TDHKSLKYL TQKE+ +RQRRWLEL+KDYD+ I +HPGKANVVADAL
Sbjct: 889 WRHYLYGEHCQIFTDHKSLKYLLTQKEINLRQRRWLELIKDYDLVIDYHPGKANVVADAL 948

BLAST of CmoCh04G021790 vs. TrEMBL
Match: A2I5E5_BETVU (Retrotransposon protein OS=Beta vulgaris PE=4 SV=1)

HSP 1 Score: 567.8 bits (1462), Expect = 1.2e-158
Identity = 287/489 (58.69%), Postives = 351/489 (71.78%), Query Frame = 1

Query: 19  IIALKARKLIRHDVVAFLASVAKV-RRVDKDVSVVPVVNEFLDVFLDEMPGLPPEREVNL 78
           I AL+ +KL+R     F  SV  V +  +  +  V +VNEF+DVF  E+ G+PP R V  
Sbjct: 491 ISALQVQKLMRKGCELFFCSVQDVSKEAELKLEDVSIVNEFMDVFPSEISGMPPARAVEF 550

Query: 79  DIELEPGMTLNSKAPYRMAPTELKELKLQLQELLNQG----------------------- 138
            I+L PG    SKAPYRMAP E+ ELK QLQELL++G                       
Sbjct: 551 TIDLVPGTAPISKAPYRMAPPEMSELKTQLQELLDKGYIRPSASPWGAPVLFVKKKDGSM 610

Query: 139 -------KLNTVTIKNKYPLPRIDDLFDQLQGAAVFSKIDLCSGYHQIRVKEDDVPKTAF 198
                  +LN VTIKNKYPLPRIDDLFDQL GA+VFSKIDL SGYHQ+RV + DVPKTAF
Sbjct: 611 RLCIDYRELNNVTIKNKYPLPRIDDLFDQLNGASVFSKIDLRSGYHQLRVADKDVPKTAF 670

Query: 199 RTRYGHYEFVVMSFGLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEHAEHL 258
           RTRYGHYEF VM FGLTNAPA+FM+LMNR+F +FLD FV+VFIDDIL+YS+   EH EHL
Sbjct: 671 RTRYGHYEFTVMPFGLTNAPAIFMDLMNRIFHEFLDKFVVVFIDDILIYSRNETEHDEHL 730

Query: 259 R-----------------------KVVFLGHVVSKDGITVDPAKVEAIIGWVRPTTVTEV 318
           R                       KV FLGH VSK+G++VDPAK++A+  W  P +VT++
Sbjct: 731 RIILETLRKNQLYAKFSKCEFRLEKVAFLGHFVSKEGVSVDPAKIQAVSEWPTPKSVTDI 790

Query: 319 RSFLGLAGYYRRFIKDFARIAAPLTQLTRKGKKFDWSRACESSFQELKERLASAPVLIVP 378
           RSFLGLAGYYRRF++DF++IA P+T L +K  KF+W+  CE +FQ LK+RL +APVL +P
Sbjct: 791 RSFLGLAGYYRRFVRDFSKIARPMTNLMKKETKFEWNEKCEEAFQILKDRLTTAPVLTLP 850

Query: 379 NGTGNLVIYSDASKHGLGCVLMQNGRVIAYASRQLKDYERSYPTHDLELAAVGFALKVWR 438
           +G     +YSDASK+GLGCVL QNG+VIAYAS QLK YE +YPTHDLELAA+ FALK+WR
Sbjct: 851 DGNEGFEVYSDASKNGLGCVLQQNGKVIAYASCQLKPYEANYPTHDLELAAIVFALKIWR 910

Query: 439 HYLYGERIQVHTDHKSLKYLFTQKELIMRQRRWLELVKDYDVEILHHPGKANVVADALSR 454
           HYLYG   ++ TDHKSLKY+FTQK+L MRQRRWLEL+KDYD++I +H GKANVVADALSR
Sbjct: 911 HYLYGATCKIFTDHKSLKYIFTQKDLNMRQRRWLELIKDYDLDIQYHEGKANVVADALSR 970

BLAST of CmoCh04G021790 vs. TAIR10
Match: ATMG00860.1 (ATMG00860.1 DNA/RNA polymerases superfamily protein)

HSP 1 Score: 96.7 bits (239), Expect = 4.1e-20
Identity = 42/99 (42.42%), Postives = 67/99 (67.68%), Query Frame = 1

Query: 229 KVVFLGH--VVSKDGITVDPAKVEAIIGWVRPTTVTEVRSFLGLAGYYRRFIKDFARIAA 288
           ++ +LGH  ++S +G++ DPAK+EA++GW  P   TE+R FLGL GYYRRF+K++ +I  
Sbjct: 29  QIAYLGHRHIISGEGVSADPAKLEAMVGWPEPKNTTELRGFLGLTGYYRRFVKNYGKIVR 88

Query: 289 PLTQLTRKGKKFDWSRACESSFQELKERLASAPVLIVPN 326
           PLT+L +K     W+     +F+ LK  + + PVL +P+
Sbjct: 89  PLTELLKK-NSLKWTEMAALAFKALKGAVTTLPVLALPD 126

BLAST of CmoCh04G021790 vs. NCBI nr
Match: gi|595885005|ref|XP_007213082.1| (hypothetical protein PRUPE_ppa021229mg [Prunus persica])

HSP 1 Score: 579.7 bits (1493), Expect = 4.5e-162
Identity = 295/488 (60.45%), Postives = 357/488 (73.16%), Query Frame = 1

Query: 19  IIALKARKLIRHDVVAFLASVAKVRRVDKDVSVVPVVNEFLDVFLDEMPGLPPEREVNLD 78
           I A+ A++L+R     ++A V   R     +  +PV+ +F DVF +++PGLPP RE+   
Sbjct: 197 ISAMTAKRLLRKGCSGYIAHVIDTRDNGLRLEDIPVIQDFPDVFPEDLPGLPPHREIEFV 256

Query: 79  IELEPGMTLNSKAPYRMAPTELKELKLQLQELLNQG------------------------ 138
           IEL PG    S+APYRMAP EL+ELK QLQEL+++G                        
Sbjct: 257 IELAPGTNPISQAPYRMAPAELRELKTQLQELVDKGFIRPSFSPWGAPVLFVKKKDGTMR 316

Query: 139 ------KLNTVTIKNKYPLPRIDDLFDQLQGAAVFSKIDLCSGYHQIRVKEDDVPKTAFR 198
                 +LN +T++N+YPLPRIDDLFDQL+GA VFSKIDL SGYHQ+RV+E+D+PKTAFR
Sbjct: 317 LCVDYRQLNKITVRNRYPLPRIDDLFDQLKGAKVFSKIDLRSGYHQLRVREEDMPKTAFR 376

Query: 199 TRYGHYEFVVMSFGLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEHAEHLR 258
           TRYGHYEF+VM FGLTNAPA FM+LMNRVF+ +LD FVIVFIDDILVYSK+   H +HL 
Sbjct: 377 TRYGHYEFLVMPFGLTNAPAAFMDLMNRVFRRYLDRFVIVFIDDILVYSKSQKAHMKHLN 436

Query: 259 -----------------------KVVFLGHVVSKDGITVDPAKVEAIIGWVRPTTVTEVR 318
                                  +V FLGHV+S +GI VDP K+EA++ W+RPT+VTE+R
Sbjct: 437 LVLRTLRRRQLYAKFSKCQFWLDRVSFLGHVISAEGIYVDPQKIEAVVNWLRPTSVTEIR 496

Query: 319 SFLGLAGYYRRFIKDFARIAAPLTQLTRKGKKFDWSRACESSFQELKERLASAPVLIVPN 378
           SFLGLAGYYRRF++ F+ IAAPLT LTRKG KF WS  CE SF ELK RL +APVL +P+
Sbjct: 497 SFLGLAGYYRRFVEGFSTIAAPLTYLTRKGVKFVWSDKCEESFIELKTRLTTAPVLALPD 556

Query: 379 GTGNLVIYSDASKHGLGCVLMQNGRVIAYASRQLKDYERSYPTHDLELAAVGFALKVWRH 438
            +GN VIYSDAS+ GLGCVLMQ+GRVIAYASRQLK +E +YP HDLELAAV FALK+WRH
Sbjct: 557 DSGNFVIYSDASQQGLGCVLMQHGRVIAYASRQLKKHELNYPVHDLELAAVVFALKIWRH 616

Query: 439 YLYGERIQVHTDHKSLKYLFTQKELIMRQRRWLELVKDYDVEILHHPGKANVVADALSRK 454
           YLYGE  Q+ TDHKSLKYLFTQKEL +RQRRWLEL+KDYD  I HHPG+ANVVADALSRK
Sbjct: 617 YLYGETCQIFTDHKSLKYLFTQKELNLRQRRWLELIKDYDCTIEHHPGRANVVADALSRK 676

BLAST of CmoCh04G021790 vs. NCBI nr
Match: gi|1021486231|ref|XP_016186119.1| (PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC107627811 [Arachis ipaensis])

HSP 1 Score: 578.2 bits (1489), Expect = 1.3e-161
Identity = 296/494 (59.92%), Postives = 360/494 (72.87%), Query Frame = 1

Query: 14   SVSKTIIALKARKLIRHDVVAFLASVAKVRRVDKDVSVVPVVNEFLDVFLDEMPGLPPER 73
            +++  I ++ A +L+      FLA V  V      +  VP+V EF DVF DE+ G+PP+R
Sbjct: 623  TLASIISSMSAMQLMDKGNQGFLAVVRDVDAEVPSLDQVPIVREFPDVFPDELLGMPPDR 682

Query: 74   EVNLDIELEPGMTLNSKAPYRMAPTELKELKLQLQELLNQG------------------- 133
            EV   IEL PG+   S  PYRMAPTEL+ELK+QL+++L +G                   
Sbjct: 683  EVEFSIELAPGVQPVSIPPYRMAPTELRELKVQLEDMLEKGFIRPSTSPWGAPVLFVKKK 742

Query: 134  -----------KLNTVTIKNKYPLPRIDDLFDQLQGAAVFSKIDLCSGYHQIRVKEDDVP 193
                       +LN +T++NKYPLPRIDDLFDQLQGA  FSKIDL SGYHQ+++KE+D+P
Sbjct: 743  DGTMRLCVDYRQLNKITVRNKYPLPRIDDLFDQLQGATCFSKIDLRSGYHQLKIKEEDIP 802

Query: 194  KTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEH 253
            KTAFRTRYGHYEF+VMSFGLTNAPA FM+LMNRVF+ FLD FVIVFIDDILVYSK+  EH
Sbjct: 803  KTAFRTRYGHYEFLVMSFGLTNAPAAFMDLMNRVFKPFLDRFVIVFIDDILVYSKSAAEH 862

Query: 254  AEHLR-----------------------KVVFLGHVVSKDGITVDPAKVEAIIGWVRPTT 313
              HLR                       +V FLGHVVSKDGI VDP KVEA+  W RPTT
Sbjct: 863  EYHLRIVLQTLKDHKLYAKFSKCEFWLDQVTFLGHVVSKDGIMVDPKKVEAVQKWPRPTT 922

Query: 314  VTEVRSFLGLAGYYRRFIKDFARIAAPLTQLTRKGKKFDWSRACESSFQELKERLASAPV 373
            VTE+RSFLGLAGYYRRFIKDF+RI+APLT+LT+K  KF WS ACE SFQ LK  L SAPV
Sbjct: 923  VTEIRSFLGLAGYYRRFIKDFSRISAPLTKLTQKNVKFQWSEACEESFQTLKACLTSAPV 982

Query: 374  LIVPNGTGNLVIYSDASKHGLGCVLMQNGRVIAYASRQLKDYERSYPTHDLELAAVGFAL 433
            L++P+G+G   ++ DAS+ GLGCVLM +GRVIAYASRQ K +E++YPTHDLE+AAV FAL
Sbjct: 983  LVLPSGSGGFSVFCDASRIGLGCVLMXHGRVIAYASRQPKKHEQNYPTHDLEMAAVVFAL 1042

Query: 434  KVWRHYLYGERIQVHTDHKSLKYLFTQKELIMRQRRWLELVKDYDVEILHHPGKANVVAD 455
            K+WRHYLYGE  +++TDHKSLKY+F QK+L +RQRRW+EL+KDYD  IL+HPGKAN+VAD
Sbjct: 1043 KIWRHYLYGETCEIYTDHKSLKYIFQQKDLNLRQRRWMELLKDYDCTILYHPGKANIVAD 1102

BLAST of CmoCh04G021790 vs. NCBI nr
Match: gi|28558781|gb|AAO45752.1| (pol protein [Cucumis melo subsp. melo])

HSP 1 Score: 577.8 bits (1488), Expect = 1.7e-161
Identity = 286/423 (67.61%), Postives = 336/423 (79.43%), Query Frame = 1

Query: 95  MAPTELKELKLQLQELLNQG------------------------------KLNTVTIKNK 154
           MAP ELKELK+QLQELL++G                              +LN VT+KN+
Sbjct: 1   MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNR 60

Query: 155 YPLPRIDDLFDQLQGAAVFSKIDLCSGYHQIRVKEDDVPKTAFRTRYGHYEFVVMSFGLT 214
           YPLPRIDDLFDQLQGA VFSKIDL SGYHQ+R+K++DVPKTAFR+RYGHY+F+VMSFGLT
Sbjct: 61  YPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYQFIVMSFGLT 120

Query: 215 NAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEHAEHLRKVV------------- 274
           NAPAVFM+LMNRVF++FLD+FVIVFIDDIL+YSKT  EH EHLR V+             
Sbjct: 121 NAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFS 180

Query: 275 ----------FLGHVVSKDGITVDPAKVEAIIGWVRPTTVTEVRSFLGLAGYYRRFIKDF 334
                     FLGHVVSK G++VDPAK+EA+ GW RP+TV+EVRSFLGLAGYYRRF+++F
Sbjct: 181 KCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENF 240

Query: 335 ARIAAPLTQLTRKGKKFDWSRACESSFQELKERLASAPVLIVPNGTGNLVIYSDASKHGL 394
           +RIA PLTQLTRKG  F WS+ACE SFQ LK++L +APVL VP+G+GN VIYSDASK GL
Sbjct: 241 SRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKLVTAPVLTVPDGSGNFVIYSDASKKGL 300

Query: 395 GCVLMQNGRVIAYASRQLKDYERSYPTHDLELAAVGFALKVWRHYLYGERIQVHTDHKSL 454
           GCVLMQ G+V+AYASRQLK +E++YPTHDLELAAV FALK+WRHYLYGE+IQ+ TDHKSL
Sbjct: 301 GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSL 360

Query: 455 KYLFTQKELIMRQRRWLELVKDYDVEILHHPGKANVVADALSRKTAHSSAMLTRQHNIQM 465
           KY FTQKEL MRQRRWLELVKDYD EIL+HPGKANVVADALSRK +HS+A++TRQ  +  
Sbjct: 361 KYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHR 420

BLAST of CmoCh04G021790 vs. NCBI nr
Match: gi|590693137|ref|XP_007044250.1| (DNA/RNA polymerases superfamily protein [Theobroma cacao])

HSP 1 Score: 577.0 bits (1486), Expect = 2.9e-161
Identity = 300/491 (61.10%), Postives = 356/491 (72.51%), Query Frame = 1

Query: 16  SKTIIALKARKLIRHDVVAFLASVAKVRRVDKDVSVVPVVNEFLDVFLDEMPGLPPEREV 75
           S  I A+KA KL++     +LA V    + +  +  V +V+EF DVF D++PGLPP+RE+
Sbjct: 506 SCVISAIKASKLVQKGYSTYLAYVIDTSKGEPKLEDVSIVSEFPDVFPDDLPGLPPDREL 565

Query: 76  NLDIELEPGMTLNSKAPYRMAPTELKELKLQLQELLNQG--------------------- 135
              I+L PG    S  PYRMAPTELKELK+QLQEL+++G                     
Sbjct: 566 EFPIDLLPGTAPISIPPYRMAPTELKELKVQLQELVDKGFIRPSISPWGAPILFVKKKDG 625

Query: 136 ---------KLNTVTIKNKYPLPRIDDLFDQLQGAAVFSKIDLCSGYHQIRVKEDDVPKT 195
                    +LN +TIKNKYPLPRIDDLFDQLQGA VFSK+DL SGYHQ+R+KE DVPKT
Sbjct: 626 TLRLCIDCRQLNRMTIKNKYPLPRIDDLFDQLQGATVFSKVDLRSGYHQLRIKEQDVPKT 685

Query: 196 AFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEHAE 255
           AFRTRYGHYEF+VM FGLTNAPA FM+LMNRVF  +LD FVIVFIDDILVYS+ NDEHA 
Sbjct: 686 AFRTRYGHYEFLVMPFGLTNAPAAFMDLMNRVFHPYLDKFVIVFIDDILVYSRDNDEHAA 745

Query: 256 HLR-----------------------KVVFLGHVVSKDGITVDPAKVEAIIGWVRPTTVT 315
           HLR                       +VVFLGH+VS+ GI VDP KVEAI+ W +P TVT
Sbjct: 746 HLRIVLQTLRERQLYAKFSKCEFWLQEVVFLGHIVSRTGIYVDPKKVEAILQWEQPKTVT 805

Query: 316 EVRSFLGLAGYYRRFIKDFARIAAPLTQLTRKGKKFDWSRACESSFQELKERLASAPVLI 375
           E+RSFLGLAGYYRRF++ F+ +AAPLT+LTRKG KF W   CE+ FQELK RL SAPVL 
Sbjct: 806 EIRSFLGLAGYYRRFVQGFSLVAAPLTRLTRKGVKFVWDDVCENRFQELKNRLTSAPVLT 865

Query: 376 VPNGTGNLVIYSDASKHGLGCVLMQNGRVIAYASRQLKDYERSYPTHDLELAAVGFALKV 435
           +P      ++YSDASK GLGCVLMQ+ +V+AYASRQLK +E +YPTHDLELAAV FALK+
Sbjct: 866 LPVNGKGFIVYSDASKLGLGCVLMQDEKVVAYASRQLKRHEANYPTHDLELAAVVFALKI 925

Query: 436 WRHYLYGERIQVHTDHKSLKYLFTQKELIMRQRRWLELVKDYDVEILHHPGKANVVADAL 454
           WRHYLYGE  ++ TDHKSLKYL TQKEL +RQRRWLEL+KDYD+ I +H GKANVVADAL
Sbjct: 926 WRHYLYGEHCRIFTDHKSLKYLLTQKELNLRQRRWLELIKDYDLVIDYHLGKANVVADAL 985

BLAST of CmoCh04G021790 vs. NCBI nr
Match: gi|848889197|ref|XP_012844768.1| (PREDICTED: uncharacterized protein LOC105964809 [Erythranthe guttata])

HSP 1 Score: 575.5 bits (1482), Expect = 8.5e-161
Identity = 292/482 (60.58%), Postives = 356/482 (73.86%), Query Frame = 1

Query: 19  IIALKARKLIRHDVVAFLASVAKVRRVDK-DVSVVPVVNEFLDVFLDEMPGLPPEREVNL 78
           I A+KA K I+   +  L  + +    DK D+S VP V EF DVF +++PGLPP+RE+  
Sbjct: 436 ISAIKASKPIQKGHLCHLVDIVEALNEDKADISKVPTVCEFPDVFPEDLPGLPPDREIEF 495

Query: 79  DIELEPGMTLNSKAPYRMAPTELKELKLQLQELLNQG----------------------- 138
           +I+L PG    SK PYRMAP ELKELK Q+Q+L+++                        
Sbjct: 496 EIKLIPGSAPISKEPYRMAPLELKELKDQIQDLIDKKFIRPSFSPWGAPVLFVKKKDGTL 555

Query: 139 -------KLNTVTIKNKYPLPRIDDLFDQLQGAAVFSKIDLCSGYHQIRVKEDDVPKTAF 198
                  +LN VT+KNKYPLPRIDDLFDQLQG+ V+SKIDL SGYHQ+++K++ +PKTAF
Sbjct: 556 RMCIDYRELNKVTVKNKYPLPRIDDLFDQLQGSTVYSKIDLRSGYHQLKIKKEYIPKTAF 615

Query: 199 RTRYGHYEFVVMSFGLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEHAEHL 258
           RTRYGHYEFVVMSFGLTNAPA F +LMNR+F+ +LD FVIVFIDDILVYS+ ++EH  HL
Sbjct: 616 RTRYGHYEFVVMSFGLTNAPAAFKDLMNRIFRKYLDQFVIVFIDDILVYSRNDEEHRNHL 675

Query: 259 -----------------------RKVVFLGHVVSKDGITVDPAKVEAIIGWVRPTTVTEV 318
                                  R+V FLGHV+SK+GI VDP+K+EA+  W  P  V E+
Sbjct: 676 EMVLQTLRENQLYAKFSKCVFWLRQVHFLGHVISKEGIAVDPSKIEAVCNWSIPRNVGEI 735

Query: 319 RSFLGLAGYYRRFIKDFARIAAPLTQLTRKGKKFDWSRACESSFQELKERLASAPVLIVP 378
           RSFLGLAGYYRRF+ +F++IAAPL++LTRK  K+ W+  CESSFQELK+RL SAP+L +P
Sbjct: 736 RSFLGLAGYYRRFVPNFSKIAAPLSRLTRKEVKYQWTPECESSFQELKKRLTSAPILALP 795

Query: 379 NGTGNLVIYSDASKHGLGCVLMQNGRVIAYASRQLKDYERSYPTHDLELAAVGFALKVWR 438
           + TGN V+YSDASK GLG VLMQNG VIAYASRQLKDYE++YPTHDLELAAV FALK+WR
Sbjct: 796 DKTGNYVVYSDASKRGLGAVLMQNGNVIAYASRQLKDYEKNYPTHDLELAAVVFALKIWR 855

Query: 439 HYLYGERIQVHTDHKSLKYLFTQKELIMRQRRWLELVKDYDVEILHHPGKANVVADALSR 447
           HYLYGE+  + TDHKSLKY FTQKEL MRQRRWLELVKDYD +IL+HPG+ANVV DALSR
Sbjct: 856 HYLYGEKCSIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCKILYHPGQANVVPDALSR 915

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
POL3_DROME2.5e-7236.63Retrovirus-related Pol polyprotein from transposon 17.6 OS=Drosophila melanogast... [more]
POL2_DROME1.2e-6937.31Retrovirus-related Pol polyprotein from transposon 297 OS=Drosophila melanogaste... [more]
POL5_DROME1.9e-6736.93Retrovirus-related Pol polyprotein from transposon opus OS=Drosophila melanogast... [more]
POLY_DROME1.1e-5932.32Retrovirus-related Pol polyprotein from transposon gypsy OS=Drosophila melanogas... [more]
YG31B_YEAST7.9e-5832.44Transposon Ty3-G Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 20... [more]
Match NameE-valueIdentityDescription
M5WLY8_PRUPE3.1e-16260.45Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa021229mg PE=4 SV=1[more]
Q84KB0_CUCME1.2e-16167.61Pol protein OS=Cucumis melo subsp. melo PE=4 SV=1[more]
A0A061E6T4_THECC2.0e-16161.10DNA/RNA polymerases superfamily protein OS=Theobroma cacao GN=TCM_009549 PE=4 SV... [more]
A0A061DW51_THECC5.5e-15960.49DNA/RNA polymerases superfamily protein OS=Theobroma cacao GN=TCM_003698 PE=4 SV... [more]
A2I5E5_BETVU1.2e-15858.69Retrotransposon protein OS=Beta vulgaris PE=4 SV=1[more]
Match NameE-valueIdentityDescription
ATMG00860.14.1e-2042.42ATMG00860.1 DNA/RNA polymerases superfamily protein[more]
Match NameE-valueIdentityDescription
gi|595885005|ref|XP_007213082.1|4.5e-16260.45hypothetical protein PRUPE_ppa021229mg [Prunus persica][more]
gi|1021486231|ref|XP_016186119.1|1.3e-16159.92PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC107627811 [Arachis ip... [more]
gi|28558781|gb|AAO45752.1|1.7e-16167.61pol protein [Cucumis melo subsp. melo][more]
gi|590693137|ref|XP_007044250.1|2.9e-16161.10DNA/RNA polymerases superfamily protein [Theobroma cacao][more]
gi|848889197|ref|XP_012844768.1|8.5e-16160.58PREDICTED: uncharacterized protein LOC105964809 [Erythranthe guttata][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR000477RT_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
cellular_component GO:0005575 cellular_component
molecular_function GO:0046872 metal ion binding
molecular_function GO:0003676 nucleic acid binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh04G021790.1CmoCh04G021790.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000477Reverse transcriptase domainPFAMPF00078RVT_1coord: 127..254
score: 4.3
IPR000477Reverse transcriptase domainPROFILEPS50878RT_POLcoord: 1..272
score:
NoneNo IPR availableGENE3DG3DSA:3.10.10.10coord: 115..190
score: 3.7
NoneNo IPR availableGENE3DG3DSA:3.30.70.270coord: 191..249
score: 3.
NoneNo IPR availablePANTHERPTHR24559FAMILY NOT NAMEDcoord: 94..462
score: 4.4E
NoneNo IPR availablePANTHERPTHR24559:SF207SUBFAMILY NOT NAMEDcoord: 94..462
score: 4.4E
NoneNo IPR availableunknownSSF56672DNA/RNA polymerasescoord: 54..430
score: 2.45E

The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
CmoCh04G021790Wax gourdcmowgoB0824
CmoCh04G021790Wax gourdcmowgoB0909
CmoCh04G021790Cucurbita moschata (Rifu)cmocmoB344
CmoCh04G021790Cucurbita moschata (Rifu)cmocmoB254
CmoCh04G021790Cucurbita moschata (Rifu)cmocmoB469
CmoCh04G021790Cucumber (Gy14) v1cgycmoB0777
CmoCh04G021790Cucumber (Gy14) v1cgycmoB0800
CmoCh04G021790Cucurbita maxima (Rimu)cmacmoB301
CmoCh04G021790Cucurbita maxima (Rimu)cmacmoB426
CmoCh04G021790Cucurbita maxima (Rimu)cmacmoB729
CmoCh04G021790Cucurbita maxima (Rimu)cmacmoB742
CmoCh04G021790Wild cucumber (PI 183967)cmocpiB733
CmoCh04G021790Wild cucumber (PI 183967)cmocpiB750
CmoCh04G021790Cucumber (Chinese Long) v2cmocuB724
CmoCh04G021790Cucumber (Chinese Long) v2cmocuB742
CmoCh04G021790Melon (DHL92) v3.5.1cmomeB627
CmoCh04G021790Melon (DHL92) v3.5.1cmomeB654
CmoCh04G021790Watermelon (Charleston Gray)cmowcgB636
CmoCh04G021790Watermelon (Charleston Gray)cmowcgB666
CmoCh04G021790Watermelon (97103) v1cmowmB705
CmoCh04G021790Watermelon (97103) v1cmowmB706
CmoCh04G021790Cucurbita pepo (Zucchini)cmocpeB650
CmoCh04G021790Cucurbita pepo (Zucchini)cmocpeB663
CmoCh04G021790Cucurbita pepo (Zucchini)cmocpeB673
CmoCh04G021790Cucurbita pepo (Zucchini)cmocpeB691
CmoCh04G021790Bottle gourd (USVL1VR-Ls)cmolsiB610
CmoCh04G021790Bottle gourd (USVL1VR-Ls)cmolsiB677
CmoCh04G021790Cucumber (Gy14) v2cgybcmoB637
CmoCh04G021790Cucumber (Gy14) v2cgybcmoB655
CmoCh04G021790Melon (DHL92) v3.6.1cmomedB716
CmoCh04G021790Melon (DHL92) v3.6.1cmomedB744
CmoCh04G021790Silver-seed gourdcarcmoB1072
CmoCh04G021790Silver-seed gourdcarcmoB1137
CmoCh04G021790Silver-seed gourdcarcmoB1413
CmoCh04G021790Cucumber (Chinese Long) v3cmocucB0858
CmoCh04G021790Cucumber (Chinese Long) v3cmocucB0878
CmoCh04G021790Watermelon (97103) v2cmowmbB676
CmoCh04G021790Watermelon (97103) v2cmowmbB740