CmoCh06G004550.1 (mRNA) Cucurbita moschata (Rifu)

NameCmoCh06G004550.1
TypemRNA
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionCopia protein
LocationCmo_Chr06 : 2170667 .. 2175838 (+)
Sequence length1914
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGGAATCAGCCAAATCTAGCTTCAAAATTTCGGATGTTGATTTAACACATCCGTACTATATTCATCACTCTGATCAGCCAGGATATTCACTTGTTCCAATCAAATTAAATGGAGCAAATTATCAATCCTGGAGGAAATCAGTTATGCATGCTCTTATTGCCAAGAAGAAAATTGGCTTCATTGATGGCACAATTGAGGAACCGTCCCAAGATGCAAATTCAACCGAATTCGAACTCTGGAATCAGTGCAACAGTATGATAATATCTTGGTTAACTCATTCCGTTGAAGCAGATATCGCTAAAGGCATTATTCACGCCAAGACAGCTCATCAAGTGTGGGTTGATCTTCACGATCAATTCTCACAAAAGAATGCTCTAGCAATTTTTCAAATACAAAACTCGATAGCAACGATGTCACAAGGAACCATGGCACTGTCAACATATTTCACCAAGCTCAAAGCACTCTGGGATGAACTGGAAGCGTACCGCACACCATTTACCTGTAATCAACGTCAAATACATATTGATCAACGCGAAGAAGNNNNNNNNNNTTAAATTTTTATTTTTAATAAAAGTAAAGTGTTAAGTAAAGAATTATAATTCAATTTGAATATATATTTTTAAAAATTATCTTTAAATATTTTAACCCTCTAATTTATAAGAGTTGTTCTAGTTTAAAAAATTTACGGCTCGAACATTTTCACTAAATCTAAAAACTATCTCAATCTGACTCGTTTCAATCCACGAACACTTTTAATTCAAACTTCAATTCCACGTGCTATTACAAAAAACTAGGAATAATTTAATGAAAAAGGGTATTCCTAAAAAGGCTAATTTTTAAATATGAACAAATTCAACTATATTGGGCACTTTTATTCATTCCAGATTTAGAAATATATTTTTTTGAAGAACAAATTATAATATTTTAAATAAACATAATATTAAAAAAAATATATATTCCGTATATATATATATATATTCCCTATCTTTATAGGCTCAATTTTGGGGGCAAATAATTTACTTTGTTGACACGTCACAGTCGTACTCTCCCGAGCTAAGAAGCTGAGAGTGAATAATGGAATGTCTGCCGAAGAAACGTAACACCAACCCCTCACCTTCCCTTGTCTTATAATTCTTCAAATTTACGCTCAAAACCATAATAAAAAGATTGTAATATATATATATAATTTAAATTTTAAAAACATATTAAACATATTTTAGGTTTCGGCACCTAAAAACCGATTAGATACAACTTTTAACGTATAGTTACTTAACATTCAATAATTGATCTTTTCTTGAAAGAGAAAGAACAAAGCATGCTAGAATGACTCATGTTGATCCATAAGAACAGTCTTCATTCTTAAGAAGCGAGAAGTAGTTGTGACCATATCGTAGGAAAATGTCGAGAGTCATCAAGTACTAATCTTGATTCAAATTCTAGGCATGGGTCGTTACAAATAACGTCAGAGTTTGGAACATGTCATATTACGATAATATGGACCATATTGCAGGAGAATGTCAAGGGTCATCAGGTACTTTATTTAGATTTCAAATCTCGATGCGAATTCTGGGCATGGGTCGTTACAAATACATGGTCTATAGTTAGACTCGTATTAAGAGGATATTTTGCCGTGATAATTAGTTAGAATATATCTTAGATATTTAGGAAGCTGTTAGTATAGATTTGATTAACCGTATATTTTATTTATTTACCAGATTTAATTAGATATATATTTTTCTATTTTTAGGTATTAGTTGATAGTTTGTATCCTATTTAAACGTGTAAACCTGAATGAAGATCATACTTTCGATCCCAATTCTATTTCTATTTCTCATTCTTAACATGGTATCAGAGCACCGATCTTGGTGTTCTTAAATATCAAAATTTCCTTTATGGCGGAATCAGCCAAATCTAGCTTCAAAATTTCGGATGTTGATTTAACACATCCGTACTATATTCATCACTCTGATCAGCCAGGATATTCACTTGTTCCAATCAAATTAAATGGAGCAAATTATCAATCCTGGAGGAAATCAGTTATGCATGCTCTTATTGCCAAGAAGAAAATTGGCTTCATTGATGGCACAATTGAGGAACCGTCCCAAGATGCAAATTCAACCGAATTCGAACTCTGGAATCAGTGCAACAGTATGATAATATCTTGGTTAACTCATTCCGTTGAAGCAGATATCGCTAAAGGCATTATTCACGCCAAGACAGCTCATCAAGTGTGGGTTGATCTTCACGATCAATTCTCACAAAAGAATGCTCTAGCAATTTTTCAAATACAAAACTCGATAGCAACGATGTCACAAGGAACCATGGCACTGTCAACATATTTCACCAAGCTCAAAGCACTCTGGGATGAACTGGAAGCGTACCGCACACCATTTACCTGTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTTGCTCATGGGGCTCAATCAGTCTTATAAAACGGTGAGATCTAACATATTGATGATGTCTCCATTACCTAATGTGAGGCAAGCCTATTCATTACTTGTACAAGAAGAGATGCAGCGTCAGGTAACTTTCGAACCTACTGAGAAAATCTCGATTGCATCAGCCGTGCAAAAGAAAACAATATATTCAAAATTCGCCAAGGACAAATTGTGTGAACACTGCAATAAAAGTGGTCATACAATTAATGAGTGTCGAATTCTTAAGTTTCACTGTAAGTTTTGTGATAGAAGGGGCCATACAGAAGATCGGTGTCGACAGAAAAATAATTCTGGAAGGACAAGACAAGACAATCAACACAATAACCGTGGATATCGATCATCTGCAAATATGGCNNNNNNNNNNGAAAAATAATTCTGGAAGGACAAGACAAGACAATCAACACAATAACCGTGGATATCGATCATCTGCAAATATGGCCGATGTTTCACAGTTGAATACAGAAGAACAGTCACCTAATTCCATTCCAAATGTTTCTTCTGAGCAATTACGACAGATAGCACAAGTCTTATCTGCAATCAATTATCACCCTTCTGGTAATTCTGACAATCACATCAATGTTGCAGGTTTGTTTCCCATATCTACATTATCTATTAACTCTGCGAGTTCTAATTCATGGATTCTCGATAGTGGAGCTACGGATCATATAGTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNCCTTAAATTAAACAACGTTTTATGTGTGCCTTCATTCAATTTAAACCTAATGTCGATCAGCAAACTTACCAATAACTTGAAATGTTATGTCACCTTCTATCCTGATTCTTGTGTTATGCAGGACTTGACTACGGGGAAGATGATTGGCTCGGGTAAACAATTTGGAGGTCTCTATCATATTTCTTCATCTCCAATCAAGTCTTCAGCTCATCAAGTATCTCAGTCATCTGATTTGTGGCATTTACGCCTAGGTCATCCTTCATTTTCTCGTTTTAAATTTCTAGCTGATCAATTGCATCTTAATAATGCGAGTTATTCTCATAATTGCAGTATCTGCCCGTTAGCAAAACAAACTAGGTTGTCTTTCCCAAGAAGTTCAATAACAACCCATTCTGCTTTTGATCTGATACATTGTGATGTTTGGGGACCATATAAAATTCCTACCCATTCTGGTTTGCGTTTTTTTCTCACTATTGTTGATGATTTTACTCGATGTACTTGGGTTTTTTTAATGCAACATAAGTCAGAAGTACATCATTTGTTGATGAACTTTGTTAAATTCGTTCAAACTCAATTTCATACTACTATCAAGATAGTTCGATCAGACAATGGGACTGAGTTCCTATCTTTGCAACCATTCTTTACTTCTTGTGGTATTGAATTTCAGCGCACTTGTGTCTATACTCCACAACAAAATGGAGTCGTAGAACGCAAGCATCGCCATATCCTAAATGTAGCTAGGTCTCTTCTTTTTCAGTCACAAGTTCCACTTAATTTTTGAGGAGAGTGCATTTTAACGGCTGTTTATCTTATAAATAGAACGCCATCACCATTATTATCTAACAAGACACAGGTCTTACAGGAGCACGTCCAGACAAATTTCCTATGGAGCAAAATCTGAAACTTTCTTTAACCGAAGGAGAGAAGTTGAATGATCCAAGTAAATACAGACGGTTGATTGGCAGATTAATATATTTGACCGTCACTAGGCCTGACATAGCTTATTCAGTTCGTATGCTTAGCCAATTTATGCATGAACCAAGAAAACCACATTGGGAGGCAGCTCTTCGAGTTCTGAGGTACATTAAAGGCACTCCTGGTCAAGGACTTCTACTGCCATCTGAAAACAATTTAAGATTACAGGCATATTGCGATTCTGACTGGGGTGGTTGTCGAACTTCCAGACGATCTATTTCTGGGTTCTGCATTTTCCTCGGAAATTCAATTATTTCTTGGAAGTCTAAAAAGCAGACTAATGTGTCCAGATCATCAGCAGAAGCCGAGTATCGAGCTATGGCAAATACTTGTTTAGAGTTAACTTGGTTAAGATACATTCTTCAAGACTTGAATGTTCCACTGTCCGAACCAGCATTATTATATTGTGATAATCAAGCAGCATTACATATAGCAGCCAATCCAGTTTTTCATGAACGTACGAAACACATTGAAATAGATTGTCATATAGTTCGAGAAAAATTACAAGCTGGAATCATCAAACCGTGTTATGTTTCGACCAAAATGCAATTGGCAGATGTTTTTACTAAATCTTTGGGAAGACAGCAATTTGACTTTTTGAAGGACAAGTTGGGTGTGATCGACATACACTCTCCAACTTGAGGGGGAGTATTAAGAGGATATTTTGCCGTGATAATTAGTTAGAATATATCTTAGATATTTAGGAAGCTGTTAGTATAGATTTGNNNNNNNNNNACAGCACGTGTGTCACATACTGGCAATATTTCCCTTAGCCCTAACCTTAAATTAAACAACGTTTTATGTGTGCCTTCATTCAATTTAAACCTAATGTCGATCAGCAAACTTACCAATAACTTGAAATGTTATGTCACCTTCTATCCTGATTCTTGTGTTATGCAGGACTTGACTACGGGGAAGATGATTGGCTCGGGTAA

mRNA sequence

ATGGCGGAATCAGCCAAATCTAGCTTCAAAATTTCGGATGTTGATTTAACACATCCGTACTATATTCATCACTCTGATCAGCCAGGATATTCACTTGTTCCAATCAAATTAAATGGAGCAAATTATCAATCCTGGAGGAAATCAGTTATGCATGCTCTTATTGCCAAGAAGAAAATTGGCTTCATTGATGGCACAATTGAGGAACCGTCCCAAGATGCAAATTCAACCGAATTCGAACTCTGGAATCAGTGCAACAGTATGATAATATCTTGGTTAACTCATTCCGTTGAAGCAGATATCGCTAAAGGCATTATTCACGCCAAGACAGCTCATCAAGTGTGGGTTGATCTTCACGATCAATTCTCACAAAAGAATGCTCTAGCAATTTTTCAAATACAAAACTCGATAGCAACGATGTCACAAGGAACCATGGCACTGTCAACATATTTCACCAAGCTCAAAGCACTCTGGGATGAACTGGAAGCGTACCGCACACCATTTACCTCCAAATCTAGCTTCAAAATTTCGGATGTTGATTTAACACATCCGTACTATATTCATCACTCTGATCAGCCAGGATATTCACTTGTTCCAATCAAATTAAATGGAGCAAATTATCAATCCTGGAGGAAATCAGTTATGCATGCTCTTATTGCCAAGAAGAAAATTGGCTTCATTGATGGCACAATTGAGGAACCGTCCCAAGATGCAAATTCAACCGAATTCGAACTCTGGAATCAGTGCAACAGTATGATAATATCTTGGTTAACTCATTCCGTTGAAGCAGATATCGCTAAAGGCATTATTCACGCCAAGACAGCTCATCAAGTGTGGGTTGATCTTCACGATCAATTCTCACAAAAGAATGCTCTAGCAATTTTTCAAATACAAAACTCGATAGCAACGATGTCACAAGGAACCATGGCACTGTCAACATATTTCACCAAGCTCAAAGCACTCTGGGATGAACTGGAAGCGTACCGCACACCATTTACCTGTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTTGCTCATGGGGCTCAATCAGTCTTATAAAACGGTGAGATCTAACATATTGATGATGTCTCCATTACCTAATGTGAGGCAAGCCTATTCATTACTTGTACAAGAAGAGATGCAGCGTCAGGACTTGACTACGGGGAAGATGATTGGCTCGGGTAAACAATTTGGAGGTCTCTATCATATTTCTTCATCTCCAATCAAGTCTTCAGCTCATCAAGTATCTCAGTCATCTGATTTGTGGCATTTACGCCTAGGTCTTACAGGAGCACGTCCAGACAAATTTCCTATGGAGCAAAATCTGAAACTTTCTTTAACCGAAGGAGAGAAGTTGAATGATCCAAGTAAATACAGACGGTTGATTGGCAGATTAATATATTTGACCGTCACTAGGCCTGACATAGCTTATTCAGTTCGTATGCTTAGCCAATTTATGCATGAACCAAGAAAACCACATTGGGAGGCAGCTCTTCGAGTTCTGAGGTACATTAAAGGCACTCCTGGTCAAGGACTTCTACTGCCATCTGAAAACAATTTAAGATTACAGGCATATTGCGATTCTGACTGGGGTGGTTGTCGAACTTCCAGACGATCTATTTCTGGGTTCTGCATTTTCCTCGGAAATTCAATTATTTCTTGGAAGTCTAAAAAGCAGACTAATGTGTCCAGATCATCAGCAGAAGCCGAGTATCGAGCTATGGCAAATACTTGTTTAGAGTTAACTTGGTTAAGATACATTCTTCAAGACTTGAATGTTCCACTGTCCGAACCAGCATTATTATATTGTGATAATCAAGCAGCATTACATATAGCAGCCAATCCAGTTTTTCATGAACGACTTGACTACGGGGAAGATGATTGGCTCGGGTAA

Coding sequence (CDS)

ATGGCGGAATCAGCCAAATCTAGCTTCAAAATTTCGGATGTTGATTTAACACATCCGTACTATATTCATCACTCTGATCAGCCAGGATATTCACTTGTTCCAATCAAATTAAATGGAGCAAATTATCAATCCTGGAGGAAATCAGTTATGCATGCTCTTATTGCCAAGAAGAAAATTGGCTTCATTGATGGCACAATTGAGGAACCGTCCCAAGATGCAAATTCAACCGAATTCGAACTCTGGAATCAGTGCAACAGTATGATAATATCTTGGTTAACTCATTCCGTTGAAGCAGATATCGCTAAAGGCATTATTCACGCCAAGACAGCTCATCAAGTGTGGGTTGATCTTCACGATCAATTCTCACAAAAGAATGCTCTAGCAATTTTTCAAATACAAAACTCGATAGCAACGATGTCACAAGGAACCATGGCACTGTCAACATATTTCACCAAGCTCAAAGCACTCTGGGATGAACTGGAAGCGTACCGCACACCATTTACCTCCAAATCTAGCTTCAAAATTTCGGATGTTGATTTAACACATCCGTACTATATTCATCACTCTGATCAGCCAGGATATTCACTTGTTCCAATCAAATTAAATGGAGCAAATTATCAATCCTGGAGGAAATCAGTTATGCATGCTCTTATTGCCAAGAAGAAAATTGGCTTCATTGATGGCACAATTGAGGAACCGTCCCAAGATGCAAATTCAACCGAATTCGAACTCTGGAATCAGTGCAACAGTATGATAATATCTTGGTTAACTCATTCCGTTGAAGCAGATATCGCTAAAGGCATTATTCACGCCAAGACAGCTCATCAAGTGTGGGTTGATCTTCACGATCAATTCTCACAAAAGAATGCTCTAGCAATTTTTCAAATACAAAACTCGATAGCAACGATGTCACAAGGAACCATGGCACTGTCAACATATTTCACCAAGCTCAAAGCACTCTGGGATGAACTGGAAGCGTACCGCACACCATTTACCTGTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTTGCTCATGGGGCTCAATCAGTCTTATAAAACGGTGAGATCTAACATATTGATGATGTCTCCATTACCTAATGTGAGGCAAGCCTATTCATTACTTGTACAAGAAGAGATGCAGCGTCAGGACTTGACTACGGGGAAGATGATTGGCTCGGGTAAACAATTTGGAGGTCTCTATCATATTTCTTCATCTCCAATCAAGTCTTCAGCTCATCAAGTATCTCAGTCATCTGATTTGTGGCATTTACGCCTAGGTCTTACAGGAGCACGTCCAGACAAATTTCCTATGGAGCAAAATCTGAAACTTTCTTTAACCGAAGGAGAGAAGTTGAATGATCCAAGTAAATACAGACGGTTGATTGGCAGATTAATATATTTGACCGTCACTAGGCCTGACATAGCTTATTCAGTTCGTATGCTTAGCCAATTTATGCATGAACCAAGAAAACCACATTGGGAGGCAGCTCTTCGAGTTCTGAGGTACATTAAAGGCACTCCTGGTCAAGGACTTCTACTGCCATCTGAAAACAATTTAAGATTACAGGCATATTGCGATTCTGACTGGGGTGGTTGTCGAACTTCCAGACGATCTATTTCTGGGTTCTGCATTTTCCTCGGAAATTCAATTATTTCTTGGAAGTCTAAAAAGCAGACTAATGTGTCCAGATCATCAGCAGAAGCCGAGTATCGAGCTATGGCAAATACTTGTTTAGAGTTAACTTGGTTAAGATACATTCTTCAAGACTTGAATGTTCCACTGTCCGAACCAGCATTATTATATTGTGATAATCAAGCAGCATTACATATAGCAGCCAATCCAGTTTTTCATGAACGACTTGACTACGGGGAAGATGATTGGCTCGGGTAA
BLAST of CmoCh06G004550.1 vs. Swiss-Prot
Match: M810_ARATH (Uncharacterized mitochondrial protein AtMg00810 OS=Arabidopsis thaliana GN=AtMg00810 PE=4 SV=1)

HSP 1 Score: 162.5 bits (410), Expect = 1.5e-38
Identity = 78/157 (49.68%), Postives = 105/157 (66.88%), Query Frame = 1

Query: 434 GLTGARPDKFPMEQNLKLSLTEGEKLNDPSKYRRLIGRLIYLTVTRPDIAYSVRMLSQFM 493
           G+   +P   P+   L  S++   K  DPS +R ++G L YLT+TRPDI+Y+V ++ Q M
Sbjct: 70  GMLDCKPMSTPLPLKLNSSVSTA-KYPDPSDFRSIVGALQYLTLTRPDISYAVNIVCQRM 129

Query: 494 HEPRKPHWEAALRVLRYIKGTPGQGLLLPSENNLRLQAYCDSDWGGCRTSRRSISGFCIF 553
           HEP    ++   RVLRY+KGT   GL +   + L +QA+CDSDW GC ++RRS +GFC F
Sbjct: 130 HEPTLADFDLLKRVLRYVKGTIFHGLYIHKNSKLNVQAFCDSDWAGCTSTRRSTTGFCTF 189

Query: 554 LGNSIISWKSKKQTNVSRSSAEAEYRAMANTCLELTW 591
           LG +IISW +K+Q  VSRSS E EYRA+A T  ELTW
Sbjct: 190 LGCNIISWSAKRQPTVSRSSTETEYRALALTAAELTW 225

BLAST of CmoCh06G004550.1 vs. Swiss-Prot
Match: POLX_TOBAC (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum PE=2 SV=1)

HSP 1 Score: 131.0 bits (328), Expect = 4.8e-29
Identity = 75/203 (36.95%), Postives = 117/203 (57.64%), Query Frame = 1

Query: 432  RLGLTGARPDKFPMEQNLKLSL----TEGEKLNDPSK--YRRLIGRLIYLTV-TRPDIAY 491
            R  +  A+P   P+  +LKLS     T  E+  + +K  Y   +G L+Y  V TRPDIA+
Sbjct: 1072 RFNMKNAKPVSTPLAGHLKLSKKMCPTTVEEKGNMAKVPYSSAVGSLMYAMVCTRPDIAH 1131

Query: 492  SVRMLSQFMHEPRKPHWEAALRVLRYIKGTPGQGLLLPSENNLRLQAYCDSDWGGCRTSR 551
            +V ++S+F+  P K HWEA   +LRY++GT G  L     + + L+ Y D+D  G   +R
Sbjct: 1132 AVGVVSRFLENPGKEHWEAVKWILRYLRGTTGDCLCFGGSDPI-LKGYTDADMAGDIDNR 1191

Query: 552  RSISGFCIFLGNSIISWKSKKQTNVSRSSAEAEYRAMANTCLELTWLRYILQDLNVPLSE 611
            +S +G+        ISW+SK Q  V+ S+ EAEY A   T  E+ WL+  LQ+L +   E
Sbjct: 1192 KSSTGYLFTFSGGAISWQSKLQKCVALSTTEAEYIAATETGKEMIWLKRFLQELGLHQKE 1251

Query: 612  PALLYCDNQAALHIAANPVFHER 628
              ++YCD+Q+A+ ++ N ++H R
Sbjct: 1252 -YVVYCDSQSAIDLSKNSMYHAR 1272

BLAST of CmoCh06G004550.1 vs. Swiss-Prot
Match: COPIA_DROME (Copia protein OS=Drosophila melanogaster GN=GIP PE=1 SV=3)

HSP 1 Score: 121.3 bits (303), Expect = 3.8e-26
Identity = 71/190 (37.37%), Postives = 107/190 (56.32%), Query Frame = 1

Query: 444  PMEQNLKLSLTEGEKLNDPSKYRRLIGRLIYLTV-TRPDIAYSVRMLSQFMHEPRKPHWE 503
            P + N +L L   E  N P   R LIG L+Y+ + TRPD+  +V +LS++  +     W+
Sbjct: 1163 PSKINYEL-LNSDEDCNTPC--RSLIGCLMYIMLCTRPDLTTAVNILSRYSSKNNSELWQ 1222

Query: 504  AALRVLRYIKGTPGQGLL----LPSENNLRLQAYCDSDWGGCRTSRRSISGFCIFLGN-S 563
               RVLRY+KGT    L+    L  EN  ++  Y DSDW G    R+S +G+   + + +
Sbjct: 1223 NLKRVLRYLKGTIDMKLIFKKNLAFEN--KIIGYVDSDWAGSEIDRKSTTGYLFKMFDFN 1282

Query: 564  IISWKSKKQTNVSRSSAEAEYRAMANTCLELTWLRYILQDLNVPLSEPALLYCDNQAALH 623
            +I W +K+Q +V+ SS EAEY A+     E  WL+++L  +N+ L  P  +Y DNQ  + 
Sbjct: 1283 LICWNTKRQNSVAASSTEAEYMALFEAVREALWLKFLLTSINIKLENPIKIYEDNQGCIS 1342

Query: 624  IAANPVFHER 628
            IA NP  H+R
Sbjct: 1343 IANNPSCHKR 1347

BLAST of CmoCh06G004550.1 vs. Swiss-Prot
Match: M240_ARATH (Uncharacterized mitochondrial protein AtMg00240 OS=Arabidopsis thaliana GN=AtMg00240 PE=4 SV=1)

HSP 1 Score: 90.9 bits (224), Expect = 5.5e-17
Identity = 38/79 (48.10%), Postives = 55/79 (69.62%), Query Frame = 1

Query: 473 IYLTVTRPDIAYSVRMLSQFMHEPRKPHWEAALRVLRYIKGTPGQGLLLPSENNLRLQAY 532
           +YLT+TRPD+ ++V  LSQF    R    +A  +VL Y+KGT GQGL   + ++L+L+A+
Sbjct: 1   MYLTITRPDLTFAVNRLSQFSSASRTAQMQAVYKVLHYVKGTVGQGLFYSATSDLQLKAF 60

Query: 533 CDSDWGGCRTSRRSISGFC 552
            DSDW  C  +RRS++GFC
Sbjct: 61  ADSDWASCPDTRRSVTGFC 79

BLAST of CmoCh06G004550.1 vs. Swiss-Prot
Match: YCH4_YEAST (Putative transposon Ty5-1 protein YCL074W OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=TY5A PE=5 SV=2)

HSP 1 Score: 89.0 bits (219), Expect = 2.1e-16
Identity = 70/232 (30.17%), Postives = 114/232 (49.14%), Query Frame = 1

Query: 365 NILMMSPLPNVRQAYSLLVQEEMQRQDLTTGKMIGSGKQFGGL-YHISSS-----PIKSS 424
           ++L+ +P P +       V++E+ +  L + K +G   +F GL  H SS+      ++  
Sbjct: 90  DLLVAAPSPKIYDR----VKQELTK--LYSMKDLGKVDKFLGLNIHQSSNGDITLSLQDY 149

Query: 425 AHQVSQSSDLWHLRLGLTGARPDKFPMEQNLKLSLTEGEKLNDPSKYRRLIGRLIYLTVT 484
             + +  S++   +L  T       P+  +  L  T    L D + Y+ ++G+L++   T
Sbjct: 150 IAKAASESEINTFKLTQT-------PLCNSKPLFETTSPHLKDITPYQSIVGQLLFCANT 209

Query: 485 -RPDIAYSVRMLSQFMHEPRKPHWEAALRVLRYIKGTPGQGLLLPSENNLRLQAYCDSDW 544
            RPDI+Y V +LS+F+ EPR  H E+A RVLRY+  T    L   S + L L  YCD+  
Sbjct: 210 GRPDISYPVSLLSRFLREPRAIHLESARRVLRYLYTTRSMCLKYRSGSQLALTVYCDASH 269

Query: 545 GGCRTSRRSISGFCIFLGNSIISWKSKKQTNV-SRSSAEAEYRAMANTCLEL 589
           G       S  G+   L  + ++W SKK   V    S EAEY   + T +E+
Sbjct: 270 GAIHDLPHSTGGYVTLLAGAPVTWSSKKLKGVIPVPSTEAEYITASETVMEI 308

BLAST of CmoCh06G004550.1 vs. TrEMBL
Match: A5BNR5_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_035665 PE=4 SV=1)

HSP 1 Score: 345.5 bits (885), Expect = 1.4e-91
Identity = 162/201 (80.60%), Postives = 182/201 (90.55%), Query Frame = 1

Query: 434  GLTGARPDKFPMEQNLKLSLTEGEKLNDPSKYRRLIGRLIYLTVTRPDIAYSVRMLSQFM 493
            GLTG +P+KFPMEQNLKL+  +GE L+DPS+YRRL+GRLIYLTVTRPDI YSVR LSQFM
Sbjct: 1476 GLTGVKPEKFPMEQNLKLTNEDGELLHDPSRYRRLVGRLIYLTVTRPDIVYSVRTLSQFM 1535

Query: 494  HEPRKPHWEAALRVLRYIKGTPGQGLLLPSENNLRLQAYCDSDWGGCRTSRRSISGFCIF 553
            + PRKPHWEAALRVLRYIKG+PGQGL LPSENNL L A+CDSDWGGCR SRRS+SG+C+F
Sbjct: 1536 NTPRKPHWEAALRVLRYIKGSPGQGLFLPSENNLTLSAFCDSDWGGCRMSRRSVSGYCVF 1595

Query: 554  LGNSIISWKSKKQTNVSRSSAEAEYRAMANTCLELTWLRYILQDLNVPLSEPALLYCDNQ 613
            LG+S+ISWKSKKQTNVSRSSAEAEYRAMANTCLELTWLRYIL+DL V L +PA L+CDNQ
Sbjct: 1596 LGSSLISWKSKKQTNVSRSSAEAEYRAMANTCLELTWLRYILKDLKVELDKPAPLFCDNQ 1655

Query: 614  AALHIAANPVFHERLDYGEDD 635
            AAL+IAANPVFHER  + E D
Sbjct: 1656 AALYIAANPVFHERTKHIEID 1676

BLAST of CmoCh06G004550.1 vs. TrEMBL
Match: A5BLV0_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_033389 PE=4 SV=1)

HSP 1 Score: 289.7 bits (740), Expect = 8.9e-75
Identity = 143/237 (60.34%), Postives = 175/237 (73.84%), Query Frame = 1

Query: 402  KQFGGLYHISSSPIKSSAHQVSQSSDLWHLRL----GLTGARPDKFPMEQNLKLSLTEGE 461
            K  G L +     +  S   +S S   + L +    G  GA+P  FPMEQN KLS   GE
Sbjct: 991  KDLGDLKYFLGIEVSRSKKGISISQRKYTLEILKDGGFLGAKPVNFPMEQNTKLS-DSGE 1050

Query: 462  KLNDPSKYRRLIGRLIYLTVTRPDIAYSVRMLSQFMHEPRKPHWEAALRVLRYIKGTPGQ 521
             L DPS+YRRL+GRLIYLT+TRPDI YSV +LS+FMH PR+PH EAALRVLRY+K +PGQ
Sbjct: 1051 LLKDPSQYRRLVGRLIYLTITRPDITYSVHVLSRFMHAPRRPHMEAALRVLRYLKNSPGQ 1110

Query: 522  GLLLPSENNLRLQAYCDSDWGGCRTSRRSISGFCIFLGNSIISWKSKKQTNVSRSSAEAE 581
            GL  PS+N+L L+A+ DSDW GC  SRRS +G+C+FLG+S+ISW++K+Q  VS SSAEAE
Sbjct: 1111 GLFFPSQNDLSLRAFSDSDWAGCPISRRSTTGYCVFLGSSLISWRTKRQKTVSLSSAEAE 1170

Query: 582  YRAMANTCLELTWLRYILQDLNVPLSEPALLYCDNQAALHIAANPVFHERLDYGEDD 635
            YRAMA TC EL+WLR +L+DL +   +PALLYCDN AALHIAANPVFHER  + E D
Sbjct: 1171 YRAMAGTCCELSWLRSLLKDLRILHPKPALLYCDNTAALHIAANPVFHERTRHIEMD 1226

BLAST of CmoCh06G004550.1 vs. TrEMBL
Match: A5AIJ8_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_031733 PE=4 SV=1)

HSP 1 Score: 285.4 bits (729), Expect = 1.7e-73
Identity = 140/237 (59.07%), Postives = 173/237 (73.00%), Query Frame = 1

Query: 402  KQFGGLYHISSSPIKSSAHQVSQSSDLWHLRL----GLTGARPDKFPMEQNLKLSLTEGE 461
            K  G L +     +  S   +S S   + L +    G  GA+P  FPMEQN+KLS    E
Sbjct: 1034 KDLGDLKYFLGIEVSRSKKGISISQRKYTLEILKDGGFLGAKPVNFPMEQNIKLS-DSSE 1093

Query: 462  KLNDPSKYRRLIGRLIYLTVTRPDIAYSVRMLSQFMHEPRKPHWEAALRVLRYIKGTPGQ 521
             L DPS+YRRL+GRLIYLT+TRPDI YSV +LS+FMH PR+PH EAALRVLRY+K +PGQ
Sbjct: 1094 LLKDPSQYRRLVGRLIYLTITRPDITYSVHVLSRFMHAPRRPHMEAALRVLRYLKNSPGQ 1153

Query: 522  GLLLPSENNLRLQAYCDSDWGGCRTSRRSISGFCIFLGNSIISWKSKKQTNVSRSSAEAE 581
            GL  PS+N+L L+A+ DSDW GC  SRRS +G+C+FLG+S+ISW++K+Q  VS SSAEAE
Sbjct: 1154 GLFFPSQNDLSLRAFSDSDWAGCPISRRSTTGYCVFLGSSLISWRTKRQKTVSLSSAEAE 1213

Query: 582  YRAMANTCLELTWLRYILQDLNVPLSEPALLYCDNQAALHIAANPVFHERLDYGEDD 635
            YRAM  TC EL+WLR +L+DL +   +PALLYCDN  ALHIAANPVFHER  + E D
Sbjct: 1214 YRAMTGTCCELSWLRSLLKDLRILHPKPALLYCDNTTALHIAANPVFHERTRHIEMD 1269

BLAST of CmoCh06G004550.1 vs. TrEMBL
Match: A0A0B2QQ23_GLYSO (Copia protein (Fragment) OS=Glycine soja GN=glysoja_029824 PE=4 SV=1)

HSP 1 Score: 283.5 bits (724), Expect = 6.4e-73
Identity = 132/168 (78.57%), Postives = 147/168 (87.50%), Query Frame = 1

Query: 467 RLIGRLIYLTVTRPDIAYSVRMLSQFMHEPRKPHWEAALRVLRYIKGTPGQGLLLPSENN 526
           RL+GRLIYLTVTRPDI YSV+ LSQFMHEPRKPHW+AALRVLRYIKGTPGQGLL  S N+
Sbjct: 1   RLVGRLIYLTVTRPDIVYSVQTLSQFMHEPRKPHWDAALRVLRYIKGTPGQGLLFSSAND 60

Query: 527 LRLQAYCDSDWGGCRTSRRSISGFCIFLGNSIISWKSKKQTNVSRSSAEAEYRAMANTCL 586
           L L+A+CDSDWGGC  +R+S++GFC FLGNS+ISWKSKKQ  VSRSSAE+EYRAMANTCL
Sbjct: 61  LTLKAFCDSDWGGCHATRKSVTGFCFFLGNSLISWKSKKQVVVSRSSAESEYRAMANTCL 120

Query: 587 ELTWLRYILQDLNVPLSEPALLYCDNQAALHIAANPVFHERLDYGEDD 635
           ELTWLR+ILQDL VP   P  L+CDNQAALHIAANPVFHER  + E D
Sbjct: 121 ELTWLRFILQDLKVPQDAPTPLFCDNQAALHIAANPVFHERTKHIEID 168

BLAST of CmoCh06G004550.1 vs. TrEMBL
Match: A5ART6_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_028058 PE=4 SV=1)

HSP 1 Score: 281.6 bits (719), Expect = 2.4e-72
Identity = 139/237 (58.65%), Postives = 171/237 (72.15%), Query Frame = 1

Query: 402  KQFGGLYHISSSPIKSSAHQVSQSSDLWHLRL----GLTGARPDKFPMEQNLKLSLTEGE 461
            K  G L +     +  S   +S S   + L +    G  GA+P  FPMEQN KLS   GE
Sbjct: 837  KDLGDLKYFLGIEVSRSKKGISISQRKYTLEILKDGGFLGAKPVNFPMEQNTKLS-DSGE 896

Query: 462  KLNDPSKYRRLIGRLIYLTVTRPDIAYSVRMLSQFMHEPRKPHWEAALRVLRYIKGTPGQ 521
             L  PS+YRRL+GRLIYLT+TRPDI YSV +LS+FMH PR+PH EAALRVLRY+K +PGQ
Sbjct: 897  LLKGPSQYRRLVGRLIYLTITRPDITYSVHVLSRFMHAPRRPHMEAALRVLRYLKNSPGQ 956

Query: 522  GLLLPSENNLRLQAYCDSDWGGCRTSRRSISGFCIFLGNSIISWKSKKQTNVSRSSAEAE 581
            GL  PS+N+L L+A+ D DW GC  SRRS +G+C+FLG+S+ISW++K+Q  VS SS EAE
Sbjct: 957  GLFFPSQNDLSLRAFSDXDWAGCPISRRSXTGYCVFLGSSLISWRTKRQKTVSLSSXEAE 1016

Query: 582  YRAMANTCLELTWLRYILQDLNVPLSEPALLYCDNQAALHIAANPVFHERLDYGEDD 635
            YRAMA TC EL+WLR +L+DL +   +PALLYCDN AALHIA NPVFHER  + E D
Sbjct: 1017 YRAMAGTCCELSWLRSLLKDLRILHPKPALLYCDNTAALHIAVNPVFHERTRHIEMD 1072

BLAST of CmoCh06G004550.1 vs. TAIR10
Match: AT4G23160.1 (AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 8)

HSP 1 Score: 214.9 bits (546), Expect = 1.4e-55
Identity = 101/201 (50.25%), Postives = 137/201 (68.16%), Query Frame = 1

Query: 434 GLTGARPDKFPMEQNLKLSLTEGEKLNDPSKYRRLIGRLIYLTVTRPDIAYSVRMLSQFM 493
           GL G +P   PM+ ++  S   G    D   YRRLIGRL+YL +TR DI+++V  LSQF 
Sbjct: 347 GLLGCKPSSVPMDPSVTFSAHSGGDFVDAKAYRRLIGRLMYLQITRLDISFAVNKLSQFS 406

Query: 494 HEPRKPHWEAALRVLRYIKGTPGQGLLLPSENNLRLQAYCDSDWGGCRTSRRSISGFCIF 553
             PR  H +A +++L YIKGT GQGL   S+  ++LQ + D+ +  C+ +RRS +G+C+F
Sbjct: 407 EAPRLAHQQAVMKILHYIKGTVGQGLFYSSQAEMQLQVFSDASFQSCKDTRRSTNGYCMF 466

Query: 554 LGNSIISWKSKKQTNVSRSSAEAEYRAMANTCLELTWLRYILQDLNVPLSEPALLYCDNQ 613
           LG S+ISWKSKKQ  VS+SSAEAEYRA++    E+ WL    ++L +PLS+P LL+CDN 
Sbjct: 467 LGTSLISWKSKKQQVVSKSSAEAEYRALSFATDEMMWLAQFFRELQLPLSKPTLLFCDNT 526

Query: 614 AALHIAANPVFHERLDYGEDD 635
           AA+HIA N VFHER  + E D
Sbjct: 527 AAIHIATNAVFHERTKHIESD 547

BLAST of CmoCh06G004550.1 vs. TAIR10
Match: ATMG00810.1 (ATMG00810.1 DNA/RNA polymerases superfamily protein)

HSP 1 Score: 162.5 bits (410), Expect = 8.3e-40
Identity = 78/157 (49.68%), Postives = 105/157 (66.88%), Query Frame = 1

Query: 434 GLTGARPDKFPMEQNLKLSLTEGEKLNDPSKYRRLIGRLIYLTVTRPDIAYSVRMLSQFM 493
           G+   +P   P+   L  S++   K  DPS +R ++G L YLT+TRPDI+Y+V ++ Q M
Sbjct: 70  GMLDCKPMSTPLPLKLNSSVSTA-KYPDPSDFRSIVGALQYLTLTRPDISYAVNIVCQRM 129

Query: 494 HEPRKPHWEAALRVLRYIKGTPGQGLLLPSENNLRLQAYCDSDWGGCRTSRRSISGFCIF 553
           HEP    ++   RVLRY+KGT   GL +   + L +QA+CDSDW GC ++RRS +GFC F
Sbjct: 130 HEPTLADFDLLKRVLRYVKGTIFHGLYIHKNSKLNVQAFCDSDWAGCTSTRRSTTGFCTF 189

Query: 554 LGNSIISWKSKKQTNVSRSSAEAEYRAMANTCLELTW 591
           LG +IISW +K+Q  VSRSS E EYRA+A T  ELTW
Sbjct: 190 LGCNIISWSAKRQPTVSRSSTETEYRALALTAAELTW 225

BLAST of CmoCh06G004550.1 vs. TAIR10
Match: AT1G21280.1 (AT1G21280.1 Retrotransposon gag protein (InterPro:IPR005162))

HSP 1 Score: 113.2 bits (282), Expect = 5.8e-25
Identity = 72/232 (31.03%), Postives = 114/232 (49.14%), Query Frame = 1

Query: 168 TSKSSFKISDVDLTHPYY----IHHSDQPGYSLVPIKLNGANYQSWRKSVMHALIAKKKI 227
           T KS    SD D   PYY    IHH     +S+  +  +  NY +W+      L   KK 
Sbjct: 4   TIKSVSPTSDPD--SPYYLPPDIHHPSD--FSIQKLSKDEDNYVAWKIRFRSFLRVTKKF 63

Query: 228 GFIDGTIEEPSQDANSTEFELWNQCNSMIISWLTHSVEADIAKGIIHAKTAHQVWVDLHD 287
           GFIDGT+ +P  D  S  ++ W QCN+M++ WL +S+   + + +++A+TAH++W DL  
Sbjct: 64  GFIDGTLPKP--DPFSPLYQPWEQCNAMVMYWLMNSMTDKLLESVMYAETAHKMWEDLRR 123

Query: 288 QFSQKNALAIFQIQNSIATMSQGTMALSTYFTKLKALWDELEAYRT-------PFTCXXX 347
            F     L I+Q++  +AT+ QG  ++  YF KL  +W EL  Y            C   
Sbjct: 124 VFVPCVDLKIYQLRRRLATLRQGGDSVEEYFGKLSKVWMELSEYAPIPECKCGGCNCECT 183

Query: 348 XXXXXXXXXXXXXXLLMG--LNQSYKTVRSNILMMSPLPNVRQAYSLLVQEE 387
                          LMG  LNQ ++ V + I+   P P++ +A++++   E
Sbjct: 184 KRAEEAREKEQRYEFLMGLKLNQGFEAVTTKIMFQKPPPSLHEAFAMVKDAE 229

BLAST of CmoCh06G004550.1 vs. TAIR10
Match: ATMG00240.1 (ATMG00240.1 Gag-Pol-related retrotransposon family protein)

HSP 1 Score: 90.9 bits (224), Expect = 3.1e-18
Identity = 38/79 (48.10%), Postives = 55/79 (69.62%), Query Frame = 1

Query: 473 IYLTVTRPDIAYSVRMLSQFMHEPRKPHWEAALRVLRYIKGTPGQGLLLPSENNLRLQAY 532
           +YLT+TRPD+ ++V  LSQF    R    +A  +VL Y+KGT GQGL   + ++L+L+A+
Sbjct: 1   MYLTITRPDLTFAVNRLSQFSSASRTAQMQAVYKVLHYVKGTVGQGLFYSATSDLQLKAF 60

Query: 533 CDSDWGGCRTSRRSISGFC 552
            DSDW  C  +RRS++GFC
Sbjct: 61  ADSDWASCPDTRRSVTGFC 79

BLAST of CmoCh06G004550.1 vs. NCBI nr
Match: gi|147783627|emb|CAN68148.1| (hypothetical protein VITISV_035665 [Vitis vinifera])

HSP 1 Score: 345.5 bits (885), Expect = 2.0e-91
Identity = 162/201 (80.60%), Postives = 182/201 (90.55%), Query Frame = 1

Query: 434  GLTGARPDKFPMEQNLKLSLTEGEKLNDPSKYRRLIGRLIYLTVTRPDIAYSVRMLSQFM 493
            GLTG +P+KFPMEQNLKL+  +GE L+DPS+YRRL+GRLIYLTVTRPDI YSVR LSQFM
Sbjct: 1476 GLTGVKPEKFPMEQNLKLTNEDGELLHDPSRYRRLVGRLIYLTVTRPDIVYSVRTLSQFM 1535

Query: 494  HEPRKPHWEAALRVLRYIKGTPGQGLLLPSENNLRLQAYCDSDWGGCRTSRRSISGFCIF 553
            + PRKPHWEAALRVLRYIKG+PGQGL LPSENNL L A+CDSDWGGCR SRRS+SG+C+F
Sbjct: 1536 NTPRKPHWEAALRVLRYIKGSPGQGLFLPSENNLTLSAFCDSDWGGCRMSRRSVSGYCVF 1595

Query: 554  LGNSIISWKSKKQTNVSRSSAEAEYRAMANTCLELTWLRYILQDLNVPLSEPALLYCDNQ 613
            LG+S+ISWKSKKQTNVSRSSAEAEYRAMANTCLELTWLRYIL+DL V L +PA L+CDNQ
Sbjct: 1596 LGSSLISWKSKKQTNVSRSSAEAEYRAMANTCLELTWLRYILKDLKVELDKPAPLFCDNQ 1655

Query: 614  AALHIAANPVFHERLDYGEDD 635
            AAL+IAANPVFHER  + E D
Sbjct: 1656 AALYIAANPVFHERTKHIEID 1676

BLAST of CmoCh06G004550.1 vs. NCBI nr
Match: gi|971549811|ref|XP_015164069.1| (PREDICTED: uncharacterized mitochondrial protein AtMg00810-like [Solanum tuberosum])

HSP 1 Score: 329.3 bits (843), Expect = 1.5e-86
Identity = 153/194 (78.87%), Postives = 172/194 (88.66%), Query Frame = 1

Query: 434 GLTGARPDKFPMEQNLKLSLTEGEKLNDPSKYRRLIGRLIYLTVTRPDIAYSVRMLSQFM 493
           G+ GARP+ FPMEQNLKL+ T+G  LNDP+KYRRL+GRLIYLTVTRPDI YSVR LSQFM
Sbjct: 15  GILGARPELFPMEQNLKLTSTDGVVLNDPTKYRRLVGRLIYLTVTRPDIVYSVRTLSQFM 74

Query: 494 HEPRKPHWEAALRVLRYIKGTPGQGLLLPSENNLRLQAYCDSDWGGCRTSRRSISGFCIF 553
            EPRKPHW+AA+R+L+YIKGTPGQGLL PS NNL L+A+CDSDWG CR +RRS++G+CIF
Sbjct: 75  QEPRKPHWDAAVRILKYIKGTPGQGLLFPSTNNLILKAFCDSDWGSCRATRRSVTGYCIF 134

Query: 554 LGNSIISWKSKKQTNVSRSSAEAEYRAMANTCLELTWLRYILQDLNVPLSEPALLYCDNQ 613
           LGNS+ISWKSKKQ  VSRSSAEAEYRAMANTCLELTWL YILQDL +    PA L+CDNQ
Sbjct: 135 LGNSLISWKSKKQLVVSRSSAEAEYRAMANTCLELTWLSYILQDLRISNVTPAKLFCDNQ 194

Query: 614 AALHIAANPVFHER 628
           AALHIAANPVFHER
Sbjct: 195 AALHIAANPVFHER 208

BLAST of CmoCh06G004550.1 vs. NCBI nr
Match: gi|731433688|ref|XP_010644745.1| (PREDICTED: uncharacterized protein LOC104877672 [Vitis vinifera])

HSP 1 Score: 322.4 bits (825), Expect = 1.8e-84
Identity = 156/213 (73.24%), Postives = 173/213 (81.22%), Query Frame = 1

Query: 178 VDLTHPYYIHHSDQPGYSLVPIKLNGANYQSWRKSVMHALIAKKKIGFIDGTIEEPSQDA 237
           +D +HP YIHHSDQPG+ LVPIKLNG NYQSW KSV+HAL AKKKIGF+DGTIEEPSQ+ 
Sbjct: 15  IDPSHPLYIHHSDQPGHVLVPIKLNGVNYQSWSKSVIHALTAKKKIGFVDGTIEEPSQED 74

Query: 238 NSTEFELWNQCNSMIISWLTHSVEADIAKGIIHAKTAHQVWVDLHDQFSQKNALAIFQIQ 297
               FE WNQCNSMI+SWLTH VE+DIA+GIIHAKTA +VWVDL DQFSQKNA A+FQIQ
Sbjct: 75  EPFMFEQWNQCNSMILSWLTHVVESDIAEGIIHAKTAREVWVDLRDQFSQKNAPAVFQIQ 134

Query: 298 NSIATMSQGTMALSTYFTKLKALWDELEAYRTPFTCXXXXXXXXXXXXXXXXXLLMGLNQ 357
            SIATMSQGTM ++ YFTK+KALWDELE YR+P TC                  LMGLN+
Sbjct: 135 KSIATMSQGTMTVAAYFTKIKALWDELETYRSPLTCNQRQAHLEQREEDRLMQFLMGLNE 194

Query: 358 SYKTVRSNILMMSPLPNVRQAYSLLVQEEMQRQ 391
           SYK VRSNILMMSPLPNVRQAYSL+VQEEMQRQ
Sbjct: 195 SYKAVRSNILMMSPLPNVRQAYSLIVQEEMQRQ 227

BLAST of CmoCh06G004550.1 vs. NCBI nr
Match: gi|731439152|ref|XP_010646703.1| (PREDICTED: uncharacterized protein LOC104878279 [Vitis vinifera])

HSP 1 Score: 319.7 bits (818), Expect = 1.2e-83
Identity = 152/213 (71.36%), Postives = 174/213 (81.69%), Query Frame = 1

Query: 178 VDLTHPYYIHHSDQPGYSLVPIKLNGANYQSWRKSVMHALIAKKKIGFIDGTIEEPSQDA 237
           +D +HP YIHHSDQPG+ LVPIKLNG NYQSW K+V+HAL AKKKIGF++GT+EEPSQ+ 
Sbjct: 15  IDPSHPLYIHHSDQPGHVLVPIKLNGVNYQSWSKAVIHALTAKKKIGFVNGTVEEPSQED 74

Query: 238 NSTEFELWNQCNSMIISWLTHSVEADIAKGIIHAKTAHQVWVDLHDQFSQKNALAIFQIQ 297
               FE WNQCNSMI+SWLTH+VE+DIA+GIIHAKTA +VWVDL DQFSQKNA A+FQIQ
Sbjct: 75  EPFMFEQWNQCNSMILSWLTHAVESDIAEGIIHAKTAREVWVDLRDQFSQKNAPAVFQIQ 134

Query: 298 NSIATMSQGTMALSTYFTKLKALWDELEAYRTPFTCXXXXXXXXXXXXXXXXXLLMGLNQ 357
            SIATMSQGTM ++ YFTK+KALWDELE YR+P TC                  LMGLN+
Sbjct: 135 KSIATMSQGTMTVAAYFTKIKALWDELETYRSPLTCNQRQAHLEQREEDRLMQFLMGLNE 194

Query: 358 SYKTVRSNILMMSPLPNVRQAYSLLVQEEMQRQ 391
           SYK VRSNILMMSPLPNVRQAYSL++QEEMQRQ
Sbjct: 195 SYKAVRSNILMMSPLPNVRQAYSLIIQEEMQRQ 227

BLAST of CmoCh06G004550.1 vs. NCBI nr
Match: gi|747105150|ref|XP_011100841.1| (PREDICTED: uncharacterized protein LOC105178952 [Sesamum indicum])

HSP 1 Score: 313.9 bits (803), Expect = 6.3e-82
Identity = 166/451 (36.81%), Postives = 256/451 (56.76%), Query Frame = 1

Query: 186 IHHSDQPGYSLVPIKLNGANYQSWRKSVMHALIAKKKIGFIDGTIEEPSQDANSTEFELW 245
           +H SD PG SLV   L+G+N+ SW +S+   L AK K+ FI    E P +  N+ EFE W
Sbjct: 10  LHPSDNPGLSLVTTILDGSNFLSWSRSIRLVLTAKTKMSFISKDAEIPEK--NTKEFEQW 69

Query: 246 NQCNSMIISWLTHSVEADIAKGIIHAKTAHQVWVDLHDQFSQKNALAIFQIQNSIATMSQ 305
            + +SM+ SW+ +S+  DI +  ++ KT+ ++W +L +++ Q N    ++++  +A++SQ
Sbjct: 70  IKVDSMVTSWILNSITRDIVESFMYTKTSKELWTELENRYGQSNGPMEYRLKRELASLSQ 129

Query: 306 GTMALSTYFTKLKALWDELEAYRTPFTCXXXXXXXXXXXXXXXXXLLMGLNQSYKTVRSN 365
                    +      D+L  +                        LMGL  S   VRS 
Sbjct: 130 ARENADIKSS------DQLMQF------------------------LMGLKDSCDHVRSQ 189

Query: 366 ILMMSPLPNVRQAYSLLVQEEMQRQDLTTGKMIGSGKQFGGLYHISSSPIKSSAHQVSQS 425
           ILMM P PNV +A+S++++ E Q++                   +++ P   +   +SQ+
Sbjct: 190 ILMMEPYPNVSKAFSMVLRIEKQKE-------------------VNTEPQSKTGIVISQT 249

Query: 426 SDLWHL--RLGLTGARPDKFPMEQNLKLSLTEGEKLNDPSKYRRLIGRLIYLTVTRPDIA 485
             +  +   +GL+  +    P+   +KLS ++ E+L +P  +RRL+GRL+YL  TRPDI 
Sbjct: 250 KYIKDIIDDVGLSEVKATSTPLPMGIKLSSSQEEQLENPESFRRLLGRLLYLGFTRPDIC 309

Query: 486 YSVRMLSQFMHEPRKPHWEAALRVLRYIKGTPGQGLLLPSENNLRLQAYCDSDWGGCRTS 545
           +  + LSQ+M  P K HW AAL ++RY+K T  +GL L ++++  L+A+CD DW  C+ S
Sbjct: 310 HGTQQLSQYMQFPCKAHWNAALHLVRYLKTTMNRGLQLNTDDSFELKAFCDVDWASCKDS 369

Query: 546 RRSISGFCIFLGNSIISWKSKKQTNVSRSSAEAEYRAMANTCLELTWLRYILQDLNVPLS 605
           R+S++ +C+FLG S ISWK+KKQT VSRSS E EYR+M  T  E  W+  +LQD  +   
Sbjct: 370 RKSLTRYCVFLGGSFISWKTKKQTTVSRSSGEVEYRSMGTTTCEPIWIFNLLQDFKIIPP 409

Query: 606 EPALLYCDNQAALHIAANPVFHERLDYGEDD 635
            P   YCDNQAAL+I ANP+FHER  + E D
Sbjct: 430 TPIKFYCDNQAALYITANPIFHERTKHIEID 409

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
M810_ARATH1.5e-3849.68Uncharacterized mitochondrial protein AtMg00810 OS=Arabidopsis thaliana GN=AtMg0... [more]
POLX_TOBAC4.8e-2936.95Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
COPIA_DROME3.8e-2637.37Copia protein OS=Drosophila melanogaster GN=GIP PE=1 SV=3[more]
M240_ARATH5.5e-1748.10Uncharacterized mitochondrial protein AtMg00240 OS=Arabidopsis thaliana GN=AtMg0... [more]
YCH4_YEAST2.1e-1630.17Putative transposon Ty5-1 protein YCL074W OS=Saccharomyces cerevisiae (strain AT... [more]
Match NameE-valueIdentityDescription
A5BNR5_VITVI1.4e-9180.60Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_035665 PE=4 SV=1[more]
A5BLV0_VITVI8.9e-7560.34Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_033389 PE=4 SV=1[more]
A5AIJ8_VITVI1.7e-7359.07Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_031733 PE=4 SV=1[more]
A0A0B2QQ23_GLYSO6.4e-7378.57Copia protein (Fragment) OS=Glycine soja GN=glysoja_029824 PE=4 SV=1[more]
A5ART6_VITVI2.4e-7258.65Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_028058 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G23160.11.4e-5550.25 cysteine-rich RLK (RECEPTOR-like protein kinase) 8[more]
ATMG00810.18.3e-4049.68ATMG00810.1 DNA/RNA polymerases superfamily protein[more]
AT1G21280.15.8e-2531.03 Retrotransposon gag protein (InterPro:IPR005162)[more]
ATMG00240.13.1e-1848.10ATMG00240.1 Gag-Pol-related retrotransposon family protein[more]
Match NameE-valueIdentityDescription
gi|147783627|emb|CAN68148.1|2.0e-9180.60hypothetical protein VITISV_035665 [Vitis vinifera][more]
gi|971549811|ref|XP_015164069.1|1.5e-8678.87PREDICTED: uncharacterized mitochondrial protein AtMg00810-like [Solanum tuberos... [more]
gi|731433688|ref|XP_010644745.1|1.8e-8473.24PREDICTED: uncharacterized protein LOC104877672 [Vitis vinifera][more]
gi|731439152|ref|XP_010646703.1|1.2e-8371.36PREDICTED: uncharacterized protein LOC104878279 [Vitis vinifera][more]
gi|747105150|ref|XP_011100841.1|6.3e-8236.81PREDICTED: uncharacterized protein LOC105178952 [Sesamum indicum][more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
IPR005162Retrotrans_gag_dom
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
biological_process GO:0055114 oxidation-reduction process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0016491 oxidoreductase activity
molecular_function GO:0008270 zinc ion binding

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CmoCh06G004550CmoCh06G004550gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CmoCh06G004550.1CmoCh06G004550.1-proteinpolypeptide


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmoCh06G004550.1.exon.1CmoCh06G004550.1.exon.1exon
CmoCh06G004550.1.exon.2CmoCh06G004550.1.exon.2exon
CmoCh06G004550.1.exon.3CmoCh06G004550.1.exon.3exon
CmoCh06G004550.1.exon.4CmoCh06G004550.1.exon.4exon
CmoCh06G004550.1.exon.5CmoCh06G004550.1.exon.5exon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmoCh06G004550.1.CDS.1CmoCh06G004550.1.CDS.1CDS
CmoCh06G004550.1.CDS.2CmoCh06G004550.1.CDS.2CDS
CmoCh06G004550.1.CDS.3CmoCh06G004550.1.CDS.3CDS
CmoCh06G004550.1.CDS.4CmoCh06G004550.1.CDS.4CDS
CmoCh06G004550.1.CDS.5CmoCh06G004550.1.CDS.5CDS


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR005162Retrotransposon gag domainPFAMPF03732Retrotrans_gagcoord: 90..162
score: 3.
NoneNo IPR availablePANTHERPTHR11439GAG-POL-RELATED RETROTRANSPOSONcoord: 434..607
score: 4.4
NoneNo IPR availablePANTHERPTHR11439:SF191SUBFAMILY NOT NAMEDcoord: 434..607
score: 4.4
NoneNo IPR availablePFAMPF14223Retrotran_gag_2coord: 245..391
score: 9.
NoneNo IPR availableunknownSSF56672DNA/RNA polymerasescoord: 456..628
score: 2.22