CmoCh20G009950 (gene) Cucurbita moschata (Rifu)

NameCmoCh20G009950
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionRetrotransposon protein, putative, Ty3-gypsy subclass
LocationCmo_Chr20 : 5925057 .. 5928087 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTCATACCACTGATAGAAGAAGCAACCACCGAAGAGACTGTCCTAGAGGAAATCAAGGAAGTACTAGACAGCTATACCGACATAATGCCAGAGAGTCTACAACAAACACTACCACCTCGTCGAGGCATTGACCACGAAATCGAACTCCTCCCCGGGGTTAAGCAGCCAGCGAAGAACGCATACCGGATGGCTTCGCCCGAGCTAGCCGAATTAAGGAAACAACTGGACGAGTTGCTGAAGGCGGGATTCATTCGCCCGACAAAGGCACCTTATGGAGCCCCCGTACTGTTCCAGAAGAAGAAGAATGGGACGTTACGTCTGTGCATAGACTATAGAGCCTTAAACAAGGTGACGGTACGCAACAAATATCCTCTGCCGATAATATCCGACTTGTTCGACAAACTTCACGGGGCCAAGTACTTCACGAAGTTGGACTTACGATCAGGATATTATCAAGTACGTATTGCCGAAGGGGACGAACCCAAGACGACGTGCGTAACAAGATATGGGGCCTTTGAATTCCTAGTAATGCCCTTTGACTTGACAAACGCCCCAGCTACGTTCTGCACGTTGATGAACCAGGTTTTCTACGAATACCTGGATCAGTTCGTCATAGTATACCTCGACGACATTGTTGTTTACAGCACAACCCTCGAGGAACACAAAGTGCACTTAAAGCTAGTGTTTGACAAGCTCCGGCAGAACCAGCTGTATGTGAAGAAGGAGAAATGTGCATTCGCACAAACATGCATCAACTTCCTTGGACATGTCGTCAGATGTGGACAAATTAGTATGGATAGCGACAAGATAAAAGCTATCCAAGAATGGAAGGTTCCTACTTCCGTATCTGAGTTGCGGTCCTTCTTAGGACTAGCCAACTACTATAGGCGGTTCGTCGAAGGGTTTTCACGACGAGCCGCCCCATTGACAGAGCTGTTGAAGAAAGACCACCCTTGGTCGTGGTCGAATGATTGTCAAATGGCCTTTGAAAATCTGAAAACAACCATGACGAGGGGTCCTGTCCTCGGGTTGGTAGACGTCACAAAGCCATTTGAAGTAGAAACCGATGCTTCCGACTTTGCTCTAGGTGGGGTCCTTATTCAAGAAGGCCACCCCATCGCTTATGAAAGTCGAAAGCTCAACGATGCCGAGCGAAGATACACTGTCTCCGAGAAAGAAATGCTGGCTGTAGTCCATTGCCTTTGAGTCTGGAGACAGTATCTCTTAGGATCACAATTCGTAGTGAAGACGGATAACAGCGCCACTTGCCACTTCTTTGATCAACCAAAATTAACAGCAAAACAAGCCCGGTGGCAGGAGTCGTTGGCCGAATTCGACTTCAAGTTCGAACACAAAGCAGGAAAGAGTAATCAAGCAGCCGACGCGCTGAGTCGGAAGGGCGAACATGCGGCCCTGTGCATGTTAGCCCATATTCATTCAAGTAAGATCGATGGATCGATACGCGACATCATCAAGGAACATTTACACAAAGACCCATCGGCCAAAGCTGTTGTCGAACTAGCTAAGGCTGGAAAGACACGACAGTTTTGGGTTGAGGGGGACCTCCTGATAACCAAAGGAAACAGATTGTACGTCCCAAGAACGGGGGAACTGAGGAAGAAGCTCATTCAGAAATGTCATGATACCTTATGGGCCGGACACCCTGGGTGGCAAAGAACATACGCCCTAATAAAGAAGGGGTACTTTTGGCCAAATATGCGAGACGACATCATGCAATACACCAAGACGTGCCTCATCTGCCAACAGGATAAGGTCGAGAAAGCCAAAGTCTTTGGACTCTTGGAACCTCTACCCGTGCCAACAAGACCCTGGGAAAGTGTATCTCTGGACTTCATAACACACCGCCCAAAAGTCGGGGAATATGACGCCATCTTGGTTATTGTAGACCGGTTCTCAAAATATGCGACATTCATCCCCACTCCCAAATTATGCTCGGCGGAACTCACAGCCCAACTATTTTTCAAACACGTTGTAAAGCTATCGGGCATTCCATCGAGCATCATCAGTGATAGGGATGGTAGATTCATTGGGACATTCTGGACTGATTTATTCGCTTTCTTGGGAACAACCTTAAACATCTCCTCGAGTTATCACCCTCAAACCGATGCACAGATAGAACGGTTCAATTGCTTGCTCGAAGAATACTTACGTCACTTCGTCGATGCTCGCCAAAAGAACTGGATACAATTGTTAGATGTCGCCCAATTTTGCTTCAATTGTCAAACAAGCTCGTCTACAGGGAAGAGTCCCTTCGAAATTGTAAGTGGACGACAACCGGCCTTACCCCATATTATTGATTATCCTTATGCAGGAAAAAACCCTCAAGCTCACAACTTTACAAGAGAATGGAAGCAGACGACAGATATAGCCCGGGCATATTTAGAGAAGGCTTCCAAGCATATGAAGAAATGGGCGGACAAGAAGCGTCGCCCCCTTCAATTCCGTGCAGGAGATCAAGTCCTTATCAAGCTGAGACCAGAACAGATCAGATTTCGTAGCCGAAAGGACCAAAGACTAGTTCGTAAATATGAAGGCCCGGTTGAAGTCCTTAAAAAGATCGGGGCTACCTCCTACCGAGTTGCACTACCCACATGGATGAAAATCCACCCCGTCATTCATGTGAGCAACTTGAAGCCCTATCACCCTGATCCAGACGACGACCAACGAAATGCAACCACAAGACTGAACATCGATCTGCAGCAAAAAGAAACAAAGGAAGTGGAAGAGATCCTAGTCGACAGGGTTAGGAAGATAGGAAGACCTGTACGGACGATTCGTGAATTCCTTATCAAATGGAAGAATCTCCCTACAGAGGAAACAAGCTGGGAACGCGCCGAAGATCTGAACTCCGCCGCCACCCACATCGCAAGGTATGAAAGTAGTCGGTTGACAGGGACGTCAACCAATTAGGTGGGGGAGGATGTCACGGTCATGCTTGTCCAGGGCATGTTGCCCATGGCCGCATGCCCATGACAAGCCAAGCCCTTATCATGTCTTTGCATTCTAG

mRNA sequence

ATGGTCATACCACTGATAGAAGAAGCAACCACCGAAGAGACTGTCCTAGAGGAAATCAAGGAAGTACTAGACAGCTATACCGACATAATGCCAGAGAGTCTACAACAAACACTACCACCTCGTCGAGGCATTGACCACGAAATCGAACTCCTCCCCGGGGTTAAGCAGCCAGCGAAGAACGCATACCGGATGGCTTCGCCCGAGCTAGCCGAATTAAGGAAACAACTGGACGAGTTGCTGAAGGCGGGATTCATTCGCCCGACAAAGGCACCTTATGGAGCCCCCGTACTGTTCCAGAAGAAGAAGAATGGGACGTTACGTCTGTGCATAGACTATAGAGCCTTAAACAAGGTGACGGTACGCAACAAATATCCTCTGCCGATAATATCCGACTTGTTCGACAAACTTCACGGGGCCAAGTACTTCACGAAGTTGGACTTACGATCAGGATATTATCAAGTACGTATTGCCGAAGGGGACGAACCCAAGACGACGTGCGTAACAAGATATGGGGCCTTTGAATTCCTAGTAATGCCCTTTGACTTGACAAACGCCCCAGCTACGTTCTGCACGTTGATGAACCAGGTTTTCTACGAATACCTGGATCAGTTCGTCATAGTATACCTCGACGACATTGTTGTTTACAGCACAACCCTCGAGGAACACAAAGTGCACTTAAAGCTAGTGTTTGACAAGCTCCGGCAGAACCAGCTGTATGTGAAGAAGGAGAAATGTGCATTCGCACAAACATGCATCAACTTCCTTGGACATGTCGTCAGATGTGGACAAATTAGTATGGATAGCGACAAGATAAAAGCTATCCAAGAATGGAAGGTTCCTACTTCCGTATCTGAGTTGCGGTCCTTCTTAGGACTAGCCAACTACTATAGGCGGTTCGTCGAAGGGTTTTCACGACGAGCCGCCCCATTGACAGAGCTGTTGAAGAAAGACCACCCTTGGTCGTGGTCGAATGATTGTCAAATGGCCTTTGAAAATCTGAAAACAACCATGACGAGGGGTCCTGTCCTCGGGTTGGTAGACGTCACAAAGCCATTTGAAGTAGAAACCGATGCTTCCGACTTTGCTCTAGTGAAGACGGATAACAGCGCCACTTGCCACTTCTTTGATCAACCAAAATTAACAGCAAAACAAGCCCGGTGGCAGGAGTCGTTGGCCGAATTCGACTTCAAGTTCGAACACAAAGCAGGAAAGAGTAATCAAGCAGCCGACGCGCTGAGTCGGAAGGGCGAACATGCGGCCCTGTGCATGTTAGCCCATATTCATTCAAGTAAGATCGATGGATCGATACGCGACATCATCAAGGAACATTTACACAAAGACCCATCGGCCAAAGCTGTTGTCGAACTAGCTAAGGCTGGAAAGACACGACAGTTTTGGGTTGAGGGGGACCTCCTGATAACCAAAGGAAACAGATTGTACGTCCCAAGAACGGGGGAACTGAGGAAGAAGCTCATTCAGAAATGTCATGATACCTTATGGGCCGGACACCCTGGGTGGCAAAGAACATACGCCCTAATAAAGAAGGGGTACTTTTGGCCAAATATGCGAGACGACATCATGCAATACACCAAGACGTGCCTCATCTGCCAACAGGATAAGGTCGAGAAAGCCAAAGTCTTTGGACTCTTGGAACCTCTACCCGTGCCAACAAGACCCTGGGAAAGTGTATCTCTGGACTTCATAACACACCGCCCAAAAGTCGGGGAATATGACGCCATCTTGGTTATTGTAGACCGGTTCTCAAAATATGCGACATTCATCCCCACTCCCAAATTATGCTCGGCGGAACTCACAGCCCAACTATTTTTCAAACACGTTGTAAAGCTATCGGGCATTCCATCGAGCATCATCAGTGATAGGGATGGTAGATTCATTGGGACATTCTGGACTGATTTATTCGCTTTCTTGGGAACAACCTTAAACATCTCCTCGAGTTATCACCCTCAAACCGATGCACAGATAGAACGGTTCAATTGCTTGCTCGAAGAATACTTACGTCACTTCGTCGATGCTCGCCAAAAGAACTGGATACAATTGTTAGATGTCGCCCAATTTTGCTTCAATTGTCAAACAAGCTCGTCTACAGGGAAGAGTCCCTTCGAAATTGTAAGTGGACGACAACCGGCCTTACCCCATATTATTGATTATCCTTATGCAGGAAAAAACCCTCAAGCTCACAACTTTACAAGAGAATGGAAGCAGACGACAGATATAGCCCGGGCATATTTAGAGAAGGCTTCCAAGCATATGAAGAAATGGGCGGACAAGAAGCGTCGCCCCCTTCAATTCCGTGCAGGAGATCAAGTCCTTATCAAGCTGAGACCAGAACAGATCAGATTTCGTAGCCGAAAGGACCAAAGACTAGTTCGTAAATATGAAGGCCCGGTTGAAGTCCTTAAAAAGATCGGGGCTACCTCCTACCGAGTTGCACTACCCACATGGATGAAAATCCACCCCGTCATTCATGTGAGCAACTTGAAGCCCTATCACCCTGATCCAGACGACGACCAACGAAATGCAACCACAAGACTGAACATCGATCTGCAGCAAAAAGAAACAAAGGAAGTGGAAGAGATCCTAGTCGACAGGGTTAGGAAGATAGGAAGACCTGTACGGACGATTCGTGAATTCCTTATCAAATGGAAGAATCTCCCTACAGAGGAAACAAGCTGGGAACGCGCCGAAGATCTGAACTCCGCCGCCACCCACATCGCAAGGGCATGTTGCCCATGGCCGCATGCCCATGACAAGCCAAGCCCTTATCATGTCTTTGCATTCTAG

Coding sequence (CDS)

ATGGTCATACCACTGATAGAAGAAGCAACCACCGAAGAGACTGTCCTAGAGGAAATCAAGGAAGTACTAGACAGCTATACCGACATAATGCCAGAGAGTCTACAACAAACACTACCACCTCGTCGAGGCATTGACCACGAAATCGAACTCCTCCCCGGGGTTAAGCAGCCAGCGAAGAACGCATACCGGATGGCTTCGCCCGAGCTAGCCGAATTAAGGAAACAACTGGACGAGTTGCTGAAGGCGGGATTCATTCGCCCGACAAAGGCACCTTATGGAGCCCCCGTACTGTTCCAGAAGAAGAAGAATGGGACGTTACGTCTGTGCATAGACTATAGAGCCTTAAACAAGGTGACGGTACGCAACAAATATCCTCTGCCGATAATATCCGACTTGTTCGACAAACTTCACGGGGCCAAGTACTTCACGAAGTTGGACTTACGATCAGGATATTATCAAGTACGTATTGCCGAAGGGGACGAACCCAAGACGACGTGCGTAACAAGATATGGGGCCTTTGAATTCCTAGTAATGCCCTTTGACTTGACAAACGCCCCAGCTACGTTCTGCACGTTGATGAACCAGGTTTTCTACGAATACCTGGATCAGTTCGTCATAGTATACCTCGACGACATTGTTGTTTACAGCACAACCCTCGAGGAACACAAAGTGCACTTAAAGCTAGTGTTTGACAAGCTCCGGCAGAACCAGCTGTATGTGAAGAAGGAGAAATGTGCATTCGCACAAACATGCATCAACTTCCTTGGACATGTCGTCAGATGTGGACAAATTAGTATGGATAGCGACAAGATAAAAGCTATCCAAGAATGGAAGGTTCCTACTTCCGTATCTGAGTTGCGGTCCTTCTTAGGACTAGCCAACTACTATAGGCGGTTCGTCGAAGGGTTTTCACGACGAGCCGCCCCATTGACAGAGCTGTTGAAGAAAGACCACCCTTGGTCGTGGTCGAATGATTGTCAAATGGCCTTTGAAAATCTGAAAACAACCATGACGAGGGGTCCTGTCCTCGGGTTGGTAGACGTCACAAAGCCATTTGAAGTAGAAACCGATGCTTCCGACTTTGCTCTAGTGAAGACGGATAACAGCGCCACTTGCCACTTCTTTGATCAACCAAAATTAACAGCAAAACAAGCCCGGTGGCAGGAGTCGTTGGCCGAATTCGACTTCAAGTTCGAACACAAAGCAGGAAAGAGTAATCAAGCAGCCGACGCGCTGAGTCGGAAGGGCGAACATGCGGCCCTGTGCATGTTAGCCCATATTCATTCAAGTAAGATCGATGGATCGATACGCGACATCATCAAGGAACATTTACACAAAGACCCATCGGCCAAAGCTGTTGTCGAACTAGCTAAGGCTGGAAAGACACGACAGTTTTGGGTTGAGGGGGACCTCCTGATAACCAAAGGAAACAGATTGTACGTCCCAAGAACGGGGGAACTGAGGAAGAAGCTCATTCAGAAATGTCATGATACCTTATGGGCCGGACACCCTGGGTGGCAAAGAACATACGCCCTAATAAAGAAGGGGTACTTTTGGCCAAATATGCGAGACGACATCATGCAATACACCAAGACGTGCCTCATCTGCCAACAGGATAAGGTCGAGAAAGCCAAAGTCTTTGGACTCTTGGAACCTCTACCCGTGCCAACAAGACCCTGGGAAAGTGTATCTCTGGACTTCATAACACACCGCCCAAAAGTCGGGGAATATGACGCCATCTTGGTTATTGTAGACCGGTTCTCAAAATATGCGACATTCATCCCCACTCCCAAATTATGCTCGGCGGAACTCACAGCCCAACTATTTTTCAAACACGTTGTAAAGCTATCGGGCATTCCATCGAGCATCATCAGTGATAGGGATGGTAGATTCATTGGGACATTCTGGACTGATTTATTCGCTTTCTTGGGAACAACCTTAAACATCTCCTCGAGTTATCACCCTCAAACCGATGCACAGATAGAACGGTTCAATTGCTTGCTCGAAGAATACTTACGTCACTTCGTCGATGCTCGCCAAAAGAACTGGATACAATTGTTAGATGTCGCCCAATTTTGCTTCAATTGTCAAACAAGCTCGTCTACAGGGAAGAGTCCCTTCGAAATTGTAAGTGGACGACAACCGGCCTTACCCCATATTATTGATTATCCTTATGCAGGAAAAAACCCTCAAGCTCACAACTTTACAAGAGAATGGAAGCAGACGACAGATATAGCCCGGGCATATTTAGAGAAGGCTTCCAAGCATATGAAGAAATGGGCGGACAAGAAGCGTCGCCCCCTTCAATTCCGTGCAGGAGATCAAGTCCTTATCAAGCTGAGACCAGAACAGATCAGATTTCGTAGCCGAAAGGACCAAAGACTAGTTCGTAAATATGAAGGCCCGGTTGAAGTCCTTAAAAAGATCGGGGCTACCTCCTACCGAGTTGCACTACCCACATGGATGAAAATCCACCCCGTCATTCATGTGAGCAACTTGAAGCCCTATCACCCTGATCCAGACGACGACCAACGAAATGCAACCACAAGACTGAACATCGATCTGCAGCAAAAAGAAACAAAGGAAGTGGAAGAGATCCTAGTCGACAGGGTTAGGAAGATAGGAAGACCTGTACGGACGATTCGTGAATTCCTTATCAAATGGAAGAATCTCCCTACAGAGGAAACAAGCTGGGAACGCGCCGAAGATCTGAACTCCGCCGCCACCCACATCGCAAGGGCATGTTGCCCATGGCCGCATGCCCATGACAAGCCAAGCCCTTATCATGTCTTTGCATTCTAG
BLAST of CmoCh20G009950 vs. Swiss-Prot
Match: TF29_SCHPO (Transposon Tf2-9 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=Tf2-9 PE=3 SV=1)

HSP 1 Score: 248.4 bits (633), Expect = 3.0e-64
Identity = 130/348 (37.36%), Postives = 200/348 (57.47%), Query Frame = 1

Query: 18  EIKEVLDSYTDIMPESLQQTLP-PRRGIDHEIELLP-GVKQPAKNAYRMASPELAELRKQ 77
           E+ ++   + DI  E+  + LP P +G++ E+EL     + P +N Y +   ++  +  +
Sbjct: 373 ELPDIYKEFKDITAETNTEKLPKPIKGLEFEVELTQENYRLPIRN-YPLPPGKMQAMNDE 432

Query: 78  LDELLKAGFIRPTKAPYGAPVLFQKKKNGTLRLCIDYRALNKVTVRNKYPLPIISDLFDK 137
           +++ LK+G IR +KA    PV+F  KK GTLR+ +DY+ LNK    N YPLP+I  L  K
Sbjct: 433 INQGLKSGIIRESKAINACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAK 492

Query: 138 LHGAKYFTKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFDLTNAPATFCTLMNQ 197
           + G+  FTKLDL+S Y+ +R+ +GDE K       G FE+LVMP+ ++ APA F   +N 
Sbjct: 493 IQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISTAPAHFQYFINT 552

Query: 198 VFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFL 257
           +  E  +  V+ Y+DDI+++S +  EH  H+K V  KL+   L + + KC F Q+ + F+
Sbjct: 553 ILGEAKESHVVCYMDDILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFI 612

Query: 258 GHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRRAAPLTELLK 317
           G+ +     +   + I  + +WK P +  ELR FLG  NY R+F+   S+   PL  LLK
Sbjct: 613 GYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLK 672

Query: 318 KDHPWSWSNDCQMAFENLKTTMTRGPVLGLVDVTKPFEVETDASDFAL 364
           KD  W W+     A EN+K  +   PVL   D +K   +ETDASD A+
Sbjct: 673 KDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAV 719

BLAST of CmoCh20G009950 vs. Swiss-Prot
Match: TF26_SCHPO (Transposon Tf2-6 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=Tf2-6 PE=3 SV=1)

HSP 1 Score: 248.4 bits (633), Expect = 3.0e-64
Identity = 130/348 (37.36%), Postives = 200/348 (57.47%), Query Frame = 1

Query: 18  EIKEVLDSYTDIMPESLQQTLP-PRRGIDHEIELLP-GVKQPAKNAYRMASPELAELRKQ 77
           E+ ++   + DI  E+  + LP P +G++ E+EL     + P +N Y +   ++  +  +
Sbjct: 373 ELPDIYKEFKDITAETNTEKLPKPIKGLEFEVELTQENYRLPIRN-YPLPPGKMQAMNDE 432

Query: 78  LDELLKAGFIRPTKAPYGAPVLFQKKKNGTLRLCIDYRALNKVTVRNKYPLPIISDLFDK 137
           +++ LK+G IR +KA    PV+F  KK GTLR+ +DY+ LNK    N YPLP+I  L  K
Sbjct: 433 INQGLKSGIIRESKAINACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAK 492

Query: 138 LHGAKYFTKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFDLTNAPATFCTLMNQ 197
           + G+  FTKLDL+S Y+ +R+ +GDE K       G FE+LVMP+ ++ APA F   +N 
Sbjct: 493 IQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISTAPAHFQYFINT 552

Query: 198 VFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFL 257
           +  E  +  V+ Y+DDI+++S +  EH  H+K V  KL+   L + + KC F Q+ + F+
Sbjct: 553 ILGEAKESHVVCYMDDILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFI 612

Query: 258 GHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRRAAPLTELLK 317
           G+ +     +   + I  + +WK P +  ELR FLG  NY R+F+   S+   PL  LLK
Sbjct: 613 GYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLK 672

Query: 318 KDHPWSWSNDCQMAFENLKTTMTRGPVLGLVDVTKPFEVETDASDFAL 364
           KD  W W+     A EN+K  +   PVL   D +K   +ETDASD A+
Sbjct: 673 KDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAV 719

BLAST of CmoCh20G009950 vs. Swiss-Prot
Match: TF25_SCHPO (Transposon Tf2-5 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=Tf2-5 PE=3 SV=1)

HSP 1 Score: 248.4 bits (633), Expect = 3.0e-64
Identity = 130/348 (37.36%), Postives = 200/348 (57.47%), Query Frame = 1

Query: 18  EIKEVLDSYTDIMPESLQQTLP-PRRGIDHEIELLP-GVKQPAKNAYRMASPELAELRKQ 77
           E+ ++   + DI  E+  + LP P +G++ E+EL     + P +N Y +   ++  +  +
Sbjct: 373 ELPDIYKEFKDITAETNTEKLPKPIKGLEFEVELTQENYRLPIRN-YPLPPGKMQAMNDE 432

Query: 78  LDELLKAGFIRPTKAPYGAPVLFQKKKNGTLRLCIDYRALNKVTVRNKYPLPIISDLFDK 137
           +++ LK+G IR +KA    PV+F  KK GTLR+ +DY+ LNK    N YPLP+I  L  K
Sbjct: 433 INQGLKSGIIRESKAINACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAK 492

Query: 138 LHGAKYFTKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFDLTNAPATFCTLMNQ 197
           + G+  FTKLDL+S Y+ +R+ +GDE K       G FE+LVMP+ ++ APA F   +N 
Sbjct: 493 IQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISTAPAHFQYFINT 552

Query: 198 VFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFL 257
           +  E  +  V+ Y+DDI+++S +  EH  H+K V  KL+   L + + KC F Q+ + F+
Sbjct: 553 ILGEAKESHVVCYMDDILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFI 612

Query: 258 GHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRRAAPLTELLK 317
           G+ +     +   + I  + +WK P +  ELR FLG  NY R+F+   S+   PL  LLK
Sbjct: 613 GYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLK 672

Query: 318 KDHPWSWSNDCQMAFENLKTTMTRGPVLGLVDVTKPFEVETDASDFAL 364
           KD  W W+     A EN+K  +   PVL   D +K   +ETDASD A+
Sbjct: 673 KDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAV 719

BLAST of CmoCh20G009950 vs. Swiss-Prot
Match: TF24_SCHPO (Transposon Tf2-4 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=Tf2-4 PE=3 SV=1)

HSP 1 Score: 248.4 bits (633), Expect = 3.0e-64
Identity = 130/348 (37.36%), Postives = 200/348 (57.47%), Query Frame = 1

Query: 18  EIKEVLDSYTDIMPESLQQTLP-PRRGIDHEIELLP-GVKQPAKNAYRMASPELAELRKQ 77
           E+ ++   + DI  E+  + LP P +G++ E+EL     + P +N Y +   ++  +  +
Sbjct: 373 ELPDIYKEFKDITAETNTEKLPKPIKGLEFEVELTQENYRLPIRN-YPLPPGKMQAMNDE 432

Query: 78  LDELLKAGFIRPTKAPYGAPVLFQKKKNGTLRLCIDYRALNKVTVRNKYPLPIISDLFDK 137
           +++ LK+G IR +KA    PV+F  KK GTLR+ +DY+ LNK    N YPLP+I  L  K
Sbjct: 433 INQGLKSGIIRESKAINACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAK 492

Query: 138 LHGAKYFTKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFDLTNAPATFCTLMNQ 197
           + G+  FTKLDL+S Y+ +R+ +GDE K       G FE+LVMP+ ++ APA F   +N 
Sbjct: 493 IQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISTAPAHFQYFINT 552

Query: 198 VFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFL 257
           +  E  +  V+ Y+DDI+++S +  EH  H+K V  KL+   L + + KC F Q+ + F+
Sbjct: 553 ILGEAKESHVVCYMDDILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFI 612

Query: 258 GHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRRAAPLTELLK 317
           G+ +     +   + I  + +WK P +  ELR FLG  NY R+F+   S+   PL  LLK
Sbjct: 613 GYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLK 672

Query: 318 KDHPWSWSNDCQMAFENLKTTMTRGPVLGLVDVTKPFEVETDASDFAL 364
           KD  W W+     A EN+K  +   PVL   D +K   +ETDASD A+
Sbjct: 673 KDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAV 719

BLAST of CmoCh20G009950 vs. Swiss-Prot
Match: TF23_SCHPO (Transposon Tf2-3 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=Tf2-3 PE=1 SV=1)

HSP 1 Score: 248.4 bits (633), Expect = 3.0e-64
Identity = 130/348 (37.36%), Postives = 200/348 (57.47%), Query Frame = 1

Query: 18  EIKEVLDSYTDIMPESLQQTLP-PRRGIDHEIELLP-GVKQPAKNAYRMASPELAELRKQ 77
           E+ ++   + DI  E+  + LP P +G++ E+EL     + P +N Y +   ++  +  +
Sbjct: 373 ELPDIYKEFKDITAETNTEKLPKPIKGLEFEVELTQENYRLPIRN-YPLPPGKMQAMNDE 432

Query: 78  LDELLKAGFIRPTKAPYGAPVLFQKKKNGTLRLCIDYRALNKVTVRNKYPLPIISDLFDK 137
           +++ LK+G IR +KA    PV+F  KK GTLR+ +DY+ LNK    N YPLP+I  L  K
Sbjct: 433 INQGLKSGIIRESKAINACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAK 492

Query: 138 LHGAKYFTKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFDLTNAPATFCTLMNQ 197
           + G+  FTKLDL+S Y+ +R+ +GDE K       G FE+LVMP+ ++ APA F   +N 
Sbjct: 493 IQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISTAPAHFQYFINT 552

Query: 198 VFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFL 257
           +  E  +  V+ Y+DDI+++S +  EH  H+K V  KL+   L + + KC F Q+ + F+
Sbjct: 553 ILGEAKESHVVCYMDDILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFI 612

Query: 258 GHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRRAAPLTELLK 317
           G+ +     +   + I  + +WK P +  ELR FLG  NY R+F+   S+   PL  LLK
Sbjct: 613 GYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLK 672

Query: 318 KDHPWSWSNDCQMAFENLKTTMTRGPVLGLVDVTKPFEVETDASDFAL 364
           KD  W W+     A EN+K  +   PVL   D +K   +ETDASD A+
Sbjct: 673 KDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAV 719

BLAST of CmoCh20G009950 vs. TrEMBL
Match: A5BX03_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_032357 PE=4 SV=1)

HSP 1 Score: 1052.0 bits (2719), Expect = 4.3e-304
Identity = 526/945 (55.66%), Postives = 661/945 (69.95%), Query Frame = 1

Query: 13   ETVLEEIKEVLDSYTDIMPESLQQTLPPRRGIDHEIELLPGVKQPAKNAYRMASPELAEL 72
            E + +EIK VLD + D+M   L + L PRR  +H+I+L  G K  A   YRMA PEL EL
Sbjct: 589  EPMPKEIKGVLDEFKDVMXPELPKRLXPRREEBHKIKLEXGAKPRAMGPYRMAPPELEEL 648

Query: 73   RKQLDELLKAGFIRPTKAPYGAPVLFQKKKNGTLRLCIDYRALNKVTVRNKYPLPIISDL 132
            R+QL ELL AGFI+P+KAPYGAPVLFQKK +G+LR+CIDYRALNKVTV+NKYP+P+I+DL
Sbjct: 649  RRQLKELLDAGFIQPSKAPYGAPVLFQKKHDGSLRMCIDYRALNKVTVKNKYPIPLIADL 708

Query: 133  FDKLHGAKYFTKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFDLTNAPATFCTL 192
            FD+L  A+YFTKLDLRSGYYQVRIAEGDEPKTTCVTRYG++EFLVMPF LTNAPATFCTL
Sbjct: 709  FDQLGRARYFTKLDLRSGYYQVRIAEGDEPKTTCVTRYGSYEFLVMPFGLTNAPATFCTL 768

Query: 193  MNQVFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCI 252
            MN++F+ YLD+FV+ YLDDIV+YS TL+EH+ HL+ VF  LRQN+LYVKKEKC+FA+  +
Sbjct: 769  MNKIFHPYLDKFVVXYLDDIVIYSNTLKEHEEHLRKVFKILRQNKLYVKKEKCSFAKEEV 828

Query: 253  NFLGHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRRAAPLTE 312
            NFLGH +R G++ MD  K+KAIQEW  PT V +LRSFLGL NYYRRF++G+S RAAPLT+
Sbjct: 829  NFLGHRIRDGKLMMDDSKVKAIQEWDPPTKVPQLRSFLGLVNYYRRFIKGYSGRAAPLTD 888

Query: 313  LLKKDHPWSWSNDCQMAFENLKTTMTRGPVLGLVDVTKPFEVETDASDFAL--------- 372
            LLKK+  W W   CQ AFE+LK  +T  PVL L D TK FEV TDASDFA+         
Sbjct: 889  LLKKNKAWEWDGRCQQAFEDLKKAVTEEPVLALPDHTKVFEVHTDASDFAIGGVLMQERH 948

Query: 373  ------VKTDNS-------------------------------------ATCHFFDQPKL 432
                   K +N+                                     AT +F  Q KL
Sbjct: 949  PIAFESRKLNNAERRYTVQEKEMTAIVHCLRTWRHYLLGSHFIVKTDNVATSYFQTQKKL 1008

Query: 433  TAKQARWQESLAEFDFKFEHKAGKSNQAADALSRKGEHAALCMLAHIHSSKIDGSIRDII 492
            + KQARWQ+ LAEFD+  E+K G +N  ADALSRK E A++       SS+  G I  ++
Sbjct: 1009 SPKQARWQDFLAEFDYTLEYKPGSANHVADALSRKAELASI-------SSQPQGDIMYLL 1068

Query: 493  KEHLHKDPSAKAVVELAKAGKTRQFWVEGDLLITKGNRLYVPRTGELRKKLIQKCHDTLW 552
            +E L  DP AK+++ LA  GKT++FWVE  LL TKG RLYVP+ G +R+ LI++CHDT W
Sbjct: 1069 REGLQHDPVAKSLIALAHEGKTKRFWVEDGLLYTKGRRLYVPKWGNIRRNLIKECHDTKW 1128

Query: 553  AGHPGWQRTYALIKKGYFWPNMRDDIMQYTKTCLICQQDKVEKAKVFGLLEPLPVPTRPW 612
            AGHPG +RT AL++  Y+WP +RD++  Y +TCL+CQQDKVE+ +  GLLEPLPV  RPW
Sbjct: 1129 AGHPGQRRTRALLESAYYWPQIRDEVEAYVRTCLVCQQDKVEQRQPRGLLEPLPVAERPW 1188

Query: 613  ESVSLDFITHRPKVGEYDAILVIVDRFSKYATFIPTPKLCSAELTAQLFFKHVVKLSGIP 672
            +SV++DFI   PK  +  +I+V+VDRFSKYATFI  P  C+AE TA+LF KHVVK  G+P
Sbjct: 1189 DSVTMDFIIGLPKSEDSGSIIVVVDRFSKYATFIAAPTDCTAEETARLFLKHVVKYWGLP 1248

Query: 673  SSIISDRDGRFIGTFWTDLFAFLGTTLNISSSYHPQTDAQIERFNCLLEEYLRHFVDARQ 732
              IISDRD RF G FWT+LF  +G+ L+ S+S+HPQTD Q ER N LLE YLRHFV A Q
Sbjct: 1249 KFIISDRDPRFTGKFWTELFKLMGSELHFSTSFHPQTDGQTERXNALLELYLRHFVSANQ 1308

Query: 733  KNWIQLLDVAQFCFNCQTSSSTGKSPFEIVSGRQPALPHIIDYPYAGKNPQAHNFTREWK 792
            ++W +LLD+AQF +N Q S +T KSPFE+ +G+QP  PH +   Y G++P A  F + W 
Sbjct: 1309 RDWAKLLDIAQFSYNLQRSEATNKSPFELATGQQPLTPHTLXIGYTGRSPAAFKFAKGWH 1368

Query: 793  QTTDIARAYLEKASKHMKKWADKKRRPLQFRAGDQVLIKLRPEQIRFRSRKDQRLVRKYE 852
            +  DIA +YL+KA+K MKKWADKKRR  +++ GD VL+KL P+Q +      + LVR+YE
Sbjct: 1369 EQADIAXSYLDKAAKKMKKWADKKRRHTEYKVGDMVLVKLLPQQFKSLRPVHKGLVRRYE 1428

Query: 853  GPVEVLKKIGATSYRVALPTWMKIHPVIHVSNLKPYHPDPDDDQRNATTRLNIDLQQKET 905
            GP  +L K+G  SY+V LP  +KIHPV HVS L PYH D DD  R  + R    +     
Sbjct: 1429 GPFPILGKVGKVSYKVELPPRLKIHPVFHVSYLNPYHEDKDDPSRGLSKRAPTAVVTSYD 1488

BLAST of CmoCh20G009950 vs. TrEMBL
Match: A5AQF1_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_014071 PE=4 SV=1)

HSP 1 Score: 959.9 bits (2480), Expect = 2.2e-276
Identity = 488/945 (51.64%), Postives = 630/945 (66.67%), Query Frame = 1

Query: 13   ETVLEEIKEVLDSYTDIMPESLQQTLPPRRGIDHEIELLPGVKQPAKNAYRMASPELAEL 72
            E + +EI+ VLD + D+MP  L + LPP+R  DH+IEL PG K PA   YRMA PEL EL
Sbjct: 293  EPMPKEIEGVLDEFKDVMPPELPKRLPPKREEDHKIELEPGAKPPAMGPYRMAPPELEEL 352

Query: 73   RKQLDELLKAGFIRPTKAPYGAPVLFQKKKNGTLRLCIDYRALNKVTVRNKYPLPIISDL 132
            R+QL ELL AGFI+P+KAPYGAPVLFQKK +G+LR+CIDYRALNKVTV+NKYP+P+I+DL
Sbjct: 353  RRQLKELLDAGFIQPSKAPYGAPVLFQKKHDGSLRMCIDYRALNKVTVKNKYPIPLIADL 412

Query: 133  FDKLHGAKYFTKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFDLTNAPATFCTL 192
            FD+L  A+YFTKLDLR                     YG++EFLVMPF LTNAP  FCTL
Sbjct: 413  FDQLGRARYFTKLDLR---------------------YGSYEFLVMPFGLTNAPTMFCTL 472

Query: 193  MNQVFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCI 252
            MN++F+ YLD+FV+VYLDDIV+YS TL+EH+ HL+ VF  LRQN+LYVKKEKC+FA+  +
Sbjct: 473  MNKIFHPYLDKFVVVYLDDIVIYSNTLKEHEEHLRKVFKILRQNKLYVKKEKCSFAKEEV 532

Query: 253  NFLGHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRRAAPLTE 312
            +FLGH +R G++ MD  K+KAIQEW  PT V +LRSFL L NYYRRF++G+S RAAPLT+
Sbjct: 533  SFLGHRIRDGKLMMDDSKVKAIQEWDPPTKVPQLRSFLSLVNYYRRFIKGYSGRAAPLTD 592

Query: 313  LLKKDHPWSWSNDCQMAFENLKTTMTRGPVLGLVDVTKPFEVETDASDFAL--------- 372
            LLKK+  W W   CQ AFENLK  +T  PVL L D TK FEV TDASDFA+         
Sbjct: 593  LLKKNKAWEWDERCQHAFENLKKAVTEEPVLALPDHTKVFEVHTDASDFAIGGVLMQERH 652

Query: 373  ------VKTDNSATCHFFDQPKLTA----------------------------------- 432
                   K +++   +   + ++TA                                   
Sbjct: 653  LIAFESRKLNDAERRYTVQEKEMTAIVHCLHTWRHYLLGSHFIVKTDNVATSYFQTQKKL 712

Query: 433  --KQARWQESLAEFDFKFEHKAGKSNQAADALSRKGEHAALCMLAHIHSSKIDGSIRDII 492
              KQARWQ+ LAEFD+  E+K G +N  A ALS K E  ++       +S+  G I D++
Sbjct: 713  SPKQARWQDFLAEFDYTLEYKPGSANHVAGALSHKAELTSM-------TSQPQGDIMDLL 772

Query: 493  KEHLHKDPSAKAVVELAKAGKTRQFWVEGDLLITKGNRLYVPRTGELRKKLIQKCHDTLW 552
            +E L  DP AK+++ LA  GKT++FWVE DLL TKG RLYVP+ G +R+ LI++CHDT W
Sbjct: 773  REGLQHDPMAKSLIALAHEGKTKRFWVEDDLLYTKGRRLYVPKWGNIRRNLIKECHDTKW 832

Query: 553  AGHPGWQRTYALIKKGYFWPNMRDDIMQYTKTCLICQQDKVEKAKVFGLLEPLPVPTRPW 612
            AGHPG +RT AL++  Y+WP +RD++  Y           VE+ +  GLLEPLP+  RPW
Sbjct: 833  AGHPGQRRTRALLESAYYWPQIRDEVEAY-----------VEQRQPRGLLEPLPIAERPW 892

Query: 613  ESVSLDFITHRPKVGEYDAILVIVDRFSKYATFIPTPKLCSAELTAQLFFKHVVKLSGIP 672
            ++V++DFI   PK  +  +I+V+VDRFSKYATFI  P  C+AE T +LF KHVVK  G+P
Sbjct: 893  DNVTMDFIIGLPKSEDSGSIIVVVDRFSKYATFIAAPTDCTAEETTRLFLKHVVKYWGLP 952

Query: 673  SSIISDRDGRFIGTFWTDLFAFLGTTLNISSSYHPQTDAQIERFNCLLEEYLRHFVDARQ 732
              IISDRD RF G FWT+LF  +G+ L+ S+S+HPQT+ Q ER N LLE YLRHFV A Q
Sbjct: 953  KYIISDRDPRFTGKFWTELFKLMGSELHFSTSFHPQTNGQTERVNALLELYLRHFVSANQ 1012

Query: 733  KNWIQLLDVAQFCFNCQTSSSTGKSPFEIVSGRQPALPHIIDYPYAGKNPQAHNFTREWK 792
            ++W +LLD+AQF +N Q S +T KSPF++ +G+QP  PH++   Y G++P A  F + W 
Sbjct: 1013 RDWAKLLDIAQFSYNLQMSEATNKSPFKLATGQQPLTPHMLTIGYTGRSPAAFKFAKGWH 1072

Query: 793  QTTDIARAYLEKASKHMKKWADKKRRPLQFRAGDQVLIKLRPEQIRFRSRKDQRLVRKYE 852
            +  DIAR+YL+KA+K MKKWADKKR   +++ GD VL+KL P+Q +      + LVR+YE
Sbjct: 1073 EQADIARSYLDKATKKMKKWADKKRHHTEYKVGDMVLVKLLPQQFKSLRPVHKSLVRRYE 1132

Query: 853  GPVEVLKKIGATSYRVALPTWMKIHPVIHVSNLKPYHPDPDDDQRNATTRLNIDLQQKET 905
            GP  +L K+G  SY+V LP  +KIHPV HVS LKPYH D DD  R  + R    +     
Sbjct: 1133 GPFPILGKVGKVSYKVELPPRLKIHPVFHVSYLKPYHEDKDDPSRGLSKRAPTAVVTSYD 1192

BLAST of CmoCh20G009950 vs. TrEMBL
Match: Q9ZS84_SOLLC (Polyprotein OS=Solanum lycopersicum PE=4 SV=1)

HSP 1 Score: 935.3 bits (2416), Expect = 5.9e-269
Identity = 480/942 (50.96%), Postives = 617/942 (65.50%), Query Frame = 1

Query: 19   IKEVLDSYTDIMPESLQQTLPPRRGIDHEIELLPGVKQPAKNAYRMASPELAELRKQLDE 78
            + E+L  Y D+MP  L + LPPRR IDH+IELLPG   PA+  YRMA  EL ELRKQL+E
Sbjct: 575  VAELLKQYADVMPPELPKKLPPRRDIDHKIELLPGTVAPAQAPYRMAPKELVELRKQLNE 634

Query: 79   LLKAGFIRPTKAPYGAPVLFQKKKNGTLRLCIDYRALNKVTVRNKYPLPIISDLFDKLHG 138
            LL AG I+P+KAPYGAPVLFQKK++GT+R+C+DYRALNK T++NKY +P++ DL D+L  
Sbjct: 635  LLDAGLIQPSKAPYGAPVLFQKKQDGTMRMCVDYRALNKATIKNKYSVPLVQDLMDRLSK 694

Query: 139  AKYFTKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFDLTNAPATFCTLMNQVFY 198
            A +FTKLDLR+GY+QVRIAEGDEPKTTCVTRYG++EFLVMPF LTNAPATFC LMN V +
Sbjct: 695  ACWFTKLDLRAGYWQVRIAEGDEPKTTCVTRYGSYEFLVMPFGLTNAPATFCNLMNNVLF 754

Query: 199  EYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLGHV 258
            +YLD FV+VYLDDIV+YS TLEEH  HL LV  +LR+  LYVK EKC FAQ  I FLGH+
Sbjct: 755  DYLDDFVVVYLDDIVIYSRTLEEHVNHLSLVLSQLRKYTLYVKMEKCEFAQQEIKFLGHL 814

Query: 259  VRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRRAAPLTELLKKDH 318
            V   Q+ MD  K++AI +W+ P  V +LRSFLGLANYYR+F+ G+S++AA LT+LLKKD 
Sbjct: 815  VSKNQVRMDPKKVQAIVDWQAPRHVKDLRSFLGLANYYRKFIAGYSKKAASLTDLLKKDA 874

Query: 319  PWSWSNDCQMAFENLKTTMTRGPVLGLVDVTKPFEVETDASD------------------ 378
             W WS  C+ AF+NLK  +   P+L L D   PFEV TDASD                  
Sbjct: 875  KWVWSEQCEKAFQNLKNAIASEPILKLPDFELPFEVHTDASDKAIGGVLVQEGHPVAFES 934

Query: 379  ---------FALVKTDNSATCHFFD-------------------------QPKLTAKQAR 438
                     ++  + +  A  H                            Q KL+ KQAR
Sbjct: 935  RKLNDAEQRYSTHEKEMVAVVHCLQVWRVYLLGTRFVVRTDNVANTFFKTQKKLSPKQAR 994

Query: 439  WQESLAEFDFKFEHKAGKSNQAADALSRKGEHAALCMLAHIHSSKIDGSIRDIIKEHLHK 498
            WQE LAE+DF +EHK GK NQ ADALSRK    A+  +     SK++    D I+     
Sbjct: 995  WQEFLAEYDFMWEHKPGKHNQVADALSRKEVFVAVYSI-----SKLETDFYDRIRLCAAN 1054

Query: 499  DPSAKAVVELAKAGKTRQFWVEGDLLITKGNRLYVPRTGELRKKLIQKCHDTLWAGHPGW 558
            D      +   + G  R++W+E DLL  KG R+ VP  G LRK L+++ +D+ WAGHPG 
Sbjct: 1055 DSLYVKWMGQVQDGTMRRYWIEDDLLYFKGGRIVVPNQGGLRKDLMKEAYDSAWAGHPGV 1114

Query: 559  QRTYALIKKGYFWPNMRDDIMQYTKTCLICQQDKVEKAKVFGLLEPLPVPTRPWESVSLD 618
            +R  AL+ + YFWP M DDI  Y KTC +CQ DK E+ K  GLL+PLP+P RPW SVS+D
Sbjct: 1115 ERMLALLSRVYFWPKMEDDIEAYVKTCHVCQVDKTERKKEAGLLQPLPIPERPWLSVSMD 1174

Query: 619  FITHRPKVGEYDAILVIVDRFSKYATFIPTPKLCSAELTAQLFFKHVVKLSGIPSSIISD 678
            FI+  PKV    +I+V+VDRFSKY+ FI  P+LCS+E+ A+LF+KHV+K  G+P+ I+SD
Sbjct: 1175 FISGFPKVDGKASIMVVVDRFSKYSVFIAAPELCSSEVAAELFYKHVIKYFGVPADIVSD 1234

Query: 679  RDGRFIGTFWTDLFAFLGTTLNISSSYHPQTDAQIERFNCLLEEYLRHFVDARQKNWIQL 738
            RD RF G FWT LF  +GT L  S++ HPQTD Q ER N LLEEYLRH+V A Q+NW++L
Sbjct: 1235 RDTRFTGRFWTALFNMMGTELKFSTANHPQTDGQTERINHLLEEYLRHYVTASQRNWVEL 1294

Query: 739  LDVAQFCFNCQTSSSTGKSPFEIVSGRQPALP-HIIDYPYAGKNPQAHNFTREWKQTTDI 798
            LD AQFC+N   SS+T  SPFEIV G+QP  P  +      GK P A+   R+  +    
Sbjct: 1295 LDTAQFCYNLHKSSATEMSPFEIVLGKQPMTPLDVAKSKNQGKCPAAYRVARDRLEMLSE 1354

Query: 799  ARAYLEKASKHMKKWADKKRRPLQFRAGDQVLIKLRPE---QIRFRSRKDQRLVRKYEGP 858
            A+  L KA + MKK+AD+ RR ++F  GD+VL+KL P+   QI  ++R  + L+ KY+GP
Sbjct: 1355 AQDSLRKAQQRMKKYADQHRRSVEFSVGDKVLLKLTPQIWKQIVSKTR-HRGLIPKYDGP 1414

Query: 859  VEVLKKIGATSYRVALPTWMKIHPVIHVSNLKPYHPDPDDDQRNATTRLNIDLQQKETKE 905
             EV+K++G  +YR+ LP  +KIHP  HVS LKPY  D DD  RN + R    +  +   E
Sbjct: 1415 FEVVKRVGEVAYRLKLPERLKIHPTFHVSFLKPYFADEDDPDRNRSKRAPPSVPTQYDAE 1474

BLAST of CmoCh20G009950 vs. TrEMBL
Match: A5AM46_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_003931 PE=4 SV=1)

HSP 1 Score: 841.6 bits (2173), Expect = 8.9e-241
Identity = 432/837 (51.61%), Postives = 550/837 (65.71%), Query Frame = 1

Query: 121  RNKYPLPIISDLFDKLHGAKYFTKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPF 180
            +++  + II DLFD+L  A+YFTKLDLRSGYYQVRIAEGDEPKTTCVTRYG++EFLVMPF
Sbjct: 216  KSRVAISIILDLFDQLGRARYFTKLDLRSGYYQVRIAEGDEPKTTCVTRYGSYEFLVMPF 275

Query: 181  DLTNAPATFCTLMNQVFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYV 240
             LTNAPATFCTLMN++F+ YLD+ V+VYLDDIV+YS TL+EH+ HL+ VF  LRQN+LYV
Sbjct: 276  GLTNAPATFCTLMNKIFHPYLDKLVVVYLDDIVIYSNTLKEHEEHLRKVFKILRQNELYV 335

Query: 241  KKEKCAFAQTCINFLGHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFV 300
            KKEKC+FA+  ++FLGH +R G++ MD  K+KAIQEW  PT V +LRSFLGL NYYRR  
Sbjct: 336  KKEKCSFAKEEVSFLGHRIRDGKLMMDDSKVKAIQEWDPPTKVPQLRSFLGLVNYYRR-- 395

Query: 301  EGFSRRAAPLTELLKKDHPWSWSNDCQMAFENLKTTMTRGPVLGLVDVTKPFEVETDASD 360
                                     CQ AFE+LK  +T  PVL L D TK FEV TDASD
Sbjct: 396  -------------------------CQQAFEDLKKAVTEEPVLALPDHTKVFEVHTDASD 455

Query: 361  FAL---------------VKTDNSATCHFFDQPKLTA----------------------- 420
            FA+                K +++   +   + ++T                        
Sbjct: 456  FAIGGVLMQERHPIAFESRKLNDAERRYTVQEKEMTVIVHCLRTWRHYLLGSHFIVKTDN 515

Query: 421  --------------KQARWQESLAEFDFKFEHKAGKSNQAADALSRKGEHAALCMLAHIH 480
                          +QARWQ+ LAEFD   E+K G  N  ADALSRK E A++       
Sbjct: 516  VATSYFQTQKKLSPQQARWQDFLAEFDSTLEYKPGSPNHVADALSRKAELASM------- 575

Query: 481  SSKIDGSIRDIIKEHLHKDPSAKAVVELAKAGKTRQFWVEGDLLITKGNRLYVPRTGELR 540
            +S+  G I D+++E L  DP AK+++ LA  GKT++FWVE  LL TKG RLYVP+ G +R
Sbjct: 576  TSQPQGDIMDLLREGLQHDPVAKSLIALAHEGKTKRFWVEDGLLYTKGRRLYVPKWGNIR 635

Query: 541  KKLIQKCHDTLWAGHPGWQRTYALIKKGYFWPNMRDDIMQYTKTCLICQQDKVEKAKVFG 600
            + LI++CHDT WAGHPG +RT AL++  Y+WP +RD++  Y +TCL+CQQDKVE+ +  G
Sbjct: 636  RNLIKECHDTKWAGHPGQRRTRALLESAYYWPQIRDEVEAYVRTCLVCQQDKVEQRQPKG 695

Query: 601  LLEPLPVPTRPWESVSLDFITHRPKVGEYDAILVIVDRFSKYATFIPTPKLCSAELTAQL 660
            LLEPLP+  RPW++V++DFI   PK  +  +I+V+VDRFSKYATFI  P  C+ E TA+L
Sbjct: 696  LLEPLPIAERPWDNVTMDFIIGLPKSEDSGSIIVVVDRFSKYATFIAAPTDCTTEETARL 755

Query: 661  FFKHVVKLSGIPSSIISDRDGRFIGTFWTDLFAFLGTTLNISSSYHPQTDAQIERFNCLL 720
            F KHVVK  G+P  IISDRD RF G FWT+LF  +G+ L+ S+S+HPQTD QIER N LL
Sbjct: 756  FLKHVVKYWGLPKFIISDRDPRFTGKFWTELFKLMGSELHFSTSFHPQTDGQIERVNALL 815

Query: 721  EEYLRHFVDARQKNWIQLLDVAQFCFNCQTSSSTGKSPFEIVSGRQPALPHIIDYPYAGK 780
            E YLRHFV A QK+W +LLD+AQF +N Q S +T KSPFE+ +G+QP  PH +   Y G+
Sbjct: 816  ELYLRHFVSANQKDWAKLLDIAQFSYNLQRSEATNKSPFELATGQQPLTPHTLTIGYTGR 875

Query: 781  NPQAHNFTREWKQTTDIARAYLEKASKHMKKWADKKRRPLQFRAGDQVLIKLRPEQIRFR 840
            +P    F + W +  DIAR+YL+KA+K MKKWADKKRR  +++ GD VL+KL P+Q +  
Sbjct: 876  SPTDFKFAKRWYEQADIARSYLDKAAKKMKKWADKKRRHTEYKVGDMVLVKLLPQQFKSL 935

Query: 841  SRKDQRLVRKYEGPVEVLKKIGATSYRVALPTWMKIHPVIHVSNLKPYHPDPDDDQRNAT 900
                + LVR+YEGP   L K+G  SYRV LP  +KIHPV H S LKPYH D DD  R  +
Sbjct: 936  RPVHKGLVRRYEGPFLELGKVGKVSYRVELPPRLKIHPVFHASYLKPYHGDNDDPSRGLS 995

Query: 901  TRLNIDLQQKETKEVEEILVDRV-RKIGRPVRTIREFLIKWKNLPTEETSWERAEDL 905
             R    +     KEVE +LVDRV R+ G P  T  E+L+KWK L   E SWE  E L
Sbjct: 996  KRAPTAVVTSYDKEVEHVLVDRVIRRRGVPPAT--EYLVKWKGLLESEASWEPTEAL 1016

BLAST of CmoCh20G009950 vs. TrEMBL
Match: A5AZ16_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_016761 PE=4 SV=1)

HSP 1 Score: 733.8 bits (1893), Expect = 2.6e-208
Identity = 372/673 (55.27%), Postives = 466/673 (69.24%), Query Frame = 1

Query: 13   ETVLEEIKEVLDSYTDIMPESLQQTLPPRRGIDHEIELLPGVKQPAKNAYRMASPELAEL 72
            E + +EI+ VLD + D+MP  L + LPPRR  DH+IEL PG K PA   YRMA PEL EL
Sbjct: 551  EPMPKEIEGVLDEFKDVMPPELPKRLPPRREEDHKIELEPGSKPPAMGPYRMAPPELEEL 610

Query: 73   RKQLDELLKAGFIRPTKAPYGAPVLFQKKKNGTLRLCIDYRALNKVTVRNKYPLPIISDL 132
            R+QL ELL AGFI+P+KAPYGAPVLFQKK +G+LR+CIDYRALNKVTV+NKYP+P+I+DL
Sbjct: 611  RRQLKELLDAGFIQPSKAPYGAPVLFQKKHDGSLRMCIDYRALNKVTVKNKYPIPLIADL 670

Query: 133  FDKLHGAKYFTKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFDLTNAPATFCTL 192
            FD+L  AKYFTKLDLRSGYYQVRIAEGDEPKTTCVTRYG++EFLVMPF LTNAPATFCTL
Sbjct: 671  FDQLGRAKYFTKLDLRSGYYQVRIAEGDEPKTTCVTRYGSYEFLVMPFGLTNAPATFCTL 730

Query: 193  MNQVFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCI 252
            MN++F+ YLD+FV+VYLDDIV+YS TL+EH+ HL+ VF  LRQN+LYVKKEKC+FA+  +
Sbjct: 731  MNKIFHPYLDKFVVVYLDDIVIYSNTLKEHEEHLRKVFKILRQNELYVKKEKCSFAKEEV 790

Query: 253  NFLGHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRRAAPLTE 312
            +FLGH +R G++ MD  K+KAIQEW  PT                   +G+S RAAPLT+
Sbjct: 791  SFLGHRIRDGKLMMDDSKVKAIQEWDPPT-------------------KGYSARAAPLTD 850

Query: 313  LLKKDHPWSWSNDCQMAFENLKTTMTRGPVLGLVDVTKPFEVETDASDFAL--------- 372
            LLKK+  W W   CQ AFENLK  +T  PVL L D TK FEV TDASDFA+         
Sbjct: 851  LLKKNKAWEWDERCQQAFENLKKAVTEEPVLALPDHTKVFEVHTDASDFAIGGVLMQDRH 910

Query: 373  ------VKTDNSATCHFFDQPKLTA----------------------------------- 432
                   K +++   +   + ++TA                                   
Sbjct: 911  PIAFESRKLNDTERRYTVQEKEMTAIIHCLRTWRHYLLGSHFIVKTDNIATSYFQTQKKL 970

Query: 433  --KQARWQESLAEFDFKFEHKAGKSNQAADALSRKGEHAALCMLAHIHSSKIDGSIRDII 492
              KQARWQ+ LAEFD+  E+K G +N  A+ALSRK E A++       +S+  G I D++
Sbjct: 971  SPKQARWQDFLAEFDYTLEYKPGSANHVANALSRKVELASM-------TSQPQGDIIDLL 1030

Query: 493  KEHLHKDPSAKAVVELAKAGKTRQFWVEGDLLITKGNRLYVPRTGELRKKLIQKCHDTLW 552
            +E L  DP  K+++ LA  GKT+ FWVE  LL TKG RLYVP+ G +R+ LI++CHDT W
Sbjct: 1031 REGLQHDPVVKSLIALAHEGKTKWFWVEDGLLYTKGRRLYVPKWGNIRRNLIKECHDTKW 1090

Query: 553  AGHPGWQRTYALIKKGYFWPNMRDDIMQYTKTCLICQQDKVEKAKVFGLLEPLPVPTRPW 612
            AGHPG +RT AL++  Y+WP +RD++  Y +TCL+CQQDKVE+ +  GLLEPLPV   PW
Sbjct: 1091 AGHPGQRRTRALLESAYYWPQIRDEVEAYVRTCLVCQQDKVEQQQPRGLLEPLPVAEHPW 1150

Query: 613  ESVSLDFITHRPKVGEYDAILVIVDRFSKYATFIPTPKLCSAELTAQLFFKHVVKLSGIP 634
            +SV++DFI   PK  +  +I+V+VDRFSKYATFI  P  C+AE TA+LF KHVVK  G+P
Sbjct: 1151 DSVTMDFIIGLPKSEDSGSIIVVVDRFSKYATFIAAPTDCTAEETARLFLKHVVKYWGLP 1197

BLAST of CmoCh20G009950 vs. TAIR10
Match: ATMG00860.1 (ATMG00860.1 DNA/RNA polymerases superfamily protein)

HSP 1 Score: 108.2 bits (269), Expect = 2.7e-23
Identity = 60/158 (37.97%), Postives = 87/158 (55.06%), Query Frame = 1

Query: 225 HLKLVFDKLRQNQLYVKKEKCAFAQTCINFLGH--VVRCGQISMDSDKIKAIQEWKVPTS 284
           HL +V     Q+Q Y  ++KCAF Q  I +LGH  ++    +S D  K++A+  W  P +
Sbjct: 3   HLGMVLQIWEQHQFYANRKKCAFGQPQIAYLGHRHIISGEGVSADPAKLEAMVGWPEPKN 62

Query: 285 VSELRSFLGLANYYRRFVEGFSRRAAPLTELLKKDHPWSWSNDCQMAFENLKTTMTRGPV 344
            +ELR FLGL  YYRRFV+ + +   PLTELLKK +   W+    +AF+ LK  +T  PV
Sbjct: 63  TTELRGFLGLTGYYRRFVKNYGKIVRPLTELLKK-NSLKWTEMAALAFKALKGAVTTLPV 122

Query: 345 LGLVDVTKPFEVETDASDFALVKTDNSATCHFFDQPKL 381
           L L D+  PF       +++   T   A C    QP++
Sbjct: 123 LALPDLKLPFVTRVGKWNWSCFITREQACC--VSQPRV 157

BLAST of CmoCh20G009950 vs. NCBI nr
Match: gi|147826806|emb|CAN63950.1| (hypothetical protein VITISV_032357 [Vitis vinifera])

HSP 1 Score: 1052.0 bits (2719), Expect = 6.2e-304
Identity = 526/945 (55.66%), Postives = 661/945 (69.95%), Query Frame = 1

Query: 13   ETVLEEIKEVLDSYTDIMPESLQQTLPPRRGIDHEIELLPGVKQPAKNAYRMASPELAEL 72
            E + +EIK VLD + D+M   L + L PRR  +H+I+L  G K  A   YRMA PEL EL
Sbjct: 589  EPMPKEIKGVLDEFKDVMXPELPKRLXPRREEBHKIKLEXGAKPRAMGPYRMAPPELEEL 648

Query: 73   RKQLDELLKAGFIRPTKAPYGAPVLFQKKKNGTLRLCIDYRALNKVTVRNKYPLPIISDL 132
            R+QL ELL AGFI+P+KAPYGAPVLFQKK +G+LR+CIDYRALNKVTV+NKYP+P+I+DL
Sbjct: 649  RRQLKELLDAGFIQPSKAPYGAPVLFQKKHDGSLRMCIDYRALNKVTVKNKYPIPLIADL 708

Query: 133  FDKLHGAKYFTKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFDLTNAPATFCTL 192
            FD+L  A+YFTKLDLRSGYYQVRIAEGDEPKTTCVTRYG++EFLVMPF LTNAPATFCTL
Sbjct: 709  FDQLGRARYFTKLDLRSGYYQVRIAEGDEPKTTCVTRYGSYEFLVMPFGLTNAPATFCTL 768

Query: 193  MNQVFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCI 252
            MN++F+ YLD+FV+ YLDDIV+YS TL+EH+ HL+ VF  LRQN+LYVKKEKC+FA+  +
Sbjct: 769  MNKIFHPYLDKFVVXYLDDIVIYSNTLKEHEEHLRKVFKILRQNKLYVKKEKCSFAKEEV 828

Query: 253  NFLGHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRRAAPLTE 312
            NFLGH +R G++ MD  K+KAIQEW  PT V +LRSFLGL NYYRRF++G+S RAAPLT+
Sbjct: 829  NFLGHRIRDGKLMMDDSKVKAIQEWDPPTKVPQLRSFLGLVNYYRRFIKGYSGRAAPLTD 888

Query: 313  LLKKDHPWSWSNDCQMAFENLKTTMTRGPVLGLVDVTKPFEVETDASDFAL--------- 372
            LLKK+  W W   CQ AFE+LK  +T  PVL L D TK FEV TDASDFA+         
Sbjct: 889  LLKKNKAWEWDGRCQQAFEDLKKAVTEEPVLALPDHTKVFEVHTDASDFAIGGVLMQERH 948

Query: 373  ------VKTDNS-------------------------------------ATCHFFDQPKL 432
                   K +N+                                     AT +F  Q KL
Sbjct: 949  PIAFESRKLNNAERRYTVQEKEMTAIVHCLRTWRHYLLGSHFIVKTDNVATSYFQTQKKL 1008

Query: 433  TAKQARWQESLAEFDFKFEHKAGKSNQAADALSRKGEHAALCMLAHIHSSKIDGSIRDII 492
            + KQARWQ+ LAEFD+  E+K G +N  ADALSRK E A++       SS+  G I  ++
Sbjct: 1009 SPKQARWQDFLAEFDYTLEYKPGSANHVADALSRKAELASI-------SSQPQGDIMYLL 1068

Query: 493  KEHLHKDPSAKAVVELAKAGKTRQFWVEGDLLITKGNRLYVPRTGELRKKLIQKCHDTLW 552
            +E L  DP AK+++ LA  GKT++FWVE  LL TKG RLYVP+ G +R+ LI++CHDT W
Sbjct: 1069 REGLQHDPVAKSLIALAHEGKTKRFWVEDGLLYTKGRRLYVPKWGNIRRNLIKECHDTKW 1128

Query: 553  AGHPGWQRTYALIKKGYFWPNMRDDIMQYTKTCLICQQDKVEKAKVFGLLEPLPVPTRPW 612
            AGHPG +RT AL++  Y+WP +RD++  Y +TCL+CQQDKVE+ +  GLLEPLPV  RPW
Sbjct: 1129 AGHPGQRRTRALLESAYYWPQIRDEVEAYVRTCLVCQQDKVEQRQPRGLLEPLPVAERPW 1188

Query: 613  ESVSLDFITHRPKVGEYDAILVIVDRFSKYATFIPTPKLCSAELTAQLFFKHVVKLSGIP 672
            +SV++DFI   PK  +  +I+V+VDRFSKYATFI  P  C+AE TA+LF KHVVK  G+P
Sbjct: 1189 DSVTMDFIIGLPKSEDSGSIIVVVDRFSKYATFIAAPTDCTAEETARLFLKHVVKYWGLP 1248

Query: 673  SSIISDRDGRFIGTFWTDLFAFLGTTLNISSSYHPQTDAQIERFNCLLEEYLRHFVDARQ 732
              IISDRD RF G FWT+LF  +G+ L+ S+S+HPQTD Q ER N LLE YLRHFV A Q
Sbjct: 1249 KFIISDRDPRFTGKFWTELFKLMGSELHFSTSFHPQTDGQTERXNALLELYLRHFVSANQ 1308

Query: 733  KNWIQLLDVAQFCFNCQTSSSTGKSPFEIVSGRQPALPHIIDYPYAGKNPQAHNFTREWK 792
            ++W +LLD+AQF +N Q S +T KSPFE+ +G+QP  PH +   Y G++P A  F + W 
Sbjct: 1309 RDWAKLLDIAQFSYNLQRSEATNKSPFELATGQQPLTPHTLXIGYTGRSPAAFKFAKGWH 1368

Query: 793  QTTDIARAYLEKASKHMKKWADKKRRPLQFRAGDQVLIKLRPEQIRFRSRKDQRLVRKYE 852
            +  DIA +YL+KA+K MKKWADKKRR  +++ GD VL+KL P+Q +      + LVR+YE
Sbjct: 1369 EQADIAXSYLDKAAKKMKKWADKKRRHTEYKVGDMVLVKLLPQQFKSLRPVHKGLVRRYE 1428

Query: 853  GPVEVLKKIGATSYRVALPTWMKIHPVIHVSNLKPYHPDPDDDQRNATTRLNIDLQQKET 905
            GP  +L K+G  SY+V LP  +KIHPV HVS L PYH D DD  R  + R    +     
Sbjct: 1429 GPFPILGKVGKVSYKVELPPRLKIHPVFHVSYLNPYHEDKDDPSRGLSKRAPTAVVTSYD 1488

BLAST of CmoCh20G009950 vs. NCBI nr
Match: gi|659121350|ref|XP_008460615.1| (PREDICTED: uncharacterized protein LOC103499392 [Cucumis melo])

HSP 1 Score: 1004.2 bits (2595), Expect = 1.5e-289
Identity = 507/953 (53.20%), Postives = 653/953 (68.52%), Query Frame = 1

Query: 6    IEEATTEET-VLEEIKEVLDSYTDIMPESLQQTLPPRRGIDHEIELLPGVKQPAKNAYRM 65
            +EE  T+E  V + I++VL+ Y DIMP  L + LPPRR +DHEIEL  G K PA   YRM
Sbjct: 468  VEEVKTDEPPVPDNIQKVLNEYKDIMPSELPKKLPPRREVDHEIELESGAKPPAMAPYRM 527

Query: 66   ASPELAELRKQLDELLKAGFIRPTKAPYGAPVLFQKKKNGTLRLCIDYRALNKVTVRNKY 125
            A PEL ELR+QL ELL AG+I+P+KAPYGAPVLFQKKK+G+LRLCIDYRALNK+T++N+Y
Sbjct: 528  APPELEELRRQLKELLDAGYIQPSKAPYGAPVLFQKKKDGSLRLCIDYRALNKITIKNRY 587

Query: 126  PLPIISDLFDKLHGAKYFTKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFDLTN 185
            P+P+I+DLFD+L  A++F+K+DLRSGYYQVRI +GDE KT CVTRYGA+EFLVMPF LTN
Sbjct: 588  PIPLIADLFDQLGKARWFSKIDLRSGYYQVRIKQGDEAKTACVTRYGAYEFLVMPFGLTN 647

Query: 186  APATFCTLMNQVFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEK 245
            APATFCTLMN++F  +LD+FVIVYLDDIVVYS TLEEH  HL+ VF  LR N+LY+K EK
Sbjct: 648  APATFCTLMNKLFQPFLDRFVIVYLDDIVVYSQTLEEHVQHLRQVFQVLRDNELYIKLEK 707

Query: 246  CAFAQTCINFLGHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFS 305
            C+FA+  + FLGH ++ G++ MD+ K++AI EWK PT V ELRSFLG  NYYRRF++G+S
Sbjct: 708  CSFAKQEVEFLGHWIKEGKLMMDNAKVRAILEWKTPTKVPELRSFLGFVNYYRRFIKGYS 767

Query: 306  RRAAPLTELLKKDHPWSWSNDCQMAFENLKTTMTRGPVLGLVDVTKPFEVETDASDFA-- 365
              AAPLT LLKK+  W W+ +CQ AF+ LK  ++  PV+ L D TKPFEV TDASDFA  
Sbjct: 768  DVAAPLTNLLKKNQTWGWTEECQRAFDRLKHAVSEEPVMVLADHTKPFEVHTDASDFAIG 827

Query: 366  ----------------LVKTDNSATC----------------HFFDQPKLTAKQAR---- 425
                            L  T+   T                 H+    K T         
Sbjct: 828  GVLMQDGHPIAFESRKLNDTERRYTVQEKEMTAIVHCLRTWRHYLLGSKFTVMTDNVATS 887

Query: 426  --------------WQESLAEFDFKFEHKAGKSNQAADALSRKGEHAALCMLAHIHSSKI 485
                          WQ+ LAEFDFK E+K G++N  ADALSRK E      L  I  S  
Sbjct: 888  YFQTQKKLTPKQARWQDFLAEFDFKLEYKPGRANVVADALSRKAE------LNIITRSMP 947

Query: 486  DGSIRDIIKEHLHKDPSAKAVVELAKAGKTRQFWVEGDLLITKGNRLYVPRTGELRKKLI 545
              +  + IKE +  D  AK +++LAK GKTR+FW     L+T GNRL+VPR G LRK ++
Sbjct: 948  TSNFLERIKEGMQHDELAKNLLKLAKEGKTRRFWENDGTLLTTGNRLFVPRWGALRKDVL 1007

Query: 546  QKCHDTLWAGHPGWQRTYALIKKGYFWPNMRDDIMQYTKTCLICQQDKVEKAKVFGLLEP 605
            ++CHD+LWAGHPG  RT AL+   Y+WP M+DDI  Y KTCL+CQQDK E+    GLLEP
Sbjct: 1008 RECHDSLWAGHPGMNRTLALVYDKYYWPRMQDDIESYVKTCLVCQQDKGEQQLPAGLLEP 1067

Query: 606  LPVPTRPWESVSLDFITHRPKVGEYDAILVIVDRFSKYATFIPTPKLCSAELTAQLFFKH 665
            LP+  +PW+S+++DFI   PK   +  I+V+VDRFSKYATFIP       +  A+LFFK+
Sbjct: 1068 LPIAEKPWDSLTMDFIVALPKSHGFGTIMVVVDRFSKYATFIPCSPDVKVDEAARLFFKN 1127

Query: 666  VVKLSGIPSSIISDRDGRFIGTFWTDLFAFLGTTLNISSSYHPQTDAQIERFNCLLEEYL 725
            VVKL GIP SIISDRD RF G FW +LF  +GT LN S+S+HPQ+D Q ER N LLE+YL
Sbjct: 1128 VVKLWGIPKSIISDRDPRFTGKFWRELFKLMGTDLNFSTSFHPQSDGQTERINALLEQYL 1187

Query: 726  RHFVDARQKNWIQLLDVAQFCFNCQTSSSTGKSPFEIVSGRQPALPHIIDYPYAGKNPQA 785
            RH+V A QK+W+ LLDVAQF +N Q S +TGKSPFE++  +QP  P  +  PY G NP A
Sbjct: 1188 RHYVSAHQKDWVALLDVAQFSYNLQRSEATGKSPFELIMNQQPNTPGALIAPYEGPNPSA 1247

Query: 786  HNFTREWKQTTDIARAYLEKASKHMKKWADKKRRPLQFRAGDQVLIKLRPEQIRFRSRKD 845
             NF ++W +  DI+RA LEKA++ MKKWADKKRRP ++  GD+VL+KL P Q +   +  
Sbjct: 1248 FNFAKQWHEEQDISRACLEKAARRMKKWADKKRRPKEYEIGDKVLVKLLPNQFKSLRKVH 1307

Query: 846  QRLVRKYEGPVEVLKKIGATSYRVALPTWMKIHPVIHVSNLKPYHPDPDDDQRNATTRLN 905
            + LVR+YEGP  +++++G  +Y+V LP  +KIH V HVS LKP+H D +D  R+ T+R  
Sbjct: 1308 KGLVRRYEGPFSIIERVGKAAYKVELPPRLKIHNVFHVSMLKPFHEDQEDPNRSKTSRAP 1367

BLAST of CmoCh20G009950 vs. NCBI nr
Match: gi|147772919|emb|CAN64786.1| (hypothetical protein VITISV_014071 [Vitis vinifera])

HSP 1 Score: 959.9 bits (2480), Expect = 3.2e-276
Identity = 488/945 (51.64%), Postives = 630/945 (66.67%), Query Frame = 1

Query: 13   ETVLEEIKEVLDSYTDIMPESLQQTLPPRRGIDHEIELLPGVKQPAKNAYRMASPELAEL 72
            E + +EI+ VLD + D+MP  L + LPP+R  DH+IEL PG K PA   YRMA PEL EL
Sbjct: 293  EPMPKEIEGVLDEFKDVMPPELPKRLPPKREEDHKIELEPGAKPPAMGPYRMAPPELEEL 352

Query: 73   RKQLDELLKAGFIRPTKAPYGAPVLFQKKKNGTLRLCIDYRALNKVTVRNKYPLPIISDL 132
            R+QL ELL AGFI+P+KAPYGAPVLFQKK +G+LR+CIDYRALNKVTV+NKYP+P+I+DL
Sbjct: 353  RRQLKELLDAGFIQPSKAPYGAPVLFQKKHDGSLRMCIDYRALNKVTVKNKYPIPLIADL 412

Query: 133  FDKLHGAKYFTKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFDLTNAPATFCTL 192
            FD+L  A+YFTKLDLR                     YG++EFLVMPF LTNAP  FCTL
Sbjct: 413  FDQLGRARYFTKLDLR---------------------YGSYEFLVMPFGLTNAPTMFCTL 472

Query: 193  MNQVFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCI 252
            MN++F+ YLD+FV+VYLDDIV+YS TL+EH+ HL+ VF  LRQN+LYVKKEKC+FA+  +
Sbjct: 473  MNKIFHPYLDKFVVVYLDDIVIYSNTLKEHEEHLRKVFKILRQNKLYVKKEKCSFAKEEV 532

Query: 253  NFLGHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRRAAPLTE 312
            +FLGH +R G++ MD  K+KAIQEW  PT V +LRSFL L NYYRRF++G+S RAAPLT+
Sbjct: 533  SFLGHRIRDGKLMMDDSKVKAIQEWDPPTKVPQLRSFLSLVNYYRRFIKGYSGRAAPLTD 592

Query: 313  LLKKDHPWSWSNDCQMAFENLKTTMTRGPVLGLVDVTKPFEVETDASDFAL--------- 372
            LLKK+  W W   CQ AFENLK  +T  PVL L D TK FEV TDASDFA+         
Sbjct: 593  LLKKNKAWEWDERCQHAFENLKKAVTEEPVLALPDHTKVFEVHTDASDFAIGGVLMQERH 652

Query: 373  ------VKTDNSATCHFFDQPKLTA----------------------------------- 432
                   K +++   +   + ++TA                                   
Sbjct: 653  LIAFESRKLNDAERRYTVQEKEMTAIVHCLHTWRHYLLGSHFIVKTDNVATSYFQTQKKL 712

Query: 433  --KQARWQESLAEFDFKFEHKAGKSNQAADALSRKGEHAALCMLAHIHSSKIDGSIRDII 492
              KQARWQ+ LAEFD+  E+K G +N  A ALS K E  ++       +S+  G I D++
Sbjct: 713  SPKQARWQDFLAEFDYTLEYKPGSANHVAGALSHKAELTSM-------TSQPQGDIMDLL 772

Query: 493  KEHLHKDPSAKAVVELAKAGKTRQFWVEGDLLITKGNRLYVPRTGELRKKLIQKCHDTLW 552
            +E L  DP AK+++ LA  GKT++FWVE DLL TKG RLYVP+ G +R+ LI++CHDT W
Sbjct: 773  REGLQHDPMAKSLIALAHEGKTKRFWVEDDLLYTKGRRLYVPKWGNIRRNLIKECHDTKW 832

Query: 553  AGHPGWQRTYALIKKGYFWPNMRDDIMQYTKTCLICQQDKVEKAKVFGLLEPLPVPTRPW 612
            AGHPG +RT AL++  Y+WP +RD++  Y           VE+ +  GLLEPLP+  RPW
Sbjct: 833  AGHPGQRRTRALLESAYYWPQIRDEVEAY-----------VEQRQPRGLLEPLPIAERPW 892

Query: 613  ESVSLDFITHRPKVGEYDAILVIVDRFSKYATFIPTPKLCSAELTAQLFFKHVVKLSGIP 672
            ++V++DFI   PK  +  +I+V+VDRFSKYATFI  P  C+AE T +LF KHVVK  G+P
Sbjct: 893  DNVTMDFIIGLPKSEDSGSIIVVVDRFSKYATFIAAPTDCTAEETTRLFLKHVVKYWGLP 952

Query: 673  SSIISDRDGRFIGTFWTDLFAFLGTTLNISSSYHPQTDAQIERFNCLLEEYLRHFVDARQ 732
              IISDRD RF G FWT+LF  +G+ L+ S+S+HPQT+ Q ER N LLE YLRHFV A Q
Sbjct: 953  KYIISDRDPRFTGKFWTELFKLMGSELHFSTSFHPQTNGQTERVNALLELYLRHFVSANQ 1012

Query: 733  KNWIQLLDVAQFCFNCQTSSSTGKSPFEIVSGRQPALPHIIDYPYAGKNPQAHNFTREWK 792
            ++W +LLD+AQF +N Q S +T KSPF++ +G+QP  PH++   Y G++P A  F + W 
Sbjct: 1013 RDWAKLLDIAQFSYNLQMSEATNKSPFKLATGQQPLTPHMLTIGYTGRSPAAFKFAKGWH 1072

Query: 793  QTTDIARAYLEKASKHMKKWADKKRRPLQFRAGDQVLIKLRPEQIRFRSRKDQRLVRKYE 852
            +  DIAR+YL+KA+K MKKWADKKR   +++ GD VL+KL P+Q +      + LVR+YE
Sbjct: 1073 EQADIARSYLDKATKKMKKWADKKRHHTEYKVGDMVLVKLLPQQFKSLRPVHKSLVRRYE 1132

Query: 853  GPVEVLKKIGATSYRVALPTWMKIHPVIHVSNLKPYHPDPDDDQRNATTRLNIDLQQKET 905
            GP  +L K+G  SY+V LP  +KIHPV HVS LKPYH D DD  R  + R    +     
Sbjct: 1133 GPFPILGKVGKVSYKVELPPRLKIHPVFHVSYLKPYHEDKDDPSRGLSKRAPTAVVTSYD 1192

BLAST of CmoCh20G009950 vs. NCBI nr
Match: gi|4235644|gb|AAD13304.1| (polyprotein [Solanum lycopersicum])

HSP 1 Score: 935.3 bits (2416), Expect = 8.5e-269
Identity = 480/942 (50.96%), Postives = 617/942 (65.50%), Query Frame = 1

Query: 19   IKEVLDSYTDIMPESLQQTLPPRRGIDHEIELLPGVKQPAKNAYRMASPELAELRKQLDE 78
            + E+L  Y D+MP  L + LPPRR IDH+IELLPG   PA+  YRMA  EL ELRKQL+E
Sbjct: 575  VAELLKQYADVMPPELPKKLPPRRDIDHKIELLPGTVAPAQAPYRMAPKELVELRKQLNE 634

Query: 79   LLKAGFIRPTKAPYGAPVLFQKKKNGTLRLCIDYRALNKVTVRNKYPLPIISDLFDKLHG 138
            LL AG I+P+KAPYGAPVLFQKK++GT+R+C+DYRALNK T++NKY +P++ DL D+L  
Sbjct: 635  LLDAGLIQPSKAPYGAPVLFQKKQDGTMRMCVDYRALNKATIKNKYSVPLVQDLMDRLSK 694

Query: 139  AKYFTKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFDLTNAPATFCTLMNQVFY 198
            A +FTKLDLR+GY+QVRIAEGDEPKTTCVTRYG++EFLVMPF LTNAPATFC LMN V +
Sbjct: 695  ACWFTKLDLRAGYWQVRIAEGDEPKTTCVTRYGSYEFLVMPFGLTNAPATFCNLMNNVLF 754

Query: 199  EYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLGHV 258
            +YLD FV+VYLDDIV+YS TLEEH  HL LV  +LR+  LYVK EKC FAQ  I FLGH+
Sbjct: 755  DYLDDFVVVYLDDIVIYSRTLEEHVNHLSLVLSQLRKYTLYVKMEKCEFAQQEIKFLGHL 814

Query: 259  VRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRRAAPLTELLKKDH 318
            V   Q+ MD  K++AI +W+ P  V +LRSFLGLANYYR+F+ G+S++AA LT+LLKKD 
Sbjct: 815  VSKNQVRMDPKKVQAIVDWQAPRHVKDLRSFLGLANYYRKFIAGYSKKAASLTDLLKKDA 874

Query: 319  PWSWSNDCQMAFENLKTTMTRGPVLGLVDVTKPFEVETDASD------------------ 378
             W WS  C+ AF+NLK  +   P+L L D   PFEV TDASD                  
Sbjct: 875  KWVWSEQCEKAFQNLKNAIASEPILKLPDFELPFEVHTDASDKAIGGVLVQEGHPVAFES 934

Query: 379  ---------FALVKTDNSATCHFFD-------------------------QPKLTAKQAR 438
                     ++  + +  A  H                            Q KL+ KQAR
Sbjct: 935  RKLNDAEQRYSTHEKEMVAVVHCLQVWRVYLLGTRFVVRTDNVANTFFKTQKKLSPKQAR 994

Query: 439  WQESLAEFDFKFEHKAGKSNQAADALSRKGEHAALCMLAHIHSSKIDGSIRDIIKEHLHK 498
            WQE LAE+DF +EHK GK NQ ADALSRK    A+  +     SK++    D I+     
Sbjct: 995  WQEFLAEYDFMWEHKPGKHNQVADALSRKEVFVAVYSI-----SKLETDFYDRIRLCAAN 1054

Query: 499  DPSAKAVVELAKAGKTRQFWVEGDLLITKGNRLYVPRTGELRKKLIQKCHDTLWAGHPGW 558
            D      +   + G  R++W+E DLL  KG R+ VP  G LRK L+++ +D+ WAGHPG 
Sbjct: 1055 DSLYVKWMGQVQDGTMRRYWIEDDLLYFKGGRIVVPNQGGLRKDLMKEAYDSAWAGHPGV 1114

Query: 559  QRTYALIKKGYFWPNMRDDIMQYTKTCLICQQDKVEKAKVFGLLEPLPVPTRPWESVSLD 618
            +R  AL+ + YFWP M DDI  Y KTC +CQ DK E+ K  GLL+PLP+P RPW SVS+D
Sbjct: 1115 ERMLALLSRVYFWPKMEDDIEAYVKTCHVCQVDKTERKKEAGLLQPLPIPERPWLSVSMD 1174

Query: 619  FITHRPKVGEYDAILVIVDRFSKYATFIPTPKLCSAELTAQLFFKHVVKLSGIPSSIISD 678
            FI+  PKV    +I+V+VDRFSKY+ FI  P+LCS+E+ A+LF+KHV+K  G+P+ I+SD
Sbjct: 1175 FISGFPKVDGKASIMVVVDRFSKYSVFIAAPELCSSEVAAELFYKHVIKYFGVPADIVSD 1234

Query: 679  RDGRFIGTFWTDLFAFLGTTLNISSSYHPQTDAQIERFNCLLEEYLRHFVDARQKNWIQL 738
            RD RF G FWT LF  +GT L  S++ HPQTD Q ER N LLEEYLRH+V A Q+NW++L
Sbjct: 1235 RDTRFTGRFWTALFNMMGTELKFSTANHPQTDGQTERINHLLEEYLRHYVTASQRNWVEL 1294

Query: 739  LDVAQFCFNCQTSSSTGKSPFEIVSGRQPALP-HIIDYPYAGKNPQAHNFTREWKQTTDI 798
            LD AQFC+N   SS+T  SPFEIV G+QP  P  +      GK P A+   R+  +    
Sbjct: 1295 LDTAQFCYNLHKSSATEMSPFEIVLGKQPMTPLDVAKSKNQGKCPAAYRVARDRLEMLSE 1354

Query: 799  ARAYLEKASKHMKKWADKKRRPLQFRAGDQVLIKLRPE---QIRFRSRKDQRLVRKYEGP 858
            A+  L KA + MKK+AD+ RR ++F  GD+VL+KL P+   QI  ++R  + L+ KY+GP
Sbjct: 1355 AQDSLRKAQQRMKKYADQHRRSVEFSVGDKVLLKLTPQIWKQIVSKTR-HRGLIPKYDGP 1414

Query: 859  VEVLKKIGATSYRVALPTWMKIHPVIHVSNLKPYHPDPDDDQRNATTRLNIDLQQKETKE 905
             EV+K++G  +YR+ LP  +KIHP  HVS LKPY  D DD  RN + R    +  +   E
Sbjct: 1415 FEVVKRVGEVAYRLKLPERLKIHPTFHVSFLKPYFADEDDPDRNRSKRAPPSVPTQYDAE 1474

BLAST of CmoCh20G009950 vs. NCBI nr
Match: gi|697156465|ref|XP_009586987.1| (PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC104084743 [Nicotiana tomentosiformis])

HSP 1 Score: 926.4 bits (2393), Expect = 4.0e-266
Identity = 479/939 (51.01%), Postives = 627/939 (66.77%), Query Frame = 1

Query: 5    LIEEATTEETVLEEIKEVLDSYTDIMPESLQQTLPPRRGIDHEIELLPGVKQPAKNAYRM 64
            ++E +     +   I++VLD   D+MPE L + LPP+R +DH+IEL+PG K PA + YRM
Sbjct: 1246 IVEHSLEAVALPPRIEQVLDENKDVMPEELPKHLPPQREVDHQIELVPGAKPPAMSPYRM 1305

Query: 65   ASPELAELRKQLDELLKAGFIRPTKAPYGAPVLFQKKKNGTLRLCIDYRALNKVTVRNKY 124
            A PEL ELRKQL ELL+A  IRP+KAPYGAPVLFQKKK GT+RLCIDYRALNKVTV+NKY
Sbjct: 1306 APPELEELRKQLKELLEAVHIRPSKAPYGAPVLFQKKKEGTMRLCIDYRALNKVTVKNKY 1365

Query: 125  PLPIISDLFDKLHGAKYFTKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFDLTN 184
             +P+I+DLFD+L  AK FTK+DLR GYYQV IA+GDEPKTTCVTRYGAFE+LVMPF LTN
Sbjct: 1366 HIPLIADLFDRLGQAKVFTKMDLRKGYYQVWIADGDEPKTTCVTRYGAFEWLVMPFGLTN 1425

Query: 185  APATFCTLMNQVFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEK 244
            APATFCTLMN++F+ +LDQFV++YLDDIVVYS  +E+H  HL+ VF  LR+N L VK+EK
Sbjct: 1426 APATFCTLMNKLFHPFLDQFVVIYLDDIVVYSNNMEDHAEHLRKVFKVLRENDLCVKREK 1485

Query: 245  CAFAQTCINFLGHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFS 304
            C+FAQ  + FLGH +  G+I MD DK++AIQ+W+ PT V ELRSFLGLANYYRRF+ G+S
Sbjct: 1486 CSFAQPIVQFLGHTISYGEIRMDRDKVEAIQDWEAPTKVPELRSFLGLANYYRRFIFGYS 1545

Query: 305  RRAAPLTELLKKDHPWSWSNDCQMAFENLKTTMTRGPVLGLVDVTKPFEVETDASDFA-- 364
              A+PLT+LLKKD  W WS+ CQ AFE LK  +T+ PVL L D +K FEV+TDASDFA  
Sbjct: 1546 TIASPLTDLLKKDREWEWSDICQKAFEKLKAAITKEPVLALPDFSKVFEVQTDASDFAIG 1605

Query: 365  --LVKTDNSATCHFFDQPKLTAKQAR-----------------WQESL--AEFDFKFEH- 424
              L++  +      F+  KL   + R                 W+  L  A F  K ++ 
Sbjct: 1606 GVLMQEGHPIA---FESRKLNDAERRYTVQEKEMTVVVHCLRTWRHYLLGAHFVVKTDNV 1665

Query: 425  --------------KAGKSNQAADALSRKGEHAALCMLAHIHSSKIDGSIRDIIKEHLHK 484
                          +A K+N  ADALSRK       +LA++ SS   G I+ I      K
Sbjct: 1666 ATSYFQTQKKLSAKQARKANVVADALSRK------TILANMVSSASSGIIKTI------K 1725

Query: 485  DPSAKAVVELAKAGKTRQFWVEGDLLITKGNRLYVPRTGELRKKLIQKCHDTLWAGHPGW 544
            +  AK    L+  GK      E  LL T G R+YVP+   LR+ L+++ HD+ WAGHPG 
Sbjct: 1726 EVKAK----LSAFGKE-----EDGLLYTTGKRVYVPKWANLRRTLLKEGHDSAWAGHPGQ 1785

Query: 545  QRTYALIKKGYFWPNMRDDIMQYTKTCLICQQDKVEKAKVFGLLEPLPVPTRPWESVSLD 604
            +RT ALI+  Y+WP MRDDI  Y +TCL+CQQDKVE     GLLEPLPV  +PW+S+++D
Sbjct: 1786 KRTMALIESSYYWPRMRDDIEVYVRTCLVCQQDKVESKLPGGLLEPLPVAAKPWDSITMD 1845

Query: 605  FITHRPKVGEYDAILVIVDRFSKYATFIPTPKLCSAELTAQLFFKHVVKLSGIPSSIISD 664
            FIT  P    +  I+V+VDRF+KYATF  T   C A+  A++F + +VK  G+P  I+SD
Sbjct: 1846 FITCLPNSEGFGTIMVVVDRFTKYATFSTTTAGCKAKEAARIFLRDIVKYWGVPKHIVSD 1905

Query: 665  RDGRFIGTFWTDLFAFLGTTLNISSSYHPQTDAQIERFNCLLEEYLRHFVDARQKNWIQL 724
            RD RF G FW  LF+ LGT L+ S+S+HPQTD Q ER N LLE YLRH+V A Q++W +L
Sbjct: 1906 RDPRFTGAFWKKLFSLLGTQLHFSTSFHPQTDGQTERINALLECYLRHYVSANQRDWAKL 1965

Query: 725  LDVAQFCFNCQTSSSTGKSPFEIVSGRQPALPHIIDYPYAGKNPQAHNFTREWKQTTDIA 784
            LD+AQF +N Q S +TGKSPFE+ +G+QP  P  +      KNP A +  + W++  D+A
Sbjct: 1966 LDIAQFSYNLQCSEATGKSPFELATGQQPNTPQSLPANVGLKNPGAFHMAKFWEEQADLA 2025

Query: 785  RAYLEKASKHMKKWADKKRRPLQFRAGDQVLIKLRPEQIRFRSRKDQRLVRKYEGPVEVL 844
            R+YL+KA++ MKK+AD+KRRP+ +R GD+V++KL P Q +        L+R+YEGP E++
Sbjct: 2026 RSYLDKAARKMKKFADRKRRPVNYRIGDRVMVKLNPRQFKSLRGVHHSLIRRYEGPFEIV 2085

Query: 845  KKIGATSYRVALPTWMKIHPVIHVSNLKPYHPDPDDDQRNATTRLNIDLQQKETKEVEEI 904
             K+G  SYR+ +P  +KI+PV H S LKPY  D +D  R  ++R         T +  E 
Sbjct: 2086 AKVGIISYRLDMPRHLKIYPVFHASQLKPYFEDKEDKDRVQSSRSQFFATPPATDKQLEA 2145

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
TF29_SCHPO3.0e-6437.36Transposon Tf2-9 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 248... [more]
TF26_SCHPO3.0e-6437.36Transposon Tf2-6 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 248... [more]
TF25_SCHPO3.0e-6437.36Transposon Tf2-5 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 248... [more]
TF24_SCHPO3.0e-6437.36Transposon Tf2-4 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 248... [more]
TF23_SCHPO3.0e-6437.36Transposon Tf2-3 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 248... [more]
Match NameE-valueIdentityDescription
A5BX03_VITVI4.3e-30455.66Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_032357 PE=4 SV=1[more]
A5AQF1_VITVI2.2e-27651.64Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_014071 PE=4 SV=1[more]
Q9ZS84_SOLLC5.9e-26950.96Polyprotein OS=Solanum lycopersicum PE=4 SV=1[more]
A5AM46_VITVI8.9e-24151.61Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_003931 PE=4 SV=1[more]
A5AZ16_VITVI2.6e-20855.27Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_016761 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
ATMG00860.12.7e-2337.97ATMG00860.1 DNA/RNA polymerases superfamily protein[more]
Match NameE-valueIdentityDescription
gi|147826806|emb|CAN63950.1|6.2e-30455.66hypothetical protein VITISV_032357 [Vitis vinifera][more]
gi|659121350|ref|XP_008460615.1|1.5e-28953.20PREDICTED: uncharacterized protein LOC103499392 [Cucumis melo][more]
gi|147772919|emb|CAN64786.1|3.2e-27651.64hypothetical protein VITISV_014071 [Vitis vinifera][more]
gi|4235644|gb|AAD13304.1|8.5e-26950.96polyprotein [Solanum lycopersicum][more]
gi|697156465|ref|XP_009586987.1|4.0e-26651.01PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC104084743 [Nicotiana ... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR000477RT_dom
IPR000953Chromo/chromo_shadow_dom
IPR001584Integrase_cat-core
IPR012337RNaseH-like_sf
IPR016197Chromo-like_dom_sf
IPR023780Chromo_domain
Vocabulary: Biological Process
TermDefinition
GO:0015074DNA integration
Vocabulary: Molecular Function
TermDefinition
GO:0003676nucleic acid binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
biological_process GO:0044238 primary metabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0046872 metal ion binding
molecular_function GO:0003676 nucleic acid binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh20G009950.1CmoCh20G009950.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000477Reverse transcriptase domainPFAMPF00078RVT_1coord: 100..258
score: 3.3
IPR000477Reverse transcriptase domainPROFILEPS50878RT_POLcoord: 80..259
score: 12
IPR000953Chromo/chromo shadow domainPROFILEPS50013CHROMO_2coord: 861..933
score: 11
IPR001584Integrase, catalytic corePFAMPF00665rvecoord: 557..670
score: 1.6
IPR001584Integrase, catalytic corePROFILEPS50994INTEGRASEcoord: 555..714
score: 21
IPR012337Ribonuclease H-like domainGENE3DG3DSA:3.30.420.10coord: 558..719
score: 2.2
IPR012337Ribonuclease H-like domainunknownSSF53098Ribonuclease H-likecoord: 552..708
score: 2.28
IPR016197Chromo domain-likeunknownSSF54160Chromo domain-likecoord: 822..906
score: 3.53
IPR023780Chromo domainPFAMPF00385Chromocoord: 862..910
score: 3.7
NoneNo IPR availableGENE3DG3DSA:2.40.50.40coord: 857..905
score: 3.
NoneNo IPR availableGENE3DG3DSA:3.10.10.10coord: 48..180
score: 4.7
NoneNo IPR availableGENE3DG3DSA:3.30.70.270coord: 181..258
score: 1.
NoneNo IPR availablePANTHERPTHR24559FAMILY NOT NAMEDcoord: 92..905
score: 1.9E
NoneNo IPR availablePANTHERPTHR24559:SF201SUBFAMILY NOT NAMEDcoord: 92..905
score: 1.9E
NoneNo IPR availableunknownSSF56672DNA/RNA polymerasescoord: 22..400
score: 5.34E

The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
CmoCh20G009950Cucurbita maxima (Rimu)cmacmoB551
CmoCh20G009950Cucurbita maxima (Rimu)cmacmoB606
CmoCh20G009950Cucurbita maxima (Rimu)cmacmoB649
CmoCh20G009950Wild cucumber (PI 183967)cmocpiB535
CmoCh20G009950Wild cucumber (PI 183967)cmocpiB544
CmoCh20G009950Wild cucumber (PI 183967)cmocpiB554
CmoCh20G009950Cucumber (Chinese Long) v2cmocuB529
CmoCh20G009950Cucumber (Chinese Long) v2cmocuB539
CmoCh20G009950Cucumber (Chinese Long) v2cmocuB548
CmoCh20G009950Melon (DHL92) v3.5.1cmomeB488
CmoCh20G009950Melon (DHL92) v3.5.1cmomeB510
CmoCh20G009950Watermelon (Charleston Gray)cmowcgB477
CmoCh20G009950Watermelon (Charleston Gray)cmowcgB497
CmoCh20G009950Watermelon (Charleston Gray)cmowcgB503
CmoCh20G009950Watermelon (97103) v1cmowmB510
CmoCh20G009950Watermelon (97103) v1cmowmB528
CmoCh20G009950Cucurbita pepo (Zucchini)cmocpeB517
CmoCh20G009950Cucurbita pepo (Zucchini)cmocpeB534
CmoCh20G009950Bottle gourd (USVL1VR-Ls)cmolsiB504
CmoCh20G009950Cucumber (Gy14) v2cgybcmoB776
CmoCh20G009950Melon (DHL92) v3.6.1cmomedB586
CmoCh20G009950Silver-seed gourdcarcmoB0550
CmoCh20G009950Silver-seed gourdcarcmoB0555
CmoCh20G009950Cucumber (Chinese Long) v3cmocucB0641
CmoCh20G009950Cucumber (Chinese Long) v3cmocucB0655
CmoCh20G009950Watermelon (97103) v2cmowmbB540
CmoCh20G009950Watermelon (97103) v2cmowmbB563
CmoCh20G009950Wax gourdcmowgoB0683
CmoCh20G009950Wax gourdcmowgoB0695
CmoCh20G009950Cucurbita moschata (Rifu)cmocmoB404
CmoCh20G009950Cucumber (Gy14) v1cgycmoB0222
CmoCh20G009950Cucumber (Gy14) v1cgycmoB1016