CmoCh20G007300.1 (mRNA) Cucurbita moschata (Rifu)

NameCmoCh20G007300.1
TypemRNA
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionGag-Pol polyprotein
LocationCmo_Chr20 : 3619388 .. 3623333 (+)
Sequence length3855
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCCAACACGATGAGTGATTTCCAAATCGTTGGAGGAATTAAGAAACTCAACAACAACAACTACAACACGTGGGCAACATGCATGATGTCCTATTTACAAGGACAGGATCTTTGGGAGATCGTTGGCGGGTGTGAAACTACGCCGCCAGAGGAGGATTCTAACGACGCCTTGCGCAAATGGAGAATCAAAGCAGGCAAAGCAATGTTTGCCTTGAAGACCACCATCGGAGAAGAGATGTTAGAACATATTTGGGATGACAAGACACCGAAAGAAGCATGGGACACATTCGTGATGTTGTTCTCAAAGAAGAATGATACGAGGCTACAACTTCTGGAGAATGAGTTGTTGTCAATTTCACAACGTGATATGACGATTGCTCAGTACTTCCACAAGGTCAAATCGATCTGTCGGGAGATTACTGAACTAGACCCAAAGTCCGCCATTGTAGAATCTCGAATGAAGAGGATTATAATCCACGGATTGCGACCAGAATATCGAAGCTTCATTGCTGCTGTACAAGGATGGCCCACTCAACCATCACTGGTAGAGTTCGAAAATTTGTTAGCCAGCCAAGAAGCCATGGCTAAACAAATGGGAGGCTTCACATTGAAGGGTGAAGAAGCACTCTACACGAGTGAAAGTCAGAGCAATAATAGGCCGTCTACCAGACGTGGATACAATGGTGACAAAAGAAGAAGCCACCAAGGGATTGCACAACCTGAGAGAGCTCAGAAGAACGACAACAAGAGTTTTCAAAGAACGAGATTTGGTGGTATTTGCTATAACTGCGGGAAAAAGGGCCATATGTCTAGAGATTGTTGGTCCAGGAAAAAGTCCATTGAAAACAATGTGGCAATATCCAAAAAGAAGATAGAAGATGAATGGGATGCAGAGGTACTATGTCCCATAGAAGAAGACGAGCTAGCACTCATGGCGACAATGGAAGACCATATCAACTATGAGAATGACTGGATCGTTGATTCAGGATGTTCAAATCATGCCTGGCAATAAGTCTGATACAGTGTCGCTACATAATGTTTATCATGTACCGGGTATAAAGAAGAACTTGTTGTCAGTGTCACAACTAACAACATCAGGAAGCTATGTCTTGTTTGGGCCAGAAGATGTAAAAGTTTATCAAGATGTTAAGATAATAGGAAAGCCGACGATAGAAGGACGAAGAGTGGAGTCTGTCTACGTTCTATCTGCAGAGTCTGCCTATGTTGACAAGACCCGGAAGAATGAGACGACAGATCTATGGCATGCAAGATTGGGACATATTAGCTACCATAAGTTGAAGCTGATTATGGAGAAATCTATGCTCAAAGGTCTACCACAACTGGAAGTCAAAACAGACGTTGTCTGCGCTGGATGTCAGTATGGTAAAGCTCATCAATTACCATACAAAGAGTCAAGTTTCAAAGCAAAGAAACCATTAGAGTTAGTTCACTCTGATTTGTTCGGCCCAGTCAAACAAGCATCGATCAGCGGAATGCGGTATATGGTGACATTTATTGACGACTACTCAAGATATGTGTGGATTTTCTTTATGAAAGAAAAGTCTGACACGTTCTCAAAGTTTCAAGAATTCAAGATGATGGTCGAAGGAGAAGTAGGAGCGAAGATTCGTTGTCTACGTTCAGACAATGGCGGAGAATACACGTCAGATGAGTTCGATCAATATTTACACGAGTGTGGGATACGACGTCAATTTACATGTGCCAACACGCCACAACAAAATGGTGTAGCAGAAAGAAAGAATCGACACCTTGCAGAAACCTGTCGAAGCATGTTACACGCAAAGAACGTTCCAGGAAGATTTTGGGCTGAAGCTATGCGAACTGCTGCCCATGTGATCAACAAGCTTCCTCAACCAAAGCTAGGGTTCGTCTCACCATTTGAGATACTATGGGATATGAAACCTACAATTAGTTACTTCCGAGTATTTGGCTGTGTTTGCTATGTATTTGTGCCTGACCATCTACGTAGCAAGTTTGACAAGAAAGCAGTCAAGTGTGTATTTGTTGGATACGACAATCAAAGAAAAGGATGGAGGTGCTGTGATCCAACAAGTGGAAAATACTATACATCAAGAGATGTAGTTTTTGATGAAGCATCTACATGGTGGTCCTCGGAGAAGAAAGTCTTATCAGATTCAAACATTGAAGAAATTCTACAACAGAAGCTGGGGGAGCAAACTACACAAATTCAATCAAATGTCGATGCATCTGAAAATCCAAGCGACATTGATATTGACAAGCAGGAGGTGACTCAATCAAGCGAATCTGATAAAAATGAAACAACACATCAACAACTTAGGCGATCAAATAGAATCCGAAGGCCAAATCCTAAGTATGCAAATGCAGCTATTGTAGAAGATAGAGTTTACGAACCAGAGACATATGAAGAAGCATCACAAAACTCGGTTTGGCAGAAAGCGATGGAGGAAGAAATTATAGCCTTGGAGCATAATCAAACTTGGGAACTAGTGCCAAGACTAGGAGATATCAAACCCGTCTCTTGCAAGTGGGTCTATAAAATAAAGCGTCGACCGGATGGATCAATCGAGAGATACAAGGCTCGACTCGTGGCTCGAGGATTTTCTCAACAATATGGACTAGATTATGATGAAACATTCAGTCCAGTGGCAAAGATTACTACCGTACGAGTTCTGCTAGCACTCGCAGCAAGTAAAGATTGGAAACTGTGGCAAATGGATGTGAAGAATGCCTTCTTGCACGGAGAGCTAGACAGGGAGATTTATATGAACCAACCAAAGGGATTTGAGAGTGCAGCTAATCCTAATTATGTATGCAAGCTTAGAAAAGCTCTTTATGGACTGAAACAAGCACCGAGAGCTTGGTATGGTAAGATTGCTGAATTTCTTACCCAAAGTGGTTATTCAGTTGCGCATGCAGACTCAAGCCTATTCATCAAAGAAAGAGAAGGAAATTTGACAATTGTGTTGGTCTACGTGGACGATTTGATTATCACCGGGGACGATGAAAGAGAAATTTATCAAACAAGAGAAAATTTATCAATACGCTTTCAGATGAAAGAGCTAGGAGAGCTTAAACACTTCTTAGGCCTAGAAGTTGATCGCACAGATGAAGGACTGTTTCTCTGCCAACAAAAGTATACCAGAGACATGCTTCAGAAGTTCAACATGTTAGAGTGCAAGCAAGTTTCAACACCGATGGAGATAAATGCCAAGATTTGTGCACATGAAGGCAAAGAGTTGAACGATGAAACAACGTACCGACAACTAGTAGGTAGTCTTATCTACCTAACTTTAACTCGACCTGATATCTCTTATGCAGTTGGGGTTATGAGTCGATACATGCAAAGTCCAAAGAAGCCTCATCTGGATGCAGCTCGACGGATCTTGAGATATATCAAAGGTACAATCGACTATGGTCTTTTGTACAAAAGAAGCAAAGACTGCAAGCTAGTTGGATACTGTGATGCTGACTATGCAGGAGACCACGATACTCGGAGGTCAACCACTGGGTATGTGTTCAAGTTTGGTTCGGGAACAATTTCTTGGTGTAGCAAGAGACAACCAACAGTATCATTATCAACTACAGAAGCAGAGTATAGAGCAGCGGCTGGAGCAGCCCAGGAAAGTACATGGCTAAAACTCTTGATGGAAGATTTGCACCAGAAAATTGACTATCCAATATCACTTCTTTGCGACAACCAATCTGCGATTCGCCTTGCAGAAAATCCAGTGTTTCATGCTAGAACAAAGCATGTGGAGGTGCACTACCATTTCATTAGAGAGAAGGTCCTAAAGGAAGAAATTGAGATGCAGCAGATCAAGACAGATGACCAAGTGGCAGACTTGTTTACAAAAGGGTTGAATACTAGCAAACATGAGAGCTTTCGCTGTCAGCTCAACATGATGCAGCGAATGAGGACTAGTGCTGAGGGGGAGTGTTGA

mRNA sequence

ATGGCCAACACGATGAGTGATTTCCAAATCGTTGGAGGAATTAAGAAACTCAACAACAACAACTACAACACGTGGGCAACATGCATGATGTCCTATTTACAAGGACAGGATCTTTGGGAGATCGTTGGCGGGTGTGAAACTACGCCGCCAGAGGAGGATTCTAACGACGCCTTGCGCAAATGGAGAATCAAAGCAGGCAAAGCAATGTTTGCCTTGAAGACCACCATCGGAGAAGAGATGTTAGAACATATTTGGGATGACAAGACACCGAAAGAAGCATGGGACACATTCGTGATGTTGTTCTCAAAGAAGAATGATACGAGGCTACAACTTCTGGAGAATGAGTTGTTGTCAATTTCACAACGTGATATGACGATTGCTCAGTACTTCCACAAGGTCAAATCGATCTGTCGGGAGATTACTGAACTAGACCCAAAGTCCGCCATTGTAGAATCTCGAATGAAGAGGATTATAATCCACGGATTGCGACCAGAATATCGAAGCTTCATTGCTGCTGTACAAGGATGGCCCACTCAACCATCACTGGTAGAGTTCGAAAATTTGTTAGCCAGCCAAGAAGCCATGGCTAAACAAATGGGAGGCTTCACATTGAAGGGTGAAGAAGCACTCTACACGAGTGAAAGTCAGAGCAATAATAGGCCGTCTACCAGACGTGGATACAATGGTGACAAAAGAAGAAGCCACCAAGGGATTGCACAACCTGAGAGAGCTCAGAAGAACGACAACAAGAGTTTTCAAAGAACGAGATTTGGTGGTATTTGCTATAACTGCGGGAAAAAGGGCCATATGTCTAGAGATTGTTGGTCCAGGAAAAAGTCCATTGAAAACAATGTGGCAATATCCAAAAAGAAGATAGAAGATGAATGGGATGCAGAGGATGTTCAAATCATGCCTGGCAATAAGTCTGATACAGTGTCGCTACATAATGTTTATCATGTACCGGGTATAAAGAAGAACTTGTTGTCAGTGTCACAACTAACAACATCAGGAAGCTATGTCTTGTTTGGGCCAGAAGATGTAAAAGTTTATCAAGATGTTAAGATAATAGGAAAGCCGACGATAGAAGGACGAAGAGTGGAGTCTGTCTACGTTCTATCTGCAGAGTCTGCCTATGTTGACAAGACCCGGAAGAATGAGACGACAGATCTATGGCATGCAAGATTGGGACATATTAGCTACCATAAGTTGAAGCTGATTATGGAGAAATCTATGCTCAAAGGTCTACCACAACTGGAAGTCAAAACAGACGTTGTCTGCGCTGGATGTCAGTATGGTAAAGCTCATCAATTACCATACAAAGAGTCAAGTTTCAAAGCAAAGAAACCATTAGAGTTAGTTCACTCTGATTTGTTCGGCCCAGTCAAACAAGCATCGATCAGCGGAATGCGGTATATGGTGACATTTATTGACGACTACTCAAGATATGTGTGGATTTTCTTTATGAAAGAAAAGTCTGACACGTTCTCAAAGTTTCAAGAATTCAAGATGATGGTCGAAGGAGAAGTAGGAGCGAAGATTCGTTGTCTACGTTCAGACAATGGCGGAGAATACACGTCAGATGAGTTCGATCAATATTTACACGAGTGTGGGATACGACGTCAATTTACATGTGCCAACACGCCACAACAAAATGGTGTAGCAGAAAGAAAGAATCGACACCTTGCAGAAACCTGTCGAAGCATGTTACACGCAAAGAACGTTCCAGGAAGATTTTGGGCTGAAGCTATGCGAACTGCTGCCCATGTGATCAACAAGCTTCCTCAACCAAAGCTAGGGTTCGTCTCACCATTTGAGATACTATGGGATATGAAACCTACAATTAGTTACTTCCGAGTATTTGGCTGTGTTTGCTATGTATTTGTGCCTGACCATCTACGTAGCAAGTTTGACAAGAAAGCAGTCAAGTGTGTATTTGTTGGATACGACAATCAAAGAAAAGGATGGAGGTGCTGTGATCCAACAAGTGGAAAATACTATACATCAAGAGATGTAGTTTTTGATGAAGCATCTACATGGTGGTCCTCGGAGAAGAAAGTCTTATCAGATTCAAACATTGAAGAAATTCTACAACAGAAGCTGGGGGAGCAAACTACACAAATTCAATCAAATGTCGATGCATCTGAAAATCCAAGCGACATTGATATTGACAAGCAGGAGGTGACTCAATCAAGCGAATCTGATAAAAATGAAACAACACATCAACAACTTAGGCGATCAAATAGAATCCGAAGGCCAAATCCTAAGTATGCAAATGCAGCTATTGTAGAAGATAGAGTTTACGAACCAGAGACATATGAAGAAGCATCACAAAACTCGGTTTGGCAGAAAGCGATGGAGGAAGAAATTATAGCCTTGGAGCATAATCAAACTTGGGAACTAGTGCCAAGACTAGGAGATATCAAACCCGTCTCTTGCAAGTGGGTCTATAAAATAAAGCGTCGACCGGATGGATCAATCGAGAGATACAAGGCTCGACTCGTGGCTCGAGGATTTTCTCAACAATATGGACTAGATTATGATGAAACATTCAGTCCAGTGGCAAAGATTACTACCGTACGAGTTCTGCTAGCACTCGCAGCAAGTAAAGATTGGAAACTGTGGCAAATGGATGTGAAGAATGCCTTCTTGCACGGAGAGCTAGACAGGGAGATTTATATGAACCAACCAAAGGGATTTGAGAGTGCAGCTAATCCTAATTATGTATGCAAGCTTAGAAAAGCTCTTTATGGACTGAAACAAGCACCGAGAGCTTGGTATGGTAAGATTGCTGAATTTCTTACCCAAAGTGGTTATTCAGTTGCGCATGCAGACTCAAGCCTATTCATCAAAGAAAGAGAAGGAAATTTGACAATTGTGTTGGTCTACGTGGACGATTTGATTATCACCGGGGACGATGAAAGAGAAATTTATCAAACAAGAGAAAATTTATCAATACGCTTTCAGATGAAAGAGCTAGGAGAGCTTAAACACTTCTTAGGCCTAGAAGTTGATCGCACAGATGAAGGACTGTTTCTCTGCCAACAAAAGTATACCAGAGACATGCTTCAGAAGTTCAACATGTTAGAGTGCAAGCAAGTTTCAACACCGATGGAGATAAATGCCAAGATTTGTGCACATGAAGGCAAAGAGTTGAACGATGAAACAACGTACCGACAACTAGTAGGTAGTCTTATCTACCTAACTTTAACTCGACCTGATATCTCTTATGCAGTTGGGGTTATGAGTCGATACATGCAAAGTCCAAAGAAGCCTCATCTGGATGCAGCTCGACGGATCTTGAGATATATCAAAGGTACAATCGACTATGGTCTTTTGTACAAAAGAAGCAAAGACTGCAAGCTAGTTGGATACTGTGATGCTGACTATGCAGGAGACCACGATACTCGGAGGTCAACCACTGGGTATGTGTTCAAGTTTGGTTCGGGAACAATTTCTTGGTGTAGCAAGAGACAACCAACAGTATCATTATCAACTACAGAAGCAGAGTATAGAGCAGCGGCTGGAGCAGCCCAGGAAAGTACATGGCTAAAACTCTTGATGGAAGATTTGCACCAGAAAATTGACTATCCAATATCACTTCTTTGCGACAACCAATCTGCGATTCGCCTTGCAGAAAATCCAGTGTTTCATGCTAGAACAAAGCATGTGGAGGTGCACTACCATTTCATTAGAGAGAAGGTCCTAAAGGAAGAAATTGAGATGCAGCAGATCAAGACAGATGACCAAGTGGCAGACTTGTTTACAAAAGGGTTGAATACTAGCAAACATGAGAGCTTTCGCTGTCAGCTCAACATGATGCAGCGAATGAGGACTAGTGCTGAGGGGGAGTGTTGA

Coding sequence (CDS)

ATGGCCAACACGATGAGTGATTTCCAAATCGTTGGAGGAATTAAGAAACTCAACAACAACAACTACAACACGTGGGCAACATGCATGATGTCCTATTTACAAGGACAGGATCTTTGGGAGATCGTTGGCGGGTGTGAAACTACGCCGCCAGAGGAGGATTCTAACGACGCCTTGCGCAAATGGAGAATCAAAGCAGGCAAAGCAATGTTTGCCTTGAAGACCACCATCGGAGAAGAGATGTTAGAACATATTTGGGATGACAAGACACCGAAAGAAGCATGGGACACATTCGTGATGTTGTTCTCAAAGAAGAATGATACGAGGCTACAACTTCTGGAGAATGAGTTGTTGTCAATTTCACAACGTGATATGACGATTGCTCAGTACTTCCACAAGGTCAAATCGATCTGTCGGGAGATTACTGAACTAGACCCAAAGTCCGCCATTGTAGAATCTCGAATGAAGAGGATTATAATCCACGGATTGCGACCAGAATATCGAAGCTTCATTGCTGCTGTACAAGGATGGCCCACTCAACCATCACTGGTAGAGTTCGAAAATTTGTTAGCCAGCCAAGAAGCCATGGCTAAACAAATGGGAGGCTTCACATTGAAGGGTGAAGAAGCACTCTACACGAGTGAAAGTCAGAGCAATAATAGGCCGTCTACCAGACGTGGATACAATGGTGACAAAAGAAGAAGCCACCAAGGGATTGCACAACCTGAGAGAGCTCAGAAGAACGACAACAAGAGTTTTCAAAGAACGAGATTTGGTGGTATTTGCTATAACTGCGGGAAAAAGGGCCATATGTCTAGAGATTGTTGGTCCAGGAAAAAGTCCATTGAAAACAATGTGGCAATATCCAAAAAGAAGATAGAAGATGAATGGGATGCAGAGGATGTTCAAATCATGCCTGGCAATAAGTCTGATACAGTGTCGCTACATAATGTTTATCATGTACCGGGTATAAAGAAGAACTTGTTGTCAGTGTCACAACTAACAACATCAGGAAGCTATGTCTTGTTTGGGCCAGAAGATGTAAAAGTTTATCAAGATGTTAAGATAATAGGAAAGCCGACGATAGAAGGACGAAGAGTGGAGTCTGTCTACGTTCTATCTGCAGAGTCTGCCTATGTTGACAAGACCCGGAAGAATGAGACGACAGATCTATGGCATGCAAGATTGGGACATATTAGCTACCATAAGTTGAAGCTGATTATGGAGAAATCTATGCTCAAAGGTCTACCACAACTGGAAGTCAAAACAGACGTTGTCTGCGCTGGATGTCAGTATGGTAAAGCTCATCAATTACCATACAAAGAGTCAAGTTTCAAAGCAAAGAAACCATTAGAGTTAGTTCACTCTGATTTGTTCGGCCCAGTCAAACAAGCATCGATCAGCGGAATGCGGTATATGGTGACATTTATTGACGACTACTCAAGATATGTGTGGATTTTCTTTATGAAAGAAAAGTCTGACACGTTCTCAAAGTTTCAAGAATTCAAGATGATGGTCGAAGGAGAAGTAGGAGCGAAGATTCGTTGTCTACGTTCAGACAATGGCGGAGAATACACGTCAGATGAGTTCGATCAATATTTACACGAGTGTGGGATACGACGTCAATTTACATGTGCCAACACGCCACAACAAAATGGTGTAGCAGAAAGAAAGAATCGACACCTTGCAGAAACCTGTCGAAGCATGTTACACGCAAAGAACGTTCCAGGAAGATTTTGGGCTGAAGCTATGCGAACTGCTGCCCATGTGATCAACAAGCTTCCTCAACCAAAGCTAGGGTTCGTCTCACCATTTGAGATACTATGGGATATGAAACCTACAATTAGTTACTTCCGAGTATTTGGCTGTGTTTGCTATGTATTTGTGCCTGACCATCTACGTAGCAAGTTTGACAAGAAAGCAGTCAAGTGTGTATTTGTTGGATACGACAATCAAAGAAAAGGATGGAGGTGCTGTGATCCAACAAGTGGAAAATACTATACATCAAGAGATGTAGTTTTTGATGAAGCATCTACATGGTGGTCCTCGGAGAAGAAAGTCTTATCAGATTCAAACATTGAAGAAATTCTACAACAGAAGCTGGGGGAGCAAACTACACAAATTCAATCAAATGTCGATGCATCTGAAAATCCAAGCGACATTGATATTGACAAGCAGGAGGTGACTCAATCAAGCGAATCTGATAAAAATGAAACAACACATCAACAACTTAGGCGATCAAATAGAATCCGAAGGCCAAATCCTAAGTATGCAAATGCAGCTATTGTAGAAGATAGAGTTTACGAACCAGAGACATATGAAGAAGCATCACAAAACTCGGTTTGGCAGAAAGCGATGGAGGAAGAAATTATAGCCTTGGAGCATAATCAAACTTGGGAACTAGTGCCAAGACTAGGAGATATCAAACCCGTCTCTTGCAAGTGGGTCTATAAAATAAAGCGTCGACCGGATGGATCAATCGAGAGATACAAGGCTCGACTCGTGGCTCGAGGATTTTCTCAACAATATGGACTAGATTATGATGAAACATTCAGTCCAGTGGCAAAGATTACTACCGTACGAGTTCTGCTAGCACTCGCAGCAAGTAAAGATTGGAAACTGTGGCAAATGGATGTGAAGAATGCCTTCTTGCACGGAGAGCTAGACAGGGAGATTTATATGAACCAACCAAAGGGATTTGAGAGTGCAGCTAATCCTAATTATGTATGCAAGCTTAGAAAAGCTCTTTATGGACTGAAACAAGCACCGAGAGCTTGGTATGGTAAGATTGCTGAATTTCTTACCCAAAGTGGTTATTCAGTTGCGCATGCAGACTCAAGCCTATTCATCAAAGAAAGAGAAGGAAATTTGACAATTGTGTTGGTCTACGTGGACGATTTGATTATCACCGGGGACGATGAAAGAGAAATTTATCAAACAAGAGAAAATTTATCAATACGCTTTCAGATGAAAGAGCTAGGAGAGCTTAAACACTTCTTAGGCCTAGAAGTTGATCGCACAGATGAAGGACTGTTTCTCTGCCAACAAAAGTATACCAGAGACATGCTTCAGAAGTTCAACATGTTAGAGTGCAAGCAAGTTTCAACACCGATGGAGATAAATGCCAAGATTTGTGCACATGAAGGCAAAGAGTTGAACGATGAAACAACGTACCGACAACTAGTAGGTAGTCTTATCTACCTAACTTTAACTCGACCTGATATCTCTTATGCAGTTGGGGTTATGAGTCGATACATGCAAAGTCCAAAGAAGCCTCATCTGGATGCAGCTCGACGGATCTTGAGATATATCAAAGGTACAATCGACTATGGTCTTTTGTACAAAAGAAGCAAAGACTGCAAGCTAGTTGGATACTGTGATGCTGACTATGCAGGAGACCACGATACTCGGAGGTCAACCACTGGGTATGTGTTCAAGTTTGGTTCGGGAACAATTTCTTGGTGTAGCAAGAGACAACCAACAGTATCATTATCAACTACAGAAGCAGAGTATAGAGCAGCGGCTGGAGCAGCCCAGGAAAGTACATGGCTAAAACTCTTGATGGAAGATTTGCACCAGAAAATTGACTATCCAATATCACTTCTTTGCGACAACCAATCTGCGATTCGCCTTGCAGAAAATCCAGTGTTTCATGCTAGAACAAAGCATGTGGAGGTGCACTACCATTTCATTAGAGAGAAGGTCCTAAAGGAAGAAATTGAGATGCAGCAGATCAAGACAGATGACCAAGTGGCAGACTTGTTTACAAAAGGGTTGAATACTAGCAAACATGAGAGCTTTCGCTGTCAGCTCAACATGATGCAGCGAATGAGGACTAGTGCTGAGGGGGAGTGTTGA
BLAST of CmoCh20G007300.1 vs. Swiss-Prot
Match: POLX_TOBAC (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum PE=2 SV=1)

HSP 1 Score: 690.6 bits (1781), Expect = 3.1e-197
Identity = 398/991 (40.16%), Postives = 594/991 (59.94%), Query Frame = 1

Query: 307  NKSDTVSLHNVYHVPGIKKNLLSVSQLTTSGSYVLFGPEDVKVYQDVKIIGKPTIEGRRV 366
            N   T+ L +V HVP ++ NL+S   L   G    F  +  ++ +   +I K    G   
Sbjct: 343  NVGCTLVLKDVRHVPDLRMNLISGIALDRDGYESYFANQKWRLTKGSLVIAKGVARG--- 402

Query: 367  ESVYVLSAE--SAYVDKTRKNETTDLWHARLGHISYHKLKLIMEKSMLKGLPQLEVKTDV 426
             ++Y  +AE     ++  +   + DLWH R+GH+S   L+++ +KS++       VK   
Sbjct: 403  -TLYRTNAEICQGELNAAQDEISVDLWHKRMGHMSEKGLQILAKKSLISYAKGTTVKP-- 462

Query: 427  VCAGCQYGKAHQLPYKESSFKAKKPLELVHSDLFGPVKQASISGMRYMVTFIDDYSRYVW 486
             C  C +GK H++ ++ SS +    L+LV+SD+ GP++  S+ G +Y VTFIDD SR +W
Sbjct: 463  -CDYCLFGKQHRVSFQTSSERKLNILDLVYSDVCGPMEIESMGGNKYFVTFIDDASRKLW 522

Query: 487  IFFMKEKSDTFSKFQEFKMMVEGEVGAKIRCLRSDNGGEYTSDEFDQYLHECGIRRQFTC 546
            ++ +K K   F  FQ+F  +VE E G K++ LRSDNGGEYTS EF++Y    GIR + T 
Sbjct: 523  VYILKTKDQVFQVFQKFHALVERETGRKLKRLRSDNGGEYTSREFEEYCSSHGIRHEKTV 582

Query: 547  ANTPQQNGVAERKNRHLAETCRSMLHAKNVPGRFWAEAMRTAAHVINKLPQPKLGFVSPF 606
              TPQ NGVAER NR + E  RSML    +P  FW EA++TA ++IN+ P   L F  P 
Sbjct: 583  PGTPQHNGVAERMNRTIVEKVRSMLRMAKLPKSFWGEAVQTACYLINRSPSVPLAFEIPE 642

Query: 607  EILWDMKPTISYFRVFGCVCYVFVPDHLRSKFDKKAVKCVFVGYDNQRKGWRCCDPTSGK 666
             +  + + + S+ +VFGC  +  VP   R+K D K++ C+F+GY ++  G+R  DP   K
Sbjct: 643  RVWTNKEVSYSHLKVFGCRAFAHVPKEQRTKLDDKSIPCIFIGYGDEEFGYRLWDPVKKK 702

Query: 667  YYTSRDVVFDEASTWWSSEKK-------------VLSDSNIEEILQQKLGEQTTQIQSNV 726
               SRDVVF E+    +++               + S SN     +    E + Q +   
Sbjct: 703  VIRSRDVVFRESEVRTAADMSEKVKNGIIPNFVTIPSTSNNPTSAESTTDEVSEQGEQPG 762

Query: 727  DASENPSDIDIDKQEVTQSSESDKNETTHQQLRRSNRIRRPNPKYANAAIV---EDRVYE 786
            +  E    +D   +EV   ++ ++    HQ LRRS R R  + +Y +   V   +DR  E
Sbjct: 763  EVIEQGEQLDEGVEEVEHPTQGEEQ---HQPLRRSERPRVESRRYPSTEYVLISDDR--E 822

Query: 787  PETYEEA---SQNSVWQKAMEEEIIALEHNQTWELVPRLGDIKPVSCKWVYKIKRRPDGS 846
            PE+ +E     + +   KAM+EE+ +L+ N T++LV      +P+ CKWV+K+K+  D  
Sbjct: 823  PESLKEVLSHPEKNQLMKAMQEEMESLQKNGTYKLVELPKGKRPLKCKWVFKLKKDGDCK 882

Query: 847  IERYKARLVARGFSQQYGLDYDETFSPVAKITTVRVLLALAASKDWKLWQMDVKNAFLHG 906
            + RYKARLV +GF Q+ G+D+DE FSPV K+T++R +L+LAAS D ++ Q+DVK AFLHG
Sbjct: 883  LVRYKARLVVKGFEQKKGIDFDEIFSPVVKMTSIRTILSLAASLDLEVEQLDVKTAFLHG 942

Query: 907  ELDREIYMNQPKGFESAANPNYVCKLRKALYGLKQAPRAWYGKIAEFLTQSGYSVAHADS 966
            +L+ EIYM QP+GFE A   + VCKL K+LYGLKQAPR WY K   F+    Y   ++D 
Sbjct: 943  DLEEEIYMEQPEGFEVAGKKHMVCKLNKSLYGLKQAPRQWYMKFDSFMKSQTYLKTYSDP 1002

Query: 967  SLFIKE-REGNLTIVLVYVDDLIITGDDEREIYQTRENLSIRFQMKELGELKHFLGLEV- 1026
             ++ K   E N  I+L+YVDD++I G D+  I + + +LS  F MK+LG  +  LG+++ 
Sbjct: 1003 CVYFKRFSENNFIILLLYVDDMLIVGKDKGLIAKLKGDLSKSFDMKDLGPAQQILGMKIV 1062

Query: 1027 -DRTDEGLFLCQQKYTRDMLQKFNMLECKQVSTP----MEINAKICAHEGKELND--ETT 1086
             +RT   L+L Q+KY   +L++FNM   K VSTP    ++++ K+C    +E  +  +  
Sbjct: 1063 RERTSRKLWLSQEKYIERVLERFNMKNAKPVSTPLAGHLKLSKKMCPTTVEEKGNMAKVP 1122

Query: 1087 YRQLVGSLIY-LTLTRPDISYAVGVMSRYMQSPKKPHLDAARRILRYIKGTIDYGLLYKR 1146
            Y   VGSL+Y +  TRPDI++AVGV+SR++++P K H +A + ILRY++GT    L +  
Sbjct: 1123 YSSAVGSLMYAMVCTRPDIAHAVGVVSRFLENPGKEHWEAVKWILRYLRGTTGDCLCFGG 1182

Query: 1147 SKDCKLVGYCDADYAGDHDTRRSTTGYVFKFGSGTISWCSKRQPTVSLSTTEAEYRAAAG 1206
            S D  L GY DAD AGD D R+S+TGY+F F  G ISW SK Q  V+LSTTEAEY AA  
Sbjct: 1183 S-DPILKGYTDADMAGDIDNRKSSTGYLFTFSGGAISWQSKLQKCVALSTTEAEYIAATE 1242

Query: 1207 AAQESTWLKLLMED--LHQKIDYPISLLCDNQSAIRLAENPVFHARTKHVEVHYHFIREK 1265
              +E  WLK  +++  LHQK +Y +   CD+QSAI L++N ++HARTKH++V YH+IRE 
Sbjct: 1243 TGKEMIWLKRFLQELGLHQK-EYVV--YCDSQSAIDLSKNSMYHARTKHIDVRYHWIREM 1302

BLAST of CmoCh20G007300.1 vs. Swiss-Prot
Match: COPIA_DROME (Copia protein OS=Drosophila melanogaster GN=GIP PE=1 SV=3)

HSP 1 Score: 372.1 bits (954), Expect = 2.5e-101
Identity = 220/605 (36.36%), Postives = 350/605 (57.85%), Query Frame = 1

Query: 693  EEILQQKLGEQTTQIQSNVDASENPSDIDIDKQEVTQSSESDKNET----THQQLRRSNR 752
            ++ L +  G          + +E+  +I ID        E     +    T  Q+  +  
Sbjct: 813  DDHLNESKGSGNPNESRESETAEHLKEIGIDNPTKNDGIEIINRRSERLKTKPQISYNEE 872

Query: 753  IRRPNPKYANAAIVEDRVYEPETYEEAS---QNSVWQKAMEEEIIALEHNQTWELVPRLG 812
                N    NA  + + V  P +++E       S W++A+  E+ A + N TW +  R  
Sbjct: 873  DNSLNKVVLNAHTIFNDV--PNSFDEIQYRDDKSSWEEAINTELNAHKINNTWTITKRPE 932

Query: 813  DIKPVSCKWVYKIKRRPDGSIERYKARLVARGFSQQYGLDYDETFSPVAKITTVRVLLAL 872
            +   V  +WV+ +K    G+  RYKARLVARGF+Q+Y +DY+ETF+PVA+I++ R +L+L
Sbjct: 933  NKNIVDSRWVFSVKYNELGNPIRYKARLVARGFTQKYQIDYEETFAPVARISSFRFILSL 992

Query: 873  AASKDWKLWQMDVKNAFLHGELDREIYMNQPKGFESAANPNYVCKLRKALYGLKQAPRAW 932
                + K+ QMDVK AFL+G L  EIYM  P+G   + N + VCKL KA+YGLKQA R W
Sbjct: 993  VIQYNLKVHQMDVKTAFLNGTLKEEIYMRLPQGI--SCNSDNVCKLNKAIYGLKQAARCW 1052

Query: 933  YGKIAEFLTQSGYSVAHADSSLFIKEREGNLT---IVLVYVDDLIITGDDEREIYQTREN 992
            +    + L +  +  +  D  ++I ++ GN+     VL+YVDD++I   D   +   +  
Sbjct: 1053 FEVFEQALKECEFVNSSVDRCIYILDK-GNINENIYVLLYVDDVVIATGDMTRMNNFKRY 1112

Query: 993  LSIRFQMKELGELKHFLGLEVDRTDEGLFLCQQKYTRDMLQKFNMLECKQVSTPM--EIN 1052
            L  +F+M +L E+KHF+G+ ++  ++ ++L Q  Y + +L KFNM  C  VSTP+  +IN
Sbjct: 1113 LMEKFRMTDLNEIKHFIGIRIEMQEDKIYLSQSAYVKKILSKFNMENCNAVSTPLPSKIN 1172

Query: 1053 AKICAHEGKELNDETTYRQLVGSLIYLTL-TRPDISYAVGVMSRYMQSPKKPHLDAARRI 1112
             ++      + +  T  R L+G L+Y+ L TRPD++ AV ++SRY            +R+
Sbjct: 1173 YELL---NSDEDCNTPCRSLIGCLMYIMLCTRPDLTTAVNILSRYSSKNNSELWQNLKRV 1232

Query: 1113 LRYIKGTIDYGLLYKRSK--DCKLVGYCDADYAGDHDTRRSTTGYVFK-FGSGTISWCSK 1172
            LRY+KGTID  L++K++   + K++GY D+D+AG    R+STTGY+FK F    I W +K
Sbjct: 1233 LRYLKGTIDMKLIFKKNLAFENKIIGYVDSDWAGSEIDRKSTTGYLFKMFDFNLICWNTK 1292

Query: 1173 RQPTVSLSTTEAEYRAAAGAAQESTWLKLLMEDLHQKIDYPISLLCDNQSAIRLAENPVF 1232
            RQ +V+ S+TEAEY A   A +E+ WLK L+  ++ K++ PI +  DNQ  I +A NP  
Sbjct: 1293 RQNSVAASSTEAEYMALFEAVREALWLKFLLTSINIKLENPIKIYEDNQGCISIANNPSC 1352

Query: 1233 HARTKHVEVHYHFIREKVLKEEIEMQQIKTDDQVADLFTKGLNTSKHESFRCQLNMMQRM 1282
            H R KH+++ YHF RE+V    I ++ I T++Q+AD+FTK L  ++    R +L ++Q  
Sbjct: 1353 HKRAKHIDIKYHFAREQVQNNVICLEYIPTENQLADIFTKPLPAARFVELRDKLGLLQDD 1409

BLAST of CmoCh20G007300.1 vs. Swiss-Prot
Match: M810_ARATH (Uncharacterized mitochondrial protein AtMg00810 OS=Arabidopsis thaliana GN=AtMg00810 PE=4 SV=1)

HSP 1 Score: 193.7 bits (491), Expect = 1.2e-47
Identity = 99/224 (44.20%), Postives = 136/224 (60.71%), Query Frame = 1

Query: 959  VLVYVDDLIITGDDEREIYQTRENLSIRFQMKELGELKHFLGLEVDRTDEGLFLCQQKYT 1018
            +L+YVDD+++TG     +      LS  F MK+LG + +FLG+++     GLFL Q KY 
Sbjct: 3    LLLYVDDILLTGSSNTLLNMLIFQLSSTFSMKDLGPVHYFLGIQIKTHPSGLFLSQTKYA 62

Query: 1019 RDMLQKFNMLECKQVSTPMEINAKICAHEGKELNDETTYRQLVGSLIYLTLTRPDISYAV 1078
              +L    ML+CK +STP+ +         K   D + +R +VG+L YLTLTRPDISYAV
Sbjct: 63   EQILNNAGMLDCKPMSTPLPLKLNSSVSTAK-YPDPSDFRSIVGALQYLTLTRPDISYAV 122

Query: 1079 GVMSRYMQSPKKPHLDAARRILRYIKGTIDYGLLYKRSKDCKLVGYCDADYAGDHDTRRS 1138
             ++ + M  P     D  +R+LRY+KGTI +GL   ++    +  +CD+D+AG   TRRS
Sbjct: 123  NIVCQRMHEPTLADFDLLKRVLRYVKGTIFHGLYIHKNSKLNVQAFCDSDWAGCTSTRRS 182

Query: 1139 TTGYVFKFGSGTISWCSKRQPTVSLSTTEAEYRAAAGAAQESTW 1183
            TTG+    G   ISW +KRQPTVS S+TE EYRA A  A E TW
Sbjct: 183  TTGFCTFLGCNIISWSAKRQPTVSRSSTETEYRALALTAAELTW 225

BLAST of CmoCh20G007300.1 vs. Swiss-Prot
Match: YCH4_YEAST (Putative transposon Ty5-1 protein YCL074W OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=TY5A PE=5 SV=2)

HSP 1 Score: 186.4 bits (472), Expect = 1.9e-45
Identity = 103/307 (33.55%), Postives = 160/307 (52.12%), Query Frame = 1

Query: 876  MDVKNAFLHGELDREIYMNQPKGFESAANPNYVCKLRKALYGLKQAPRAWYGKIAEFLTQ 935
            MDV  AFL+  +D  IY+ QP GF +  NP+YV +L   +YGLKQAP  W   I   L +
Sbjct: 1    MDVDTAFLNSTMDEPIYVKQPPGFVNERNPDYVWELYGGMYGLKQAPLLWNEHINNTLKK 60

Query: 936  SGYSVAHADSSLFIKEREGNLTIVLVYVDDLIITGDDEREIYQTRENLSIRFQMKELGEL 995
             G+     +  L+ +        + VYVDDL++     +   + ++ L+  + MK+LG++
Sbjct: 61   IGFCRHEGEHGLYFRSTSDGPIYIAVYVDDLLVAAPSPKIYDRVKQELTKLYSMKDLGKV 120

Query: 996  KHFLGLEVDRTDEG-LFLCQQKYTRDMLQKFNMLECKQVSTPMEINAKICAHEGKELNDE 1055
              FLGL + ++  G + L  Q Y      +  +   K   TP+  +  +       L D 
Sbjct: 121  DKFLGLNIHQSSNGDITLSLQDYIAKAASESEINTFKLTQTPLCNSKPLFETTSPHLKDI 180

Query: 1056 TTYRQLVGSLIYLTLT-RPDISYAVGVMSRYMQSPKKPHLDAARRILRYIKGTIDYGLLY 1115
            T Y+ +VG L++   T RPDISY V ++SR+++ P+  HL++ARR+LRY+  T    L Y
Sbjct: 181  TPYQSIVGQLLFCANTGRPDISYPVSLLSRFLREPRAIHLESARRVLRYLYTTRSMCLKY 240

Query: 1116 KRSKDCKLVGYCDADYAGDHDTRRSTTGYVFKFGSGTISWCSKR-QPTVSLSTTEAEYRA 1175
            +      L  YCDA +   HD   ST GYV       ++W SK+ +  + + +TEAEY  
Sbjct: 241  RSGSQLALTVYCDASHGAIHDLPHSTGGYVTLLAGAPVTWSSKKLKGVIPVPSTEAEYIT 300

Query: 1176 AAGAAQE 1180
            A+    E
Sbjct: 301  ASETVME 307

BLAST of CmoCh20G007300.1 vs. Swiss-Prot
Match: YJ41B_YEAST (Transposon Ty4-J Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=TY4B-J PE=3 SV=3)

HSP 1 Score: 139.8 bits (351), Expect = 2.1e-31
Identity = 117/462 (25.32%), Postives = 220/462 (47.62%), Query Frame = 1

Query: 829  YKARLVARGFSQQYGLDYDETFSPVAKITT----VRVLLALAASKDWKLWQMDVKNAFLH 888
            YKAR+V RG +Q       +T+S +   +     +++ L +A +++  +  +D+ +AFL+
Sbjct: 1337 YKARIVCRGDTQS-----PDTYSVITTESLNHNHIKIFLMIANNRNMFMKTLDINHAFLY 1396

Query: 889  GELDREIYMNQPKGFESAANPNYVCKLRKALYGLKQAPRAWYGKIAEFLTQSGYSVAHAD 948
             +L+ EIY+  P       +   V KL KALYGLKQ+P+ W   + ++L   G       
Sbjct: 1397 AKLEEEIYIPHPH------DRRCVVKLNKALYGLKQSPKEWNDHLRQYLNGIGLKDNSYT 1456

Query: 949  SSLFIKEREGNLTIVLVYVDDLIITGDDEREIYQTRENLSIRFQMKELGEL------KHF 1008
              L+  + E    ++ VYVDD +I   +E+ + +    L   F++K  G L         
Sbjct: 1457 PGLY--QTEDKNLMIAVYVDDCVIAASNEQRLDEFINKLKSNFELKITGTLIDDVLDTDI 1516

Query: 1009 LGLEV--DRTDEGLFLCQQKYTRDMLQKFN--MLECKQVSTPMEINAKICAHEGKELNDE 1068
            LG+++  ++    + L  + +   M +K+N  + + ++ S P     KI   +      E
Sbjct: 1517 LGMDLVYNKRLGTIDLTLKSFINRMDKKYNEELKKIRKSSIPHMSTYKIDPKKDVLQMSE 1576

Query: 1069 TTYR-------QLVGSLIYLT-LTRPDISYAVGVMSRYMQSPKKPHLDAARRILRYIKGT 1128
              +R       QL+G L Y+    R DI +AV  ++R +  P +       +I++Y+   
Sbjct: 1577 EEFRQGVLKLQQLLGELNYVRHKCRYDIEFAVKKVARLVNYPHERVFYMIYKIIQYLVRY 1636

Query: 1129 IDYGLLYKR--SKDCKLVGYCDADYAGDHDTRRSTTGYVFKFGSGTISWCSKRQPTVSLS 1188
             D G+ Y R  +KD K++   DA    ++D  +S  G +  +G    +  S +     +S
Sbjct: 1637 KDIGIHYDRDCNKDKKVIAITDASVGSEYDA-QSRIGVILWYGMNIFNVYSNKSTNRCVS 1696

Query: 1189 TTEAEYRAAAGAAQESTWLKLLMEDLHQKIDYPISLLCDNQSAIRLAENPVFHARTKHVE 1248
            +TEAE  A      +S  LK+ +++L +  +  I ++ D++ AI+         + K   
Sbjct: 1697 STEAELHAIYEGYADSETLKVTLKELGEGDNNDIVMITDSKPAIQGLNRSYQQPKEKFTW 1756

Query: 1249 VHYHFIREKVLKEEIEMQQIKTDDQVADLFTKGLNTSKHESF 1267
            +    I+EK+ ++ I++ +I     +ADL TK ++ S  + F
Sbjct: 1757 IKTEIIKEKIKEKSIKLLKITGKGNIADLLTKPVSASDFKRF 1784

BLAST of CmoCh20G007300.1 vs. TrEMBL
Match: A5AKW8_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_027864 PE=4 SV=1)

HSP 1 Score: 1641.3 bits (4249), Expect = 0.0e+00
Identity = 851/1351 (62.99%), Postives = 1002/1351 (74.17%), Query Frame = 1

Query: 5    MSDFQIVGGIKKLNNNNYNTWATCMMSYLQGQDLWEIVGGCETTPPE-EDSNDALRKWRI 64
            M D Q++GGIKKLNN NYNTW+TCMMSY+QGQDLWE+V G E T P+ ED+N  LRKW+I
Sbjct: 1    MGDLQVIGGIKKLNNQNYNTWSTCMMSYMQGQDLWEVVNGSEITQPKVEDANGILRKWKI 60

Query: 65   KAGKAMFALKTTIGEEMLEHIWDDKTPKEAWDTFVMLFSKKNDTRLQLLENELLSISQRD 124
            KAGKAMFALKTTI E++LEHI D KTP EAW+TF  LFSKKNDTRLQLLE+EL S++QRD
Sbjct: 61   KAGKAMFALKTTIEEDVLEHIRDAKTPYEAWNTFTKLFSKKNDTRLQLLESELFSVAQRD 120

Query: 125  MTIAQYFHKVKSICREITELDPKSAIVESRMKRIIIHGLRPEYRSFIAAVQGWPTQPSLV 184
            +TIAQYFHKVK++CREI+ELD ++ I E+ MKRIIIHGLRPE+R F+AA+QGW  QPSLV
Sbjct: 121  LTIAQYFHKVKTLCREISELDLEAPIGETXMKRIIIHGLRPEFRGFVAAIQGWQNQPSLV 180

Query: 185  EFENLLASQEAMAKQMGGFTLK-------------------------GEEALYTSESQSN 244
            EFENLLA QEA+AKQMGG +LK                          E+    S+ + +
Sbjct: 181  EFENLLAGQEALAKQMGGVSLKGEEEALYAHKGGWNSXQHTVRRTKKNEDKAKCSQGERS 240

Query: 245  NR-------PSTRRGYNGD----KRRSH--------QGIAQPERAQKNDNKSFQRTRF-- 304
             R       P T + + G      ++ H        +G+ +           +    F  
Sbjct: 241  ARVEGDSKNPGTXKKFEGKCYNCXKKGHMAKDCWSKKGLVESNATTSKSEDEWDAQAFFA 300

Query: 305  ----GGICYNCGKKGHMSRDCWSRKKSIENNVAISKKKIED--EWDAEDVQIMPGNK--- 364
                        ++    +D W       N++   K+K++D  E+    + +   N    
Sbjct: 301  AIGESAFIATTSEQIDYEKD-WIIDSGCSNHMTGDKEKLQDLSEYKGRHMVVTANNSKLP 360

Query: 365  --------------SDTVSLHNVYHVPGIKKNLLSVSQLTTSGSYVLFGPEDVKVYQDVK 424
                          ++ VSL NVYHVPG+KKNLLSV+QLT+SG  VLFGP+DVKVY D++
Sbjct: 361  IAHIGNTVVSSQYNTNDVSLQNVYHVPGMKKNLLSVAQLTSSGHSVLFGPQDVKVYHDLE 420

Query: 425  IIGKPTIEGRRVESVYVLSAESAYVDKTRKNETTDLWHARLGHISYHKLKLIMEKSMLKG 484
            ++ +P I+GRR+ESVYV+SAE+AYVDKTRKNET DLWH RL HISY KL ++M+KSMLKG
Sbjct: 421  VMEEPVIKGRRLESVYVMSAETAYVDKTRKNETADLWHMRLSHISYSKLTMMMKKSMLKG 480

Query: 485  LPQLEVKTDVVCAGCQYGKAHQLPYKESSFKAKKPLELVHSDLFGPVKQASISGMRYMVT 544
            LPQLEV+   +CA CQYGKAHQLPY+ES +KAK PLEL+HSD+FGPVKQAS+SGM     
Sbjct: 481  LPQLEVRKXTICAXCQYGKAHQLPYEESKWKAKGPLELIHSDVFGPVKQASLSGM----- 540

Query: 545  FIDDYSRYVWIFFMKEKSDTFSKFQEFKMMVEGEVGAKIRCLRSDNGGEYTSDEFDQYLH 604
                  +Y+  F      D FS+    +M                    +TS E  +Y  
Sbjct: 541  ------KYMVTFI-----DDFSRRVYLQMSF------------------FTSSENXEYAI 600

Query: 605  ECGIRRQFTCANTPQQNGVAERKNRHLAETCRSMLHAKNVPGRFWAEAMRTAAHVINKLP 664
                   FTCANTPQQNGV ERKNRHLAE CRSMLHAKNVPG FWAE M+TAA VIN+LP
Sbjct: 601  S------FTCANTPQQNGVXERKNRHLAEICRSMLHAKNVPGXFWAEXMKTAAFVINRLP 660

Query: 665  QPKLGFVSPFEILWDMKPTISYFRVFGCVCYVFVPDHLRSKFDKKAVKCVFVGYDNQRKG 724
            Q +L F SPFE LW++KPT+SYFRVFGCVCYVFVP+HLRSK DKKAV+CV VGYD+QRK 
Sbjct: 661  QQRLNFSSPFEKLWNIKPTVSYFRVFGCVCYVFVPNHLRSKMDKKAVRCVLVGYDSQRKX 720

Query: 725  WRCCDPTSGKYYTSRDVVFDEASTWWSSEKKVLSDSNIEEILQQKLGEQTTQIQSNVDAS 784
            WRCCDPT+GK YTSR+VVFDE+S+WWSSEK++L DS+   + + +L  Q+ +IQ ++  +
Sbjct: 721  WRCCDPTTGKCYTSRNVVFDESSSWWSSEKEILXDSB---VFKDEL--QSARIQLSLGEA 780

Query: 785  ENPSDIDIDKQEVTQSSESDKNETTHQQLRRSNRIRRPNPKYANAAIVED-RVYEPETYE 844
            EN  D DI   + TQS      +T         R ++PNPKYAN AIVED    EP T+ 
Sbjct: 781  ENAXDGDIG-DDXTQSPW----QTGVHGQPSEERTKKPNPKYANVAIVEDANAKEPXTFA 840

Query: 845  EASQNSVWQKAMEEEIIALEHNQTWELVPRLGDIKPVSCKWVYKIKRRPDGSIERYKARL 904
            EA QNS W KAM EEI AL+ NQTWELVP+  D++P SCKWVYKIKRR DGSIER+KA L
Sbjct: 841  EAFQNSDWSKAMXEEIAALKRNQTWELVPKPRDVEPXSCKWVYKIKRRTDGSIERHKAXL 900

Query: 905  VARGFSQQYGLDYDETFSPVAKITTVRVLLALAASKDWKLWQMDVKNAFLHGELDREIYM 964
            VARGFSQQYGLDYDETFSPV K+TTVRVLLALAA+KDW LWQMDVKNAFLHGELDREIYM
Sbjct: 901  VARGFSQQYGLDYDETFSPVXKLTTVRVLLALAANKDWDLWQMDVKNAFLHGELDREIYM 960

Query: 965  NQPKGFESAANPNYVCKLRKALYGLKQAPRAWYGKIAEFLTQSGYSVAHADSSLFIKERE 1024
            NQP GF+S  +P YVCKLRKALYGLKQAPRAWYGKIAEFLTQSGYSV  ADSSLF+K   
Sbjct: 961  NQPMGFQSQGHPEYVCKLRKALYGLKQAPRAWYGKIAEFLTQSGYSVTPADSSLFVKANG 1020

Query: 1025 GNLTIVLVYVDDLIITGDDEREIYQTRENLSIRFQMKELGELKHFLGLEVDRTDEGLFLC 1084
            G L IVLVYVDDLIITGDD  EI++T+ENLS+RF+MKELG+LKHFLGLEVDRT+EG+FLC
Sbjct: 1021 GKLAIVLVYVDDLIITGDDVEEIFRTKENLSVRFEMKELGQLKHFLGLEVDRTNEGIFLC 1080

Query: 1085 QQKYTRDMLQKFNMLECKQVSTPMEINAKICAHEGKELNDETTYRQLVGSLIYLTLTRPD 1144
            QQKY +D+L+KF MLECK +STPME NAK+C HEGK+L D T YRQLVGSL+YLTLT PD
Sbjct: 1081 QQKYAKDLLKKFGMLECKPISTPMEPNAKMCEHEGKDLKDATMYRQLVGSLLYLTLTXPD 1140

Query: 1145 ISYAVGVMSRYMQSPKKPHLDAARRILRYIKGTIDYGLLYKRSKDCKLVGYCDADYAGDH 1204
            ISYAVGVMSRYMQ+PKKPHL+A RRILR++KGTIDYGLLYK+ +DCKLVGYCDADYAGDH
Sbjct: 1141 ISYAVGVMSRYMQNPKKPHLEAVRRILRHVKGTIDYGLLYKKXEDCKLVGYCDADYAGDH 1200

Query: 1205 DTRRSTTGYVFKFGSGTISWCSKRQPTVSLSTTEAEYRAAAGAAQESTWLKLLMEDLHQK 1264
            DTR STTGYVF  GSG ISWCSKRQPTVSLSTTEAEYRAAA A QES WL  LM DLHQ 
Sbjct: 1201 DTRXSTTGYVFMLGSGAISWCSKRQPTVSLSTTEAEYRAAAMATQESMWLIRLMNDLHQL 1260

Query: 1265 IDYPISLLCDNQSAIRLAENPVFHARTKHVEVHYHFIREKVLKEEIEMQQIKTDDQVADL 1285
            +DY + L CDNQSA+RLAENPVFHARTKHVEVHYHFIREKVLKEE+E+ QIK++DQVADL
Sbjct: 1261 VDYAVPLYCDNQSAVRLAENPVFHARTKHVEVHYHFIREKVLKEEVELNQIKSEDQVADL 1300

BLAST of CmoCh20G007300.1 vs. TrEMBL
Match: I1J0P4_BRADI (Uncharacterized protein OS=Brachypodium distachyon PE=4 SV=1)

HSP 1 Score: 1399.0 bits (3620), Expect = 0.0e+00
Identity = 672/979 (68.64%), Postives = 813/979 (83.04%), Query Frame = 1

Query: 303  IMPGNKSDTVSLHNVYHVPGIKKNLLSVSQLTTSGSYVLFGPEDVKVYQDVKIIGKPTIE 362
            ++P   +  + L  VYHVPG+KKNLLSV QLT  G YVLFGP++V +++ +K+IG P +E
Sbjct: 446  VVPRYGTQQLQLERVYHVPGLKKNLLSVPQLTAEGKYVLFGPQEVAIFRRLKVIGTPIME 505

Query: 363  GRRVESVYVLSAESAYVDKTRKNETTDLWHARLGHISYHKLKLIMEKSMLKGLPQLEVKT 422
            G++ +S++VLSAESAYVDKTRKNET DLWHARLGH+SYHKLK +MEK ++KGLP L+++T
Sbjct: 506  GKKRDSLFVLSAESAYVDKTRKNETADLWHARLGHVSYHKLKEMMEKHVVKGLPDLDIRT 565

Query: 423  DVVCAGCQYGKAHQLPYKESSFKAKKPLELVHSDLFGPVKQASISGMRYMVTFIDDYSRY 482
            D+VCAGCQYGKAHQLPYKES  ++K PLEL+HSD+FGPVKQ S+ GMRYMVTFIDD+SRY
Sbjct: 566  DMVCAGCQYGKAHQLPYKESQHQSKVPLELIHSDVFGPVKQISLGGMRYMVTFIDDFSRY 625

Query: 483  VWIFFMKEKSDTFSKFQEFKMMVEGEVGAKIRCLRSDNGGEYTSDEFDQYLHECGIRRQF 542
            VW++FMKEKS+TF KF+EFK M+EGE+  KIRCLR+DNG EY S+EF  YL +  I+RQ 
Sbjct: 626  VWVYFMKEKSETFIKFKEFKDMIEGELEYKIRCLRTDNGREYLSNEFTIYLKKKKIKRQL 685

Query: 543  TCANTPQQNGVAERKNRHLAETCRSMLHAKNVPGRFWAEAMRTAAHVINKLPQPKLGFVS 602
            TC NTPQQNGVAERKNRHLAETCRSMLHAKNVPGRFWAE MRTAAHVINKLPQ +LGF S
Sbjct: 686  TCPNTPQQNGVAERKNRHLAETCRSMLHAKNVPGRFWAECMRTAAHVINKLPQVRLGFKS 745

Query: 603  PFEILWDMKPTISYFRVFGCVCYVFVPDHLRSKFDKKAVKCVFVGYDNQRKGWRCCDPTS 662
            P E LW MKP IS+ +VFGCVCY+FVPDHLR+KF+KKA +C+FVGYD+ RKGWRCCDPT+
Sbjct: 746  PHEKLWRMKPAISHLKVFGCVCYIFVPDHLRTKFEKKAKRCIFVGYDDARKGWRCCDPTT 805

Query: 663  GKYYTSRDVVFDEASTWWSSEKKVLSDS-NIEEILQ----QKLGEQTTQIQSNVDASENP 722
            GK +TSR++VFDEAS+WWS +K+ + +S ++EE+++    QKL   +   +S+   +++P
Sbjct: 806  GKCHTSRNIVFDEASSWWSPKKEEIPESPDVEEVIEEERDQKLETPSEGERSSPSKTKSP 865

Query: 723  SDIDIDKQEVTQSSESDKNETTHQQLRRSNRIRRPNPKYANAAIVEDRV-YEPETYEEAS 782
                I + E  Q+ E D      Q+LRRS R R+PNP+YANAA+ ++ +  EP +YEEA+
Sbjct: 866  WKTGIHQPEEPQTEEHD------QELRRSTRPRKPNPRYANAALADESLPIEPSSYEEAA 925

Query: 783  QNSVWQKAMEEEIIALEHNQTWELVPRLGDIKPVSCKWVYKIKRRPDGSIERYKARLVAR 842
            +   WQKAMEEEI AL+ NQTW+LVP+  D+KP+SCKWVYK+K R DGSIERYKARLVAR
Sbjct: 926  RGPEWQKAMEEEIKALKENQTWDLVPKPKDVKPISCKWVYKVKTRTDGSIERYKARLVAR 985

Query: 843  GFSQQYGLDYDETFSPVAKITTVRVLLALAASKDWKLWQMDVKNAFLHGELDREIYMNQP 902
            GFSQ+YGLDY+ETFSPVAKITTVRVLLALAASK W+LWQMDVKNAFLHGELD+EIYM QP
Sbjct: 986  GFSQEYGLDYEETFSPVAKITTVRVLLALAASKSWELWQMDVKNAFLHGELDKEIYMEQP 1045

Query: 903  KGFESAANPNYVCKLRKALYGLKQAPRAWYGKIAEFLTQSGYSVAHADSSLFIKEREGNL 962
            KGFES   P +VCKL+KALYGLKQAPRAWYGKI EFL  +G+ VA +DSSLF+  +EG L
Sbjct: 1046 KGFESKKYPEHVCKLKKALYGLKQAPRAWYGKIGEFLVHNGFKVAPSDSSLFVMAKEGRL 1105

Query: 963  TIVLVYVDDLIITGDDEREIYQTRENLSIRFQMKELGELKHFLGLEVDRTDEGLFLCQQK 1022
             IVLVYVDDLIITGD   EI +TRENLS+RFQMKELGEL+HFLGLEV+ T  G+FL QQK
Sbjct: 1106 AIVLVYVDDLIITGDYSEEIERTRENLSVRFQMKELGELRHFLGLEVEHTKNGIFLGQQK 1165

Query: 1023 YTRDMLQKFNMLECKQVSTPMEINAKICAHEGKELNDETTYRQLVGSLIYLTLTRPDISY 1082
            Y +D+L+++ ML+CK +STPM+ N ++   +GK L D T YRQLVGSLIYLTL+RPDISY
Sbjct: 1166 YAKDLLKRYGMLDCKPISTPMDPNTRLQEDKGKNLEDATMYRQLVGSLIYLTLSRPDISY 1225

Query: 1083 AVGVMSRYMQSPKKPHLDAARRILRYIKGTIDYGLLYKRSKDCKLVGYCDADYAGDHDTR 1142
            AVGV SRYM +PKKPHLDA RRILRY+KGT++YG+LYK++K+C+++GYCDADYAGD DTR
Sbjct: 1226 AVGVASRYMSTPKKPHLDAIRRILRYVKGTLNYGILYKKTKECQVIGYCDADYAGDCDTR 1285

Query: 1143 RSTTGYVFKFGSGTISWCSKRQPTVSLSTTEAEYRAAAGAAQESTWLKLLMEDLHQKIDY 1202
            RSTTGY+F  GSG I+WCSKRQPTV+LS+TEAEYR+AA AAQESTWLK LMEDLHQ    
Sbjct: 1286 RSTTGYLFSLGSGAITWCSKRQPTVALSSTEAEYRSAAAAAQESTWLKQLMEDLHQTPKD 1345

Query: 1203 PISLLCDNQSAIRLAENPVFHARTKHVEVHYHFIREKVLKEEIEMQQIKTDDQVADLFTK 1262
             + + CDN S IRLAENPVFHARTKH+EVHYH+IREKVLK EIEM   KT+DQ AD+ TK
Sbjct: 1346 QVWIFCDNLSTIRLAENPVFHARTKHIEVHYHYIREKVLKGEIEMVPTKTEDQTADILTK 1405

Query: 1263 GLNTSKHESFRCQLNMMQR 1276
             LN SK E FR  L M+ +
Sbjct: 1406 SLNKSKFEKFREALGMVTK 1418

BLAST of CmoCh20G007300.1 vs. TrEMBL
Match: I1IA27_BRADI (Uncharacterized protein OS=Brachypodium distachyon PE=4 SV=1)

HSP 1 Score: 1398.3 bits (3618), Expect = 0.0e+00
Identity = 671/970 (69.18%), Postives = 810/970 (83.51%), Query Frame = 1

Query: 312  VSLHNVYHVPGIKKNLLSVSQLTTSGSYVLFGPEDVKVYQDVKIIGKPTIEGRRVESVYV 371
            + L  VYHVPG+KKNLLSV QLT  G YVLFGP++V +++ +K+IG P +EG++ +S++V
Sbjct: 455  LQLERVYHVPGLKKNLLSVPQLTAEGKYVLFGPQEVAIFRRLKVIGTPIMEGKKRDSLFV 514

Query: 372  LSAESAYVDKTRKNETTDLWHARLGHISYHKLKLIMEKSMLKGLPQLEVKTDVVCAGCQY 431
            LSAESAYVDKTRKNET DLWHARLGH+SYHKLK +MEK ++KGLP L+++TD+VCAGCQY
Sbjct: 515  LSAESAYVDKTRKNETADLWHARLGHVSYHKLKEMMEKHVVKGLPDLDIRTDMVCAGCQY 574

Query: 432  GKAHQLPYKESSFKAKKPLELVHSDLFGPVKQASISGMRYMVTFIDDYSRYVWIFFMKEK 491
            GKAHQLPYKES  ++K PLEL+HSD+FGPVKQ S+ GMRYMVTFI+D+SRYVW++FMKEK
Sbjct: 575  GKAHQLPYKESQHQSKVPLELIHSDVFGPVKQISLGGMRYMVTFINDFSRYVWVYFMKEK 634

Query: 492  SDTFSKFQEFKMMVEGEVGAKIRCLRSDNGGEYTSDEFDQYLHECGIRRQFTCANTPQQN 551
            S+TF KF+EFK M+EGE+  KIRCLR+DNG EY S+EF  YL +  I+RQ TC NTPQQN
Sbjct: 635  SETFMKFKEFKDMIEGELEYKIRCLRTDNGREYLSNEFTIYLKKNKIKRQLTCPNTPQQN 694

Query: 552  GVAERKNRHLAETCRSMLHAKNVPGRFWAEAMRTAAHVINKLPQPKLGFVSPFEILWDMK 611
            GVAERKNRHLAETCRSMLHAKNVPGRFWAE MRTAAHVINKLPQ +LGF SP E LW MK
Sbjct: 695  GVAERKNRHLAETCRSMLHAKNVPGRFWAECMRTAAHVINKLPQVRLGFKSPHEKLWRMK 754

Query: 612  PTISYFRVFGCVCYVFVPDHLRSKFDKKAVKCVFVGYDNQRKGWRCCDPTSGKYYTSRDV 671
            P IS+ +VFGCVCY+FVPDHLR+KF+KKA +C+FVGYD+ RKGWRCCDPT+GK + SR++
Sbjct: 755  PAISHLKVFGCVCYIFVPDHLRTKFEKKAKRCIFVGYDDARKGWRCCDPTTGKCHISRNI 814

Query: 672  VFDEASTWWSSEKKVLSDS-NIEEILQ----QKLGEQTTQIQSNVDASENPSDIDIDKQE 731
            VFDEAS+WWS +K+ + +S ++EE+++    QKL   +   +S+   +++P    I + E
Sbjct: 815  VFDEASSWWSPKKEEIPESPDVEEVIEEERDQKLETPSEGERSSPSKTKSPWKTGIHQPE 874

Query: 732  VTQSSESDKNETTHQQLRRSNRIRRPNPKYANAAIVEDRV-YEPETYEEASQNSVWQKAM 791
              Q+ E D      Q+LRRS R R+PNP+YANAA+ ++ +  EP +YEEA++   WQKAM
Sbjct: 875  EPQTEEHD------QELRRSTRPRKPNPRYANAALADESLPIEPSSYEEAARGPEWQKAM 934

Query: 792  EEEIIALEHNQTWELVPRLGDIKPVSCKWVYKIKRRPDGSIERYKARLVARGFSQQYGLD 851
            EEEI AL+ NQTW+LVP+  D+KP+SCKWVYK+K R DGSIERYKARLVARGFSQ+YGLD
Sbjct: 935  EEEIKALKENQTWDLVPKPKDVKPISCKWVYKVKTRTDGSIERYKARLVARGFSQEYGLD 994

Query: 852  YDETFSPVAKITTVRVLLALAASKDWKLWQMDVKNAFLHGELDREIYMNQPKGFESAANP 911
            Y+ETFSPVAKITTVRVLLALAASK W+LWQMDVKNAFLHGELD+EIYM QPKGFES   P
Sbjct: 995  YEETFSPVAKITTVRVLLALAASKSWELWQMDVKNAFLHGELDKEIYMEQPKGFESKKYP 1054

Query: 912  NYVCKLRKALYGLKQAPRAWYGKIAEFLTQSGYSVAHADSSLFIKEREGNLTIVLVYVDD 971
             +VCKL+KALYGLKQAPRAWYGKI EFL  +G+ VA +DSSLF+  +EG L IVLVYVDD
Sbjct: 1055 EHVCKLKKALYGLKQAPRAWYGKIGEFLVHNGFKVAPSDSSLFVMAKEGRLAIVLVYVDD 1114

Query: 972  LIITGDDEREIYQTRENLSIRFQMKELGELKHFLGLEVDRTDEGLFLCQQKYTRDMLQKF 1031
            LIITGD   EI +TRENLS+RFQMKELGEL+HFLGLEV+ T  G+FL QQKY +D+L+++
Sbjct: 1115 LIITGDYSEEIERTRENLSVRFQMKELGELRHFLGLEVEHTKNGIFLGQQKYAKDLLKRY 1174

Query: 1032 NMLECKQVSTPMEINAKICAHEGKELNDETTYRQLVGSLIYLTLTRPDISYAVGVMSRYM 1091
             ML+CK +STPM+ NA++  H+GK L D T YRQLVGSLIYLTL+RPDISYAVGV SRYM
Sbjct: 1175 GMLDCKPISTPMDPNARLQEHKGKNLEDATMYRQLVGSLIYLTLSRPDISYAVGVASRYM 1234

Query: 1092 QSPKKPHLDAARRILRYIKGTIDYGLLYKRSKDCKLVGYCDADYAGDHDTRRSTTGYVFK 1151
             +PKKPHLDA RRILRY+KGT++YG+LYK++K+C+++GYCDADYAGD DTRRSTTGY+F 
Sbjct: 1235 STPKKPHLDAIRRILRYVKGTLNYGILYKKTKECQVIGYCDADYAGDCDTRRSTTGYLFS 1294

Query: 1152 FGSGTISWCSKRQPTVSLSTTEAEYRAAAGAAQESTWLKLLMEDLHQKIDYPISLLCDNQ 1211
             GSG I+WCSKRQPTV+LS+TEAEYR+AA AAQESTWLK LMEDLHQ     + + CDN 
Sbjct: 1295 LGSGAITWCSKRQPTVALSSTEAEYRSAAAAAQESTWLKQLMEDLHQTPKDQVWIFCDNL 1354

Query: 1212 SAIRLAENPVFHARTKHVEVHYHFIREKVLKEEIEMQQIKTDDQVADLFTKGLNTSKHES 1271
            S IRLAENPVFHARTKH+EVHYH+IREKVLK EIEM   KT+DQ AD+ TK LN SK E 
Sbjct: 1355 STIRLAENPVFHARTKHIEVHYHYIREKVLKGEIEMVPTKTEDQTADILTKSLNKSKFEK 1414

Query: 1272 FRCQLNMMQR 1276
            FR  L M+ +
Sbjct: 1415 FREALGMVTK 1418

BLAST of CmoCh20G007300.1 vs. TrEMBL
Match: I1HD26_BRADI (Uncharacterized protein OS=Brachypodium distachyon PE=4 SV=1)

HSP 1 Score: 1395.2 bits (3610), Expect = 0.0e+00
Identity = 671/978 (68.61%), Postives = 812/978 (83.03%), Query Frame = 1

Query: 304  MPGNKSDTVSLHNVYHVPGIKKNLLSVSQLTTSGSYVLFGPEDVKVYQDVKIIGKPTIEG 363
            +P   +  + L  VYHVPG+KKNLLSV QLT  G YVLFGP++V +++ +K+IG P +EG
Sbjct: 463  VPRYGTQQLQLERVYHVPGLKKNLLSVPQLTAEGKYVLFGPQEVAIFRRLKVIGTPIMEG 522

Query: 364  RRVESVYVLSAESAYVDKTRKNETTDLWHARLGHISYHKLKLIMEKSMLKGLPQLEVKTD 423
            ++ +S++VLSAESAYVDKTRKNET DLWHARLGH+SYHKLK +MEK ++KGLP L+++TD
Sbjct: 523  KKRDSLFVLSAESAYVDKTRKNETADLWHARLGHVSYHKLKEMMEKHVVKGLPDLDIRTD 582

Query: 424  VVCAGCQYGKAHQLPYKESSFKAKKPLELVHSDLFGPVKQASISGMRYMVTFIDDYSRYV 483
            +VCAGCQYGKAHQLPYKES  ++K PLEL+HSD+FGPVKQ S+ GMRYMVTFIDD+SRYV
Sbjct: 583  MVCAGCQYGKAHQLPYKESQHQSKVPLELIHSDVFGPVKQISLGGMRYMVTFIDDFSRYV 642

Query: 484  WIFFMKEKSDTFSKFQEFKMMVEGEVGAKIRCLRSDNGGEYTSDEFDQYLHECGIRRQFT 543
            W++FMKEKS+TF KF+EFK M+EGE+  KIRCLR+DNG EY S+EF  YL +  I+RQ T
Sbjct: 643  WVYFMKEKSETFMKFKEFKDMIEGELEYKIRCLRTDNGREYLSNEFTIYLKKNKIKRQLT 702

Query: 544  CANTPQQNGVAERKNRHLAETCRSMLHAKNVPGRFWAEAMRTAAHVINKLPQPKLGFVSP 603
            C NTPQQNGVAERKNRHLAETCRSMLHAKNVPGRFWAE MRTAAHVINKLPQ +LGF SP
Sbjct: 703  CPNTPQQNGVAERKNRHLAETCRSMLHAKNVPGRFWAECMRTAAHVINKLPQVRLGFKSP 762

Query: 604  FEILWDMKPTISYFRVFGCVCYVFVPDHLRSKFDKKAVKCVFVGYDNQRKGWRCCDPTSG 663
             E LW MKP IS+ +VFGCVCY+FVPDHLR+KF+KKA +C+FVGYD+ RKGWRCCDPT+G
Sbjct: 763  HEKLWRMKPAISHLKVFGCVCYIFVPDHLRTKFEKKAKRCIFVGYDDARKGWRCCDPTTG 822

Query: 664  KYYTSRDVVFDEASTWWSSEKKVLSDS-NIEEILQ----QKLGEQTTQIQSNVDASENPS 723
            K +TSR++VFDEAS+WWS +K+ + +S ++EE+++    QKL   +   +S+   +++P 
Sbjct: 823  KCHTSRNIVFDEASSWWSPKKEEIPESPDVEEVIEEERDQKLEIPSEGERSSPSKTKSPW 882

Query: 724  DIDIDKQEVTQSSESDKNETTHQQLRRSNRIRRPNPKYANAAIVEDRV-YEPETYEEASQ 783
               I + E  Q+ E D      Q+LRRS R R+PNP+YANAA+ ++ +  EP +YEEA++
Sbjct: 883  KTGIHQPEEPQTEEHD------QELRRSTRPRKPNPRYANAALADESLPIEPSSYEEAAR 942

Query: 784  NSVWQKAMEEEIIALEHNQTWELVPRLGDIKPVSCKWVYKIKRRPDGSIERYKARLVARG 843
               WQKAMEEEI AL+ NQTW+LVP+  ++KP+SCKWVYK+K R DGSIERYKARLVARG
Sbjct: 943  GPEWQKAMEEEIKALKENQTWDLVPKPKNVKPISCKWVYKVKTRTDGSIERYKARLVARG 1002

Query: 844  FSQQYGLDYDETFSPVAKITTVRVLLALAASKDWKLWQMDVKNAFLHGELDREIYMNQPK 903
            FSQ+YGLDY+ETFSPVAKITTVRVLLALAASK W+LWQMDVKNAFLHGELD+EIYM QPK
Sbjct: 1003 FSQEYGLDYEETFSPVAKITTVRVLLALAASKSWELWQMDVKNAFLHGELDKEIYMEQPK 1062

Query: 904  GFESAANPNYVCKLRKALYGLKQAPRAWYGKIAEFLTQSGYSVAHADSSLFIKEREGNLT 963
            GFES   P +VCKL+KALYGLKQAPRAWYGKI EFL  +G+ VA +DSSLF+  +EG L 
Sbjct: 1063 GFESKKYPEHVCKLKKALYGLKQAPRAWYGKIGEFLVHNGFKVAPSDSSLFVMAKEGRLA 1122

Query: 964  IVLVYVDDLIITGDDEREIYQTRENLSIRFQMKELGELKHFLGLEVDRTDEGLFLCQQKY 1023
            IVLVYVDDLIITGD   EI +TRENLS+RFQMKELGEL+HFLGLEV+ T  G+FL QQKY
Sbjct: 1123 IVLVYVDDLIITGDYSEEIERTRENLSVRFQMKELGELRHFLGLEVEHTKNGIFLGQQKY 1182

Query: 1024 TRDMLQKFNMLECKQVSTPMEINAKICAHEGKELNDETTYRQLVGSLIYLTLTRPDISYA 1083
             +D+L+++ ML+CK +STPM+ NA++   +GK L D T YRQLVGSLIYLTL+RPDISYA
Sbjct: 1183 AKDLLKRYGMLDCKPISTPMDPNARLQEDKGKNLEDATMYRQLVGSLIYLTLSRPDISYA 1242

Query: 1084 VGVMSRYMQSPKKPHLDAARRILRYIKGTIDYGLLYKRSKDCKLVGYCDADYAGDHDTRR 1143
            VGV SRYM +PKKPHLDA RRILRY+KGT++YG+LYK++K+C+++GYCDADYAGD DTRR
Sbjct: 1243 VGVASRYMSTPKKPHLDAIRRILRYVKGTLNYGILYKKTKECQVIGYCDADYAGDCDTRR 1302

Query: 1144 STTGYVFKFGSGTISWCSKRQPTVSLSTTEAEYRAAAGAAQESTWLKLLMEDLHQKIDYP 1203
            STTGY+F  GSG I+WCSKRQPTV+LS+TEAEYR+AA AAQESTWLK LMEDLHQ     
Sbjct: 1303 STTGYLFSLGSGAITWCSKRQPTVALSSTEAEYRSAAAAAQESTWLKQLMEDLHQTPKDQ 1362

Query: 1204 ISLLCDNQSAIRLAENPVFHARTKHVEVHYHFIREKVLKEEIEMQQIKTDDQVADLFTKG 1263
            + + CDN S IRLAENPVFHARTKH+EVHYH+IREKVLK EIEM   KT+D  AD+ TK 
Sbjct: 1363 VWIFCDNLSTIRLAENPVFHARTKHIEVHYHYIREKVLKGEIEMVPTKTEDHTADILTKS 1422

Query: 1264 LNTSKHESFRCQLNMMQR 1276
            LN SK E FR  L M+ +
Sbjct: 1423 LNKSKFEKFREALGMVTK 1434

BLAST of CmoCh20G007300.1 vs. TrEMBL
Match: I1H466_BRADI (Uncharacterized protein OS=Brachypodium distachyon PE=4 SV=1)

HSP 1 Score: 1394.8 bits (3609), Expect = 0.0e+00
Identity = 669/970 (68.97%), Postives = 809/970 (83.40%), Query Frame = 1

Query: 312  VSLHNVYHVPGIKKNLLSVSQLTTSGSYVLFGPEDVKVYQDVKIIGKPTIEGRRVESVYV 371
            + L  VYHVPG+KKNLLSV QLT  G YVLFGP++V +++ +K+IG P +EG++ +S++V
Sbjct: 455  LQLERVYHVPGLKKNLLSVPQLTAEGKYVLFGPQEVAIFRRLKVIGTPIMEGKKRDSLFV 514

Query: 372  LSAESAYVDKTRKNETTDLWHARLGHISYHKLKLIMEKSMLKGLPQLEVKTDVVCAGCQY 431
            LSAESAYVDKTRKNET DLWHARLGH+SYHKLK +MEK ++KGLP L+++TD+VCAGCQY
Sbjct: 515  LSAESAYVDKTRKNETADLWHARLGHVSYHKLKEMMEKHVVKGLPDLDIRTDMVCAGCQY 574

Query: 432  GKAHQLPYKESSFKAKKPLELVHSDLFGPVKQASISGMRYMVTFIDDYSRYVWIFFMKEK 491
            GKAHQLPYKES  ++K PLEL+HSD+FGPVKQ S+ GMRYMVTFIDD+SRYVW++FMKEK
Sbjct: 575  GKAHQLPYKESQHQSKVPLELIHSDVFGPVKQISLGGMRYMVTFIDDFSRYVWVYFMKEK 634

Query: 492  SDTFSKFQEFKMMVEGEVGAKIRCLRSDNGGEYTSDEFDQYLHECGIRRQFTCANTPQQN 551
            S+TF KF+EFK M+EGE+  KIRCLR+DNG EY S+EF  YL +  I+RQ TC NTPQQN
Sbjct: 635  SETFMKFKEFKDMIEGELEYKIRCLRTDNGREYLSNEFTIYLKKNKIKRQLTCPNTPQQN 694

Query: 552  GVAERKNRHLAETCRSMLHAKNVPGRFWAEAMRTAAHVINKLPQPKLGFVSPFEILWDMK 611
            GVAERKNRHLAETCRSMLHAKNVPGRFWAE MRTAAHVINKLPQ +LGF SP E LW MK
Sbjct: 695  GVAERKNRHLAETCRSMLHAKNVPGRFWAECMRTAAHVINKLPQVRLGFKSPHEKLWRMK 754

Query: 612  PTISYFRVFGCVCYVFVPDHLRSKFDKKAVKCVFVGYDNQRKGWRCCDPTSGKYYTSRDV 671
            P IS+ +VFGCVCY+FVPDHLR+KF+KKA +C+FVGYD+ +KGWRCCDPT+GK +TSR++
Sbjct: 755  PAISHLKVFGCVCYIFVPDHLRTKFEKKAKRCIFVGYDDAQKGWRCCDPTTGKCHTSRNI 814

Query: 672  VFDEASTWWSSEKKVLSDS-NIEEILQ----QKLGEQTTQIQSNVDASENPSDIDIDKQE 731
            VFDEAS+WWS +K+ + +S ++EE+++    QKL   +   +S+   +++P    I + E
Sbjct: 815  VFDEASSWWSPKKEEIPESPDVEEVIEEERDQKLETPSEGERSSPSKTKSPWKTGIHQPE 874

Query: 732  VTQSSESDKNETTHQQLRRSNRIRRPNPKYANAAIVEDRV-YEPETYEEASQNSVWQKAM 791
              Q+ E D      Q+LRRS R R+PNP+YANAA+ ++ +  EP +YEEA++   WQKAM
Sbjct: 875  EPQTEEHD------QELRRSTRPRKPNPRYANAALADESLPIEPSSYEEAARGPEWQKAM 934

Query: 792  EEEIIALEHNQTWELVPRLGDIKPVSCKWVYKIKRRPDGSIERYKARLVARGFSQQYGLD 851
            EEEI AL+ NQTW+LVP+  D+KP+SCKWVYK+K R DGSIERYKARLVARGFSQ+YGLD
Sbjct: 935  EEEIKALKENQTWDLVPKPKDVKPISCKWVYKVKTRTDGSIERYKARLVARGFSQEYGLD 994

Query: 852  YDETFSPVAKITTVRVLLALAASKDWKLWQMDVKNAFLHGELDREIYMNQPKGFESAANP 911
            Y+ETFSPVAKITTVRVLLALAASK W+LWQMDVKNAFLHGELD+EIYM QPKGFES   P
Sbjct: 995  YEETFSPVAKITTVRVLLALAASKSWELWQMDVKNAFLHGELDKEIYMEQPKGFESKKYP 1054

Query: 912  NYVCKLRKALYGLKQAPRAWYGKIAEFLTQSGYSVAHADSSLFIKEREGNLTIVLVYVDD 971
             +VCKL+KALYGLKQAPRAWYGKI EFL  +G+ VA +DSSLF+  +EG L IVLVYVDD
Sbjct: 1055 EHVCKLKKALYGLKQAPRAWYGKIGEFLVHNGFKVAPSDSSLFVMAKEGRLAIVLVYVDD 1114

Query: 972  LIITGDDEREIYQTRENLSIRFQMKELGELKHFLGLEVDRTDEGLFLCQQKYTRDMLQKF 1031
            LIITGD   EI +TRENLS+RFQMKELGEL+HFLGLEV+ T  G+FL QQKY +D+L+++
Sbjct: 1115 LIITGDYSEEIERTRENLSVRFQMKELGELRHFLGLEVEHTKNGIFLGQQKYAKDLLKRY 1174

Query: 1032 NMLECKQVSTPMEINAKICAHEGKELNDETTYRQLVGSLIYLTLTRPDISYAVGVMSRYM 1091
             ML+CK +STPM+ NA++   +GK L D T YRQLVGSLIYLTL+RPDISYAVGV SRYM
Sbjct: 1175 GMLDCKPISTPMDPNARLQEDKGKNLEDATMYRQLVGSLIYLTLSRPDISYAVGVASRYM 1234

Query: 1092 QSPKKPHLDAARRILRYIKGTIDYGLLYKRSKDCKLVGYCDADYAGDHDTRRSTTGYVFK 1151
             +PKKPHLDA RRILRY+KGT++YG+LYK++K+C+++GYCDADYAGD DTRRSTTGY+F 
Sbjct: 1235 STPKKPHLDAIRRILRYVKGTLNYGILYKKTKECQVIGYCDADYAGDCDTRRSTTGYLFS 1294

Query: 1152 FGSGTISWCSKRQPTVSLSTTEAEYRAAAGAAQESTWLKLLMEDLHQKIDYPISLLCDNQ 1211
             GSG I+WCSKRQPTV+LS+TEAEYR+AA AAQESTWLK LMEDLHQ     + + CDN 
Sbjct: 1295 LGSGAITWCSKRQPTVALSSTEAEYRSAAAAAQESTWLKQLMEDLHQTPKDQVWIFCDNL 1354

Query: 1212 SAIRLAENPVFHARTKHVEVHYHFIREKVLKEEIEMQQIKTDDQVADLFTKGLNTSKHES 1271
            + IRLAENPVFHARTKH+EVHYH+IRE VLK EIEM   KT+DQ AD+ TK LN SK E 
Sbjct: 1355 TTIRLAENPVFHARTKHIEVHYHYIRENVLKGEIEMVPTKTEDQTADILTKSLNKSKFEK 1414

Query: 1272 FRCQLNMMQR 1276
            FR  L M+ +
Sbjct: 1415 FRKALGMVTK 1418

BLAST of CmoCh20G007300.1 vs. TAIR10
Match: AT4G23160.1 (AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 8)

HSP 1 Score: 419.5 bits (1077), Expect = 7.6e-117
Identity = 230/578 (39.79%), Postives = 341/578 (59.00%), Query Frame = 1

Query: 709  SNVDASENPSDIDIDKQEVTQSSESDKN-ETTHQQLRRSNRIR----------------- 768
            S+ DAS + S IDI      Q+   + +  T+H++ R+   ++                 
Sbjct: 3    SDADASTSSSSIDIMPSANIQNDVPEPSVHTSHRRTRKPAYLQDYYCHSVASLTIHDISQ 62

Query: 769  -----RPNPKYANAAIVEDRVYEPETYEEASQNSVWQKAMEEEIIALEHNQTWELVPRLG 828
                 + +P Y +  +   +  EP TY EA +  VW  AM++EI A+E   TWE+     
Sbjct: 63   FLSYEKVSPLYHSFLVCIAKAKEPSTYNEAKEFLVWCGAMDDEIGAMETTHTWEICTLPP 122

Query: 829  DIKPVSCKWVYKIKRRPDGSIERYKARLVARGFSQQYGLDYDETFSPVAKITTVRVLLAL 888
            + KP+ CKWVYKIK   DG+IERYKARLVA+G++QQ G+D+ ETFSPV K+T+V+++LA+
Sbjct: 123  NKKPIGCKWVYKIKYNSDGTIERYKARLVAKGYTQQEGIDFIETFSPVCKLTSVKLILAI 182

Query: 889  AASKDWKLWQMDVKNAFLHGELDREIYMNQPKGFESAAN----PNYVCKLRKALYGLKQA 948
            +A  ++ L Q+D+ NAFL+G+LD EIYM  P G+ +       PN VC L+K++YGLKQA
Sbjct: 183  SAIYNFTLHQLDISNAFLNGDLDEEIYMKLPPGYAARQGDSLPPNAVCYLKKSIYGLKQA 242

Query: 949  PRAWYGKIAEFLTQSGYSVAHADSSLFIKEREGNLTIVLVYVDDLIITGDDEREIYQTRE 1008
             R W+ K +  L   G+  +H+D + F+K        VLVYVDD+II  +++  + + + 
Sbjct: 243  SRQWFLKFSVTLIGFGFVQSHSDHTYFLKITATLFLCVLVYVDDIIICSNNDAAVDELKS 302

Query: 1009 NLSIRFQMKELGELKHFLGLEVDRTDEGLFLCQQKYTRDMLQKFNMLECKQVSTPMEINA 1068
             L   F++++LG LK+FLGLE+ R+  G+ +CQ+KY  D+L +  +L CK  S PM+ + 
Sbjct: 303  QLKSCFKLRDLGPLKYFLGLEIARSAAGINICQRKYALDLLDETGLLGCKPSSVPMDPSV 362

Query: 1069 KICAHEGKELNDETTYRQLVGSLIYLTLTRPDISYAVGVMSRYMQSPKKPHLDAARRILR 1128
               AH G +  D   YR+L+G L+YL +TR DIS+AV  +S++ ++P+  H  A  +IL 
Sbjct: 363  TFSAHSGGDFVDAKAYRRLIGRLMYLQITRLDISFAVNKLSQFSEAPRLAHQQAVMKILH 422

Query: 1129 YIKGTIDYGLLYKRSKDCKLVGYCDADYAGDHDTRRSTTGYVFKFGSGTISWCSKRQPTV 1188
            YIKGT+  GL Y    + +L  + DA +    DTRRST GY    G+  ISW SK+Q  V
Sbjct: 423  YIKGTVGQGLFYSSQAEMQLQVFSDASFQSCKDTRRSTNGYCMFLGTSLISWKSKKQQVV 482

Query: 1189 SLSTTEAEYRAAAGAAQESTWLKLLMEDLHQKIDYPISLLCDNQSAIRLAENPVFHARTK 1248
            S S+ EAEYRA + A  E  WL     +L   +  P  L CDN +AI +A N VFH RTK
Sbjct: 483  SKSSAEAEYRALSFATDEMMWLAQFFRELQLPLSKPTLLFCDNTAAIHIATNAVFHERTK 542

Query: 1249 HVEVHYHFIREKVLKEEIEMQQIKTDDQVADLFTKGLN 1260
            H+E   H +RE+ + +       +  D+  D FT+ L+
Sbjct: 543  HIESDCHSVRERSVYQATLSYSFQAYDE-QDGFTEYLS 579

BLAST of CmoCh20G007300.1 vs. TAIR10
Match: ATMG00810.1 (ATMG00810.1 DNA/RNA polymerases superfamily protein)

HSP 1 Score: 193.7 bits (491), Expect = 6.8e-49
Identity = 99/224 (44.20%), Postives = 136/224 (60.71%), Query Frame = 1

Query: 959  VLVYVDDLIITGDDEREIYQTRENLSIRFQMKELGELKHFLGLEVDRTDEGLFLCQQKYT 1018
            +L+YVDD+++TG     +      LS  F MK+LG + +FLG+++     GLFL Q KY 
Sbjct: 3    LLLYVDDILLTGSSNTLLNMLIFQLSSTFSMKDLGPVHYFLGIQIKTHPSGLFLSQTKYA 62

Query: 1019 RDMLQKFNMLECKQVSTPMEINAKICAHEGKELNDETTYRQLVGSLIYLTLTRPDISYAV 1078
              +L    ML+CK +STP+ +         K   D + +R +VG+L YLTLTRPDISYAV
Sbjct: 63   EQILNNAGMLDCKPMSTPLPLKLNSSVSTAK-YPDPSDFRSIVGALQYLTLTRPDISYAV 122

Query: 1079 GVMSRYMQSPKKPHLDAARRILRYIKGTIDYGLLYKRSKDCKLVGYCDADYAGDHDTRRS 1138
             ++ + M  P     D  +R+LRY+KGTI +GL   ++    +  +CD+D+AG   TRRS
Sbjct: 123  NIVCQRMHEPTLADFDLLKRVLRYVKGTIFHGLYIHKNSKLNVQAFCDSDWAGCTSTRRS 182

Query: 1139 TTGYVFKFGSGTISWCSKRQPTVSLSTTEAEYRAAAGAAQESTW 1183
            TTG+    G   ISW +KRQPTVS S+TE EYRA A  A E TW
Sbjct: 183  TTGFCTFLGCNIISWSAKRQPTVSRSSTETEYRALALTAAELTW 225

BLAST of CmoCh20G007300.1 vs. TAIR10
Match: ATMG00820.1 (ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase))

HSP 1 Score: 94.7 bits (234), Expect = 4.3e-19
Identity = 50/118 (42.37%), Postives = 75/118 (63.56%), Query Frame = 1

Query: 749 IRRPNPKYANAAIVEDRVYEPETYEEASQNSVWQKAMEEEIIALEHNQTWELVPRLGDIK 808
           I + NPKY+   I      EP++   A ++  W +AM+EE+ AL  N+TW LVP   +  
Sbjct: 9   INKLNPKYS-LTITTTIKKEPKSVIFALKDPGWCQAMQEELDALSRNKTWILVPPPVNQN 68

Query: 809 PVSCKWVYKIKRRPDGSIERYKARLVARGFSQQYGLDYDETFSPVAKITTVRVLLALA 867
            + CKWV+K K   DG+++R KARLVA+GF Q+ G+ + ET+SPV +  T+R +L +A
Sbjct: 69  ILGCKWVFKTKLHSDGTLDRLKARLVAKGFHQEEGIYFVETYSPVVRTATIRTILNVA 125

BLAST of CmoCh20G007300.1 vs. TAIR10
Match: ATMG00240.1 (ATMG00240.1 Gag-Pol-related retrotransposon family protein)

HSP 1 Score: 75.9 bits (185), Expect = 2.1e-13
Identity = 31/78 (39.74%), Postives = 52/78 (66.67%), Query Frame = 1

Query: 1065 IYLTLTRPDISYAVGVMSRYMQSPKKPHLDAARRILRYIKGTIDYGLLYKRSKDCKLVGY 1124
            +YLT+TRPD+++AV  +S++  + +   + A  ++L Y+KGT+  GL Y  + D +L  +
Sbjct: 1    MYLTITRPDLTFAVNRLSQFSSASRTAQMQAVYKVLHYVKGTVGQGLFYSATSDLQLKAF 60

Query: 1125 CDADYAGDHDTRRSTTGY 1143
             D+D+A   DTRRS TG+
Sbjct: 61   ADSDWASCPDTRRSVTGF 78

BLAST of CmoCh20G007300.1 vs. TAIR10
Match: ATMG00300.1 (ATMG00300.1 Gag-Pol-related retrotransposon family protein)

HSP 1 Score: 65.5 bits (158), Expect = 2.8e-10
Identity = 35/103 (33.98%), Postives = 57/103 (55.34%), Query Frame = 1

Query: 361 IEGRRVESVYVLSAE----SAYVDKTRKNETTDLWHARLGHISYHKLKLIMEKSMLKGLP 420
           ++G R +S+Y+L        + + +T K+ET  LWH+RL H+S   ++L+++K  L    
Sbjct: 39  LKGNRHDSLYILQGSVETGESNLAETAKDETR-LWHSRLAHMSQRGMELLVKKGFLDSSK 98

Query: 421 QLEVKTDVVCAGCQYGKAHQLPYKESSFKAKKPLELVHSDLFG 460
              +K    C  C YGK H++ +       K PL+ VHSDL+G
Sbjct: 99  VSSLK---FCEDCIYGKTHRVNFSTGQHTTKNPLDYVHSDLWG 137

BLAST of CmoCh20G007300.1 vs. NCBI nr
Match: gi|147794801|emb|CAN71427.1| (hypothetical protein VITISV_027864 [Vitis vinifera])

HSP 1 Score: 1641.3 bits (4249), Expect = 0.0e+00
Identity = 851/1351 (62.99%), Postives = 1002/1351 (74.17%), Query Frame = 1

Query: 5    MSDFQIVGGIKKLNNNNYNTWATCMMSYLQGQDLWEIVGGCETTPPE-EDSNDALRKWRI 64
            M D Q++GGIKKLNN NYNTW+TCMMSY+QGQDLWE+V G E T P+ ED+N  LRKW+I
Sbjct: 1    MGDLQVIGGIKKLNNQNYNTWSTCMMSYMQGQDLWEVVNGSEITQPKVEDANGILRKWKI 60

Query: 65   KAGKAMFALKTTIGEEMLEHIWDDKTPKEAWDTFVMLFSKKNDTRLQLLENELLSISQRD 124
            KAGKAMFALKTTI E++LEHI D KTP EAW+TF  LFSKKNDTRLQLLE+EL S++QRD
Sbjct: 61   KAGKAMFALKTTIEEDVLEHIRDAKTPYEAWNTFTKLFSKKNDTRLQLLESELFSVAQRD 120

Query: 125  MTIAQYFHKVKSICREITELDPKSAIVESRMKRIIIHGLRPEYRSFIAAVQGWPTQPSLV 184
            +TIAQYFHKVK++CREI+ELD ++ I E+ MKRIIIHGLRPE+R F+AA+QGW  QPSLV
Sbjct: 121  LTIAQYFHKVKTLCREISELDLEAPIGETXMKRIIIHGLRPEFRGFVAAIQGWQNQPSLV 180

Query: 185  EFENLLASQEAMAKQMGGFTLK-------------------------GEEALYTSESQSN 244
            EFENLLA QEA+AKQMGG +LK                          E+    S+ + +
Sbjct: 181  EFENLLAGQEALAKQMGGVSLKGEEEALYAHKGGWNSXQHTVRRTKKNEDKAKCSQGERS 240

Query: 245  NR-------PSTRRGYNGD----KRRSH--------QGIAQPERAQKNDNKSFQRTRF-- 304
             R       P T + + G      ++ H        +G+ +           +    F  
Sbjct: 241  ARVEGDSKNPGTXKKFEGKCYNCXKKGHMAKDCWSKKGLVESNATTSKSEDEWDAQAFFA 300

Query: 305  ----GGICYNCGKKGHMSRDCWSRKKSIENNVAISKKKIED--EWDAEDVQIMPGNK--- 364
                        ++    +D W       N++   K+K++D  E+    + +   N    
Sbjct: 301  AIGESAFIATTSEQIDYEKD-WIIDSGCSNHMTGDKEKLQDLSEYKGRHMVVTANNSKLP 360

Query: 365  --------------SDTVSLHNVYHVPGIKKNLLSVSQLTTSGSYVLFGPEDVKVYQDVK 424
                          ++ VSL NVYHVPG+KKNLLSV+QLT+SG  VLFGP+DVKVY D++
Sbjct: 361  IAHIGNTVVSSQYNTNDVSLQNVYHVPGMKKNLLSVAQLTSSGHSVLFGPQDVKVYHDLE 420

Query: 425  IIGKPTIEGRRVESVYVLSAESAYVDKTRKNETTDLWHARLGHISYHKLKLIMEKSMLKG 484
            ++ +P I+GRR+ESVYV+SAE+AYVDKTRKNET DLWH RL HISY KL ++M+KSMLKG
Sbjct: 421  VMEEPVIKGRRLESVYVMSAETAYVDKTRKNETADLWHMRLSHISYSKLTMMMKKSMLKG 480

Query: 485  LPQLEVKTDVVCAGCQYGKAHQLPYKESSFKAKKPLELVHSDLFGPVKQASISGMRYMVT 544
            LPQLEV+   +CA CQYGKAHQLPY+ES +KAK PLEL+HSD+FGPVKQAS+SGM     
Sbjct: 481  LPQLEVRKXTICAXCQYGKAHQLPYEESKWKAKGPLELIHSDVFGPVKQASLSGM----- 540

Query: 545  FIDDYSRYVWIFFMKEKSDTFSKFQEFKMMVEGEVGAKIRCLRSDNGGEYTSDEFDQYLH 604
                  +Y+  F      D FS+    +M                    +TS E  +Y  
Sbjct: 541  ------KYMVTFI-----DDFSRRVYLQMSF------------------FTSSENXEYAI 600

Query: 605  ECGIRRQFTCANTPQQNGVAERKNRHLAETCRSMLHAKNVPGRFWAEAMRTAAHVINKLP 664
                   FTCANTPQQNGV ERKNRHLAE CRSMLHAKNVPG FWAE M+TAA VIN+LP
Sbjct: 601  S------FTCANTPQQNGVXERKNRHLAEICRSMLHAKNVPGXFWAEXMKTAAFVINRLP 660

Query: 665  QPKLGFVSPFEILWDMKPTISYFRVFGCVCYVFVPDHLRSKFDKKAVKCVFVGYDNQRKG 724
            Q +L F SPFE LW++KPT+SYFRVFGCVCYVFVP+HLRSK DKKAV+CV VGYD+QRK 
Sbjct: 661  QQRLNFSSPFEKLWNIKPTVSYFRVFGCVCYVFVPNHLRSKMDKKAVRCVLVGYDSQRKX 720

Query: 725  WRCCDPTSGKYYTSRDVVFDEASTWWSSEKKVLSDSNIEEILQQKLGEQTTQIQSNVDAS 784
            WRCCDPT+GK YTSR+VVFDE+S+WWSSEK++L DS+   + + +L  Q+ +IQ ++  +
Sbjct: 721  WRCCDPTTGKCYTSRNVVFDESSSWWSSEKEILXDSB---VFKDEL--QSARIQLSLGEA 780

Query: 785  ENPSDIDIDKQEVTQSSESDKNETTHQQLRRSNRIRRPNPKYANAAIVED-RVYEPETYE 844
            EN  D DI   + TQS      +T         R ++PNPKYAN AIVED    EP T+ 
Sbjct: 781  ENAXDGDIG-DDXTQSPW----QTGVHGQPSEERTKKPNPKYANVAIVEDANAKEPXTFA 840

Query: 845  EASQNSVWQKAMEEEIIALEHNQTWELVPRLGDIKPVSCKWVYKIKRRPDGSIERYKARL 904
            EA QNS W KAM EEI AL+ NQTWELVP+  D++P SCKWVYKIKRR DGSIER+KA L
Sbjct: 841  EAFQNSDWSKAMXEEIAALKRNQTWELVPKPRDVEPXSCKWVYKIKRRTDGSIERHKAXL 900

Query: 905  VARGFSQQYGLDYDETFSPVAKITTVRVLLALAASKDWKLWQMDVKNAFLHGELDREIYM 964
            VARGFSQQYGLDYDETFSPV K+TTVRVLLALAA+KDW LWQMDVKNAFLHGELDREIYM
Sbjct: 901  VARGFSQQYGLDYDETFSPVXKLTTVRVLLALAANKDWDLWQMDVKNAFLHGELDREIYM 960

Query: 965  NQPKGFESAANPNYVCKLRKALYGLKQAPRAWYGKIAEFLTQSGYSVAHADSSLFIKERE 1024
            NQP GF+S  +P YVCKLRKALYGLKQAPRAWYGKIAEFLTQSGYSV  ADSSLF+K   
Sbjct: 961  NQPMGFQSQGHPEYVCKLRKALYGLKQAPRAWYGKIAEFLTQSGYSVTPADSSLFVKANG 1020

Query: 1025 GNLTIVLVYVDDLIITGDDEREIYQTRENLSIRFQMKELGELKHFLGLEVDRTDEGLFLC 1084
            G L IVLVYVDDLIITGDD  EI++T+ENLS+RF+MKELG+LKHFLGLEVDRT+EG+FLC
Sbjct: 1021 GKLAIVLVYVDDLIITGDDVEEIFRTKENLSVRFEMKELGQLKHFLGLEVDRTNEGIFLC 1080

Query: 1085 QQKYTRDMLQKFNMLECKQVSTPMEINAKICAHEGKELNDETTYRQLVGSLIYLTLTRPD 1144
            QQKY +D+L+KF MLECK +STPME NAK+C HEGK+L D T YRQLVGSL+YLTLT PD
Sbjct: 1081 QQKYAKDLLKKFGMLECKPISTPMEPNAKMCEHEGKDLKDATMYRQLVGSLLYLTLTXPD 1140

Query: 1145 ISYAVGVMSRYMQSPKKPHLDAARRILRYIKGTIDYGLLYKRSKDCKLVGYCDADYAGDH 1204
            ISYAVGVMSRYMQ+PKKPHL+A RRILR++KGTIDYGLLYK+ +DCKLVGYCDADYAGDH
Sbjct: 1141 ISYAVGVMSRYMQNPKKPHLEAVRRILRHVKGTIDYGLLYKKXEDCKLVGYCDADYAGDH 1200

Query: 1205 DTRRSTTGYVFKFGSGTISWCSKRQPTVSLSTTEAEYRAAAGAAQESTWLKLLMEDLHQK 1264
            DTR STTGYVF  GSG ISWCSKRQPTVSLSTTEAEYRAAA A QES WL  LM DLHQ 
Sbjct: 1201 DTRXSTTGYVFMLGSGAISWCSKRQPTVSLSTTEAEYRAAAMATQESMWLIRLMNDLHQL 1260

Query: 1265 IDYPISLLCDNQSAIRLAENPVFHARTKHVEVHYHFIREKVLKEEIEMQQIKTDDQVADL 1285
            +DY + L CDNQSA+RLAENPVFHARTKHVEVHYHFIREKVLKEE+E+ QIK++DQVADL
Sbjct: 1261 VDYAVPLYCDNQSAVRLAENPVFHARTKHVEVHYHFIREKVLKEEVELNQIKSEDQVADL 1300

BLAST of CmoCh20G007300.1 vs. NCBI nr
Match: gi|147817226|emb|CAN75363.1| (hypothetical protein VITISV_026292 [Vitis vinifera])

HSP 1 Score: 911.0 bits (2353), Expect = 2.4e-261
Identity = 471/753 (62.55%), Postives = 546/753 (72.51%), Query Frame = 1

Query: 462  KQASISGMRYMVTFIDDYSRYVWIFFMKEKSDTFSKFQEFKMMVEGEVGAKIRCLRSDNG 521
            KQAS+SGM+YMVTFI+D+S YVW++FMKEKS+TFSK++EFK M E +V  +IRCLR+DNG
Sbjct: 461  KQASLSGMKYMVTFINDFSNYVWVYFMKEKSETFSKYKEFKEMTEVKVDKRIRCLRTDNG 520

Query: 522  GEYTSDEFDQYLHECGIRRQFTCANTPQQNGVAERKNRHLAETCRSMLHAKNVPGRFWAE 581
            GEYTSDEF  +L EC +R QFTCANTPQQN VAERKNRHLAE CRSMLHAKNVP RFWAE
Sbjct: 521  GEYTSDEFFYFLRECRVRHQFTCANTPQQNSVAERKNRHLAEICRSMLHAKNVPRRFWAE 580

Query: 582  AMRTAAHVINKLPQPKLGFVSPFEILWDMKPTISYFRVFGCVCYVFVPDHLRSKFDKKAV 641
            AM+T A VIN+LPQ KL F SPFE LW++KPTISYFRVFGCVCYVFVP+HLRSK DKK +
Sbjct: 581  AMKTVAFVINRLPQQKLNFSSPFEKLWNIKPTISYFRVFGCVCYVFVPNHLRSKMDKKEI 640

Query: 642  KCVFVGYDNQRKGWRC-CDPTSGKYYTSRDVVFDEASTWWSSEKKVLSDSNIEEILQQKL 701
                  + ++ +  R        +   + D+  DE  + W +    +     EE    + 
Sbjct: 641  LPDSDVFKDELQSARIQLSLGEXENAANGDIXDDETQSPWQTG---VHGQXSEEGEPSET 700

Query: 702  GEQTTQIQSNVDASENPSDIDIDKQEVTQSSESDKNETTHQQLRRSNRIRRPNPKYANAA 761
                   +S      NP   ++    + + + + + ET  +  +        NP ++ A 
Sbjct: 701  EAPIPLRRSARTKKPNPKYANV---AIVEDANAKEPETFAEAFQ--------NPDWSKAM 760

Query: 762  IVEDRVYEPETYEEASQNSVWQKAMEEEIIALEHNQTWELVPRLGDIKPVSCKWVYKIKR 821
                     E      +N  W+                   P+  D++P+SCKWVYKIKR
Sbjct: 761  --------KEEIAALKRNQTWELV-----------------PKXRDVEPISCKWVYKIKR 820

Query: 822  RPDGSIERYKARLVARGFSQQYGLDYDETFSPVAKITTVRVLLALAASKDWKLWQMDVKN 881
            R DG IER+KARLVARGFSQQYGLDYDETFSPVAK+TTVRVLLALAA+KDW L QMDVKN
Sbjct: 821  RTDGLIERHKARLVARGFSQQYGLDYDETFSPVAKLTTVRVLLALAANKDWDLRQMDVKN 880

Query: 882  AFLHGELDREIYMNQPKGFESAANPNYVCKLRKALYGLKQAPRAWYGKIAEFLTQSGYSV 941
            AFLHGELDREIYMNQP GF+S  +P YVCKLRKALYGLKQAPRAWYGKIAEFLTQSGYS+
Sbjct: 881  AFLHGELDREIYMNQPMGFQSQGHPKYVCKLRKALYGLKQAPRAWYGKIAEFLTQSGYSI 940

Query: 942  AHADSSLFIKEREGNLTIVLVYVDDLIITGDDEREIYQTRENLSIRFQMKELGELKHFLG 1001
              ADSSLF+K   G L IVL Y                  ENLS+RF+MKELG+LKHFLG
Sbjct: 941  TPADSSLFVKANGGKLAIVLAY------------------ENLSVRFEMKELGQLKHFLG 1000

Query: 1002 LEVDRTDEGLFLCQQKYTRDMLQKFNMLECKQVSTPMEINAKICAHEGKELNDETTYRQL 1061
            LEVDRT EG+FLCQQKY +D+L+KF MLECK +STPME NAK+C HEGK+L D T YRQL
Sbjct: 1001 LEVDRTHEGIFLCQQKYAKDLLKKFGMLECKPISTPMEPNAKMCEHEGKDLKDATMYRQL 1060

Query: 1062 VGSLIYLTLTRPDISYAVGVMSRYMQSPKKPHLDAARRILRYIKGTIDYGLLYKRSKDCK 1121
            VGSL+YLT TR DISYAVGVMSRYMQ+PKKPHL+A RRILR++KGTIDYGLLYK+ +D K
Sbjct: 1061 VGSLLYLTFTRTDISYAVGVMSRYMQNPKKPHLEAVRRILRHVKGTIDYGLLYKKGEDYK 1120

Query: 1122 LVGYCDADYAGDHDTRRSTTGYVFKFGSGTISWCSKRQPTVSLSTTEAEYRAAAGAAQES 1181
            LVGYCDADY GDHDTRRSTTGYVF  GS  ISWCSKRQPTVSLST E EYRAA  AAQES
Sbjct: 1121 LVGYCDADYVGDHDTRRSTTGYVFMLGSRAISWCSKRQPTVSLSTMEXEYRAAPMAAQES 1156

Query: 1182 TWLKLLMEDLHQKIDYPISLLCDNQSAIRLAEN 1214
            TWL  LM DLHQ +DY   L CDNQSA+RLAEN
Sbjct: 1181 TWLIRLMNDLHQXVDYAXPLYCDNQSAVRLAEN 1156

BLAST of CmoCh20G007300.1 vs. NCBI nr
Match: gi|147810137|emb|CAN73532.1| (hypothetical protein VITISV_012827 [Vitis vinifera])

HSP 1 Score: 881.3 bits (2276), Expect = 2.0e-252
Identity = 495/936 (52.88%), Postives = 617/936 (65.92%), Query Frame = 1

Query: 5   MSDFQIVGGIKKLNNNNYNTWATCMMSYLQGQDLWEIVGGCETTPPE-EDSNDALRKWRI 64
           M D Q++GGIKKLNN NYNTW+TCMMSY+QGQDLWE+V G E T PE ED N  LRKW+I
Sbjct: 1   MGDLQVIGGIKKLNNQNYNTWSTCMMSYMQGQDLWEVVNGSEITQPEAEDVNGILRKWKI 60

Query: 65  KAGKAMFALKTTIGEEMLEHIWDDKTPKEAWDTFVMLFSKKNDTRLQLLENELLSISQRD 124
           K GKAMFALKTTI E++LEHI D KTP EAW+TF  LFSKKNDTRLQLLE+ELLSI+QRD
Sbjct: 61  KXGKAMFALKTTIEEDVLEHIRDAKTPYEAWNTFTKLFSKKNDTRLQLLESELLSIAQRD 120

Query: 125 MTIAQYFHKVKSICREITELDPKSAIVESRMKRIIIHGLRPEYRSFIAAVQGWPTQPSLV 184
           +TI  YFHKVK++CREI+ELD ++ I E+RMKRIIIHGLRP++R F+AA+QGW  QPSLV
Sbjct: 121 LTITHYFHKVKTLCREISELDLEAPIGETRMKRIIIHGLRPKFRGFVAAIQGWQNQPSLV 180

Query: 185 EFENLLASQEAMAKQMGGFTLKGEE-ALYTSESQSNNR---------------------- 244
           EFENLLA QEA+AKQMGG +LKGEE ALY  + + N++                      
Sbjct: 181 EFENLLAGQEALAKQMGGVSLKGEEEALYAHKGRWNSKQHTVGRTKKNEDKAKXSQGERS 240

Query: 245 ---------PSTRRG-----YNGDKRR-------SHQGIAQPERAQKNDNKSFQRTRF-- 304
                    P TR+      YN  K+        S +G+ +   A       +    F  
Sbjct: 241 ARVEGDSKNPXTRKKXEGKCYNCGKKGHMAKDCWSKKGLVESNAATSESEBEWDAQAFFV 300

Query: 305 ----GGICYNCGKKGHMSRDCWSRKKSIENNVAISKKKIED--EWDAEDVQIMPGNK--- 364
                       ++    +D W       N++   K+K+ D  E+    + +   N    
Sbjct: 301 AXGESXFIATTSEQIDYEKD-WIIDSGCSNHMTGDKEKLXDLSEYKGRHMVVTXNNSKJP 360

Query: 365 --------------SDTVSLHNVYHVPGIKKNLLSVSQLTTSGSYVLFGPEDVKVYQDVK 424
                         ++ VSL NVYHVPG+KKNLLSV+QLT+SG +VLFGP+DVKVY+D++
Sbjct: 361 IAHIGNTVVSSQYNTNDVSLQNVYHVPGMKKNLLSVAQLTSSGHFVLFGPQDVKVYRDLE 420

Query: 425 IIGKPTIEGRRVESVYVLSAESAYVDKTRKNETTDLWHARLGHISYHKLKLIMEKSMLKG 484
           I+ +P I   R+ESVYV+SAE+A VDKTRKNETTDL H RL H+SY KL ++M+KSMLKG
Sbjct: 421 IMEEPVIXRWRLESVYVMSAETAXVDKTRKNETTDLXHMRLSHVSYSKLTVMMKKSMLKG 480

Query: 485 LPQLEVKTDVVCAGCQYGKAHQLPYKESSFKAKKPLELVHSDLFGPVKQASISGMRYMVT 544
           LPQLEV+ D +CAGC YGKAHQLPY+ES +K K PLEL+HSD+FGPVK AS+SGM+    
Sbjct: 481 LPQLEVRKDTICAGCXYGKAHQLPYEESKWKTKGPLELIHSDVFGPVKXASLSGMK---- 540

Query: 545 FIDDYSRYVWIFFMKEKSDTFSKFQEFKMMVE-GEVGAKIRCLRSDNGGEYTSDEFDQYL 604
                  Y+  F      D FS++     M E  E  +K +             EF + +
Sbjct: 541 -------YMXTFI-----DDFSRYVWVHFMKEKSETFSKFK-------------EFKE-M 600

Query: 605 HECGIRRQFTCANTPQQNGVAERKNRHLAETCRSMLHAKNVPGRFWAEAMRTAAHVINKL 664
            E  + ++  C      NG              SMLH KNVPGRFW EAM+TAA VIN+L
Sbjct: 601 TEAEVDKRIRCLRX--DNGG------------ESMLHXKNVPGRFWVEAMKTAAFVINRL 660

Query: 665 PQPKLGFVSPFEILWDMKPTISYFRVFGCVCYVFVPDHLRSKFDKKAVKCVFVGYDNQRK 724
           PQ +L F SPFE LW++KP +SYFRVFGCVCY FVP+H RSK DKK V+CV VGYD+Q K
Sbjct: 661 PQQRLNFSSPFEKLWNIKPIVSYFRVFGCVCYAFVPNHXRSKMDKKXVRCVLVGYDSQXK 720

Query: 725 GWRCCDPTSGKYYTSRDVVFDEASTWWSSEKKVLSDSNIEEILQQKLGEQTTQIQSNVDA 784
            WRCC+PT+GKYYTSR+VVFDE+S+WWSSEK++L DS   ++ + +L  Q+ +IQ ++  
Sbjct: 721 RWRCCNPTTGKYYTSRNVVFDESSSWWSSEKEILPDS---DVFKDEL--QSARIQLSLGE 780

Query: 785 SENPSDIDIDKQEVTQS--------SESDKNETTHQQ----LRRSNRIRRPNPKYANAAI 844
           +EN +D DI   E TQS          S++ E +  +    LRRS R ++PNPKYAN AI
Sbjct: 781 AENAADGDIGDDE-TQSPWQTGVHGQPSEEGEPSEIEAPIPLRRSARTKKPNPKYANVAI 840

Query: 845 VED-RVYEPETYEEASQNSVWQKAMEEEIIALEHNQTWELVPRLGDIKPVSCKWVYKIKR 857
           VED    EPET+ EA QN  W KAM+EEI AL+ NQTWELVP+  D++P+SCKWVYKIKR
Sbjct: 841 VEDANTKEPETFAEAFQNPDWSKAMKEEIAALKRNQTWELVPKPRDVEPISCKWVYKIKR 885

BLAST of CmoCh20G007300.1 vs. NCBI nr
Match: gi|147798853|emb|CAN61340.1| (hypothetical protein VITISV_007301 [Vitis vinifera])

HSP 1 Score: 831.6 bits (2147), Expect = 1.8e-237
Identity = 427/629 (67.89%), Postives = 490/629 (77.90%), Query Frame = 1

Query: 674  DEASTWWSSEKKVLSDSNIEEILQQKLGEQTTQIQSNVDA------SENPSDIDIDKQEV 733
            +E    W      +S S +  ++++ + +   +++   D       +EN +D DI+  E 
Sbjct: 362  NETINLWHMRLSHISYSKLTVMMKKSMLKGLPELEMRKDTICAGCEAENVADGDIEDDET 421

Query: 734  TQ----------SSESDKNETTHQ-QLRRSNRIRRPNPKYANAAIVED-RVYEPETYEEA 793
                        S E + +ET     LRRS R ++PNPKYAN AIVED    EPET+ EA
Sbjct: 422  QSPWQTGVHGQPSEEGEPSETEAPIPLRRSARTKKPNPKYANVAIVEDANAKEPETFAEA 481

Query: 794  SQNSVWQKAMEEEIIALEHNQTWELVPRLGDIKPVSCKWVYKIKRRPDGSIERYKARLVA 853
             QN  W KA++EEI AL+ NQTWELVP+  D++P+SCKWVYKIKRR DGSIER+KARLVA
Sbjct: 482  FQNPDWTKAIKEEIAALKQNQTWELVPKPRDVEPISCKWVYKIKRRTDGSIERHKARLVA 541

Query: 854  RGFSQQYGLDYDETFSPVAKITTVRVLLALAASKDWKLWQMDVKNAFLHGELDREIYMNQ 913
            RGFSQQYGLDYDETFSPVAK+TTVRVLLALAA+KDW LWQMDVKNAFLHGELDREIYMNQ
Sbjct: 542  RGFSQQYGLDYDETFSPVAKLTTVRVLLALAANKDWDLWQMDVKNAFLHGELDREIYMNQ 601

Query: 914  PKGFESAANPNYVCKLRKALYGLKQAPRAWYGKIAEFLTQSGYSVAHADSSLFIKEREGN 973
            P GF S  +P YVCKLRKALYGLKQAPRAWY                 DSSLF+K   G 
Sbjct: 602  PXGFXSQGHPEYVCKLRKALYGLKQAPRAWY-----------------DSSLFVKANGGK 661

Query: 974  LTIVLVYVDDLIITGDDEREIYQTRENLSIRFQMKELGELKHFLGLEVDRTDEGLFLCQQ 1033
            L IVLVYVDDLIIT DD  EI++T ENLS+RF+MKELG+LKHFLGLEVD T EG+FLCQQ
Sbjct: 662  LVIVLVYVDDLIITRDDVEEIFRTEENLSVRFEMKELGQLKHFLGLEVDCTHEGIFLCQQ 721

Query: 1034 KYTRDMLQKFNMLECKQVSTPMEINAKICAHEGKELNDETTYRQLVGSLIYLTLTRPDIS 1093
            KY +D+L+KF MLECK +STPME NAK+C HEGK+L D T YRQLVGSL+YLTLTRPDIS
Sbjct: 722  KYAKDLLKKFGMLECKSISTPMEPNAKMCEHEGKDLKDATMYRQLVGSLVYLTLTRPDIS 781

Query: 1094 YAVGVMSRYMQSPKKPHLDAARRILRYIKGTIDYGLLYKRSKDCKLVGYCDADYAGDHDT 1153
            YAVGVMSRYMQ+PKKPHL+A RRILR++KGTIDYGLLYK+ +DCKLVGYCDADYAGDHDT
Sbjct: 782  YAVGVMSRYMQNPKKPHLEAVRRILRHVKGTIDYGLLYKKGEDCKLVGYCDADYAGDHDT 841

Query: 1154 RRSTTGYVFKFGSGTISWCSKRQPTVSLSTTEAEYRAAAGAAQESTWLKLLMEDLHQKID 1213
            RRSTTGYVF  GSG ISWCSKRQPTVSL TTEAEYRAAA AAQESTWL  LM DLHQ +D
Sbjct: 842  RRSTTGYVFMLGSGAISWCSKRQPTVSLLTTEAEYRAAAMAAQESTWLIRLMNDLHQLVD 901

Query: 1214 YPISLLCDNQSAIRLAENPVFHARTKHVEVHYHFIREKVLKEEIEMQQIKTDDQVADLFT 1273
            Y + L CDNQSA+RLAENPVFHARTKHVEVHYHFIREKVL+EE+E++QIK+ DQVADLFT
Sbjct: 902  YAVPLYCDNQSAVRLAENPVFHARTKHVEVHYHFIREKVLEEEVELKQIKSKDQVADLFT 961

Query: 1274 KGLNTSKHESFRCQLNMMQRMRTSAEGEC 1285
            KGL+ SK E F  QL M++ +    EGEC
Sbjct: 962  KGLSGSKFECFCHQLGMVKILEADVEGEC 973

BLAST of CmoCh20G007300.1 vs. NCBI nr
Match: gi|47824985|gb|AAT38758.1| (Putative gag-pol polyprotein, identical [Solanum demissum])

HSP 1 Score: 818.1 bits (2112), Expect = 2.1e-233
Identity = 493/1327 (37.15%), Postives = 749/1327 (56.44%), Query Frame = 1

Query: 21   NYNTWATCMMSYLQGQDLWEIVGGCETTPPEEDSNDALRKWRIKAGKAMFALKTTIGEEM 80
            NY  W+  M +  + Q+LW+IV   ET  PE ++N  +R+ R +  KA+F ++  + +E+
Sbjct: 21   NYQFWSLKMKTLFKSQELWDIV---ETGIPEGNANQ-MREHRKRDSKALFTIQQALDDEI 80

Query: 81   LEHIWDDKTPKEAWDTFVMLF---SKKNDTRLQLLENELLSISQRDMTIAQ-YFHKVKSI 140
               I   +T K+AW+     +    K    +LQ L  +  ++   +    Q Y  +  +I
Sbjct: 81   FPRISAVETSKQAWEILKQEYFGDDKVITVKLQTLRRDFETLFMNENESVQGYLSRTSAI 140

Query: 141  CREITELDPK--SAIVESRMKRIIIHGLRPEYRSFIAAVQGWPTQPSLVEFENLLAS--- 200
               +     K  + IV S+    ++  L  ++   + A++      S   F+ L++S   
Sbjct: 141  VNRMRSYGEKIDNQIVVSK----VLRSLTTKFEHVVTAIEE-SKDLSTYSFDELMSSLLA 200

Query: 201  ------------QEAMAKQMGGFTLKG--EEALYTSESQSNNRPSTR----RGYN--GDK 260
                        QE   +  G F+ KG  E +      + N R   R    RG N  G+ 
Sbjct: 201  HEDRLNRSREKVQEKAFQVKGEFSYKGKAENSAGRGHGRGNFRGRGRGGSGRGRNQVGEF 260

Query: 261  RRSHQGI----------------AQPERAQKNDNKSFQRTRFGGICYNCGKKGHMSRDCW 320
            R+    I                 + +  QK+ N +        +     +    +   W
Sbjct: 261  RQYKSNIQCRYCKKFGHKEVDCWTKQKDEQKDANFTQNVEEESKLFMASSQITESANAVW 320

Query: 321  SRKKSIENNVAISKKKIEDEWDAEDVQIMPGN-----------------KSDTVSLHNVY 380
                   N+++ SK    D  +++  ++  G+                 + +   L++V 
Sbjct: 321  FIDSGCSNHMSSSKSLFRDLDESQKSEVRLGDDKQVHIEGKGTVEIKTVQGNVKFLYDVQ 380

Query: 381  HVPGIKKNLLSVSQLTTSGSYVLFGPE--DVKVYQDVKIIGKPTIEGRRVESVYVLSAES 440
            +VP +  NLLSV QL TSG  V+F     D+K  +  + I +  +   ++  + + +  +
Sbjct: 381  YVPTLAHNLLSVGQLMTSGYSVVFYDNACDIKDKESGRTIARVPMTQNKMFPLDISNVGN 440

Query: 441  AYVDKTRKNETTDLWHARLGHISYHKLKLIMEKSMLKGLPQLEVKTDVVCAGCQYGKAHQ 500
            + +    KNET +LWH R GH++ + LKL+++K M+ GLP   +K   +C GC YGK  +
Sbjct: 441  SALVVKEKNET-NLWHLRYGHLNVNWLKLLVQKDMVIGLPN--IKELDLCEGCIYGKQTR 500

Query: 501  LPYKES-SFKAKKPLELVHSDLFGPVKQASISGMRYMVTFIDDYSRYVWIFFMKEKSDTF 560
              +    S++A   LELVH+DL GP+K  S+ G RY + F DDYSR+ W++F+K KS+TF
Sbjct: 501  KSFPVGKSWRATTCLELVHADLCGPMKMESLGGSRYFLMFTDDYSRFSWVYFLKFKSETF 560

Query: 561  SKFQEFKMMVEGEVGAKIRCLRSDNGGEYTSDEFDQYLHECGIRRQFTCANTPQQNGVAE 620
              F++FK  VE + G KI+ LR+D GGE+ S++F+ +  E GIRR+ T   TP+QNGVAE
Sbjct: 561  ETFKKFKAFVENQSGNKIKSLRTDRGGEFLSNDFNLFCEENGIRRELTAPYTPEQNGVAE 620

Query: 621  RKNRHLAETCRSMLHAKNVPGRFWAEAMRTAAHVINKLPQPKLGFVSPFEILWDMKPTIS 680
            RKNR + E  RS L AK +P  FW EA+ T  + +N  P   +   +P E     KP +S
Sbjct: 621  RKNRTVVEMARSSLKAKGLPDYFWGEAVATVVYFLNISPTKDVWNTTPLEAWNGKKPRVS 680

Query: 681  YFRVFGCVCYVFVPDHLRSKFDKKAVKCVFVGYDNQRKGWRCCDPTSGKYYTSRDVVFDE 740
            + R+FGC+ Y  V  H  SK D+K+ KC+FVGY  Q K +R  +P SGK   SR+VVF+E
Sbjct: 681  HLRIFGCIAYALVNFH--SKLDEKSTKCIFVGYSLQSKAYRLYNPISGKVIISRNVVFNE 740

Query: 741  ASTWWSSEKKVLSDSNIEEILQQKLGEQTTQIQSNVDASENPSDIDIDKQEVTQSSES-- 800
              +W  +   ++S  NI+ +         T  +S VD   +P+   +     +  + S  
Sbjct: 741  DVSWNFNSGNMMS--NIQLL--------PTDEESAVDFGNSPNSSPVSSSVSSPIAPSTT 800

Query: 801  ---DKNETTHQQLRRSNRIRRPNPKYANAAIVEDR----VYEPETYEEASQNSVWQKAME 860
               D++      LRRS R ++PNPKY+N      +    V +P  YEEA + S W+ AM 
Sbjct: 801  VAPDESSVEPIPLRRSTREKKPNPKYSNTVNTSCQFALLVSDPICYEEAVEQSEWKNAMI 860

Query: 861  EEIIALEHNQTWELVPRLGDIKPVSCKWVYKIKRRPDGSIERYKARLVARGFSQQYGLDY 920
            EEI A+E N TWELV        +  KWV++ K   DGSI+++KARLVA+G+SQQ G+D+
Sbjct: 861  EEIQAIERNSTWELVDAPEGKNVIGLKWVFRTKYNADGSIQKHKARLVAKGYSQQQGVDF 920

Query: 921  DETFSPVAKITTVRVLLALAASKDWKLWQMDVKNAFLHGELDREIYMNQPKGFESAANPN 980
            DETFSPVA+  TVRV+LALAA     ++Q DVK+AFL+G+L+ E+Y++QP+GF    N N
Sbjct: 921  DETFSPVARFETVRVVLALAAQLHLPVYQFDVKSAFLNGDLEEEVYVSQPQGFMITGNEN 980

Query: 981  YVCKLRKALYGLKQAPRAWYGKIAEFLTQSGYSVAHADSSLFIKER-EGNLTIVLVYVDD 1040
             V KLRKALYGLKQAPRAWY KI  F   SG+  +  + +L++K++      +V +YVDD
Sbjct: 981  KVYKLRKALYGLKQAPRAWYSKIDSFFQGSGFRRSDNEPTLYLKKQGTDEFLLVCLYVDD 1040

Query: 1041 LIITGDDEREIYQTRENLSIRFQMKELGELKHFLGLEVDRTDEGLFLCQQKYTRDMLQKF 1100
            +I  G  +  +   + N+   F+M +LG LK+FLGLEV +  +G+F+ Q+KY  D+L+KF
Sbjct: 1041 MIYIGSSKSLVNDFKSNMMRNFEMSDLGLLKYFLGLEVIQDKDGIFISQKKYAEDLLKKF 1100

Query: 1101 NMLECKQVSTPMEINAKICAHEGKELNDETTYRQLVGSLIYLTLTRPDISYAVGVMSRYM 1160
             M+ C+  +TPM IN K+   +G E  +   +R LVG L YLT TRPDI+++V V+SR++
Sbjct: 1101 QMMNCEVATTPMNINEKLQRADGTEKANPKLFRSLVGGLNYLTHTRPDIAFSVSVVSRFL 1160

Query: 1161 QSPKKPHLDAARRILRYIKGTIDYGLLYKRSKDCKLVGYCDADYAGDHDTRRSTTGYVFK 1220
            QSP K H  AA+R+LRY+ GT D+G+ Y ++ + +LVG+ D+DYAG  D R+ST+G  F 
Sbjct: 1161 QSPTKQHFGAAKRVLRYVAGTTDFGIWYSKAPNFRLVGFTDSDYAGCLDDRKSTSGSCFS 1220

Query: 1221 FGSGTISWCSKRQPTVSLSTTEAEYRAAAGAAQESTWLKLLMEDLHQKIDYPISLLCDNQ 1273
            FGSG ++W SK+Q TV+LST+EAEY AA+ AA+++ WL+ L+ED   +      +  D++
Sbjct: 1221 FGSGVVTWSSKKQETVALSTSEAEYTAASLAARQALWLRKLLEDFSYEQKESTEIFSDSK 1280

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
POLX_TOBAC3.1e-19740.16Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
COPIA_DROME2.5e-10136.36Copia protein OS=Drosophila melanogaster GN=GIP PE=1 SV=3[more]
M810_ARATH1.2e-4744.20Uncharacterized mitochondrial protein AtMg00810 OS=Arabidopsis thaliana GN=AtMg0... [more]
YCH4_YEAST1.9e-4533.55Putative transposon Ty5-1 protein YCL074W OS=Saccharomyces cerevisiae (strain AT... [more]
YJ41B_YEAST2.1e-3125.32Transposon Ty4-J Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 20... [more]
Match NameE-valueIdentityDescription
A5AKW8_VITVI0.0e+0062.99Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_027864 PE=4 SV=1[more]
I1J0P4_BRADI0.0e+0068.64Uncharacterized protein OS=Brachypodium distachyon PE=4 SV=1[more]
I1IA27_BRADI0.0e+0069.18Uncharacterized protein OS=Brachypodium distachyon PE=4 SV=1[more]
I1HD26_BRADI0.0e+0068.61Uncharacterized protein OS=Brachypodium distachyon PE=4 SV=1[more]
I1H466_BRADI0.0e+0068.97Uncharacterized protein OS=Brachypodium distachyon PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G23160.17.6e-11739.79 cysteine-rich RLK (RECEPTOR-like protein kinase) 8[more]
ATMG00810.16.8e-4944.20ATMG00810.1 DNA/RNA polymerases superfamily protein[more]
ATMG00820.14.3e-1942.37ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)[more]
ATMG00240.12.1e-1339.74ATMG00240.1 Gag-Pol-related retrotransposon family protein[more]
ATMG00300.12.8e-1033.98ATMG00300.1 Gag-Pol-related retrotransposon family protein[more]
Match NameE-valueIdentityDescription
gi|147794801|emb|CAN71427.1|0.0e+0062.99hypothetical protein VITISV_027864 [Vitis vinifera][more]
gi|147817226|emb|CAN75363.1|2.4e-26162.55hypothetical protein VITISV_026292 [Vitis vinifera][more]
gi|147810137|emb|CAN73532.1|2.0e-25252.88hypothetical protein VITISV_012827 [Vitis vinifera][more]
gi|147798853|emb|CAN61340.1|1.8e-23767.89hypothetical protein VITISV_007301 [Vitis vinifera][more]
gi|47824985|gb|AAT38758.1|2.1e-23337.15Putative gag-pol polyprotein, identical [Solanum demissum][more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
IPR001584Integrase_cat-core
IPR001878Znf_CCHC
IPR012337RNaseH-like_sf
IPR013103RVT_2
IPR025314DUF4219
IPR025724GAG-pre-integrase_dom
Vocabulary: Biological Process
TermDefinition
GO:0015074DNA integration
Vocabulary: Molecular Function
TermDefinition
GO:0003676nucleic acid binding
GO:0008270zinc ion binding
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
cellular_component GO:0005575 cellular_component
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0008270 zinc ion binding

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CmoCh20G007300CmoCh20G007300gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CmoCh20G007300.1CmoCh20G007300.1-proteinpolypeptide


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmoCh20G007300.1.exon.1CmoCh20G007300.1.exon.1exon
CmoCh20G007300.1.exon.2CmoCh20G007300.1.exon.2exon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmoCh20G007300.1.CDS.1CmoCh20G007300.1.CDS.1CDS
CmoCh20G007300.1.CDS.2CmoCh20G007300.1.CDS.2CDS


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001584Integrase, catalytic corePFAMPF00665rvecoord: 447..563
score: 2.4
IPR001584Integrase, catalytic corePROFILEPS50994INTEGRASEcoord: 445..611
score: 23
IPR001878Zinc finger, CCHC-typeGENE3DG3DSA:4.10.60.10coord: 252..278
score: 1.
IPR001878Zinc finger, CCHC-typePFAMPF00098zf-CCHCcoord: 260..274
score: 1.
IPR001878Zinc finger, CCHC-typeSMARTSM00343c2hcfinal6coord: 260..276
score: 1.
IPR001878Zinc finger, CCHC-typePROFILEPS50158ZF_CCHCcoord: 261..274
score: 11
IPR001878Zinc finger, CCHC-typeunknownSSF57756Retrovirus zinc finger-like domainscoord: 253..283
score: 1.5
IPR012337Ribonuclease H-like domainGENE3DG3DSA:3.30.420.10coord: 443..604
score: 5.6
IPR012337Ribonuclease H-like domainunknownSSF53098Ribonuclease H-likecoord: 448..620
score: 1.52
IPR013103Reverse transcriptase, RNA-dependent DNA polymerasePFAMPF07727RVT_2coord: 795..1038
score: 8.1E
IPR025314Domain of unknown function DUF4219PFAMPF13961DUF4219coord: 17..42
score: 2.7
IPR025724GAG-pre-integrase domainPFAMPF13976gag_pre-integrscoord: 382..434
score: 1.6
NoneNo IPR availablePANTHERPTHR11439GAG-POL-RELATED RETROTRANSPOSONcoord: 16..1197
score:
NoneNo IPR availablePANTHERPTHR11439:SF164SUBFAMILY NOT NAMEDcoord: 16..1197
score:
NoneNo IPR availablePFAMPF14223Retrotran_gag_2coord: 61..189
score: 4.8
NoneNo IPR availableunknownSSF56672DNA/RNA polymerasescoord: 1047..1228
score: 5.48E-38coord: 794..1017
score: 5.48