CmaCh19G005910 (gene) Cucurbita maxima (Rimu)

NameCmaCh19G005910
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionGag-Pol polyprotein
LocationCma_Chr19 : 6541477 .. 6545538 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCCAACACGATGAGTGATTTCCAAATCGTTGGAGGAATTAAGAAACTCAACAACAACAACTACAACACGTGGGCAACATGCATGATGTCCTATTTACAAGGACAGGATCTTTGGGAGATCGTTGGCGGGTGTGAAACTACGCCGCCAGAGGAGGATTCTAACGACGCCTTGCGCAAATGGAGAATCAAAGCAGGCAAAGCAATGTTTGCCTTGAAGACCACCATCGGAGAAGAGATGTTAGAACATATTTGGGATGACAAGACACCGAAAGAAGCATGGGACACATTCGTGATGTTGTTCTCAAAGAAGAATGATACGAGACTACAACTTCTGGAGAATGAGTTGTTGTCAATTTCACAACGTGATATGACGATTGCTCAGTACTTCCACAAGGTCAAATCGATCTGTCGGGAGATTACTGAACTAGACCCAAAGTCCGCCATTGTAGAATCTCGAATGAAGAGGATTATAATCCACGGATTGCGACCAGAATATCGAAGCTTCATTGCTGCTGTACAAGGATGGCCCACTCAACCATCACTGGTAGAGTTCGAAAATTTGTTAGCCAGCCAAGAAGCCATGGCTAAACAAATGGGAGGCTTCACGTTGAAGGGTGAAGAAGCACTCTACACGAGTGAAAGTCAGAGCAATAATAGGCCGTCTACCAGACGTGGATACAATGGTGACAAAAGAAGAAGCCACCAAGGGATTGCACAACCTGAGAGAGCTCAGAAGAACGACAACAAGAGTTTTCAAAGAACGAGATTTGGTGGTATTTGCTATAACTGCGGGAAAAAGGGCCATATGTCTAGAGATTGTTGGTCCAGGAAAAAGACCATTGAAAACAATGTGGCAATATCCAAAAAGAAGATAGAAGATGAATGGGATGCAGAGGTACTATGTCCCATAGAAGAAGACGAGCTAGCACTCATGGCGACAATGGAAGACCATATCAACTATGAGAATGACTGGATCGTTGATTCAGGATGTTCAAATCACATGACTGGCGATCAGAGAAAATTGCAAGACAAGAAAGAGTACAAAGGAAGTCGTATGGTACGTACAGCAAATAACACACAGTTATCAATTGCTCAAATCGGCAATACGACAATCATGCCTGGCAATAAGTCTGATACAGTGTCGCTACATAATGTTTATCATGTACCGGGTATAAAGAAGAACTTGTTGTCAGTGTCACAACTAACAACATCAGGAAGCTATGTCTTGTTTGGGCCAGAAGATGTAAAAGTTTATCAAGATGTTAAGATAATAGGAAAGCCGACGATAGAAGGACGAAGAGTGGAGTCTGTCTACGTTCTATCTGCAGAGTCTGCCTATGTTGACAAGACCCGAAAGAATGAGACGACAGATCTATGGCATGCAAGATTGGGACATATTAGCTACCATAAGTTGAAGCTGATTATGGAGAAATCTATGCTCAAAGGTCTACCACAACTGGAAGTCAAAACAGACGTTGTCTGCGCTGGATGTCAGTATGGTAAAGCTCATCAATTACCATACAAAGAGTCAAGTTTCAAAGCAAAGAAACCATTAGAGTTAGTTCACTCTGATTTGTTCGGCCCAGTCAAACAAGCATCGATCAGCGGAATGCGGTATATGGTGACATTTATTGACGACTACTCAAGATATGTGTGGATTTTCTTTATGAAAGAAAAGTCTGACACGTTCTCAAAGTTTCAAGAATTCAAGATGATGGTCGAAGGAGAAGTAGGAGCGAAGATTCGTTGTCTACGTTCAGACAATGGCGGAGAATACACGTCAGATGAGTTCGATCAATATTTACACGAGTGTGGGATACGACGTCAATTTACATGTGCCAACACGCCACAACAAAATGGTGTAGCAGAAAGAAAGAATCGACACCTTGCAGAAACCTGTCGAAGCATGTTACACGCAAAGAACGTTCCAGGAAGATTTTGGGCTGAAGCTATGCGAACTGCTGCCCATGTGATCAACAAGCTTCCTCAACCAAAGCTAGGGTTCGTCTCACCATTTGAGATACTATGGGATATGAAACCTACAATTAGTTACTTCCGAGTATTTGGCTGTGTTTGCTATGTATTTGTGCCTGACCATCTACGTAGCAAGTTTGACAAGAAAGCAGTCAAGTGTGTATTTGTTGGATACGACAATCAAAGAAAAGGATGGAGGTGCTGTGATCCAACAAGTGGAAAATACTATACATCAAGAGATGTAGTTTTTGATGAAGCATCTACATGGTGGTCCTCGGAGAAGAAAGTCTTATCAGATTCAAACATTGAAGAAATTCTACAACAGAAGCTGGGGGAGCAAACTACACAAATTCAATCAAATTTCGATGCATCTGAAAATCCAAGCGACATTGATATTGACAAGCAGGAGGTGACTCAATCAAGCGAATCTGATAAAAATGAAACAACACATCAACAACTTAGGCGATCAAATAGAATCCGAAGGCCAAATCCTAAGTATGCAAATGCAGCTATTGTAGAAGATAGAGTTTACGAACCAGAGACATATGAAGAAGCATCACAAAACTCGGTTTGGCAGAAAGCGATGGAGGAAGAAATTATAGCCTTGGAGCATAATCAAACTTGGGAACTAGTGCCAAGACTAGGAGATATCAAACCCGTCTCTTGCAAGTGGGTCTATAAAATAAAGCGTCGACCGGATGGATCAATCGAGAGATACAAGGCTCGACTCGTGGCTCGAGGATTTTCTCAACAATATGGACTAGATTATGATGAAACATTCAGTCCAGTGGCAAAGATTACTACCGTACGAGTTCTGCTAGCACTCGCAGCAAGTAAAGATTGGAAACTGTGGCAAATGGATGTGAAGAATGCCTTCTTGCACGGAGAGCTAGACAGGGAGATTTATATGAACCAACCAAAGGGATTTGAGAGTGCAGCTAATCCTAATTATGTATGCAAGCTTAGAAAAGCTCTTTATGGACTGAAACAAGCACCGAGAGCTTGGTATGGTAAGATTGCTGAATTTCTTACCCAAAGTGGTTATTCAGTTGCGCATGCAGACTCAAGCCTATTCATCAAAGAAAGAGAAGGAAATTTGACAATTGTGTTGGTCTACGTGGACGATTTGATTATCACCGGGGACGATGAAAGAGAAATTTATCAAACAAGAGAAAATTTATCAATACGCTTTCAGATGAAAGAGCTAGGAGAGCTTAAACACTTCTTAGGCCTAGAAGTTGATCGCACAGATGAAGGACTGTTTCTCTGCCAACAAAAGTATACCAGAGACATGCTTCAGAAGTTCAACATGTTAGAGTGCAAGCAAGTTTCAACACCGATGGAGATAAATGCCAAGATTTGTGCACATGAAGGCAAAGAGTTGAACGATGAAACAACGTACCGACAACTAGTAGGTAGTCTTATCTACCTAACTTTAACTCGACCTGATATCTCGTATGCAGTTGGGGTTATGAGTCGATACATGCAAAGTCCAAAGAAGCCTCATCTGGATGCAGCTCGACGGATCTTGAGATATATCAAAGGTACAATCGACTATGGTCTTTTGTACAAAAGAAGCAAAGACTGCAAGCTAGTTGGATACTGTGATGCTGACTATGCAGGAGACCACGATACTCGGAGGTCAACCACTGGGTATGTGTTCAAGTTTGGTTCGGGAACAATTTCTTGGTGTAGCAAGAGACAACCAACAGTATCATTATCAACTACAGAAGCAGAGTATAGAGCAGCGGCTGGAGCAGCCCAGGAAAGTACATGGCTAAAACTCTTGATGGAAGATTTGCACCAGAAAATTGACTATCCAATATCACTTCTTTGCGACAACCAATCTGCGATTCGCCTTGCAGAAAATCCAGTGTTTCATGCTAGAACAAAGCATGTGGAGGTGCACTACCATTTCATTAGAGAGAAGGTCCTAAAGGAAGAAATTGAGATGCAGCAGATCAAGACAGATGACCAAGTGGCAGACTTGTTTACAAAAGGGTTGAATACTAGCAAACATGAGAGCTTTCGCTGTCAGCTCAACATGATGCAGCGAATGAGGACTAGTGCTGAGGGGGAGTGTTGA

mRNA sequence

ATGGCCAACACGATGAGTGATTTCCAAATCGTTGGAGGAATTAAGAAACTCAACAACAACAACTACAACACGTGGGCAACATGCATGATGTCCTATTTACAAGGACAGGATCTTTGGGAGATCGTTGGCGGGTGTGAAACTACGCCGCCAGAGGAGGATTCTAACGACGCCTTGCGCAAATGGAGAATCAAAGCAGGCAAAGCAATGTTTGCCTTGAAGACCACCATCGGAGAAGAGATGTTAGAACATATTTGGGATGACAAGACACCGAAAGAAGCATGGGACACATTCGTGATGTTGTTCTCAAAGAAGAATGATACGAGACTACAACTTCTGGAGAATGAGTTGTTGTCAATTTCACAACGTGATATGACGATTGCTCAGTACTTCCACAAGGTCAAATCGATCTGTCGGGAGATTACTGAACTAGACCCAAAGTCCGCCATTGTAGAATCTCGAATGAAGAGGATTATAATCCACGGATTGCGACCAGAATATCGAAGCTTCATTGCTGCTGTACAAGGATGGCCCACTCAACCATCACTGGTAGAGTTCGAAAATTTGTTAGCCAGCCAAGAAGCCATGGCTAAACAAATGGGAGGCTTCACGTTGAAGGGTGAAGAAGCACTCTACACGAGTGAAAGTCAGAGCAATAATAGGCCGTCTACCAGACGTGGATACAATGGTGACAAAAGAAGAAGCCACCAAGGGATTGCACAACCTGAGAGAGCTCAGAAGAACGACAACAAGAGTTTTCAAAGAACGAGATTTGGTGGTATTTGCTATAACTGCGGGAAAAAGGGCCATATGTCTAGAGATTGTTGGTCCAGGAAAAAGACCATTGAAAACAATGTGGCAATATCCAAAAAGAAGATAGAAGATGAATGGGATGCAGAGGTACTATGTCCCATAGAAGAAGACGAGCTAGCACTCATGGCGACAATGGAAGACCATATCAACTATGAGAATGACTGGATCGTTGATTCAGGATGTTCAAATCACATGACTGGCGATCAGAGAAAATTGCAAGACAAGAAAGAGTACAAAGGAAGTCGTATGGTACGTACAGCAAATAACACACAGTTATCAATTGCTCAAATCGGCAATACGACAATCATGCCTGGCAATAAGTCTGATACAGTGTCGCTACATAATGTTTATCATGTACCGGGTATAAAGAAGAACTTGTTGTCAGTGTCACAACTAACAACATCAGGAAGCTATGTCTTGTTTGGGCCAGAAGATGTAAAAGTTTATCAAGATGTTAAGATAATAGGAAAGCCGACGATAGAAGGACGAAGAGTGGAGTCTGTCTACGTTCTATCTGCAGAGTCTGCCTATGTTGACAAGACCCGAAAGAATGAGACGACAGATCTATGGCATGCAAGATTGGGACATATTAGCTACCATAAGTTGAAGCTGATTATGGAGAAATCTATGCTCAAAGGTCTACCACAACTGGAAGTCAAAACAGACGTTGTCTGCGCTGGATGTCAGTATGGTAAAGCTCATCAATTACCATACAAAGAGTCAAGTTTCAAAGCAAAGAAACCATTAGAGTTAGTTCACTCTGATTTGTTCGGCCCAGTCAAACAAGCATCGATCAGCGGAATGCGGTATATGGTGACATTTATTGACGACTACTCAAGATATGTGTGGATTTTCTTTATGAAAGAAAAGTCTGACACGTTCTCAAAGTTTCAAGAATTCAAGATGATGGTCGAAGGAGAAGTAGGAGCGAAGATTCGTTGTCTACGTTCAGACAATGGCGGAGAATACACGTCAGATGAGTTCGATCAATATTTACACGAGTGTGGGATACGACGTCAATTTACATGTGCCAACACGCCACAACAAAATGGTGTAGCAGAAAGAAAGAATCGACACCTTGCAGAAACCTGTCGAAGCATGTTACACGCAAAGAACGTTCCAGGAAGATTTTGGGCTGAAGCTATGCGAACTGCTGCCCATGTGATCAACAAGCTTCCTCAACCAAAGCTAGGGTTCGTCTCACCATTTGAGATACTATGGGATATGAAACCTACAATTAGTTACTTCCGAGTATTTGGCTGTGTTTGCTATGTATTTGTGCCTGACCATCTACGTAGCAAGTTTGACAAGAAAGCAGTCAAGTGTGTATTTGTTGGATACGACAATCAAAGAAAAGGATGGAGGTGCTGTGATCCAACAAGTGGAAAATACTATACATCAAGAGATGTAGTTTTTGATGAAGCATCTACATGGTGGTCCTCGGAGAAGAAAGTCTTATCAGATTCAAACATTGAAGAAATTCTACAACAGAAGCTGGGGGAGCAAACTACACAAATTCAATCAAATTTCGATGCATCTGAAAATCCAAGCGACATTGATATTGACAAGCAGGAGGTGACTCAATCAAGCGAATCTGATAAAAATGAAACAACACATCAACAACTTAGGCGATCAAATAGAATCCGAAGGCCAAATCCTAAGTATGCAAATGCAGCTATTGTAGAAGATAGAGTTTACGAACCAGAGACATATGAAGAAGCATCACAAAACTCGGTTTGGCAGAAAGCGATGGAGGAAGAAATTATAGCCTTGGAGCATAATCAAACTTGGGAACTAGTGCCAAGACTAGGAGATATCAAACCCGTCTCTTGCAAGTGGGTCTATAAAATAAAGCGTCGACCGGATGGATCAATCGAGAGATACAAGGCTCGACTCGTGGCTCGAGGATTTTCTCAACAATATGGACTAGATTATGATGAAACATTCAGTCCAGTGGCAAAGATTACTACCGTACGAGTTCTGCTAGCACTCGCAGCAAGTAAAGATTGGAAACTGTGGCAAATGGATGTGAAGAATGCCTTCTTGCACGGAGAGCTAGACAGGGAGATTTATATGAACCAACCAAAGGGATTTGAGAGTGCAGCTAATCCTAATTATGTATGCAAGCTTAGAAAAGCTCTTTATGGACTGAAACAAGCACCGAGAGCTTGGTATGGTAAGATTGCTGAATTTCTTACCCAAAGTGGTTATTCAGTTGCGCATGCAGACTCAAGCCTATTCATCAAAGAAAGAGAAGGAAATTTGACAATTGTGTTGGTCTACGTGGACGATTTGATTATCACCGGGGACGATGAAAGAGAAATTTATCAAACAAGAGAAAATTTATCAATACGCTTTCAGATGAAAGAGCTAGGAGAGCTTAAACACTTCTTAGGCCTAGAAGTTGATCGCACAGATGAAGGACTGTTTCTCTGCCAACAAAAGTATACCAGAGACATGCTTCAGAAGTTCAACATGTTAGAGTGCAAGCAAGTTTCAACACCGATGGAGATAAATGCCAAGATTTGTGCACATGAAGGCAAAGAGTTGAACGATGAAACAACGTACCGACAACTAGTAGGTAGTCTTATCTACCTAACTTTAACTCGACCTGATATCTCGTATGCAGTTGGGGTTATGAGTCGATACATGCAAAGTCCAAAGAAGCCTCATCTGGATGCAGCTCGACGGATCTTGAGATATATCAAAGGTACAATCGACTATGGTCTTTTGTACAAAAGAAGCAAAGACTGCAAGCTAGTTGGATACTGTGATGCTGACTATGCAGGAGACCACGATACTCGGAGGTCAACCACTGGGTATGTGTTCAAGTTTGGTTCGGGAACAATTTCTTGGTGTAGCAAGAGACAACCAACAGTATCATTATCAACTACAGAAGCAGAGTATAGAGCAGCGGCTGGAGCAGCCCAGGAAAGTACATGGCTAAAACTCTTGATGGAAGATTTGCACCAGAAAATTGACTATCCAATATCACTTCTTTGCGACAACCAATCTGCGATTCGCCTTGCAGAAAATCCAGTGTTTCATGCTAGAACAAAGCATGTGGAGGTGCACTACCATTTCATTAGAGAGAAGGTCCTAAAGGAAGAAATTGAGATGCAGCAGATCAAGACAGATGACCAAGTGGCAGACTTGTTTACAAAAGGGTTGAATACTAGCAAACATGAGAGCTTTCGCTGTCAGCTCAACATGATGCAGCGAATGAGGACTAGTGCTGAGGGGGAGTGTTGA

Coding sequence (CDS)

ATGGCCAACACGATGAGTGATTTCCAAATCGTTGGAGGAATTAAGAAACTCAACAACAACAACTACAACACGTGGGCAACATGCATGATGTCCTATTTACAAGGACAGGATCTTTGGGAGATCGTTGGCGGGTGTGAAACTACGCCGCCAGAGGAGGATTCTAACGACGCCTTGCGCAAATGGAGAATCAAAGCAGGCAAAGCAATGTTTGCCTTGAAGACCACCATCGGAGAAGAGATGTTAGAACATATTTGGGATGACAAGACACCGAAAGAAGCATGGGACACATTCGTGATGTTGTTCTCAAAGAAGAATGATACGAGACTACAACTTCTGGAGAATGAGTTGTTGTCAATTTCACAACGTGATATGACGATTGCTCAGTACTTCCACAAGGTCAAATCGATCTGTCGGGAGATTACTGAACTAGACCCAAAGTCCGCCATTGTAGAATCTCGAATGAAGAGGATTATAATCCACGGATTGCGACCAGAATATCGAAGCTTCATTGCTGCTGTACAAGGATGGCCCACTCAACCATCACTGGTAGAGTTCGAAAATTTGTTAGCCAGCCAAGAAGCCATGGCTAAACAAATGGGAGGCTTCACGTTGAAGGGTGAAGAAGCACTCTACACGAGTGAAAGTCAGAGCAATAATAGGCCGTCTACCAGACGTGGATACAATGGTGACAAAAGAAGAAGCCACCAAGGGATTGCACAACCTGAGAGAGCTCAGAAGAACGACAACAAGAGTTTTCAAAGAACGAGATTTGGTGGTATTTGCTATAACTGCGGGAAAAAGGGCCATATGTCTAGAGATTGTTGGTCCAGGAAAAAGACCATTGAAAACAATGTGGCAATATCCAAAAAGAAGATAGAAGATGAATGGGATGCAGAGGTACTATGTCCCATAGAAGAAGACGAGCTAGCACTCATGGCGACAATGGAAGACCATATCAACTATGAGAATGACTGGATCGTTGATTCAGGATGTTCAAATCACATGACTGGCGATCAGAGAAAATTGCAAGACAAGAAAGAGTACAAAGGAAGTCGTATGGTACGTACAGCAAATAACACACAGTTATCAATTGCTCAAATCGGCAATACGACAATCATGCCTGGCAATAAGTCTGATACAGTGTCGCTACATAATGTTTATCATGTACCGGGTATAAAGAAGAACTTGTTGTCAGTGTCACAACTAACAACATCAGGAAGCTATGTCTTGTTTGGGCCAGAAGATGTAAAAGTTTATCAAGATGTTAAGATAATAGGAAAGCCGACGATAGAAGGACGAAGAGTGGAGTCTGTCTACGTTCTATCTGCAGAGTCTGCCTATGTTGACAAGACCCGAAAGAATGAGACGACAGATCTATGGCATGCAAGATTGGGACATATTAGCTACCATAAGTTGAAGCTGATTATGGAGAAATCTATGCTCAAAGGTCTACCACAACTGGAAGTCAAAACAGACGTTGTCTGCGCTGGATGTCAGTATGGTAAAGCTCATCAATTACCATACAAAGAGTCAAGTTTCAAAGCAAAGAAACCATTAGAGTTAGTTCACTCTGATTTGTTCGGCCCAGTCAAACAAGCATCGATCAGCGGAATGCGGTATATGGTGACATTTATTGACGACTACTCAAGATATGTGTGGATTTTCTTTATGAAAGAAAAGTCTGACACGTTCTCAAAGTTTCAAGAATTCAAGATGATGGTCGAAGGAGAAGTAGGAGCGAAGATTCGTTGTCTACGTTCAGACAATGGCGGAGAATACACGTCAGATGAGTTCGATCAATATTTACACGAGTGTGGGATACGACGTCAATTTACATGTGCCAACACGCCACAACAAAATGGTGTAGCAGAAAGAAAGAATCGACACCTTGCAGAAACCTGTCGAAGCATGTTACACGCAAAGAACGTTCCAGGAAGATTTTGGGCTGAAGCTATGCGAACTGCTGCCCATGTGATCAACAAGCTTCCTCAACCAAAGCTAGGGTTCGTCTCACCATTTGAGATACTATGGGATATGAAACCTACAATTAGTTACTTCCGAGTATTTGGCTGTGTTTGCTATGTATTTGTGCCTGACCATCTACGTAGCAAGTTTGACAAGAAAGCAGTCAAGTGTGTATTTGTTGGATACGACAATCAAAGAAAAGGATGGAGGTGCTGTGATCCAACAAGTGGAAAATACTATACATCAAGAGATGTAGTTTTTGATGAAGCATCTACATGGTGGTCCTCGGAGAAGAAAGTCTTATCAGATTCAAACATTGAAGAAATTCTACAACAGAAGCTGGGGGAGCAAACTACACAAATTCAATCAAATTTCGATGCATCTGAAAATCCAAGCGACATTGATATTGACAAGCAGGAGGTGACTCAATCAAGCGAATCTGATAAAAATGAAACAACACATCAACAACTTAGGCGATCAAATAGAATCCGAAGGCCAAATCCTAAGTATGCAAATGCAGCTATTGTAGAAGATAGAGTTTACGAACCAGAGACATATGAAGAAGCATCACAAAACTCGGTTTGGCAGAAAGCGATGGAGGAAGAAATTATAGCCTTGGAGCATAATCAAACTTGGGAACTAGTGCCAAGACTAGGAGATATCAAACCCGTCTCTTGCAAGTGGGTCTATAAAATAAAGCGTCGACCGGATGGATCAATCGAGAGATACAAGGCTCGACTCGTGGCTCGAGGATTTTCTCAACAATATGGACTAGATTATGATGAAACATTCAGTCCAGTGGCAAAGATTACTACCGTACGAGTTCTGCTAGCACTCGCAGCAAGTAAAGATTGGAAACTGTGGCAAATGGATGTGAAGAATGCCTTCTTGCACGGAGAGCTAGACAGGGAGATTTATATGAACCAACCAAAGGGATTTGAGAGTGCAGCTAATCCTAATTATGTATGCAAGCTTAGAAAAGCTCTTTATGGACTGAAACAAGCACCGAGAGCTTGGTATGGTAAGATTGCTGAATTTCTTACCCAAAGTGGTTATTCAGTTGCGCATGCAGACTCAAGCCTATTCATCAAAGAAAGAGAAGGAAATTTGACAATTGTGTTGGTCTACGTGGACGATTTGATTATCACCGGGGACGATGAAAGAGAAATTTATCAAACAAGAGAAAATTTATCAATACGCTTTCAGATGAAAGAGCTAGGAGAGCTTAAACACTTCTTAGGCCTAGAAGTTGATCGCACAGATGAAGGACTGTTTCTCTGCCAACAAAAGTATACCAGAGACATGCTTCAGAAGTTCAACATGTTAGAGTGCAAGCAAGTTTCAACACCGATGGAGATAAATGCCAAGATTTGTGCACATGAAGGCAAAGAGTTGAACGATGAAACAACGTACCGACAACTAGTAGGTAGTCTTATCTACCTAACTTTAACTCGACCTGATATCTCGTATGCAGTTGGGGTTATGAGTCGATACATGCAAAGTCCAAAGAAGCCTCATCTGGATGCAGCTCGACGGATCTTGAGATATATCAAAGGTACAATCGACTATGGTCTTTTGTACAAAAGAAGCAAAGACTGCAAGCTAGTTGGATACTGTGATGCTGACTATGCAGGAGACCACGATACTCGGAGGTCAACCACTGGGTATGTGTTCAAGTTTGGTTCGGGAACAATTTCTTGGTGTAGCAAGAGACAACCAACAGTATCATTATCAACTACAGAAGCAGAGTATAGAGCAGCGGCTGGAGCAGCCCAGGAAAGTACATGGCTAAAACTCTTGATGGAAGATTTGCACCAGAAAATTGACTATCCAATATCACTTCTTTGCGACAACCAATCTGCGATTCGCCTTGCAGAAAATCCAGTGTTTCATGCTAGAACAAAGCATGTGGAGGTGCACTACCATTTCATTAGAGAGAAGGTCCTAAAGGAAGAAATTGAGATGCAGCAGATCAAGACAGATGACCAAGTGGCAGACTTGTTTACAAAAGGGTTGAATACTAGCAAACATGAGAGCTTTCGCTGTCAGCTCAACATGATGCAGCGAATGAGGACTAGTGCTGAGGGGGAGTGTTGA

Protein sequence

MANTMSDFQIVGGIKKLNNNNYNTWATCMMSYLQGQDLWEIVGGCETTPPEEDSNDALRKWRIKAGKAMFALKTTIGEEMLEHIWDDKTPKEAWDTFVMLFSKKNDTRLQLLENELLSISQRDMTIAQYFHKVKSICREITELDPKSAIVESRMKRIIIHGLRPEYRSFIAAVQGWPTQPSLVEFENLLASQEAMAKQMGGFTLKGEEALYTSESQSNNRPSTRRGYNGDKRRSHQGIAQPERAQKNDNKSFQRTRFGGICYNCGKKGHMSRDCWSRKKTIENNVAISKKKIEDEWDAEVLCPIEEDELALMATMEDHINYENDWIVDSGCSNHMTGDQRKLQDKKEYKGSRMVRTANNTQLSIAQIGNTTIMPGNKSDTVSLHNVYHVPGIKKNLLSVSQLTTSGSYVLFGPEDVKVYQDVKIIGKPTIEGRRVESVYVLSAESAYVDKTRKNETTDLWHARLGHISYHKLKLIMEKSMLKGLPQLEVKTDVVCAGCQYGKAHQLPYKESSFKAKKPLELVHSDLFGPVKQASISGMRYMVTFIDDYSRYVWIFFMKEKSDTFSKFQEFKMMVEGEVGAKIRCLRSDNGGEYTSDEFDQYLHECGIRRQFTCANTPQQNGVAERKNRHLAETCRSMLHAKNVPGRFWAEAMRTAAHVINKLPQPKLGFVSPFEILWDMKPTISYFRVFGCVCYVFVPDHLRSKFDKKAVKCVFVGYDNQRKGWRCCDPTSGKYYTSRDVVFDEASTWWSSEKKVLSDSNIEEILQQKLGEQTTQIQSNFDASENPSDIDIDKQEVTQSSESDKNETTHQQLRRSNRIRRPNPKYANAAIVEDRVYEPETYEEASQNSVWQKAMEEEIIALEHNQTWELVPRLGDIKPVSCKWVYKIKRRPDGSIERYKARLVARGFSQQYGLDYDETFSPVAKITTVRVLLALAASKDWKLWQMDVKNAFLHGELDREIYMNQPKGFESAANPNYVCKLRKALYGLKQAPRAWYGKIAEFLTQSGYSVAHADSSLFIKEREGNLTIVLVYVDDLIITGDDEREIYQTRENLSIRFQMKELGELKHFLGLEVDRTDEGLFLCQQKYTRDMLQKFNMLECKQVSTPMEINAKICAHEGKELNDETTYRQLVGSLIYLTLTRPDISYAVGVMSRYMQSPKKPHLDAARRILRYIKGTIDYGLLYKRSKDCKLVGYCDADYAGDHDTRRSTTGYVFKFGSGTISWCSKRQPTVSLSTTEAEYRAAAGAAQESTWLKLLMEDLHQKIDYPISLLCDNQSAIRLAENPVFHARTKHVEVHYHFIREKVLKEEIEMQQIKTDDQVADLFTKGLNTSKHESFRCQLNMMQRMRTSAEGEC
BLAST of CmaCh19G005910 vs. Swiss-Prot
Match: POLX_TOBAC (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum PE=2 SV=1)

HSP 1 Score: 748.0 bits (1930), Expect = 1.7e-214
Identity = 484/1359 (35.61%), Postives = 740/1359 (54.45%), Query Frame = 1

Query: 14   IKKLNNNN-YNTWATCMMSYLQGQDLWEIVGGCETTPPEEDSNDALRKWRIKAGKAMFAL 73
            + K N +N ++TW   M   L  Q L +++      P    + D    W     +A  A+
Sbjct: 8    VAKFNGDNGFSTWQRRMRDLLIQQGLHKVLDVDSKKPDTMKAED----WADLDERAASAI 67

Query: 74   KTTIGEEMLEHIWDDKTPKEAWDTFVMLFSKKNDTRLQLLENELLSISQRDMT-IAQYFH 133
            +  + ++++ +I D+ T +  W     L+  K  T    L+ +L ++   + T    + +
Sbjct: 68   RLHLSDDVVNNIIDEDTARGIWTRLESLYMSKTLTNKLYLKKQLYALHMSEGTNFLSHLN 127

Query: 134  KVKSICREITELDPKSAIVESRMKRIIIHGLRPEYRSFIAAVQGWPTQPSLVEFENLLAS 193
                +  ++  L  K  I E     ++++ L   Y +    +    T   L +  + L  
Sbjct: 128  VFNGLITQLANLGVK--IEEEDKAILLLNSLPSSYDNLATTILHGKTTIELKDVTSALLL 187

Query: 194  QEAMAKQMGGFTLKGEEALYTSESQSNNRPSTRRGYNGDKRRSHQGIAQPERAQKNDNKS 253
             E M K+      +G+  +     +S  R S   G +G + +S           KN +KS
Sbjct: 188  NEKMRKKPEN---QGQALITEGRGRSYQRSSNNYGRSGARGKS-----------KNRSKS 247

Query: 254  FQRTRFGGICYNCGKKGHMSRDCWSRKKTIENNVAISKKKIEDEWDAEVLCPIEEDELAL 313
              R      CYNC + GH  RDC + +K        S +K +D   A V      D + L
Sbjct: 248  RVRN-----CYNCNQPGHFKRDCPNPRK---GKGETSGQKNDDNTAAMVQ---NNDNVVL 307

Query: 314  MATMEDHINY----ENDWIVDSGCSNHMTGDQRKLQDKKEYKGSRMVRTANNTQLSIAQI 373
                E+   +    E++W+VD+  S+H T   R L  +        V+  N +   IA I
Sbjct: 308  FINEEEECMHLSGPESEWVVDTAASHHAT-PVRDLFCRYVAGDFGTVKMGNTSYSKIAGI 367

Query: 374  GNTTIMPGNKSDTVSLHNVYHVPGIKKNLLSVSQLTTSGSYVLFGPEDVKVYQDVKIIGK 433
            G+  I   N   T+ L +V HVP ++ NL+S   L   G    F  +  ++ +   +I K
Sbjct: 368  GDICIKT-NVGCTLVLKDVRHVPDLRMNLISGIALDRDGYESYFANQKWRLTKGSLVIAK 427

Query: 434  PTIEGRRVESVYVLSAE--SAYVDKTRKNETTDLWHARLGHISYHKLKLIMEKSMLKGLP 493
                G    ++Y  +AE     ++  +   + DLWH R+GH+S   L+++ +KS++    
Sbjct: 428  GVARG----TLYRTNAEICQGELNAAQDEISVDLWHKRMGHMSEKGLQILAKKSLISYAK 487

Query: 494  QLEVKTDVVCAGCQYGKAHQLPYKESSFKAKKPLELVHSDLFGPVKQASISGMRYMVTFI 553
               VK    C  C +GK H++ ++ SS +    L+LV+SD+ GP++  S+ G +Y VTFI
Sbjct: 488  GTTVKP---CDYCLFGKQHRVSFQTSSERKLNILDLVYSDVCGPMEIESMGGNKYFVTFI 547

Query: 554  DDYSRYVWIFFMKEKSDTFSKFQEFKMMVEGEVGAKIRCLRSDNGGEYTSDEFDQYLHEC 613
            DD SR +W++ +K K   F  FQ+F  +VE E G K++ LRSDNGGEYTS EF++Y    
Sbjct: 548  DDASRKLWVYILKTKDQVFQVFQKFHALVERETGRKLKRLRSDNGGEYTSREFEEYCSSH 607

Query: 614  GIRRQFTCANTPQQNGVAERKNRHLAETCRSMLHAKNVPGRFWAEAMRTAAHVINKLPQP 673
            GIR + T   TPQ NGVAER NR + E  RSML    +P  FW EA++TA ++IN+ P  
Sbjct: 608  GIRHEKTVPGTPQHNGVAERMNRTIVEKVRSMLRMAKLPKSFWGEAVQTACYLINRSPSV 667

Query: 674  KLGFVSPFEILWDMKPTISYFRVFGCVCYVFVPDHLRSKFDKKAVKCVFVGYDNQRKGWR 733
             L F  P  +  + + + S+ +VFGC  +  VP   R+K D K++ C+F+GY ++  G+R
Sbjct: 668  PLAFEIPERVWTNKEVSYSHLKVFGCRAFAHVPKEQRTKLDDKSIPCIFIGYGDEEFGYR 727

Query: 734  CCDPTSGKYYTSRDVVFDEASTWWS---SEK----------KVLSDSNIEEILQQKLGEQ 793
              DP   K   SRDVVF E+    +   SEK           + S SN     +    E 
Sbjct: 728  LWDPVKKKVIRSRDVVFRESEVRTAADMSEKVKNGIIPNFVTIPSTSNNPTSAESTTDEV 787

Query: 794  TTQIQSNFDASENPSDIDIDKQEVTQSSESDKNETTHQQLRRSNRIRRPNPKYANAAIV- 853
            + Q +   +  E    +D   +EV   ++    E  HQ LRRS R R  + +Y +   V 
Sbjct: 788  SEQGEQPGEVIEQGEQLDEGVEEVEHPTQ---GEEQHQPLRRSERPRVESRRYPSTEYVL 847

Query: 854  --EDRVYEPETYEEA---SQNSVWQKAMEEEIIALEHNQTWELVPRLGDIKPVSCKWVYK 913
              +DR  EPE+ +E     + +   KAM+EE+ +L+ N T++LV      +P+ CKWV+K
Sbjct: 848  ISDDR--EPESLKEVLSHPEKNQLMKAMQEEMESLQKNGTYKLVELPKGKRPLKCKWVFK 907

Query: 914  IKRRPDGSIERYKARLVARGFSQQYGLDYDETFSPVAKITTVRVLLALAASKDWKLWQMD 973
            +K+  D  + RYKARLV +GF Q+ G+D+DE FSPV K+T++R +L+LAAS D ++ Q+D
Sbjct: 908  LKKDGDCKLVRYKARLVVKGFEQKKGIDFDEIFSPVVKMTSIRTILSLAASLDLEVEQLD 967

Query: 974  VKNAFLHGELDREIYMNQPKGFESAANPNYVCKLRKALYGLKQAPRAWYGKIAEFLTQSG 1033
            VK AFLHG+L+ EIYM QP+GFE A   + VCKL K+LYGLKQAPR WY K   F+    
Sbjct: 968  VKTAFLHGDLEEEIYMEQPEGFEVAGKKHMVCKLNKSLYGLKQAPRQWYMKFDSFMKSQT 1027

Query: 1034 YSVAHADSSLFIKE-REGNLTIVLVYVDDLIITGDDEREIYQTRENLSIRFQMKELGELK 1093
            Y   ++D  ++ K   E N  I+L+YVDD++I G D+  I + + +LS  F MK+LG  +
Sbjct: 1028 YLKTYSDPCVYFKRFSENNFIILLLYVDDMLIVGKDKGLIAKLKGDLSKSFDMKDLGPAQ 1087

Query: 1094 HFLGLEV--DRTDEGLFLCQQKYTRDMLQKFNMLECKQVSTP----MEINAKICAHEGKE 1153
              LG+++  +RT   L+L Q+KY   +L++FNM   K VSTP    ++++ K+C    +E
Sbjct: 1088 QILGMKIVRERTSRKLWLSQEKYIERVLERFNMKNAKPVSTPLAGHLKLSKKMCPTTVEE 1147

Query: 1154 LND--ETTYRQLVGSLIY-LTLTRPDISYAVGVMSRYMQSPKKPHLDAARRILRYIKGTI 1213
              +  +  Y   VGSL+Y +  TRPDI++AVGV+SR++++P K H +A + ILRY++GT 
Sbjct: 1148 KGNMAKVPYSSAVGSLMYAMVCTRPDIAHAVGVVSRFLENPGKEHWEAVKWILRYLRGTT 1207

Query: 1214 DYGLLYKRSKDCKLVGYCDADYAGDHDTRRSTTGYVFKFGSGTISWCSKRQPTVSLSTTE 1273
               L +  S D  L GY DAD AGD D R+S+TGY+F F  G ISW SK Q  V+LSTTE
Sbjct: 1208 GDCLCFGGS-DPILKGYTDADMAGDIDNRKSSTGYLFTFSGGAISWQSKLQKCVALSTTE 1267

Query: 1274 AEYRAAAGAAQESTWLKLLMED--LHQKIDYPISLLCDNQSAIRLAENPVFHARTKHVEV 1333
            AEY AA    +E  WLK  +++  LHQK +Y +   CD+QSAI L++N ++HARTKH++V
Sbjct: 1268 AEYIAATETGKEMIWLKRFLQELGLHQK-EYVV--YCDSQSAIDLSKNSMYHARTKHIDV 1317

BLAST of CmaCh19G005910 vs. Swiss-Prot
Match: COPIA_DROME (Copia protein OS=Drosophila melanogaster GN=GIP PE=1 SV=3)

HSP 1 Score: 372.9 bits (956), Expect = 1.5e-101
Identity = 220/605 (36.36%), Postives = 350/605 (57.85%), Query Frame = 1

Query: 762  EEILQQKLGEQTTQIQSNFDASENPSDIDIDKQEVTQSSESDKNET----THQQLRRSNR 821
            ++ L +  G          + +E+  +I ID        E     +    T  Q+  +  
Sbjct: 813  DDHLNESKGSGNPNESRESETAEHLKEIGIDNPTKNDGIEIINRRSERLKTKPQISYNEE 872

Query: 822  IRRPNPKYANAAIVEDRVYEPETYEEAS---QNSVWQKAMEEEIIALEHNQTWELVPRLG 881
                N    NA  + + V  P +++E       S W++A+  E+ A + N TW +  R  
Sbjct: 873  DNSLNKVVLNAHTIFNDV--PNSFDEIQYRDDKSSWEEAINTELNAHKINNTWTITKRPE 932

Query: 882  DIKPVSCKWVYKIKRRPDGSIERYKARLVARGFSQQYGLDYDETFSPVAKITTVRVLLAL 941
            +   V  +WV+ +K    G+  RYKARLVARGF+Q+Y +DY+ETF+PVA+I++ R +L+L
Sbjct: 933  NKNIVDSRWVFSVKYNELGNPIRYKARLVARGFTQKYQIDYEETFAPVARISSFRFILSL 992

Query: 942  AASKDWKLWQMDVKNAFLHGELDREIYMNQPKGFESAANPNYVCKLRKALYGLKQAPRAW 1001
                + K+ QMDVK AFL+G L  EIYM  P+G   + N + VCKL KA+YGLKQA R W
Sbjct: 993  VIQYNLKVHQMDVKTAFLNGTLKEEIYMRLPQGI--SCNSDNVCKLNKAIYGLKQAARCW 1052

Query: 1002 YGKIAEFLTQSGYSVAHADSSLFIKEREGNLT---IVLVYVDDLIITGDDEREIYQTREN 1061
            +    + L +  +  +  D  ++I ++ GN+     VL+YVDD++I   D   +   +  
Sbjct: 1053 FEVFEQALKECEFVNSSVDRCIYILDK-GNINENIYVLLYVDDVVIATGDMTRMNNFKRY 1112

Query: 1062 LSIRFQMKELGELKHFLGLEVDRTDEGLFLCQQKYTRDMLQKFNMLECKQVSTPM--EIN 1121
            L  +F+M +L E+KHF+G+ ++  ++ ++L Q  Y + +L KFNM  C  VSTP+  +IN
Sbjct: 1113 LMEKFRMTDLNEIKHFIGIRIEMQEDKIYLSQSAYVKKILSKFNMENCNAVSTPLPSKIN 1172

Query: 1122 AKICAHEGKELNDETTYRQLVGSLIYLTL-TRPDISYAVGVMSRYMQSPKKPHLDAARRI 1181
             ++      + +  T  R L+G L+Y+ L TRPD++ AV ++SRY            +R+
Sbjct: 1173 YELL---NSDEDCNTPCRSLIGCLMYIMLCTRPDLTTAVNILSRYSSKNNSELWQNLKRV 1232

Query: 1182 LRYIKGTIDYGLLYKRSK--DCKLVGYCDADYAGDHDTRRSTTGYVFK-FGSGTISWCSK 1241
            LRY+KGTID  L++K++   + K++GY D+D+AG    R+STTGY+FK F    I W +K
Sbjct: 1233 LRYLKGTIDMKLIFKKNLAFENKIIGYVDSDWAGSEIDRKSTTGYLFKMFDFNLICWNTK 1292

Query: 1242 RQPTVSLSTTEAEYRAAAGAAQESTWLKLLMEDLHQKIDYPISLLCDNQSAIRLAENPVF 1301
            RQ +V+ S+TEAEY A   A +E+ WLK L+  ++ K++ PI +  DNQ  I +A NP  
Sbjct: 1293 RQNSVAASSTEAEYMALFEAVREALWLKFLLTSINIKLENPIKIYEDNQGCISIANNPSC 1352

Query: 1302 HARTKHVEVHYHFIREKVLKEEIEMQQIKTDDQVADLFTKGLNTSKHESFRCQLNMMQRM 1351
            H R KH+++ YHF RE+V    I ++ I T++Q+AD+FTK L  ++    R +L ++Q  
Sbjct: 1353 HKRAKHIDIKYHFAREQVQNNVICLEYIPTENQLADIFTKPLPAARFVELRDKLGLLQDD 1409

BLAST of CmaCh19G005910 vs. Swiss-Prot
Match: M810_ARATH (Uncharacterized mitochondrial protein AtMg00810 OS=Arabidopsis thaliana GN=AtMg00810 PE=4 SV=1)

HSP 1 Score: 193.7 bits (491), Expect = 1.3e-47
Identity = 99/224 (44.20%), Postives = 136/224 (60.71%), Query Frame = 1

Query: 1028 VLVYVDDLIITGDDEREIYQTRENLSIRFQMKELGELKHFLGLEVDRTDEGLFLCQQKYT 1087
            +L+YVDD+++TG     +      LS  F MK+LG + +FLG+++     GLFL Q KY 
Sbjct: 3    LLLYVDDILLTGSSNTLLNMLIFQLSSTFSMKDLGPVHYFLGIQIKTHPSGLFLSQTKYA 62

Query: 1088 RDMLQKFNMLECKQVSTPMEINAKICAHEGKELNDETTYRQLVGSLIYLTLTRPDISYAV 1147
              +L    ML+CK +STP+ +         K   D + +R +VG+L YLTLTRPDISYAV
Sbjct: 63   EQILNNAGMLDCKPMSTPLPLKLNSSVSTAK-YPDPSDFRSIVGALQYLTLTRPDISYAV 122

Query: 1148 GVMSRYMQSPKKPHLDAARRILRYIKGTIDYGLLYKRSKDCKLVGYCDADYAGDHDTRRS 1207
             ++ + M  P     D  +R+LRY+KGTI +GL   ++    +  +CD+D+AG   TRRS
Sbjct: 123  NIVCQRMHEPTLADFDLLKRVLRYVKGTIFHGLYIHKNSKLNVQAFCDSDWAGCTSTRRS 182

Query: 1208 TTGYVFKFGSGTISWCSKRQPTVSLSTTEAEYRAAAGAAQESTW 1252
            TTG+    G   ISW +KRQPTVS S+TE EYRA A  A E TW
Sbjct: 183  TTGFCTFLGCNIISWSAKRQPTVSRSSTETEYRALALTAAELTW 225

BLAST of CmaCh19G005910 vs. Swiss-Prot
Match: YCH4_YEAST (Putative transposon Ty5-1 protein YCL074W OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=TY5A PE=5 SV=2)

HSP 1 Score: 186.4 bits (472), Expect = 2.0e-45
Identity = 103/307 (33.55%), Postives = 160/307 (52.12%), Query Frame = 1

Query: 945  MDVKNAFLHGELDREIYMNQPKGFESAANPNYVCKLRKALYGLKQAPRAWYGKIAEFLTQ 1004
            MDV  AFL+  +D  IY+ QP GF +  NP+YV +L   +YGLKQAP  W   I   L +
Sbjct: 1    MDVDTAFLNSTMDEPIYVKQPPGFVNERNPDYVWELYGGMYGLKQAPLLWNEHINNTLKK 60

Query: 1005 SGYSVAHADSSLFIKEREGNLTIVLVYVDDLIITGDDEREIYQTRENLSIRFQMKELGEL 1064
             G+     +  L+ +        + VYVDDL++     +   + ++ L+  + MK+LG++
Sbjct: 61   IGFCRHEGEHGLYFRSTSDGPIYIAVYVDDLLVAAPSPKIYDRVKQELTKLYSMKDLGKV 120

Query: 1065 KHFLGLEVDRTDEG-LFLCQQKYTRDMLQKFNMLECKQVSTPMEINAKICAHEGKELNDE 1124
              FLGL + ++  G + L  Q Y      +  +   K   TP+  +  +       L D 
Sbjct: 121  DKFLGLNIHQSSNGDITLSLQDYIAKAASESEINTFKLTQTPLCNSKPLFETTSPHLKDI 180

Query: 1125 TTYRQLVGSLIYLTLT-RPDISYAVGVMSRYMQSPKKPHLDAARRILRYIKGTIDYGLLY 1184
            T Y+ +VG L++   T RPDISY V ++SR+++ P+  HL++ARR+LRY+  T    L Y
Sbjct: 181  TPYQSIVGQLLFCANTGRPDISYPVSLLSRFLREPRAIHLESARRVLRYLYTTRSMCLKY 240

Query: 1185 KRSKDCKLVGYCDADYAGDHDTRRSTTGYVFKFGSGTISWCSKR-QPTVSLSTTEAEYRA 1244
            +      L  YCDA +   HD   ST GYV       ++W SK+ +  + + +TEAEY  
Sbjct: 241  RSGSQLALTVYCDASHGAIHDLPHSTGGYVTLLAGAPVTWSSKKLKGVIPVPSTEAEYIT 300

Query: 1245 AAGAAQE 1249
            A+    E
Sbjct: 301  ASETVME 307

BLAST of CmaCh19G005910 vs. Swiss-Prot
Match: YJ41B_YEAST (Transposon Ty4-J Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=TY4B-J PE=3 SV=3)

HSP 1 Score: 139.8 bits (351), Expect = 2.2e-31
Identity = 117/462 (25.32%), Postives = 220/462 (47.62%), Query Frame = 1

Query: 898  YKARLVARGFSQQYGLDYDETFSPVAKITT----VRVLLALAASKDWKLWQMDVKNAFLH 957
            YKAR+V RG +Q       +T+S +   +     +++ L +A +++  +  +D+ +AFL+
Sbjct: 1337 YKARIVCRGDTQS-----PDTYSVITTESLNHNHIKIFLMIANNRNMFMKTLDINHAFLY 1396

Query: 958  GELDREIYMNQPKGFESAANPNYVCKLRKALYGLKQAPRAWYGKIAEFLTQSGYSVAHAD 1017
             +L+ EIY+  P       +   V KL KALYGLKQ+P+ W   + ++L   G       
Sbjct: 1397 AKLEEEIYIPHPH------DRRCVVKLNKALYGLKQSPKEWNDHLRQYLNGIGLKDNSYT 1456

Query: 1018 SSLFIKEREGNLTIVLVYVDDLIITGDDEREIYQTRENLSIRFQMKELGEL------KHF 1077
              L+  + E    ++ VYVDD +I   +E+ + +    L   F++K  G L         
Sbjct: 1457 PGLY--QTEDKNLMIAVYVDDCVIAASNEQRLDEFINKLKSNFELKITGTLIDDVLDTDI 1516

Query: 1078 LGLEV--DRTDEGLFLCQQKYTRDMLQKFN--MLECKQVSTPMEINAKICAHEGKELNDE 1137
            LG+++  ++    + L  + +   M +K+N  + + ++ S P     KI   +      E
Sbjct: 1517 LGMDLVYNKRLGTIDLTLKSFINRMDKKYNEELKKIRKSSIPHMSTYKIDPKKDVLQMSE 1576

Query: 1138 TTYR-------QLVGSLIYLT-LTRPDISYAVGVMSRYMQSPKKPHLDAARRILRYIKGT 1197
              +R       QL+G L Y+    R DI +AV  ++R +  P +       +I++Y+   
Sbjct: 1577 EEFRQGVLKLQQLLGELNYVRHKCRYDIEFAVKKVARLVNYPHERVFYMIYKIIQYLVRY 1636

Query: 1198 IDYGLLYKR--SKDCKLVGYCDADYAGDHDTRRSTTGYVFKFGSGTISWCSKRQPTVSLS 1257
             D G+ Y R  +KD K++   DA    ++D  +S  G +  +G    +  S +     +S
Sbjct: 1637 KDIGIHYDRDCNKDKKVIAITDASVGSEYDA-QSRIGVILWYGMNIFNVYSNKSTNRCVS 1696

Query: 1258 TTEAEYRAAAGAAQESTWLKLLMEDLHQKIDYPISLLCDNQSAIRLAENPVFHARTKHVE 1317
            +TEAE  A      +S  LK+ +++L +  +  I ++ D++ AI+         + K   
Sbjct: 1697 STEAELHAIYEGYADSETLKVTLKELGEGDNNDIVMITDSKPAIQGLNRSYQQPKEKFTW 1756

Query: 1318 VHYHFIREKVLKEEIEMQQIKTDDQVADLFTKGLNTSKHESF 1336
            +    I+EK+ ++ I++ +I     +ADL TK ++ S  + F
Sbjct: 1757 IKTEIIKEKIKEKSIKLLKITGKGNIADLLTKPVSASDFKRF 1784

BLAST of CmaCh19G005910 vs. TrEMBL
Match: A5AKW8_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_027864 PE=4 SV=1)

HSP 1 Score: 1808.9 bits (4684), Expect = 0.0e+00
Identity = 917/1354 (67.73%), Postives = 1065/1354 (78.66%), Query Frame = 1

Query: 5    MSDFQIVGGIKKLNNNNYNTWATCMMSYLQGQDLWEIVGGCETTPPE-EDSNDALRKWRI 64
            M D Q++GGIKKLNN NYNTW+TCMMSY+QGQDLWE+V G E T P+ ED+N  LRKW+I
Sbjct: 1    MGDLQVIGGIKKLNNQNYNTWSTCMMSYMQGQDLWEVVNGSEITQPKVEDANGILRKWKI 60

Query: 65   KAGKAMFALKTTIGEEMLEHIWDDKTPKEAWDTFVMLFSKKNDTRLQLLENELLSISQRD 124
            KAGKAMFALKTTI E++LEHI D KTP EAW+TF  LFSKKNDTRLQLLE+EL S++QRD
Sbjct: 61   KAGKAMFALKTTIEEDVLEHIRDAKTPYEAWNTFTKLFSKKNDTRLQLLESELFSVAQRD 120

Query: 125  MTIAQYFHKVKSICREITELDPKSAIVESRMKRIIIHGLRPEYRSFIAAVQGWPTQPSLV 184
            +TIAQYFHKVK++CREI+ELD ++ I E+ MKRIIIHGLRPE+R F+AA+QGW  QPSLV
Sbjct: 121  LTIAQYFHKVKTLCREISELDLEAPIGETXMKRIIIHGLRPEFRGFVAAIQGWQNQPSLV 180

Query: 185  EFENLLASQEAMAKQMGGFTLKGEE-ALYTSESQSNNRPST--RRGYNGDKRRSHQGIAQ 244
            EFENLLA QEA+AKQMGG +LKGEE ALY  +   N+   T  R   N DK +  QG  +
Sbjct: 181  EFENLLAGQEALAKQMGGVSLKGEEEALYAHKGGWNSXQHTVRRTKKNEDKAKCSQG-ER 240

Query: 245  PERAQKNDNKSFQRTRFGGICYNCGKKGHMSRDCWSRKKTIENNVAISKKKIEDEWDAEV 304
              R + +        +F G CYNC KKGHM++DCWS+K  +E+N   SK   EDEWDA+ 
Sbjct: 241  SARVEGDSKNPGTXKKFEGKCYNCXKKGHMAKDCWSKKGLVESNATTSKS--EDEWDAQA 300

Query: 305  LCPIEEDELALMATMEDHINYENDWIVDSGCSNHMTGDQRKLQDKKEYKGSRMVRTANNT 364
                   E A +AT  + I+YE DWI+DSGCSNHMTGD+ KLQD  EYKG  MV TANN+
Sbjct: 301  FFAAI-GESAFIATTSEQIDYEKDWIIDSGCSNHMTGDKEKLQDLSEYKGRHMVVTANNS 360

Query: 365  QLSIAQIGNTTIMPGNKSDTVSLHNVYHVPGIKKNLLSVSQLTTSGSYVLFGPEDVKVYQ 424
            +L IA IGNT +     ++ VSL NVYHVPG+KKNLLSV+QLT+SG  VLFGP+DVKVY 
Sbjct: 361  KLPIAHIGNTVVSSQYNTNDVSLQNVYHVPGMKKNLLSVAQLTSSGHSVLFGPQDVKVYH 420

Query: 425  DVKIIGKPTIEGRRVESVYVLSAESAYVDKTRKNETTDLWHARLGHISYHKLKLIMEKSM 484
            D++++ +P I+GRR+ESVYV+SAE+AYVDKTRKNET DLWH RL HISY KL ++M+KSM
Sbjct: 421  DLEVMEEPVIKGRRLESVYVMSAETAYVDKTRKNETADLWHMRLSHISYSKLTMMMKKSM 480

Query: 485  LKGLPQLEVKTDVVCAGCQYGKAHQLPYKESSFKAKKPLELVHSDLFGPVKQASISGMRY 544
            LKGLPQLEV+   +CA CQYGKAHQLPY+ES +KAK PLEL+HSD+FGPVKQAS+SGM  
Sbjct: 481  LKGLPQLEVRKXTICAXCQYGKAHQLPYEESKWKAKGPLELIHSDVFGPVKQASLSGM-- 540

Query: 545  MVTFIDDYSRYVWIFFMKEKSDTFSKFQEFKMMVEGEVGAKIRCLRSDNGGEYTSDEFDQ 604
                     +Y+  F      D FS+    +M                    +TS E  +
Sbjct: 541  ---------KYMVTFI-----DDFSRRVYLQMSF------------------FTSSENXE 600

Query: 605  YLHECGIRRQFTCANTPQQNGVAERKNRHLAETCRSMLHAKNVPGRFWAEAMRTAAHVIN 664
            Y         FTCANTPQQNGV ERKNRHLAE CRSMLHAKNVPG FWAE M+TAA VIN
Sbjct: 601  YAIS------FTCANTPQQNGVXERKNRHLAEICRSMLHAKNVPGXFWAEXMKTAAFVIN 660

Query: 665  KLPQPKLGFVSPFEILWDMKPTISYFRVFGCVCYVFVPDHLRSKFDKKAVKCVFVGYDNQ 724
            +LPQ +L F SPFE LW++KPT+SYFRVFGCVCYVFVP+HLRSK DKKAV+CV VGYD+Q
Sbjct: 661  RLPQQRLNFSSPFEKLWNIKPTVSYFRVFGCVCYVFVPNHLRSKMDKKAVRCVLVGYDSQ 720

Query: 725  RKGWRCCDPTSGKYYTSRDVVFDEASTWWSSEKKVLSDSNIEEILQQKLGEQTTQIQSNF 784
            RK WRCCDPT+GK YTSR+VVFDE+S+WWSSEK++L DS+   + + +L  Q+ +IQ + 
Sbjct: 721  RKXWRCCDPTTGKCYTSRNVVFDESSSWWSSEKEILXDSB---VFKDEL--QSARIQLSL 780

Query: 785  DASENPSDIDIDKQEVTQSSESDKNETTHQQLRRSNRIRRPNPKYANAAIVED-RVYEPE 844
              +EN  D DI   + TQS      +T         R ++PNPKYAN AIVED    EP 
Sbjct: 781  GEAENAXDGDIG-DDXTQSPW----QTGVHGQPSEERTKKPNPKYANVAIVEDANAKEPX 840

Query: 845  TYEEASQNSVWQKAMEEEIIALEHNQTWELVPRLGDIKPVSCKWVYKIKRRPDGSIERYK 904
            T+ EA QNS W KAM EEI AL+ NQTWELVP+  D++P SCKWVYKIKRR DGSIER+K
Sbjct: 841  TFAEAFQNSDWSKAMXEEIAALKRNQTWELVPKPRDVEPXSCKWVYKIKRRTDGSIERHK 900

Query: 905  ARLVARGFSQQYGLDYDETFSPVAKITTVRVLLALAASKDWKLWQMDVKNAFLHGELDRE 964
            A LVARGFSQQYGLDYDETFSPV K+TTVRVLLALAA+KDW LWQMDVKNAFLHGELDRE
Sbjct: 901  AXLVARGFSQQYGLDYDETFSPVXKLTTVRVLLALAANKDWDLWQMDVKNAFLHGELDRE 960

Query: 965  IYMNQPKGFESAANPNYVCKLRKALYGLKQAPRAWYGKIAEFLTQSGYSVAHADSSLFIK 1024
            IYMNQP GF+S  +P YVCKLRKALYGLKQAPRAWYGKIAEFLTQSGYSV  ADSSLF+K
Sbjct: 961  IYMNQPMGFQSQGHPEYVCKLRKALYGLKQAPRAWYGKIAEFLTQSGYSVTPADSSLFVK 1020

Query: 1025 EREGNLTIVLVYVDDLIITGDDEREIYQTRENLSIRFQMKELGELKHFLGLEVDRTDEGL 1084
               G L IVLVYVDDLIITGDD  EI++T+ENLS+RF+MKELG+LKHFLGLEVDRT+EG+
Sbjct: 1021 ANGGKLAIVLVYVDDLIITGDDVEEIFRTKENLSVRFEMKELGQLKHFLGLEVDRTNEGI 1080

Query: 1085 FLCQQKYTRDMLQKFNMLECKQVSTPMEINAKICAHEGKELNDETTYRQLVGSLIYLTLT 1144
            FLCQQKY +D+L+KF MLECK +STPME NAK+C HEGK+L D T YRQLVGSL+YLTLT
Sbjct: 1081 FLCQQKYAKDLLKKFGMLECKPISTPMEPNAKMCEHEGKDLKDATMYRQLVGSLLYLTLT 1140

Query: 1145 RPDISYAVGVMSRYMQSPKKPHLDAARRILRYIKGTIDYGLLYKRSKDCKLVGYCDADYA 1204
             PDISYAVGVMSRYMQ+PKKPHL+A RRILR++KGTIDYGLLYK+ +DCKLVGYCDADYA
Sbjct: 1141 XPDISYAVGVMSRYMQNPKKPHLEAVRRILRHVKGTIDYGLLYKKXEDCKLVGYCDADYA 1200

Query: 1205 GDHDTRRSTTGYVFKFGSGTISWCSKRQPTVSLSTTEAEYRAAAGAAQESTWLKLLMEDL 1264
            GDHDTR STTGYVF  GSG ISWCSKRQPTVSLSTTEAEYRAAA A QES WL  LM DL
Sbjct: 1201 GDHDTRXSTTGYVFMLGSGAISWCSKRQPTVSLSTTEAEYRAAAMATQESMWLIRLMNDL 1260

Query: 1265 HQKIDYPISLLCDNQSAIRLAENPVFHARTKHVEVHYHFIREKVLKEEIEMQQIKTDDQV 1324
            HQ +DY + L CDNQSA+RLAENPVFHARTKHVEVHYHFIREKVLKEE+E+ QIK++DQV
Sbjct: 1261 HQLVDYAVPLYCDNQSAVRLAENPVFHARTKHVEVHYHFIREKVLKEEVELNQIKSEDQV 1300

Query: 1325 ADLFTKGLNTSKHESFRCQLNMMQRMRTSAEGEC 1354
            ADLFTKGL+ SK ESF  QL M++ +    EGEC
Sbjct: 1321 ADLFTKGLSGSKFESFCHQLGMVKILEADVEGEC 1300

BLAST of CmaCh19G005910 vs. TrEMBL
Match: I1J3P8_BRADI (Uncharacterized protein OS=Brachypodium distachyon PE=4 SV=1)

HSP 1 Score: 1761.9 bits (4562), Expect = 0.0e+00
Identity = 880/1386 (63.49%), Postives = 1075/1386 (77.56%), Query Frame = 1

Query: 13   GIKKLNNNNYNTWATCMMSYLQGQDLWEIVGGCETTPPEEDSNDALRKWRIKAGKAMFAL 72
            GI+KLN++NY  W TCM SYLQGQDLWE+V G E  PPE  + DALRKW+IK+GKAMF L
Sbjct: 10   GIRKLNSHNYGYWQTCMESYLQGQDLWEVVAGTEAFPPE--NADALRKWKIKSGKAMFVL 69

Query: 73   KTTIGEEMLEHIWDDKTPKEAWDTFVMLFSKKNDTRLQLLENELLSISQRDMTIAQYFHK 132
            KTTI E++LEHI D+KTPKEAW+T   LFS+KN+ RLQLLENEL  ISQ +++I+QYF K
Sbjct: 70   KTTIEEDLLEHIRDEKTPKEAWETLAKLFSRKNEARLQLLENELAGISQGNLSISQYFSK 129

Query: 133  VKSICREITELDPKSAIVESRMKRIIIHGLRPEYRSFIAAVQGWPTQPSLVEFENLLASQ 192
            VK ICREI++L P   + E+RMKRIIIHGLRPEY  F+AAV GWPT PS+VE ENLL +Q
Sbjct: 130  VKFICREISQLAPDEKVSETRMKRIIIHGLRPEYNGFMAAVMGWPTPPSVVELENLLVNQ 189

Query: 193  EAMAKQMGGFTLKGEE-ALYTSESQSNNR---------------PSTRRGYNGDKRRSHQ 252
            E +AK+MG  T+K EE AL+T +   + +               P  R   +G  +R  Q
Sbjct: 190  EELAKKMGSITIKEEEEALFTKKKSPHQKQWKPKPKWTEGGKEHPKERSNSSGGDQRERQ 249

Query: 253  GIAQPER-AQKNDNKSFQRTRFGGI-------------CYNCGKKGHMSRDCWSRKKTIE 312
            G    E+ A++  +  F   + G               C NCGKKGH++R+C   ++  E
Sbjct: 250  GPKWYEKNARRPSDGCFNCGKAGHFARECRFPRRSNDGCSNCGKKGHLTRECRYPRRRYE 309

Query: 313  NNVAISKKK---------IEDEWDAEV-------LCPIEED--ELALMATMEDHINYEND 372
             NVA +K+K          E+EWDAE        +  +EED    AL A  +  INY++D
Sbjct: 310  GNVATTKEKEEITLEASMSEEEWDAEAGYSQEVDIEDLEEDMEAPALAAIKDPKINYKDD 369

Query: 373  WIVDSGCSNHMTGDQRKLQDKKEYKGSRMVRTANNTQLSIAQIGNTTIMPGNKSDTVSLH 432
            WIVDSGCSNHMT D  KLQD  EYKG +MV TANN++LSI+ +G T + P   +  + L 
Sbjct: 370  WIVDSGCSNHMTNDDNKLQDMTEYKGGKMVLTANNSKLSISHVGKTAV-PRYGTQQLQLE 429

Query: 433  NVYHVPGIKKNLLSVSQLTTSGSYVLFGPEDVKVYQDVKIIGKPTIEGRRVESVYVLSAE 492
             VYHVPG+KKNLLSV QLT  G YVLFGP++V +++ +K+IG P  EG++ +S++VLSAE
Sbjct: 430  RVYHVPGLKKNLLSVPQLTAEGKYVLFGPQEVAIFRRLKVIGTPIKEGKKRDSLFVLSAE 489

Query: 493  SAYVDKTRKNETTDLWHARLGHISYHKLKLIMEKSMLKGLPQLEVKTDVVCAGCQYGKAH 552
            SAYVDKTRKNET DLWHARLGHISYHKLK +MEK ++KGL  L+++TD+VCAGCQYGKAH
Sbjct: 490  SAYVDKTRKNETADLWHARLGHISYHKLKEMMEKHVVKGLSDLDIRTDMVCAGCQYGKAH 549

Query: 553  QLPYKESSFKAKKPLELVHSDLFGPVKQASISGMRYMVTFIDDYSRYVWIFFMKEKSDTF 612
            QLPYKES  ++K PLEL+HSD+FGPVKQ S+ GMRYMVTFIDD+SRYVW++FMKEKS+TF
Sbjct: 550  QLPYKESQHQSKVPLELIHSDVFGPVKQISLGGMRYMVTFIDDFSRYVWVYFMKEKSETF 609

Query: 613  SKFQEFKMMVEGEVGAKIRCLRSDNGGEYTSDEFDQYLHECGIRRQFTCANTPQQNGVAE 672
             KF+EFK M+EGE+  KIRCLR+DNG EY S+EF  YL +  I+RQ TC NTPQQNGVAE
Sbjct: 610  MKFKEFKDMIEGELEYKIRCLRTDNGREYLSNEFTIYLKKNKIKRQLTCPNTPQQNGVAE 669

Query: 673  RKNRHLAETCRSMLHAKNVPGRFWAEAMRTAAHVINKLPQPKLGFVSPFEILWDMKPTIS 732
            RKNRHLAETCRSMLHAKNVPGRFWAE MRTAAHVINKLPQ +LGF SP E LW MKP IS
Sbjct: 670  RKNRHLAETCRSMLHAKNVPGRFWAECMRTAAHVINKLPQVRLGFKSPHEKLWRMKPAIS 729

Query: 733  YFRVFGCVCYVFVPDHLRSKFDKKAVKCVFVGYDNQRKGWRCCDPTSGKYYTSRDVVFDE 792
            + +VFGCVCY+FVPDHLR+KF+KKA +C+FVGYD+ RKGWRCCDPT+GK +TSR++VFDE
Sbjct: 730  HLKVFGCVCYIFVPDHLRTKFEKKAKRCIFVGYDDARKGWRCCDPTTGKCHTSRNIVFDE 789

Query: 793  ASTWWSSEKKVLSDS-NIEEILQ----QKLGEQTTQIQSNFDASENPSDIDIDKQEVTQS 852
            AS+WWS +K+ + +S ++EE+++    QKL   +   +S+   +++P    I + E  Q+
Sbjct: 790  ASSWWSPKKEEIPESPDVEEVIEEERDQKLETPSEGERSSPSKTKSPWKTGIHQPEEPQT 849

Query: 853  SESDKNETTHQQLRRSNRIRRPNPKYANAAIVEDRV-YEPETYEEASQNSVWQKAMEEEI 912
             E D      Q+LRRS R R+PNP+YANAA+ ++ +  EP +YEEA++   WQKAMEEEI
Sbjct: 850  EEHD------QELRRSTRPRKPNPRYANAALADESLPIEPSSYEEAARGPEWQKAMEEEI 909

Query: 913  IALEHNQTWELVPRLGDIKPVSCKWVYKIKRRPDGSIERYKARLVARGFSQQYGLDYDET 972
             AL+ NQTW+LVP+  D+KP+SCKWVYK+K R DGSIERYKARLVARGFSQ+YGLDY+ET
Sbjct: 910  KALKENQTWDLVPKPKDVKPISCKWVYKVKTRTDGSIERYKARLVARGFSQEYGLDYEET 969

Query: 973  FSPVAKITTVRVLLALAASKDWKLWQMDVKNAFLHGELDREIYMNQPKGFESAANPNYVC 1032
            FSPVAKITTVRVLLALAASK W+LWQMDVKNAFLHGELD+EIYM QPKGFES   P +VC
Sbjct: 970  FSPVAKITTVRVLLALAASKSWELWQMDVKNAFLHGELDKEIYMEQPKGFESKKYPEHVC 1029

Query: 1033 KLRKALYGLKQAPRAWYGKIAEFLTQSGYSVAHADSSLFIKEREGNLTIVLVYVDDLIIT 1092
            KL+KALYGLKQAPRAWYGKI EFL  +G+ VA +DSSLF+  +EG L IVLVYVDDLIIT
Sbjct: 1030 KLKKALYGLKQAPRAWYGKIGEFLVHNGFKVAPSDSSLFVMAKEGRLAIVLVYVDDLIIT 1089

Query: 1093 GDDEREIYQTRENLSIRFQMKELGELKHFLGLEVDRTDEGLFLCQQKYTRDMLQKFNMLE 1152
            GD   EI +TRENLS+RFQMKELGEL+HFLGLEV+ T  G+FL QQKY +D+L+++ ML+
Sbjct: 1090 GDYSEEIERTRENLSVRFQMKELGELRHFLGLEVEHTKNGIFLGQQKYAKDLLKRYGMLD 1149

Query: 1153 CKQVSTPMEINAKICAHEGKELNDETTYRQLVGSLIYLTLTRPDISYAVGVMSRYMQSPK 1212
            CK +STPM+ NA++   +GK L D T YRQLVGSLIYLTL+RPDISYAVG+ SRYM +PK
Sbjct: 1150 CKPISTPMDPNARLQEDKGKNLEDATMYRQLVGSLIYLTLSRPDISYAVGMASRYMSTPK 1209

Query: 1213 KPHLDAARRILRYIKGTIDYGLLYKRSKDCKLVGYCDADYAGDHDTRRSTTGYVFKFGSG 1272
            KPHLDA RRILRY+KGT++YG+LYK++K+C+++GYCDADYAGD DTRRSTTGY+F  GSG
Sbjct: 1210 KPHLDAIRRILRYVKGTLNYGILYKKTKECQVIGYCDADYAGDCDTRRSTTGYLFSLGSG 1269

Query: 1273 TISWCSKRQPTVSLSTTEAEYRAAAGAAQESTWLKLLMEDLHQKIDYPISLLCDNQSAIR 1332
             I+WCSKRQPTV+LS+TEAEYR+AA AAQESTWLK LMEDLHQ     + + CDN S IR
Sbjct: 1270 AITWCSKRQPTVALSSTEAEYRSAAAAAQESTWLKQLMEDLHQTPKDQVWIFCDNLSTIR 1329

Query: 1333 LAENPVFHARTKHVEVHYHFIREKVLKEEIEMQQIKTDDQVADLFTKGLNTSKHESFRCQ 1345
            LAENPVFHARTKH+EVHYH+IREKVLK EIEM   KT+DQ AD+ TK LN SK E FR  
Sbjct: 1330 LAENPVFHARTKHIEVHYHYIREKVLKGEIEMVPTKTEDQTADILTKSLNKSKFEKFREA 1386

BLAST of CmaCh19G005910 vs. TrEMBL
Match: I1J0P4_BRADI (Uncharacterized protein OS=Brachypodium distachyon PE=4 SV=1)

HSP 1 Score: 1758.4 bits (4553), Expect = 0.0e+00
Identity = 884/1427 (61.95%), Postives = 1083/1427 (75.89%), Query Frame = 1

Query: 5    MSDFQIVG-GIKKLNNNNYNTWATCMMSYLQGQDLWEIVGGCETTPPEEDSNDALRKWRI 64
            M+D    G GI+KLN++NY  W TCM SYLQGQDLWE+V G E  PPE+   DALRKW+I
Sbjct: 1    MADLASSGSGIRKLNSHNYGYWQTCMESYLQGQDLWEVVAGTEAFPPEDA--DALRKWKI 60

Query: 65   KAGKAMFALKTTIGEEMLEHIWDDKTPKEAWDTFVMLFSKKNDTRLQLLENELLSISQRD 124
            K+GKAMF LKTTI E++LEHI D+KTPKEAW+T   LFS+KN+ RLQLLENEL  ISQ +
Sbjct: 61   KSGKAMFVLKTTIEEDLLEHIRDEKTPKEAWETLAKLFSRKNEARLQLLENELAGISQGN 120

Query: 125  MTIAQYFHKVKSICREITELDPKSAIVESRMKRIIIHGLRPEYRSFIAAVQGWPTQPSLV 184
            ++I+QYF KVK ICREI++L P   + E+RMKRIIIHGLRPEY  F+AAV GWPT PS+V
Sbjct: 121  LSISQYFSKVKFICREISQLAPDEKVSETRMKRIIIHGLRPEYNGFMAAVMGWPTPPSVV 180

Query: 185  EFENLLASQEAMAKQMGGFTLKGEE-ALYTSESQSNNR---------------PSTRRGY 244
            E ENLL +QE +AK+MG  T+K EE AL+T +   + +               P  RR  
Sbjct: 181  ELENLLVNQEELAKKMGSITIKEEEEALFTKKKSPHQKQWKPKPKWTEGGKEHPKERRNS 240

Query: 245  NGDKRRSHQGIAQPER-AQKNDNKSFQRTRFGGI-------------CYNCGKKGHMSRD 304
            +G  +R  QG    E+ A++  +  F   + G               C NCGKKGH++R+
Sbjct: 241  SGGDQRERQGPKWYEKNARRPSDGCFNCGKAGHFARECRFPRRSNDGCSNCGKKGHLTRE 300

Query: 305  CWSRKKTIENNVAISKKK---------IEDEWDAEV------------------LCPIEE 364
            C   ++  E NVA +K+K          E+EWDAE                      ++E
Sbjct: 301  CRYPRRRYEGNVATTKEKEEITLEASMSEEEWDAEAGYSQEVDIKDLEEDMEAPAFAMDE 360

Query: 365  DEL-----------------------ALMATMEDHINYENDWIVDSGCSNHMTGDQRKLQ 424
            +EL                       AL A  +  INY++DWIVDSGCSNHMT D  KLQ
Sbjct: 361  EELEEDMKPPVFATDEEELEEDMEAPALAAIKDPKINYKDDWIVDSGCSNHMTNDDNKLQ 420

Query: 425  DKKEYKGSRMVRTANNTQLSIAQIGNTTIMPGNKSDTVSLHNVYHVPGIKKNLLSVSQLT 484
            D  EYKG +MV TANN++LSI+ +G T ++P   +  + L  VYHVPG+KKNLLSV QLT
Sbjct: 421  DMTEYKGGKMVLTANNSKLSISHVGKT-VVPRYGTQQLQLERVYHVPGLKKNLLSVPQLT 480

Query: 485  TSGSYVLFGPEDVKVYQDVKIIGKPTIEGRRVESVYVLSAESAYVDKTRKNETTDLWHAR 544
              G YVLFGP++V +++ +K+IG P +EG++ +S++VLSAESAYVDKTRKNET DLWHAR
Sbjct: 481  AEGKYVLFGPQEVAIFRRLKVIGTPIMEGKKRDSLFVLSAESAYVDKTRKNETADLWHAR 540

Query: 545  LGHISYHKLKLIMEKSMLKGLPQLEVKTDVVCAGCQYGKAHQLPYKESSFKAKKPLELVH 604
            LGH+SYHKLK +MEK ++KGLP L+++TD+VCAGCQYGKAHQLPYKES  ++K PLEL+H
Sbjct: 541  LGHVSYHKLKEMMEKHVVKGLPDLDIRTDMVCAGCQYGKAHQLPYKESQHQSKVPLELIH 600

Query: 605  SDLFGPVKQASISGMRYMVTFIDDYSRYVWIFFMKEKSDTFSKFQEFKMMVEGEVGAKIR 664
            SD+FGPVKQ S+ GMRYMVTFIDD+SRYVW++FMKEKS+TF KF+EFK M+EGE+  KIR
Sbjct: 601  SDVFGPVKQISLGGMRYMVTFIDDFSRYVWVYFMKEKSETFIKFKEFKDMIEGELEYKIR 660

Query: 665  CLRSDNGGEYTSDEFDQYLHECGIRRQFTCANTPQQNGVAERKNRHLAETCRSMLHAKNV 724
            CLR+DNG EY S+EF  YL +  I+RQ TC NTPQQNGVAERKNRHLAETCRSMLHAKNV
Sbjct: 661  CLRTDNGREYLSNEFTIYLKKKKIKRQLTCPNTPQQNGVAERKNRHLAETCRSMLHAKNV 720

Query: 725  PGRFWAEAMRTAAHVINKLPQPKLGFVSPFEILWDMKPTISYFRVFGCVCYVFVPDHLRS 784
            PGRFWAE MRTAAHVINKLPQ +LGF SP E LW MKP IS+ +VFGCVCY+FVPDHLR+
Sbjct: 721  PGRFWAECMRTAAHVINKLPQVRLGFKSPHEKLWRMKPAISHLKVFGCVCYIFVPDHLRT 780

Query: 785  KFDKKAVKCVFVGYDNQRKGWRCCDPTSGKYYTSRDVVFDEASTWWSSEKKVLSDS-NIE 844
            KF+KKA +C+FVGYD+ RKGWRCCDPT+GK +TSR++VFDEAS+WWS +K+ + +S ++E
Sbjct: 781  KFEKKAKRCIFVGYDDARKGWRCCDPTTGKCHTSRNIVFDEASSWWSPKKEEIPESPDVE 840

Query: 845  EILQ----QKLGEQTTQIQSNFDASENPSDIDIDKQEVTQSSESDKNETTHQQLRRSNRI 904
            E+++    QKL   +   +S+   +++P    I + E  Q+ E D      Q+LRRS R 
Sbjct: 841  EVIEEERDQKLETPSEGERSSPSKTKSPWKTGIHQPEEPQTEEHD------QELRRSTRP 900

Query: 905  RRPNPKYANAAIVEDRV-YEPETYEEASQNSVWQKAMEEEIIALEHNQTWELVPRLGDIK 964
            R+PNP+YANAA+ ++ +  EP +YEEA++   WQKAMEEEI AL+ NQTW+LVP+  D+K
Sbjct: 901  RKPNPRYANAALADESLPIEPSSYEEAARGPEWQKAMEEEIKALKENQTWDLVPKPKDVK 960

Query: 965  PVSCKWVYKIKRRPDGSIERYKARLVARGFSQQYGLDYDETFSPVAKITTVRVLLALAAS 1024
            P+SCKWVYK+K R DGSIERYKARLVARGFSQ+YGLDY+ETFSPVAKITTVRVLLALAAS
Sbjct: 961  PISCKWVYKVKTRTDGSIERYKARLVARGFSQEYGLDYEETFSPVAKITTVRVLLALAAS 1020

Query: 1025 KDWKLWQMDVKNAFLHGELDREIYMNQPKGFESAANPNYVCKLRKALYGLKQAPRAWYGK 1084
            K W+LWQMDVKNAFLHGELD+EIYM QPKGFES   P +VCKL+KALYGLKQAPRAWYGK
Sbjct: 1021 KSWELWQMDVKNAFLHGELDKEIYMEQPKGFESKKYPEHVCKLKKALYGLKQAPRAWYGK 1080

Query: 1085 IAEFLTQSGYSVAHADSSLFIKEREGNLTIVLVYVDDLIITGDDEREIYQTRENLSIRFQ 1144
            I EFL  +G+ VA +DSSLF+  +EG L IVLVYVDDLIITGD   EI +TRENLS+RFQ
Sbjct: 1081 IGEFLVHNGFKVAPSDSSLFVMAKEGRLAIVLVYVDDLIITGDYSEEIERTRENLSVRFQ 1140

Query: 1145 MKELGELKHFLGLEVDRTDEGLFLCQQKYTRDMLQKFNMLECKQVSTPMEINAKICAHEG 1204
            MKELGEL+HFLGLEV+ T  G+FL QQKY +D+L+++ ML+CK +STPM+ N ++   +G
Sbjct: 1141 MKELGELRHFLGLEVEHTKNGIFLGQQKYAKDLLKRYGMLDCKPISTPMDPNTRLQEDKG 1200

Query: 1205 KELNDETTYRQLVGSLIYLTLTRPDISYAVGVMSRYMQSPKKPHLDAARRILRYIKGTID 1264
            K L D T YRQLVGSLIYLTL+RPDISYAVGV SRYM +PKKPHLDA RRILRY+KGT++
Sbjct: 1201 KNLEDATMYRQLVGSLIYLTLSRPDISYAVGVASRYMSTPKKPHLDAIRRILRYVKGTLN 1260

Query: 1265 YGLLYKRSKDCKLVGYCDADYAGDHDTRRSTTGYVFKFGSGTISWCSKRQPTVSLSTTEA 1324
            YG+LYK++K+C+++GYCDADYAGD DTRRSTTGY+F  GSG I+WCSKRQPTV+LS+TEA
Sbjct: 1261 YGILYKKTKECQVIGYCDADYAGDCDTRRSTTGYLFSLGSGAITWCSKRQPTVALSSTEA 1320

Query: 1325 EYRAAAGAAQESTWLKLLMEDLHQKIDYPISLLCDNQSAIRLAENPVFHARTKHVEVHYH 1345
            EYR+AA AAQESTWLK LMEDLHQ     + + CDN S IRLAENPVFHARTKH+EVHYH
Sbjct: 1321 EYRSAAAAAQESTWLKQLMEDLHQTPKDQVWIFCDNLSTIRLAENPVFHARTKHIEVHYH 1380

BLAST of CmaCh19G005910 vs. TrEMBL
Match: I1IA27_BRADI (Uncharacterized protein OS=Brachypodium distachyon PE=4 SV=1)

HSP 1 Score: 1756.5 bits (4548), Expect = 0.0e+00
Identity = 880/1418 (62.06%), Postives = 1079/1418 (76.09%), Query Frame = 1

Query: 13   GIKKLNNNNYNTWATCMMSYLQGQDLWEIVGGCETTPPEEDSNDALRKWRIKAGKAMFAL 72
            GI+KLN++NY  W TCM SYLQGQDLWE+V G E  PPE  + DALRKW+IK+GKAMF L
Sbjct: 10   GIRKLNSHNYGYWQTCMESYLQGQDLWEVVAGTEAFPPE--NADALRKWKIKSGKAMFVL 69

Query: 73   KTTIGEEMLEHIWDDKTPKEAWDTFVMLFSKKNDTRLQLLENELLSISQRDMTIAQYFHK 132
            KTTIGE++LEHI D+KTPKEAW+T   LFS+KN+ RLQLLENEL+ ISQ +++I+QYF K
Sbjct: 70   KTTIGEDLLEHIRDEKTPKEAWETLAKLFSRKNEARLQLLENELVGISQGNLSISQYFSK 129

Query: 133  VKSICREITELDPKSAIVESRMKRIIIHGLRPEYRSFIAAVQGWPTQPSLVEFENLLASQ 192
            VK ICREI++L P   + E+RMKRIIIHGLRPEY  F+AAV GWPT PS+VE ENLL +Q
Sbjct: 130  VKFICREISQLAPDEKVSETRMKRIIIHGLRPEYNGFMAAVMGWPTPPSVVELENLLVNQ 189

Query: 193  EAMAKQMGGFTLKGEE-ALYTSESQSNNR---------------PSTRRGYNGDKRRSHQ 252
            E +AK+MG  T+K EE AL+T +   + +               P  R   +G  +R  Q
Sbjct: 190  EELAKKMGSITIKEEEEALFTKKKSPHQKQWKPKPKWTEGGKEHPKERSNSSGGDQRERQ 249

Query: 253  GIAQPER-AQKNDNKSFQRTRFGGI-------------CYNCGKKGHMSRDCWSRKKTIE 312
            G    E+ A++  +  F   + G               C NCGKKGH++R+C   ++  E
Sbjct: 250  GPKWYEKNARRPSDGCFNCGKAGHFARECRFPRRSNDGCSNCGKKGHLTRECRYPRRRYE 309

Query: 313  NNVAISKKK---------IEDEWDAEV------------------LCPIEEDEL------ 372
             NVA +K+K          E+EWDAE                      ++E+EL      
Sbjct: 310  GNVATTKEKEEITLEASMSEEEWDAEAGYSQEVDIEDLEEDMEAPAFAMDEEELEEDMEP 369

Query: 373  -----------------ALMATMEDHINYENDWIVDSGCSNHMTGDQRKLQDKKEYKGSR 432
                             AL A  +  INY++DWIVDSGCSN+MT D  KLQD  EYKG +
Sbjct: 370  PVFARDEEELEEDMEAPALAAIKDPKINYKDDWIVDSGCSNYMTNDDNKLQDMTEYKGGK 429

Query: 433  MVRTANNTQLSIAQIGNTTIMPGNKSDTVSLHNVYHVPGIKKNLLSVSQLTTSGSYVLFG 492
            MV TANN++LSI+ +G T + P      + L  VYHVPG+KKNLLSV QLT  G YVLFG
Sbjct: 430  MVLTANNSKLSISHVGKTAV-PRYGPQQLQLERVYHVPGLKKNLLSVPQLTAEGKYVLFG 489

Query: 493  PEDVKVYQDVKIIGKPTIEGRRVESVYVLSAESAYVDKTRKNETTDLWHARLGHISYHKL 552
            P++V +++ +K+IG P +EG++ +S++VLSAESAYVDKTRKNET DLWHARLGH+SYHKL
Sbjct: 490  PQEVAIFRRLKVIGTPIMEGKKRDSLFVLSAESAYVDKTRKNETADLWHARLGHVSYHKL 549

Query: 553  KLIMEKSMLKGLPQLEVKTDVVCAGCQYGKAHQLPYKESSFKAKKPLELVHSDLFGPVKQ 612
            K +MEK ++KGLP L+++TD+VCAGCQYGKAHQLPYKES  ++K PLEL+HSD+FGPVKQ
Sbjct: 550  KEMMEKHVVKGLPDLDIRTDMVCAGCQYGKAHQLPYKESQHQSKVPLELIHSDVFGPVKQ 609

Query: 613  ASISGMRYMVTFIDDYSRYVWIFFMKEKSDTFSKFQEFKMMVEGEVGAKIRCLRSDNGGE 672
             S+ GMRYMVTFI+D+SRYVW++FMKEKS+TF KF+EFK M+EGE+  KIRCLR+DNG E
Sbjct: 610  ISLGGMRYMVTFINDFSRYVWVYFMKEKSETFMKFKEFKDMIEGELEYKIRCLRTDNGRE 669

Query: 673  YTSDEFDQYLHECGIRRQFTCANTPQQNGVAERKNRHLAETCRSMLHAKNVPGRFWAEAM 732
            Y S+EF  YL +  I+RQ TC NTPQQNGVAERKNRHLAETCRSMLHAKNVPGRFWAE M
Sbjct: 670  YLSNEFTIYLKKNKIKRQLTCPNTPQQNGVAERKNRHLAETCRSMLHAKNVPGRFWAECM 729

Query: 733  RTAAHVINKLPQPKLGFVSPFEILWDMKPTISYFRVFGCVCYVFVPDHLRSKFDKKAVKC 792
            RTAAHVINKLPQ +LGF SP E LW MKP IS+ +VFGCVCY+FVPDHLR+KF+KKA +C
Sbjct: 730  RTAAHVINKLPQVRLGFKSPHEKLWRMKPAISHLKVFGCVCYIFVPDHLRTKFEKKAKRC 789

Query: 793  VFVGYDNQRKGWRCCDPTSGKYYTSRDVVFDEASTWWSSEKKVLSDS-NIEEILQ----Q 852
            +FVGYD+ RKGWRCCDPT+GK + SR++VFDEAS+WWS +K+ + +S ++EE+++    Q
Sbjct: 790  IFVGYDDARKGWRCCDPTTGKCHISRNIVFDEASSWWSPKKEEIPESPDVEEVIEEERDQ 849

Query: 853  KLGEQTTQIQSNFDASENPSDIDIDKQEVTQSSESDKNETTHQQLRRSNRIRRPNPKYAN 912
            KL   +   +S+   +++P    I + E  Q+ E D      Q+LRRS R R+PNP+YAN
Sbjct: 850  KLETPSEGERSSPSKTKSPWKTGIHQPEEPQTEEHD------QELRRSTRPRKPNPRYAN 909

Query: 913  AAIVEDRV-YEPETYEEASQNSVWQKAMEEEIIALEHNQTWELVPRLGDIKPVSCKWVYK 972
            AA+ ++ +  EP +YEEA++   WQKAMEEEI AL+ NQTW+LVP+  D+KP+SCKWVYK
Sbjct: 910  AALADESLPIEPSSYEEAARGPEWQKAMEEEIKALKENQTWDLVPKPKDVKPISCKWVYK 969

Query: 973  IKRRPDGSIERYKARLVARGFSQQYGLDYDETFSPVAKITTVRVLLALAASKDWKLWQMD 1032
            +K R DGSIERYKARLVARGFSQ+YGLDY+ETFSPVAKITTVRVLLALAASK W+LWQMD
Sbjct: 970  VKTRTDGSIERYKARLVARGFSQEYGLDYEETFSPVAKITTVRVLLALAASKSWELWQMD 1029

Query: 1033 VKNAFLHGELDREIYMNQPKGFESAANPNYVCKLRKALYGLKQAPRAWYGKIAEFLTQSG 1092
            VKNAFLHGELD+EIYM QPKGFES   P +VCKL+KALYGLKQAPRAWYGKI EFL  +G
Sbjct: 1030 VKNAFLHGELDKEIYMEQPKGFESKKYPEHVCKLKKALYGLKQAPRAWYGKIGEFLVHNG 1089

Query: 1093 YSVAHADSSLFIKEREGNLTIVLVYVDDLIITGDDEREIYQTRENLSIRFQMKELGELKH 1152
            + VA +DSSLF+  +EG L IVLVYVDDLIITGD   EI +TRENLS+RFQMKELGEL+H
Sbjct: 1090 FKVAPSDSSLFVMAKEGRLAIVLVYVDDLIITGDYSEEIERTRENLSVRFQMKELGELRH 1149

Query: 1153 FLGLEVDRTDEGLFLCQQKYTRDMLQKFNMLECKQVSTPMEINAKICAHEGKELNDETTY 1212
            FLGLEV+ T  G+FL QQKY +D+L+++ ML+CK +STPM+ NA++  H+GK L D T Y
Sbjct: 1150 FLGLEVEHTKNGIFLGQQKYAKDLLKRYGMLDCKPISTPMDPNARLQEHKGKNLEDATMY 1209

Query: 1213 RQLVGSLIYLTLTRPDISYAVGVMSRYMQSPKKPHLDAARRILRYIKGTIDYGLLYKRSK 1272
            RQLVGSLIYLTL+RPDISYAVGV SRYM +PKKPHLDA RRILRY+KGT++YG+LYK++K
Sbjct: 1210 RQLVGSLIYLTLSRPDISYAVGVASRYMSTPKKPHLDAIRRILRYVKGTLNYGILYKKTK 1269

Query: 1273 DCKLVGYCDADYAGDHDTRRSTTGYVFKFGSGTISWCSKRQPTVSLSTTEAEYRAAAGAA 1332
            +C+++GYCDADYAGD DTRRSTTGY+F  GSG I+WCSKRQPTV+LS+TEAEYR+AA AA
Sbjct: 1270 ECQVIGYCDADYAGDCDTRRSTTGYLFSLGSGAITWCSKRQPTVALSSTEAEYRSAAAAA 1329

Query: 1333 QESTWLKLLMEDLHQKIDYPISLLCDNQSAIRLAENPVFHARTKHVEVHYHFIREKVLKE 1345
            QESTWLK LMEDLHQ     + + CDN S IRLAENPVFHARTKH+EVHYH+IREKVLK 
Sbjct: 1330 QESTWLKQLMEDLHQTPKDQVWIFCDNLSTIRLAENPVFHARTKHIEVHYHYIREKVLKG 1389

BLAST of CmaCh19G005910 vs. TrEMBL
Match: I1H466_BRADI (Uncharacterized protein OS=Brachypodium distachyon PE=4 SV=1)

HSP 1 Score: 1749.2 bits (4529), Expect = 0.0e+00
Identity = 877/1418 (61.85%), Postives = 1075/1418 (75.81%), Query Frame = 1

Query: 13   GIKKLNNNNYNTWATCMMSYLQGQDLWEIVGGCETTPPEEDSNDALRKWRIKAGKAMFAL 72
            GI+KLN++NY  W TCM SYLQGQDLWE+V G E  PPE  + DALRKW+IK+GKAMF L
Sbjct: 10   GIRKLNSHNYGYWQTCMESYLQGQDLWEVVAGTEAFPPE--NADALRKWKIKSGKAMFVL 69

Query: 73   KTTIGEEMLEHIWDDKTPKEAWDTFVMLFSKKNDTRLQLLENELLSISQRDMTIAQYFHK 132
            KTTI E++LEHI D+KTPKEAW+T   LFS+KN+ RLQLLENEL  ISQ +++I+QYF K
Sbjct: 70   KTTIEEDLLEHIRDEKTPKEAWETLAKLFSRKNEARLQLLENELAGISQGNLSISQYFSK 129

Query: 133  VKSICREITELDPKSAIVESRMKRIIIHGLRPEYRSFIAAVQGWPTQPSLVEFENLLASQ 192
            VK ICREI++L P   + E+RMKRIIIHGLRPEY  F+AAV GWPT PS+VE ENLL +Q
Sbjct: 130  VKFICREISQLAPDEKVSEARMKRIIIHGLRPEYNGFMAAVMGWPTPPSVVELENLLVNQ 189

Query: 193  EAMAKQMGGFTLKGEE-ALYTSESQSNNR---------------PSTRRGYNGDKRRSHQ 252
            E +AK+MG  T+K EE AL+T +   + +               P  R   +G  +R  Q
Sbjct: 190  EELAKKMGSITIKEEEEALFTKKKSPHQKQWKPKPKWTEGGKEHPKERSNSSGGDQRERQ 249

Query: 253  GIAQPER-AQKNDNKSFQRTRFGGI-------------CYNCGKKGHMSRDCWSRKKTIE 312
            G    E+ A++  +  F   + G               C NC KKGH++R+C   ++  E
Sbjct: 250  GPKWYEKNARRPSDGCFNCGKAGHFARECRFPRRSNDGCSNCDKKGHLTRECRYPRRRYE 309

Query: 313  NNVAISKKK---------IEDEWDAEV------------------LCPIEEDEL------ 372
             NVA +K+K          E+EWDAE                      ++E+EL      
Sbjct: 310  GNVATTKEKEEITLEVSMSEEEWDAEAGYSQEVDIEDLEEDMEVPALAMDEEELEEDMEP 369

Query: 373  -----------------ALMATMEDHINYENDWIVDSGCSNHMTGDQRKLQDKKEYKGSR 432
                             AL A  +  INY++DWIVDSGCSNHMT D  KLQD  EYKG +
Sbjct: 370  PVFATDEEELEKDMEASALAAIKDPKINYKDDWIVDSGCSNHMTNDDNKLQDMTEYKGGK 429

Query: 433  MVRTANNTQLSIAQIGNTTIMPGNKSDTVSLHNVYHVPGIKKNLLSVSQLTTSGSYVLFG 492
            MV TANN++LSI+ +G T + P      + L  VYHVPG+KKNLLSV QLT  G YVLFG
Sbjct: 430  MVLTANNSKLSISHVGKTAV-PRYGPQQLQLERVYHVPGLKKNLLSVPQLTAEGKYVLFG 489

Query: 493  PEDVKVYQDVKIIGKPTIEGRRVESVYVLSAESAYVDKTRKNETTDLWHARLGHISYHKL 552
            P++V +++ +K+IG P +EG++ +S++VLSAESAYVDKTRKNET DLWHARLGH+SYHKL
Sbjct: 490  PQEVAIFRRLKVIGTPIMEGKKRDSLFVLSAESAYVDKTRKNETADLWHARLGHVSYHKL 549

Query: 553  KLIMEKSMLKGLPQLEVKTDVVCAGCQYGKAHQLPYKESSFKAKKPLELVHSDLFGPVKQ 612
            K +MEK ++KGLP L+++TD+VCAGCQYGKAHQLPYKES  ++K PLEL+HSD+FGPVKQ
Sbjct: 550  KEMMEKHVVKGLPDLDIRTDMVCAGCQYGKAHQLPYKESQHQSKVPLELIHSDVFGPVKQ 609

Query: 613  ASISGMRYMVTFIDDYSRYVWIFFMKEKSDTFSKFQEFKMMVEGEVGAKIRCLRSDNGGE 672
             S+ GMRYMVTFIDD+SRYVW++FMKEKS+TF KF+EFK M+EGE+  KIRCLR+DNG E
Sbjct: 610  ISLGGMRYMVTFIDDFSRYVWVYFMKEKSETFMKFKEFKDMIEGELEYKIRCLRTDNGRE 669

Query: 673  YTSDEFDQYLHECGIRRQFTCANTPQQNGVAERKNRHLAETCRSMLHAKNVPGRFWAEAM 732
            Y S+EF  YL +  I+RQ TC NTPQQNGVAERKNRHLAETCRSMLHAKNVPGRFWAE M
Sbjct: 670  YLSNEFTIYLKKNKIKRQLTCPNTPQQNGVAERKNRHLAETCRSMLHAKNVPGRFWAECM 729

Query: 733  RTAAHVINKLPQPKLGFVSPFEILWDMKPTISYFRVFGCVCYVFVPDHLRSKFDKKAVKC 792
            RTAAHVINKLPQ +LGF SP E LW MKP IS+ +VFGCVCY+FVPDHLR+KF+KKA +C
Sbjct: 730  RTAAHVINKLPQVRLGFKSPHEKLWRMKPAISHLKVFGCVCYIFVPDHLRTKFEKKAKRC 789

Query: 793  VFVGYDNQRKGWRCCDPTSGKYYTSRDVVFDEASTWWSSEKKVLSDS-NIEEILQ----Q 852
            +FVGYD+ +KGWRCCDPT+GK +TSR++VFDEAS+WWS +K+ + +S ++EE+++    Q
Sbjct: 790  IFVGYDDAQKGWRCCDPTTGKCHTSRNIVFDEASSWWSPKKEEIPESPDVEEVIEEERDQ 849

Query: 853  KLGEQTTQIQSNFDASENPSDIDIDKQEVTQSSESDKNETTHQQLRRSNRIRRPNPKYAN 912
            KL   +   +S+   +++P    I + E  Q+ E D      Q+LRRS R R+PNP+YAN
Sbjct: 850  KLETPSEGERSSPSKTKSPWKTGIHQPEEPQTEEHD------QELRRSTRPRKPNPRYAN 909

Query: 913  AAIVEDRV-YEPETYEEASQNSVWQKAMEEEIIALEHNQTWELVPRLGDIKPVSCKWVYK 972
            AA+ ++ +  EP +YEEA++   WQKAMEEEI AL+ NQTW+LVP+  D+KP+SCKWVYK
Sbjct: 910  AALADESLPIEPSSYEEAARGPEWQKAMEEEIKALKENQTWDLVPKPKDVKPISCKWVYK 969

Query: 973  IKRRPDGSIERYKARLVARGFSQQYGLDYDETFSPVAKITTVRVLLALAASKDWKLWQMD 1032
            +K R DGSIERYKARLVARGFSQ+YGLDY+ETFSPVAKITTVRVLLALAASK W+LWQMD
Sbjct: 970  VKTRTDGSIERYKARLVARGFSQEYGLDYEETFSPVAKITTVRVLLALAASKSWELWQMD 1029

Query: 1033 VKNAFLHGELDREIYMNQPKGFESAANPNYVCKLRKALYGLKQAPRAWYGKIAEFLTQSG 1092
            VKNAFLHGELD+EIYM QPKGFES   P +VCKL+KALYGLKQAPRAWYGKI EFL  +G
Sbjct: 1030 VKNAFLHGELDKEIYMEQPKGFESKKYPEHVCKLKKALYGLKQAPRAWYGKIGEFLVHNG 1089

Query: 1093 YSVAHADSSLFIKEREGNLTIVLVYVDDLIITGDDEREIYQTRENLSIRFQMKELGELKH 1152
            + VA +DSSLF+  +EG L IVLVYVDDLIITGD   EI +TRENLS+RFQMKELGEL+H
Sbjct: 1090 FKVAPSDSSLFVMAKEGRLAIVLVYVDDLIITGDYSEEIERTRENLSVRFQMKELGELRH 1149

Query: 1153 FLGLEVDRTDEGLFLCQQKYTRDMLQKFNMLECKQVSTPMEINAKICAHEGKELNDETTY 1212
            FLGLEV+ T  G+FL QQKY +D+L+++ ML+CK +STPM+ NA++   +GK L D T Y
Sbjct: 1150 FLGLEVEHTKNGIFLGQQKYAKDLLKRYGMLDCKPISTPMDPNARLQEDKGKNLEDATMY 1209

Query: 1213 RQLVGSLIYLTLTRPDISYAVGVMSRYMQSPKKPHLDAARRILRYIKGTIDYGLLYKRSK 1272
            RQLVGSLIYLTL+RPDISYAVGV SRYM +PKKPHLDA RRILRY+KGT++YG+LYK++K
Sbjct: 1210 RQLVGSLIYLTLSRPDISYAVGVASRYMSTPKKPHLDAIRRILRYVKGTLNYGILYKKTK 1269

Query: 1273 DCKLVGYCDADYAGDHDTRRSTTGYVFKFGSGTISWCSKRQPTVSLSTTEAEYRAAAGAA 1332
            +C+++GYCDADYAGD DTRRSTTGY+F  GSG I+WCSKRQPTV+LS+TEAEYR+AA AA
Sbjct: 1270 ECQVIGYCDADYAGDCDTRRSTTGYLFSLGSGAITWCSKRQPTVALSSTEAEYRSAAAAA 1329

Query: 1333 QESTWLKLLMEDLHQKIDYPISLLCDNQSAIRLAENPVFHARTKHVEVHYHFIREKVLKE 1345
            QESTWLK LMEDLHQ     + + CDN + IRLAENPVFHARTKH+EVHYH+IRE VLK 
Sbjct: 1330 QESTWLKQLMEDLHQTPKDQVWIFCDNLTTIRLAENPVFHARTKHIEVHYHYIRENVLKG 1389

BLAST of CmaCh19G005910 vs. TAIR10
Match: AT4G23160.1 (AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 8)

HSP 1 Score: 419.1 bits (1076), Expect = 1.0e-116
Identity = 230/578 (39.79%), Postives = 341/578 (59.00%), Query Frame = 1

Query: 778  SNFDASENPSDIDIDKQEVTQSSESDKN-ETTHQQLRRSNRIR----------------- 837
            S+ DAS + S IDI      Q+   + +  T+H++ R+   ++                 
Sbjct: 3    SDADASTSSSSIDIMPSANIQNDVPEPSVHTSHRRTRKPAYLQDYYCHSVASLTIHDISQ 62

Query: 838  -----RPNPKYANAAIVEDRVYEPETYEEASQNSVWQKAMEEEIIALEHNQTWELVPRLG 897
                 + +P Y +  +   +  EP TY EA +  VW  AM++EI A+E   TWE+     
Sbjct: 63   FLSYEKVSPLYHSFLVCIAKAKEPSTYNEAKEFLVWCGAMDDEIGAMETTHTWEICTLPP 122

Query: 898  DIKPVSCKWVYKIKRRPDGSIERYKARLVARGFSQQYGLDYDETFSPVAKITTVRVLLAL 957
            + KP+ CKWVYKIK   DG+IERYKARLVA+G++QQ G+D+ ETFSPV K+T+V+++LA+
Sbjct: 123  NKKPIGCKWVYKIKYNSDGTIERYKARLVAKGYTQQEGIDFIETFSPVCKLTSVKLILAI 182

Query: 958  AASKDWKLWQMDVKNAFLHGELDREIYMNQPKGFESAAN----PNYVCKLRKALYGLKQA 1017
            +A  ++ L Q+D+ NAFL+G+LD EIYM  P G+ +       PN VC L+K++YGLKQA
Sbjct: 183  SAIYNFTLHQLDISNAFLNGDLDEEIYMKLPPGYAARQGDSLPPNAVCYLKKSIYGLKQA 242

Query: 1018 PRAWYGKIAEFLTQSGYSVAHADSSLFIKEREGNLTIVLVYVDDLIITGDDEREIYQTRE 1077
             R W+ K +  L   G+  +H+D + F+K        VLVYVDD+II  +++  + + + 
Sbjct: 243  SRQWFLKFSVTLIGFGFVQSHSDHTYFLKITATLFLCVLVYVDDIIICSNNDAAVDELKS 302

Query: 1078 NLSIRFQMKELGELKHFLGLEVDRTDEGLFLCQQKYTRDMLQKFNMLECKQVSTPMEINA 1137
             L   F++++LG LK+FLGLE+ R+  G+ +CQ+KY  D+L +  +L CK  S PM+ + 
Sbjct: 303  QLKSCFKLRDLGPLKYFLGLEIARSAAGINICQRKYALDLLDETGLLGCKPSSVPMDPSV 362

Query: 1138 KICAHEGKELNDETTYRQLVGSLIYLTLTRPDISYAVGVMSRYMQSPKKPHLDAARRILR 1197
               AH G +  D   YR+L+G L+YL +TR DIS+AV  +S++ ++P+  H  A  +IL 
Sbjct: 363  TFSAHSGGDFVDAKAYRRLIGRLMYLQITRLDISFAVNKLSQFSEAPRLAHQQAVMKILH 422

Query: 1198 YIKGTIDYGLLYKRSKDCKLVGYCDADYAGDHDTRRSTTGYVFKFGSGTISWCSKRQPTV 1257
            YIKGT+  GL Y    + +L  + DA +    DTRRST GY    G+  ISW SK+Q  V
Sbjct: 423  YIKGTVGQGLFYSSQAEMQLQVFSDASFQSCKDTRRSTNGYCMFLGTSLISWKSKKQQVV 482

Query: 1258 SLSTTEAEYRAAAGAAQESTWLKLLMEDLHQKIDYPISLLCDNQSAIRLAENPVFHARTK 1317
            S S+ EAEYRA + A  E  WL     +L   +  P  L CDN +AI +A N VFH RTK
Sbjct: 483  SKSSAEAEYRALSFATDEMMWLAQFFRELQLPLSKPTLLFCDNTAAIHIATNAVFHERTK 542

Query: 1318 HVEVHYHFIREKVLKEEIEMQQIKTDDQVADLFTKGLN 1329
            H+E   H +RE+ + +       +  D+  D FT+ L+
Sbjct: 543  HIESDCHSVRERSVYQATLSYSFQAYDE-QDGFTEYLS 579

BLAST of CmaCh19G005910 vs. TAIR10
Match: ATMG00810.1 (ATMG00810.1 DNA/RNA polymerases superfamily protein)

HSP 1 Score: 193.7 bits (491), Expect = 7.2e-49
Identity = 99/224 (44.20%), Postives = 136/224 (60.71%), Query Frame = 1

Query: 1028 VLVYVDDLIITGDDEREIYQTRENLSIRFQMKELGELKHFLGLEVDRTDEGLFLCQQKYT 1087
            +L+YVDD+++TG     +      LS  F MK+LG + +FLG+++     GLFL Q KY 
Sbjct: 3    LLLYVDDILLTGSSNTLLNMLIFQLSSTFSMKDLGPVHYFLGIQIKTHPSGLFLSQTKYA 62

Query: 1088 RDMLQKFNMLECKQVSTPMEINAKICAHEGKELNDETTYRQLVGSLIYLTLTRPDISYAV 1147
              +L    ML+CK +STP+ +         K   D + +R +VG+L YLTLTRPDISYAV
Sbjct: 63   EQILNNAGMLDCKPMSTPLPLKLNSSVSTAK-YPDPSDFRSIVGALQYLTLTRPDISYAV 122

Query: 1148 GVMSRYMQSPKKPHLDAARRILRYIKGTIDYGLLYKRSKDCKLVGYCDADYAGDHDTRRS 1207
             ++ + M  P     D  +R+LRY+KGTI +GL   ++    +  +CD+D+AG   TRRS
Sbjct: 123  NIVCQRMHEPTLADFDLLKRVLRYVKGTIFHGLYIHKNSKLNVQAFCDSDWAGCTSTRRS 182

Query: 1208 TTGYVFKFGSGTISWCSKRQPTVSLSTTEAEYRAAAGAAQESTW 1252
            TTG+    G   ISW +KRQPTVS S+TE EYRA A  A E TW
Sbjct: 183  TTGFCTFLGCNIISWSAKRQPTVSRSSTETEYRALALTAAELTW 225

BLAST of CmaCh19G005910 vs. TAIR10
Match: ATMG00820.1 (ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase))

HSP 1 Score: 94.7 bits (234), Expect = 4.5e-19
Identity = 50/118 (42.37%), Postives = 75/118 (63.56%), Query Frame = 1

Query: 818 IRRPNPKYANAAIVEDRVYEPETYEEASQNSVWQKAMEEEIIALEHNQTWELVPRLGDIK 877
           I + NPKY+   I      EP++   A ++  W +AM+EE+ AL  N+TW LVP   +  
Sbjct: 9   INKLNPKYS-LTITTTIKKEPKSVIFALKDPGWCQAMQEELDALSRNKTWILVPPPVNQN 68

Query: 878 PVSCKWVYKIKRRPDGSIERYKARLVARGFSQQYGLDYDETFSPVAKITTVRVLLALA 936
            + CKWV+K K   DG+++R KARLVA+GF Q+ G+ + ET+SPV +  T+R +L +A
Sbjct: 69  ILGCKWVFKTKLHSDGTLDRLKARLVAKGFHQEEGIYFVETYSPVVRTATIRTILNVA 125

BLAST of CmaCh19G005910 vs. TAIR10
Match: ATMG00240.1 (ATMG00240.1 Gag-Pol-related retrotransposon family protein)

HSP 1 Score: 75.9 bits (185), Expect = 2.2e-13
Identity = 31/78 (39.74%), Postives = 52/78 (66.67%), Query Frame = 1

Query: 1134 IYLTLTRPDISYAVGVMSRYMQSPKKPHLDAARRILRYIKGTIDYGLLYKRSKDCKLVGY 1193
            +YLT+TRPD+++AV  +S++  + +   + A  ++L Y+KGT+  GL Y  + D +L  +
Sbjct: 1    MYLTITRPDLTFAVNRLSQFSSASRTAQMQAVYKVLHYVKGTVGQGLFYSATSDLQLKAF 60

Query: 1194 CDADYAGDHDTRRSTTGY 1212
             D+D+A   DTRRS TG+
Sbjct: 61   ADSDWASCPDTRRSVTGF 78

BLAST of CmaCh19G005910 vs. TAIR10
Match: ATMG00300.1 (ATMG00300.1 Gag-Pol-related retrotransposon family protein)

HSP 1 Score: 65.5 bits (158), Expect = 2.9e-10
Identity = 35/103 (33.98%), Postives = 57/103 (55.34%), Query Frame = 1

Query: 430 IEGRRVESVYVLSAE----SAYVDKTRKNETTDLWHARLGHISYHKLKLIMEKSMLKGLP 489
           ++G R +S+Y+L        + + +T K+ET  LWH+RL H+S   ++L+++K  L    
Sbjct: 39  LKGNRHDSLYILQGSVETGESNLAETAKDETR-LWHSRLAHMSQRGMELLVKKGFLDSSK 98

Query: 490 QLEVKTDVVCAGCQYGKAHQLPYKESSFKAKKPLELVHSDLFG 529
              +K    C  C YGK H++ +       K PL+ VHSDL+G
Sbjct: 99  VSSLK---FCEDCIYGKTHRVNFSTGQHTTKNPLDYVHSDLWG 137

BLAST of CmaCh19G005910 vs. NCBI nr
Match: gi|147794801|emb|CAN71427.1| (hypothetical protein VITISV_027864 [Vitis vinifera])

HSP 1 Score: 1808.9 bits (4684), Expect = 0.0e+00
Identity = 917/1354 (67.73%), Postives = 1065/1354 (78.66%), Query Frame = 1

Query: 5    MSDFQIVGGIKKLNNNNYNTWATCMMSYLQGQDLWEIVGGCETTPPE-EDSNDALRKWRI 64
            M D Q++GGIKKLNN NYNTW+TCMMSY+QGQDLWE+V G E T P+ ED+N  LRKW+I
Sbjct: 1    MGDLQVIGGIKKLNNQNYNTWSTCMMSYMQGQDLWEVVNGSEITQPKVEDANGILRKWKI 60

Query: 65   KAGKAMFALKTTIGEEMLEHIWDDKTPKEAWDTFVMLFSKKNDTRLQLLENELLSISQRD 124
            KAGKAMFALKTTI E++LEHI D KTP EAW+TF  LFSKKNDTRLQLLE+EL S++QRD
Sbjct: 61   KAGKAMFALKTTIEEDVLEHIRDAKTPYEAWNTFTKLFSKKNDTRLQLLESELFSVAQRD 120

Query: 125  MTIAQYFHKVKSICREITELDPKSAIVESRMKRIIIHGLRPEYRSFIAAVQGWPTQPSLV 184
            +TIAQYFHKVK++CREI+ELD ++ I E+ MKRIIIHGLRPE+R F+AA+QGW  QPSLV
Sbjct: 121  LTIAQYFHKVKTLCREISELDLEAPIGETXMKRIIIHGLRPEFRGFVAAIQGWQNQPSLV 180

Query: 185  EFENLLASQEAMAKQMGGFTLKGEE-ALYTSESQSNNRPST--RRGYNGDKRRSHQGIAQ 244
            EFENLLA QEA+AKQMGG +LKGEE ALY  +   N+   T  R   N DK +  QG  +
Sbjct: 181  EFENLLAGQEALAKQMGGVSLKGEEEALYAHKGGWNSXQHTVRRTKKNEDKAKCSQG-ER 240

Query: 245  PERAQKNDNKSFQRTRFGGICYNCGKKGHMSRDCWSRKKTIENNVAISKKKIEDEWDAEV 304
              R + +        +F G CYNC KKGHM++DCWS+K  +E+N   SK   EDEWDA+ 
Sbjct: 241  SARVEGDSKNPGTXKKFEGKCYNCXKKGHMAKDCWSKKGLVESNATTSKS--EDEWDAQA 300

Query: 305  LCPIEEDELALMATMEDHINYENDWIVDSGCSNHMTGDQRKLQDKKEYKGSRMVRTANNT 364
                   E A +AT  + I+YE DWI+DSGCSNHMTGD+ KLQD  EYKG  MV TANN+
Sbjct: 301  FFAAI-GESAFIATTSEQIDYEKDWIIDSGCSNHMTGDKEKLQDLSEYKGRHMVVTANNS 360

Query: 365  QLSIAQIGNTTIMPGNKSDTVSLHNVYHVPGIKKNLLSVSQLTTSGSYVLFGPEDVKVYQ 424
            +L IA IGNT +     ++ VSL NVYHVPG+KKNLLSV+QLT+SG  VLFGP+DVKVY 
Sbjct: 361  KLPIAHIGNTVVSSQYNTNDVSLQNVYHVPGMKKNLLSVAQLTSSGHSVLFGPQDVKVYH 420

Query: 425  DVKIIGKPTIEGRRVESVYVLSAESAYVDKTRKNETTDLWHARLGHISYHKLKLIMEKSM 484
            D++++ +P I+GRR+ESVYV+SAE+AYVDKTRKNET DLWH RL HISY KL ++M+KSM
Sbjct: 421  DLEVMEEPVIKGRRLESVYVMSAETAYVDKTRKNETADLWHMRLSHISYSKLTMMMKKSM 480

Query: 485  LKGLPQLEVKTDVVCAGCQYGKAHQLPYKESSFKAKKPLELVHSDLFGPVKQASISGMRY 544
            LKGLPQLEV+   +CA CQYGKAHQLPY+ES +KAK PLEL+HSD+FGPVKQAS+SGM  
Sbjct: 481  LKGLPQLEVRKXTICAXCQYGKAHQLPYEESKWKAKGPLELIHSDVFGPVKQASLSGM-- 540

Query: 545  MVTFIDDYSRYVWIFFMKEKSDTFSKFQEFKMMVEGEVGAKIRCLRSDNGGEYTSDEFDQ 604
                     +Y+  F      D FS+    +M                    +TS E  +
Sbjct: 541  ---------KYMVTFI-----DDFSRRVYLQMSF------------------FTSSENXE 600

Query: 605  YLHECGIRRQFTCANTPQQNGVAERKNRHLAETCRSMLHAKNVPGRFWAEAMRTAAHVIN 664
            Y         FTCANTPQQNGV ERKNRHLAE CRSMLHAKNVPG FWAE M+TAA VIN
Sbjct: 601  YAIS------FTCANTPQQNGVXERKNRHLAEICRSMLHAKNVPGXFWAEXMKTAAFVIN 660

Query: 665  KLPQPKLGFVSPFEILWDMKPTISYFRVFGCVCYVFVPDHLRSKFDKKAVKCVFVGYDNQ 724
            +LPQ +L F SPFE LW++KPT+SYFRVFGCVCYVFVP+HLRSK DKKAV+CV VGYD+Q
Sbjct: 661  RLPQQRLNFSSPFEKLWNIKPTVSYFRVFGCVCYVFVPNHLRSKMDKKAVRCVLVGYDSQ 720

Query: 725  RKGWRCCDPTSGKYYTSRDVVFDEASTWWSSEKKVLSDSNIEEILQQKLGEQTTQIQSNF 784
            RK WRCCDPT+GK YTSR+VVFDE+S+WWSSEK++L DS+   + + +L  Q+ +IQ + 
Sbjct: 721  RKXWRCCDPTTGKCYTSRNVVFDESSSWWSSEKEILXDSB---VFKDEL--QSARIQLSL 780

Query: 785  DASENPSDIDIDKQEVTQSSESDKNETTHQQLRRSNRIRRPNPKYANAAIVED-RVYEPE 844
              +EN  D DI   + TQS      +T         R ++PNPKYAN AIVED    EP 
Sbjct: 781  GEAENAXDGDIG-DDXTQSPW----QTGVHGQPSEERTKKPNPKYANVAIVEDANAKEPX 840

Query: 845  TYEEASQNSVWQKAMEEEIIALEHNQTWELVPRLGDIKPVSCKWVYKIKRRPDGSIERYK 904
            T+ EA QNS W KAM EEI AL+ NQTWELVP+  D++P SCKWVYKIKRR DGSIER+K
Sbjct: 841  TFAEAFQNSDWSKAMXEEIAALKRNQTWELVPKPRDVEPXSCKWVYKIKRRTDGSIERHK 900

Query: 905  ARLVARGFSQQYGLDYDETFSPVAKITTVRVLLALAASKDWKLWQMDVKNAFLHGELDRE 964
            A LVARGFSQQYGLDYDETFSPV K+TTVRVLLALAA+KDW LWQMDVKNAFLHGELDRE
Sbjct: 901  AXLVARGFSQQYGLDYDETFSPVXKLTTVRVLLALAANKDWDLWQMDVKNAFLHGELDRE 960

Query: 965  IYMNQPKGFESAANPNYVCKLRKALYGLKQAPRAWYGKIAEFLTQSGYSVAHADSSLFIK 1024
            IYMNQP GF+S  +P YVCKLRKALYGLKQAPRAWYGKIAEFLTQSGYSV  ADSSLF+K
Sbjct: 961  IYMNQPMGFQSQGHPEYVCKLRKALYGLKQAPRAWYGKIAEFLTQSGYSVTPADSSLFVK 1020

Query: 1025 EREGNLTIVLVYVDDLIITGDDEREIYQTRENLSIRFQMKELGELKHFLGLEVDRTDEGL 1084
               G L IVLVYVDDLIITGDD  EI++T+ENLS+RF+MKELG+LKHFLGLEVDRT+EG+
Sbjct: 1021 ANGGKLAIVLVYVDDLIITGDDVEEIFRTKENLSVRFEMKELGQLKHFLGLEVDRTNEGI 1080

Query: 1085 FLCQQKYTRDMLQKFNMLECKQVSTPMEINAKICAHEGKELNDETTYRQLVGSLIYLTLT 1144
            FLCQQKY +D+L+KF MLECK +STPME NAK+C HEGK+L D T YRQLVGSL+YLTLT
Sbjct: 1081 FLCQQKYAKDLLKKFGMLECKPISTPMEPNAKMCEHEGKDLKDATMYRQLVGSLLYLTLT 1140

Query: 1145 RPDISYAVGVMSRYMQSPKKPHLDAARRILRYIKGTIDYGLLYKRSKDCKLVGYCDADYA 1204
             PDISYAVGVMSRYMQ+PKKPHL+A RRILR++KGTIDYGLLYK+ +DCKLVGYCDADYA
Sbjct: 1141 XPDISYAVGVMSRYMQNPKKPHLEAVRRILRHVKGTIDYGLLYKKXEDCKLVGYCDADYA 1200

Query: 1205 GDHDTRRSTTGYVFKFGSGTISWCSKRQPTVSLSTTEAEYRAAAGAAQESTWLKLLMEDL 1264
            GDHDTR STTGYVF  GSG ISWCSKRQPTVSLSTTEAEYRAAA A QES WL  LM DL
Sbjct: 1201 GDHDTRXSTTGYVFMLGSGAISWCSKRQPTVSLSTTEAEYRAAAMATQESMWLIRLMNDL 1260

Query: 1265 HQKIDYPISLLCDNQSAIRLAENPVFHARTKHVEVHYHFIREKVLKEEIEMQQIKTDDQV 1324
            HQ +DY + L CDNQSA+RLAENPVFHARTKHVEVHYHFIREKVLKEE+E+ QIK++DQV
Sbjct: 1261 HQLVDYAVPLYCDNQSAVRLAENPVFHARTKHVEVHYHFIREKVLKEEVELNQIKSEDQV 1300

Query: 1325 ADLFTKGLNTSKHESFRCQLNMMQRMRTSAEGEC 1354
            ADLFTKGL+ SK ESF  QL M++ +    EGEC
Sbjct: 1321 ADLFTKGLSGSKFESFCHQLGMVKILEADVEGEC 1300

BLAST of CmaCh19G005910 vs. NCBI nr
Match: gi|147810137|emb|CAN73532.1| (hypothetical protein VITISV_012827 [Vitis vinifera])

HSP 1 Score: 1037.3 bits (2681), Expect = 2.3e-299
Identity = 549/939 (58.47%), Postives = 677/939 (72.10%), Query Frame = 1

Query: 5   MSDFQIVGGIKKLNNNNYNTWATCMMSYLQGQDLWEIVGGCETTPPE-EDSNDALRKWRI 64
           M D Q++GGIKKLNN NYNTW+TCMMSY+QGQDLWE+V G E T PE ED N  LRKW+I
Sbjct: 1   MGDLQVIGGIKKLNNQNYNTWSTCMMSYMQGQDLWEVVNGSEITQPEAEDVNGILRKWKI 60

Query: 65  KAGKAMFALKTTIGEEMLEHIWDDKTPKEAWDTFVMLFSKKNDTRLQLLENELLSISQRD 124
           K GKAMFALKTTI E++LEHI D KTP EAW+TF  LFSKKNDTRLQLLE+ELLSI+QRD
Sbjct: 61  KXGKAMFALKTTIEEDVLEHIRDAKTPYEAWNTFTKLFSKKNDTRLQLLESELLSIAQRD 120

Query: 125 MTIAQYFHKVKSICREITELDPKSAIVESRMKRIIIHGLRPEYRSFIAAVQGWPTQPSLV 184
           +TI  YFHKVK++CREI+ELD ++ I E+RMKRIIIHGLRP++R F+AA+QGW  QPSLV
Sbjct: 121 LTITHYFHKVKTLCREISELDLEAPIGETRMKRIIIHGLRPKFRGFVAAIQGWQNQPSLV 180

Query: 185 EFENLLASQEAMAKQMGGFTLKGEE-ALYTSESQSNNRPST--RRGYNGDKRRSHQGIAQ 244
           EFENLLA QEA+AKQMGG +LKGEE ALY  + + N++  T  R   N DK +  QG  +
Sbjct: 181 EFENLLAGQEALAKQMGGVSLKGEEEALYAHKGRWNSKQHTVGRTKKNEDKAKXSQG-ER 240

Query: 245 PERAQKNDNKSFQRTRFGGICYNCGKKGHMSRDCWSRKKTIENNVAISKKKIEDEWDAEV 304
             R + +      R +  G CYNCGKKGHM++DCWS+K  +E+N A S+   E+EWDA+ 
Sbjct: 241 SARVEGDSKNPXTRKKXEGKCYNCGKKGHMAKDCWSKKGLVESNAATSES--EBEWDAQA 300

Query: 305 LCPIEEDELALMATMEDHINYENDWIVDSGCSNHMTGDQRKLQDKKEYKGSRMVRTANNT 364
              +   E   +AT  + I+YE DWI+DSGCSNHMTGD+ KL D  EYKG  MV T NN+
Sbjct: 301 FF-VAXGESXFIATTSEQIDYEKDWIIDSGCSNHMTGDKEKLXDLSEYKGRHMVVTXNNS 360

Query: 365 QLSIAQIGNTTIMPGNKSDTVSLHNVYHVPGIKKNLLSVSQLTTSGSYVLFGPEDVKVYQ 424
           ++ IA IGNT +     ++ VSL NVYHVPG+KKNLLSV+QLT+SG +VLFGP+DVKVY+
Sbjct: 361 KJPIAHIGNTVVSSQYNTNDVSLQNVYHVPGMKKNLLSVAQLTSSGHFVLFGPQDVKVYR 420

Query: 425 DVKIIGKPTIEGRRVESVYVLSAESAYVDKTRKNETTDLWHARLGHISYHKLKLIMEKSM 484
           D++I+ +P I   R+ESVYV+SAE+A VDKTRKNETTDL H RL H+SY KL ++M+KSM
Sbjct: 421 DLEIMEEPVIXRWRLESVYVMSAETAXVDKTRKNETTDLXHMRLSHVSYSKLTVMMKKSM 480

Query: 485 LKGLPQLEVKTDVVCAGCQYGKAHQLPYKESSFKAKKPLELVHSDLFGPVKQASISGMRY 544
           LKGLPQLEV+ D +CAGC YGKAHQLPY+ES +K K PLEL+HSD+FGPVK AS+SGM+ 
Sbjct: 481 LKGLPQLEVRKDTICAGCXYGKAHQLPYEESKWKTKGPLELIHSDVFGPVKXASLSGMK- 540

Query: 545 MVTFIDDYSRYVWIFFMKEKSDTFSKFQEFKMMVE-GEVGAKIRCLRSDNGGEYTSDEFD 604
                     Y+  F      D FS++     M E  E  +K +             EF 
Sbjct: 541 ----------YMXTFI-----DDFSRYVWVHFMKEKSETFSKFK-------------EFK 600

Query: 605 QYLHECGIRRQFTCANTPQQNGVAERKNRHLAETCRSMLHAKNVPGRFWAEAMRTAAHVI 664
           + + E  + ++  C      NG              SMLH KNVPGRFW EAM+TAA VI
Sbjct: 601 E-MTEAEVDKRIRCLRX--DNGG------------ESMLHXKNVPGRFWVEAMKTAAFVI 660

Query: 665 NKLPQPKLGFVSPFEILWDMKPTISYFRVFGCVCYVFVPDHLRSKFDKKAVKCVFVGYDN 724
           N+LPQ +L F SPFE LW++KP +SYFRVFGCVCY FVP+H RSK DKK V+CV VGYD+
Sbjct: 661 NRLPQQRLNFSSPFEKLWNIKPIVSYFRVFGCVCYAFVPNHXRSKMDKKXVRCVLVGYDS 720

Query: 725 QRKGWRCCDPTSGKYYTSRDVVFDEASTWWSSEKKVLSDSNIEEILQQKLGEQTTQIQSN 784
           Q K WRCC+PT+GKYYTSR+VVFDE+S+WWSSEK++L DS   ++ + +L  Q+ +IQ +
Sbjct: 721 QXKRWRCCNPTTGKYYTSRNVVFDESSSWWSSEKEILPDS---DVFKDEL--QSARIQLS 780

Query: 785 FDASENPSDIDIDKQEVTQS--------SESDKNETTHQQ----LRRSNRIRRPNPKYAN 844
              +EN +D DI   E TQS          S++ E +  +    LRRS R ++PNPKYAN
Sbjct: 781 LGEAENAADGDIGDDE-TQSPWQTGVHGQPSEEGEPSEIEAPIPLRRSARTKKPNPKYAN 840

Query: 845 AAIVED-RVYEPETYEEASQNSVWQKAMEEEIIALEHNQTWELVPRLGDIKPVSCKWVYK 904
            AIVED    EPET+ EA QN  W KAM+EEI AL+ NQTWELVP+  D++P+SCKWVYK
Sbjct: 841 VAIVEDANTKEPETFAEAFQNPDWSKAMKEEIAALKRNQTWELVPKPRDVEPISCKWVYK 885

Query: 905 IKRRPDGSIERYKARLVARGFSQQYGLDYDETFSPVAKI 926
           IKRR  GSIE +KARLVARGFSQQYGLDYDETFS VAK+
Sbjct: 901 IKRRTXGSIEMHKARLVARGFSQQYGLDYDETFSXVAKL 885

BLAST of CmaCh19G005910 vs. NCBI nr
Match: gi|147817226|emb|CAN75363.1| (hypothetical protein VITISV_026292 [Vitis vinifera])

HSP 1 Score: 913.3 bits (2359), Expect = 5.0e-262
Identity = 472/753 (62.68%), Postives = 547/753 (72.64%), Query Frame = 1

Query: 531  KQASISGMRYMVTFIDDYSRYVWIFFMKEKSDTFSKFQEFKMMVEGEVGAKIRCLRSDNG 590
            KQAS+SGM+YMVTFI+D+S YVW++FMKEKS+TFSK++EFK M E +V  +IRCLR+DNG
Sbjct: 461  KQASLSGMKYMVTFINDFSNYVWVYFMKEKSETFSKYKEFKEMTEVKVDKRIRCLRTDNG 520

Query: 591  GEYTSDEFDQYLHECGIRRQFTCANTPQQNGVAERKNRHLAETCRSMLHAKNVPGRFWAE 650
            GEYTSDEF  +L EC +R QFTCANTPQQN VAERKNRHLAE CRSMLHAKNVP RFWAE
Sbjct: 521  GEYTSDEFFYFLRECRVRHQFTCANTPQQNSVAERKNRHLAEICRSMLHAKNVPRRFWAE 580

Query: 651  AMRTAAHVINKLPQPKLGFVSPFEILWDMKPTISYFRVFGCVCYVFVPDHLRSKFDKKAV 710
            AM+T A VIN+LPQ KL F SPFE LW++KPTISYFRVFGCVCYVFVP+HLRSK DKK +
Sbjct: 581  AMKTVAFVINRLPQQKLNFSSPFEKLWNIKPTISYFRVFGCVCYVFVPNHLRSKMDKKEI 640

Query: 711  KCVFVGYDNQRKGWRC-CDPTSGKYYTSRDVVFDEASTWWSSEKKVLSDSNIEEILQQKL 770
                  + ++ +  R        +   + D+  DE  + W +    +     EE    + 
Sbjct: 641  LPDSDVFKDELQSARIQLSLGEXENAANGDIXDDETQSPWQTG---VHGQXSEEGEPSET 700

Query: 771  GEQTTQIQSNFDASENPSDIDIDKQEVTQSSESDKNETTHQQLRRSNRIRRPNPKYANAA 830
                   +S      NP   ++    + + + + + ET  +  +        NP ++ A 
Sbjct: 701  EAPIPLRRSARTKKPNPKYANV---AIVEDANAKEPETFAEAFQ--------NPDWSKAM 760

Query: 831  IVEDRVYEPETYEEASQNSVWQKAMEEEIIALEHNQTWELVPRLGDIKPVSCKWVYKIKR 890
                     E      +N  W+                  VP+  D++P+SCKWVYKIKR
Sbjct: 761  --------KEEIAALKRNQTWEL-----------------VPKXRDVEPISCKWVYKIKR 820

Query: 891  RPDGSIERYKARLVARGFSQQYGLDYDETFSPVAKITTVRVLLALAASKDWKLWQMDVKN 950
            R DG IER+KARLVARGFSQQYGLDYDETFSPVAK+TTVRVLLALAA+KDW L QMDVKN
Sbjct: 821  RTDGLIERHKARLVARGFSQQYGLDYDETFSPVAKLTTVRVLLALAANKDWDLRQMDVKN 880

Query: 951  AFLHGELDREIYMNQPKGFESAANPNYVCKLRKALYGLKQAPRAWYGKIAEFLTQSGYSV 1010
            AFLHGELDREIYMNQP GF+S  +P YVCKLRKALYGLKQAPRAWYGKIAEFLTQSGYS+
Sbjct: 881  AFLHGELDREIYMNQPMGFQSQGHPKYVCKLRKALYGLKQAPRAWYGKIAEFLTQSGYSI 940

Query: 1011 AHADSSLFIKEREGNLTIVLVYVDDLIITGDDEREIYQTRENLSIRFQMKELGELKHFLG 1070
              ADSSLF+K   G L IVL Y                  ENLS+RF+MKELG+LKHFLG
Sbjct: 941  TPADSSLFVKANGGKLAIVLAY------------------ENLSVRFEMKELGQLKHFLG 1000

Query: 1071 LEVDRTDEGLFLCQQKYTRDMLQKFNMLECKQVSTPMEINAKICAHEGKELNDETTYRQL 1130
            LEVDRT EG+FLCQQKY +D+L+KF MLECK +STPME NAK+C HEGK+L D T YRQL
Sbjct: 1001 LEVDRTHEGIFLCQQKYAKDLLKKFGMLECKPISTPMEPNAKMCEHEGKDLKDATMYRQL 1060

Query: 1131 VGSLIYLTLTRPDISYAVGVMSRYMQSPKKPHLDAARRILRYIKGTIDYGLLYKRSKDCK 1190
            VGSL+YLT TR DISYAVGVMSRYMQ+PKKPHL+A RRILR++KGTIDYGLLYK+ +D K
Sbjct: 1061 VGSLLYLTFTRTDISYAVGVMSRYMQNPKKPHLEAVRRILRHVKGTIDYGLLYKKGEDYK 1120

Query: 1191 LVGYCDADYAGDHDTRRSTTGYVFKFGSGTISWCSKRQPTVSLSTTEAEYRAAAGAAQES 1250
            LVGYCDADY GDHDTRRSTTGYVF  GS  ISWCSKRQPTVSLST E EYRAA  AAQES
Sbjct: 1121 LVGYCDADYVGDHDTRRSTTGYVFMLGSRAISWCSKRQPTVSLSTMEXEYRAAPMAAQES 1156

Query: 1251 TWLKLLMEDLHQKIDYPISLLCDNQSAIRLAEN 1283
            TWL  LM DLHQ +DY   L CDNQSA+RLAEN
Sbjct: 1181 TWLIRLMNDLHQXVDYAXPLYCDNQSAVRLAEN 1156

BLAST of CmaCh19G005910 vs. NCBI nr
Match: gi|47824985|gb|AAT38758.1| (Putative gag-pol polyprotein, identical [Solanum demissum])

HSP 1 Score: 870.2 bits (2247), Expect = 4.9e-249
Identity = 511/1349 (37.88%), Postives = 778/1349 (57.67%), Query Frame = 1

Query: 21   NYNTWATCMMSYLQGQDLWEIVGGCETTPPEEDSNDALRKWRIKAGKAMFALKTTIGEEM 80
            NY  W+  M +  + Q+LW+IV   ET  PE ++N  +R+ R +  KA+F ++  + +E+
Sbjct: 21   NYQFWSLKMKTLFKSQELWDIV---ETGIPEGNANQ-MREHRKRDSKALFTIQQALDDEI 80

Query: 81   LEHIWDDKTPKEAWDTFVMLF---SKKNDTRLQLLENELLSISQRDMTIAQ-YFHKVKSI 140
               I   +T K+AW+     +    K    +LQ L  +  ++   +    Q Y  +  +I
Sbjct: 81   FPRISAVETSKQAWEILKQEYFGDDKVITVKLQTLRRDFETLFMNENESVQGYLSRTSAI 140

Query: 141  CREITELDPK--SAIVESRMKRIIIHGLRPEYRSFIAAVQGWPTQPSLVEFENLLASQEA 200
               +     K  + IV S+    ++  L  ++   + A++      S   F+ L++S  A
Sbjct: 141  VNRMRSYGEKIDNQIVVSK----VLRSLTTKFEHVVTAIEE-SKDLSTYSFDELMSSLLA 200

Query: 201  MAKQMG---------GFTLKGEEALYTSESQSNNRPSTRRGYNGDKRRSHQGIAQPERAQ 260
               ++           F +KGE   ++ + ++ N      G    + R   G  +     
Sbjct: 201  HEDRLNRSREKVQEKAFQVKGE---FSYKGKAENSAGRGHGRGNFRGRGRGGSGRG---- 260

Query: 261  KNDNKSFQRTRFGGICYNCGKKGHMSRDCWSRKKTIENNVAISKKKIEDEWDAEVLCPIE 320
            +N    F++ +    C  C K GH   DCW+++K             +++ DA     +E
Sbjct: 261  RNQVGEFRQYKSNIQCRYCKKFGHKEVDCWTKQK-------------DEQKDANFTQNVE 320

Query: 321  EDELALMATMEDHINYENDWIVDSGCSNHMTGDQRKLQDKKEYKGSRMVRTANNTQLSIA 380
            E+    MA+ +   +    W +DSGCSNHM+  +   +D  E + S  VR  ++ Q+ I 
Sbjct: 321  EESKLFMASSQITESANAVWFIDSGCSNHMSSSKSLFRDLDESQKSE-VRLGDDKQVHIE 380

Query: 381  QIGNTTIMPGNKSDTVSLHNVYHVPGIKKNLLSVSQLTTSGSYVLFGPE--DVKVYQDVK 440
              G   I    + +   L++V +VP +  NLLSV QL TSG  V+F     D+K  +  +
Sbjct: 381  GKGTVEIKTV-QGNVKFLYDVQYVPTLAHNLLSVGQLMTSGYSVVFYDNACDIKDKESGR 440

Query: 441  IIGKPTIEGRRVESVYVLSAESAYVDKTRKNETTDLWHARLGHISYHKLKLIMEKSMLKG 500
             I +  +   ++  + + +  ++ +    KNET +LWH R GH++ + LKL+++K M+ G
Sbjct: 441  TIARVPMTQNKMFPLDISNVGNSALVVKEKNET-NLWHLRYGHLNVNWLKLLVQKDMVIG 500

Query: 501  LPQLEVKTDVVCAGCQYGKAHQLPYKES-SFKAKKPLELVHSDLFGPVKQASISGMRYMV 560
            LP   +K   +C GC YGK  +  +    S++A   LELVH+DL GP+K  S+ G RY +
Sbjct: 501  LPN--IKELDLCEGCIYGKQTRKSFPVGKSWRATTCLELVHADLCGPMKMESLGGSRYFL 560

Query: 561  TFIDDYSRYVWIFFMKEKSDTFSKFQEFKMMVEGEVGAKIRCLRSDNGGEYTSDEFDQYL 620
             F DDYSR+ W++F+K KS+TF  F++FK  VE + G KI+ LR+D GGE+ S++F+ + 
Sbjct: 561  MFTDDYSRFSWVYFLKFKSETFETFKKFKAFVENQSGNKIKSLRTDRGGEFLSNDFNLFC 620

Query: 621  HECGIRRQFTCANTPQQNGVAERKNRHLAETCRSMLHAKNVPGRFWAEAMRTAAHVINKL 680
             E GIRR+ T   TP+QNGVAERKNR + E  RS L AK +P  FW EA+ T  + +N  
Sbjct: 621  EENGIRRELTAPYTPEQNGVAERKNRTVVEMARSSLKAKGLPDYFWGEAVATVVYFLNIS 680

Query: 681  PQPKLGFVSPFEILWDMKPTISYFRVFGCVCYVFVPDHLRSKFDKKAVKCVFVGYDNQRK 740
            P   +   +P E     KP +S+ R+FGC+ Y  V  H  SK D+K+ KC+FVGY  Q K
Sbjct: 681  PTKDVWNTTPLEAWNGKKPRVSHLRIFGCIAYALVNFH--SKLDEKSTKCIFVGYSLQSK 740

Query: 741  GWRCCDPTSGKYYTSRDVVFDEASTWWSSEKKVLSDSNIEEILQQKLGEQTTQIQSNFDA 800
             +R  +P SGK   SR+VVF+E  +W  +   ++S  NI+ +         T  +S  D 
Sbjct: 741  AYRLYNPISGKVIISRNVVFNEDVSWNFNSGNMMS--NIQLL--------PTDEESAVDF 800

Query: 801  SENPSDIDIDKQEVTQSSES-----DKNETTHQQLRRSNRIRRPNPKYANAAIVEDR--- 860
              +P+   +     +  + S     D++      LRRS R ++PNPKY+N      +   
Sbjct: 801  GNSPNSSPVSSSVSSPIAPSTTVAPDESSVEPIPLRRSTREKKPNPKYSNTVNTSCQFAL 860

Query: 861  -VYEPETYEEASQNSVWQKAMEEEIIALEHNQTWELVPRLGDIKPVSCKWVYKIKRRPDG 920
             V +P  YEEA + S W+ AM EEI A+E N TWELV        +  KWV++ K   DG
Sbjct: 861  LVSDPICYEEAVEQSEWKNAMIEEIQAIERNSTWELVDAPEGKNVIGLKWVFRTKYNADG 920

Query: 921  SIERYKARLVARGFSQQYGLDYDETFSPVAKITTVRVLLALAASKDWKLWQMDVKNAFLH 980
            SI+++KARLVA+G+SQQ G+D+DETFSPVA+  TVRV+LALAA     ++Q DVK+AFL+
Sbjct: 921  SIQKHKARLVAKGYSQQQGVDFDETFSPVARFETVRVVLALAAQLHLPVYQFDVKSAFLN 980

Query: 981  GELDREIYMNQPKGFESAANPNYVCKLRKALYGLKQAPRAWYGKIAEFLTQSGYSVAHAD 1040
            G+L+ E+Y++QP+GF    N N V KLRKALYGLKQAPRAWY KI  F   SG+  +  +
Sbjct: 981  GDLEEEVYVSQPQGFMITGNENKVYKLRKALYGLKQAPRAWYSKIDSFFQGSGFRRSDNE 1040

Query: 1041 SSLFIKER-EGNLTIVLVYVDDLIITGDDEREIYQTRENLSIRFQMKELGELKHFLGLEV 1100
             +L++K++      +V +YVDD+I  G  +  +   + N+   F+M +LG LK+FLGLEV
Sbjct: 1041 PTLYLKKQGTDEFLLVCLYVDDMIYIGSSKSLVNDFKSNMMRNFEMSDLGLLKYFLGLEV 1100

Query: 1101 DRTDEGLFLCQQKYTRDMLQKFNMLECKQVSTPMEINAKICAHEGKELNDETTYRQLVGS 1160
             +  +G+F+ Q+KY  D+L+KF M+ C+  +TPM IN K+   +G E  +   +R LVG 
Sbjct: 1101 IQDKDGIFISQKKYAEDLLKKFQMMNCEVATTPMNINEKLQRADGTEKANPKLFRSLVGG 1160

Query: 1161 LIYLTLTRPDISYAVGVMSRYMQSPKKPHLDAARRILRYIKGTIDYGLLYKRSKDCKLVG 1220
            L YLT TRPDI+++V V+SR++QSP K H  AA+R+LRY+ GT D+G+ Y ++ + +LVG
Sbjct: 1161 LNYLTHTRPDIAFSVSVVSRFLQSPTKQHFGAAKRVLRYVAGTTDFGIWYSKAPNFRLVG 1220

Query: 1221 YCDADYAGDHDTRRSTTGYVFKFGSGTISWCSKRQPTVSLSTTEAEYRAAAGAAQESTWL 1280
            + D+DYAG  D R+ST+G  F FGSG ++W SK+Q TV+LST+EAEY AA+ AA+++ WL
Sbjct: 1221 FTDSDYAGCLDDRKSTSGSCFSFGSGVVTWSSKKQETVALSTSEAEYTAASLAARQALWL 1280

Query: 1281 KLLMEDLHQKIDYPISLLCDNQSAIRLAENPVFHARTKHVEVHYHFIREKVLKEEIEMQQ 1340
            + L+ED   +      +  D++SAI +A+NP FH RTKH++V YHFIR  V    I ++ 
Sbjct: 1281 RKLLEDFSYEQKESTEIFSDSKSAIAMAKNPSFHGRTKHIDVQYHFIRTLVADGRIVLKF 1323

Query: 1341 IKTDDQVADLFTKGLNTSKHESFRCQLNM 1342
              T++Q AD+FTK L  +KHE FR QL +
Sbjct: 1341 CSTNEQAADIFTKSLPQAKHEYFRLQLGV 1323

BLAST of CmaCh19G005910 vs. NCBI nr
Match: gi|651219311|gb|AIC77183.1| (polyprotein [Gossypium barbadense])

HSP 1 Score: 852.4 bits (2201), Expect = 1.1e-243
Identity = 503/1394 (36.08%), Postives = 773/1394 (55.45%), Query Frame = 1

Query: 14   IKKLNNNNYNTWATCMMSYLQGQDLWEIVGGCETTPPE--------EDSNDALRKWRIKA 73
            + +L   NY  W+  M + L  QD WEIV      P +         D+  ALR+ R K 
Sbjct: 9    VPQLTKTNYGNWSIRMKALLGSQDCWEIVEKGYIEPGDAATEAALSNDAKKALREARKKD 68

Query: 74   GKAMFALKTTIGEEMLEHIWDDKTPKEAWDTFVMLFS---KKNDTRLQLLENELLSISQR 133
             KA+ ++   + E   E I D K  K AW+     F    K    RLQ L  E   +  +
Sbjct: 69   QKALNSIFQGMDESTFEKISDVKNAKNAWEILQKSFQGVEKAKKVRLQSLRAEFEMLKMK 128

Query: 134  DM-TIAQYFHKVKSICREITELDPKSAIVESRMKRIIIHGLRPEYRSFIAAVQGWPTQPS 193
                I  Y ++VKS+  E+        + E R+   I+  L  ++   + A++      S
Sbjct: 129  SSENIDDYANRVKSVVNEMKRNG--ETLDEVRVMEKILRSLTRKFEYVVVAIEE-SKDLS 188

Query: 194  LVEFENLLASQEAMAKQMG--------GFTLKGEEALYTSESQSN------NRPSTRRGY 253
             +  E L+ S +A  ++M            L  + ++   E+ +N      NR   R GY
Sbjct: 189  KMSLEELVGSLQAHEQKMKLNEDSENLNQALHSKLSIDDGETSNNFSQGRGNRRGYRGGY 248

Query: 254  NGDKR--RSHQGIAQPERAQKNDNKSFQRTRFGG-----------------ICYNCGKKG 313
             G  R  R  +G       +  +NK +Q +  G                   CYNC K G
Sbjct: 249  RGGNRGGRGSRGRGNQSYGRYQENKDYQTSNRGRGSRGRGRGRFQENKSQVQCYNCNKYG 308

Query: 314  HMSRDCWSRKKTIENNVAISKKKIEDEWDAEVLCPIEEDELALMATMEDHINYENDWIVD 373
            H S +C S  K  E N      +  ++ ++ V     E+E    +           W +D
Sbjct: 309  HFSYECRSTHKVDERNHVAVAAEGNEKVESSVFLTYGENEDRKRSV----------WYLD 368

Query: 374  SGCSNHMTGDQRKLQDKKEYKGSRMVRTANNTQLSIAQIGNTTIMPGNKSDTVSLHNVYH 433
            +G SNHM G +++L  + +      +   +N+   I   G   I   N  +   + +VY+
Sbjct: 369  NGASNHMCG-RKELFTELDETVHGQITFGDNSHAEIKGKGKVVITQRN-GEKKYISDVYY 428

Query: 434  VPGIKKNLLSVSQLTTSGSYVLFGPEDVKVY-QDVKIIGKPTIEGRRVESVYVLSAESAY 493
            VP +K NL+S+ QL   G  V      + +  +  +++ +  +   R+ ++ + S E   
Sbjct: 429  VPALKSNLISLGQLLEKGYEVHMKDRSLAIRNKSGELVVRVDMTRNRLFTLDIESGEVKC 488

Query: 494  VDKTRKNETTDLWHARLGHISYHKLKLIMEKSMLKGLPQLEVKTDVVCAGCQYGKAHQLP 553
            +    KNE+  LWH R GH+ +  LKL+ + +M+ GLP +    D +C  C  GK H+  
Sbjct: 489  MKTDLKNESW-LWHLRYGHLGFSGLKLLSKTNMVNGLPSIN-HPDQLCEACVKGKQHRQK 548

Query: 554  YKES-SFKAKKPLELVHSDLFGPVKQASISGMRYMVTFIDDYSRYVWIFFMKEKSDTFSK 613
            ++   S +A++PLE+VH+D+ GP    S+ G RY +TFIDDYSR  W++F+K KS+   K
Sbjct: 549  FEVGKSRRARRPLEIVHTDISGPYDIESLGGNRYYLTFIDDYSRKCWVYFLKAKSEALEK 608

Query: 614  FQEFKMMVEGEVGAKIRCLRSDNGGEYTSDEFDQYLHECGIRRQFTCANTPQQNGVAERK 673
            F+EFK MVE + G  ++ LRSD GGEYT+  ++ +  + GI  Q T   TPQQNGVAERK
Sbjct: 609  FKEFKAMVEKQSGRYLKILRSDRGGEYTAKLYESFCKDHGIIHQLTARRTPQQNGVAERK 668

Query: 674  NRHLAETCRSMLHAKNVPGRFWAEAMRTAAHVINKLPQPKLGFVSPFEILWDMKPTISYF 733
            NR + +  RSM+  K++P  FWAEA+  A +++N+ P   +   +P E     KP + + 
Sbjct: 669  NRTILDMARSMIKGKHLPRTFWAEAVECAVYLLNQCPTKSVRHKTPEEAWSGHKPRVGHL 728

Query: 734  RVFGCVCYVFVPDHLRSKFDKKAVKCVFVGYDNQRKGWRCCDPTSGKYYTSRDVVFDEAS 793
            ++FGC+ Y  VP+  R K D +  KC+F+GYD + K +R  +P + K   SRDV FDEA 
Sbjct: 729  KIFGCIAYAHVPEQQRKKLDDRGEKCIFIGYDKRSKAYRLYNPLTKKLIISRDVEFDEAD 788

Query: 794  TW-WSSEKKVLSDS--NIEEILQQKLGEQT---TQIQSNFDASENPSDIDIDKQEVTQSS 853
             W WS E+K +     N ++  Q++ G+     T   S+  +S   S +D +    T+S 
Sbjct: 789  YWRWSEEEKKVEGLFFNEDDNNQEEQGDDQSPGTTAPSSPTSSSGSSSLD-EAPTRTRSL 848

Query: 854  ESDKNETTHQQLRRSNRIRRPNPKYANAAIVEDRVYEPETYEEASQNSVWQKAMEEEIIA 913
                N T   + +           Y+   ++ +   +P TYEEA +N+ W+KAM+EEI A
Sbjct: 849  NDIYNSTEPVETQFD---------YSLFCLMTE--CDPVTYEEAIENNKWKKAMDEEIAA 908

Query: 914  LEHNQTWELVPRLGDIKPVSCKWVYKIKRRPDGSIERYKARLVARGFSQQYGLDYDETFS 973
            +  N TWEL        P+  KWVYK K   +G +E+YKARLVA+G+ Q+ G+DYDE F+
Sbjct: 909  IRRNDTWELTSLPEGHSPIGVKWVYKTKTNKEGKVEKYKARLVAKGYKQRQGVDYDEIFA 968

Query: 974  PVAKITTVRVLLALAASKDWKLWQMDVKNAFLHGELDREIYMNQPKGFESAANPNYVCKL 1033
            PVA+I T+R+L+A+AA   WK++QMDVK+AFL+G L+ E+Y+ QP G+      + V +L
Sbjct: 969  PVARIDTIRLLIAVAAQYKWKIYQMDVKSAFLNGYLEEEVYIEQPPGYSIQGKEDKVYRL 1028

Query: 1034 RKALYGLKQAPRAWYGKIAEFLTQSGYSVAHADSSLFIKERE-GNLTIVLVYVDDLIITG 1093
            +K+LYGLKQAPRAW  +I E+  ++G+  +  + +L+ K+   G++ IV +YVDD+I TG
Sbjct: 1029 KKSLYGLKQAPRAWNTRIDEYFRRNGFIKSPHEHTLYTKKNGYGDIMIVCLYVDDMIFTG 1088

Query: 1094 DDEREIYQTRENLSIRFQMKELGELKHFLGLEVDRTDEGLFLCQQKYTRDMLQKFNMLEC 1153
            ++       ++ ++  F+M ++GE+ +FLG+EV +  +G+F+ Q+KY   +L KF M +C
Sbjct: 1089 NNPGMSDDFKKAMTKEFEMTDIGEMSYFLGVEVKQMQDGIFVSQKKYAEQILNKFKMKDC 1148

Query: 1154 KQVSTPMEINAKICAHEGKELNDETTYRQLVGSLIYLTLTRPDISYAVGVMSRYMQSPKK 1213
            K V TP +   K+     +E  + T ++ LVGSL YLT+TRPDI+YAVG++SR+M+ PK+
Sbjct: 1149 KPVVTPADPGMKLSVDSTRESINPTLFKSLVGSLRYLTITRPDITYAVGLVSRFMEKPKQ 1208

Query: 1214 PHLDAARRILRYIKGTIDYGLLYKRSKDCKLVGYCDADYAGDHDTRRSTTGYVFKFGSGT 1273
             HL AA+RILRYIKGT+++GL Y  S+D KLVGY D+DY GD D R+ST+GY F   S  
Sbjct: 1209 DHLIAAKRILRYIKGTMNHGLFYTHSQDSKLVGYSDSDYGGDLDDRKSTSGYAFHISSAV 1268

Query: 1274 ISWCSKRQPTVSLSTTEAEYRAAAGAAQESTWLKLLMEDLHQKIDYPISLLCDNQSAIRL 1333
             SW SK+Q T++LST EAEY AAA    ++ WLK ++ ++    + PI++  DN+SAI L
Sbjct: 1269 FSWSSKKQQTIALSTCEAEYMAAATCTCQAMWLKNILGEIGVSNEGPITIYVDNKSAISL 1328

Query: 1334 AENPVFHARTKHVEVHYHFIREKVLKEEIEMQQIKTDDQVADLFTKGLNTSKHESFRCQL 1354
            A+NPV H+R+KH++  YHFIRE+V  + +E+   +T+DQ+AD+FTK L       F+ +L
Sbjct: 1329 AKNPVSHSRSKHIDTKYHFIREQVKNKNVELVHCRTEDQLADIFTKPLKVETFNKFKEKL 1369

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
POLX_TOBAC1.7e-21435.61Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
COPIA_DROME1.5e-10136.36Copia protein OS=Drosophila melanogaster GN=GIP PE=1 SV=3[more]
M810_ARATH1.3e-4744.20Uncharacterized mitochondrial protein AtMg00810 OS=Arabidopsis thaliana GN=AtMg0... [more]
YCH4_YEAST2.0e-4533.55Putative transposon Ty5-1 protein YCL074W OS=Saccharomyces cerevisiae (strain AT... [more]
YJ41B_YEAST2.2e-3125.32Transposon Ty4-J Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 20... [more]
Match NameE-valueIdentityDescription
A5AKW8_VITVI0.0e+0067.73Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_027864 PE=4 SV=1[more]
I1J3P8_BRADI0.0e+0063.49Uncharacterized protein OS=Brachypodium distachyon PE=4 SV=1[more]
I1J0P4_BRADI0.0e+0061.95Uncharacterized protein OS=Brachypodium distachyon PE=4 SV=1[more]
I1IA27_BRADI0.0e+0062.06Uncharacterized protein OS=Brachypodium distachyon PE=4 SV=1[more]
I1H466_BRADI0.0e+0061.85Uncharacterized protein OS=Brachypodium distachyon PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G23160.11.0e-11639.79 cysteine-rich RLK (RECEPTOR-like protein kinase) 8[more]
ATMG00810.17.2e-4944.20ATMG00810.1 DNA/RNA polymerases superfamily protein[more]
ATMG00820.14.5e-1942.37ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)[more]
ATMG00240.12.2e-1339.74ATMG00240.1 Gag-Pol-related retrotransposon family protein[more]
ATMG00300.12.9e-1033.98ATMG00300.1 Gag-Pol-related retrotransposon family protein[more]
Match NameE-valueIdentityDescription
gi|147794801|emb|CAN71427.1|0.0e+0067.73hypothetical protein VITISV_027864 [Vitis vinifera][more]
gi|147810137|emb|CAN73532.1|2.3e-29958.47hypothetical protein VITISV_012827 [Vitis vinifera][more]
gi|147817226|emb|CAN75363.1|5.0e-26262.68hypothetical protein VITISV_026292 [Vitis vinifera][more]
gi|47824985|gb|AAT38758.1|4.9e-24937.88Putative gag-pol polyprotein, identical [Solanum demissum][more]
gi|651219311|gb|AIC77183.1|1.1e-24336.08polyprotein [Gossypium barbadense][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001584Integrase_cat-core
IPR001878Znf_CCHC
IPR012337RNaseH-like_sf
IPR013103RVT_2
IPR025314DUF4219
IPR025724GAG-pre-integrase_dom
Vocabulary: Biological Process
TermDefinition
GO:0015074DNA integration
Vocabulary: Molecular Function
TermDefinition
GO:0003676nucleic acid binding
GO:0008270zinc ion binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
cellular_component GO:0005575 cellular_component
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0008270 zinc ion binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh19G005910.1CmaCh19G005910.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001584Integrase, catalytic corePFAMPF00665rvecoord: 516..632
score: 2.6
IPR001584Integrase, catalytic corePROFILEPS50994INTEGRASEcoord: 514..680
score: 23
IPR001878Zinc finger, CCHC-typeGENE3DG3DSA:4.10.60.10coord: 252..278
score: 1.
IPR001878Zinc finger, CCHC-typePFAMPF00098zf-CCHCcoord: 260..274
score: 1.
IPR001878Zinc finger, CCHC-typeSMARTSM00343c2hcfinal6coord: 260..276
score: 1.
IPR001878Zinc finger, CCHC-typePROFILEPS50158ZF_CCHCcoord: 261..274
score: 11
IPR001878Zinc finger, CCHC-typeunknownSSF57756Retrovirus zinc finger-like domainscoord: 253..283
score: 1.3
IPR012337Ribonuclease H-like domainGENE3DG3DSA:3.30.420.10coord: 512..673
score: 6.0
IPR012337Ribonuclease H-like domainunknownSSF53098Ribonuclease H-likecoord: 517..689
score: 1.64
IPR013103Reverse transcriptase, RNA-dependent DNA polymerasePFAMPF07727RVT_2coord: 864..1107
score: 8.8E
IPR025314Domain of unknown function DUF4219PFAMPF13961DUF4219coord: 17..42
score: 2.8
IPR025724GAG-pre-integrase domainPFAMPF13976gag_pre-integrscoord: 451..503
score: 1.7
NoneNo IPR availablePANTHERPTHR11439GAG-POL-RELATED RETROTRANSPOSONcoord: 319..1266
score: 0.0coord: 16..300
score:
NoneNo IPR availablePANTHERPTHR11439:SF164SUBFAMILY NOT NAMEDcoord: 319..1266
score: 0.0coord: 16..300
score:
NoneNo IPR availablePFAMPF14223Retrotran_gag_2coord: 61..189
score: 5.2
NoneNo IPR availableunknownSSF56672DNA/RNA polymerasescoord: 1116..1297
score: 6.08E-38coord: 863..1086
score: 6.08

The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
CmaCh19G005910Wax gourdcmawgoB0640
CmaCh19G005910Wax gourdcmawgoB0642
CmaCh19G005910Wax gourdcmawgoB0656
CmaCh19G005910Wax gourdcmawgoB0664
CmaCh19G005910Cucurbita maxima (Rimu)cmacmaB149
CmaCh19G005910Cucurbita maxima (Rimu)cmacmaB340
CmaCh19G005910Cucurbita maxima (Rimu)cmacmaB415
CmaCh19G005910Cucurbita maxima (Rimu)cmacmaB457
CmaCh19G005910Cucurbita maxima (Rimu)cmacmaB458
CmaCh19G005910Cucurbita maxima (Rimu)cmacmaB462
CmaCh19G005910Cucumber (Gy14) v1cgycmaB0624
CmaCh19G005910Cucumber (Gy14) v1cgycmaB0686
CmaCh19G005910Cucurbita moschata (Rifu)cmacmoB494
CmaCh19G005910Cucurbita moschata (Rifu)cmacmoB502
CmaCh19G005910Cucurbita moschata (Rifu)cmacmoB508
CmaCh19G005910Cucurbita moschata (Rifu)cmacmoB530
CmaCh19G005910Cucurbita moschata (Rifu)cmacmoB534
CmaCh19G005910Wild cucumber (PI 183967)cmacpiB513
CmaCh19G005910Wild cucumber (PI 183967)cmacpiB520
CmaCh19G005910Wild cucumber (PI 183967)cmacpiB534
CmaCh19G005910Cucumber (Chinese Long) v2cmacuB513
CmaCh19G005910Cucumber (Chinese Long) v2cmacuB528
CmaCh19G005910Melon (DHL92) v3.5.1cmameB463
CmaCh19G005910Melon (DHL92) v3.5.1cmameB466
CmaCh19G005910Melon (DHL92) v3.5.1cmameB481
CmaCh19G005910Melon (DHL92) v3.5.1cmameB485
CmaCh19G005910Watermelon (Charleston Gray)cmawcgB457
CmaCh19G005910Watermelon (Charleston Gray)cmawcgB460
CmaCh19G005910Watermelon (Charleston Gray)cmawcgB465
CmaCh19G005910Watermelon (97103) v1cmawmB484
CmaCh19G005910Watermelon (97103) v1cmawmB496
CmaCh19G005910Watermelon (97103) v1cmawmB497
CmaCh19G005910Watermelon (97103) v1cmawmB503
CmaCh19G005910Cucurbita pepo (Zucchini)cmacpeB522
CmaCh19G005910Cucurbita pepo (Zucchini)cmacpeB531
CmaCh19G005910Cucurbita pepo (Zucchini)cmacpeB532
CmaCh19G005910Cucurbita pepo (Zucchini)cmacpeB539
CmaCh19G005910Bottle gourd (USVL1VR-Ls)cmalsiB475
CmaCh19G005910Bottle gourd (USVL1VR-Ls)cmalsiB472
CmaCh19G005910Bottle gourd (USVL1VR-Ls)cmalsiB484
CmaCh19G005910Bottle gourd (USVL1VR-Ls)cmalsiB493
CmaCh19G005910Bottle gourd (USVL1VR-Ls)cmalsiB498
CmaCh19G005910Cucumber (Gy14) v2cgybcmaB210
CmaCh19G005910Cucumber (Gy14) v2cgybcmaB379
CmaCh19G005910Cucumber (Gy14) v2cgybcmaB945
CmaCh19G005910Melon (DHL92) v3.6.1cmamedB535
CmaCh19G005910Melon (DHL92) v3.6.1cmamedB540
CmaCh19G005910Melon (DHL92) v3.6.1cmamedB560
CmaCh19G005910Melon (DHL92) v3.6.1cmamedB561
CmaCh19G005910Melon (DHL92) v3.6.1cmamedB565
CmaCh19G005910Silver-seed gourdcarcmaB0253
CmaCh19G005910Silver-seed gourdcarcmaB0288
CmaCh19G005910Silver-seed gourdcarcmaB0407
CmaCh19G005910Silver-seed gourdcarcmaB0528
CmaCh19G005910Silver-seed gourdcarcmaB1114
CmaCh19G005910Silver-seed gourdcarcmaB1118
CmaCh19G005910Cucumber (Chinese Long) v3cmacucB0635
CmaCh19G005910Cucumber (Chinese Long) v3cmacucB0607
CmaCh19G005910Cucumber (Chinese Long) v3cmacucB0618
CmaCh19G005910Cucumber (Chinese Long) v3cmacucB0627
CmaCh19G005910Cucumber (Chinese Long) v3cmacucB0639
CmaCh19G005910Watermelon (97103) v2cmawmbB521
CmaCh19G005910Watermelon (97103) v2cmawmbB531
CmaCh19G005910Watermelon (97103) v2cmawmbB536
CmaCh19G005910Watermelon (97103) v2cmawmbB546