Cp4.1LG02g02800.1 (mRNA) Cucurbita pepo (Zucchini)

NameCp4.1LG02g02800.1
TypemRNA
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionTranslation initiation factor IF-2
LocationCp4.1LG02 : 3547250 .. 3554525 (+)
Sequence length3704
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AACGGAAGAAACTGTGGATGGAAATCCAAAATAGGAGAGCCTTATCTCCTCGAGCGTTAATCGATAGGCTGGGAAAGAGCCACACACGCTCACACCCCCATCTCCCCGAAGCCCACCACCAAATTTCTAGCTTCATTGCCTTCAGAACCATATGTTTTCATAAACATCCTCATCCACTGGCGCCTCTTCAAACTCTCCATTCATTTCAAGTCGCATGCGTCAAGAGGTAAGCCATTTCCCACTGCTAATTACCCACTCCTTGTATGTTCTTTTCTACTTCGCTTTTGATTCTTGGAAATTCCATTTTCTTTGCGTTAGATTGAATTCTGGGGTTTCGAAGCAATTTTGTTTGATGGGTTTGTTTAGAGCCATTGATTGATTAAGTAGGACGAGATTTTAGTTGTGAGTATGCAAGGAACAGGAACAATGGCGTCTGTGGCTTCCTTGTTCAACTTATCTGGTGTTGGAGTGATTGGATCATCAGAAAAGCCTCGCTCTCAGTTTCGTGGAGTGTGTTTATCGAGGAGAGGGTTTAAAGGAAGCAATAAATGGTATTATGTTCCATTTCCTCTCTGTAAATATTCAGCTACAACAACAACTGATTTCATTGCCGATCAAGGCAATGCCATCTCTGTTGATTCTAATAGTTCTTATATTAGAAGTAAAGATAATGATGGCACAGATTTTCTTCTTAAGCCAGCCCCTAAACCAGTGTTGAAAGCTGCAGAATCTAAGCCTCCCCTTGTAGGTTTGAATAAAGTAGCTTGGGAATCTCCGAAAAGTAATGGCGATTCGAACGGTAGCCGAAAATTGTTGGATAATGAAGAGGAAAGGAGTAAGGTGATTGAGTCTCTTGGGGAGGTATTGGAAAAGGCTGAAAAGTTGGAGACTCCGAAGTTGGGTGTGAATAAACCAGGAAGAGGTGTAGATAAGCCAACGACATCAGGGTCGAGCTCTAGTTCGAATTCGAAATCAAATCGCTCGAGAAGTGTTTGGCGTAAGGGGGATGCAGTTGCACCAGTGCAGAAGGTTGTAGCGGAACCGTCTAAGGCTAGTGGTAAAGTTGAAGCTAAACCGGGGGGAGCTAGTACGGTAGAGCCTCAATCACAAGCTGGTTTTCGAACTCCTCAACCACTTGTGAAACCACAATCAAGGTTACAAGCAAAGCCTTTGGCTGCGCCTCGGCCTGTGTTAAAGAAACCAATTCTGAAGGATGTCGGGGCAGCGACTGTGACAGCAAATGATGAAAGTAATGCAGCGGCGAAGACAAAAGAGAGAAAGCCGATTCTGATTGACAAATATGCTTCGAAGAAACCTGTAGCTAAACCGTTTGACTCTGAAGCCGTCATGGCCCCGAAAAAACCTGGAAAAGCCCCCTCTCCTGGAAAGTTCAAGGACGAGTACCGTAAGAAGAATGTTGCATCGGGAGGTCCAAGAAGGAGAATGGTTGATGATGTGGAAATCCCGGACGATGTTTCCATTCCTAGTGTGACTACAGCACGAAAAGGAAGGAAATGGAGCAAGGCAAGCCGTAAGGCTGCTAGAATCCAGGCTTCTAAAGATGCAGCTCCGGTGAAAGTCGAAATTCTAGAAGTTGAAGAAAGTGGTATGTTACTTGAGGAATTGGCCTTCAACTTGGCTATCAGTGAAGGCGAAATTCTCGGGTATTTATATTCAAAGGGGATTAAGCCTGATGGTGTGCAAACTTTAGACAAAGATATCGTGAAGATGATTTGCAAAGAGTACGATGTGGAGACCATAGATGTTGATCCGGTTAAAGTTGAAGAATTGGCTAAAAAGAAGGATATTTTTGATGAAGAAGATCTCGATAAACTTCAAAGCAGGCCTCCAGTCATCACCATTATGGGACATGTCGACCATGGCAAGGTAATTCGACGTCTTTCATTCTCCCTTTAGCTTTCGGGGTTTTCTTCGAGTCCTAATTTTTGTTTCAATCCATTCTAATCATTCCTGGAATTCGAAAAGGCAAGCTTTTATTTATTTTTATTTTTAAAATCTAATGAATCTTATTTTTTCTCCTCTCTTGGCAGACGACACTTTTGGACTACATCCGCAGGAGCAAGGTACTCGTGCAAGCTTAAACGTCTTTCTTGATGCTTTGCCTTTCTGGAATAGAGTTATCAGCTAAATTTGCAACAATTTAGATCTTTTGCTAATGATTCTAGCGTTTTAATTATGTCACGTCTAAAGTTCGAGTGCTACTCATCCATAGGTAGCTGCATCGGAAGCTGGTGGTATTACTCAAGGAATTGGTGCTTATAAGGTGATGGTACCACTAGATGGCAAGTTGCAGCCTTGTGTGTTCCTTGACACTCCAGGTCACGAGGTACCTAAAACCGTGCATCGTCTTTCTAAAAATGTAGCATTTACATGATATTTTCTTATTTTTTGGCGTATTCACTGTCATTGGATGAATTTTTGGGCAACGTTTTATCCGTTGGATTCTTTGTGGCATAATTTTCTCAGACAGGAACTGTCATTTGTATATACCATCCCTGTGTTCAGATGATGTATAGTGTCCATTTCCTGTAGGCGTTTGGAGCCATGAGAGCTCGTGGTGCTCGGGTGACAGACATTGCCATCATTGTGGTGGCTGCTGACGATGGAATCCGGCCTCAGACAAATGAGGCTATAGCTCATGCCAGAGCAGCTGGAGTGCCAATAGTGATTGCGATAAATAAGGCATGTAAACAGTAAAAGATCTCTACGAACTTCTCTACCGTACAAGCGTATAAACGTTTATAGAAAAATTGTGATGTCCCACATTGGTTGGGGAGGAGAACAAACCACCATTTATAAGGGTGTGGAAACCTTCCCCCTAGCAGACGCGTTTTAAAGCTTTGAGGGGAAGCCCGAAACGGAAAGCCCAAAGAGGACAATATCTGCTAGCGGTGGATCTGGGTCATTACAAAAAAATAGATGTCCTTGTATAAACGTGTTATATATCCTCGTAAAAGCTTCGAAACTATTGATTAATTTCTCATTTCGTTCACATTTTTCCAGATCGACAAAGATGGAGCTAATGCAGACAGAGTGATGCAGGAGCTTTCCTCTATCGGTTTAATGCCAGAAGACTGGGGCGGTGACGTCCCTATGGTGCAGGTAAATATATCTCAAACGCAAAAATTGACCTTGCATATCTCGGCTTTATCTATTTTCGAACTTATGCTCGTAATATTTTCCAACAGATTAGTGCTCTTAAAGGGCAGAATGTGGATGACCTGTTGGAAACAGTAATGCTTCTTGCTGAGGTAAGTGAAACATTGCTAGTACGTCGTGTTGATTTTAGTATTTTTACATCTTGGAAATCATCCAATTATGGTGGCTCTGGCTGTTCGTCGTCGTTGTTGCTTAATGCTTAAGGTCTGAGAGAATCATAATCCTCTAAATAAAGAACAACATATCTTGTGGATAACTAATATTTATCGTGCCCAACTCGTTAGAGAGGTAATCTATTCGTGCTCGAGTTAAAGCCTTTGTGAAGATATCTTCTAATTGCTCTCCAGTCTTCACATATCTTGTGGATACCAAACCTTTTGTATTTTTCACGTAAAAAATGATACTCGACATCAATGTGTTTTAGTTCGCTCATAAAAGACCGGATTAGATGCGATATGAAGAGCTGTTTGATTATCATATCACAATTTGGTCGGTGTTGTGATATCAAATCTTCGTCAACTCAGTCAATAATCTTTTTTTTCCTCTCCCTATCGGTCGAAATGTGGAGATACCTTAATTTCGGCCGGAGTCTCCCTATACCAAACCAACTCGCCTTCAACTCAATCGGTGAAAAGATATCTGAGTTAGGTTTGGACCCTTTGCTTAGAAGTGGACTCGACCGATGTTTCCTTTCCCGAACTAAAGGAATTTCAAGTTTTCCAATAGCTATTTGACTCATTCACAATAAGTGGATCGAGGCTGAGATATTGCGGGAGTTAACGGATTGTCGTGGACTTCCGTCTGTCCATTTTTTTCTCATTATCAATTTGGTCTAGTTTTGTCGCATCGAGAGGAAATCCCCATCCCACATTTGAGAAAGTTTAGCTATGCCCACCGGTCTTAAATTGGACTCCGGTTCTTAGTTAATTTTGAGATTCCTTTCGTGGGGCTGAGTTTTCGTACGCCTTAGTATATTCTTTTCATTTTATTCCCTCACATAAACTTTTCAACGTATGTTATCTAATTCTTTTTCAATGTTCAGTTGCAAGAACTGAAGGCTAATCCTGATAGAAGTGCGAAGGGGACAGTTATCGAGGCAGGTCTTGACAAAGCAAAAGGACCGTTCGCTACGTTTATTGTACAGAATGGAACCCTTAAACGGGGTGACGTTTTAGTATGCGGGGAGGCCTTCGGGAAGGTCGGTACTGCTTTTTGTTTGTACATTTGTTACTATGTGGCGCAAATAATCGTGAAGTTATTTTGCGTTCGTTTGATTATGTTATTATTTTTGGTGATTGTTGTTGATGTCGACCATTTCTAAGGAGTTCGTTTACCTGTATAACTATTTTAGGTTCGAGCTTTATTTGATGATGGCGGAAAACGGGTCGACGAAGCTGGACCTTCCTTACCAGTGCAGGTTTGTGGATCATTACTTGTTGTTTTTGTTTATGTCAAAGCAGATTTAAGCTGCATCCATAAGGAGTTTCACTTTGTTATATATTCTCTGATAGTCTAATTCTCTGGTTTTCTTAATCTGTCCACAGGTCATTGGGTTAAATATTGTGCCAATTGCTGGTGATGAATTCGAGGTTGTCGATTCCCTCGATACAGCTCGTGAGAAGGCCGAGTTACGCGCTGAGGCCTTGTGGTCTCAACGAATATCAGACAAGGCTGGAGATGGGAAGGTTACCCTTTCTTCTTTAGCTTCGGCAGTTTCATCGGGAAAGCGATCCGGGTTAGACTTGCACCAACTGAACATAATCTTGAAGGTCGACGTTCAGGTACATCGGGACGTATATCTTTCTTACACTTGTAATCTACAATGAACTTGTTGAGATGAAAATTTGTGCACGTTATTCACTTGTTAATATTGTCGATGACTAGGGATCGATCGAGGCTATTAGACAAGCCCTTCAGGTGCTCCCGCAAGATAACGTCTCCTTGAAGTTTCTTTTGCAAGCAACTGGGGATGTGACCTCCAGTGATATTGATCTTGCAGTTGCAAGCAAAGCCATAATTCTAGGATTTAATGTTAAAGCACCAGGTTCTGTTAAGAGTTATGCAGAAAATAAAGCTGTTGAGATCAGGCTCTATAGAGTCATTTACGAGCTTATCGATGATGTACGGAACGCAATGGAAGGACTTCTAGATCCTGTTGAGGTCAGTACTTAAAGTTTTACATTTCCAAGTTATAACGACATGGTACACGAGTTTGTGAAAGTAAGTTTGAGATCCCACATCGGTTGGAGAGGAAAACGAAGCATTCCTTATAAGGGCGTGGAAACCTCTCCCTAGCAGATGCGTTTTAAAACCGTGAGGTTGATGGTGATACGTAACGGGCCAAAACGGACAATATCTATAGCGGGGGGCTTGGGCTGTTACAAATGGTATCAGAGTTAGACACCGGGTGGTGTGCCAGCGAGGACGTTGGGCCCCCAAGGGGGGTGGACTGTGGGATCTCACACCGGTTGGAGAGGGGAACGAAGTATTCCTTATAAGGGTGTGGAAACCTCTCTCCAGTAGACACGTTTTAAAACCGTGAGGCTAACGGCGATACGTAACGAGCCCGGGCGCAAGCGAACAATATCTATTAGCGTTTGGAGTTGGACTATATTTTTCCTTTGTATTTACAAGAGAGGCTCACACTCGGATACGCCTTAATCCTTAGTCCAGATTTTTCTTTTTTTTTTTTCATTTTTGTCGTTTAACTCGATCGATCAACTTCGATCAACTTGTATGCTCTTCTAGTCAGTCCCTGTGACTATATCTTTGTTTGGTTAAATTGTTGGCTTGAAGCTTCTACACTTTCCTCATTGAAGATTAACATATATAGCTGAGATCTGTCATTTGTCTTTGAATAGGAACAAATACCAATGGGGTCAGCAGAAGTCCGTGCAGTATTCAGTAGTGGCAGTGGTCTTGTTGCTGGATGCATGGTAGTGGATGGAAAGCTGGCTAAGGGCTGTGGCATCCGGGTTCTGCGAAAGGGTAAAACGGTATATAATGGCCAACTTGATTCGCTGCGACGAGTTAAAGAAATCGTGAAAGAGGTACGTGATTATCCTTCCCTTTGATTGGGAACATGCCTAGCTTGAGCTTAGTTCCCTATGTAGATTGATATTAAGTTCCAGCTTAGATGTAGCACCGTTGGCAGATATTCTCCTCTTTGAGTTTCCCTTTTAGGCTTTCCCTCAAGATTTTATAAAACACGTATGTTAGGGAGAGATTTCCACGCCCTTATAAAGGGTGTTCCGTTCTCCTCCCTAACTGATGTGGAATCTCACAGTAGACGCGTTTTAAAAAAACTTCGAGGGAAAGCCAGAAAGTGAAAGTCCAAAGAGGACAATATCTGCTAGCGGTGTGCTTGGGCTATTACATTAGATTTGTGGATTAGAGTAGACATTAACCCGAGCTAAGTACATAACTAGTGACTGCAAAAATCAACACCATAATGTCTCTTGGTCACTGCTCGTTTTATGCGTCGGGTTTTGTTCTGAAAGATGTTGGATCGATAATACTGTTTTTAAGGAGAATGAATGGTCTGTTTTGTAGGTGAATGCTGGCCTAGAGTGTGGCGTTGGGATGGAGGAGTACAATGATTGGGAGGTGGGCGATGTAATAGAGGCCTACTACACCGTTCAGAAGAAGCGGACGCTTGAGGAGGCCTCAGCTTCAATGGAAGCTGCACTGGAGGAAGCTGGAATTGGCATGTAGACAAAGAAACCCAGTAACTTGTCAGGCCAGCCTAGGCCATACTTACCTTTATGATCAGATGGCTATGCCTTCTGCAATGCTTTGGCCATTTTTTGTCCAGTCAATGTTCGACGATACCAAAGACGAGGACGTGTGTCGTTGCTTTGTGATTCACTTGAGGAGCAAGGTGACGTCGAGAAGCTGAAGGCTTTGTGTTGCTGTAAATATTCAATACATGTATTAGATCTCCGATTCTTATTGGTAAGAGGAAAACTGTTATTATTTGGTTCTCAATGCTCGAGCGTTCAATGCGATGTAAACAGCAAACTGTGAACACAGCATGAAAATGGAAGAAATGTCAACAAATTTTCAATGATGTTCAAATTTTGCATCATTTTACTGT

mRNA sequence

AACGGAAGAAACTGTGGATGGAAATCCAAAATAGGAGAGCCTTATCTCCTCGAGCGTTAATCGATAGGCTGGGAAAGAGCCACACACGCTCACACCCCCATCTCCCCGAAGCCCACCACCAAATTTCTAGCTTCATTGCCTTCAGAACCATATGTTTTCATAAACATCCTCATCCACTGGCGCCTCTTCAAACTCTCCATTCATTTCAAGTCGCATGCGTCAAGAGATTGAATTCTGGGGTTTCGAAGCAATTTTGTTTGATGGGTTTGTTTAGAGCCATTGATTGATTAAGTAGGACGAGATTTTAGTTGTGAGTATGCAAGGAACAGGAACAATGGCGTCTGTGGCTTCCTTGTTCAACTTATCTGGTGTTGGAGTGATTGGATCATCAGAAAAGCCTCGCTCTCAGTTTCGTGGAGTGTGTTTATCGAGGAGAGGGTTTAAAGGAAGCAATAAATGGTATTATGTTCCATTTCCTCTCTGTAAATATTCAGCTACAACAACAACTGATTTCATTGCCGATCAAGGCAATGCCATCTCTGTTGATTCTAATAGTTCTTATATTAGAAGTAAAGATAATGATGGCACAGATTTTCTTCTTAAGCCAGCCCCTAAACCAGTGTTGAAAGCTGCAGAATCTAAGCCTCCCCTTGTAGGTTTGAATAAAGTAGCTTGGGAATCTCCGAAAAGTAATGGCGATTCGAACGGTAGCCGAAAATTGTTGGATAATGAAGAGGAAAGGAGTAAGGTGATTGAGTCTCTTGGGGAGGTATTGGAAAAGGCTGAAAAGTTGGAGACTCCGAAGTTGGGTGTGAATAAACCAGGAAGAGGTGTAGATAAGCCAACGACATCAGGGTCGAGCTCTAGTTCGAATTCGAAATCAAATCGCTCGAGAAGTGTTTGGCGTAAGGGGGATGCAGTTGCACCAGTGCAGAAGGTTGTAGCGGAACCGTCTAAGGCTAGTGGTAAAGTTGAAGCTAAACCGGGGGGAGCTAGTACGGTAGAGCCTCAATCACAAGCTGGTTTTCGAACTCCTCAACCACTTGTGAAACCACAATCAAGGTTACAAGCAAAGCCTTTGGCTGCGCCTCGGCCTGTGTTAAAGAAACCAATTCTGAAGGATGTCGGGGCAGCGACTGTGACAGCAAATGATGAAAGTAATGCAGCGGCGAAGACAAAAGAGAGAAAGCCGATTCTGATTGACAAATATGCTTCGAAGAAACCTGTAGCTAAACCGTTTGACTCTGAAGCCGTCATGGCCCCGAAAAAACCTGGAAAAGCCCCCTCTCCTGGAAAGTTCAAGGACGAGTACCGTAAGAAGAATGTTGCATCGGGAGGTCCAAGAAGGAGAATGGTTGATGATGTGGAAATCCCGGACGATGTTTCCATTCCTAGTGTGACTACAGCACGAAAAGGAAGGAAATGGAGCAAGGCAAGCCGTAAGGCTGCTAGAATCCAGGCTTCTAAAGATGCAGCTCCGGTGAAAGTCGAAATTCTAGAAGTTGAAGAAAGTGGTATGTTACTTGAGGAATTGGCCTTCAACTTGGCTATCAGTGAAGGCGAAATTCTCGGGTATTTATATTCAAAGGGGATTAAGCCTGATGGTGTGCAAACTTTAGACAAAGATATCGTGAAGATGATTTGCAAAGAGTACGATGTGGAGACCATAGATGTTGATCCGGTTAAAGTTGAAGAATTGGCTAAAAAGAAGGATATTTTTGATGAAGAAGATCTCGATAAACTTCAAAGCAGGCCTCCAGTCATCACCATTATGGGACATGTCGACCATGGCAAGACGACACTTTTGGACTACATCCGCAGGAGCAAGGTAGCTGCATCGGAAGCTGGTGGTATTACTCAAGGAATTGGTGCTTATAAGGTGATGGTACCACTAGATGGCAAGTTGCAGCCTTGTGTGTTCCTTGACACTCCAGGTCACGAGGCGTTTGGAGCCATGAGAGCTCGTGGTGCTCGGGTGACAGACATTGCCATCATTGTGGTGGCTGCTGACGATGGAATCCGGCCTCAGACAAATGAGGCTATAGCTCATGCCAGAGCAGCTGGAGTGCCAATAATCGACAAAGATGGAGCTAATGCAGACAGAGTGATGCAGGAGCTTTCCTCTATCGGTTTAATGCCAGAAGACTGGGGCGGTGACGTCCCTATGGTGCAGATTAGTGCTCTTAAAGGGCAGAATGTGGATGACCTGTTGGAAACAGTAATGCTTCTTGCTGAGTTGCAAGAACTGAAGGCTAATCCTGATAGAAGTGCGAAGGGGACAGTTATCGAGGCAGGTCTTGACAAAGCAAAAGGACCGTTCGCTACGTTTATTGTACAGAATGGAACCCTTAAACGGGGTGACGTTTTAGTATGCGGGGAGGCCTTCGGGAAGGTTCGAGCTTTATTTGATGATGGCGGAAAACGGGTCGACGAAGCTGGACCTTCCTTACCAGTGCAGGTCATTGGGTTAAATATTGTGCCAATTGCTGGTGATGAATTCGAGGTTGTCGATTCCCTCGATACAGCTCGTGAGAAGGCCGAGTTACGCGCTGAGGCCTTGTGGTCTCAACGAATATCAGACAAGGCTGGAGATGGGAAGGTTACCCTTTCTTCTTTAGCTTCGGCAGTTTCATCGGGAAAGCGATCCGGGTTAGACTTGCACCAACTGAACATAATCTTGAAGGTCGACGTTCAGGGATCGATCGAGGCTATTAGACAAGCCCTTCAGGTGCTCCCGCAAGATAACGTCTCCTTGAAGTTTCTTTTGCAAGCAACTGGGGATGTGACCTCCAGTGATATTGATCTTGCAGTTGCAAGCAAAGCCATAATTCTAGGATTTAATGTTAAAGCACCAGGTTCTGTTAAGAGTTATGCAGAAAATAAAGCTGTTGAGATCAGGCTCTATAGAGTCATTTACGAGCTTATCGATGATGTACGGAACGCAATGGAAGGACTTCTAGATCCTGTTGAGGAACAAATACCAATGGGGTCAGCAGAAGTCCGTGCAGTATTCAGTAGTGGCAGTGGTCTTGTTGCTGGATGCATGGTAGTGGATGGAAAGCTGGCTAAGGGCTGTGGCATCCGGGTTCTGCGAAAGGGTAAAACGGTATATAATGGCCAACTTGATTCGCTGCGACGAGTTAAAGAAATCGTGAAAGAGGTGAATGCTGGCCTAGAGTGTGGCGTTGGGATGGAGGAGTACAATGATTGGGAGGTGGGCGATGTAATAGAGGCCTACTACACCGTTCAGAAGAAGCGGACGCTTGAGGAGGCCTCAGCTTCAATGGAAGCTGCACTGGAGGAAGCTGGAATTGGCATGTAGACAAAGAAACCCAGTAACTTGTCAGGCCAGCCTAGGCCATACTTACCTTTATGATCAGATGGCTATGCCTTCTGCAATGCTTTGGCCATTTTTTGTCCAGTCAATGTTCGACGATACCAAAGACGAGGACGTGTGTCGTTGCTTTGTGATTCACTTGAGGAGCAAGGTGACGTCGAGAAGCTGAAGGCTTTGTGTTGCTGTAAATATTCAATACATGTATTAGATCTCCGATTCTTATTGGTAAGAGGAAAACTGTTATTATTTGGTTCTCAATGCTCGAGCGTTCAATGCGATGTAAACAGCAAACTGTGAACACAGCATGAAAATGGAAGAAATGTCAACAAATTTTCAATGATGTTCAAATTTTGCATCATTTTACTGT

Coding sequence (CDS)

ATGCAAGGAACAGGAACAATGGCGTCTGTGGCTTCCTTGTTCAACTTATCTGGTGTTGGAGTGATTGGATCATCAGAAAAGCCTCGCTCTCAGTTTCGTGGAGTGTGTTTATCGAGGAGAGGGTTTAAAGGAAGCAATAAATGGTATTATGTTCCATTTCCTCTCTGTAAATATTCAGCTACAACAACAACTGATTTCATTGCCGATCAAGGCAATGCCATCTCTGTTGATTCTAATAGTTCTTATATTAGAAGTAAAGATAATGATGGCACAGATTTTCTTCTTAAGCCAGCCCCTAAACCAGTGTTGAAAGCTGCAGAATCTAAGCCTCCCCTTGTAGGTTTGAATAAAGTAGCTTGGGAATCTCCGAAAAGTAATGGCGATTCGAACGGTAGCCGAAAATTGTTGGATAATGAAGAGGAAAGGAGTAAGGTGATTGAGTCTCTTGGGGAGGTATTGGAAAAGGCTGAAAAGTTGGAGACTCCGAAGTTGGGTGTGAATAAACCAGGAAGAGGTGTAGATAAGCCAACGACATCAGGGTCGAGCTCTAGTTCGAATTCGAAATCAAATCGCTCGAGAAGTGTTTGGCGTAAGGGGGATGCAGTTGCACCAGTGCAGAAGGTTGTAGCGGAACCGTCTAAGGCTAGTGGTAAAGTTGAAGCTAAACCGGGGGGAGCTAGTACGGTAGAGCCTCAATCACAAGCTGGTTTTCGAACTCCTCAACCACTTGTGAAACCACAATCAAGGTTACAAGCAAAGCCTTTGGCTGCGCCTCGGCCTGTGTTAAAGAAACCAATTCTGAAGGATGTCGGGGCAGCGACTGTGACAGCAAATGATGAAAGTAATGCAGCGGCGAAGACAAAAGAGAGAAAGCCGATTCTGATTGACAAATATGCTTCGAAGAAACCTGTAGCTAAACCGTTTGACTCTGAAGCCGTCATGGCCCCGAAAAAACCTGGAAAAGCCCCCTCTCCTGGAAAGTTCAAGGACGAGTACCGTAAGAAGAATGTTGCATCGGGAGGTCCAAGAAGGAGAATGGTTGATGATGTGGAAATCCCGGACGATGTTTCCATTCCTAGTGTGACTACAGCACGAAAAGGAAGGAAATGGAGCAAGGCAAGCCGTAAGGCTGCTAGAATCCAGGCTTCTAAAGATGCAGCTCCGGTGAAAGTCGAAATTCTAGAAGTTGAAGAAAGTGGTATGTTACTTGAGGAATTGGCCTTCAACTTGGCTATCAGTGAAGGCGAAATTCTCGGGTATTTATATTCAAAGGGGATTAAGCCTGATGGTGTGCAAACTTTAGACAAAGATATCGTGAAGATGATTTGCAAAGAGTACGATGTGGAGACCATAGATGTTGATCCGGTTAAAGTTGAAGAATTGGCTAAAAAGAAGGATATTTTTGATGAAGAAGATCTCGATAAACTTCAAAGCAGGCCTCCAGTCATCACCATTATGGGACATGTCGACCATGGCAAGACGACACTTTTGGACTACATCCGCAGGAGCAAGGTAGCTGCATCGGAAGCTGGTGGTATTACTCAAGGAATTGGTGCTTATAAGGTGATGGTACCACTAGATGGCAAGTTGCAGCCTTGTGTGTTCCTTGACACTCCAGGTCACGAGGCGTTTGGAGCCATGAGAGCTCGTGGTGCTCGGGTGACAGACATTGCCATCATTGTGGTGGCTGCTGACGATGGAATCCGGCCTCAGACAAATGAGGCTATAGCTCATGCCAGAGCAGCTGGAGTGCCAATAATCGACAAAGATGGAGCTAATGCAGACAGAGTGATGCAGGAGCTTTCCTCTATCGGTTTAATGCCAGAAGACTGGGGCGGTGACGTCCCTATGGTGCAGATTAGTGCTCTTAAAGGGCAGAATGTGGATGACCTGTTGGAAACAGTAATGCTTCTTGCTGAGTTGCAAGAACTGAAGGCTAATCCTGATAGAAGTGCGAAGGGGACAGTTATCGAGGCAGGTCTTGACAAAGCAAAAGGACCGTTCGCTACGTTTATTGTACAGAATGGAACCCTTAAACGGGGTGACGTTTTAGTATGCGGGGAGGCCTTCGGGAAGGTTCGAGCTTTATTTGATGATGGCGGAAAACGGGTCGACGAAGCTGGACCTTCCTTACCAGTGCAGGTCATTGGGTTAAATATTGTGCCAATTGCTGGTGATGAATTCGAGGTTGTCGATTCCCTCGATACAGCTCGTGAGAAGGCCGAGTTACGCGCTGAGGCCTTGTGGTCTCAACGAATATCAGACAAGGCTGGAGATGGGAAGGTTACCCTTTCTTCTTTAGCTTCGGCAGTTTCATCGGGAAAGCGATCCGGGTTAGACTTGCACCAACTGAACATAATCTTGAAGGTCGACGTTCAGGGATCGATCGAGGCTATTAGACAAGCCCTTCAGGTGCTCCCGCAAGATAACGTCTCCTTGAAGTTTCTTTTGCAAGCAACTGGGGATGTGACCTCCAGTGATATTGATCTTGCAGTTGCAAGCAAAGCCATAATTCTAGGATTTAATGTTAAAGCACCAGGTTCTGTTAAGAGTTATGCAGAAAATAAAGCTGTTGAGATCAGGCTCTATAGAGTCATTTACGAGCTTATCGATGATGTACGGAACGCAATGGAAGGACTTCTAGATCCTGTTGAGGAACAAATACCAATGGGGTCAGCAGAAGTCCGTGCAGTATTCAGTAGTGGCAGTGGTCTTGTTGCTGGATGCATGGTAGTGGATGGAAAGCTGGCTAAGGGCTGTGGCATCCGGGTTCTGCGAAAGGGTAAAACGGTATATAATGGCCAACTTGATTCGCTGCGACGAGTTAAAGAAATCGTGAAAGAGGTGAATGCTGGCCTAGAGTGTGGCGTTGGGATGGAGGAGTACAATGATTGGGAGGTGGGCGATGTAATAGAGGCCTACTACACCGTTCAGAAGAAGCGGACGCTTGAGGAGGCCTCAGCTTCAATGGAAGCTGCACTGGAGGAAGCTGGAATTGGCATGTAG

Protein sequence

MQGTGTMASVASLFNLSGVGVIGSSEKPRSQFRGVCLSRRGFKGSNKWYYVPFPLCKYSATTTTDFIADQGNAISVDSNSSYIRSKDNDGTDFLLKPAPKPVLKAAESKPPLVGLNKVAWESPKSNGDSNGSRKLLDNEEERSKVIESLGEVLEKAEKLETPKLGVNKPGRGVDKPTTSGSSSSSNSKSNRSRSVWRKGDAVAPVQKVVAEPSKASGKVEAKPGGASTVEPQSQAGFRTPQPLVKPQSRLQAKPLAAPRPVLKKPILKDVGAATVTANDESNAAAKTKERKPILIDKYASKKPVAKPFDSEAVMAPKKPGKAPSPGKFKDEYRKKNVASGGPRRRMVDDVEIPDDVSIPSVTTARKGRKWSKASRKAARIQASKDAAPVKVEILEVEESGMLLEELAFNLAISEGEILGYLYSKGIKPDGVQTLDKDIVKMICKEYDVETIDVDPVKVEELAKKKDIFDEEDLDKLQSRPPVITIMGHVDHGKTTLLDYIRRSKVAASEAGGITQGIGAYKVMVPLDGKLQPCVFLDTPGHEAFGAMRARGARVTDIAIIVVAADDGIRPQTNEAIAHARAAGVPIIDKDGANADRVMQELSSIGLMPEDWGGDVPMVQISALKGQNVDDLLETVMLLAELQELKANPDRSAKGTVIEAGLDKAKGPFATFIVQNGTLKRGDVLVCGEAFGKVRALFDDGGKRVDEAGPSLPVQVIGLNIVPIAGDEFEVVDSLDTAREKAELRAEALWSQRISDKAGDGKVTLSSLASAVSSGKRSGLDLHQLNIILKVDVQGSIEAIRQALQVLPQDNVSLKFLLQATGDVTSSDIDLAVASKAIILGFNVKAPGSVKSYAENKAVEIRLYRVIYELIDDVRNAMEGLLDPVEEQIPMGSAEVRAVFSSGSGLVAGCMVVDGKLAKGCGIRVLRKGKTVYNGQLDSLRRVKEIVKEVNAGLECGVGMEEYNDWEVGDVIEAYYTVQKKRTLEEASASMEAALEEAGIGM
BLAST of Cp4.1LG02g02800.1 vs. Swiss-Prot
Match: IF2C_ARATH (Translation initiation factor IF-2, chloroplastic OS=Arabidopsis thaliana GN=At1g17220 PE=2 SV=2)

HSP 1 Score: 1248.8 bits (3230), Expect = 0.0e+00
Identity = 706/1035 (68.21%), Postives = 813/1035 (78.55%), Query Frame = 1

Query: 5    GTMASVASLFNLSG----VGVIGSSEKPRSQFRGVCLSRRGFKGSNKWYYVPFPLCKYSA 64
            GTM S+ASL +L G    V    SS+   +  + V LSRR  KG+ KW      LC+YS 
Sbjct: 9    GTMPSLASLVSLGGACASVSGTSSSDASYALVKRVSLSRRSVKGTKKW------LCRYSV 68

Query: 65   -----TTTTDFIADQGN-AISVDSNSSYIRSKDNDGTDFLLKPAPKPVLKAAESKPPL-V 124
                 TTT DFIADQ N ++S+DSNS +  SKD D ++ +LK  PKPVLK   ++    +
Sbjct: 69   SSSTTTTTADFIADQNNNSVSIDSNS-FRGSKDGDDSEVVLKQTPKPVLKPPVARVERGL 128

Query: 125  GLNKVAWESPKSNGDSNGSRKLLDNEEERSKVIESLGEVLEKAEKLETPKLGVNKPGRGV 184
            G+N   W    SNG         D EEER+KVIESLGEVL+KAEKLE PK G  + G  V
Sbjct: 129  GVNTAPWSKDLSNGGK------FDGEEERNKVIESLGEVLDKAEKLEIPKPGNKEGGEAV 188

Query: 185  --DKPTTSGSSSSSNSKSNRS----------RSVWRKGDAVAPVQKVVAEPSKASGK-VE 244
               +P+ + S+S + S +N S          +SVWRKGDAVA VQKVV E  K   + V+
Sbjct: 189  KPSQPSANSSNSRNGSYANASDGGTRKTKTMKSVWRKGDAVAAVQKVVKESPKIFNRGVQ 248

Query: 245  AKPGGASTVEPQSQAG---------FRTPQPLVKPQSRLQAKPLAAPRPVLKKPILKDVG 304
             +P      E  ++AG         FR PQP V+PQ  LQ KP+ AP PV K PILKD+G
Sbjct: 249  TEPRTREEGEVNAKAGTPLAPPQPPFR-PQPPVRPQPMLQGKPMVAP-PVKKSPILKDLG 308

Query: 305  -AATVTANDESNAAAKTKERKPILIDKYASKKPVAKPFDSEAVMAPKKPGKAPSPGKFKD 364
             AA    ++E +++ K+KERKPIL+DK+ASKK    P  S+AV+AP KPGK P   KF+ 
Sbjct: 309  MAAKPLVSEEVDSSVKSKERKPILVDKFASKKKGVDPAASQAVLAPTKPGKGPPSNKFRV 368

Query: 365  EYRKKNVASGGPRRRMVDDVEIPDDVSIPSVTTARKGRKWSKASRKAARIQASKDAAPVK 424
            E+R K  AS  PRRR+V + +  DD SI    + RKGRKWSKASRKA R+QA+KDAAPVK
Sbjct: 369  EHRNKKNASASPRRRIVAEDDGDDDASISR--SGRKGRKWSKASRKAVRLQAAKDAAPVK 428

Query: 425  VEILEVEESGMLLEELAFNLAISEGEILGYLYSKGIKPDGVQTLDKDIVKMICKEYDVET 484
             EILEVEE GM +E+LA+NLAI EG+ILGYLYSKGI+PDGV TLD+++VKMIC++YDVE 
Sbjct: 429  AEILEVEEEGMSIEDLAYNLAIGEGDILGYLYSKGIRPDGVHTLDREMVKMICRDYDVEV 488

Query: 485  IDVDPVKVEELAKKKDIFDEEDLDKLQSRPPVITIMGHVDHGKTTLLDYIRRSKVAASEA 544
            +D D VKVEE+AKK+  FDEEDLDKL+ RPPVITIMGHVDHGKTTLLDYIR+SKVAASEA
Sbjct: 489  LDADSVKVEEMAKKRQTFDEEDLDKLEDRPPVITIMGHVDHGKTTLLDYIRKSKVAASEA 548

Query: 545  GGITQGIGAYKVMVPLDGKLQPCVFLDTPGHEAFGAMRARGARVTDIAIIVVAADDGIRP 604
            GGITQGIGAYKV VP+DGKLQ CVFLDTPGHEAFGAMRARGARVTDIAIIVVAADDGIRP
Sbjct: 549  GGITQGIGAYKVSVPVDGKLQSCVFLDTPGHEAFGAMRARGARVTDIAIIVVAADDGIRP 608

Query: 605  QTNEAIAHARAAGVPI------IDKDGANADRVMQELSSIGLMPEDWGGDVPMVQISALK 664
            QTNEAIAHA+AA VPI      IDK+GA+ DRVMQELSSIGLMPEDWGGDVPMVQISALK
Sbjct: 609  QTNEAIAHAKAAAVPIVIAINKIDKEGASPDRVMQELSSIGLMPEDWGGDVPMVQISALK 668

Query: 665  GQNVDDLLETVMLLAELQELKANPDRSAKGTVIEAGLDKAKGPFATFIVQNGTLKRGDVL 724
            G+NVDDLLETVML+AELQELKANP R+AKG VIEAGLDKAKGPFATFIVQ GTLKRGDV+
Sbjct: 669  GENVDDLLETVMLVAELQELKANPHRNAKGIVIEAGLDKAKGPFATFIVQKGTLKRGDVV 728

Query: 725  VCGEAFGKVRALFDDGGKRVDEAGPSLPVQVIGLNIVPIAGDEFEVVDSLDTAREKAELR 784
            VCGEAFGKVRALFD  G+RVDEAGPS+PVQVIGLN VPIAGDEFE+V SLD ARE AE R
Sbjct: 729  VCGEAFGKVRALFDHSGERVDEAGPSIPVQVIGLNNVPIAGDEFEIVSSLDVAREMAEAR 788

Query: 785  AEALWSQRISDKAGDGKVTLSSLASAVSSGKRSGLDLHQLNIILKVDVQGSIEAIRQALQ 844
            A +L  +RIS KAGDGKVTLSSLASAVS+ K SGLDLHQLNIILKVDVQGSIEA+RQALQ
Sbjct: 789  AVSLRDERISAKAGDGKVTLSSLASAVSAKKMSGLDLHQLNIILKVDVQGSIEAVRQALQ 848

Query: 845  VLPQDNVSLKFLLQATGDVTSSDIDLAVASKAIILGFNVKAPGSVKSYAENKAVEIRLYR 904
            VLPQ+NV+LKFLLQATGDV++SD+DLA AS+AI+ GFNVKA GSVK  AENK VEIRLYR
Sbjct: 849  VLPQENVTLKFLLQATGDVSNSDVDLASASEAIVFGFNVKASGSVKKAAENKGVEIRLYR 908

Query: 905  VIYELIDDVRNAMEGLLDPVEEQIPMGSAEVRAVFSSGSGLVAGCMVVDGKLAKGCGIRV 964
            VIYELIDDVRNAMEGLL+ VEEQIP+GSAEVRA FSSGSG VAGCMV +GK  K CGIRV
Sbjct: 909  VIYELIDDVRNAMEGLLESVEEQIPIGSAEVRATFSSGSGRVAGCMVNEGKFVKDCGIRV 968

Query: 965  LRKGKTVYNGQLDSLRRVKEIVKEVNAGLECGVGMEEYNDWEVGDVIEAYYTVQKKRTLE 1000
            +RKGKTV+ G LDSL+RVKE VKEV+AGLECG+GM++Y+DW  GD+IEA+  VQK+RTLE
Sbjct: 969  VRKGKTVHVGVLDSLKRVKENVKEVSAGLECGIGMDDYDDWIEGDIIEAFNAVQKRRTLE 1026

BLAST of Cp4.1LG02g02800.1 vs. Swiss-Prot
Match: IF2C_PHAVU (Translation initiation factor IF-2, chloroplastic OS=Phaseolus vulgaris GN=IF2CP PE=2 SV=1)

HSP 1 Score: 1216.4 bits (3146), Expect = 0.0e+00
Identity = 683/1017 (67.16%), Postives = 799/1017 (78.56%), Query Frame = 1

Query: 5    GTMASVASLFNLSGVGVIGSSEKPRSQFRGVCLSRRGFKGSNKWYYVPFPLCKYSATTTT 64
            GTM+S+AS  +L  +  + SS +  S  R V  SR   KG  +W+ +   +C+YS TTT 
Sbjct: 10   GTMSSLASPVSLGSLMGVSSSGRSHSGVRRVSFSRGNCKGRKRWHCLSLSVCRYSVTTT- 69

Query: 65   DFIADQGNAISVDSNS---SYIRSKDNDGTDFLLKPAPKPVLKAAESKPPLVGLNKVAWE 124
            DFIADQGN++S+DSNS   S  +S  +DGT F+LKP PKPVLKA +++   +G ++    
Sbjct: 70   DFIADQGNSVSLDSNSNSSSSSKSGGDDGTGFVLKPPPKPVLKAPDNRMTHLGPSR---- 129

Query: 125  SPKSNGDSNGSRKLLDNEEERSKVIESLGEVLEKAEKLETPKLGVNKPGRGVDKPTTSGS 184
               + GD           EER+KVIESLGEVLEKAEKL + K+  +K    V+KP  + +
Sbjct: 130  ---TTGDV----------EERNKVIESLGEVLEKAEKLGSSKVNGDKNNGSVNKPVRNNA 189

Query: 185  SSSSNS----------KSNRSRSVWRKGDAVAPVQKVVAEPSKAS-GKVEAKP----GGA 244
            ++S  +          KS   +SVWRKGD+VA VQKVV E  K S  K E +     GG 
Sbjct: 190  NASPRTERPVNSAASLKSKTLKSVWRKGDSVASVQKVVKEVPKPSYNKNEEEKSQTRGGE 249

Query: 245  STV------EPQSQAGFRTPQPLVKPQSRLQAKPLAAPRPVLKKPILKDVGAATVTANDE 304
              V      +P S+     PQ   KPQ  L +KP  AP PV K  +L+D GAA       
Sbjct: 250  KVVSQTRAPQPPSKPQPLKPQQPSKPQPALLSKPSIAPPPVKKPVVLRDKGAA------- 309

Query: 305  SNAAAKTKERK-PILIDKYASKKPVAKPFDSEAVMAPKKPGKAPSPGKFKDEYRKKN-VA 364
               + K+KE+K PILIDK+ASKKPV  P  ++AV+AP KPGKAPSPGKFKD++RKK  +A
Sbjct: 310  -ETSVKSKEKKSPILIDKFASKKPVVDPLIAQAVLAPPKPGKAPSPGKFKDDFRKKGALA 369

Query: 365  SGGPRRRMVDDVEIPDD-----VSIPSVTTARKGRKWSKASRKAARIQASKDAAPVKVEI 424
             GG RRR++DD ++  D     VSIP   TARKGRKWSKASRKAAR+QA++DAAPVKVEI
Sbjct: 370  GGGRRRRILDDEDVIQDASELNVSIPGAATARKGRKWSKASRKAARLQAARDAAPVKVEI 429

Query: 425  LEVEESGMLLEELAFNLAISEGEILGYLYSKGIKPDGVQTLDKDIVKMICKEYDVETIDV 484
            LEV +SGML+EELA+ LA SEGEILGYLYSKGIKPDGVQT+DKD+VKMICKEYDVE ID 
Sbjct: 430  LEVGDSGMLVEELAYCLATSEGEILGYLYSKGIKPDGVQTIDKDMVKMICKEYDVEVIDA 489

Query: 485  DPVKVEELAKKKDIFDEEDLDKLQSRPPVITIMGHVDHGKTTLLDYIRRSKVAASEAGGI 544
            DPVKVE L KK++I DE+DLDKL+ RPPVITIMGHVDHGKTTLLDYIR+SKVAASEAGGI
Sbjct: 490  DPVKVEGLVKKREILDEDDLDKLKDRPPVITIMGHVDHGKTTLLDYIRKSKVAASEAGGI 549

Query: 545  TQGIGAYKVMVPLDGKLQPCVFLDTPGHEAFGAMRARGARVTDIAIIVVAADDGIRPQTN 604
            TQGIGAYKV VP DGK  PCVFLDTPGHEAFGAMRARGA VTDIA+IVVAADDGIR QTN
Sbjct: 550  TQGIGAYKVQVPFDGKTLPCVFLDTPGHEAFGAMRARGASVTDIAVIVVAADDGIRSQTN 609

Query: 605  EAIAHARAAGVPI------IDKDGANADRVMQELSSIGLMPEDWGGDVPMVQISALKGQN 664
            EAIAHA+AAGVPI      IDKDGAN +RVMQELSSIGLMPEDWGG+ PMV ISALKG+N
Sbjct: 610  EAIAHAKAAGVPIVIAINKIDKDGANPERVMQELSSIGLMPEDWGGNTPMVPISALKGKN 669

Query: 665  VDDLLETVMLLAELQELKANPDRSAKGTVIEAGLDKAKGPFATFIVQNGTLKRGDVLVCG 724
            VDDLLETVML+AELQELKANPDRSAKGTVIEAGLDK+KGP ATFIVQNG+L+RGD++VC 
Sbjct: 670  VDDLLETVMLVAELQELKANPDRSAKGTVIEAGLDKSKGPLATFIVQNGSLRRGDIVVCW 729

Query: 725  EAFGKVRALFDDGGKRVDEAGPSLPVQVIGLNIVPIAGDEFEVVDSLDTAREKAELRAEA 784
             +F K RALFDDGGKRVDEA PS+PVQVIGLN VPIAGD FEVV+SLD ARE+AE RAE+
Sbjct: 730  RSFWKGRALFDDGGKRVDEATPSIPVQVIGLNNVPIAGDVFEVVESLDAARERAETRAES 789

Query: 785  LWSQRISDKAGDGKVTLSSLASAVSSGKRSGLDLHQLNIILKVDVQGSIEAIRQALQVLP 844
            L ++RIS KAGDGK+TLSSLASAVSSGK SGLDLHQLNIILKVD+QGSIEA+R+ALQVLP
Sbjct: 790  LRNERISAKAGDGKITLSSLASAVSSGKLSGLDLHQLNIILKVDLQGSIEAVRKALQVLP 849

Query: 845  QDNVSLKFLLQATGDVTSSDIDLAVASKAIILGFNVKAPGSVKSYAENKAVEIRLYRVIY 904
            Q+NV+LKFLL+ATGDV +SD+DLAVASKAII+GFN   PGSVKSYA+NKAVEIRLYRVIY
Sbjct: 850  QENVTLKFLLEATGDVNTSDVDLAVASKAIIMGFNAXTPGSVKSYADNKAVEIRLYRVIY 909

Query: 905  ELIDDVRNAMEGLLDPVEEQIPMGSAEVRAVFSSGSGLVAGCMVVDGKLAKGCGIRVLRK 964
            ELIDDVR AMEGLL+PVEEQ+ +GSA VRAVFSSGSG VAGCMV +GK+ K CGIRV RK
Sbjct: 910  ELIDDVRKAMEGLLEPVEEQLTIGSAVVRAVFSSGSGRVAGCMVTEGKVLKDCGIRVKRK 969

Query: 965  GKTVYNGQLDSLRRVKEIVKEVNAGLECGVGMEEYNDWEVGDVIEAYYTVQKKRTLE 985
            GK V+ G +DSLRRVKEIVKEVNAGLECG+G+E+++DWE GD+IE    ++++  L+
Sbjct: 970  GKIVHVGIIDSLRRVKEIVKEVNAGLECGLGLEDFDDWEEGDIIEPSTQLRRRGPLK 1000

BLAST of Cp4.1LG02g02800.1 vs. Swiss-Prot
Match: IF2_ANAVT (Translation initiation factor IF-2 OS=Anabaena variabilis (strain ATCC 29413 / PCC 7937) GN=infB PE=3 SV=1)

HSP 1 Score: 605.9 bits (1561), Expect = 7.9e-172
Identity = 334/634 (52.68%), Postives = 443/634 (69.87%), Query Frame = 1

Query: 358  IPSVTTARKGRKWSKASRKAARIQAS--KDAAPVKVEILEVEESGMLLEELAFNLAISEG 417
            I S TT  +G++ S  +R   R Q +  K   P KV +       M ++ELA  LA+++ 
Sbjct: 418  ISSPTT--RGKRSSHNNRDQNRRQETEVKRERPEKVAVT----GPMTVQELADLLAVADT 477

Query: 418  EILGYLYSKGIKPDGVQTLDKDIVKMICKEYDVETIDVDPVKVEELAKKKDIFDEEDLDK 477
            EI+  L+ KG+     Q LD   + ++ KE ++E    +P    E  K  ++ +  DL+ 
Sbjct: 478  EIVKILFMKGMAVSITQNLDIPTITLVGKELEIEVETAEPEA--EARKVTEMIEVGDLEH 537

Query: 478  LQSRPPVITIMGHVDHGKTTLLDYIRRSKVAASEAGGITQGIGAYKVMVPLDGKLQPCVF 537
            L  RPPV+TIMGHVDHGKTTLLD IR++KVAA EAGGITQ IGAY V +  DGK Q  VF
Sbjct: 538  LLRRPPVVTIMGHVDHGKTTLLDSIRKTKVAAGEAGGITQHIGAYHVDIVHDGKEQQIVF 597

Query: 538  LDTPGHEAFGAMRARGARVTDIAIIVVAADDGIRPQTNEAIAHARAAGVPI------IDK 597
            LDTPGHEAF AMRARGARVTDIA++VVAADDG+RPQT EAI+HA+AAGVPI      IDK
Sbjct: 598  LDTPGHEAFTAMRARGARVTDIAVLVVAADDGVRPQTVEAISHAQAAGVPIVVAINKIDK 657

Query: 598  DGANADRVMQELSSIGLMPEDWGGDVPMVQISALKGQNVDDLLETVMLLAELQELKANPD 657
            +GA  DRV QEL+  GL PE+WGG+  MV +SA+KG+N+D LLE ++L+AE+ EL ANPD
Sbjct: 658  EGAQPDRVKQELTQYGLTPEEWGGETIMVPVSAIKGENLDTLLEMILLVAEVGELSANPD 717

Query: 658  RSAKGTVIEAGLDKAKGPFATFIVQNGTLKRGDVLVCGEAFGKVRALFDDGGKRVDEAGP 717
            R+A+GTVIEA LDKAKG  AT ++QNGTL  GD+L+ G AFGKVRA+ DD G+RVD AGP
Sbjct: 718  RNARGTVIEAHLDKAKGAVATLLIQNGTLHVGDILLAGSAFGKVRAMVDDRGRRVDIAGP 777

Query: 718  SLPVQVIGLNIVPIAGDEFEVVDSLDTAREKAELRAEALWSQRISDKAGDGKVTLSSLAS 777
            S  V+V+GL+ VP AGDEFEV D+   AR  A  RA+    QR+S +   G+VTL++L++
Sbjct: 778  SFAVEVLGLSDVPAAGDEFEVFDNEKEARALASDRAD---KQRLS-RLLQGRVTLTTLSA 837

Query: 778  AVSSGKRSGLDLHQLNIILKVDVQGSIEAIRQALQVLPQDNVSLKFLLQATGDVTSSDID 837
                G     +L +LN+ILK DVQGS+EAI  +L+ +PQ+ V ++ LL A G++T +DID
Sbjct: 838  QAQEG-----ELKELNLILKGDVQGSVEAIVGSLKQIPQNEVQIRMLLTAAGEITETDID 897

Query: 838  LAVASKAIILGFNVKAPGSVKSYAENKAVEIRLYRVIYELIDDVRNAMEGLLDPVEEQIP 897
            LA AS A+I+GFN       +  A+   V++R Y +IY+LI+D++ A+EGLL+P   + P
Sbjct: 898  LAAASGAVIIGFNTTFASGARQAADEAGVDVREYNIIYKLIEDIQGALEGLLEPELVEEP 957

Query: 898  MGSAEVRAVFSSGSGLVAGCMVVDGKLAKGCGIRVLRKGKTVYNGQLDSLRRVKEIVKEV 957
            +G  EVRAVF  G G VAGC V  GKL + C +RV R GK +Y G LDSL+R+K+  +EV
Sbjct: 958  LGQTEVRAVFPVGRGAVAGCYVQSGKLVRNCKVRVRRAGKVIYEGVLDSLKRMKDDAREV 1017

Query: 958  NAGLECGVGMEEYNDWEVGDVIEAYYTVQKKRTL 984
            NAG ECG+G+++++DW  GD+IE+Y  V K+RTL
Sbjct: 1018 NAGYECGIGVDKFHDWAEGDIIESYQMVTKRRTL 1034

BLAST of Cp4.1LG02g02800.1 vs. Swiss-Prot
Match: IF2_PROM5 (Translation initiation factor IF-2 OS=Prochlorococcus marinus (strain MIT 9515) GN=infB PE=3 SV=1)

HSP 1 Score: 601.7 bits (1550), Expect = 1.5e-170
Identity = 323/639 (50.55%), Postives = 442/639 (69.17%), Query Frame = 1

Query: 360  SVTTARKGRKWSKAS------RKAARIQASKDAAPVKVEILEVEESGMLLEELAFNLAIS 419
            S  T R+ +K +K +      R+A  ++A+KDA  V+ E++ V E  + ++ELA  L++ 
Sbjct: 531  SQKTTRQFKKKNKETTRQRQKRRAMELRAAKDAKQVRPEMIIVPEDNLTVQELADKLSLE 590

Query: 420  EGEILGYLYSKGIKPDGVQTLDKDIVKMICKEYDVETIDVDPVKVEELAKKK-DIFDEED 479
              EI+  L+ KGI     Q+LD   ++ + +E+ V  +  D   VEE AKK  D+ + +D
Sbjct: 591  SSEIIKSLFFKGITATVTQSLDLATIETVAEEFGVPVLQDD---VEEAAKKTVDMIETDD 650

Query: 480  LDKLQSRPPVITIMGHVDHGKTTLLDYIRRSKVAASEAGGITQGIGAYKVMVPLDGKLQP 539
            ++ L  RPPVIT+MGHVDHGKT+LLD IR S+VA+ EAGGITQ IGAY+V    + K + 
Sbjct: 651  IESLIKRPPVITVMGHVDHGKTSLLDSIRESRVASGEAGGITQHIGAYQVEFEHESKKKK 710

Query: 540  CVFLDTPGHEAFGAMRARGARVTDIAIIVVAADDGIRPQTNEAIAHARAAGVPI------ 599
              FLDTPGHEAF AMRARG +VTD+A++VVAADDG RPQT EAI+HARAA VPI      
Sbjct: 711  LTFLDTPGHEAFTAMRARGTKVTDVAVLVVAADDGCRPQTLEAISHARAAKVPIVVAINK 770

Query: 600  IDKDGANADRVMQELSSIGLMPEDWGGDVPMVQISALKGQNVDDLLETVMLLAELQELKA 659
            IDK+GA+ DRV QELS   L+ EDWGGDV MV +SA+K QN+D LLE ++L++E+++L+A
Sbjct: 771  IDKEGASPDRVKQELSEKDLIAEDWGGDVVMVPVSAIKKQNIDKLLEMILLVSEVEDLQA 830

Query: 660  NPDRSAKGTVIEAGLDKAKGPFATFIVQNGTLKRGDVLVCGEAFGKVRALFDDGGKRVDE 719
            NP+R AKGTVIEA LDKAKGP AT +VQNGTLK GDVL  G   GK+RA+ D+ G R+ E
Sbjct: 831  NPERLAKGTVIEAHLDKAKGPVATLLVQNGTLKAGDVLAAGSVLGKIRAMVDEHGNRIKE 890

Query: 720  AGPSLPVQVIGLNIVPIAGDEFEVVDSLDTAREKAELRAEALWSQRISDKAGDGKVTLSS 779
            AGPS PV+ +G + VP AGDEFEV     TAR     RA    + +++ +    +V+LSS
Sbjct: 891  AGPSCPVEALGFSEVPTAGDEFEVYPDEKTARGIVGERATDARATKLAQQMASRRVSLSS 950

Query: 780  LASAVSSGKRSGLDLHQLNIILKVDVQGSIEAIRQALQVLPQDNVSLKFLLQATGDVTSS 839
            L++  + G     +L +LN+ILK DVQGS+EAI  +L+ LP++ V ++ LL A G++T +
Sbjct: 951  LSTQANDG-----ELKELNLILKADVQGSVEAILGSLEQLPKNEVQVRVLLSAPGEITET 1010

Query: 840  DIDLAVASKAIILGFNVKAPGSVKSYAENKAVEIRLYRVIYELIDDVRNAMEGLLDPVEE 899
            DIDLA AS ++I+GFN       K  A++  V+IR Y VIY+L++D+++AMEGLL+P   
Sbjct: 1011 DIDLAAASGSVIIGFNTSLASGAKRAADSNNVDIREYEVIYKLLEDIQSAMEGLLEPDLV 1070

Query: 900  QIPMGSAEVRAVFSSGSGLVAGCMVVDGKLAKGCGIRVLRKGKTVYNGQLDSLRRVKEIV 959
            +  +G AEVRA F+ G G +AGC +  GKL + C +RVLR  K ++ G LDSL+R K+ V
Sbjct: 1071 EESLGQAEVRATFAVGKGAIAGCYIQSGKLQRNCSLRVLRSDKVIFEGNLDSLKRSKDDV 1130

Query: 960  KEVNAGLECGVGMEEYNDWEVGDVIEAYYTVQKKRTLEE 986
            KEVN G ECGVG ++++ W  GD+IEA+  V KKRTL +
Sbjct: 1131 KEVNTGFECGVGCDKFSTWNEGDIIEAFKFVTKKRTLNK 1161

BLAST of Cp4.1LG02g02800.1 vs. Swiss-Prot
Match: IF2_SYNP2 (Translation initiation factor IF-2 OS=Synechococcus sp. (strain ATCC 27264 / PCC 7002 / PR-6) GN=infB PE=3 SV=1)

HSP 1 Score: 599.4 bits (1544), Expect = 7.4e-170
Identity = 378/836 (45.22%), Postives = 501/836 (59.93%), Query Frame = 1

Query: 176 PTTSGSSSSSNSKSNRSRSVWRKGDAVAPVQKVVAEPS-KASGKVEAKPGGASTVEPQSQ 235
           P T+ +SS +     R++S+  K  A AP +     P  +  G V  KP     +EP+  
Sbjct: 179 PATAETSSGATQADVRAKSL-SKNTAEAPAKAPKLRPKPQIVGTVSKKPTPVQAIEPELD 238

Query: 236 A----------GFRTPQPLVKPQSRLQAKPLAAPRPVLKKPILKDVGAATVTANDESNAA 295
                         TP+ L+ P  R  AKP  A  P   KP  + V         ES   
Sbjct: 239 EEPDTNNVEGDDDATPEVLLAPPKRPAAKPKKAIGP---KPSKRKVWEDEEEDESESKKT 298

Query: 296 AKTK-ERKPILIDK------YASKKPVAKPFDSEAVMAPKKPGKA---PSPGKFKDEYRK 355
             +K +R+P++ID         +      P  S ++  P KP  A   PSP K      K
Sbjct: 299 KTSKLKRRPVVIDDDDDDFGTTTNNNAEVPSVSLSIARPPKPKSASSSPSPSKPSPSKPK 358

Query: 356 KNVASGGPRRRMVDDVEIPDDVSIPSVTTARKGRKWSKASRKAARIQASKDAAPVKVEIL 415
           K  A                          R G   S  S+K  R    +D   VK    
Sbjct: 359 KPAAK-------------------------RSGSGGSGQSQKEQR----RDRPDVKTPPA 418

Query: 416 EVE-ESGMLLEELAFNLAISEGEILGYLYSKGIKPDGVQTLDKDIVKMICKEYDVETIDV 475
           E+     M L E+A  L I+E +I+  L+SKGI  +  QTLD D  +M+ +E+DV+ I  
Sbjct: 419 EITLTETMTLREMADILCIAETDIIRRLFSKGIAINITQTLDYDTAQMVAEEFDVKVIAP 478

Query: 476 DPVKVEELAKKKDIFDEEDLDKLQSRPPVITIMGHVDHGKTTLLDYIRRSKVAASEAGGI 535
           +     E  K  ++ D  DL+ LQ RPPV+TIMGHVDHGKTTLLD IR +KVA  EAGGI
Sbjct: 479 EVKSAAE--KSTEMLDVADLEHLQHRPPVVTIMGHVDHGKTTLLDSIRETKVAQGEAGGI 538

Query: 536 TQGIGAYKVMVPLDGKLQPCVFLDTPGHEAFGAMRARGARVTDIAIIVVAADDGIRPQTN 595
           TQ IGAY V +  +GK    VFLDTPGHEAF AMRARGA+VTDIAI+VVAADDG+RPQT 
Sbjct: 539 TQHIGAYHVDIEHNGKPGQIVFLDTPGHEAFTAMRARGAKVTDIAILVVAADDGVRPQTL 598

Query: 596 EAIAHARAAGVPII------DKDGANADRVMQELSSIGLMPEDWGGDVPMVQISALKGQN 655
           EAI HA+AA VPI+      DK GA  DRV QELS  GL+PE+WGG+  MV +SALKG+N
Sbjct: 599 EAIRHAQAAKVPIVVAINKMDKLGAEPDRVKQELSEQGLVPEEWGGETIMVPVSALKGEN 658

Query: 656 VDDLLETVMLLAELQELKANPDRSAKGTVIEAGLDKAKGPFATFIVQNGTLKRGDVLVCG 715
           +D LLE ++L++E++EL ANPDR A+GT+IEA LD+A+GP AT +VQNGTL+ GD++V G
Sbjct: 659 LDTLLEMILLVSEIEELSANPDRLARGTIIEAHLDRARGPVATLLVQNGTLRVGDIIVAG 718

Query: 716 EAFGKVRALFDDGGKRVDEAGPSLPVQVIGLNIVPIAGDEFEVVDSLDTAREKAELRAEA 775
              GK+RA+  D G++V +A PS  V+++GL+ VP AGDEFEV  S   AR  A+ RAE 
Sbjct: 719 SVMGKIRAMISDRGEKVTDATPSFAVEILGLSEVPAAGDEFEVYSSEKEARAIADERAEG 778

Query: 776 LWSQRISDKAGDGKVTLSSLASAVSSGKRSGLDLHQLNIILKVDVQGSIEAIRQALQVLP 835
               R+       +V+LSSL++    G     +L +LN++LK DVQGS+EAI  +LQ LP
Sbjct: 779 KRQSRLQQAMSSRRVSLSSLSAQAQEG-----ELKELNLVLKADVQGSVEAILGSLQQLP 838

Query: 836 QDNVSLKFLLQATGDVTSSDIDLAVASKAIILGFNVKAPGSVKSYAENKAVEIRLYRVIY 895
           QD V ++ LL A G+++ +D+DLA AS AII+GFN       +  AE + V+IR Y VIY
Sbjct: 839 QDEVQIRVLLSAPGEISETDVDLAAASGAIIIGFNTTLAPGARQAAEQEGVDIREYNVIY 898

Query: 896 ELIDDVRNAMEGLLDPVEEQIPMGSAEVRAVFSSGSGLVAGCMVVDGKLAKGCGIRVLRK 955
             +++++ AMEGLLDP E + P+G AEVRAVF  G G VAGC V  GK+ +   IRV R 
Sbjct: 899 RFLEEIQGAMEGLLDPEEVEEPLGRAEVRAVFPVGRGSVAGCYVQSGKVVRNRMIRVRRG 958

Query: 956 GKTVYNGQLDSLRRVKEIVKEVNAGLECGVGMEEYNDWEVGDVIEAYYTVQKKRTL 984
              VY+G LDSL+RV+E V+EVN+G ECG+G+++++ W+ GD+IEAY  V K+RTL
Sbjct: 959 DVVVYDGSLDSLKRVREDVREVNSGYECGIGVDKFSTWKEGDIIEAYEMVFKRRTL 974

BLAST of Cp4.1LG02g02800.1 vs. TrEMBL
Match: A0A0A0L611_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G675260 PE=3 SV=1)

HSP 1 Score: 1682.9 bits (4357), Expect = 0.0e+00
Identity = 896/1019 (87.93%), Postives = 950/1019 (93.23%), Query Frame = 1

Query: 1    MQGTGTMASVASLFNLSGVGVIGSSEKPRSQFRGVCLSRRGFKGSNKWYYVPFPLCKYSA 60
            MQGTGTMASVASLFNLSGVGV+GSSEKPRSQFRGVCLSRRGFKGSN+WYYV FPLCKYSA
Sbjct: 8    MQGTGTMASVASLFNLSGVGVVGSSEKPRSQFRGVCLSRRGFKGSNRWYYVSFPLCKYSA 67

Query: 61   TTTTDFIADQGNAISVDSNSSYIRSKDNDGTDFLLKPAPKPVLKAAESKPPLVGLNKVAW 120
            TTT DF+ADQGNAISVDSNS Y RSK++D TDFLLKPAPKPVLKAAESKP LVGLNKV W
Sbjct: 68   TTT-DFVADQGNAISVDSNS-YRRSKEDDNTDFLLKPAPKPVLKAAESKP-LVGLNKVTW 127

Query: 121  ESPKSNGDSNGSRKLLDNEEERSKVIESLGEVLEKAEKLETPKLGVNKPGRGVDKPTTSG 180
            ESPK+NGDSN +RKLLD+EEERSK+IESLGEVLEKAEKLETPKLG  KPGRGVD PTTS 
Sbjct: 128  ESPKTNGDSNSNRKLLDDEEERSKMIESLGEVLEKAEKLETPKLGNRKPGRGVDTPTTSS 187

Query: 181  SSSSS---NSKSNRS----RSVWRKGDAVAPVQKVVAEPSKASGKVEAKPGGASTVEPQS 240
              S+S   NS +NR     +SVWRKGD VA VQK+VAEPSK   +VEAKP G S VEPQS
Sbjct: 188  LGSNSKPVNSMANRKYKTLKSVWRKGDTVASVQKIVAEPSKPKDEVEAKPRGTSKVEPQS 247

Query: 241  QAGFRTPQPLVKPQSRLQAKPLAAPRPVLKKPI-LKDVGAATVTANDESNAAAKTKERKP 300
            +A F+ PQP VKPQ +LQ KPLAA  P+LKKP+ LKDVGAAT+TA+DE+N AAKTKERKP
Sbjct: 248  RAAFQPPQPPVKPQPKLQEKPLAATPPILKKPVVLKDVGAATMTADDETNTAAKTKERKP 307

Query: 301  ILIDKYASKKPVAKPFDSEAVMAPKKPGKAPSPGKFKDEYRKKNVASGGPRRRMV----D 360
            ILIDKYASKKPV  PF S+A++AP KP KAP PGKFKD+YRK++VASGGPRR+MV    D
Sbjct: 308  ILIDKYASKKPVVDPFISDAILAPTKPVKAPPPGKFKDDYRKRSVASGGPRRKMVGDGKD 367

Query: 361  DVEIPDDVSIPSVTTARKGRKWSKASRKAARIQASKDAAPVKVEILEVEESGMLLEELAF 420
            DVEIPDDVSIPSV+TARKGRKWSKASRKAARIQASKDAAPVKVEILEVEESGMLLEELA+
Sbjct: 368  DVEIPDDVSIPSVSTARKGRKWSKASRKAARIQASKDAAPVKVEILEVEESGMLLEELAY 427

Query: 421  NLAISEGEILGYLYSKGIKPDGVQTLDKDIVKMICKEYDVETIDVDPVKVEELAKKKDIF 480
            NLAISEGEILGYLYSKGIKPDGVQTLDKDIVKMICKEYDVETID+DPVKVEELAKK+DIF
Sbjct: 428  NLAISEGEILGYLYSKGIKPDGVQTLDKDIVKMICKEYDVETIDIDPVKVEELAKKRDIF 487

Query: 481  DEEDLDKLQSRPPVITIMGHVDHGKTTLLDYIRRSKVAASEAGGITQGIGAYKVMVPLDG 540
            DEEDLDKLQSRPPVITIMGHVDHGKTTLLDYIRRSKVAASEAGGITQGIGAY+V+VPLDG
Sbjct: 488  DEEDLDKLQSRPPVITIMGHVDHGKTTLLDYIRRSKVAASEAGGITQGIGAYRVLVPLDG 547

Query: 541  KLQPCVFLDTPGHEAFGAMRARGARVTDIAIIVVAADDGIRPQTNEAIAHARAAGVPI-- 600
            KLQPCVFLDTPGHEAFGAMRARGARVTDIAIIVVAADDGIRPQTNEAIAHARAAGVPI  
Sbjct: 548  KLQPCVFLDTPGHEAFGAMRARGARVTDIAIIVVAADDGIRPQTNEAIAHARAAGVPIVI 607

Query: 601  ----IDKDGANADRVMQELSSIGLMPEDWGGDVPMVQISALKGQNVDDLLETVMLLAELQ 660
                IDKDGANADRVMQELSSIGLMPEDWGGD+PMVQISALKG NVDDLLETVMLLAELQ
Sbjct: 608  AINKIDKDGANADRVMQELSSIGLMPEDWGGDIPMVQISALKGLNVDDLLETVMLLAELQ 667

Query: 661  ELKANPDRSAKGTVIEAGLDKAKGPFATFIVQNGTLKRGDVLVCGEAFGKVRALFDDGGK 720
            ELKANPDRSAKGTVIEAGLDK+KGPFATFIVQNGTLKRGDV+VCGEAFGKVRALFDD GK
Sbjct: 668  ELKANPDRSAKGTVIEAGLDKSKGPFATFIVQNGTLKRGDVVVCGEAFGKVRALFDDSGK 727

Query: 721  RVDEAGPSLPVQVIGLNIVPIAGDEFEVVDSLDTAREKAELRAEALWSQRISDKAGDGKV 780
            RVDEAGPSLPVQVIGLNIVPIAGD FEVVDSLDTAREKAELRAEALWSQRISDKAGDGKV
Sbjct: 728  RVDEAGPSLPVQVIGLNIVPIAGDVFEVVDSLDTAREKAELRAEALWSQRISDKAGDGKV 787

Query: 781  TLSSLASAVSSGKRSGLDLHQLNIILKVDVQGSIEAIRQALQVLPQDNVSLKFLLQATGD 840
            TLSSLASAVSSGK+SGLDLHQLNII+KVDVQGSIEAIRQALQVLPQ+NVSLKFLLQATGD
Sbjct: 788  TLSSLASAVSSGKQSGLDLHQLNIIMKVDVQGSIEAIRQALQVLPQENVSLKFLLQATGD 847

Query: 841  VTSSDIDLAVASKAIILGFNVKAPGSVKSYAENKAVEIRLYRVIYELIDDVRNAMEGLLD 900
            V+SSDIDLAVASKAI+LGFNVKAPGSVKSYAENK VEIRLYRVIYELIDDVRNAMEGLL+
Sbjct: 848  VSSSDIDLAVASKAIVLGFNVKAPGSVKSYAENKGVEIRLYRVIYELIDDVRNAMEGLLE 907

Query: 901  PVEEQIPMGSAEVRAVFSSGSGLVAGCMVVDGKLAKGCGIRVLRKGKTVYNGQLDSLRRV 960
            PVEE++P+GSAEVRAVFSSGSGLVAGCMVV+GKL KGCGI+VLRKGK  Y GQLDSLRRV
Sbjct: 908  PVEEKVPIGSAEVRAVFSSGSGLVAGCMVVEGKLVKGCGIQVLRKGKIAYTGQLDSLRRV 967

Query: 961  KEIVKEVNAGLECGVGMEEYNDWEVGDVIEAYYTVQKKRTLEEASASMEAALEEAGIGM 1002
            KEIVKEVNAGLECGVGME+Y+DWEVGD IEA+ TVQKKRTLEEASASM  ALE+AGI +
Sbjct: 968  KEIVKEVNAGLECGVGMEDYDDWEVGDAIEAFDTVQKKRTLEEASASMATALEKAGIDL 1023

BLAST of Cp4.1LG02g02800.1 vs. TrEMBL
Match: M5X751_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa000701mg PE=3 SV=1)

HSP 1 Score: 1326.6 bits (3432), Expect = 0.0e+00
Identity = 727/1029 (70.65%), Postives = 844/1029 (82.02%), Query Frame = 1

Query: 5    GTMASVASLFNLSGVGVIGSSEKPRSQFRGVCLSRRGFKGSNKWYYVPFPLCKYSATTTT 64
            GTMASVASL +L  V ++GSSE+ RS  R V LS+   KGS +W+ V   +CK S TTT 
Sbjct: 10   GTMASVASLVSLGSVTLLGSSERSRSLVRKVSLSKASLKGSRRWHCVRLSVCKCSVTTT- 69

Query: 65   DFIADQGNAISVDSNSSYIRSKD--NDGTDFLLKPAPKPVLKAA--ESKPPLVGLNKVAW 124
            DF+A QGN +S+DSN+ Y  S D  N   DF+LKP+PKPVLK++   +  PLVG++   W
Sbjct: 70   DFVAKQGNEVSLDSNN-YRGSTDVSNANADFVLKPSPKPVLKSSGGSNNEPLVGIDAADW 129

Query: 125  ESPKSNGDSNGSRKLLDNEEERSKVIESLGEVLEKAEKLETPK---LGVNKPGRGVDKPT 184
            +  + +GDS+      D +EER+KVIESLGEVLEKAEKLET +   LG  K    V+KP 
Sbjct: 130  DPSRISGDSDEE----DGDEERNKVIESLGEVLEKAEKLETSRAGELGTKKDSSSVNKPA 189

Query: 185  TSGSS----------SSSNSKSNRSRSVWRKGDAVAPVQKVVAEPSKASGKV---EAKPG 244
             S +S          S + SKS   +SVWRKGD VA VQKVV E  K +  +   E K G
Sbjct: 190  PSNASTNLRNAKPVNSETTSKSKTLKSVWRKGDTVANVQKVVKESPKLNNTIPEEELKTG 249

Query: 245  GASTVEPQSQAGFRTPQPLVKPQSRLQAKPLAAPRPVLKKPI-LKDVGAATVTAN-DESN 304
            G    + Q  A  R PQP ++PQ +LQAKP AAP P++KKP+ LKDVGAA  ++  DE++
Sbjct: 250  GGLKADSQPHASLRPPQPPLRPQPKLQAKPSAAPPPMVKKPVVLKDVGAAPKSSGIDETD 309

Query: 305  AAAKTKERKPILIDKYASKKPVAKPFDSEAVMAPKKPGKAPSPGKFKDEYRKKNVASGGP 364
            ++ +TKERKPILIDK+ASKKP      S+AV+AP KPGK P PG+FKD YRKKN   G  
Sbjct: 310  SSTQTKERKPILIDKFASKKPAVDSVISQAVLAPSKPGKGPPPGRFKDGYRKKNDPGG-- 369

Query: 365  RRRMVDDVEIPDD------VSIPSVTTARKGRKWSKASRKAARIQASKDAAPVKVEILEV 424
            RRR VDD EIPD+      VSIP    ARKGRKWSKASRKAAR+QA+K+AAPVKVEILEV
Sbjct: 370  RRRKVDD-EIPDEEASELNVSIPGA--ARKGRKWSKASRKAARLQAAKEAAPVKVEILEV 429

Query: 425  EESGMLLEELAFNLAISEGEILGYLYSKGIKPDGVQTLDKDIVKMICKEYDVETIDVDPV 484
             E GML+++LA+ LAI+E +ILG LY+KGIKPDGVQTLDKD+VKMICKE+DVE ID DPV
Sbjct: 430  GEDGMLIDDLAYYLAINESQILGSLYAKGIKPDGVQTLDKDMVKMICKEHDVEVIDADPV 489

Query: 485  KVEELAKKKDIFDEEDLDKLQSRPPVITIMGHVDHGKTTLLDYIRRSKVAASEAGGITQG 544
            KVEE+AKKK+I DE+DLDKL+ RPPV+TIMGHVDHGKTTLLDYIR+SKVAASEAGGITQG
Sbjct: 490  KVEEMAKKKEILDEDDLDKLEDRPPVLTIMGHVDHGKTTLLDYIRKSKVAASEAGGITQG 549

Query: 545  IGAYKVMVPLDGKLQPCVFLDTPGHEAFGAMRARGARVTDIAIIVVAADDGIRPQTNEAI 604
            IGAYKV+VP+DGK+Q CVFLDTPGHEAFGAMRARGARVTDIAIIVVAADDGIRPQT EAI
Sbjct: 550  IGAYKVLVPIDGKVQSCVFLDTPGHEAFGAMRARGARVTDIAIIVVAADDGIRPQTKEAI 609

Query: 605  AHARAAGVPI------IDKDGANADRVMQELSSIGLMPEDWGGDVPMVQISALKGQNVDD 664
            AHA+AAGVPI      IDKDGAN DRVMQELSSIGLMPEDWGGDVPMVQISALKG+N+D+
Sbjct: 610  AHAKAAGVPIVIAINKIDKDGANPDRVMQELSSIGLMPEDWGGDVPMVQISALKGKNIDE 669

Query: 665  LLETVMLLAELQELKANPDRSAKGTVIEAGLDKAKGPFATFIVQNGTLKRGDVLVCGEAF 724
            LLETVML+AELQ+LKANP RSAKGTVIEAGL K+KGP  T IVQNGTL+RGD++VCG AF
Sbjct: 670  LLETVMLVAELQDLKANPHRSAKGTVIEAGLHKSKGPLVTLIVQNGTLRRGDIIVCGGAF 729

Query: 725  GKVRALFDDGGKRVDEAGPSLPVQVIGLNIVPIAGDEFEVVDSLDTAREKAELRAEALWS 784
            GKVRALFDDGG RVDEAGPS+PVQV+GLN VP+AGDEF+VV SLD AREKAE RAE+L S
Sbjct: 730  GKVRALFDDGGNRVDEAGPSIPVQVLGLNNVPVAGDEFDVVGSLDVAREKAESRAESLRS 789

Query: 785  QRISDKAGDGKVTLSSLASAVSSGKRSGLDLHQLNIILKVDVQGSIEAIRQALQVLPQDN 844
            +RIS KAGDG+VTLSSLASAVSSGK SGLDLHQLNIILKVD+QGSIEA+RQALQVLPQDN
Sbjct: 790  ERISAKAGDGRVTLSSLASAVSSGKLSGLDLHQLNIILKVDLQGSIEAVRQALQVLPQDN 849

Query: 845  VSLKFLLQATGDVTSSDIDLAVASKAIILGFNVKAPGSVKSYAENKAVEIRLYRVIYELI 904
            V+LKFLL+ATGDV++SD+DLA ASKAI+ GFNVK PGSVKSY ENK VEIRLYRVIYELI
Sbjct: 850  VTLKFLLEATGDVSTSDVDLAAASKAIVFGFNVKVPGSVKSYGENKGVEIRLYRVIYELI 909

Query: 905  DDVRNAMEGLLDPVEEQIPMGSAEVRAVFSSGSGLVAGCMVVDGKLAKGCGIRVLRKGKT 964
            DDVRNAMEGLL+PVEEQ+ +GSAEVRAVFSSGSG VAGCM+ +GK+ KGCG++V+R+GK 
Sbjct: 910  DDVRNAMEGLLEPVEEQVTIGSAEVRAVFSSGSGRVAGCMINEGKVVKGCGVQVIRRGKV 969

Query: 965  VYNGQLDSLRRVKEIVKEVNAGLECGVGMEEYNDWEVGDVIEAYYTVQKKRTLEEASASM 1000
            V+ G LDSL+RVKEIVKEVNAGLECG+G+E+Y+DWE GD++EA+ TVQKKRTLEEASASM
Sbjct: 970  VHVGLLDSLKRVKEIVKEVNAGLECGIGVEDYDDWEEGDILEAFNTVQKKRTLEEASASM 1027

BLAST of Cp4.1LG02g02800.1 vs. TrEMBL
Match: W9QCR7_9ROSA (Translation initiation factor IF-2 OS=Morus notabilis GN=L484_007400 PE=3 SV=1)

HSP 1 Score: 1302.7 bits (3370), Expect = 0.0e+00
Identity = 743/1040 (71.44%), Postives = 845/1040 (81.25%), Query Frame = 1

Query: 7    MASVASLFNLSGVGVIGSSE-KPRSQFRGVCLSRR-GFKGSNK-WYYVPFPLCKYSATTT 66
            MAS+ASL +L  V V+G SE   RS  R V LSRR  F+ +NK W+ V   +CKYS TTT
Sbjct: 1    MASMASLVSLGSVMVVGPSEISSRSLVRRVALSRRTSFRPNNKTWHCVSVSVCKYSVTTT 60

Query: 67   TDFIADQ----GNAISVDSNSSYIRSKDNDGTD----FLLKPAPKPVLKAAESKP-PLVG 126
             DF+A       NA+S+DSN+++     ND T+    F+LKP  KPVLK   SK  PL G
Sbjct: 61   -DFVASSDLGNANAVSLDSNTTFNNRPSNDSTNDQAGFVLKPPRKPVLKPPGSKDEPLSG 120

Query: 127  LNKVAWESPKSNGDSNGSRKLLDNEEERSKVIESLGEVLEKAEKLE---TPKLGVNKPGR 186
            ++   W+S    GDS       D+EEERSKVIESLGEVLEKAEKLE   +  L   + G 
Sbjct: 121  MSSAGWDSSGIRGDS-------DDEEERSKVIESLGEVLEKAEKLEISTSGDLASIRNGG 180

Query: 187  GVDKPTTSGSSSSS------NSKSNRS----RSVWRKGDAVAPVQKVVAEPS--KASGKV 246
             V+KP TS SSS+S      NS +NR     +SVWRKGD+VA V+KVV +PS  K   +V
Sbjct: 181  SVNKPATSTSSSNSGNAEPLNSTTNRKAKTLKSVWRKGDSVA-VRKVVKDPSNSKPDKRV 240

Query: 247  EAKPGGASTVEPQSQAGFRT---PQPLVKPQSRLQAKPLAAPRPVLKKP-ILKDVGAATV 306
            E +       EP+SQ        PQP ++PQ +LQAKP  AP P LKKP ILKDVGAA  
Sbjct: 241  ERE-------EPKSQTPTSLRPHPQPSLRPQPKLQAKPSVAPPPTLKKPVILKDVGAAPK 300

Query: 307  T-ANDESNAAAKTKERKPILIDKYASKKPVAKPFDSEAVMAPKKPGKAPSPGKFKDEYRK 366
            +   DES    + KERKPILIDK+ASKKPV  P   EAV+AP KP K+P PGKFKDEYRK
Sbjct: 301  SQGTDES---VRKKERKPILIDKFASKKPVVDPL-IEAVLAPTKPVKSPPPGKFKDEYRK 360

Query: 367  KNVASGGPRRRMV-DDVEIPDD------VSIPSVTTARKGRKWSKASRKAARIQASKDAA 426
            KNV +GG RRRMV DDVEIPD+      VSIP    ARKGRKWSKASRKAAR+QA++DAA
Sbjct: 361  KNVPAGGSRRRMVRDDVEIPDEDSSELNVSIPGA--ARKGRKWSKASRKAARLQAARDAA 420

Query: 427  PVKVEILEVEESGMLLEELAFNLAISEGEILGYLYSKGIKPDGVQTLDKDIVKMICKEYD 486
            PVKVEILEV E GML+EELA++LAISEGEILGYLYSKGIKPDGVQTLD+DIVKM+CKEYD
Sbjct: 421  PVKVEILEVGEKGMLIEELAYDLAISEGEILGYLYSKGIKPDGVQTLDRDIVKMVCKEYD 480

Query: 487  VETIDVDPVKVEELAKKKDIFDEEDLDKLQSRPPVITIMGHVDHGKTTLLDYIRRSKVAA 546
            VE ID DPVKVEE+A+KK+  D+EDLDKL+ RPPV+TIMGHVDHGKTTLLD IR+SKVA+
Sbjct: 481  VEVIDADPVKVEEMARKKEFLDDEDLDKLEDRPPVLTIMGHVDHGKTTLLDCIRKSKVAS 540

Query: 547  SEAGGITQGIGAYKVMVPLDGKLQPCVFLDTPGHEAFGAMRARGARVTDIAIIVVAADDG 606
            SEAGGITQGIGAYKV+VP+DGKLQPCVFLDTPGHEAFGAMRARGARVTDIAIIVVAADD 
Sbjct: 541  SEAGGITQGIGAYKVLVPIDGKLQPCVFLDTPGHEAFGAMRARGARVTDIAIIVVAADDS 600

Query: 607  IRPQTNEAIAHARAAGVPI------IDKDGANADRVMQELSSIGLMPEDWGGDVPMVQIS 666
            IRPQTNEAIAHA+AAGVPI      ID++GAN +RVMQELSSIGLMPEDWGGD+PMVQIS
Sbjct: 601  IRPQTNEAIAHAKAAGVPIVIAINKIDREGANPERVMQELSSIGLMPEDWGGDIPMVQIS 660

Query: 667  ALKGQNVDDLLETVMLLAELQELKANPDRSAKGTVIEAGLDKAKGPFATFIVQNGTLKRG 726
            ALKG+NV++LLETVML+AELQELKANP RSAKGTVIEAGL K+KGP  T IVQNGTLKRG
Sbjct: 661  ALKGENVNELLETVMLVAELQELKANPHRSAKGTVIEAGLHKSKGPVVTLIVQNGTLKRG 720

Query: 727  DVLVCGEAFGKVRALFDDGGKRVDEAGPSLPVQVIGLNIVPIAGDEFEVVDSLDTAREKA 786
            D++VCGEAFGKVRALFDD G RV+EAGPS+PVQVIGLN VP++GDEFEVV SLD AREKA
Sbjct: 721  DIVVCGEAFGKVRALFDDDGNRVNEAGPSIPVQVIGLNNVPMSGDEFEVVGSLDIAREKA 780

Query: 787  ELRAEALWSQRISDKAGDGKVTLSSLASAVSSGKRSGLDLHQLNIILKVDVQGSIEAIRQ 846
            E RAE+LW +RIS KAGDGKVTLSSLASAV++GK SGLDLHQLNII+KVDVQGSIEA+RQ
Sbjct: 781  ESRAESLWQERISAKAGDGKVTLSSLASAVAAGKLSGLDLHQLNIIMKVDVQGSIEAVRQ 840

Query: 847  ALQVLPQDNVSLKFLLQATGDVTSSDIDLAVASKAIILGFNVKAPGSVKSYAENKAVEIR 906
            ALQ LPQDNV+LKFLL+ATGDV+SSD+DLAVASKAIILGFN KAPGSVKSYAENK VEIR
Sbjct: 841  ALQTLPQDNVTLKFLLEATGDVSSSDVDLAVASKAIILGFNNKAPGSVKSYAENKGVEIR 900

Query: 907  LYRVIYELIDDVRNAMEGLLDPVEEQIPMGSAEVRAVFSSGSGLVAGCMVVDGKLAKGCG 966
            LYRVIYELIDDVRNAMEGLL+PVEEQ+ +GSAEVR VFSSGSG VAGCMV++GK+  GCG
Sbjct: 901  LYRVIYELIDDVRNAMEGLLEPVEEQVSIGSAEVRVVFSSGSGRVAGCMVMEGKVVTGCG 960

Query: 967  IRVLRKGKTVYNGQLDSLRRVKEIVKEVNAGLECGVGMEEYNDWEVGDVIEAYYTVQKKR 1002
            IRVLRKGK V+ G LDSLRRVKEIVKEV+ GLECG+G+E++NDWE GD IEA+ TV+K+R
Sbjct: 961  IRVLRKGKVVHVGVLDSLRRVKEIVKEVSTGLECGIGVEDFNDWEEGDTIEAFNTVEKRR 1017

BLAST of Cp4.1LG02g02800.1 vs. TrEMBL
Match: F6H4W1_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_19s0027g00130 PE=3 SV=1)

HSP 1 Score: 1295.4 bits (3351), Expect = 0.0e+00
Identity = 721/1026 (70.27%), Postives = 833/1026 (81.19%), Query Frame = 1

Query: 7    MASVASLFNLSGVGVIGSSEKPRSQF--RGVCLSRRGFKGSNKWYYVPFPLCKYSATTTT 66
            MAS+ASL +L   G   S     S    R V LSRR F G  +W  V   +CKYS T T 
Sbjct: 1    MASLASLVSLGSAGASSSGHFEGSLLLQRRVSLSRRNFGGGKRWGLVS--VCKYSGTMT- 60

Query: 67   DFIADQGNAISVDSNSSYIRSKDNDGTDFLLKPAPKPVLKAAESKPPLVGLNKVAWES-P 126
            + IA++GNA+SVDS++     KD D    +LKPAPKPVLK   S         V+W++  
Sbjct: 61   NVIAEEGNAVSVDSSTYRGGGKDEDN-GLVLKPAPKPVLKPVNSV--------VSWDAGS 120

Query: 127  KSNGDSNGSRKLLDNEEERSKVIESLGEVLEKAEKLETPKLG----------VNKPGRGV 186
            K +GDS+   KL +N +ER+KVIESLGEVLEKAEKLET +LG          V+K   G 
Sbjct: 121  KISGDSDDDEKL-ENVDERNKVIESLGEVLEKAEKLETGRLGELGSKRESGSVDKSPPGT 180

Query: 187  DKPTTSG---SSSSSNSKSNRSRSVWRKGDAVAPVQKVVAEPSKASGKVEAK-PGGASTV 246
            +  +T G   ++S+++ KS   +SVWRKG+ VA V+KVV + S      E + P     V
Sbjct: 181  NDNSTVGRTVNNSNASKKSKTLKSVWRKGNPVATVEKVVKDASNNITNTEREGPEVGRKV 240

Query: 247  EPQSQAGFRTPQPLVKPQSRLQAKP-LAAPRPVLKKP-ILKDVGAATVTANDESNAAAKT 306
            E Q +   R  QP ++ Q +LQAKP +A P PVLKKP ILKDVGAA  ++  +   + KT
Sbjct: 241  ETQPRIPLRPTQPPLRAQPKLQAKPSVAPPPPVLKKPVILKDVGAAPKSSGIDETDSGKT 300

Query: 307  KERKPILIDKYASKKPVAKPFDSEAVMAPKKPGKAPSPGKFKDEYRKKNVASGGPRRRMV 366
            +ERKPILIDK+ASK+PV  P  ++AV+AP KPGK P PGKFKD+YRKKN ++GG RRRMV
Sbjct: 301  RERKPILIDKFASKRPVVDPMIAQAVLAPPKPGKGPVPGKFKDDYRKKNASTGGSRRRMV 360

Query: 367  --DDVEIPDD------VSIPSVTTARKGRKWSKASRKAARIQASKDAAPVKVEILEVEES 426
              +D+EIPDD      VSIP   TARKGRKWSKASRKAAR+QA+KDAAPVKVEILEV E 
Sbjct: 361  AANDMEIPDDETSELNVSIPGAATARKGRKWSKASRKAARLQAAKDAAPVKVEILEVGEE 420

Query: 427  GMLLEELAFNLAISEGEILGYLYSKGIKPDGVQTLDKDIVKMICKEYDVETIDVDPVKVE 486
            GML E+LA+NLAISEGEILG+LYSKGIKPDGVQTLDKD+VKMICKEY+VE ID   VKVE
Sbjct: 421  GMLTEDLAYNLAISEGEILGFLYSKGIKPDGVQTLDKDMVKMICKEYEVEVIDAAGVKVE 480

Query: 487  ELAKKKDIFDEEDLDKLQSRPPVITIMGHVDHGKTTLLDYIRRSKVAASEAGGITQGIGA 546
            E+A+KK+I DEEDLDKL++RPPV+TIMGHVDHGKTTLLD+IR+SKV ASEAGGITQGIGA
Sbjct: 481  EMARKKEILDEEDLDKLENRPPVLTIMGHVDHGKTTLLDHIRKSKVTASEAGGITQGIGA 540

Query: 547  YKVMVPLDGKLQPCVFLDTPGHEAFGAMRARGARVTDIAIIVVAADDGIRPQTNEAIAHA 606
            YKV+VP+DGK Q CVFLDTPGHEAFGAMRARGARVTDIAIIVVAADDGIRPQTNEAIAHA
Sbjct: 541  YKVLVPIDGKPQSCVFLDTPGHEAFGAMRARGARVTDIAIIVVAADDGIRPQTNEAIAHA 600

Query: 607  RAAGVPI------IDKDGANADRVMQELSSIGLMPEDWGGDVPMVQISALKGQNVDDLLE 666
            +AAGVPI      IDKDGAN +RVMQELSSIGLMPEDWGGD+PMVQISALKG+NVDDLLE
Sbjct: 601  KAAGVPIVIAINKIDKDGANPERVMQELSSIGLMPEDWGGDIPMVQISALKGENVDDLLE 660

Query: 667  TVMLLAELQELKANPDRSAKGTVIEAGLDKAKGPFATFIVQNGTLKRGDVLVCGEAFGKV 726
            T+ML+AELQELKANPDR+AKGTVIEAGLDK+KGP ATFIVQNGTLKRGD++VCG AFGKV
Sbjct: 661  TIMLVAELQELKANPDRNAKGTVIEAGLDKSKGPVATFIVQNGTLKRGDIVVCGGAFGKV 720

Query: 727  RALFDDGGKRVDEAGPSLPVQVIGLNIVPIAGDEFEVVDSLDTAREKAELRAEALWSQRI 786
            RALFDDGGKRVD AGPS+PVQVIGLN VPIAGDEFEVV SLD ARE+AE RAE+L  +RI
Sbjct: 721  RALFDDGGKRVDAAGPSIPVQVIGLNNVPIAGDEFEVVGSLDIARERAEARAESLRQERI 780

Query: 787  SDKAGDGKVTLSSLASAVSSGKRSGLDLHQLNIILKVDVQGSIEAIRQALQVLPQDNVSL 846
            S KAGDGKVTLSS ASAVS G +SGLDLHQLNII+KVDVQGSIEA+RQALQVLPQDNV+L
Sbjct: 781  SSKAGDGKVTLSSFASAVSGGSQSGLDLHQLNIIMKVDVQGSIEAVRQALQVLPQDNVAL 840

Query: 847  KFLLQATGDVTSSDIDLAVASKAIILGFNVKAPGSVKSYAENKAVEIRLYRVIYELIDDV 906
            KFLLQATGD+++SDIDLAVASKAI++GFNV+APGSVKSYA+ K VEIRLY+VIY+LIDDV
Sbjct: 841  KFLLQATGDISASDIDLAVASKAIVIGFNVRAPGSVKSYADTKGVEIRLYKVIYDLIDDV 900

Query: 907  RNAMEGLLDPVEEQIPMGSAEVRAVFSSGSGLVAGCMVVDGKLAKGCGIRVLRKGKTVYN 966
            RNAMEGLLD VEE+I +G+AEVRA F+SGSG +AGCMV +GK+ KGCGIRV+R G+ VY 
Sbjct: 901  RNAMEGLLDAVEEEITIGTAEVRATFTSGSGRIAGCMVKEGKVEKGCGIRVVRDGRAVYV 960

Query: 967  GQLDSLRRVKEIVKEVNAGLECGVGMEEYNDWEVGDVIEAYYTVQKKRTLEEASASMEAA 1000
            G LDSLRRVKEIVKEVNAGLECG+GME+YNDWEVGD+++A+   QKKRTLEEASASM AA
Sbjct: 961  GTLDSLRRVKEIVKEVNAGLECGMGMEDYNDWEVGDIVQAFNKKQKKRTLEEASASMTAA 1013

BLAST of Cp4.1LG02g02800.1 vs. TrEMBL
Match: A5B2X2_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_024584 PE=3 SV=1)

HSP 1 Score: 1284.6 bits (3323), Expect = 0.0e+00
Identity = 716/1025 (69.85%), Postives = 830/1025 (80.98%), Query Frame = 1

Query: 7    MASVASLFNLSGVGVIGSSEKPRSQF--RGVCLSRRGFKGSNKWYYVPFPLCKYSATTTT 66
            MAS+ASL +L   G   S     S    R V L RR F G  +W  V   +CKYS T T 
Sbjct: 1    MASLASLVSLGSAGASSSGHFEGSLLLQRRVSLLRRNFGGGKRWGLVS--VCKYSGTMT- 60

Query: 67   DFIADQGNAISVDSNSSYIRSKDNDGTDFLLKPAPKPVLKAAESKPPLVGLNKVAWES-P 126
            + IA++GNA+SVDS++     KD D    +LKPAPKPVLK   S         V+W++  
Sbjct: 61   NVIAEEGNAVSVDSSTYRGGGKDEDN-GLVLKPAPKPVLKPVNSV--------VSWDAGS 120

Query: 127  KSNGDSNGSRKLLDNEEERSKVIESLGEVLEKAEKLETPKLG----------VNKPGRGV 186
            K +GDS+   KL +N +ER+KVIESLGEVLEKAEKLET +LG          V+K   G 
Sbjct: 121  KISGDSDDDEKL-ENVDERNKVIESLGEVLEKAEKLETGRLGELGSKRESGSVDKSPPGT 180

Query: 187  DKPTTSG---SSSSSNSKSNRSRSVWRKGDAVAPVQKVVAEPSKASGKVEAKPGGASTVE 246
            +  +T G   ++S+++ KS   +SVWRKG+ VA V+KVV +   AS  +       +T  
Sbjct: 181  NDNSTVGRTVNNSNASKKSKTLKSVWRKGNPVATVEKVVKD---ASNNI------TNTER 240

Query: 247  PQSQAGFRTPQPLVKPQSRLQAKP-LAAPRPVLKKP-ILKDVGAATVTANDESNAAAKTK 306
               +   R  QP ++ Q +LQAKP +A P PVLKKP ILKDVGAA  ++  +   + KT+
Sbjct: 241  EGPEIPLRPTQPPLRAQPKLQAKPSVAPPPPVLKKPVILKDVGAAPKSSGIDETDSGKTR 300

Query: 307  ERKPILIDKYASKKPVAKPFDSEAVMAPKKPGKAPSPGKFKDEYRKKNVASGGPRRRMV- 366
            ERKPILIDK+ASK+PV  P  ++AV+AP KPGK P PGKFKD+YRKKN ++GG RRRMV 
Sbjct: 301  ERKPILIDKFASKRPVVDPMIAQAVLAPPKPGKGPVPGKFKDDYRKKNASTGGSRRRMVA 360

Query: 367  -DDVEIPDD------VSIPSVTTARKGRKWSKASRKAARIQASKDAAPVKVEILEVEESG 426
             +D+EIPDD      VSIP   TARKGRKWSKASRKAAR+QA+KDAAPVKVEILEV E G
Sbjct: 361  ANDMEIPDDETSELNVSIPGAATARKGRKWSKASRKAARLQAAKDAAPVKVEILEVGEEG 420

Query: 427  MLLEELAFNLAISEGEILGYLYSKGIKPDGVQTLDKDIVKMICKEYDVETIDVDPVKVEE 486
            ML E+LA+NLAISEGEILG+LYSKGIKPDGVQTLDKD+VKMICKEY+VE ID   VKVEE
Sbjct: 421  MLTEDLAYNLAISEGEILGFLYSKGIKPDGVQTLDKDMVKMICKEYEVEVIDAAGVKVEE 480

Query: 487  LAKKKDIFDEEDLDKLQSRPPVITIMGHVDHGKTTLLDYIRRSKVAASEAGGITQGIGAY 546
            +A+KK+I DEEDLDKL++RPPV+TIMGHVDHGKTTLLD+IR+SKV ASEAGGITQGIGAY
Sbjct: 481  MARKKEILDEEDLDKLENRPPVLTIMGHVDHGKTTLLDHIRKSKVTASEAGGITQGIGAY 540

Query: 547  KVMVPLDGKLQPCVFLDTPGHEAFGAMRARGARVTDIAIIVVAADDGIRPQTNEAIAHAR 606
            KV+VP+DGK Q CVFLDTPGHEAFGAMRARGARVTDIAIIVVAADDGIRPQTNEAIAHA+
Sbjct: 541  KVLVPIDGKPQSCVFLDTPGHEAFGAMRARGARVTDIAIIVVAADDGIRPQTNEAIAHAK 600

Query: 607  AAGVPI------IDKDGANADRVMQELSSIGLMPEDWGGDVPMVQISALKGQNVDDLLET 666
            AAGVPI      IDKDGAN +RVMQELSSIGLMPEDWGGD+PMVQISALKG+NVDDLLET
Sbjct: 601  AAGVPIVIAINKIDKDGANPERVMQELSSIGLMPEDWGGDIPMVQISALKGENVDDLLET 660

Query: 667  VMLLAELQELKANPDRSAKGTVIEAGLDKAKGPFATFIVQNGTLKRGDVLVCGEAFGKVR 726
            +ML+AELQELKANPDR+AKGTVIEAGLDK+KGP ATFIVQNGTLKRGD++VCG AFGKVR
Sbjct: 661  IMLVAELQELKANPDRNAKGTVIEAGLDKSKGPVATFIVQNGTLKRGDIVVCGGAFGKVR 720

Query: 727  ALFDDGGKRVDEAGPSLPVQVIGLNIVPIAGDEFEVVDSLDTAREKAELRAEALWSQRIS 786
            ALFDDGGKRVD AGPS+PVQVIGLN VPIAGDEFEVV SLD ARE+AE RAE+L  +RIS
Sbjct: 721  ALFDDGGKRVDAAGPSIPVQVIGLNNVPIAGDEFEVVGSLDIARERAEARAESLRQERIS 780

Query: 787  DKAGDGKVTLSSLASAVSSGKRSGLDLHQLNIILKVDVQGSIEAIRQALQVLPQDNVSLK 846
             KAGDGKVTLSS ASAVS G +SGLDLHQLNII+KVDVQGSIEA+RQALQVLPQDNV+LK
Sbjct: 781  AKAGDGKVTLSSFASAVSGGSQSGLDLHQLNIIMKVDVQGSIEAVRQALQVLPQDNVALK 840

Query: 847  FLLQATGDVTSSDIDLAVASKAIILGFNVKAPGSVKSYAENKAVEIRLYRVIYELIDDVR 906
            FLLQATGD+++SDIDLAVASKAI++GFNV+APGSVKSYA+ K VEIRLY+VIY+LIDDVR
Sbjct: 841  FLLQATGDISASDIDLAVASKAIVIGFNVRAPGSVKSYADTKGVEIRLYKVIYDLIDDVR 900

Query: 907  NAMEGLLDPVEEQIPMGSAEVRAVFSSGSGLVAGCMVVDGKLAKGCGIRVLRKGKTVYNG 966
            NAMEGLLD VEE+I +G+AEVRA F+SGSG +AGCMV +GK+ KGCGIRV+R G+ VY G
Sbjct: 901  NAMEGLLDAVEEEITIGTAEVRATFTSGSGRIAGCMVKEGKVEKGCGIRVVRDGRAVYVG 960

Query: 967  QLDSLRRVKEIVKEVNAGLECGVGMEEYNDWEVGDVIEAYYTVQKKRTLEEASASMEAAL 1000
             LDSLRRVKE+VKEVNAGLECG+GME+YNDWEVGD+++A+   QKKRTLEEASASM AAL
Sbjct: 961  TLDSLRRVKEMVKEVNAGLECGMGMEDYNDWEVGDIVQAFNKKQKKRTLEEASASMTAAL 1003

BLAST of Cp4.1LG02g02800.1 vs. TAIR10
Match: AT1G17220.1 (AT1G17220.1 Translation initiation factor 2, small GTP-binding protein)

HSP 1 Score: 1248.8 bits (3230), Expect = 0.0e+00
Identity = 706/1035 (68.21%), Postives = 813/1035 (78.55%), Query Frame = 1

Query: 5    GTMASVASLFNLSG----VGVIGSSEKPRSQFRGVCLSRRGFKGSNKWYYVPFPLCKYSA 64
            GTM S+ASL +L G    V    SS+   +  + V LSRR  KG+ KW      LC+YS 
Sbjct: 9    GTMPSLASLVSLGGACASVSGTSSSDASYALVKRVSLSRRSVKGTKKW------LCRYSV 68

Query: 65   -----TTTTDFIADQGN-AISVDSNSSYIRSKDNDGTDFLLKPAPKPVLKAAESKPPL-V 124
                 TTT DFIADQ N ++S+DSNS +  SKD D ++ +LK  PKPVLK   ++    +
Sbjct: 69   SSSTTTTTADFIADQNNNSVSIDSNS-FRGSKDGDDSEVVLKQTPKPVLKPPVARVERGL 128

Query: 125  GLNKVAWESPKSNGDSNGSRKLLDNEEERSKVIESLGEVLEKAEKLETPKLGVNKPGRGV 184
            G+N   W    SNG         D EEER+KVIESLGEVL+KAEKLE PK G  + G  V
Sbjct: 129  GVNTAPWSKDLSNGGK------FDGEEERNKVIESLGEVLDKAEKLEIPKPGNKEGGEAV 188

Query: 185  --DKPTTSGSSSSSNSKSNRS----------RSVWRKGDAVAPVQKVVAEPSKASGK-VE 244
               +P+ + S+S + S +N S          +SVWRKGDAVA VQKVV E  K   + V+
Sbjct: 189  KPSQPSANSSNSRNGSYANASDGGTRKTKTMKSVWRKGDAVAAVQKVVKESPKIFNRGVQ 248

Query: 245  AKPGGASTVEPQSQAG---------FRTPQPLVKPQSRLQAKPLAAPRPVLKKPILKDVG 304
             +P      E  ++AG         FR PQP V+PQ  LQ KP+ AP PV K PILKD+G
Sbjct: 249  TEPRTREEGEVNAKAGTPLAPPQPPFR-PQPPVRPQPMLQGKPMVAP-PVKKSPILKDLG 308

Query: 305  -AATVTANDESNAAAKTKERKPILIDKYASKKPVAKPFDSEAVMAPKKPGKAPSPGKFKD 364
             AA    ++E +++ K+KERKPIL+DK+ASKK    P  S+AV+AP KPGK P   KF+ 
Sbjct: 309  MAAKPLVSEEVDSSVKSKERKPILVDKFASKKKGVDPAASQAVLAPTKPGKGPPSNKFRV 368

Query: 365  EYRKKNVASGGPRRRMVDDVEIPDDVSIPSVTTARKGRKWSKASRKAARIQASKDAAPVK 424
            E+R K  AS  PRRR+V + +  DD SI    + RKGRKWSKASRKA R+QA+KDAAPVK
Sbjct: 369  EHRNKKNASASPRRRIVAEDDGDDDASISR--SGRKGRKWSKASRKAVRLQAAKDAAPVK 428

Query: 425  VEILEVEESGMLLEELAFNLAISEGEILGYLYSKGIKPDGVQTLDKDIVKMICKEYDVET 484
             EILEVEE GM +E+LA+NLAI EG+ILGYLYSKGI+PDGV TLD+++VKMIC++YDVE 
Sbjct: 429  AEILEVEEEGMSIEDLAYNLAIGEGDILGYLYSKGIRPDGVHTLDREMVKMICRDYDVEV 488

Query: 485  IDVDPVKVEELAKKKDIFDEEDLDKLQSRPPVITIMGHVDHGKTTLLDYIRRSKVAASEA 544
            +D D VKVEE+AKK+  FDEEDLDKL+ RPPVITIMGHVDHGKTTLLDYIR+SKVAASEA
Sbjct: 489  LDADSVKVEEMAKKRQTFDEEDLDKLEDRPPVITIMGHVDHGKTTLLDYIRKSKVAASEA 548

Query: 545  GGITQGIGAYKVMVPLDGKLQPCVFLDTPGHEAFGAMRARGARVTDIAIIVVAADDGIRP 604
            GGITQGIGAYKV VP+DGKLQ CVFLDTPGHEAFGAMRARGARVTDIAIIVVAADDGIRP
Sbjct: 549  GGITQGIGAYKVSVPVDGKLQSCVFLDTPGHEAFGAMRARGARVTDIAIIVVAADDGIRP 608

Query: 605  QTNEAIAHARAAGVPI------IDKDGANADRVMQELSSIGLMPEDWGGDVPMVQISALK 664
            QTNEAIAHA+AA VPI      IDK+GA+ DRVMQELSSIGLMPEDWGGDVPMVQISALK
Sbjct: 609  QTNEAIAHAKAAAVPIVIAINKIDKEGASPDRVMQELSSIGLMPEDWGGDVPMVQISALK 668

Query: 665  GQNVDDLLETVMLLAELQELKANPDRSAKGTVIEAGLDKAKGPFATFIVQNGTLKRGDVL 724
            G+NVDDLLETVML+AELQELKANP R+AKG VIEAGLDKAKGPFATFIVQ GTLKRGDV+
Sbjct: 669  GENVDDLLETVMLVAELQELKANPHRNAKGIVIEAGLDKAKGPFATFIVQKGTLKRGDVV 728

Query: 725  VCGEAFGKVRALFDDGGKRVDEAGPSLPVQVIGLNIVPIAGDEFEVVDSLDTAREKAELR 784
            VCGEAFGKVRALFD  G+RVDEAGPS+PVQVIGLN VPIAGDEFE+V SLD ARE AE R
Sbjct: 729  VCGEAFGKVRALFDHSGERVDEAGPSIPVQVIGLNNVPIAGDEFEIVSSLDVAREMAEAR 788

Query: 785  AEALWSQRISDKAGDGKVTLSSLASAVSSGKRSGLDLHQLNIILKVDVQGSIEAIRQALQ 844
            A +L  +RIS KAGDGKVTLSSLASAVS+ K SGLDLHQLNIILKVDVQGSIEA+RQALQ
Sbjct: 789  AVSLRDERISAKAGDGKVTLSSLASAVSAKKMSGLDLHQLNIILKVDVQGSIEAVRQALQ 848

Query: 845  VLPQDNVSLKFLLQATGDVTSSDIDLAVASKAIILGFNVKAPGSVKSYAENKAVEIRLYR 904
            VLPQ+NV+LKFLLQATGDV++SD+DLA AS+AI+ GFNVKA GSVK  AENK VEIRLYR
Sbjct: 849  VLPQENVTLKFLLQATGDVSNSDVDLASASEAIVFGFNVKASGSVKKAAENKGVEIRLYR 908

Query: 905  VIYELIDDVRNAMEGLLDPVEEQIPMGSAEVRAVFSSGSGLVAGCMVVDGKLAKGCGIRV 964
            VIYELIDDVRNAMEGLL+ VEEQIP+GSAEVRA FSSGSG VAGCMV +GK  K CGIRV
Sbjct: 909  VIYELIDDVRNAMEGLLESVEEQIPIGSAEVRATFSSGSGRVAGCMVNEGKFVKDCGIRV 968

Query: 965  LRKGKTVYNGQLDSLRRVKEIVKEVNAGLECGVGMEEYNDWEVGDVIEAYYTVQKKRTLE 1000
            +RKGKTV+ G LDSL+RVKE VKEV+AGLECG+GM++Y+DW  GD+IEA+  VQK+RTLE
Sbjct: 969  VRKGKTVHVGVLDSLKRVKENVKEVSAGLECGIGMDDYDDWIEGDIIEAFNAVQKRRTLE 1026

BLAST of Cp4.1LG02g02800.1 vs. TAIR10
Match: AT4G11160.1 (AT4G11160.1 Translation initiation factor 2, small GTP-binding protein)

HSP 1 Score: 377.9 bits (969), Expect = 2.0e-104
Identity = 246/634 (38.80%), Postives = 365/634 (57.57%), Query Frame = 1

Query: 362 TTARKGRKWSKASRKAARIQASKDAAPVKVEILEVEESGMLLEELAFNLAISEGEILGYL 421
           T   KG K+SK  +K  +        P +++ L     G  ++       I EG  L  L
Sbjct: 108 TVKTKG-KFSKREKKTDKPPVEAPYVPPRLKRLAKGLPGKTVD-------IFEGMTLLEL 167

Query: 422 YSK-GIKPDGVQTLDKDIVKMICKEYDVETIDVDPVKVEELAKKKDIFDEEDLDKLQSRP 481
             + G     +Q++  ++ +    E+D  ++DV  +   E+          +  ++  RP
Sbjct: 168 SKRTGESVAVLQSILINVGETFSSEFDTISVDVAELLAMEIGINVRRQHSTEGSEILPRP 227

Query: 482 PVITIMGHVDHGKTTLLDYIRRSKVAASEAGGITQGIGAYKVMVPLDGKLQPCVFLDTPG 541
           PV+T+MGHVDHGKT+LLD +R + VAA EAGGITQ +GA+ V +P  G      FLDTPG
Sbjct: 228 PVVTVMGHVDHGKTSLLDALRNTSVAAREAGGITQHVGAFVVGMPDSGT--SITFLDTPG 287

Query: 542 HEAFGAMRARGARVTDIAIIVVAADDGIRPQTNEAIAHARAAGVPII------DKDGANA 601
           H AF  MRARGA VTDI ++VVAADDG+ PQT EAIAHAR+A VP++      DK GAN 
Sbjct: 288 HAAFSEMRARGAAVTDIVVLVVAADDGVMPQTLEAIAHARSANVPVVVAINKCDKPGANP 347

Query: 602 DRVMQELSSIGLMPEDWGGDVPMVQISALKGQNVDDLLETVMLLAELQELKANPDRSAKG 661
           ++V  +L+S G+  ED GG+V  V++SA K   +D L E ++L A   +LKA  D  A+ 
Sbjct: 348 EKVKYQLTSEGIELEDIGGNVQAVEVSAAKSTGLDKLEEALLLQAVDMDLKARVDGPAQA 407

Query: 662 TVIEAGLDKAKGPFATFIVQNGTLKRGDVLVCGEAFGKVRALFDDGGKRVDEAGPSLPVQ 721
            V+EA LDK +GP AT IV+ GTL RG  +V G  +G++RA+ D  GK  D A P++PV+
Sbjct: 408 YVVEARLDKGRGPLATIIVKAGTLVRGQHVVIGCQWGRLRAIRDMIGKTTDRATPAMPVE 467

Query: 722 VIGLNIVPIAGDEFEVVDSLDTAREKAELRAEALWSQRISDKAGDGKVTLSSLASAVSSG 781
           + GL  +P+AGD+  VV+S + AR  +E R       R+  KA + +  L   A+ + + 
Sbjct: 468 IEGLKGLPMAGDDVIVVESEERARMLSEGRKRKYEKDRLL-KAEEAR--LEEAAARLETE 527

Query: 782 KRSGLDLHQLNIILKVDVQGSIEAIRQALQVLPQDNVSLKFLLQATGDVTSSDIDLAVAS 841
              G    +L I++K DVQG+ +A+  AL+ L    VS+  +    G ++ SD+DLA A 
Sbjct: 528 SEEGFVRVELPIVVKSDVQGTAQAVADALRTLNSPQVSVNIVHSGVGAISHSDLDLAQAC 587

Query: 842 KAIILGFNVKAPGSVKSYAENKAVEIRLYRVIYELIDDVRNAMEGLLDPVEEQIPMGSAE 901
            A I+GFNVK   +    A   +V++  +RVIY L++D+ N +      V E    G AE
Sbjct: 588 GACIVGFNVKGGSTGNLSAAQGSVKVFHHRVIYHLLEDIGNLIVEKAPGVSEMEVSGEAE 647

Query: 902 VRAVF-------SSGSGL-VAGCMVVDGKLAKGCGIRVLRKGKTVYNGQLDSLRRVKEIV 961
           V ++F       +   G+ +AGC V+DG++ +   +R+LR G+ V+ G   SL+R K+ V
Sbjct: 648 VLSIFKILGKRRTEEDGVNIAGCKVMDGRVCRSGLMRLLRSGEVVFEGSCASLKREKQDV 707

Query: 962 KEVNAGLECGVGMEEYNDWEVGDVIEAYYTVQKK 981
           ++V  G ECG+   ++ND+ VGDVI+    V +K
Sbjct: 708 EQVGKGNECGLVFGDWNDFRVGDVIQCMEPVIRK 728

BLAST of Cp4.1LG02g02800.1 vs. TAIR10
Match: AT1G76820.1 (AT1G76820.1 eukaryotic translation initiation factor 2 (eIF-2) family protein)

HSP 1 Score: 92.0 bits (227), Expect = 2.2e-18
Identity = 64/162 (39.51%), Postives = 89/162 (54.94%), Query Frame = 1

Query: 427 KPDGVQTLDKDIVKMICKEYDVETIDVDPVKVEELAKKKDIFDEEDLDKLQSRPPVITIM 486
           KP G+  L+   VK I +  D  T      K + LA  +  F EE  +KL+S   +  IM
Sbjct: 528 KPAGMSKLETAAVKAISEVEDAATQTKRAKKGKCLAPNE--FIEEGGEKLRSI--ICCIM 587

Query: 487 GHVDHGKTTLLDYIRRSKVAASEAGGITQGIGA------------YKVMVPLDGKLQPCV 546
           GHVD GKT LLD IR + V   EAGGITQ IGA             ++      K+   +
Sbjct: 588 GHVDSGKTKLLDCIRGTNVQEGEAGGITQQIGATYFPAKNIRERTRELKADAKLKVPGLL 647

Query: 547 FLDTPGHEAFGAMRARGARVTDIAIIVVAADDGIRPQTNEAI 577
            +DTPGHE+F  +R+RG+ + D+AI+VV    G++PQT E++
Sbjct: 648 VIDTPGHESFTNLRSRGSSLCDLAILVVDITHGLQPQTIESL 685

BLAST of Cp4.1LG02g02800.1 vs. TAIR10
Match: AT1G76810.1 (AT1G76810.1 eukaryotic translation initiation factor 2 (eIF-2) family protein)

HSP 1 Score: 88.6 bits (218), Expect = 2.4e-17
Identity = 55/132 (41.67%), Postives = 75/132 (56.82%), Query Frame = 1

Query: 457 KVEELAKKKDIFDEEDLDKLQSRPPVITIMGHVDHGKTTLLDYIRRSKVAASEAGGITQG 516
           K + LA  + I  EE+L     R P+  IMGHVD GKT LLD IR + V   EAGGITQ 
Sbjct: 687 KGKGLAPSESIEGEENL-----RSPICCIMGHVDTGKTKLLDCIRGTNVQEGEAGGITQQ 746

Query: 517 IGA------------YKVMVPLDGKLQPCVFLDTPGHEAFGAMRARGARVTDIAIIVVAA 576
           IGA             ++      K+   + +DTPGHE+F  +R+RG+ + D+AI+VV  
Sbjct: 747 IGATYFPAENIRERTKELKADAKLKVPGLLVIDTPGHESFTNLRSRGSSLCDLAILVVDI 806

BLAST of Cp4.1LG02g02800.1 vs. TAIR10
Match: AT1G21160.1 (AT1G21160.1 eukaryotic translation initiation factor 2 (eIF-2) family protein)

HSP 1 Score: 87.8 bits (216), Expect = 4.1e-17
Identity = 57/152 (37.50%), Postives = 82/152 (53.95%), Query Frame = 1

Query: 457 KVEELAKKKDIFDEED---LDKLQS------RPPVITIMGHVDHGKTTLLDYIRRSKVAA 516
           KV++  + KD   +++   L++L        R P+  IMGHVD GKT LLD IR + V  
Sbjct: 465 KVDDTTRPKDTSKKDEGLVLNELAKEVEENLRSPICCIMGHVDSGKTKLLDCIRGTNVQE 524

Query: 517 SEAGGITQGIGA------------YKVMVPLDGKLQPCVFLDTPGHEAFGAMRARGARVT 576
            EAGGITQ IGA             ++      K+   + +DTPGHE+F  +R+RG+ + 
Sbjct: 525 GEAGGITQQIGATFFPAENIRERTKELQANAKLKVPGILVIDTPGHESFTNLRSRGSNLC 584

Query: 577 DIAIIVVAADDGIRPQTNEAIAHARAAGVPII 588
           D+AI+VV    G+ PQT E++   R   V  I
Sbjct: 585 DLAILVVDIMRGLEPQTIESLNLLRRRNVKFI 616

BLAST of Cp4.1LG02g02800.1 vs. NCBI nr
Match: gi|659087091|ref|XP_008444270.1| (PREDICTED: translation initiation factor IF-2, chloroplastic [Cucumis melo])

HSP 1 Score: 1689.1 bits (4373), Expect = 0.0e+00
Identity = 901/1019 (88.42%), Postives = 951/1019 (93.33%), Query Frame = 1

Query: 1    MQGTGTMASVASLFNLSGVGVIGSSEKPRSQFRGVCLSRRGFKGSNKWYYVPFPLCKYSA 60
            MQGTGTMASVASLFNL+GVGV+GSSEKPRSQFRGVCLS+RGFKGSN+WYYV FPLCKYSA
Sbjct: 8    MQGTGTMASVASLFNLAGVGVVGSSEKPRSQFRGVCLSKRGFKGSNRWYYVSFPLCKYSA 67

Query: 61   TTTTDFIADQGNAISVDSNSSYIRSKDNDGTDFLLKPAPKPVLKAAESKPPLVGLNKVAW 120
            TTTTDF+ADQGNAISVDSNS Y RSK++D TDFLLKPAPKPVLKAAESKP LVGLNKV W
Sbjct: 68   TTTTDFVADQGNAISVDSNS-YRRSKEDDNTDFLLKPAPKPVLKAAESKP-LVGLNKVTW 127

Query: 121  ESPKSNGDSNGSRKLLDNEEERSKVIESLGEVLEKAEKLETPKLGVNKPGRGVDKPTTSG 180
            ESPK+NG+S+ + KLLDNEEERSKVIESLGEVLEKAEKLETPKLG  KPGRGVD PTTS 
Sbjct: 128  ESPKTNGNSSSNSKLLDNEEERSKVIESLGEVLEKAEKLETPKLGNRKPGRGVDTPTTSS 187

Query: 181  SSSSS---NSKSNRS----RSVWRKGDAVAPVQKVVAEPSKASGKVEAKPGGASTVEPQS 240
            S S+S   NS +NR     +SVWRKGD VA VQK VAEPSK  G+VEAKP GAS VEPQS
Sbjct: 188  SGSNSKPVNSMANRKYKTLKSVWRKGDTVASVQKTVAEPSKPKGEVEAKPRGASRVEPQS 247

Query: 241  QAGFRTPQPLVKPQSRLQAKPLAAPRPVLKKPI-LKDVGAATVTANDESNAAAKTKERKP 300
            +A FR+PQP VKPQ +LQ KPLAA  P+LKKP+ LKDVGAATVTA+DE+N AAKTKERKP
Sbjct: 248  RAAFRSPQPPVKPQPKLQEKPLAATPPILKKPVVLKDVGAATVTADDETNTAAKTKERKP 307

Query: 301  ILIDKYASKKPVAKPFDSEAVMAPKKPGKAPSPGKFKDEYRKKNVASGGPRRRMV----D 360
            ILIDKYASKKPV  PF SEAV+AP KP K P PGKFKD+YRKK+VASGGPRRRMV    D
Sbjct: 308  ILIDKYASKKPVVDPFISEAVLAPTKPAKVPPPGKFKDDYRKKSVASGGPRRRMVGDDKD 367

Query: 361  DVEIPDDVSIPSVTTARKGRKWSKASRKAARIQASKDAAPVKVEILEVEESGMLLEELAF 420
            DVEIPDDVSIPSV+TARKGRKWSKASRKAARIQASKDAAPVKVEILEVEESGMLLEELA+
Sbjct: 368  DVEIPDDVSIPSVSTARKGRKWSKASRKAARIQASKDAAPVKVEILEVEESGMLLEELAY 427

Query: 421  NLAISEGEILGYLYSKGIKPDGVQTLDKDIVKMICKEYDVETIDVDPVKVEELAKKKDIF 480
            +LAISEGEILGYLYSKGIKPDGVQTLDKDIVKMICKEYDVETIDVDPVKVEELAKK DIF
Sbjct: 428  SLAISEGEILGYLYSKGIKPDGVQTLDKDIVKMICKEYDVETIDVDPVKVEELAKKSDIF 487

Query: 481  DEEDLDKLQSRPPVITIMGHVDHGKTTLLDYIRRSKVAASEAGGITQGIGAYKVMVPLDG 540
            DEEDLDKLQSRPPVITIMGHVDHGKTTLLDYIRRSKVAASEAGGITQGIGAY+V+VPLDG
Sbjct: 488  DEEDLDKLQSRPPVITIMGHVDHGKTTLLDYIRRSKVAASEAGGITQGIGAYRVLVPLDG 547

Query: 541  KLQPCVFLDTPGHEAFGAMRARGARVTDIAIIVVAADDGIRPQTNEAIAHARAAGVPI-- 600
            KLQPCVFLDTPGHEAFGAMRARGARVTDIAIIVVAADDGIRPQTNEAIAHA+AAGVPI  
Sbjct: 548  KLQPCVFLDTPGHEAFGAMRARGARVTDIAIIVVAADDGIRPQTNEAIAHAKAAGVPIVI 607

Query: 601  ----IDKDGANADRVMQELSSIGLMPEDWGGDVPMVQISALKGQNVDDLLETVMLLAELQ 660
                IDKDGANADRVMQELSSIGLMPEDWGGD+PMVQISALKG NVDDLLETVMLLAELQ
Sbjct: 608  AINKIDKDGANADRVMQELSSIGLMPEDWGGDIPMVQISALKGLNVDDLLETVMLLAELQ 667

Query: 661  ELKANPDRSAKGTVIEAGLDKAKGPFATFIVQNGTLKRGDVLVCGEAFGKVRALFDDGGK 720
            ELKANPDRSAKGTVIEAGLDK+KGPFATFIVQNGTLKRGDV+VCGEAFGKVRALFDD GK
Sbjct: 668  ELKANPDRSAKGTVIEAGLDKSKGPFATFIVQNGTLKRGDVVVCGEAFGKVRALFDDSGK 727

Query: 721  RVDEAGPSLPVQVIGLNIVPIAGDEFEVVDSLDTAREKAELRAEALWSQRISDKAGDGKV 780
            RVDEAGPSLPVQVIGLNIVPIAGD FEVVDSLDTAREKAELRAEAL SQRISDKAGDGKV
Sbjct: 728  RVDEAGPSLPVQVIGLNIVPIAGDVFEVVDSLDTAREKAELRAEALRSQRISDKAGDGKV 787

Query: 781  TLSSLASAVSSGKRSGLDLHQLNIILKVDVQGSIEAIRQALQVLPQDNVSLKFLLQATGD 840
            TLSSLASAVSSGK+SGLDLHQLNII+KVDVQGSIEAIRQALQVLPQ+NVSLKFLLQATGD
Sbjct: 788  TLSSLASAVSSGKQSGLDLHQLNIIMKVDVQGSIEAIRQALQVLPQENVSLKFLLQATGD 847

Query: 841  VTSSDIDLAVASKAIILGFNVKAPGSVKSYAENKAVEIRLYRVIYELIDDVRNAMEGLLD 900
            V+SSDIDLAVASKAI+LGFNVKA GSVKSYAENK VEIRLYRVIYELIDDVRNAMEGLL+
Sbjct: 848  VSSSDIDLAVASKAIVLGFNVKASGSVKSYAENKGVEIRLYRVIYELIDDVRNAMEGLLE 907

Query: 901  PVEEQIPMGSAEVRAVFSSGSGLVAGCMVVDGKLAKGCGIRVLRKGKTVYNGQLDSLRRV 960
            PVEE++P+GSAEVRAVFSSGSGLVAGCMVV+GKL KGCGIRVLRKGKT Y GQLDSLRRV
Sbjct: 908  PVEEKVPIGSAEVRAVFSSGSGLVAGCMVVEGKLVKGCGIRVLRKGKTAYTGQLDSLRRV 967

Query: 961  KEIVKEVNAGLECGVGMEEYNDWEVGDVIEAYYTVQKKRTLEEASASMEAALEEAGIGM 1002
            KEIVKEVNAGLECGVGME+Y+DWEVGDV+EA+ TVQKKRTLEEASASM  ALE+AGI +
Sbjct: 968  KEIVKEVNAGLECGVGMEDYDDWEVGDVLEAFDTVQKKRTLEEASASMATALEKAGIDL 1024

BLAST of Cp4.1LG02g02800.1 vs. NCBI nr
Match: gi|449466939|ref|XP_004151183.1| (PREDICTED: translation initiation factor IF-2, chloroplastic [Cucumis sativus])

HSP 1 Score: 1682.9 bits (4357), Expect = 0.0e+00
Identity = 896/1019 (87.93%), Postives = 950/1019 (93.23%), Query Frame = 1

Query: 1    MQGTGTMASVASLFNLSGVGVIGSSEKPRSQFRGVCLSRRGFKGSNKWYYVPFPLCKYSA 60
            MQGTGTMASVASLFNLSGVGV+GSSEKPRSQFRGVCLSRRGFKGSN+WYYV FPLCKYSA
Sbjct: 8    MQGTGTMASVASLFNLSGVGVVGSSEKPRSQFRGVCLSRRGFKGSNRWYYVSFPLCKYSA 67

Query: 61   TTTTDFIADQGNAISVDSNSSYIRSKDNDGTDFLLKPAPKPVLKAAESKPPLVGLNKVAW 120
            TTT DF+ADQGNAISVDSNS Y RSK++D TDFLLKPAPKPVLKAAESKP LVGLNKV W
Sbjct: 68   TTT-DFVADQGNAISVDSNS-YRRSKEDDNTDFLLKPAPKPVLKAAESKP-LVGLNKVTW 127

Query: 121  ESPKSNGDSNGSRKLLDNEEERSKVIESLGEVLEKAEKLETPKLGVNKPGRGVDKPTTSG 180
            ESPK+NGDSN +RKLLD+EEERSK+IESLGEVLEKAEKLETPKLG  KPGRGVD PTTS 
Sbjct: 128  ESPKTNGDSNSNRKLLDDEEERSKMIESLGEVLEKAEKLETPKLGNRKPGRGVDTPTTSS 187

Query: 181  SSSSS---NSKSNRS----RSVWRKGDAVAPVQKVVAEPSKASGKVEAKPGGASTVEPQS 240
              S+S   NS +NR     +SVWRKGD VA VQK+VAEPSK   +VEAKP G S VEPQS
Sbjct: 188  LGSNSKPVNSMANRKYKTLKSVWRKGDTVASVQKIVAEPSKPKDEVEAKPRGTSKVEPQS 247

Query: 241  QAGFRTPQPLVKPQSRLQAKPLAAPRPVLKKPI-LKDVGAATVTANDESNAAAKTKERKP 300
            +A F+ PQP VKPQ +LQ KPLAA  P+LKKP+ LKDVGAAT+TA+DE+N AAKTKERKP
Sbjct: 248  RAAFQPPQPPVKPQPKLQEKPLAATPPILKKPVVLKDVGAATMTADDETNTAAKTKERKP 307

Query: 301  ILIDKYASKKPVAKPFDSEAVMAPKKPGKAPSPGKFKDEYRKKNVASGGPRRRMV----D 360
            ILIDKYASKKPV  PF S+A++AP KP KAP PGKFKD+YRK++VASGGPRR+MV    D
Sbjct: 308  ILIDKYASKKPVVDPFISDAILAPTKPVKAPPPGKFKDDYRKRSVASGGPRRKMVGDGKD 367

Query: 361  DVEIPDDVSIPSVTTARKGRKWSKASRKAARIQASKDAAPVKVEILEVEESGMLLEELAF 420
            DVEIPDDVSIPSV+TARKGRKWSKASRKAARIQASKDAAPVKVEILEVEESGMLLEELA+
Sbjct: 368  DVEIPDDVSIPSVSTARKGRKWSKASRKAARIQASKDAAPVKVEILEVEESGMLLEELAY 427

Query: 421  NLAISEGEILGYLYSKGIKPDGVQTLDKDIVKMICKEYDVETIDVDPVKVEELAKKKDIF 480
            NLAISEGEILGYLYSKGIKPDGVQTLDKDIVKMICKEYDVETID+DPVKVEELAKK+DIF
Sbjct: 428  NLAISEGEILGYLYSKGIKPDGVQTLDKDIVKMICKEYDVETIDIDPVKVEELAKKRDIF 487

Query: 481  DEEDLDKLQSRPPVITIMGHVDHGKTTLLDYIRRSKVAASEAGGITQGIGAYKVMVPLDG 540
            DEEDLDKLQSRPPVITIMGHVDHGKTTLLDYIRRSKVAASEAGGITQGIGAY+V+VPLDG
Sbjct: 488  DEEDLDKLQSRPPVITIMGHVDHGKTTLLDYIRRSKVAASEAGGITQGIGAYRVLVPLDG 547

Query: 541  KLQPCVFLDTPGHEAFGAMRARGARVTDIAIIVVAADDGIRPQTNEAIAHARAAGVPI-- 600
            KLQPCVFLDTPGHEAFGAMRARGARVTDIAIIVVAADDGIRPQTNEAIAHARAAGVPI  
Sbjct: 548  KLQPCVFLDTPGHEAFGAMRARGARVTDIAIIVVAADDGIRPQTNEAIAHARAAGVPIVI 607

Query: 601  ----IDKDGANADRVMQELSSIGLMPEDWGGDVPMVQISALKGQNVDDLLETVMLLAELQ 660
                IDKDGANADRVMQELSSIGLMPEDWGGD+PMVQISALKG NVDDLLETVMLLAELQ
Sbjct: 608  AINKIDKDGANADRVMQELSSIGLMPEDWGGDIPMVQISALKGLNVDDLLETVMLLAELQ 667

Query: 661  ELKANPDRSAKGTVIEAGLDKAKGPFATFIVQNGTLKRGDVLVCGEAFGKVRALFDDGGK 720
            ELKANPDRSAKGTVIEAGLDK+KGPFATFIVQNGTLKRGDV+VCGEAFGKVRALFDD GK
Sbjct: 668  ELKANPDRSAKGTVIEAGLDKSKGPFATFIVQNGTLKRGDVVVCGEAFGKVRALFDDSGK 727

Query: 721  RVDEAGPSLPVQVIGLNIVPIAGDEFEVVDSLDTAREKAELRAEALWSQRISDKAGDGKV 780
            RVDEAGPSLPVQVIGLNIVPIAGD FEVVDSLDTAREKAELRAEALWSQRISDKAGDGKV
Sbjct: 728  RVDEAGPSLPVQVIGLNIVPIAGDVFEVVDSLDTAREKAELRAEALWSQRISDKAGDGKV 787

Query: 781  TLSSLASAVSSGKRSGLDLHQLNIILKVDVQGSIEAIRQALQVLPQDNVSLKFLLQATGD 840
            TLSSLASAVSSGK+SGLDLHQLNII+KVDVQGSIEAIRQALQVLPQ+NVSLKFLLQATGD
Sbjct: 788  TLSSLASAVSSGKQSGLDLHQLNIIMKVDVQGSIEAIRQALQVLPQENVSLKFLLQATGD 847

Query: 841  VTSSDIDLAVASKAIILGFNVKAPGSVKSYAENKAVEIRLYRVIYELIDDVRNAMEGLLD 900
            V+SSDIDLAVASKAI+LGFNVKAPGSVKSYAENK VEIRLYRVIYELIDDVRNAMEGLL+
Sbjct: 848  VSSSDIDLAVASKAIVLGFNVKAPGSVKSYAENKGVEIRLYRVIYELIDDVRNAMEGLLE 907

Query: 901  PVEEQIPMGSAEVRAVFSSGSGLVAGCMVVDGKLAKGCGIRVLRKGKTVYNGQLDSLRRV 960
            PVEE++P+GSAEVRAVFSSGSGLVAGCMVV+GKL KGCGI+VLRKGK  Y GQLDSLRRV
Sbjct: 908  PVEEKVPIGSAEVRAVFSSGSGLVAGCMVVEGKLVKGCGIQVLRKGKIAYTGQLDSLRRV 967

Query: 961  KEIVKEVNAGLECGVGMEEYNDWEVGDVIEAYYTVQKKRTLEEASASMEAALEEAGIGM 1002
            KEIVKEVNAGLECGVGME+Y+DWEVGD IEA+ TVQKKRTLEEASASM  ALE+AGI +
Sbjct: 968  KEIVKEVNAGLECGVGMEDYDDWEVGDAIEAFDTVQKKRTLEEASASMATALEKAGIDL 1023

BLAST of Cp4.1LG02g02800.1 vs. NCBI nr
Match: gi|1009141746|ref|XP_015888355.1| (PREDICTED: translation initiation factor IF-2, chloroplastic [Ziziphus jujuba])

HSP 1 Score: 1343.9 bits (3477), Expect = 0.0e+00
Identity = 737/1032 (71.41%), Postives = 852/1032 (82.56%), Query Frame = 1

Query: 5    GTMASVASLFNLSGVGVIGSSEKPRSQFRGVCLSRRGFKGSNKWYYVPFPLCKYSATTTT 64
            GTMAS+ SL +L  V V+GSSE+ RS  R V LS+R F+G  +W+YV F +CKYS TTT 
Sbjct: 10   GTMASLTSLVSLGSVMVVGSSERSRSLVRSVSLSKRSFRGKRRWHYVSFSVCKYSVTTT- 69

Query: 65   DFIADQGNAISVDSNSSYIRSKD-NDGTDFLLKPAPKPVLKAAESKP-PLVGLNKVAWES 124
            DF+A+ GNA+S+DSN+ Y   KD N G DF+LKPAPKP+LK+A S   PL+ +N +AW+ 
Sbjct: 70   DFVAEHGNAVSLDSNT-YSGGKDTNGGADFVLKPAPKPLLKSAGSNAEPLLDMNPMAWDP 129

Query: 125  PKSNGDSNGSRKLLDNEEERSKVIESLGEVLEKAEKLETPK---LGVNKPGRGVDKPTTS 184
             + +GDS+     L  EEERSKVIE+LGEVLEKAEKLET K   +G  K     +K + S
Sbjct: 130  SRISGDSDDEE--LGTEEERSKVIETLGEVLEKAEKLETSKPSQVGSKKDSVSENKHSPS 189

Query: 185  GSSSSS-----------NSKSNRSRSVWRKGDAVAPVQKVVAEPSKASGKV---EAKPGG 244
             +SS++           N KS   +SVWRKGD VA VQKVV E SK  GK+   E   G 
Sbjct: 190  NNSSTNSRNLEPVNSARNRKSKTLKSVWRKGDTVANVQKVVRESSKPIGKIDKEEPNSGE 249

Query: 245  ASTVEPQSQAGFRTPQPLVKPQSRLQAKPLAAPRPVLKKP-ILKDVGAATVTAN-DESNA 304
               VE Q+    + PQ  ++PQ +LQ KP  AP PV+KKP ILKDVGA+  +A   ++++
Sbjct: 250  KLKVESQAVGPLKPPQQPLRPQPKLQGKPSIAPPPVIKKPVILKDVGASPRSAVVRKTDS 309

Query: 305  AAKTKERKPILIDKYASKKPVAKPFDSEAVMAPKKPGKAPSPGKFKDEYRKKNVASGGPR 364
            ++KT+ERKPILIDK+A KKPV  P  ++AV+AP KP K P  GKFKDEYRKKNV +G  R
Sbjct: 310  SSKTQERKPILIDKFAPKKPVVDPLIAQAVLAPTKPAKGPPHGKFKDEYRKKNVPAGS-R 369

Query: 365  RRMVDD--VEIPDD------VSIPSVTTARKGRKWSKASRKAARIQASKDAAPVKVEILE 424
            RR+VDD  VEI D+      VSI    TARKGRKWSKASRKAAR+QA+K+AAPVKVEILE
Sbjct: 370  RRLVDDDDVEIHDEETSELNVSIRGAATARKGRKWSKASRKAARLQAAKEAAPVKVEILE 429

Query: 425  VEESGMLLEELAFNLAISEGEILGYLYSKGIKPDGVQTLDKDIVKMICKEYDVETIDVDP 484
            V E GML+EELA+NL ISEGEILG LYSKGIKPDGVQTLD+DIVKM+CKEYDVE +D D 
Sbjct: 430  VGEKGMLIEELAYNLTISEGEILGLLYSKGIKPDGVQTLDRDIVKMVCKEYDVEVLDADS 489

Query: 485  VKVEELAKKKDIFDEEDLDKLQSRPPVITIMGHVDHGKTTLLDYIRRSKVAASEAGGITQ 544
            VKVEE+A+KK+I DE+DLDKL+ RPPVITIMGHVDHGKTTLLDYIR++KVAASEAGGITQ
Sbjct: 490  VKVEEMARKKEILDEDDLDKLEDRPPVITIMGHVDHGKTTLLDYIRKTKVAASEAGGITQ 549

Query: 545  GIGAYKVMVPLDGKLQPCVFLDTPGHEAFGAMRARGARVTDIAIIVVAADDGIRPQTNEA 604
            GIGAYKV+VP+DGKL+PCVFLDTPGH+AFGAMRARGARVTDIAIIVVAADDGIRPQTNEA
Sbjct: 550  GIGAYKVLVPIDGKLRPCVFLDTPGHQAFGAMRARGARVTDIAIIVVAADDGIRPQTNEA 609

Query: 605  IAHARAAGVPI------IDKDGANADRVMQELSSIGLMPEDWGGDVPMVQISALKGQNVD 664
            IAHA+AAGVPI      ID+DGAN +RVMQELSSIGLMPEDWGG+ PMVQISALKGQN++
Sbjct: 610  IAHAKAAGVPIVIAINKIDRDGANPERVMQELSSIGLMPEDWGGNTPMVQISALKGQNIN 669

Query: 665  DLLETVMLLAELQELKANPDRSAKGTVIEAGLDKAKGPFATFIVQNGTLKRGDVLVCGEA 724
            DLLET+ML+AELQELKANPDR+AKGTVIEAGL K+KGP ATFIVQNGTLKRGD++VCGEA
Sbjct: 670  DLLETIMLVAELQELKANPDRTAKGTVIEAGLHKSKGPLATFIVQNGTLKRGDIVVCGEA 729

Query: 725  FGKVRALFDDGGKRVDEAGPSLPVQVIGLNIVPIAGDEFEVVDSLDTAREKAELRAEALW 784
            FGKVRALFDDGG RVDEAGPS+PVQVIGLN VPIAGDEFEVVDSLD AREKAE  A++L 
Sbjct: 730  FGKVRALFDDGGNRVDEAGPSIPVQVIGLNNVPIAGDEFEVVDSLDVAREKAESCAQSLR 789

Query: 785  SQRISDKAGDGKVTLSSLASAVSSGKRSGLDLHQLNIILKVDVQGSIEAIRQALQVLPQD 844
            + RIS KAGDGKVTLSSLASAVS+GK SGLDLHQLNII+KVD+QGSIEA+RQALQ LPQD
Sbjct: 790  NARISAKAGDGKVTLSSLASAVSAGKLSGLDLHQLNIIMKVDLQGSIEAVRQALQELPQD 849

Query: 845  NVSLKFLLQATGDVTSSDIDLAVASKAIILGFNVKAPGSVKSYAENKAVEIRLYRVIYEL 904
            NV+LKFLL+ATGDV++SD+DLA ASKAIILGFNVKAPGSVKSYA+ K +EIRLYRVIYEL
Sbjct: 850  NVTLKFLLEATGDVSTSDVDLAAASKAIILGFNVKAPGSVKSYADKKGIEIRLYRVIYEL 909

Query: 905  IDDVRNAMEGLLDPVEEQIPMGSAEVRAVFSSGSGLVAGCMVVDGKLAKGCGIRVLRKGK 964
            IDDVRNAMEGLL+PVEEQ  +GSAEVRAVFSSGSG VAGCMV +GK+ KGCG RV+RKGK
Sbjct: 910  IDDVRNAMEGLLEPVEEQETIGSAEVRAVFSSGSGRVAGCMVTEGKVVKGCGFRVIRKGK 969

Query: 965  TVYNGQLDSLRRVKEIVKEVNAGLECGVGMEEYNDWEVGDVIEAYYTVQKKRTLEEASAS 1002
             ++ G +DSLRRVKEIVKEVN GLECG+G E+YNDWE GD+IEA+  VQK+RTLEEASAS
Sbjct: 970  VIHTGTIDSLRRVKEIVKEVNTGLECGIGSEDYNDWEEGDMIEAFEMVQKRRTLEEASAS 1029

BLAST of Cp4.1LG02g02800.1 vs. NCBI nr
Match: gi|595894839|ref|XP_007213706.1| (hypothetical protein PRUPE_ppa000701mg [Prunus persica])

HSP 1 Score: 1326.6 bits (3432), Expect = 0.0e+00
Identity = 727/1029 (70.65%), Postives = 844/1029 (82.02%), Query Frame = 1

Query: 5    GTMASVASLFNLSGVGVIGSSEKPRSQFRGVCLSRRGFKGSNKWYYVPFPLCKYSATTTT 64
            GTMASVASL +L  V ++GSSE+ RS  R V LS+   KGS +W+ V   +CK S TTT 
Sbjct: 10   GTMASVASLVSLGSVTLLGSSERSRSLVRKVSLSKASLKGSRRWHCVRLSVCKCSVTTT- 69

Query: 65   DFIADQGNAISVDSNSSYIRSKD--NDGTDFLLKPAPKPVLKAA--ESKPPLVGLNKVAW 124
            DF+A QGN +S+DSN+ Y  S D  N   DF+LKP+PKPVLK++   +  PLVG++   W
Sbjct: 70   DFVAKQGNEVSLDSNN-YRGSTDVSNANADFVLKPSPKPVLKSSGGSNNEPLVGIDAADW 129

Query: 125  ESPKSNGDSNGSRKLLDNEEERSKVIESLGEVLEKAEKLETPK---LGVNKPGRGVDKPT 184
            +  + +GDS+      D +EER+KVIESLGEVLEKAEKLET +   LG  K    V+KP 
Sbjct: 130  DPSRISGDSDEE----DGDEERNKVIESLGEVLEKAEKLETSRAGELGTKKDSSSVNKPA 189

Query: 185  TSGSS----------SSSNSKSNRSRSVWRKGDAVAPVQKVVAEPSKASGKV---EAKPG 244
             S +S          S + SKS   +SVWRKGD VA VQKVV E  K +  +   E K G
Sbjct: 190  PSNASTNLRNAKPVNSETTSKSKTLKSVWRKGDTVANVQKVVKESPKLNNTIPEEELKTG 249

Query: 245  GASTVEPQSQAGFRTPQPLVKPQSRLQAKPLAAPRPVLKKPI-LKDVGAATVTAN-DESN 304
            G    + Q  A  R PQP ++PQ +LQAKP AAP P++KKP+ LKDVGAA  ++  DE++
Sbjct: 250  GGLKADSQPHASLRPPQPPLRPQPKLQAKPSAAPPPMVKKPVVLKDVGAAPKSSGIDETD 309

Query: 305  AAAKTKERKPILIDKYASKKPVAKPFDSEAVMAPKKPGKAPSPGKFKDEYRKKNVASGGP 364
            ++ +TKERKPILIDK+ASKKP      S+AV+AP KPGK P PG+FKD YRKKN   G  
Sbjct: 310  SSTQTKERKPILIDKFASKKPAVDSVISQAVLAPSKPGKGPPPGRFKDGYRKKNDPGG-- 369

Query: 365  RRRMVDDVEIPDD------VSIPSVTTARKGRKWSKASRKAARIQASKDAAPVKVEILEV 424
            RRR VDD EIPD+      VSIP    ARKGRKWSKASRKAAR+QA+K+AAPVKVEILEV
Sbjct: 370  RRRKVDD-EIPDEEASELNVSIPGA--ARKGRKWSKASRKAARLQAAKEAAPVKVEILEV 429

Query: 425  EESGMLLEELAFNLAISEGEILGYLYSKGIKPDGVQTLDKDIVKMICKEYDVETIDVDPV 484
             E GML+++LA+ LAI+E +ILG LY+KGIKPDGVQTLDKD+VKMICKE+DVE ID DPV
Sbjct: 430  GEDGMLIDDLAYYLAINESQILGSLYAKGIKPDGVQTLDKDMVKMICKEHDVEVIDADPV 489

Query: 485  KVEELAKKKDIFDEEDLDKLQSRPPVITIMGHVDHGKTTLLDYIRRSKVAASEAGGITQG 544
            KVEE+AKKK+I DE+DLDKL+ RPPV+TIMGHVDHGKTTLLDYIR+SKVAASEAGGITQG
Sbjct: 490  KVEEMAKKKEILDEDDLDKLEDRPPVLTIMGHVDHGKTTLLDYIRKSKVAASEAGGITQG 549

Query: 545  IGAYKVMVPLDGKLQPCVFLDTPGHEAFGAMRARGARVTDIAIIVVAADDGIRPQTNEAI 604
            IGAYKV+VP+DGK+Q CVFLDTPGHEAFGAMRARGARVTDIAIIVVAADDGIRPQT EAI
Sbjct: 550  IGAYKVLVPIDGKVQSCVFLDTPGHEAFGAMRARGARVTDIAIIVVAADDGIRPQTKEAI 609

Query: 605  AHARAAGVPI------IDKDGANADRVMQELSSIGLMPEDWGGDVPMVQISALKGQNVDD 664
            AHA+AAGVPI      IDKDGAN DRVMQELSSIGLMPEDWGGDVPMVQISALKG+N+D+
Sbjct: 610  AHAKAAGVPIVIAINKIDKDGANPDRVMQELSSIGLMPEDWGGDVPMVQISALKGKNIDE 669

Query: 665  LLETVMLLAELQELKANPDRSAKGTVIEAGLDKAKGPFATFIVQNGTLKRGDVLVCGEAF 724
            LLETVML+AELQ+LKANP RSAKGTVIEAGL K+KGP  T IVQNGTL+RGD++VCG AF
Sbjct: 670  LLETVMLVAELQDLKANPHRSAKGTVIEAGLHKSKGPLVTLIVQNGTLRRGDIIVCGGAF 729

Query: 725  GKVRALFDDGGKRVDEAGPSLPVQVIGLNIVPIAGDEFEVVDSLDTAREKAELRAEALWS 784
            GKVRALFDDGG RVDEAGPS+PVQV+GLN VP+AGDEF+VV SLD AREKAE RAE+L S
Sbjct: 730  GKVRALFDDGGNRVDEAGPSIPVQVLGLNNVPVAGDEFDVVGSLDVAREKAESRAESLRS 789

Query: 785  QRISDKAGDGKVTLSSLASAVSSGKRSGLDLHQLNIILKVDVQGSIEAIRQALQVLPQDN 844
            +RIS KAGDG+VTLSSLASAVSSGK SGLDLHQLNIILKVD+QGSIEA+RQALQVLPQDN
Sbjct: 790  ERISAKAGDGRVTLSSLASAVSSGKLSGLDLHQLNIILKVDLQGSIEAVRQALQVLPQDN 849

Query: 845  VSLKFLLQATGDVTSSDIDLAVASKAIILGFNVKAPGSVKSYAENKAVEIRLYRVIYELI 904
            V+LKFLL+ATGDV++SD+DLA ASKAI+ GFNVK PGSVKSY ENK VEIRLYRVIYELI
Sbjct: 850  VTLKFLLEATGDVSTSDVDLAAASKAIVFGFNVKVPGSVKSYGENKGVEIRLYRVIYELI 909

Query: 905  DDVRNAMEGLLDPVEEQIPMGSAEVRAVFSSGSGLVAGCMVVDGKLAKGCGIRVLRKGKT 964
            DDVRNAMEGLL+PVEEQ+ +GSAEVRAVFSSGSG VAGCM+ +GK+ KGCG++V+R+GK 
Sbjct: 910  DDVRNAMEGLLEPVEEQVTIGSAEVRAVFSSGSGRVAGCMINEGKVVKGCGVQVIRRGKV 969

Query: 965  VYNGQLDSLRRVKEIVKEVNAGLECGVGMEEYNDWEVGDVIEAYYTVQKKRTLEEASASM 1000
            V+ G LDSL+RVKEIVKEVNAGLECG+G+E+Y+DWE GD++EA+ TVQKKRTLEEASASM
Sbjct: 970  VHVGLLDSLKRVKEIVKEVNAGLECGIGVEDYDDWEEGDILEAFNTVQKKRTLEEASASM 1027

BLAST of Cp4.1LG02g02800.1 vs. NCBI nr
Match: gi|645239749|ref|XP_008226278.1| (PREDICTED: translation initiation factor IF-2, chloroplastic [Prunus mume])

HSP 1 Score: 1325.1 bits (3428), Expect = 0.0e+00
Identity = 726/1029 (70.55%), Postives = 844/1029 (82.02%), Query Frame = 1

Query: 5    GTMASVASLFNLSGVGVIGSSEKPRSQFRGVCLSRRGFKGSNKWYYVPFPLCKYSATTTT 64
            GTMASVASL +L  V ++GSSE+ RS  R V LS+   KGS +W+ V   +CK S TTT 
Sbjct: 10   GTMASVASLVSLGSVTLLGSSERSRSLVRKVSLSKASLKGSRRWHCVRLSVCKCSVTTT- 69

Query: 65   DFIADQGNAISVDSNSSYIRSKD--NDGTDFLLKPAPKPVLKAA--ESKPPLVGLNKVAW 124
            DF+A QGN +S+DSN+ YI S D  N   DF+LKP+PKPVLK++   +  PLVG++   W
Sbjct: 70   DFVAKQGNEVSLDSNN-YIGSTDVSNANADFVLKPSPKPVLKSSGGSNNEPLVGIDAADW 129

Query: 125  ESPKSNGDSNGSRKLLDNEEERSKVIESLGEVLEKAEKLETPK---LGVNKPGRGVDKPT 184
            +  + +GDS+      + EE+R+KVIESLGEVLEKAEKLET +   LG  K    V+KP 
Sbjct: 130  DPSRISGDSDEE----EGEEDRNKVIESLGEVLEKAEKLETSRAGELGTKKDSSSVNKPA 189

Query: 185  TSGSS----------SSSNSKSNRSRSVWRKGDAVAPVQKVVAEPSKASG---KVEAKPG 244
             S +S          S ++SKS   +SVWRKGD V+ VQKVV E  K +    K E K G
Sbjct: 190  PSNASTNLQNAKPVNSETSSKSKTLKSVWRKGDTVSTVQKVVKESPKLNNTIPKEELKTG 249

Query: 245  GASTVEPQSQAGFRTPQPLVKPQSRLQAKPLAAPRPVLKKPI-LKDVGAATVTAN-DESN 304
            G    E Q  A  R PQP ++PQ +LQAKP AAP P +KKP+ LKDVGAA  ++  DE++
Sbjct: 250  GGLKAEAQPHASLRPPQPPLRPQPKLQAKPSAAPPPTVKKPVVLKDVGAAPKSSGIDETD 309

Query: 305  AAAKTKERKPILIDKYASKKPVAKPFDSEAVMAPKKPGKAPSPGKFKDEYRKKNVASGGP 364
            ++ +TKERKPILIDK+ASKKP      S+AV+AP KPGK P PG+FKD YRKKN   G  
Sbjct: 310  SSTQTKERKPILIDKFASKKPAVDSVISQAVLAPSKPGKGPPPGRFKDGYRKKNDPGG-- 369

Query: 365  RRRMVDDVEIPDD------VSIPSVTTARKGRKWSKASRKAARIQASKDAAPVKVEILEV 424
            RRR VDD EIPD+      VSIP    ARKGRKWSKASRKAAR+QA+K+AAPVKVEILEV
Sbjct: 370  RRRKVDD-EIPDEEASELNVSIPGA--ARKGRKWSKASRKAARLQAAKEAAPVKVEILEV 429

Query: 425  EESGMLLEELAFNLAISEGEILGYLYSKGIKPDGVQTLDKDIVKMICKEYDVETIDVDPV 484
             E GML+++LA+ LAI+E +ILG LY+KGIKPDGVQTLDKD+VKMICKE+DVE ID DPV
Sbjct: 430  GEDGMLIDDLAYYLAINESQILGSLYAKGIKPDGVQTLDKDMVKMICKEHDVEVIDADPV 489

Query: 485  KVEELAKKKDIFDEEDLDKLQSRPPVITIMGHVDHGKTTLLDYIRRSKVAASEAGGITQG 544
            KVEE+AKKK+I DE+DLDKL+ RPPV+TIMGHVDHGKTTLLDYIR+SKVAASEAGGITQG
Sbjct: 490  KVEEMAKKKEILDEDDLDKLEDRPPVLTIMGHVDHGKTTLLDYIRKSKVAASEAGGITQG 549

Query: 545  IGAYKVMVPLDGKLQPCVFLDTPGHEAFGAMRARGARVTDIAIIVVAADDGIRPQTNEAI 604
            IGAYKV+VP+DGK+Q CVFLDTPGHEAFGAMRARGARVTDIAIIVVAADDGIRPQT EAI
Sbjct: 550  IGAYKVLVPIDGKVQSCVFLDTPGHEAFGAMRARGARVTDIAIIVVAADDGIRPQTKEAI 609

Query: 605  AHARAAGVPI------IDKDGANADRVMQELSSIGLMPEDWGGDVPMVQISALKGQNVDD 664
            AHA+AAGVPI      IDKDGAN DRVMQELSSIGLMPEDWGGDVPMVQISALKG+N+D+
Sbjct: 610  AHAKAAGVPIVIAINKIDKDGANPDRVMQELSSIGLMPEDWGGDVPMVQISALKGKNIDE 669

Query: 665  LLETVMLLAELQELKANPDRSAKGTVIEAGLDKAKGPFATFIVQNGTLKRGDVLVCGEAF 724
            LLETVML+AELQ+LKANP RSAKGTVIEAGL K+KGP  T IVQNGTL+RGD++VCG +F
Sbjct: 670  LLETVMLVAELQDLKANPHRSAKGTVIEAGLHKSKGPLVTLIVQNGTLRRGDIIVCGGSF 729

Query: 725  GKVRALFDDGGKRVDEAGPSLPVQVIGLNIVPIAGDEFEVVDSLDTAREKAELRAEALWS 784
            GKVRALFDDGG RVDEAGPS+PVQV+GLN VP+AGDEF+VV SLD AREKAE RAE+L +
Sbjct: 730  GKVRALFDDGGNRVDEAGPSMPVQVLGLNNVPVAGDEFDVVGSLDVAREKAESRAESLRN 789

Query: 785  QRISDKAGDGKVTLSSLASAVSSGKRSGLDLHQLNIILKVDVQGSIEAIRQALQVLPQDN 844
            +RIS KAGDG+VTLSSLASAVSSGK SGLDLHQLNIILKVD+QGSIEA+RQALQVLPQDN
Sbjct: 790  ERISAKAGDGRVTLSSLASAVSSGKLSGLDLHQLNIILKVDLQGSIEAVRQALQVLPQDN 849

Query: 845  VSLKFLLQATGDVTSSDIDLAVASKAIILGFNVKAPGSVKSYAENKAVEIRLYRVIYELI 904
            V+LKFLL+ATGDV++SD+DLA ASKAII GFNVK PGSVKSY ENK VEIRLYRVIYELI
Sbjct: 850  VTLKFLLEATGDVSTSDVDLAAASKAIIFGFNVKVPGSVKSYGENKGVEIRLYRVIYELI 909

Query: 905  DDVRNAMEGLLDPVEEQIPMGSAEVRAVFSSGSGLVAGCMVVDGKLAKGCGIRVLRKGKT 964
            DDVRNAMEGLL+PVEEQ+ +GSAEVRAVFSSGSG VAGCM+ +GK+ KGCG++V+R+GK 
Sbjct: 910  DDVRNAMEGLLEPVEEQVTIGSAEVRAVFSSGSGRVAGCMINEGKVVKGCGVQVIRRGKV 969

Query: 965  VYNGQLDSLRRVKEIVKEVNAGLECGVGMEEYNDWEVGDVIEAYYTVQKKRTLEEASASM 1000
            V+ G LDSL+RVKEIVKEVN GLECG+G+E+Y+DWE GD++EA+ TVQKKRTLEEASASM
Sbjct: 970  VHVGLLDSLKRVKEIVKEVNTGLECGIGVEDYDDWEEGDILEAFNTVQKKRTLEEASASM 1027

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
IF2C_ARATH0.0e+0068.21Translation initiation factor IF-2, chloroplastic OS=Arabidopsis thaliana GN=At1... [more]
IF2C_PHAVU0.0e+0067.16Translation initiation factor IF-2, chloroplastic OS=Phaseolus vulgaris GN=IF2CP... [more]
IF2_ANAVT7.9e-17252.68Translation initiation factor IF-2 OS=Anabaena variabilis (strain ATCC 29413 / P... [more]
IF2_PROM51.5e-17050.55Translation initiation factor IF-2 OS=Prochlorococcus marinus (strain MIT 9515) ... [more]
IF2_SYNP27.4e-17045.22Translation initiation factor IF-2 OS=Synechococcus sp. (strain ATCC 27264 / PCC... [more]
Match NameE-valueIdentityDescription
A0A0A0L611_CUCSA0.0e+0087.93Uncharacterized protein OS=Cucumis sativus GN=Csa_4G675260 PE=3 SV=1[more]
M5X751_PRUPE0.0e+0070.65Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa000701mg PE=3 SV=1[more]
W9QCR7_9ROSA0.0e+0071.44Translation initiation factor IF-2 OS=Morus notabilis GN=L484_007400 PE=3 SV=1[more]
F6H4W1_VITVI0.0e+0070.27Putative uncharacterized protein OS=Vitis vinifera GN=VIT_19s0027g00130 PE=3 SV=... [more]
A5B2X2_VITVI0.0e+0069.85Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_024584 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT1G17220.10.0e+0068.21 Translation initiation factor 2, small GTP-binding protein[more]
AT4G11160.12.0e-10438.80 Translation initiation factor 2, small GTP-binding protein[more]
AT1G76820.12.2e-1839.51 eukaryotic translation initiation factor 2 (eIF-2) family protein[more]
AT1G76810.12.4e-1741.67 eukaryotic translation initiation factor 2 (eIF-2) family protein[more]
AT1G21160.14.1e-1737.50 eukaryotic translation initiation factor 2 (eIF-2) family protein[more]
Match NameE-valueIdentityDescription
gi|659087091|ref|XP_008444270.1|0.0e+0088.42PREDICTED: translation initiation factor IF-2, chloroplastic [Cucumis melo][more]
gi|449466939|ref|XP_004151183.1|0.0e+0087.93PREDICTED: translation initiation factor IF-2, chloroplastic [Cucumis sativus][more]
gi|1009141746|ref|XP_015888355.1|0.0e+0071.41PREDICTED: translation initiation factor IF-2, chloroplastic [Ziziphus jujuba][more]
gi|595894839|ref|XP_007213706.1|0.0e+0070.65hypothetical protein PRUPE_ppa000701mg [Prunus persica][more]
gi|645239749|ref|XP_008226278.1|0.0e+0070.55PREDICTED: translation initiation factor IF-2, chloroplastic [Prunus mume][more]
The following terms have been associated with this mRNA:
Vocabulary: Molecular Function
TermDefinition
GO:0003924GTPase activity
GO:0005525GTP binding
GO:0003743translation initiation factor activity
Vocabulary: Biological Process
TermDefinition
GO:0006413translational initiation
Vocabulary: Cellular Component
TermDefinition
GO:0005622intracellular
Vocabulary: INTERPRO
TermDefinition
IPR027417P-loop_NTPase
IPR023115TIF_IF2_dom3
IPR009000Transl_B-barrel_sf
IPR005225Small_GTP-bd_dom
IPR000795TF_GTP-bd_dom
IPR000178TF_IF2_bacterial-like
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006413 translational initiation
biological_process GO:0051234 establishment of localization
biological_process GO:0006796 phosphate-containing compound metabolic process
biological_process GO:0009657 plastid organization
biological_process GO:0006446 regulation of translational initiation
biological_process GO:0016070 RNA metabolic process
biological_process GO:0044711 single-organism biosynthetic process
biological_process GO:0044763 single-organism cellular process
cellular_component GO:0005622 intracellular
cellular_component GO:0044434 chloroplast part
cellular_component GO:0005840 ribosome
molecular_function GO:0003924 GTPase activity
molecular_function GO:0005525 GTP binding
molecular_function GO:0003743 translation initiation factor activity

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Cp4.1LG02g02800Cp4.1LG02g02800gene


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cp4.1LG02g02800.1:five_prime_utr:002Cp4.1LG02g02800.1:five_prime_utr:002five_prime_UTR
Cp4.1LG02g02800.1:five_prime_utr:001Cp4.1LG02g02800.1:five_prime_utr:001five_prime_UTR


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cp4.1LG02g02800.1:cds:012Cp4.1LG02g02800.1:cds:012CDS
Cp4.1LG02g02800.1:cds:011Cp4.1LG02g02800.1:cds:011CDS
Cp4.1LG02g02800.1:cds:010Cp4.1LG02g02800.1:cds:010CDS
Cp4.1LG02g02800.1:cds:009Cp4.1LG02g02800.1:cds:009CDS
Cp4.1LG02g02800.1:cds:008Cp4.1LG02g02800.1:cds:008CDS
Cp4.1LG02g02800.1:cds:007Cp4.1LG02g02800.1:cds:007CDS
Cp4.1LG02g02800.1:cds:006Cp4.1LG02g02800.1:cds:006CDS
Cp4.1LG02g02800.1:cds:005Cp4.1LG02g02800.1:cds:005CDS
Cp4.1LG02g02800.1:cds:004Cp4.1LG02g02800.1:cds:004CDS
Cp4.1LG02g02800.1:cds:003Cp4.1LG02g02800.1:cds:003CDS
Cp4.1LG02g02800.1:cds:002Cp4.1LG02g02800.1:cds:002CDS
Cp4.1LG02g02800.1:cds:001Cp4.1LG02g02800.1:cds:001CDS


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cp4.1LG02g02800.1:three_prime_utr:001Cp4.1LG02g02800.1:three_prime_utr:001three_prime_UTR


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Cp4.1LG02g02800.1Cp4.1LG02g02800.1-proteinpolypeptide


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000178Translation initiation factor aIF-2, bacterial-likeHAMAPMF_00100_BIF_2_Bcoord: 4..981
score: 15
IPR000178Translation initiation factor aIF-2, bacterial-likeTIGRFAMsTIGR00487TIGR00487coord: 402..983
score: 1.9E
IPR000178Translation initiation factor aIF-2, bacterial-likePROSITEPS01176IF2coord: 934..956
scor
IPR000795Transcription factor, GTP-binding domainPRINTSPR00315ELONGATNFCTcoord: 482..495
score: 5.2E-6coord: 550..561
score: 5.2E-6coord: 534..544
score: 5.
IPR000795Transcription factor, GTP-binding domainPROFILEPS51722G_TR_2coord: 478..645
score: 35
IPR005225Small GTP-binding protein domainTIGRFAMsTIGR00231TIGR00231coord: 480..635
score: 2.3
IPR009000Translation protein, beta-barrel domainunknownSSF50447Translation proteinscoord: 648..731
score: 2.5E-25coord: 889..981
score: 6.03
IPR023115Translation initiation factor IF- 2, domain 3GENE3DG3DSA:3.40.50.10050coord: 784..863
score: 4.7
IPR023115Translation initiation factor IF- 2, domain 3PFAMPF11987IF-2coord: 782..873
score: 2.0
IPR023115Translation initiation factor IF- 2, domain 3unknownSSF52156Initiation factor IF2/eIF5b, domain 3coord: 779..891
score: 8.63
IPR027417P-loop containing nucleoside triphosphate hydrolaseGENE3DG3DSA:3.40.50.300coord: 472..636
score: 5.7
IPR027417P-loop containing nucleoside triphosphate hydrolaseunknownSSF52540P-loop containing nucleoside triphosphate hydrolasescoord: 468..648
score: 5.57
NoneNo IPR availableGENE3DG3DSA:2.40.30.10coord: 647..739
score: 1.1E-20coord: 889..982
score: 8.5
NoneNo IPR availablePANTHERPTHR23115TRANSLATION FACTORcoord: 456..998
score:
NoneNo IPR availablePANTHERPTHR23115:SF138TRANSLATION INITIATION FACTOR IF-2, CHLOROPLASTICcoord: 456..998
score:
NoneNo IPR availablePFAMPF00009GTP_EFTUcoord: 483..637
score: 1.1