Cp4.1LG03g05780 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG03g05780
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionAAR2 protein family
LocationCp4.1LG03 : 4528073 .. 4544308 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AATAGTTTCGAGAGTCTTCTCGACTTCAAAACAAGGAAGAACCCTGAATGTCTTCTATTCCTCCTCCAGGGGAGAGCGTTTAAGCGAAGAAATCAGTGTTCCGCGATAAATTTCAGTAGAAACTAGAGGAATTTGAACTTCAATTTTCGTGTCTGCTGTGTTTTTTTGAAGCTGAGATGGATCCTGAAACTGCCCTACAGCTTGTAAAGCACGGTGCGACGATTCTCCTCCTCGACGTTCCTCAGTACACGCTTATTGGAATTGATACTCAGGTCAGTCAGATAAAGGCATCTTCTGTTTTTTTGTTTGTTTTTCTTTCTCTGAAGAAGTTTCGAGGTACTGTTGGGGAAGGAATAGATGTTTAATCATTCTAGCTAAGGACAAACTGCTATTGCTGCTACTGGCTGAAATTTGTGACTGCAATGTTAGGTTCTGTGGAACTTGAACGAAAAATTGATTCTCTGTTACACGTTACTTCTCTATTTCCAGTAATTTTTATCTGAAATTGGTCAAAATCTGTACGCGCTTCTTGTTCGGTTGGCTGAAGAGCCACCTTATTTCATAGTTTTAGACGCTAATACGTCAATATTTTTTGCCGCGTATTGTTTTCAGTGTTTTGTTTTTGTGGTGGATAGTAGTCCTCCATATTGCTGGCTGAAAAGTCATGTTACGGCCTGATTTCATAGTTTTAACTGCTAATACTTCAGAATTTTCGATTAAATATTGTTTCACGTTACTCTTACTCCTCCCCTTCCAATTAATGTGAGAAGAAAATGGCCAAAACCACCAAAAGTGTTTTGTTGGGCTAGTAGAAAAGCCAAATCATGAGTCGATTTCATAGCCATAGCAGCTAATTCTTGGAATTTTTGATTAAATCTTGTATTCAACGTTTTGGGCAATGTTTCTTCTCACCGTTTATGTCACTCCAGTATTCCTATCATTTTTAGTTGAAAACTTGCAAAAACCACTATACACTTTTTGACTTCTTTGCAAGAGCTCAATTCATGGTTTTTTTAGCATCTGATAATTATTATATTACTTATTAATATTGTTTTCAATGTTTATTTTTGTTATGGGCGTGGTTCCCACGGCATATCATACTGCATTTCCCAGCATTGTTAGAATAAATTGGCTGAAACCACTGCATACTTGTTATTTGGCTGGCAGGAAAGCCACATTAGGAGCTGGTTTCATAGTTCTAGCCACCAATATTTATATTCTGACTAAAATTTGTTTCCATGTTTATTTTGGCTGCTTTATTCTCACCACGAGTGAAGTTTTTAGACATTGCTTTTGAGCATTCAGTGGTAGGCTAAATTCTATTATAAAATGAATCTCAATGAGATTTGAATAGCTTCACACTTAGTTTGAGTAATTTGCAGTAATTATTGCACTAAAGTAGCTTAATTTAATTTATTTAGAGTGACTAATAGTTGTTTGCCTTCTTAAGATGTTCTCTGTAGGGCCTTCTTTCAAAGGTATAAAGATGATTCCTCCAGGACCACATTTTCTTTATTACAGCTCATCGAGCAGGTGTGATATCATTAATCAGGTTATGTAGATATATCTATTGTATTGTTGCAGACTATGACTTACATGATGTGACGCATTTGATCAGTTCTTATAAAGATGTCATACCATGCATCATACATTTTATTCTTACTGCTCATGTCCTCCATGACCTGTGTAGCCTTGCCTTACATTCTGCATCACTTTGATAATTGTTTGTCTAATTCTAACTTTATATGACATCTTAGCAACCTAATGCTTTATACGTTGTTGTATCATTAATCAGTTCATAGATACTATATCTTTTCAACAGTACAAACTATCAGATGCTATATTTCTCCTTCTTCTTCTTCTTTTATTCATTCATTTTTATCAAGAAAACCTACGTGCGTGCGATATTTCCTTCAACTGCCTATGTAAAGATGATGCATCACATTGATAATTGCTTGTCTAAGTCCATCTTTACCTTATGTTTATTTTGCAACATCATATGCTATGATGTTAATCGAGCTCTATAGATACTTATATGTAGTTAATTGTGACAGGCTTATAACTAGATATATCTAGTTAATTGGTTCATAATAACTTTACACGACTCATCTATTTAATTGTAAAAAGGAGATGACATATGACATCAATTATTGACCCTTGTGTTTATGTATTGTAGTTAACTCTTCATTTATTAATCCATTATGTTTAATGAAAAGGAAAGGACTAAAAGCATGTGAGAGATCAAAAAATTTATAGAAAGGCCATCCAATTGGTCCAACATGAAAGAAACAATCCTATTTTGCTCCGATGAAAGATTTTTGGAAAAAGAAATACTCTTTTAGTCCCTTTGAATTTTAAAATATTACACTTTACTCCTGAGATTTGAGTTTATTTTCCATTTGGGTGATAGGCTTCAAAGTGTTGCATTTTACCCTTTTATTTTGAGTTTAGTTTCTGTTTGGTTCCTACATTTCAAATTGTTGCAGCACCATCCCATTTCTTTACATTTTAAAAAATCCATTGATTGAAATTATCAGAATAAGGGAGTTCAAATTCAAAAAATTCATAACAATTCTAATTAAAAAAAGAAAAGAAAAGAAAGAATGCCATTTCCAAGCTCATATTTTCACATACAGTCAAAGGATGGTTAAGGGAAAAAATGAAAACTAATTGGGTGGGTTCCAATTATTTCTTGGCCCTTTTCAAGATCCTGGTGAACTGGAAGTTCTTCTCGGAAGGCAGTAGACATACCCTTATCCAGTCTGTCTTGAGTTGGATTCCAACCTATTTCTTAGCCCTTTTTAGGATCCAATGTCGGTGAGTAAAACTTTCAAGTAGGGTAGGAGGTATTTCCTCTGGAAAGAGGGTTGAAGACGGGAAAAGATGCACACTTAATAAAGAGGGGGATGGTTGCAAAGCATGTGGACTTTGGGGTTCTAGGGATTGGAAATGTGAGAGCCTGCAACAAGGCTCTTTTCACTCGGTTGTGGCAGTTTCATCAGAAGTCCAATTCCTTTTGGCATAAGGTTAAACTGAGAAAATTCGGCACGGCCCTATCTGAGTGGACCTTGAATGAAGTCAGAGTACAGTTATTCTATTAAAAAAAGAAAAGAAAAAAAGAAGGCATAAGAAAAAGTGACAATACCTATAATTGAATACAGTATCAAATTGTATTACAAAGAGTCAAATGCCCCTTGCTATACTGAAGCACCTCCAACAGTCTGAGTTTCTTAGGAATCTTGTCAATCACAAACTACCAAAAAGAATTTTGTTTTGGTTACCAAGTGGACACTCAAGGTAAAGGAATGGCGAAGATTCAACTTTGCTAGTCAAGGAGGCATGTTTTGGAAGAATCTTAGAGGGAGGAGATCTTCCTTATTACGTGATTTCATCTAGCCCAAGTAACAAATTTTGTTTAGTAAATTTAGAGACATACAAGTACCCTGAGAGATTTGACATAACCAAAAATAGATGGTGGCACATATATTCAAAGATTAGACAAATAGGAGCTCAGATTGATTTGAATGCATTGGTGAAGAGAAGTTAGGATTGTCCACAATCAACTAATAGAAGGAACAATTATTAATTTCAAATGTGTGATAGATAAACTATTATGGAGAGTTTGGAGAAACAATTGCCTTCAATTATACATGAATTTGGGAAGAGAAGGAAGGGGAGAACTTAATTTGAAATGCAAGAAAGCTAGGAATAACATTTATGAGAATATATAATAACTAGGATTCATGTATCAAAATCTTCCATCATTTGAGAAAGGCTTAATTGAAATGCACGAAAGAACGAAACAAGGTAGGTTTGTGAGAATAGGTTGATTTAAGGTCCATGCATTAAATCCTCTATCAATTAAAAGTTTCAGTTTTCCTGTTCAAAAAACAAAGAAGTTATCCATAACTTTTTTATGAGAACTATTTTATCTATATCCTTTGTTCTAAAGATAGTTTTCCTGACTGTGTATAAATTTATTCTATAATATTTATGTGTAGAGAAGGCAGAGAGTTTTCACCAATTACTGGCTTTTTTGTAGATGCTGGTTCCTCTGAGGTTTGACTTCTCATATACTTCAATTACTGGCAGAGAGTTTTCACCAATTACTGGCTTTTTTGTAAATGATGTTCACATGTAAATGATGTTCAAGTCCTATTAACATTTGAGTGGATTGCTTGCTTCTACCCCGCTGGGTATCGTTCTGGTTGATCAAATTATAAAAAAGATAAAAGAATAGCAAAAGGAAAGGAAAAAAGTGTACCAGTTTTATTTTATTTTTTAAATAATGAAAGCCAATGACTAGCGTGGAACAGGAACAATGTATTAGGCTTTAAATGGCTTCTCTCTCTCTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTNACCTAAAATTTTCCACCACACTTCCAGTTTCAGTCTTTATGATTTGTTTGCTTTTGAAACCCTGCATTGCTAAAATGTGTTGATGGTTCATTTACGAATTTTTAGTCTTGGAAATTTCCCATGCGTGCATTTGAGGTGGAACTTCCTATTTTTGGTTGAAATAGGTTATTGTTCGTAAGTGGGATCAGAGGGAGGAGCGACTTGTTAAAGTATCAGAAGAAGAGGTTTGCATTTTATTTATTTATTTATTTATTCAAGATTCATTAGTATTTAGTAGTTAGAAGCTATAAAAATTCATTAAGGAATTGAGTCTTTTTGCAAGAGTTTTATCTTAGAGTTTATGAAGTGATGTAGCAAGTTTGATTTGTTAAAAGTTAGAAACTTTAGGAGCTAGGCCCAGCAAGGTGTATGTTCACCAACAATGGCCTTCAATTTTTCTGCCTCAAAACAGACCGAACAAAGTCCTCTGATTGATATTTTACCTATTGATATTTTTATCCCTCAAATAGGAGCAGCGATTTGGGGAAGCAGTTAGACAACTAGAGTTCGACAGACAACTTGGTCCGTATAATTTGGGCCAATATGGAGAATGGAAGCGAATATCTAACCACATCAACTGTACCACAATTAAACGACTAGGCATGCACCCTTTTCTATTTCAGTGTCTCGTACGTGTACAGTTTACTTTTTGCTGCATTTTTGCGTGTAAAGTTTCTTTTACTTTTTGTGGCCTTGGCTATGATTATTTGGCAAAGATTGGAGTAAAAAGTATTATTATTGATGATAAGTCTCGTGATAATTCTACATAGTTTGTTTTGATTGAGTTAGGCAATTTTATGCATGATTTTGGCCACGATAATTGTATCATAATTTACCACCTTATCGAAGTACTAGCTGTGCAAGAATTTGAAAAATTATCATAACCCCAAATATTCATTTCTATTTGTTTCTTTAGTTACTTCCTCTGTCTTTTCTCATTCCCTTTTTATATTTTGCTAAGCCTTTCCCCTCGCCGTTCTACATATTGTTTTTTTTTTTCTCTTTTTTTTTTTTCTAGCATCTTGATTGGTTTGTTTGTTTCATTGTGGGGTTCATAGCTCCAAAAAGCTGTAATGTTTCTGTTAAAGTTTGAGAAGGAAGACTGTAATTCATTAGAAAGTTCAACATTTCCTATTGCTGTCCTGATTAGTATAGGTGTTGACATTCGTATGTTGAAACATTAGTTGTATTTTATCTTATTATTTATTTTTATCAAAAGTATTATAATGCTCAAGGAGGTAAGCCACATTCAAGCTACCATGTTCGTTTGCCATTTTGACAAATAGAAAATGAGAGAAGGTTTAGCTTGGTAGAAAAAGAACTCCAAGGAAGAAAACACGAAGTATGGTTATAGGCTGAGTTTGGCCAATGGGCTAAACTTATGGGCTTATCCCACCCTTTTAGGGTTGTCTCCTATAGGATCTCAACAAAGCTCTAAATTGGACTTAAATCTCGATTCTGCACCTGGTTTGTAGCCGATCGGGGTGTGATAATGTTACCCATACAGCAGGTCATATTCTTGTGGATTGGTGTTATCTCTTAGTTAGTGTTGTGACTCTTTTTGAGGGGCTGTTTCTCTCATGACATGTATTTTCTTTCATATTTTTTTAATGAAATCTTGGTTTCTTATTAGAGAAAAAAGGTTTTTATGTGTCGTGAAGAGAAGAGATGGTTTGTTAATTATTTAGCTTTTTTTCTTTCTTAGTATCGTAACTTTTTGAAAAAGAAAAATACATACCCCCCCCCCCCCCCCAAAAAAAANATGTATATATAGAGAGGGGGAGGTCTTTATTTTGTAGCTTTTGAAACTCACGTATTGTCTTGTCACGCTTAGTTACAGATGTAACCTGCATATGAAAAGTTCATTTTGTTTCTTGTTAAAAATAATAATAATTATATTCATATTTCCAGACAATGTATTTAATGTACAGAACCTATTGGAGGTGACATTAGCGTGGCTTGTGAACCTGGAATTTCTCAAAGCACTTCCAAGTCTGCAATTGAGAAAGTCCTGGATGATCAGTTGAAGGCTAGTAAGTTTGCAATGCATGTTGATTCGTCTCAGAGGAGAAAATGTTATTACACAGAAATTCCCCATGTTATCAAACAGAGAGGAGTTCATGGGCAAGAACTTACTAATTTGAATCTTGATAAGGTAATTAGTACTGGTATGAATTCTTCAACATATGACCTTGCTGAAAAATCATTTTGTTGCAACAGTTGTTTTTGGGAATTGTTTCTTAATTTGGTTATGATTGTTATGACTTTTGTAGACTTTACTACTCGAAAAGTTACTGAAAAAGGATTTTGGAGGTTCAGAGGACTTACTCCTTGGGGAGCTACAGTTTGCATTCGTTGTATTTTTGGTTAGTATTGCCGTGGACAAGAGTTTTCTCCTTTTGTTGGCAGATCTTTGTAACGTCTAACGTTGTGCAGATGGGACAATCACTTGAAGGATTCCTACAGTGGAAATCATTAGTTAGCCTGTTTTTTGAGTGTACAGAAGCTGTAAGTATTCCATTCCAACTATGACGTGTAACTTTTGGGTAACTTTTTTTTTTCTGATTGAACTTTTGTATAAGAGTTCCACCTTTTCTTAGACTATTTTGTTCAAGAGAAACTTTCTTCTACCATTCCTCATATTTGGAGCAATTTTTTATTTGTTTGTAATGTAAGCTTGTTTCATTGATTGTAATTAGTACTTAGTGTACATGATTACAACTAAGAAACATATTTACAACCAACCATTCTACTACTACTACCATATGCATTTTTAAAAATGGTGAAAACAAATAATGCAATTCTATTTTCTACTAAATCCTCTGTTTTTCTTTTTTTTACAAGAGACAATTTCATTGATGATTGGAATTTACAAAAGGAATGTAATATCCATGATGTTTATAAAAGACATTTCTAATTTACAACGTAGGAGGTATATCTATAGGAAGCAAAAATATTAGACTCTTAGCATTGGGTTGGAATGTGAATTCTCTTTCCATATTTCCTCTTTTACTGGTTCTTTTCTAGATGGAGTTACTTTTTAGGCATAAGGATGTCTTGATTTTTTGACTGATATTTCTTCCATTTTATTGGCTAAATTTCTATTTCTATTTCTGTTTTTTTATTTTATTTATTTTTATTTATAAAAAAGACCATATTGCAGACGATCCATGAATTCCTATTCATTTTCTCCCTTGACTTTGCTGAAAATCCTTTGCTTTATGTTCTATCTTTTTGTTCCGTGGGAGTTCAAAATTTTTGGGACATGGTTAGGCTACAATTTTTTTCCTGCCCAGGTTATTTCTTTTTGTTGCGACATATCTCTCTGCATGCCAGATTGTACCCTTAGTTTGTGAGGTTATCTGAACTTGGATGAAAAAGAATAGGGGCATAGTTTGCTTTTCTTGTTCTAGCTGAGTGGTATTTTGTCAATCCACATTTTAAGAATTCACATTTTGTTGCAGCCTTTTTGCACAAGGAGTCAACTATTTACAAAGGTGTGTTCGGTGACTTTTAATTATTTTGAATGTTATTGTTTTGTTCCCTTCTTGTGTATTGCTGTCATGTTGCACTGTTCACGGTGGAGGTAGATGCACAACTTTTAGAAACTGTGGGATGTCACTCTTCAAATAAAGGAGCTATCTTGATTATCTAATGAGGCCTTATCCCAGCTTATAAAAAAAGTTTATCTTCTTTTCTTGCGTTGACACATGGGAAAAACTGACACTTTCGAATTCCATAATGGAGGACCAATCACTTGCGGTATTAATTGTTTTTTTATTTTTTATTATGCCAGGAAGGGTGCATGAGGAATTTTATGTACTTTTGGCTTTCTTTTGCTGAAAATTGTAAATCATCTTTCTAGATGCTTTAGGTTGGCATGCCTTACTAATCCTACACATTGAATGAGAACTCACCAGTTCTTTAACAAATATCAGAGCTGAAAATGCGCTCACCACTTTTGGTAGCCTTCATTATTTGTGTTAATTGTTCCTTTCAGATTGCTTTCTTATATCCTATTATTGGGTACTTCTATTCACAACCGTCTAATTTAGACTCCAACAGTCTTCATGACCTAACTAGCATAGAACACACATGGAAGCTTTAAAAGAAAATTTGGAAGGCTTCTTTTCAAGATAATCTACTATGTATTGATACCACTTTTGTGCATAATCCAGACAAGAATACCAATCTTTCTTGGGGTTTTGGTCTTCCAAATAGCTCTGAATAATTTGGTTGGCATAGAAGGGGGGAGGGATTTTGGTCTATCCAGAGGCATCCTCCACCCTCAAGAATTTTCAGAATTACTATAAGAAACTAATTCCAGCATGTATAAAAGGGATCTAGCTCATTAGTCTCGACTTCTTATGAAGGTCTTGTAATTCAAGAACCATGGACACCGACACGACAATCACTTGTTAGTTTCTAAAGCAGTAGTATTGGACATGGATACATTGAGACATAATTTTTAAAGAGAAAAAATACTCCTTTGGTCCATAAAATTTAAGGTTGGCATCTATTTGGTTTTTGGTGTTTGGAAAGTGGTTCTAAATGATCTCTGAGCTTACTTGATTGACGAAAAGATAATGCGGCCATTTTGTGGTGATTGGACTAGATTTAGGGGGCAAAACATATTCATCTTGTATTCTTTGTTTGGGAGATTAGACTATAAGGTAACAAAATGCGGTGACATCTTCTTCCCCATTTTCTTTCCTTTTCTTTCCTTTTACATCTTCTGGATGAAACTTACTACTATTGCGCCCCTTTCTTTCTTGCGATTGTAGTCATATCTCATTGAGGCTTGTCCTCCTTGCTTGGCCTTGTTGGTTGGTATGGCCTAGAAGTGCGATCTAGTGTTCCTGATCTAGTTTACAGCCCCAAAACGTTGTTCATGTCATCTAACGTTGCCTAGAAGTGCGAGCTAGATAGTTGTAGATCTTCGATCAGTTAATTCTCTGGCCCAACTAGCATTAATGTACAGTTCTACCTGTCTTTTACGATTTTTTTCAATAATAGTCCATGACTTAGTGTCTTTCAAATATCTCAAGACCTAGTTTACAGCCCCAAGATGACATTCATTTGGATTGTTCATATACTGGCTAACAATGTTATACAAAGTTTGCAATGTTTGGCCTGGTATGTGAAAGATAAATTAGCCTTTGATACACGCCCTTGTCAATTGGAATAAATTAGCCAAGATGCCATTCATTTGGATTGTTCATATACTGGCTAACAATGTTATACAAAGTTTGCAATGTTTGGCCTGGTATGTGAAAGATAAATTAGCCTTTGATACACGCCCTTGTCAATTGGAATAAATTAGCCAAGATGCCATTCATTTGGATTGTTCATATACTGGCTAACAATGTTATACAAAGTTTGCAATGTTTGGCCTGGTATGTGAAAGATAAATTAGCCTTTGATACACGCCCTTGTCAATTGGAATAAATTAGCCAAGATGCCATTCATTTGGATTGTTCATATACTGGCTAACAATGTTATACAAAGTTTGCAATGTTTGGCCTGGTATGTGAAAGATAAATTAGCCTTTGATACACGCCCTTGTCAATTGGAATAAATTAGCCAAGATGCCATTCATTTGGATTGTTCATATACTGGCTAACAATGTTATACAAAGTTTGCAATGTTTGGCCTGGTATGTGAAAGATAAATTAGCCTTTGATACACGCCCTTGTCAATTGGAATAACCTCTTCACTTTGATGTAGAGCTAAATTTAAATCCGTAGATGTTTCACTAGATCTACACCCAAGACTTCTGGTATCCTTTAACGAATCTAGAATATAGTTTCAGAGAAATTACAATACCATTCTTGGATTGTGCCACTTCCATAGCCAAAAAAATATATCAAACTTTCCTAGTCTTTGAATTCAAACTCAGTTGATAGGCATCCTTTTAATGTTGAGAATCTCTTCTAGGTCATTCCGTGCAATGATAATATCATCCACGTATACAATCAAAATTGCAGTTTTGTCATTTGAGAATTTCACGAACAAGGTATGATTGACTTGACCTTGATAATAACCACTTTTATTCTGTGTTTTAGTGAATCTATCAAACCATGCGGGTCGGGACTGATTTATTCCATATAAAAACTTTCTCAACATGTACACCAAGTTTCCATTAGCCTTATCCTTCATTCCAAGGATAATTTGCATATAAACTTCTTCTTGAGGAGAGGAATCGAGAGGTTAAGAGGAGCTTCCAAACCTAATACTTTCTAACCCGTGTTTCTTCATTTGTGATTGCTGAAAACCAAGGTTGAAGTGGCTGCTTAAACCTTGTAGAAGACCACACACCAAAAACCTTAAAAGAAATCAGTGAACTGAGACCCACATACGAGAAAAAATTACCTATGAAAAATAACTAGAGCTTGAATGCACGACTCAAGCAAGGGAATTGATGGTCATCATCAACGAATGACCCAAACAACCATGAACAGCCACAGACACGAATGAAATAAGGGTTAGAACAGAATGCCATGAATCAATCCATAACATGAACAAAAACATTGTGCAAACATAACCAGTGTGTGACTAGGTTGGTGACAAAAGGACGAGAAGTAGAAAGAATGCAATGAATGAACGGATTTGGTGGAGGCAAGCGAATACAATTTTCTTGTCCATCCAAGTAAATTCATAGGTGTAATTAGGCTAGTAGTTTAAGAATATTATTATATACGGATTAGAAATTGTTACGGAATCTCTCTTTTCATTAGTAATATTTTTGTTCTTTAGTTTTTAGGGTTGGCCTACATAGGCTAGGCTTGAATTTTAGTGTCAACCATGTCTAAGCATGTTCTGCTGTGTGCAAGTTGTGTATAACCCTAATTAATTTCTAATAAATGGAACGTTTTAGCATAAATGCTGGTGAAATCTTGCTTGCACTTTCTTCTGTTACGATTAACTTTGTTCTGTTCTGTTTATATGTCTGACATTTTCAAGAATTATTGCAGTTCATTAAGGTCATCTACCATCAATTGAAATTTGGATTAGAGAAAGATCATTCTAATGACAAGGGTCGATCATCAACAATTTTAGATGAATCGTGGTTTTCTGCTGATAGTTTCTTATATCATCTATGTAAGGTAAGGTAGTATCATGATCATTTTAAAACCCCTTGCTTACCCAAAAGTTAAAAGTTTATAGACGGGATTTATAATTCCTTATGCATATTCTTAACGTTCCTCCTCATTTGTGGGCTTGGAAATTAACACAAAGCTTAACAAGTGATCGCTAATGTTAATTGGTCAGAAAATGATTCAAACACGTAATCTCTTGTGCAAGCTAATTTATTCTATCTATTTAATATTATATCCATATTCTATCACACCTATCATTCATGTATTTTGAAATTATCCTTATGTTACACTCACTCCAGTTATCTATGGTGAGTGAGTATTCAATTATTTGAGATCCATATATGTTACATTTTGATTCTCTGACTTGTTTATTGCAGGTTAATGCTGAATTAATTTTTTTTAATGGGTGATGTTATTGAGATTTACATTTGCCTATTACCTAAAATATGATCAATATCATGCAGGATTTCTTCTCATTGGTGCTGGAGGCTCCAGTTGTTGATGGGGATCTTCTGACATGGGTATGTGCATCGAACTTTTTTCATATAATTATTCTGTTGTGGGTATGTGCATCGAGCATATTTCATATAATTTTTACCTGGTTGTTCTAACCATTTCCAAGATGCAGGTTTGAAATTTAACAAAATCTCAAAAGATTTTATGGGCATACTCCAAATTTTTATACGGTGGTATTATATTGTGAATTCAAGTGTTCAAGCCTACACGAATTTGGTATAGGCCCTCAGATTTATTCATGTTCTAGTACGTTTAAAGCCTCCCAAGAAAATGAAAATTTCCATGTCCATCTTTAAGAACTTATAGGCAGTAAACTTTCTCCCCCACCACCTGTGTTTTCTTTTTGTGCATTGGCCTGCTATGTCTTCTCTCTATGAATTATTACTCATGGATGAGGCAAGTTCATGGTTTCTTTTGGCTCTATGATTCCTCCAGGGAGTTGTAAATGGTTTTCTCAACCATATCCTCGTTTCTTGCCTGAGCACCAGTTTCTTATTTCTTTCTTGGCATCATCCAATTTAAGCGATGTTCTATCAAGTGCTTTAGATTCTTGAGGGCTCATTTTATAATGGAGAAGGTGAGGCAGTATTGTGCCAAGGATGAAGTCAATCTGGTGCGGCACGTTCCTCAAAGGATCAAAGATGGATATTTCGTAAGTAAACATTCCTCCTCGTTAGTATGAAATTGTTGGGTTGCAGGGACATGTCTCCTTTTACTACCAAACCCCATCTTCTCCTTGGGAGATGAACTGTTTCACTCATAGTACCGTGAATTTGTCCCCCCCTCCACCCCCCCAAAAAAAAAAGTTCTCTGCTTTTTTCCGGATTCTTACTTCAGAGTCCTCATCTGATTTTCATCTGAGAACGGATTCAATATCGTTTCTTCCTTAGCCTTCAACCGATTCTTCTTATAGTCATCAATCTAGTACATCAGTTTTTCAGCTTTTGGTTCTGGATCTAATGAGTTGTGTCAACCATAGATGCTTTTGTACCTGGGTCAAGAAGGGTATTTTCTTTATTGAAGACATGGCCAACAAGCTGGTGATCCCCCCTTTTAATCTCATTTTCGTTGGTTCAAAAGTTCTTTGTGTGGATTTGCTTCATTATCCTGTGCAAATGAGTTTGGTCTATTCGATTATCAAAACGAAAATCTTATTTGAGTTGGTGTTTAGAATGTGCTGCTTGGCCTTCTTCTAGAGGTAGGAAATTGCTCAAAGTTCGAATGTTTTCGTAGGAGTTTGTTCAAGGACACATGTTCATCTTCAGTCAGGAGTGTTAGGGGACACCATGTTCTTTTTCCTTCTAGTATAGACTATGAATATCTTTATTCGACTCTTGAAGAGTGCTGGTCAATGATACTAGAGTTTTGGATGAAAGGATCATACAGTACAATCTTTGACATAGTTAAAGGTTTCATCGATTTCTGGCAAAGACGGTCCATCATTAGTTTTCCTTGGATTGCTGACATTTGCACCCTTTTATATTTGATACATAGTCAAGGAAGATACATTTGATAGTCATAACTTTACGGTTTTTTGGTTGATTATTTTGAACAAAGAATTTATCCGTAGGGGAATAAGAGAAAAGATCATTGTATCTAGAAAGTGTTTTAGAAGTTTTTGTAAAGGAGTTGGAAAATTGAGGGTTTAAGAAGGCATCCTATTTATGAGGGAGGTAGCTGTTTCTTGAAAAACTACTAATGTTCAAAAGTACAAGCTTCCCCACTGGGGATATAAAATGAAAGACAAAAAGCATAAAAAGCAGAATTAATTATCCCTTGAAATTATATTATTCTCTTTGAATTTCTATGCTGGAGTTTTTGTACCTCTATCTACTACTTTGTAACTCCCATGGTGATTTAATAATTATGGAAAGTGCTTAATTAATGCACTAAATTCCTCCGCCACAGACAAGGAAACTCAAGGAACTGCTAGAGAACAGCCTGGACTGGAAATTCCAAAACAACGCTACAAGTGATGGAATTTCTTTCGATGAAGATGATGAGGTCAGTCAATGTTTACCTTGCTATAATCTCTCTCTCTCTCTCTCTCACATATATGCCTATTTACTTTTGCAGTTTGCTCCTGTAGTTGTAGATTAGAGGCTGGATGATTCTAAGTCGTCATGAGGTCTCAAGTTATATATTTTAGTCTATAATGTTTTTTTTTCTTTTTCTTTTTTCCCCTTCAATTTAGCTTCTTATGCTTTATGATGAACGTTAAAATTTGTGGATAATGGTGGGTTTGACACAAATCTAGAAGAGTACTTGGCTTTTACAAATCTAAACATTAAGGGTTATAATATTTATCTAGCCTAGAACCTTGCCTCACTGTTCTACTGAACTTCTAGAGATCCTTTTCTGTGATCTAGACCATACACACCAAAAGAAAGAAAGAAAGAAAGAAGAGGACATTCTCATCCGTTAGAGAGAACAGTTAAATTAACTCCTAGGTTGTTTAATCAAACCTGAAATTCTGTGTACATATAGATGATACCTCTAGAATTCTTTTGCCATTTCTTACGGCCTTTAGTTAGTGCTTACGAGGGAGCCTCGATATATTTGTTTATCCACTTTGTTCTGGATGAATTGGGTTCATGAAATTCTATTATATTTTGGATTCTAAATTTTAAATAAATGGTTTAGGCTCGAAGTGATAATTGTCTTTTTCGTTTTTAGTTTTGATTTTTGAAATTTATGCTTGTTTTCTCCCACAATTTTTTTTACCTTAGTTTTTTTTTTAATTTTTTTTTTTACATAGAAAAGTTTGAATTTTTAGCCTAATTCTAGAAAGAAAAAAATAAATAGTTAGTTTTTTAAAATATTGATAGGAAGTGAATAACAAAACATTTTATGAATTTAATTTTCAAAGATTAAATAGTTTAATATATCTTTCTGTTTGTATAGGATGTTTTCTAGTTAGAGCTTTCATTCATAATCATAGCTCCAAAGATATTTGAATTATTTCAAACTAGTTGCAATTAGTTGTTTTTCTTCCATTTGGATGAAAACTTAATTTGTTTTGATCAGGTTGATTGTGAGAAATTTAAAATTGTAAATTACTTAACATAATTAAATCTCTTTGTGATAATTTAACTATATAAATGGTAAAATAAAAAAGTTTATGTACAAATTTAATTTTAGAGCGTTTGTATGATACAAACAATAAAGAGACTCTACCACCGGTCTGTTGCTTGTCAAACACACTCTAACATCTAAGTTAAATAAGGTTTGTATTATCATAGCTAGAATTATGATTTAGGTAGCATTCATTGTATTTGAGCCTCTTTTTTATTGACATGGATTTGATCTAAGGTTTTCAGTTTACTCTACTAATGATGTTGTACTCAAGACCCTTAGGCTTCGAGTTAGAACCATTTGAGTCTTTTAGACTAATCAACTAGACCAAATAAGGGATTGACCATTTTGCTCGATAATTGAGCTTATAAACTTAGGTGCCCAACCCCAATTGGACCTTGGGTCACGAAAAGCCTAATGAGTTAGGGCCTATCTCGTGCCCTTAGGGCTACGATGGCCTTCACTTGCATTAGTGTTCATAGGTTTGACCACTTGTTCTCTTGCAACTCTCACAAATTTCCGTCTAGATTAGGGAACATTTGATTTTTACATATCTCATCTGGCTTCATTCGACTCCAAAGGAGCTTGTCATTGGCAATATGATGATATTATCCCTTCCTATATTGTTTGCGTCTAAATCATTATATAGATGTTTTGGTGTTCTTTGCATCGACTAGAGGAAGTTTCTCCTACACCATATATAAAAAATCCTCAAGATTTGCATTATTTGTGGGCAACATTTCTCCTTGTTGCATTCCATTTAAACTATGATATCTTTTATTTGATCTTTCACAACCTTTTCGGGCTAAGTTGTAGCAACATTATTATTCTCTTTTTCCGCTAAGATTAGTATATTACGAAAAAAGAGGAACCTAGGTCAAAACATTAACTAGACAGAATAACTTACAATCTGAACCAAGCAAACCACGAGAAAAGATTTTGAGGGGTGAAGATCTAAACGATTCCAGTATTGAAGATGCTTTCGATTAGACAAAGTTATCAAATGATTCTTATTATTACATGAAATGCAGAAATGAGTATAGCTGAACGGCCTGTGAATTGAACTTGAAAAAGAAAAAATGCCCAA

mRNA sequence

AATAGTTTCGAGAGTCTTCTCGACTTCAAAACAAGGAAGAACCCTGAATGTCTTCTATTCCTCCTCCAGGGGAGAGCGTTTAAGCGAAGAAATCAGTGTTCCGCGATAAATTTCAGTAGAAACTAGAGGAATTTGAACTTCAATTTTCGTGTCTGCTGTGTTTTTTTGAAGCTGAGATGGATCCTGAAACTGCCCTACAGCTTGTAAAGCACGGTGCGACGATTCTCCTCCTCGACGTTCCTCAGTACACGCTTATTGGAATTGATACTCAGATGTTCTCTGTAGGGCCTTCTTTCAAAGGTATAAAGATGATTCCTCCAGGACCACATTTTCTTTATTACAGCTCATCGAGCAGAGAAGGCAGAGAGTTTTCACCAATTACTGGCTTTTTTGTAGATGCTGGTTCCTCTGAGGTTATTGTTCGTAAGTGGGATCAGAGGGAGGAGCGACTTGTTAAAGTATCAGAAGAAGAGGAGCAGCGATTTGGGGAAGCAGTTAGACAACTAGAGTTCGACAGACAACTTGGTCCGTATAATTTGGGCCAATATGGAGAATGGAAGCGAATATCTAACCACATCAACTGTACCACAATTAAACGACTAGGCATGCACCCTTTTCTATTTCAGTGTCTCCTCCAAAAAGCTGTAATGTTTCTGTTAAAGTTTGAGAAGGAAGACTGTAATTCATTAGAAAGTTCAACATTTCCTATTGCTGTCCTGATTAGTATAGGTGTTGACATTCAACCTATTGGAGGTGACATTAGCGTGGCTTGTGAACCTGGAATTTCTCAAAGCACTTCCAAGTCTGCAATTGAGAAAGTCCTGGATGATCAGTTGAAGGCTAGTAAGTTTGCAATGCATGTTGATTCGTCTCAGAGGAGAAAATGTTATTACACAGAAATTCCCCATGTTATCAAACAGAGAGGAGTTCATGGGCAAGAACTTACTAATTTGAATCTTGATAAGACTTTACTACTCGAAAAGTTACTGAAAAAGGATTTTGGAGGTTCAGAGGACTTACTCCTTGGGGAGCTACAGTTTGCATTCGTTGTATTTTTGATGGGACAATCACTTGAAGGATTCCTACAGTGGAAATCATTAGTTAGCCTGTTTTTTGAGTGTACAGAAGCTCCTTTTTGCACAAGGAGTCAACTATTTACAAAGGATTTCTTCTCATTGGTGCTGGAGGCTCCAGTTGTTGATGGGGATCTTCTGACATGGGTGAGGCAGTATTGTGCCAAGGATGAAGTCAATCTGGTGCGGCACGTTCCTCAAAGGATCAAAGATGGATATTTCACAAGGAAACTCAAGGAACTGCTAGAGAACAGCCTGGACTGGAAATTCCAAAACAACGCTACAAGTGATGGAATTTCTTTCGATGAAGATGATGAGTTTGCTCCTGTAGTTGTAGATTAGAGGCTGGATGATTCTAAGTCGTCATGAGGTCTCAAGTTATATATTTTAGTCTATAATGTTTTTTTTTCTTTTTCTTTTTTCCCCTTCAATTTAGCTTCTTATGCTTTATGATGAACGTTAAAATTTGTGGATAATGGTGGGTTTGACACAAATCTAGAAGAGTACTTGGCTTTTACAAATCTAAACATTAAGGGTTATAATATTTATCTAGCCTAGAACCTTGCCTCACTGTTCTACTGAACTTCTAGAGATCCTTTTCTGTGATCTAGACCATACACACCAAAAGAAAGAAAGAAAGAAAGAAGAGGACATTCTCATCCGTTAGAGAGAACAGTTAAATTAACTCCTAGGTTGTTTAATCAAACCTGAAATTCTGTGTACATATAGATGATACCTCTAGAATTCTTTTGCCATTTCTTACGGCCTTTAGTTAGTGCTTACGAGGGAGCCTCGATATATTTGTTTATCCACTTTGTTCTGGATGAATTGGGTTCATGAAATTCTATTATATTTTGGATTCTAAATTTTAAATAAATGGTTTAGGCTCGAAGTGATAATTGTCTTTTTCGTTTTTAGTTTTGATTTTTGAAATTTATGCTTGTTTTCTCCCACAATTTTTTTTACCTTAGTTTTTTTTTTAATTTTTTTTTTTACATAGAAAAGTTTGAATTTTTAGCCTAATTCTAGAAAGAAAAAAATAAATAGTTAGTTTTTTAAAATATTGATAGGAAGTGAATAACAAAACATTTTATGAATTTAATTTTCAAAGATTAAATAGTTTAATATATCTTTCTGTTTGTATAGGATGTTTTCTAGTTAGAGCTTTCATTCATAATCATAGCTCCAAAGATATTTGAATTATTTCAAACTAGTTGCAATTAGTTGTTTTTCTTCCATTTGGATGAAAACTTAATTTGTTTTGATCAGGTTGATTGTGAGAAATTTAAAATTGTAAATTACTTAACATAATTAAATCTCTTTGTGATAATTTAACTATATAAATGGTAAAATAAAAAAGTTTATGTACAAATTTAATTTTAGAGCGTTTGTATGATACAAACAATAAAGAGACTCTACCACCGGTCTGTTGCTTGTCAAACACACTCTAACATCTAAGTTAAATAAGGTTTGTATTATCATAGCTAGAATTATGATTTAGGTAGCATTCATTGTATTTGAGCCTCTTTTTTATTGACATGGATTTGATCTAAGGTTTTCAGTTTACTCTACTAATGATGTTGTACTCAAGACCCTTAGGCTTCGAGTTAGAACCATTTGAGTCTTTTAGACTAATCAACTAGACCAAATAAGGGATTGACCATTTTGCTCGATAATTGAGCTTATAAACTTAGGTGCCCAACCCCAATTGGACCTTGGGTCACGAAAAGCCTAATGAGTTAGGGCCTATCTCGTGCCCTTAGGGCTACGATGGCCTTCACTTGCATTAGTGTTCATAGGTTTGACCACTTGTTCTCTTGCAACTCTCACAAATTTCCGTCTAGATTAGGGAACATTTGATTTTTACATATCTCATCTGGCTTCATTCGACTCCAAAGGAGCTTGTCATTGGCAATATGATGATATTATCCCTTCCTATATTGTTTGCGTCTAAATCATTATATAGATGTTTTGGTGTTCTTTGCATCGACTAGAGGAAGTTTCTCCTACACCATATATAAAAAATCCTCAAGATTTGCATTATTTGTGGGCAACATTTCTCCTTGTTGCATTCCATTTAAACTATGATATCTTTTATTTGATCTTTCACAACCTTTTCGGGCTAAGTTGTAGCAACATTATTATTCTCTTTTTCCGCTAAGATTAGTATATTACGAAAAAAGAGGAACCTAGGTCAAAACATTAACTAGACAGAATAACTTACAATCTGAACCAAGCAAACCACGAGAAAAGATTTTGAGGGGTGAAGATCTAAACGATTCCAGTATTGAAGATGCTTTCGATTAGACAAAGTTATCAAATGATTCTTATTATTACATGAAATGCAGAAATGAGTATAGCTGAACGGCCTGTGAATTGAACTTGAAAAAGAAAAAATGCCCAA

Coding sequence (CDS)

ATGGATCCTGAAACTGCCCTACAGCTTGTAAAGCACGGTGCGACGATTCTCCTCCTCGACGTTCCTCAGTACACGCTTATTGGAATTGATACTCAGATGTTCTCTGTAGGGCCTTCTTTCAAAGGTATAAAGATGATTCCTCCAGGACCACATTTTCTTTATTACAGCTCATCGAGCAGAGAAGGCAGAGAGTTTTCACCAATTACTGGCTTTTTTGTAGATGCTGGTTCCTCTGAGGTTATTGTTCGTAAGTGGGATCAGAGGGAGGAGCGACTTGTTAAAGTATCAGAAGAAGAGGAGCAGCGATTTGGGGAAGCAGTTAGACAACTAGAGTTCGACAGACAACTTGGTCCGTATAATTTGGGCCAATATGGAGAATGGAAGCGAATATCTAACCACATCAACTGTACCACAATTAAACGACTAGGCATGCACCCTTTTCTATTTCAGTGTCTCCTCCAAAAAGCTGTAATGTTTCTGTTAAAGTTTGAGAAGGAAGACTGTAATTCATTAGAAAGTTCAACATTTCCTATTGCTGTCCTGATTAGTATAGGTGTTGACATTCAACCTATTGGAGGTGACATTAGCGTGGCTTGTGAACCTGGAATTTCTCAAAGCACTTCCAAGTCTGCAATTGAGAAAGTCCTGGATGATCAGTTGAAGGCTAGTAAGTTTGCAATGCATGTTGATTCGTCTCAGAGGAGAAAATGTTATTACACAGAAATTCCCCATGTTATCAAACAGAGAGGAGTTCATGGGCAAGAACTTACTAATTTGAATCTTGATAAGACTTTACTACTCGAAAAGTTACTGAAAAAGGATTTTGGAGGTTCAGAGGACTTACTCCTTGGGGAGCTACAGTTTGCATTCGTTGTATTTTTGATGGGACAATCACTTGAAGGATTCCTACAGTGGAAATCATTAGTTAGCCTGTTTTTTGAGTGTACAGAAGCTCCTTTTTGCACAAGGAGTCAACTATTTACAAAGGATTTCTTCTCATTGGTGCTGGAGGCTCCAGTTGTTGATGGGGATCTTCTGACATGGGTGAGGCAGTATTGTGCCAAGGATGAAGTCAATCTGGTGCGGCACGTTCCTCAAAGGATCAAAGATGGATATTTCACAAGGAAACTCAAGGAACTGCTAGAGAACAGCCTGGACTGGAAATTCCAAAACAACGCTACAAGTGATGGAATTTCTTTCGATGAAGATGATGAGTTTGCTCCTGTAGTTGTAGATTAG

Protein sequence

MDPETALQLVKHGATILLLDVPQYTLIGIDTQMFSVGPSFKGIKMIPPGPHFLYYSSSSREGREFSPITGFFVDAGSSEVIVRKWDQREERLVKVSEEEEQRFGEAVRQLEFDRQLGPYNLGQYGEWKRISNHINCTTIKRLGMHPFLFQCLLQKAVMFLLKFEKEDCNSLESSTFPIAVLISIGVDIQPIGGDISVACEPGISQSTSKSAIEKVLDDQLKASKFAMHVDSSQRRKCYYTEIPHVIKQRGVHGQELTNLNLDKTLLLEKLLKKDFGGSEDLLLGELQFAFVVFLMGQSLEGFLQWKSLVSLFFECTEAPFCTRSQLFTKDFFSLVLEAPVVDGDLLTWVRQYCAKDEVNLVRHVPQRIKDGYFTRKLKELLENSLDWKFQNNATSDGISFDEDDEFAPVVVD
BLAST of Cp4.1LG03g05780 vs. Swiss-Prot
Match: AAR2_HUMAN (Protein AAR2 homolog OS=Homo sapiens GN=AAR2 PE=1 SV=2)

HSP 1 Score: 114.8 bits (286), Expect = 2.3e-24
Identity = 95/313 (30.35%), Postives = 150/313 (47.92%), Query Frame = 1

Query: 1   MDPETALQLVKHGATILLLDVPQYTLIGIDTQMFSVGPSFKGIKMIPPGPHFLYYSSSSR 60
           MDPE A +L   GAT+++L++P+ T  GID   + VGP F+G+KMIPPG HFL+YSS  +
Sbjct: 6   MDPELAKRLFFEGATVVILNMPKGTEFGIDYNSWEVGPKFRGVKMIPPGIHFLHYSSVDK 65

Query: 61  EG-REFSPITGFFVDAGSSEVIVRKWDQ-REERLVKVSEEEEQRFGEAVRQLEFDRQLGP 120
              +E  P  GFF+      + V +W   REE  +  + E E     A  Q E D+ LGP
Sbjct: 66  ANPKEVGPRMGFFLSLHQRGLTVLRWSTLREEVDLSPAPESEVEAMRANLQ-ELDQFLGP 125

Query: 121 YNLGQYGEWKRISNHINCTTIKRLGMHPFLFQCLLQKAVMFLLKFEKEDCNSLESSTFPI 180
           Y                 T  K + +  F+ +  ++K    L    ++ C    S   P+
Sbjct: 126 YPYA--------------TLKKWISLTNFISEATVEK----LQPENRQICAF--SDVLPV 185

Query: 181 AVLISIGVDIQPIGGDISVACEPGISQSTSKSAIEKVLDDQLKASKFAMHVDSSQRRKCY 240
              +S+      +G ++      GI   + +  + ++ + + +A             +  
Sbjct: 186 ---LSMKHTKDRVGQNLPRC---GIECKSYQEGLARLPEMKPRAGT-----------EIR 245

Query: 241 YTEIPHVIKQRGVHGQELTNLNLDKTLLLEKLLKKDFGGSEDLLLGELQFAFVVFLMGQS 300
           ++E+P  +   G    E+T  ++D +  LE +L K F  S   +LGELQFAFV FL+G  
Sbjct: 246 FSELPTQMFPEGATPAEITKHSMDLSYALETVLNKQFPSSPQDVLGELQFAFVCFLLGNV 280

Query: 301 LEGFLQWKSLVSL 312
            E F  WK L++L
Sbjct: 306 YEAFEHWKRLLNL 280

BLAST of Cp4.1LG03g05780 vs. Swiss-Prot
Match: AAR2_MOUSE (Protein AAR2 homolog OS=Mus musculus GN=Aar2 PE=1 SV=3)

HSP 1 Score: 99.4 bits (246), Expect = 9.9e-20
Identity = 54/143 (37.76%), Postives = 80/143 (55.94%), Query Frame = 1

Query: 1   MDPETALQLVKHGATILLLDVPQYTLIGIDTQMFSVGPSFKGIKMIPPGPHFLYYSSSSR 60
           MDPE A QL   GAT+++L++P+ T  GID   + VGP F+G+KMIPPG HFLYYSS  +
Sbjct: 6   MDPELAKQLFFEGATVVILNMPKGTEFGIDYNSWEVGPKFRGVKMIPPGIHFLYYSSVDK 65

Query: 61  EG-REFSPITGFFVDAGSSEVIVRKWDQREERLVKVSEEEEQRFGEAVRQLEFDRQLGPY 120
              RE  P  GFF+      + V +W+  +E +      E +         + D+ LGPY
Sbjct: 66  ANPREVGPRMGFFLSLKQRGLTVLRWNAVQEEVDLSPAPEAEVEAMRANLPDLDQFLGPY 125

Query: 121 NLGQYGEWKRISNHINCTTIKRL 143
                 +W  ++N I+  T+++L
Sbjct: 126 PYATLKKWISLTNFISEATMEKL 148

BLAST of Cp4.1LG03g05780 vs. Swiss-Prot
Match: AAR2_MACFA (Protein AAR2 homolog OS=Macaca fascicularis GN=AAR2 PE=2 SV=1)

HSP 1 Score: 99.4 bits (246), Expect = 9.9e-20
Identity = 58/144 (40.28%), Postives = 83/144 (57.64%), Query Frame = 1

Query: 1   MDPETALQLVKHGATILLLDVPQYTLIGIDTQMFSVGPSFKGIKMIPPGPHFLYYSSSSR 60
           MDPE A +L   GAT+++L++P+ T  GID   + VGP F+G+KMIPPG HFLYYSS  +
Sbjct: 6   MDPELAKRLFFEGATVVILNMPKGTEFGIDCNSWEVGPKFRGVKMIPPGIHFLYYSSVDK 65

Query: 61  EG-REFSPITGFFVDAGSSEVIVRKWDQ-REERLVKVSEEEEQRFGEAVRQLEFDRQLGP 120
              +E  P  GFF+      + V +W   REE  +  + E E     A  Q E D+ LGP
Sbjct: 66  ANPKEVGPRMGFFLSLYQRGLTVLRWSTLREEVDLSPAPESEVEAMRANLQ-ELDQFLGP 125

Query: 121 YNLGQYGEWKRISNHINCTTIKRL 143
           Y      +W  ++N I+  T+++L
Sbjct: 126 YPYATLKKWISLTNFISEATVEKL 148

BLAST of Cp4.1LG03g05780 vs. Swiss-Prot
Match: AAR2_BOVIN (Protein AAR2 homolog OS=Bos taurus GN=AAR2 PE=2 SV=1)

HSP 1 Score: 99.4 bits (246), Expect = 9.9e-20
Identity = 59/144 (40.97%), Postives = 85/144 (59.03%), Query Frame = 1

Query: 1   MDPETALQLVKHGATILLLDVPQYTLIGIDTQMFSVGPSFKGIKMIPPGPHFLYYSSSSR 60
           MDPE A +L   GAT+++L++P+ T  GID   + VGP F+G+KMIPPG HFL+YSS  +
Sbjct: 6   MDPELARRLFFEGATVVILNMPKGTEFGIDYNSWEVGPKFRGVKMIPPGIHFLHYSSVDK 65

Query: 61  EG-REFSPITGFFVDAGSSEVIVRKWD-QREERLVKVSEEEEQRFGEAVRQLEFDRQLGP 120
              RE  P  GFF++     + V +WD  REE  +  + E E     A  Q E D+ LGP
Sbjct: 66  ANPREVGPRMGFFLNLQQRGLKVLRWDAAREEVDLSPAPEAEVEAMRANLQ-ELDQFLGP 125

Query: 121 YNLGQYGEWKRISNHINCTTIKRL 143
           Y      +W  ++N I+  T+++L
Sbjct: 126 YPYTTLKKWISLTNFISEATVEKL 148

BLAST of Cp4.1LG03g05780 vs. Swiss-Prot
Match: AAR2_PONAB (Protein AAR2 homolog OS=Pongo abelii GN=AAR2 PE=2 SV=1)

HSP 1 Score: 95.1 bits (235), Expect = 1.9e-18
Identity = 56/144 (38.89%), Postives = 82/144 (56.94%), Query Frame = 1

Query: 1   MDPETALQLVKHGATILLLDVPQYTLIGIDTQMFSVGPSFKGIKMIPPGPHFLYYSSSSR 60
           MDPE A +L   GAT+++L++P+ T  GID   + VGP F+G+K IPPG HFL+YSS  +
Sbjct: 6   MDPELAKRLFFEGATVVILNMPRGTEFGIDYNSWEVGPKFRGVKTIPPGIHFLHYSSVDK 65

Query: 61  EG-REFSPITGFFVDAGSSEVIVRKWDQ-REERLVKVSEEEEQRFGEAVRQLEFDRQLGP 120
              +E  P  GFF+      + V +W   REE  +  + E E     A  Q E D+ LGP
Sbjct: 66  ANPKEVGPRMGFFLSLHQRGLTVLRWSTLREEVDLSPAPESEVEAMRANLQ-ELDQFLGP 125

Query: 121 YNLGQYGEWKRISNHINCTTIKRL 143
           Y      +W  ++N I+  T+++L
Sbjct: 126 YPYATLKKWISLTNFISEATVEKL 148

BLAST of Cp4.1LG03g05780 vs. TrEMBL
Match: A0A0A0M0D1_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G633380 PE=4 SV=1)

HSP 1 Score: 405.2 bits (1040), Expect = 9.4e-110
Identity = 225/329 (68.39%), Postives = 243/329 (73.86%), Query Frame = 1

Query: 1   MDPETALQLVKHGATILLLDVPQYTLIGIDTQMFSVGPSFKGIKMIPPGPHFLYYSSSSR 60
           MDPET+L+LVKHG T+LLLDVPQYTL+GIDTQMFSVGPSFKGIKMIPPGPHFLYYSSSSR
Sbjct: 1   MDPETSLELVKHGVTVLLLDVPQYTLLGIDTQMFSVGPSFKGIKMIPPGPHFLYYSSSSR 60

Query: 61  EGREFSPITGFFVDAGSSEVIVRKWDQREERLVKVSEEEEQRFGEAVRQLEFDRQLGPYN 120
           +GREFSPITGFFVDAG SEVIVR+WDQREERLVKV EEE                     
Sbjct: 61  DGREFSPITGFFVDAGPSEVIVRRWDQREERLVKVLEEE--------------------- 120

Query: 121 LGQYGEWKRISNHINCTTIKRLGMHPFLFQCLLQKAVMFLLKFEKEDCNSLESSTFPIAV 180
            GQ+ E            I+RL     L    L +   +     K   N + S+T     
Sbjct: 121 EGQFRE-----------AIRRLEFDRQLGPYNLGQYGEW-----KRMSNHINSTTIK--- 180

Query: 181 LISIGVDIQPIGGDISVACEPGISQSTSKSAIEKVLDDQLKASKFAMHVDSSQRRKCYYT 240
                  ++PIGGDI+V CEPGISQSTSKSA+EKVLDDQLK SKFA  VDSSQ R CYY 
Sbjct: 181 ------RLEPIGGDITVVCEPGISQSTSKSAVEKVLDDQLKGSKFATPVDSSQSRGCYYA 240

Query: 241 EIPHVIKQRGVHGQELTNLNLDKTLLLEKLLKKDFGGSEDLLLGELQFAFVVFLMGQSLE 300
           +IPHVIKQRGVHGQELT LNLDKTLLLE  LKK FGGSEDLLLGELQFAFVVFLMGQSLE
Sbjct: 241 KIPHVIKQRGVHGQELTYLNLDKTLLLENQLKKYFGGSEDLLLGELQFAFVVFLMGQSLE 283

Query: 301 GFLQWKSLVSLFFECTEAPFCTRSQLFTK 330
           GFLQWKSLV+L FEC EAPFCTRSQLFTK
Sbjct: 301 GFLQWKSLVTLLFECREAPFCTRSQLFTK 283

BLAST of Cp4.1LG03g05780 vs. TrEMBL
Match: A0A0S3RKW3_PHAAN (Uncharacterized protein OS=Vigna angularis var. angularis GN=Vigan.03G090800 PE=4 SV=1)

HSP 1 Score: 354.0 bits (907), Expect = 2.5e-94
Identity = 212/433 (48.96%), Postives = 270/433 (62.36%), Query Frame = 1

Query: 1   MDPETALQLVKHGATILLLDVPQYTLIGIDTQMFSVGPSFKGIKMIPPGPHFLYYSSSSR 60
           MD E AL+LVKHG T+L LDVPQYT++ IDTQMF VGP+F GIKMIPPG HF+YYSSSSR
Sbjct: 1   MDSEKALELVKHGVTLLFLDVPQYTMVAIDTQMFYVGPAFNGIKMIPPGTHFVYYSSSSR 60

Query: 61  EGREFSPITGFFVDAGSSEVIVRKWDQREERLVKVSEEEEQRFGEAVRQLEFDRQLGPYN 120
           +G+EFSPI GFF++AG+SEVIVRKWDQ+EERLVKVSE                       
Sbjct: 61  DGKEFSPIVGFFINAGTSEVIVRKWDQQEERLVKVSE----------------------- 120

Query: 121 LGQYGEWKRISNHI-NCTTIKRLGMHPFLFQCLLQKAVMFLLKFEKEDCNSLESSTFPIA 180
                E +R S  + N    ++LG +        ++   F+ K      N +E       
Sbjct: 121 ----EEEERYSQAVKNLEFDRQLGPYNISHYEDWKRLSNFITK------NVIER------ 180

Query: 181 VLISIGVDIQPIGGDISVACEPGISQSTSKSAIEKVLDDQLKASKFAMHVDSSQRRKCYY 240
                   ++PIGG+I+V CE  I ++++K  +E+  + QLK    A  V  SQR+ CYY
Sbjct: 181 --------LEPIGGEITVECENEIVRNSTKIPMEEAPEKQLKVGNSASSVGKSQRKGCYY 240

Query: 241 TEIPHVIKQRGVHGQELTNLNLDKTLLLEKLLKKDFGGSEDLLLGELQFAFVVFLMGQSL 300
           T IP V+K +G+ GQELT+LNLDKT LLE LL KD+GGSED+LLGELQF FVVFLMGQSL
Sbjct: 241 TSIPRVVKCKGISGQELTSLNLDKTQLLETLLAKDYGGSEDMLLGELQFTFVVFLMGQSL 300

Query: 301 EGFLQWKSLVSLFFECTEAPFCTRSQLFTK-------------------DFFSLVLEAPV 360
           E FLQWKSLVSL F CTEAPF TR+QLFTK                   +  S VL+  +
Sbjct: 301 EAFLQWKSLVSLLFGCTEAPFRTRTQLFTKFIKVIHNQLKYGLLKDHMGETGSAVLDDSL 360

Query: 361 VDGDLLTWVRQYCAKDEVNLVRHVPQRIKDG---YFTRKLKELLENSLDWKFQNNATSDG 411
           +  D  +++   C KD  + +  +   + DG    +TRK KELLE+SL W+FQ  +  DG
Sbjct: 361 ISAD--SFLHHVC-KDFFSSL--LDGSVVDGDLLKWTRKFKELLESSLGWEFQLGSAVDG 381

BLAST of Cp4.1LG03g05780 vs. TrEMBL
Match: B9T7Q0_RICCO (Protein C20orf4, putative OS=Ricinus communis GN=RCOM_0140520 PE=4 SV=1)

HSP 1 Score: 351.3 bits (900), Expect = 1.6e-93
Identity = 215/438 (49.09%), Postives = 275/438 (62.79%), Query Frame = 1

Query: 1   MDPETALQLVKHGATILLLDVPQYTLIGIDTQMFSVGPSFKGIKMIPPGPHFLYYSSSSR 60
           MDPETAL  VK GAT+LLLDVPQYTL GIDTQ+F+VGP+FKG+KMIPPG HF+YYSSSSR
Sbjct: 4   MDPETALDFVKQGATLLLLDVPQYTLFGIDTQVFTVGPAFKGVKMIPPGTHFVYYSSSSR 63

Query: 61  EGREFSPITGFFVDAGSSEVIVRKWDQREERLVKVSEEEEQRFGEAVRQLEFDRQLGPYN 120
           +G++FSPI GFFVDAG SEVIVRKW ++EERLVKVSEEEE+RF +AV+ LEFDR LGPYN
Sbjct: 64  DGKDFSPIIGFFVDAGPSEVIVRKWVRQEERLVKVSEEEEERFSQAVKSLEFDRNLGPYN 123

Query: 121 LGQYGEWKRISNHINCTTIKRLGMHPFLFQCLLQKAVMFLLKFEKEDCNSLESSTFPIAV 180
           L QYGEWKR+SN++    I+R+            + +   +  E E  + +  S+   A+
Sbjct: 124 LNQYGEWKRLSNYVRKNVIERI------------EPIGGEITIESE--SGITRSSPKTAM 183

Query: 181 LISIGVDIQPIGGDISVACEPGISQSTSKSAIEKVLDDQLKASKFAMHVDSSQRRKCYYT 240
             ++   ++     +S + +    +    ++I  V+                +RR  Y  
Sbjct: 184 EKALDEQLRNSKCSVSASVDKAEKRGCYYTSIPHVI----------------KRRGIYSA 243

Query: 241 EIPHVIKQRGVHGQELTNLNLDKTLLLEKLLKKDFGGSEDLLLGELQFAFVVFLMGQSLE 300
                         ELT+LNLDKT LLE +L KD+GGSEDLL+GELQFAF+ FLMGQSLE
Sbjct: 244 --------------ELTSLNLDKTELLENILVKDYGGSEDLLIGELQFAFIAFLMGQSLE 303

Query: 301 GFLQWKSLVSLFFECTEAPFCTRSQLFTKDFFSLVL---------------EAPVVDGDL 360
            F QWKSLVSL   CTEAP  TRS+LFTK F  ++                +A V    L
Sbjct: 304 AFFQWKSLVSLLLGCTEAPLRTRSRLFTK-FIKVIYYQLKYGLQKDKAETNDAGVGVSTL 363

Query: 361 L--------TWVRQYCAKDEVNLVRHVPQRIKDG---YFTRKLKELLENSLDWKF-QNNA 412
           L        +++ Q C KD   LV+     + DG    +TRKLKELLE+SL W+F QN+A
Sbjct: 364 LDESWFSADSFLHQLC-KDFFLLVQDA--SVVDGDLLTWTRKLKELLESSLGWEFQQNSA 393

BLAST of Cp4.1LG03g05780 vs. TrEMBL
Match: K7KAL2_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_02G247400 PE=4 SV=1)

HSP 1 Score: 340.1 bits (871), Expect = 3.7e-90
Identity = 210/434 (48.39%), Postives = 267/434 (61.52%), Query Frame = 1

Query: 1   MDPETALQLVKHGATILLLDVPQYTLIGIDTQMFSVGPSFKGIKMIPPGPHFLYYSSSSR 60
           MDP TAL+LVKHG T+L LDVPQYTL+ + TQMFSVGP+FKGIKMIPPG HF+YYSSSSR
Sbjct: 1   MDPGTALELVKHGVTLLFLDVPQYTLVAVGTQMFSVGPTFKGIKMIPPGIHFVYYSSSSR 60

Query: 61  EGREFSPITGFFVDAGSSEVIVRKWDQRE---ERLVKVSEEEEQRFGEAVRQLEFDRQLG 120
           +G+EFSPI GFF+DAG SEVIVRKW   +   ERL+K+SE                    
Sbjct: 61  DGKEFSPIIGFFIDAGPSEVIVRKW---DQQDERLIKLSE-------------------- 120

Query: 121 PYNLGQYGEWKRISNHI-NCTTIKRLGMHPFLFQCLLQKAVMFLLKFEKEDCNSLESSTF 180
                   E +R S  + N    ++LG +        ++   F+ K              
Sbjct: 121 -------EEEERYSQAVKNLEFDRQLGPYNLSHYEDWKRLSNFITK-------------- 180

Query: 181 PIAVLISIGVDIQPIGGDISVACEPGISQSTSKSAIEKVLDDQLKASKFAMHVDSSQRRK 240
                 S+   ++PIGG+I+V CE  I ++T+K  +E+ LD QLK    A  V  S+R+ 
Sbjct: 181 ------SVIERLEPIGGEITVECENEIVRNTTKMPMEEALDKQLKVGNSATSVGKSRRKG 240

Query: 241 CYYTEIPHVIKQRGVHGQELTNLNLDKTLLLEKLLKKDFGGSEDLLLGELQFAFVVFLMG 300
           CYYT IPHV+K +G+ GQELT+LNLDKT LLE LL KD+G SEDLLLGELQFAF+ FLMG
Sbjct: 241 CYYTSIPHVVKCKGISGQELTSLNLDKTHLLETLLTKDYGDSEDLLLGELQFAFIAFLMG 300

Query: 301 QSLEGFLQWKSLVSLFFECTEAPFCTRSQLFTKDFFSLV-------LEAPVVD--GDLL- 360
           QSLE FLQWKSLVSL F CTEAPF TR+ LFTK F  ++       L+   +D  G  L 
Sbjct: 301 QSLEAFLQWKSLVSLLFGCTEAPFRTRTHLFTK-FIKVIYNQLKYGLQKDHMDETGSALL 360

Query: 361 --TWVR-----QYCAKDEVNLVRHVPQRIKDG---YFTRKLKELLENSLDWKFQNNATSD 411
             +W+       +  KD  + +  +   + DG    +TRK KELLE +L W+FQ  +  D
Sbjct: 361 DDSWLSADSFLHHLCKDFFSSL--LDGSVVDGDLLNWTRKFKELLERNLGWEFQQGSAVD 381

BLAST of Cp4.1LG03g05780 vs. TrEMBL
Match: A0A0B2QI06_GLYSO (Uncharacterized protein OS=Glycine soja GN=glysoja_009238 PE=4 SV=1)

HSP 1 Score: 337.4 bits (864), Expect = 2.4e-89
Identity = 205/429 (47.79%), Postives = 259/429 (60.37%), Query Frame = 1

Query: 1   MDPETALQLVKHGATILLLDVPQYTLIGIDTQMFSVGPSFKGIKMIPPGPHFLYYSSSSR 60
           MDPETAL+LVKHG T+LLLDVPQYTL+ +DTQMFSVGP+FKGIKMIPPG HF+YYSSSSR
Sbjct: 1   MDPETALELVKHGVTLLLLDVPQYTLVAVDTQMFSVGPAFKGIKMIPPGVHFVYYSSSSR 60

Query: 61  EGREFSPITGFFVDAGSSEVIVRKWDQREERLVKVSEEEEQRFGEAVRQLEFDRQLGPYN 120
           +G+EFS I GFF+DAG SEVIVRK            +++E+R                  
Sbjct: 61  DGKEFSSIIGFFIDAGPSEVIVRK-----------WDQQEERL---------------IK 120

Query: 121 LGQYGEWKRISNHINCTTIKRLGMHPFLFQCLLQKAVMFLLKFEKEDCNSLESSTFPIAV 180
           L +  E +      N    ++LG +                    ED   L +      +
Sbjct: 121 LSEEEEERYSQAVKNLEFDRQLGPYNLSHY---------------EDWKQLSNF-----I 180

Query: 181 LISIGVDIQPIGGDISVACEPGISQSTSKSAIEKVLDDQLKASKFAMHVDSSQRRKCYYT 240
             S+   ++PIGG+I+V CE  I ++ +K  +E  L  QLK    A  V  SQR+ CYYT
Sbjct: 181 TKSVIERLEPIGGEITVECENEIVRNATKMPMEDALGKQLKVGNSATSVGKSQRKGCYYT 240

Query: 241 EIPHVIKQRGVHGQELTNLNLDKTLLLEKLLKKDFGGSEDLLLGELQFAFVVFLMGQSLE 300
            IPHV+K +G+ GQELT+LNLDKT LLE LL KD+GGSEDLLLGELQFAFV FLMGQSLE
Sbjct: 241 SIPHVVKCKGISGQELTSLNLDKTQLLETLLAKDYGGSEDLLLGELQFAFVAFLMGQSLE 300

Query: 301 GFLQWKSLVSLFFECTEAPFCTRSQLFTKDF--------FSLVLEAPVVDGDLL---TWV 360
            FLQWKSLVSL F CTEAPF TR+ LFTK          + L  +     G  L   +W+
Sbjct: 301 AFLQWKSLVSLLFGCTEAPFRTRTHLFTKFIKVIYNQLKYGLQKDHMGETGSALLDDSWI 360

Query: 361 R-----QYCAKDEVNLVRHVPQRIKDG---YFTRKLKELLENSLDWKFQNNATSDGISFD 411
                  +  KD  + +  +   + DG    +TRK KELLE +L W+FQ ++  DG+ F+
Sbjct: 361 SADSFLHHLCKDFFSSL--LDGSVVDGDLLKWTRKFKELLERNLGWEFQQSSAVDGMYFE 381

BLAST of Cp4.1LG03g05780 vs. TAIR10
Match: AT1G66510.1 (AT1G66510.1 AAR2 protein family)

HSP 1 Score: 212.2 bits (539), Expect = 5.9e-55
Identity = 97/141 (68.79%), Postives = 121/141 (85.82%), Query Frame = 1

Query: 1   MDPETALQLVKHGATILLLDVPQYTLIGIDTQMFSVGPSFKGIKMIPPGPHFLYYSSSSR 60
           MD E AL+LVKHGAT+L LDVPQYTL+GIDTQ+F+VGP+FKGIKMIPPG HF++YSSS+R
Sbjct: 1   MDSEKALELVKHGATLLFLDVPQYTLVGIDTQIFAVGPAFKGIKMIPPGIHFVFYSSSTR 60

Query: 61  EGREFSPITGFFVDAGSSEVIVRKWDQREERLVKVSEEEEQRFGEAVRQLEFDRQLGPYN 120
           +GREFSP  GFFVD   S+VIVRKW+Q++E L KVSEEEE+R+ +AVR LEFD+ LGPYN
Sbjct: 61  DGREFSPTIGFFVDVAPSQVIVRKWNQQDEWLTKVSEEEEERYSQAVRSLEFDKNLGPYN 120

Query: 121 LGQYGEWKRISNHINCTTIKR 142
           L QYGEW+ +SN+I    +++
Sbjct: 121 LKQYGEWRHLSNYITKDVVEK 141

BLAST of Cp4.1LG03g05780 vs. NCBI nr
Match: gi|778663924|ref|XP_011660184.1| (PREDICTED: protein AAR2 homolog [Cucumis sativus])

HSP 1 Score: 452.2 bits (1162), Expect = 9.6e-124
Identity = 269/436 (61.70%), Postives = 296/436 (67.89%), Query Frame = 1

Query: 1   MDPETALQLVKHGATILLLDVPQYTLIGIDTQMFSVGPSFKGIKMIPPGPHFLYYSSSSR 60
           MDPET+L+LVKHG T+LLLDVPQYTL+GIDTQMFSVGPSFKGIKMIPPGPHFLYYSSSSR
Sbjct: 1   MDPETSLELVKHGVTVLLLDVPQYTLLGIDTQMFSVGPSFKGIKMIPPGPHFLYYSSSSR 60

Query: 61  EGREFSPITGFFVDAGSSEVIVRKWDQREERLVKVSEEEEQRFGEAVRQLEFDRQLGPYN 120
           +GREFSPITGFFVDAG SEVIVR+WDQREERLVKV EEE                     
Sbjct: 61  DGREFSPITGFFVDAGPSEVIVRRWDQREERLVKVLEEE--------------------- 120

Query: 121 LGQYGEWKRISNHINCTTIKRLGMHPFLFQCLLQKAVMFLLKFEKEDCNSLESSTFPIAV 180
            GQ+ E            I+RL     L    L +   +     K   N + S+T     
Sbjct: 121 EGQFRE-----------AIRRLEFDRQLGPYNLGQYGEW-----KRMSNHINSTTIK--- 180

Query: 181 LISIGVDIQPIGGDISVACEPGISQSTSKSAIEKVLDDQLKASKFAMHVDSSQRRKCYYT 240
                  ++PIGGDI+V CEPGISQSTSKSA+EKVLDDQLK SKFA  VDSSQ R CYY 
Sbjct: 181 ------RLEPIGGDITVVCEPGISQSTSKSAVEKVLDDQLKGSKFATPVDSSQSRGCYYA 240

Query: 241 EIPHVIKQRGVHGQELTNLNLDKTLLLEKLLKKDFGGSEDLLLGELQFAFVVFLMGQSLE 300
           +IPHVIKQRGVHGQELT LNLDKTLLLE  LKK FGGSEDLLLGELQFAFVVFLMGQSLE
Sbjct: 241 KIPHVIKQRGVHGQELTYLNLDKTLLLENQLKKYFGGSEDLLLGELQFAFVVFLMGQSLE 300

Query: 301 GFLQWKSLVSLFFECTEAPFCTRSQLFTK----------------------DFFSLVLEA 360
           GFLQWKSLV+L FEC EAPFCTRSQLFTK                         S++L+ 
Sbjct: 301 GFLQWKSLVTLLFECREAPFCTRSQLFTKFIKVIYHQLKFGLEKDRSNDKAGSSSILLDE 360

Query: 361 PVVDGDLLTWVRQYCAKDEVNLVRHVPQRIKDG---YFTRKLKELLENSLDWKFQNNATS 412
                D  +++   C KD  +LV   P  + DG    +TRKLKELLEN L WKFQN A  
Sbjct: 361 SWFSAD--SFLHHLC-KDFFSLVLEAP--VVDGDLLTWTRKLKELLENRLGWKFQNIAI- 384

BLAST of Cp4.1LG03g05780 vs. NCBI nr
Match: gi|700211491|gb|KGN66587.1| (hypothetical protein Csa_1G633380 [Cucumis sativus])

HSP 1 Score: 405.2 bits (1040), Expect = 1.4e-109
Identity = 225/329 (68.39%), Postives = 243/329 (73.86%), Query Frame = 1

Query: 1   MDPETALQLVKHGATILLLDVPQYTLIGIDTQMFSVGPSFKGIKMIPPGPHFLYYSSSSR 60
           MDPET+L+LVKHG T+LLLDVPQYTL+GIDTQMFSVGPSFKGIKMIPPGPHFLYYSSSSR
Sbjct: 1   MDPETSLELVKHGVTVLLLDVPQYTLLGIDTQMFSVGPSFKGIKMIPPGPHFLYYSSSSR 60

Query: 61  EGREFSPITGFFVDAGSSEVIVRKWDQREERLVKVSEEEEQRFGEAVRQLEFDRQLGPYN 120
           +GREFSPITGFFVDAG SEVIVR+WDQREERLVKV EEE                     
Sbjct: 61  DGREFSPITGFFVDAGPSEVIVRRWDQREERLVKVLEEE--------------------- 120

Query: 121 LGQYGEWKRISNHINCTTIKRLGMHPFLFQCLLQKAVMFLLKFEKEDCNSLESSTFPIAV 180
            GQ+ E            I+RL     L    L +   +     K   N + S+T     
Sbjct: 121 EGQFRE-----------AIRRLEFDRQLGPYNLGQYGEW-----KRMSNHINSTTIK--- 180

Query: 181 LISIGVDIQPIGGDISVACEPGISQSTSKSAIEKVLDDQLKASKFAMHVDSSQRRKCYYT 240
                  ++PIGGDI+V CEPGISQSTSKSA+EKVLDDQLK SKFA  VDSSQ R CYY 
Sbjct: 181 ------RLEPIGGDITVVCEPGISQSTSKSAVEKVLDDQLKGSKFATPVDSSQSRGCYYA 240

Query: 241 EIPHVIKQRGVHGQELTNLNLDKTLLLEKLLKKDFGGSEDLLLGELQFAFVVFLMGQSLE 300
           +IPHVIKQRGVHGQELT LNLDKTLLLE  LKK FGGSEDLLLGELQFAFVVFLMGQSLE
Sbjct: 241 KIPHVIKQRGVHGQELTYLNLDKTLLLENQLKKYFGGSEDLLLGELQFAFVVFLMGQSLE 283

Query: 301 GFLQWKSLVSLFFECTEAPFCTRSQLFTK 330
           GFLQWKSLV+L FEC EAPFCTRSQLFTK
Sbjct: 301 GFLQWKSLVTLLFECREAPFCTRSQLFTK 283

BLAST of Cp4.1LG03g05780 vs. NCBI nr
Match: gi|743821223|ref|XP_011021359.1| (PREDICTED: protein AAR2 homolog [Populus euphratica])

HSP 1 Score: 390.2 bits (1001), Expect = 4.5e-105
Identity = 233/438 (53.20%), Postives = 283/438 (64.61%), Query Frame = 1

Query: 1   MDPETALQLVKHGATILLLDVPQYTLIGIDTQMFSVGPSFKGIKMIPPGPHFLYYSSSSR 60
           MDPETAL+LVK GAT+LLLDVPQYT++GIDTQMFSVGP+FKGIKMIPPGPHF+YYSSSS+
Sbjct: 1   MDPETALELVKQGATLLLLDVPQYTIVGIDTQMFSVGPAFKGIKMIPPGPHFVYYSSSSK 60

Query: 61  EGREFSPITGFFVDAGSSEVIVRKWDQREERLVKVSEEEEQRFGEAVRQLEFDRQLGPYN 120
           +G++FSPI GFFVDA  SEVIVRKW+Q+EERLVKV E+EE+RF +AV+ LEFDR LGPYN
Sbjct: 61  DGKQFSPIVGFFVDAAPSEVIVRKWNQQEERLVKVPEDEEERFCQAVKSLEFDRYLGPYN 120

Query: 121 LGQYGEWKRISNHINCTTIKRLGMHPFLFQCLLQKAVMFLLKFEKE-DCNSLESSTFPIA 180
           L QYGEWKR+S+++  T IKR+   P              +  E E D NS ++S     
Sbjct: 121 LSQYGEWKRLSSYLTKTIIKRI--EPI--------GGEITVACESEMDKNSPKTS----- 180

Query: 181 VLISIGVDIQPIGGDISVACEPGISQSTSKSAIEKVLDDQLKASKFAMHVDSSQRRKCYY 240
             I   +D Q   G  S +                              VD S++R CYY
Sbjct: 181 --IERALDAQLGTGKFSASAS----------------------------VDRSKKRGCYY 240

Query: 241 TEIPHVIKQRGVHGQELTNLNLDKTLLLEKLLKKDFGGSEDLLLGELQFAFVVFLMGQSL 300
           T IP VIK+RG+ G+ELT+LNLDKT LLE +L KD+GGSEDLLLGELQFA++ FLMGQSL
Sbjct: 241 TTIPRVIKRRGMEGKELTSLNLDKTELLESVLIKDYGGSEDLLLGELQFAYIAFLMGQSL 300

Query: 301 EGFLQWKSLVSLFFECTEAPFCTRSQLFTKDFFSLVL---------------EAPVVDGD 360
           E F QWKSLVSL   C +APF TRS LFTK F  ++                 A +    
Sbjct: 301 EAFFQWKSLVSLLLSCIDAPFRTRSHLFTK-FIKVIFYQLKYGLQKDRKESNGAGIAVSS 360

Query: 361 LL--------TWVRQYCAKDEVNLVRHVPQRIKDG---YFTRKLKELLENSLDWKFQNNA 411
           LL        +++ + C KD   LV+     + DG    +TRKLKELLEN L W+FQ N+
Sbjct: 361 LLDESWFSADSFLHRLC-KDFFLLVQDA--TVVDGDLLTWTRKLKELLENILGWEFQQNS 389

BLAST of Cp4.1LG03g05780 vs. NCBI nr
Match: gi|697150054|ref|XP_009629239.1| (PREDICTED: protein AAR2 homolog [Nicotiana tomentosiformis])

HSP 1 Score: 359.8 bits (922), Expect = 6.5e-96
Identity = 210/434 (48.39%), Postives = 266/434 (61.29%), Query Frame = 1

Query: 1   MDPETALQLVKHGATILLLDVPQYTLIGIDTQMFSVGPSFKGIKMIPPGPHFLYYSSSSR 60
           M+PE AL+ VK GAT+LLLDVPQ TLIGIDT MFS GP+FKG+KMIPPG HF+YYSSS+R
Sbjct: 1   MEPEAALEFVKQGATMLLLDVPQNTLIGIDTHMFSTGPNFKGVKMIPPGVHFIYYSSSNR 60

Query: 61  EGREFSPITGFFVDAGSSEVIVRKWDQREERLVKVSEEEEQRFGEAVRQLEFDRQLGPYN 120
           EG +FSPI GFFVDA  SEVIVRKWD ++ER +K+SEEEE+R+ +AV+ LEF        
Sbjct: 61  EGSQFSPIVGFFVDASPSEVIVRKWDSKDERFIKLSEEEEERYAQAVKNLEF-------- 120

Query: 121 LGQYGEWKRISNHINCTTIKRLGMHPFLFQCLLQKAVMFLLKFEKEDCNSLESSTFPIAV 180
                        +    + R G    L   + +              N++ES       
Sbjct: 121 ----------DRQLGPYALDRYGDWKRLSNYITK--------------NTIES------- 180

Query: 181 LISIGVDIQPIGGDISVACEPGISQSTSKSAIEKVLDDQLKASKFAMHVDSSQRRKCYYT 240
                  I+PIGG+I+V  E  + ++  K+A+EKVL +QLK+SKF+  V+ S  + CYYT
Sbjct: 181 -------IEPIGGEITVISESEVVENVPKTAMEKVLAEQLKSSKFSKPVEKSPSKGCYYT 240

Query: 241 EIPHVIKQRGVHGQELTNLNLDKTLLLEKLLKKDFGGSEDLLLGELQFAFVVFLMGQSLE 300
            IP VIKQ+G  GQELTN+NLDKT +LE +L K  GGS+D LLGELQF FV FLMGQSLE
Sbjct: 241 SIPRVIKQKGASGQELTNMNLDKTQILETILMKQHGGSDDSLLGELQFTFVAFLMGQSLE 300

Query: 301 GFLQWKSLVSLFFECTEAPFCTRSQLFTKDFFSLVLEAPV-----------VDGDLLTWV 360
            FLQWK LVSL   CTEAP  TR+QLFTK   ++  +  V            +   +T +
Sbjct: 301 AFLQWKLLVSLLLGCTEAPLHTRTQLFTKFMKAIYYQLKVGFQKDSKDTGRAEKGAMTLL 360

Query: 361 RQYCA----------KDEVNLVRHVPQRIKDG---YFTRKLKELLENSLDWKFQNNATSD 411
            +             KD  +LV   P  + DG    +TRKL+ELLE +L W FQ N+  D
Sbjct: 361 DESLLSADNFLRHLCKDFFSLVLDAP--MVDGDLLTWTRKLRELLEQTLGWDFQQNSAVD 386

BLAST of Cp4.1LG03g05780 vs. NCBI nr
Match: gi|698515871|ref|XP_009802816.1| (PREDICTED: protein AAR2 homolog isoform X1 [Nicotiana sylvestris])

HSP 1 Score: 357.8 bits (917), Expect = 2.5e-95
Identity = 209/434 (48.16%), Postives = 267/434 (61.52%), Query Frame = 1

Query: 1   MDPETALQLVKHGATILLLDVPQYTLIGIDTQMFSVGPSFKGIKMIPPGPHFLYYSSSSR 60
           M+PE AL+ VK GAT+LLLDVPQ TLIG+DT MFS GP+FKG+KMIPPG HF+YYSSS+R
Sbjct: 1   MEPEAALEFVKQGATMLLLDVPQNTLIGVDTHMFSTGPNFKGVKMIPPGVHFIYYSSSNR 60

Query: 61  EGREFSPITGFFVDAGSSEVIVRKWDQREERLVKVSEEEEQRFGEAVRQLEFDRQLGPYN 120
           EG EFSPI GFFVDA  SEVIVRKWD ++ER VK+SEEEE+R+ +AV+ LEF        
Sbjct: 61  EGNEFSPIVGFFVDASPSEVIVRKWDSKDERFVKLSEEEEERYAQAVKNLEF-------- 120

Query: 121 LGQYGEWKRISNHINCTTIKRLGMHPFLFQCLLQKAVMFLLKFEKEDCNSLESSTFPIAV 180
                        +    ++R G    L   + +              N++ES       
Sbjct: 121 ----------DRQLGPYALERYGDWKRLSNYITK--------------NTIES------- 180

Query: 181 LISIGVDIQPIGGDISVACEPGISQSTSKSAIEKVLDDQLKASKFAMHVDSSQRRKCYYT 240
                  I+PIGG+I+V  E  + ++  K+A+EKVL +QLK+SKF+  V+ S  + CYYT
Sbjct: 181 -------IEPIGGEITVISESEVVENVPKTAMEKVLAEQLKSSKFSKPVEKSPSKGCYYT 240

Query: 241 EIPHVIKQRGVHGQELTNLNLDKTLLLEKLLKKDFGGSEDLLLGELQFAFVVFLMGQSLE 300
            IP VIKQ+G  G ELT++NLDKT +LE +L K  GGS+D LLGELQFAFV FLMGQSLE
Sbjct: 241 SIPRVIKQKGASGPELTSMNLDKTQILETILMKQHGGSDDSLLGELQFAFVAFLMGQSLE 300

Query: 301 GFLQWKSLVSLFFECTEAPFCTRSQLFTKDFFSLVLEAPV-----------VDGDLLTWV 360
            FLQWK LVSL   CTEAP  TR+QLFTK   ++  +  V            +   +T +
Sbjct: 301 AFLQWKLLVSLLLGCTEAPLHTRTQLFTKFVKAIYYQLKVGFQKDSKDTGRAEKGAMTLL 360

Query: 361 RQYCA----------KDEVNLVRHVPQRIKDG---YFTRKLKELLENSLDWKFQNNATSD 411
            +             KD  +LV   P  + DG    +TRKL+ELLE +L W FQ N+  D
Sbjct: 361 DESLLSADNFLCHLCKDFFSLVLDAP--MVDGDLLTWTRKLRELLEQTLGWDFQQNSAVD 386

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
AAR2_HUMAN2.3e-2430.35Protein AAR2 homolog OS=Homo sapiens GN=AAR2 PE=1 SV=2[more]
AAR2_MOUSE9.9e-2037.76Protein AAR2 homolog OS=Mus musculus GN=Aar2 PE=1 SV=3[more]
AAR2_MACFA9.9e-2040.28Protein AAR2 homolog OS=Macaca fascicularis GN=AAR2 PE=2 SV=1[more]
AAR2_BOVIN9.9e-2040.97Protein AAR2 homolog OS=Bos taurus GN=AAR2 PE=2 SV=1[more]
AAR2_PONAB1.9e-1838.89Protein AAR2 homolog OS=Pongo abelii GN=AAR2 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0M0D1_CUCSA9.4e-11068.39Uncharacterized protein OS=Cucumis sativus GN=Csa_1G633380 PE=4 SV=1[more]
A0A0S3RKW3_PHAAN2.5e-9448.96Uncharacterized protein OS=Vigna angularis var. angularis GN=Vigan.03G090800 PE=... [more]
B9T7Q0_RICCO1.6e-9349.09Protein C20orf4, putative OS=Ricinus communis GN=RCOM_0140520 PE=4 SV=1[more]
K7KAL2_SOYBN3.7e-9048.39Uncharacterized protein OS=Glycine max GN=GLYMA_02G247400 PE=4 SV=1[more]
A0A0B2QI06_GLYSO2.4e-8947.79Uncharacterized protein OS=Glycine soja GN=glysoja_009238 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G66510.15.9e-5568.79 AAR2 protein family[more]
Match NameE-valueIdentityDescription
gi|778663924|ref|XP_011660184.1|9.6e-12461.70PREDICTED: protein AAR2 homolog [Cucumis sativus][more]
gi|700211491|gb|KGN66587.1|1.4e-10968.39hypothetical protein Csa_1G633380 [Cucumis sativus][more]
gi|743821223|ref|XP_011021359.1|4.5e-10553.20PREDICTED: protein AAR2 homolog [Populus euphratica][more]
gi|697150054|ref|XP_009629239.1|6.5e-9648.39PREDICTED: protein AAR2 homolog [Nicotiana tomentosiformis][more]
gi|698515871|ref|XP_009802816.1|2.5e-9548.16PREDICTED: protein AAR2 homolog isoform X1 [Nicotiana sylvestris][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR007946AAR2
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0000278 mitotic cell cycle
biological_process GO:0006457 protein folding
biological_process GO:0009408 response to heat
biological_process GO:0009644 response to high light intensity
biological_process GO:0042542 response to hydrogen peroxide
biological_process GO:0006396 RNA processing
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG03g05780.1Cp4.1LG03g05780.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR007946A1 cistron-splicing factor, AAR2PANTHERPTHR12689A1 CISTRON SPLICING FACTOR AAR2-RELATEDcoord: 1..165
score: 3.1E-103coord: 207..412
score: 3.1E
IPR007946A1 cistron-splicing factor, AAR2PFAMPF05282AAR2coord: 13..384
score: 1.3

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Cp4.1LG03g05780Wax gourdcpewgoB0786
Cp4.1LG03g05780Cucurbita pepo (Zucchini)cpecpeB449
Cp4.1LG03g05780Cucurbita maxima (Rimu)cmacpeB494
Cp4.1LG03g05780Cucurbita moschata (Rifu)cmocpeB453
Cp4.1LG03g05780Watermelon (97103) v1cpewmB614
Cp4.1LG03g05780Melon (DHL92) v3.5.1cpemeB586
Cp4.1LG03g05780Silver-seed gourdcarcpeB1073