Cp4.1LG07g07930 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG07g07930
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionPentatricopeptide repeat-containing protein
LocationCp4.1LG07: 7025867 .. 7047893 (-)
RNA-Seq ExpressionCp4.1LG07g07930
SyntenyCp4.1LG07g07930
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCAAAGGAGAAAGCCATTATCACTCTCTTGCAAGGCTGCAACAGCCTCAACAAGCTTCGAAAGATCCACGCCCATGTTCTTGTTAGCGGCCTCCGCCATCACGTCGCCATTAACAACAAGCTTTTGAACTTCTGTGCCATCTCCGTTTCAGGTTCCCTTGCTTATGCCCAGCTTCTCTTCCATCAAATGGAGTGCCTACAAACCGAAGCCTGGAACTCCATCATCAGAGGCTTTGCGCAGAGTTCATCTCCCATTGACGCCGTTGTTTATTACAATCAAATGGTTTGTGCCTCTTTCTCTTCCCCTGATACATTCACTTTCTCATTTGTGCTCAAAGCATGTGAAAGACTCAAGGCTGAGCGTAAGTGTAAAGAAATTCATGGCACTATAATCCGCTGTGGTTATGATGGGGATGTTATTATTTGTACCAATCTTGTCAAATGCTATGCTGCAATGGGGTCCGTTTGTATTGCCCAACAGGTGTTCGACGAAATGCCTGTGAGAGACTTGGTTGCTTGGAATGCTATGATTTCGTGCTTTTCCCAACAGGGTTTGCACGGGGAGGCATTGCAAGTGTACAATCAGATGAGAAGTGAAAATGTGGATGTAGATGGGTTTACCCTTGTTGGGTTGATTTCTTCATGTGCTCATCTTGGAGCCTTGAATATTGGGGTTCAGATGCATAGATTTGCTCGTGAAAAGGGTCTTGTAGAGAGTCTTTATGTTGGAAATGCGTTGATAGATATGTATGCTAAATGTGGTAGTTTAGATCAAGCCATTTTTATCTTTGATAGAATGCATAGGAAGGACATATTCACTTGGAACTCAATGATTGTTGGGTACGGAGTACACGGTCGAGGGACTGAAGCTATATTTTGCTTCGAACGGATGTTAGAAGCAAGAATGCAACCGAACTCCATCACGTTTTTGGGTTTACTTTGTGGATGTAGTCATCAAGGCTTAGTTCAAGAAGGTGTTAAATATTTCAATTTGATGAGCTCCAAGTTTAGGCTAAGACCTGAGGTTAAACACTACGGATGCCTCGTGGATTTATACGGTCGAGCTGGGAAGCTCGAAAAGGCACTTGAAACTATACAGAATTCATCACCGAATGATCCGGTTTTGTGGCGAATCTTACTTGGCTCTTGCAAAATCCATAAAAATGTGGGTGTAGGAGAAATTGCCATGAACAATCTCTCTGAGCTTGGAGCTACAAATGCAGGGGATTGTATATTGCTGGCTACGATCTATGCCGGAGTGAACGATACAGCTGGTGTTGCGAGTATGAGAAAAACGATCAAGAGCCAAGGGATAAAGACTAGCCCAGGTTGGAGTTGGATTGAAATTGGGGAACAAGTTCATAAATTTGTGGTTGATGACAAGTCCCATCGTGATTCCATTGAAGTTTATGAAAAGTTGAGGGAAGTTCTTCATCAAGCTTCCTTGTTTGGATATGTAAGAGATGCAGAGACTTTGAAGACTTCCTCTACATATCATAGTGAGAAACTTGCCATTGCATTTGGATTGGCAAGAACTGCAGATGGGACACCGATACGTATCGTTAAAAACCTTAGAGTTTGCAGAGATTGTCATTCGTTCATGAAGGCTGTCTCGGTGGCGTTCGACAGAGAAATAATCGTTCGAGATCGGGTTCGGTTCCACCATTTCAAGGGTGGCCAATGTTCTTGCAATGACTACTGGTGAAAAGTGAACAATGTACTTCCAGTTCTGCAACCATTCACAAACCTGCTGCTGCATGATGAGTTTATGTAATTACTTGGGCGTTCTTGGAAATAGATATGGTGTTCTTAATCACCATGAAGACTGCATAGCAAGCACAAACTTACCACAACGAGGACGCTTTACAGGTTTGAATTAAATTGGTTATATCATAATCGGGATTGTGATGTCCCACGTTGGTTGAGGAGGAGAACAAAACACCCTTTATAAGGGTGTGGAAACCTTCCTCTGACAAACACGTTTTAAAGCTTTGAGGGGAAGCCCGAAAAGCCAGACACCGAATGATGTGCCAGCTTTCTCGCTGTTCCTCGAAGGGGGTAAGCACGAGGCGGTGTGCCAATAAGGACGCTGGCCCCAAAGAGGGTGGATTTGGTAGAGTCCCACGTCGATTAGAGAAAGGAACGAGTGCTAACGAGAACGCTGGCCCTGAAGGGGGGTGGATTGTGATGTCCCACATTGGTTGGGGAGGAGAACAAAACACCCTTTATAAGAGTGTGAAAACCTTCCCTAGTAGACGCGTTTTAAAGCCGTGAGGGGAAGTTCGGAAGGGAAAGCCCAAAGAGGATAATATCTGCTAGCGATAGTCTGGGCCGTTATAGTGATCTTTGAAGTTTGTGTCTATATGGTCTATGAAGTTTCAAAATTTTCTAATGAGTTCCTAAACTTTCAATTTTGTTTATGCTATATGTCACAATCTGCGAGACAACGATAAAGATAGAGGAGCAAGAACAAGGATGGGCAACGAAGGTCAATGAACGTTCAATGATGAAGAAGATGAAAGAATAACGAGAACTGGCAATGAGTTTGTGAAAGTGTGAAAATTTAGGTTATTTTAATTTTTTTTAACAAACGTTAGAAAATGTTTTAATAATCATCTCAACGTTCTTTTTACGTCTAACATTAATGTTTATGGAACAAAAGAAACAAAACATATCAATAGCAAAATATACTCGAAAAACTTCGAAAATGAAGTAAATAGACATTGCATACAAATATTTATAACTGATAATAAAGTATTTATTTTATTTAAATGACCTGTGTCGTTGAGAGTTTGAATCTCTTCAATCGAGAACGTCTACAACAGACCAGGATGCTCTTGGACGAGCTAACATGAAATAAAAAAACACATTCGATAAGGTATTGTAGAATCATAGGTTAAAAAATGGATTAAAAATTTAAATGTTTAGATGAAAATAAAATAAAAATAAAAAAATAAAAAAAAATAAAATGCTTAATAATTTTAGTGGGAATAAATAGCGGGTATTAAAAAACATTCTCCTAACGTCCAGTTTGGCCAATCTTACCGCTCCTCCAATCTTCATCTTCCATGCCCGATCCGACGTCGGACTTCAGATCGATTCGCTCGTTTCAAGTTCTTCTTCCTCTGCATCTCTAATTCTTTCCCTCTCGATTCGTTTCGAAATTCAGCACTTTCGAAGCTCTGAGGGAGAAACTCTTTTGGCAGTTCCTGAAAATTCAGCACAGGGAGGGGAGCGCCACGGAGACGGGCGGAAGTGGTGCCAATCGCGTTTGGTTAGTGAATTTAATCATAATTTTGTTTTGTTCGTTTCTATGCCATTTTTTTAATTATGAGTTGAAAGACGATTGGAAATGGGAGAAGTATCTGTGATGGAGCTGTTCTTCTCTCGTTCTCTTTCTTTCTTCCCTTTTTTTTTTCTTTTGTATTTGGTGAGATTATAGTTGGGTGTGATTCTGTCGCAGGAGGAAATAGGGAGGTCCAGATTCGATCAGGGTTTAATGAAATTCGTGTCTGATTGGAGGAACGTTGAGTGTGATTCGAGGAGAGTGACGAAGCGGGAGTTGGAAATCACGTGGCGCTATTGGACCAGTGAATGCATTTCGACCCCATAATTGCTATTGAAATTTACCCATTTAGGGAGCTGTATAATTTTTAATCAGGCAAGGTTCCCTCTTCTTTCGATTTGTTGAAGGAGGAGGAGGAGGTGAAAGAAGAGAATCGAAGATGTTTAAGTTCTTAAAGGGAGTGGTGGGTGGATCTGGGTCTGGACTCAAAGATCTGCCCTACAACATTGGCGATCCGTACCCATCAGCTTGGGGCTCCTGGACTCATTTTCGCGGCACCTCCAAGGTATGTTTTTTAAGTGGTCGCTTGGACATGAACATGATTTAACTTTGGCATTCCGGATTGCTGAATTGTTGTTTGTTCAATGCTAAATGATTCGTACTGAATTTTGGAATTGACGAGGCTTGTTCTTGTATTGGGGAATTGTCCATCCTTTTCATAGTTGGCTTCTAACTGTTCAGGTCGCTGGTGTGTGGTAGCCTATTTATGGGATGAATTTGCATTGGTTTACATTTGACAAATTTGAGTTTTCTTGTGTTTTGGTTTTCATTGCTCAACCTTTCCAGCTTGTGCGTTACGAGGTAGCGAAATGAACTATCAAAATTCTTCTACGTTGAAATGAAACCATGCTAGCTGAGGTTCATACGCTTAAAAGGAAATGAAATGTAAATCTGGCTGTACATTAACTTGCCTTGCATGATGTTGCCAAACTCTTCTTGTTCTTCCCTTTTCTTTGGAATCTTAAACACGTTAGTTAATTATGAAATGCCCTTTATATAGTTCCTATATAAGAAAAATTTAAATAAGACAGTCTTAGTAAAATAGTGACTTACCAGATCCCAGCGTGGGTTATGGGTACGACTTTATTTGAATAGGATTTGTTCTATTGGTTGTAGTAGAACTGTATCTTTGAGTTCTAGGAAGGAGGTGGGATAATGAATTCTTATGGAAATGGTTATATGAGGTTCTCATATTTTTGTAACAACGCGGAGGCTTTCAAAAGTAGTGGTTTTAGGATATAAGTGTTTTATTTTAACTGGTCATCTGCTATTCTTTTTCTTCATAGAGTAACAATTTTTTTAGATTCGTGAAATTAAGAAGGAGAAGCTAATGACAATCAGCCAAAGAGGAGTGTGTCTGCAATGAAAGAATCAAGATATATGAATCCTTCCACACAATTGGCGATGTAAAAATGAAAACATCCCAGATTCTTACATTGCTTTTTCTTTAGATATTGCAGCCTGGTTGGTAAAACATTTGGAGCAATAAGGAAAGGGTGATGTCAGAACAGCTTTTGAAGTTGATAGAAAATTATTTATTCTTGGTTTCCCTAAAATCCTGCCATGTCTTCCTTGTGCATGGCAATCTCCAATCATGTTTGCTGTTTAGTCTGATGTGAGCCAATAGCAGCCTGGAACTGTATACTTGCTTTTGTTAGGATCTGGAGCTGGGATTTTTTTGCGGAGTTTCAATGAGAAATCCTGTGGAGGATTCATAATAAAAAGGTTGAAGGGAACAAACTCATTCTTTTTTCTAAAGACAGACATCAGGATGGGAAGATCTTTGAGTCTTTGGGATAAAGTGCTATTTTTCACTTTGTGGTGGTCAAAGAGGGTTAAAATTTTAGATTTGTACTTACATTTGCTTCCATTTATGCTGAATGGAGATTTTTTTTAGTTAGAGGATTTTTCATCATTTTTTCTTGTAATGATTCTTTTTATATGATTTTTTTTGTCCAAAAAGAGGCTTGCTGTATTCTCATATTCTCCTATCCAAGGGTTATTCATCCTATGCATATTAGAGATCGTTTGCTAGCTGGTTTAGATGACATCTGTAGAGATGATATTTGAATCTATTTTCGAATTTTGTTGGTCTAACCAATTGCATTTTTGTTCAATTTAAAACTCCTATTTGAGGTGTATTTTGGACATTTTCCTTGTCAAGCGGTTTGCAGCTTATATGTATATTTTATGTCATCTTTATAATCTTATGAAATTTACATTTAAAGTGAAGTTGATGTAAGATTGGTTTCACTTTCAGGATGATGGGTCTCCAGTATCTATATTTTCTCTATCAGGTAGTGATGCACAAGATGGACATTTGGCTGCAGGTCGCAACGGTGTGAAGCGGCTACGAACTGTAAGCTGCAATGCCTATTGTTATGGATCTAGTGCATACAAATGAGTTTTTTTTTCCTTTTTTTCTTTTTTCTTTTACTAAAAAACCAAGCTTTCATTGAGATCAAATGAAATAATACAAGGGGGCTACAAAAAGATCCACCCTCAAGAGGAGCCCAATTAACACAAATGAGATTATTTTGTGTCTATTTATAATTTGGATGGCTTTTGTGCTATATAAACTCTATTTTTGTGGATTTTGATCCTTCTTTTTAAACAGTTGGTGATATTGTTGCAACTTTGCTCATTAATAATGTAATGTTATACCAAAAATGTCATGTGTGTGGTTGAGTCTCAGTTGTATGGGCGAAAAAGTAATTTTCAATAACAAGCAATCTACTTCCCTCTGTCTGCTTCAACTTACGTGTTATGTTCCTAGTTTTACTTCATTTTCCTCTTCATTTATACTAGGGATTGATAATTTCTCTTGGTTCATCCATAGAATCTAACATGTGACTTTTTAAAGAAACGTCGACAGAATGATATCGACATTAATAGTCAAAATCTGCAATTTTATTCAGGTTAGGCATCCAAATATTTTATCCTTTCTTCACAGTACTGAGGCTGAAACTATTGATGGTTCTGCTTCCAAGATTACAATTTATATTGTTACAGAACCTGTTATGCCATTGTCTGAAAAGATCAAGGAGTTGGGTCTAGAAGGTACCCAAAGGTATAAGCAATCCAACCAGCAATATCCAATTTTGGTTTTTACATTCGGTTAACATGTTTTTTTGTTCTGTTTTGTCATTCATTTTTCAGAAGGGGTAAAAATTTTTGTTTTGAGCTTCTAAAATGATCTGCCTTTAGTAGTTTTATGCCTTGGAAATGAATGCTAGTTATTGATCTTTAAGAAGTTCAAAATGTGACAACGACAAGGTATTTTTCATGTTATGAAATGCATCGTTTGTGGAACATATTTGTCTATTTATCATCAAAATAAGATTGCCCAACTCTTGCTCAAGGAGCCCTCTGAAAATTTCTATTGGAGGATTGAAAATCTTATTATATTTGCTGGAAATCACTTATGGCACAAACTGTCTTCCTTCAAGGAAATCTCCACTACTATTTTAATAATGTAAAATTTTGAGCTTGATAGCCATAATATAGATTTTTACCAGTAGCAAACTTTTCATCTTGATACTGCTGCTATCTAGATTCAAAACTCTTATGCTACAAGATTTCATATTTGGAATATGGTATTGGCTTTCACTTTTTGTAAATGCAAGTGTGGTATATTTTAGTCATTGACATTGTTCTGAATTTAAAATGTGAACTTATTTAATATCAATCATGATAACATAAAAATTATTCAATGTATTGAACAATATATTTTTAAAATTTTAATAAAGGAAAGATACACTTTTAGTTCTTGAGTTTTGACTTAGGTTACATTTGAGCGATTGTGGCTTCAAATTCAATATTCTTCCCCTTGAAGCTTGTGATAATCTCAATTTTAGTTTTTCCCGTTAGCATTTTTGCTCACTGTTGTAAATTTTGGTTATGTTGGGCTGATTGGACAATTTGTGTATTATTTTTAAGCCAGCTAAACATTATATATATAAATATATCCTCACAGCCTAAGAAGCATGAATATGGATACGACATGACACAATGTGTCAATTTCAAAAATACTAGGTGGCTCAAACAAGGTAGCAAGAATCTTGGAGGCTCGAACCAACCCTGAAAACTTGAAAAACACAAGAAAAGTAAAAAGCCTTGAAACATAACAAGAGCGAAATTGTCAGCAAAACACATGGGAACCTTTGCAGACTGCAATCAATTGAGAGCCAAGAGATCATAAACTTTAACCAATGCTGAAAAGATTAATCAGGCAACCAACCAATCAGGGCTGAAAAACATGAAAGACGAACCAAAACATCAAAACCTCAGCGAAGACAAAACAAAGACCATATTCTTTAAGAAATGATGCGATTTGGAGGGTGGAAGAGACTTTTTGTCCCAGTCCTAAAATAATATGTCAAAGGCTTTAGAATAAGAATCTTTGAAAGCACCATCGTCCTTGTTGAAATCCAAAAATGAAGGTTCTCGCATTGCTCATGCTTACCTTGGAGTATGCATTAGAGGTGGCGATGTCACCACGTAAAGGAGTAATAAATGCCCAGTCAACCCTACATAACTAAAAGTTTACGGCAATGACCAAAAATGTTGACGATGAGACTAGAATGGAGATCATCCCAAACCCCAGTGGTAAATATGTCGAGTTTGGAAACGAGGAAAGGGCCTATGTGAACTTCTTTGATAATGTTCTTTACCCTGCTCTATCTTGTTGTATATTGAATTCTCTCTTTCATTCCTATAACTTTTCTTTCTTTTTTCCAATTGGAGAAGTTTATTGTAACTCCTTTAAATTGGGGTCTTTCTTTTATTCTTTTATTTATTATTATTATTATTTTTTTGTGTATACTTAATACATCAATGAAATTGTTTCGTATAACAAAATGTTGAGTTTGAAACCTTATAGACCCAAATGCAATCGAAGTCAGACCTTAAGGACCAAAAGTGTACATTTGTTTTTTAATAAATTTGTCTCTCAAAGTCCGTTCAAATGCTTTTAGGAACATGTCATGTCTGTTCTGTGTTCATATCCATACTTCTTAGGAGATAGTCTATCTTTTTCAGTTTTTGTGTTTGATGTTGCTGCAAAATAGTCTTCAAGTATATTAATGATGGGTTTGAAGCCGTCTGTTAGTTTTCTTCTTCCATGGGCTCTCATCACAGCAGTTTTATCGAGAAATGGTTTGATTAGTACTTTGGACTATGGTATTTTTCAAGTTCCACATTGAAGATTATCCATATTTGGCTTCATTGCAGTTCGTATAAGGTCTGGTAGGGATGGAATTCTCCAGCTGTTCTTTGGTTGGTTGTTTTGTTAGTTCCAAGTTTTGGGTTGTATATTTTTCTAGTAATTGTTATGCAACTAAGCCACCATTGCCTATACAATTTGAATCTTGGAAGAAGAGACCTAATAANCTTCACTGAAGTAGTGAGAAATCATACTGTTATTTATCTTATCATGAGGACTAACCATCATTAGACCAGCTGCATAAGGTTTATGAATGTCATCAAGATATAGTGTCTCGATATCGGCAACCATGAATGATTTCAGCTTTCGCCTATTTTCTCGATTCTGGATTGAGTTTTTTTTTTTGTTGACTCAGTGTATTTTTTTTATTCAATCAATTAAAATTTGTTCTTTTCTAAACAAAACAAAAACAAAAACATTCCTGACAAACTCATTCTTCCTCCGGAATACATCCGTTACTGGCAGGGATGAGTATTATGCTTGGGGTCTGCATCAGATAGCTAAAGCTGTGAGCTTCTTAAACAATGACTGTAAACTTGTAAGTATTTTGATATTTTGCATCTGGATATTAATTATGCTAATATTCTATTTGGTCAAACCATCTCTTCATTCTCACATTTGAATTGGCCCTGCCCCAGTCTGCCTCCCCTGCTCCCTTTCAAGTTTCAACTTTTTAACTAGCTAAATCTCCAACAGGTTCATGGTAATGTTTGCTTAGCCAGTGTAGTTGTAACTCCAACCTTGGATTGGAAGCTCCATGCTTTTGACGTGCTTTCTGAGTTTGACGGGAACAATGAATCTTCCAGTGGGCAAATGCTGGTAATAGTATCTCTTGAAGTTTGGAGTTTTGTGCGTGGATGTTTCCTTATCTTTTCTATTGTGTTCCCCATGTTATGTTTATGAGCTGGAATTAAGAAGCAAAAGGTGAAAATTAATACTGAAAGCCTCCTTAGTTGTGAGATGTTTTAAAGAACTGAACATTGACTTTTAGTAGATGATATTTTATAATATAAATGGTGGACTGTCAAATTGAATGCTACTGAGGTCTCAGTCTCAAAACTGAAGTCTGTTTAATTGACACTGTACCCTAGTCAATCGGTAATGGATGCTTTCTCCATCAAAATAGATGTGGCATGAATCGATGGCCATTATTGACATCTTGTGTCTTTGTTCTGCCTTTTGGCTTTCGATTACGTAGGAATGTACAAGGGGTTTATTTTAGATTTTCTCTTTAATTCGTAACCAAAAAGCTATTGTTTGTACAGAAGGTACAACAATTATAATAATATGTTAAGAGGGTAAGATGTTCCCCGTACTGTTGATGTAAAAGCAGATTACAGAAAAAATTCCGCCTAGAATATTTGGAAGTGAAACAAAGATTAAATATCAAAGGGCTCTCTCTCTACTAGACCAACTAAAAGTGAGGAATTTAACTCTAGGCGTACCAGTCCTAACAATTTAAAGCTAGGAAATCTTTCATTGTGGCACTTGCAACAGATAAAGATAGTCATCCTTTGCTTTCTTTTAGGTCCTCTTATTGCATTCAATCTTGGATTTCTGTGTTGGTAACTTGGGTATTTTCTTCAGCAATATGCCTGGCTCATTGGATCCCAATATAAACCGATGGAACTGGTGAAGTCTGACTTGTCTGTTATTAGAAAATCCCCCCCATGGGCCGTTGATTCTTGGGGCTTGGGTAAGTATTGGTTATTCCTACTGTTTGGGTTGCTTATTTATTGGTTTTTTTCAATGAAGGAATTATAGTATCCTGTTGTGCTTTAGTAACCTTGTAGGAATTCTTGCTTTGTTAGTTATTGACAATATATGCAATTTAAAGGGAAGAGTCCTTGATGCTTGAAAATTTAACATTTGCAAACAAATTAGCAAACTTTGATTAGGAGTATGAACAAAAATCTCCTGCTTGGGAGCGTGTTCGATTCTCATTATTGACATCTTGAGTATGAGCACAATCCTCTGCAACAGAACGTCAACAAATAAAATATATTGTGAATATGTTTTTTATTTCCTTCAGAATACTGAATTTTTGTACTGGCAATGGCATAATACCTACATGCTAACCACGTTCTTGTTTCAATATTTTGCAATCTGACGGCTCCCTGGATTCCTATTCTAATATAATGGATGCTTCTCAATCAGTTTGCACTCCTGGTGTTGTAACCAATAGAAGTTATAAATTTAAATCCTTTGGTATCAGGAAAAATCTATATGAACTTGTGCAAAGGGGAAATGGTATTGGAAACAATATTGGCTAATGTAGTGGTTCTGATAAATGTAGAAGATATTTTGACACTATTATCATACATACATATCTTCTCACATTGTTATCATACGTTTACTTTCACTTGCAGTAATGTTACTTTATTTTTGTTGATAGGTTGTCTCATCTATGAACTTTTTTCTGGTTTAAAGTTGGGCAAAACAGAGGAGCTGCGAAATACTGCTTCCATTCCCAAGGTTTCTATTCTTATTTGAAAAATGCTCATGGGATGTTGATTTTTCTCTTATTCTTTTTTCTCAAAGATTTTCATTCCTCACAATCAACTGTATGCTATACTTTAGCTTTAAACAATTTGTTCTCATGTTTTCTCTCTTTCGATTCTATGTCTGTGCACCAAAATACTTTGGCAAGGTGAAAAACAATGTTGTTTAGGAGGAAACGCTGATGAAGAAGTCTCTTTCCTCAATAGATGTGTAATGGAATGATCTAGGGAGTTGAAAGGTTGCTTCTAACACCTATAACTGAATCCCTTGTACTTCGACATTGTCTTTTTAATAAAAATTGAGGTTTCGTTGAAAAGTAAATGGAAAAAAATGCAAGGGGCATACAAAAATAAACCCATCCCTCAAAAGGAGCCAAGGTAAAGACGAGGGCTCGAATGAAGCAAGATCAAATGCAACCAGTAATTACAAAATTCCTTATGCATTGTTTTGGTTTTCATGATCAAACATAACCACCTATGATTCTAGAATGTTTATGTTACTTCCTCGGGAACATTAATTGGGATCTTTCAAAAGCAAGTCAGTTTTTGCTAGTATGTGTCTTTGTGTCCTCTGCTCTATTTGGTATACAAATTCATGTTGCTTTTTAACTTGCAATCTCACTTGATTGCTTTTGAATATTGAGGGATTTGTGGCATACAAATTCATTTTACTTTTTAATTTGCTCTTACTTGATTGCTTCTGAATCTTGAGTGATTTGATTTCATTCGTGGATGTGTCTGCCAGTCTTTACTTCCAGATTATCAACGGCTATTGAGCTCTATGCCTTCTCGCAGGTTGAATACATCCAAGCTTATAGAAAACAGTGGTGAGCTTTCTTATTAACATGTAATTATATATTTGAATTACCAGAATGCACTTTGCATCATTATGGCCTAATTAAAATCGTTTTGTAAATACTTGGATATTAAAGGCCTATGGGAATGTGAGCTGTAGAAAGGTTAGAAATATCATTTTCTATTTTTAAAAATGAAAAATGCTTTAAATTTATTCAATAATTTTCTTTAGGAAGCTTGTGACGCATCAAAAAAGGAGGGGGTTGAAAAAATCCATGGTAGCATTGGACATTAGCATCCTAAATCATAATTGGTATTAAAAAAACTCAAGTACTTAATTTATTTCTTAATATTTAATCAGGTGTTTTAAAATTTATTTTGAAGCAATTTATTAAGCAAAATGTTGTTAAATTTAATTTAATTTTATACAATTAATTAATTTCTAAAAAAGACTAGATTTTTATGATAAATAAATAAAAAATAAAAATTTCTTTTTATATTTCTCATTCCAACATCTAATACAATTATTATCTTCTTTCCAAATATCTTATTCTCAAACTTCATATTTACTTTTCCATCTGTTAAATACTCTATGCTTTCTGAACACTTTTATCTGAGGTCTTTTTAGTAGCTATTTTTAGATGTCACTAGACTGTACCACTTAGTGTATAAACTGAACTTATGATTTGTTTAAAGAAGAAAAAAGAAAAGAAAAAGAGAGAACTATTGAAAGAGAAGTACAAATAAATCCCTAATAGTTATCTAAAATGATATGTCATGGTTTTTTCTGGATGAGACAGAGATATAACATCCAAAAAGGGTCCAAAATTCGGGCCAACAACTGAGAAAGTACTCAAGAATATGGGCCCCTTATTCACAGGGAAAGGTTAAACACAAACCTTCCAATATAAGTGGATGGTAAAAGTAGAATAAAGTCTTAAGAACTTGTGAAGAGCTCTCCAACTTTGGGGCTGATTCTGTCTCCTTTTATTAAAGAGAAACAAGAATTTCTCCATTATTAGAAACTTACGAGAAGAACATATAGTAATAACATCCTTTCCTTATAAAACTGCCAAAATGAATTATTAAAATGGCCATCCAGTTCCAACTACAAAGTTGTTCAGGTAATCAAAATAACAAAGAATTGATCAGTAGATCATCATGCATGCCTCTCCCACACTTCTCCTTGTACTTTTCAGTTTATTTTTTGTATTTAGGTTTAAAATTATATATCATGTATATTTTATTGCACTCGTGTCTCATCATTTGAAAGTTATTTTCTTTTCAGTCATTTTTCTTTTTACATTTCTTATTAGTCTATCTTCTATTTTTTTTTTCTTTCATTTTAACTTTCTGCTCTTGCGTTTATTGATTAATTTGCTATCTATGAATGATAGAATATTTTCAAAATAAGTTGGTCGACACTATACACTTCATGGAAATTCTTAGCCTAAAGGATAGTGTTGAGAAGGATACCTTCTTCCGCAAGCTCCCAATTCTAGCTGAACAACTTCCTCGTCAAATAGTACTGAAAAAGGTGACTTCCAAACTACAGTATTTGTTTTCTTGATGTGCATCGTAAGTATCTCATTCTTTTGTTATTTTACAGTTGCTTCCGTTATTAGCTTCTGCCCTTGAATTTGGTTCAGCTACTGCCCCTGCCTTGACCGCACTGTTAAAAATGGGTTCTTGGCTTTCAACTGAAGAGTTCAATGCTAAGGTTAAACCAAAGTTGAATTATTATTTCGTTTAAGTTTTTGTTTGTTGATCTTTTACTTTTATCTGTCATGAGTTTGATTCTGGTCCCTTACTTTTTTGTTGACTCGGGTTTGGCCTCTATGATGTTCAGGTTCTACCTACGATTGTGAAATTATTTGCTTCCAATGACCGAGCTATCAGAACTGGACTCTTACAACATATTGATCAATTTGGAGAATCATTGTCTTCCCAAATGGTTGATGAACAGGTATGACATCAGTGATAATTATTGCATCTACAGATAAGGAAACTAGTTTATGTTGTTGAGGAGAAAATGATGTTGGATGAGTATTAGATGATTTTTCCACTACCAATTGAAAATATATTAATTTACATTTGAGCTTGATTTCAAAAACAAAACTAAGATTCTGTATAGGGATGCAATCCTTTCTCTCTCTTTTTCTGAAAATTTGCTAGGAATGAGCAGAAGGCAGAAGAGTTTTTTACTATAAGGTGAATAGTTTTCCTATACATTTTGGATGCGATTGTTGTTGAAATTTCCTATCAACCTATTGTTAGTTGTTACTAAGAGCTTCGTAAGAACTTTTTACTTTTTATTTTAATCAATTAGAGATCAGTCTTTTTCAAGTCCTTCGGACATGGATGAAATTAAATCTGCTTCTTTTATATCTTTCGTATTTTCACGTTACTTTGTATCTCTATGCGTACATGTATAGAATAACACACGATGGTGGGCACCTGCTCTCACACTATGAATGCTTCAATGGGGGTTGATTAGATGCTTATGTGATGCAGGGAGGGAAATATTGTATTTTATATTTCTGAGTTTGGCTGGTTATGTTAATGGCATGGCATATTTCATTTTTACAATCTCACCTTGTTTTACTTAGGTCTATCCTCATGTTGCCAATGGGTTCAACGACACATCTGCTTTTCTTCGTGAATTAACTCTTAAATCCATGCTTGTTTTGGCTCCCAAGGTATGCATGTCATTTTTTACTGTTCTTAATGTTTCTCAATAATATTTTAAAATGATAATAATTTCTTTTTTCTTTCGGTAAGGAACTTGAGACTTCAGAAATGAGCAATCAAACAAAAGAAACAACCTATGGCCACATGGACCTGATGTCTTTATACTAGGAACAAGGCTTCTCGTTCTTTTTAGGAATCACACTTGTATGAAGAAACAATGAAAGAATACAAGAACAAACAAGAGAACGAGTGAAACTAACATTAAAAGAGGCTTCAATTGAGCAAAGTAAGAACTAAAAGGTAATTACTAAAATCTTTGTACGCTGAGGCCTAAAGAGAGGTTTTAAACCTTAGAAGGGACCAAACCTCCACCTTCTCAGTTTCTAGTATAGCCCATAAAAGTTGCTGCCAAACAACCATGCCTTTCCTCTTCGTAAGTCTCTCTCACCCTACACCAATTCCCCCACCTCATGTTACCTAACCATGTGACTCAGTGGAGCTCGACCATCTTCAGGTAGCATAGACCAATGACACCCCACTTCCTCTTTGTTCTTCTCAATCAATTGTAAGGCTTTTTCCATCTATGGAAATCAAGGACTTCACATCATTGAATCATTGAGAGACTGTTCCTTCACCCTTACCCTAAAATTTGACTCTCACTCTTGGCTGATCTCTTGTTTCATATCTCTCATCGACTTCCCCTTAACCTAAAGTACTCCGCAGAGAGCCAAATTGATGAATATGTGCTTTCGGTTGAAAAAGTACCCAATCGTAAAGGGCATTGTGTTGAGATAGCCAAACTGGTGTGAATTGAGACTTAAATAAGATTATCCCACCAATTGACGAAGTCAAATCTGGTTGGAAGATGTTTTAAACCTTTTTTTAAAACCCTCCACCACCACCATCCCCCATTAATGCCAAAGGTTTGTACAAGGTGGCGTTAAAGGCTAATCATTGTTTGAAGATTCCTCAGACCTCTACCCCACCATAAATTTTGGCGACCGAACTCCCATCAAAGAATACTCCTTTGTCTGATTTTTGCTAAATTTTATCTGGAAAGGGCAATCATTCTTTATCAGCAGCACCTACATGATGACTCGTATGAGATTGTTAGAGCTTTGCAACACGTGGCCACTGATCTTGGCTCCCTTAGCCCACTCCAACCAAATAAATTTCTCCTAGATGTGAAAACAAGTAACAAGGATGTATTTTGTGCAATATTAAGGGCCAGTACAAGGTGGGTTCATATTAGGTGAAATTTGAACCTTGGAGCACAGAGGCCTTCCATAGAGACCAGATTCCATCATATGGGGATGGATAGGGTGCAAAACCTCCCATTCGACAAATTGTCCTTGGAGATCTTCAAGCAAATGGGTGATTTATGTGGTGGTTTTGTGGAACATCTTAGAAGACACTATCGAGAATGGACATGATGGATTTTTGTTTGAAGATTTAAAAAAAAAAAAAAAAAAAAAATCTATGGCTTTGTGCTTGTCGTTTTTCATTTTCTGTTGTCCTCATTTGCCCATGTGGTGGTGCATATTGACCTGTTTGTCAACTGCGACCTTCACATCAGATTTGATGCTGGAATTCATGGGCAAATTCCCAACTGACACGTCTTCTCCCCTTACGGTTTAGATCAGATGAGAGGATCTTTATTTAATCAAGTTGTGAACCATTTACTAATAGAAACTAGTATCTATATTACCTACCTAGCTTACGGAAACCAACTATTTTACCCTCTAATGGCTTGCTTTTAACTAACTAATGTTACGGCCTACTAACCATTGTTCCTTTAATGGCTAGGTTACAACCTTCTAATTGCAAAATAACTAACATGTCCACCCACATCATTTCATCCATCTAAAATGTAATTACACCTATACAAAGTGGTCTGAGTTTACCATAGCATGCACTTGGAATGCCCATGAGGTGTTGTAAGGTGTGTTGAAGGTTATGAGTTAACGGAGGTCAGTTGCTCCGCCCAATGCCAAAACAGTGAAAACGTACTGCTTTCTTCAAAAACATAAAACTTAATAATTAGATACCAAATGGCTCCTAAGTTTTCTTAAAGAAAGACGCCTGTGTTCTTATCTGAATTCTAGTTGAATTACTAAAAGTAGGGAAGAATATATGCCACCGAATCCAGAAAATTTGAGGCCAGGGTATTTCTTGAGGACACTTATTGTTTTTCTGTCATCTTGAATCGTTGGATGTGCGTTAATATTTTGGCTTATCTTTCTTTTTTAATGTCTTCATGATGGTTTCAATCAATAATTGTCTTATTTCTTGTTATTTATGGAGTTTTGAGATTATCTGAATTGGTATGGCAGCTTTCTCAACGGACTATTTCCGGGTCATTATTGAAGTACCTTTCAAAGTTACAGGTACTGCTTGAGCCATTTGTTGCTAGAATTGTTTTTTTCTTCCGCTATTCTGGTTGGTGTATGTGCAACTATCTATTTTTCTTGACGAGCAGTCCCAAAGAACCCTTTACATTAAAATGGAAAATATAAATAAATCTCTTGGCCAAAACACCACTAACTAACTGAATATCCTATACAGCTTTTCTATCTTTATCCTTGTCCTCACGGGACAGAATACTATTCTGTTTGTAACATAAAAAATAGGTCCTACCTTTACAAGCTTGGACACACAAAATATGCTTCTTTTATTTCCTGGATAACTAAATCCATTAAATAGGCTTAGATTCCTCTCAGCACACATAAACCGAGTTGGAGGTTTGACTTTCCTACCACTCCAGTATTGTTATTCTTTTGTATTATTAACTGAGAACGTTTCTGAGTAAATATAGGTTTTGGGGTCTTCCTTTCCTTTTGTTTTATAAATTTCATACATCAATGAAATTGTTTCTTACAAAAATAATAAATTATTTACTGACAACATTTCTGAGTCCTACAGGTTGATGAAGAAGCAGCAATCCGAACAAATACGACCATATTACTTGGGAACATTGCAAGTTACTTGAATGAAGGGGTGGGTATCTCTGTAAAGCTGCTTGCTGATTCACTCAAAACCATCAGTCAACAATCTTGATGTACTTAATTTTTAGTGATATTTTTCTTTCTTACAGCTATGTTTGGTAATAGTACTAGTGTTTCTTGGTATTTAAAATGCATGCTCTCTTTGTCCTTTATTTCGGATCACTTTTCCTCTTGTTCTTTGTTATTTGATGATCATCTTGATATGATTTTGATTTTTCTATATTTCAACAATGCTCATGTTGTATCTTTTTTTTTTTAACACTTAAACGTATATTAGTCTTTTACTGGAATAGCATCTTTGCTTTGGGTATCCAGGAAGACTTAAGAGATTCATTTCATTCGCTTTTTCTAATCATAAGAAACTGGGATTCATGGTCATAAAGTTTGGGATATTCTGGTTTGGGCTTTATTTAATGCTGCCAGTTGTTTTTTAAATGTACAACTGATCCACTCTTCATAGTCGTATCATCATTTTGCAGATGTTGTAATCCAGTGGTGAACGATTTTTTTTTTTTTTTTTTTTTTTTTTCTATTGAAATAAAATCTGATTGAGTTAAAATCTAAATGCAGACAAGGAAGAGAGTTTTAATTAACGCTTTCACTGTCCGTGCACTGCGTGATACATTTTCTCCAGCCCGTGGTGCAGGTATAAAATTTGTAGCCCCTTTTATTCTCTTTTCTGTTTGCCATATTAGGTCCATCCAAGTGTCTCGATTAAAGGACTTCCTTACATATTATACTTTAATTTCTCAGAGATTTAGTGTTGACTGCCCAATAAATGACGTTATCATGACAGGCATCATGGCATTATGTGCTACAAGTGGATATTATGACAGTGCAGAGATTGCAACTAGGATTCTTCCTAATATTATTGTGCTTACTGTAGATCCTGACAGGTGTGTTATGATGTTCAACATTTTCAATACATTTTTATCTTCCCACCCCCACCAAAAAAATAAAAAAATAATAATGCACATCTCGCCATGCACTTGAAATATCCTGCTTTTTTACATGCCTTGTGTTGATTTTATCAACACTCTTGAATAATTCGGTGTTTTGCATTAAGAAGCCACAAACTTGGTTAGATCATATACTTATTGCTTGAGGTGAATCAGGAGCTTGCATAGCGATATTTAGAAACAATGTTCTTGTGTTATATAAATTATTTTCTTTTTCTTTCATTTTTTTTATATCGAAAGCTGAGCCTTCATTTTGAGAAAAGAAAGAATATAAAGAACTAAAATGTCCAAACACAACTATAAACATAAAGGAGTTGAAAAAAAGTTAATCAACTACTTGAAATAACTTCCTTCATTCCTAGTTCAGTGATGATTTTATATTTCTTTGGAAAATATTTTCACCCTTCTGACTTGTTATACATTATATAGCACGGCATAGCTGACACCTTATGTAGTGGAGAACATGAGAGACAGATGGCTCTTCTAATAAATTTTATTATTGGACTAGTACATGTGGACCATTTCTACTTGTTTCTCAAAATTGGGGATTTTGTTAATTTCCTCTTTTTCCTTTGTATCTGGAATAAATCGTCTCATTTTGTTATTGACTTATAATCATAATATCTATCATTTTACATCATTTCTATATTCATTAATATCTTTCATTTCGTACTTCTCTGGTTGATAGTGATGTTCGATTGAAGTCCTTTCAAGCAGTTGATCAATTCTTACAGATATTAAAGCAAAACAATGAGGTGTGTTAAATACACATTTAGCTCATGACCTCAAATCTTTGCTTTGTTAAGTATTTTATCGTTTCTTGTTACTTAGGATTTTACATTCCTGGGCATAGATAGTTTGATACTAATAACTTCATAGATAGTCTGATACCATTTGCTTTCATATATTTGCTCAATTATGTATGGAAAACCAGTTGAGCTGCTACCAGTGAGAGAGCTAGTGTTTTTTGTTTTCTTATAACCCAAAATATATTCTCTCTCTCTCTCTCTCTCTCACACACACACACACACTCACAAATTCCTGTACATTTTCTGTCTTTGTAATTATTAGATGCAAGAATTAAAGTTTTCTATTTAAGCAGAATACAGTGACACATAGCTTGCCTATTCCTGGAAAAGGAAAANGTGTTAGTCAAATACTGGTCAACTCTCTAATAATGGAATGTGTTTTGTTGACATGTCTCCTATGGTTTATTCTAGGAAATGTCGGGAGATACAGCTGCATTAGGTTTGAACATCCCGTCTCTACCAGGAAATGCTAGTTTGCTCGGGTGAGTTCGTTTGAATATTACAATTGAAAATAATTATAATCATTTGTCTCTGGATTTCTGATGTGGTGTGTAAAGCATACGGTCAAGTCACTTTTATGGTTTGTTTTTATGCTATTGTTGGATTTTCTATTCAAAGTTTCTTGTATCTAATATGTCATTTATTGAAAACCTTTGGTTATGCATAGAAAACACTTTTTACTGATATTGACTGTCCTATTGTGATTAGGATAATACGCCACCTGTGAGATCCCACATCGGTTGGGAAGGAGAACGCAACCACATCGGTTGGGGAGGAGAACGCAACACCCTTTATAAGGGTGTGGAAACCTCTCCCTATTGTGATTAGAAAACACTTTATAAGGGCGTGGAAACCTCTCCCTAGCAGACGTGTTTTAAAAACCTTGAGGGAAAGCCCGAAAGGGAATGCCCAAAGAGGACAATATCTGCTAGCGGTGGACTTGGGTCCTTACACCACTGAAGTGGTAATTAGTAACACTATTGAAGTAGATCGATGTTCTCGTTGCCACAGATTTTTGCAATTTCAAATTTATTGGCCATAGATACACCGCGAAATATGTCATCTCTGCTCAAAATCTGACCCCACCAATCTATTATCCACCCTGATCTCGTATATGTATGCAGATGGGCGATGAGCTCCTTAACTCTAAAAGGAAAACCCTCTGAGCATGGTTCTAGCGCTCCTGTAAGCTCTAATGCACCTTTGGCTGCTACAAGCTCTGATTCCACATTAGGTACATTATATATCTTGAAACTTTTCTTTGAATTTATGCATACGTCTTTCATATTCCTCAGCCATGCTAGTGTTTTTTTCATGGTGGCTGGTGATGGATATGCCTCTCATATTTGTAATTTCTTTGAACTCTATGAGAAAGGGCGTACATCTATTTTGAAGACATTTAGAATGGTGTTTTTCTTATATTTTTTATAATGATAATATAGAATGGTGCTTTATAATGATAATATAGACTTATTGGTGTATTGTTGTAATATCCCACGTTGGTTGGGAGGAGAACGAAACACCCTTTTATAAGGGCTAGGAAACCTCTCCCTAGCAGATGCGTTTTAAAAACCTTGAGGGGAATCTCGAAAGAGAAAGCCCAAAAAGGACAATATCTGCTAGCTGTGGAAAACATAGCAAGGATGAGTATCTGGATGAGTACCGGTTCCGCTGTGTAACTTTGAAATACGCGTTTTAAAGCCTTGTGGGGAAACTTGTAGGGGAAAGCCCAAAGAGGACAATATCTGCTAGCGGTGGGCTTCGACTGTTACAATTGTAATAAATTTTTTGTTGATTTTATTTGCTTTCAGTTGAAAACGCTCCAACTACCTCACCTATAAGGGTAAGCTCGAGTTTCGATTTAACTGAACAACACGCAACTGAATCCCCTACATCGACTGATGGTTGGGGCGAAGTTGAAAATGGAATTCATGATGAAGATGAAAATGAGAAGGATGGGTGGGACGAGTTGGAACCGCTCGAGGAGTCTAAACCGTCTCCAGCTCTTGCCAACATTCAGGCTGCTCAAAAACGACCTGTATCTCAAATTGTGTCACAAACAAAACGACCAAGTAATGCCATTTTCATTCTTGCTTTCTAGATCATAGCTCGACTTCATAATAATAACTTATTTCAGTTCATTCAGATTCAAGTTCGGGTTCAAGAAGTACACCCAGGCCAGCTAAAGAAGACGACAATCTGTGGGGTTCCATAGCTGCCCCTGCTCCAAGAACTGGTTCAAAATCATTGAAAGTAAAAGCAAGCACAACTATTGACGACGACGATCCTTGGGCTGCCATCGCCGCTCCCGCACCAACGACTCGAGCTAAGCCATTGTCAGCTGGTGGGGGAAGAGGAAACAAACCCGCTGCTCCAAAACTAGGCGCACAACGGATAAACCGAACATCGTCGACAGGTATGTGATTACATATAGTACAAATCAAAAAAGAAAAAAAAAAGATTTCAAGATTCAAATTGCCTGCAAATTGGTCAATCCCCTCCATGAGCTGTTTCATCATAGTTTGATATGCAATAGCTTTCTTTTTCCTTGTTCAAGGAATATGATTTTGACATTTTCAAGATGTAATAAATTTATTATTCTTTTTATTAGATTGTAATAAATCTTTCATTTCATACAAATATATACTCAAAATAGATCAAATAAGCAATGAAATTAGATTCGTCTCAACTATAG

mRNA sequence

ATGTCAAAGGAGAAAGCCATTATCACTCTCTTGCAAGGCTGCAACAGCCTCAACAAGCTTCGAAAGATCCACGCCCATGTTCTTGTTAGCGGCCTCCGCCATCACGTCGCCATTAACAACAAGCTTTTGAACTTCTGTGCCATCTCCGTTTCAGGTTCCCTTGCTTATGCCCAGCTTCTCTTCCATCAAATGGAGTGCCTACAAACCGAAGCCTGGAACTCCATCATCAGAGGCTTTGCGCAGAGTTCATCTCCCATTGACGCCGTTGTTTATTACAATCAAATGGTTTGTGCCTCTTTCTCTTCCCCTGATACATTCACTTTCTCATTTGTGCTCAAAGCATGTGAAAGACTCAAGGCTGAGCGTAAGTGTAAAGAAATTCATGGCACTATAATCCGCTGTGGTTATGATGGGGATGTTATTATTTGTACCAATCTTGTCAAATGCTATGCTGCAATGGGGTCCGTTTGTATTGCCCAACAGGTGTTCGACGAAATGCCTGTGAGAGACTTGGTTGCTTGGAATGCTATGATTTCGTGCTTTTCCCAACAGGGTTTGCACGGGGAGGCATTGCAAGTGTACAATCAGATGAGAAGTGAAAATGTGGATGTAGATGGGTTTACCCTTGTTGGTTTGGCCAATCTTACCGCTCCTCCAATCTTCATCTTCCATGCCCGATCCGACGTCGGACTTCAGATCGATTCGCTCGTTTCAAGTTCTTCTTCCTCTGCATCTCTAATTCTTTCCCTCTCGATTCGTTTCGAAATTCAGCACTTTCGAAGCTCTGAGGGAGAAACTCTTTTGGCAGTTCCTGAAAATTCAGCACAGGGAGGGGAGCGCCACGGAGACGGGCGGAAGTGGTGCCAATCGCGTTTGGAGGAAATAGGGAGGTCCAGATTCGATCAGGGTTTAATGAAATTCGTGTCTGATTGGAGGAACGTTGAGTGTGATTCGAGGAGAGTGACGAAGCGGGAGTTGGAAATCACGTGGCGCTATTGGACCAGTGAATGCATTTCGACCCCATAATTGCTATTGAAATTTACCCATTTAGGGAGCTGTATAATTTTTAATCAGGCAAGGTTCCCTCTTCTTTCGATTTGTTGAAGGAGGAGGAGGAGGTGAAAGAAGAGAATCGAAGATGTTTAAGTTCTTAAAGGGAGTGGTGGGTGGATCTGGGTCTGGACTCAAAGATCTGCCCTACAACATTGGCGATCCGTACCCATCAGCTTGGGGCTCCTGGACTCATTTTCGCGGCACCTCCAAGGATGATGGGTCTCCAGTATCTATATTTTCTCTATCAGGTAGTGATGCACAAGATGGACATTTGGCTGCAGGTCGCAACGGTGTGAAGCGGCTACGAACTGTTAGGCATCCAAATATTTTATCCTTTCTTCACAGTACTGAGGCTGAAACTATTGATGGTTCTGCTTCCAAGATTACAATTTATATTGTTACAGAACCTGTTATGCCATTGTCTGAAAAGATCAAGGAGTTGGGTCTAGAAGGTACCCAAAGGGATGAGTATTATGCTTGGGGTCTGCATCAGATAGCTAAAGCTGTGAGCTTCTTAAACAATGACTGTAAACTTGTTCATGGTAATGTTTGCTTAGCCAGTGTAGTTGTAACTCCAACCTTGGATTGGAAGCTCCATGCTTTTGACGTGCTTTCTGAGTTTGACGGGAACAATGAATCTTCCAGTGGGCAAATGCTGCAATATGCCTGGCTCATTGGATCCCAATATAAACCGATGGAACTGGTGAAGTCTGACTTGTCTGTTATTAGAAAATCCCCCCCATGGGCCGTTGATTCTTGGGGCTTGGGTTGTCTCATCTATGAACTTTTTTCTGGTTTAAAGTTGGGCAAAACAGAGGAGCTGCGAAATACTGCTTCCATTCCCAAGTCTTTACTTCCAGATTATCAACGGCTATTGAGCTCTATGCCTTCTCGCAGGTTGAATACATCCAAGCTTATAGAAAACAGTGAATATTTTCAAAATAAGTTGGTCGACACTATACACTTCATGGAAATTCTTAGCCTAAAGGATAGTGTTGAGAAGGATACCTTCTTCCGCAAGCTCCCAATTCTAGCTGAACAACTTCCTCGTCAAATAGTACTGAAAAAGTTGCTTCCGTTATTAGCTTCTGCCCTTGAATTTGGTTCAGCTACTGCCCCTGCCTTGACCGCACTGTTAAAAATGGGTTCTTGGCTTTCAACTGAAGAGTTCAATGCTAAGGTTCTACCTACGATTGTGAAATTATTTGCTTCCAATGACCGAGCTATCAGAACTGGACTCTTACAACATATTGATCAATTTGGAGAATCATTGTCTTCCCAAATGGTTGATGAACAGGTCTATCCTCATGTTGCCAATGGGTTCAACGACACATCTGCTTTTCTTCGTGAATTAACTCTTAAATCCATGCTTGTTTTGGCTCCCAAGCTTTCTCAACGGACTATTTCCGGGTCATTATTGAAGTACCTTTCAAAGTTACAGGTTGATGAAGAAGCAGCAATCCGAACAAATACGACCATATTACTTGGGAACATTGCAAGTTACTTGAATGAAGGGACAAGGAAGAGAGTTTTAATTAACGCTTTCACTGTCCGTGCACTGCGTGATACATTTTCTCCAGCCCGTGGTGCAGGCATCATGGCATTATGTGCTACAAGTGGATATTATGACAGTGCAGAGATTGCAACTAGGATTCTTCCTAATATTATTGTGCTTACTGTAGATCCTGACAGTGATGTTCGATTGAAGTCCTTTCAAGCAGTTGATCAATTCTTACAGATATTAAAGCAAAACAATGAGGAAATGTCGGGAGATACAGCTGCATTAGGTTTGAACATCCCGTCTCTACCAGGAAATGCTAGTTTGCTCGGATGGGCGATGAGCTCCTTAACTCTAAAAGGAAAACCCTCTGAGCATGGTTCTAGCGCTCCTGTAAGCTCTAATGCACCTTTGGCTGCTACAAGCTCTGATTCCACATTAGTTGAAAACGCTCCAACTACCTCACCTATAAGGGTAAGCTCGAGTTTCGATTTAACTGAACAACACGCAACTGAATCCCCTACATCGACTGATGGTTGGGGCGAAGTTGAAAATGGAATTCATGATGAAGATGAAAATGAGAAGGATGGGTGGGACGAGTTGGAACCGCTCGAGGAGTCTAAACCGTCTCCAGCTCTTGCCAACATTCAGGCTGCTCAAAAACGACCTGTATCTCAAATTGTGTCACAAACAAAACGACCAAATTCAAGTTCGGGTTCAAGAAGTACACCCAGGCCAGCTAAAGAAGACGACAATCTGTGGGGTTCCATAGCTGCCCCTGCTCCAAGAACTGGTTCAAAATCATTGAAAGTAAAAGCAAGCACAACTATTGACGACGACGATCCTTGGGCTGCCATCGCCGCTCCCGCACCAACGACTCGAGCTAAGCCATTGTCAGCTGGTGGGGGAAGAGGAAACAAACCCGCTGCTCCAAAACTAGGCGCACAACGGATAAACCGAACATCGTCGACAGGTATGTGATTACATATAGTACAAATCAAAAAAGAAAAAAAAAAGATTTCAAGATTCAAATTGCCTGCAAATTGGTCAATCCCCTCCATGAGCTGTTTCATCATAGTTTGATATGCAATAGCTTTCTTTTTCCTTGTTCAAGGAATATGATTTTGACATTTTCAAGATGTAATAAATTTATTATTCTTTTTATTAGATTGTAATAAATCTTTCATTTCATACAAATATATACTCAAAATAGATCAAATAAGCAATGAAATTAGATTCGTCTCAACTATAG

Coding sequence (CDS)

ATGTCAAAGGAGAAAGCCATTATCACTCTCTTGCAAGGCTGCAACAGCCTCAACAAGCTTCGAAAGATCCACGCCCATGTTCTTGTTAGCGGCCTCCGCCATCACGTCGCCATTAACAACAAGCTTTTGAACTTCTGTGCCATCTCCGTTTCAGGTTCCCTTGCTTATGCCCAGCTTCTCTTCCATCAAATGGAGTGCCTACAAACCGAAGCCTGGAACTCCATCATCAGAGGCTTTGCGCAGAGTTCATCTCCCATTGACGCCGTTGTTTATTACAATCAAATGGTTTGTGCCTCTTTCTCTTCCCCTGATACATTCACTTTCTCATTTGTGCTCAAAGCATGTGAAAGACTCAAGGCTGAGCGTAAGTGTAAAGAAATTCATGGCACTATAATCCGCTGTGGTTATGATGGGGATGTTATTATTTGTACCAATCTTGTCAAATGCTATGCTGCAATGGGGTCCGTTTGTATTGCCCAACAGGTGTTCGACGAAATGCCTGTGAGAGACTTGGTTGCTTGGAATGCTATGATTTCGTGCTTTTCCCAACAGGGTTTGCACGGGGAGGCATTGCAAGTGTACAATCAGATGAGAAGTGAAAATGTGGATGTAGATGGGTTTACCCTTGTTGGTTTGGCCAATCTTACCGCTCCTCCAATCTTCATCTTCCATGCCCGATCCGACGTCGGACTTCAGATCGATTCGCTCGTTTCAAGTTCTTCTTCCTCTGCATCTCTAATTCTTTCCCTCTCGATTCGTTTCGAAATTCAGCACTTTCGAAGCTCTGAGGGAGAAACTCTTTTGGCAGTTCCTGAAAATTCAGCACAGGGAGGGGAGCGCCACGGAGACGGGCGGAAGTGGTGCCAATCGCGTTTGGAGGAAATAGGGAGGTCCAGATTCGATCAGGGTTTAATGAAATTCGTGTCTGATTGGAGGAACGTTGAGTGTGATTCGAGGAGAGTGACGAAGCGGGAGTTGGAAATCACGTGGCGCTATTGGACCAGTGAATGCATTTCGACCCCATAA

Protein sequence

MSKEKAIITLLQGCNSLNKLRKIHAHVLVSGLRHHVAINNKLLNFCAISVSGSLAYAQLLFHQMECLQTEAWNSIIRGFAQSSSPIDAVVYYNQMVCASFSSPDTFTFSFVLKACERLKAERKCKEIHGTIIRCGYDGDVIICTNLVKCYAAMGSVCIAQQVFDEMPVRDLVAWNAMISCFSQQGLHGEALQVYNQMRSENVDVDGFTLVGLANLTAPPIFIFHARSDVGLQIDSLVSSSSSSASLILSLSIRFEIQHFRSSEGETLLAVPENSAQGGERHGDGRKWCQSRLEEIGRSRFDQGLMKFVSDWRNVECDSRRVTKRELEITWRYWTSECISTP
Homology
BLAST of Cp4.1LG07g07930 vs. ExPASy Swiss-Prot
Match: Q9LXY5 (Pentatricopeptide repeat-containing protein At3g56550 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H80 PE=2 SV=1)

HSP 1 Score: 255.0 bits (650), Expect = 1.2e-66
Identity = 120/216 (55.56%), Postives = 165/216 (76.39%), Query Frame = 0

Query: 3   KEKAIITLLQGCNSLNKLRKIHAHVLVSGLRHHVAINNKLLNFCAISVSGSLAYAQLLFH 62
           K + I+ +LQGCNS+ KLRKIH+HV+++GL+HH +I N LL FCA+SV+GSL++AQLLF 
Sbjct: 4   KARVIVRMLQGCNSMKKLRKIHSHVIINGLQHHPSIFNHLLRFCAVSVTGSLSHAQLLFD 63

Query: 63  QMEC-LQTEAWNSIIRGFAQSSSPIDAVVYYNQMVCASFSSPDTFTFSFVLKACERLKAE 122
             +    T  WN +IRGF+ SSSP++++++YN+M+ +S S PD FTF+F LK+CER+K+ 
Sbjct: 64  HFDSDPSTSDWNYLIRGFSNSSSPLNSILFYNRMLLSSVSRPDLFTFNFALKSCERIKSI 123

Query: 123 RKCKEIHGTIIRCGYDGDVIICTNLVKCYAAMGSVCIAQQVFDEMPVRDLVAWNAMISCF 182
            KC EIHG++IR G+  D I+ T+LV+CY+A GSV IA +VFDEMPVRDLV+WN MI CF
Sbjct: 124 PKCLEIHGSVIRSGFLDDAIVATSLVRCYSANGSVEIASKVFDEMPVRDLVSWNVMICCF 183

Query: 183 SQQGLHGEALQVYNQMRSENVDVDGFTLVGLANLTA 218
           S  GLH +AL +Y +M +E V  D +TLV L +  A
Sbjct: 184 SHVGLHNQALSMYKRMGNEGVCGDSYTLVALLSSCA 219

BLAST of Cp4.1LG07g07930 vs. ExPASy Swiss-Prot
Match: A8MQA3 (Pentatricopeptide repeat-containing protein At4g21065 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H28 PE=2 SV=2)

HSP 1 Score: 134.0 bits (336), Expect = 3.1e-30
Identity = 80/219 (36.53%), Postives = 127/219 (57.99%), Query Frame = 0

Query: 8   ITLLQ--GCNSLNKLRKIHAHVLVSGLRHHVAINN----KLLNFCAISVSG--SLAYAQL 67
           I LLQ  G +S+ KLR+IHA      +RH V+I++    K L F  +S+     ++YA  
Sbjct: 19  INLLQTYGVSSITKLRQIHAF----SIRHGVSISDAELGKHLIFYLVSLPSPPPMSYAHK 78

Query: 68  LFHQME-CLQTEAWNSIIRGFAQSSSPIDAVVYYNQMVCASFSSPDTFTFSFVLKACERL 127
           +F ++E  +    WN++IRG+A+  + I A   Y +M  +    PDT T+ F++KA   +
Sbjct: 79  VFSKIEKPINVFIWNTLIRGYAEIGNSISAFSLYREMRVSGLVEPDTHTYPFLIKAVTTM 138

Query: 128 KAERKCKEIHGTIIRCGYDGDVIICTNLVKCYAAMGSVCIAQQVFDEMPVRDLVAWNAMI 187
              R  + IH  +IR G+   + +  +L+  YA  G V  A +VFD+MP +DLVAWN++I
Sbjct: 139 ADVRLGETIHSVVIRSGFGSLIYVQNSLLHLYANCGDVASAYKVFDKMPEKDLVAWNSVI 198

Query: 188 SCFSQQGLHGEALQVYNQMRSENVDVDGFTLVGLANLTA 218
           + F++ G   EAL +Y +M S+ +  DGFT+V L +  A
Sbjct: 199 NGFAENGKPEEALALYTEMNSKGIKPDGFTIVSLLSACA 233

BLAST of Cp4.1LG07g07930 vs. ExPASy Swiss-Prot
Match: Q9FX24 (Pentatricopeptide repeat-containing protein At1g34160 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H68 PE=2 SV=2)

HSP 1 Score: 133.7 bits (335), Expect = 4.1e-30
Identity = 74/207 (35.75%), Postives = 114/207 (55.07%), Query Frame = 0

Query: 9   TLLQGCNSLNKLRKIHAHVLVSGLRHHVAINNKLLNFCAISVSGSLAYAQLLFHQMECLQ 68
           T++Q C S ++++++ +H L +G      + ++LL  CAIS  G L++A  +F  +    
Sbjct: 8   TMIQKCVSFSQIKQLQSHFLTAGHFQSSFLRSRLLERCAISPFGDLSFAVQIFRYIPKPL 67

Query: 69  TEAWNSIIRGFAQSSSPIDAVVYYNQMVCASFSSP-----DTFTFSFVLKACERLKAERK 128
           T  WN+IIRGFA SS P  A  +Y  M+  S SS      D  T SF LKAC R      
Sbjct: 68  TNDWNAIIRGFAGSSHPSLAFSWYRSMLQQSSSSSAICRVDALTCSFTLKACARALCSSA 127

Query: 129 CKEIHGTIIRCGYDGDVIICTNLVKCYAAMGSVCIAQQVFDEMPVRDLVAWNAMISCFSQ 188
             ++H  I R G   D ++CT L+  Y+  G +  A ++FDEMPVRD+ +WNA+I+    
Sbjct: 128 MDQLHCQINRRGLSADSLLCTTLLDAYSKNGDLISAYKLFDEMPVRDVASWNALIAGLVS 187

Query: 189 QGLHGEALQVYNQMRSENVDVDGFTLV 211
                EA+++Y +M +E +     T+V
Sbjct: 188 GNRASEAMELYKRMETEGIRRSEVTVV 214

BLAST of Cp4.1LG07g07930 vs. ExPASy Swiss-Prot
Match: Q9LN01 (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 130.2 bits (326), Expect = 4.5e-29
Identity = 80/242 (33.06%), Postives = 129/242 (53.31%), Query Frame = 0

Query: 8   ITLLQGCNSLNKLRKIHAHVLVSGLRHHVAINNKLLNFCAISVS-GSLAYAQLLFHQMEC 67
           ++LL  C +L  LR IHA ++  GL +     +KL+ FC +S     L YA  +F  ++ 
Sbjct: 37  LSLLHNCKTLQSLRIIHAQMIKIGLHNTNYALSKLIEFCILSPHFEGLPYAISVFKTIQE 96

Query: 68  LQTEAWNSIIRGFAQSSSPIDAVVYYNQMVCASFSSPDTFTFSFVLKACERLKAERKCKE 127
                WN++ RG A SS P+ A+  Y  M+      P+++TF FVLK+C + KA ++ ++
Sbjct: 97  PNLLIWNTMFRGHALSSDPVSALKLYVCMISLGL-LPNSYTFPFVLKSCAKSKAFKEGQQ 156

Query: 128 IHGTIIRCGYD-------------------------------GDVIICTNLVKCYAAMGS 187
           IHG +++ G D                                DV+  T L+K YA+ G 
Sbjct: 157 IHGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVFDKSPHRDVVSYTALIKGYASRGY 216

Query: 188 VCIAQQVFDEMPVRDLVAWNAMISCFSQQGLHGEALQVYNQMRSENVDVDGFTLVGLANL 218
           +  AQ++FDE+PV+D+V+WNAMIS +++ G + EAL+++  M   NV  D  T+V + + 
Sbjct: 217 IENAQKLFDEIPVKDVVSWNAMISGYAETGNYKEALELFKDMMKTNVRPDESTMVTVVSA 276

BLAST of Cp4.1LG07g07930 vs. ExPASy Swiss-Prot
Match: Q9MA95 (Putative pentatricopeptide repeat-containing protein At3g05240 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E82 PE=3 SV=2)

HSP 1 Score: 127.1 bits (318), Expect = 3.8e-28
Identity = 64/209 (30.62%), Postives = 124/209 (59.33%), Query Frame = 0

Query: 5   KAIITLLQGCNSLNKLRKIHAHVLVSGLRHHVAINNKLLNFC-AISVSGSLAYAQLLFHQ 64
           K I++ L+ C SL +L ++H  ++ S +  +V   ++L++FC     + +L+YA+ +F  
Sbjct: 7   KPILSQLENCRSLVELNQLHGLMIKSSVIRNVIPLSRLIDFCTTCPETMNLSYARSVFES 66

Query: 65  MECLQTEAWNSIIRGFAQSSSPIDAVVYYNQMVCASFSSPDTFTFSFVLKACERLKAERK 124
           ++C     WNS+IRG++ S +P  A+++Y +M+   + SPD FTF +VLKAC  L+  + 
Sbjct: 67  IDCPSVYIWNSMIRGYSNSPNPDKALIFYQEMLRKGY-SPDYFTFPYVLKACSGLRDIQF 126

Query: 125 CKEIHGTIIRCGYDGDVIICTNLVKCYAAMGSVCIAQQVFDEMPVRDLVAWNAMISCFSQ 184
              +HG +++ G++ ++ + T L+  Y   G V    +VF+++P  ++VAW ++IS F  
Sbjct: 127 GSCVHGFVVKTGFEVNMYVSTCLLHMYMCCGEVNYGLRVFEDIPQWNVVAWGSLISGFVN 186

Query: 185 QGLHGEALQVYNQMRSENVDVDGFTLVGL 213
                +A++ + +M+S  V  +   +V L
Sbjct: 187 NNRFSDAIEAFREMQSNGVKANETIMVDL 214

BLAST of Cp4.1LG07g07930 vs. NCBI nr
Match: XP_023537237.1 (pentatricopeptide repeat-containing protein At3g56550 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 432 bits (1110), Expect = 3.33e-145
Identity = 213/217 (98.16%), Postives = 214/217 (98.62%), Query Frame = 0

Query: 1   MSKEKAIITLLQGCNSLNKLRKIHAHVLVSGLRHHVAINNKLLNFCAISVSGSLAYAQLL 60
           MSKEKAIITLLQGCNSLNKLRKIHAHVLVSGLRHHVAINNKLLNFCAISVSGSLAYAQLL
Sbjct: 1   MSKEKAIITLLQGCNSLNKLRKIHAHVLVSGLRHHVAINNKLLNFCAISVSGSLAYAQLL 60

Query: 61  FHQMECLQTEAWNSIIRGFAQSSSPIDAVVYYNQMVCASFSSPDTFTFSFVLKACERLKA 120
           FHQMECLQTEAWNSIIRGFAQSSSPIDAVVYYNQMVCASFSSPDTFTFSFVLKACERLKA
Sbjct: 61  FHQMECLQTEAWNSIIRGFAQSSSPIDAVVYYNQMVCASFSSPDTFTFSFVLKACERLKA 120

Query: 121 ERKCKEIHGTIIRCGYDGDVIICTNLVKCYAAMGSVCIAQQVFDEMPVRDLVAWNAMISC 180
           ERKCKEIHGTIIRCGYDGDVIICTNLVKCYAAMGSVCIAQQVFDEMPVRDLVAWNAMISC
Sbjct: 121 ERKCKEIHGTIIRCGYDGDVIICTNLVKCYAAMGSVCIAQQVFDEMPVRDLVAWNAMISC 180

Query: 181 FSQQGLHGEALQVYNQMRSENVDVDGFTLVGLANLTA 217
           FSQQGLHGEALQVYNQMRSENVDVDGFTLVGL +  A
Sbjct: 181 FSQQGLHGEALQVYNQMRSENVDVDGFTLVGLISSCA 217

BLAST of Cp4.1LG07g07930 vs. NCBI nr
Match: XP_022937704.1 (pentatricopeptide repeat-containing protein At3g56550 [Cucurbita moschata])

HSP 1 Score: 429 bits (1104), Expect = 2.69e-144
Identity = 211/217 (97.24%), Postives = 213/217 (98.16%), Query Frame = 0

Query: 1   MSKEKAIITLLQGCNSLNKLRKIHAHVLVSGLRHHVAINNKLLNFCAISVSGSLAYAQLL 60
           MSKEKAI+TLLQGCNSLNKLRKIHAHVLVSGLRHHVAINNKLLNFCAISVSGSLAYAQLL
Sbjct: 1   MSKEKAIVTLLQGCNSLNKLRKIHAHVLVSGLRHHVAINNKLLNFCAISVSGSLAYAQLL 60

Query: 61  FHQMECLQTEAWNSIIRGFAQSSSPIDAVVYYNQMVCASFSSPDTFTFSFVLKACERLKA 120
           FHQMECLQTEAWNSIIRGFAQSSSPIDAVVYYNQMVCASFSSPDTFTFSFVLKACERLKA
Sbjct: 61  FHQMECLQTEAWNSIIRGFAQSSSPIDAVVYYNQMVCASFSSPDTFTFSFVLKACERLKA 120

Query: 121 ERKCKEIHGTIIRCGYDGDVIICTNLVKCYAAMGSVCIAQQVFDEMPVRDLVAWNAMISC 180
           ERKCKEIHGTIIRCGYDGDVIICTNLVKCYAAMGSVCIA QVFDEMPVRDLVAWNAMISC
Sbjct: 121 ERKCKEIHGTIIRCGYDGDVIICTNLVKCYAAMGSVCIAHQVFDEMPVRDLVAWNAMISC 180

Query: 181 FSQQGLHGEALQVYNQMRSENVDVDGFTLVGLANLTA 217
           FSQQGLHGEALQVYNQMRSENVDVDGFTLVGL +  A
Sbjct: 181 FSQQGLHGEALQVYNQMRSENVDVDGFTLVGLISSCA 217

BLAST of Cp4.1LG07g07930 vs. NCBI nr
Match: XP_022965478.1 (pentatricopeptide repeat-containing protein At3g56550 [Cucurbita maxima])

HSP 1 Score: 424 bits (1091), Expect = 3.08e-142
Identity = 208/217 (95.85%), Postives = 212/217 (97.70%), Query Frame = 0

Query: 1   MSKEKAIITLLQGCNSLNKLRKIHAHVLVSGLRHHVAINNKLLNFCAISVSGSLAYAQLL 60
           MSKEKAIITLLQGCNSLNKLRKIHAHVLVSGLRHHVAINNKLLNFCAISVSGSLAYAQLL
Sbjct: 8   MSKEKAIITLLQGCNSLNKLRKIHAHVLVSGLRHHVAINNKLLNFCAISVSGSLAYAQLL 67

Query: 61  FHQMECLQTEAWNSIIRGFAQSSSPIDAVVYYNQMVCASFSSPDTFTFSFVLKACERLKA 120
           FHQMECLQTEAWNSIIRGFAQSSSPIDAV YYNQMVCASFSSPDTFTFSFVLKACERLKA
Sbjct: 68  FHQMECLQTEAWNSIIRGFAQSSSPIDAVAYYNQMVCASFSSPDTFTFSFVLKACERLKA 127

Query: 121 ERKCKEIHGTIIRCGYDGDVIICTNLVKCYAAMGSVCIAQQVFDEMPVRDLVAWNAMISC 180
           ERKCKEIHGT+IRCGYDGDVIICTNLVKCYAAMGSVCIAQQVFD+MPVRDLVAWNA+ISC
Sbjct: 128 ERKCKEIHGTVIRCGYDGDVIICTNLVKCYAAMGSVCIAQQVFDKMPVRDLVAWNALISC 187

Query: 181 FSQQGLHGEALQVYNQMRSENVDVDGFTLVGLANLTA 217
           FSQQGLHGEALQVYNQMRSENV VDGFTLVGL +  A
Sbjct: 188 FSQQGLHGEALQVYNQMRSENVGVDGFTLVGLISSCA 224

BLAST of Cp4.1LG07g07930 vs. NCBI nr
Match: KAG6586142.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 430 bits (1105), Expect = 7.91e-136
Identity = 212/217 (97.70%), Postives = 213/217 (98.16%), Query Frame = 0

Query: 1   MSKEKAIITLLQGCNSLNKLRKIHAHVLVSGLRHHVAINNKLLNFCAISVSGSLAYAQLL 60
           MSKEKAIITLLQGCNSLNKLRKIHAHVLVSGLRHHVAINNKLLNFCAISVSGSLAYAQLL
Sbjct: 1   MSKEKAIITLLQGCNSLNKLRKIHAHVLVSGLRHHVAINNKLLNFCAISVSGSLAYAQLL 60

Query: 61  FHQMECLQTEAWNSIIRGFAQSSSPIDAVVYYNQMVCASFSSPDTFTFSFVLKACERLKA 120
           FHQMECLQTEAWNSIIRGFAQSSSPIDAVVYYNQMVCASFSSPDTFTFSFVLKACERLKA
Sbjct: 61  FHQMECLQTEAWNSIIRGFAQSSSPIDAVVYYNQMVCASFSSPDTFTFSFVLKACERLKA 120

Query: 121 ERKCKEIHGTIIRCGYDGDVIICTNLVKCYAAMGSVCIAQQVFDEMPVRDLVAWNAMISC 180
           ERKCKEIHGTIIRCGYDGDVIICTNLVKCYAAMGSVCIA QVFDEMPVRDLVAWNAMISC
Sbjct: 121 ERKCKEIHGTIIRCGYDGDVIICTNLVKCYAAMGSVCIAHQVFDEMPVRDLVAWNAMISC 180

Query: 181 FSQQGLHGEALQVYNQMRSENVDVDGFTLVGLANLTA 217
           FSQQGLHGEALQVYNQMRSENVDVDGFTLVGL +  A
Sbjct: 181 FSQQGLHGEALQVYNQMRSENVDVDGFTLVGLISSCA 217

BLAST of Cp4.1LG07g07930 vs. NCBI nr
Match: XP_038890323.1 (pentatricopeptide repeat-containing protein At3g56550 [Benincasa hispida] >XP_038890324.1 pentatricopeptide repeat-containing protein At3g56550 [Benincasa hispida] >XP_038890325.1 pentatricopeptide repeat-containing protein At3g56550 [Benincasa hispida])

HSP 1 Score: 403 bits (1035), Expect = 9.97e-134
Identity = 192/217 (88.48%), Postives = 207/217 (95.39%), Query Frame = 0

Query: 1   MSKEKAIITLLQGCNSLNKLRKIHAHVLVSGLRHHVAINNKLLNFCAISVSGSLAYAQLL 60
           MSKEKAI+TLLQGCNSLN+LRKIHAHV+VSGLRHHVAI NKLLNFCAISVSGSLAYAQLL
Sbjct: 1   MSKEKAILTLLQGCNSLNRLRKIHAHVIVSGLRHHVAIGNKLLNFCAISVSGSLAYAQLL 60

Query: 61  FHQMECLQTEAWNSIIRGFAQSSSPIDAVVYYNQMVCASFSSPDTFTFSFVLKACERLKA 120
           FHQMEC QTEAWNSIIRGFAQSSSPIDA+++YNQMV ASFSSPDTFTFSFVLKACER+KA
Sbjct: 61  FHQMECPQTEAWNSIIRGFAQSSSPIDAIIFYNQMVWASFSSPDTFTFSFVLKACERIKA 120

Query: 121 ERKCKEIHGTIIRCGYDGDVIICTNLVKCYAAMGSVCIAQQVFDEMPVRDLVAWNAMISC 180
           ERKC E+HG++IRCGYDGDVI+CTNLVKCY+AMGS+CIAQQVFD+MP RDLVAWNAMISC
Sbjct: 121 ERKCNEVHGSVIRCGYDGDVIVCTNLVKCYSAMGSICIAQQVFDKMPARDLVAWNAMISC 180

Query: 181 FSQQGLHGEALQVYNQMRSENVDVDGFTLVGLANLTA 217
           FSQQGLH EALQ YNQMRSENVDVDGFTLVGL +  A
Sbjct: 181 FSQQGLHQEALQTYNQMRSENVDVDGFTLVGLISSCA 217

BLAST of Cp4.1LG07g07930 vs. ExPASy TrEMBL
Match: A0A6J1FBZ0 (pentatricopeptide repeat-containing protein At3g56550 OS=Cucurbita moschata OX=3662 GN=LOC111444025 PE=3 SV=1)

HSP 1 Score: 429 bits (1104), Expect = 1.30e-144
Identity = 211/217 (97.24%), Postives = 213/217 (98.16%), Query Frame = 0

Query: 1   MSKEKAIITLLQGCNSLNKLRKIHAHVLVSGLRHHVAINNKLLNFCAISVSGSLAYAQLL 60
           MSKEKAI+TLLQGCNSLNKLRKIHAHVLVSGLRHHVAINNKLLNFCAISVSGSLAYAQLL
Sbjct: 1   MSKEKAIVTLLQGCNSLNKLRKIHAHVLVSGLRHHVAINNKLLNFCAISVSGSLAYAQLL 60

Query: 61  FHQMECLQTEAWNSIIRGFAQSSSPIDAVVYYNQMVCASFSSPDTFTFSFVLKACERLKA 120
           FHQMECLQTEAWNSIIRGFAQSSSPIDAVVYYNQMVCASFSSPDTFTFSFVLKACERLKA
Sbjct: 61  FHQMECLQTEAWNSIIRGFAQSSSPIDAVVYYNQMVCASFSSPDTFTFSFVLKACERLKA 120

Query: 121 ERKCKEIHGTIIRCGYDGDVIICTNLVKCYAAMGSVCIAQQVFDEMPVRDLVAWNAMISC 180
           ERKCKEIHGTIIRCGYDGDVIICTNLVKCYAAMGSVCIA QVFDEMPVRDLVAWNAMISC
Sbjct: 121 ERKCKEIHGTIIRCGYDGDVIICTNLVKCYAAMGSVCIAHQVFDEMPVRDLVAWNAMISC 180

Query: 181 FSQQGLHGEALQVYNQMRSENVDVDGFTLVGLANLTA 217
           FSQQGLHGEALQVYNQMRSENVDVDGFTLVGL +  A
Sbjct: 181 FSQQGLHGEALQVYNQMRSENVDVDGFTLVGLISSCA 217

BLAST of Cp4.1LG07g07930 vs. ExPASy TrEMBL
Match: A0A6J1HP16 (pentatricopeptide repeat-containing protein At3g56550 OS=Cucurbita maxima OX=3661 GN=LOC111465370 PE=3 SV=1)

HSP 1 Score: 424 bits (1091), Expect = 1.49e-142
Identity = 208/217 (95.85%), Postives = 212/217 (97.70%), Query Frame = 0

Query: 1   MSKEKAIITLLQGCNSLNKLRKIHAHVLVSGLRHHVAINNKLLNFCAISVSGSLAYAQLL 60
           MSKEKAIITLLQGCNSLNKLRKIHAHVLVSGLRHHVAINNKLLNFCAISVSGSLAYAQLL
Sbjct: 8   MSKEKAIITLLQGCNSLNKLRKIHAHVLVSGLRHHVAINNKLLNFCAISVSGSLAYAQLL 67

Query: 61  FHQMECLQTEAWNSIIRGFAQSSSPIDAVVYYNQMVCASFSSPDTFTFSFVLKACERLKA 120
           FHQMECLQTEAWNSIIRGFAQSSSPIDAV YYNQMVCASFSSPDTFTFSFVLKACERLKA
Sbjct: 68  FHQMECLQTEAWNSIIRGFAQSSSPIDAVAYYNQMVCASFSSPDTFTFSFVLKACERLKA 127

Query: 121 ERKCKEIHGTIIRCGYDGDVIICTNLVKCYAAMGSVCIAQQVFDEMPVRDLVAWNAMISC 180
           ERKCKEIHGT+IRCGYDGDVIICTNLVKCYAAMGSVCIAQQVFD+MPVRDLVAWNA+ISC
Sbjct: 128 ERKCKEIHGTVIRCGYDGDVIICTNLVKCYAAMGSVCIAQQVFDKMPVRDLVAWNALISC 187

Query: 181 FSQQGLHGEALQVYNQMRSENVDVDGFTLVGLANLTA 217
           FSQQGLHGEALQVYNQMRSENV VDGFTLVGL +  A
Sbjct: 188 FSQQGLHGEALQVYNQMRSENVGVDGFTLVGLISSCA 224

BLAST of Cp4.1LG07g07930 vs. ExPASy TrEMBL
Match: A0A0A0LH20 (DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_2G074120 PE=3 SV=1)

HSP 1 Score: 390 bits (1002), Expect = 1.64e-128
Identity = 186/217 (85.71%), Postives = 201/217 (92.63%), Query Frame = 0

Query: 1   MSKEKAIITLLQGCNSLNKLRKIHAHVLVSGLRHHVAINNKLLNFCAISVSGSLAYAQLL 60
           MS EKAI+ LLQGCNSL +LRKIHAHV+VSGL HHV I NKLLNFCAISVSGSLAYAQLL
Sbjct: 1   MSNEKAILALLQGCNSLKRLRKIHAHVIVSGLHHHVPIANKLLNFCAISVSGSLAYAQLL 60

Query: 61  FHQMECLQTEAWNSIIRGFAQSSSPIDAVVYYNQMVCASFSSPDTFTFSFVLKACERLKA 120
           FHQMEC QTEAWNSIIRGFAQSSSPIDA+V+YNQMVC SFS PDTFTFSFVLKACER+KA
Sbjct: 61  FHQMECPQTEAWNSIIRGFAQSSSPIDAIVFYNQMVCDSFSIPDTFTFSFVLKACERIKA 120

Query: 121 ERKCKEIHGTIIRCGYDGDVIICTNLVKCYAAMGSVCIAQQVFDEMPVRDLVAWNAMISC 180
           ERKCKE+HG++IRCGYD DVI+CTNLVKCY+AMGSVCIA+QVFD+MP RDLVAWNAMISC
Sbjct: 121 ERKCKEVHGSVIRCGYDADVIVCTNLVKCYSAMGSVCIARQVFDKMPARDLVAWNAMISC 180

Query: 181 FSQQGLHGEALQVYNQMRSENVDVDGFTLVGLANLTA 217
           FSQQGLH EALQ YNQMRSENVD+DGFTLVGL +  A
Sbjct: 181 FSQQGLHQEALQTYNQMRSENVDIDGFTLVGLISSCA 217

BLAST of Cp4.1LG07g07930 vs. ExPASy TrEMBL
Match: A0A1S4DU66 (pentatricopeptide repeat-containing protein At3g56550 OS=Cucumis melo OX=3656 GN=LOC103485901 PE=3 SV=1)

HSP 1 Score: 379 bits (974), Expect = 7.60e-125
Identity = 185/217 (85.25%), Postives = 199/217 (91.71%), Query Frame = 0

Query: 1   MSKEKAIITLLQGCNSLNKLRKIHAHVLVSGLRHHVAINNKLLNFCAISVSGSLAYAQLL 60
           MSKEKAI+TLLQGCNSL +LRKIHAHV+VSGL HHVAI NKLLNFCAISVSGSLAYAQLL
Sbjct: 1   MSKEKAILTLLQGCNSLKRLRKIHAHVIVSGLHHHVAIANKLLNFCAISVSGSLAYAQLL 60

Query: 61  FHQMECLQTEAWNSIIRGFAQSSSPIDAVVYYNQMVCASFSSPDTFTFSFVLKACERLKA 120
           FHQ E  QTEAWNSIIRGFAQSSSPIDA+V+YNQMV  SFS  DTFTFSFVLKACER+KA
Sbjct: 61  FHQTEFPQTEAWNSIIRGFAQSSSPIDAIVFYNQMVWDSFSMRDTFTFSFVLKACERIKA 120

Query: 121 ERKCKEIHGTIIRCGYDGDVIICTNLVKCYAAMGSVCIAQQVFDEMPVRDLVAWNAMISC 180
           ERKCKE+HGT+IRCGYD DVI+CTNLVKCY+AMGSV IA+QVFD+MP RDLVAWNAMISC
Sbjct: 121 ERKCKEVHGTVIRCGYDADVIVCTNLVKCYSAMGSVYIARQVFDKMPARDLVAWNAMISC 180

Query: 181 FSQQGLHGEALQVYNQMRSENVDVDGFTLVGLANLTA 217
           FSQQGLH EALQ YNQMRSENVD+DGFTLVGL +  A
Sbjct: 181 FSQQGLHQEALQTYNQMRSENVDIDGFTLVGLISSCA 217

BLAST of Cp4.1LG07g07930 vs. ExPASy TrEMBL
Match: A0A5A7TXJ9 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold648G001800 PE=3 SV=1)

HSP 1 Score: 379 bits (974), Expect = 7.60e-125
Identity = 185/217 (85.25%), Postives = 199/217 (91.71%), Query Frame = 0

Query: 1   MSKEKAIITLLQGCNSLNKLRKIHAHVLVSGLRHHVAINNKLLNFCAISVSGSLAYAQLL 60
           MSKEKAI+TLLQGCNSL +LRKIHAHV+VSGL HHVAI NKLLNFCAISVSGSLAYAQLL
Sbjct: 1   MSKEKAILTLLQGCNSLKRLRKIHAHVIVSGLHHHVAIANKLLNFCAISVSGSLAYAQLL 60

Query: 61  FHQMECLQTEAWNSIIRGFAQSSSPIDAVVYYNQMVCASFSSPDTFTFSFVLKACERLKA 120
           FHQ E  QTEAWNSIIRGFAQSSSPIDA+V+YNQMV  SFS  DTFTFSFVLKACER+KA
Sbjct: 61  FHQTEFPQTEAWNSIIRGFAQSSSPIDAIVFYNQMVWDSFSMRDTFTFSFVLKACERIKA 120

Query: 121 ERKCKEIHGTIIRCGYDGDVIICTNLVKCYAAMGSVCIAQQVFDEMPVRDLVAWNAMISC 180
           ERKCKE+HGT+IRCGYD DVI+CTNLVKCY+AMGSV IA+QVFD+MP RDLVAWNAMISC
Sbjct: 121 ERKCKEVHGTVIRCGYDADVIVCTNLVKCYSAMGSVYIARQVFDKMPARDLVAWNAMISC 180

Query: 181 FSQQGLHGEALQVYNQMRSENVDVDGFTLVGLANLTA 217
           FSQQGLH EALQ YNQMRSENVD+DGFTLVGL +  A
Sbjct: 181 FSQQGLHQEALQTYNQMRSENVDIDGFTLVGLISSCA 217

BLAST of Cp4.1LG07g07930 vs. TAIR 10
Match: AT3G56550.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 255.0 bits (650), Expect = 8.6e-68
Identity = 120/216 (55.56%), Postives = 165/216 (76.39%), Query Frame = 0

Query: 3   KEKAIITLLQGCNSLNKLRKIHAHVLVSGLRHHVAINNKLLNFCAISVSGSLAYAQLLFH 62
           K + I+ +LQGCNS+ KLRKIH+HV+++GL+HH +I N LL FCA+SV+GSL++AQLLF 
Sbjct: 4   KARVIVRMLQGCNSMKKLRKIHSHVIINGLQHHPSIFNHLLRFCAVSVTGSLSHAQLLFD 63

Query: 63  QMEC-LQTEAWNSIIRGFAQSSSPIDAVVYYNQMVCASFSSPDTFTFSFVLKACERLKAE 122
             +    T  WN +IRGF+ SSSP++++++YN+M+ +S S PD FTF+F LK+CER+K+ 
Sbjct: 64  HFDSDPSTSDWNYLIRGFSNSSSPLNSILFYNRMLLSSVSRPDLFTFNFALKSCERIKSI 123

Query: 123 RKCKEIHGTIIRCGYDGDVIICTNLVKCYAAMGSVCIAQQVFDEMPVRDLVAWNAMISCF 182
            KC EIHG++IR G+  D I+ T+LV+CY+A GSV IA +VFDEMPVRDLV+WN MI CF
Sbjct: 124 PKCLEIHGSVIRSGFLDDAIVATSLVRCYSANGSVEIASKVFDEMPVRDLVSWNVMICCF 183

Query: 183 SQQGLHGEALQVYNQMRSENVDVDGFTLVGLANLTA 218
           S  GLH +AL +Y +M +E V  D +TLV L +  A
Sbjct: 184 SHVGLHNQALSMYKRMGNEGVCGDSYTLVALLSSCA 219

BLAST of Cp4.1LG07g07930 vs. TAIR 10
Match: AT4G21065.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 134.0 bits (336), Expect = 2.2e-31
Identity = 80/219 (36.53%), Postives = 127/219 (57.99%), Query Frame = 0

Query: 8   ITLLQ--GCNSLNKLRKIHAHVLVSGLRHHVAINN----KLLNFCAISVSG--SLAYAQL 67
           I LLQ  G +S+ KLR+IHA      +RH V+I++    K L F  +S+     ++YA  
Sbjct: 19  INLLQTYGVSSITKLRQIHAF----SIRHGVSISDAELGKHLIFYLVSLPSPPPMSYAHK 78

Query: 68  LFHQME-CLQTEAWNSIIRGFAQSSSPIDAVVYYNQMVCASFSSPDTFTFSFVLKACERL 127
           +F ++E  +    WN++IRG+A+  + I A   Y +M  +    PDT T+ F++KA   +
Sbjct: 79  VFSKIEKPINVFIWNTLIRGYAEIGNSISAFSLYREMRVSGLVEPDTHTYPFLIKAVTTM 138

Query: 128 KAERKCKEIHGTIIRCGYDGDVIICTNLVKCYAAMGSVCIAQQVFDEMPVRDLVAWNAMI 187
              R  + IH  +IR G+   + +  +L+  YA  G V  A +VFD+MP +DLVAWN++I
Sbjct: 139 ADVRLGETIHSVVIRSGFGSLIYVQNSLLHLYANCGDVASAYKVFDKMPEKDLVAWNSVI 198

Query: 188 SCFSQQGLHGEALQVYNQMRSENVDVDGFTLVGLANLTA 218
           + F++ G   EAL +Y +M S+ +  DGFT+V L +  A
Sbjct: 199 NGFAENGKPEEALALYTEMNSKGIKPDGFTIVSLLSACA 233

BLAST of Cp4.1LG07g07930 vs. TAIR 10
Match: AT1G34160.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 133.7 bits (335), Expect = 2.9e-31
Identity = 74/207 (35.75%), Postives = 114/207 (55.07%), Query Frame = 0

Query: 9   TLLQGCNSLNKLRKIHAHVLVSGLRHHVAINNKLLNFCAISVSGSLAYAQLLFHQMECLQ 68
           T++Q C S ++++++ +H L +G      + ++LL  CAIS  G L++A  +F  +    
Sbjct: 8   TMIQKCVSFSQIKQLQSHFLTAGHFQSSFLRSRLLERCAISPFGDLSFAVQIFRYIPKPL 67

Query: 69  TEAWNSIIRGFAQSSSPIDAVVYYNQMVCASFSSP-----DTFTFSFVLKACERLKAERK 128
           T  WN+IIRGFA SS P  A  +Y  M+  S SS      D  T SF LKAC R      
Sbjct: 68  TNDWNAIIRGFAGSSHPSLAFSWYRSMLQQSSSSSAICRVDALTCSFTLKACARALCSSA 127

Query: 129 CKEIHGTIIRCGYDGDVIICTNLVKCYAAMGSVCIAQQVFDEMPVRDLVAWNAMISCFSQ 188
             ++H  I R G   D ++CT L+  Y+  G +  A ++FDEMPVRD+ +WNA+I+    
Sbjct: 128 MDQLHCQINRRGLSADSLLCTTLLDAYSKNGDLISAYKLFDEMPVRDVASWNALIAGLVS 187

Query: 189 QGLHGEALQVYNQMRSENVDVDGFTLV 211
                EA+++Y +M +E +     T+V
Sbjct: 188 GNRASEAMELYKRMETEGIRRSEVTVV 214

BLAST of Cp4.1LG07g07930 vs. TAIR 10
Match: AT1G08070.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 130.2 bits (326), Expect = 3.2e-30
Identity = 80/242 (33.06%), Postives = 129/242 (53.31%), Query Frame = 0

Query: 8   ITLLQGCNSLNKLRKIHAHVLVSGLRHHVAINNKLLNFCAISVS-GSLAYAQLLFHQMEC 67
           ++LL  C +L  LR IHA ++  GL +     +KL+ FC +S     L YA  +F  ++ 
Sbjct: 37  LSLLHNCKTLQSLRIIHAQMIKIGLHNTNYALSKLIEFCILSPHFEGLPYAISVFKTIQE 96

Query: 68  LQTEAWNSIIRGFAQSSSPIDAVVYYNQMVCASFSSPDTFTFSFVLKACERLKAERKCKE 127
                WN++ RG A SS P+ A+  Y  M+      P+++TF FVLK+C + KA ++ ++
Sbjct: 97  PNLLIWNTMFRGHALSSDPVSALKLYVCMISLGL-LPNSYTFPFVLKSCAKSKAFKEGQQ 156

Query: 128 IHGTIIRCGYD-------------------------------GDVIICTNLVKCYAAMGS 187
           IHG +++ G D                                DV+  T L+K YA+ G 
Sbjct: 157 IHGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVFDKSPHRDVVSYTALIKGYASRGY 216

Query: 188 VCIAQQVFDEMPVRDLVAWNAMISCFSQQGLHGEALQVYNQMRSENVDVDGFTLVGLANL 218
           +  AQ++FDE+PV+D+V+WNAMIS +++ G + EAL+++  M   NV  D  T+V + + 
Sbjct: 217 IENAQKLFDEIPVKDVVSWNAMISGYAETGNYKEALELFKDMMKTNVRPDESTMVTVVSA 276

BLAST of Cp4.1LG07g07930 vs. TAIR 10
Match: AT3G05240.1 (mitochondrial editing factor 19 )

HSP 1 Score: 127.1 bits (318), Expect = 2.7e-29
Identity = 64/209 (30.62%), Postives = 124/209 (59.33%), Query Frame = 0

Query: 5   KAIITLLQGCNSLNKLRKIHAHVLVSGLRHHVAINNKLLNFC-AISVSGSLAYAQLLFHQ 64
           K I++ L+ C SL +L ++H  ++ S +  +V   ++L++FC     + +L+YA+ +F  
Sbjct: 7   KPILSQLENCRSLVELNQLHGLMIKSSVIRNVIPLSRLIDFCTTCPETMNLSYARSVFES 66

Query: 65  MECLQTEAWNSIIRGFAQSSSPIDAVVYYNQMVCASFSSPDTFTFSFVLKACERLKAERK 124
           ++C     WNS+IRG++ S +P  A+++Y +M+   + SPD FTF +VLKAC  L+  + 
Sbjct: 67  IDCPSVYIWNSMIRGYSNSPNPDKALIFYQEMLRKGY-SPDYFTFPYVLKACSGLRDIQF 126

Query: 125 CKEIHGTIIRCGYDGDVIICTNLVKCYAAMGSVCIAQQVFDEMPVRDLVAWNAMISCFSQ 184
              +HG +++ G++ ++ + T L+  Y   G V    +VF+++P  ++VAW ++IS F  
Sbjct: 127 GSCVHGFVVKTGFEVNMYVSTCLLHMYMCCGEVNYGLRVFEDIPQWNVVAWGSLISGFVN 186

Query: 185 QGLHGEALQVYNQMRSENVDVDGFTLVGL 213
                +A++ + +M+S  V  +   +V L
Sbjct: 187 NNRFSDAIEAFREMQSNGVKANETIMVDL 214

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9LXY51.2e-6655.56Pentatricopeptide repeat-containing protein At3g56550 OS=Arabidopsis thaliana OX... [more]
A8MQA33.1e-3036.53Pentatricopeptide repeat-containing protein At4g21065 OS=Arabidopsis thaliana OX... [more]
Q9FX244.1e-3035.75Pentatricopeptide repeat-containing protein At1g34160 OS=Arabidopsis thaliana OX... [more]
Q9LN014.5e-2933.06Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
Q9MA953.8e-2830.62Putative pentatricopeptide repeat-containing protein At3g05240 OS=Arabidopsis th... [more]
Match NameE-valueIdentityDescription
XP_023537237.13.33e-14598.16pentatricopeptide repeat-containing protein At3g56550 [Cucurbita pepo subsp. pep... [more]
XP_022937704.12.69e-14497.24pentatricopeptide repeat-containing protein At3g56550 [Cucurbita moschata][more]
XP_022965478.13.08e-14295.85pentatricopeptide repeat-containing protein At3g56550 [Cucurbita maxima][more]
KAG6586142.17.91e-13697.70Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
XP_038890323.19.97e-13488.48pentatricopeptide repeat-containing protein At3g56550 [Benincasa hispida] >XP_03... [more]
Match NameE-valueIdentityDescription
A0A6J1FBZ01.30e-14497.24pentatricopeptide repeat-containing protein At3g56550 OS=Cucurbita moschata OX=3... [more]
A0A6J1HP161.49e-14295.85pentatricopeptide repeat-containing protein At3g56550 OS=Cucurbita maxima OX=366... [more]
A0A0A0LH201.64e-12885.71DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_2G0741... [more]
A0A1S4DU667.60e-12585.25pentatricopeptide repeat-containing protein At3g56550 OS=Cucumis melo OX=3656 GN... [more]
A0A5A7TXJ97.60e-12585.25Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
Match NameE-valueIdentityDescription
AT3G56550.18.6e-6855.56Pentatricopeptide repeat (PPR) superfamily protein [more]
AT4G21065.12.2e-3136.53Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G34160.12.9e-3135.75Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G08070.13.2e-3033.06Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G05240.12.7e-2930.62mitochondrial editing factor 19 [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 172..205
e-value: 3.5E-8
score: 31.1
coord: 71..104
e-value: 4.7E-4
score: 18.1
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 170..208
e-value: 2.3E-8
score: 34.1
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 71..97
e-value: 0.008
score: 16.4
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 170..204
score: 11.388848
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 4..102
e-value: 1.9E-6
score: 29.5
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 122..217
e-value: 2.3E-16
score: 61.7
NoneNo IPR availablePANTHERPTHR47925:SF17BNAA09G35950D PROTEINcoord: 4..213
NoneNo IPR availablePANTHERPTHR47925OS01G0913400 PROTEIN-RELATEDcoord: 4..213

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG07g07930.1Cp4.1LG07g07930.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding