Cp4.1LG11g06770 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG11g06770
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionPentatricopeptide repeat-containing protein, putative
LocationCp4.1LG11: 4686268 .. 4695462 (+)
RNA-Seq ExpressionCp4.1LG11g06770
SyntenyCp4.1LG11g06770
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCAACTTGATTAAAATTGATGTTTTGTCTTTACACCGTCGTGGACCTTACACGGTTTTGGGAGGGTAAGAATCAGCCATGGAATAAACGCTTTTTAAATTCACCACTTTCCAACCCGTTGGTCAAAACATTAAGCTCTGAACAACACACAAAAATTAAATAAAAGCTTCCACGGTTTGGGCTATGGTGGATCTCTAGCTTCCAAGAAAGGATCTCCCCCTTTCCTCTCACTCGGATTCAAATTGTGGTGGTGTGCACTTGCAGCAAAGCGAAGCAAAGCAATGCAGATCTGATCTTCCTTTCAATCGACGAGAGATAGATCCCACCCTCTTTAATTCTTCTCCGATTCAGATCCTTGTCTACAAAAAAAAAGCAAGATTCCGCTTAATCTGCGCCAGTTATTTCCCTCTGGAATCACTGATCCGGTTTTCTGAAATGGGTTTGCGCTCAAACCCTACGGGAAACCTCTCCTTTTCTCCTTCTTTGATCCTCCTTCTGCTTCTTGTTTCTCTGTTTGCCTCCGTTCAGGTATTCATCTCGCATTTCTCTACCTTTAATTTTTGGTCTTTTTTGCGTTCTTATGAGGGTTTGGTTTGTAATTTTTGTTCGGTGTTTTCAGGTGTTTTCTGCGGAAGTGGAGAAAGAGGAGTTGGATGGACCTAAAGATCTCGGTCGACGTAGCAAGGTGAGTATGATTCCTTGTCTTTCTTGTTTTACTTTGGTTGAAGCTATTTTATGAAATTATTGGTGGGCTGTGAGGCTGCGAGGCTGTGAGGCTTTGATGCCGGCCATGGCAGGAGGGGAAGGGGTTTGGGGGCTGTTGAGATTGAGATCTTTGCGCTTTTTAACATAATTGTTTTCCTTTTACATTGTAAGTAGAACTAACTATTTTGAGTAATGATTCTCTTGCCTTGTATGGGGTATCTTGTGGAACATAATATGCCATTCTGAAAAGAAATTTGAGATTAAGATGCGTTGTCCCAGTTGATCATTTCCCTTTTTCTTTTCTTCTTTAATCTCTAGTTTTATTTCTTTCATGATGTTATTATCGTCTGTATTTCCTCTCATTAGATTTCCTGGAACAACATTGACACAATTGCTGCAAGAAAGGATGTTGTTGACTCAGAAGATCTTAATCTTGACCTGGACTCCGTTGGTCTTGGAGTTTTTGACGCATTTTTTGCTAGTTTGTCCATGATAATTGTCAGTGAGGTTGGTGTTGTTATTTCTATTCTCATTTATACTTTCTGTGAAAAATTTCACTAGAATTGGGTCCATTCCCGATATGCGTTTGGGTGTATTACGGCGGTATCAGTTCAATTGAATTACCGACATGAGCTGAAGAGACAAAAAGATGGGAAATGGGAAATATGGTGGTTTTGGACCTAACTGCTTGTTTCTTCAAGTTTGATTCCTTTAATTTTTTCTAGAATGAATGTGTGTAGATTGGAGATGAGACGTTTATAATAGCTGCACTTATGGCTATGCGCCACCCGAAGTCCATTGTATTATCTGGTGCGCTCAGTGCCCTGATTGTAATGACAGTAAGTTGATCAGATCTTTCATTTTCATCAGGTTCCAAGTCCAAGTTCTTTCTCTGGTGAATAAGAGTAGAACTATTTCCATGTCAATTAGGTACTCTCGACTGGATTAGGTAGGATAGTGCCGAATTTGATATCAAGGAAACATACCAATAATGCTGCTACAGGTACATACATATTATTTTACACTCTTTGGAGTATATCTATTTGTAGAGTGCTGTATGATCTTATTCATTCTATTTGACTCTGACTATCTTTTCTAATATATATTTCCTGATTAACCGTTACCTCCTGATTCCCCTGCTGTTGTCACTTCATTCATACAAAGGGTTGATAATTTGACAATTTTCTTCTACTACTTCACCGTCTCTATTAGTGTCACTTCTCCCTTCGCTGTAATGAAGATATAAAAGAAATCTTAAAGATCCTCCAAGGTTTAATGGTATCTATACGTATAAAAATAAATAAACAAAACTTATACATATAGAAAGAAAAAGATGAATTGGATACGAAATCTTGACTTTAAAGTTATCTACCTAGGCTTGCTACCAAGAAAAAAAGTCTCCTAAAAGCATCTATACCACTAAGGAAGGAAGGAGTTGAATCGCAGACTTACGAATCACTAATATCCCTACCACCTGTAGCAATAAGAAAGAGTTTGGAGGTGAATCGCAATTGGTGATTTTGTTATCGGAGTCTGACCAAACAATATGGGTTATTGATGTAGCTTCATTCCAACCAAAAAGAATTTACCTATATTAAATTGGTTAATATTTTCAATATCTCGACTTCAATATACATATGTTTGATTTATGTACGAGAAATTTATAGGCACTACTTTTCTGATGAAAAAATGCAATCTAGAGTTTTGGGTTCATGTTTTTTTTCCTCGTAGGAGGTGGTACGACTACTTTAGCCAAACCTCCCATATTTTAAGGTATTTGTAGGAAGGCATTCCACTTCTTTGACTCCTAAGGTACTTAACCAATTTTCTAATGATGGATTTATCTAATTCAATATTTTAAATTAATCAATTAAATAACTGAGCCACAGTGCCATACTGGTTTGACATTAAGGAAAACCTTTAGGAGTGATGAATTTATACCATGGTGAAGATAAAACTTTTAGGAGTGATGAATCACTTAAACAACTTAATTATGAGCGTCGGAAGGTTTTATCTGTTGTCTTTGATCCTGCATTTCACATATTAGATATGACTTGATGCATCACTTAAACAACTTTTGCTTGTCATTTTGGGATGTTGTAATGGTACCTTCTTTCCAGAGGATGAGACGACATTTATTATTTTACCTGTTAAACGTGACTCTTCTGTCTAAGTTGTGTACTTTTGCTAGAGTTTTCATAGTTGTTTGTTTTTTCCTATAGTGATGTCCACCTACATATTTATAATTCTGCAGTTCTGTATGCATTTTTTGGATTGCGGTTACTTTACATTGCTTGGAGATCCAAAGCAGACTCAAAATCTTCAACGAAAAAGGAAATGGAAGAAGTATGATATCTTCAACACCTTTTCGGTTTGATGCTTGTTTAGCTTTATTTTGTTTGATTCTTATCTGCATTTTTCTTTGTTGTCTGGCATAATTTCTTAATCTTGAAGTTTTTTTTATTTCAATTGGAATTTCGTGATAAAAGATTCGGTGATTCACGATCTATGGCATGAAGATGATCATAATACAGATGTCTTTGGTAGTTTATATTCTCCCCCTGTTTTATTACATTTATTTGAGGCTTTTATCTGTGATTTAATAAATGAAGATATATGGTGAAAACGGATAAATAGTTATCTGTTCTTTTTCGATTTGGATTTTTAAATTTTTATTTTCTACACCTTATATTTTTTTTGGAGTCATTCTGAATATTGTTGTTTATGTCTTTTATGTCTTGAGTGATTCAATATACCTATATTTCATCTGTCCCCATATTGTTATCTCATTTTCTAAGAAAACTTGTGGCCGTTTTAGCCTTTCCTTGTGTTAGTTCTTTAAATGGTTGACTTCTAAAAATCACGTGTTCAGGTAGAGGAGAAACTCGAGGCTGGACAATCCAAGACGACCTTTCGCCGCTTCTTTCTCCGATTTTGCACACCCATATTCTTGGAGGTTCACAAATCACTATATCACTGATTTGGCATAAAACTATCAATATTGTTCCTTTCTTTTTTTCTTTCTGACTTCTTCCATTAGTGTAAATATTTGTGTTATTCAGATGCATATAATTTTACATATATGTCATTCCTTAGCTGATTCTCGAAATTACTTTGGCTTTTGCAGTCATTTATTTTAACCTTCCTTGCCGAATGGGGAGATCGAAGTCAGATAGCTACAATTGCTGTAAGTTCTTTCATAAAATTGTATGTTTGGTTTGCTTGATATCATCTAGTTGGTACTGGAAGTTGGACTGAACCATGGTTTCAATAGTTTTTATATTCTAGAAACGAGATTTATAACACGAATCTGAATTGGTTTCGATACTCGAAATAAGTGACTTCAGAATGTTTAAGATATGAAAAATGAAGAAGAATTTAAGGCATACAAAACTAATGGGGGAGGGGCGTGTATTGCTGCAGCTAGCAACACACAAGAATGCACTTGGAGTGGCTGTGGGAGCCATCTTGGGGCATTCAGTATGCACATCAATGGCAGTGATCGGTGGAAGCATGTTGGCATCGAAGATATCACAGGGCACAGTTGCAACAGTTGGAGGCTTGCTCTTCCTTGGCTTCTCCTTGTCTTCCTATTTTTTCCCACCTCTATAAGAAGTTCTAACCCCACAGTATTCACTCTTTTTCTTTTTTGCCTGTCATTTTGTTGTTCTAAATTCTTTCCCAACTGTTCATTTTCTTTTTCTTTTTTTACATTTTTAGTAGCTTTGGAGAAGTCAGATGTTTTGGCAATTTCGTCGATTTCGTGTGTTGTTTGTTCAAAAAGCATACACAAATACATTTATATACCGATTCTAACTCTCTATAGTTCGAAAAGCGAATTCATATATAAAGCCTTTTTGTATTCTCCCAACTCGCACTCTAAGCGAATTTTGTTAATACTGATCTTAATGATTTGAAAATACGAAAGGAGATTTAGAAACTATATCTCGACTTAGGTCTTAAATGATCTTTTTTGGCTTTGTAGGTCGTGTCGTTCTTGTCGAAAGGATTCACTCACTCACCATTGCTTCATTAGCCAACATTTTTAATTTGATATGAACTTTGTTTCCATTTAGAAATTTCCATTCTAAATTCTCATCGACTTACTTGAAAACCCTAAAAAACCCAATCCATTGGATCAAATTTAAATAACCACAAGTATAATCTATTAACTAAATTTAAATATGGTTTCAGAAGAAAAAATTTGAAGCAGCCGGGTTCGAACCTCATTCGTTAACGTCCGTTCCACCTTTATGGAGCGAATTCAGCCATATGAACTCGAAGTAAGTTCGGGTTGGGGGATTTAACTCCTACCCATTGCTTGATATGGCTAGCGATGGGTTTGAATTTTGTTTTTGGGTTGTTTCCTCATTGCTCTAATGCCCTGTTTCGAATTCTATTGGAGAATCGTCACTTTCCTTCGCTTACATTATCATTGAAGAACGTGGTTTAAGATGAAATGGATGAATCTGTGCAGTGGTTACTCTCCTTCTACTGCGTTTCTGAAATTTTCCCGTTCTGTTTCTCAAGTTACAATGGCCCAGAAAATCATACCATTCAACTTCTCTGCGCACCACCTGTTCGAGTCATGTAGCTACCACTCTTCAAATGATTCTTTGCCCAATACCCTTCATGCCATGATGGTCAAGAATGGTTCTATTTTCGAATCGAGGAAGTTTATTTTGAGTTCTTATGTGAAATCTGAGAAATTAAACGATGCACGGAAAGTGTTCGATGAAATGCCTAGCAGAGATGTACTTACATGGACGGTCCTTATATCCGGTTTTGCTAGAGTAAATTGTTCTGAAATGGCATTGCAACTGTTTAGAGAAATGCTGGTTGAAGGTGTTTGTCCAAATCCTTTTACTTTGTCTACTGTTCTTAAACTTTGCTCTAGAGTGGGTGATGTGAAAATGGGTAAGGGAATTCATGGATGGATACTCAGAAGTGGGGTTAACTTAGATGTTGTCTTGGAGAATTCTATGCTTGATTTGTATGCTAAGTTTGATGAATTTGATTATGTCAAAAAGTTGTTTGATTCAATGAGAGAAAAGAGCACTGCTACTTACAACATATTGCTTGGTGTGCATGTCCGTAGTGATGTTAACAAATCTCTTGATTTATTCAGAAACTTGCCTTGCAGAGATACTGCAAGTTGGAATACAGTAATATGTGGGCTAATGCAAGGTGGGTATCTGAATGAAGCATTGGAGCTACTCTATGAGATGGTGGAGAACGAACCCGAGTTTAACAAAGTTACTTCTTCCATAGCCTTAAGTGTCGTTTCTTCTTTACTGATTTTCGAGCTAGGTAGACAAGTACATGGTCGAATAGTCAGGTGTGGGTTTCATAATGATGGATTTGTAAAGAGTTCACTGATAAATATGTACATTAAGTGTGGAAATTTGGAAAAAGCATCGGTGATATATAGTCAAATGCCTTCGGGTTTTGCAAAAAAACAAGATTTCGACATTGTATGTAGCGACACGATGACAGAAATTGTTTCACGGAGCTCAATGGTGTCTGGATATGTTCGAAATGGCAAGTATGAAGATGCCTTCAAAACCTTTGTTTCTATGGTTCGTGAACAGGTTTTAATGGATAAATTTACCATTGCAAGCGTTGTTTCCGCTTGTTCTAATGCTGGCGTTTTCGAGCTTGGACGTCAAATCCATGCATATATTCAGAAAACAGGGGAACAGCTTGATGCTCACTTGACTTCCTCCTTGATTGACATGTACGCTAAAGGTGGGAGTCTTGATTGTGCCCGTCAAATTTTCGAGCAAACGACTTACTTAAACGTTGTGATATGGACTTCCATGATCACTGGATGTGCTCTACACGGGCAAGGTAAGGAAGCCATTAGACTGTTTGAAAAGATGAGATATGAGGGAATGATACCAAATGAAGTTACTTTTTTAGGAGTTTTAGCAGCTTGCAGTCATGCTGGGCTACTTGAAGATGGCCGTCTATATTTTAACATGATGAAGGATGTTTATGCAATCAAGCCTAAGGTTGAGCATTTCACTTGTATGGTCGATCTTTACGGTCGAGCTGGACGTCTGAACGAAGTCAAGAAATTCATCTACGAGAATGATTTATCACACCTTAATGCAGTTTGGAAGGCATTCCTGTCATCTTGTCAGCTTTACAAGGACATCGAAATGGGGAATTGGGTTTCTGAAAGATTGTTTAGACTCGAACCACTAGACGAAGGGCCTTACGTTTTACTATCGAACATGTGCTCCAGCAATCAGAAATGGGAAGAAGCTTTCAGAACAAGAAGATCTATGCAACACAGAGGGATTAGCAAAACACCTGGTCAATCTTGGATTCATGTGAAAAATCGAGTCCATTCTTTTGTTGCGGGAGATCGATCGCACCCTCAACACGCTCAGATATATGAATATCTAGACAAGCTTATTGGAAGATTGAAGGAAATTGGGTATTTGTTTGATGTAAAATTGGTGATGCAAGATGTAGAAGAAGAACAGGGTGAAGTGCTTCTTGGTTGGCATAGTGAAAAGCTTGCAATTGCTTATGGCTTAATCAGTTTGGGCTCTTCCATTCCAATCCGAATCATGAAGAACCTTAGGATATGTACTGATTGTCATAACTTTATGAAGTTAACATCTCAACTTTTATGCAGGGAGATCATTGCACGAGATATTCATCGTTTCCATCGTTTTAACTCAGGTCATTGCTCGTGTGGTGATTATTGGTGAGACGAGATGCAGAATCTGATGCCTGTGAATTTTCATGTATAAACTTACGTTATTCTGATGCCCAAGATAAAAGGAACGAGGGGAAGAATCTACCCCGAGATGTTAGAAGGTGCAAAATCAATGGGTGCTGGAGCTGCTACAATTGCTTCAGTGGGAGCTGTGTCGGTATTAGAAACGTGTTCAGTTTTTTGATCCATTTCATGACGCGTCCAAGGCTTGGATTAGACTTATGAGAATAGCTTTTCTTAAATGGAAGGAGGGTGCATACATAACAGTTCTAGAAAGGAAATCAATATGCAATCACAAAGGATCTAGCAAAGAGGTTTGGCATGAAATATACAATTTTATGTTTCCTTAACACAGACATGGAAGTTACATTGCAGGAACAGAACATGATACAAAATCTACTTGACAAAGTAAGGGAGGTCCAAGACATAGAGGAGGAAGTATGCCCAGATCACGCCGGAGATCCCGCCGAAGAAGAACCCGCCAGTGAATTTCGCCCACCCGTCAGCAGTTTGCAGCTGATCAGGGGTCTTTTTCCTGCCGGTCAAGGTCAATGACGGTGCAGTGGATGGCTCTCCTTCATTGAAGGATGCAATGCCATACATTGTCAAGCAGATGCTCAAAATGGCAATGAGACCACCGGCGGCCAGTGAGCCAGCTCCTCCGGCGTAGGCTGTGTTCCTTAGAGGGCCAGCTTTCACAAATGGACCGACCAGAAGAAAACCATGTGCCAGACCTACTTCAATTCCCCTCAGCAATGGGTTGACTGCTGTTCTGTAGGCTGGGAGGTTTGAGAGGTACCATGCTATCAATGGGCTTGAGGTTACTGGAGTCTCGAGGCTTCCAATGAATGGGTCTCCATTGATTGGCTGGATCACTTGGTACGTTGGCTGCAACAATATCCATAGCTTATTATTATATGGAGAATTGCTTAACAATAAATGCACTCCCAAAACACCTTTTTTTTTTTTTTTATCGGTTAGCAAATATAATCTCCTTTGGACTTTCCTTCAAAATTTTTAAAATGCGTTTGCTAAAGAGAAGTTTCCACTCTTTTATAAAAAAATGTTTTGTTCTCTTCCTCATCCGATGTGGGGTCTCACAATCCACCTCCCTTTGAGGCCTAGCGTCCTCGCTGTCACATGTTCCTTTCTCCAATCGATGTGAGATCTTCTAATCCACTCCCTTTCAGGGACCTGCATCCTTACAAGCACACCGTCTTGTGTCCACCCCTTTCGGGGCTCAGCGTACTCGCTGACACATAGTCTGGTGTATAGCTCTAATATCATTTGTAATGGTCCAAGCCCATCGTTAACAAATATTGTCTTTTTTGGGCTTTCCCTTTCGTGCTTCCCCTCAACGTTTTTAAAACGTGTCTGTGCTAGGGAGAGGTTTCTACACCTTGCAAAGAATGTTTCGTTCTCCTCTCCAACTTGAGATGTCATGAAATAGTATACATTACTTGGCATGAAATATTTCTTAAAAGCTCAAAGCTAGGCAGATGAAATAGAATTGGGAGATGATGGTTGGTTAGATTACCTTATCAGCTTGGATGGCTCTGATAGTGAAGGAGTGTGTTCTTGCAGGTATGCGTCGGAGCGGCGAAACGGAGAGGCCTTTGGGAGCAACCAAAGCTCTGGTGGTGGAAGAAGCAAAGCTGGAACTGAGCTGACTGGCAATGGGGGAGGTTGCAGTGGCCATGGCTGCTGCTGCCTTTGGCTTGTGGAAGAATCAGTGGCAGAGAAGAAGATGAATGAG

mRNA sequence

ATGCAACTTGATTAAAATTGATGTTTTGTCTTTACACCGTCGTGGACCTTACACGGTTTTGGGAGGGTAAGAATCAGCCATGGAATAAACGCTTTTTAAATTCACCACTTTCCAACCCGTTGGTCAAAACATTAAGCTCTGAACAACACACAAAAATTAAATAAAAGCTTCCACGGTTTGGGCTATGGTGGATCTCTAGCTTCCAAGAAAGGATCTCCCCCTTTCCTCTCACTCGGATTCAAATTGTGGTGGTGTGCACTTGCAGCAAAGCGAAGCAAAGCAATGCAGATCTGATCTTCCTTTCAATCGACGAGAGATAGATCCCACCCTCTTTAATTCTTCTCCGATTCAGATCCTTGTCTACAAAAAAAAAGCAAGATTCCGCTTAATCTGCGCCAGTTATTTCCCTCTGGAATCACTGATCCGGTTTTCTGAAATGGGTTTGCGCTCAAACCCTACGGGAAACCTCTCCTTTTCTCCTTCTTTGATCCTCCTTCTGCTTCTTGTTTCTCTGTTTGCCTCCGTTCAGGTGTTTTCTGCGGAAGTGGAGAAAGAGGAGTTGGATGGACCTAAAGATCTCGGTCGACGTAGCAAGATTTCCTGGAACAACATTGACACAATTGCTGCAAGAAAGGATGTTGTTGACTCAGAAGATCTTAATCTTGACCTGGACTCCGTTGGTCTTGGAGTTTTTGACGCATTTTTTGCTAGTTTGTCCATGATAATTGTCAGTGAGATTGGAGATGAGACGTTTATAATAGCTGCACTTATGGCTATGCGCCACCCGAAGTCCATTGTATTATCTGGTGCGCTCAGTGCCCTGATTGTAATGACAGTACTCTCGACTGGATTAGGTAGGATAGTGCCGAATTTGATATCAAGGAAACATACCAATAATGCTGCTACAGTTCTGTATGCATTTTTTGGATTGCGGTTACTTTACATTGCTTGGAGATCCAAAGCAGACTCAAAATCTTCAACGAAAAAGGAAATGGAAGAAGTAGAGGAGAAACTCGAGGCTGGACAATCCAAGACGACCTTTCGCCGCTTCTTTCTCCGATTTTGCACACCCATATTCTTGGAGTCATTTATTTTAACCTTCCTTGCCGAATGGGGAGATCGAAGTCAGATAGCTACAATTGCTCTAGCAACACACAAGAATGCACTTGGAGTGGCTGTGGGAGCCATCTTGGGGCATTCAGTATGCACATCAATGGCAGTGATCGGTGGAAGCATGTTGGCATCGAAGATATCACAGGGCACAGTTGCAACAGTTGGAGGCTTGCTCTTCCTTGGCTTCTCCTTTAGCTTTGGAGAAGTCAGATGTTTTGGCAATTTCGTCGATTTCAAGAAAAAATTTGAAGCAGCCGGGTTCGAACCTCATTCGTTAACGTCCGTTCCACCTTTATGGAGCGAATTCAGCCATATGAACTCGAAAACGTGGTTTAAGATGAAATGGATGAATCTGTGCAGTGGTTACTCTCCTTCTACTGCGTTTCTGAAATTTTCCCGTTCTGTTTCTCAAGTTACAATGGCCCAGAAAATCATACCATTCAACTTCTCTGCGCACCACCTGTTCGAGTCATGTAGCTACCACTCTTCAAATGATTCTTTGCCCAATACCCTTCATGCCATGATGGTCAAGAATGGTTCTATTTTCGAATCGAGGAAGTTTATTTTGAGTTCTTATGTGAAATCTGAGAAATTAAACGATGCACGGAAAGTGTTCGATGAAATGCCTAGCAGAGATGTACTTACATGGACGGTCCTTATATCCGGTTTTGCTAGAGTAAATTGTTCTGAAATGGCATTGCAACTGTTTAGAGAAATGCTGGTTGAAGATACTGCAAGTTGGAATACAGTAATATGTGGGCTAATGCAAGGTGGGTATCTGAATGAAGCATTGGAGCTACTCTATGAGATGGTGGAGAACGAACCCGAGTTTAACAAAGTTACTTCTTCCATAGCCTTAAGTGTCGTTTCTTCTTTACTGATTTTCGAGCTAGGTAGACAAGTACATGGTCGAATAGTCAGGTGTGGGTTTCATAATGATGGATTTGTAAAGAGTTCACTGATAAATATGTACATTAAGTGTGGAAATTTGGAAAAAGCATCGGTGATATATAGTCAAATGCCTTCGGGTTTTGCAAAAAAACAAGATTTCGACATTGTATGTAGCGACACGATGACAGAAATTGTTTCACGGAGCTCAATGGTGTCTGGATATGTTCGAAATGGCAAGTATGAAGATGCCTTCAAAACCTTTGTTTCTATGGTTCGTGAACAGGTTTTAATGGATAAATTTACCATTGCAAGCGTTGTTTCCGCTTGTTCTAATGCTGGCGTTTTCGAGCTTGGACACATGGAAGTTACATTGCAGGAACAGAACATGATACAAAATCTACTTGACAAAGTAAGGGAGGTCCAAGACATAGAGGAGGAAGTATGCCCAGATCACGCCGGAGATCCCGCCGAAGAAGAACCCGCCAGTGAATTTCGCCCACCCGTCAGCAGTTTGCAGCTGATCAGGGGTCTTTTTCCTGCCGGTCAAGGATGCAATGCCATACATTGTCAAGCAGATGCTCAAAATGGCAATGAGACCACCGGCGGCCAGTATGCGTCGGAGCGGCGAAACGGAGAGGCCTTTGGGAGCAACCAAAGCTCTGGTGGTGGAAGAAGCAAAGCTGGAACTGAGCTGACTGGCAATGGGGGAGGTTGCAGTGGCCATGGCTGCTGCTGCCTTTGGCTTGTGGAAGAATCAGTGGCAGAGAAGAAGATGAATGAG

Coding sequence (CDS)

ATGGGTTTGCGCTCAAACCCTACGGGAAACCTCTCCTTTTCTCCTTCTTTGATCCTCCTTCTGCTTCTTGTTTCTCTGTTTGCCTCCGTTCAGGTGTTTTCTGCGGAAGTGGAGAAAGAGGAGTTGGATGGACCTAAAGATCTCGGTCGACGTAGCAAGATTTCCTGGAACAACATTGACACAATTGCTGCAAGAAAGGATGTTGTTGACTCAGAAGATCTTAATCTTGACCTGGACTCCGTTGGTCTTGGAGTTTTTGACGCATTTTTTGCTAGTTTGTCCATGATAATTGTCAGTGAGATTGGAGATGAGACGTTTATAATAGCTGCACTTATGGCTATGCGCCACCCGAAGTCCATTGTATTATCTGGTGCGCTCAGTGCCCTGATTGTAATGACAGTACTCTCGACTGGATTAGGTAGGATAGTGCCGAATTTGATATCAAGGAAACATACCAATAATGCTGCTACAGTTCTGTATGCATTTTTTGGATTGCGGTTACTTTACATTGCTTGGAGATCCAAAGCAGACTCAAAATCTTCAACGAAAAAGGAAATGGAAGAAGTAGAGGAGAAACTCGAGGCTGGACAATCCAAGACGACCTTTCGCCGCTTCTTTCTCCGATTTTGCACACCCATATTCTTGGAGTCATTTATTTTAACCTTCCTTGCCGAATGGGGAGATCGAAGTCAGATAGCTACAATTGCTCTAGCAACACACAAGAATGCACTTGGAGTGGCTGTGGGAGCCATCTTGGGGCATTCAGTATGCACATCAATGGCAGTGATCGGTGGAAGCATGTTGGCATCGAAGATATCACAGGGCACAGTTGCAACAGTTGGAGGCTTGCTCTTCCTTGGCTTCTCCTTTAGCTTTGGAGAAGTCAGATGTTTTGGCAATTTCGTCGATTTCAAGAAAAAATTTGAAGCAGCCGGGTTCGAACCTCATTCGTTAACGTCCGTTCCACCTTTATGGAGCGAATTCAGCCATATGAACTCGAAAACGTGGTTTAAGATGAAATGGATGAATCTGTGCAGTGGTTACTCTCCTTCTACTGCGTTTCTGAAATTTTCCCGTTCTGTTTCTCAAGTTACAATGGCCCAGAAAATCATACCATTCAACTTCTCTGCGCACCACCTGTTCGAGTCATGTAGCTACCACTCTTCAAATGATTCTTTGCCCAATACCCTTCATGCCATGATGGTCAAGAATGGTTCTATTTTCGAATCGAGGAAGTTTATTTTGAGTTCTTATGTGAAATCTGAGAAATTAAACGATGCACGGAAAGTGTTCGATGAAATGCCTAGCAGAGATGTACTTACATGGACGGTCCTTATATCCGGTTTTGCTAGAGTAAATTGTTCTGAAATGGCATTGCAACTGTTTAGAGAAATGCTGGTTGAAGATACTGCAAGTTGGAATACAGTAATATGTGGGCTAATGCAAGGTGGGTATCTGAATGAAGCATTGGAGCTACTCTATGAGATGGTGGAGAACGAACCCGAGTTTAACAAAGTTACTTCTTCCATAGCCTTAAGTGTCGTTTCTTCTTTACTGATTTTCGAGCTAGGTAGACAAGTACATGGTCGAATAGTCAGGTGTGGGTTTCATAATGATGGATTTGTAAAGAGTTCACTGATAAATATGTACATTAAGTGTGGAAATTTGGAAAAAGCATCGGTGATATATAGTCAAATGCCTTCGGGTTTTGCAAAAAAACAAGATTTCGACATTGTATGTAGCGACACGATGACAGAAATTGTTTCACGGAGCTCAATGGTGTCTGGATATGTTCGAAATGGCAAGTATGAAGATGCCTTCAAAACCTTTGTTTCTATGGTTCGTGAACAGGTTTTAATGGATAAATTTACCATTGCAAGCGTTGTTTCCGCTTGTTCTAATGCTGGCGTTTTCGAGCTTGGACACATGGAAGTTACATTGCAGGAACAGAACATGATACAAAATCTACTTGACAAAGTAAGGGAGGTCCAAGACATAGAGGAGGAAGTATGCCCAGATCACGCCGGAGATCCCGCCGAAGAAGAACCCGCCAGTGAATTTCGCCCACCCGTCAGCAGTTTGCAGCTGATCAGGGGTCTTTTTCCTGCCGGTCAAGGATGCAATGCCATACATTGTCAAGCAGATGCTCAAAATGGCAATGAGACCACCGGCGGCCAGTATGCGTCGGAGCGGCGAAACGGAGAGGCCTTTGGGAGCAACCAAAGCTCTGGTGGTGGAAGAAGCAAAGCTGGAACTGAGCTGACTGGCAATGGGGGAGGTTGCAGTGGCCATGGCTGCTGCTGCCTTTGGCTTGTGGAAGAATCAGTGGCAGAGAAGAAGATGAATGAG

Protein sequence

MGLRSNPTGNLSFSPSLILLLLLVSLFASVQVFSAEVEKEELDGPKDLGRRSKISWNNIDTIAARKDVVDSEDLNLDLDSVGLGVFDAFFASLSMIIVSEIGDETFIIAALMAMRHPKSIVLSGALSALIVMTVLSTGLGRIVPNLISRKHTNNAATVLYAFFGLRLLYIAWRSKADSKSSTKKEMEEVEEKLEAGQSKTTFRRFFLRFCTPIFLESFILTFLAEWGDRSQIATIALATHKNALGVAVGAILGHSVCTSMAVIGGSMLASKISQGTVATVGGLLFLGFSFSFGEVRCFGNFVDFKKKFEAAGFEPHSLTSVPPLWSEFSHMNSKTWFKMKWMNLCSGYSPSTAFLKFSRSVSQVTMAQKIIPFNFSAHHLFESCSYHSSNDSLPNTLHAMMVKNGSIFESRKFILSSYVKSEKLNDARKVFDEMPSRDVLTWTVLISGFARVNCSEMALQLFREMLVEDTASWNTVICGLMQGGYLNEALELLYEMVENEPEFNKVTSSIALSVVSSLLIFELGRQVHGRIVRCGFHNDGFVKSSLINMYIKCGNLEKASVIYSQMPSGFAKKQDFDIVCSDTMTEIVSRSSMVSGYVRNGKYEDAFKTFVSMVREQVLMDKFTIASVVSACSNAGVFELGHMEVTLQEQNMIQNLLDKVREVQDIEEEVCPDHAGDPAEEEPASEFRPPVSSLQLIRGLFPAGQGCNAIHCQADAQNGNETTGGQYASERRNGEAFGSNQSSGGGRSKAGTELTGNGGGCSGHGCCCLWLVEESVAEKKMNE
Homology
BLAST of Cp4.1LG11g06770 vs. ExPASy Swiss-Prot
Match: Q93Y38 (GDT1-like protein 3 OS=Arabidopsis thaliana OX=3702 GN=At5g36290 PE=2 SV=1)

HSP 1 Score: 359.0 bits (920), Expect = 1.4e-97
Identity = 209/297 (70.37%), Postives = 238/297 (80.13%), Query Frame = 0

Query: 1   MGLRSNPTGNLSFSPSLILLLLLVSLFASVQVFSAEV---EKEELDGP-KDLGRRSKISW 60
           MGL SNPT        LIL+  +  L +S+    + V   E++E +G  K+LGRR  +  
Sbjct: 1   MGLISNPT-------RLILVATIFFLVSSISGQDSVVENNERQESEGSGKELGRRGMVGT 60

Query: 61  NNI--DTIAARKDVVDSEDLNLDLDSVGLGVFDAFFASLSMIIVSEIGDETFIIAALMAM 120
             I  DT+    D + +  LNLDLD+    VFDA F+S SMI+V+EIGDETFIIAALMAM
Sbjct: 61  ERIGVDTVV---DNIGALGLNLDLDATAPSVFDALFSSFSMILVTEIGDETFIIAALMAM 120

Query: 121 RHPKSIVLSGALSALIVMTVLSTGLGRIVPNLISRKHTNNAATVLYAFFGLRLLYIAWRS 180
           RHPK+ VLSGALSAL VMT+LSTGLGRIVPNLISRKHTN+AATVLYAFFGLRLLYIAWRS
Sbjct: 121 RHPKATVLSGALSALFVMTILSTGLGRIVPNLISRKHTNSAATVLYAFFGLRLLYIAWRS 180

Query: 181 KADSKSSTKKEMEEVEEKLEAGQSKTTFRRFFLRFCTPIFLESFILTFLAEWGDRSQIAT 240
             DSKS+ KKEMEEVEEKLE+GQ KT FRR F RFCTPIFLESFILTFLAEWGDRSQIAT
Sbjct: 181 -TDSKSNQKKEMEEVEEKLESGQGKTPFRRLFSRFCTPIFLESFILTFLAEWGDRSQIAT 240

Query: 241 IALATHKNALGVAVGAILGHSVCTSMAVIGGSMLASKISQGTVATVGGLLFLGFSFS 292
           IALATHKNA+GVA+GA +GH+VCTS+AV+GGSMLAS+ISQ TVATVGGLLFLGFS S
Sbjct: 241 IALATHKNAIGVAIGASIGHTVCTSLAVVGGSMLASRISQRTVATVGGLLFLGFSVS 286

BLAST of Cp4.1LG11g06770 vs. ExPASy Swiss-Prot
Match: Q6ZIB9 (GDT1-like protein 4 OS=Oryza sativa subsp. japonica OX=39947 GN=Os08g0528500 PE=2 SV=1)

HSP 1 Score: 355.9 bits (912), Expect = 1.2e-96
Identity = 201/272 (73.90%), Postives = 225/272 (82.72%), Query Frame = 0

Query: 20  LLLLVSLFASVQVFSAEVEKEELDGPKDLGRRSKISWNNIDTIAARKDVVDSEDLNLDLD 79
           LLLL+ L A+    +A  ++E+  G  D G         +   AAR     S+     ++
Sbjct: 10  LLLLLLLVAAAAAAAAAGDQEDPRGGGDNGTARLDRRTKMFLHAARA----SDGGATGME 69

Query: 80  SVGLGVFDAFFASLSMIIVSEIGDETFIIAALMAMRHPKSIVLSGALSALIVMTVLSTGL 139
             GLG+FDAFFASLSMI+VSEIGDETFIIAALMAMRHPKS VLSGALSAL+VMT+LSTGL
Sbjct: 70  KAGLGLFDAFFASLSMILVSEIGDETFIIAALMAMRHPKSTVLSGALSALVVMTILSTGL 129

Query: 140 GRIVPNLISRKHTNNAATVLYAFFGLRLLYIAWRSKADSKSSTKKEMEEVEEKLEAGQSK 199
           GRIVPNLISRKHTN+AATVLYAFFGLRLLYIAWRS  DSK+S KKE+EEVEEKLEAGQ K
Sbjct: 130 GRIVPNLISRKHTNSAATVLYAFFGLRLLYIAWRS--DSKASQKKEIEEVEEKLEAGQGK 189

Query: 200 TTFRRFFLRFCTPIFLESFILTFLAEWGDRSQIATIALATHKNALGVAVGAILGHSVCTS 259
           +TFRR F RFCTPIFLESF+LTFLAEWGDRSQIATIALATHKNA+GVAVGA LGH++CTS
Sbjct: 190 STFRRIFSRFCTPIFLESFVLTFLAEWGDRSQIATIALATHKNAVGVAVGATLGHTICTS 249

Query: 260 MAVIGGSMLASKISQGTVATVGGLLFLGFSFS 292
            AV+GGSMLASKISQGTVAT+GGLLFLGFS S
Sbjct: 250 FAVVGGSMLASKISQGTVATIGGLLFLGFSLS 275

BLAST of Cp4.1LG11g06770 vs. ExPASy Swiss-Prot
Match: A2YXC7 (GDT1-like protein 4 OS=Oryza sativa subsp. indica OX=39946 GN=OsI_29993 PE=3 SV=1)

HSP 1 Score: 355.5 bits (911), Expect = 1.5e-96
Identity = 205/276 (74.28%), Postives = 229/276 (82.97%), Query Frame = 0

Query: 17  LILLLLLVSLFASVQVFSAEVEKEELD-GPKDLGRRSKISWNNIDTIAARKDVVDSEDLN 76
           L+LLLLLV+  A+      E  +   D G   L RR+K+  +     AAR     S+   
Sbjct: 10  LLLLLLLVAAAAAAAAGDQEDPRGGGDNGTARLDRRTKMFLH-----AARA----SDGGA 69

Query: 77  LDLDSVGLGVFDAFFASLSMIIVSEIGDETFIIAALMAMRHPKSIVLSGALSALIVMTVL 136
             ++  GLG+FDAFFASLSMI+VSEIGDETFIIAALMAMRHPKS VLSGALSAL+VMT+L
Sbjct: 70  TGMEKAGLGLFDAFFASLSMILVSEIGDETFIIAALMAMRHPKSTVLSGALSALVVMTIL 129

Query: 137 STGLGRIVPNLISRKHTNNAATVLYAFFGLRLLYIAWRSKADSKSSTKKEMEEVEEKLEA 196
           STGLGRIVPNLISRKHTN+AATVLYAFFGLRLLYIAWRS  DSK+S KKE+EEVEEKLEA
Sbjct: 130 STGLGRIVPNLISRKHTNSAATVLYAFFGLRLLYIAWRS--DSKASQKKEIEEVEEKLEA 189

Query: 197 GQSKTTFRRFFLRFCTPIFLESFILTFLAEWGDRSQIATIALATHKNALGVAVGAILGHS 256
           GQ K+TFRR F RFCTPIFLESF+LTFLAEWGDRSQIATIALATHKNA+GVAVGA LGH+
Sbjct: 190 GQGKSTFRRIFSRFCTPIFLESFVLTFLAEWGDRSQIATIALATHKNAVGVAVGATLGHT 249

Query: 257 VCTSMAVIGGSMLASKISQGTVATVGGLLFLGFSFS 292
           +CTS AV+GGSMLASKISQGTVAT+GGLLFLGFS S
Sbjct: 250 ICTSFAVVGGSMLASKISQGTVATIGGLLFLGFSLS 274

BLAST of Cp4.1LG11g06770 vs. ExPASy Swiss-Prot
Match: A2ZE50 (GDT1-like protein 3 OS=Oryza sativa subsp. indica OX=39946 GN=OsI_36063 PE=3 SV=1)

HSP 1 Score: 352.1 bits (902), Expect = 1.7e-95
Identity = 206/279 (73.84%), Postives = 232/279 (83.15%), Query Frame = 0

Query: 14  SPSLILLLLLVSLFASVQVFSAEVEKEELDGPK-DLGRRSKISWNNIDTIAARKDVVDSE 73
           +P L++LL+L++  A+V V  AE + E   G K  LGRR+    + +     +K+ V   
Sbjct: 4   NPRLLILLVLLAFSATVAV--AE-DGESTGGSKVSLGRRAGGFLHGL-----KKEAVVEG 63

Query: 74  DLNLDLDSVGLGVFDAFFASLSMIIVSEIGDETFIIAALMAMRHPKSIVLSGALSALIVM 133
           D  + LD VG G+FDA FASLSMI+VSEIGDETFIIAALMAMRHPKSIVLSGALSAL VM
Sbjct: 64  DHGVALDEVGPGLFDALFASLSMILVSEIGDETFIIAALMAMRHPKSIVLSGALSALYVM 123

Query: 134 TVLSTGLGRIVPNLISRKHTNNAATVLYAFFGLRLLYIAWRSKADSKSSTKKEMEEVEEK 193
           TVLSTGLGRIVPNLISRKHTN+AATVLY FFGLRLLYIAW  K+D K S KKEMEEVEEK
Sbjct: 124 TVLSTGLGRIVPNLISRKHTNSAATVLYLFFGLRLLYIAW--KSDPKGSQKKEMEEVEEK 183

Query: 194 LEAGQSKTTFRRFFLRFCTPIFLESFILTFLAEWGDRSQIATIALATHKNALGVAVGAIL 253
           LE+GQ K+T RRFF RFCTPIFLE+FILTFLAEWGDRSQIATIALATHKNA+GVAVGA L
Sbjct: 184 LESGQGKSTLRRFFGRFCTPIFLEAFILTFLAEWGDRSQIATIALATHKNAIGVAVGASL 243

Query: 254 GHSVCTSMAVIGGSMLASKISQGTVATVGGLLFLGFSFS 292
           GH+VCTS+AVIGGSMLASKISQ TVAT+GG+LFLGFS S
Sbjct: 244 GHTVCTSLAVIGGSMLASKISQRTVATIGGVLFLGFSVS 272

BLAST of Cp4.1LG11g06770 vs. ExPASy Swiss-Prot
Match: Q2R4J1 (GDT1-like protein 3 OS=Oryza sativa subsp. japonica OX=39947 GN=Os11g0472500 PE=2 SV=1)

HSP 1 Score: 352.1 bits (902), Expect = 1.7e-95
Identity = 206/279 (73.84%), Postives = 232/279 (83.15%), Query Frame = 0

Query: 14  SPSLILLLLLVSLFASVQVFSAEVEKEELDGPK-DLGRRSKISWNNIDTIAARKDVVDSE 73
           +P L++LL+L++  A+V V  AE + E   G K  LGRR+    + +     +K+ V   
Sbjct: 4   NPRLLILLVLLAFSATVAV--AE-DGESTGGSKVSLGRRAGGFLHGL-----KKEAVVEG 63

Query: 74  DLNLDLDSVGLGVFDAFFASLSMIIVSEIGDETFIIAALMAMRHPKSIVLSGALSALIVM 133
           D  + LD VG G+FDA FASLSMI+VSEIGDETFIIAALMAMRHPKSIVLSGALSAL VM
Sbjct: 64  DHGVALDEVGPGLFDALFASLSMILVSEIGDETFIIAALMAMRHPKSIVLSGALSALYVM 123

Query: 134 TVLSTGLGRIVPNLISRKHTNNAATVLYAFFGLRLLYIAWRSKADSKSSTKKEMEEVEEK 193
           TVLSTGLGRIVPNLISRKHTN+AATVLY FFGLRLLYIAW  K+D K S KKEMEEVEEK
Sbjct: 124 TVLSTGLGRIVPNLISRKHTNSAATVLYLFFGLRLLYIAW--KSDPKGSQKKEMEEVEEK 183

Query: 194 LEAGQSKTTFRRFFLRFCTPIFLESFILTFLAEWGDRSQIATIALATHKNALGVAVGAIL 253
           LE+GQ K+T RRFF RFCTPIFLE+FILTFLAEWGDRSQIATIALATHKNA+GVAVGA L
Sbjct: 184 LESGQGKSTLRRFFGRFCTPIFLEAFILTFLAEWGDRSQIATIALATHKNAIGVAVGASL 243

Query: 254 GHSVCTSMAVIGGSMLASKISQGTVATVGGLLFLGFSFS 292
           GH+VCTS+AVIGGSMLASKISQ TVAT+GG+LFLGFS S
Sbjct: 244 GHTVCTSLAVIGGSMLASKISQRTVATIGGVLFLGFSVS 272

BLAST of Cp4.1LG11g06770 vs. NCBI nr
Match: KAG7029890.1 (putative pentatricopeptide repeat-containing protein [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1015 bits (2624), Expect = 0.0
Identity = 569/741 (76.79%), Postives = 576/741 (77.73%), Query Frame = 0

Query: 1   MGLRSNPTGNLSFSPSLILLLLLVSLFASVQVFSAEVEKEELDGPKDLGRRSKISWNNID 60
           MGLRSNPTGNLSFSPSLILLLLLVSLFASVQVFSAEVEKEELD PKDLGRRSKISWNN+D
Sbjct: 1   MGLRSNPTGNLSFSPSLILLLLLVSLFASVQVFSAEVEKEELDVPKDLGRRSKISWNNVD 60

Query: 61  TIAARKDVVDSEDLNLDLDSVGLGVFDAFFASLSMIIVSEIGDETFIIAALMAMRHPKSI 120
           TIAA+KDVVDSEDLNLDLDSVGLGVFDAFFASLSMIIVSEIGDETFIIAALMAMRHPKSI
Sbjct: 61  TIAAKKDVVDSEDLNLDLDSVGLGVFDAFFASLSMIIVSEIGDETFIIAALMAMRHPKSI 120

Query: 121 VLSGALSALIVMTVLSTGLGRIVPNLISRKHTNNAATVLYAFFGLRLLYIAWRSKADSKS 180
           VLSGALSALIVMTVLSTGLGRIVPNLISRKHTNNAATVLYAFFGLRLLYIAWRSKADSKS
Sbjct: 121 VLSGALSALIVMTVLSTGLGRIVPNLISRKHTNNAATVLYAFFGLRLLYIAWRSKADSKS 180

Query: 181 STKKEMEEVEEKLEAGQSKTTFRRFFLRFCTPIFLESFILTFLAEWGDRSQIATIALATH 240
           STKKEMEEVEEKLEAGQSKTTFRRFFLRFCTPIFLESFILTFLAEWGDRSQIATIALATH
Sbjct: 181 STKKEMEEVEEKLEAGQSKTTFRRFFLRFCTPIFLESFILTFLAEWGDRSQIATIALATH 240

Query: 241 KNALGVAVGAILGHSVCTSMAVIGGSMLASKISQGTVATVGGLLFLGFSFSFGEVRCFGN 300
           KNALGVAVGAILGHSVCTSMAVIGGSMLASKISQGTVATVGGLLFLGFSF          
Sbjct: 241 KNALGVAVGAILGHSVCTSMAVIGGSMLASKISQGTVATVGGLLFLGFSF---------- 300

Query: 301 FVDFKKKFEAAGFEPHSLTSVPPLWSEFSHMNSKTWFKMKWMNLCSGYSPSTAFLKFSRS 360
                                                         GYS STAFLK  RS
Sbjct: 301 ----------------------------------------------GYSASTAFLKLFRS 360

Query: 361 VSQVTMAQKIIPFNFSAHHLFESCSYHSSNDSLPNTLHAMMVKNGSIFESRKFILSSYVK 420
           VSQVTMAQKIIPFNFSAHHLFESCS+HSSNDSLPNTLHA MVKNGSIFESRKFILSSYVK
Sbjct: 361 VSQVTMAQKIIPFNFSAHHLFESCSFHSSNDSLPNTLHAKMVKNGSIFESRKFILSSYVK 420

Query: 421 SEKLNDARKVFDEMPSRDVLTWTVLISGFARVNCSEMALQLFREMLVE------------ 480
           SEKLNDARKVFDEMPSRDVLTWTVLISGFARVNCSEMALQLFREMLVE            
Sbjct: 421 SEKLNDARKVFDEMPSRDVLTWTVLISGFARVNCSEMALQLFREMLVEGVCPNPFTLSTV 480

Query: 481 ------------------------------------------------------------ 540
                                                                       
Sbjct: 481 LKLCSRVGDVKMGKGIHGWILRSGISLDVVLENSMLDLYAKFDEFDYVTKLFDSMREKST 540

Query: 541 ----------------------------DTASWNTVICGLMQGGYLNEALELLYEMVENE 600
                                       DTA+WNTVICGLMQGGYLNEALELLYEMVENE
Sbjct: 541 ATYNILLGVHVRSDVNKSLDLFRNLPCRDTATWNTVICGLMQGGYLNEALELLYEMVENE 600

Query: 601 PEFNKVTSSIALSVVSSLLIFELGRQVHGRIVRCGFHNDGFVKSSLINMYIKCGNLEKAS 641
           PEFNKVTSSIALSVVSSLL+ ELGRQVHGRIVRCGFHNDGFVKSSLINMYIKCGNLEKAS
Sbjct: 601 PEFNKVTSSIALSVVSSLLVSELGRQVHGRIVRCGFHNDGFVKSSLINMYIKCGNLEKAS 660

BLAST of Cp4.1LG11g06770 vs. NCBI nr
Match: XP_023545881.1 (putative pentatricopeptide repeat-containing protein At3g23330 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 958 bits (2477), Expect = 0.0
Identity = 549/741 (74.09%), Postives = 553/741 (74.63%), Query Frame = 0

Query: 1   MGLRSNPTGNLSFSPSLILLLLLVSLFASVQVFSAEVEKEELDGPKDLGRRSKISWNNID 60
           MGLRSNPTGNLSFSPSLILLLLLVSLFASVQVFSAEVEKEELDGPKDLGRRSKISWNNID
Sbjct: 1   MGLRSNPTGNLSFSPSLILLLLLVSLFASVQVFSAEVEKEELDGPKDLGRRSKISWNNID 60

Query: 61  TIAARKDVVDSEDLNLDLDSVGLGVFDAFFASLSMIIVSEIGDETFIIAALMAMRHPKSI 120
           TIAARKDVVDSEDLNLDLDSVGLGVFDAFFASLSMIIVSEIGDETFIIAALMAMRHPKSI
Sbjct: 61  TIAARKDVVDSEDLNLDLDSVGLGVFDAFFASLSMIIVSEIGDETFIIAALMAMRHPKSI 120

Query: 121 VLSGALSALIVMTVLSTGLGRIVPNLISRKHTNNAATVLYAFFGLRLLYIAWRSKADSKS 180
           VLSGALSALIVMTVLSTGLGRIVPNLISRKHTNNAATVLYAFFGLRLLYIAWRSKADSKS
Sbjct: 121 VLSGALSALIVMTVLSTGLGRIVPNLISRKHTNNAATVLYAFFGLRLLYIAWRSKADSKS 180

Query: 181 STKKEMEEVEEKLEAGQSKTTFRRFFLRFCTPIFLESFILTFLAEWGDRSQIATIALATH 240
           STKKEMEEVEEKLEAGQSKTTFRRFFLRFCTPIFLESFILTFLAEWGDRSQIATIALATH
Sbjct: 181 STKKEMEEVEEKLEAGQSKTTFRRFFLRFCTPIFLESFILTFLAEWGDRSQIATIALATH 240

Query: 241 KNALGVAVGAILGHSVCTSMAVIGGSMLASKISQGTVATVGGLLFLGFSFSFGEVRCFGN 300
           KNALGVAVGAILGHSVCTSMAVIGGSMLASKISQGTVATVGGLLFLGFS S         
Sbjct: 241 KNALGVAVGAILGHSVCTSMAVIGGSMLASKISQGTVATVGGLLFLGFSLS--------- 300

Query: 301 FVDFKKKFEAAGFEPHSLTSVPPLWSEFSHMNSKTWFKMKWMNLCSGYSPSTAFLKFSRS 360
                    +  F P  L ++                                       
Sbjct: 301 ---------SYFFPPLFLVALE-------------------------------------- 360

Query: 361 VSQVTMAQKIIPFNFSAHHLFESCSYHSSNDSLPNTLHAMMVKNGSIFESRKFILSSYVK 420
                                   +YHSSNDSLPNTLHAMMVKNGSIFESRKFILSSYVK
Sbjct: 361 ------------------------NYHSSNDSLPNTLHAMMVKNGSIFESRKFILSSYVK 420

Query: 421 SEKLNDARKVFDEMPSRDVLTWTVLISGFARVNCSEMALQLFREMLVE------------ 480
           SEKLNDARKVFDEMPSRDVLTWTVLISGFARVNCSEMALQLFREMLVE            
Sbjct: 421 SEKLNDARKVFDEMPSRDVLTWTVLISGFARVNCSEMALQLFREMLVEGVCPNPFTLSTV 480

Query: 481 ------------------------------------------------------------ 540
                                                                       
Sbjct: 481 LKLCSRVGDVKMGKGIHGWILRSGVNLDVVLENSMLDLYAKFDEFDYVKKLFDSMREKST 540

Query: 541 ----------------------------DTASWNTVICGLMQGGYLNEALELLYEMVENE 600
                                       DTASWNTVICGLMQGGYLNEALELLYEMVENE
Sbjct: 541 ATYNILLGVHVRSDVNKSLDLFRNLPCRDTASWNTVICGLMQGGYLNEALELLYEMVENE 600

Query: 601 PEFNKVTSSIALSVVSSLLIFELGRQVHGRIVRCGFHNDGFVKSSLINMYIKCGNLEKAS 641
           PEFNKVTSSIALSVVSSLLIFELGRQVHGRIVRCGFHNDGFVKSSLINMYIKCGNLEKAS
Sbjct: 601 PEFNKVTSSIALSVVSSLLIFELGRQVHGRIVRCGFHNDGFVKSSLINMYIKCGNLEKAS 660

BLAST of Cp4.1LG11g06770 vs. NCBI nr
Match: XP_022997610.1 (putative pentatricopeptide repeat-containing protein At3g23330 [Cucurbita maxima])

HSP 1 Score: 941 bits (2433), Expect = 0.0
Identity = 539/741 (72.74%), Postives = 549/741 (74.09%), Query Frame = 0

Query: 1   MGLRSNPTGNLSFSPSLILLLLLVSLFASVQVFSAEVEKEELDGPKDLGRRSKISWNNID 60
           MGLRSNPTGNLSFSPSLILLLLLVSLFASVQV+SAEVEKEELDGPKDLGRRSKISWNN+D
Sbjct: 1   MGLRSNPTGNLSFSPSLILLLLLVSLFASVQVYSAEVEKEELDGPKDLGRRSKISWNNVD 60

Query: 61  TIAARKDVVDSEDLNLDLDSVGLGVFDAFFASLSMIIVSEIGDETFIIAALMAMRHPKSI 120
           TIAA+KDVVDSEDLNLDLDSVGLGVFDAFFASLSMIIVSEIGDETFIIAALMAMRHPKSI
Sbjct: 61  TIAAKKDVVDSEDLNLDLDSVGLGVFDAFFASLSMIIVSEIGDETFIIAALMAMRHPKSI 120

Query: 121 VLSGALSALIVMTVLSTGLGRIVPNLISRKHTNNAATVLYAFFGLRLLYIAWRSKADSKS 180
           VLSGALSALIVMTVLSTGLGRIVPNLISRKHTNNAATVLYAFFGLRLLYIAWRSKADSKS
Sbjct: 121 VLSGALSALIVMTVLSTGLGRIVPNLISRKHTNNAATVLYAFFGLRLLYIAWRSKADSKS 180

Query: 181 STKKEMEEVEEKLEAGQSKTTFRRFFLRFCTPIFLESFILTFLAEWGDRSQIATIALATH 240
           STKKEMEEVEEKLEAGQSKTTFRRFFLRFCTPIFLESFILTFLAEWGDRSQIATIALATH
Sbjct: 181 STKKEMEEVEEKLEAGQSKTTFRRFFLRFCTPIFLESFILTFLAEWGDRSQIATIALATH 240

Query: 241 KNALGVAVGAILGHSVCTSMAVIGGSMLASKISQGTVATVGGLLFLGFSFSFGEVRCFGN 300
           KNALGVAVGAILGHSVCTSMAVIGGSMLASKISQGTVATVGGLLFLGFS S         
Sbjct: 241 KNALGVAVGAILGHSVCTSMAVIGGSMLASKISQGTVATVGGLLFLGFSLS--------- 300

Query: 301 FVDFKKKFEAAGFEPHSLTSVPPLWSEFSHMNSKTWFKMKWMNLCSGYSPSTAFLKFSRS 360
                    +  F P  L ++                                       
Sbjct: 301 ---------SYFFPPLFLVALE-------------------------------------- 360

Query: 361 VSQVTMAQKIIPFNFSAHHLFESCSYHSSNDSLPNTLHAMMVKNGSIFESRKFILSSYVK 420
                                   +YHSSNDSLPNTLHA MVKNGSIFESRKFILSSYVK
Sbjct: 361 ------------------------NYHSSNDSLPNTLHAKMVKNGSIFESRKFILSSYVK 420

Query: 421 SEKLNDARKVFDEMPSRDVLTWTVLISGFARVNCSEMALQLFREMLVE------------ 480
           SEKLNDARKVFDEMPSRDVLTWTVLISGFARVNCSEMALQLFREMLVE            
Sbjct: 421 SEKLNDARKVFDEMPSRDVLTWTVLISGFARVNCSEMALQLFREMLVEGVYPNPFTLSTV 480

Query: 481 ------------------------------------------------------------ 540
                                                                       
Sbjct: 481 LKLCSRVGDVKMGKGIHGWILRSGVSLDVVLENSMLDLYAKFDEFDYVKKLFDSMREKST 540

Query: 541 ----------------------------DTASWNTVICGLMQGGYLNEALELLYEMVENE 600
                                       DTASWNTVICGLMQGGYLNEALELLYEMVEN+
Sbjct: 541 ATYNILLGVHVRSDVNKSLDLFRNLPCRDTASWNTVICGLMQGGYLNEALELLYEMVENQ 600

Query: 601 PEFNKVTSSIALSVVSSLLIFELGRQVHGRIVRCGFHNDGFVKSSLINMYIKCGNLEKAS 641
           PEFNKVTSSIALSVVSSLLI ELGRQVHGRI+RCGFHNDGFVKSSLINMYIKCGNLEKAS
Sbjct: 601 PEFNKVTSSIALSVVSSLLIIELGRQVHGRILRCGFHNDGFVKSSLINMYIKCGNLEKAS 660

BLAST of Cp4.1LG11g06770 vs. NCBI nr
Match: XP_022929759.1 (putative pentatricopeptide repeat-containing protein At3g23330 [Cucurbita moschata])

HSP 1 Score: 939 bits (2427), Expect = 0.0
Identity = 539/741 (72.74%), Postives = 548/741 (73.95%), Query Frame = 0

Query: 1   MGLRSNPTGNLSFSPSLILLLLLVSLFASVQVFSAEVEKEELDGPKDLGRRSKISWNNID 60
           MGLRSNPTGNLSFSPSLILLLLLVSLFASVQVFSAEVEKEELD PKDLGRRSKISWNN+D
Sbjct: 1   MGLRSNPTGNLSFSPSLILLLLLVSLFASVQVFSAEVEKEELDVPKDLGRRSKISWNNVD 60

Query: 61  TIAARKDVVDSEDLNLDLDSVGLGVFDAFFASLSMIIVSEIGDETFIIAALMAMRHPKSI 120
           TIAA+KDVVDSEDLNLDLDSVGLGVFDAFFASLSMIIVSEIGDETFIIAALMAMRHPKSI
Sbjct: 61  TIAAKKDVVDSEDLNLDLDSVGLGVFDAFFASLSMIIVSEIGDETFIIAALMAMRHPKSI 120

Query: 121 VLSGALSALIVMTVLSTGLGRIVPNLISRKHTNNAATVLYAFFGLRLLYIAWRSKADSKS 180
           VLSGALSALIVMTVLSTGLGRIVPNLISRKHTNNAATVLYAFFGLRLLYIAWRSKADSKS
Sbjct: 121 VLSGALSALIVMTVLSTGLGRIVPNLISRKHTNNAATVLYAFFGLRLLYIAWRSKADSKS 180

Query: 181 STKKEMEEVEEKLEAGQSKTTFRRFFLRFCTPIFLESFILTFLAEWGDRSQIATIALATH 240
           STKKEMEEVEEKLEAGQSKTTFRRFFLRFCTPIFLESFILTFLAEWGDRSQIATIALATH
Sbjct: 181 STKKEMEEVEEKLEAGQSKTTFRRFFLRFCTPIFLESFILTFLAEWGDRSQIATIALATH 240

Query: 241 KNALGVAVGAILGHSVCTSMAVIGGSMLASKISQGTVATVGGLLFLGFSFSFGEVRCFGN 300
           KNALGVAVGAILGHSVCTSMAVIGGSMLASKISQGTVATVGGLLFLGFS S         
Sbjct: 241 KNALGVAVGAILGHSVCTSMAVIGGSMLASKISQGTVATVGGLLFLGFSLS--------- 300

Query: 301 FVDFKKKFEAAGFEPHSLTSVPPLWSEFSHMNSKTWFKMKWMNLCSGYSPSTAFLKFSRS 360
                    +  F P  L ++                                       
Sbjct: 301 ---------SYFFPPLFLVALE-------------------------------------- 360

Query: 361 VSQVTMAQKIIPFNFSAHHLFESCSYHSSNDSLPNTLHAMMVKNGSIFESRKFILSSYVK 420
                                   ++HSSNDSLPNTLHA MVKNGSIFESRKFILSSYVK
Sbjct: 361 ------------------------NFHSSNDSLPNTLHAKMVKNGSIFESRKFILSSYVK 420

Query: 421 SEKLNDARKVFDEMPSRDVLTWTVLISGFARVNCSEMALQLFREMLVE------------ 480
           SEKLNDARKVFDEMPSRDVLTWTVLISGFARVNCSEMALQLFREMLVE            
Sbjct: 421 SEKLNDARKVFDEMPSRDVLTWTVLISGFARVNCSEMALQLFREMLVEGVCPNPFTLSTV 480

Query: 481 ------------------------------------------------------------ 540
                                                                       
Sbjct: 481 LKLCSRVGDVKMGKGIHGWILRSGVSLDVVLENSMLDLYAKFDEFDYVTKLFDSMREKST 540

Query: 541 ----------------------------DTASWNTVICGLMQGGYLNEALELLYEMVENE 600
                                       DTASWNTVICGLMQGGYLNEALELLYEMVENE
Sbjct: 541 ATYNILLGVHVRSDVNKSLDLFRNLPCRDTASWNTVICGLMQGGYLNEALELLYEMVENE 600

Query: 601 PEFNKVTSSIALSVVSSLLIFELGRQVHGRIVRCGFHNDGFVKSSLINMYIKCGNLEKAS 641
           PEFNKVTSSIALSVVSSLLI ELGRQVHGRIVRCG HNDGFVKSSLINMYIKCGNLEKAS
Sbjct: 601 PEFNKVTSSIALSVVSSLLIIELGRQVHGRIVRCGLHNDGFVKSSLINMYIKCGNLEKAS 660

BLAST of Cp4.1LG11g06770 vs. NCBI nr
Match: XP_011648996.1 (LOW QUALITY PROTEIN: putative pentatricopeptide repeat-containing protein At3g23330 [Cucumis sativus])

HSP 1 Score: 851 bits (2199), Expect = 2.38e-296
Identity = 494/766 (64.49%), Postives = 531/766 (69.32%), Query Frame = 0

Query: 1   MGLRSNPTGNLSFSPSLILLLLLVSLFASVQVFSAEVEKEELDGPKDLGRRSKISWNNID 60
           MGLRSNPT  LSFS S +L LL +SLFAS QV+SAEVEK++LDGPKDLGRRSK+SW+N D
Sbjct: 1   MGLRSNPTATLSFSASFLLFLLFLSLFASHQVYSAEVEKDDLDGPKDLGRRSKMSWSNSD 60

Query: 61  TIAARKDVVDSEDLNLDLDSVGLGVFDAFFASLSMIIVSEIGDETFIIAALMAMRHPKSI 120
           T+A +KD VDSEDLNLD+DS+GLGVFDAFFASLSMI+VSEIGDETFIIAALMAMRHPKSI
Sbjct: 61  TVATKKDGVDSEDLNLDMDSIGLGVFDAFFASLSMILVSEIGDETFIIAALMAMRHPKSI 120

Query: 121 VLSGALSALIVMTVLSTGLGRIVPNLISRKHTNNAATVLYAFFGLRLLYIAWRSKADSKS 180
           VLSGAL+ALIVMTVLSTGLGRIVPNLISRKHTNNAATVLYAFFGLRLLYIAWRSK++ KS
Sbjct: 121 VLSGALAALIVMTVLSTGLGRIVPNLISRKHTNNAATVLYAFFGLRLLYIAWRSKSE-KS 180

Query: 181 STKKEMEEVEEKLEAGQSKTTFRRFFLRFCTPIFLESFILTFLAEWGDRSQIATIALATH 240
           STKKEMEEVEEKLEAGQSKTTFRRFFLRFCTPIFLESFILTFLAEWGDRSQIATIALATH
Sbjct: 181 STKKEMEEVEEKLEAGQSKTTFRRFFLRFCTPIFLESFILTFLAEWGDRSQIATIALATH 240

Query: 241 KNALGVAVGAILGHSVCTSMAVIGGSMLASKISQGTVATVGGLLFLGFSFSFGEVRCFGN 300
           KNALGVAVGAILGHS+CTSMAVIGGSMLASKISQGTVATVGGLLFLGFS S         
Sbjct: 241 KNALGVAVGAILGHSICTSMAVIGGSMLASKISQGTVATVGGLLFLGFSLS--------- 300

Query: 301 FVDFKKKFEAAGFEPHSLTSVPPLWSEFSHMNSKTWFKMKWMNLCSGYSPSTAFLKFSRS 360
                                PPL                                    
Sbjct: 301 -----------------SYFFPPL------------------------------------ 360

Query: 361 VSQVTMAQKIIPFNFSAHHLFESCSYHSSNDSLPNTLHAMMVKNGSIFESRKFILSSYVK 420
                            HHLF+S SYH+SN    NTLHA MVK GSIF S KF+L+SYVK
Sbjct: 361 ----------------XHHLFKSFSYHTSNHFSSNTLHAKMVKIGSIFVSGKFVLTSYVK 420

Query: 421 SEKLNDARKVFDEMPSRDVLTWTVLISGFARVNCSEMALQLFREMLVE------------ 480
           SEKLNDA+K+FDEMP+RDVLTWT LISGF+RVN S MALQLFREMLVE            
Sbjct: 421 SEKLNDAQKLFDEMPNRDVLTWTALISGFSRVNSSGMALQLFREMLVEGVSPNHFTLSTV 480

Query: 481 ------------------------------------------------------------ 540
                                                                       
Sbjct: 481 LKLCSKVGDVRMGKGIHGWILRNGVKLDVVLENSMLDLYAKFDEFVYARKLYDSMREKST 540

Query: 541 -----------------------------DTASWNTVICGLMQGGYLNEALELLYEMVEN 600
                                        + ASWNT+ICGLMQGGYLN ALELLYEMVEN
Sbjct: 541 DTDNIILGVYVRSCDVNKSLHLFRNLPCRNAASWNTIICGLMQGGYLNAALELLYEMVEN 600

Query: 601 EPEFNKVTSSIALSVVSSLLIFELGRQVHGRIVRCGFHNDGFVKSSLINMYIKCGNLEKA 658
           E EFN  TSSIALSVVSSLLI ELGRQVHGRIVRCG HNDGFVKS+LINMYIKCGNLEKA
Sbjct: 601 ESEFNNFTSSIALSVVSSLLILELGRQVHGRIVRCGLHNDGFVKSALINMYIKCGNLEKA 660

BLAST of Cp4.1LG11g06770 vs. ExPASy TrEMBL
Match: A0A6J1KA70 (putative pentatricopeptide repeat-containing protein At3g23330 OS=Cucurbita maxima OX=3661 GN=LOC111492492 PE=3 SV=1)

HSP 1 Score: 941 bits (2433), Expect = 0.0
Identity = 539/741 (72.74%), Postives = 549/741 (74.09%), Query Frame = 0

Query: 1   MGLRSNPTGNLSFSPSLILLLLLVSLFASVQVFSAEVEKEELDGPKDLGRRSKISWNNID 60
           MGLRSNPTGNLSFSPSLILLLLLVSLFASVQV+SAEVEKEELDGPKDLGRRSKISWNN+D
Sbjct: 1   MGLRSNPTGNLSFSPSLILLLLLVSLFASVQVYSAEVEKEELDGPKDLGRRSKISWNNVD 60

Query: 61  TIAARKDVVDSEDLNLDLDSVGLGVFDAFFASLSMIIVSEIGDETFIIAALMAMRHPKSI 120
           TIAA+KDVVDSEDLNLDLDSVGLGVFDAFFASLSMIIVSEIGDETFIIAALMAMRHPKSI
Sbjct: 61  TIAAKKDVVDSEDLNLDLDSVGLGVFDAFFASLSMIIVSEIGDETFIIAALMAMRHPKSI 120

Query: 121 VLSGALSALIVMTVLSTGLGRIVPNLISRKHTNNAATVLYAFFGLRLLYIAWRSKADSKS 180
           VLSGALSALIVMTVLSTGLGRIVPNLISRKHTNNAATVLYAFFGLRLLYIAWRSKADSKS
Sbjct: 121 VLSGALSALIVMTVLSTGLGRIVPNLISRKHTNNAATVLYAFFGLRLLYIAWRSKADSKS 180

Query: 181 STKKEMEEVEEKLEAGQSKTTFRRFFLRFCTPIFLESFILTFLAEWGDRSQIATIALATH 240
           STKKEMEEVEEKLEAGQSKTTFRRFFLRFCTPIFLESFILTFLAEWGDRSQIATIALATH
Sbjct: 181 STKKEMEEVEEKLEAGQSKTTFRRFFLRFCTPIFLESFILTFLAEWGDRSQIATIALATH 240

Query: 241 KNALGVAVGAILGHSVCTSMAVIGGSMLASKISQGTVATVGGLLFLGFSFSFGEVRCFGN 300
           KNALGVAVGAILGHSVCTSMAVIGGSMLASKISQGTVATVGGLLFLGFS S         
Sbjct: 241 KNALGVAVGAILGHSVCTSMAVIGGSMLASKISQGTVATVGGLLFLGFSLS--------- 300

Query: 301 FVDFKKKFEAAGFEPHSLTSVPPLWSEFSHMNSKTWFKMKWMNLCSGYSPSTAFLKFSRS 360
                    +  F P  L ++                                       
Sbjct: 301 ---------SYFFPPLFLVALE-------------------------------------- 360

Query: 361 VSQVTMAQKIIPFNFSAHHLFESCSYHSSNDSLPNTLHAMMVKNGSIFESRKFILSSYVK 420
                                   +YHSSNDSLPNTLHA MVKNGSIFESRKFILSSYVK
Sbjct: 361 ------------------------NYHSSNDSLPNTLHAKMVKNGSIFESRKFILSSYVK 420

Query: 421 SEKLNDARKVFDEMPSRDVLTWTVLISGFARVNCSEMALQLFREMLVE------------ 480
           SEKLNDARKVFDEMPSRDVLTWTVLISGFARVNCSEMALQLFREMLVE            
Sbjct: 421 SEKLNDARKVFDEMPSRDVLTWTVLISGFARVNCSEMALQLFREMLVEGVYPNPFTLSTV 480

Query: 481 ------------------------------------------------------------ 540
                                                                       
Sbjct: 481 LKLCSRVGDVKMGKGIHGWILRSGVSLDVVLENSMLDLYAKFDEFDYVKKLFDSMREKST 540

Query: 541 ----------------------------DTASWNTVICGLMQGGYLNEALELLYEMVENE 600
                                       DTASWNTVICGLMQGGYLNEALELLYEMVEN+
Sbjct: 541 ATYNILLGVHVRSDVNKSLDLFRNLPCRDTASWNTVICGLMQGGYLNEALELLYEMVENQ 600

Query: 601 PEFNKVTSSIALSVVSSLLIFELGRQVHGRIVRCGFHNDGFVKSSLINMYIKCGNLEKAS 641
           PEFNKVTSSIALSVVSSLLI ELGRQVHGRI+RCGFHNDGFVKSSLINMYIKCGNLEKAS
Sbjct: 601 PEFNKVTSSIALSVVSSLLIIELGRQVHGRILRCGFHNDGFVKSSLINMYIKCGNLEKAS 660

BLAST of Cp4.1LG11g06770 vs. ExPASy TrEMBL
Match: A0A6J1EPP7 (putative pentatricopeptide repeat-containing protein At3g23330 OS=Cucurbita moschata OX=3662 GN=LOC111436248 PE=3 SV=1)

HSP 1 Score: 939 bits (2427), Expect = 0.0
Identity = 539/741 (72.74%), Postives = 548/741 (73.95%), Query Frame = 0

Query: 1   MGLRSNPTGNLSFSPSLILLLLLVSLFASVQVFSAEVEKEELDGPKDLGRRSKISWNNID 60
           MGLRSNPTGNLSFSPSLILLLLLVSLFASVQVFSAEVEKEELD PKDLGRRSKISWNN+D
Sbjct: 1   MGLRSNPTGNLSFSPSLILLLLLVSLFASVQVFSAEVEKEELDVPKDLGRRSKISWNNVD 60

Query: 61  TIAARKDVVDSEDLNLDLDSVGLGVFDAFFASLSMIIVSEIGDETFIIAALMAMRHPKSI 120
           TIAA+KDVVDSEDLNLDLDSVGLGVFDAFFASLSMIIVSEIGDETFIIAALMAMRHPKSI
Sbjct: 61  TIAAKKDVVDSEDLNLDLDSVGLGVFDAFFASLSMIIVSEIGDETFIIAALMAMRHPKSI 120

Query: 121 VLSGALSALIVMTVLSTGLGRIVPNLISRKHTNNAATVLYAFFGLRLLYIAWRSKADSKS 180
           VLSGALSALIVMTVLSTGLGRIVPNLISRKHTNNAATVLYAFFGLRLLYIAWRSKADSKS
Sbjct: 121 VLSGALSALIVMTVLSTGLGRIVPNLISRKHTNNAATVLYAFFGLRLLYIAWRSKADSKS 180

Query: 181 STKKEMEEVEEKLEAGQSKTTFRRFFLRFCTPIFLESFILTFLAEWGDRSQIATIALATH 240
           STKKEMEEVEEKLEAGQSKTTFRRFFLRFCTPIFLESFILTFLAEWGDRSQIATIALATH
Sbjct: 181 STKKEMEEVEEKLEAGQSKTTFRRFFLRFCTPIFLESFILTFLAEWGDRSQIATIALATH 240

Query: 241 KNALGVAVGAILGHSVCTSMAVIGGSMLASKISQGTVATVGGLLFLGFSFSFGEVRCFGN 300
           KNALGVAVGAILGHSVCTSMAVIGGSMLASKISQGTVATVGGLLFLGFS S         
Sbjct: 241 KNALGVAVGAILGHSVCTSMAVIGGSMLASKISQGTVATVGGLLFLGFSLS--------- 300

Query: 301 FVDFKKKFEAAGFEPHSLTSVPPLWSEFSHMNSKTWFKMKWMNLCSGYSPSTAFLKFSRS 360
                    +  F P  L ++                                       
Sbjct: 301 ---------SYFFPPLFLVALE-------------------------------------- 360

Query: 361 VSQVTMAQKIIPFNFSAHHLFESCSYHSSNDSLPNTLHAMMVKNGSIFESRKFILSSYVK 420
                                   ++HSSNDSLPNTLHA MVKNGSIFESRKFILSSYVK
Sbjct: 361 ------------------------NFHSSNDSLPNTLHAKMVKNGSIFESRKFILSSYVK 420

Query: 421 SEKLNDARKVFDEMPSRDVLTWTVLISGFARVNCSEMALQLFREMLVE------------ 480
           SEKLNDARKVFDEMPSRDVLTWTVLISGFARVNCSEMALQLFREMLVE            
Sbjct: 421 SEKLNDARKVFDEMPSRDVLTWTVLISGFARVNCSEMALQLFREMLVEGVCPNPFTLSTV 480

Query: 481 ------------------------------------------------------------ 540
                                                                       
Sbjct: 481 LKLCSRVGDVKMGKGIHGWILRSGVSLDVVLENSMLDLYAKFDEFDYVTKLFDSMREKST 540

Query: 541 ----------------------------DTASWNTVICGLMQGGYLNEALELLYEMVENE 600
                                       DTASWNTVICGLMQGGYLNEALELLYEMVENE
Sbjct: 541 ATYNILLGVHVRSDVNKSLDLFRNLPCRDTASWNTVICGLMQGGYLNEALELLYEMVENE 600

Query: 601 PEFNKVTSSIALSVVSSLLIFELGRQVHGRIVRCGFHNDGFVKSSLINMYIKCGNLEKAS 641
           PEFNKVTSSIALSVVSSLLI ELGRQVHGRIVRCG HNDGFVKSSLINMYIKCGNLEKAS
Sbjct: 601 PEFNKVTSSIALSVVSSLLIIELGRQVHGRIVRCGLHNDGFVKSSLINMYIKCGNLEKAS 660

BLAST of Cp4.1LG11g06770 vs. ExPASy TrEMBL
Match: A0A6J1HR62 (LOW QUALITY PROTEIN: putative pentatricopeptide repeat-containing protein At3g23330 OS=Cucurbita maxima OX=3661 GN=LOC111465385 PE=3 SV=1)

HSP 1 Score: 849 bits (2194), Expect = 5.94e-296
Identity = 489/742 (65.90%), Postives = 525/742 (70.75%), Query Frame = 0

Query: 1   MGLRSNPTGNLSFSPSLILLLLLVSLFASVQVFSAEVEKEELDGPKDLGRRSKISWNNID 60
           MGLRSNP  NLSFSP L LLLLLVSLFASVQ FSAEVEK ELDGPKDLGRRSKIS +N D
Sbjct: 1   MGLRSNPIRNLSFSPFL-LLLLLVSLFASVQGFSAEVEKVELDGPKDLGRRSKISLSNAD 60

Query: 61  TIAARKDVVDSEDLNLDLDSVGLGVFDAFFASLSMIIVSEIGDETFIIAALMAMRHPKSI 120
           T+AA KD VDS+DLNLDLDS+GLGVFDAFFASLSMIIVSEIGDETFIIAALMAMRHPKSI
Sbjct: 61  TVAANKDGVDSKDLNLDLDSIGLGVFDAFFASLSMIIVSEIGDETFIIAALMAMRHPKSI 120

Query: 121 VLSGALSALIVMTVLSTGLGRIVPNLISRKHTNNAATVLYAFFGLRLLYIAWRSKADSKS 180
           VLSGAL+ALIVMTVLSTGLGRIVPNLISRKHTNNAATVLYAFFGLRLLYIAWRSK+DSKS
Sbjct: 121 VLSGALTALIVMTVLSTGLGRIVPNLISRKHTNNAATVLYAFFGLRLLYIAWRSKSDSKS 180

Query: 181 STKKEMEEVEEKLEAGQSKTTFRRFFLRFCTPIFLESFILTFLAEWGDRSQIATIALATH 240
           STKKEMEEVEEKLEAGQSKT+FRRFFLRFCTPIFLESFILTFLAEWGDRSQIATIALATH
Sbjct: 181 STKKEMEEVEEKLEAGQSKTSFRRFFLRFCTPIFLESFILTFLAEWGDRSQIATIALATH 240

Query: 241 KNALGVAVGAILGHSVCTSMAVIGGSMLASKISQGTVATVGGLLFLGFSFSFGEVRCFGN 300
           KNALGVAVGAILGHS+CTSMAVIGGS+LASKISQGT+ATVGGLLFLGFSFS         
Sbjct: 241 KNALGVAVGAILGHSICTSMAVIGGSLLASKISQGTIATVGGLLFLGFSFS--------- 300

Query: 301 FVDFKKKFEAAGFEPHSLTSVPPLWSEFSHMNSKTWFKMKWMNLCSGYSPSTAFLKFSRS 360
                                PPL                                    
Sbjct: 301 -----------------SYFFPPLX----------------------------------- 360

Query: 361 VSQVTMAQKIIPFNFSAHHLFESCSYHSSNDSLPNTLHAMMVKNGSIFESRKFILSSYVK 420
                              LF+SC YH+SN +  +TLHA MVKNGSI    KFI+SS+VK
Sbjct: 361 -------------------LFKSCCYHASNGASADTLHAKMVKNGSILYLGKFIMSSHVK 420

Query: 421 SEKLNDARKVFDEMPSRDVLTWTVLISGFARVNCSEMALQLFREMLVE------------ 480
           SE+L+DA+KVFDEMP RDVL+WTVLISGFARVNCSEMALQLFREMLVE            
Sbjct: 421 SERLDDAQKVFDEMPHRDVLSWTVLISGFARVNCSEMALQLFREMLVEGVCPNHFTLSCV 480

Query: 481 ------------------------------------------------------------ 540
                                                                       
Sbjct: 481 LKLCSRVGDLQMGKGIHGWILRSGVNLDVVLGNSMLDLYAKFDAFDYAKQLFDSMKEKST 540

Query: 541 -----------------------------DTASWNTVICGLMQGGYLNEALELLYEMVEN 600
                                        D ASWNT+ICGLMQGGYLN A+ELLYEMV+N
Sbjct: 541 ATYNIMLGVYVRSCDVNKSLDLFRNLPCRDAASWNTIICGLMQGGYLNTAMELLYEMVKN 600

Query: 601 EPEFNKVTSSIALSVVSSLLIFELGRQVHGRIVRCGFHNDGFVKSSLINMYIKCGNLEKA 641
           EPEFNKVTSSIALSVVSSLLI +LGRQVHGRI R GFHNDGFV SSLINMYIKCGNLEKA
Sbjct: 601 EPEFNKVTSSIALSVVSSLLIIDLGRQVHGRIFRFGFHNDGFVNSSLINMYIKCGNLEKA 660

BLAST of Cp4.1LG11g06770 vs. ExPASy TrEMBL
Match: A0A1S3B4E3 (LOW QUALITY PROTEIN: putative pentatricopeptide repeat-containing protein At3g23330 OS=Cucumis melo OX=3656 GN=LOC103485889 PE=3 SV=1)

HSP 1 Score: 847 bits (2189), Expect = 3.17e-295
Identity = 490/742 (66.04%), Postives = 520/742 (70.08%), Query Frame = 0

Query: 1   MGLRSNPTGNLSFSPSLILLLLLVSLFASVQVFSAEVEKEELDGPKDLGRRSKISWNNID 60
           MGLRSNPT  LSFSPS +L LLL+SLFAS+QV+SAE EK+ELDGPKDLGRRSKISW+N D
Sbjct: 1   MGLRSNPTTTLSFSPSFLLFLLLLSLFASIQVYSAEAEKDELDGPKDLGRRSKISWSNSD 60

Query: 61  TIAARKDVVDSEDLNLDLDSVGLGVFDAFFASLSMIIVSEIGDETFIIAALMAMRHPKSI 120
           T+AA+KD VDSEDLNLD+DS+GLGVFDAFFASLSMIIVSEIGDETFIIAALMAMRHPKSI
Sbjct: 61  TVAAKKDGVDSEDLNLDMDSIGLGVFDAFFASLSMIIVSEIGDETFIIAALMAMRHPKSI 120

Query: 121 VLSGALSALIVMTVLSTGLGRIVPNLISRKHTNNAATVLYAFFGLRLLYIAWRSKADSKS 180
           VLSGAL+ALIVMTVLSTGLGRIVPNLISRKHTNNAATVLYAFFGLRLLYIAWRSK++ KS
Sbjct: 121 VLSGALAALIVMTVLSTGLGRIVPNLISRKHTNNAATVLYAFFGLRLLYIAWRSKSE-KS 180

Query: 181 STKKEMEEVEEKLEAGQSKTTFRRFFLRFCTPIFLESFILTFLAEWGDRSQIATIALATH 240
           STKKEMEEVEEKLEAGQSKTTFRRFFLRFCTPIFLESFILTFLAEWGDRSQIATIALATH
Sbjct: 181 STKKEMEEVEEKLEAGQSKTTFRRFFLRFCTPIFLESFILTFLAEWGDRSQIATIALATH 240

Query: 241 KNALGVAVGAILGHSVCTSMAVIGGSMLASKISQGTVATVGGLLFLGFSFSFGEVRCFGN 300
           KNALGVAVGAILGHS+CTSMAVIGGSMLASKISQGTVATVGGLLFLGFS S         
Sbjct: 241 KNALGVAVGAILGHSICTSMAVIGGSMLASKISQGTVATVGGLLFLGFSLS--------- 300

Query: 301 FVDFKKKFEAAGFEPHSLTSVPPLWSEFSHMNSKTWFKMKWMNLCSGYSPSTAFLKFSRS 360
                                PPL                               KF   
Sbjct: 301 -----------------SYFFPPL------------------------------XKFC-- 360

Query: 361 VSQVTMAQKIIPFNFSAHHLFESCSYHSSNDSLPNTLHAMMVKNGSIFESRKFILSSYVK 420
                                    YH+SN    NTLHA MVK GSI ES KF+L+SYVK
Sbjct: 361 -------------------------YHTSNSFSSNTLHAKMVKIGSIIESGKFVLTSYVK 420

Query: 421 SEKLNDARKVFDEMPSRDVLTWTVLISGFARVNCSEMALQLFREMLVE------------ 480
           S+KLNDA+K+FDEMP+RDVLTWT +ISGF+RVNCS MALQLFREMLVE            
Sbjct: 421 SKKLNDAQKLFDEMPNRDVLTWTAIISGFSRVNCSGMALQLFREMLVEGVCPNHFTLSTV 480

Query: 481 ------------------------------------------------------------ 540
                                                                       
Sbjct: 481 LKLCSKVGDVRMGKGIHGWILRNGVKLDVVLENSLLDLYAKFDEFVYARKLYDSMGEKST 540

Query: 541 -----------------------------DTASWNTVICGLMQGGYLNEALELLYEMVEN 600
                                        + ASWNT+ICGLMQGGYLN ALELLYEMVEN
Sbjct: 541 DTDNIILGVYVRSCDVNKSLHLFRNLPCRNAASWNTIICGLMQGGYLNAALELLYEMVEN 600

Query: 601 EPEFNKVTSSIALSVVSSLLIFELGRQVHGRIVRCGFHNDGFVKSSLINMYIKCGNLEKA 641
           E EFN  TSSIALSV SSLLI ELGRQVHGRIVRCG HNDGFVKS+LINMYIKCGNLEKA
Sbjct: 601 ESEFNNFTSSIALSVASSLLILELGRQVHGRIVRCGLHNDGFVKSALINMYIKCGNLEKA 658

BLAST of Cp4.1LG11g06770 vs. ExPASy TrEMBL
Match: B9RM88 (Pentatricopeptide repeat-containing protein, putative OS=Ricinus communis OX=3988 GN=RCOM_1078580 PE=3 SV=1)

HSP 1 Score: 575 bits (1483), Expect = 2.30e-191
Identity = 353/665 (53.08%), Postives = 443/665 (66.62%), Query Frame = 0

Query: 14  SPSLILL---LLLVSLFASVQVFSAEVEKEELDGP-KDLGRRSKISWNNIDTIAARKDVV 73
           SP L++L   LLL     + Q    E EKEE     KDLGRR  I   +ID         
Sbjct: 6   SPRLLILFAFLLLGLPLIAAQDSLVENEKEESTASIKDLGRRGMIVTKDIDG-------- 65

Query: 74  DSEDLNLDLDSVGLGVFDAFFASLSMIIVSEIGDETFIIAALMAMRHPKSIVLSGALSAL 133
           +S +L L +DS GLGVFDAF ASLSMIIVSEIGDETFIIAALMAMRHPKSIVLSGAL+AL
Sbjct: 66  NSVNLGLHVDS-GLGVFDAFIASLSMIIVSEIGDETFIIAALMAMRHPKSIVLSGALTAL 125

Query: 134 IVMTVLSTGLGRIVPNLISRKHTNNAATVLYAFFGLRLLYIAWRSKADSKSSTKKEMEEV 193
           IVMTVLSTGLGRIVPNLISRKHTN+AATVLYAFFGLRLLYIAWRS  DSK S KKEMEEV
Sbjct: 126 IVMTVLSTGLGRIVPNLISRKHTNSAATVLYAFFGLRLLYIAWRS--DSKVSQKKEMEEV 185

Query: 194 EEKLEAGQSKTTFRRFFLRFCTPIFLESFILTFLAEWGDRSQIATIALATHKNALGVAVG 253
           EEKLE+GQ KTTFRRFF RFCTPIFLESFILTFLAEWGDRSQIATIALATHKNA+GVAVG
Sbjct: 186 EEKLESGQGKTTFRRFFSRFCTPIFLESFILTFLAEWGDRSQIATIALATHKNAIGVAVG 245

Query: 254 AILGHSVCTSMAVIGGSMLASKISQGTVATVGGLLFLGFSFS------------------ 313
           A +GH++CTS+AV+GGSMLASKISQGTVAT+GGLLFLGFS S                  
Sbjct: 246 ATIGHTICTSLAVVGGSMLASKISQGTVATIGGLLFLGFSLSSYFYPPLFLQSLSSSYSN 305

Query: 314 ---FGE--------VRCFGNFVDFKKKFEAAGFEPHSLTSVPPLWSEFSHMNSKTWFKMK 373
              + E        +  F N        +       +L+    L+ E    + +TW    
Sbjct: 306 NHKYAELLHSNVIKIGSFHNLGITNHLLDLYAKNSQNLSHAHKLFDEILCRDVRTW---- 365

Query: 374 WMNLCSGYSPSTAFLKFSRSVSQVTMAQKIIPFNFSAHHLFESCSYHSSNDSLPNT--LH 433
              L SG++ +  F K    + +    + + P  F+   + + CS   S   + N   +H
Sbjct: 366 -TILISGFAQTRNF-KMVSGLFRRMQKEGVCPNQFTLSSVLKCCS---SICEIRNGKGIH 425

Query: 434 AMMVKNGSIFES--RKFILSSYVKSEKLNDARKVFDEMPSRDVLTWTVLISGFARVNCSE 493
             ++ +G  F+      IL  YVK    + A+ +FD M  +  ++W ++I G+ R+   E
Sbjct: 426 GWILTSGIGFDIVLENSILDLYVKCGAFDYAKSLFDSMAEKGTVSWNIMIGGYLRMGDVE 485

Query: 494 MALQLFREMLVEDTASWNTVICGLMQGGYLNEALELLYEMVENEPEFNKVTSSIALSVVS 553
            +L+LF+ +  ++ ASWNT+I GLM+ G+   ALELLY+MVE+   FN VT S+AL++VS
Sbjct: 486 SSLELFQSLYFKNIASWNTIIDGLMKNGFETIALELLYKMVESGLGFNSVTFSVALNLVS 545

Query: 554 SLLIFELGRQVHGRIVRCGFHNDGFVKSSLINMYIKCGNLEKASVIYSQMPSGFAKKQDF 613
            L+  ELG+Q+HGRI+R   H++GF+++SL++MY KCG +E+AS ++  +P         
Sbjct: 546 CLVNLELGKQIHGRILRLIIHDNGFIRNSLLDMYCKCGKMEEASRMFRNVP--------V 605

Query: 614 DIVCSDTMTEIVSRSSMVSGYVRNGKYEDAFKTFVSMVREQVLMDKFTIASVVSACSNAG 641
           +I C D + EIVS SSM+SGYVRNG+YE A +TF+SMV EQVL+DKFT+ SVVSAC+N G
Sbjct: 606 EISCDDPLGEIVSWSSMISGYVRNGEYEYALRTFISMVHEQVLVDKFTLTSVVSACANTG 642

BLAST of Cp4.1LG11g06770 vs. TAIR 10
Match: AT5G36290.1 (Uncharacterized protein family (UPF0016) )

HSP 1 Score: 359.0 bits (920), Expect = 9.7e-99
Identity = 209/297 (70.37%), Postives = 238/297 (80.13%), Query Frame = 0

Query: 1   MGLRSNPTGNLSFSPSLILLLLLVSLFASVQVFSAEV---EKEELDGP-KDLGRRSKISW 60
           MGL SNPT        LIL+  +  L +S+    + V   E++E +G  K+LGRR  +  
Sbjct: 1   MGLISNPT-------RLILVATIFFLVSSISGQDSVVENNERQESEGSGKELGRRGMVGT 60

Query: 61  NNI--DTIAARKDVVDSEDLNLDLDSVGLGVFDAFFASLSMIIVSEIGDETFIIAALMAM 120
             I  DT+    D + +  LNLDLD+    VFDA F+S SMI+V+EIGDETFIIAALMAM
Sbjct: 61  ERIGVDTVV---DNIGALGLNLDLDATAPSVFDALFSSFSMILVTEIGDETFIIAALMAM 120

Query: 121 RHPKSIVLSGALSALIVMTVLSTGLGRIVPNLISRKHTNNAATVLYAFFGLRLLYIAWRS 180
           RHPK+ VLSGALSAL VMT+LSTGLGRIVPNLISRKHTN+AATVLYAFFGLRLLYIAWRS
Sbjct: 121 RHPKATVLSGALSALFVMTILSTGLGRIVPNLISRKHTNSAATVLYAFFGLRLLYIAWRS 180

Query: 181 KADSKSSTKKEMEEVEEKLEAGQSKTTFRRFFLRFCTPIFLESFILTFLAEWGDRSQIAT 240
             DSKS+ KKEMEEVEEKLE+GQ KT FRR F RFCTPIFLESFILTFLAEWGDRSQIAT
Sbjct: 181 -TDSKSNQKKEMEEVEEKLESGQGKTPFRRLFSRFCTPIFLESFILTFLAEWGDRSQIAT 240

Query: 241 IALATHKNALGVAVGAILGHSVCTSMAVIGGSMLASKISQGTVATVGGLLFLGFSFS 292
           IALATHKNA+GVA+GA +GH+VCTS+AV+GGSMLAS+ISQ TVATVGGLLFLGFS S
Sbjct: 241 IALATHKNAIGVAIGASIGHTVCTSLAVVGGSMLASRISQRTVATVGGLLFLGFSVS 286

BLAST of Cp4.1LG11g06770 vs. TAIR 10
Match: AT5G36290.2 (Uncharacterized protein family (UPF0016) )

HSP 1 Score: 359.0 bits (920), Expect = 9.7e-99
Identity = 209/297 (70.37%), Postives = 238/297 (80.13%), Query Frame = 0

Query: 1   MGLRSNPTGNLSFSPSLILLLLLVSLFASVQVFSAEV---EKEELDGP-KDLGRRSKISW 60
           MGL SNPT        LIL+  +  L +S+    + V   E++E +G  K+LGRR  +  
Sbjct: 1   MGLISNPT-------RLILVATIFFLVSSISGQDSVVENNERQESEGSGKELGRRGMVGT 60

Query: 61  NNI--DTIAARKDVVDSEDLNLDLDSVGLGVFDAFFASLSMIIVSEIGDETFIIAALMAM 120
             I  DT+    D + +  LNLDLD+    VFDA F+S SMI+V+EIGDETFIIAALMAM
Sbjct: 61  ERIGVDTVV---DNIGALGLNLDLDATAPSVFDALFSSFSMILVTEIGDETFIIAALMAM 120

Query: 121 RHPKSIVLSGALSALIVMTVLSTGLGRIVPNLISRKHTNNAATVLYAFFGLRLLYIAWRS 180
           RHPK+ VLSGALSAL VMT+LSTGLGRIVPNLISRKHTN+AATVLYAFFGLRLLYIAWRS
Sbjct: 121 RHPKATVLSGALSALFVMTILSTGLGRIVPNLISRKHTNSAATVLYAFFGLRLLYIAWRS 180

Query: 181 KADSKSSTKKEMEEVEEKLEAGQSKTTFRRFFLRFCTPIFLESFILTFLAEWGDRSQIAT 240
             DSKS+ KKEMEEVEEKLE+GQ KT FRR F RFCTPIFLESFILTFLAEWGDRSQIAT
Sbjct: 181 -TDSKSNQKKEMEEVEEKLESGQGKTPFRRLFSRFCTPIFLESFILTFLAEWGDRSQIAT 240

Query: 241 IALATHKNALGVAVGAILGHSVCTSMAVIGGSMLASKISQGTVATVGGLLFLGFSFS 292
           IALATHKNA+GVA+GA +GH+VCTS+AV+GGSMLAS+ISQ TVATVGGLLFLGFS S
Sbjct: 241 IALATHKNAIGVAIGASIGHTVCTSLAVVGGSMLASRISQRTVATVGGLLFLGFSVS 286

BLAST of Cp4.1LG11g06770 vs. TAIR 10
Match: AT1G25520.1 (Uncharacterized protein family (UPF0016) )

HSP 1 Score: 161.0 bits (406), Expect = 3.9e-39
Identity = 98/220 (44.55%), Postives = 137/220 (62.27%), Query Frame = 0

Query: 85  VFDAFFASLSMIIVSEIGDETFIIAALMAMRHPKSIVLSGALSALIVMTVLSTGLGRIVP 144
           V   F  SL+M  VSEIGD+TF  AA++AMR+P+ +VL+G LSALIVMT+LS  LG   P
Sbjct: 4   VLQGFTKSLAMTFVSEIGDKTFFAAAILAMRYPRRLVLAGCLSALIVMTILSATLGWAAP 63

Query: 145 NLISRKHTNNAATVLYAFFGLRLLYIAWRSKADSKSSTKKEMEEVEEKLEA--------- 204
           NLISRK T++  T+L+  FGL  L+  ++          +E+ EVE +L+A         
Sbjct: 64  NLISRKWTHHITTLLFFGFGLWSLWDGFKEGGGG----SEELAEVEAELDADLKANGKSP 123

Query: 205 -------GQSKTTFRRFFLRFCTPIFLESFILTFLAEWGDRSQIATIALATHKNALGVAV 264
                   ++K   R F  +F +PIFL++F + F  EWGD+SQ+ATI LA  +N  GV +
Sbjct: 124 KDSSKREDENKKQNRAFLTQFFSPIFLKAFSINFFGEWGDKSQLATIGLAADENPFGVVL 183

Query: 265 GAILGHSVCTSMAVIGGSMLASKISQGTVATVGGLLFLGF 289
           G ++   +CT+ AVIGG  LAS+IS+  VA  GG+LF+ F
Sbjct: 184 GGVVAQFLCTTAAVIGGKSLASQISERIVALSGGMLFIIF 219

BLAST of Cp4.1LG11g06770 vs. TAIR 10
Match: AT1G68650.1 (Uncharacterized protein family (UPF0016) )

HSP 1 Score: 159.8 bits (403), Expect = 8.6e-39
Identity = 95/214 (44.39%), Postives = 135/214 (63.08%), Query Frame = 0

Query: 85  VFDAFFASLSMIIVSEIGDETFIIAALMAMRHPKSIVLSGALSALIVMTVLSTGLGRIVP 144
           +   F  SL+M  +SEIGD+TF  AA++AMR+P+ +VL+G LSALIVMT+LS  LG   P
Sbjct: 4   LLQGFTKSLAMTFLSEIGDKTFFAAAILAMRYPRRLVLAGCLSALIVMTILSATLGWAAP 63

Query: 145 NLISRKHTNNAATVLYAFFGLRLLYIAWRS----------KADSKSSTKKEMEEVEEKLE 204
           NLISRK T++  T L+  FGL  L+  ++           +A+  S  KK  ++ +    
Sbjct: 64  NLISRKWTHHITTFLFFGFGLWSLWDGFKEGGGSEELAEVEAELDSDLKKTNDQSKNSKI 123

Query: 205 AGQSKTTFRRFFLRFCTPIFLESFILTFLAEWGDRSQIATIALATHKNALGVAVGAILGH 264
             + K   R F   F +PIFL++F + F  EWGD+SQ+ATI LA  +N LGV +G I+  
Sbjct: 124 EDEQKKQKRPFLTAFFSPIFLKAFSINFFGEWGDKSQLATIGLAADENPLGVVLGGIVAQ 183

Query: 265 SVCTSMAVIGGSMLASKISQGTVATVGGLLFLGF 289
           ++CT+ AV+GG  LAS+IS+  VA  GG+LF+ F
Sbjct: 184 TLCTTAAVLGGKSLASQISERIVALSGGMLFIIF 217

BLAST of Cp4.1LG11g06770 vs. TAIR 10
Match: AT4G02750.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 149.8 bits (377), Expect = 8.9e-36
Identity = 81/228 (35.53%), Postives = 132/228 (57.89%), Query Frame = 0

Query: 414 ILSSYVKSEKLNDARKVFDEMPSRDVLTWTVLISGFARVNCSEMALQLFREMLVEDTASW 473
           +L+ YV+ E++  A+++FD MP R+V TW  +I+G+A+      A  LF +M   D  SW
Sbjct: 318 MLAGYVQGERMEMAKELFDVMPCRNVSTWNTMITGYAQCGKISEAKNLFDKMPKRDPVSW 377

Query: 474 NTVICGLMQGGYLNEALELLYEMVENEPEFNKVTSSIALSVVSSLLIFELGRQVHGRIVR 533
             +I G  Q G+  EAL L  +M       N+ + S ALS  + ++  ELG+Q+HGR+V+
Sbjct: 378 AAMIAGYSQSGHSFEALRLFVQMEREGGRLNRSSFSSALSTCADVVALELGKQLHGRLVK 437

Query: 534 CGFHNDGFVKSSLINMYIKCGNLEKASVIYSQMPSGFAKKQDFDIVCSDTMTEIVSRSSM 593
            G+    FV ++L+ MY KCG++E+A+ ++ +M                   +IVS ++M
Sbjct: 438 GGYETGCFVGNALLLMYCKCGSIEEANDLFKEMAG----------------KDIVSWNTM 497

Query: 594 VSGYVRNGKYEDAFKTFVSMVREQVLMDKFTIASVVSACSNAGVFELG 642
           ++GY R+G  E A + F SM RE +  D  T+ +V+SACS+ G+ + G
Sbjct: 498 IAGYSRHGFGEVALRFFESMKREGLKPDDATMVAVLSACSHTGLVDKG 529

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q93Y381.4e-9770.37GDT1-like protein 3 OS=Arabidopsis thaliana OX=3702 GN=At5g36290 PE=2 SV=1[more]
Q6ZIB91.2e-9673.90GDT1-like protein 4 OS=Oryza sativa subsp. japonica OX=39947 GN=Os08g0528500 PE=... [more]
A2YXC71.5e-9674.28GDT1-like protein 4 OS=Oryza sativa subsp. indica OX=39946 GN=OsI_29993 PE=3 SV=... [more]
A2ZE501.7e-9573.84GDT1-like protein 3 OS=Oryza sativa subsp. indica OX=39946 GN=OsI_36063 PE=3 SV=... [more]
Q2R4J11.7e-9573.84GDT1-like protein 3 OS=Oryza sativa subsp. japonica OX=39947 GN=Os11g0472500 PE=... [more]
Match NameE-valueIdentityDescription
KAG7029890.10.076.79putative pentatricopeptide repeat-containing protein [Cucurbita argyrosperma sub... [more]
XP_023545881.10.074.09putative pentatricopeptide repeat-containing protein At3g23330 [Cucurbita pepo s... [more]
XP_022997610.10.072.74putative pentatricopeptide repeat-containing protein At3g23330 [Cucurbita maxima... [more]
XP_022929759.10.072.74putative pentatricopeptide repeat-containing protein At3g23330 [Cucurbita moscha... [more]
XP_011648996.12.38e-29664.49LOW QUALITY PROTEIN: putative pentatricopeptide repeat-containing protein At3g23... [more]
Match NameE-valueIdentityDescription
A0A6J1KA700.072.74putative pentatricopeptide repeat-containing protein At3g23330 OS=Cucurbita maxi... [more]
A0A6J1EPP70.072.74putative pentatricopeptide repeat-containing protein At3g23330 OS=Cucurbita mosc... [more]
A0A6J1HR625.94e-29665.90LOW QUALITY PROTEIN: putative pentatricopeptide repeat-containing protein At3g23... [more]
A0A1S3B4E33.17e-29566.04LOW QUALITY PROTEIN: putative pentatricopeptide repeat-containing protein At3g23... [more]
B9RM882.30e-19153.08Pentatricopeptide repeat-containing protein, putative OS=Ricinus communis OX=398... [more]
Match NameE-valueIdentityDescription
AT5G36290.19.7e-9970.37Uncharacterized protein family (UPF0016) [more]
AT5G36290.29.7e-9970.37Uncharacterized protein family (UPF0016) [more]
AT1G25520.13.9e-3944.55Uncharacterized protein family (UPF0016) [more]
AT1G68650.18.6e-3944.39Uncharacterized protein family (UPF0016) [more]
AT4G02750.18.9e-3635.53Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 179..199
NoneNo IPR availablePANTHERPTHR12608:SF9GDT1-LIKE PROTEIN 3coord: 18..290
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 414..438
e-value: 0.017
score: 15.4
coord: 472..499
e-value: 1.3E-5
score: 25.1
coord: 441..466
e-value: 1.1E-5
score: 25.3
coord: 544..567
e-value: 0.021
score: 15.0
coord: 589..617
e-value: 3.5E-5
score: 23.8
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 472..499
e-value: 2.5E-5
score: 22.2
coord: 591..619
e-value: 6.6E-5
score: 20.8
coord: 441..468
e-value: 4.9E-5
score: 21.2
coord: 414..439
e-value: 0.0029
score: 15.7
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 438..468
score: 9.284279
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 469..503
score: 9.689847
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 586..620
score: 9.152743
IPR001727Gdt1 familyPFAMPF01169UPF0016coord: 91..164
e-value: 2.1E-17
score: 63.2
coord: 216..288
e-value: 1.2E-19
score: 70.3
IPR001727Gdt1 familyPANTHERPTHR12608TRANSMEMBRANE PROTEIN HTP-1 RELATEDcoord: 18..290
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 518..675
e-value: 2.2E-16
score: 62.2
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 383..517
e-value: 1.4E-23
score: 85.2

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG11g06770.1Cp4.1LG11g06770.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0043231 intracellular membrane-bounded organelle
cellular_component GO:0016020 membrane
molecular_function GO:0005515 protein binding