Cp4.1LG01g13140.1 (mRNA) Cucurbita pepo (Zucchini)

NameCp4.1LG01g13140.1
TypemRNA
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionPentatricopeptide repeat-containing protein, putative
LocationCp4.1LG01 : 8393917 .. 8400612 (-)
Sequence length4950
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TATGCCCTTTGGGGGAAATTACACAAATCTGCTCGCGCTATCAATGGCCGAAAACGGCAGCAACCCAAAACCAGCAGTGGCTGCCGCCCTTCGGAATTGCCGCGAAACCCATCGGTTCGCTAAAACCCTAACTCCCTTTCCTTACCTCGTTTCATACTGCTTTCTGATCCCTCCAACTTATCCCTATCTTTGTTGCTTTTAATTTCACGTTGTCTTACACGAGAAATCGGCAGGGTTTCTTTCCAATCCTGCTCCCATTTCCATGCTTAGCCGTATTCATCAATGGAAGCCATTACATTTTTTGAGGAAATGCGGGATACTCGCTTCTTTTAGTTCTGTGATCCTTGCTAGGCCTTCAGTTTCTGCCGCCCGCCTCGAAGCGGAATCCGTCACTCCCTCCTTCGTTCTGGGCCAGAACGACCCAGTTCGTGAGATTCTTACGGGTTTAAATTCCTTTGGGTTTAGAGCGTATGTTGGTGGATGTAACTTTCGAACTGTAGTTTCTACTTTGAGTGAAACTGTAGTGGACGGCGTTCTTGAGAGTTTGAGTATTCAGAATCCTGATGTTGCTGTGGCATTTTTCTATTTGTTGAGAAATAAGTACGGATTTCGGCATTCTGGGTTCTCCCAGCTTGCCGTTTCTCATATTCTAGCGGGTAAAGGAAGATTCAAGGAGTTGCATTGCGTTATAAAGCAATTGGTTGAGGAGCAAGGTATCAGTTCTTGGCATCTCTTTCTTCATGATTGATTGGCTGATATTAGAGAAAAGCAGAGAAGTCGGGCATTCTCCTTCTTAACGCCTAGTGTGTATATTGGCTGAACTAAGAGATTCATGGTTATCATGATAATGTGCATACTTAAGAGTAAAAAAAAAAATGGTAGACTAATTCCCCACATTTTCTGCAGTTGAAAAGCACCGTGCTTTCTTTCATGCCTTCGCTGTTTCTTTGATGTATATATAAACCTCGTCTTTTAACGTACTTCTTTTAGACCATCCTTATTTTTCAGTTCATCAACCCCTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTANNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTGTTAACATTGGAATGTCTATTTTATGTTGTTTATGTCTCTCGATATTTACTCTCTGGTAGGGTCGGGTTCTGCATCTTCCTTTTGTGACCTGCTCTTGAACAAATTCAGGAATTGGGATTCAAATGGTGTGGTATGGGATATGCTGGCATTTGCGTATTCTAGACATGAAATGATCCATGATGCCCTCTTTGTCATCGCAAAGATGAAGGATCTAAATTTACAAGCTTCAGTTCCAACTTATAACTCTCTATTGCATAACTTGAGGCACACTGATGTTATGTGGGATATATGCAATGAGGTCAAAGCTAGTGGAGCTCCTCAGAGTGAATATACTACTTCAATACTTATACACGGCCTATGTGCGCAATCCAAGTTACAAGATGCGATTTCATTCCTACAGGACAGCAATGAAGTAGTTGGACCTTCTATTGTGTCTATCAATACCGTTATGTCAAAGTTTTGCAAAGTGGGGCTAGTAGATGTTGCAAGGTCATTTTTCTGTTTGATGGTCAAGAATGGACTTCTTCCTGATTCGTACAGTTATAATATTCTTATTCATGGGTTATGTGTAGCAGGTTCCATGGACGAAGCTCTGGAATTCACAGATGACATGGAAAAGCATGGTGTGGAGCCTGATGTAGTAACATACAACACACTTGCTAAAGGGTTTCTCTTGCTTGGTTTTATGAGTGGGGCCTGGAAAGTCGTCCAGAAAATGTTGCTAAAAGGTCTAAATCCGGATATTGTGACATATACAATACTGATATGTGGGCATTGTCAAATGGGAAATATTGAGGAAGCCCTTAAGCTGCGGCAAGAAACCCTTTCAAGGGGGTTTCAGTTGAACATCATTTCGTACAGTGTGCTACTTAGCTGTTTGTGTAAAGTTGGACGAATAGAAGAGGCATTGGCATTGCTCAATGAAATGGAAACTCTACGTTTGAAACCTGATCTTATAGTATATTCAATCCTCATTCATGGCCTCTGCAAGGAAGGGTTTGTACAAAGGGCTTACCAACTATATGAACAAATGCGTTTGAAGAGAAATTTTCCTAACTACTTTTCTCAACGTGCAGTACTTTTGGGTTTCTTTGAGAATGGAAATATTTCTGAGGCAAGAAAATATTTTGATGCTTTGACTCATATGGATCTGATAGAGGATGTTATTTTGTTTAATATTATGATTGATGGTTACGTAAGGCTTGGTGATATTTCTGAGGCTATGCAGCTATATTACAGAATGTTTGAAAGGGGGATTACTCCGACTGTTGTCACTTTCAACACTCTTGTCCATGGGTTTTGCAGAAATGGAGACCTAGTGGAGGCTCGAAAGATGTTCAAAATCATTAGGTTGAATGGATTGCTACCCAGTGTAGTAACTTATACTACCCTTATGAATGCGTACTGTGAAGCGGGAAACATGCAGGAAATGTTTGATCTACTCCATGAGATGGAAGCAAATGCTGTTGTTCCAACTCATATAACTTATACTGTACTAATCAAAGGACTCTGCAGACAGAATAAAATGCATGAAGCCCTCCAGTTGCTTGAGTATATGTATGCAAAGGGTCTAATGCCAGATCAGATTACATATAATACTATTATCCAATGTTTTTGCAAGGCCAGAGACATTGCAAAAGCTTTCCAGGTTTATAATGAGATGTTGCTCCATAATCTTGATCCTACCCATGTAACTTATAATGTACTTATTAGTGGTCTTTGTGTATATGGTGACCTAAAGGATGCTGATCGAATGCTGGTTTCTATGGAGGATCAAAATATTAGCTTGACAAAAGTTGCTTATATGACAATTATTAAGGCACATTGTGCAAAGGGTCAGGTGTCTAAAGCATTAGGGTTCTTCAATCAAATGTTGGCTAAGAGTTTTGTCATTTCCATCAGAGATTATAGTGCTATCATTAATAGGCTGTGCAAAAGGGGTCTAATTACTGAAGCAAAGTACTTATTTGCTATGATGTTATCTGAAGGTGTAACTCCTGATTCTGAAATCTGCGAGACAATGCTTAATGCTTTCCATCAACATGGTGATAGCAGTTCAGTATTTGAATTTCTTGCTGTGATGGTTAAATCTGGCGTCATTTCACATTGATCCATCAGAAGAGGTCAGTCGATCACATCACGCCGTTCCAGATAACCATATTTTGGCATAAGAGGTTAGCATCTGAGGTACAGGTGAGCATTATGAACTTAATCAAAATTCAAGGATGAATTTTTCCCCTAGATTCTAATTTTATTAAGAATGCATGGATCCAGAATATCGGTGTTAGGAGAGGTTGTTCTCGTAGAGTACCATTCTCTAGTTATATCAAATTAACTCTGGTTTTCCCAATTTATTCAACTTGATGGTCACATTTGTTTCAGCTCGAGTATTTGCAACCATAACCATGTACTTTTACGCAAGGTCAAGCTTAACATGAAACGATCACGCTGAATGTCTTACACATATCAAATCCTTTACCCTCTGTTTCGATTACTGTGGGTATGGCACATTAAAATTCTTGATTGATATCTTGAAACTACTACTTTGACTTCCTTTATGAGTTTCTCATGAGACTATCATGGGTAGGAAGCAATCATTCATTTGTGTTATTCTTGTTTCCTCTCCCTAAAATTGCTAAAACCTTCAAAGGAAGCTAGCGATCAGTTATATGACTTGCTTGCTTGTTCATGTTGCAGGATGTAGGGATAATATTCCCTCTGTGGCATGTATCAAACTTTATGCCTCAGATAAGCAACGAAAGGAAAGTCATGGCAATGACCCTTGTGGGCTCTCCCAGCTTGGAGGCTGCAGCAGACGGTCTTATAGAGTGGTTATGACAAGTTGTACTACGAAGTATTAGCTGAGCCTGATTCCTTGCAGCGAGTCGAATAGAAAGTAAGGTATATAGCTTCTCAAAGGCTCATAGGATATCTTCTGACTGAGGGGAAAAAGTTGAAAATTAGTAAATGATGGGCGAGCCCCTTCCCAACTGTTGTCATTGAGGGGAAAAAAATCTTGTTGACTTATTTCTTAGCAATTTCGTTGTGCAGGAATTCTCTGAACCTAGAGGGGCATCACTTTCAGAACTCCCGCCTGTATTTGTTAAGTATGGAGGATGGTCGATACTTGAAGGGATTTAAAGATAAATCAACTAACATATTTGATCTCTGATAGGATAGCTCACAACAAAGAAGAACAAGTGGTAAGATCTTCTTTCCTATCAGTTATTATTTTGATACCCAGATGCTGCCCTGATCTTCATGTAAATAAGGAAGAAAAGAACGTTTACAATTGGACAGCATAATGAAAGAATTTGTTGAACTTTACCATAATAAAAATTGGCCCTTTATCGTTTGGGTAATTCTCAATTAATTATATTTTGGTCTCTCTGAATGCTTCTCAAACACATGAAATGGTTTTACTTTTTCTTTGGGGGTGGGGAAGGAGGTCAAGGAGAGATTTTCATATATTGATTTTATAAGTTGAGAATTACTGTCAAATGAATGAACTAAGGTTAGATTATATTTGTCATGAATATAATTGATGATACTCAATTAGCCAAGACGTTATAAACTGATTGAAAAAGTTGGAATTTGGAATAGACTATAAGATAAACAGTAGACGAGATGTTGAAAAGCAAGAACCAACGGAAGACAATTAAGGTTCGGCCAATTTTATTTGAAATAAAGACAAGGCCATACACATGAATTGGTAGCGAAGTCACTTGGTTGGTGGAGGAAGGCTGGCCAGCTTCTTGGCAAATTCTTGCAGGTTCCTCAATGAACTCCCACTGGGGCCAGTTGCATCCGCCACGGCTTTGCTCAGAGCTCTGGCTTTGGCTTTCAGTTTGTCGGAGTTGAAAGACTCGGCGAGGACCTCCCCCAGTTTGTCCGGGTCAGGCACCGAGTTCTCCCCATGGCAGACTTGCACCGCCACTCCAATCTGATCCACCAATACCTTCGAGTTTATCATTTGGTCTCCTTCCATGGCCCAAGCAAATATTGGTACTCCACTCGCTAGGCTCTCCAGTATAGAGTTCCACCCGCAGTGGCTCAGAAACCCTCTCACGGCTCTGTGCCTCAGTATCGCCTCCTGTGGTACCCATCTCTTCACCACCATTCCACGACCAGACACCCGATCCTCAAACCCGATTGGGATTCCGTCGGACCCACCATCTGTTTGACGAATTGTCTTAACCACCCACACGAATCGAGTCCCGCTTTTCTCAATCCCGGATGCCAGTGCTTCCATTTGCTGCTGATTCAGTTGCTTTTGACTCCCAAAACTAATGTATAGCACCGAACCATCGGGGCATTCATCAAGCCATTTCAGAACCTCATCTGAACTCGATTCCACAGTTGGGTTGTGCCCTTTGACAAGGCTTAGCGGTCCAACTCCAAATACATTCTCGTTGTTGTTTAATTTCCTGTAGAACTCCAAATACTCGGGCTCCAACTCGTCGAAGGTATTGAAAACATTAGCCCAGCAGGTCCCGGCTTCAAACCAGTCGGCTCTTATGGTACTGAATAGTGGGTCAGAATCCCGGTACACGGTGACAAATTCCGGCAAATCCTGATTAGTGAAGGAGGGGGAATTAGGCAATTCAGCCATATGCTTCACTGGGGATTTACGGAACTCTTCGGAAGGTACGTGACGCCAACAGTAATCGAGGACGTCAACGACCCAACAGCCAGAGGAGAAGAAGGAGACTCTAGGGATTTGAAGCTTTTGGGCGAGAGACTGGGTCCATCCAAGGAAGAAATCAGAAATGATGGCGGAGGGAGGTGAAGGGTGGGATTTGAACCACTCAATAATGGGGTCTTGGAGTTGGCGGAGAGCCACTATGATGGGTACGTTGCCTTCGCCGCCGATGTCTTTGATATTCTCGACGCCGGGAGGCAAACCGGGGACAGAAGGGAAGGGGAGAACAAGGGTTTGGACAGAAGGGTGGGCTGAAAGAAGAGGTTGAAGGATGGGGAGGTTCTTGGGGGTAACCAAAACGGTAATGGGGAAGCCATAAGAAGCTAAGAAATGGGTGAGATCGAGAAGTGGAAGCATATGGCCCTGAGCTGGGTATGGAAAAAGCAGAAGATGGGGTAATGGCGATGAAGGCGACGCCATTGGAACCAGATGTGATTGAAGGCTGAGGAAGGAATGAGGGAGAGTGGGTATTTATATTGGAAACATGTTTCTGAATTGAAGTACGGAAATCGCCATTGGAGATCAAGGAAAAACAAAGAACGAATAGGAAATTGACCAAAGAGAAGGGTGGTTGTCCAGGTGCCTGACAGCACGGCACGTGAACAAACAAGAGCGAAGAATGGGGGGAGGTGGGCGATTTGAGACGCTGAGAGAGAGAGAGTGTGTCAGAAACGGTGATGGGGTTGGTGGAGCCTTGAGATGCGATTGGGTGAGTGTCCTGTCATCAAAAATAAATAATTCGGAAAACAATATATTAAGCGAGTAACGTGAAGTCCCATAAAATGGGAAAACTTTATTTTAACAATGCGGGCCCTCCAACATACATATA

mRNA sequence

TATGCCCTTTGGGGGAAATTACACAAATCTGCTCGCGCTATCAATGGCCGAAAACGGCAGCAACCCAAAACCAGCAGTGGCTGCCGCCCTTCGGAATTGCCGCGAAACCCATCGGTTCGCTAAAACCCTAACTCCCTTTCCTTACCTCGTTTCATACTGCTTTCTGATCCCTCCAACTTATCCCTATCTTTGTTGCTTTTAATTTCACGTTGTCTTACACGAGAAATCGGCAGGGTTTCTTTCCAATCCTGCTCCCATTTCCATGCTTAGCCGTATTCATCAATGGAAGCCATTACATTTTTTGAGGAAATGCGGGATACTCGCTTCTTTTAGTTCTGTGATCCTTGCTAGGCCTTCAGTTTCTGCCGCCCGCCTCGAAGCGGAATCCGTCACTCCCTCCTTCGTTCTGGGCCAGAACGACCCAGTTCGTGAGATTCTTACGGGTTTAAATTCCTTTGGGTTTAGAGCGTATGTTGGTGGATGTAACTTTCGAACTGTAGTTTCTACTTTGAGTGAAACTGTAGTGGACGGCGTTCTTGAGAGTTTGAGTATTCAGAATCCTGATGTTGCTGTGGCATTTTTCTATTTGTTGAGAAATAAGTACGGATTTCGGCATTCTGGGTTCTCCCAGCTTGCCGTTTCTCATATTCTAGCGGGTAAAGGAAGATTCAAGGAGTTGCATTGCGTTATAAAGCAATTGGTTGAGGAGCAAGGGTCGGGTTCTGCATCTTCCTTTTGTGACCTGCTCTTGAACAAATTCAGGAATTGGGATTCAAATGGTGTGGTATGGGATATGCTGGCATTTGCGTATTCTAGACATGAAATGATCCATGATGCCCTCTTTGTCATCGCAAAGATGAAGGATCTAAATTTACAAGCTTCAGTTCCAACTTATAACTCTCTATTGCATAACTTGAGGCACACTGATGTTATGTGGGATATATGCAATGAGGTCAAAGCTAGTGGAGCTCCTCAGAGTGAATATACTACTTCAATACTTATACACGGCCTATGTGCGCAATCCAAGTTACAAGATGCGATTTCATTCCTACAGGACAGCAATGAAGTAGTTGGACCTTCTATTGTGTCTATCAATACCGTTATGTCAAAGTTTTGCAAAGTGGGGCTAGTAGATGTTGCAAGTGTGCTACTTAGCTGTTTGTGTAAAGTTGGACGAATAGAAGAGGCATTGGCATTGCTCAATGAAATGGAAACTCTACGTTTGAAACCTGATCTTATAGTATATTCAATCCTCATTCATGGCCTCTGCAAGGAAGGGTTTGTACAAAGGGCTTACCAACTATATGAACAAATGCGTTTGAAGAGAAATTTTCCTAACTACTTTTCTCAACGTGCAGTACTTTTGGGTTTCTTTGAGAATGGAAATATTTCTGAGGCAAGAAAATATTTTGATGCTTTGACTCATATGGATCTGATAGAGGATGTTATTTTGTTTAATATTATGATTGATGGTTACGTAAGGCTTGGTGATATTTCTGAGGCTATGCAGCTATATTACAGAATGTTTGAAAGGGGGATTACTCCGACTGTTGTCACTTTCAACACTCTTGTCCATGGGTTTTGCAGAAATGGAGACCTAGTGGAGGCTCGAAAGATGTTCAAAATCATTAGGTTGAATGGATTGCTACCCAGTGTAGTAACTTATACTACCCTTATGAATGCGTACTGTGAAGCGGGAAACATGCAGGAAATGTTTGATCTACTCCATGAGATGGAAGCAAATGCTGTTGTTCCAACTCATATAACTTATACTGTACTAATCAAAGGACTCTGCAGACAGAATAAAATGCATGAAGCCCTCCAGTTGCTTGAGTATATGTATGCAAAGGGTCTAATGCCAGATCAGATTACATATAATACTATTATCCAATGTTTTTGCAAGGCCAGAGACATTGCAAAAGCTTTCCAGGTTTATAATGAGATGTTGCTCCATAATCTTGATCCTACCCATGTAACTTATAATGTACTTATTAGTGGTCTTTGTGTATATGGTGACCTAAAGGATGCTGATCGAATGCTGGTTTCTATGGAGGATCAAAATATTAGCTTGACAAAAGTTGCTTATATGACAATTATTAAGGCACATTGTGCAAAGGGTCAGGTGTCTAAAGCATTAGGGTTCTTCAATCAAATGTTGGCTAAGAGTTTTGTCATTTCCATCAGAGATTATAGTGCTATCATTAATAGGCTGTGCAAAAGGGGTCTAATTACTGAAGCAAAGTACTTATTTGCTATGATGTTATCTGAAGGTGTAACTCCTGATTCTGAAATCTGCGAGACAATGCTTAATGCTTTCCATCAACATGGTGATAGCAGATGTAGGGATAATATTCCCTCTGTGGCATGTATCAAACTTTATGCCTCAGATAAGCAACGAAAGGAAAGTCATGGCAATGACCCTTGTGGGCTCTCCCAGCTTGGAGGCTGCAGCAGACGGTCTTATAGAGTGGAATTCTCTGAACCTAGAGGGGCATCACTTTCAGAACTCCCGCCTGTATTTGTTAAGTATGGAGGATGGTCGATACTTGAAGGGATTTAAAGATAAATCAACTAACATATTTGATCTCTGATAGGATAGCTCACAACAAAGAAGAACAAGTGGTAAGATCTTCTTTCCTATCAGTTATTATTTTGATACCCAGATGCTGCCCTGATCTTCATGTAAATAAGGAAGAAAAGAACGTTTACAATTGGACAGCATAATGAAAGAATTTGTTGAACTTTACCATAATAAAAATTGGCCCTTTATCGTTTGGGTAATTCTCAATTAATTATATTTTGGTCTCTCTGAATGCTTCTCAAACACATGAAATGGTTTTACTTTTTCTTTGGGGGTGGGGAAGGAGGTCAAGGAGAGATTTTCATATATTGATTTTATAAGTTGAGAATTACTGTCAAATGAATGAACTAAGGTTAGATTATATTTGTCATGAATATAATTGATGATACTCAATTAGCCAAGACGTTATAAACTGATTGAAAAAGTTGGAATTTGGAATAGACTATAAGATAAACAGTAGACGAGATGTTGAAAAGCAAGAACCAACGGAAGACAATTAAGGTTCGGCCAATTTTATTTGAAATAAAGACAAGGCCATACACATGAATTGGTAGCGAAGTCACTTGGTTGGTGGAGGAAGGCTGGCCAGCTTCTTGGCAAATTCTTGCAGGTTCCTCAATGAACTCCCACTGGGGCCAGTTGCATCCGCCACGGCTTTGCTCAGAGCTCTGGCTTTGGCTTTCAGTTTGTCGGAGTTGAAAGACTCGGCGAGGACCTCCCCCAGTTTGTCCGGGTCAGGCACCGAGTTCTCCCCATGGCAGACTTGCACCGCCACTCCAATCTGATCCACCAATACCTTCGAGTTTATCATTTGGTCTCCTTCCATGGCCCAAGCAAATATTGGTACTCCACTCGCTAGGCTCTCCAGTATAGAGTTCCACCCGCAGTGGCTCAGAAACCCTCTCACGGCTCTGTGCCTCAGTATCGCCTCCTGTGGTACCCATCTCTTCACCACCATTCCACGACCAGACACCCGATCCTCAAACCCGATTGGGATTCCGTCGGACCCACCATCTGTTTGACGAATTGTCTTAACCACCCACACGAATCGAGTCCCGCTTTTCTCAATCCCGGATGCCAGTGCTTCCATTTGCTGCTGATTCAGTTGCTTTTGACTCCCAAAACTAATGTATAGCACCGAACCATCGGGGCATTCATCAAGCCATTTCAGAACCTCATCTGAACTCGATTCCACAGTTGGGTTGTGCCCTTTGACAAGGCTTAGCGGTCCAACTCCAAATACATTCTCGTTGTTGTTTAATTTCCTGTAGAACTCCAAATACTCGGGCTCCAACTCGTCGAAGGTATTGAAAACATTAGCCCAGCAGGTCCCGGCTTCAAACCAGTCGGCTCTTATGGTACTGAATAGTGGGTCAGAATCCCGGTACACGGTGACAAATTCCGGCAAATCCTGATTAGTGAAGGAGGGGGAATTAGGCAATTCAGCCATATGCTTCACTGGGGATTTACGGAACTCTTCGGAAGGTACGTGACGCCAACAGTAATCGAGGACGTCAACGACCCAACAGCCAGAGGAGAAGAAGGAGACTCTAGGGATTTGAAGCTTTTGGGCGAGAGACTGGGTCCATCCAAGGAAGAAATCAGAAATGATGGCGGAGGGAGGTGAAGGGTGGGATTTGAACCACTCAATAATGGGGTCTTGGAGTTGGCGGAGAGCCACTATGATGGGTACGTTGCCTTCGCCGCCGATGTCTTTGATATTCTCGACGCCGGGAGGCAAACCGGGGACAGAAGGGAAGGGGAGAACAAGGGTTTGGACAGAAGGGTGGGCTGAAAGAAGAGGTTGAAGGATGGGGAGGTTCTTGGGGGTAACCAAAACGGTAATGGGGAAGCCATAAGAAGCTAAGAAATGGGTGAGATCGAGAAGTGGAAGCATATGGCCCTGAGCTGGGTATGGAAAAAGCAGAAGATGGGGTAATGGCGATGAAGGCGACGCCATTGGAACCAGATGTGATTGAAGGCTGAGGAAGGAATGAGGGAGAGTGGGTATTTATATTGGAAACATGTTTCTGAATTGAAGTACGGAAATCGCCATTGGAGATCAAGGAAAAACAAAGAACGAATAGGAAATTGACCAAAGAGAAGGGTGGTTGTCCAGGTGCCTGACAGCACGGCACGTGAACAAACAAGAGCGAAGAATGGGGGGAGGTGGGCGATTTGAGACGCTGAGAGAGAGAGAGTGTGTCAGAAACGGTGATGGGGTTGGTGGAGCCTTGAGATGCGATTGGGTGAGTGTCCTGTCATCAAAAATAAATAATTCGGAAAACAATATATTAAGCGAGTAACGTGAAGTCCCATAAAATGGGAAAACTTTATTTTAACAATGCGGGCCCTCCAACATACATATA

Coding sequence (CDS)

ATGCTTAGCCGTATTCATCAATGGAAGCCATTACATTTTTTGAGGAAATGCGGGATACTCGCTTCTTTTAGTTCTGTGATCCTTGCTAGGCCTTCAGTTTCTGCCGCCCGCCTCGAAGCGGAATCCGTCACTCCCTCCTTCGTTCTGGGCCAGAACGACCCAGTTCGTGAGATTCTTACGGGTTTAAATTCCTTTGGGTTTAGAGCGTATGTTGGTGGATGTAACTTTCGAACTGTAGTTTCTACTTTGAGTGAAACTGTAGTGGACGGCGTTCTTGAGAGTTTGAGTATTCAGAATCCTGATGTTGCTGTGGCATTTTTCTATTTGTTGAGAAATAAGTACGGATTTCGGCATTCTGGGTTCTCCCAGCTTGCCGTTTCTCATATTCTAGCGGGTAAAGGAAGATTCAAGGAGTTGCATTGCGTTATAAAGCAATTGGTTGAGGAGCAAGGGTCGGGTTCTGCATCTTCCTTTTGTGACCTGCTCTTGAACAAATTCAGGAATTGGGATTCAAATGGTGTGGTATGGGATATGCTGGCATTTGCGTATTCTAGACATGAAATGATCCATGATGCCCTCTTTGTCATCGCAAAGATGAAGGATCTAAATTTACAAGCTTCAGTTCCAACTTATAACTCTCTATTGCATAACTTGAGGCACACTGATGTTATGTGGGATATATGCAATGAGGTCAAAGCTAGTGGAGCTCCTCAGAGTGAATATACTACTTCAATACTTATACACGGCCTATGTGCGCAATCCAAGTTACAAGATGCGATTTCATTCCTACAGGACAGCAATGAAGTAGTTGGACCTTCTATTGTGTCTATCAATACCGTTATGTCAAAGTTTTGCAAAGTGGGGCTAGTAGATGTTGCAAGTGTGCTACTTAGCTGTTTGTGTAAAGTTGGACGAATAGAAGAGGCATTGGCATTGCTCAATGAAATGGAAACTCTACGTTTGAAACCTGATCTTATAGTATATTCAATCCTCATTCATGGCCTCTGCAAGGAAGGGTTTGTACAAAGGGCTTACCAACTATATGAACAAATGCGTTTGAAGAGAAATTTTCCTAACTACTTTTCTCAACGTGCAGTACTTTTGGGTTTCTTTGAGAATGGAAATATTTCTGAGGCAAGAAAATATTTTGATGCTTTGACTCATATGGATCTGATAGAGGATGTTATTTTGTTTAATATTATGATTGATGGTTACGTAAGGCTTGGTGATATTTCTGAGGCTATGCAGCTATATTACAGAATGTTTGAAAGGGGGATTACTCCGACTGTTGTCACTTTCAACACTCTTGTCCATGGGTTTTGCAGAAATGGAGACCTAGTGGAGGCTCGAAAGATGTTCAAAATCATTAGGTTGAATGGATTGCTACCCAGTGTAGTAACTTATACTACCCTTATGAATGCGTACTGTGAAGCGGGAAACATGCAGGAAATGTTTGATCTACTCCATGAGATGGAAGCAAATGCTGTTGTTCCAACTCATATAACTTATACTGTACTAATCAAAGGACTCTGCAGACAGAATAAAATGCATGAAGCCCTCCAGTTGCTTGAGTATATGTATGCAAAGGGTCTAATGCCAGATCAGATTACATATAATACTATTATCCAATGTTTTTGCAAGGCCAGAGACATTGCAAAAGCTTTCCAGGTTTATAATGAGATGTTGCTCCATAATCTTGATCCTACCCATGTAACTTATAATGTACTTATTAGTGGTCTTTGTGTATATGGTGACCTAAAGGATGCTGATCGAATGCTGGTTTCTATGGAGGATCAAAATATTAGCTTGACAAAAGTTGCTTATATGACAATTATTAAGGCACATTGTGCAAAGGGTCAGGTGTCTAAAGCATTAGGGTTCTTCAATCAAATGTTGGCTAAGAGTTTTGTCATTTCCATCAGAGATTATAGTGCTATCATTAATAGGCTGTGCAAAAGGGGTCTAATTACTGAAGCAAAGTACTTATTTGCTATGATGTTATCTGAAGGTGTAACTCCTGATTCTGAAATCTGCGAGACAATGCTTAATGCTTTCCATCAACATGGTGATAGCAGATGTAGGGATAATATTCCCTCTGTGGCATGTATCAAACTTTATGCCTCAGATAAGCAACGAAAGGAAAGTCATGGCAATGACCCTTGTGGGCTCTCCCAGCTTGGAGGCTGCAGCAGACGGTCTTATAGAGTGGAATTCTCTGAACCTAGAGGGGCATCACTTTCAGAACTCCCGCCTGTATTTGTTAAGTATGGAGGATGGTCGATACTTGAAGGGATTTAA

Protein sequence

MLSRIHQWKPLHFLRKCGILASFSSVILARPSVSAARLEAESVTPSFVLGQNDPVREILTGLNSFGFRAYVGGCNFRTVVSTLSETVVDGVLESLSIQNPDVAVAFFYLLRNKYGFRHSGFSQLAVSHILAGKGRFKELHCVIKQLVEEQGSGSASSFCDLLLNKFRNWDSNGVVWDMLAFAYSRHEMIHDALFVIAKMKDLNLQASVPTYNSLLHNLRHTDVMWDICNEVKASGAPQSEYTTSILIHGLCAQSKLQDAISFLQDSNEVVGPSIVSINTVMSKFCKVGLVDVASVLLSCLCKVGRIEEALALLNEMETLRLKPDLIVYSILIHGLCKEGFVQRAYQLYEQMRLKRNFPNYFSQRAVLLGFFENGNISEARKYFDALTHMDLIEDVILFNIMIDGYVRLGDISEAMQLYYRMFERGITPTVVTFNTLVHGFCRNGDLVEARKMFKIIRLNGLLPSVVTYTTLMNAYCEAGNMQEMFDLLHEMEANAVVPTHITYTVLIKGLCRQNKMHEALQLLEYMYAKGLMPDQITYNTIIQCFCKARDIAKAFQVYNEMLLHNLDPTHVTYNVLISGLCVYGDLKDADRMLVSMEDQNISLTKVAYMTIIKAHCAKGQVSKALGFFNQMLAKSFVISIRDYSAIINRLCKRGLITEAKYLFAMMLSEGVTPDSEICETMLNAFHQHGDSRCRDNIPSVACIKLYASDKQRKESHGNDPCGLSQLGGCSRRSYRVEFSEPRGASLSELPPVFVKYGGWSILEGI
BLAST of Cp4.1LG01g13140.1 vs. Swiss-Prot
Match: PPR41_ARATH (Putative pentatricopeptide repeat-containing protein At1g13630 OS=Arabidopsis thaliana GN=At1g13630 PE=2 SV=3)

HSP 1 Score: 399.4 bits (1025), Expect = 8.6e-110
Identity = 212/511 (41.49%), Postives = 314/511 (61.45%), Query Frame = 1

Query: 189 IHDALFVIAKMKDLNLQASVPTYNSL---LHNLRHTDVMWDICNEVKASGAPQSEYTTSI 248
           I +AL + + M    ++    TYN L    H L      W++  ++   G      T +I
Sbjct: 315 IAEALELASDMNKHGVEPDSVTYNILAKGFHLLGMISGAWEVIRDMLDKGLSPDVITYTI 374

Query: 249 LIHGLCAQSKLQDAISFLQDSNEVVGPSIVSINTVMSKFCKVGLVDVASVLLSCLCKVGR 308
           L+ G C    +   +  L+D              ++S+  ++  +   SV+LS LCK GR
Sbjct: 375 LLCGQCQLGNIDMGLVLLKD--------------MLSRGFELNSIIPCSVMLSGLCKTGR 434

Query: 309 IEEALALLNEMETLRLKPDLIVYSILIHGLCKEGFVQRAYQLYEQMRLKRNFPNYFSQRA 368
           I+EAL+L N+M+   L PDL+ YSI+IHGLCK G    A  LY++M  KR  PN  +  A
Sbjct: 435 IDEALSLFNQMKADGLSPDLVAYSIVIHGLCKLGKFDMALWLYDEMCDKRILPNSRTHGA 494

Query: 369 VLLGFFENGNISEARKYFDALTHMDLIEDVILFNIMIDGYVRLGDISEAMQLYYRMFERG 428
           +LLG  + G + EAR   D+L       D++L+NI+IDGY + G I EA++L+  + E G
Sbjct: 495 LLLGLCQKGMLLEARSLLDSLISSGETLDIVLYNIVIDGYAKSGCIEEALELFKVVIETG 554

Query: 429 ITPTVVTFNTLVHGFCRNGDLVEARKMFKIIRLNGLLPSVVTYTTLMNAYCEAGNMQEMF 488
           ITP+V TFN+L++G+C+  ++ EARK+  +I+L GL PSVV+YTTLM+AY   GN + + 
Sbjct: 555 ITPSVATFNSLIYGYCKTQNIAEARKILDVIKLYGLAPSVVSYTTLMDAYANCGNTKSID 614

Query: 489 DLLHEMEANAVVPTHITYTVLIKGLCR----QNKMH--------EALQLLEYMYAKGLMP 548
           +L  EM+A  + PT++TY+V+ KGLCR    +N  H        +  Q L  M ++G+ P
Sbjct: 615 ELRREMKAEGIPPTNVTYSVIFKGLCRGWKHENCNHVLRERIFEKCKQGLRDMESEGIPP 674

Query: 549 DQITYNTIIQCFCKARDIAKAFQVYNEMLLHNLDPTHVTYNVLISGLCVYGDLKDADRML 608
           DQITYNTIIQ  C+ + ++ AF     M   NLD +  TYN+LI  LCVYG ++ AD  +
Sbjct: 675 DQITYNTIIQYLCRVKHLSGAFVFLEIMKSRNLDASSATYNILIDSLCVYGYIRKADSFI 734

Query: 609 VSMEDQNISLTKVAYMTIIKAHCAKGQVSKALGFFNQMLAKSFVISIRDYSAIINRLCKR 668
            S+++QN+SL+K AY T+IKAHC KG    A+  F+Q+L + F +SIRDYSA+INRLC+R
Sbjct: 735 YSLQEQNVSLSKFAYTTLIKAHCVKGDPEMAVKLFHQLLHRGFNVSIRDYSAVINRLCRR 794

Query: 669 GLITEAKYLFAMMLSEGVTPDSEICETMLNA 685
            L+ E+K+ F +MLS+G++PD +ICE M+ +
Sbjct: 795 HLVNESKFFFCLMLSQGISPDLDICEVMIKS 811

BLAST of Cp4.1LG01g13140.1 vs. Swiss-Prot
Match: PP360_ARATH (Pentatricopeptide repeat-containing protein At5g01110 OS=Arabidopsis thaliana GN=At5g01110 PE=2 SV=1)

HSP 1 Score: 257.3 bits (656), Expect = 5.3e-67
Identity = 159/587 (27.09%), Postives = 286/587 (48.72%), Query Frame = 1

Query: 116 FRHSGFSQLAVSHILAGKGRFKELHCVIKQLVEEQGSGSASSFCDLLLNKFRNWDSNGVV 175
           F+H+  S  A+ HIL   GR  +    + +++   G  S     + L + F N  SN  V
Sbjct: 109 FKHTSLSLSAMIHILVRSGRLSDAQSCLLRMIRRSGV-SRLEIVNSLDSTFSNCGSNDSV 168

Query: 176 WDMLAFAYSRHEMIHDALFVIAKMKDLNLQASVPTYNSLLHNLRH---TDVMWDICNEVK 235
           +D+L   Y +   + +A      ++      S+   N+L+ +L      ++ W +  E+ 
Sbjct: 169 FDLLIRTYVQARKLREAHEAFTLLRSKGFTVSIDACNALIGSLVRIGWVELAWGVYQEIS 228

Query: 236 ASGAPQSEYTTSILIHGLCAQSKLQDAISFLQDSNEV-VGPSIVSINTVMSKFCKVGLVD 295
            SG   + YT +I+++ LC   K++   +FL    E  V P IV+ NT++S +   GL+ 
Sbjct: 229 RSGVGINVYTLNIMVNALCKDGKMEKVGTFLSQVQEKGVYPDIVTYNTLISAYSSKGLM- 288

Query: 296 VASVLLSCLCKVGRIEEALALLNEMETLRLKPDLIVYSILIHGLCKEGFVQRAYQLYEQM 355
                          EEA  L+N M      P +  Y+ +I+GLCK G  +RA +++ +M
Sbjct: 289 ---------------EEAFELMNAMPGKGFSPGVYTYNTVINGLCKHGKYERAKEVFAEM 348

Query: 356 RLKRNFPNYFSQRAVLLGFFENGNISEARKYFDALTHMDLIEDVILFNIMIDGYVRLGDI 415
                 P+  + R++L+   + G++ E  K F  +   D++ D++ F+ M+  + R G++
Sbjct: 349 LRSGLSPDSTTYRSLLMEACKKGDVVETEKVFSDMRSRDVVPDLVCFSSMMSLFTRSGNL 408

Query: 416 SEAMQLYYRMFERGITPTVVTFNTLVHGFCRNGDLVEARKMFKIIRLNGLLPSVVTYTTL 475
            +A+  +  + E G+ P  V +  L+ G+CR G +  A  +   +   G    VVTY T+
Sbjct: 409 DKALMYFNSVKEAGLIPDNVIYTILIQGYCRKGMISVAMNLRNEMLQQGCAMDVVTYNTI 468

Query: 476 MNAYCEAGNMQEMFDLLHEMEANAVVPTHITYTVLIKGLCRQNKMHEALQLLEYMYAKGL 535
           ++  C+   + E   L +EM   A+ P   T T+LI G C+   +  A++L + M  K +
Sbjct: 469 LHGLCKRKMLGEADKLFNEMTERALFPDSYTLTILIDGHCKLGNLQNAMELFQKMKEKRI 528

Query: 536 MPDQITYNTIIQCFCKARDIAKAFQVYNEMLLHNLDPTHVTYNVLISGLCVYGDLKDADR 595
             D +TYNT++  F K  DI  A +++ +M+   + PT ++Y++L++ LC  G L +A R
Sbjct: 529 RLDVVTYNTLLDGFGKVGDIDTAKEIWADMVSKEILPTPISYSILVNALCSKGHLAEAFR 588

Query: 596 MLVSMEDQNISLTKVAYMTIIKAHCAKGQVSKALGFFNQMLAKSFVISIRDYSAIINRLC 655
           +   M  +NI  T +   ++IK +C  G  S    F  +M+++ FV     Y+ +I    
Sbjct: 589 VWDEMISKNIKPTVMICNSMIKGYCRSGNASDGESFLEKMISEGFVPDCISYNTLIYGFV 648

Query: 656 KRGLITEAKYLFAMMLSE--GVTPDSEICETMLNAFHQHGDSRCRDN 697
           +   +++A  L   M  E  G+ PD     ++L+ F       CR N
Sbjct: 649 REENMSKAFGLVKKMEEEQGGLVPDVFTYNSILHGF-------CRQN 671

BLAST of Cp4.1LG01g13140.1 vs. Swiss-Prot
Match: PPR12_ARATH (Pentatricopeptide repeat-containing protein At1g05670, mitochondrial OS=Arabidopsis thaliana GN=At1g05670 PE=2 SV=1)

HSP 1 Score: 253.4 bits (646), Expect = 7.6e-66
Identity = 152/545 (27.89%), Postives = 270/545 (49.54%), Query Frame = 1

Query: 158 FCDLLLNKFRNWDSNGVVWDMLAFAYSRHEMIHDALFVIAKMKDLNLQASVPTYNSLLHN 217
           F DLL+  +++W S+  V+D+         ++ +A  V  KM +  L  SV + N  L  
Sbjct: 160 FFDLLVYTYKDWGSDPRVFDVFFQVLVDFGLLREARRVFEKMLNYGLVLSVDSCNVYLTR 219

Query: 218 LRH----TDVMWDICNEVKASGAPQSEYTTSILIHGLCAQSKLQDAISFLQDSNEVVG-- 277
           L      T     +  E    G   +  + +I+IH +C   ++++A   L    E+ G  
Sbjct: 220 LSKDCYKTATAIIVFREFPEVGVCWNVASYNIVIHFVCQLGRIKEAHHLLL-LMELKGYT 279

Query: 278 PSIVSINTVMSKFCKVGLVD-------------------VASVLLSCLCKVGRIEEALAL 337
           P ++S +TV++ +C+ G +D                   +   ++  LC++ ++ EA   
Sbjct: 280 PDVISYSTVVNGYCRFGELDKVWKLIEVMKRKGLKPNSYIYGSIIGLLCRICKLAEAEEA 339

Query: 338 LNEMETLRLKPDLIVYSILIHGLCKEGFVQRAYQLYEQMRLKRNFPNYFSQRAVLLGFFE 397
            +EM    + PD +VY+ LI G CK G ++ A + + +M  +   P+  +  A++ GF +
Sbjct: 340 FSEMIRQGILPDTVVYTTLIDGFCKRGDIRAASKFFYEMHSRDITPDVLTYTAIISGFCQ 399

Query: 398 NGNISEARKYFDALTHMDLIEDVILFNIMIDGYVRLGDISEAMQLYYRMFERGITPTVVT 457
            G++ EA K F  +    L  D + F  +I+GY + G + +A +++  M + G +P VVT
Sbjct: 400 IGDMVEAGKLFHEMFCKGLEPDSVTFTELINGYCKAGHMKDAFRVHNHMIQAGCSPNVVT 459

Query: 458 FNTLVHGFCRNGDLVEARKMFKIIRLNGLLPSVVTYTTLMNAYCEAGNMQEMFDLLHEME 517
           + TL+ G C+ GDL  A ++   +   GL P++ TY +++N  C++GN++E   L+ E E
Sbjct: 460 YTTLIDGLCKEGDLDSANELLHEMWKIGLQPNIFTYNSIVNGLCKSGNIEEAVKLVGEFE 519

Query: 518 ANAVVPTHITYTVLIKGLCRQNKMHEALQLLEYMYAKGLMPDQITYNTIIQCFCKARDIA 577
           A  +    +TYT L+   C+  +M +A ++L+ M  KGL P  +T+N ++  FC    + 
Sbjct: 520 AAGLNADTVTYTTLMDAYCKSGEMDKAQEILKEMLGKGLQPTIVTFNVLMNGFCLHGMLE 579

Query: 578 KAFQVYNEMLLHNLDPTHVTYNVLISGLCVYGDLKDADRMLVSMEDQNISLTKVAYMTII 637
              ++ N ML   + P   T+N L+   C+  +LK A  +   M  + +      Y  ++
Sbjct: 580 DGEKLLNWMLAKGIAPNATTFNSLVKQYCIRNNLKAATAIYKDMCSRGVGPDGKTYENLV 639

Query: 638 KAHCAKGQVSKALGFFNQMLAKSFVISIRDYSAIINRLCKRGLITEAKYLFAMMLSEGVT 678
           K HC    + +A   F +M  K F +S+  YS +I    KR    EA+ +F  M  EG+ 
Sbjct: 640 KGHCKARNMKEAWFLFQEMKGKGFSVSVSTYSVLIKGFLKRKKFLEAREVFDQMRREGLA 699

BLAST of Cp4.1LG01g13140.1 vs. Swiss-Prot
Match: PP432_ARATH (Pentatricopeptide repeat-containing protein At5g55840 OS=Arabidopsis thaliana GN=At5g55840 PE=3 SV=2)

HSP 1 Score: 252.7 bits (644), Expect = 1.3e-65
Identity = 157/569 (27.59%), Postives = 282/569 (49.56%), Query Frame = 1

Query: 127 SHILAGKGRFKELHCVIKQLVEEQGSGSASSFCDLLLNKFRNWDSNGVVWDMLAFAYSRH 186
           +HIL     +     ++K+L     SG +S     L+  +R  +SN  V+D+L   Y R 
Sbjct: 79  THILVRARMYDPARHILKEL--SLMSGKSSFVFGALMTTYRLCNSNPSVYDILIRVYLRE 138

Query: 187 EMIHDALFVIAKMKDLNLQASVPTYNSLLHNLRHT--DV-MWDICNEVKASGAPQSEYTT 246
            MI D+L +   M       SV T N++L ++  +  DV +W    E+          T 
Sbjct: 139 GMIQDSLEIFRLMGLYGFNPSVYTCNAILGSVVKSGEDVSVWSFLKEMLKRKICPDVATF 198

Query: 247 SILIHGLCAQSKLQDAISFLQDSNEVVG--PSIVSINTVMSKFCKVGLVDVASVLLSCLC 306
           +ILI+ LCA+   + + S+L    E  G  P+IV+ NTV+  +CK               
Sbjct: 199 NILINVLCAEGSFEKS-SYLMQKMEKSGYAPTIVTYNTVLHWYCKK-------------- 258

Query: 307 KVGRIEEALALLNEMETLRLKPDLIVYSILIHGLCKEGFVQRAYQLYEQMRLKRNFPNYF 366
             GR + A+ LL+ M++  +  D+  Y++LIH LC+   + + Y L   MR +   PN  
Sbjct: 259 --GRFKAAIELLDHMKSKGVDADVCTYNMLIHDLCRSNRIAKGYLLLRDMRKRMIHPNEV 318

Query: 367 SQRAVLLGFFENGNISEARKYFDALTHMDLIEDVILFNIMIDGYVRLGDISEAMQLYYRM 426
           +   ++ GF   G +  A +  + +    L  + + FN +IDG++  G+  EA++++Y M
Sbjct: 319 TYNTLINGFSNEGKVLIASQLLNEMLSFGLSPNHVTFNALIDGHISEGNFKEALKMFYMM 378

Query: 427 FERGITPTVVTFNTLVHGFCRNGDLVEARKMFKIIRLNGLLPSVVTYTTLMNAYCEAGNM 486
             +G+TP+ V++  L+ G C+N +   AR  +  ++ NG+    +TYT +++  C+ G +
Sbjct: 379 EAKGLTPSEVSYGVLLDGLCKNAEFDLARGFYMRMKRNGVCVGRITYTGMIDGLCKNGFL 438

Query: 487 QEMFDLLHEMEANAVVPTHITYTVLIKGLCRQNKMHEALQLLEYMYAKGLMPDQITYNTI 546
            E   LL+EM  + + P  +TY+ LI G C+  +   A +++  +Y  GL P+ I Y+T+
Sbjct: 439 DEAVVLLNEMSKDGIDPDIVTYSALINGFCKVGRFKTAKEIVCRIYRVGLSPNGIIYSTL 498

Query: 547 IQCFCKARDIAKAFQVYNEMLLHNLDPTHVTYNVLISGLCVYGDLKDADRMLVSMEDQNI 606
           I   C+   + +A ++Y  M+L      H T+NVL++ LC  G + +A+  +  M    I
Sbjct: 499 IYNCCRMGCLKEAIRIYEAMILEGHTRDHFTFNVLVTSLCKAGKVAEAEEFMRCMTSDGI 558

Query: 607 SLTKVAYMTIIKAHCAKGQVSKALGFFNQMLAKSFVISIRDYSAIINRLCKRGLITEAKY 666
               V++  +I  +   G+  KA   F++M       +   Y +++  LCK G + EA+ 
Sbjct: 559 LPNTVSFDCLINGYGNSGEGLKAFSVFDEMTKVGHHPTFFTYGSLLKGLCKGGHLREAEK 618

Query: 667 LFAMMLSEGVTPDSEICETMLNAFHQHGD 691
               + +     D+ +  T+L A  + G+
Sbjct: 619 FLKSLHAVPAAVDTVMYNTLLTAMCKSGN 628

BLAST of Cp4.1LG01g13140.1 vs. Swiss-Prot
Match: PP412_ARATH (Pentatricopeptide repeat-containing protein At5g41170, mitochondrial OS=Arabidopsis thaliana GN=At5g41170 PE=2 SV=1)

HSP 1 Score: 249.2 bits (635), Expect = 1.4e-64
Identity = 155/545 (28.44%), Postives = 272/545 (49.91%), Query Frame = 1

Query: 135 RFKELHCVIKQLVEEQGSGSASSFCDLLLNKFRNWDSNGVVWDMLAFAYSRHEMIHDALF 194
           RF +LH   +  + +  SG A SF  LL   F  W      +  +          ++AL 
Sbjct: 4   RFFQLH---RNRLVKGNSGKALSFSRLLDLSF--WVRAFCNYREILRNGLHSLQFNEALD 63

Query: 195 VIAKMKDLNLQASVPTYNSLLH---NLRHTDVMWDICNEVKASGAPQSEYTTSILIHGLC 254
           +   M +     S+  +  LL+    ++  DV+ ++C+ ++  G     YT ++L++  C
Sbjct: 64  LFTHMVESRPLPSIIDFTKLLNVIAKMKKFDVVINLCDHLQIMGVSHDLYTCNLLMNCFC 123

Query: 255 AQSKLQDAISFLQDSNEV-VGPSIVSINTVMSKFCKVGLVDVASVLLSCLCKVGRIEEAL 314
             S+   A SFL    ++   P IV+  ++++ FC +G                R+EEA+
Sbjct: 124 QSSQPYLASSFLGKMMKLGFEPDIVTFTSLINGFC-LG---------------NRMEEAM 183

Query: 315 ALLNEMETLRLKPDLIVYSILIHGLCKEGFVQRAYQLYEQMRLKRNFPNYFSQRAVLLGF 374
           +++N+M  + +KPD+++Y+ +I  LCK G V  A  L++QM      P+     +++ G 
Sbjct: 184 SMVNQMVEMGIKPDVVMYTTIIDSLCKNGHVNYALSLFDQMENYGIRPDVVMYTSLVNGL 243

Query: 375 FENGNISEARKYFDALTHMDLIEDVILFNIMIDGYVRLGDISEAMQLYYRMFERGITPTV 434
             +G   +A      +T   +  DVI FN +ID +V+ G   +A +LY  M    I P +
Sbjct: 244 CNSGRWRDADSLLRGMTKRKIKPDVITFNALIDAFVKEGKFLDAEELYNEMIRMSIAPNI 303

Query: 435 VTFNTLVHGFCRNGDLVEARKMFKIIRLNGLLPSVVTYTTLMNAYCEAGNMQEMFDLLHE 494
            T+ +L++GFC  G + EAR+MF ++   G  P VV YT+L+N +C+   + +   + +E
Sbjct: 304 FTYTSLINGFCMEGCVDEARQMFYLMETKGCFPDVVAYTSLINGFCKCKKVDDAMKIFYE 363

Query: 495 MEANAVVPTHITYTVLIKGLCRQNKMHEALQLLEYMYAKGLMPDQITYNTIIQCFCKARD 554
           M    +    ITYT LI+G  +  K + A ++  +M ++G+ P+  TYN ++ C C    
Sbjct: 364 MSQKGLTGNTITYTTLIQGFGQVGKPNVAQEVFSHMVSRGVPPNIRTYNVLLHCLCYNGK 423

Query: 555 IAKAFQVYNEMLLHNLD---PTHVTYNVLISGLCVYGDLKDADRMLVSMEDQNISLTKVA 614
           + KA  ++ +M    +D   P   TYNVL+ GLC  G L+ A  +   M  + + +  + 
Sbjct: 424 VKKALMIFEDMQKREMDGVAPNIWTYNVLLHGLCYNGKLEKALMVFEDMRKREMDIGIIT 483

Query: 615 YMTIIKAHCAKGQVSKALGFFNQMLAKSFVISIRDYSAIINRLCKRGLITEAKYLFAMML 673
           Y  II+  C  G+V  A+  F  + +K    ++  Y+ +I+ L + GL  EA  LF  M 
Sbjct: 484 YTIIIQGMCKAGKVKNAVNLFCSLPSKGVKPNVVTYTTMISGLFREGLKHEAHVLFRKMK 527

BLAST of Cp4.1LG01g13140.1 vs. TrEMBL
Match: A0A0A0L9A2_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G642640 PE=4 SV=1)

HSP 1 Score: 686.4 bits (1770), Expect = 3.9e-194
Identity = 351/549 (63.93%), Postives = 420/549 (76.50%), Query Frame = 1

Query: 156 SSFC-----DLLLNKFRNWDSNGVVWDMLAFAYSRHEM-----IHDALFVIAKMKDLNLQ 215
           S FC     D+  + F     NG++ D  ++    H +     + +AL     M+   ++
Sbjct: 279 SKFCKVGLIDVARSFFCLMVKNGLLHDSFSYNILLHGLCVAGSMDEALGFTDDMEKHGVE 338

Query: 216 ASVPTYNSLLHNLRHTDVMWD---ICNEVKASGAPQSEYTTSILIHGLCAQSKLQDAISF 275
             V TYN+L        +M     +  ++   G      T + LI G C    +++A+  
Sbjct: 339 PDVVTYNTLAKGFLLLGLMSGARKVVQKMLLQGLNPDLVTYTTLICGHCQMGNIEEALKL 398

Query: 276 LQDSNEVVGPSIVSINTVMSKFCKVGLVDVASVLLSCLCKVGRIEEALALLNEMETLRLK 335
            Q++              +S+  K+ ++   ++LLSCLCKVGRIEEAL L +EMETLRL+
Sbjct: 399 RQET--------------LSRGFKLNVI-FYNMLLSCLCKVGRIEEALTLFDEMETLRLE 458

Query: 336 PDLIVYSILIHGLCKEGFVQRAYQLYEQMRLKRNFPNYFSQRAVLLGFFENGNISEARKY 395
           PD IVYSILIHGLCKEGFVQRAYQLYEQMRLKR FP++F+QRAVLLG F+NGNISEAR Y
Sbjct: 459 PDFIVYSILIHGLCKEGFVQRAYQLYEQMRLKRKFPHHFAQRAVLLGLFKNGNISEARNY 518

Query: 396 FDALTHMDLIEDVILFNIMIDGYVRLGDISEAMQLYYRMFERGITPTVVTFNTLVHGFCR 455
           FD  T MDL+EDV+L+NIMIDGYVRL  I+EAMQLYY+M ERGITP+VVTFNTL++GFCR
Sbjct: 519 FDTWTRMDLMEDVVLYNIMIDGYVRLDGIAEAMQLYYKMIERGITPSVVTFNTLINGFCR 578

Query: 456 NGDLVEARKMFKIIRLNGLLPSVVTYTTLMNAYCEAGNMQEMFDLLHEMEANAVVPTHIT 515
            GDL+EARKM ++IRL GL+PSVVTYTTLMNAYCE GNMQEMF  LHEMEANAVVPTH+T
Sbjct: 579 RGDLMEARKMLEVIRLKGLVPSVVTYTTLMNAYCEVGNMQEMFHFLHEMEANAVVPTHVT 638

Query: 516 YTVLIKGLCRQNKMHEALQLLEYMYAKGLMPDQITYNTIIQCFCKARDIAKAFQVYNEML 575
           YTVLIKGLCRQNKMHE+LQLLEYMYAKGL+PD +TYNTIIQCFCK ++I KA Q+YN ML
Sbjct: 639 YTVLIKGLCRQNKMHESLQLLEYMYAKGLLPDSVTYNTIIQCFCKGKEITKALQLYNMML 698

Query: 576 LHNLDPTHVTYNVLISGLCVYGDLKDADRMLVSMEDQNISLTKVAYMTIIKAHCAKGQVS 635
           LHN DPT VTY VLI+ LC++GDLKD DRM+VS+ED+NI+L KV YMTIIKAHCAKGQVS
Sbjct: 699 LHNCDPTQVTYKVLINALCIFGDLKDVDRMVVSIEDRNITLKKVTYMTIIKAHCAKGQVS 758

Query: 636 KALGFFNQMLAKSFVISIRDYSAIINRLCKRGLITEAKYLFAMMLSEGVTPDSEICETML 692
           KALG+FNQMLAK FVISIRDYSA+INRLCKRGLITEAKY F MMLSEGVTPD EIC+T+L
Sbjct: 759 KALGYFNQMLAKGFVISIRDYSAVINRLCKRGLITEAKYFFVMMLSEGVTPDPEICKTVL 812

BLAST of Cp4.1LG01g13140.1 vs. TrEMBL
Match: M5XH23_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa022936mg PE=4 SV=1)

HSP 1 Score: 540.0 bits (1390), Expect = 4.6e-150
Identity = 271/456 (59.43%), Postives = 344/456 (75.44%), Query Frame = 1

Query: 235 GAPQSEYTTSILIHGLCAQSKLQDAISFLQDSNEVVGPSIVSINTVMSKFCKVGLVDVAS 294
           G      T +ILI G C    +++A+ F ++              ++S+  ++ ++ V S
Sbjct: 5   GLNPDHVTYTILICGHCHAGNIEEALKFRKE--------------MLSRGFQLSVI-VYS 64

Query: 295 VLLSCLCKVGRIEEALALLNEMETLRLKPDLIVYSILIHGLCKEGFVQRAYQLYEQMRLK 354
           VLLS LCK GR+EEAL LL EME + L+PDLI YSILIHGLCK+G VQRA +LY +M +K
Sbjct: 65  VLLSSLCKSGRVEEALRLLYEMEAVGLEPDLITYSILIHGLCKQGDVQRASELYREMYMK 124

Query: 355 RNFPNYFSQRAVLLGFFENGNISEARKYFDALTHMDLIEDVILFNIMIDGYVRLGDISEA 414
           R  PNYF+ R++LLG  E G+ISEARKYFD L   D+ ED++L+NIM+DGYV+LG+I E+
Sbjct: 125 RIIPNYFAHRSILLGLREKGDISEARKYFDNLLTRDVTEDIVLYNIMMDGYVKLGNIVES 184

Query: 415 MQLYYRMFERGITPTVVTFNTLVHGFCRNGDLVEARKMFKIIRLNGLLPSVVTYTTLMNA 474
            +LY ++ E+GI P++VTFNTL++GFC+ G L EA KM   I+L+GLLPS  TYTTLMNA
Sbjct: 185 TRLYKQIIEKGINPSIVTFNTLIYGFCKTGKLAEAHKMLDTIKLHGLLPSPFTYTTLMNA 244

Query: 475 YCEAGNMQEMFDLLHEMEANAVVPTHITYTVLIKGLCRQNKMHEALQLLEYMYAKGLMPD 534
             E GN+  M  LL EMEANAV PTH++YTV+IK L +  K+ EA+ L+E MYAKGL PD
Sbjct: 245 NIERGNIHGMLKLLQEMEANAVQPTHVSYTVVIKALFKLGKLQEAVHLVEDMYAKGLTPD 304

Query: 535 QITYNTIIQCFCKARDIAKAFQVYNEMLLHNLDPTHVTYNVLISGLCVYGDLKDADRMLV 594
           QITYNT+I+CFC+ARD  KAFQ++NEML+HNL+PT VTYNVLI+GLCVYGDL DADR+LV
Sbjct: 305 QITYNTLIKCFCRARDFLKAFQLHNEMLVHNLEPTPVTYNVLINGLCVYGDLMDADRLLV 364

Query: 595 SMEDQNISLTKVAYMTIIKAHCAKGQVSKALGFFNQMLAKSFVISIRDYSAIINRLCKRG 654
           S+ D NI+LTKVAY T+IKAHCAKG V +A+G F+QM+ K F ISI+DYSA+INRLCKR 
Sbjct: 365 SLCDCNINLTKVAYTTLIKAHCAKGDVHRAVGLFHQMVKKGFEISIQDYSAVINRLCKRC 424

Query: 655 LITEAKYLFAMMLSEGVTPDSEICETMLNAFHQHGD 691
           LIT+AKY F MMLS G+ PD E+C  MLN F   GD
Sbjct: 425 LITDAKYFFCMMLSNGICPDQELCGVMLNTFRHVGD 445

BLAST of Cp4.1LG01g13140.1 vs. TrEMBL
Match: D7TA84_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_01s0010g00630 PE=4 SV=1)

HSP 1 Score: 528.1 bits (1359), Expect = 1.8e-146
Identity = 265/529 (50.09%), Postives = 370/529 (69.94%), Query Frame = 1

Query: 174 VVWDMLAFAYSRHEMIH---------DALFVIAKMKDLNLQASVPTYNSLLHNLRHTDVM 233
           + + +L   YS + ++H         +AL     M++  ++  + TYN L +  R   ++
Sbjct: 297 IKYGLLPDVYSYNILLHGLCVAGSMEEALEFTNDMENHGVEPDIVTYNILANGFRILGLI 356

Query: 234 ---WDICNEVKASGAPQSEYTTSILIHGLCAQSKLQDAISFLQDSNEVVGPSIVSINTVM 293
              W +   +  +G      T +ILI G C    ++++    +               ++
Sbjct: 357 SGAWKVVQRMLLNGLNPDLVTYTILICGHCQMGNIEESFKLKEK--------------ML 416

Query: 294 SKFCKVGLVDVASVLLSCLCKVGRIEEALALLNEMETLRLKPDLIVYSILIHGLCKEGFV 353
           S+  K+ +V   +VLLS LCK GRI+EA+ LL+EME + LKPDL+ YS         G V
Sbjct: 417 SQGLKLSIVTY-TVLLSSLCKSGRIDEAVILLHEMEVIGLKPDLLTYS--------RGAV 476

Query: 354 QRAYQLYEQMRLKRNFPNYFSQRAVLLGFFENGNISEARKYFDALTHMDLIEDVILFNIM 413
           + A +LYE+M  KR +PN F   A++ G FE G ISEA+ YFD++T  D+ E++IL+NIM
Sbjct: 477 EEAIELYEEMCSKRIYPNSFVCSAIISGLFEKGAISEAQMYFDSVTKSDVAEEIILYNIM 536

Query: 414 IDGYVRLGDISEAMQLYYRMFERGITPTVVTFNTLVHGFCRNGDLVEARKMFKIIRLNGL 473
           IDGY +LG+I EA++ Y ++ E+GI+PT+VTFN+L++GFC+ G L EA K+   I+++GL
Sbjct: 537 IDGYAKLGNIGEAVRSYKQIIEKGISPTIVTFNSLIYGFCKKGKLAEAVKLLDTIKVHGL 596

Query: 474 LPSVVTYTTLMNAYCEAGNMQEMFDLLHEMEANAVVPTHITYTVLIKGLCRQNKMHEALQ 533
           +P+ VTYTTLMN YCE G+M  MFD+LHEMEA A+ PT ITYTV++KGLC++ ++HE++Q
Sbjct: 597 VPTSVTYTTLMNGYCEEGDMHSMFDMLHEMEAKAIKPTQITYTVVVKGLCKEGRLHESVQ 656

Query: 534 LLEYMYAKGLMPDQITYNTIIQCFCKARDIAKAFQVYNEMLLHNLDPTHVTYNVLISGLC 593
           LL+YMYA+GL PDQITYNT+IQ FCKA D+ KAFQ++N+ML H+L P+ VTYNVLI+GLC
Sbjct: 657 LLKYMYARGLFPDQITYNTVIQSFCKAHDLQKAFQLHNQMLQHSLQPSPVTYNVLINGLC 716

Query: 594 VYGDLKDADRMLVSMEDQNISLTKVAYMTIIKAHCAKGQVSKALGFFNQMLAKSFVISIR 653
           VYG+LKDADR+LV+++DQ+I LTKVAY TIIKAHCAKG V  AL FF+QM+ + F +SIR
Sbjct: 717 VYGNLKDADRLLVTLQDQSIRLTKVAYTTIIKAHCAKGDVQNALVFFHQMVERGFEVSIR 776

Query: 654 DYSAIINRLCKRGLITEAKYLFAMMLSEGVTPDSEICETMLNAFHQHGD 691
           DYSA+INRLCKR LIT+AK+ F MML+ G+ PD +IC  MLNAFH+ GD
Sbjct: 777 DYSAVINRLCKRNLITDAKFFFCMMLTHGIPPDQDICLVMLNAFHRSGD 802

BLAST of Cp4.1LG01g13140.1 vs. TrEMBL
Match: B9HWT8_POPTR (Pentatricopeptide repeat-containing family protein OS=Populus trichocarpa GN=POPTR_0010s14700g PE=4 SV=2)

HSP 1 Score: 517.7 bits (1332), Expect = 2.4e-143
Identity = 278/581 (47.85%), Postives = 389/581 (66.95%), Query Frame = 1

Query: 130 LAGKGRFKELHCVIKQLVEEQGSGSASSF-------CDL-LLNKFRNWDSNGVVWDMLAF 189
           L G+ RF++    ++Q   ++ + S  SF       C L L +  +++    + + +L  
Sbjct: 136 LCGQSRFRDAVLFLRQNDGKEFAPSVVSFNTIMSRYCKLGLADVAKSFFCMMLKYGILPD 195

Query: 190 AYSRHEMIH---------DALFVIAKMKDLNLQASVPTYNSLLHNLRHTDVMWD----IC 249
            YS + +IH         +AL +   M+   LQ  + TY  +        +M      I 
Sbjct: 196 TYSYNILIHGLIVAGSMEEALELTNDMEKQGLQPDMVTYKIVAKGFHLLGLMSGAREIIQ 255

Query: 250 NEVKASGAPQSEYTTSILIHGLCAQSKLQDAISFLQDSNEVVGPSIVSINTVMSKFCKVG 309
             +   G      T ++LI G C    +++A+   +D              ++S   ++ 
Sbjct: 256 KMLTDEGLKPDLVTYTVLICGHCQMGNIEEALRLRRD--------------LLSSGFQLN 315

Query: 310 LVDVASVLLSCLCKVGRIEEALALLNEMETLRLKPDLIVYSILIHGLCKEGFVQRAYQLY 369
           ++ + SVLLS LCK G+++EAL LL EME   L+PDL+ YSILIHGLCK+G VQ+A QLY
Sbjct: 316 VI-LYSVLLSSLCKRGQVDEALQLLYEMEANNLQPDLVTYSILIHGLCKQGKVQQAIQLY 375

Query: 370 EQMRLKRNFPNYFSQRAVLLGFFENGNISEARKYFDALTHMDLIEDVILFNIMIDGYVRL 429
           ++M   R FPN F+   +L G  E G +S+AR YFD+L   +L  DV L+NIMIDGYV+L
Sbjct: 376 KEMCFNRIFPNSFAHSGILKGLCEKGMLSDARMYFDSLIMSNLRPDVTLYNIMIDGYVKL 435

Query: 430 GDISEAMQLYYRMFERGITPTVVTFNTLVHGFCRNGDLVEARKMFKIIRLNGLLPSVVTY 489
           GD+ EA++LY R+ ++ ITP++VTFN+L++GFC+N  +VEAR++ + I+L+GL PS VTY
Sbjct: 436 GDVEEAVRLYKRLRDKAITPSIVTFNSLIYGFCKNRKVVEARRLLESIKLHGLEPSAVTY 495

Query: 490 TTLMNAYCEAGNMQEMFDLLHEMEANAVVPTHITYTVLIKGLCRQNKMHEALQLLEYMYA 549
           TTLMNAYCE GN+ ++ +LL EM    + PT +TYTV+IKGLC+Q K+ E++QLLE M A
Sbjct: 496 TTLMNAYCEEGNINKLHELLLEMNLKDIEPTVVTYTVVIKGLCKQRKLEESVQLLEDMRA 555

Query: 550 KGLMPDQITYNTIIQCFCKARDIAKAFQVYNEMLLHNLDPTHVTYNVLISGLCVYGDLKD 609
           KGL PDQITYNTIIQCFCKA+D+ KAF++ ++ML+HNL+PT  TYNVLI GLC YGD++D
Sbjct: 556 KGLAPDQITYNTIIQCFCKAKDMRKAFELLDDMLIHNLEPTPATYNVLIDGLCRYGDVED 615

Query: 610 ADRMLVSMEDQNISLTKVAYMTIIKAHCAKGQVSKALGFFNQMLAKSFVISIRDYSAIIN 669
           ADR+LVS++D+NI+LTKVAY T+IKAHC KG   +A+  F+QM+ K F +SI+DYSA+IN
Sbjct: 616 ADRVLVSLQDRNINLTKVAYTTMIKAHCVKGDAQRAVKVFHQMVEKGFEVSIKDYSAVIN 675

Query: 670 RLCKRGLITEAKYLFAMMLSEGVTPDSEICETMLNAFHQHG 690
           RLCKR LI EAKY F +MLS+GV+PD EI E MLNAFH+ G
Sbjct: 676 RLCKRCLINEAKYYFCIMLSDGVSPDQEIFEMMLNAFHRAG 701

BLAST of Cp4.1LG01g13140.1 vs. TrEMBL
Match: A0A067LDB5_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_01872 PE=4 SV=1)

HSP 1 Score: 512.7 bits (1319), Expect = 7.8e-142
Identity = 267/524 (50.95%), Postives = 363/524 (69.27%), Query Frame = 1

Query: 178 MLAFAYSRHEMIH---------DALFVIAKMKDLNLQASVPTYNSLLHNLRHTDVM---W 237
           +L  AYS + +IH         +AL    +M+   +Q  + TY  L        +M   W
Sbjct: 119 LLPDAYSYNILIHGLCLAGSIEEALEFANEMEKHGVQPDMVTYKILAKGFHLVGLMSGAW 178

Query: 238 DICNEVKASGAPQSEYTTSILIHGLCAQSKLQDAISFLQDSNEVVGPSIVSINTVMSKFC 297
            I  E           T +ILI G C    +++A    ++              ++S+  
Sbjct: 179 KIIQETLIKRQIPDLVTYTILICGNCQIGNIEEASRLHKE--------------MISQGF 238

Query: 298 KVGLVDVASVLLSCLCKVGRIEEALALLNEMETLRLKPDLIVYSILIHGLCKEGFVQRAY 357
           ++ ++   +VLLS LCK G+++EAL LL EM+   L+PDL+ Y+ILIHGLCK+G V RA 
Sbjct: 239 QLSIISY-TVLLSSLCKSGQVDEALKLLGEMKANGLQPDLVTYTILIHGLCKQGEVPRAI 298

Query: 358 QLYEQMRLKRNFPNYFSQRAVLLGFFENGNISEARKYFDALTHMDLIEDVILFNIMIDGY 417
           QLY++M L R FP+ F+  A+L+G  + G I +AR YFD+L   +L  D+IL+NIMIDGY
Sbjct: 299 QLYDEMYLSRIFPSSFTHSAILMGLRDKGMILKARMYFDSLMSSNLTPDIILYNIMIDGY 358

Query: 418 VRLGDISEAMQLYYRMFERGITPTVVTFNTLVHGFCRNGDLVEARKMFKIIRLNGLLPSV 477
           V+ G+I +A+ LY +M E+GI+PT+VTFN L++GFC+   + EAR +   I+L+GL PS 
Sbjct: 359 VKHGNIRQAINLYRQMGEKGISPTIVTFNCLINGFCKTKKVAEARWLLHTIKLHGLEPSA 418

Query: 478 VTYTTLMNAYCEAGNMQEMFDLLHEMEANAVVPTHITYTVLIKGLCRQNKMHEALQLLEY 537
           VTYTTLMNAYCE GN+Q + +LL EMEA A+ PTH+TYTV+IKGLC+Q K+ E+ QLLE 
Sbjct: 419 VTYTTLMNAYCEEGNIQNLLELLSEMEAKAIGPTHVTYTVMIKGLCKQWKLRESCQLLEE 478

Query: 538 MYAKGLMPDQITYNTIIQCFCKARDIAKAFQVYNEMLLHNLDPTHVTYNVLISGLCVYGD 597
           M+AKGL PDQ+TYN IIQ FCKARD+ KAFQ++++MLLHNL+PT VTYNVLI GLCVYGD
Sbjct: 479 MHAKGLTPDQVTYNIIIQAFCKARDMRKAFQLFDKMLLHNLEPTSVTYNVLIKGLCVYGD 538

Query: 598 LKDADRMLVSMEDQNISLTKVAYMTIIKAHCAKGQVSKALGFFNQMLAKSFVISIRDYSA 657
           LK AD ++VS++ + I+L+K+AY TIIKAHCAKG V +A+ +F+QM  + F +SIRDYSA
Sbjct: 539 LKAADNLVVSLQARKINLSKIAYTTIIKAHCAKGDVHRAIAYFHQMSKRGFEVSIRDYSA 598

Query: 658 IINRLCKRGLITEAKYLFAMMLSEGVTPDSEICETMLNAFHQHG 690
           +I+RLCKR LIT+AKY F MML++GV+PD EICE ML+AF   G
Sbjct: 599 VISRLCKRCLITKAKYFFCMMLADGVSPDQEICEVMLDAFQLGG 627

BLAST of Cp4.1LG01g13140.1 vs. TAIR10
Match: AT1G13630.1 (AT1G13630.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 365.2 bits (936), Expect = 1.0e-100
Identity = 199/483 (41.20%), Postives = 292/483 (60.46%), Query Frame = 1

Query: 189 IHDALFVIAKMKDLNLQASVPTYNSL---LHNLRHTDVMWDICNEVKASGAPQSEYTTSI 248
           I +AL + + M    ++    TYN L    H L      W++  ++   G      T +I
Sbjct: 273 IAEALELASDMNKHGVEPDSVTYNILAKGFHLLGMISGAWEVIRDMLDKGLSPDVITYTI 332

Query: 249 LIHGLCAQSKLQDAISFLQDSNEVVGPSIVSINTVMSKFCKVGLVDVASVLLSCLCKVGR 308
           L+ G C    +   +  L+D              ++S+  ++  +   SV+LS LCK GR
Sbjct: 333 LLCGQCQLGNIDMGLVLLKD--------------MLSRGFELNSIIPCSVMLSGLCKTGR 392

Query: 309 IEEALALLNEMETLRLKPDLIVYSILIHGLCKEGFVQRAYQLYEQMRLKRNFPNYFSQRA 368
           I+EAL+L N+M+   L PDL+ YSI+IHGLCK G    A  LY++M  KR  PN  +  A
Sbjct: 393 IDEALSLFNQMKADGLSPDLVAYSIVIHGLCKLGKFDMALWLYDEMCDKRILPNSRTHGA 452

Query: 369 VLLGFFENGNISEARKYFDALTHMDLIEDVILFNIMIDGYVRLGDISEAMQLYYRMFERG 428
           +LLG  + G + EAR   D+L       D++L+NI+IDGY + G I EA++L+  + E G
Sbjct: 453 LLLGLCQKGMLLEARSLLDSLISSGETLDIVLYNIVIDGYAKSGCIEEALELFKVVIETG 512

Query: 429 ITPTVVTFNTLVHGFCRNGDLVEARKMFKIIRLNGLLPSVVTYTTLMNAYCEAGNMQEMF 488
           ITP+V TFN+L++G+C+  ++ EARK+  +I+L GL PSVV+YTTLM+AY   GN + + 
Sbjct: 513 ITPSVATFNSLIYGYCKTQNIAEARKILDVIKLYGLAPSVVSYTTLMDAYANCGNTKSID 572

Query: 489 DLLHEMEANAVVPTHITYTVLIKGLCR----QNKMH--------EALQLLEYMYAKGLMP 548
           +L  EM+A  + PT++TY+V+ KGLCR    +N  H        +  Q L  M ++G+ P
Sbjct: 573 ELRREMKAEGIPPTNVTYSVIFKGLCRGWKHENCNHVLRERIFEKCKQGLRDMESEGIPP 632

Query: 549 DQITYNTIIQCFCKARDIAKAFQVYNEMLLHNLDPTHVTYNVLISGLCVYGDLKDADRML 608
           DQITYNTIIQ  C+ + ++ AF     M   NLD +  TYN+LI  LCVYG ++ AD  +
Sbjct: 633 DQITYNTIIQYLCRVKHLSGAFVFLEIMKSRNLDASSATYNILIDSLCVYGYIRKADSFI 692

Query: 609 VSMEDQNISLTKVAYMTIIKAHCAKGQVSKALGFFNQMLAKSFVISIRDYSAIINRLCKR 657
            S+++QN+SL+K AY T+IKAHC KG    A+  F+Q+L + F +SIRDYSA+INRLC+R
Sbjct: 693 YSLQEQNVSLSKFAYTTLIKAHCVKGDPEMAVKLFHQLLHRGFNVSIRDYSAVINRLCRR 741

BLAST of Cp4.1LG01g13140.1 vs. TAIR10
Match: AT5G01110.1 (AT5G01110.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 257.3 bits (656), Expect = 3.0e-68
Identity = 159/587 (27.09%), Postives = 286/587 (48.72%), Query Frame = 1

Query: 116 FRHSGFSQLAVSHILAGKGRFKELHCVIKQLVEEQGSGSASSFCDLLLNKFRNWDSNGVV 175
           F+H+  S  A+ HIL   GR  +    + +++   G  S     + L + F N  SN  V
Sbjct: 109 FKHTSLSLSAMIHILVRSGRLSDAQSCLLRMIRRSGV-SRLEIVNSLDSTFSNCGSNDSV 168

Query: 176 WDMLAFAYSRHEMIHDALFVIAKMKDLNLQASVPTYNSLLHNLRH---TDVMWDICNEVK 235
           +D+L   Y +   + +A      ++      S+   N+L+ +L      ++ W +  E+ 
Sbjct: 169 FDLLIRTYVQARKLREAHEAFTLLRSKGFTVSIDACNALIGSLVRIGWVELAWGVYQEIS 228

Query: 236 ASGAPQSEYTTSILIHGLCAQSKLQDAISFLQDSNEV-VGPSIVSINTVMSKFCKVGLVD 295
            SG   + YT +I+++ LC   K++   +FL    E  V P IV+ NT++S +   GL+ 
Sbjct: 229 RSGVGINVYTLNIMVNALCKDGKMEKVGTFLSQVQEKGVYPDIVTYNTLISAYSSKGLM- 288

Query: 296 VASVLLSCLCKVGRIEEALALLNEMETLRLKPDLIVYSILIHGLCKEGFVQRAYQLYEQM 355
                          EEA  L+N M      P +  Y+ +I+GLCK G  +RA +++ +M
Sbjct: 289 ---------------EEAFELMNAMPGKGFSPGVYTYNTVINGLCKHGKYERAKEVFAEM 348

Query: 356 RLKRNFPNYFSQRAVLLGFFENGNISEARKYFDALTHMDLIEDVILFNIMIDGYVRLGDI 415
                 P+  + R++L+   + G++ E  K F  +   D++ D++ F+ M+  + R G++
Sbjct: 349 LRSGLSPDSTTYRSLLMEACKKGDVVETEKVFSDMRSRDVVPDLVCFSSMMSLFTRSGNL 408

Query: 416 SEAMQLYYRMFERGITPTVVTFNTLVHGFCRNGDLVEARKMFKIIRLNGLLPSVVTYTTL 475
            +A+  +  + E G+ P  V +  L+ G+CR G +  A  +   +   G    VVTY T+
Sbjct: 409 DKALMYFNSVKEAGLIPDNVIYTILIQGYCRKGMISVAMNLRNEMLQQGCAMDVVTYNTI 468

Query: 476 MNAYCEAGNMQEMFDLLHEMEANAVVPTHITYTVLIKGLCRQNKMHEALQLLEYMYAKGL 535
           ++  C+   + E   L +EM   A+ P   T T+LI G C+   +  A++L + M  K +
Sbjct: 469 LHGLCKRKMLGEADKLFNEMTERALFPDSYTLTILIDGHCKLGNLQNAMELFQKMKEKRI 528

Query: 536 MPDQITYNTIIQCFCKARDIAKAFQVYNEMLLHNLDPTHVTYNVLISGLCVYGDLKDADR 595
             D +TYNT++  F K  DI  A +++ +M+   + PT ++Y++L++ LC  G L +A R
Sbjct: 529 RLDVVTYNTLLDGFGKVGDIDTAKEIWADMVSKEILPTPISYSILVNALCSKGHLAEAFR 588

Query: 596 MLVSMEDQNISLTKVAYMTIIKAHCAKGQVSKALGFFNQMLAKSFVISIRDYSAIINRLC 655
           +   M  +NI  T +   ++IK +C  G  S    F  +M+++ FV     Y+ +I    
Sbjct: 589 VWDEMISKNIKPTVMICNSMIKGYCRSGNASDGESFLEKMISEGFVPDCISYNTLIYGFV 648

Query: 656 KRGLITEAKYLFAMMLSE--GVTPDSEICETMLNAFHQHGDSRCRDN 697
           +   +++A  L   M  E  G+ PD     ++L+ F       CR N
Sbjct: 649 REENMSKAFGLVKKMEEEQGGLVPDVFTYNSILHGF-------CRQN 671

BLAST of Cp4.1LG01g13140.1 vs. TAIR10
Match: AT1G05670.1 (AT1G05670.1 Pentatricopeptide repeat (PPR-like) superfamily protein)

HSP 1 Score: 253.4 bits (646), Expect = 4.3e-67
Identity = 152/545 (27.89%), Postives = 270/545 (49.54%), Query Frame = 1

Query: 158 FCDLLLNKFRNWDSNGVVWDMLAFAYSRHEMIHDALFVIAKMKDLNLQASVPTYNSLLHN 217
           F DLL+  +++W S+  V+D+         ++ +A  V  KM +  L  SV + N  L  
Sbjct: 160 FFDLLVYTYKDWGSDPRVFDVFFQVLVDFGLLREARRVFEKMLNYGLVLSVDSCNVYLTR 219

Query: 218 LRH----TDVMWDICNEVKASGAPQSEYTTSILIHGLCAQSKLQDAISFLQDSNEVVG-- 277
           L      T     +  E    G   +  + +I+IH +C   ++++A   L    E+ G  
Sbjct: 220 LSKDCYKTATAIIVFREFPEVGVCWNVASYNIVIHFVCQLGRIKEAHHLLL-LMELKGYT 279

Query: 278 PSIVSINTVMSKFCKVGLVD-------------------VASVLLSCLCKVGRIEEALAL 337
           P ++S +TV++ +C+ G +D                   +   ++  LC++ ++ EA   
Sbjct: 280 PDVISYSTVVNGYCRFGELDKVWKLIEVMKRKGLKPNSYIYGSIIGLLCRICKLAEAEEA 339

Query: 338 LNEMETLRLKPDLIVYSILIHGLCKEGFVQRAYQLYEQMRLKRNFPNYFSQRAVLLGFFE 397
            +EM    + PD +VY+ LI G CK G ++ A + + +M  +   P+  +  A++ GF +
Sbjct: 340 FSEMIRQGILPDTVVYTTLIDGFCKRGDIRAASKFFYEMHSRDITPDVLTYTAIISGFCQ 399

Query: 398 NGNISEARKYFDALTHMDLIEDVILFNIMIDGYVRLGDISEAMQLYYRMFERGITPTVVT 457
            G++ EA K F  +    L  D + F  +I+GY + G + +A +++  M + G +P VVT
Sbjct: 400 IGDMVEAGKLFHEMFCKGLEPDSVTFTELINGYCKAGHMKDAFRVHNHMIQAGCSPNVVT 459

Query: 458 FNTLVHGFCRNGDLVEARKMFKIIRLNGLLPSVVTYTTLMNAYCEAGNMQEMFDLLHEME 517
           + TL+ G C+ GDL  A ++   +   GL P++ TY +++N  C++GN++E   L+ E E
Sbjct: 460 YTTLIDGLCKEGDLDSANELLHEMWKIGLQPNIFTYNSIVNGLCKSGNIEEAVKLVGEFE 519

Query: 518 ANAVVPTHITYTVLIKGLCRQNKMHEALQLLEYMYAKGLMPDQITYNTIIQCFCKARDIA 577
           A  +    +TYT L+   C+  +M +A ++L+ M  KGL P  +T+N ++  FC    + 
Sbjct: 520 AAGLNADTVTYTTLMDAYCKSGEMDKAQEILKEMLGKGLQPTIVTFNVLMNGFCLHGMLE 579

Query: 578 KAFQVYNEMLLHNLDPTHVTYNVLISGLCVYGDLKDADRMLVSMEDQNISLTKVAYMTII 637
              ++ N ML   + P   T+N L+   C+  +LK A  +   M  + +      Y  ++
Sbjct: 580 DGEKLLNWMLAKGIAPNATTFNSLVKQYCIRNNLKAATAIYKDMCSRGVGPDGKTYENLV 639

Query: 638 KAHCAKGQVSKALGFFNQMLAKSFVISIRDYSAIINRLCKRGLITEAKYLFAMMLSEGVT 678
           K HC    + +A   F +M  K F +S+  YS +I    KR    EA+ +F  M  EG+ 
Sbjct: 640 KGHCKARNMKEAWFLFQEMKGKGFSVSVSTYSVLIKGFLKRKKFLEAREVFDQMRREGLA 699

BLAST of Cp4.1LG01g13140.1 vs. TAIR10
Match: AT5G55840.1 (AT5G55840.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 252.7 bits (644), Expect = 7.3e-67
Identity = 157/569 (27.59%), Postives = 282/569 (49.56%), Query Frame = 1

Query: 127 SHILAGKGRFKELHCVIKQLVEEQGSGSASSFCDLLLNKFRNWDSNGVVWDMLAFAYSRH 186
           +HIL     +     ++K+L     SG +S     L+  +R  +SN  V+D+L   Y R 
Sbjct: 119 THILVRARMYDPARHILKEL--SLMSGKSSFVFGALMTTYRLCNSNPSVYDILIRVYLRE 178

Query: 187 EMIHDALFVIAKMKDLNLQASVPTYNSLLHNLRHT--DV-MWDICNEVKASGAPQSEYTT 246
            MI D+L +   M       SV T N++L ++  +  DV +W    E+          T 
Sbjct: 179 GMIQDSLEIFRLMGLYGFNPSVYTCNAILGSVVKSGEDVSVWSFLKEMLKRKICPDVATF 238

Query: 247 SILIHGLCAQSKLQDAISFLQDSNEVVG--PSIVSINTVMSKFCKVGLVDVASVLLSCLC 306
           +ILI+ LCA+   + + S+L    E  G  P+IV+ NTV+  +CK               
Sbjct: 239 NILINVLCAEGSFEKS-SYLMQKMEKSGYAPTIVTYNTVLHWYCKK-------------- 298

Query: 307 KVGRIEEALALLNEMETLRLKPDLIVYSILIHGLCKEGFVQRAYQLYEQMRLKRNFPNYF 366
             GR + A+ LL+ M++  +  D+  Y++LIH LC+   + + Y L   MR +   PN  
Sbjct: 299 --GRFKAAIELLDHMKSKGVDADVCTYNMLIHDLCRSNRIAKGYLLLRDMRKRMIHPNEV 358

Query: 367 SQRAVLLGFFENGNISEARKYFDALTHMDLIEDVILFNIMIDGYVRLGDISEAMQLYYRM 426
           +   ++ GF   G +  A +  + +    L  + + FN +IDG++  G+  EA++++Y M
Sbjct: 359 TYNTLINGFSNEGKVLIASQLLNEMLSFGLSPNHVTFNALIDGHISEGNFKEALKMFYMM 418

Query: 427 FERGITPTVVTFNTLVHGFCRNGDLVEARKMFKIIRLNGLLPSVVTYTTLMNAYCEAGNM 486
             +G+TP+ V++  L+ G C+N +   AR  +  ++ NG+    +TYT +++  C+ G +
Sbjct: 419 EAKGLTPSEVSYGVLLDGLCKNAEFDLARGFYMRMKRNGVCVGRITYTGMIDGLCKNGFL 478

Query: 487 QEMFDLLHEMEANAVVPTHITYTVLIKGLCRQNKMHEALQLLEYMYAKGLMPDQITYNTI 546
            E   LL+EM  + + P  +TY+ LI G C+  +   A +++  +Y  GL P+ I Y+T+
Sbjct: 479 DEAVVLLNEMSKDGIDPDIVTYSALINGFCKVGRFKTAKEIVCRIYRVGLSPNGIIYSTL 538

Query: 547 IQCFCKARDIAKAFQVYNEMLLHNLDPTHVTYNVLISGLCVYGDLKDADRMLVSMEDQNI 606
           I   C+   + +A ++Y  M+L      H T+NVL++ LC  G + +A+  +  M    I
Sbjct: 539 IYNCCRMGCLKEAIRIYEAMILEGHTRDHFTFNVLVTSLCKAGKVAEAEEFMRCMTSDGI 598

Query: 607 SLTKVAYMTIIKAHCAKGQVSKALGFFNQMLAKSFVISIRDYSAIINRLCKRGLITEAKY 666
               V++  +I  +   G+  KA   F++M       +   Y +++  LCK G + EA+ 
Sbjct: 599 LPNTVSFDCLINGYGNSGEGLKAFSVFDEMTKVGHHPTFFTYGSLLKGLCKGGHLREAEK 658

Query: 667 LFAMMLSEGVTPDSEICETMLNAFHQHGD 691
               + +     D+ +  T+L A  + G+
Sbjct: 659 FLKSLHAVPAAVDTVMYNTLLTAMCKSGN 668

BLAST of Cp4.1LG01g13140.1 vs. TAIR10
Match: AT5G41170.1 (AT5G41170.1 Pentatricopeptide repeat (PPR-like) superfamily protein)

HSP 1 Score: 249.2 bits (635), Expect = 8.1e-66
Identity = 155/545 (28.44%), Postives = 272/545 (49.91%), Query Frame = 1

Query: 135 RFKELHCVIKQLVEEQGSGSASSFCDLLLNKFRNWDSNGVVWDMLAFAYSRHEMIHDALF 194
           RF +LH   +  + +  SG A SF  LL   F  W      +  +          ++AL 
Sbjct: 4   RFFQLH---RNRLVKGNSGKALSFSRLLDLSF--WVRAFCNYREILRNGLHSLQFNEALD 63

Query: 195 VIAKMKDLNLQASVPTYNSLLH---NLRHTDVMWDICNEVKASGAPQSEYTTSILIHGLC 254
           +   M +     S+  +  LL+    ++  DV+ ++C+ ++  G     YT ++L++  C
Sbjct: 64  LFTHMVESRPLPSIIDFTKLLNVIAKMKKFDVVINLCDHLQIMGVSHDLYTCNLLMNCFC 123

Query: 255 AQSKLQDAISFLQDSNEV-VGPSIVSINTVMSKFCKVGLVDVASVLLSCLCKVGRIEEAL 314
             S+   A SFL    ++   P IV+  ++++ FC +G                R+EEA+
Sbjct: 124 QSSQPYLASSFLGKMMKLGFEPDIVTFTSLINGFC-LG---------------NRMEEAM 183

Query: 315 ALLNEMETLRLKPDLIVYSILIHGLCKEGFVQRAYQLYEQMRLKRNFPNYFSQRAVLLGF 374
           +++N+M  + +KPD+++Y+ +I  LCK G V  A  L++QM      P+     +++ G 
Sbjct: 184 SMVNQMVEMGIKPDVVMYTTIIDSLCKNGHVNYALSLFDQMENYGIRPDVVMYTSLVNGL 243

Query: 375 FENGNISEARKYFDALTHMDLIEDVILFNIMIDGYVRLGDISEAMQLYYRMFERGITPTV 434
             +G   +A      +T   +  DVI FN +ID +V+ G   +A +LY  M    I P +
Sbjct: 244 CNSGRWRDADSLLRGMTKRKIKPDVITFNALIDAFVKEGKFLDAEELYNEMIRMSIAPNI 303

Query: 435 VTFNTLVHGFCRNGDLVEARKMFKIIRLNGLLPSVVTYTTLMNAYCEAGNMQEMFDLLHE 494
            T+ +L++GFC  G + EAR+MF ++   G  P VV YT+L+N +C+   + +   + +E
Sbjct: 304 FTYTSLINGFCMEGCVDEARQMFYLMETKGCFPDVVAYTSLINGFCKCKKVDDAMKIFYE 363

Query: 495 MEANAVVPTHITYTVLIKGLCRQNKMHEALQLLEYMYAKGLMPDQITYNTIIQCFCKARD 554
           M    +    ITYT LI+G  +  K + A ++  +M ++G+ P+  TYN ++ C C    
Sbjct: 364 MSQKGLTGNTITYTTLIQGFGQVGKPNVAQEVFSHMVSRGVPPNIRTYNVLLHCLCYNGK 423

Query: 555 IAKAFQVYNEMLLHNLD---PTHVTYNVLISGLCVYGDLKDADRMLVSMEDQNISLTKVA 614
           + KA  ++ +M    +D   P   TYNVL+ GLC  G L+ A  +   M  + + +  + 
Sbjct: 424 VKKALMIFEDMQKREMDGVAPNIWTYNVLLHGLCYNGKLEKALMVFEDMRKREMDIGIIT 483

Query: 615 YMTIIKAHCAKGQVSKALGFFNQMLAKSFVISIRDYSAIINRLCKRGLITEAKYLFAMML 673
           Y  II+  C  G+V  A+  F  + +K    ++  Y+ +I+ L + GL  EA  LF  M 
Sbjct: 484 YTIIIQGMCKAGKVKNAVNLFCSLPSKGVKPNVVTYTTMISGLFREGLKHEAHVLFRKMK 527

BLAST of Cp4.1LG01g13140.1 vs. NCBI nr
Match: gi|659130189|ref|XP_008465042.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At1g13630 [Cucumis melo])

HSP 1 Score: 713.0 bits (1839), Expect = 5.6e-202
Identity = 366/549 (66.67%), Postives = 430/549 (78.32%), Query Frame = 1

Query: 156 SSFCDL-LLNKFRNWDSNGVVWDMLAFAYSRHEMIH---------DALFVIAKMKDLNLQ 215
           S FC + L++  R++    V   +L  ++S + ++H         +AL     M+   ++
Sbjct: 277 SKFCKVGLIDVARSFFCLLVKSGLLHDSFSYNILVHGLCVAGSMDEALEFTDDMEKHGVE 336

Query: 216 ASVPTYNSLLHNLRHTDVMWD---ICNEVKASGAPQSEYTTSILIHGLCAQSKLQDAISF 275
             V TYN+L        +M     +  ++   G      T +ILI G C    +++A+  
Sbjct: 337 PDVVTYNTLAKGFLLLGLMSGARKVVQKMLLQGLNPDIVTYTILICGHCQMGNIEEALKL 396

Query: 276 LQDSNEVVGPSIVSINTVMSKFCKVGLVDVASVLLSCLCKVGRIEEALALLNEMETLRLK 335
            Q++              +S+  K+ ++   SVLLSCLCKVGRIEEAL L +EMETL LK
Sbjct: 397 RQET--------------LSRGFKLNIISY-SVLLSCLCKVGRIEEALTLFDEMETLHLK 456

Query: 336 PDLIVYSILIHGLCKEGFVQRAYQLYEQMRLKRNFPNYFSQRAVLLGFFENGNISEARKY 395
           PD IVYSILIHGLCKEGFVQRAYQLYEQM LKR FP+YF+QRAVLLG F+NGNISEARKY
Sbjct: 457 PDFIVYSILIHGLCKEGFVQRAYQLYEQMCLKRIFPHYFAQRAVLLGLFKNGNISEARKY 516

Query: 396 FDALTHMDLIEDVILFNIMIDGYVRLGDISEAMQLYYRMFERGITPTVVTFNTLVHGFCR 455
           FD L  MDLIEDV+L+NIMIDGYVRLGDI+EAMQLYY M ERGITP+VVTFNTL++GFCR
Sbjct: 517 FDTLNRMDLIEDVVLYNIMIDGYVRLGDIAEAMQLYYNMIERGITPSVVTFNTLINGFCR 576

Query: 456 NGDLVEARKMFKIIRLNGLLPSVVTYTTLMNAYCEAGNMQEMFDLLHEMEANAVVPTHIT 515
            GDL+EARKM  +IRL GL+PSVVTYTTLMNAYCE GNMQEMF  LHEMEANAVVPTH+T
Sbjct: 577 RGDLMEARKMLDVIRLKGLVPSVVTYTTLMNAYCEVGNMQEMFHFLHEMEANAVVPTHVT 636

Query: 516 YTVLIKGLCRQNKMHEALQLLEYMYAKGLMPDQITYNTIIQCFCKARDIAKAFQVYNEML 575
           YTVLIKGLCRQNKMHE+LQLLEYMYAKGL+PD +TYNTIIQCFCK ++I KAFQ+YN+ML
Sbjct: 637 YTVLIKGLCRQNKMHESLQLLEYMYAKGLVPDPVTYNTIIQCFCKGKEITKAFQLYNKML 696

Query: 576 LHNLDPTHVTYNVLISGLCVYGDLKDADRMLVSMEDQNISLTKVAYMTIIKAHCAKGQVS 635
           LHN DPTHVTYNVLI+GLC+YGDLKD DRM+VSMED+NI LTKVAYMTII+AHCAKGQVS
Sbjct: 697 LHNCDPTHVTYNVLINGLCIYGDLKDVDRMVVSMEDRNIILTKVAYMTIIQAHCAKGQVS 756

Query: 636 KALGFFNQMLAKSFVISIRDYSAIINRLCKRGLITEAKYLFAMMLSEGVTPDSEICETML 692
           KALG+FNQMLAK+FVISIRDYSA+INRLCKRGLITEAKY F MMLSEG+TPD EICET+L
Sbjct: 757 KALGYFNQMLAKNFVISIRDYSAVINRLCKRGLITEAKYFFVMMLSEGITPDPEICETVL 810

BLAST of Cp4.1LG01g13140.1 vs. NCBI nr
Match: gi|449453449|ref|XP_004144470.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At1g13630 [Cucumis sativus])

HSP 1 Score: 686.4 bits (1770), Expect = 5.6e-194
Identity = 351/549 (63.93%), Postives = 420/549 (76.50%), Query Frame = 1

Query: 156 SSFC-----DLLLNKFRNWDSNGVVWDMLAFAYSRHEM-----IHDALFVIAKMKDLNLQ 215
           S FC     D+  + F     NG++ D  ++    H +     + +AL     M+   ++
Sbjct: 279 SKFCKVGLIDVARSFFCLMVKNGLLHDSFSYNILLHGLCVAGSMDEALGFTDDMEKHGVE 338

Query: 216 ASVPTYNSLLHNLRHTDVMWD---ICNEVKASGAPQSEYTTSILIHGLCAQSKLQDAISF 275
             V TYN+L        +M     +  ++   G      T + LI G C    +++A+  
Sbjct: 339 PDVVTYNTLAKGFLLLGLMSGARKVVQKMLLQGLNPDLVTYTTLICGHCQMGNIEEALKL 398

Query: 276 LQDSNEVVGPSIVSINTVMSKFCKVGLVDVASVLLSCLCKVGRIEEALALLNEMETLRLK 335
            Q++              +S+  K+ ++   ++LLSCLCKVGRIEEAL L +EMETLRL+
Sbjct: 399 RQET--------------LSRGFKLNVI-FYNMLLSCLCKVGRIEEALTLFDEMETLRLE 458

Query: 336 PDLIVYSILIHGLCKEGFVQRAYQLYEQMRLKRNFPNYFSQRAVLLGFFENGNISEARKY 395
           PD IVYSILIHGLCKEGFVQRAYQLYEQMRLKR FP++F+QRAVLLG F+NGNISEAR Y
Sbjct: 459 PDFIVYSILIHGLCKEGFVQRAYQLYEQMRLKRKFPHHFAQRAVLLGLFKNGNISEARNY 518

Query: 396 FDALTHMDLIEDVILFNIMIDGYVRLGDISEAMQLYYRMFERGITPTVVTFNTLVHGFCR 455
           FD  T MDL+EDV+L+NIMIDGYVRL  I+EAMQLYY+M ERGITP+VVTFNTL++GFCR
Sbjct: 519 FDTWTRMDLMEDVVLYNIMIDGYVRLDGIAEAMQLYYKMIERGITPSVVTFNTLINGFCR 578

Query: 456 NGDLVEARKMFKIIRLNGLLPSVVTYTTLMNAYCEAGNMQEMFDLLHEMEANAVVPTHIT 515
            GDL+EARKM ++IRL GL+PSVVTYTTLMNAYCE GNMQEMF  LHEMEANAVVPTH+T
Sbjct: 579 RGDLMEARKMLEVIRLKGLVPSVVTYTTLMNAYCEVGNMQEMFHFLHEMEANAVVPTHVT 638

Query: 516 YTVLIKGLCRQNKMHEALQLLEYMYAKGLMPDQITYNTIIQCFCKARDIAKAFQVYNEML 575
           YTVLIKGLCRQNKMHE+LQLLEYMYAKGL+PD +TYNTIIQCFCK ++I KA Q+YN ML
Sbjct: 639 YTVLIKGLCRQNKMHESLQLLEYMYAKGLLPDSVTYNTIIQCFCKGKEITKALQLYNMML 698

Query: 576 LHNLDPTHVTYNVLISGLCVYGDLKDADRMLVSMEDQNISLTKVAYMTIIKAHCAKGQVS 635
           LHN DPT VTY VLI+ LC++GDLKD DRM+VS+ED+NI+L KV YMTIIKAHCAKGQVS
Sbjct: 699 LHNCDPTQVTYKVLINALCIFGDLKDVDRMVVSIEDRNITLKKVTYMTIIKAHCAKGQVS 758

Query: 636 KALGFFNQMLAKSFVISIRDYSAIINRLCKRGLITEAKYLFAMMLSEGVTPDSEICETML 692
           KALG+FNQMLAK FVISIRDYSA+INRLCKRGLITEAKY F MMLSEGVTPD EIC+T+L
Sbjct: 759 KALGYFNQMLAKGFVISIRDYSAVINRLCKRGLITEAKYFFVMMLSEGVTPDPEICKTVL 812

BLAST of Cp4.1LG01g13140.1 vs. NCBI nr
Match: gi|658009094|ref|XP_008339746.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At1g13630 isoform X1 [Malus domestica])

HSP 1 Score: 558.9 bits (1439), Expect = 1.4e-155
Identity = 290/541 (53.60%), Postives = 378/541 (69.87%), Query Frame = 1

Query: 158 FCDLLLNKFRNWDSNGVVWDMLAFAYSRHEM-----IHDALFVIAKMKDLNLQASVPTYN 217
           F D+  + F      G+V D  ++    H +     + +AL     M+   +Q    TYN
Sbjct: 290 FVDVAKSFFCVXXKYGLVPDSYSYNILIHGLCVAGSLEEALEFTKDMERHGVQPDTVTYN 349

Query: 218 SLLHNLRHTDVMWD---ICNEVKASGAPQSEYTTSILIHGLCAQSKLQDAISFLQDSNEV 277
            L        +M     +  ++   G      T +I+I G C    + +A+   ++    
Sbjct: 350 ILCKGFHLLGLMSGARKVIQKMLVKGLNPDHVTYTIMICGHCHVGNIDEALKLQKE---- 409

Query: 278 VGPSIVSINTVMSKFCKVGLVDVASVLLSCLCKVGRIEEALALLNEMETLRLKPDLIVYS 337
                     ++S+  ++ ++ V SVLLS +CK GR+E AL LL EME + L+PDLI YS
Sbjct: 410 ----------MISRGFQLSVI-VYSVLLSSMCKSGRVEXALRLLYEMEAVGLEPDLITYS 469

Query: 338 ILIHGLCKEGFVQRAYQLYEQMRLKRNFPNYFSQRAVLLGFFENGNISEARKYFDALTHM 397
           ILIHGLCK+G VQRA ++Y +M +KR  PNYF+ RA+LLG  E G+I EARKYFD LT  
Sbjct: 470 ILIHGLCKQGDVQRASEIYREMYMKRIIPNYFAHRAILLGLREKGDIYEARKYFDHLTTR 529

Query: 398 DLIEDVILFNIMIDGYVRLGDISEAMQLYYRMFERGITPTVVTFNTLVHGFCRNGDLVEA 457
            + ED++L+NIM+DGYV+LG+++EA+QLY ++ E+G+ P+ VTFNTL+HGFC+NG LVEA
Sbjct: 530 AVTEDIVLYNIMMDGYVKLGNVAEAIQLYKQIIEKGLNPSTVTFNTLIHGFCKNGKLVEA 589

Query: 458 RKMFKIIRLNGLLPSVVTYTTLMNAYCEAGNMQEMFDLLHEMEANAVVPTHITYTVLIKG 517
           R+M   I L+GLLPS VTYTTLMNA CE GN+  M +LL EMEA  V PTH++YTV+IKG
Sbjct: 590 RRMLDTIELHGLLPSPVTYTTLMNANCEQGNINGMXELLXEMEAKDVEPTHVSYTVVIKG 649

Query: 518 LCRQNKMHEALQLLEYMYAKGLMPDQITYNTIIQCFCKARDIAKAFQVYNEMLLHNLDPT 577
           LCRQ K  +A+ L+E MYAKGL PDQITYNTII+CFCKA+D  KAFQ++NEML+HNL PT
Sbjct: 650 LCRQGKRWDAVHLVEEMYAKGLSPDQITYNTIIKCFCKAQDFEKAFQLHNEMLMHNLAPT 709

Query: 578 HVTYNVLISGLCVYGDLKDADRMLVSMEDQNISLTKVAYMTIIKAHCAKGQVSKALGFFN 637
            VTYN+LI+GLCVYGDL+DADR+LVS+ D NI+LTKVAY T+IKAHCAKG V +A+  F+
Sbjct: 710 PVTYNLLINGLCVYGDLEDADRLLVSLNDSNINLTKVAYTTLIKAHCAKGDVYRAVALFH 769

Query: 638 QMLAKSFVISIRDYSAIINRLCKRGLITEAKYLFAMMLSEGVTPDSEICETMLNAFHQHG 691
           QM+ K F ISIRDYSA+INRLCKR  ITEAKY F MMLS+G++PD E+CE MLN F Q G
Sbjct: 770 QMVEKGFEISIRDYSAVINRLCKRCWITEAKYFFCMMLSDGISPDQELCEVMLNVFXQGG 815

BLAST of Cp4.1LG01g13140.1 vs. NCBI nr
Match: gi|694310974|ref|XP_009355583.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At1g13630 [Pyrus x bretschneideri])

HSP 1 Score: 556.6 bits (1433), Expect = 6.7e-155
Identity = 286/541 (52.87%), Postives = 381/541 (70.43%), Query Frame = 1

Query: 158 FCDLLLNKFRNWDSNGVVWDMLAFAYSRHEM-----IHDALFVIAKMKDLNLQASVPTYN 217
           F D+  + F      G+V D  ++    H +     + +AL     M+   +Q    TYN
Sbjct: 290 FVDVAKSFFCMMFKYGLVPDSYSYNILIHGLCVAGSLEEALEFTKDMERHGVQPDTVTYN 349

Query: 218 SLLHNLRHTDVMWD---ICNEVKASGAPQSEYTTSILIHGLCAQSKLQDAISFLQDSNEV 277
            L        +M     +  ++   G      T +I+I G C    + +A+   ++    
Sbjct: 350 ILCKGFHLLGLMSGARKVIQKMLVRGLNPDHVTYTIMICGHCHVGNIDEALKLRKE---- 409

Query: 278 VGPSIVSINTVMSKFCKVGLVDVASVLLSCLCKVGRIEEALALLNEMETLRLKPDLIVYS 337
                     ++S+  ++ ++ V SVLLS +CK GR+EEAL LL EME + L+PDLI YS
Sbjct: 410 ----------MISRGFQLSVI-VYSVLLSSMCKSGRVEEALRLLYEMEAVGLEPDLITYS 469

Query: 338 ILIHGLCKEGFVQRAYQLYEQMRLKRNFPNYFSQRAVLLGFFENGNISEARKYFDALTHM 397
           ILIHGLCK+G VQRA ++Y +M +KR  PNYF+ RA+LLG  E G++ EARKYFD LT  
Sbjct: 470 ILIHGLCKQGDVQRASEIYREMYMKRIIPNYFAHRAILLGLREKGDLYEARKYFDHLTTR 529

Query: 398 DLIEDVILFNIMIDGYVRLGDISEAMQLYYRMFERGITPTVVTFNTLVHGFCRNGDLVEA 457
            + ED++L+NIM+DGYV+LG+++EA+QLY ++ E+G+ P+ VTFNTL+HGFC+ G LVEA
Sbjct: 530 TVTEDIVLYNIMMDGYVKLGNVAEAIQLYKQIIEKGLNPSTVTFNTLIHGFCKTGKLVEA 589

Query: 458 RKMFKIIRLNGLLPSVVTYTTLMNAYCEAGNMQEMFDLLHEMEANAVVPTHITYTVLIKG 517
           R++   I L+GLLPS VTYTTLMNA CE GN+  M +LL EMEA  V PTH++YTVLIKG
Sbjct: 590 RRILDTIELHGLLPSPVTYTTLMNANCEQGNINGMLELLREMEAKDVEPTHVSYTVLIKG 649

Query: 518 LCRQNKMHEALQLLEYMYAKGLMPDQITYNTIIQCFCKARDIAKAFQVYNEMLLHNLDPT 577
           LCRQ K+ +A+ L+  MYAKGL PDQITYNT+I+CFCKA+D  KAFQ++NEML+HNL+PT
Sbjct: 650 LCRQGKLWDAVHLVGEMYAKGLSPDQITYNTVIKCFCKAQDFEKAFQLHNEMLMHNLEPT 709

Query: 578 HVTYNVLISGLCVYGDLKDADRMLVSMEDQNISLTKVAYMTIIKAHCAKGQVSKALGFFN 637
            VTYN+LI+GLCVYGDL+DADR+LVS+ D NI+LTKVAY T+IKAHCAKG V +A+  F+
Sbjct: 710 PVTYNLLINGLCVYGDLEDADRLLVSLNDSNINLTKVAYSTLIKAHCAKGDVYRAVELFH 769

Query: 638 QMLAKSFVISIRDYSAIINRLCKRGLITEAKYLFAMMLSEGVTPDSEICETMLNAFHQHG 691
           QM+ K F ISIRDYSA+INRLCKR  +TEAKY F MMLS+G++PD E+CE MLNAF+Q G
Sbjct: 770 QMVDKGFEISIRDYSAVINRLCKRCWMTEAKYFFCMMLSDGISPDQELCEVMLNAFYQGG 815

BLAST of Cp4.1LG01g13140.1 vs. NCBI nr
Match: gi|359473479|ref|XP_002267299.2| (PREDICTED: LOW QUALITY PROTEIN: putative pentatricopeptide repeat-containing protein At1g13630 [Vitis vinifera])

HSP 1 Score: 552.0 bits (1421), Expect = 1.7e-153
Identity = 272/529 (51.42%), Postives = 378/529 (71.46%), Query Frame = 1

Query: 174 VVWDMLAFAYSRHEMIH---------DALFVIAKMKDLNLQASVPTYNSLLHNLRHTDVM 233
           + + +L   YS + ++H         +AL     M++  ++  + TYN L +  R   ++
Sbjct: 297 IKYGLLPDVYSYNILLHGLCVAGSMEEALEFTNDMENHGVEPDIVTYNILANGFRILGLI 356

Query: 234 ---WDICNEVKASGAPQSEYTTSILIHGLCAQSKLQDAISFLQDSNEVVGPSIVSINTVM 293
              W +   +  +G      T +ILI G C    ++++    +               ++
Sbjct: 357 SGAWKVVQRMLLNGLNPDLVTYTILICGHCQMGNIEESFKLKEK--------------ML 416

Query: 294 SKFCKVGLVDVASVLLSCLCKVGRIEEALALLNEMETLRLKPDLIVYSILIHGLCKEGFV 353
           S+  K+ +V   +VLLS LCK GRI+EA+ LL+EME + LKPDL+ YS+LIHGLCK G V
Sbjct: 417 SQGLKLSIVTY-TVLLSSLCKSGRIDEAVILLHEMEVIGLKPDLLTYSVLIHGLCKRGAV 476

Query: 354 QRAYQLYEQMRLKRNFPNYFSQRAVLLGFFENGNISEARKYFDALTHMDLIEDVILFNIM 413
           + A +LYE+M  KR +PN F   A++ G FE G ISEA+ YFD++T  D+ E++IL+NIM
Sbjct: 477 EEAIELYEEMCSKRIYPNSFVCSAIISGLFEKGAISEAQMYFDSVTKSDVAEEIILYNIM 536

Query: 414 IDGYVRLGDISEAMQLYYRMFERGITPTVVTFNTLVHGFCRNGDLVEARKMFKIIRLNGL 473
           IDGY +LG+I EA++ Y ++ E+GI+PT+VTFN+L++GFC+ G L EA K+   I+++GL
Sbjct: 537 IDGYAKLGNIGEAVRSYKQIIEKGISPTIVTFNSLIYGFCKKGKLAEAVKLLDTIKVHGL 596

Query: 474 LPSVVTYTTLMNAYCEAGNMQEMFDLLHEMEANAVVPTHITYTVLIKGLCRQNKMHEALQ 533
           +P+ VTYTTLMN YCE G+M  MFD+LHEMEA A+ PT ITYTV++KGLC++ ++HE++Q
Sbjct: 597 VPTSVTYTTLMNGYCEEGDMHSMFDMLHEMEAKAIKPTQITYTVVVKGLCKEGRLHESVQ 656

Query: 534 LLEYMYAKGLMPDQITYNTIIQCFCKARDIAKAFQVYNEMLLHNLDPTHVTYNVLISGLC 593
           LL+YMYA+GL PDQITYNT+IQ FCKA D+ KAFQ++N+ML H+L P+ VTYNVLI+GLC
Sbjct: 657 LLKYMYARGLFPDQITYNTVIQSFCKAHDLQKAFQLHNQMLQHSLQPSPVTYNVLINGLC 716

Query: 594 VYGDLKDADRMLVSMEDQNISLTKVAYMTIIKAHCAKGQVSKALGFFNQMLAKSFVISIR 653
           VYG+LKDADR+LV+++DQ+I LTKVAY TIIKAHCAKG V  AL FF+QM+ + F +SIR
Sbjct: 717 VYGNLKDADRLLVTLQDQSIRLTKVAYTTIIKAHCAKGDVQNALVFFHQMVERGFEVSIR 776

Query: 654 DYSAIINRLCKRGLITEAKYLFAMMLSEGVTPDSEICETMLNAFHQHGD 691
           DYSA+INRLCKR LIT+AK+ F MML+ G+ PD +IC  MLNAFH+ GD
Sbjct: 777 DYSAVINRLCKRNLITDAKFFFCMMLTHGIPPDQDICLVMLNAFHRSGD 810

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PPR41_ARATH8.6e-11041.49Putative pentatricopeptide repeat-containing protein At1g13630 OS=Arabidopsis th... [more]
PP360_ARATH5.3e-6727.09Pentatricopeptide repeat-containing protein At5g01110 OS=Arabidopsis thaliana GN... [more]
PPR12_ARATH7.6e-6627.89Pentatricopeptide repeat-containing protein At1g05670, mitochondrial OS=Arabidop... [more]
PP432_ARATH1.3e-6527.59Pentatricopeptide repeat-containing protein At5g55840 OS=Arabidopsis thaliana GN... [more]
PP412_ARATH1.4e-6428.44Pentatricopeptide repeat-containing protein At5g41170, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A0A0L9A2_CUCSA3.9e-19463.93Uncharacterized protein OS=Cucumis sativus GN=Csa_3G642640 PE=4 SV=1[more]
M5XH23_PRUPE4.6e-15059.43Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa022936mg PE=4 SV=1[more]
D7TA84_VITVI1.8e-14650.09Putative uncharacterized protein OS=Vitis vinifera GN=VIT_01s0010g00630 PE=4 SV=... [more]
B9HWT8_POPTR2.4e-14347.85Pentatricopeptide repeat-containing family protein OS=Populus trichocarpa GN=POP... [more]
A0A067LDB5_JATCU7.8e-14250.95Uncharacterized protein OS=Jatropha curcas GN=JCGZ_01872 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G13630.11.0e-10041.20 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G01110.13.0e-6827.09 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G05670.14.3e-6727.89 Pentatricopeptide repeat (PPR-like) superfamily protein[more]
AT5G55840.17.3e-6727.59 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT5G41170.18.1e-6628.44 Pentatricopeptide repeat (PPR-like) superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659130189|ref|XP_008465042.1|5.6e-20266.67PREDICTED: putative pentatricopeptide repeat-containing protein At1g13630 [Cucum... [more]
gi|449453449|ref|XP_004144470.1|5.6e-19463.93PREDICTED: putative pentatricopeptide repeat-containing protein At1g13630 [Cucum... [more]
gi|658009094|ref|XP_008339746.1|1.4e-15553.60PREDICTED: putative pentatricopeptide repeat-containing protein At1g13630 isofor... [more]
gi|694310974|ref|XP_009355583.1|6.7e-15552.87PREDICTED: putative pentatricopeptide repeat-containing protein At1g13630 [Pyrus... [more]
gi|359473479|ref|XP_002267299.2|1.7e-15351.42PREDICTED: LOW QUALITY PROTEIN: putative pentatricopeptide repeat-containing pro... [more]
The following terms have been associated with this mRNA:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Cp4.1LG01g13140Cp4.1LG01g13140gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Cp4.1LG01g13140.1Cp4.1LG01g13140.1-proteinpolypeptide


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cp4.1LG01g13140.1:five_prime_utr:001Cp4.1LG01g13140.1:five_prime_utr:001five_prime_UTR


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cp4.1LG01g13140.1:cds:001Cp4.1LG01g13140.1:cds:001CDS
Cp4.1LG01g13140.1:cds:002Cp4.1LG01g13140.1:cds:002CDS
Cp4.1LG01g13140.1:cds:003Cp4.1LG01g13140.1:cds:003CDS
Cp4.1LG01g13140.1:cds:004Cp4.1LG01g13140.1:cds:004CDS
Cp4.1LG01g13140.1:cds:005Cp4.1LG01g13140.1:cds:005CDS


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cp4.1LG01g13140.1:three_prime_utr:001Cp4.1LG01g13140.1:three_prime_utr:001three_prime_UTR


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 643..671
score: 8.9E-4coord: 606..634
score: 2.
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 533..581
score: 2.7E-17coord: 463..512
score: 1.4E-17coord: 394..442
score: 5.4E-15coord: 295..337
score: 1.9
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 397..429
score: 3.4E-9coord: 606..639
score: 1.4E-5coord: 431..464
score: 1.7E-7coord: 501..534
score: 5.5E-8coord: 571..603
score: 1.2E-6coord: 466..499
score: 3.0E-9coord: 294..324
score: 4.5E-7coord: 326..358
score: 1.1E-6coord: 643..675
score: 3.8E-7coord: 536..569
score: 2.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 359..393
score: 6.051coord: 172..206
score: 7.596coord: 569..603
score: 10.194coord: 239..269
score: 6.127coord: 289..323
score: 9.843coord: 534..568
score: 12.419coord: 499..533
score: 12.079coord: 429..463
score: 12.079coord: 464..498
score: 12.474coord: 604..638
score: 9.317coord: 394..428
score: 13.241coord: 639..673
score: 10.622coord: 324..358
score: 1
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 371..549
score: 5.8E-7coord: 550..633
score: 8.
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 162..282
score: 9.3E-185coord: 299..690
score: 9.3E
NoneNo IPR availableunknownSSF81901HCP-likecoord: 370..590
score: 1.3