Cp4.1LG09g00990 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG09g00990
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionPentatricopeptide repeat-containing protein
LocationCp4.1LG09 : 577024 .. 584341 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
AAATATAATAAAAGAAAATTCCCTATTCCTTCCTTTTGTTCTTTCTTTTACAATGGCCTCAAAATGGCCATCTCCGAAACCCACGAGAAAAATCTTCACCATCTCTCTAATCTTCCTCCTCTGCATATTTGCTTACTTTCTCGGCCTCCGTCAACACCCCTCCTCCGCCGTCGCCCTCCTCCGCACCACCCCATGCACCACCCTCCCAAACACCACCACCACCGCCGGAAAACCCAATTACTTCCCGGCATGCGGCATGGAGTACAGCGAGTACACGCCATGCGAGGACACGAAAAGGTCGCTGAAGTTCACAAGACACCGGCTGATTTACCGAGAGAGGCACTGCCCGGAGAAGGAGGAGATATTGAAGTGCCGTGTTCCTCCGCCGCCGGGGTACCGGAATCCGTTCCCATGGCCGGAGAGTAGAGATTACGCTTGGTACTTGAATGCGCCGCATAAATCCCTGACGGTGGAGAAGGCGGTGCAGAATTGGATAATATACGAAGGGGAGAGGTTCAGATTTCCCGGCGGAGGGACGATGTTCCCTAACGGAGCTGATGCGTATATTGACAAAATTGGGAAACTCATTAATCTGAAAGATGGGTCCATCAGAACCGCCATTGATACCGGCTGTGGGGTAATTAACCTCTTCATTCTTTCCCTTTCTTAACATCAAAATTCAATTATAAGTTTTCAAAATCTTTATGAACATTCTGTCCCAATGTTGACTCCGTGAAATTTTATTTTTAATTTTGAAAATATGTGTCGTAGTCAAAGGTTTATTTATTGGTTTGACCTCAACTTCCTGGGTCCAAGTGGCAGTCATGGAGTTACAATTTGGTTTGGAAAGTCCCATCATTCTCTGTCACCATATTTATAACAAAGTAATTGTTCCCAATATTTGCCTTCAAAATATTTAAATTAATTCGAACTTTTTATTTTTTTATTTTATTTTATTTTATTTATAAAGGTATTTTTAAATTTTCATGAGTATTTTTAAGCCTAATAATAATAATAAAGGTATTTATTTTTTAATTTTTAAAAAATAGTTGATGGGCAGTAATGAAACTTTAAATACTATTAAATTTAAAGATATTATTGAAATTTTTAAAACTTTATGAATATTTTTTATAGACAAAATACTAAATTTAAGATTATTTTTATTAATTGAGTCTAAAAATGATTTATTATATTATTCAAATTTCAACTTTTCAACTTTTTTTTTTACAATTAAAAAAAAAATTATAAATATATTATAAAATTGAAAATATAGAAACAAATTAGAAACTTTCATATATATATATATATAAACCAACCTTCCCACAATTGAANAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGATCGTCTAGATTGAAGGAGTCGTTTGAATTTATCCATGGTTTGTTTATTCCTTATCTCGAAAATCTTATGTCATTCGAATCTAATATAATTTAAAGTTGTTGAACATAGTTAAGACGTATATACTCAATTAAGGTCCGATGTCATAAGTTTCTCTACTCCCACACGAGCCTTAGTTGTTGAACATTGGTAAGACGCATAAGGTGTTTTTGTACATATATGCTTTGTGGAGTAAGGCATAGTTGAATGGAGAAGAGAAGGGTGATTTATAAAGATTTGTTGATGATGATGATGAAGGTGGGAAGTTGGGGAGCTTATCTTCTGTCAAGGGACATTGTAACCATGTCGTTTGCGCCAAGGGACACCCACGAAGCTCAAGTTCAGTTCGCTTTGGAGCGAGGGGTTCCTGCTCTCATTGGCGTTCTCTCCTCCAAGAGACTGCCTTATCCTTCCTCTGCCTTTGACATGGCTCATTGCTCTCGCTGCCTCATTCCATGGCCACAACATGGTATTCTTTTCTATCAACATATCTTCACAACACTATTTGAACTCATTCAATTGATGTTGTTGTAGATGGCATCCTCCTCATTGAAGTCGATCGAGTTCTCCGCCCCGGCGGATACTGGATCCTCTCTGGCCCTCCCATCAACTGGAAGCAACACTGGAAAGGCTGGCAAAGAACAAAGGAAGATCTGAACTCTGAACAGCTCGCCATTGAAAAGTTAGCCAAAAGTCTCTGTTGGACTAAGCTTGTGGAACATGGTGATATTGCCATTTGGCAAAAGCCCATCAATCACTTGAACTGCAAAACCAACCCCAAGAACACCAACAATCCACCTTTCTGCAAGCCCCAAGATCCTGACAAAGCTTGGTATGATGTTCTTCATAGGCTTGGAGCTTGAGTTTTCTTTTCTCTTCCATTTCTCATTTGGTGCTTGAATTTGTCCTTTCAGGTATACAGAAATGCAAGCTTGTTTGACGCATTTGCCTCAAGTTTCAAGCAGTAAGGAAATTGCAGGAGGGAAATTGGCAAGATGGCCAGAGAGGCTAAGTGCAATCCCACAAAGGGTTAGCAGAGGAACTGTAAAAGGGGTCACTGAAGAAACGTTCATTCATGATTCTGAGCTATGGAGGAAGAGGTTGTTATATTACAGAACCATCAACAATCAGTTGAGTCAGCCTGGACGGTACCGCAATTTCTTAGACATGAATGCTTTCTTGGGTGGATTTGCTGCTGCTCTGGTTGATGACCCAGTTTGGGTGATGAATGTTGTCCCTGTGGATGTCAAAGTCAATACGCTCGGAGTTATATACGATAGGGGTTTGATCGGCACGTATCAAGATTGGTAAGTACTTAAGGCTGTATAGATTCATGTTCGTGTTTTTTCATGTTTTGGTATGCGCATTGAGCGAGACATAAGAGAGAGGCTTTTTCATTTAGCTACATTATAGTTGTTACAAAGAAATTGATATGTGACCGAACCTAAAAAAGCTCGTTGTATCCGAGAACTTACATTATGTCCTGTGGATGCATTGCAGGTGTGAGGCAATGTCTACATATCCAAGAACTTATGACTTCATTCATGCGGATTCAGTTTTTAGCCTCTATGAGAATAGGTACTCTCTTCTTCCATAGACTATGGAATTGTGAAGTTTTGTTCGTTCTTTTGCGAGATTCGACATTGATTGGAGAGGAACGAGTGCCAGCGAGGACGCTCGTTGGCCCCAAAGGAGATGGATCGTGAGATTCCACATCGATTGGAGAGAGAGGAACGAGTGCCAGCGAGGATGCTGGGACCTAAAGGGGGTGGATTGTGAGATCCCACATCGGTTGGGGAGGAGAACGAAATATCCTTTATAAGGGTGTAGAAACCTCTCTCTAGCAGACGCGTTTTAAAAACCTTGAGGGGAAACCCGAAAGTGAAAGGAGAGAGGAACCAAACATTCTCTATAAGGGTGTTGAAACCTCTCCCTAGTAGACGCATTTTAAAAACCTTGACCGAAAACCCAAACATTCTTTATAAGGTTGTTGAAACCTCGCCCTGACATACATGTTTTATAAACCTTGAGGGGAAACCTGAAAGGGAAAGTCCAAAGAGGACAATATCTGCTAGCCGAGGACTTGGACAGTTACTTCAAGTTTATGTTGATTTTCACGCGCATCTAAATGATGTGAAACTGATGTTCTGCTGAAACTGCATTGCAGATGTGAAATGGAAGATATTCTACTCGAGATGGACAGAATTCTGAGACCAGAAGGAAGCGTAATCTTCCGAGAAAACATCGTTATATTGACGAAAATCAAGACGATAACCGATAAGTTGAACTGGAGCAGTCAAATTGTGCATTCTGAAGATGGATCTTATCATATGGAGAAGCTTCTATTTGCAGTAAAAAACTATTGGACAGCACCTCCTGAGCATTCTAATCAACAAGGCTCTAAACCAACTTAACCACTCAATCACATCCATGTTCTTTTTAGTACTTGATTTACCATCATTTATTACCAAGAACAGTGAATTACATTCAATGAACTGATATGATCAATGGGAATGAATCATACTTTCAATCTTGATTTTGGTTAATTTTGGGTACCCAACACAAAAGCTATGAACTCAGAATTGGCATGTAGGTGATTCATGCAAACCTCAATCTTTCAACACCTATAAATAACACCCAAAACCCTGAAGAAATCATCAAACAAACAGATTCAGAAGAGAAAAAACAAATGGCAGGAACCAAGTCTCATGGGGTGCAGCAAGTTATATTGCTCTTGTTGGTGTCTGTTCTGCTGTGGCAATCACAGGCACAGGCGCAAAGCTGCTCAACTCAGCTCAGCAACCTTAATGGGTGTGCACCTTTTGTGCTTCCTGGTGCCAGTAACCCAAGCCCAGAGTGTTGTGCAGCGCTTGGAGCTGTGCAGCAGGATTGCCTGTGCAGCACCCTTAGAATTTCCTCCACTCTGCCTTCGCTTTGCCGCCTTCCACCTCTCTCTTGTGGTCAGTTCTCTTCGTTTTTCTAACCATTTTTACTTGAGAATGGACTGTAATGAATGAAGGGGTTTTCTTTGATTTGTGCTTCGTTTCAGGTACAAACTAGAGGAGATGGCTGATGGGTATTACAAGAAATGAATAATGCTCCACTTATATCATTCCTTTCTCTTTTCTTTACTGTTCTTGGCGTCCTTTGATCTAAAGTCAATGTTCTTAGAAGTCTGGTTTCGTTGAATAAAATGGAAGCTTTTATCTTTTTCTTCTCTCTATGATTACATTACAATAACTACAGAAAGTTAAGAAACTGCCTAATCCTGCACTAATTATCATAATTTTAACTTAATCACAGAAAGAGATGGTTTAAGCTGTTGGTGAATGAATTTATTAAACAGTTCAAAAAATGGGGTTAGGTGAATCACACTCTTTTCCACCCGCTATCAACCTCATATCAAGAATGTGTGTGAGATCCTACGTCGGTTAGAGAGGGAAACGAAGCATTCTTTAAGGGGTGGAAATCTCTTCGAGGGTGAAATCATGCATTGGTTAGAGAGGGGAACGAAACATTCCTTGTACATGTGTGAAAATCTATCCCTACCAGACTCGTTTTAAAACTTTGAAAGAAATGTCGAAAGGAAAACTCAAGAAGACAATATCTGTTAGCGACGGGCTCGAAAGGAAAAATTTAAAAATGACACCATCTGCTCTCGAAATAGAAAATCGAGACGGGACAATATCGGGTGGGCGAGCTCAAAAGGAAAAACTATTACTATTATTTACCCATTTTTACAAAAGTTAACACCAAACTAAGTATAGGATTAATTTAAGATTAAATTCAAAGATTTAGATTAAAAAAAAAAGAACCCACATGAGAAAAAAAATGATTTTTTAAGGAAGATTATTTAGTGATGAAATGCCAAGCCCGTCGCCGCTCTCAACGATGTCCCAAACTTTTACCCGCCGCCCGCGCAAACACGGCAAGGCAGCAAGCAGACATGCTTACTATGATGGGCAGTCAAGCTTCAATACTCAGTACAGCTAAAGCACAACGAACAGGGGAGAAGAGCGAGGATGCATAACAAATTCCCCTTCAACTCGAGTTTTCAGAGTACGCTCAATCTCCTCTCCTTTGCCCTCCAATTCATCTTCCTTCATGATGAAATTAATTTCAGAGCTCTTCTTATATGTTGGATGTTTTTTCCCTTTTCTTTTTTGGCGAGGGGGGCCTATGGAATTCCCCCTCTTTCTGGAAATTCGTTGCTAGTTTCTTGAATGAGCAAATTGATTGTATTCTATTATGTGGACCATGAACTGTTATTTCCTTAATTCTTTACGGAGTTTTCTTTGCTCTCCTCTTAAACCCTTTGCTACTATTACCCATTTGTTCACTTCTACCAAATGTATTAACACATCGGTTAAGTGTATAAGCGACCTACGAACCAATGATGTTTCTGGGTTTATTCAAAATGGGAGTGGCAACACTTCATCCATTTCCTATTCCAAGCTTTTATTAGAGTTTACTGCTTCCAAGGATGTAAAATCAGGCATGGAAATCCATGCTCGTATGATCAGGTTAGGATTGTGTACAGATACAGGGGTAAGGAACAAATTGATAAACTTGTACTCAAAATGTCAGTGTTTTCCAGCTGCTCGAAAACTTGTTATGGACGGTACCGAGCCAGATTTAGTTTCTTGGTCTGCTTTGATATCTGGGTATGCTCAGAATGGTCGTGGAGAAGAAGCCCTTTTGACCTTTTATGAAATGCATTTGTTGGGAGTGAAGGGCAATGAGTTCACTTTCCCTAGTGTTTTAAAAGCCTGTTCTTTAACGAGGAACTTGGAACTGGGGAAGCAGATTCATGGGATTGCTTTAGTGACAGGTTTTGAATCTGATGTGTTTGTTGCCAATACTTTGGTTGTTATGTATGCTAAATGTGGGGAGTTTAGTGATTCGAAGAAGCTGTTCGAGGAAATTCCAGAACGAAACGTCGTATCTTGGAATGCTTTGTTTTCTTGTTATGTGCAGATTGATTTCTTTTCAGAAGCGATTAATTTGTTTCGAGAAATGGTTTCTACTGGACTTACTCCAAATGAATTTAGTCTCTCCACTGTATTAAACGCTTGTGCTGGTTTGGAGCACATCGATTCCGGAATGGAAATTCATGGATACTTGATAAAGCTTGGGTATGATTCTGATCCCTTTTCTGCTAATGCACTTCTTGACATGTATGCTAAAGCTGGATGTCCTGAATCTGCAATAGCCGTGTTTTATGAAATCCCGAAACCTGATATCGTTTCATGGAATGCTGTAATTGCTGGCTGTGTTCTTCATGAGTATAATGATTTAGCTCTTAAATTGTTTGGGATGATGGGAAGCTTTAGAGTGAGTCCTAACATGTTTACCCTATCAAGTGCTCTTAAAGCTTGTGCTGGGTTAGGGCTTATCAAAATAGGTAGACAATTGCACTCTGCCTTGATGATGATGAATATGGATTCAGATTCATTTGTGGGTGTTGGATTGATAGATATGTATTCGAAATGTGGTTTACTGCAAGATGCACGGAAGGTGTTTGATCTAATTCCTAAAAGGGACTCGATCGCATGGAATTCTATTATTTCCAGTTACTCCAATTGTGGGTATGATATGGAAGCTATATCCCTCTTTACAATGATGTATAAAGAAGGTTTAGAATTCAACCAGACCACATTGTCAACAATCCTCAAATCTTCAGCTGGCTCGCAGGCCATTGCGTTTTGCGAACAAGTTCATGCGATATCGATCAAATCAGGTTACCAATATGATGGTTACGTAGCAAATAGCCTGCTCGATTCTTATGGAAAAGGCTGTCGATTAGAAGAGGCAGAAAAAGTTTTTGAAGAGTGTCCTGCTGAAGATTTGGTGGCGTATACGTCAATGATTACTGCTTACTCCCAATATGGCTTGGGAGAAGAGGCTCTAAAA

mRNA sequence

AAATATAATAAAAGAAAATTCCCTATTCCTTCCTTTTGTTCTTTCTTTTACAATGGCCTCAAAATGGCCATCTCCGAAACCCACGAGAAAAATCTTCACCATCTCTCTAATCTTCCTCCTCTGCATATTTGCTTACTTTCTCGGCCTCCGTCAACACCCCTCCTCCGCCGTCGCCCTCCTCCGCACCACCCCATGCACCACCCTCCCAAACACCACCACCACCGCCGGAAAACCCAATTACTTCCCGGCATGCGGCATGGAGTACAGCGAGTACACGCCATGCGAGGACACGAAAAGGTCGCTGAAGTTCACAAGACACCGGCTGATTTACCGAGAGAGGCACTGCCCGGAGAAGGAGGAGATATTGAAGTGCCGTGTTCCTCCGCCGCCGGGGTACCGGAATCCGTTCCCATGGCCGGAGAGTAGAGATTACGCTTGGTACTTGAATGCGCCGCATAAATCCCTGACGGTGGAGAAGGCGGTGCAGAATTGGATAATATACGAAGGGGAGAGGTTCAGATTTCCCGGCGGAGGGACGATGTTCCCTAACGGAGCTGATGCGTATATTGACAAAATTGGGAAACTCATTAATCTGAAAGATGGGTCCATCAGAACCGCCATTGATACCGGCTGTGGGGTGGGAAGTTGGGGAGCTTATCTTCTGTCAAGGGACATTGTAACCATGTCGTTTGCGCCAAGGGACACCCACGAAGCTCAAGTTCAGTTCGCTTTGGAGCGAGGGGTTCCTGCTCTCATTGGCGTTCTCTCCTCCAAGAGACTGCCTTATCCTTCCTCTGCCTTTGACATGGCTCATTGCTCTCGCTGCCTCATTCCATGGCCACAACATGATGGCATCCTCCTCATTGAAGTCGATCGAGTTCTCCGCCCCGGCGGATACTGGATCCTCTCTGGCCCTCCCATCAACTGGAAGCAACACTGGAAAGGCTGGCAAAGAACAAAGGAAGATCTGAACTCTGAACAGCTCGCCATTGAAAAGTTAGCCAAAAGTCTCTGTTGGACTAAGCTTGTGGAACATGGTGATATTGCCATTTGGCAAAAGCCCATCAATCACTTGAACTGCAAAACCAACCCCAAGAACACCAACAATCCACCTTTCTGCAAGCCCCAAGATCCTGACAAAGCTTGGTATACAGAAATGCAAGCTTGTTTGACGCATTTGCCTCAAGTTTCAAGCAGTAAGGAAATTGCAGGAGGGAAATTGGCAAGATGGCCAGAGAGGCTAAGTGCAATCCCACAAAGGGTTAGCAGAGGAACTGTAAAAGGGGTCACTGAAGAAACGTTCATTCATGATTCTGAGCTATGGAGGAAGAGGTTGTTATATTACAGAACCATCAACAATCAGTTGAGTCAGCCTGGACGGTACCGCAATTTCTTAGACATGAATGCTTTCTTGGGTGGATTTGCTGCTGCTCTGGTTGATGACCCAGTTTGGGTGATGAATGTTGTCCCTGTGGATGTCAAAGTCAATACGCTCGGAGTTATATACGATAGGGGTTTGATCGGCACGTATCAAGATTGGTGTGAGGCAATGTCTACATATCCAAGAACTTATGACTTCATTCATGCGGATTCAGTTTTTAGCCTCTATGAGAATAGATGTGAAATGGAAGATATTCTACTCGAGATGGACAGAATTCTGAGACCAGAAGGAAGCGTAATCTTCCGAGAAAACATCGTTATATTGACGAAAATCAAGACGATAACCGATAAGTTGAACTGGAGCAGTCAAATTGTGCATTCTGAAGATGGATCTTATCATATGGAGAAGCTTCTATTTGCAGTAAAAAACTATTGGACAGCACCTCCTGAGCATTCTAATCAACAAGGCTCTAAACCAACTTAACCACTCAATCACATCCATGTTCTTTTTAGTACTTGATTTACCATCATTTATTACCAAGAACAGTGAATTACATTCAATGAACTGATATGATCAATGGGAATGAATCATACTTTCAATCTTGATTTTGGTTAATTTTGGGTACCCAACACAAAAGCTATGAACTCAGAATTGGCATGTAGGTGATTCATGCAAACCTCAATCTTTCAACACCTATAAATAACACCCAAAACCCTGAAGAAATCATCAAACAAACAGATTCAGAAGAGAAAAAACAAATGGCAGGAACCAAGTCTCATGGGGTGCAGCAAGTTATATTGCTCTTGTTGGTGTCTGTTCTGCTGTGGCAATCACAGGCACAGGCGCAAAGCTGCTCAACTCAGCTCAGCAACCTTAATGGGTGTGCACCTTTTGTGCTTCCTGGTGCCAGTAACCCAAGCCCAGAGTGTTGTGCAGCGCTTGGAGCTGTGCAGCAGGATTGCCTGTGCAGCACCCTTAGAATTTCCTCCACTCTGCCTTCGCTTTGCCGCCTTCCACCTCTCTCTTGTGGCATGGAAATCCATGCTCGTATGATCAGGTTAGGATTGTGTACAGATACAGGGGTAAGGAACAAATTGATAAACTTGTACTCAAAATGTCAGTGTTTTCCAGCTGCTCGAAAACTTGTTATGGACGGTACCGAGCCAGATTTAGTTTCTTGGTCTGCTTTGATATCTGGGTATGCTCAGAATGGTCGTGGAGAAGAAGCCCTTTTGACCTTTTATGAAATGCATTTGTTGGGAGTGAAGGGCAATGAGTTCACTTTCCCTAGTGTTTTAAAAGCCTGTTCTTTAACGAGGAACTTGGAACTGGGGAAGCAGATTCATGGGATTGCTTTAGTGACAGGTTTTGAATCTGATGTGTTTGTTGCCAATACTTTGGTTGTTATGTATGCTAAATGTGGGGAGTTTAGTGATTCGAAGAAGCTGTTCGAGGAAATTCCAGAACGAAACGTCGTATCTTGGAATGCTTTGTTTTCTTGTTATGTGCAGATTGATTTCTTTTCAGAAGCGATTAATTTGTTTCGAGAAATGGTTTCTACTGGACTTACTCCAAATGAATTTAGTCTCTCCACTGTATTAAACGCTTGTGCTGGTTTGGAGCACATCGATTCCGGAATGGAAATTCATGGATACTTGATAAAGCTTGGGTATGATTCTGATCCCTTTTCTGCTAATGCACTTCTTGACATGTATGCTAAAGCTGGATGTCCTGAATCTGCAATAGCCGTGTTTTATGAAATCCCGAAACCTGATATCGTTTCATGGAATGCTGCCATTGCGTTTTGCGAACAAGTTCATGCGATATCGATCAAATCAGGTTACCAATATGATGGTTACGTAGCAAATAGCCTGCTCGATTCTTATGGAAAAGGCTGTCGATTAGAAGAGGCAGAAAAAGTTTTTGAAGAGTGTCCTGCTGAAGATTTGGTGGCGTATACGTCAATGATTACTGCTTACTCCCAATATGGCTTGGGAGAAGAGGCTCTAAAA

Coding sequence (CDS)

ATGGCAGGAACCAAGTCTCATGGGGTGCAGCAAGTTATATTGCTCTTGTTGGTGTCTGTTCTGCTGTGGCAATCACAGGCACAGGCGCAAAGCTGCTCAACTCAGCTCAGCAACCTTAATGGGTGTGCACCTTTTGTGCTTCCTGGTGCCAGTAACCCAAGCCCAGAGTGTTGTGCAGCGCTTGGAGCTGTGCAGCAGGATTGCCTGTGCAGCACCCTTAGAATTTCCTCCACTCTGCCTTCGCTTTGCCGCCTTCCACCTCTCTCTTGTGGCATGGAAATCCATGCTCGTATGATCAGGTTAGGATTGTGTACAGATACAGGGGTAAGGAACAAATTGATAAACTTGTACTCAAAATGTCAGTGTTTTCCAGCTGCTCGAAAACTTGTTATGGACGGTACCGAGCCAGATTTAGTTTCTTGGTCTGCTTTGATATCTGGGTATGCTCAGAATGGTCGTGGAGAAGAAGCCCTTTTGACCTTTTATGAAATGCATTTGTTGGGAGTGAAGGGCAATGAGTTCACTTTCCCTAGTGTTTTAAAAGCCTGTTCTTTAACGAGGAACTTGGAACTGGGGAAGCAGATTCATGGGATTGCTTTAGTGACAGGTTTTGAATCTGATGTGTTTGTTGCCAATACTTTGGTTGTTATGTATGCTAAATGTGGGGAGTTTAGTGATTCGAAGAAGCTGTTCGAGGAAATTCCAGAACGAAACGTCGTATCTTGGAATGCTTTGTTTTCTTGTTATGTGCAGATTGATTTCTTTTCAGAAGCGATTAATTTGTTTCGAGAAATGGTTTCTACTGGACTTACTCCAAATGAATTTAGTCTCTCCACTGTATTAAACGCTTGTGCTGGTTTGGAGCACATCGATTCCGGAATGGAAATTCATGGATACTTGATAAAGCTTGGGTATGATTCTGATCCCTTTTCTGCTAATGCACTTCTTGACATGTATGCTAAAGCTGGATGTCCTGAATCTGCAATAGCCGTGTTTTATGAAATCCCGAAACCTGATATCGTTTCATGGAATGCTGCCATTGCGTTTTGCGAACAAGTTCATGCGATATCGATCAAATCAGGTTACCAATATGATGGTTACGTAGCAAATAGCCTGCTCGATTCTTATGGAAAAGGCTGTCGATTAGAAGAGGCAGAAAAAGTTTTTGAAGAGTGTCCTGCTGAAGATTTGGTGGCGTATACGTCAATGATTACTGCTTACTCCCAATATGGCTTGGGAGAAGAGGCTCTAAAA

Protein sequence

MAGTKSHGVQQVILLLLVSVLLWQSQAQAQSCSTQLSNLNGCAPFVLPGASNPSPECCAALGAVQQDCLCSTLRISSTLPSLCRLPPLSCGMEIHARMIRLGLCTDTGVRNKLINLYSKCQCFPAARKLVMDGTEPDLVSWSALISGYAQNGRGEEALLTFYEMHLLGVKGNEFTFPSVLKACSLTRNLELGKQIHGIALVTGFESDVFVANTLVVMYAKCGEFSDSKKLFEEIPERNVVSWNALFSCYVQIDFFSEAINLFREMVSTGLTPNEFSLSTVLNACAGLEHIDSGMEIHGYLIKLGYDSDPFSANALLDMYAKAGCPESAIAVFYEIPKPDIVSWNAAIAFCEQVHAISIKSGYQYDGYVANSLLDSYGKGCRLEEAEKVFEECPAEDLVAYTSMITAYSQYGLGEEALK
BLAST of Cp4.1LG09g00990 vs. Swiss-Prot
Match: PPR75_ARATH (Pentatricopeptide repeat-containing protein At1g50270 OS=Arabidopsis thaliana GN=PCMP-E42 PE=2 SV=1)

HSP 1 Score: 214.9 bits (546), Expect = 1.6e-54
Identity = 125/370 (33.78%), Postives = 196/370 (52.97%), Query Frame = 1

Query: 73  LRISSTLPSLCRLPPL---------SCGMEIHARMIRLGLCTDTGVRNKLINLYSKCQCF 132
           +R +  +PS    PPL         S   + HA +++ GL +D  VRN LI+ YS    F
Sbjct: 95  MRRNGVIPSRHTFPPLLKAVFKLRDSNPFQFHAHIVKFGLDSDPFVRNSLISGYSSSGLF 154

Query: 133 PAARKLVMDGTEPDLVSWSALISGYAQNGRGEEALLTFYEMHLLGVKGNEFTFPSVLKAC 192
             A +L     + D+V+W+A+I G+ +NG   EA++ F EM   GV  NE T  SVLKA 
Sbjct: 155 DFASRLFDGAEDKDVVTWTAMIDGFVRNGSASEAMVYFVEMKKTGVAANEMTVVSVLKAA 214

Query: 193 SLTRNLELGKQIHGIALVTG-FESDVFVANTLVVMYAKCGEFSDSKKLFEEIPERNVVSW 252
               ++  G+ +HG+ L TG  + DVF+ ++LV MY KC  + D++K+F+E+P RNVV+W
Sbjct: 215 GKVEDVRFGRSVHGLYLETGRVKCDVFIGSSLVDMYGKCSCYDDAQKVFDEMPSRNVVTW 274

Query: 253 NALFSCYVQIDFFSEAINLFREMVSTGLTPNEFSLSTVLNACAGLEHIDSGMEIHGYLIK 312
            AL + YVQ   F + + +F EM+ + + PNE +LS+VL+ACA +  +  G  +H Y+IK
Sbjct: 275 TALIAGYVQSRCFDKGMLVFEEMLKSDVAPNEKTLSSVLSACAHVGALHRGRRVHCYMIK 334

Query: 313 LGYDSDPFSANALLDMYAKAGCPESAIAVFYEIPKPDIVSWNAAI-AFCEQVHA------ 372
              + +  +   L+D+Y K GC E AI VF  + + ++ +W A I  F    +A      
Sbjct: 335 NSIEINTTAGTTLIDLYVKCGCLEEAILVFERLHEKNVYTWTAMINGFAAHGYARDAFDL 394

Query: 373 --ISIKSGYQYDGYVANSLLDSYGKGCRLEEAEKVFEEC-------PAEDLVAYTSMITA 417
               + S    +     ++L +   G  +EE  ++F          P  D   Y  M+  
Sbjct: 395 FYTMLSSHVSPNEVTFMAVLSACAHGGLVEEGRRLFLSMKGRFNMEPKAD--HYACMVDL 454

BLAST of Cp4.1LG09g00990 vs. Swiss-Prot
Match: PP181_ARATH (Pentatricopeptide repeat-containing protein At2g33680 OS=Arabidopsis thaliana GN=PCMP-E19 PE=3 SV=1)

HSP 1 Score: 204.9 bits (520), Expect = 1.7e-51
Identity = 121/343 (35.28%), Postives = 182/343 (53.06%), Query Frame = 1

Query: 76  SSTLPSLCRLPPLSCGMEIHARMIRLGLCTDTGVRNKLINLYSKCQCFPAARKLVMDGTE 135
           ++ L SL     +  G +IH   I+ GL     + N L+ +YSKC+    A K+     +
Sbjct: 225 TAVLSSLAATIYVGLGRQIHCITIKNGLLGFVALSNALVTMYSKCESLNEACKMFDSSGD 284

Query: 136 PDLVSWSALISGYAQNGRGEEALLTFYEMHLLGVKGNEFTFPSVLKACSLTRNLELGKQI 195
            + ++WSA+++GY+QNG   EA+  F  M   G+K +E+T   VL ACS    LE GKQ+
Sbjct: 285 RNSITWSAMVTGYSQNGESLEAVKLFSRMFSAGIKPSEYTIVGVLNACSDICYLEEGKQL 344

Query: 196 HGIALVTGFESDVFVANTLVVMYAKCGEFSDSKKLFEEIPERNVVSWNALFSCYVQIDFF 255
           H   L  GFE  +F    LV MYAK G  +D++K F+ + ER+V  W +L S YVQ    
Sbjct: 345 HSFLLKLGFERHLFATTALVDMYAKAGCLADARKGFDCLQERDVALWTSLISGYVQNSDN 404

Query: 256 SEAINLFREMVSTGLTPNEFSLSTVLNACAGLEHIDSGMEIHGYLIKLGYDSDPFSANAL 315
            EA+ L+R M + G+ PN+ ++++VL AC+ L  ++ G ++HG+ IK G+  +    +AL
Sbjct: 405 EEALILYRRMKTAGIIPNDPTMASVLKACSSLATLELGKQVHGHTIKHGFGLEVPIGSAL 464

Query: 316 LDMYAKAGCPESAIAVFYEIPKPDIVSWNAAIAFCEQVHAISIKSGYQYDGYVANSLLDS 375
             MY+K G  E    VF   P  D+VSWNA I            SG  ++G         
Sbjct: 465 STMYSKCGSLEDGNLVFRRTPNKDVVSWNAMI------------SGLSHNG--------- 524

Query: 376 YGKGCRLEEAEKVFEECPAE----DLVAYTSMITAYSQYGLGE 415
                + +EA ++FEE  AE    D V + ++I+A S  G  E
Sbjct: 525 -----QGDEALELFEEMLAEGMEPDDVTFVNIISACSHKGFVE 541

BLAST of Cp4.1LG09g00990 vs. Swiss-Prot
Match: PP337_ARATH (Pentatricopeptide repeat-containing protein At4g25270, chloroplastic OS=Arabidopsis thaliana GN=PCMP-E53 PE=3 SV=1)

HSP 1 Score: 198.0 bits (502), Expect = 2.1e-49
Identity = 115/358 (32.12%), Postives = 195/358 (54.47%), Query Frame = 1

Query: 76  SSTLPSLCRLPPLSCGMEIHARMIRLGLCTDTGVRNKLINLYSKCQCFPAARKLV--MDG 135
           +S L +   L  +  G+ +H  +    L  + G+ +KL+ LY+ C     A ++   M  
Sbjct: 96  ASLLETCYSLRAIDHGVRVHHLIPPYLLRNNLGISSKLVRLYASCGYAEVAHEVFDRMSK 155

Query: 136 TEPDLVSWSALISGYAQNGRGEEALLTFYEMHLLGVKGNEFTFPSVLKACSLTRNLELGK 195
            +    +W++LISGYA+ G+ E+A+  +++M   GVK + FTFP VLKAC    ++++G+
Sbjct: 156 RDSSPFAWNSLISGYAELGQYEDAMALYFQMAEDGVKPDRFTFPRVLKACGGIGSVQIGE 215

Query: 196 QIHGIALVTGFESDVFVANTLVVMYAKCGEFSDSKKLFEEIPERNVVSWNALFSCYVQID 255
            IH   +  GF  DV+V N LVVMYAKCG+   ++ +F+ IP ++ VSWN++ + Y+   
Sbjct: 216 AIHRDLVKEGFGYDVYVLNALVVMYAKCGDIVKARNVFDMIPHKDYVSWNSMLTGYLHHG 275

Query: 256 FFSEAINLFREMVSTGLTPNEFSLSTVLNACAGLEHIDSGMEIHGYLIKLGYDSDPFSAN 315
              EA+++FR MV  G+ P++ ++S+VL      +H   G ++HG++I+ G + +   AN
Sbjct: 276 LLHEALDIFRLMVQNGIEPDKVAISSVLARVLSFKH---GRQLHGWVIRRGMEWELSVAN 335

Query: 316 ALLDMYAKAGCPESAIAVFYEIPKPDIVSWNAAIA----------FCEQVHAISIKSGYQ 375
           AL+ +Y+K G    A  +F ++ + D VSWNA I+          + EQ+H  + K    
Sbjct: 336 ALIVLYSKRGQLGQACFIFDQMLERDTVSWNAIISAHSKNSNGLKYFEQMHRANAKP--- 395

Query: 376 YDGYVANSLLDSYGKGCRLEEAEKVFEECPAE-----DLVAYTSMITAYSQYGLGEEA 417
            DG    S+L        +E+ E++F     E      +  Y  M+  Y + G+ EEA
Sbjct: 396 -DGITFVSVLSLCANTGMVEDGERLFSLMSKEYGIDPKMEHYACMVNLYGRAGMMEEA 446

BLAST of Cp4.1LG09g00990 vs. Swiss-Prot
Match: PP347_ARATH (Pentatricopeptide repeat-containing protein At4g33170 OS=Arabidopsis thaliana GN=PCMP-H53 PE=3 SV=1)

HSP 1 Score: 197.2 bits (500), Expect = 3.6e-49
Identity = 128/435 (29.43%), Postives = 211/435 (48.51%), Query Frame = 1

Query: 29  AQSCSTQLSNLNGCAPFVLPGASNPSPECCAALGAVQQDCLCSTLRISSTLPSLCRLPPL 88
           A S S  +    G + ++  G  +   +C A +  V+ D  C  +     L +  ++  L
Sbjct: 274 ASSVSEIIFRNKGLSEYLHSGQYSALLKCFADM--VESDVECDQVTFILMLATAVKVDSL 333

Query: 89  SCGMEIHARMIRLGLCTDTGVRNKLINLYSKCQCFPAARKLVMDGTEPDLVSWSALISGY 148
           + G ++H   ++LGL     V N LIN+Y K + F  AR +  + +E DL+SW+++I+G 
Sbjct: 334 ALGQQVHCMALKLGLDLMLTVSNSLINMYCKLRKFGFARTVFDNMSERDLISWNSVIAGI 393

Query: 149 AQNGRGEEALLTFYEMHLLGVKGNEFTFPSVLKACS-LTRNLELGKQIHGIALVTGFESD 208
           AQNG   EA+  F ++   G+K +++T  SVLKA S L   L L KQ+H  A+     SD
Sbjct: 394 AQNGLEVEAVCLFMQLLRCGLKPDQYTMTSVLKAASSLPEGLSLSKQVHVHAIKINNVSD 453

Query: 209 VFVANTLVVMYAKCGEFSDSKKLFEEIPERNVVSWNALFSCYVQIDFFSEAINLFREMVS 268
            FV+  L+  Y++     +++ LFE     ++V+WNA+ + Y Q     + + LF  M  
Sbjct: 454 SFVSTALIDAYSRNRCMKEAEILFER-HNFDLVAWNAMMAGYTQSHDGHKTLKLFALMHK 513

Query: 269 TGLTPNEFSLSTVLNACAGLEHIDSGMEIHGYLIKLGYDSDPFSANALLDMYAKAGCPES 328
            G   ++F+L+TV   C  L  I+ G ++H Y IK GYD D + ++ +LDMY K G   +
Sbjct: 514 QGERSDDFTLATVFKTCGFLFAINQGKQVHAYAIKSGYDLDLWVSSGILDMYVKCGDMSA 573

Query: 329 AIAVFYEIPKPDIVSWNAAIAFC------------------------------------- 388
           A   F  IP PD V+W   I+ C                                     
Sbjct: 574 AQFAFDSIPVPDDVAWTTMISGCIENGEEERAFHVFSQMRLMGVLPDEFTIATLAKASSC 633

Query: 389 -------EQVHAISIKSGYQYDGYVANSLLDSYGKGCRLEEAEKVFEECPAEDLVAYTSM 419
                   Q+HA ++K     D +V  SL+D Y K   +++A  +F+     ++ A+ +M
Sbjct: 634 LTALEQGRQIHANALKLNCTNDPFVGTSLVDMYAKCGSIDDAYCLFKRIEMMNITAWNAM 693

BLAST of Cp4.1LG09g00990 vs. Swiss-Prot
Match: PP357_ARATH (Pentatricopeptide repeat-containing protein At4g39530 OS=Arabidopsis thaliana GN=PCMP-E52 PE=3 SV=1)

HSP 1 Score: 196.8 bits (499), Expect = 4.6e-49
Identity = 115/347 (33.14%), Postives = 181/347 (52.16%), Query Frame = 1

Query: 76  SSTLPSLCRLPPLSCGMEIHARMIRLGLCTDTGVRNKLINLYSKCQCFPAARKLVMDGTE 135
           SS L S   L  L  G ++HA  I+  L  D+ V N LI++Y+KC C   ARK+      
Sbjct: 354 SSILTSCASLHALGFGTQVHAYTIKANLGNDSYVTNSLIDMYAKCDCLTDARKVFDIFAA 413

Query: 136 PDLVSWSALISGYAQNGRG---EEALLTFYEMHLLGVKGNEFTFPSVLKACSLTRNLELG 195
            D+V ++A+I GY++ G      EAL  F +M    ++ +  TF S+L+A +   +L L 
Sbjct: 414 ADVVLFNAMIEGYSRLGTQWELHEALNIFRDMRFRLIRPSLLTFVSLLRASASLTSLGLS 473

Query: 196 KQIHGIALVTGFESDVFVANTLVVMYAKCGEFSDSKKLFEEIPERNVVSWNALFSCYVQI 255
           KQIHG+    G   D+F  + L+ +Y+ C    DS+ +F+E+  +++V WN++F+ YVQ 
Sbjct: 474 KQIHGLMFKYGLNLDIFAGSALIDVYSNCYCLKDSRLVFDEMKVKDLVIWNSMFAGYVQQ 533

Query: 256 DFFSEAINLFREMVSTGLTPNEFSLSTVLNACAGLEHIDSGMEIHGYLIKLGYDSDPFSA 315
               EA+NLF E+  +   P+EF+ + ++ A   L  +  G E H  L+K G + +P+  
Sbjct: 534 SENEEALNLFLELQLSRERPDEFTFANMVTAAGNLASVQLGQEFHCQLLKRGLECNPYIT 593

Query: 316 NALLDMYAKAGCPESAIAVFYEIPKPDIVSWNAAIAFCEQVHAISIKSGYQYDGYVANSL 375
           NALLDMYAK G PE A   F      D+V WN+ I                       S 
Sbjct: 594 NALLDMYAKCGSPEDAHKAFDSAASRDVVCWNSVI-----------------------SS 653

Query: 376 LDSYGKGCR-LEEAEKVFEECPAEDLVAYTSMITAYSQYGLGEEALK 419
             ++G+G + L+  EK+  E    + + +  +++A S  GL E+ LK
Sbjct: 654 YANHGEGKKALQMLEKMMSEGIEPNYITFVGVLSACSHAGLVEDGLK 677

BLAST of Cp4.1LG09g00990 vs. TrEMBL
Match: A0A0A0L8M4_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G180420 PE=4 SV=1)

HSP 1 Score: 462.6 bits (1189), Expect = 5.0e-127
Identity = 255/428 (59.58%), Postives = 296/428 (69.16%), Query Frame = 1

Query: 36  LSNL--NGCAPFVLPGASNPSPECCAALGAVQQDCLCSTLRISSTLPSLCRLPPLSCGME 95
           +SNL  N  + F+L  +SNPS                 ++     L        +S GM 
Sbjct: 29  ISNLRPNDVSGFILDSSSNPS-----------------SISYPKLLLQFTASKDVSSGMA 88

Query: 96  IHARMIRLGLCTDTGVRNKLINLYSKCQCFPAARKLVMDGTEPDLVSWSALISGYAQNGR 155
           IHAR+IRLGL    G+RN+L+NLYSKCQCF  ARKLV+D +EPDLVSWSALISGY QNGR
Sbjct: 89  IHARIIRLGLL---GLRNRLVNLYSKCQCFRVARKLVIDSSEPDLVSWSALISGYVQNGR 148

Query: 156 GEEALLTFYEMHLLGVKGNEFTFPSVLKACSLTRNLELGKQIHGIALVTGFESDVFVANT 215
           GEEALLT+YEM+LLG KGNEFTF SVLK CSLTRNLELGKQIH +ALVTGFESDVFVANT
Sbjct: 149 GEEALLTYYEMYLLGAKGNEFTFSSVLKGCSLTRNLELGKQIHRVALVTGFESDVFVANT 208

Query: 216 LVVMYAKCGEFSDSKKLFEEIPERNVVSWNALFSCYVQIDFFSEAINLFREMVSTGLTPN 275
           LVVMYAKCGEF DSKKLFE IPERNVVSWNALFSCYVQIDFF EAINLF+EM+STG++PN
Sbjct: 209 LVVMYAKCGEFGDSKKLFEAIPERNVVSWNALFSCYVQIDFFGEAINLFQEMISTGISPN 268

Query: 276 EFSLSTVLNACAGLEHIDSGMEIHGYLIKLGYDSDPFSANALLDMYAKAGCPESAIAVFY 335
           EFSLSTVLNACAGLE  + GM++HGYLIKLGYDSDPFSANALLDMYAK+GCPE+AIAVFY
Sbjct: 269 EFSLSTVLNACAGLEDENYGMKVHGYLIKLGYDSDPFSANALLDMYAKSGCPEAAIAVFY 328

Query: 336 EIPKPDIVSWNAAIAFC--EQVHAISIKSGYQYDGY-VANSL--LDSYGKGCR------- 395
           EIPKPDIVSWNA IA C   + + +++K   +   Y VA S+  L S  K C        
Sbjct: 329 EIPKPDIVSWNAVIAGCVLHEKNDLALKLLGKMGSYRVAPSMFTLSSALKACAAIGLVKL 388

Query: 396 ------------LEE--------------------AEKVFEECPAEDLVAYTSMITAYSQ 418
                       +E                     A  VF+  P +D++ + S+I+ YS 
Sbjct: 389 GRQLHSALMKMDMEPDSFVGVGLIDMYSKCGLLQDARMVFDLMPKKDVIVWNSIISGYSN 436

BLAST of Cp4.1LG09g00990 vs. TrEMBL
Match: A5B2K7_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_023708 PE=4 SV=1)

HSP 1 Score: 410.6 bits (1054), Expect = 2.3e-111
Identity = 213/381 (55.91%), Postives = 261/381 (68.50%), Query Frame = 1

Query: 55  PECCAALGAVQQDCLCST-LRISSTLPSLCRLPPLSCGMEIHARMIRLGLCTDTGVRNKL 114
           P+  A L  + +     T +  S  L   C    L  G++IHA + + GL  D  +RN L
Sbjct: 38  PQTTAILNLIDKGNFTPTSVSYSKLLSQCCTTKSLRPGLQIHAHITKSGLSDDPSIRNHL 97

Query: 115 INLYSKCQCFPAARKLVMDGTEPDLVSWSALISGYAQNGRGEEALLTFYEMHLLGVKGNE 174
           INLYSKC+ F  ARKLV + +EPDLVSWSALISGYAQNG G  AL+ F+EMHLLGVK NE
Sbjct: 98  INLYSKCRXFGYARKLVDESSEPDLVSWSALISGYAQNGLGGGALMAFHEMHLLGVKCNE 157

Query: 175 FTFPSVLKACSLTRNLELGKQIHGIALVTGFESDVFVANTLVVMYAKCGEFSDSKKLFEE 234
           FTF SVLKACS+ ++L +GKQ+HG+ +V+GFE DVFVANTLVVMYAKC EF DSK+LF+E
Sbjct: 158 FTFSSVLKACSIVKDLRIGKQVHGVVVVSGFEGDVFVANTLVVMYAKCDEFLDSKRLFDE 217

Query: 235 IPERNVVSWNALFSCYVQIDFFSEAINLFREMVSTGLTPNEFSLSTVLNACAGLEHIDSG 294
           IPERNVVSWNALFSCYVQ DF  EA+ LF EMV +G+ PNEFSLS+++NAC GL     G
Sbjct: 218 IPERNVVSWNALFSCYVQXDFCGEAVGLFYEMVLSGIKPNEFSLSSMVNACTGLRDSSRG 277

Query: 295 MEIHGYLIKLGYDSDPFSANALLDMYAKAGCPESAIAVFYEIPKPDIVSWNAAIAFC--- 354
             IHGYLIKLGYD DPFSANAL+DMYAK G    AI+VF +I +PDIVSWNA IA C   
Sbjct: 278 KIIHGYLIKLGYDWDPFSANALVDMYAKVGDLADAISVFEKIKQPDIVSWNAVIAGCVLH 337

Query: 355 --------------EQVHAISIKSGYQYDGYVANSLLDSYGKGCRLEEAEKVFEECPAED 414
                          Q+H+  +K   + D +V+  L+D Y K   LE+A   F   P +D
Sbjct: 338 EHHEQALELLGQMKRQLHSSLMKMDMESDLFVSVGLVDMYSKCDLLEDARMAFNLLPEKD 397

Query: 415 LVAYTSMITAYSQYGLGEEAL 418
           L+A+ ++I+ YSQY    EAL
Sbjct: 398 LIAWNAIISGYSQYWEDMEAL 418

BLAST of Cp4.1LG09g00990 vs. TrEMBL
Match: A0A061GGD5_THECC (Tetratricopeptide repeat-like superfamily protein OS=Theobroma cacao GN=TCM_029817 PE=4 SV=1)

HSP 1 Score: 383.6 bits (984), Expect = 3.0e-103
Identity = 199/373 (53.35%), Postives = 246/373 (65.95%), Query Frame = 1

Query: 89  SCGMEIHARMIRLGLCTDTGVRNKLINLYSKCQCFPAARKLVMDGTEPDLVSWSALISGY 148
           S GM+IHA  I+ G   D   RN LI+LY+KC+ F  ARKLV +  EPDLVSWSALISGY
Sbjct: 86  SPGMQIHAITIKFGSTKDPKSRNLLISLYAKCKLFRYARKLVDESPEPDLVSWSALISGY 145

Query: 149 AQNGRGEEALLTFYEMHLLGVKGNEFTFPSVLKACSLTRNLELGKQIHGIALVTGFESDV 208
           AQNG G+EA+L FYEMHLLGV+ N+FTFPSVLKAC+ TR+LELG+QIH + +VTGFE D 
Sbjct: 146 AQNGFGKEAILAFYEMHLLGVRCNDFTFPSVLKACTFTRDLELGRQIHAVVVVTGFECDE 205

Query: 209 FVANTLVVMYAKCGEFSDSKKLFEEIPERNVVSWNALFSCYVQIDFFSEAINLFREMVST 268
           +VAN+LVVMYAKCGEF DS++LFE++PER+VVSWNAL SCYVQ D+  EA+ LF EMVS+
Sbjct: 206 YVANSLVVMYAKCGEFGDSRRLFEDMPERSVVSWNALLSCYVQSDYCGEAVELFHEMVSS 265

Query: 269 GLTPNEFSLSTVLNACAGLEHIDSGMEIHGYLIKLGYDSDPFSANALLDMYAKAGCPESA 328
           G+ PNEFSLS+++NA  GLE    G + HG+LIKLGYDSDPFS NAL+DM AK G  E A
Sbjct: 266 GIKPNEFSLSSMINAYTGLEDSGQGRKTHGFLIKLGYDSDPFSKNALVDMCAKVGSLEDA 325

Query: 329 IAVFYEIPKPDIVSWNAAIAFC-------------------------------------- 388
           + VF EI +PDIVSWNA IA C                                      
Sbjct: 326 VFVFEEIARPDIVSWNAVIAGCVLHENHDWALELFGQMRRSGTHPNMFTLSSALKACAGT 385

Query: 389 ------EQVHAISIKSGYQYDGYVANSLLDSYGKGCRLEEAEKVFEECPAEDLVAYTSMI 418
                  Q+H   IK     D +V   L+D Y K   + +A  VF   P +DL+A+ ++I
Sbjct: 386 GHKKLGRQLHCNLIKINVGSDPFVDVGLIDMYSKTYLMNDARMVFNLMPDKDLIAWNAVI 445

BLAST of Cp4.1LG09g00990 vs. TrEMBL
Match: B9H2R5_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0004s04740g PE=4 SV=2)

HSP 1 Score: 380.9 bits (977), Expect = 1.9e-102
Identity = 194/369 (52.57%), Postives = 247/369 (66.94%), Query Frame = 1

Query: 92  MEIHARMIRLGLCTDTGVRNKLINLYSKCQCFPAARKLVMDGTEPDLVSWSALISGYAQN 151
           MEIHAR+I+ GL  D  +RN L+NLYSKCQ F  ARKL+   TEPDLVSWSALISGY+QN
Sbjct: 1   MEIHARVIKFGLSQDPKIRNYLVNLYSKCQLFGYARKLLDRSTEPDLVSWSALISGYSQN 60

Query: 152 GRGEEALLTFYEMHLLGVKGNEFTFPSVLKACSLTRNLELGKQIHGIALVTGFESDVFVA 211
           G  +EA+L FYEMHLLG+K NEF FPSVLKAC++T++L LGKQ+HGI +VTGF+SD FVA
Sbjct: 61  GFCQEAVLAFYEMHLLGIKCNEFAFPSVLKACTVTKDLVLGKQVHGIVVVTGFDSDEFVA 120

Query: 212 NTLVVMYAKCGEFSDSKKLFEEIPERNVVSWNALFSCYVQIDFFSEAINLFREMVSTGLT 271
           N+LV++YAKCG F D++ LF+ IP+R+VVSWNALFSCYV  D   EA++LF +MV +G+ 
Sbjct: 121 NSLVILYAKCGGFGDARSLFDAIPDRSVVSWNALFSCYVHSDMHGEAVSLFHDMVLSGIR 180

Query: 272 PNEFSLSTVLNACAGLEHIDSGMEIHGYLIKLGYDSDPFSANALLDMYAKAGCPESAIAV 331
           PNEFSLS+++N C GLE    G +IHGYLIKLGYDSD FSANAL+DMYAK G  E A +V
Sbjct: 181 PNEFSLSSMINVCTGLEDSVQGRKIHGYLIKLGYDSDAFSANALVDMYAKVGILEDASSV 240

Query: 332 FYEIPKPDIVSWN-----------------------------------AAIAFC------ 391
           F EI KPDIVSWN                                   +A+  C      
Sbjct: 241 FDEIAKPDIVSWNAIIAGCVLHEYHHRALELLREMNKSGMCPNMFTLSSALKACAGMALR 300

Query: 392 ---EQVHAISIKSGYQYDGYVANSLLDSYGKGCRLEEAEKVFEECPAEDLVAYTSMITAY 417
               Q+H+  IK     D ++   L+D Y K   +++A  VF+  P  D++A+ ++I+ +
Sbjct: 301 ELGRQLHSSLIKMDMGSDSFLGVGLIDMYSKCNSMDDARLVFKLMPERDMIAWNAVISGH 360

BLAST of Cp4.1LG09g00990 vs. TrEMBL
Match: A0A103XF42_CYNCS (Pentatricopeptide repeat-containing protein OS=Cynara cardunculus var. scolymus GN=Ccrd_008503 PE=4 SV=1)

HSP 1 Score: 376.3 bits (965), Expect = 4.8e-101
Identity = 177/280 (63.21%), Postives = 221/280 (78.93%), Query Frame = 1

Query: 71  STLRISSTLPSLCRLPPLSCGMEIHARMIRLGLCTDTGVRNKLINLYSKCQCFPAARKLV 130
           S++  S  L   C+   LS G++IH  +I++GL  D+  RN LINLYSKC+ F  AR+L+
Sbjct: 56  SSISYSKLLSQCCQSKSLSPGLQIHTHLIKIGLANDSKHRNHLINLYSKCRLFGCARRLL 115

Query: 131 MDGTEPDLVSWSALISGYAQNGRGEEALLTFYEMHLLGVKGNEFTFPSVLKACSLTRNLE 190
            +  EPDLV+WS+LISGYAQNG GEEA+L F EMH LG++ NEFTFPSVLKACS+ +++ 
Sbjct: 116 DESPEPDLVAWSSLISGYAQNGLGEEAILAFSEMHSLGIRCNEFTFPSVLKACSIKKDIV 175

Query: 191 LGKQIHGIALVTGFESDVFVANTLVVMYAKCGEFSDSKKLFEEIPERNVVSWNALFSCYV 250
            GKQIHGI +VTGFESDVFVANTLVV+YAKCGEF DS++LF++IP+RN+VSWNALFSCY 
Sbjct: 176 GGKQIHGIVVVTGFESDVFVANTLVVVYAKCGEFLDSRRLFDQIPDRNIVSWNALFSCYT 235

Query: 251 QIDFFSEAINLFREMVSTGLTPNEFSLSTVLNACAGLEHIDSGMEIHGYLIKLGYDSDPF 310
           Q DFF EAI LF++MVS+G+ P+EFSLST++NAC GL  ++ G +IHGYL+K G+ SDPF
Sbjct: 236 QGDFFKEAIYLFQDMVSSGIRPDEFSLSTIINACTGLHDVNQGKKIHGYLMKHGFSSDPF 295

Query: 311 SANALLDMYAKAGCPESAIAVFYEIPKPDIVSWNAAIAFC 351
           S NAL+DMY+K G  E    VF  IP PDIVSWNA IA C
Sbjct: 296 SCNALVDMYSKVGDFEDCKQVFEHIPNPDIVSWNAVIAGC 335

BLAST of Cp4.1LG09g00990 vs. TAIR10
Match: AT1G50270.1 (AT1G50270.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 214.9 bits (546), Expect = 9.3e-56
Identity = 125/370 (33.78%), Postives = 196/370 (52.97%), Query Frame = 1

Query: 73  LRISSTLPSLCRLPPL---------SCGMEIHARMIRLGLCTDTGVRNKLINLYSKCQCF 132
           +R +  +PS    PPL         S   + HA +++ GL +D  VRN LI+ YS    F
Sbjct: 95  MRRNGVIPSRHTFPPLLKAVFKLRDSNPFQFHAHIVKFGLDSDPFVRNSLISGYSSSGLF 154

Query: 133 PAARKLVMDGTEPDLVSWSALISGYAQNGRGEEALLTFYEMHLLGVKGNEFTFPSVLKAC 192
             A +L     + D+V+W+A+I G+ +NG   EA++ F EM   GV  NE T  SVLKA 
Sbjct: 155 DFASRLFDGAEDKDVVTWTAMIDGFVRNGSASEAMVYFVEMKKTGVAANEMTVVSVLKAA 214

Query: 193 SLTRNLELGKQIHGIALVTG-FESDVFVANTLVVMYAKCGEFSDSKKLFEEIPERNVVSW 252
               ++  G+ +HG+ L TG  + DVF+ ++LV MY KC  + D++K+F+E+P RNVV+W
Sbjct: 215 GKVEDVRFGRSVHGLYLETGRVKCDVFIGSSLVDMYGKCSCYDDAQKVFDEMPSRNVVTW 274

Query: 253 NALFSCYVQIDFFSEAINLFREMVSTGLTPNEFSLSTVLNACAGLEHIDSGMEIHGYLIK 312
            AL + YVQ   F + + +F EM+ + + PNE +LS+VL+ACA +  +  G  +H Y+IK
Sbjct: 275 TALIAGYVQSRCFDKGMLVFEEMLKSDVAPNEKTLSSVLSACAHVGALHRGRRVHCYMIK 334

Query: 313 LGYDSDPFSANALLDMYAKAGCPESAIAVFYEIPKPDIVSWNAAI-AFCEQVHA------ 372
              + +  +   L+D+Y K GC E AI VF  + + ++ +W A I  F    +A      
Sbjct: 335 NSIEINTTAGTTLIDLYVKCGCLEEAILVFERLHEKNVYTWTAMINGFAAHGYARDAFDL 394

Query: 373 --ISIKSGYQYDGYVANSLLDSYGKGCRLEEAEKVFEEC-------PAEDLVAYTSMITA 417
               + S    +     ++L +   G  +EE  ++F          P  D   Y  M+  
Sbjct: 395 FYTMLSSHVSPNEVTFMAVLSACAHGGLVEEGRRLFLSMKGRFNMEPKAD--HYACMVDL 454

BLAST of Cp4.1LG09g00990 vs. TAIR10
Match: AT2G33680.1 (AT2G33680.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 204.9 bits (520), Expect = 9.6e-53
Identity = 121/343 (35.28%), Postives = 182/343 (53.06%), Query Frame = 1

Query: 76  SSTLPSLCRLPPLSCGMEIHARMIRLGLCTDTGVRNKLINLYSKCQCFPAARKLVMDGTE 135
           ++ L SL     +  G +IH   I+ GL     + N L+ +YSKC+    A K+     +
Sbjct: 225 TAVLSSLAATIYVGLGRQIHCITIKNGLLGFVALSNALVTMYSKCESLNEACKMFDSSGD 284

Query: 136 PDLVSWSALISGYAQNGRGEEALLTFYEMHLLGVKGNEFTFPSVLKACSLTRNLELGKQI 195
            + ++WSA+++GY+QNG   EA+  F  M   G+K +E+T   VL ACS    LE GKQ+
Sbjct: 285 RNSITWSAMVTGYSQNGESLEAVKLFSRMFSAGIKPSEYTIVGVLNACSDICYLEEGKQL 344

Query: 196 HGIALVTGFESDVFVANTLVVMYAKCGEFSDSKKLFEEIPERNVVSWNALFSCYVQIDFF 255
           H   L  GFE  +F    LV MYAK G  +D++K F+ + ER+V  W +L S YVQ    
Sbjct: 345 HSFLLKLGFERHLFATTALVDMYAKAGCLADARKGFDCLQERDVALWTSLISGYVQNSDN 404

Query: 256 SEAINLFREMVSTGLTPNEFSLSTVLNACAGLEHIDSGMEIHGYLIKLGYDSDPFSANAL 315
            EA+ L+R M + G+ PN+ ++++VL AC+ L  ++ G ++HG+ IK G+  +    +AL
Sbjct: 405 EEALILYRRMKTAGIIPNDPTMASVLKACSSLATLELGKQVHGHTIKHGFGLEVPIGSAL 464

Query: 316 LDMYAKAGCPESAIAVFYEIPKPDIVSWNAAIAFCEQVHAISIKSGYQYDGYVANSLLDS 375
             MY+K G  E    VF   P  D+VSWNA I            SG  ++G         
Sbjct: 465 STMYSKCGSLEDGNLVFRRTPNKDVVSWNAMI------------SGLSHNG--------- 524

Query: 376 YGKGCRLEEAEKVFEECPAE----DLVAYTSMITAYSQYGLGE 415
                + +EA ++FEE  AE    D V + ++I+A S  G  E
Sbjct: 525 -----QGDEALELFEEMLAEGMEPDDVTFVNIISACSHKGFVE 541

BLAST of Cp4.1LG09g00990 vs. TAIR10
Match: AT4G25270.1 (AT4G25270.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 198.0 bits (502), Expect = 1.2e-50
Identity = 115/358 (32.12%), Postives = 195/358 (54.47%), Query Frame = 1

Query: 76  SSTLPSLCRLPPLSCGMEIHARMIRLGLCTDTGVRNKLINLYSKCQCFPAARKLV--MDG 135
           +S L +   L  +  G+ +H  +    L  + G+ +KL+ LY+ C     A ++   M  
Sbjct: 96  ASLLETCYSLRAIDHGVRVHHLIPPYLLRNNLGISSKLVRLYASCGYAEVAHEVFDRMSK 155

Query: 136 TEPDLVSWSALISGYAQNGRGEEALLTFYEMHLLGVKGNEFTFPSVLKACSLTRNLELGK 195
            +    +W++LISGYA+ G+ E+A+  +++M   GVK + FTFP VLKAC    ++++G+
Sbjct: 156 RDSSPFAWNSLISGYAELGQYEDAMALYFQMAEDGVKPDRFTFPRVLKACGGIGSVQIGE 215

Query: 196 QIHGIALVTGFESDVFVANTLVVMYAKCGEFSDSKKLFEEIPERNVVSWNALFSCYVQID 255
            IH   +  GF  DV+V N LVVMYAKCG+   ++ +F+ IP ++ VSWN++ + Y+   
Sbjct: 216 AIHRDLVKEGFGYDVYVLNALVVMYAKCGDIVKARNVFDMIPHKDYVSWNSMLTGYLHHG 275

Query: 256 FFSEAINLFREMVSTGLTPNEFSLSTVLNACAGLEHIDSGMEIHGYLIKLGYDSDPFSAN 315
              EA+++FR MV  G+ P++ ++S+VL      +H   G ++HG++I+ G + +   AN
Sbjct: 276 LLHEALDIFRLMVQNGIEPDKVAISSVLARVLSFKH---GRQLHGWVIRRGMEWELSVAN 335

Query: 316 ALLDMYAKAGCPESAIAVFYEIPKPDIVSWNAAIA----------FCEQVHAISIKSGYQ 375
           AL+ +Y+K G    A  +F ++ + D VSWNA I+          + EQ+H  + K    
Sbjct: 336 ALIVLYSKRGQLGQACFIFDQMLERDTVSWNAIISAHSKNSNGLKYFEQMHRANAKP--- 395

Query: 376 YDGYVANSLLDSYGKGCRLEEAEKVFEECPAE-----DLVAYTSMITAYSQYGLGEEA 417
            DG    S+L        +E+ E++F     E      +  Y  M+  Y + G+ EEA
Sbjct: 396 -DGITFVSVLSLCANTGMVEDGERLFSLMSKEYGIDPKMEHYACMVNLYGRAGMMEEA 446

BLAST of Cp4.1LG09g00990 vs. TAIR10
Match: AT4G33170.1 (AT4G33170.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 197.2 bits (500), Expect = 2.0e-50
Identity = 128/435 (29.43%), Postives = 211/435 (48.51%), Query Frame = 1

Query: 29  AQSCSTQLSNLNGCAPFVLPGASNPSPECCAALGAVQQDCLCSTLRISSTLPSLCRLPPL 88
           A S S  +    G + ++  G  +   +C A +  V+ D  C  +     L +  ++  L
Sbjct: 274 ASSVSEIIFRNKGLSEYLHSGQYSALLKCFADM--VESDVECDQVTFILMLATAVKVDSL 333

Query: 89  SCGMEIHARMIRLGLCTDTGVRNKLINLYSKCQCFPAARKLVMDGTEPDLVSWSALISGY 148
           + G ++H   ++LGL     V N LIN+Y K + F  AR +  + +E DL+SW+++I+G 
Sbjct: 334 ALGQQVHCMALKLGLDLMLTVSNSLINMYCKLRKFGFARTVFDNMSERDLISWNSVIAGI 393

Query: 149 AQNGRGEEALLTFYEMHLLGVKGNEFTFPSVLKACS-LTRNLELGKQIHGIALVTGFESD 208
           AQNG   EA+  F ++   G+K +++T  SVLKA S L   L L KQ+H  A+     SD
Sbjct: 394 AQNGLEVEAVCLFMQLLRCGLKPDQYTMTSVLKAASSLPEGLSLSKQVHVHAIKINNVSD 453

Query: 209 VFVANTLVVMYAKCGEFSDSKKLFEEIPERNVVSWNALFSCYVQIDFFSEAINLFREMVS 268
            FV+  L+  Y++     +++ LFE     ++V+WNA+ + Y Q     + + LF  M  
Sbjct: 454 SFVSTALIDAYSRNRCMKEAEILFER-HNFDLVAWNAMMAGYTQSHDGHKTLKLFALMHK 513

Query: 269 TGLTPNEFSLSTVLNACAGLEHIDSGMEIHGYLIKLGYDSDPFSANALLDMYAKAGCPES 328
            G   ++F+L+TV   C  L  I+ G ++H Y IK GYD D + ++ +LDMY K G   +
Sbjct: 514 QGERSDDFTLATVFKTCGFLFAINQGKQVHAYAIKSGYDLDLWVSSGILDMYVKCGDMSA 573

Query: 329 AIAVFYEIPKPDIVSWNAAIAFC------------------------------------- 388
           A   F  IP PD V+W   I+ C                                     
Sbjct: 574 AQFAFDSIPVPDDVAWTTMISGCIENGEEERAFHVFSQMRLMGVLPDEFTIATLAKASSC 633

Query: 389 -------EQVHAISIKSGYQYDGYVANSLLDSYGKGCRLEEAEKVFEECPAEDLVAYTSM 419
                   Q+HA ++K     D +V  SL+D Y K   +++A  +F+     ++ A+ +M
Sbjct: 634 LTALEQGRQIHANALKLNCTNDPFVGTSLVDMYAKCGSIDDAYCLFKRIEMMNITAWNAM 693

BLAST of Cp4.1LG09g00990 vs. TAIR10
Match: AT4G39530.1 (AT4G39530.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 196.8 bits (499), Expect = 2.6e-50
Identity = 115/347 (33.14%), Postives = 181/347 (52.16%), Query Frame = 1

Query: 76  SSTLPSLCRLPPLSCGMEIHARMIRLGLCTDTGVRNKLINLYSKCQCFPAARKLVMDGTE 135
           SS L S   L  L  G ++HA  I+  L  D+ V N LI++Y+KC C   ARK+      
Sbjct: 354 SSILTSCASLHALGFGTQVHAYTIKANLGNDSYVTNSLIDMYAKCDCLTDARKVFDIFAA 413

Query: 136 PDLVSWSALISGYAQNGRG---EEALLTFYEMHLLGVKGNEFTFPSVLKACSLTRNLELG 195
            D+V ++A+I GY++ G      EAL  F +M    ++ +  TF S+L+A +   +L L 
Sbjct: 414 ADVVLFNAMIEGYSRLGTQWELHEALNIFRDMRFRLIRPSLLTFVSLLRASASLTSLGLS 473

Query: 196 KQIHGIALVTGFESDVFVANTLVVMYAKCGEFSDSKKLFEEIPERNVVSWNALFSCYVQI 255
           KQIHG+    G   D+F  + L+ +Y+ C    DS+ +F+E+  +++V WN++F+ YVQ 
Sbjct: 474 KQIHGLMFKYGLNLDIFAGSALIDVYSNCYCLKDSRLVFDEMKVKDLVIWNSMFAGYVQQ 533

Query: 256 DFFSEAINLFREMVSTGLTPNEFSLSTVLNACAGLEHIDSGMEIHGYLIKLGYDSDPFSA 315
               EA+NLF E+  +   P+EF+ + ++ A   L  +  G E H  L+K G + +P+  
Sbjct: 534 SENEEALNLFLELQLSRERPDEFTFANMVTAAGNLASVQLGQEFHCQLLKRGLECNPYIT 593

Query: 316 NALLDMYAKAGCPESAIAVFYEIPKPDIVSWNAAIAFCEQVHAISIKSGYQYDGYVANSL 375
           NALLDMYAK G PE A   F      D+V WN+ I                       S 
Sbjct: 594 NALLDMYAKCGSPEDAHKAFDSAASRDVVCWNSVI-----------------------SS 653

Query: 376 LDSYGKGCR-LEEAEKVFEECPAEDLVAYTSMITAYSQYGLGEEALK 419
             ++G+G + L+  EK+  E    + + +  +++A S  GL E+ LK
Sbjct: 654 YANHGEGKKALQMLEKMMSEGIEPNYITFVGVLSACSHAGLVEDGLK 677

BLAST of Cp4.1LG09g00990 vs. NCBI nr
Match: gi|659077399|ref|XP_008439183.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g04780-like [Cucumis melo])

HSP 1 Score: 476.5 bits (1225), Expect = 4.8e-131
Identity = 252/391 (64.45%), Postives = 288/391 (73.66%), Query Frame = 1

Query: 71  STLRISSTLPSLCRLPPLSCGMEIHARMIRLGLCTDTGVRNKLINLYSKCQCFPAARKLV 130
           S++  S  L        ++ GM IHAR+IRLGLC D G+RN+LINLYSKCQCF  ARKLV
Sbjct: 49  SSISYSKLLLQFTASKDVNSGMAIHARIIRLGLCRDVGLRNRLINLYSKCQCFRVARKLV 108

Query: 131 MDGTEPDLVSWSALISGYAQNGRGEEALLTFYEMHLLGVKGNEFTFPSVLKACSLTRNLE 190
           MD TEPDLVSWSALISGYAQNGRGEEALLT+YEM+LLGVKGNEFTFPSVLK CSLTRNLE
Sbjct: 109 MDSTEPDLVSWSALISGYAQNGRGEEALLTYYEMYLLGVKGNEFTFPSVLKGCSLTRNLE 168

Query: 191 LGKQIHGIALVTGFESDVFVANTLVVMYAKCGEFSDSKKLFEEIPERNVVSWNALFSCYV 250
           LGKQIHG+ALVTGFESD FVANTLVVMYAKCGEF DSKKLFE IPER+VVSWNALFSCYV
Sbjct: 169 LGKQIHGVALVTGFESDEFVANTLVVMYAKCGEFGDSKKLFEAIPERSVVSWNALFSCYV 228

Query: 251 QIDFFSEAINLFREMVSTGLTPNEFSLSTVLNACAGLEHIDSGMEIHGYLIKLGYDSDPF 310
           QIDFF EAINLF+EM+STG++PNEFSLSTVLNACAGLE  + GM+IHG LIKLGY+SDPF
Sbjct: 229 QIDFFGEAINLFQEMISTGISPNEFSLSTVLNACAGLEDENYGMKIHGCLIKLGYESDPF 288

Query: 311 SANALLDMYAKAGCPESAIAVFYEIPKPDIVSWNAAIAFC--EQVHAISIKSGYQYDGY- 370
           SANALLDMYAK+GCPE+AIAVFYEIPKPDIVSWNA IA C   + + +++K   +   Y 
Sbjct: 289 SANALLDMYAKSGCPEAAIAVFYEIPKPDIVSWNAVIAGCVLHEKNDLALKLLGKMGSYR 348

Query: 371 VANSL--LDSYGKGCR-------------------LEE--------------------AE 418
           VA S+  L S  K C                    +E                     A 
Sbjct: 349 VAPSMFALSSALKACAAIGLVKLGRQLHSALMKRDMESDSFVGVGLIDMYSKCGLLQDAR 408

BLAST of Cp4.1LG09g00990 vs. NCBI nr
Match: gi|778679510|ref|XP_011651139.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g04780-like [Cucumis sativus])

HSP 1 Score: 462.6 bits (1189), Expect = 7.2e-127
Identity = 255/428 (59.58%), Postives = 296/428 (69.16%), Query Frame = 1

Query: 36  LSNL--NGCAPFVLPGASNPSPECCAALGAVQQDCLCSTLRISSTLPSLCRLPPLSCGME 95
           +SNL  N  + F+L  +SNPS                 ++     L        +S GM 
Sbjct: 29  ISNLRPNDVSGFILDSSSNPS-----------------SISYPKLLLQFTASKDVSSGMA 88

Query: 96  IHARMIRLGLCTDTGVRNKLINLYSKCQCFPAARKLVMDGTEPDLVSWSALISGYAQNGR 155
           IHAR+IRLGL    G+RN+L+NLYSKCQCF  ARKLV+D +EPDLVSWSALISGY QNGR
Sbjct: 89  IHARIIRLGLL---GLRNRLVNLYSKCQCFRVARKLVIDSSEPDLVSWSALISGYVQNGR 148

Query: 156 GEEALLTFYEMHLLGVKGNEFTFPSVLKACSLTRNLELGKQIHGIALVTGFESDVFVANT 215
           GEEALLT+YEM+LLG KGNEFTF SVLK CSLTRNLELGKQIH +ALVTGFESDVFVANT
Sbjct: 149 GEEALLTYYEMYLLGAKGNEFTFSSVLKGCSLTRNLELGKQIHRVALVTGFESDVFVANT 208

Query: 216 LVVMYAKCGEFSDSKKLFEEIPERNVVSWNALFSCYVQIDFFSEAINLFREMVSTGLTPN 275
           LVVMYAKCGEF DSKKLFE IPERNVVSWNALFSCYVQIDFF EAINLF+EM+STG++PN
Sbjct: 209 LVVMYAKCGEFGDSKKLFEAIPERNVVSWNALFSCYVQIDFFGEAINLFQEMISTGISPN 268

Query: 276 EFSLSTVLNACAGLEHIDSGMEIHGYLIKLGYDSDPFSANALLDMYAKAGCPESAIAVFY 335
           EFSLSTVLNACAGLE  + GM++HGYLIKLGYDSDPFSANALLDMYAK+GCPE+AIAVFY
Sbjct: 269 EFSLSTVLNACAGLEDENYGMKVHGYLIKLGYDSDPFSANALLDMYAKSGCPEAAIAVFY 328

Query: 336 EIPKPDIVSWNAAIAFC--EQVHAISIKSGYQYDGY-VANSL--LDSYGKGCR------- 395
           EIPKPDIVSWNA IA C   + + +++K   +   Y VA S+  L S  K C        
Sbjct: 329 EIPKPDIVSWNAVIAGCVLHEKNDLALKLLGKMGSYRVAPSMFTLSSALKACAAIGLVKL 388

Query: 396 ------------LEE--------------------AEKVFEECPAEDLVAYTSMITAYSQ 418
                       +E                     A  VF+  P +D++ + S+I+ YS 
Sbjct: 389 GRQLHSALMKMDMEPDSFVGVGLIDMYSKCGLLQDARMVFDLMPKKDVIVWNSIISGYSN 436

BLAST of Cp4.1LG09g00990 vs. NCBI nr
Match: gi|147805537|emb|CAN74095.1| (hypothetical protein VITISV_023708 [Vitis vinifera])

HSP 1 Score: 410.6 bits (1054), Expect = 3.3e-111
Identity = 213/381 (55.91%), Postives = 261/381 (68.50%), Query Frame = 1

Query: 55  PECCAALGAVQQDCLCST-LRISSTLPSLCRLPPLSCGMEIHARMIRLGLCTDTGVRNKL 114
           P+  A L  + +     T +  S  L   C    L  G++IHA + + GL  D  +RN L
Sbjct: 38  PQTTAILNLIDKGNFTPTSVSYSKLLSQCCTTKSLRPGLQIHAHITKSGLSDDPSIRNHL 97

Query: 115 INLYSKCQCFPAARKLVMDGTEPDLVSWSALISGYAQNGRGEEALLTFYEMHLLGVKGNE 174
           INLYSKC+ F  ARKLV + +EPDLVSWSALISGYAQNG G  AL+ F+EMHLLGVK NE
Sbjct: 98  INLYSKCRXFGYARKLVDESSEPDLVSWSALISGYAQNGLGGGALMAFHEMHLLGVKCNE 157

Query: 175 FTFPSVLKACSLTRNLELGKQIHGIALVTGFESDVFVANTLVVMYAKCGEFSDSKKLFEE 234
           FTF SVLKACS+ ++L +GKQ+HG+ +V+GFE DVFVANTLVVMYAKC EF DSK+LF+E
Sbjct: 158 FTFSSVLKACSIVKDLRIGKQVHGVVVVSGFEGDVFVANTLVVMYAKCDEFLDSKRLFDE 217

Query: 235 IPERNVVSWNALFSCYVQIDFFSEAINLFREMVSTGLTPNEFSLSTVLNACAGLEHIDSG 294
           IPERNVVSWNALFSCYVQ DF  EA+ LF EMV +G+ PNEFSLS+++NAC GL     G
Sbjct: 218 IPERNVVSWNALFSCYVQXDFCGEAVGLFYEMVLSGIKPNEFSLSSMVNACTGLRDSSRG 277

Query: 295 MEIHGYLIKLGYDSDPFSANALLDMYAKAGCPESAIAVFYEIPKPDIVSWNAAIAFC--- 354
             IHGYLIKLGYD DPFSANAL+DMYAK G    AI+VF +I +PDIVSWNA IA C   
Sbjct: 278 KIIHGYLIKLGYDWDPFSANALVDMYAKVGDLADAISVFEKIKQPDIVSWNAVIAGCVLH 337

Query: 355 --------------EQVHAISIKSGYQYDGYVANSLLDSYGKGCRLEEAEKVFEECPAED 414
                          Q+H+  +K   + D +V+  L+D Y K   LE+A   F   P +D
Sbjct: 338 EHHEQALELLGQMKRQLHSSLMKMDMESDLFVSVGLVDMYSKCDLLEDARMAFNLLPEKD 397

Query: 415 LVAYTSMITAYSQYGLGEEAL 418
           L+A+ ++I+ YSQY    EAL
Sbjct: 398 LIAWNAIISGYSQYWEDMEAL 418

BLAST of Cp4.1LG09g00990 vs. NCBI nr
Match: gi|359483488|ref|XP_002273710.2| (PREDICTED: pentatricopeptide repeat-containing protein At5g04780-like [Vitis vinifera])

HSP 1 Score: 401.4 bits (1030), Expect = 2.0e-108
Identity = 214/408 (52.45%), Postives = 262/408 (64.22%), Query Frame = 1

Query: 55  PECCAALGAVQQDCLCST-LRISSTLPSLCRLPPLSCGMEIHARMIRLGLCTDTGVRNKL 114
           P+  A L  + +     T +  S  L   C    L  G++IHA + + GL  D  +RN L
Sbjct: 38  PQTTAILNLIDKGNFTPTSVSYSKLLSQCCTTKSLRPGLQIHAHITKSGLSDDPSIRNHL 97

Query: 115 INLYSKCQCFPAARKLVMDGTEPDLVSWSALISGYAQNGRGEEALLTFYEMHLLGVKGNE 174
           INLYSKC+ F  ARKLV + +EPDLVSWSALISGYAQNG G  AL+ F+EMHLLGVK NE
Sbjct: 98  INLYSKCRNFGYARKLVDESSEPDLVSWSALISGYAQNGLGGGALMAFHEMHLLGVKCNE 157

Query: 175 FTFPSVLKACSLTRNLELGKQIHGIALVTGFESDVFVANTLVVMYAKCGEFSDSKKLFEE 234
           FTF SVLKACS+ ++L +GKQ+HG+ +V+GFE DVFVANTLVVMYAKC EF DSK+LF+E
Sbjct: 158 FTFSSVLKACSIVKDLRIGKQVHGVVVVSGFEGDVFVANTLVVMYAKCDEFLDSKRLFDE 217

Query: 235 IPERNVVSWNALFSCYVQIDFFSEAINLFREMVSTGLTPNEFSLSTVLNACAGLEHIDSG 294
           IPERNVVSWNALFSCYVQIDF  EA+ LF EMV +G+ PNEFSLS+++NAC GL     G
Sbjct: 218 IPERNVVSWNALFSCYVQIDFCGEAVGLFYEMVLSGIKPNEFSLSSMVNACTGLRDSSRG 277

Query: 295 MEIHGYLIKLGYDSDPFSANALLDMYAKAGCPESAIAVFYEIPKPDIVSWNAAIAFC--- 354
             IHGYLIKLGYD DPFSANAL+DMYAK G    AI+VF +I +PDIVSWNA IA C   
Sbjct: 278 KIIHGYLIKLGYDWDPFSANALVDMYAKVGDLADAISVFEKIKQPDIVSWNAVIAGCVLH 337

Query: 355 -----------------------------------------EQVHAISIKSGYQYDGYVA 414
                                                     Q+H+  +K   + D +V+
Sbjct: 338 EHHEQALELLGQMKRSGICPNIFTLSSALKACAGMGLKELGRQLHSSLMKMDMESDLFVS 397

Query: 415 NSLLDSYGKGCRLEEAEKVFEECPAEDLVAYTSMITAYSQYGLGEEAL 418
             L+D Y K   LE+A   F   P +DL+A+ ++I+ YSQY    EAL
Sbjct: 398 VGLVDMYSKCDLLEDARMAFNLLPEKDLIAWNAIISGYSQYWEDMEAL 445

BLAST of Cp4.1LG09g00990 vs. NCBI nr
Match: gi|645237289|ref|XP_008225136.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g04780-like [Prunus mume])

HSP 1 Score: 393.3 bits (1009), Expect = 5.4e-106
Identity = 209/402 (51.99%), Postives = 264/402 (65.67%), Query Frame = 1

Query: 61  LGAVQQDCLCSTLRISSTLPSLCRLPP-LSCGMEIHARMIRLGLCTDTGVRNKLINLYSK 120
           L +VQ+     T    S L S C     +  GME+HA +IR G   D  +RN LINLYSK
Sbjct: 58  LSSVQKGNFSPTSISYSKLLSQCAASKSVGVGMEVHAHIIRCGCSGDQSLRNHLINLYSK 117

Query: 121 CQCFPAARKLVMDGTEPDLVSWSALISGYAQNGRGEEALLTFYEMHLLGVKGNEFTFPSV 180
           C+ F  ARKLV + TEPDLVSWSALISGYAQNG G+EAL  F EMH LGVK NEFTFPSV
Sbjct: 118 CRFFRHARKLVDESTEPDLVSWSALISGYAQNGLGKEALSAFREMHSLGVKCNEFTFPSV 177

Query: 181 LKACSLTRNLELGKQIHGIALVTGFESDVFVANTLVVMYAKCGEFSDSKKLFEEIPERNV 240
           LKACS+TR+  LGKQ+HGIAL+TGFESD FVANTLVVMYAKCGEF DS++LF+ IPERNV
Sbjct: 178 LKACSITRDSVLGKQVHGIALLTGFESDEFVANTLVVMYAKCGEFGDSRRLFDAIPERNV 237

Query: 241 VSWNALFSCYVQIDFFSEAINLFREMVSTGLTPNEFSLSTVLNACAGLEHIDSGMEIHGY 300
           VSWNALFSCYVQ D + EA++LF+EM+ +G+ PNE+SLS+++NAC GL     G +IHGY
Sbjct: 238 VSWNALFSCYVQSDSYGEAMDLFQEMILSGVRPNEYSLSSIINACTGLGDGSRGRKIHGY 297

Query: 301 LIKLGYDSDPFSANALLDMYAKAGCPESAIAVFYEIPKPDIVSWNAAIAFC--------- 360
           ++KLGY+SD FSANAL+DMYAK    E AI+VF +I +PDIVSWNA IA C         
Sbjct: 298 MVKLGYESDSFSANALVDMYAKVKGLEDAISVFEKIAQPDIVSWNAVIAGCVLHEYHDWA 357

Query: 361 -----------------------------------EQVHAISIKSGYQYDGYVANSLLDS 418
                                               Q+H+  +K   + D +V   L+D 
Sbjct: 358 LQFFGQMNGSGICPNMFTLSSALKACAGLGFEKLGRQLHSFLLKMDTESDSFVNVGLIDM 417

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PPR75_ARATH1.6e-5433.78Pentatricopeptide repeat-containing protein At1g50270 OS=Arabidopsis thaliana GN... [more]
PP181_ARATH1.7e-5135.28Pentatricopeptide repeat-containing protein At2g33680 OS=Arabidopsis thaliana GN... [more]
PP337_ARATH2.1e-4932.12Pentatricopeptide repeat-containing protein At4g25270, chloroplastic OS=Arabidop... [more]
PP347_ARATH3.6e-4929.43Pentatricopeptide repeat-containing protein At4g33170 OS=Arabidopsis thaliana GN... [more]
PP357_ARATH4.6e-4933.14Pentatricopeptide repeat-containing protein At4g39530 OS=Arabidopsis thaliana GN... [more]
Match NameE-valueIdentityDescription
A0A0A0L8M4_CUCSA5.0e-12759.58Uncharacterized protein OS=Cucumis sativus GN=Csa_3G180420 PE=4 SV=1[more]
A5B2K7_VITVI2.3e-11155.91Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_023708 PE=4 SV=1[more]
A0A061GGD5_THECC3.0e-10353.35Tetratricopeptide repeat-like superfamily protein OS=Theobroma cacao GN=TCM_0298... [more]
B9H2R5_POPTR1.9e-10252.57Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0004s04740g PE=4 SV=2[more]
A0A103XF42_CYNCS4.8e-10163.21Pentatricopeptide repeat-containing protein OS=Cynara cardunculus var. scolymus ... [more]
Match NameE-valueIdentityDescription
AT1G50270.19.3e-5633.78 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT2G33680.19.6e-5335.28 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT4G25270.11.2e-5032.12 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT4G33170.12.0e-5029.43 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT4G39530.12.6e-5033.14 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659077399|ref|XP_008439183.1|4.8e-13164.45PREDICTED: pentatricopeptide repeat-containing protein At5g04780-like [Cucumis m... [more]
gi|778679510|ref|XP_011651139.1|7.2e-12759.58PREDICTED: pentatricopeptide repeat-containing protein At5g04780-like [Cucumis s... [more]
gi|147805537|emb|CAN74095.1|3.3e-11155.91hypothetical protein VITISV_023708 [Vitis vinifera][more]
gi|359483488|ref|XP_002273710.2|2.0e-10852.45PREDICTED: pentatricopeptide repeat-containing protein At5g04780-like [Vitis vin... [more]
gi|645237289|ref|XP_008225136.1|5.4e-10651.99PREDICTED: pentatricopeptide repeat-containing protein At5g04780-like [Prunus mu... [more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR016140Bifunc_inhib/LTP/seed_store
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0005575 cellular_component
cellular_component GO:0005576 extracellular region
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG09g00990.1Cp4.1LG09g00990.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 311..335
score: 0.003coord: 370..391
score: 0.0023coord: 398..418
score: 0.
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 238..285
score: 5.8E-12coord: 136..183
score: 5.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 369..391
score: 5.2E-4coord: 139..172
score: 8.6E-7coord: 240..274
score: 3.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 238..272
score: 11.871coord: 172..206
score: 6.16coord: 137..171
score: 10.665coord: 308..342
score: 8.868coord: 273..307
score: 6.982coord: 207..237
score: 8.517coord: 365..399
score: 8
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 123..333
score: 3.
IPR016140Bifunctional inhibitor/plant lipid transfer protein/seed storage helical domainPFAMPF14368LTP_2coord: 17..73
score: 6.
IPR016140Bifunctional inhibitor/plant lipid transfer protein/seed storage helical domainSMARTSM00499aai_6coord: 32..90
score: 8.
IPR016140Bifunctional inhibitor/plant lipid transfer protein/seed storage helical domainunknownSSF47699Bifunctional inhibitor/lipid-transfer protein/seed storage 2S albumincoord: 29..91
score: 4.84
NoneNo IPR availableGENE3DG3DSA:1.10.110.10coord: 31..75
score: 5.
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 61..418
score: 2.0E
NoneNo IPR availablePANTHERPTHR24015:SF811SUBFAMILY NOT NAMEDcoord: 61..418
score: 2.0E

The following gene(s) are paralogous to this gene:

None