Cp4.1LG06g00270 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG06g00270
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
Descriptionpentatricopeptide repeat-containing protein isoform X1
LocationCp4.1LG06: 146734 .. 153004 (+)
RNA-Seq ExpressionCp4.1LG06g00270
SyntenyCp4.1LG06g00270
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
CGAACAGTATTCGGCTGCAGTCCAAGACTTGATTTACATTTCTTTCAAGACTTGATTTTTGCACGAACTCATGGCCCTGTCCGTATTGCCACTGCACTCGAAATCTGGTGTTCACTAAGCCTTTGTCGCCGTCTTTGCGGGGAAGACAATGCACAGGCAAGTAAAATTCTTCCCGGGTAACGATAGCAGATACAGATGCCTTAACCTCGTTTCTCCTCTGCTTTTTTCTCAGGGCAAGGTTTCGTTTGGGATCGATAGCTGACTCGCTATACAGATTCAGGCCACACGAACATGTAAGTATCTTCAGTTCGGAAGTGTGGCTGGATTTTATAATTTTGAATTTGTTATCTTCAAAGAAGAAGAAAAGGAGTAGGAAGCTGACTTAAGGGGCCAGCTGGTTCATCTAGTGCATAATTTTTATTGTTAGAATTTCTAAACACGGCCATTGCGAACAATGAACTTATAATTGGTTTAGCATTTGTGCAAATTAGTCATTCTTGTATGCCTGTGGCCGCCCTAATGTAGTAGGGCAAATTAGTTGTCTCGTAAGATTAGTCAAGATGCACGTAAGATCAGTCAAGATGCACTCGCCTGAAAACTAGCCATCTAGGAATTCCCAGTGATTCTTGTGAGTTTTCATTTATTCTCCTTACTGCGATCGACAGGGGCGAAAACAGGATGCGAATAAGATGGTATTCCGTCGAGCTCTTCTCATCTCCCAAGGTAATTTATGGCAAATAAGTTGTTTGAACTTTGAATACGTCCTGCTTTGCTAAATGAGTGCATCAATTTCGTTTCTTTAGCGTGGTATGTTATATTGATCTTCTTACGTTGTCTGCAACTCGAAACTTTGAAAATAGTTTGTTATATGCTATCTTTTCAGGTGGGACAGATCAAAATGGCAAAACCTTTTAAAAATCACCATCTCATGTGGAAAATGTTGACACATTAATGTTTCAGTGTAGGTATTGAATATTTGGGAAATGAAGCCGAGTCAACTAAGTTCATGCAGAGACAGATTGTCGATGCACTTCGAGTGGGCGATAGAAGTAGTGCTTCCAATCTGCTCATGGAACTTGGCCAGGAAAAACACTCTTTAACTGCAGATAATTTTGTTGGCATTTTGAGCTACTGTGCAAGATCACCTGATCCACTGGTAGTTCTTGCGTTCTTTATTATCCTCTTGAAGTTTAAAATAATAATGTTGAACTAGGAAACAACCTTTCTTCTAACTAATATTATATGTTTGTAACTTCGATCAGTTTCGATTTAGTCGTCGCCTTTTATGGCTACAATTTTCACTCTCCTCGAGTAAGTAAAATGGATCGGGGTTGAAGTTTAAAATAATAATGTTGTTTGGTGCATCTATCTAAGAATGTAATGTGCTTCGGCATGCCTATATTTCACTTGCATATCTTATGCTAGTATGGGAAAAAAAGAATGATGATGACCTTATTATTTTCTAGCGACTGATGAGTGTGCTGAAGTCTAAGGCTGCATACATATTTATTTTTCTTGCAGTTTGTCATGGAAACTTGGAAAATAATGGAAGAAAGAGGAGTTTTTCTGGATAACACATGCACTTTACTTATGATAAAAGCACTCTGTAAGGGCGGTTACTTGGATGAGGTATAAACATGTTTTTCATAAACTGAGATTGTATGCAATATTTTGCCCTCACATCGAGTTGCACTATTAACAACAGTTTTCTCCTCTTGTAATAACATAGGCATTTGGTCTAATAAGTTTCCTGGCAGAAAGTCGTGTCATGTTTCCTGTTCTGCCTGTGTACAATCTTTTTTTGAGAGCCTGTGGCAAAAGGCAAAGTACGGTTCATGTTAGTCAATGTTTGGATATGATGGATCGCAGAATGGTCGGGAAGAATGAAGCTACATATTCTGAGCTACTCAAGGTTTGTAAGGAACCTGATTCCTTTTCATTTGTATAAGCTATATTTCAAAATAGTTTATTTCTTAGATTTCTTATGAACTGGCGATCATCATCAAGATACTTACAATTTCTCTTATGCATTTTACTTCTACACCAAAATTGGATCGATCATAGGCATCTTAGTATCAATCTTTTCAGTAAAAATAAATTAAAATTAGATGTGGATTCGATTCTAAATTTCTAAATGTATACTCAACTGTCCATGCACTCTTGTTTAGTGTGATAAGAACATATTGATTCATAAAAAAGTATCTCTGCAGAATAAAAGGATCATGGGATTGATTCTCTGGTCATTGACCTCCTCTAAACAATGTCTCCTGTATCAATCGCATTTTGCCTTTTGGCAGGAAAGTTTCTGCTATAAGGGTATTGTAACATCATATATGTACCTGTAATCCTTGTATTGCTGTTTTGAGAATGTATTTTTGATGTAGTGAATCTGAATTCTCTAATTTATTTATAGTGAATGAATACTGAGCTTTCTTTTCTCAATCAGGTTGCAGTTTGTCAGAAAAATTTGTCTTCTGTGCATGAAATTTGGACAGACTTTGTAAAAAATTACAGTCCAAGTGTTTTATCTCTAAGAAAGTTTATATGGTCTTATACAAGGCTGGGAGACCTAAAATCTGCATATACTGCACTGCAAAAGATGGTGGCTTTGGTTATTGGAGCCGCAGGACAAAAGTTACCCTCTTTAGAATTGGACATTCCTGTACCCTTAAGAACTGAATCCTATCATGAAAATTTTAATTTCGAGGAAAATGGACCTTCTACCGACGAGTTGTACTGTAAGAAAATGGTCCCCTGCGAAGGTGACATTGGGCAATTTTCTGTTAATGGTATGAAGTGTGGAGAAGTTGAAAGTGGTCGATTAACTTTGCCGAGCAATTACAGAAGCAATTTTGTTATGAAGGTTTTGAGGTGGTCTTTCAATGATGTGATATGTGCATGTGCGCTTACTAGGAACTGTGGTCTTGCAGAGCAGTTAATGCAACAGGTCTTATTTCTTTCTTTGAGATCATTTTTGTTTAATTATATAGCCAATGGTTACAATAGTTGCAACTAAATAAGATAATTTTTTCATAATTTTATCTCTTGTTACTGCCATAATAGTTGCAATCTTAAATTGTTCATACTTTCTTGGTCTTGCTCATTACTGCATTATTAGTGAAATGATATTTTGTATTTGATTACTTGTACCTTATGTTCATATTTTTACACTTTATTTAGTTTAGCAGCATTTCATGTGTACTGACGACACTAGGGTTCATTTCTTTGAAGATGCATGAACTGGGATTGCAACCTTCGTCCCACACATTTGATGGTTTTGTTAGATCAGTTGTTTCAGAGAGAGGTTTCAGTGATGGCATTAAAATAGTAAGTTACTTAAGCTTTGGTTAGATCTTGATTATCATTACAAGATTTCTTCTTCTTTTTAATGGAAATGTATTGTTTTTCTTTTAATTATGCGGCCAATGATGTATCGTTTCCCTCGACCCCAAAAAAACCCTCCTTTTCAGTTAAAAATAATGCAACAGAGGAAATTGAAGCCATATGATTCAACTCTTGCTGCTGTTTCGATAAGTTGTAGCAAGGCGCTAGAACTTGATTTGGCTGAGGCTCTACTCGAACAAATTTCAGCTTGTGTGTTCCCACACCCCTTCAATGCATTTCTTTCAGCATGTGACATGATGGTAAGTTAATTTTTTTCTCTTGATAAGAAACTGAGTTTTTATTGAGAAAAAGGAAGGAATTTATTAGTGCATACAAAAAACAAGCCCACAAATGGAAGCCTCAACTACAAGAAAGGACTCCAATCCAAAAGATCGATTGGAATGATGCCAAGCTGATAATTACAAAAAAGGAAAAAGAAAAAAGGGAAGTCCATAAGCACTCATTAAAGCACACCCTCCCAATCCCCCTCCAAAGTCCTCTCAACCCTCTAAAAGTTTTACCATTGCTCTCGAGCCACACTCCCCATAAGATAGTTCTGTAGCACTTGGTTGGATTAGTTGAGCCCAGCTCAGATTGTGTGATCCTTATTGCAGGATCAGCCAGAACGTGCCATGCGTATGTTGGTTAAAATGAAACAAATGGAGGTGCTTCCAGATGTGAAGACCTATGAGCTTTTATATTCATTATTTGGTAACGTGAATGCTCCATACGAGGAGGGCAACAGATTGTCACAGGTGGATGCTGCAAAAAGGATACGCATGATAGAGATGGATATGGAGAAACATGGGATACAACACAGTCATTTCTCTATGATGAACTTGGTAAGACTCTCTCTCTCTTCCTTCCTACCAGGAAAAAAAAAAAAAAAACTTGCAGCTGTATATGCTCGTCTGCTAATGACCCACATCTGACTAGAGCACATTGCAGGACCCCATCTACTTTCCTCCCAAAATTTCAAGCTCTATTTGATTCTCGTTTCAAAGTCTCTTTCATTTTATATGTACACTACACCGTCGATATTGTTGGAAGAACATCTTCTCGGGATCCATGCAGATGTTCTTTGATATGTGTATATATGCTCACGTTTCACTTCGAGAATTGTTTTACATGTAGGTAAAAGGTTTTGATAAATTAAGTTGGATCAATCTTATTTAGCACTAGAACATAGCAGGAAGAGCATTGGGCGCACAATTAAATTATTTTTATTTGTTACTATAGTTATAAAATCTTTCTACTTGAACCATATTAGCAAGTGGAAGAATCAAGTTCCCTCACTGCTTATGGCATTATTATTGTTTCTCTTTTTCCTTTGATTCATTTGTTCCCTTTAGTTGAAAGCTCTTGGTGCGGAGGGGATGACGAAGGAGCTTCTTCAGTATTTAAATGTGGCAGAGAACCTCTTCTATTACGATAACACTTGTCTGGGGACGCCTATTTACAACACAGCGTTGCATTTTTTAGTTGAATCCAAAGAGGTAAGCGCTTGTTGTCATGCACTCTCTTCTGTTAAGCTATAGAGGACTTCCTCGTTTGTGTTTAACACTGTTCTATACTTTTGCAGATCCATATGGCAATAGAATTATTCAATAATATGAAGCATTCTGGTCTCTTTCCAGATGCTGCGACATTTGAGATGATGATCAACTGTTGCAGTGTTATCGGATGCTTGAAATCTGCGTTTGCTCTTCTCTCCCTGATGATTCGCTCCGGGTTTTGTCCTCAGATATTAACTTATACCAGTCTAGTAAAGGTATCCAATGAATTTATGATTGTTAGGCATGTACATTTACTCACTGCAAGTATACTCCTGAAGTATTCATCTTGATTTAACAGATCGAACATTGTGGACTATCAACTATATATTTTTGTTTTCAGATTTAGAGTTTGTTAACTTGAAAGTTGCTGATTCCATCTCATCATATTTTCTTTTAACCGTCCTTGAGTTTGGCCGATTTACATGCCATTGTACAGTTCGAAAGCTGTCCACGAACTTATTCTTGAACCCCGACACCCGACACCCATGGTTCATGTATATTTGCCGACACTAGTGAACAATCCTAATAGCTTGGTGATTGAACTCTCTGGAGGATGCACTATTAAAATTTTCCATAAGTTTCCATGTCAATCATCGCCAATTGATACTCTTTCTTCTTACCATTAATTTTTTCCTTTATCGTGTTATTCAATCATTCAAATTTAACGTTTGGATTATCCTGATGCAGATTGTGCTGGGATTTGAGAGATTTGATGATGCCTTGAATCTCTTAGATCAAGCCAGTTCAGAAGGGATTGAACTTGATGTTGTTATAATGAATACAATCGTGCAGAAAGCTTGTGAAAAGGTAACCCCCATGGCTTCATGTCATTAACTTGATAAGAACAACACTCTCAATGCACCCATGCATAATTCCAACAACTGTTTAAGCTCATCTCTCATTCTGGGTACTTTTTTTTTTTTTTTTTTTTTTTTTNTGATGTGATTGAGTTTGTCGTTGAGAAGATGAAGCGCGAAAAGATCCAGCCCGACCCTTCAACGTGCCATAGTGTCTTCTCGGCATATGTGAGCCTTGGCTATCACAGCACCGCCATGGAAGCACTGCAAGTACTGAGCATGCGTATGCTATGCAAAGAACACGACACTTCTCCAGTCGTTACAGAATATGTCGAAGACTTTGTGCTTGCAGAAGACTCCGAAGCGGAATCACGGATTTTGGAATTCTTCAAATGCTCTGAAGAGAGCCTAAGTTTTGCCCTTCTCAACTTGAGATGGTCTGCCATGCTGGGATATTCCCTTTGTTCTTCCCCTAATCAGAGTCCATGGGCAATGAGACTTGCAAGTTCCTATGATGATGGCTACACAGCCTAA

mRNA sequence

CGAACAGTATTCGGCTGCAGTCCAAGACTTGATTTACATTTCTTTCAAGACTTGATTTTTGCACGAACTCATGGCCCTGTCCGTATTGCCACTGCACTCGAAATCTGGTGTTCACTAAGCCTTTGTCGCCGTCTTTGCGGGGAAGACAATGCACAGGGCAAGGTTTCGTTTGGGATCGATAGCTGACTCGCTATACAGATTCAGGCCACACGAACATGGGCGAAAACAGGATGCGAATAAGATGGTATTCCGTCGAGCTCTTCTCATCTCCCAAGGTATTGAATATTTGGGAAATGAAGCCGAGTCAACTAAGTTCATGCAGAGACAGATTGTCGATGCACTTCGAGTGGGCGATAGAAGTAGTGCTTCCAATCTGCTCATGGAACTTGGCCAGGAAAAACACTCTTTAACTGCAGATAATTTTGTTGGCATTTTGAGCTACTGTGCAAGATCACCTGATCCACTGTTTGTCATGGAAACTTGGAAAATAATGGAAGAAAGAGGAGTTTTTCTGGATAACACATGCACTTTACTTATGATAAAAGCACTCTGTAAGGGCGGTTACTTGGATGAGGCATTTGGTCTAATAAGTTTCCTGGCAGAAAGTCGTGTCATGTTTCCTGTTCTGCCTGTGTACAATCTTTTTTTGAGAGCCTGTGGCAAAAGGCAAAGTACGGTTCATGTTAGTCAATGTTTGGATATGATGGATCGCAGAATGGTCGGGAAGAATGAAGCTACATATTCTGAGCTACTCAAGGTTGCAGTTTGTCAGAAAAATTTGTCTTCTGTGCATGAAATTTGGACAGACTTTGTAAAAAATTACAGTCCAAGTGTTTTATCTCTAAGAAAGTTTATATGGTCTTATACAAGGCTGGGAGACCTAAAATCTGCATATACTGCACTGCAAAAGATGGTGGCTTTGGTTATTGGAGCCGCAGGACAAAAGTTACCCTCTTTAGAATTGGACATTCCTGTACCCTTAAGAACTGAATCCTATCATGAAAATTTTAATTTCGAGGAAAATGGACCTTCTACCGACGAGTTGTACTGTAAGAAAATGGTCCCCTGCGAAGGTGACATTGGGCAATTTTCTGTTAATGGTATGAAGTGTGGAGAAGTTGAAAGTGGTCGATTAACTTTGCCGAGCAATTACAGAAGCAATTTTGTTATGAAGGTTTTGAGGTGGTCTTTCAATGATGTGATATGTGCATGTGCGCTTACTAGGAACTGTGGTCTTGCAGAGCAGTTAATGCAACAGTTTAGCAGCATTTCATGTGTACTGACGACACTAGGGTTCATTTCTTTGAAGATGCATGAACTGGGATTGCAACCTTCGTCCCACACATTTGATGGTTTTGTTAGATCAGTTGTTTCAGAGAGAGGTTTCAGTGATGGCATTAAAATAGATCAGCCAGAACGTGCCATGCGTATGTTGGTTAAAATGAAACAAATGGAGGTGCTTCCAGATGTGAAGACCTATGAGCTTTTATATTCATTATTTGGTAACGTGAATGCTCCATACGAGGAGGGCAACAGATTGTCACAGGTGGATGCTGCAAAAAGGATACGCATGATAGAGATGGATATGGAGAAACATGGGATACAACACAGTCATTTCTCTATGATGAACTTGTTGAAAGCTCTTGGTGCGGAGGGGATGACGAAGGAGCTTCTTCAGTATTTAAATGTGGCAGAGAACCTCTTCTATTACGATAACACTTGTCTGGGGACGCCTATTTACAACACAGCGTTGCATTTTTTAGTTGAATCCAAAGAGATCCATATGGCAATAGAATTATTCAATAATATGAAGCATTCTGGTCTCTTTCCAGATGCTGCGACATTTGAGATGATGATCAACTGTTGCAGTGTTATCGGATGCTTGAAATCTGCGTTTGCTCTTCTCTCCCTGATGATTCGCTCCGGGTTTTGTCCTCAGATATTAACTTATACCAGTCTAGTAAAGATTGTGCTGGGATTTGAGAGATTTGATGATGCCTTGAATCTCTTAGATCAAGCCAGTTCAGAAGGGATTGAACTTGATGTTGTTATAATGAATACAATCGTGCAGAAAGCTTGTGAAAAGATGAAGCGCGAAAAGATCCAGCCCGACCCTTCAACGTGCCATAGTGTCTTCTCGGCATATGTGAGCCTTGGCTATCACAGCACCGCCATGGAAGCACTGCAAGTACTGAGCATGCGTATGCTATGCAAAGAACACGACACTTCTCCAGTCGTTACAGAATATGTCGAAGACTTTGTGCTTGCAGAAGACTCCGAAGCGGAATCACGGATTTTGGAATTCTTCAAATGCTCTGAAGAGAGCCTAAGTTTTGCCCTTCTCAACTTGAGATGGTCTGCCATGCTGGGATATTCCCTTTGTTCTTCCCCTAATCAGAGTCCATGGGCAATGAGACTTGCAAGTTCCTATGATGATGGCTACACAGCCTAA

Coding sequence (CDS)

ATGCACAGGGCAAGGTTTCGTTTGGGATCGATAGCTGACTCGCTATACAGATTCAGGCCACACGAACATGGGCGAAAACAGGATGCGAATAAGATGGTATTCCGTCGAGCTCTTCTCATCTCCCAAGGTATTGAATATTTGGGAAATGAAGCCGAGTCAACTAAGTTCATGCAGAGACAGATTGTCGATGCACTTCGAGTGGGCGATAGAAGTAGTGCTTCCAATCTGCTCATGGAACTTGGCCAGGAAAAACACTCTTTAACTGCAGATAATTTTGTTGGCATTTTGAGCTACTGTGCAAGATCACCTGATCCACTGTTTGTCATGGAAACTTGGAAAATAATGGAAGAAAGAGGAGTTTTTCTGGATAACACATGCACTTTACTTATGATAAAAGCACTCTGTAAGGGCGGTTACTTGGATGAGGCATTTGGTCTAATAAGTTTCCTGGCAGAAAGTCGTGTCATGTTTCCTGTTCTGCCTGTGTACAATCTTTTTTTGAGAGCCTGTGGCAAAAGGCAAAGTACGGTTCATGTTAGTCAATGTTTGGATATGATGGATCGCAGAATGGTCGGGAAGAATGAAGCTACATATTCTGAGCTACTCAAGGTTGCAGTTTGTCAGAAAAATTTGTCTTCTGTGCATGAAATTTGGACAGACTTTGTAAAAAATTACAGTCCAAGTGTTTTATCTCTAAGAAAGTTTATATGGTCTTATACAAGGCTGGGAGACCTAAAATCTGCATATACTGCACTGCAAAAGATGGTGGCTTTGGTTATTGGAGCCGCAGGACAAAAGTTACCCTCTTTAGAATTGGACATTCCTGTACCCTTAAGAACTGAATCCTATCATGAAAATTTTAATTTCGAGGAAAATGGACCTTCTACCGACGAGTTGTACTGTAAGAAAATGGTCCCCTGCGAAGGTGACATTGGGCAATTTTCTGTTAATGGTATGAAGTGTGGAGAAGTTGAAAGTGGTCGATTAACTTTGCCGAGCAATTACAGAAGCAATTTTGTTATGAAGGTTTTGAGGTGGTCTTTCAATGATGTGATATGTGCATGTGCGCTTACTAGGAACTGTGGTCTTGCAGAGCAGTTAATGCAACAGTTTAGCAGCATTTCATGTGTACTGACGACACTAGGGTTCATTTCTTTGAAGATGCATGAACTGGGATTGCAACCTTCGTCCCACACATTTGATGGTTTTGTTAGATCAGTTGTTTCAGAGAGAGGTTTCAGTGATGGCATTAAAATAGATCAGCCAGAACGTGCCATGCGTATGTTGGTTAAAATGAAACAAATGGAGGTGCTTCCAGATGTGAAGACCTATGAGCTTTTATATTCATTATTTGGTAACGTGAATGCTCCATACGAGGAGGGCAACAGATTGTCACAGGTGGATGCTGCAAAAAGGATACGCATGATAGAGATGGATATGGAGAAACATGGGATACAACACAGTCATTTCTCTATGATGAACTTGTTGAAAGCTCTTGGTGCGGAGGGGATGACGAAGGAGCTTCTTCAGTATTTAAATGTGGCAGAGAACCTCTTCTATTACGATAACACTTGTCTGGGGACGCCTATTTACAACACAGCGTTGCATTTTTTAGTTGAATCCAAAGAGATCCATATGGCAATAGAATTATTCAATAATATGAAGCATTCTGGTCTCTTTCCAGATGCTGCGACATTTGAGATGATGATCAACTGTTGCAGTGTTATCGGATGCTTGAAATCTGCGTTTGCTCTTCTCTCCCTGATGATTCGCTCCGGGTTTTGTCCTCAGATATTAACTTATACCAGTCTAGTAAAGATTGTGCTGGGATTTGAGAGATTTGATGATGCCTTGAATCTCTTAGATCAAGCCAGTTCAGAAGGGATTGAACTTGATGTTGTTATAATGAATACAATCGTGCAGAAAGCTTGTGAAAAGATGAAGCGCGAAAAGATCCAGCCCGACCCTTCAACGTGCCATAGTGTCTTCTCGGCATATGTGAGCCTTGGCTATCACAGCACCGCCATGGAAGCACTGCAAGTACTGAGCATGCGTATGCTATGCAAAGAACACGACACTTCTCCAGTCGTTACAGAATATGTCGAAGACTTTGTGCTTGCAGAAGACTCCGAAGCGGAATCACGGATTTTGGAATTCTTCAAATGCTCTGAAGAGAGCCTAAGTTTTGCCCTTCTCAACTTGAGATGGTCTGCCATGCTGGGATATTCCCTTTGTTCTTCCCCTAATCAGAGTCCATGGGCAATGAGACTTGCAAGTTCCTATGATGATGGCTACACAGCCTAA

Protein sequence

MHRARFRLGSIADSLYRFRPHEHGRKQDANKMVFRRALLISQGIEYLGNEAESTKFMQRQIVDALRVGDRSSASNLLMELGQEKHSLTADNFVGILSYCARSPDPLFVMETWKIMEERGVFLDNTCTLLMIKALCKGGYLDEAFGLISFLAESRVMFPVLPVYNLFLRACGKRQSTVHVSQCLDMMDRRMVGKNEATYSELLKVAVCQKNLSSVHEIWTDFVKNYSPSVLSLRKFIWSYTRLGDLKSAYTALQKMVALVIGAAGQKLPSLELDIPVPLRTESYHENFNFEENGPSTDELYCKKMVPCEGDIGQFSVNGMKCGEVESGRLTLPSNYRSNFVMKVLRWSFNDVICACALTRNCGLAEQLMQQFSSISCVLTTLGFISLKMHELGLQPSSHTFDGFVRSVVSERGFSDGIKIDQPERAMRMLVKMKQMEVLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRIRMIEMDMEKHGIQHSHFSMMNLLKALGAEGMTKELLQYLNVAENLFYYDNTCLGTPIYNTALHFLVESKEIHMAIELFNNMKHSGLFPDAATFEMMINCCSVIGCLKSAFALLSLMIRSGFCPQILTYTSLVKIVLGFERFDDALNLLDQASSEGIELDVVIMNTIVQKACEKMKREKIQPDPSTCHSVFSAYVSLGYHSTAMEALQVLSMRMLCKEHDTSPVVTEYVEDFVLAEDSEAESRILEFFKCSEESLSFALLNLRWSAMLGYSLCSSPNQSPWAMRLASSYDDGYTA
Homology
BLAST of Cp4.1LG06g00270 vs. ExPASy Swiss-Prot
Match: Q9SGQ6 (Pentatricopeptide repeat-containing protein At1g76280 OS=Arabidopsis thaliana OX=3702 GN=At1g76280 PE=2 SV=2)

HSP 1 Score: 624.8 bits (1610), Expect = 1.3e-177
Identity = 358/755 (47.42%), Postives = 479/755 (63.44%), Query Frame = 0

Query: 47  LGNE----AESTKFMQRQIVDALRVGDRSSASNLLMELGQEKHSLTADNFVGILSYCARS 106
           +GNE     + +K +Q QIVDALR G+R  AS LL +L Q  +SL+AD+F  IL YCARS
Sbjct: 50  IGNEFIRCQDESKILQLQIVDALRSGERQGASALLFKLIQGNYSLSADDFHDILYYCARS 109

Query: 107 PDPLFVMETWKIMEERGVFLDNTCTLLMIKALCKGGYLDEAFGLISFLAESRVMFPVLPV 166
           PDP+FVMET+ +M ++ + LD+   L ++K+LC GG+LD+A   I  + E   + P+LP+
Sbjct: 110 PDPVFVMETYSVMCKKEISLDSRSLLFIVKSLCNGGHLDKASEFIHAVREDDRISPLLPI 169

Query: 167 YNLFLRACGKRQSTVHVSQCLDMMDRRMVGKNEATYSELLKVAVCQKNLSSVHEIWTDFV 226
           YN FL AC + +S  H S+CL++MD+R VGKN  TY  LLK+AV Q+NLS+V++IW  +V
Sbjct: 170 YNFFLGACARTRSVYHASKCLELMDQRRVGKNGITYVALLKLAVFQRNLSTVNDIWKHYV 229

Query: 227 KNYSPSVLSLRKFIWSYTRLGDLKSAYTALQKMVALV------IGAAGQKLPSLELDIPV 286
            +Y+  +LSLR+FIWS+TRLGDLKSAY  LQ MV L       + +   KL S  L IPV
Sbjct: 230 NHYNLDILSLRRFIWSFTRLGDLKSAYELLQHMVYLALRGEFFVKSNRGKLHSTRLYIPV 289

Query: 287 PLRTESYHENFNFEENGPSTDELYCKKMVPCEGDIGQFSVNGMKCGEVESGRLTLPSNYR 346
           P + E+  E F F      TD     ++V C                  S ++ LP  + 
Sbjct: 290 PSKDETGSEKFAF----GVTD-----RIVDCN----------------SSSKVALPKGHN 349

Query: 347 SNFVMKVLRWSFNDVICACALTRNCGLAEQLM--------QQFSSISCVLTTLGFISLKM 406
               ++VLRWSFNDVI AC  ++N  LAEQLM        Q        L T+     K 
Sbjct: 350 KILAIRVLRWSFNDVIHACGQSKNSELAEQLMLQLKVMQQQNLKPYDSTLATVAAYCSKA 409

Query: 407 HELGLQPSSHTFDGFVRSVVSERGFSDGI--------KIDQPERAMRMLVKMKQMEVLPD 466
            ++ L  + H  D      +SE  +S            +DQPERA+R+L +MK++++ PD
Sbjct: 410 LQVDL--AEHLLD-----QISECSYSYPFNNLLAAYDSLDQPERAVRVLARMKELKLRPD 469

Query: 467 VKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRIRMIEMDMEKHGIQHSHFSMMNLLKALG 526
           ++TYELL+SLFGNVNAPYEEGN LSQVD  KRI  IEMDM ++G QHS  S +N+L+ALG
Sbjct: 470 MRTYELLFSLFGNVNAPYEEGNMLSQVDCCKRINAIEMDMMRNGFQHSPISRLNVLRALG 529

Query: 527 AEGMTKELLQYLNVAENLFYYDNTCLGTPIYNTALHFLVESKEIHMAIELFNNMKHSGLF 586
           AEGM  E++++L  AENL  + N  LGTP YN  LH L+E+ E  M I +F  MK  G  
Sbjct: 530 AEGMVNEMIRHLQKAENLSAHSNMYLGTPTYNIVLHSLLEANETDMVINIFKRMKSCGCP 589

Query: 587 PDAATFEMMINCCSVIGCLKSAFALLSLMIRSGFCPQILTYTSLVKIVLGFERFDDALNL 646
            D AT+ +MI+CCS+I   KSA AL+S+MIR GF P+ +T+T+L+KI+L    F++ALNL
Sbjct: 590 ADVATYNIMIDCCSLIHSYKSACALVSMMIRDGFSPKAVTFTALMKILLNDANFEEALNL 649

Query: 647 LDQASSEGIELDVVIMNTIVQKACEK------------MKREKIQPDPSTCHSVFSAYVS 706
           LDQA+ E I LDV+  NTI++KA EK            M REK+ PDP+TCH VFS YV 
Sbjct: 650 LDQAALEEIHLDVLSYNTILRKAFEKGMIDVIEYIVEQMHREKVNPDPTTCHYVFSCYVE 709

Query: 707 LGYHSTAMEALQVLSMRMLCKEHDTS--PVVTEYVEDFVLAEDSEAESRILEFFKCSEES 762
            GYH+TA+EAL VLS+RML +E   S      E  E+FV++ED EAE++I+E F+ SEE 
Sbjct: 710 KGYHATAIEALNVLSLRMLNEEDKESLQDKKIELEENFVMSEDPEAETKIIELFRKSEEH 769

BLAST of Cp4.1LG06g00270 vs. ExPASy Swiss-Prot
Match: Q9CA58 (Putative pentatricopeptide repeat-containing protein At1g74580 OS=Arabidopsis thaliana OX=3702 GN=At1g74580 PE=3 SV=1)

HSP 1 Score: 79.0 bits (193), Expect = 2.7e-13
Identity = 78/291 (26.80%), Postives = 124/291 (42.61%), Query Frame = 0

Query: 387 KMHELGLQPSSHTFDGFVRSVVSERGFSDGIKIDQPERAMRMLVKMKQMEVLPDVKTY-E 446
           K+ + G+ P+  T++ F++  + +RG  DG        A+RM+  + +    PDV TY  
Sbjct: 241 KVIKRGVLPNLFTYNLFIQG-LCQRGELDG--------AVRMVGCLIEQGPKPDVITYNN 300

Query: 447 LLYSLFGN-------------VNAPYEEGN-----------RLSQVDAAKRIRMIEMDME 506
           L+Y L  N             VN   E  +           +   V  A+R   I  D  
Sbjct: 301 LIYGLCKNSKFQEAEVYLGKMVNEGLEPDSYTYNTLIAGYCKGGMVQLAER---IVGDAV 360

Query: 507 KHGIQHSHFSMMNLLKALGAEGMTKELLQYLNVAENLFYYDNTCLGTPIYNTALHFLVES 566
            +G     F+  +L+  L  EG T   L   N A       N  L    YNT +  L   
Sbjct: 361 FNGFVPDQFTYRSLIDGLCHEGETNRALALFNEALGKGIKPNVIL----YNTLIKGLSNQ 420

Query: 567 KEIHMAIELFNNMKHSGLFPDAATFEMMINCCSVIGCLKSAFALLSLMIRSGFCPQILTY 626
             I  A +L N M   GL P+  TF +++N    +GC+  A  L+ +MI  G+ P I T+
Sbjct: 421 GMILEAAQLANEMSEKGLIPEVQTFNILVNGLCKMGCVSDADGLVKVMISKGYFPDIFTF 480

Query: 627 TSLVKIVLGFERFDDALNLLDQASSEGIELDVVIMNTIVQKACEKMKREKI 653
             L+       + ++AL +LD     G++ DV   N+++   C+  K E +
Sbjct: 481 NILIHGYSTQLKMENALEILDVMLDNGVDPDVYTYNSLLNGLCKTSKFEDV 515

BLAST of Cp4.1LG06g00270 vs. ExPASy Swiss-Prot
Match: Q8L844 (Pentatricopeptide repeat-containing protein At5g42310, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=CRP1 PE=1 SV=1)

HSP 1 Score: 70.1 bits (170), Expect = 1.2e-10
Identity = 70/309 (22.65%), Postives = 126/309 (40.78%), Query Frame = 0

Query: 387 KMHELGLQPSSHTFDGFVRSVVSERGFSDGIKIDQPERAMRMLVKMKQMEVLPDVKTYEL 446
           ++ + G++P +  ++  ++  V      D         A  M+ +M++  V PD  TY L
Sbjct: 329 ELRQSGIKPRTRAYNALLKGYVKTGPLKD---------AESMVSEMEKRGVSPDEHTYSL 388

Query: 447 LYSLFGNVNAPYEEGNRLSQVDAAKRIRMIEMDMEKHGIQHSHFSMMNLLKALGAEGMTK 506
           L   +  VNA   E             R++  +ME   +Q + F    LL      G  +
Sbjct: 389 LIDAY--VNAGRWES-----------ARIVLKEMEAGDVQPNSFVFSRLLAGFRDRGEWQ 448

Query: 507 ELLQYLNVAENLFYYDNTCLGTPIYNTALHFLVESKEIHMAIELFNNMKHSGLFPDAATF 566
           +  Q L   +++    +       YN  +    +   +  A+  F+ M   G+ PD  T+
Sbjct: 449 KTFQVLKEMKSIGVKPD----RQFYNVVIDTFGKFNCLDHAMTTFDRMLSEGIEPDRVTW 508

Query: 567 EMMINCCSVIGCLKSAFALLSLMIRSGFCPQILTYTSLVKIVLGFERFDDALNLLDQASS 626
             +I+C    G    A  +   M R G  P   TY  ++      ER+DD   LL +  S
Sbjct: 509 NTLIDCHCKHGRHIVAEEMFEAMERRGCLPCATTYNIMINSYGDQERWDDMKRLLGKMKS 568

Query: 627 EGIELDVVIMNTIVQ------------KACEKMKREKIQPDPSTCHSVFSAYVSLGYHST 684
           +GI  +VV   T+V             +  E+MK   ++P  +  +++ +AY   G    
Sbjct: 569 QGILPNVVTHTTLVDVYGKSGRFNDAIECLEEMKSVGLKPSSTMYNALINAYAQRGLSEQ 611

BLAST of Cp4.1LG06g00270 vs. ExPASy Swiss-Prot
Match: Q9LMH5 (Putative pentatricopeptide repeat-containing protein At1g13800 OS=Arabidopsis thaliana OX=3702 GN=At1g13800 PE=3 SV=1)

HSP 1 Score: 66.2 bits (160), Expect = 1.8e-09
Identity = 50/203 (24.63%), Postives = 95/203 (46.80%), Query Frame = 0

Query: 418 KIDQPERAMRMLVKMKQMEVLPDVKTYELLYSLFGNVNAPYEEG---------------N 477
           ++++P++A  +   MK+ +V PDV TY +L +    ++   E                 N
Sbjct: 647 RLNEPKQAYALFEDMKRRDVKPDVVTYSVLLNSDPELDMKREMEAFDVIPDVVYYTIMIN 706

Query: 478 RLSQVDAAKRIRMIEMDMEKHGIQHSHFSMMNLLKALGAEGMTKELLQYLNVAENLFYYD 537
           R   ++  K++  +  DM++  I     +   LLK      +++E+  + +V  ++FY  
Sbjct: 707 RYCHLNDLKKVYALFKDMKRREIVPDVVTYTVLLKNKPERNLSREMKAF-DVKPDVFY-- 766

Query: 538 NTCLGTPIYNTALHFLVESKEIHMAIELFNNMKHSGLFPDAATFEMMINCCSVIGCLKSA 597
                   Y   + +  +  ++  A  +F+ M  SG+ PDAA +  +I CC  +G LK A
Sbjct: 767 --------YTVLIDWQCKIGDLGEAKRIFDQMIESGVDPDAAPYTALIACCCKMGYLKEA 826

Query: 598 FALLSLMIRSGFCPQILTYTSLV 606
             +   MI SG  P ++ YT+L+
Sbjct: 827 KMIFDRMIESGVKPDVVPYTALI 838

BLAST of Cp4.1LG06g00270 vs. ExPASy Swiss-Prot
Match: Q9SHK2 (Pentatricopeptide repeat-containing protein At1g06580 OS=Arabidopsis thaliana OX=3702 GN=At1g06580 PE=2 SV=1)

HSP 1 Score: 65.9 bits (159), Expect = 2.3e-09
Identity = 50/168 (29.76%), Postives = 80/168 (47.62%), Query Frame = 0

Query: 531 YNTALHFLVESKEIHMAIELFNNMKHSGLFPDAATFEMMINCCSVIGCLKSAFALLSLMI 590
           ++  L  + +  +    I LF +++  G+  D  +F  +I+C      L  A + L  M+
Sbjct: 82  FSRLLIAIAKLNKYEAVISLFRHLEMLGISHDLYSFTTLIDCFCRCARLSLALSCLGKMM 141

Query: 591 RSGFCPQILTYTSLVKIVLGFERFDDALNLLDQASSEGIELDVVIMNTIVQKACEK---- 650
           + GF P I+T+ SLV       RF +A++L+DQ    G E +VVI NTI+   CEK    
Sbjct: 142 KLGFEPSIVTFGSLVNGFCHVNRFYEAMSLVDQIVGLGYEPNVVIYNTIIDSLCEKGQVN 201

Query: 651 --------MKREKIQPDPSTCHSVFSAYVSLGYHSTAMEALQVLSMRM 687
                   MK+  I+PD  T +S+ +     G    +   L  + MRM
Sbjct: 202 TALDVLKHMKKMGIRPDVVTYNSLITRLFHSGTWGVSARILSDM-MRM 248

BLAST of Cp4.1LG06g00270 vs. NCBI nr
Match: XP_023536089.1 (pentatricopeptide repeat-containing protein At1g76280 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1454 bits (3765), Expect = 0.0
Identity = 750/837 (89.61%), Postives = 750/837 (89.61%), Query Frame = 0

Query: 1   MHRARFRLGSIADSLYRFRPHEHGRKQDANKMVFRRALLISQGIEYLGNEAESTKFMQRQ 60
           MHRARFRLGSIADSLYRFRPHEHGRKQDANKMVFRRALLISQGIEYLGNEAESTKFMQRQ
Sbjct: 1   MHRARFRLGSIADSLYRFRPHEHGRKQDANKMVFRRALLISQGIEYLGNEAESTKFMQRQ 60

Query: 61  IVDALRVGDRSSASNLLMELGQEKHSLTADNFVGILSYCARSPDPLFVMETWKIMEERGV 120
           IVDALRVGDRSSASNLLMELGQEKHSLTADNFVGILSYCARSPDPLFVMETWKIMEERGV
Sbjct: 61  IVDALRVGDRSSASNLLMELGQEKHSLTADNFVGILSYCARSPDPLFVMETWKIMEERGV 120

Query: 121 FLDNTCTLLMIKALCKGGYLDEAFGLISFLAESRVMFPVLPVYNLFLRACGKRQSTVHVS 180
           FLDNTCTLLMIKALCKGGYLDEAFGLISFLAESRVMFPVLPVYNLFLRACGKRQSTVHVS
Sbjct: 121 FLDNTCTLLMIKALCKGGYLDEAFGLISFLAESRVMFPVLPVYNLFLRACGKRQSTVHVS 180

Query: 181 QCLDMMDRRMVGKNEATYSELLKVAVCQKNLSSVHEIWTDFVKNYSPSVLSLRKFIWSYT 240
           QCLDMMDRRMVGKNEATYSELLKVAVCQKNLSSVHEIWTDFVKNYSPSVLSLRKFIWSYT
Sbjct: 181 QCLDMMDRRMVGKNEATYSELLKVAVCQKNLSSVHEIWTDFVKNYSPSVLSLRKFIWSYT 240

Query: 241 RLGDLKSAYTALQKMVALVIGAAGQKLPSLELDIPVPLRTESYHENFNFEENGPSTDELY 300
           RLGDLKSAYTALQKMVALVIGAAGQKLPSLELDIPVPLRTESYHENFNFEENGPSTDELY
Sbjct: 241 RLGDLKSAYTALQKMVALVIGAAGQKLPSLELDIPVPLRTESYHENFNFEENGPSTDELY 300

Query: 301 CKKMVPCEGDIGQFSVNGMKCGEVESGRLTLPSNYRSNFVMKVLRWSFNDVICACALTRN 360
           CKKMVPCEGDIGQFSVNGMKCGEVESGRLTLPSNYRSNFVMKVLRWSFNDVICACALTRN
Sbjct: 301 CKKMVPCEGDIGQFSVNGMKCGEVESGRLTLPSNYRSNFVMKVLRWSFNDVICACALTRN 360

Query: 361 CGLAEQLMQQFSSISCVLTTLGFISLKMHELGLQPSSHTFDGFVRSVVSERGFSDGIKI- 420
           CGLAEQLMQQ                 MHELGLQPSSHTFDGFVRSVVSERGFSDGIKI 
Sbjct: 361 CGLAEQLMQQ-----------------MHELGLQPSSHTFDGFVRSVVSERGFSDGIKIL 420

Query: 421 ---------------------------------------------------------DQP 480
                                                                    DQP
Sbjct: 421 KIMQQRKLKPYDSTLAAVSISCSKALELDLAEALLEQISACVFPHPFNAFLSACDMMDQP 480

Query: 481 ERAMRMLVKMKQMEVLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRIRMIEMDMEK 540
           ERAMRMLVKMKQMEVLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRIRMIEMDMEK
Sbjct: 481 ERAMRMLVKMKQMEVLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRIRMIEMDMEK 540

Query: 541 HGIQHSHFSMMNLLKALGAEGMTKELLQYLNVAENLFYYDNTCLGTPIYNTALHFLVESK 600
           HGIQHSHFSMMNLLKALGAEGMTKELLQYLNVAENLFYYDNTCLGTPIYNTALHFLVESK
Sbjct: 541 HGIQHSHFSMMNLLKALGAEGMTKELLQYLNVAENLFYYDNTCLGTPIYNTALHFLVESK 600

Query: 601 EIHMAIELFNNMKHSGLFPDAATFEMMINCCSVIGCLKSAFALLSLMIRSGFCPQILTYT 660
           EIHMAIELFNNMKHSGLFPDAATFEMMINCCSVIGCLKSAFALLSLMIRSGFCPQILTYT
Sbjct: 601 EIHMAIELFNNMKHSGLFPDAATFEMMINCCSVIGCLKSAFALLSLMIRSGFCPQILTYT 660

Query: 661 SLVKIVLGFERFDDALNLLDQASSEGIELDVVIMNTIVQKACEK------------MKRE 720
           SLVKIVLGFERFDDALNLLDQASSEGIELDVVIMNTIVQKACEK            MKRE
Sbjct: 661 SLVKIVLGFERFDDALNLLDQASSEGIELDVVIMNTIVQKACEKVTXDVIEFVVEKMKRE 720

Query: 721 KIQPDPSTCHSVFSAYVSLGYHSTAMEALQVLSMRMLCKEHDTSPVVTEYVEDFVLAEDS 767
           KIQPDPSTCHSVFSAYVSLGYHSTAMEALQVLSMRMLCKEHDTSPVVTEYVEDFVLAEDS
Sbjct: 721 KIQPDPSTCHSVFSAYVSLGYHSTAMEALQVLSMRMLCKEHDTSPVVTEYVEDFVLAEDS 780

BLAST of Cp4.1LG06g00270 vs. NCBI nr
Match: XP_022976056.1 (pentatricopeptide repeat-containing protein At1g76280 isoform X1 [Cucurbita maxima])

HSP 1 Score: 1422 bits (3681), Expect = 0.0
Identity = 736/837 (87.93%), Postives = 740/837 (88.41%), Query Frame = 0

Query: 1   MHRARFRLGSIADSLYRFRPHEHGRKQDANKMVFRRALLISQGIEYLGNEAESTKFMQRQ 60
           MHRAR RLGSIADSLYRFRPHEHGRKQDANKMVFRRALLISQGIEYLGNEAESTKFMQRQ
Sbjct: 1   MHRARLRLGSIADSLYRFRPHEHGRKQDANKMVFRRALLISQGIEYLGNEAESTKFMQRQ 60

Query: 61  IVDALRVGDRSSASNLLMELGQEKHSLTADNFVGILSYCARSPDPLFVMETWKIMEERGV 120
           IVDALRVGDRSSASNLLMELGQEKHSLTADNFVGILSYCARSPDPLFVMETWKIMEERGV
Sbjct: 61  IVDALRVGDRSSASNLLMELGQEKHSLTADNFVGILSYCARSPDPLFVMETWKIMEERGV 120

Query: 121 FLDNTCTLLMIKALCKGGYLDEAFGLISFLAESRVMFPVLPVYNLFLRACGKRQSTVHVS 180
           FLDNTCTLLMIKALCKGGYLDEAFGLISFLAESRVMFPVLPVYNLFLRACGKRQSTVHVS
Sbjct: 121 FLDNTCTLLMIKALCKGGYLDEAFGLISFLAESRVMFPVLPVYNLFLRACGKRQSTVHVS 180

Query: 181 QCLDMMDRRMVGKNEATYSELLKVAVCQKNLSSVHEIWTDFVKNYSPSVLSLRKFIWSYT 240
           QCLDMMDRRMVGKNEATYSELLKVAV QKNLSSVHEIWTDFVKNYSPSVLSLRKFIWSYT
Sbjct: 181 QCLDMMDRRMVGKNEATYSELLKVAVGQKNLSSVHEIWTDFVKNYSPSVLSLRKFIWSYT 240

Query: 241 RLGDLKSAYTALQKMVALVIGAAGQKLPSLELDIPVPLRTESYHENFNFEENGPSTDELY 300
           RLGDLKSAYTALQKMV LVIGAAGQKL SLELDIPVPLRTE YH+NFNFEENGPSTDELY
Sbjct: 241 RLGDLKSAYTALQKMVTLVIGAAGQKLSSLELDIPVPLRTEFYHDNFNFEENGPSTDELY 300

Query: 301 CKKMVPCEGDIGQFSVNGMKCGEVESGRLTLPSNYRSNFVMKVLRWSFNDVICACALTRN 360
           CKK+VPCEGDI QFSVNGMKCGEVESGRLTLPSNYRSNFVMKVLRWSFNDVICACA TRN
Sbjct: 301 CKKVVPCEGDIWQFSVNGMKCGEVESGRLTLPSNYRSNFVMKVLRWSFNDVICACARTRN 360

Query: 361 CGLAEQLMQQFSSISCVLTTLGFISLKMHELGLQPSSHTFDGFVRSVVSERGFSDGIKI- 420
           CGLAEQLMQQ                 MHELGLQPSSHTFDGFVRSVVSERGFSDGIKI 
Sbjct: 361 CGLAEQLMQQ-----------------MHELGLQPSSHTFDGFVRSVVSERGFSDGIKIL 420

Query: 421 ---------------------------------------------------------DQP 480
                                                                    DQP
Sbjct: 421 KIMQQRKLKPYDSTLAAVSISCSKALELDLAEALLEQISACVYPHPFNAFLSACDMMDQP 480

Query: 481 ERAMRMLVKMKQMEVLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRIRMIEMDMEK 540
           ERAMRML KMKQMEVLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRIRMIEMDMEK
Sbjct: 481 ERAMRMLAKMKQMEVLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRIRMIEMDMEK 540

Query: 541 HGIQHSHFSMMNLLKALGAEGMTKELLQYLNVAENLFYYDNTCLGTPIYNTALHFLVESK 600
           HGIQHSHFSMMNLLKALGAEGMTKELLQYLNVAENLFYY+NTCLGTPIYNTALHFLVESK
Sbjct: 541 HGIQHSHFSMMNLLKALGAEGMTKELLQYLNVAENLFYYNNTCLGTPIYNTALHFLVESK 600

Query: 601 EIHMAIELFNNMKHSGLFPDAATFEMMINCCSVIGCLKSAFALLSLMIRSGFCPQILTYT 660
           EIHMA ELFNNMKHSGLFPDAATFEMMI+CCSVIGCLKSAFALLSLMIRSGFCPQILTYT
Sbjct: 601 EIHMATELFNNMKHSGLFPDAATFEMMIDCCSVIGCLKSAFALLSLMIRSGFCPQILTYT 660

Query: 661 SLVKIVLGFERFDDALNLLDQASSEGIELDVVIMNTIVQKACEK------------MKRE 720
           SLVKIVLGFERFDDALNLLDQASSEGIELDVVIMNTIVQKACEK            MKRE
Sbjct: 661 SLVKIVLGFERFDDALNLLDQASSEGIELDVVIMNTIVQKACEKGRIDVIEFVVEKMKRE 720

Query: 721 KIQPDPSTCHSVFSAYVSLGYHSTAMEALQVLSMRMLCKEHDTSPVVTEYVEDFVLAEDS 767
           KIQPDPSTCHSVFSAYVSLGYHSTAMEALQVLSMRMLCKEHDTSPVVTEYVEDFVLAEDS
Sbjct: 721 KIQPDPSTCHSVFSAYVSLGYHSTAMEALQVLSMRMLCKEHDTSPVVTEYVEDFVLAEDS 780

BLAST of Cp4.1LG06g00270 vs. NCBI nr
Match: XP_022937086.1 (pentatricopeptide repeat-containing protein At1g76280 isoform X1 [Cucurbita moschata])

HSP 1 Score: 1419 bits (3672), Expect = 0.0
Identity = 735/837 (87.81%), Postives = 742/837 (88.65%), Query Frame = 0

Query: 1   MHRARFRLGSIADSLYRFRPHEHGRKQDANKMVFRRALLISQGIEYLGNEAESTKFMQRQ 60
           MHRAR RLGSIADSLYRFRPHEHGRKQDANKMVFRRALLISQGIEYLGNEAESTKFMQRQ
Sbjct: 1   MHRARLRLGSIADSLYRFRPHEHGRKQDANKMVFRRALLISQGIEYLGNEAESTKFMQRQ 60

Query: 61  IVDALRVGDRSSASNLLMELGQEKHSLTADNFVGILSYCARSPDPLFVMETWKIMEERGV 120
           IVDALRVGDRSSASNLLMELGQEKHSLTADNFVGILSYCARSPDPLFVMETWKIMEERGV
Sbjct: 61  IVDALRVGDRSSASNLLMELGQEKHSLTADNFVGILSYCARSPDPLFVMETWKIMEERGV 120

Query: 121 FLDNTCTLLMIKALCKGGYLDEAFGLISFLAESRVMFPVLPVYNLFLRACGKRQSTVHVS 180
           FLDNTCTLLMIKALCKGGYLDEAFGLISFLAESRVMFPVLPVYNLFLRACGKRQSTVHVS
Sbjct: 121 FLDNTCTLLMIKALCKGGYLDEAFGLISFLAESRVMFPVLPVYNLFLRACGKRQSTVHVS 180

Query: 181 QCLDMMDRRMVGKNEATYSELLKVAVCQKNLSSVHEIWTDFVKNYSPSVLSLRKFIWSYT 240
           QCLD+MDRRMVGKNEATYSELLKVAVCQKNLSSVHEIWTDFVKNYSPSVLSLRKFIW YT
Sbjct: 181 QCLDIMDRRMVGKNEATYSELLKVAVCQKNLSSVHEIWTDFVKNYSPSVLSLRKFIWCYT 240

Query: 241 RLGDLKSAYTALQKMVALVIGAAGQKLPSLELDIPVPLRTESYHENFNFEENGPSTDELY 300
           RLGDLKSA+TALQKMVALVIGAAGQKLPSLELDIPVPLRTE YH+NFNFEENGPSTDE+Y
Sbjct: 241 RLGDLKSAHTALQKMVALVIGAAGQKLPSLELDIPVPLRTEFYHDNFNFEENGPSTDEVY 300

Query: 301 CKKMVPCEGDIGQFSVNGMKCGEVESGRLTLPSNYRSNFVMKVLRWSFNDVICACALTRN 360
           CKKMVPCEGDI QFSVNGMKCGEVESGR TLPSNYRSNFVMKVLRWSFNDVICACALTRN
Sbjct: 301 CKKMVPCEGDIEQFSVNGMKCGEVESGR-TLPSNYRSNFVMKVLRWSFNDVICACALTRN 360

Query: 361 CGLAEQLMQQFSSISCVLTTLGFISLKMHELGLQPSSHTFDGFVRSVVSERGFSDGIKI- 420
           CGLAEQLMQQ                 MHELGLQPSSHTFDGFVRSVVSERGFSDGIKI 
Sbjct: 361 CGLAEQLMQQ-----------------MHELGLQPSSHTFDGFVRSVVSERGFSDGIKIL 420

Query: 421 ---------------------------------------------------------DQP 480
                                                                    DQP
Sbjct: 421 KIMQQRKLKPYDSTLAAVSISCSKALELDLAEALLEQISACVYPHPFNAFLSACDMMDQP 480

Query: 481 ERAMRMLVKMKQMEVLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRIRMIEMDMEK 540
           ERAMRML KMKQMEVLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRIRMIEMDMEK
Sbjct: 481 ERAMRMLAKMKQMEVLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRIRMIEMDMEK 540

Query: 541 HGIQHSHFSMMNLLKALGAEGMTKELLQYLNVAENLFYYDNTCLGTPIYNTALHFLVESK 600
           HGIQHSHFSMMNLLKALGAEGMTKELLQYLNVAENLFYY+NT LGTPIYNTALHFLVESK
Sbjct: 541 HGIQHSHFSMMNLLKALGAEGMTKELLQYLNVAENLFYYNNTWLGTPIYNTALHFLVESK 600

Query: 601 EIHMAIELFNNMKHSGLFPDAATFEMMINCCSVIGCLKSAFALLSLMIRSGFCPQILTYT 660
           EIHMAIELFNNMKHSGLFPDAATFEMMI+CCSVIGCLKSAFALLSLMIRSGFCPQILTYT
Sbjct: 601 EIHMAIELFNNMKHSGLFPDAATFEMMIDCCSVIGCLKSAFALLSLMIRSGFCPQILTYT 660

Query: 661 SLVKIVLGFERFDDALNLLDQASSEGIELDVVIMNTIVQKACEK------------MKRE 720
           SLVKIVLGFERFDDALNLLDQASSEGIELDVVIMNTIVQKACEK            MKR+
Sbjct: 661 SLVKIVLGFERFDDALNLLDQASSEGIELDVVIMNTIVQKACEKGRIDVIEFVVEKMKRK 720

Query: 721 KIQPDPSTCHSVFSAYVSLGYHSTAMEALQVLSMRMLCKEHDTSPVVTEYVEDFVLAEDS 767
           KIQPDPSTCHSVFSAYVSLGYHSTAMEALQVLSMRMLCKE DTSPVVTEYVEDFVLAEDS
Sbjct: 721 KIQPDPSTCHSVFSAYVSLGYHSTAMEALQVLSMRMLCKEQDTSPVVTEYVEDFVLAEDS 780

BLAST of Cp4.1LG06g00270 vs. NCBI nr
Match: KAG6591227.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1411 bits (3653), Expect = 0.0
Identity = 737/864 (85.30%), Postives = 743/864 (86.00%), Query Frame = 0

Query: 1   MHRARFRLGSIADSLYRFRPHEHGRKQDANKMVFRRALLISQ------------------ 60
           MHRAR RLGSIADSLYRFRPHEHGRKQDANKMVFRRALLISQ                  
Sbjct: 1   MHRARLRLGSIADSLYRFRPHEHGRKQDANKMVFRRALLISQAWYVIVIFLRLSATQNFE 60

Query: 61  ---------GIEYLGNEAESTKFMQRQIVDALRVGDRSSASNLLMELGQEKHSLTADNFV 120
                    GIEYLGNEAESTKFMQRQIVDALRVGDRSSASNLLMELGQEKHSLTADNFV
Sbjct: 61  NSLLYAIFSGIEYLGNEAESTKFMQRQIVDALRVGDRSSASNLLMELGQEKHSLTADNFV 120

Query: 121 GILSYCARSPDPLFVMETWKIMEERGVFLDNTCTLLMIKALCKGGYLDEAFGLISFLAES 180
           GILSYCARSPDPLFVMETWKIMEERGVFLDNTCTLLMIKALCKGGYLDEAFGLISFL ES
Sbjct: 121 GILSYCARSPDPLFVMETWKIMEERGVFLDNTCTLLMIKALCKGGYLDEAFGLISFLGES 180

Query: 181 RVMFPVLPVYNLFLRACGKRQSTVHVSQCLDMMDRRMVGKNEATYSELLKVAVCQKNLSS 240
           RVMFPVLPVYNLFLRACGKRQSTVHVSQCLD+MDRRMVGKNEATYSELLKVAVCQKNLSS
Sbjct: 181 RVMFPVLPVYNLFLRACGKRQSTVHVSQCLDIMDRRMVGKNEATYSELLKVAVCQKNLSS 240

Query: 241 VHEIWTDFVKNYSPSVLSLRKFIWSYTRLGDLKSAYTALQKMVALVIGAAGQKLPSLELD 300
           VHEIWTDFVKNYSPSVLSLRKFIW YTRLGDLKSA+TALQKMVALVIGAAGQKLPSLELD
Sbjct: 241 VHEIWTDFVKNYSPSVLSLRKFIWCYTRLGDLKSAHTALQKMVALVIGAAGQKLPSLELD 300

Query: 301 IPVPLRTESYHENFNFEENGPSTDELYCKKMVPCEGDIGQFSVNGMKCGEVESGRLTLPS 360
           IPVPLRTE YH+NFNFEENGPSTDE+YCKKMVPCEGDI QFSVNGMKCGEVESGR TLPS
Sbjct: 301 IPVPLRTEFYHDNFNFEENGPSTDEVYCKKMVPCEGDIEQFSVNGMKCGEVESGR-TLPS 360

Query: 361 NYRSNFVMKVLRWSFNDVICACALTRNCGLAEQLMQQFSSISCVLTTLGFISLKMHELGL 420
           NYRSNFVMKVLRWSFNDVICACALTRNCGLAEQLMQQ                 MHELGL
Sbjct: 361 NYRSNFVMKVLRWSFNDVICACALTRNCGLAEQLMQQ-----------------MHELGL 420

Query: 421 QPSSHTFDGFVRSVVSERGFSDGIKI---------------------------------- 480
           QPSSHTFDGFVRSVVSERGFSDGIKI                                  
Sbjct: 421 QPSSHTFDGFVRSVVSERGFSDGIKILKIMQQRKLKPYDSTLAAVSISCSKALELDLAEA 480

Query: 481 ------------------------DQPERAMRMLVKMKQMEVLPDVKTYELLYSLFGNVN 540
                                   DQPERAMRML KMKQMEVLPDVKTYELLYSLFGNVN
Sbjct: 481 LLEQISACVFPHPFNAFLSACDMMDQPERAMRMLAKMKQMEVLPDVKTYELLYSLFGNVN 540

Query: 541 APYEEGNRLSQVDAAKRIRMIEMDMEKHGIQHSHFSMMNLLKALGAEGMTKELLQYLNVA 600
           APYEEGNRLSQVDAAKRIRMIEMDMEKHGIQHSHFSMMNLLKALGAEGMTKELLQYLNVA
Sbjct: 541 APYEEGNRLSQVDAAKRIRMIEMDMEKHGIQHSHFSMMNLLKALGAEGMTKELLQYLNVA 600

Query: 601 ENLFYYDNTCLGTPIYNTALHFLVESKEIHMAIELFNNMKHSGLFPDAATFEMMINCCSV 660
           ENLFYY+NTCLGTPIYNTALHFLVESKEIHMAIELFNNMKHSGLFPDAATFEMMI+CCSV
Sbjct: 601 ENLFYYNNTCLGTPIYNTALHFLVESKEIHMAIELFNNMKHSGLFPDAATFEMMIDCCSV 660

Query: 661 IGCLKSAFALLSLMIRSGFCPQILTYTSLVKIVLGFERFDDALNLLDQASSEGIELDVVI 720
           IGCLKSAFALLSLMIRSGFCPQILTYTSLVKIVLGFERFDDALNLLDQASSEGIELDVVI
Sbjct: 661 IGCLKSAFALLSLMIRSGFCPQILTYTSLVKIVLGFERFDDALNLLDQASSEGIELDVVI 720

Query: 721 MNTIVQKACEK------------MKREKIQPDPSTCHSVFSAYVSLGYHSTAMEALQVLS 767
           MNTIVQKACEK            MKREKIQPDPSTCHSVFSAYVSLGYHSTAMEALQVLS
Sbjct: 721 MNTIVQKACEKGRIDVIEFAVEKMKREKIQPDPSTCHSVFSAYVSLGYHSTAMEALQVLS 780

BLAST of Cp4.1LG06g00270 vs. NCBI nr
Match: KAG7024113.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1317 bits (3408), Expect = 0.0
Identity = 682/781 (87.32%), Postives = 688/781 (88.09%), Query Frame = 0

Query: 57  MQRQIVDALRVGDRSSASNLLMELGQEKHSLTADNFVGILSYCARSPDPLFVMETWKIME 116
           MQRQIVDALRVGDRSSASNLLMELGQEKHSLTADNFVGILSYCARSPDPLFVMETWKIME
Sbjct: 1   MQRQIVDALRVGDRSSASNLLMELGQEKHSLTADNFVGILSYCARSPDPLFVMETWKIME 60

Query: 117 ERGVFLDNTCTLLMIKALCKGGYLDEAFGLISFLAESRVMFPVLPVYNLFLRACGKRQST 176
           ERGVFLDNTCTLLMIKALCKGGYLDEAFGLISFL ESRVMFPVLPVYNLFLRACGKRQST
Sbjct: 61  ERGVFLDNTCTLLMIKALCKGGYLDEAFGLISFLGESRVMFPVLPVYNLFLRACGKRQST 120

Query: 177 VHVSQCLDMMDRRMVGKNEATYSELLKVAVCQKNLSSVHEIWTDFVKNYSPSVLSLRKFI 236
           VHVSQCLD+MDRRMVGKNEATYSELLKVAVCQKNLSSVHEIWTDFVKNYSPSVLSLRKFI
Sbjct: 121 VHVSQCLDIMDRRMVGKNEATYSELLKVAVCQKNLSSVHEIWTDFVKNYSPSVLSLRKFI 180

Query: 237 WSYTRLGDLKSAYTALQKMVALVIGAAGQKLPSLELDIPVPLRTESYHENFNFEENGPST 296
           W YTRLGDLKSA+TALQKMVALVIGAAGQKLPSLELDIPVPLRTE YH+NFNFEENGPST
Sbjct: 181 WCYTRLGDLKSAHTALQKMVALVIGAAGQKLPSLELDIPVPLRTEFYHDNFNFEENGPST 240

Query: 297 DELYCKKMVPCEGDIGQFSVNGMKCGEVESGRLTLPSNYRSNFVMKVLRWSFNDVICACA 356
           +E+YCKKMVPCEGDI QFSVNGMKCGEVESGR TLPSNYRSNFVMKVLRWSFNDVICACA
Sbjct: 241 NEVYCKKMVPCEGDIEQFSVNGMKCGEVESGR-TLPSNYRSNFVMKVLRWSFNDVICACA 300

Query: 357 LTRNCGLAEQLMQQFSSISCVLTTLGFISLKMHELGLQPSSHTFDGFVRSVVSERGFSDG 416
           LTRNCGLAEQLMQQ                 MHELGLQPSSHTFDGFVRSVVSERGFSDG
Sbjct: 301 LTRNCGLAEQLMQQ-----------------MHELGLQPSSHTFDGFVRSVVSERGFSDG 360

Query: 417 IKI--------------------------------------------------------- 476
           IKI                                                         
Sbjct: 361 IKILKIMQQRKLKPYDSTLAAVSISCSKALELDLAEALLEQISACVFPHPFNAFLSACDM 420

Query: 477 -DQPERAMRMLVKMKQMEVLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRIRMIEM 536
            DQPERAMRMLVKMKQMEVLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRIRMIEM
Sbjct: 421 MDQPERAMRMLVKMKQMEVLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRIRMIEM 480

Query: 537 DMEKHGIQHSHFSMMNLLKALGAEGMTKELLQYLNVAENLFYYDNTCLGTPIYNTALHFL 596
           DMEKHGIQHSHFSMMNLLKALGAEGMTKELLQYLNVAENLFYY+NTCLGTPIYNTALHFL
Sbjct: 481 DMEKHGIQHSHFSMMNLLKALGAEGMTKELLQYLNVAENLFYYNNTCLGTPIYNTALHFL 540

Query: 597 VESKEIHMAIELFNNMKHSGLFPDAATFEMMINCCSVIGCLKSAFALLSLMIRSGFCPQI 656
           VESKEIHMAIELFNNMKHSGLFPDAATFEMMINCCSVIGCLKSAFALLSLMIRSGFCPQI
Sbjct: 541 VESKEIHMAIELFNNMKHSGLFPDAATFEMMINCCSVIGCLKSAFALLSLMIRSGFCPQI 600

Query: 657 LTYTSLVKIVLGFERFDDALNLLDQASSEGIELDVVIMNTIVQKACEK------------ 716
           LTYTSLVKIVLGFERFDDALNLLDQASSEGIELDVVIMNTIVQKACEK            
Sbjct: 601 LTYTSLVKIVLGFERFDDALNLLDQASSEGIELDVVIMNTIVQKACEKGRIDVIEFVVEK 660

Query: 717 MKREKIQPDPSTCHSVFSAYVSLGYHSTAMEALQVLSMRMLCKEHDTSPVVTEYVEDFVL 767
           MKREKIQPDPSTCHSVFSAYVSLGYHSTAMEALQVLSMRMLCKEHDTSPVVTEYVEDFVL
Sbjct: 661 MKREKIQPDPSTCHSVFSAYVSLGYHSTAMEALQVLSMRMLCKEHDTSPVVTEYVEDFVL 720

BLAST of Cp4.1LG06g00270 vs. ExPASy TrEMBL
Match: A0A6J1IFV2 (pentatricopeptide repeat-containing protein At1g76280 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111476573 PE=4 SV=1)

HSP 1 Score: 1422 bits (3681), Expect = 0.0
Identity = 736/837 (87.93%), Postives = 740/837 (88.41%), Query Frame = 0

Query: 1   MHRARFRLGSIADSLYRFRPHEHGRKQDANKMVFRRALLISQGIEYLGNEAESTKFMQRQ 60
           MHRAR RLGSIADSLYRFRPHEHGRKQDANKMVFRRALLISQGIEYLGNEAESTKFMQRQ
Sbjct: 1   MHRARLRLGSIADSLYRFRPHEHGRKQDANKMVFRRALLISQGIEYLGNEAESTKFMQRQ 60

Query: 61  IVDALRVGDRSSASNLLMELGQEKHSLTADNFVGILSYCARSPDPLFVMETWKIMEERGV 120
           IVDALRVGDRSSASNLLMELGQEKHSLTADNFVGILSYCARSPDPLFVMETWKIMEERGV
Sbjct: 61  IVDALRVGDRSSASNLLMELGQEKHSLTADNFVGILSYCARSPDPLFVMETWKIMEERGV 120

Query: 121 FLDNTCTLLMIKALCKGGYLDEAFGLISFLAESRVMFPVLPVYNLFLRACGKRQSTVHVS 180
           FLDNTCTLLMIKALCKGGYLDEAFGLISFLAESRVMFPVLPVYNLFLRACGKRQSTVHVS
Sbjct: 121 FLDNTCTLLMIKALCKGGYLDEAFGLISFLAESRVMFPVLPVYNLFLRACGKRQSTVHVS 180

Query: 181 QCLDMMDRRMVGKNEATYSELLKVAVCQKNLSSVHEIWTDFVKNYSPSVLSLRKFIWSYT 240
           QCLDMMDRRMVGKNEATYSELLKVAV QKNLSSVHEIWTDFVKNYSPSVLSLRKFIWSYT
Sbjct: 181 QCLDMMDRRMVGKNEATYSELLKVAVGQKNLSSVHEIWTDFVKNYSPSVLSLRKFIWSYT 240

Query: 241 RLGDLKSAYTALQKMVALVIGAAGQKLPSLELDIPVPLRTESYHENFNFEENGPSTDELY 300
           RLGDLKSAYTALQKMV LVIGAAGQKL SLELDIPVPLRTE YH+NFNFEENGPSTDELY
Sbjct: 241 RLGDLKSAYTALQKMVTLVIGAAGQKLSSLELDIPVPLRTEFYHDNFNFEENGPSTDELY 300

Query: 301 CKKMVPCEGDIGQFSVNGMKCGEVESGRLTLPSNYRSNFVMKVLRWSFNDVICACALTRN 360
           CKK+VPCEGDI QFSVNGMKCGEVESGRLTLPSNYRSNFVMKVLRWSFNDVICACA TRN
Sbjct: 301 CKKVVPCEGDIWQFSVNGMKCGEVESGRLTLPSNYRSNFVMKVLRWSFNDVICACARTRN 360

Query: 361 CGLAEQLMQQFSSISCVLTTLGFISLKMHELGLQPSSHTFDGFVRSVVSERGFSDGIKI- 420
           CGLAEQLMQQ                 MHELGLQPSSHTFDGFVRSVVSERGFSDGIKI 
Sbjct: 361 CGLAEQLMQQ-----------------MHELGLQPSSHTFDGFVRSVVSERGFSDGIKIL 420

Query: 421 ---------------------------------------------------------DQP 480
                                                                    DQP
Sbjct: 421 KIMQQRKLKPYDSTLAAVSISCSKALELDLAEALLEQISACVYPHPFNAFLSACDMMDQP 480

Query: 481 ERAMRMLVKMKQMEVLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRIRMIEMDMEK 540
           ERAMRML KMKQMEVLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRIRMIEMDMEK
Sbjct: 481 ERAMRMLAKMKQMEVLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRIRMIEMDMEK 540

Query: 541 HGIQHSHFSMMNLLKALGAEGMTKELLQYLNVAENLFYYDNTCLGTPIYNTALHFLVESK 600
           HGIQHSHFSMMNLLKALGAEGMTKELLQYLNVAENLFYY+NTCLGTPIYNTALHFLVESK
Sbjct: 541 HGIQHSHFSMMNLLKALGAEGMTKELLQYLNVAENLFYYNNTCLGTPIYNTALHFLVESK 600

Query: 601 EIHMAIELFNNMKHSGLFPDAATFEMMINCCSVIGCLKSAFALLSLMIRSGFCPQILTYT 660
           EIHMA ELFNNMKHSGLFPDAATFEMMI+CCSVIGCLKSAFALLSLMIRSGFCPQILTYT
Sbjct: 601 EIHMATELFNNMKHSGLFPDAATFEMMIDCCSVIGCLKSAFALLSLMIRSGFCPQILTYT 660

Query: 661 SLVKIVLGFERFDDALNLLDQASSEGIELDVVIMNTIVQKACEK------------MKRE 720
           SLVKIVLGFERFDDALNLLDQASSEGIELDVVIMNTIVQKACEK            MKRE
Sbjct: 661 SLVKIVLGFERFDDALNLLDQASSEGIELDVVIMNTIVQKACEKGRIDVIEFVVEKMKRE 720

Query: 721 KIQPDPSTCHSVFSAYVSLGYHSTAMEALQVLSMRMLCKEHDTSPVVTEYVEDFVLAEDS 767
           KIQPDPSTCHSVFSAYVSLGYHSTAMEALQVLSMRMLCKEHDTSPVVTEYVEDFVLAEDS
Sbjct: 721 KIQPDPSTCHSVFSAYVSLGYHSTAMEALQVLSMRMLCKEHDTSPVVTEYVEDFVLAEDS 780

BLAST of Cp4.1LG06g00270 vs. ExPASy TrEMBL
Match: A0A6J1F9C6 (pentatricopeptide repeat-containing protein At1g76280 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111443493 PE=4 SV=1)

HSP 1 Score: 1419 bits (3672), Expect = 0.0
Identity = 735/837 (87.81%), Postives = 742/837 (88.65%), Query Frame = 0

Query: 1   MHRARFRLGSIADSLYRFRPHEHGRKQDANKMVFRRALLISQGIEYLGNEAESTKFMQRQ 60
           MHRAR RLGSIADSLYRFRPHEHGRKQDANKMVFRRALLISQGIEYLGNEAESTKFMQRQ
Sbjct: 1   MHRARLRLGSIADSLYRFRPHEHGRKQDANKMVFRRALLISQGIEYLGNEAESTKFMQRQ 60

Query: 61  IVDALRVGDRSSASNLLMELGQEKHSLTADNFVGILSYCARSPDPLFVMETWKIMEERGV 120
           IVDALRVGDRSSASNLLMELGQEKHSLTADNFVGILSYCARSPDPLFVMETWKIMEERGV
Sbjct: 61  IVDALRVGDRSSASNLLMELGQEKHSLTADNFVGILSYCARSPDPLFVMETWKIMEERGV 120

Query: 121 FLDNTCTLLMIKALCKGGYLDEAFGLISFLAESRVMFPVLPVYNLFLRACGKRQSTVHVS 180
           FLDNTCTLLMIKALCKGGYLDEAFGLISFLAESRVMFPVLPVYNLFLRACGKRQSTVHVS
Sbjct: 121 FLDNTCTLLMIKALCKGGYLDEAFGLISFLAESRVMFPVLPVYNLFLRACGKRQSTVHVS 180

Query: 181 QCLDMMDRRMVGKNEATYSELLKVAVCQKNLSSVHEIWTDFVKNYSPSVLSLRKFIWSYT 240
           QCLD+MDRRMVGKNEATYSELLKVAVCQKNLSSVHEIWTDFVKNYSPSVLSLRKFIW YT
Sbjct: 181 QCLDIMDRRMVGKNEATYSELLKVAVCQKNLSSVHEIWTDFVKNYSPSVLSLRKFIWCYT 240

Query: 241 RLGDLKSAYTALQKMVALVIGAAGQKLPSLELDIPVPLRTESYHENFNFEENGPSTDELY 300
           RLGDLKSA+TALQKMVALVIGAAGQKLPSLELDIPVPLRTE YH+NFNFEENGPSTDE+Y
Sbjct: 241 RLGDLKSAHTALQKMVALVIGAAGQKLPSLELDIPVPLRTEFYHDNFNFEENGPSTDEVY 300

Query: 301 CKKMVPCEGDIGQFSVNGMKCGEVESGRLTLPSNYRSNFVMKVLRWSFNDVICACALTRN 360
           CKKMVPCEGDI QFSVNGMKCGEVESGR TLPSNYRSNFVMKVLRWSFNDVICACALTRN
Sbjct: 301 CKKMVPCEGDIEQFSVNGMKCGEVESGR-TLPSNYRSNFVMKVLRWSFNDVICACALTRN 360

Query: 361 CGLAEQLMQQFSSISCVLTTLGFISLKMHELGLQPSSHTFDGFVRSVVSERGFSDGIKI- 420
           CGLAEQLMQQ                 MHELGLQPSSHTFDGFVRSVVSERGFSDGIKI 
Sbjct: 361 CGLAEQLMQQ-----------------MHELGLQPSSHTFDGFVRSVVSERGFSDGIKIL 420

Query: 421 ---------------------------------------------------------DQP 480
                                                                    DQP
Sbjct: 421 KIMQQRKLKPYDSTLAAVSISCSKALELDLAEALLEQISACVYPHPFNAFLSACDMMDQP 480

Query: 481 ERAMRMLVKMKQMEVLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRIRMIEMDMEK 540
           ERAMRML KMKQMEVLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRIRMIEMDMEK
Sbjct: 481 ERAMRMLAKMKQMEVLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRIRMIEMDMEK 540

Query: 541 HGIQHSHFSMMNLLKALGAEGMTKELLQYLNVAENLFYYDNTCLGTPIYNTALHFLVESK 600
           HGIQHSHFSMMNLLKALGAEGMTKELLQYLNVAENLFYY+NT LGTPIYNTALHFLVESK
Sbjct: 541 HGIQHSHFSMMNLLKALGAEGMTKELLQYLNVAENLFYYNNTWLGTPIYNTALHFLVESK 600

Query: 601 EIHMAIELFNNMKHSGLFPDAATFEMMINCCSVIGCLKSAFALLSLMIRSGFCPQILTYT 660
           EIHMAIELFNNMKHSGLFPDAATFEMMI+CCSVIGCLKSAFALLSLMIRSGFCPQILTYT
Sbjct: 601 EIHMAIELFNNMKHSGLFPDAATFEMMIDCCSVIGCLKSAFALLSLMIRSGFCPQILTYT 660

Query: 661 SLVKIVLGFERFDDALNLLDQASSEGIELDVVIMNTIVQKACEK------------MKRE 720
           SLVKIVLGFERFDDALNLLDQASSEGIELDVVIMNTIVQKACEK            MKR+
Sbjct: 661 SLVKIVLGFERFDDALNLLDQASSEGIELDVVIMNTIVQKACEKGRIDVIEFVVEKMKRK 720

Query: 721 KIQPDPSTCHSVFSAYVSLGYHSTAMEALQVLSMRMLCKEHDTSPVVTEYVEDFVLAEDS 767
           KIQPDPSTCHSVFSAYVSLGYHSTAMEALQVLSMRMLCKE DTSPVVTEYVEDFVLAEDS
Sbjct: 721 KIQPDPSTCHSVFSAYVSLGYHSTAMEALQVLSMRMLCKEQDTSPVVTEYVEDFVLAEDS 780

BLAST of Cp4.1LG06g00270 vs. ExPASy TrEMBL
Match: A0A6J1IEQ4 (pentatricopeptide repeat-containing protein At1g76280 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111476573 PE=4 SV=1)

HSP 1 Score: 1315 bits (3402), Expect = 0.0
Identity = 681/781 (87.20%), Postives = 685/781 (87.71%), Query Frame = 0

Query: 57  MQRQIVDALRVGDRSSASNLLMELGQEKHSLTADNFVGILSYCARSPDPLFVMETWKIME 116
           MQRQIVDALRVGDRSSASNLLMELGQEKHSLTADNFVGILSYCARSPDPLFVMETWKIME
Sbjct: 1   MQRQIVDALRVGDRSSASNLLMELGQEKHSLTADNFVGILSYCARSPDPLFVMETWKIME 60

Query: 117 ERGVFLDNTCTLLMIKALCKGGYLDEAFGLISFLAESRVMFPVLPVYNLFLRACGKRQST 176
           ERGVFLDNTCTLLMIKALCKGGYLDEAFGLISFLAESRVMFPVLPVYNLFLRACGKRQST
Sbjct: 61  ERGVFLDNTCTLLMIKALCKGGYLDEAFGLISFLAESRVMFPVLPVYNLFLRACGKRQST 120

Query: 177 VHVSQCLDMMDRRMVGKNEATYSELLKVAVCQKNLSSVHEIWTDFVKNYSPSVLSLRKFI 236
           VHVSQCLDMMDRRMVGKNEATYSELLKVAV QKNLSSVHEIWTDFVKNYSPSVLSLRKFI
Sbjct: 121 VHVSQCLDMMDRRMVGKNEATYSELLKVAVGQKNLSSVHEIWTDFVKNYSPSVLSLRKFI 180

Query: 237 WSYTRLGDLKSAYTALQKMVALVIGAAGQKLPSLELDIPVPLRTESYHENFNFEENGPST 296
           WSYTRLGDLKSAYTALQKMV LVIGAAGQKL SLELDIPVPLRTE YH+NFNFEENGPST
Sbjct: 181 WSYTRLGDLKSAYTALQKMVTLVIGAAGQKLSSLELDIPVPLRTEFYHDNFNFEENGPST 240

Query: 297 DELYCKKMVPCEGDIGQFSVNGMKCGEVESGRLTLPSNYRSNFVMKVLRWSFNDVICACA 356
           DELYCKK+VPCEGDI QFSVNGMKCGEVESGRLTLPSNYRSNFVMKVLRWSFNDVICACA
Sbjct: 241 DELYCKKVVPCEGDIWQFSVNGMKCGEVESGRLTLPSNYRSNFVMKVLRWSFNDVICACA 300

Query: 357 LTRNCGLAEQLMQQFSSISCVLTTLGFISLKMHELGLQPSSHTFDGFVRSVVSERGFSDG 416
            TRNCGLAEQLMQQ                 MHELGLQPSSHTFDGFVRSVVSERGFSDG
Sbjct: 301 RTRNCGLAEQLMQQ-----------------MHELGLQPSSHTFDGFVRSVVSERGFSDG 360

Query: 417 IKI--------------------------------------------------------- 476
           IKI                                                         
Sbjct: 361 IKILKIMQQRKLKPYDSTLAAVSISCSKALELDLAEALLEQISACVYPHPFNAFLSACDM 420

Query: 477 -DQPERAMRMLVKMKQMEVLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRIRMIEM 536
            DQPERAMRML KMKQMEVLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRIRMIEM
Sbjct: 421 MDQPERAMRMLAKMKQMEVLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRIRMIEM 480

Query: 537 DMEKHGIQHSHFSMMNLLKALGAEGMTKELLQYLNVAENLFYYDNTCLGTPIYNTALHFL 596
           DMEKHGIQHSHFSMMNLLKALGAEGMTKELLQYLNVAENLFYY+NTCLGTPIYNTALHFL
Sbjct: 481 DMEKHGIQHSHFSMMNLLKALGAEGMTKELLQYLNVAENLFYYNNTCLGTPIYNTALHFL 540

Query: 597 VESKEIHMAIELFNNMKHSGLFPDAATFEMMINCCSVIGCLKSAFALLSLMIRSGFCPQI 656
           VESKEIHMA ELFNNMKHSGLFPDAATFEMMI+CCSVIGCLKSAFALLSLMIRSGFCPQI
Sbjct: 541 VESKEIHMATELFNNMKHSGLFPDAATFEMMIDCCSVIGCLKSAFALLSLMIRSGFCPQI 600

Query: 657 LTYTSLVKIVLGFERFDDALNLLDQASSEGIELDVVIMNTIVQKACEK------------ 716
           LTYTSLVKIVLGFERFDDALNLLDQASSEGIELDVVIMNTIVQKACEK            
Sbjct: 601 LTYTSLVKIVLGFERFDDALNLLDQASSEGIELDVVIMNTIVQKACEKGRIDVIEFVVEK 660

Query: 717 MKREKIQPDPSTCHSVFSAYVSLGYHSTAMEALQVLSMRMLCKEHDTSPVVTEYVEDFVL 767
           MKREKIQPDPSTCHSVFSAYVSLGYHSTAMEALQVLSMRMLCKEHDTSPVVTEYVEDFVL
Sbjct: 661 MKREKIQPDPSTCHSVFSAYVSLGYHSTAMEALQVLSMRMLCKEHDTSPVVTEYVEDFVL 720

BLAST of Cp4.1LG06g00270 vs. ExPASy TrEMBL
Match: A0A6J1FA55 (pentatricopeptide repeat-containing protein At1g76280 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111443493 PE=4 SV=1)

HSP 1 Score: 1311 bits (3393), Expect = 0.0
Identity = 680/781 (87.07%), Postives = 687/781 (87.96%), Query Frame = 0

Query: 57  MQRQIVDALRVGDRSSASNLLMELGQEKHSLTADNFVGILSYCARSPDPLFVMETWKIME 116
           MQRQIVDALRVGDRSSASNLLMELGQEKHSLTADNFVGILSYCARSPDPLFVMETWKIME
Sbjct: 1   MQRQIVDALRVGDRSSASNLLMELGQEKHSLTADNFVGILSYCARSPDPLFVMETWKIME 60

Query: 117 ERGVFLDNTCTLLMIKALCKGGYLDEAFGLISFLAESRVMFPVLPVYNLFLRACGKRQST 176
           ERGVFLDNTCTLLMIKALCKGGYLDEAFGLISFLAESRVMFPVLPVYNLFLRACGKRQST
Sbjct: 61  ERGVFLDNTCTLLMIKALCKGGYLDEAFGLISFLAESRVMFPVLPVYNLFLRACGKRQST 120

Query: 177 VHVSQCLDMMDRRMVGKNEATYSELLKVAVCQKNLSSVHEIWTDFVKNYSPSVLSLRKFI 236
           VHVSQCLD+MDRRMVGKNEATYSELLKVAVCQKNLSSVHEIWTDFVKNYSPSVLSLRKFI
Sbjct: 121 VHVSQCLDIMDRRMVGKNEATYSELLKVAVCQKNLSSVHEIWTDFVKNYSPSVLSLRKFI 180

Query: 237 WSYTRLGDLKSAYTALQKMVALVIGAAGQKLPSLELDIPVPLRTESYHENFNFEENGPST 296
           W YTRLGDLKSA+TALQKMVALVIGAAGQKLPSLELDIPVPLRTE YH+NFNFEENGPST
Sbjct: 181 WCYTRLGDLKSAHTALQKMVALVIGAAGQKLPSLELDIPVPLRTEFYHDNFNFEENGPST 240

Query: 297 DELYCKKMVPCEGDIGQFSVNGMKCGEVESGRLTLPSNYRSNFVMKVLRWSFNDVICACA 356
           DE+YCKKMVPCEGDI QFSVNGMKCGEVESGR TLPSNYRSNFVMKVLRWSFNDVICACA
Sbjct: 241 DEVYCKKMVPCEGDIEQFSVNGMKCGEVESGR-TLPSNYRSNFVMKVLRWSFNDVICACA 300

Query: 357 LTRNCGLAEQLMQQFSSISCVLTTLGFISLKMHELGLQPSSHTFDGFVRSVVSERGFSDG 416
           LTRNCGLAEQLMQQ                 MHELGLQPSSHTFDGFVRSVVSERGFSDG
Sbjct: 301 LTRNCGLAEQLMQQ-----------------MHELGLQPSSHTFDGFVRSVVSERGFSDG 360

Query: 417 IKI--------------------------------------------------------- 476
           IKI                                                         
Sbjct: 361 IKILKIMQQRKLKPYDSTLAAVSISCSKALELDLAEALLEQISACVYPHPFNAFLSACDM 420

Query: 477 -DQPERAMRMLVKMKQMEVLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRIRMIEM 536
            DQPERAMRML KMKQMEVLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRIRMIEM
Sbjct: 421 MDQPERAMRMLAKMKQMEVLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRIRMIEM 480

Query: 537 DMEKHGIQHSHFSMMNLLKALGAEGMTKELLQYLNVAENLFYYDNTCLGTPIYNTALHFL 596
           DMEKHGIQHSHFSMMNLLKALGAEGMTKELLQYLNVAENLFYY+NT LGTPIYNTALHFL
Sbjct: 481 DMEKHGIQHSHFSMMNLLKALGAEGMTKELLQYLNVAENLFYYNNTWLGTPIYNTALHFL 540

Query: 597 VESKEIHMAIELFNNMKHSGLFPDAATFEMMINCCSVIGCLKSAFALLSLMIRSGFCPQI 656
           VESKEIHMAIELFNNMKHSGLFPDAATFEMMI+CCSVIGCLKSAFALLSLMIRSGFCPQI
Sbjct: 541 VESKEIHMAIELFNNMKHSGLFPDAATFEMMIDCCSVIGCLKSAFALLSLMIRSGFCPQI 600

Query: 657 LTYTSLVKIVLGFERFDDALNLLDQASSEGIELDVVIMNTIVQKACEK------------ 716
           LTYTSLVKIVLGFERFDDALNLLDQASSEGIELDVVIMNTIVQKACEK            
Sbjct: 601 LTYTSLVKIVLGFERFDDALNLLDQASSEGIELDVVIMNTIVQKACEKGRIDVIEFVVEK 660

Query: 717 MKREKIQPDPSTCHSVFSAYVSLGYHSTAMEALQVLSMRMLCKEHDTSPVVTEYVEDFVL 767
           MKR+KIQPDPSTCHSVFSAYVSLGYHSTAMEALQVLSMRMLCKE DTSPVVTEYVEDFVL
Sbjct: 661 MKRKKIQPDPSTCHSVFSAYVSLGYHSTAMEALQVLSMRMLCKEQDTSPVVTEYVEDFVL 720

BLAST of Cp4.1LG06g00270 vs. ExPASy TrEMBL
Match: A0A1S3CEN8 (pentatricopeptide repeat-containing protein At1g76280 isoform X3 OS=Cucumis melo OX=3656 GN=LOC103500041 PE=4 SV=1)

HSP 1 Score: 1204 bits (3115), Expect = 0.0
Identity = 618/778 (79.43%), Postives = 677/778 (87.02%), Query Frame = 0

Query: 1   MHRARFRLGSIADSLYRFRPHEHGRKQDANKMVFRRALLISQGIEYLGNEAESTKFMQRQ 60
           MHRA FRLGSIADS+YRF+PHE  RKQDA+K+VF RALLIS+G E  GN AEST FMQ Q
Sbjct: 1   MHRASFRLGSIADSIYRFKPHELVRKQDASKLVFHRALLISKGSEIWGNGAESTAFMQIQ 60

Query: 61  IVDALRVGDRSSASNLLMELGQEKHSLTADNFVGILSYCARSPDPLFVMETWKIMEERGV 120
           IVDALR+GDRS ASNLLM LGQEK SLTADNFV ILSYCA+SPDPLFVMETWKIMEERG+
Sbjct: 61  IVDALRLGDRSKASNLLMVLGQEKCSLTADNFVRILSYCAKSPDPLFVMETWKIMEERGI 120

Query: 121 FLDNTCTLLMIKALCKGGYLDEAFGLISFLAESRVMFPVLPVYNLFLRACGKRQSTVHVS 180
           FL+NTC+LLMI+ALCKGGYLDEAFGLI+FLAES VMFPVLPVYN FLRAC  RQSTVH S
Sbjct: 121 FLNNTCSLLMIEALCKGGYLDEAFGLINFLAESHVMFPVLPVYNCFLRACAIRQSTVHAS 180

Query: 181 QCLDMMDRRMVGKNEATYSELLKVAVCQKNLSSVHEIWTDFVKNYSPSVLSLRKFIWSYT 240
           QCLD+MD RMVGKNEATYSELLK+AVCQ+N SSVHEIWTDFVKNYSPSV SLRKFIWS+ 
Sbjct: 181 QCLDLMDHRMVGKNEATYSELLKLAVCQENSSSVHEIWTDFVKNYSPSVSSLRKFIWSFA 240

Query: 241 RLGDLKSAYTALQKMVALVIGAAGQKLPSLELDIPVPLRTESYHENFNFEENGPSTDELY 300
           RLGDL SAYTALQKMVAL  GA G+KL SL  DIP+PLRTE YH NFNFEE  PS DE +
Sbjct: 241 RLGDLTSAYTALQKMVALATGATGRKLQSL--DIPIPLRTEFYHNNFNFEEKEPSIDEFF 300

Query: 301 CKKMVPCEGDIGQFSVNGMKCGEVESGRLTLPSNYRSNFVMKVLRWSFNDVICACALTRN 360
           CKKMVP  GD+G  SVN MKCGE  +G LT+P+N+RS+FV KVLRWS NDV+ +C+L  N
Sbjct: 301 CKKMVPWNGDVGGISVNDMKCGE--TGPLTVPNNHRSSFVRKVLRWSSNDVMRSCSLAGN 360

Query: 361 CGLAEQLMQQFSSISCVLTTLGFISLKMHELGLQPSSHTFDGFVRSVVSERGFSDGIKID 420
           CGLAEQLMQQ                 MH+LGLQPSSHTFDGFVRSVVSERGFS G++ID
Sbjct: 361 CGLAEQLMQQ-----------------MHKLGLQPSSHTFDGFVRSVVSERGFSAGMEID 420

Query: 421 QPERAMRMLVKMKQMEVLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRIRMIEMDM 480
           QPERAMRMLVKMKQM+V+PDV+TYELLYSLFGNVNAPYEEG++LSQVDAAKRIRMIEMDM
Sbjct: 421 QPERAMRMLVKMKQMKVVPDVRTYELLYSLFGNVNAPYEEGDKLSQVDAAKRIRMIEMDM 480

Query: 481 EKHGIQHSHFSMMNLLKALGAEGMTKELLQYLNVAENLFYYDNTCLGTPIYNTALHFLVE 540
            KHGIQ+SHFSMMNLLKALGAEGM KE+LQYLN+AENLFYY+NT LG P+YNT LHFLV+
Sbjct: 481 GKHGIQYSHFSMMNLLKALGAEGMKKEVLQYLNLAENLFYYNNTSLGMPVYNTVLHFLVD 540

Query: 541 SKEIHMAIELFNNMKHSGLFPDAATFEMMINCCSVIGCLKSAFALLSLMIRSGFCPQILT 600
           SKEI+MAIELFNNMK+SG FPDAATFE+M++CCSV+GCLKSAFALLSLMIRSGFCPQILT
Sbjct: 541 SKEIYMAIELFNNMKNSGFFPDAATFEIMLDCCSVMGCLKSAFALLSLMIRSGFCPQILT 600

Query: 601 YTSLVKIVLGFERFDDALNLLDQASSEGIELDVVIMNTIVQKACEK------------MK 660
           YTSLVKIVLGF RFDDALNLLDQASSEGIELDV+IMNTI++KACEK            M 
Sbjct: 601 YTSLVKIVLGFGRFDDALNLLDQASSEGIELDVIIMNTIMRKACEKARIDVIEFLVEKMN 660

Query: 661 REKIQPDPSTCHSVFSAYVSLGYHSTAMEALQVLSMRML-CKEHDTSPVVTEYVEDFVLA 720
           REKIQPDPSTCH+VFSAYV+LGYHSTAMEALQVLSMRML C+E D S  VTEY+E+FVLA
Sbjct: 661 REKIQPDPSTCHNVFSAYVNLGYHSTAMEALQVLSMRMLLCEEDDDS--VTEYMENFVLA 720

Query: 721 EDSEAESRILEFFKCSEESLSFALLNLRWSAMLGYSLCSSPNQSPWAMRLASSYDDGY 765
           ED+ A+SRI EFFKCS E L FAL NLRW AMLGYS+C SPNQSPWAMRLASSYD GY
Sbjct: 721 EDTGADSRIAEFFKCSREYLGFALFNLRWCAMLGYSVCCSPNQSPWAMRLASSYD-GY 754

BLAST of Cp4.1LG06g00270 vs. TAIR 10
Match: AT1G76280.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 624.8 bits (1610), Expect = 9.3e-179
Identity = 358/755 (47.42%), Postives = 479/755 (63.44%), Query Frame = 0

Query: 47  LGNE----AESTKFMQRQIVDALRVGDRSSASNLLMELGQEKHSLTADNFVGILSYCARS 106
           +GNE     + +K +Q QIVDALR G+R  AS LL +L Q  +SL+AD+F  IL YCARS
Sbjct: 50  IGNEFIRCQDESKILQLQIVDALRSGERQGASALLFKLIQGNYSLSADDFHDILYYCARS 109

Query: 107 PDPLFVMETWKIMEERGVFLDNTCTLLMIKALCKGGYLDEAFGLISFLAESRVMFPVLPV 166
           PDP+FVMET+ +M ++ + LD+   L ++K+LC GG+LD+A   I  + E   + P+LP+
Sbjct: 110 PDPVFVMETYSVMCKKEISLDSRSLLFIVKSLCNGGHLDKASEFIHAVREDDRISPLLPI 169

Query: 167 YNLFLRACGKRQSTVHVSQCLDMMDRRMVGKNEATYSELLKVAVCQKNLSSVHEIWTDFV 226
           YN FL AC + +S  H S+CL++MD+R VGKN  TY  LLK+AV Q+NLS+V++IW  +V
Sbjct: 170 YNFFLGACARTRSVYHASKCLELMDQRRVGKNGITYVALLKLAVFQRNLSTVNDIWKHYV 229

Query: 227 KNYSPSVLSLRKFIWSYTRLGDLKSAYTALQKMVALV------IGAAGQKLPSLELDIPV 286
            +Y+  +LSLR+FIWS+TRLGDLKSAY  LQ MV L       + +   KL S  L IPV
Sbjct: 230 NHYNLDILSLRRFIWSFTRLGDLKSAYELLQHMVYLALRGEFFVKSNRGKLHSTRLYIPV 289

Query: 287 PLRTESYHENFNFEENGPSTDELYCKKMVPCEGDIGQFSVNGMKCGEVESGRLTLPSNYR 346
           P + E+  E F F      TD     ++V C                  S ++ LP  + 
Sbjct: 290 PSKDETGSEKFAF----GVTD-----RIVDCN----------------SSSKVALPKGHN 349

Query: 347 SNFVMKVLRWSFNDVICACALTRNCGLAEQLM--------QQFSSISCVLTTLGFISLKM 406
               ++VLRWSFNDVI AC  ++N  LAEQLM        Q        L T+     K 
Sbjct: 350 KILAIRVLRWSFNDVIHACGQSKNSELAEQLMLQLKVMQQQNLKPYDSTLATVAAYCSKA 409

Query: 407 HELGLQPSSHTFDGFVRSVVSERGFSDGI--------KIDQPERAMRMLVKMKQMEVLPD 466
            ++ L  + H  D      +SE  +S            +DQPERA+R+L +MK++++ PD
Sbjct: 410 LQVDL--AEHLLD-----QISECSYSYPFNNLLAAYDSLDQPERAVRVLARMKELKLRPD 469

Query: 467 VKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRIRMIEMDMEKHGIQHSHFSMMNLLKALG 526
           ++TYELL+SLFGNVNAPYEEGN LSQVD  KRI  IEMDM ++G QHS  S +N+L+ALG
Sbjct: 470 MRTYELLFSLFGNVNAPYEEGNMLSQVDCCKRINAIEMDMMRNGFQHSPISRLNVLRALG 529

Query: 527 AEGMTKELLQYLNVAENLFYYDNTCLGTPIYNTALHFLVESKEIHMAIELFNNMKHSGLF 586
           AEGM  E++++L  AENL  + N  LGTP YN  LH L+E+ E  M I +F  MK  G  
Sbjct: 530 AEGMVNEMIRHLQKAENLSAHSNMYLGTPTYNIVLHSLLEANETDMVINIFKRMKSCGCP 589

Query: 587 PDAATFEMMINCCSVIGCLKSAFALLSLMIRSGFCPQILTYTSLVKIVLGFERFDDALNL 646
            D AT+ +MI+CCS+I   KSA AL+S+MIR GF P+ +T+T+L+KI+L    F++ALNL
Sbjct: 590 ADVATYNIMIDCCSLIHSYKSACALVSMMIRDGFSPKAVTFTALMKILLNDANFEEALNL 649

Query: 647 LDQASSEGIELDVVIMNTIVQKACEK------------MKREKIQPDPSTCHSVFSAYVS 706
           LDQA+ E I LDV+  NTI++KA EK            M REK+ PDP+TCH VFS YV 
Sbjct: 650 LDQAALEEIHLDVLSYNTILRKAFEKGMIDVIEYIVEQMHREKVNPDPTTCHYVFSCYVE 709

Query: 707 LGYHSTAMEALQVLSMRMLCKEHDTS--PVVTEYVEDFVLAEDSEAESRILEFFKCSEES 762
            GYH+TA+EAL VLS+RML +E   S      E  E+FV++ED EAE++I+E F+ SEE 
Sbjct: 710 KGYHATAIEALNVLSLRMLNEEDKESLQDKKIELEENFVMSEDPEAETKIIELFRKSEEH 769

BLAST of Cp4.1LG06g00270 vs. TAIR 10
Match: AT1G76280.3 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 622.9 bits (1605), Expect = 3.5e-178
Identity = 361/797 (45.29%), Postives = 482/797 (60.48%), Query Frame = 0

Query: 47  LGNE----AESTKFMQRQIVDALRVGDRSSASNLLMELGQEKHSLTADNFVGILSYCARS 106
           +GNE     + +K +Q QIVDALR G+R  AS LL +L Q  +SL+AD+F  IL YCARS
Sbjct: 50  IGNEFIRCQDESKILQLQIVDALRSGERQGASALLFKLIQGNYSLSADDFHDILYYCARS 109

Query: 107 PDPLFVMETWKIMEERGVFLDNTCTLLMIKALCKGGYLDEAFGLISFLAESRVMFPVLPV 166
           PDP+    T+ +M ++ + LD+   L ++K+LC GG+LD+A   I  + E   + P+LP+
Sbjct: 110 PDPV----TYSVMCKKEISLDSRSLLFIVKSLCNGGHLDKASEFIHAVREDDRISPLLPI 169

Query: 167 YNLFLRACGKRQSTVHVSQCLDMMDRRMVGKNEATYSELLKVAVCQKNLSSVHEIWTDFV 226
           YN FL AC + +S  H S+CL++MD+R VGKN  TY  LLK+AV Q+NLS+V++IW  +V
Sbjct: 170 YNFFLGACARTRSVYHASKCLELMDQRRVGKNGITYVALLKLAVFQRNLSTVNDIWKHYV 229

Query: 227 KNYSPSVLSLRKFIWSYTRLGDLKSAYTALQKMVALV------IGAAGQKLPSLELDIPV 286
            +Y+  +LSLR+FIWS+TRLGDLKSAY  LQ MV L       + +   KL S  L IPV
Sbjct: 230 NHYNLDILSLRRFIWSFTRLGDLKSAYELLQHMVYLALRGEFFVKSNRGKLHSTRLYIPV 289

Query: 287 PLRTESYHENFNFEENGPSTDELYCKKMVPCEGDIGQFSVNGMKCGEVESGRLTLPSNYR 346
           P + E+  E F F      TD     ++V C                  S ++ LP  + 
Sbjct: 290 PSKDETGSEKFAF----GVTD-----RIVDCN----------------SSSKVALPKGHN 349

Query: 347 SNFVMKVLRWSFNDVICACALTRNCGLAEQLMQQFSSISCVLTTLGFISLKMHELGLQPS 406
               ++VLRWSFNDVI AC  ++N  LAEQLM                 L+M  LGL PS
Sbjct: 350 KILAIRVLRWSFNDVIHACGQSKNSELAEQLM-----------------LQMQNLGLLPS 409

Query: 407 SHTFDGFVRSVVSERGFSDGI--------------------------------------- 466
           SHT+DGF+R+V    G+  G+                                       
Sbjct: 410 SHTYDGFIRAVAFPEGYEYGMTLLKVMQQQNLKPYDSTLATVAAYCSKALQVDLAEHLLD 469

Query: 467 -------------------KIDQPERAMRMLVKMKQMEVLPDVKTYELLYSLFGNVNAPY 526
                               +DQPERA+R+L +MK++++ PD++TYELL+SLFGNVNAPY
Sbjct: 470 QISECSYSYPFNNLLAAYDSLDQPERAVRVLARMKELKLRPDMRTYELLFSLFGNVNAPY 529

Query: 527 EEGNRLSQVDAAKRIRMIEMDMEKHGIQHSHFSMMNLLKALGAEGMTKELLQYLNVAENL 586
           EEGN LSQVD  KRI  IEMDM ++G QHS  S +N+L+ALGAEGM  E++++L  AENL
Sbjct: 530 EEGNMLSQVDCCKRINAIEMDMMRNGFQHSPISRLNVLRALGAEGMVNEMIRHLQKAENL 589

Query: 587 FYYDNTCLGTPIYNTALHFLVESKEIHMAIELFNNMKHSGLFPDAATFEMMINCCSVIGC 646
             + N  LGTP YN  LH L+E+ E  M I +F  MK  G   D AT+ +MI+CCS+I  
Sbjct: 590 SAHSNMYLGTPTYNIVLHSLLEANETDMVINIFKRMKSCGCPADVATYNIMIDCCSLIHS 649

Query: 647 LKSAFALLSLMIRSGFCPQILTYTSLVKIVLGFERFDDALNLLDQASSEGIELDVVIMNT 706
            KSA AL+S+MIR GF P+ +T+T+L+KI+L    F++ALNLLDQA+ E I LDV+  NT
Sbjct: 650 YKSACALVSMMIRDGFSPKAVTFTALMKILLNDANFEEALNLLDQAALEEIHLDVLSYNT 709

Query: 707 IVQKACEK------------MKREKIQPDPSTCHSVFSAYVSLGYHSTAMEALQVLSMRM 762
           I++KA EK            M REK+ PDP+TCH VFS YV  GYH+TA+EAL VLS+RM
Sbjct: 710 ILRKAFEKGMIDVIEYIVEQMHREKVNPDPTTCHYVFSCYVEKGYHATAIEALNVLSLRM 769

BLAST of Cp4.1LG06g00270 vs. TAIR 10
Match: AT1G76280.2 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 527.7 bits (1358), Expect = 1.5e-149
Identity = 300/670 (44.78%), Postives = 404/670 (60.30%), Query Frame = 0

Query: 47  LGNE----AESTKFMQRQIVDALRVGDRSSASNLLMELGQEKHSLTADNFVGILSYCARS 106
           +GNE     + +K +Q QIVDALR G+R  AS LL +L Q  +SL+AD+F  IL YCARS
Sbjct: 50  IGNEFIRCQDESKILQLQIVDALRSGERQGASALLFKLIQGNYSLSADDFHDILYYCARS 109

Query: 107 PDPLFVMETWKIMEERGVFLDNTCTLLMIKALCKGGYLDEAFGLISFLAESRVMFPVLPV 166
           PDP+FVMET+ +M ++ + LD+   L ++K+LC GG+LD+A   I  + E   + P+LP+
Sbjct: 110 PDPVFVMETYSVMCKKEISLDSRSLLFIVKSLCNGGHLDKASEFIHAVREDDRISPLLPI 169

Query: 167 YNLFLRACGKRQSTVHVSQCLDMMDRRMVGKNEATYSELLKVAVCQKNLSSVHEIWTDFV 226
           YN FL AC + +S  H S+CL++MD+R VGKN  TY  LLK+AV Q+NLS+V++IW  +V
Sbjct: 170 YNFFLGACARTRSVYHASKCLELMDQRRVGKNGITYVALLKLAVFQRNLSTVNDIWKHYV 229

Query: 227 KNYSPSVLSLRKFIWSYTRLGDLKSAYTALQKMVALV------IGAAGQKLPSLELDIPV 286
            +Y+  +LSLR+FIWS+TRLGDLKSAY  LQ MV L       + +   KL S  L IPV
Sbjct: 230 NHYNLDILSLRRFIWSFTRLGDLKSAYELLQHMVYLALRGEFFVKSNRGKLHSTRLYIPV 289

Query: 287 PLRTESYHENFNFEENGPSTDELYCKKMVPCEGDIGQFSVNGMKCGEVESGRLTLPSNYR 346
           P + E+  E F F      TD     ++V C                  S ++ LP  + 
Sbjct: 290 PSKDETGSEKFAF----GVTD-----RIVDCN----------------SSSKVALPKGHN 349

Query: 347 SNFVMKVLRWSFNDVICACALTRNCGLAEQLMQQFSSISCVLTTLGFISLKMHELGLQPS 406
               ++VLRWSFNDVI AC  ++N  LAEQLM                 L+M  LGL PS
Sbjct: 350 KILAIRVLRWSFNDVIHACGQSKNSELAEQLM-----------------LQMQNLGLLPS 409

Query: 407 SHTFDGFVRSVVSERGFSDGI--------------------------------------- 466
           SHT+DGF+R+V    G+  G+                                       
Sbjct: 410 SHTYDGFIRAVAFPEGYEYGMTLLKVMQQQNLKPYDSTLATVAAYCSKALQVDLAEHLLD 469

Query: 467 -------------------KIDQPERAMRMLVKMKQMEVLPDVKTYELLYSLFGNVNAPY 526
                               +DQPERA+R+L +MK++++ PD++TYELL+SLFGNVNAPY
Sbjct: 470 QISECSYSYPFNNLLAAYDSLDQPERAVRVLARMKELKLRPDMRTYELLFSLFGNVNAPY 529

Query: 527 EEGNRLSQVDAAKRIRMIEMDMEKHGIQHSHFSMMNLLKALGAEGMTKELLQYLNVAENL 586
           EEGN LSQVD  KRI  IEMDM ++G QHS  S +N+L+ALGAEGM  E++++L  AENL
Sbjct: 530 EEGNMLSQVDCCKRINAIEMDMMRNGFQHSPISRLNVLRALGAEGMVNEMIRHLQKAENL 589

Query: 587 FYYDNTCLGTPIYNTALHFLVESKEIHMAIELFNNMKHSGLFPDAATFEMMINCCSVIGC 646
             + N  LGTP YN  LH L+E+ E  M I +F  MK  G   D AT+ +MI+CCS+I  
Sbjct: 590 SAHSNMYLGTPTYNIVLHSLLEANETDMVINIFKRMKSCGCPADVATYNIMIDCCSLIHS 649

Query: 647 LKSAFALLSLMIRSGFCPQILTYTSLVKIVLGFERFDDALNLLDQASSEGIELDVVIMNT 649
            KSA AL+S+MIR GF P+ +T+T+L+KI+L    F++ALNLLDQA+ E I LDV+  NT
Sbjct: 650 YKSACALVSMMIRDGFSPKAVTFTALMKILLNDANFEEALNLLDQAALEEIHLDVLSYNT 677

BLAST of Cp4.1LG06g00270 vs. TAIR 10
Match: AT1G74580.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 79.0 bits (193), Expect = 1.9e-14
Identity = 78/291 (26.80%), Postives = 124/291 (42.61%), Query Frame = 0

Query: 387 KMHELGLQPSSHTFDGFVRSVVSERGFSDGIKIDQPERAMRMLVKMKQMEVLPDVKTY-E 446
           K+ + G+ P+  T++ F++  + +RG  DG        A+RM+  + +    PDV TY  
Sbjct: 241 KVIKRGVLPNLFTYNLFIQG-LCQRGELDG--------AVRMVGCLIEQGPKPDVITYNN 300

Query: 447 LLYSLFGN-------------VNAPYEEGN-----------RLSQVDAAKRIRMIEMDME 506
           L+Y L  N             VN   E  +           +   V  A+R   I  D  
Sbjct: 301 LIYGLCKNSKFQEAEVYLGKMVNEGLEPDSYTYNTLIAGYCKGGMVQLAER---IVGDAV 360

Query: 507 KHGIQHSHFSMMNLLKALGAEGMTKELLQYLNVAENLFYYDNTCLGTPIYNTALHFLVES 566
            +G     F+  +L+  L  EG T   L   N A       N  L    YNT +  L   
Sbjct: 361 FNGFVPDQFTYRSLIDGLCHEGETNRALALFNEALGKGIKPNVIL----YNTLIKGLSNQ 420

Query: 567 KEIHMAIELFNNMKHSGLFPDAATFEMMINCCSVIGCLKSAFALLSLMIRSGFCPQILTY 626
             I  A +L N M   GL P+  TF +++N    +GC+  A  L+ +MI  G+ P I T+
Sbjct: 421 GMILEAAQLANEMSEKGLIPEVQTFNILVNGLCKMGCVSDADGLVKVMISKGYFPDIFTF 480

Query: 627 TSLVKIVLGFERFDDALNLLDQASSEGIELDVVIMNTIVQKACEKMKREKI 653
             L+       + ++AL +LD     G++ DV   N+++   C+  K E +
Sbjct: 481 NILIHGYSTQLKMENALEILDVMLDNGVDPDVYTYNSLLNGLCKTSKFEDV 515

BLAST of Cp4.1LG06g00270 vs. TAIR 10
Match: AT5G42310.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 70.1 bits (170), Expect = 8.8e-12
Identity = 70/309 (22.65%), Postives = 126/309 (40.78%), Query Frame = 0

Query: 387 KMHELGLQPSSHTFDGFVRSVVSERGFSDGIKIDQPERAMRMLVKMKQMEVLPDVKTYEL 446
           ++ + G++P +  ++  ++  V      D         A  M+ +M++  V PD  TY L
Sbjct: 329 ELRQSGIKPRTRAYNALLKGYVKTGPLKD---------AESMVSEMEKRGVSPDEHTYSL 388

Query: 447 LYSLFGNVNAPYEEGNRLSQVDAAKRIRMIEMDMEKHGIQHSHFSMMNLLKALGAEGMTK 506
           L   +  VNA   E             R++  +ME   +Q + F    LL      G  +
Sbjct: 389 LIDAY--VNAGRWES-----------ARIVLKEMEAGDVQPNSFVFSRLLAGFRDRGEWQ 448

Query: 507 ELLQYLNVAENLFYYDNTCLGTPIYNTALHFLVESKEIHMAIELFNNMKHSGLFPDAATF 566
           +  Q L   +++    +       YN  +    +   +  A+  F+ M   G+ PD  T+
Sbjct: 449 KTFQVLKEMKSIGVKPD----RQFYNVVIDTFGKFNCLDHAMTTFDRMLSEGIEPDRVTW 508

Query: 567 EMMINCCSVIGCLKSAFALLSLMIRSGFCPQILTYTSLVKIVLGFERFDDALNLLDQASS 626
             +I+C    G    A  +   M R G  P   TY  ++      ER+DD   LL +  S
Sbjct: 509 NTLIDCHCKHGRHIVAEEMFEAMERRGCLPCATTYNIMINSYGDQERWDDMKRLLGKMKS 568

Query: 627 EGIELDVVIMNTIVQ------------KACEKMKREKIQPDPSTCHSVFSAYVSLGYHST 684
           +GI  +VV   T+V             +  E+MK   ++P  +  +++ +AY   G    
Sbjct: 569 QGILPNVVTHTTLVDVYGKSGRFNDAIECLEEMKSVGLKPSSTMYNALINAYAQRGLSEQ 611

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9SGQ61.3e-17747.42Pentatricopeptide repeat-containing protein At1g76280 OS=Arabidopsis thaliana OX... [more]
Q9CA582.7e-1326.80Putative pentatricopeptide repeat-containing protein At1g74580 OS=Arabidopsis th... [more]
Q8L8441.2e-1022.65Pentatricopeptide repeat-containing protein At5g42310, chloroplastic OS=Arabidop... [more]
Q9LMH51.8e-0924.63Putative pentatricopeptide repeat-containing protein At1g13800 OS=Arabidopsis th... [more]
Q9SHK22.3e-0929.76Pentatricopeptide repeat-containing protein At1g06580 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
XP_023536089.10.089.61pentatricopeptide repeat-containing protein At1g76280 [Cucurbita pepo subsp. pep... [more]
XP_022976056.10.087.93pentatricopeptide repeat-containing protein At1g76280 isoform X1 [Cucurbita maxi... [more]
XP_022937086.10.087.81pentatricopeptide repeat-containing protein At1g76280 isoform X1 [Cucurbita mosc... [more]
KAG6591227.10.085.30Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
KAG7024113.10.087.32Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
Match NameE-valueIdentityDescription
A0A6J1IFV20.087.93pentatricopeptide repeat-containing protein At1g76280 isoform X1 OS=Cucurbita ma... [more]
A0A6J1F9C60.087.81pentatricopeptide repeat-containing protein At1g76280 isoform X1 OS=Cucurbita mo... [more]
A0A6J1IEQ40.087.20pentatricopeptide repeat-containing protein At1g76280 isoform X2 OS=Cucurbita ma... [more]
A0A6J1FA550.087.07pentatricopeptide repeat-containing protein At1g76280 isoform X2 OS=Cucurbita mo... [more]
A0A1S3CEN80.079.43pentatricopeptide repeat-containing protein At1g76280 isoform X3 OS=Cucumis melo... [more]
Match NameE-valueIdentityDescription
AT1G76280.19.3e-17947.42Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G76280.33.5e-17845.29Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G76280.21.5e-14944.78Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G74580.11.9e-1426.80Pentatricopeptide repeat (PPR) superfamily protein [more]
AT5G42310.18.8e-1222.65Pentatricopeptide repeat (PPR-like) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 54..226
e-value: 8.2E-16
score: 60.3
coord: 341..523
e-value: 7.1E-9
score: 37.5
coord: 524..728
e-value: 6.4E-27
score: 96.7
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 130..148
e-value: 0.082
score: 13.2
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 530..563
e-value: 3.2E-5
score: 21.8
coord: 565..596
e-value: 1.0E-4
score: 20.2
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 530..573
e-value: 7.5E-10
score: 38.9
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 527..561
score: 9.086975
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 562..596
score: 9.88715
NoneNo IPR availablePANTHERPTHR47859PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 419..762
coord: 2..418

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG06g00270.1Cp4.1LG06g00270.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding