Cp4.1LG14g05680 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG14g05680
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
Descriptionpentatricopeptide repeat-containing protein isoform X1
LocationCp4.1LG14: 854399 .. 862675 (-)
RNA-Seq ExpressionCp4.1LG14g05680
SyntenyCp4.1LG14g05680
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATACCTTTCGCGCCTTCCTGTTCTCGCTCAGTTTCTCTTCTCAGTTCTCAGTTTTGCTCTTCCTTATGTAATCTTCAGGCCTCCAAAGCTGCCGGAGTTTATTGCAGGTATTCTTATGTAATTCCGGTGTTTAATTTATTGGCCTTTCAAGCTCTTGTTCTCTTTCAGTTTCTGTTATACTCAATCCTTCTCGGAGCCATTATTTGAACCGAATATTGGAGTTGTAATTTCGTCATGTTTTGATGGAGTCTTCATCGTTCCTCACTAGGTTTACTACGTGAACTCAACTTGCCCCTAATGCCATACTGTTGCTTGTTGTTACTAGCATTGCGTTTTAGAGATGATACGGGGACGGCCCTGTAAATATTACCTCTCTGTGAACTTCAGGAATCTGGTGACGACTTGTACAGTCCCACTTGATCCTCCAGTTACTTCGAGTTCCTCTTCTGCTAGCGAACACAAGACTTTGTGCTATTCCTTAGTGGAGCGACTAATTCGTCGTGGCTTGTTTTTGCCGGCGCAACAAGTGATACAACGAATTGTAACGCAATCTTCTTCAATTTCTGAAGCTATTTCTATTGTTGACTTCGCTGCTGAACGGGGTTTGGAGATTGATTTGGATACCCATGGTGTGTTTTGCCGGCAGCTTGTCTATTCTAGGCCCCAGTTGGCTGAACTGCTGTACGACAAAAAATTTACATTCGGAGGTGCTGAGCCAGATGCGTCAGTTTTGGACTCTATGGTAATCTGTTTCTGTAGGCTAGGAAAATTTGAGAAGGCACTGGCCTATTTTAATCAACTTCTGTCGTTAAATTACGTCCCAAGTAAAACTTCATTTAATGCTATCTTTCGAGAGCTTTGTGCGCAAGAAAGGGTTTTAGAGGCATTCGACTATTTTGTGAGAGTCAATGGAGGTGGTGTTCACTTGGGGTATTGGTGTTTTAATGTCTTGATAGATGGGCTATGCAATAAGGGGCATATGGAGGAAGCTCTTGAATTATTTGATATAATGCAAAACACTAATGGTTATCCTCCGTCGCTGCATTTGTTTAAGTCATTATTTTATGGCCTTTGTAAGAGCAAGTGGTTAGTGGAGGCAGAGTTGTTGATCAGAGAAATGGAGTTTCGGAGCCTATATCCTGACAAGACTATGTATACTTCTTTAGTTCATGAATATTGCAAAGATAAGAAAATGAAAATGGCAATGCAAGCCTTTTTTAGAATGATAAAAATAGGCTGTGAACCAGATAATTATACATTAAATACACTGATCCATGGGTTTGTGAAATTGGGTTTAGTCGATAAGGGTTGGTTGGTATATAACCTTATGGCAGAGTGGGGAATCCAACCTGATGTGGTAACTTTTCACATCATGATTAGTCAGTATTGTCAAGAAGGGAAGGTTGACTTTGCATTAACGATTTTGAATAATATGGTCAGCTGCAACTTTTCTCCTAGCTTGCATTGTTATACAGTTTTGATTAATGCTCTGCATAGGGATGATAGGTTAGAAGAAGTCAGTGAATTGCTTAGGAGTATCTTGGACAATGGAATTGTACCTGATCACGTGCTTTTCTTTACCCTTATGAAGATGTATCCAAAGGGACATGAACTTCAGCTTGCTTTAAATTTTTTGGAAGCCATTTTAAAGAATGGGTGTGGGTGTGATCCTTCTGTAATCTTAGCCAGTACAAAGTTACAAACATCAAGCAATCTGGAGCAAAAAATTGAAACGCTGCTGCAAGAAATTTTCAATAGCAACTTGAATCTAGCAGGTGTGGCATTTAGTATTGTCATTTGTGCCTTATGTGAGACCGAAAATTTGGATTGTGCTTTGGATTACTTCCATAAAATGGCAAGTCTTGGATGCAAGCCTTTGCTCTTTACTTATAATTCCTTGATTAAATGTCTTTGCAAGGAGGGGCTTTTTGAGGATGCCTTGTCTCTAATTGATCATATGCAGGAATGTAGTTTGCTTCCTGATACCACAACATATTTGATTATTATTAACGAGCATTGTAGGAAGGGTAATGTTAACTCAGCACATTATATTCATAGAAAAATGAGGCAGAGGGGATTGAAACCGAGTGTTGCTATTTATGATTCAATAATTGGTTGTTTAAGTAGGAAAAAGAGAATTTTTGAAGTAAAAGGAGTTTTTAAGAAGATGCTTAAAGCGGGTGTGGATCCGGATAAGCATTTGTATTTGACAATGATTAATGGCTATGGTAAAAATGGAAAGCTTCTTGAAGCTCGTAAATTGTTTGAGCAAATGGTTGAGAACTCTATTCCACCAAGCTCTCATATTTATACGGCACTGATTAGTGGTTTGGTTAAAAAAAATATGACTGATAAAGGATGTTTATATCTGGGCAAGATGTTAAGAGATGGGTTTTCACCTAATGCTGTATTGTATACCTCTCTTATCAATCATTACCTAAAGATCGGGGAGGTTGAATATGCCTTTCGATTAGTTGATTTGATGGAAAGGAGCCACATTGAACCCGATGTTATCTTCTATATCACATTAGTCAGTGGTATTTGCAAAAATTTAATTGTCGACAAGAAAAAATGGTTCCTGCTAGAGAAAGAGAATCAAAAGGCAAAAAGTACGTTGTTTCGTATGCTCCATGAAACAACTCTTGTTCCAAGGGATAATAATATGATAGTTTCTGCTAATTCTACTGAAGAAATGAAATCCTTTGCATTGAAGCTTATCCAGAAGGTTAAAGATGTATGCATTGTACCTAACTTGCATCTGTACAATAGCATAATATGTGGATATTGTCGGACAGATAGGATGCTGGATGCCAATCATCAGTTGGAATTGATGCAAAAAGAAGGGTTGCATCCAAACCAGGTTACTTTCACGATTCTTATGGATGGTTATATTCTTGCAGGTGATGTTAACTCTGCCATTGGGTTGTTTAATAAGATGAATGTAGATGGGTGTATTCCAGATAAAGTTGCATACATCACTTTACTGAAAGGCCTTTCCCAAGGAGGGAGACTTTCTGATGCATTGGCACTCCACGTACAATGCATAAAAAAGGGTTTTCCCCAAGTATACTAGGTTATCGTAATTTTGTGAGGAATTGATGCATGGTAAACTCTTGCCTCACTGGAAAAGATGATCGGCCGTCTACATCCTGTGTGGAGAAAATCATTGCAGGAAGCCTGTTTTGCTTTTAATATAAAGCTTGAGAGGAGCACACATAAAAAGAAAAACCAAATAGGTATTTGGTGGATTCATGGATGAAATTAGATGGGTCATCATGGTATAAAATGCATATCAAGACAGACTCAGACATGACACAATCATTGAAAACGGAGGTTAGTGATGAAGATTCACATGGATTCTCTTTATTTCCCCTCTTAACCTTTTCCTGTTGCCTTTCTGGTCACTATTTTTGTACAAGTTTTTTTTTTTTTTTTTTTGAACAAGACCTATCCAGTCAATATATATATATACATATACATATATTANACTATTTTTGTACAAGTTTTTTTCTTTATTTTTCTATTTTGACCCCAAATTTTGACATATTTTTAACTATTAATTTCAATTTAATTATTTTGATCCTTTCTTTATTTTACCTTNAGTCAATATATATATATACATATACATATATTATTGCAGATGTTCATTTGTAGCCGTTTGACATTAGCAAGATCCTTGTCTGCAACACTTTCCCATGGAAATCTTGGAATTTGCAATGTATAGGTAATTTGAATATTAACTTATTGCACATATGTAATTAAATTTTCGTACCAACTTTTGGCTTATTACCTTGTACACGATGAACAGAAGTGCATGCCATGTAAATAATTTCTTACAAAAGAATAATAATAATAAATAAATGTTATTATGTTTCAGAACTCGAACAAAAGTAAGACATTATTTTACAAGTGTTTGACCTATGTCGGAGATCCTTCCTGATATCAAAGATGGACAAATGGGCTATGTTCAAAAGATATATAGAGGTTGTTTACGTCCAATCCCAAAAGACGATCTTTTATGGATGAGAGTTATCATTCGAACTTATAGGTTGGGTACTCTGGGCTGTGCATAAGAATTTCTTTGATAGGGCTATATGGGGCCTTGGTACCACAAGTGCTATACATTGTGGGAGAAGTTGATTAGCCACAAGATTGGGTTGGCAAATTGTATCGGGTTTCAGGGAGATACTTGTTGTAAATTGTCATAAGTTTTGATTTGATTTGGTTAGGTTATTTGGATTAGATTTGATTGGGTTATTTAGATTAGATTTGATTTGGTTATTTGGATTAGATTTGATTTGATTATTTGAATTAGATTTAGTCTCAGTCATATCGGAGATTTGATTCTGTAATGCTATATAATGAGAGAGTTCTCCCTCCATTCTTAATATCCATTCCTAACAAGTGGTATCGGGATTTGGGAGACTTTTGGGTGTCACATCGTATTTGTTGAGTGACATATTGGTGAGAGACTTTGTAGTGTGAAAGTGAGGCATTCACTTGTAACACTTGGGGTTATTAGTGATTGATTGCTACCCGTAGCTGTAGGGAAACTTCTTCTCTCCGAACCACGTAAATATCTTGGTGTCTCTTTGTGTAATTCCGTTCATTGTTGTTGTGAGTACATTTGTTCCGCTTTCGGCGCACAACAATACTTTGGTTGACCATCGCCCCTCTAGAACTAGAACCTTGTTGTGCTCTGGGCTATACAACACTACTTGCTAATAATCTCCTCTGCAATTTGATTGATATTTGTTTCATGTGAAACACACTTGGTCTTCTTCAAAAATTTTTTTTATGGTCTATTGTTCTTTGCATACTTATTTTATCTAACGCTTGGTTTTCATAAATTCAGTAAAAAAGTATAATTAAATTAAAGTATTGCCTCAAATTGTGGTACCAATTAAGTTTATTTTACGTAATTGGTAACCTTTACTTTTTTAAAATTCTAACCATGTAATACCCTCGTTTTGAATTTGTATATTCTTCAGCATGGAACAGCTAGGGGGAAATGTGCTTGCCAACATTGGCTTTTGATGTTTGCTATTTTACTCTTAGAGTTTAGTTTATATATATAGCTATTCTTCGGTTGTAATTGTGTTGAGAAATTCAATAATGAATTTTAGCTAGAGAGGCTTTCTCCTGAGATTTGGTTCTACTGAGATATTAATCCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAANCCTCAATAGTGATTTGCAGTGTAATTGGAAATGGGTTTGTCGTTTGCTAGTTCTGATTTTATTAATGTAACTTTTCAATTTACATATATATGAGCTTGAATAATGATTTCTTAAATGTGCGTACAGAGATTAAAGAATCAAATGACGTGGTTATATTTATTTCAGAGGAAGAATGTAGAAAACCTATTCTCCAAGCGGAGTGAATGGGACCTCTGTCTGGCAACAATAACTGCCTTGCAAATGAAATTTTTGACGGTGAAAGGCGATACCTCCATTGAATTGCTTGTAACACGGTAACCTTTCCAGTCATTCTCTGGCTAGTTGATGAACAAGGAGGTAAGCTTTGTGGAAAACTGTTTGAGATTACTCCTTCCACGGCCTTCCCAAGTAGGTTTTCACTGTCTTCTCCATTTGTTTTGGTACATCTGTGGCCATGACCCTTGAATTGTGGATTGAAATGCCAGTATTCTGACCATTCATTGACCTTTTTGTTAAATCATGCAGTTCTGTAGAATAACTGCAGCATTGCCAAATATAAAATCTACAGTGCCGTAAATATAACACTACTTATAGAGCTATCTTCGAAAATGGACATGAAGGGTTTCTTGATACCCTTCAAAGGCACAACGGTAGAAGACAGAAAGATCTGCGCCAGATCGGAGAGCAATTGCTTGATGGTTTCCAGGACCAGTGGTGTTCCGGAATGTGATGCCTCGAGCTATAAATCCTTCACCTGTCACCGCTGATTGTCTCAGAGTACAAGTATCTCTCGTTAAACATATACTCTTCACATTTTCGGCTTTTTGAAAAATAACCAAATTTTCGAGTTTTTTTGTATGAAAGTGGTAAATGTGATTTCATATTGCCATTTTTGGAATCTATAAAAGAAAGTGACAATGTATATTACTGATTGAAACTTGACAGCCATACGATGTATACGATGATGCATTTATGATGAAATTCGATAAATTTTGTTTTTGCTATTTTTATATTGATATTGACAAAGATGGCTGCCACTAATAATTCTATTCTCATTGGAAATGAAGGAAAGGAGAAAAGGAGAAAATTTCAAGCACCATGCCTTGGAATGGAAGAAAAGTAATTCTAGACGATAAAGTTGCCAGGTAGAGTTGATGGACTTGGAACGAGAAAAATATGTAGGTCGACATGTTATAATGTCATTATAATGGAGGAAAATGAGCTTATAAGCAGTAGTCAACCCAGAGAGAGTCTTACCAACAGTAGCAGAATTGAAAGGGGTGGAACCTCCTCCTATGTTACCAGTAATGAAGGTAAATCTCATTCCATTTATCTTGTTTCTGTCCGGATTTAGTGCAACTTTTTGCTAGTTTGAATTAGTCAAGATATTCGTAATCTGTCAATATTTGAAGAATATATTAGTTATGGAATATATCTTACATATTCGTAATCAGTTAGTATTTTTCTTTTTATGATTTCATTAGTAGTATATTTTATTTTATTCTCAAGATTTAGTTAGTTTATATTTTCCTTATTTGTAGGTATTAGTGGGTAGCTTCTATCCTATTTAAACGTTGTGAATATCAATGTGAATATCAATGAAGATTGAACCTTCGATCCCAATTTTATTTCTCATTCTTAACTCTATTGTTTCCTTGAGTAATAATATTTTCAGATCTATTCACACTTTTGAGTTGTAGACATTGGGTCAGTTGGTATCATAGCCAAGTTGGTAACCAAAGGTTTGATATTATTACTCAACGAGATGAATGGTGGAAGTGCTTCTATGAGTATAGCTGTGGAAAAGCTTATTGGCAATAATTATAATTATTGGAAGTTATGTATGGAAGGTTATCTACAAGAGCAAGATTTGTGGGATTTAATTTAAGGTCATGACACAGAAATTCCAACAGATACTCCACATAATGCTGAATTACGTCGAAAATGGAAGATCAAATGTGGAAAAGCTTTATTTACTTTGAGAACCTATATATTGATTTCGTGACTTTAAATCACGAACATGATTAATATATTCATTGCTAATCAAAGTTCGCAAGGTAAATAAAGTTTTTCCACATTTGATCTTCCATTTTTGGCGTAATTCAACATTATGTGGAGTATTTGCTGGAATTTCTGTGTCTATCACCTTCATAGAAGCACTTCCACCATTCATCTCATTGAGTAATAGTATCAAACCCTGGTTACCAACTGACACAATGTCGCGTGAATAGATATGAAAATATTATTACTCAAGGAATCAAGGAAACAATACAGAGTTAAGAATGAGAAATAGAATTGGGGTGAATAGATATAAAAATATTATTACTCAAGGAATCAAGGAAACAATACAGTGTTAAGAATGAGAAATAGAATTGGGGTGAATAGATATGAAAATATTATTACTCAAGGAAACAATATAGAGTTAAGAATGAGAAATAGAATTGGGATCGAAGGCTCAATCTTCGTTGATATTCACAACCTTTAAATACGATACAAGCTACTCATTAATACCTACAAAAAGGAAAATATAAACTAACTAACTAACTAAATCTTGAGAATAAAATAAAATATGTTACTAATCAAATCATAAAGGAAAAATATTAACTGATTACGAATATCTAAGATATATTTCATAACTAATATATTCTCCAAATATTAACTGATTACGAATATCTTGATTAATTCAAACTAGCAATAAGTTGCACTAAATTCTAATAGCCTCACAGAGTGGAAACTTATTTCTCAATAACTTTGAGAGCCTTACAAAAAAAACTTCAATTACTTGTAAACCCCTCTCTGTTACTTGTATAACAAACTCCCATTCATGTTCCTCTTCGCAGCTTCCTCTAGAGCCTCTCCTACTATCTTCTGGTTACCCGACCTGTCCTGAGCCACCTCAAGGTCCGACTTTGTTGATGACTGCAGTATCCTCTTGTCACCACCTAAAAGCCAACTTGGAAAGCCCTTTTTATATGTTTCCTTCCCCAGGCCAGCTGAAGCATTGTTAATAGCCAAGCTGTTGCTTATCAGCTTTGTGACATTGTTCGATGTGATCAGCGGCAATACATAA

mRNA sequence

ATACCTTTCGCGCCTTCCTGTTCTCGCTCAGTTTCTCTTCTCAGTTCTCAGTTTTGCTCTTCCTTATGTAATCTTCAGGCCTCCAAAGCTGCCGGAGTTTATTGCAGGAATCTGGTGACGACTTGTACAGTCCCACTTGATCCTCCAGTTACTTCGAGTTCCTCTTCTGCTAGCGAACACAAGACTTTGTGCTATTCCTTAGTGGAGCGACTAATTCGTCGTGGCTTGTTTTTGCCGGCGCAACAAGTGATACAACGAATTGTAACGCAATCTTCTTCAATTTCTGAAGCTATTTCTATTGTTGACTTCGCTGCTGAACGGGGTTTGGAGATTGATTTGGATACCCATGGTGTGTTTTGCCGGCAGCTTGTCTATTCTAGGCCCCAGTTGGCTGAACTGCTGTACGACAAAAAATTTACATTCGGAGGTGCTGAGCCAGATGCGTCAGTTTTGGACTCTATGAGCAAGTGGTTAGTGGAGGCAGAGTTGTTGATCAGAGAAATGGAGTTTCGGAGCCTATATCCTGACAAGACTATGAAAAAGAGAATTTTTGAAGTAAAAGGAGTTTTTAAGAAGATGCTTAAAGCGGGTGTGGATCCGGATAAGCATTTGTATTTGACAATGATTAATGGCTATGGTAAAAATGGAAAGCTTCTTGAAGCTCGTAAATTGTTTGAGCAAATGGTTGAGAACTCTATTCCACCAAGCTCTCATATTTATACGGCACTGATTAGTGGTTTGGTTAAAAAAAATATGACTGATAAAGGATGTTTATATCTGGGCAAGATGTTAAGAGATGGGTTTTCACCTAATGCTGTATTGTATACCTCTCTTATCAATCATTACCTAAAGATCGGGGAGGTTGAATATGCCTTTCGATTAGTTGATTTGATGGAAAGGAGCCACATTGAACCCGATGTTATCTTCTATATCACATTAGTCAGTGGTATTTGCAAAAATTTAATTGTCGACAAGAAAAAATGGTTCCTGCTAGAGAAAGAGAATCAAAAGGCAAAAAGTACGTTGTTTCGTATGCTCCATGAAACAACTCTTGTTCCAAGGGATAATAATATGATAGTTTCTGCTAATTCTACTGAAGAAATGAAATCCTTTGCATTGAAGCTTATCCAGAAGGTTAAAGATGTATGCATTGTACCTAACTTGCATCTGTACAATAGCATAATATGTGGATATTGTCGGACAGATAGGATGCTGGATGCCAATCATCAGTTGGAATTGATGCAAAAAGAAGGGTTGCATCCAAACCAGGTTACTTTCACGATTCTTATGGATGGTGATGTTAACTCTGCCATTGGGTTGTTTAATAAGATGAATGTAGATGGGTGTATTCCAGATAAAGTTGCATACATCACTTTACTGAAAGGCCTTTCCCAAGGAGGGAGACTTTCTGATGCATTGGCACTCCACATGTTCATTTGTAGCCGTTTGACATTAGCAAGATCCTTGTCTGCAACACTTTCCCATGGAAATCTTGGAATTTGCAATAGGAAGAATGTAGAAAACCTATTCTCCAAGCGGAGTGAATGGGACCTCTGTCTGGCAACAATAACTGCCTTGCAAATGAAATTTTTGACGGTGAAAGGCGATACCTCCATTGAATTGCTTGTAACACGGAAAGGAGAAAAGGAGAAAATTTCAAGCACCATGCCTTGGAATGGAAGAAAAGTAATTCTAGACGATAAAGTTGCCAGGCCAGCTGAAGCATTGTTAATAGCCAAGCTGTTGCTTATCAGCTTTGTGACATTGTTCGATGTGATCAGCGGCAATACATAA

Coding sequence (CDS)

ATACCTTTCGCGCCTTCCTGTTCTCGCTCAGTTTCTCTTCTCAGTTCTCAGTTTTGCTCTTCCTTATGTAATCTTCAGGCCTCCAAAGCTGCCGGAGTTTATTGCAGGAATCTGGTGACGACTTGTACAGTCCCACTTGATCCTCCAGTTACTTCGAGTTCCTCTTCTGCTAGCGAACACAAGACTTTGTGCTATTCCTTAGTGGAGCGACTAATTCGTCGTGGCTTGTTTTTGCCGGCGCAACAAGTGATACAACGAATTGTAACGCAATCTTCTTCAATTTCTGAAGCTATTTCTATTGTTGACTTCGCTGCTGAACGGGGTTTGGAGATTGATTTGGATACCCATGGTGTGTTTTGCCGGCAGCTTGTCTATTCTAGGCCCCAGTTGGCTGAACTGCTGTACGACAAAAAATTTACATTCGGAGGTGCTGAGCCAGATGCGTCAGTTTTGGACTCTATGAGCAAGTGGTTAGTGGAGGCAGAGTTGTTGATCAGAGAAATGGAGTTTCGGAGCCTATATCCTGACAAGACTATGAAAAAGAGAATTTTTGAAGTAAAAGGAGTTTTTAAGAAGATGCTTAAAGCGGGTGTGGATCCGGATAAGCATTTGTATTTGACAATGATTAATGGCTATGGTAAAAATGGAAAGCTTCTTGAAGCTCGTAAATTGTTTGAGCAAATGGTTGAGAACTCTATTCCACCAAGCTCTCATATTTATACGGCACTGATTAGTGGTTTGGTTAAAAAAAATATGACTGATAAAGGATGTTTATATCTGGGCAAGATGTTAAGAGATGGGTTTTCACCTAATGCTGTATTGTATACCTCTCTTATCAATCATTACCTAAAGATCGGGGAGGTTGAATATGCCTTTCGATTAGTTGATTTGATGGAAAGGAGCCACATTGAACCCGATGTTATCTTCTATATCACATTAGTCAGTGGTATTTGCAAAAATTTAATTGTCGACAAGAAAAAATGGTTCCTGCTAGAGAAAGAGAATCAAAAGGCAAAAAGTACGTTGTTTCGTATGCTCCATGAAACAACTCTTGTTCCAAGGGATAATAATATGATAGTTTCTGCTAATTCTACTGAAGAAATGAAATCCTTTGCATTGAAGCTTATCCAGAAGGTTAAAGATGTATGCATTGTACCTAACTTGCATCTGTACAATAGCATAATATGTGGATATTGTCGGACAGATAGGATGCTGGATGCCAATCATCAGTTGGAATTGATGCAAAAAGAAGGGTTGCATCCAAACCAGGTTACTTTCACGATTCTTATGGATGGTGATGTTAACTCTGCCATTGGGTTGTTTAATAAGATGAATGTAGATGGGTGTATTCCAGATAAAGTTGCATACATCACTTTACTGAAAGGCCTTTCCCAAGGAGGGAGACTTTCTGATGCATTGGCACTCCACATGTTCATTTGTAGCCGTTTGACATTAGCAAGATCCTTGTCTGCAACACTTTCCCATGGAAATCTTGGAATTTGCAATAGGAAGAATGTAGAAAACCTATTCTCCAAGCGGAGTGAATGGGACCTCTGTCTGGCAACAATAACTGCCTTGCAAATGAAATTTTTGACGGTGAAAGGCGATACCTCCATTGAATTGCTTGTAACACGGAAAGGAGAAAAGGAGAAAATTTCAAGCACCATGCCTTGGAATGGAAGAAAAGTAATTCTAGACGATAAAGTTGCCAGGCCAGCTGAAGCATTGTTAATAGCCAAGCTGTTGCTTATCAGCTTTGTGACATTGTTCGATGTGATCAGCGGCAATACATAA

Protein sequence

IPFAPSCSRSVSLLSSQFCSSLCNLQASKAAGVYCRNLVTTCTVPLDPPVTSSSSSASEHKTLCYSLVERLIRRGLFLPAQQVIQRIVTQSSSISEAISIVDFAAERGLEIDLDTHGVFCRQLVYSRPQLAELLYDKKFTFGGAEPDASVLDSMSKWLVEAELLIREMEFRSLYPDKTMKKRIFEVKGVFKKMLKAGVDPDKHLYLTMINGYGKNGKLLEARKLFEQMVENSIPPSSHIYTALISGLVKKNMTDKGCLYLGKMLRDGFSPNAVLYTSLINHYLKIGEVEYAFRLVDLMERSHIEPDVIFYITLVSGICKNLIVDKKKWFLLEKENQKAKSTLFRMLHETTLVPRDNNMIVSANSTEEMKSFALKLIQKVKDVCIVPNLHLYNSIICGYCRTDRMLDANHQLELMQKEGLHPNQVTFTILMDGDVNSAIGLFNKMNVDGCIPDKVAYITLLKGLSQGGRLSDALALHMFICSRLTLARSLSATLSHGNLGICNRKNVENLFSKRSEWDLCLATITALQMKFLTVKGDTSIELLVTRKGEKEKISSTMPWNGRKVILDDKVARPAEALLIAKLLLISFVTLFDVISGNT
Homology
BLAST of Cp4.1LG14g05680 vs. ExPASy Swiss-Prot
Match: Q9LVA2 (Pentatricopeptide repeat-containing protein At5g62370 OS=Arabidopsis thaliana OX=3702 GN=At5g62370 PE=2 SV=1)

HSP 1 Score: 244.6 bits (623), Expect = 2.9e-63
Identity = 161/433 (37.18%), Postives = 243/433 (56.12%), Query Frame = 0

Query: 50  VTSSSSSASEHKTLCYSLVERLIRRGLF-LP-AQQVIQRIVTQSSSISEAISIVDFAAER 109
           V +++  +  +     S +E+++  G   LP +   + + + Q + I +  S+V+   E 
Sbjct: 482 VVTTALCSQRNYIAALSRIEKMVNLGCTPLPFSYNSVIKCLFQENIIEDLASLVNIIQEL 541

Query: 110 GLEIDLDTHGVFCRQLVYSRPQLAELLYDKKFTFGGAEPDASVLDSMSKWLVEAELLIRE 169
               D+DT+ +   +L            D+   F       +++D+M +  +   + I  
Sbjct: 542 DFVPDVDTYLIVVNELCKKN--------DRDAAF-------AIIDAMEELGLRPTVAI-- 601

Query: 170 MEFRSLYPDKTMKKRIFEVKGVFKKMLKAGVDPDKHLYLTMINGYGKNGKLLEARKLFEQ 229
             + S+      + R+ E +  F KML++G+ PD+  Y+ MIN Y +NG++ EA +L E+
Sbjct: 602 --YSSIIGSLGKQGRVVEAEETFAKMLESGIQPDEIAYMIMINTYARNGRIDEANELVEE 661

Query: 230 MVENSIPPSSHIYTALISGLVKKNMTDKGCLYLGKMLRDGFSPNAVLYTSLINHYLKIGE 289
           +V++ + PSS  YT LISG VK  M +KGC YL KML DG SPN VLYT+LI H+LK G+
Sbjct: 662 VVKHFLRPSSFTYTVLISGFVKMGMMEKGCQYLDKMLEDGLSPNVVLYTALIGHFLKKGD 721

Query: 290 VEYAFRLVDLMERSHIEPDVIFYITLVSGICKNLIVDKKKWFLLEKENQKAKSTLFRMLH 349
            +++F L  LM  + I+ D I YITL+SG+ + +   KK+  ++E   +K    L R++ 
Sbjct: 722 FKFSFTLFGLMGENDIKHDHIAYITLLSGLWRAMARKKKRQVIVEPGKEK---LLQRLIR 781

Query: 350 ETTLVPRDNNMIVSANSTEEMKSFALKLIQKVKDVCIVPNLHLYNSIICGYCRTDRMLDA 409
              LV      I S+      KSFA+++I KVK   I+PNL+L+N+II GYC   R+ +A
Sbjct: 782 TKPLV-----SIPSSLGNYGSKSFAMEVIGKVKK-SIIPNLYLHNTIITGYCAAGRLDEA 841

Query: 410 NHQLELMQKEGLHPNQVTFTILMD-----GDVNSAIGLFNKMNVDGCIPDKVAYITLLKG 469
            + LE MQKEG+ PN VT+TILM      GD+ SAI LF   N   C PD+V Y TLLKG
Sbjct: 842 YNHLESMQKEGIVPNLVTYTILMKSHIEAGDIESAIDLFEGTN---CEPDQVMYSTLLKG 883

Query: 470 LSQGGRLSDALAL 476
           L    R  DALAL
Sbjct: 902 LCDFKRPLDALAL 883

BLAST of Cp4.1LG14g05680 vs. ExPASy Swiss-Prot
Match: Q8S8P6 (Pentatricopeptide repeat-containing protein At2g32630 OS=Arabidopsis thaliana OX=3702 GN=At2g32630 PE=3 SV=1)

HSP 1 Score: 134.0 bits (336), Expect = 5.5e-30
Identity = 110/439 (25.06%), Postives = 197/439 (44.87%), Query Frame = 0

Query: 83  VIQRIVTQSSSISEAISIVDFAAERGLEIDLDTHGVFCRQLVYSRPQLAELLYDKKFTFG 142
           ++ R+   +    E + + D+  ++GL ID  +  VF       R     L   ++    
Sbjct: 159 LVFRVYVDNGMFEEGLRVFDYMVKKGLSIDERSCIVFLVAAKKRRRIDLCLEIFRRMVDS 218

Query: 143 GAEPDASVLDSMSKWLV------EAELLIREMEFRSLYPD---------KTMKKRIFE-V 202
           G +     L  + + L       +++ LI+E   + + P+           +K+R F  V
Sbjct: 219 GVKITVYSLTIVVEGLCRRGEVEKSKKLIKEFSVKGIKPEAYTYNTIINAYVKQRDFSGV 278

Query: 203 KGVFKKMLKAGVDPDKHLYLTMINGYGKNGKLLEARKLFEQMVENSIPPSSHIYTALISG 262
           +GV K M K GV  +K  Y  ++    KNGK+ +A KLF++M E  I    H+YT+LIS 
Sbjct: 279 EGVLKVMKKDGVVYNKVTYTLLMELSVKNGKMSDAEKLFDEMRERGIESDVHVYTSLISW 338

Query: 263 LVKKNMTDKGCLYLGKMLRDGFSPNAVLYTSLINHYLKIGEVEYAFRLVDLMERSHIEPD 322
             +K    +  L   ++   G SP++  Y +LI+   K+GE+  A  L++ M+   +   
Sbjct: 339 NCRKGNMKRAFLLFDELTEKGLSPSSYTYGALIDGVCKVGEMGAAEILMNEMQSKGVNIT 398

Query: 323 VIFYITLVSGICKNLIVD---------KKKWFLLE--------------KENQKAKSTLF 382
            + + TL+ G C+  +VD         ++K F  +              K   +AK  LF
Sbjct: 399 QVVFNTLIDGYCRKGMVDEASMIYDVMEQKGFQADVFTCNTIASCFNRLKRYDEAKQWLF 458

Query: 383 RMLH-ETTLVPRDNNMIVSANSTEEMKSFALKLIQKVKDVCIVPNLHLYNSIICGYCRTD 442
           RM+     L       ++     E     A +L  ++    + PN   YN +I  YC+  
Sbjct: 459 RMMEGGVKLSTVSYTNLIDVYCKEGNVEEAKRLFVEMSSKGVQPNAITYNVMIYAYCKQG 518

Query: 443 RMLDANHQLELMQKEGLHPNQVTFTILMDGD-----VNSAIGLFNKMNVDGCIPDKVAYI 477
           ++ +A      M+  G+ P+  T+T L+ G+     V+ A+ LF++M + G   + V Y 
Sbjct: 519 KIKEARKLRANMEANGMDPDSYTYTSLIHGECIADNVDEAMRLFSEMGLKGLDQNSVTYT 578

BLAST of Cp4.1LG14g05680 vs. ExPASy Swiss-Prot
Match: Q9LQ16 (Pentatricopeptide repeat-containing protein At1g62910 OS=Arabidopsis thaliana OX=3702 GN=At1g62910 PE=2 SV=1)

HSP 1 Score: 132.9 bits (333), Expect = 1.2e-29
Identity = 88/308 (28.57%), Postives = 148/308 (48.05%), Query Frame = 0

Query: 189 VFKKMLKAGVDPDKHLYLTMINGYGKNGKLLEARKLFEQMVENSIPPSSHIYTALISGLV 248
           V  KM+K G +PD     +++NGY  + ++ +A  L +QMVE    P +  +T LI GL 
Sbjct: 140 VLAKMMKLGYEPDIVTLSSLLNGYCHSKRISDAVALVDQMVEMGYKPDTFTFTTLIHGLF 199

Query: 249 KKNMTDKGCLYLGKMLRDGFSPNAVLYTSLINHYLKIGEVEYAFRLVDLMERSHIEPDVI 308
             N   +    + +M++ G  P+ V Y +++N   K G+++ A  L+  ME+  IE DV+
Sbjct: 200 LHNKASEAVALVDQMVQRGCQPDLVTYGTVVNGLCKRGDIDLALSLLKKMEKGKIEADVV 259

Query: 309 FYITLVSGICKNLIVDKKKWFLLEKENQKAKSTLF-------------------RMLHET 368
            Y T++ G+CK   +D       E +N+  +  +F                   R+L + 
Sbjct: 260 IYNTIIDGLCKYKHMDDALNLFTEMDNKGIRPDVFTYSSLISCLCNYGRWSDASRLLSD- 319

Query: 369 TLVPRDNN-------MIVSANSTEEMKSFALKLIQKVKDVCIVPNLHLYNSIICGYCRTD 428
            ++ R  N        ++ A   E     A KL  ++    I P++  Y+S+I G+C  D
Sbjct: 320 -MIERKINPNVVTFSALIDAFVKEGKLVEAEKLYDEMIKRSIDPDIFTYSSLINGFCMHD 379

Query: 429 RMLDANHQLELMQKEGLHPNQVTFTILMDG-----DVNSAIGLFNKMNVDGCIPDKVAYI 466
           R+ +A H  ELM  +   PN VT++ L+ G      V   + LF +M+  G + + V Y 
Sbjct: 380 RLDEAKHMFELMISKDCFPNVVTYSTLIKGFCKAKRVEEGMELFREMSQRGLVGNTVTYT 439

BLAST of Cp4.1LG14g05680 vs. ExPASy Swiss-Prot
Match: Q9FIT7 (Pentatricopeptide repeat-containing protein At5g61990, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At5g61990 PE=2 SV=1)

HSP 1 Score: 131.0 bits (328), Expect = 4.6e-29
Identity = 87/288 (30.21%), Postives = 139/288 (48.26%), Query Frame = 0

Query: 182 RIFEVKGVFKKMLKAGVDPDKHLYLTMINGYGKNGKLLEARKLFEQMVENSIPPSSHIYT 241
           R  +   V K+M + G+ PD   Y ++I G  K  ++ EAR    +MVEN + P++  Y 
Sbjct: 467 RFGDAMRVLKEMKEQGIAPDIFCYNSLIIGLSKAKRMDEARSFLVEMVENGLKPNAFTYG 526

Query: 242 ALISGLVKKNMTDKGCLYLGKMLRDGFSPNAVLYTSLINHYLKIGEVEYAFRLVDLMERS 301
           A ISG ++ +       Y+ +M   G  PN VL T LIN Y K G+V  A      M   
Sbjct: 527 AFISGYIEASEFASADKYVKEMRECGVLPNKVLCTGLINEYCKKGKVIEACSAYRSMVDQ 586

Query: 302 HIEPDVIFYITLVSGICKNLIVDKKKWFLLEKENQKAKSTLFRMLHETTLVPRDNNMIVS 361
            I  D   Y  L++G+ KN  VD  +              +FR +    + P   +  V 
Sbjct: 587 GILGDAKTYTVLMNGLFKNDKVDDAE-------------EIFREMRGKGIAPDVFSYGVL 646

Query: 362 ANSTEEMKSF--ALKLIQKVKDVCIVPNLHLYNSIICGYCRTDRMLDANHQLELMQKEGL 421
            N   ++ +   A  +  ++ +  + PN+ +YN ++ G+CR+  +  A   L+ M  +GL
Sbjct: 647 INGFSKLGNMQKASSIFDEMVEEGLTPNVIIYNMLLGGFCRSGEIEKAKELLDEMSVKGL 706

Query: 422 HPNQVTFTILMD-----GDVNSAIGLFNKMNVDGCIPDKVAYITLLKG 463
           HPN VT+  ++D     GD+  A  LF++M + G +PD   Y TL+ G
Sbjct: 707 HPNAVTYCTIIDGYCKSGDLAEAFRLFDEMKLKGLVPDSFVYTTLVDG 741

BLAST of Cp4.1LG14g05680 vs. ExPASy Swiss-Prot
Match: Q9SH26 (Pentatricopeptide repeat-containing protein At1g63400 OS=Arabidopsis thaliana OX=3702 GN=At1g63400 PE=2 SV=1)

HSP 1 Score: 129.4 bits (324), Expect = 1.3e-28
Identity = 86/303 (28.38%), Postives = 142/303 (46.86%), Query Frame = 0

Query: 192 KMLKAGVDPDKHLYLTMINGYGKNGKLLEARKLFEQMVENSIPPSSHIYTALISGLVKKN 251
           KM+K G +P      +++NGY    ++ +A  L +QMVE    P +  +T LI GL   N
Sbjct: 145 KMMKLGYEPSIVTLSSLLNGYCHGKRISDAVALVDQMVEMGYRPDTITFTTLIHGLFLHN 204

Query: 252 MTDKGCLYLGKMLRDGFSPNAVLYTSLINHYLKIGEVEYAFRLVDLMERSHIEPDVIFYI 311
              +    + +M++ G  PN V Y  ++N   K G+++ AF L++ ME + IE +V+ Y 
Sbjct: 205 KASEAVALVDRMVQRGCQPNLVTYGVVVNGLCKRGDIDLAFNLLNKMEAAKIEANVVIYS 264

Query: 312 TLVSGICKNLIVDKKKWFLLEKENQKAK----------------------STLFRMLHET 371
           T++  +CK    D       E EN+  +                      S L   + E 
Sbjct: 265 TVIDSLCKYRHEDDALNLFTEMENKGVRPNVITYSSLISCLCNYERWSDASRLLSDMIER 324

Query: 372 TLVPR--DNNMIVSANSTEEMKSFALKLIQKVKDVCIVPNLHLYNSIICGYCRTDRMLDA 431
            + P     N ++ A   E     A KL  ++    I P++  Y+S+I G+C  DR+ +A
Sbjct: 325 KINPNVVTFNALIDAFVKEGKLVEAEKLYDEMIKRSIDPDIFTYSSLINGFCMHDRLDEA 384

Query: 432 NHQLELMQKEGLHPNQVTFTILMDG-----DVNSAIGLFNKMNVDGCIPDKVAYITLLKG 466
            H  ELM  +   PN VT+  L++G      ++  + LF +M+  G + + V Y TL+ G
Sbjct: 385 KHMFELMISKDCFPNVVTYNTLINGFCKAKRIDEGVELFREMSQRGLVGNTVTYTTLIHG 444

BLAST of Cp4.1LG14g05680 vs. NCBI nr
Match: XP_023552131.1 (pentatricopeptide repeat-containing protein At5g62370 [Cucurbita pepo subsp. pepo] >XP_023552132.1 pentatricopeptide repeat-containing protein At5g62370 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 692 bits (1787), Expect = 1.36e-238
Identity = 441/884 (49.89%), Postives = 442/884 (50.00%), Query Frame = 0

Query: 36  RNLVTTCTVPLDPPVTSSSSSASEHKTLCYSLVERLIRRGLFLPAQQVIQRIVTQSSSIS 95
           RNLVTTCTVPLDPPVTSSSSSASEHKTLCYSLVERLIRRGLFLPAQQVIQRIVTQSSSIS
Sbjct: 16  RNLVTTCTVPLDPPVTSSSSSASEHKTLCYSLVERLIRRGLFLPAQQVIQRIVTQSSSIS 75

Query: 96  EAISIVDFAAERGLEIDLDTHGVFCRQLVYSRPQLAELLYDKKFTFGGAEPDASVLDSM- 155
           EAISIVDFAAERGLEIDLDTHGVFCRQLVYSRPQLAELLYDKKFTFGGAEPDASVLDSM 
Sbjct: 76  EAISIVDFAAERGLEIDLDTHGVFCRQLVYSRPQLAELLYDKKFTFGGAEPDASVLDSMV 135

Query: 156 ------------------------------------------------------------ 215
                                                                       
Sbjct: 136 ICFCRLGKFEKALAYFNQLLSLNYVPSKTSFNAIFRELCAQERVLEAFDYFVRVNGGGVH 195

Query: 216 ---------------------------------------------------SKWLVEAEL 275
                                                              SKWLVEAEL
Sbjct: 196 LGYWCFNVLIDGLCNKGHMEEALELFDIMQNTNGYPPSLHLFKSLFYGLCKSKWLVEAEL 255

Query: 276 LIREMEFRSLYPDKTM-------------------------------------------- 335
           LIREMEFRSLYPDKTM                                            
Sbjct: 256 LIREMEFRSLYPDKTMYTSLVHEYCKDKKMKMAMQAFFRMIKIGCEPDNYTLNTLIHGFV 315

Query: 336 ------------------------------------------------------------ 395
                                                                       
Sbjct: 316 KLGLVDKGWLVYNLMAEWGIQPDVVTFHIMISQYCQEGKVDFALTILNNMVSCNFSPSLH 375

Query: 396 ------------------------------------------------------------ 455
                                                                       
Sbjct: 376 CYTVLINALHRDDRLEEVSELLRSILDNGIVPDHVLFFTLMKMYPKGHELQLALNFLEAI 435

Query: 456 ------------------------------------------------------------ 477
                                                                       
Sbjct: 436 LKNGCGCDPSVILASTKLQTSSNLEQKIETLLQEIFNSNLNLAGVAFSIVICALCETENL 495

BLAST of Cp4.1LG14g05680 vs. NCBI nr
Match: XP_022922745.1 (pentatricopeptide repeat-containing protein At5g62370 isoform X1 [Cucurbita moschata] >XP_022922746.1 pentatricopeptide repeat-containing protein At5g62370 isoform X1 [Cucurbita moschata] >XP_022922747.1 pentatricopeptide repeat-containing protein At5g62370 isoform X1 [Cucurbita moschata])

HSP 1 Score: 667 bits (1722), Expect = 8.81e-229
Identity = 429/884 (48.53%), Postives = 437/884 (49.43%), Query Frame = 0

Query: 36  RNLVTTCTVPLDPPVTSSSSSASEHKTLCYSLVERLIRRGLFLPAQQVIQRIVTQSSSIS 95
           RNLVTTCTVPLDPPVTSSSSSASEHKTLCYSLVE+LIRRGLFLPAQQVIQRIVTQSSSIS
Sbjct: 16  RNLVTTCTVPLDPPVTSSSSSASEHKTLCYSLVEQLIRRGLFLPAQQVIQRIVTQSSSIS 75

Query: 96  EAISIVDFAAERGLEIDLDTHGVFCRQLVYSRPQLAELLYDKKFTFGGAEPDASVLDSM- 155
           EAISIVDFAAERGLE+DLDTHGVF RQLVYSRPQLAELLYDKKFTF GAEPDASVLDSM 
Sbjct: 76  EAISIVDFAAERGLELDLDTHGVFWRQLVYSRPQLAELLYDKKFTFRGAEPDASVLDSMV 135

Query: 156 ------------------------------------------------------------ 215
                                                                       
Sbjct: 136 ICFCRLGKFEKALAYFNQLLSLNYVPSKTSFNAIFRELCAQERVLEAFDYFVRVNGGGVH 195

Query: 216 ---------------------------------------------------SKWLVEAEL 275
                                                               KWLVEAEL
Sbjct: 196 LGYWCFNVLIDGLCNKGHMEEALELFDIMQNTNGYPPSLHLFKSLFYGLCKRKWLVEAEL 255

Query: 276 LIREMEFRSLYPDKTM-------------------------------------------- 335
           LIREMEFRSLYPDKTM                                            
Sbjct: 256 LIREMEFRSLYPDKTMYTSLVHEYCKDKKMKMAMQAFFRMIKIGCEPDNYTLNTLIHGFV 315

Query: 336 ------------------------------------------------------------ 395
                                                                       
Sbjct: 316 KLGLVDKGWLVYNLMAEWGIQPDVVTFHIMISQYCQEGKVDFALTILNNMVSCNFSPSLH 375

Query: 396 ------------------------------------------------------------ 455
                                                                       
Sbjct: 376 CYTVLINALHRDDRLEEVSELLRSILDNGIVPDHVLFFTLMKMYPKGHELQLALNFLEAI 435

Query: 456 ------------------------------------------------------------ 477
                                                                       
Sbjct: 436 LKNGCGCDPSVILASTKLQTSSNLEQKIETLLQEIFNSNLNLAGVAFSIVICALCETENL 495

BLAST of Cp4.1LG14g05680 vs. NCBI nr
Match: XP_022985467.1 (pentatricopeptide repeat-containing protein At5g62370 isoform X1 [Cucurbita maxima] >XP_022985468.1 pentatricopeptide repeat-containing protein At5g62370 isoform X1 [Cucurbita maxima])

HSP 1 Score: 656 bits (1693), Expect = 2.09e-224
Identity = 422/882 (47.85%), Postives = 433/882 (49.09%), Query Frame = 0

Query: 36  RNLVTTCTVPLDPPVTSSSSSASEHKTLCYSLVERLIRRGLFLPAQQVIQRIVTQSSSIS 95
           RNLVTTCTVPLDPPVTSSSSSASEHKTLCYSLV++LIRRGLFLPAQQVIQRIVTQSSSIS
Sbjct: 16  RNLVTTCTVPLDPPVTSSSSSASEHKTLCYSLVDQLIRRGLFLPAQQVIQRIVTQSSSIS 75

Query: 96  EAISIVDFAAERGLEIDLDTHGVFCRQLVYSRPQLAELLYDKKFTFGGAEPDASVLDSM- 155
           EAISIVDFAAERGLE+DL THGV CRQLVYSRPQLAELLYDKKFTFGGAEPDASVLDSM 
Sbjct: 76  EAISIVDFAAERGLELDLATHGVLCRQLVYSRPQLAELLYDKKFTFGGAEPDASVLDSMV 135

Query: 156 ------------------------------------------------------------ 215
                                                                       
Sbjct: 136 TCFCRLGKFEKALAYFNQLLSLNYVPSKSSFNAIFRELCAQERVLEAFDYFMRVNGAGVH 195

Query: 216 ---------------------------------------------------SKWLVEAEL 275
                                                              SKWLVEAEL
Sbjct: 196 LGYWCFNVLIDGLCNKGHMEEALELFDIMQSTNGYPPSLHLFKSLFYGLCKSKWLVEAEL 255

Query: 276 LIREMEFRSLYPDKTM-------------------------------------------- 335
           LIREMEFRSL+PDKTM                                            
Sbjct: 256 LIREMEFRSLHPDKTMYTSLVHEYCKDKKMKMAMQAFFRMIKIGCEPDNYTLNTLIHGFV 315

Query: 336 ------------------------------------------------------------ 395
                                                                       
Sbjct: 316 KLGLVDKGWLVYNLMAEWGIQPDVVTFHIMISQYCQEGKVDFALTILNNMVSCNISPSLH 375

Query: 396 ------------------------------------------------------------ 455
                                                                       
Sbjct: 376 CYTVLINALHRDDRLEEVSELLKSMLDNGIIPDHVLFFTLMKMYPKGHELQLALNVLEAI 435

Query: 456 ------------------------------------------------------------ 475
                                                                       
Sbjct: 436 LKNGCGCDPSVILASTKLQTSSNLEQKIETLLQEIFNSNLNLAGVAFSIVICALCETENL 495

BLAST of Cp4.1LG14g05680 vs. NCBI nr
Match: KAG6576797.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 640 bits (1650), Expect = 1.65e-217
Identity = 413/877 (47.09%), Postives = 424/877 (48.35%), Query Frame = 0

Query: 54  SSSASEHKTLCYSLVERLIRRGLFLPAQQVIQRIVTQSSSISEAISIVDFAAERGLEIDL 113
           + +A EHKTLCYSLVE+LIRRGLFLPAQQVIQRIVTQSSSI EAISIVDFAAERGLE+DL
Sbjct: 42  TGTALEHKTLCYSLVEQLIRRGLFLPAQQVIQRIVTQSSSIYEAISIVDFAAERGLELDL 101

Query: 114 DTHGVFCRQLVYSRPQLAELLYDKKFTFGGAEPDASVLDSM------------------- 173
           DTHGVFCRQLVYSRPQLAELLYDKKFTFGGAEPDASVLDSM                   
Sbjct: 102 DTHGVFCRQLVYSRPQLAELLYDKKFTFGGAEPDASVLDSMVICFCRLGKFEKALAYFNQ 161

Query: 174 ------------------------------------------------------------ 233
                                                                       
Sbjct: 162 LLSLNYVPSKTSFNAIFRELCAQERVLEAFDYFVRVNGGGVHLGYWCFNVLIDGLCNKGH 221

Query: 234 ---------------------------------SKWLVEAELLIREMEFRSLYPDKTM-- 293
                                             KWLVEAELLIREMEFRSLYPDKTM  
Sbjct: 222 MEEALELFDIMQNTNGYPPSLHLFKSLFYGLCKRKWLVEAELLIREMEFRSLYPDKTMYT 281

Query: 294 ------------------------------------------------------------ 353
                                                                       
Sbjct: 282 SLVHEYCKDKKMKMAMQAFFRMIKIGCEPDNYTLNTLIHGFVKLGLVDKGWLVYNLMAEW 341

Query: 354 ------------------------------------------------------------ 413
                                                                       
Sbjct: 342 GIQPDVVTFHIMISQYCQEGKVDFALTILNSMVSCNFSPSLHCYTVLINALHRDDRLEEV 401

Query: 414 ------------------------------------------------------------ 473
                                                                       
Sbjct: 402 SELLRSILDNGIVPDHVLFFTLMKMYPKGHELQLALNFLEAILKNGCGCDPSVILASTKL 461

Query: 474 ------------------------------------------------------------ 488
                                                                       
Sbjct: 462 QTSSNLEQKIETLLQEIFNSNLNLAGVAFSIVICALCETENLDCALGYFHKMASLGCKPL 521

BLAST of Cp4.1LG14g05680 vs. NCBI nr
Match: XP_038882384.1 (pentatricopeptide repeat-containing protein At5g62370 [Benincasa hispida])

HSP 1 Score: 558 bits (1439), Expect = 4.04e-186
Identity = 369/882 (41.84%), Postives = 401/882 (45.46%), Query Frame = 0

Query: 36  RNLVTTCTVPLDPPVTSSSSSASEHKTLCYSLVERLIRRGLFLPAQQVIQRIVTQSSSIS 95
           RNLVTTCTVPLD P TSSSSSAS+HK LC+SLVE+LIRRGLFL AQQVIQRIVTQSSSIS
Sbjct: 16  RNLVTTCTVPLDIPTTSSSSSASQHKNLCFSLVEQLIRRGLFLSAQQVIQRIVTQSSSIS 75

Query: 96  EAISIVDFAAERGLEIDLDTHGVFCRQLVYSRPQLAELLYDKKFTFGGAEPDASVLDSM- 155
           EAIS++DFAAERGLE+DL THG  CRQ VYS+PQLAELLY++ F FGGAEPD  ++DSM 
Sbjct: 76  EAISVLDFAAERGLELDLATHGWLCRQFVYSKPQLAELLYNRNFVFGGAEPDVLLMDSMV 135

Query: 156 ------------------------------------------------------------ 215
                                                                       
Sbjct: 136 ICFCRLGKFEEALTHFNRLLSLNYVPSKVSFNAIFRELCAQERVLEAFDYFVRVNGAGVY 195

Query: 216 ---------------------------------------------------SKWLVEAEL 275
                                                              S+WLVEAEL
Sbjct: 196 LGHWCFNVLMDGLCNKGYMEEALELFDIMQSTNGYPPTLHLFKTLFYGLCKSRWLVEAEL 255

Query: 276 LIREMEFRSLYPDKTM-------------------------------------------- 335
           LIREMEF+SLYPD+TM                                            
Sbjct: 256 LIREMEFQSLYPDETMYTSLIHGYCKDKKMKMAMQALFRMVKIGCKPDSFTLNTLIHGFV 315

Query: 336 ------------------------------------------------------------ 395
                                                                       
Sbjct: 316 KLDLVEKGWLVYNLMAEWGIQPNVVTFHIMISKYCQEGKVDTALAFLNSMVNSNLSPSVH 375

Query: 396 ------------------------------------------------------------ 455
                                                                       
Sbjct: 376 CYTVLINALYRDDRLEEVSELLKSMLDNGIIPDHVLFFTLMKMYPRGHELQLALNTLGAI 435

Query: 456 ------------------------------------------------------------ 475
                                                                       
Sbjct: 436 VKNGCGCDPSVILASTKWQTSSTLEQKIETLLREIFNSNLNLAGVAFSIVISALCETKNL 495

BLAST of Cp4.1LG14g05680 vs. ExPASy TrEMBL
Match: A0A6J1E4Z0 (pentatricopeptide repeat-containing protein At5g62370 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111430647 PE=4 SV=1)

HSP 1 Score: 667 bits (1722), Expect = 4.27e-229
Identity = 429/884 (48.53%), Postives = 437/884 (49.43%), Query Frame = 0

Query: 36  RNLVTTCTVPLDPPVTSSSSSASEHKTLCYSLVERLIRRGLFLPAQQVIQRIVTQSSSIS 95
           RNLVTTCTVPLDPPVTSSSSSASEHKTLCYSLVE+LIRRGLFLPAQQVIQRIVTQSSSIS
Sbjct: 16  RNLVTTCTVPLDPPVTSSSSSASEHKTLCYSLVEQLIRRGLFLPAQQVIQRIVTQSSSIS 75

Query: 96  EAISIVDFAAERGLEIDLDTHGVFCRQLVYSRPQLAELLYDKKFTFGGAEPDASVLDSM- 155
           EAISIVDFAAERGLE+DLDTHGVF RQLVYSRPQLAELLYDKKFTF GAEPDASVLDSM 
Sbjct: 76  EAISIVDFAAERGLELDLDTHGVFWRQLVYSRPQLAELLYDKKFTFRGAEPDASVLDSMV 135

Query: 156 ------------------------------------------------------------ 215
                                                                       
Sbjct: 136 ICFCRLGKFEKALAYFNQLLSLNYVPSKTSFNAIFRELCAQERVLEAFDYFVRVNGGGVH 195

Query: 216 ---------------------------------------------------SKWLVEAEL 275
                                                               KWLVEAEL
Sbjct: 196 LGYWCFNVLIDGLCNKGHMEEALELFDIMQNTNGYPPSLHLFKSLFYGLCKRKWLVEAEL 255

Query: 276 LIREMEFRSLYPDKTM-------------------------------------------- 335
           LIREMEFRSLYPDKTM                                            
Sbjct: 256 LIREMEFRSLYPDKTMYTSLVHEYCKDKKMKMAMQAFFRMIKIGCEPDNYTLNTLIHGFV 315

Query: 336 ------------------------------------------------------------ 395
                                                                       
Sbjct: 316 KLGLVDKGWLVYNLMAEWGIQPDVVTFHIMISQYCQEGKVDFALTILNNMVSCNFSPSLH 375

Query: 396 ------------------------------------------------------------ 455
                                                                       
Sbjct: 376 CYTVLINALHRDDRLEEVSELLRSILDNGIVPDHVLFFTLMKMYPKGHELQLALNFLEAI 435

Query: 456 ------------------------------------------------------------ 477
                                                                       
Sbjct: 436 LKNGCGCDPSVILASTKLQTSSNLEQKIETLLQEIFNSNLNLAGVAFSIVICALCETENL 495

BLAST of Cp4.1LG14g05680 vs. ExPASy TrEMBL
Match: A0A6J1J4Z3 (pentatricopeptide repeat-containing protein At5g62370 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111483468 PE=4 SV=1)

HSP 1 Score: 656 bits (1693), Expect = 1.01e-224
Identity = 422/882 (47.85%), Postives = 433/882 (49.09%), Query Frame = 0

Query: 36  RNLVTTCTVPLDPPVTSSSSSASEHKTLCYSLVERLIRRGLFLPAQQVIQRIVTQSSSIS 95
           RNLVTTCTVPLDPPVTSSSSSASEHKTLCYSLV++LIRRGLFLPAQQVIQRIVTQSSSIS
Sbjct: 16  RNLVTTCTVPLDPPVTSSSSSASEHKTLCYSLVDQLIRRGLFLPAQQVIQRIVTQSSSIS 75

Query: 96  EAISIVDFAAERGLEIDLDTHGVFCRQLVYSRPQLAELLYDKKFTFGGAEPDASVLDSM- 155
           EAISIVDFAAERGLE+DL THGV CRQLVYSRPQLAELLYDKKFTFGGAEPDASVLDSM 
Sbjct: 76  EAISIVDFAAERGLELDLATHGVLCRQLVYSRPQLAELLYDKKFTFGGAEPDASVLDSMV 135

Query: 156 ------------------------------------------------------------ 215
                                                                       
Sbjct: 136 TCFCRLGKFEKALAYFNQLLSLNYVPSKSSFNAIFRELCAQERVLEAFDYFMRVNGAGVH 195

Query: 216 ---------------------------------------------------SKWLVEAEL 275
                                                              SKWLVEAEL
Sbjct: 196 LGYWCFNVLIDGLCNKGHMEEALELFDIMQSTNGYPPSLHLFKSLFYGLCKSKWLVEAEL 255

Query: 276 LIREMEFRSLYPDKTM-------------------------------------------- 335
           LIREMEFRSL+PDKTM                                            
Sbjct: 256 LIREMEFRSLHPDKTMYTSLVHEYCKDKKMKMAMQAFFRMIKIGCEPDNYTLNTLIHGFV 315

Query: 336 ------------------------------------------------------------ 395
                                                                       
Sbjct: 316 KLGLVDKGWLVYNLMAEWGIQPDVVTFHIMISQYCQEGKVDFALTILNNMVSCNISPSLH 375

Query: 396 ------------------------------------------------------------ 455
                                                                       
Sbjct: 376 CYTVLINALHRDDRLEEVSELLKSMLDNGIIPDHVLFFTLMKMYPKGHELQLALNVLEAI 435

Query: 456 ------------------------------------------------------------ 475
                                                                       
Sbjct: 436 LKNGCGCDPSVILASTKLQTSSNLEQKIETLLQEIFNSNLNLAGVAFSIVICALCETENL 495

BLAST of Cp4.1LG14g05680 vs. ExPASy TrEMBL
Match: A0A6J1DJ30 (pentatricopeptide repeat-containing protein At5g62370 OS=Momordica charantia OX=3673 GN=LOC111021369 PE=4 SV=1)

HSP 1 Score: 502 bits (1292), Expect = 2.39e-164
Identity = 348/890 (39.10%), Postives = 390/890 (43.82%), Query Frame = 0

Query: 36  RNLVTTCTVPLDPPVTSSSSSASEHKTLCYSLVERLIRRGLFLPAQQVIQRIVTQSSSIS 95
           +  VTTCTVP+D P T SS+ ASEHKTLCYSLVE+LI RGLF  AQQVIQRI+ QSSS+ 
Sbjct: 16  KRSVTTCTVPIDAPTTLSSTCASEHKTLCYSLVEQLIGRGLFSSAQQVIQRIIRQSSSVC 75

Query: 96  EAISIVDFAAERGLEIDLDTHGVFCRQLVYS-RPQLAELLYDKKFTFGGAEPDASVLDSM 155
           EAISIVDFA+ERGLE+DL +HGV  R+LVYS RPQLAE L+  K   GGA PD  VLD M
Sbjct: 76  EAISIVDFASERGLELDLASHGVLFRKLVYSSRPQLAEELFYNKIISGGAYPDPLVLDYM 135

Query: 156 ---------------------------SK------------------------------- 215
                                      SK                               
Sbjct: 136 VICFCRLEKFEEALAHFDQLISLNYIPSKASFNAIFRELCAQGRVLEAFNYFVRVNGAGV 195

Query: 216 ------------------------------------------------------WLVEAE 275
                                                                 WLVEAE
Sbjct: 196 YLGYWCFNVLIDGLCYKEYMGEALQLFDIMQITNRYPPTLHLFKSLFYGLCKRGWLVEAE 255

Query: 276 LLIREMEFRSLYPDKTM------------------------------------------- 335
           LLIREMEF+ LYPDKTM                                           
Sbjct: 256 LLIREMEFQGLYPDKTMYTSLIHEYCKEKKMKMAMQAFFRMIKIGCKPDNYTLNTLIHGF 315

Query: 336 ------------------------------------------------------------ 395
                                                                       
Sbjct: 316 VKLGLVDKGWLVYNLMEEWGVQPDVVTFHIMINKYCQEGKVDSALAIFNNMVSCNLSPSL 375

Query: 396 ------------------------------------------------------------ 455
                                                                       
Sbjct: 376 HCYTVLINALHRDNRLEEVDVFSRSMLDSGIVPDHVLFFTLMKMYPKGHELQLALTILEA 435

Query: 456 ------------------------------------------------------------ 482
                                                                       
Sbjct: 436 IVKNGCGFDPSIISSCKKLQSSSNLEKKIEMLLQEIFDSNLNLAGVAFSIVISALCEIEK 495

BLAST of Cp4.1LG14g05680 vs. ExPASy TrEMBL
Match: A0A5N6QXY1 (Uncharacterized protein OS=Carpinus fangiana OX=176857 GN=FH972_008191 PE=4 SV=1)

HSP 1 Score: 375 bits (962), Expect = 1.15e-114
Identity = 213/467 (45.61%), Postives = 301/467 (64.45%), Query Frame = 0

Query: 46  LDPPVTSSSSSASEHKTL---CYSLVERLIRRGLFLP--AQQVIQRIVTQSSSISEAISI 105
           +DP + + S+S +    L      L+ER++R  L L   A  V    + +   I  A+  
Sbjct: 445 VDPSMLAFSASVNSTGDLEREIEILLERIVRSNLNLANVAFSVFISALCEEGRIDCALIC 504

Query: 106 VDFAAERGLEIDLDTHG----VFCRQLVYSRPQLAELLYDKKFTFGGAEPDASVLDSMSK 165
           +D     G    L T+       C++ +++    AE L D     G     A+ L  ++ 
Sbjct: 505 MDKMVRVGCVPLLFTYNSLIKCLCQEGLFAD---AESLIDIMQVHGAVPDQATYLIMINA 564

Query: 166 ------WLVEAELLIREMEFRSLYPDKTM----------KKRIFEVKGVFKKMLKAGVDP 225
                 W V A  ++ +ME R L P   +          +KRIFE + +FK+MLK GVDP
Sbjct: 565 HCKRGDW-VSAFDILDQMEERGLRPYVAVYDTIIRCLSREKRIFEAEELFKRMLKFGVDP 624

Query: 226 DKHLYLTMINGYGKNGKLLEARKLFEQMVENSIPPSSHIYTALISGLVKKNMTDKGCLYL 285
           D+ +Y+TMI+GY KNG+ +EA + F++M+ENSI PSS+ YTALISGLVKKNMTDKGC+YL
Sbjct: 625 DEVVYMTMIDGYSKNGRAIEAHQFFDKMIENSIRPSSYSYTALISGLVKKNMTDKGCIYL 684

Query: 286 GKMLRDGFSPNAVLYTSLINHYLKIGEVEYAFRLVDLMERSHIEPDVIFYITLVSGICKN 345
            +ML DG  PNAVLYT LINH+LK GE E+AFRLVDLM+++ +E D++ YI+LVSGI +N
Sbjct: 685 DRMLADGLEPNAVLYTLLINHFLKKGEFEFAFRLVDLMDKNQVEHDLVMYISLVSGISRN 744

Query: 346 LIVDKKKWFLLEKENQKAKSTLFRMLHETTLVPRDNNMIVSANSTEEMKSFALKLIQKVK 405
           +   KKKW +L K +++A+     +LH+ TL+PR+N + VS  S EEMK FALKL+QKVK
Sbjct: 745 ITGIKKKWRILNKGSERAREMFLHLLHQRTLIPRENILRVSVISVEEMKCFALKLMQKVK 804

Query: 406 DVCIVPNLHLYNSIICGYCRTDRMLDANHQLELMQKEGLHPNQVTFTILMDG-----DVN 465
           ++ ++PNL++YN II G+CR ++M DA    E+MQ+EG+ PNQVT+TIL+DG     D++
Sbjct: 805 EIGLMPNLYIYNGIISGFCRAEQMQDAYDHFEMMQREGVRPNQVTYTILVDGHIQLGDID 864

Query: 466 SAIGLFNKMNVDGCIPDKVAYITLLKGLSQGGRLSDALALHMFICSR 482
           SA+GLFNKMN  G  PD++AY TLL+GL + GRL DAL++   +  R
Sbjct: 865 SAVGLFNKMNEGGFAPDRIAYNTLLRGLCKAGRLLDALSISYMMRKR 907

BLAST of Cp4.1LG14g05680 vs. ExPASy TrEMBL
Match: A0A6J1AG82 (pentatricopeptide repeat-containing protein At5g62370 isoform X2 OS=Herrania umbratica OX=108875 GN=LOC110417676 PE=4 SV=1)

HSP 1 Score: 363 bits (932), Expect = 4.53e-112
Identity = 194/393 (49.36%), Postives = 267/393 (67.94%), Query Frame = 0

Query: 88  VTQSSSISEAISIVDFAAERGLEIDLDTHGVFCRQLVYSRPQLAELLYDKKFTFGGAEPD 147
           ++Q     +A S+VD   ++G+  D  T+ +   +    R  LA                
Sbjct: 332 LSQEGLFEDAKSLVDLMQDQGIFPDQATYLIMINEHC-KRGNLASAF------------- 391

Query: 148 ASVLDSMSKWLVEAELLIREMEFRSLYPDKTMKKRIFEVKGVFKKMLKAGVDPDKHLYLT 207
             +LD M    ++  + I +    SL   K    R+FE + +F +ML++G DPD+ +Y+T
Sbjct: 392 -DILDQMEDRGMKPGVAIYDCIIGSLCQQK----RMFEAEDMFIRMLESGEDPDEIVYMT 451

Query: 208 MINGYGKNGKLLEARKLFEQMVENSIPPSSHIYTALISGLVKKNMTDKGCLYLGKMLRDG 267
           MINGY KNG+L+EAR+LFE+M+E++I P+SH YTALISGLVKK+MTDKGC+YL +ML DG
Sbjct: 452 MINGYSKNGRLIEARQLFEKMIEDAIRPTSHSYTALISGLVKKDMTDKGCMYLDRMLGDG 511

Query: 268 FSPNAVLYTSLINHYLKIGEVEYAFRLVDLMERSHIEPDVIFYITLVSGICKNLIVDKKK 327
             PNAVLYTSLIN++L+ GE E+AFRLVDLM+R+ IE D+I YI LVSG+C+N I  +K+
Sbjct: 512 LVPNAVLYTSLINNFLRKGEFEFAFRLVDLMDRNQIEHDLITYIALVSGVCRN-ITSRKR 571

Query: 328 WFLLEKENQKAKSTLFRMLHETTLVPRDNNMIVSANSTEEMKSFALKLIQKVKDVCIVPN 387
           W  +++ +++A+  LF +LH   L+PR+  + VS +S E MK FALK++QKVK+   +PN
Sbjct: 572 WCSIKRSSERAREMLFCLLHYRCLLPREKKLRVSDSSPEAMKCFALKVMQKVKETRFMPN 631

Query: 388 LHLYNSIICGYCRTDRMLDANHQLELMQKEGLHPNQVTFTILMDG-----DVNSAIGLFN 447
           L+LYN II G+C  DRM DA    ELMQKEG+ PNQVTFTILM G     +++ AI LFN
Sbjct: 632 LYLYNGIISGFCWADRMQDAYDHFELMQKEGVRPNQVTFTILMGGHIKAGEIDHAIDLFN 691

Query: 448 KMNVDGCIPDKVAYITLLKGLSQGGRLSDALAL 475
           KMN D C PDK+AY TL+KGL Q GRL +A++L
Sbjct: 692 KMNADECTPDKIAYNTLIKGLCQAGRLLEAVSL 704

BLAST of Cp4.1LG14g05680 vs. TAIR 10
Match: AT5G62370.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 244.6 bits (623), Expect = 2.0e-64
Identity = 161/433 (37.18%), Postives = 243/433 (56.12%), Query Frame = 0

Query: 50  VTSSSSSASEHKTLCYSLVERLIRRGLF-LP-AQQVIQRIVTQSSSISEAISIVDFAAER 109
           V +++  +  +     S +E+++  G   LP +   + + + Q + I +  S+V+   E 
Sbjct: 482 VVTTALCSQRNYIAALSRIEKMVNLGCTPLPFSYNSVIKCLFQENIIEDLASLVNIIQEL 541

Query: 110 GLEIDLDTHGVFCRQLVYSRPQLAELLYDKKFTFGGAEPDASVLDSMSKWLVEAELLIRE 169
               D+DT+ +   +L            D+   F       +++D+M +  +   + I  
Sbjct: 542 DFVPDVDTYLIVVNELCKKN--------DRDAAF-------AIIDAMEELGLRPTVAI-- 601

Query: 170 MEFRSLYPDKTMKKRIFEVKGVFKKMLKAGVDPDKHLYLTMINGYGKNGKLLEARKLFEQ 229
             + S+      + R+ E +  F KML++G+ PD+  Y+ MIN Y +NG++ EA +L E+
Sbjct: 602 --YSSIIGSLGKQGRVVEAEETFAKMLESGIQPDEIAYMIMINTYARNGRIDEANELVEE 661

Query: 230 MVENSIPPSSHIYTALISGLVKKNMTDKGCLYLGKMLRDGFSPNAVLYTSLINHYLKIGE 289
           +V++ + PSS  YT LISG VK  M +KGC YL KML DG SPN VLYT+LI H+LK G+
Sbjct: 662 VVKHFLRPSSFTYTVLISGFVKMGMMEKGCQYLDKMLEDGLSPNVVLYTALIGHFLKKGD 721

Query: 290 VEYAFRLVDLMERSHIEPDVIFYITLVSGICKNLIVDKKKWFLLEKENQKAKSTLFRMLH 349
            +++F L  LM  + I+ D I YITL+SG+ + +   KK+  ++E   +K    L R++ 
Sbjct: 722 FKFSFTLFGLMGENDIKHDHIAYITLLSGLWRAMARKKKRQVIVEPGKEK---LLQRLIR 781

Query: 350 ETTLVPRDNNMIVSANSTEEMKSFALKLIQKVKDVCIVPNLHLYNSIICGYCRTDRMLDA 409
              LV      I S+      KSFA+++I KVK   I+PNL+L+N+II GYC   R+ +A
Sbjct: 782 TKPLV-----SIPSSLGNYGSKSFAMEVIGKVKK-SIIPNLYLHNTIITGYCAAGRLDEA 841

Query: 410 NHQLELMQKEGLHPNQVTFTILMD-----GDVNSAIGLFNKMNVDGCIPDKVAYITLLKG 469
            + LE MQKEG+ PN VT+TILM      GD+ SAI LF   N   C PD+V Y TLLKG
Sbjct: 842 YNHLESMQKEGIVPNLVTYTILMKSHIEAGDIESAIDLFEGTN---CEPDQVMYSTLLKG 883

Query: 470 LSQGGRLSDALAL 476
           L    R  DALAL
Sbjct: 902 LCDFKRPLDALAL 883

BLAST of Cp4.1LG14g05680 vs. TAIR 10
Match: AT2G32630.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 134.0 bits (336), Expect = 3.9e-31
Identity = 110/439 (25.06%), Postives = 197/439 (44.87%), Query Frame = 0

Query: 83  VIQRIVTQSSSISEAISIVDFAAERGLEIDLDTHGVFCRQLVYSRPQLAELLYDKKFTFG 142
           ++ R+   +    E + + D+  ++GL ID  +  VF       R     L   ++    
Sbjct: 159 LVFRVYVDNGMFEEGLRVFDYMVKKGLSIDERSCIVFLVAAKKRRRIDLCLEIFRRMVDS 218

Query: 143 GAEPDASVLDSMSKWLV------EAELLIREMEFRSLYPD---------KTMKKRIFE-V 202
           G +     L  + + L       +++ LI+E   + + P+           +K+R F  V
Sbjct: 219 GVKITVYSLTIVVEGLCRRGEVEKSKKLIKEFSVKGIKPEAYTYNTIINAYVKQRDFSGV 278

Query: 203 KGVFKKMLKAGVDPDKHLYLTMINGYGKNGKLLEARKLFEQMVENSIPPSSHIYTALISG 262
           +GV K M K GV  +K  Y  ++    KNGK+ +A KLF++M E  I    H+YT+LIS 
Sbjct: 279 EGVLKVMKKDGVVYNKVTYTLLMELSVKNGKMSDAEKLFDEMRERGIESDVHVYTSLISW 338

Query: 263 LVKKNMTDKGCLYLGKMLRDGFSPNAVLYTSLINHYLKIGEVEYAFRLVDLMERSHIEPD 322
             +K    +  L   ++   G SP++  Y +LI+   K+GE+  A  L++ M+   +   
Sbjct: 339 NCRKGNMKRAFLLFDELTEKGLSPSSYTYGALIDGVCKVGEMGAAEILMNEMQSKGVNIT 398

Query: 323 VIFYITLVSGICKNLIVD---------KKKWFLLE--------------KENQKAKSTLF 382
            + + TL+ G C+  +VD         ++K F  +              K   +AK  LF
Sbjct: 399 QVVFNTLIDGYCRKGMVDEASMIYDVMEQKGFQADVFTCNTIASCFNRLKRYDEAKQWLF 458

Query: 383 RMLH-ETTLVPRDNNMIVSANSTEEMKSFALKLIQKVKDVCIVPNLHLYNSIICGYCRTD 442
           RM+     L       ++     E     A +L  ++    + PN   YN +I  YC+  
Sbjct: 459 RMMEGGVKLSTVSYTNLIDVYCKEGNVEEAKRLFVEMSSKGVQPNAITYNVMIYAYCKQG 518

Query: 443 RMLDANHQLELMQKEGLHPNQVTFTILMDGD-----VNSAIGLFNKMNVDGCIPDKVAYI 477
           ++ +A      M+  G+ P+  T+T L+ G+     V+ A+ LF++M + G   + V Y 
Sbjct: 519 KIKEARKLRANMEANGMDPDSYTYTSLIHGECIADNVDEAMRLFSEMGLKGLDQNSVTYT 578

BLAST of Cp4.1LG14g05680 vs. TAIR 10
Match: AT1G62910.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 132.9 bits (333), Expect = 8.6e-31
Identity = 88/308 (28.57%), Postives = 148/308 (48.05%), Query Frame = 0

Query: 189 VFKKMLKAGVDPDKHLYLTMINGYGKNGKLLEARKLFEQMVENSIPPSSHIYTALISGLV 248
           V  KM+K G +PD     +++NGY  + ++ +A  L +QMVE    P +  +T LI GL 
Sbjct: 140 VLAKMMKLGYEPDIVTLSSLLNGYCHSKRISDAVALVDQMVEMGYKPDTFTFTTLIHGLF 199

Query: 249 KKNMTDKGCLYLGKMLRDGFSPNAVLYTSLINHYLKIGEVEYAFRLVDLMERSHIEPDVI 308
             N   +    + +M++ G  P+ V Y +++N   K G+++ A  L+  ME+  IE DV+
Sbjct: 200 LHNKASEAVALVDQMVQRGCQPDLVTYGTVVNGLCKRGDIDLALSLLKKMEKGKIEADVV 259

Query: 309 FYITLVSGICKNLIVDKKKWFLLEKENQKAKSTLF-------------------RMLHET 368
            Y T++ G+CK   +D       E +N+  +  +F                   R+L + 
Sbjct: 260 IYNTIIDGLCKYKHMDDALNLFTEMDNKGIRPDVFTYSSLISCLCNYGRWSDASRLLSD- 319

Query: 369 TLVPRDNN-------MIVSANSTEEMKSFALKLIQKVKDVCIVPNLHLYNSIICGYCRTD 428
            ++ R  N        ++ A   E     A KL  ++    I P++  Y+S+I G+C  D
Sbjct: 320 -MIERKINPNVVTFSALIDAFVKEGKLVEAEKLYDEMIKRSIDPDIFTYSSLINGFCMHD 379

Query: 429 RMLDANHQLELMQKEGLHPNQVTFTILMDG-----DVNSAIGLFNKMNVDGCIPDKVAYI 466
           R+ +A H  ELM  +   PN VT++ L+ G      V   + LF +M+  G + + V Y 
Sbjct: 380 RLDEAKHMFELMISKDCFPNVVTYSTLIKGFCKAKRVEEGMELFREMSQRGLVGNTVTYT 439

BLAST of Cp4.1LG14g05680 vs. TAIR 10
Match: AT5G61990.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 131.0 bits (328), Expect = 3.3e-30
Identity = 87/288 (30.21%), Postives = 139/288 (48.26%), Query Frame = 0

Query: 182 RIFEVKGVFKKMLKAGVDPDKHLYLTMINGYGKNGKLLEARKLFEQMVENSIPPSSHIYT 241
           R  +   V K+M + G+ PD   Y ++I G  K  ++ EAR    +MVEN + P++  Y 
Sbjct: 467 RFGDAMRVLKEMKEQGIAPDIFCYNSLIIGLSKAKRMDEARSFLVEMVENGLKPNAFTYG 526

Query: 242 ALISGLVKKNMTDKGCLYLGKMLRDGFSPNAVLYTSLINHYLKIGEVEYAFRLVDLMERS 301
           A ISG ++ +       Y+ +M   G  PN VL T LIN Y K G+V  A      M   
Sbjct: 527 AFISGYIEASEFASADKYVKEMRECGVLPNKVLCTGLINEYCKKGKVIEACSAYRSMVDQ 586

Query: 302 HIEPDVIFYITLVSGICKNLIVDKKKWFLLEKENQKAKSTLFRMLHETTLVPRDNNMIVS 361
            I  D   Y  L++G+ KN  VD  +              +FR +    + P   +  V 
Sbjct: 587 GILGDAKTYTVLMNGLFKNDKVDDAE-------------EIFREMRGKGIAPDVFSYGVL 646

Query: 362 ANSTEEMKSF--ALKLIQKVKDVCIVPNLHLYNSIICGYCRTDRMLDANHQLELMQKEGL 421
            N   ++ +   A  +  ++ +  + PN+ +YN ++ G+CR+  +  A   L+ M  +GL
Sbjct: 647 INGFSKLGNMQKASSIFDEMVEEGLTPNVIIYNMLLGGFCRSGEIEKAKELLDEMSVKGL 706

Query: 422 HPNQVTFTILMD-----GDVNSAIGLFNKMNVDGCIPDKVAYITLLKG 463
           HPN VT+  ++D     GD+  A  LF++M + G +PD   Y TL+ G
Sbjct: 707 HPNAVTYCTIIDGYCKSGDLAEAFRLFDEMKLKGLVPDSFVYTTLVDG 741

BLAST of Cp4.1LG14g05680 vs. TAIR 10
Match: AT1G09680.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 129.4 bits (324), Expect = 9.5e-30
Identity = 86/300 (28.67%), Postives = 149/300 (49.67%), Query Frame = 0

Query: 183 IFEVKGVFKKMLKAGVDPDKHLYLTMINGYGKNGKLLEARKLFEQMVENSIPPSSHIYTA 242
           I + + VF ++ K  + P    + T+INGY K G L E  +L  QM ++   P    Y+A
Sbjct: 256 ISDAQKVFDEITKRSLQPTVVSFNTLINGYCKVGNLDEGFRLKHQMEKSRTRPDVFTYSA 315

Query: 243 LISGLVKKNMTDKGCLYLGKMLRDGFSPNAVLYTSLINHYLKIGEVEYAFRLVDLMERSH 302
           LI+ L K+N  D       +M + G  PN V++T+LI+ + + GE++        M    
Sbjct: 316 LINALCKENKMDGAHGLFDEMCKRGLIPNDVIFTTLIHGHSRNGEIDLMKESYQKMLSKG 375

Query: 303 IEPDVIFYITLVSGICKN--LIVDKKKWFLLEKENQKAKSTLFRMLHETTLVP---RDNN 362
           ++PD++ Y TLV+G CKN  L+  +     + +   +     +     TTL+    R  +
Sbjct: 376 LQPDIVLYNTLVNGFCKNGDLVAARNIVDGMIRRGLRPDKITY-----TTLIDGFCRGGD 435

Query: 363 MIVSANSTEEMKSFALKLIQKVKDVCIVPNLHLYNSIICGYCRTDRMLDANHQLELMQKE 422
           +  +    +EM    ++L  +V           +++++CG C+  R++DA   L  M + 
Sbjct: 436 VETALEIRKEMDQNGIEL-DRVG----------FSALVCGMCKEGRVIDAERALREMLRA 495

Query: 423 GLHPNQVTFTILMD-----GDVNSAIGLFNKMNVDGCIPDKVAYITLLKGLSQGGRLSDA 473
           G+ P+ VT+T++MD     GD  +   L  +M  DG +P  V Y  LL GL + G++ +A
Sbjct: 496 GIKPDDVTYTMMMDAFCKKGDAQTGFKLLKEMQSDGHVPSVVTYNVLLNGLCKLGQMKNA 539

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9LVA22.9e-6337.18Pentatricopeptide repeat-containing protein At5g62370 OS=Arabidopsis thaliana OX... [more]
Q8S8P65.5e-3025.06Pentatricopeptide repeat-containing protein At2g32630 OS=Arabidopsis thaliana OX... [more]
Q9LQ161.2e-2928.57Pentatricopeptide repeat-containing protein At1g62910 OS=Arabidopsis thaliana OX... [more]
Q9FIT74.6e-2930.21Pentatricopeptide repeat-containing protein At5g61990, mitochondrial OS=Arabidop... [more]
Q9SH261.3e-2828.38Pentatricopeptide repeat-containing protein At1g63400 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
XP_023552131.11.36e-23849.89pentatricopeptide repeat-containing protein At5g62370 [Cucurbita pepo subsp. pep... [more]
XP_022922745.18.81e-22948.53pentatricopeptide repeat-containing protein At5g62370 isoform X1 [Cucurbita mosc... [more]
XP_022985467.12.09e-22447.85pentatricopeptide repeat-containing protein At5g62370 isoform X1 [Cucurbita maxi... [more]
KAG6576797.11.65e-21747.09Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
XP_038882384.14.04e-18641.84pentatricopeptide repeat-containing protein At5g62370 [Benincasa hispida][more]
Match NameE-valueIdentityDescription
A0A6J1E4Z04.27e-22948.53pentatricopeptide repeat-containing protein At5g62370 isoform X1 OS=Cucurbita mo... [more]
A0A6J1J4Z31.01e-22447.85pentatricopeptide repeat-containing protein At5g62370 isoform X1 OS=Cucurbita ma... [more]
A0A6J1DJ302.39e-16439.10pentatricopeptide repeat-containing protein At5g62370 OS=Momordica charantia OX=... [more]
A0A5N6QXY11.15e-11445.61Uncharacterized protein OS=Carpinus fangiana OX=176857 GN=FH972_008191 PE=4 SV=1[more]
A0A6J1AG824.53e-11249.36pentatricopeptide repeat-containing protein At5g62370 isoform X2 OS=Herrania umb... [more]
Match NameE-valueIdentityDescription
AT5G62370.12.0e-6437.18Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT2G32630.13.9e-3125.06Pentatricopeptide repeat (PPR-like) superfamily protein [more]
AT1G62910.18.6e-3128.57Pentatricopeptide repeat (PPR) superfamily protein [more]
AT5G61990.13.3e-3030.21Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G09680.19.5e-3028.67Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 390..422
e-value: 3.0E-5
score: 21.9
coord: 239..272
e-value: 3.6E-7
score: 27.9
coord: 273..307
e-value: 1.3E-6
score: 26.2
coord: 204..236
e-value: 7.7E-9
score: 33.2
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 200..249
e-value: 5.1E-11
score: 42.6
coord: 386..432
e-value: 2.8E-11
score: 43.4
coord: 270..319
e-value: 5.0E-10
score: 39.4
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 236..270
score: 9.744654
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 201..235
score: 12.561707
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 271..305
score: 10.818861
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 387..421
score: 10.709248
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 345..520
e-value: 2.5E-21
score: 78.4
coord: 86..344
e-value: 8.8E-35
score: 122.5
NoneNo IPR availablePANTHERPTHR47933:SF28OS10G0116000 PROTEINcoord: 180..482
coord: 44..155
NoneNo IPR availablePANTHERPTHR47933PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEIN 1, MITOCHONDRIALcoord: 180..482
coord: 44..155

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG14g05680.1Cp4.1LG14g05680.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding