Cp4.1LG06g05950 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG06g05950
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionPentatricopeptide repeat-containing protein
LocationCp4.1LG06: 3657315 .. 3660713 (-)
RNA-Seq ExpressionCp4.1LG06g05950
SyntenyCp4.1LG06g05950
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTGGAAGCCATGGCTGAAAACCCTAAAATTGGTGTAAAAAAGCCTTTCAAACTCCCCCCCAGTTCCTCGCCTCCGTTGGTCAAATCCCCTCGCCTCCCCGTGCATCTCGACGCGCCTGACATATCGCCGGCTGCCAAAACCGTTTGCAAAGTCCTGGTCAGAGCCTCGCCAAATGATGTAGAAGCCGCGCTCTCCGCGACGGGAGTAGTTCCGTCGCCAGAGCTTGTCCAAGAAGTTCTAAGGGTTTCTTACAACTATCCGTCGTCGGCGATCAAATTCTTCCGGTGGGCGAGGCAGTTGGCGAAGCAGTCGGCGTACTCGTGGAATCTGATGGTTGATTTGCTGGGCAAGAATGAATTGTTCGATCAAATGTGGAACGCCATCCGCACCATGAGAGAAGAAAAAGTCCTTTCATTACCAACTTTCGTGTCTGTTTTCGGGAGTTATTGTTCGGCTGGCAGGTTCAAAGATGCGATTAAGAGCTTTGAAGTGATGGATAGGTACGACGTCGAGAAGGATGTGGTGGCGGTGAATTCTCTACTGAGTGCAATTTGCACCGAGGAAAATCAAACATCAAAGGCTTGGGAGTTCTTTAAAAAGCACAAAGAGAAGATTCCTCTAGATGGGGACTCATACGCCATTTTATTGGAAGGATGGGAGAAAGAAGGCAATGTGGAGAAAGCCGAGGTTACATTTGATGAAATGGTGAAAAGAGTTGGTTGGAATCCTGAAAATGTTTCAGCTTATGATGCATTTTTGATAACATTGGTTCGTCGGGGCCAATCTGCAGAGGCAGTTAAAGTTCTACTAGAAATAAAGAACAATGGTTGCTCTCCAGGTTTGAAATTCCTATCCAATGCTCTTGATAGTCTCATTAACCAAAATGATGCAACCCATGCGATTCTATTGTGGGATATAGCTGTAGGAAATGGATTAGTTCCAAACTTGATTATGTACAATGCCATAATCGGATTGCTTAGTGAGAAGGGTAAAATTGAAGACTCATTTCGGCTCTTGGACGCCATGGTTTTCCATGGAGCTTTTCCCAACTCCCTCACTTACAACCTGATCTTCAGCTGTTTGATTAAGAACAAGAAAGTTAAGGAAGCAAGCCAATTTTTCAGGGAAATGGTAAAGAATGAGTGCCCTCCCACCCCTTCTAATTGTGCTGCAGCTATCACAATGTTGTTTGATGGTTATGACCCTGAAACAGCCATTGATATATGGAACTTCATGGTTGATAATAACATCAAGCCTATGGATGCAAGTGCAAATGCACTGCTTATTGGTCTCTGCGACTTGGATCGATTAACAGAGGTTAGACGATTTGCTGATGATATGCTAGATCAACGAATCGGAATATTCGAATCGACTATGAAGTTCTTACGGAATGGTTTCTATCAGCAGAGGGGAAGATTCAGAGATAGTTATGATAGTCTTTTTCGTAGGTGGAGAGACTCCTAAACTTTGTAGTTTTTGCCCTCTCTTCATCCATTTTTGGTGTTTTCACAAGCTTAATACATAGAAACATTAGTGATTTCCTAGAGGCAAGTAGGGTTACCTGTTCTACACAAATCATTTTGACCTTAGCAATTTCACCTACTTCAACTTATCAGAAGCTCTTTTTTCTCCCTTACTATTGCCCCATCTTTGTATAGTCTTCGTTCTTAAGGTAGGTTGAAGTAACAGTGAAATGAAAATATTGTTGAAGTCATAAAGATGCAAATCAAATTATATAATTCAAATGGTAAAATAATCGTATCTGATATCTAAATGTTATAAGCTTTTTGTCCATAGTTGGGTTGAGATAAATATTAGTCGAATTACCTGTTGGAACATTCAGATGTCGATAACGTCATTTTCCCATAAGCTCGATCATAACTCGACTGCTTTCGACATGCACTGTCAATAGATCAAAAGCAACTCAGTCGTCTTTCTCTGTGCACCAGCTAGAAAATTCCAGTGTAAGTTTGGAATTTTGCTTTCTAAACCAAAGTAAAGAGAACAAAAAAAAAAGAAGAAAAAAAACTCAGTTCCAATAATATGTCCAATTATTACGTATAGTTCAAGTGATTAAACAAGTTACTTTTAAAGTATACCAACGATAACTAAACGTAAAAGGAAGCCATCTAGATTGATGTAAAAATGAATCTCTGAAATCGTAGGAAGGAGAGAAATGGCCTAACTAAAGAACTGTTGTGAGGATGTAGTTCATCGATAATTTAATTTTTTATCTAGCTGTTGTTGAACTGTATATATATATACAGAAAACAGAGCCAATTTGTTCAAAAGAAGTTCGAAGTATGATTTCTAAAGCAGAACTACAAAAGTAATATCATTGTTGAATTCGGACTAAACATTTAATCCGATCTAAATAGAACACACAATCATGAACATTGTCTTTTGTTAATTTAAATGAAGCATTACATTATCATCAGTACTGTTATGGATTCAAAAAGAGACTCTTAAGTATAATTGAAGGAAACAATTACTCGACAAAGCTTTGGATAAATTCTCAGATAACAAATATGCTCTAACAGTAGGAGTTCTTTACCTTTAGATGCGTTAAAAATAAGGAAAGGAAGATACCAAAGCCTTCGCCGCCCATCGCTGCAACTTCTTGACAAGCTTGCATTCCTTCCTCGGTGTCTTTTCTCTGCCATGCCTAAAATTGAGCTCAGCAACACAACAAAAGTGGTGAGATCATCTGCCAATTCAAATGTACCTTCACTTTTACTTTCTTTTCGGTCTAGATTGCAATTAGTTTATGAGAAATTGCATTGGTCTCAGATGCCACCAAATTTCCTATCCAGCCTAAAGCGACATACTCGTCCCCTAATTTGGATGGCCGGCATAATCTGCGCCACCATAGCTGTTGCTGTGATTATCGCAGGCATCGTTAACTTCATCGGCTACGTGACGATCCGCCCTACGGTGCCTTCAATCAGCGTAACCTACGGACATCTCGATAGAATCCGAAACAGCAGAATCGGATTGCTTGAAGTCCAGATGAAGATCGTCGTCCGAGCCGAGAATCAAAATGCTAGAGCACAGGCAAGCTTCTCACATACCGATTTCGTCCTGATCTTCGACGGCATAGAAATTGCATCACTGATGGCTCACCGGCCGTTCAAAGTGAATAAGATGAGCTACCTGGATTTGCATTTCTTAGTGGAATCATCGGCTATTCCGCTCAATCCTATGCAGATGCAGCATCTAAGCTGGTCGCTGAATAGGAATTTGATGCAATTCGATCTTAAAGGAAGCTCGAGAACTCGATGGCGAGTTGGAGTTCTGGGACCGCTCAAGTTTTGGTGCCATTTGAACTGTCGCCTCAGGTTTTACCCGCGCAATGGAAGCTACATTCCCGCGCCTTGTTCTTCAAAGGATAAGTAA

mRNA sequence

ATGGTGGAAGCCATGGCTGAAAACCCTAAAATTGGTGTAAAAAAGCCTTTCAAACTCCCCCCCAGTTCCTCGCCTCCGTTGGTCAAATCCCCTCGCCTCCCCGTGCATCTCGACGCGCCTGACATATCGCCGGCTGCCAAAACCGTTTGCAAAGTCCTGGTCAGAGCCTCGCCAAATGATGTAGAAGCCGCGCTCTCCGCGACGGGAGTAGTTCCGTCGCCAGAGCTTGTCCAAGAAGTTCTAAGGGTTTCTTACAACTATCCGTCGTCGGCGATCAAATTCTTCCGGTGGGCGAGGCAGTTGGCGAAGCAGTCGGCGTACTCGTGGAATCTGATGGTTGATTTGCTGGGCAAGAATGAATTGTTCGATCAAATGTGGAACGCCATCCGCACCATGAGAGAAGAAAAAGTCCTTTCATTACCAACTTTCGTGTCTGTTTTCGGGAGTTATTGTTCGGCTGGCAGGTTCAAAGATGCGATTAAGAGCTTTGAAGTGATGGATAGGTACGACGTCGAGAAGGATGTGGTGGCGGTGAATTCTCTACTGAGTGCAATTTGCACCGAGGAAAATCAAACATCAAAGGCTTGGGAGTTCTTTAAAAAGCACAAAGAGAAGATTCCTCTAGATGGGGACTCATACGCCATTTTATTGGAAGGATGGGAGAAAGAAGGCAATGTGGAGAAAGCCGAGGTTACATTTGATGAAATGGTGAAAAGAGTTGGTTGGAATCCTGAAAATGTTTCAGCTTATGATGCATTTTTGATAACATTGGTTCGTCGGGGCCAATCTGCAGAGGCAGTTAAAGTTCTACTAGAAATAAAGAACAATGGTTGCTCTCCAGGTTTGAAATTCCTATCCAATGCTCTTGATAGTCTCATTAACCAAAATGATGCAACCCATGCGATTCTATTGTGGGATATAGCTGTAGGAAATGGATTAGTTCCAAACTTGATTATGTACAATGCCATAATCGGATTGCTTAGTGAGAAGGGTAAAATTGAAGACTCATTTCGGCTCTTGGACGCCATGGTTTTCCATGGAGCTTTTCCCAACTCCCTCACTTACAACCTGATCTTCAGCTGTTTGATTAAGAACAAGAAAGTTAAGGAAGCAAGCCAATTTTTCAGGGAAATGGTAAAGAATGAGTGCCCTCCCACCCCTTCTAATTGTGCTGCAGCTATCACAATGTTGTTTGATGGTTATGACCCTGAAACAGCCATTGATATATGGAACTTCATGGTTGATAATAACATCAAGCCTATGGATGCAAGTGCAAATGCACTGCTTATTGGTCTCTGCGACTTGGATCGATTAACAGAGATCAAAAGCAACTCAGTCGTCTTTCTCTGTGCACCAGCTAGAAAATTCCAGTATGCGTTAAAAATAAGGAAAGGAAGATACCAAAGCCTTCGCCGCCCATCGCTGCAACTTCTTGACAAGCTTGCATTCCTTCCTCGGTGTCTTTTCTCTGCCATGCCTAAAATTGAGCTCAGCAACACAACAAAAGTGGTGAGATCATCTGCCAATTCAAATGTACCTTCACTTTTACTTTCTTTTCGGTCTAGATTGCAATTAGTTTATGAGAAATTGCATTGGTCTCAGATGCCACCAAATTTCCTATCCAGCCTAAAGCGACATACTCGTCCCCTAATTTGGATGGCCGGCATAATCTGCGCCACCATAGCTGTTGCTGTGATTATCGCAGGCATCGTTAACTTCATCGGCTACGTGACGATCCGCCCTACGGTGCCTTCAATCAGCGTAACCTACGGACATCTCGATAGAATCCGAAACAGCAGAATCGGATTGCTTGAAGTCCAGATGAAGATCGTCGTCCGAGCCGAGAATCAAAATGCTAGAGCACAGGCAAGCTTCTCACATACCGATTTCGTCCTGATCTTCGACGGCATAGAAATTGCATCACTGATGGCTCACCGGCCGTTCAAAGTGAATAAGATGAGCTACCTGGATTTGCATTTCTTAGTGGAATCATCGGCTATTCCGCTCAATCCTATGCAGATGCAGCATCTAAGCTGGTCGCTGAATAGGAATTTGATGCAATTCGATCTTAAAGGAAGCTCGAGAACTCGATGGCGAGTTGGAGTTCTGGGACCGCTCAAGTTTTGGTGCCATTTGAACTGTCGCCTCAGGTTTTACCCGCGCAATGGAAGCTACATTCCCGCGCCTTGTTCTTCAAAGGATAAGTAA

Coding sequence (CDS)

ATGGTGGAAGCCATGGCTGAAAACCCTAAAATTGGTGTAAAAAAGCCTTTCAAACTCCCCCCCAGTTCCTCGCCTCCGTTGGTCAAATCCCCTCGCCTCCCCGTGCATCTCGACGCGCCTGACATATCGCCGGCTGCCAAAACCGTTTGCAAAGTCCTGGTCAGAGCCTCGCCAAATGATGTAGAAGCCGCGCTCTCCGCGACGGGAGTAGTTCCGTCGCCAGAGCTTGTCCAAGAAGTTCTAAGGGTTTCTTACAACTATCCGTCGTCGGCGATCAAATTCTTCCGGTGGGCGAGGCAGTTGGCGAAGCAGTCGGCGTACTCGTGGAATCTGATGGTTGATTTGCTGGGCAAGAATGAATTGTTCGATCAAATGTGGAACGCCATCCGCACCATGAGAGAAGAAAAAGTCCTTTCATTACCAACTTTCGTGTCTGTTTTCGGGAGTTATTGTTCGGCTGGCAGGTTCAAAGATGCGATTAAGAGCTTTGAAGTGATGGATAGGTACGACGTCGAGAAGGATGTGGTGGCGGTGAATTCTCTACTGAGTGCAATTTGCACCGAGGAAAATCAAACATCAAAGGCTTGGGAGTTCTTTAAAAAGCACAAAGAGAAGATTCCTCTAGATGGGGACTCATACGCCATTTTATTGGAAGGATGGGAGAAAGAAGGCAATGTGGAGAAAGCCGAGGTTACATTTGATGAAATGGTGAAAAGAGTTGGTTGGAATCCTGAAAATGTTTCAGCTTATGATGCATTTTTGATAACATTGGTTCGTCGGGGCCAATCTGCAGAGGCAGTTAAAGTTCTACTAGAAATAAAGAACAATGGTTGCTCTCCAGGTTTGAAATTCCTATCCAATGCTCTTGATAGTCTCATTAACCAAAATGATGCAACCCATGCGATTCTATTGTGGGATATAGCTGTAGGAAATGGATTAGTTCCAAACTTGATTATGTACAATGCCATAATCGGATTGCTTAGTGAGAAGGGTAAAATTGAAGACTCATTTCGGCTCTTGGACGCCATGGTTTTCCATGGAGCTTTTCCCAACTCCCTCACTTACAACCTGATCTTCAGCTGTTTGATTAAGAACAAGAAAGTTAAGGAAGCAAGCCAATTTTTCAGGGAAATGGTAAAGAATGAGTGCCCTCCCACCCCTTCTAATTGTGCTGCAGCTATCACAATGTTGTTTGATGGTTATGACCCTGAAACAGCCATTGATATATGGAACTTCATGGTTGATAATAACATCAAGCCTATGGATGCAAGTGCAAATGCACTGCTTATTGGTCTCTGCGACTTGGATCGATTAACAGAGATCAAAAGCAACTCAGTCGTCTTTCTCTGTGCACCAGCTAGAAAATTCCAGTATGCGTTAAAAATAAGGAAAGGAAGATACCAAAGCCTTCGCCGCCCATCGCTGCAACTTCTTGACAAGCTTGCATTCCTTCCTCGGTGTCTTTTCTCTGCCATGCCTAAAATTGAGCTCAGCAACACAACAAAAGTGGTGAGATCATCTGCCAATTCAAATGTACCTTCACTTTTACTTTCTTTTCGGTCTAGATTGCAATTAGTTTATGAGAAATTGCATTGGTCTCAGATGCCACCAAATTTCCTATCCAGCCTAAAGCGACATACTCGTCCCCTAATTTGGATGGCCGGCATAATCTGCGCCACCATAGCTGTTGCTGTGATTATCGCAGGCATCGTTAACTTCATCGGCTACGTGACGATCCGCCCTACGGTGCCTTCAATCAGCGTAACCTACGGACATCTCGATAGAATCCGAAACAGCAGAATCGGATTGCTTGAAGTCCAGATGAAGATCGTCGTCCGAGCCGAGAATCAAAATGCTAGAGCACAGGCAAGCTTCTCACATACCGATTTCGTCCTGATCTTCGACGGCATAGAAATTGCATCACTGATGGCTCACCGGCCGTTCAAAGTGAATAAGATGAGCTACCTGGATTTGCATTTCTTAGTGGAATCATCGGCTATTCCGCTCAATCCTATGCAGATGCAGCATCTAAGCTGGTCGCTGAATAGGAATTTGATGCAATTCGATCTTAAAGGAAGCTCGAGAACTCGATGGCGAGTTGGAGTTCTGGGACCGCTCAAGTTTTGGTGCCATTTGAACTGTCGCCTCAGGTTTTACCCGCGCAATGGAAGCTACATTCCCGCGCCTTGTTCTTCAAAGGATAAGTAA

Protein sequence

MVEAMAENPKIGVKKPFKLPPSSSPPLVKSPRLPVHLDAPDISPAAKTVCKVLVRASPNDVEAALSATGVVPSPELVQEVLRVSYNYPSSAIKFFRWARQLAKQSAYSWNLMVDLLGKNELFDQMWNAIRTMREEKVLSLPTFVSVFGSYCSAGRFKDAIKSFEVMDRYDVEKDVVAVNSLLSAICTEENQTSKAWEFFKKHKEKIPLDGDSYAILLEGWEKEGNVEKAEVTFDEMVKRVGWNPENVSAYDAFLITLVRRGQSAEAVKVLLEIKNNGCSPGLKFLSNALDSLINQNDATHAILLWDIAVGNGLVPNLIMYNAIIGLLSEKGKIEDSFRLLDAMVFHGAFPNSLTYNLIFSCLIKNKKVKEASQFFREMVKNECPPTPSNCAAAITMLFDGYDPETAIDIWNFMVDNNIKPMDASANALLIGLCDLDRLTEIKSNSVVFLCAPARKFQYALKIRKGRYQSLRRPSLQLLDKLAFLPRCLFSAMPKIELSNTTKVVRSSANSNVPSLLLSFRSRLQLVYEKLHWSQMPPNFLSSLKRHTRPLIWMAGIICATIAVAVIIAGIVNFIGYVTIRPTVPSISVTYGHLDRIRNSRIGLLEVQMKIVVRAENQNARAQASFSHTDFVLIFDGIEIASLMAHRPFKVNKMSYLDLHFLVESSAIPLNPMQMQHLSWSLNRNLMQFDLKGSSRTRWRVGVLGPLKFWCHLNCRLRFYPRNGSYIPAPCSSKDK
Homology
BLAST of Cp4.1LG06g05950 vs. ExPASy Swiss-Prot
Match: Q9FVX2 (Pentatricopeptide repeat-containing protein At1g77360, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At1g77360 PE=2 SV=2)

HSP 1 Score: 192.6 bits (488), Expect = 1.6e-47
Identity = 119/402 (29.60%), Postives = 216/402 (53.73%), Query Frame = 0

Query: 41  DISPAAKTVCKVLVRASPNDVEAALSATGVVPSPELVQEVLRVSYNYPSSAIKFFRWARQ 100
           D++  AK + KVL+ +    +++AL  +G+  S E+V++VL    N      +FF+W+ +
Sbjct: 67  DVADVAKNISKVLMSSPQLVLDSALDQSGLRVSQEVVEDVLNRFRNAGLLTYRFFQWSEK 126

Query: 101 LA--KQSAYSWNLMVDLLGKNELFDQMWNAIRTMREEKVLSLPTFVSVFGSYCSAGRFKD 160
               + S  ++++M++   K   +  MW+ I  MR++K+L++ TF  V   Y  A +  +
Sbjct: 127 QRHYEHSVRAYHMMIESTAKIRQYKLMWDLINAMRKKKMLNVETFCIVMRKYARAQKVDE 186

Query: 161 AIKSFEVMDRYDVEKDVVAVNSLLSAICTEENQTSKAWEFFKKHKEKIPLDGDSYAILLE 220
           AI +F VM++YD+  ++VA N LLSA+C  +N   KA E F+  +++   D  +Y+ILLE
Sbjct: 187 AIYAFNVMEKYDLPPNLVAFNGLLSALCKSKN-VRKAQEVFENMRDRFTPDSKTYSILLE 246

Query: 221 GWEKEGNVEKAEVTFDEMVKRVGWNPENVSAYDAFLITLVRRGQSAEAVKVLLEIKNNGC 280
           GW KE N+ KA   F EM+   G +P+ V+ Y   +  L + G+  EA+ ++  +  + C
Sbjct: 247 GWGKEPNLPKAREVFREMID-AGCHPDIVT-YSIMVDILCKAGRVDEALGIVRSMDPSIC 306

Query: 281 SPGLKFLSNALDSLINQNDATHAILLWDIAVGNGLVPNLIMYNAIIGLLSEKGKIEDSFR 340
            P     S  + +   +N    A+  +     +G+  ++ ++N++IG   +  ++++ +R
Sbjct: 307 KPTTFIYSVLVHTYGTENRLEEAVDTFLEMERSGMKADVAVFNSLIGAFCKANRMKNVYR 366

Query: 341 LLDAMVFHGAFPNSLTYNLIFSCLIKNKKVKEASQFFREMVKNECPPTPSNCAAAITMLF 400
           +L  M   G  PNS + N+I   LI+  +  EA   FR+M+K  C P        I M  
Sbjct: 367 VLKEMKSKGVTPNSKSCNIILRHLIERGEKDEAFDVFRKMIK-VCEPDADTYTMVIKMFC 426

Query: 401 DGYDPETAIDIWNFMVDNNIKPMDASANALLIGLCDLDRLTE 441
           +  + ETA  +W +M    + P   + + L+ GLC+ +R T+
Sbjct: 427 EKKEMETADKVWKYMRKKGVFPSMHTFSVLINGLCE-ERTTQ 463

BLAST of Cp4.1LG06g05950 vs. ExPASy Swiss-Prot
Match: Q9C9A2 (Pentatricopeptide repeat-containing protein At1g71060, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At1g71060 PE=2 SV=1)

HSP 1 Score: 172.9 bits (437), Expect = 1.3e-41
Identity = 111/406 (27.34%), Postives = 199/406 (49.01%), Query Frame = 0

Query: 37  LDAPDISPAAKTVCKVLVRASPNDVEAALSATGVVPSPELVQEVLRVSYNYPSSAIKFFR 96
           + A D S  A+ +CK+L + + + VE  L+   V  SP L++EVL+   N    A+  F+
Sbjct: 57  VSANDASQDAERICKILTKFTDSKVETLLNEASVKLSPALIEEVLKKLSNAGVLALSVFK 116

Query: 97  WARQLA--KQSAYSWNLMVDLLGKNELFDQMWNAIRTMREEKVLSLPTFVSVFGSYCSAG 156
           WA      K +  ++N +++ LGK + F  +W+ +  M+ +K+LS  TF  +   Y  A 
Sbjct: 117 WAENQKGFKHTTSNYNALIESLGKIKQFKLIWSLVDDMKAKKLLSKETFALISRRYARAR 176

Query: 157 RFKDAIKSFEVMDRYDVEKDVVAVNSLLSAICTEENQTSKAWEFFKKHKEKIPLDGDSYA 216
           + K+AI +F  M+ +  + +    N +L  +    N       F K  K++   D  SY 
Sbjct: 177 KVKEAIGAFHKMEEFGFKMESSDFNRMLDTLSKSRNVGDAQKVFDKMKKKRFEPDIKSYT 236

Query: 217 ILLEGWEKEGNVEKAEVTFDEMVKRVGWNPENVSAYDAFLITLVRRGQSAEAVKVLLEIK 276
           ILLEGW +E N+ + +    EM K  G+ P+ V AY   +    +  +  EA++   E++
Sbjct: 237 ILLEGWGQELNLLRVDEVNREM-KDEGFEPD-VVAYGIIINAHCKAKKYEEAIRFFNEME 296

Query: 277 NNGCSPGLKFLSNALDSLINQNDATHAILLWDIAVGNGLVPNLIMYNAIIGLLSEKGKIE 336
              C P      + ++ L ++     A+  ++ +  +G       YNA++G      ++E
Sbjct: 297 QRNCKPSPHIFCSLINGLGSEKKLNDALEFFERSKSSGFPLEAPTYNALVGAYCWSQRME 356

Query: 337 DSFRLLDAMVFHGAFPNSLTYNLIFSCLIKNKKVKEASQFFREMVKNECPPTPSNCAAAI 396
           D+++ +D M   G  PN+ TY++I   LI+ ++ KEA + ++ M    C PT S     +
Sbjct: 357 DAYKTVDEMRLKGVGPNARTYDIILHHLIRMQRSKEAYEVYQTM---SCEPTVSTYEIMV 416

Query: 397 TMLFDGYDPETAIDIWNFMVDNNIKPMDASANALLIGLCDLDRLTE 441
            M  +    + AI IW+ M    + P     ++L+  LC  ++L E
Sbjct: 417 RMFCNKERLDMAIKIWDEMKGKGVLPGMHMFSSLITALCHENKLDE 457

BLAST of Cp4.1LG06g05950 vs. ExPASy Swiss-Prot
Match: Q9LZP3 (Pentatricopeptide repeat-containing protein At3g62470, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At3g62470 PE=2 SV=1)

HSP 1 Score: 141.0 bits (354), Expect = 5.5e-32
Identity = 101/388 (26.03%), Postives = 187/388 (48.20%), Query Frame = 0

Query: 49  VCKVL--VRASPNDVEAALSATGVVPSPELVQEVLRVSYNYPSSAIKFFRWA--RQLAKQ 108
           VCKV+  + A   ++EA L    +  S +L+ EVL    +    A +FF WA  RQ    
Sbjct: 134 VCKVIDELFALDRNMEAVLDEMKLDLSHDLIVEVLERFRHARKPAFRFFCWAAERQGFAH 193

Query: 109 SAYSWNLMVDLLGKNELFDQMWNAIRTMREEKVLSLPTFVSVFGSYCSAGRFKDAIKSFE 168
            + ++N M+ +L K   F+ M + +  M  + +L++ TF     ++ +A   K A+  FE
Sbjct: 194 DSRTYNSMMSILAKTRQFETMVSVLEEMGTKGLLTMETFTIAMKAFAAAKERKKAVGIFE 253

Query: 169 VMDRYDVEKDVVAVNSLLSAICTEENQTSKAWEFFKKHKEKIPLDGDSYAILLEGWEKEG 228
           +M +Y  +  V  +N LL ++        +A   F K KE+   +  +Y +LL GW +  
Sbjct: 254 LMKKYKFKIGVETINCLLDSL-GRAKLGKEAQVLFDKLKERFTPNMMTYTVLLNGWCRVR 313

Query: 229 NVEKAEVTFDEMVKRVGWNPENVSAYDAFLITLVRRGQSAEAVKVLLEIKNNGCSPGLKF 288
           N+ +A   +++M+ + G  P+ + A++  L  L+R  + ++A+K+   +K+ G  P ++ 
Sbjct: 314 NLIEAARIWNDMIDQ-GLKPD-IVAHNVMLEGLLRSRKKSDAIKLFHVMKSKGPCPNVRS 373

Query: 289 LSNALDSLINQNDATHAILLWDIAVGNGLVPNLIMYNAIIGLLSEKGKIEDSFRLLDAMV 348
            +  +     Q+    AI  +D  V +GL P+  +Y  +I     + K++  + LL  M 
Sbjct: 374 YTIMIRDFCKQSSMETAIEYFDDMVDSGLQPDAAVYTCLITGFGTQKKLDTVYELLKEMQ 433

Query: 349 FHGAFPNSLTYNLIFSCLIKNKKVKEASQFFREMVKNECPPTPSNCAAAITMLFDGYDPE 408
             G  P+  TYN +   +   K  + A++ + +M++NE  P+       +   F   + E
Sbjct: 434 EKGHPPDGKTYNALIKLMANQKMPEHATRIYNKMIQNEIEPSIHTFNMIMKSYFMARNYE 493

Query: 409 TAIDIWNFMVDNNIKPMDASANALLIGL 433
               +W  M+   I P D S   L+ GL
Sbjct: 494 MGRAVWEEMIKKGICPDDNSYTVLIRGL 518

BLAST of Cp4.1LG06g05950 vs. ExPASy Swiss-Prot
Match: Q9S7R4 (Pentatricopeptide repeat-containing protein At1g74900, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=OTP43 PE=2 SV=1)

HSP 1 Score: 140.2 bits (352), Expect = 9.4e-32
Identity = 103/397 (25.94%), Postives = 182/397 (45.84%), Query Frame = 0

Query: 44  PAAKTVCKVLVRASPN----DVEAALSATGVVPSPELVQEVLRVSYNYPSSAIKFFRWAR 103
           PA       L+ +SPN    D +  LS      +P LV  VL+  +N+   A++FF +  
Sbjct: 22  PADSAAIAKLILSSPNTTHQDDQFLLSTKTTPWTPNLVNSVLKRLWNHGPKALQFFHFLD 81

Query: 104 QLAKQ---SAYSWNLMVDLLGKNELFDQMWNAIRTMREEKVLSLP-TFVSVFGSYCSAGR 163
              ++    A S++L +D+  +  L   +W+ I  MR  ++   P TF  V   Y SAG+
Sbjct: 82  NHHREYVHDASSFDLAIDIAARLHLHPTVWSLIHRMRSLRIGPSPKTFAIVAERYASAGK 141

Query: 164 FKDAIKSFEVMDRYDVEKDVVAVNSLLSAICTEENQTSKAWEFFKKHKEKIPLDGDSYAI 223
              A+K F  M  +   +D+ + N++L  +C +  +  KA+E F+  + +  +D  +Y +
Sbjct: 142 PDKAVKLFLNMHEHGCFQDLASFNTILDVLC-KSKRVEKAYELFRALRGRFSVDTVTYNV 201

Query: 224 LLEGWEKEGNVEKAEVTFDEMVKRVGWNPENVSAYDAFLITLVRRGQSAEAVKVLLEIKN 283
           +L GW       KA     EMV+R G NP N++ Y+  L    R GQ   A +  LE+K 
Sbjct: 202 ILNGWCLIKRTPKALEVLKEMVER-GINP-NLTTYNTMLKGFFRAGQIRHAWEFFLEMKK 261

Query: 284 NGCSPGLKFLSNALDSLINQNDATHAILLWDIAVGNGLVPNLIMYNAIIGLLSEKGKIED 343
             C   +   +  +       +   A  ++D  +  G++P++  YNA+I +L +K  +E+
Sbjct: 262 RDCEIDVVTYTTVVHGFGVAGEIKRARNVFDEMIREGVLPSVATYNAMIQVLCKKDNVEN 321

Query: 344 SFRLLDAMVFHGAFPNSLTYNLIFSCLIKNKKVKEASQFFREMVKNECPPTPSNCAAAIT 403
           +  + + MV  G  PN  TYN++   L    +     +  + M    C P        I 
Sbjct: 322 AVVMFEEMVRRGYEPNVTTYNVLIRGLFHAGEFSRGEELMQRMENEGCEPNFQTYNMMIR 381

Query: 404 MLFDGYDPETAIDIWNFMVDNNIKPMDASANALLIGL 433
              +  + E A+ ++  M   +  P   + N L+ G+
Sbjct: 382 YYSECSEVEKALGLFEKMGSGDCLPNLDTYNILISGM 415

BLAST of Cp4.1LG06g05950 vs. ExPASy Swiss-Prot
Match: Q9SSR6 (Pentatricopeptide repeat-containing protein At1g52640, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At1g52640 PE=2 SV=1)

HSP 1 Score: 139.8 bits (351), Expect = 1.2e-31
Identity = 106/383 (27.68%), Postives = 183/383 (47.78%), Query Frame = 0

Query: 55  RASPNDVEAALSATGVVPSPELVQEVLRVSYNYPSSAIKFFRWARQLA--KQSAYSWNLM 114
           R   +D+E  L A     S  LV++VL+   N    A +FF WAR++     S  S++++
Sbjct: 49  RNPKDDLEHTLVAYSPRVSSNLVEQVLKRCKNLGFPAHRFFLWARRIPDFAHSLESYHIL 108

Query: 115 VDLLGKNELFDQMWNAIRTMREEKV--LSLPTFVSVFGSYCSAGRFKDAIKSFEVMDRYD 174
           V++LG ++ F  +W+ +   RE     +S   F  VF +Y  A    +A ++F  M  + 
Sbjct: 109 VEILGSSKQFALLWDFLIEAREYNYFEISSKVFWIVFRAYSRANLPSEACRAFNRMVEFG 168

Query: 175 VEKDVVAVNSLLSAICTEENQTSKAWEFFKKHKE-KIPLDGDSYAILLEGWEKEGNVEKA 234
           ++  V  ++ LL ++C ++   + A EFF K K   I     +Y+IL+ GW +  +   A
Sbjct: 169 IKPCVDDLDQLLHSLC-DKKHVNHAQEFFGKAKGFGIVPSAKTYSILVRGWARIRDASGA 228

Query: 235 EVTFDEMVKRVGWNPENVSAYDAFLITLVRRGQSAEAVKVLLEIKNNGCSPGLKFLSNAL 294
              FDEM++R      ++ AY+A L  L + G      K+  E+ N G  P     +  +
Sbjct: 229 RKVFDEMLERN--CVVDLLAYNALLDALCKSGDVDGGYKMFQEMGNLGLKPDAYSFAIFI 288

Query: 295 DSLINQNDATHAILLWDIAVGNGLVPNLIMYNAIIGLLSEKGKIEDSFRLLDAMVFHGAF 354
            +  +  D   A  + D      LVPN+  +N II  L +  K++D++ LLD M+  GA 
Sbjct: 289 HAYCDAGDVHSAYKVLDRMKRYDLVPNVYTFNHIIKTLCKNEKVDDAYLLLDEMIQKGAN 348

Query: 355 PNSLTYNLIFSCLIKNKKVKEASQFFREMVKNECPPTPSNCAAAITMLFDGYDPETAIDI 414
           P++ TYN I +    + +V  A++    M + +C P        + +L      + A +I
Sbjct: 349 PDTWTYNSIMAYHCDHCEVNRATKLLSRMDRTKCLPDRHTYNMVLKLLIRIGRFDRATEI 408

Query: 415 WNFMVDNNIKPMDASANALLIGL 433
           W  M +    P  A+   ++ GL
Sbjct: 409 WEGMSERKFYPTVATYTVMIHGL 428

BLAST of Cp4.1LG06g05950 vs. NCBI nr
Match: KGN53998.2 (hypothetical protein Csa_011831 [Cucumis sativus])

HSP 1 Score: 1118 bits (2892), Expect = 0.0
Identity = 570/740 (77.03%), Postives = 636/740 (85.95%), Query Frame = 0

Query: 5   MAENPKIGVKKPFKLPPSSSPPLVKSPRLPVHLDAPDISPAAKTVCKVLVRASPNDVEAA 64
           MAENPK+GVK P K+PPSSSP    SPR P+HLD PDISPAAKT+C+VLVR S N+V+ A
Sbjct: 1   MAENPKMGVKNPSKIPPSSSPRSRNSPRFPLHLDLPDISPAAKTICEVLVRVSRNEVDGA 60

Query: 65  LSATGVVPSPELVQEVLRVSYNYPSSAIKFFRWARQLAKQSAYSWNLMVDLLGKNELFDQ 124
           L ATG+ PSPELVQEVLRVSYN PSSAIKFFRWARQLAKQSAYSWNLM+DLLGKNELF++
Sbjct: 61  LLATGLAPSPELVQEVLRVSYNSPSSAIKFFRWARQLAKQSAYSWNLMIDLLGKNELFEE 120

Query: 125 MWNAIRTMREEKVLSLPTFVSVFGSYCSAGRFKDAIKSFEVMDRYDVEKDVVAVNSLLSA 184
           MWN IRTMR+EK+LSLPTFVSVFGSYCSAGR K+A  +FEVMDRY+VEKDVVAVNSLLSA
Sbjct: 121 MWNGIRTMRQEKILSLPTFVSVFGSYCSAGRSKEARMTFEVMDRYEVEKDVVAVNSLLSA 180

Query: 185 ICTEENQTSKAWEFFKKHKEKIPLDGDSYAILLEGWEKEGNVEKAEVTFDEMVKRVGWNP 244
           IC+EENQTS+AWEFF+KHKEKIPLDG+S+AILLEGWEKEGNVEKA+VTFDEMVKRVGWNP
Sbjct: 181 ICSEENQTSEAWEFFEKHKEKIPLDGESFAILLEGWEKEGNVEKAKVTFDEMVKRVGWNP 240

Query: 245 ENVSAYDAFLITLVRRGQSAEAVKVLLEIKNNGCSPGLKFLSNALDSLINQNDATHAILL 304
           ENVS+YDAFLITLVR G+S +A+KVLL++K N C PGLKFLSNALDSLI QNDA HAILL
Sbjct: 241 ENVSSYDAFLITLVRGGRSEDAIKVLLKLKKNRCLPGLKFLSNALDSLIQQNDANHAILL 300

Query: 305 WDIAVGNGLVPNLIMYNAIIGLLSEKGKIEDSFRLLDAMVFHGAFPNSLTYNLIFSCLIK 364
           WDI VG+GLVPNLI+YNAIIGLLSE  KI+DSFRLLD+MVFHGAFPNSLTYNLIFS LIK
Sbjct: 301 WDIVVGSGLVPNLIVYNAIIGLLSENSKIDDSFRLLDSMVFHGAFPNSLTYNLIFSSLIK 360

Query: 365 NKKVKEASQFFREMVKNECPPTPSNCAAAITMLFDGYDPETAIDIWNFMVDNNIKPMDAS 424
           NKKVKE SQFFREMVKNECPPTPS+CAAAITMLFDGYDPETAIDIWN+M +N+I+PMD S
Sbjct: 361 NKKVKEVSQFFREMVKNECPPTPSSCAAAITMLFDGYDPETAIDIWNYMDENHIEPMDTS 420

Query: 425 ANALLIGLCDLDRLTEIKSNSVVFLCAPARKFQYALKIRKGRYQSLRRPSLQLLDKLA-- 484
           ANALLIGLC+L+RLTE++  +   +       +  +K+ K  +   R    +  D L   
Sbjct: 421 ANALLIGLCNLNRLTEVRRFADDMIDQRIDILESTMKLLKNCFYQQRGNFRENYDGLLRR 480

Query: 485 ------FLPRCLFSAMPKIELSNTTKVVRSSANSNVPSLLLSF-RSRLQLVYEKLHWSQM 544
                 F  + +F  +PKIEL NTTKVV+S  +        SF  S L LVYEKLH SQM
Sbjct: 481 LVALKFFSTKRVF--LPKIELCNTTKVVKSEISCQSRYTRTSFFHSLLSLVYEKLHRSQM 540

Query: 545 PPNFLSSLKRHTRPLIWMAGIICATIAVAVIIAGIVNFIGYVTIRPTVPSISVTYGHLDR 604
           PPNFLSSLKRHT PLIW+AGIICATIA+AVIIAGIV FIGYVTIRP VPSISVT GHL+R
Sbjct: 541 PPNFLSSLKRHTHPLIWIAGIICATIALAVIIAGIVIFIGYVTIRPRVPSISVTDGHLER 600

Query: 605 IRNSRIGLLEVQMKIVVRAENQNARAQASFSHTDFVLIFDGIEIASLMAHRPFKVNKMSY 664
           IR+SR GLLEVQMKIVVRAENQNA+A A FS TDFVL+FDGIEIASL+AHRPFKVNKM+Y
Sbjct: 601 IRSSRTGLLEVQMKIVVRAENQNAKAHAGFSKTDFVLLFDGIEIASLVAHRPFKVNKMNY 660

Query: 665 LDLHFLVESSAIPLNPMQMQHLSWSLNRNLMQFDLKGSSRTRWRVGVLGPLKFWCHLNCR 724
           LDLHFLVESSAIPL+  QMQHLSWSL R+L+QFDLKGSSRTRWRVGVLGPLKFWC L+C 
Sbjct: 661 LDLHFLVESSAIPLDSTQMQHLSWSLKRDLIQFDLKGSSRTRWRVGVLGPLKFWCRLDCH 720

Query: 725 LRFYPRNGSYIPAPCSSKDK 735
           LRF+PRNGSYIP PCSSK K
Sbjct: 721 LRFFPRNGSYIPTPCSSKQK 738

BLAST of Cp4.1LG06g05950 vs. NCBI nr
Match: XP_023536455.1 (pentatricopeptide repeat-containing protein At1g77360, mitochondrial-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 883 bits (2282), Expect = 0.0
Identity = 447/470 (95.11%), Postives = 453/470 (96.38%), Query Frame = 0

Query: 1   MVEAMAENPKIGVKKPFKLPPSSSPPLVKSPRLPVHLDAPDISPAAKTVCKVLVRASPND 60
           MVEAMAENPKIGVKKPFKLPPSSSPPLVKSPRLPVHLDAPDISPAAKTVCKVLVRASPND
Sbjct: 1   MVEAMAENPKIGVKKPFKLPPSSSPPLVKSPRLPVHLDAPDISPAAKTVCKVLVRASPND 60

Query: 61  VEAALSATGVVPSPELVQEVLRVSYNYPSSAIKFFRWARQLAKQSAYSWNLMVDLLGKNE 120
           VEAALSATGVVPSPELVQEVLRVSYNYPSSAIKFFRWARQLAKQSAYSWNLMVDLLGKNE
Sbjct: 61  VEAALSATGVVPSPELVQEVLRVSYNYPSSAIKFFRWARQLAKQSAYSWNLMVDLLGKNE 120

Query: 121 LFDQMWNAIRTMREEKVLSLPTFVSVFGSYCSAGRFKDAIKSFEVMDRYDVEKDVVAVNS 180
           LFDQMWNAIRTMREEKVLSLPTFVSVFGSYCSAGRFKDAIKSFEVMDRYDVEKDVVAVNS
Sbjct: 121 LFDQMWNAIRTMREEKVLSLPTFVSVFGSYCSAGRFKDAIKSFEVMDRYDVEKDVVAVNS 180

Query: 181 LLSAICTEENQTSKAWEFFKKHKEKIPLDGDSYAILLEGWEKEGNVEKAEVTFDEMVKRV 240
           LLSAICTEENQTSKAWEFFKKHKEKIPLDGDSYAILLEGWEKEGNVEKAEVTFDEMVKRV
Sbjct: 181 LLSAICTEENQTSKAWEFFKKHKEKIPLDGDSYAILLEGWEKEGNVEKAEVTFDEMVKRV 240

Query: 241 GWNPENVSAYDAFLITLVRRGQSAEAVKVLLEIKNNGCSPGLKFLSNALDSLINQNDATH 300
           GWNPENVSAYDAFLITLVRRGQSAEAVKVLLEIKNNGCSPGLKFLSNALDSLINQNDATH
Sbjct: 241 GWNPENVSAYDAFLITLVRRGQSAEAVKVLLEIKNNGCSPGLKFLSNALDSLINQNDATH 300

Query: 301 AILLWDIAVGNGLVPNLIMYNAIIGLLSEKGKIEDSFRLLDAMVFHGAFPNSLTYNLIFS 360
           AILLWDIAVGNGLVPNLIMYNAIIGLLSEKGKIEDSFRLLDAMVFHGAFPNSLTYNLIFS
Sbjct: 301 AILLWDIAVGNGLVPNLIMYNAIIGLLSEKGKIEDSFRLLDAMVFHGAFPNSLTYNLIFS 360

Query: 361 CLIKNKKVKEASQFFREMVKNECPPTPSNCAAAITMLFDGYDPETAIDIWNFMVDNNIKP 420
           CLIKNKKVKEASQFFREMVKNECPPTPSNCAAAITMLFDGYDPETAIDIWNFMVDNNIKP
Sbjct: 361 CLIKNKKVKEASQFFREMVKNECPPTPSNCAAAITMLFDGYDPETAIDIWNFMVDNNIKP 420

Query: 421 MDASANALLIGLCDLDRLTEIKSNSVVFLCAPARKFQYALK-IRKGRYQS 469
           MDASANALLIGLCDLDRLTE++  +   L      F+  +K +R G YQ 
Sbjct: 421 MDASANALLIGLCDLDRLTEVRRFADDMLDQRIGIFESTMKFLRNGFYQQ 470

BLAST of Cp4.1LG06g05950 vs. NCBI nr
Match: XP_022976764.1 (pentatricopeptide repeat-containing protein At1g77360, mitochondrial-like [Cucurbita maxima])

HSP 1 Score: 874 bits (2259), Expect = 2.05e-314
Identity = 436/442 (98.64%), Postives = 440/442 (99.55%), Query Frame = 0

Query: 1   MVEAMAENPKIGVKKPFKLPPSSSPPLVKSPRLPVHLDAPDISPAAKTVCKVLVRASPND 60
           MVEAMAENPKIGVKKPFKLPPSSSPPL KSPRLPVHLDAPDISPAAKTVCKVLVRASPND
Sbjct: 1   MVEAMAENPKIGVKKPFKLPPSSSPPLAKSPRLPVHLDAPDISPAAKTVCKVLVRASPND 60

Query: 61  VEAALSATGVVPSPELVQEVLRVSYNYPSSAIKFFRWARQLAKQSAYSWNLMVDLLGKNE 120
           VEAALSATGVVPSPELVQEVLRVSYNYPSSAIKFFRWARQLAKQSAYSWN+MVDLLGKNE
Sbjct: 61  VEAALSATGVVPSPELVQEVLRVSYNYPSSAIKFFRWARQLAKQSAYSWNMMVDLLGKNE 120

Query: 121 LFDQMWNAIRTMREEKVLSLPTFVSVFGSYCSAGRFKDAIKSFEVMDRYDVEKDVVAVNS 180
           LFDQMWNAIRTMREEKVLSLPTFVSVFGSYCSAGRFKDAIKSFEVMDRYDVEKDVVAVNS
Sbjct: 121 LFDQMWNAIRTMREEKVLSLPTFVSVFGSYCSAGRFKDAIKSFEVMDRYDVEKDVVAVNS 180

Query: 181 LLSAICTEENQTSKAWEFFKKHKEKIPLDGDSYAILLEGWEKEGNVEKAEVTFDEMVKRV 240
           LLSAICTEENQTSKAWEFFKKHKEKIPLDGDSYAILLEGWEKEGNVEKAEVTFDEMVKRV
Sbjct: 181 LLSAICTEENQTSKAWEFFKKHKEKIPLDGDSYAILLEGWEKEGNVEKAEVTFDEMVKRV 240

Query: 241 GWNPENVSAYDAFLITLVRRGQSAEAVKVLLEIKNNGCSPGLKFLSNALDSLINQNDATH 300
           GWNPENVSAYDAFLITLVRRGQSAEAVKVLLEIKNNGCSPGLKFLSNALDSLINQNDATH
Sbjct: 241 GWNPENVSAYDAFLITLVRRGQSAEAVKVLLEIKNNGCSPGLKFLSNALDSLINQNDATH 300

Query: 301 AILLWDIAVGNGLVPNLIMYNAIIGLLSEKGKIEDSFRLLDAMVFHGAFPNSLTYNLIFS 360
           AILLWDI VG+GLVPNLIMYNAIIGLLSEKGKIEDSFRLLDAMVFHGAFPNSLTYNLIFS
Sbjct: 301 AILLWDIVVGSGLVPNLIMYNAIIGLLSEKGKIEDSFRLLDAMVFHGAFPNSLTYNLIFS 360

Query: 361 CLIKNKKVKEASQFFREMVKNECPPTPSNCAAAITMLFDGYDPETAIDIWNFMVDNNIKP 420
           CLIKNKKVKEASQFFREMVKNECPPTPSNCAAAITMLFDGYDPETAIDIWNFMVDNNIKP
Sbjct: 361 CLIKNKKVKEASQFFREMVKNECPPTPSNCAAAITMLFDGYDPETAIDIWNFMVDNNIKP 420

Query: 421 MDASANALLIGLCDLDRLTEIK 442
           MDASANALLIGLCDLDRLTE++
Sbjct: 421 MDASANALLIGLCDLDRLTEVR 442

BLAST of Cp4.1LG06g05950 vs. NCBI nr
Match: KAG6591855.1 (Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 874 bits (2258), Expect = 2.99e-314
Identity = 441/470 (93.83%), Postives = 450/470 (95.74%), Query Frame = 0

Query: 1   MVEAMAENPKIGVKKPFKLPPSSSPPLVKSPRLPVHLDAPDISPAAKTVCKVLVRASPND 60
           MVEAMAENPKIGVKKPFKLPPSSSPPL KSPRLPVHLDAPDISPAAKTVCKVL+RASPND
Sbjct: 1   MVEAMAENPKIGVKKPFKLPPSSSPPLAKSPRLPVHLDAPDISPAAKTVCKVLIRASPND 60

Query: 61  VEAALSATGVVPSPELVQEVLRVSYNYPSSAIKFFRWARQLAKQSAYSWNLMVDLLGKNE 120
           VEAALSATGVVPSPELVQEVLRVSYNYPSSAIKFFRWARQLAKQSAYSWNLMVDLLGKNE
Sbjct: 61  VEAALSATGVVPSPELVQEVLRVSYNYPSSAIKFFRWARQLAKQSAYSWNLMVDLLGKNE 120

Query: 121 LFDQMWNAIRTMREEKVLSLPTFVSVFGSYCSAGRFKDAIKSFEVMDRYDVEKDVVAVNS 180
           LFDQMWNAIRTMREEKVLSLPTFVSVFGSYCSAGRFKDAIKSFEVMDRYDVEKDVVAVNS
Sbjct: 121 LFDQMWNAIRTMREEKVLSLPTFVSVFGSYCSAGRFKDAIKSFEVMDRYDVEKDVVAVNS 180

Query: 181 LLSAICTEENQTSKAWEFFKKHKEKIPLDGDSYAILLEGWEKEGNVEKAEVTFDEMVKRV 240
           LLSAICTEENQTSKAWEFF+KHKEKIPLDGDSYAILLEGWEKEGNVEKAEVTFDEMVKRV
Sbjct: 181 LLSAICTEENQTSKAWEFFEKHKEKIPLDGDSYAILLEGWEKEGNVEKAEVTFDEMVKRV 240

Query: 241 GWNPENVSAYDAFLITLVRRGQSAEAVKVLLEIKNNGCSPGLKFLSNALDSLINQNDATH 300
           GWNPENVSAYDAFLITLVRRGQSAEAVKVLLEIKNNGCSPGLKFLSNALDSLINQNDATH
Sbjct: 241 GWNPENVSAYDAFLITLVRRGQSAEAVKVLLEIKNNGCSPGLKFLSNALDSLINQNDATH 300

Query: 301 AILLWDIAVGNGLVPNLIMYNAIIGLLSEKGKIEDSFRLLDAMVFHGAFPNSLTYNLIFS 360
           AILLWDI VG+GLVPNLIMYNAIIGLLSEKGKIEDSFRLLDAMVFHGAFPNSLTYNLIFS
Sbjct: 301 AILLWDIVVGSGLVPNLIMYNAIIGLLSEKGKIEDSFRLLDAMVFHGAFPNSLTYNLIFS 360

Query: 361 CLIKNKKVKEASQFFREMVKNECPPTPSNCAAAITMLFDGYDPETAIDIWNFMVDNNIKP 420
           CLIKNKKVKEASQFFREMVKNECPPTPSNCAAAITMLFDGYDPETAIDIWNFMVDNNIKP
Sbjct: 361 CLIKNKKVKEASQFFREMVKNECPPTPSNCAAAITMLFDGYDPETAIDIWNFMVDNNIKP 420

Query: 421 MDASANALLIGLCDLDRLTEIKSNSVVFLCAPARKFQYALK-IRKGRYQS 469
           MDASANALLIGLCDLDRLTE++  +   L       +  +K +R G YQ 
Sbjct: 421 MDASANALLIGLCDLDRLTEVRRFADDMLDQRIGILESTMKFLRNGFYQQ 470

BLAST of Cp4.1LG06g05950 vs. NCBI nr
Match: XP_022937011.1 (pentatricopeptide repeat-containing protein At1g77360, mitochondrial-like [Cucurbita moschata])

HSP 1 Score: 872 bits (2254), Expect = 1.21e-313
Identity = 441/470 (93.83%), Postives = 450/470 (95.74%), Query Frame = 0

Query: 1   MVEAMAENPKIGVKKPFKLPPSSSPPLVKSPRLPVHLDAPDISPAAKTVCKVLVRASPND 60
           MVEAMAENPKIGVKKPFKLPPSSSPPL KSPRLPVHLDAPDISPAAKTVCKVLVRASPND
Sbjct: 1   MVEAMAENPKIGVKKPFKLPPSSSPPLAKSPRLPVHLDAPDISPAAKTVCKVLVRASPND 60

Query: 61  VEAALSATGVVPSPELVQEVLRVSYNYPSSAIKFFRWARQLAKQSAYSWNLMVDLLGKNE 120
           VEAALSATGVVPS ELVQEVLRVSYNYPSSAIKFFRWARQLAKQSAYSWNLMVDLLGKNE
Sbjct: 61  VEAALSATGVVPSTELVQEVLRVSYNYPSSAIKFFRWARQLAKQSAYSWNLMVDLLGKNE 120

Query: 121 LFDQMWNAIRTMREEKVLSLPTFVSVFGSYCSAGRFKDAIKSFEVMDRYDVEKDVVAVNS 180
           LFDQMWNAIRTMREEKVLSLPTFVSVFGSYCSAGRFKDAIKSFEVMDRYDVEKDVVAVNS
Sbjct: 121 LFDQMWNAIRTMREEKVLSLPTFVSVFGSYCSAGRFKDAIKSFEVMDRYDVEKDVVAVNS 180

Query: 181 LLSAICTEENQTSKAWEFFKKHKEKIPLDGDSYAILLEGWEKEGNVEKAEVTFDEMVKRV 240
           LLSAICTEENQTSKAWEFF+KHKEKIPLDGDSYAILLEGWEK+GNVEKAEVTFDEMVKRV
Sbjct: 181 LLSAICTEENQTSKAWEFFEKHKEKIPLDGDSYAILLEGWEKDGNVEKAEVTFDEMVKRV 240

Query: 241 GWNPENVSAYDAFLITLVRRGQSAEAVKVLLEIKNNGCSPGLKFLSNALDSLINQNDATH 300
           GWNPENVSAYDAFLITLVRRGQSAEAVKVLLEIKNNGCSPGLKFLSNALDSLINQNDATH
Sbjct: 241 GWNPENVSAYDAFLITLVRRGQSAEAVKVLLEIKNNGCSPGLKFLSNALDSLINQNDATH 300

Query: 301 AILLWDIAVGNGLVPNLIMYNAIIGLLSEKGKIEDSFRLLDAMVFHGAFPNSLTYNLIFS 360
           AILLWDI VG+GLVPNLIMYNAIIGLLSEKGKIEDSFRLLDAMVFHGAFPNSLTYNLIFS
Sbjct: 301 AILLWDIVVGSGLVPNLIMYNAIIGLLSEKGKIEDSFRLLDAMVFHGAFPNSLTYNLIFS 360

Query: 361 CLIKNKKVKEASQFFREMVKNECPPTPSNCAAAITMLFDGYDPETAIDIWNFMVDNNIKP 420
           CLIKNKKVKEASQFFREMVKNECPPTPSNCAAAITMLFDGYDPETAIDIWNFMVDNNIKP
Sbjct: 361 CLIKNKKVKEASQFFREMVKNECPPTPSNCAAAITMLFDGYDPETAIDIWNFMVDNNIKP 420

Query: 421 MDASANALLIGLCDLDRLTEIKSNSVVFLCAPARKFQYALK-IRKGRYQS 469
           MDASANALLIGLCDLDRLTE++  +   L      F+  +K +R G YQ 
Sbjct: 421 MDASANALLIGLCDLDRLTEVRRFADDMLDQRIGIFESTMKFLRNGFYQQ 470

BLAST of Cp4.1LG06g05950 vs. ExPASy TrEMBL
Match: A0A6J1IPL0 (pentatricopeptide repeat-containing protein At1g77360, mitochondrial-like OS=Cucurbita maxima OX=3661 GN=LOC111477048 PE=4 SV=1)

HSP 1 Score: 874 bits (2259), Expect = 9.91e-315
Identity = 436/442 (98.64%), Postives = 440/442 (99.55%), Query Frame = 0

Query: 1   MVEAMAENPKIGVKKPFKLPPSSSPPLVKSPRLPVHLDAPDISPAAKTVCKVLVRASPND 60
           MVEAMAENPKIGVKKPFKLPPSSSPPL KSPRLPVHLDAPDISPAAKTVCKVLVRASPND
Sbjct: 1   MVEAMAENPKIGVKKPFKLPPSSSPPLAKSPRLPVHLDAPDISPAAKTVCKVLVRASPND 60

Query: 61  VEAALSATGVVPSPELVQEVLRVSYNYPSSAIKFFRWARQLAKQSAYSWNLMVDLLGKNE 120
           VEAALSATGVVPSPELVQEVLRVSYNYPSSAIKFFRWARQLAKQSAYSWN+MVDLLGKNE
Sbjct: 61  VEAALSATGVVPSPELVQEVLRVSYNYPSSAIKFFRWARQLAKQSAYSWNMMVDLLGKNE 120

Query: 121 LFDQMWNAIRTMREEKVLSLPTFVSVFGSYCSAGRFKDAIKSFEVMDRYDVEKDVVAVNS 180
           LFDQMWNAIRTMREEKVLSLPTFVSVFGSYCSAGRFKDAIKSFEVMDRYDVEKDVVAVNS
Sbjct: 121 LFDQMWNAIRTMREEKVLSLPTFVSVFGSYCSAGRFKDAIKSFEVMDRYDVEKDVVAVNS 180

Query: 181 LLSAICTEENQTSKAWEFFKKHKEKIPLDGDSYAILLEGWEKEGNVEKAEVTFDEMVKRV 240
           LLSAICTEENQTSKAWEFFKKHKEKIPLDGDSYAILLEGWEKEGNVEKAEVTFDEMVKRV
Sbjct: 181 LLSAICTEENQTSKAWEFFKKHKEKIPLDGDSYAILLEGWEKEGNVEKAEVTFDEMVKRV 240

Query: 241 GWNPENVSAYDAFLITLVRRGQSAEAVKVLLEIKNNGCSPGLKFLSNALDSLINQNDATH 300
           GWNPENVSAYDAFLITLVRRGQSAEAVKVLLEIKNNGCSPGLKFLSNALDSLINQNDATH
Sbjct: 241 GWNPENVSAYDAFLITLVRRGQSAEAVKVLLEIKNNGCSPGLKFLSNALDSLINQNDATH 300

Query: 301 AILLWDIAVGNGLVPNLIMYNAIIGLLSEKGKIEDSFRLLDAMVFHGAFPNSLTYNLIFS 360
           AILLWDI VG+GLVPNLIMYNAIIGLLSEKGKIEDSFRLLDAMVFHGAFPNSLTYNLIFS
Sbjct: 301 AILLWDIVVGSGLVPNLIMYNAIIGLLSEKGKIEDSFRLLDAMVFHGAFPNSLTYNLIFS 360

Query: 361 CLIKNKKVKEASQFFREMVKNECPPTPSNCAAAITMLFDGYDPETAIDIWNFMVDNNIKP 420
           CLIKNKKVKEASQFFREMVKNECPPTPSNCAAAITMLFDGYDPETAIDIWNFMVDNNIKP
Sbjct: 361 CLIKNKKVKEASQFFREMVKNECPPTPSNCAAAITMLFDGYDPETAIDIWNFMVDNNIKP 420

Query: 421 MDASANALLIGLCDLDRLTEIK 442
           MDASANALLIGLCDLDRLTE++
Sbjct: 421 MDASANALLIGLCDLDRLTEVR 442

BLAST of Cp4.1LG06g05950 vs. ExPASy TrEMBL
Match: A0A6J1F937 (pentatricopeptide repeat-containing protein At1g77360, mitochondrial-like OS=Cucurbita moschata OX=3662 GN=LOC111443435 PE=4 SV=1)

HSP 1 Score: 872 bits (2254), Expect = 5.87e-314
Identity = 441/470 (93.83%), Postives = 450/470 (95.74%), Query Frame = 0

Query: 1   MVEAMAENPKIGVKKPFKLPPSSSPPLVKSPRLPVHLDAPDISPAAKTVCKVLVRASPND 60
           MVEAMAENPKIGVKKPFKLPPSSSPPL KSPRLPVHLDAPDISPAAKTVCKVLVRASPND
Sbjct: 1   MVEAMAENPKIGVKKPFKLPPSSSPPLAKSPRLPVHLDAPDISPAAKTVCKVLVRASPND 60

Query: 61  VEAALSATGVVPSPELVQEVLRVSYNYPSSAIKFFRWARQLAKQSAYSWNLMVDLLGKNE 120
           VEAALSATGVVPS ELVQEVLRVSYNYPSSAIKFFRWARQLAKQSAYSWNLMVDLLGKNE
Sbjct: 61  VEAALSATGVVPSTELVQEVLRVSYNYPSSAIKFFRWARQLAKQSAYSWNLMVDLLGKNE 120

Query: 121 LFDQMWNAIRTMREEKVLSLPTFVSVFGSYCSAGRFKDAIKSFEVMDRYDVEKDVVAVNS 180
           LFDQMWNAIRTMREEKVLSLPTFVSVFGSYCSAGRFKDAIKSFEVMDRYDVEKDVVAVNS
Sbjct: 121 LFDQMWNAIRTMREEKVLSLPTFVSVFGSYCSAGRFKDAIKSFEVMDRYDVEKDVVAVNS 180

Query: 181 LLSAICTEENQTSKAWEFFKKHKEKIPLDGDSYAILLEGWEKEGNVEKAEVTFDEMVKRV 240
           LLSAICTEENQTSKAWEFF+KHKEKIPLDGDSYAILLEGWEK+GNVEKAEVTFDEMVKRV
Sbjct: 181 LLSAICTEENQTSKAWEFFEKHKEKIPLDGDSYAILLEGWEKDGNVEKAEVTFDEMVKRV 240

Query: 241 GWNPENVSAYDAFLITLVRRGQSAEAVKVLLEIKNNGCSPGLKFLSNALDSLINQNDATH 300
           GWNPENVSAYDAFLITLVRRGQSAEAVKVLLEIKNNGCSPGLKFLSNALDSLINQNDATH
Sbjct: 241 GWNPENVSAYDAFLITLVRRGQSAEAVKVLLEIKNNGCSPGLKFLSNALDSLINQNDATH 300

Query: 301 AILLWDIAVGNGLVPNLIMYNAIIGLLSEKGKIEDSFRLLDAMVFHGAFPNSLTYNLIFS 360
           AILLWDI VG+GLVPNLIMYNAIIGLLSEKGKIEDSFRLLDAMVFHGAFPNSLTYNLIFS
Sbjct: 301 AILLWDIVVGSGLVPNLIMYNAIIGLLSEKGKIEDSFRLLDAMVFHGAFPNSLTYNLIFS 360

Query: 361 CLIKNKKVKEASQFFREMVKNECPPTPSNCAAAITMLFDGYDPETAIDIWNFMVDNNIKP 420
           CLIKNKKVKEASQFFREMVKNECPPTPSNCAAAITMLFDGYDPETAIDIWNFMVDNNIKP
Sbjct: 361 CLIKNKKVKEASQFFREMVKNECPPTPSNCAAAITMLFDGYDPETAIDIWNFMVDNNIKP 420

Query: 421 MDASANALLIGLCDLDRLTEIKSNSVVFLCAPARKFQYALK-IRKGRYQS 469
           MDASANALLIGLCDLDRLTE++  +   L      F+  +K +R G YQ 
Sbjct: 421 MDASANALLIGLCDLDRLTEVRRFADDMLDQRIGIFESTMKFLRNGFYQQ 470

BLAST of Cp4.1LG06g05950 vs. ExPASy TrEMBL
Match: A0A6J1CG53 (pentatricopeptide repeat-containing protein At1g77360, mitochondrial-like OS=Momordica charantia OX=3673 GN=LOC111011123 PE=4 SV=1)

HSP 1 Score: 778 bits (2010), Expect = 7.87e-277
Identity = 388/473 (82.03%), Postives = 428/473 (90.49%), Query Frame = 0

Query: 1   MVEAMAENPKIGVKKPFKLPPSSSPPL---VKSPRLPVHLDAPDISPAAKTVCKVLVRAS 60
           M+E+MAENPKIG+  P K+PP  SPP     +SPR P+HLD PD+SPAAKTVC+VL+RAS
Sbjct: 1   MMESMAENPKIGLGNPPKIPPGFSPPSRPPAQSPRFPLHLDEPDVSPAAKTVCEVLIRAS 60

Query: 61  PNDVEAALSATGVVPSPELVQEVLRVSYNYPSSAIKFFRWARQLAKQSAYSWNLMVDLLG 120
           P DVEAAL+ATGV PSPELVQEVLR+SYNYPSSAIKFFRWA QLA+QSAYSWNLM+DLLG
Sbjct: 61  PKDVEAALAATGVAPSPELVQEVLRLSYNYPSSAIKFFRWAGQLAQQSAYSWNLMIDLLG 120

Query: 121 KNELFDQMWNAIRTMREEKVLSLPTFVSVFGSYCSAGRFKDAIKSFEVMDRYDVEKDVVA 180
           KNELFDQMWNAIRTMR+EKVLSLPTFVSVFGSYCSAGRFKDA+ SFEVMDRY+VEKDVVA
Sbjct: 121 KNELFDQMWNAIRTMRKEKVLSLPTFVSVFGSYCSAGRFKDAMMSFEVMDRYEVEKDVVA 180

Query: 181 VNSLLSAICTEENQTSKAWEFFKKHKEKIPLDGDSYAILLEGWEKEGNVEKAEVTFDEMV 240
           VNSLLSAICTEENQTSKAWEFF+K+KEKIP+DGDS+AILLEGWEKEGNVE+A+VTF EMV
Sbjct: 181 VNSLLSAICTEENQTSKAWEFFEKNKEKIPVDGDSFAILLEGWEKEGNVEEAKVTFGEMV 240

Query: 241 KRVGWNPENVSAYDAFLITLVRRGQSAEAVKVLLEIKNNGCSPGLKFLSNALDSLINQND 300
           KRVGWNP+NVSAYDAFLITLVR GQS EA++ LLE+K NGC PGLKFLSNALDSLI QND
Sbjct: 241 KRVGWNPQNVSAYDAFLITLVRGGQSGEAIEFLLEMKKNGCLPGLKFLSNALDSLIKQND 300

Query: 301 ATHAILLWDIAVGNGLVPNLIMYNAIIGLLSEKGKIEDSFRLLDAMVFHGAFPNSLTYNL 360
           A HAI+LWDI VG+GLVPNLIMYNAIIGLLSE GKI+DSFRLLDAMVFHGAFPNSLTYNL
Sbjct: 301 ADHAIILWDIVVGSGLVPNLIMYNAIIGLLSENGKIDDSFRLLDAMVFHGAFPNSLTYNL 360

Query: 361 IFSCLIKNKKVKEASQFFREMVKNECPPTPSNCAAAITMLFDGYDPETAIDIWNFMVDNN 420
           IFSCLIK KKV+EASQ FREMVKNECPPTPSNCAAAI+M FDGYDPETAIDIWNFMV+N+
Sbjct: 361 IFSCLIKTKKVREASQIFREMVKNECPPTPSNCAAAISMFFDGYDPETAIDIWNFMVENH 420

Query: 421 IKPMDASANALLIGLCDLDRLTEIKSNSVVFLCAPARKFQYALKIRK-GRYQS 469
           IKPMDASANALLIGLC+LDRLTE++S +   L      ++  +KI K G YQ 
Sbjct: 421 IKPMDASANALLIGLCNLDRLTEVRSFADDMLDRRIGIYESTMKILKNGFYQQ 473

BLAST of Cp4.1LG06g05950 vs. ExPASy TrEMBL
Match: A0A5A7UDH9 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold113G00620 PE=4 SV=1)

HSP 1 Score: 766 bits (1977), Expect = 6.96e-272
Identity = 370/439 (84.28%), Postives = 411/439 (93.62%), Query Frame = 0

Query: 5   MAENPKIGVKKPFKLPPSSSPPLVKSPRLPVHLDAPDISPAAKTVCKVLVRASPNDVEAA 64
           MAENPK+GV+ P K+PPSS+P    SPR P HLD PDISPAAKT+C+VL++   N+V+AA
Sbjct: 1   MAENPKMGVRNPSKIPPSSAPRSPNSPRFPSHLDLPDISPAAKTICEVLIKVPRNEVDAA 60

Query: 65  LSATGVVPSPELVQEVLRVSYNYPSSAIKFFRWARQLAKQSAYSWNLMVDLLGKNELFDQ 124
           LSATG+ PSPELVQEVLRVSYNYPSSAIKFFRWARQLAKQSAYSWNLM+DLLGKNELF++
Sbjct: 61  LSATGLAPSPELVQEVLRVSYNYPSSAIKFFRWARQLAKQSAYSWNLMIDLLGKNELFEE 120

Query: 125 MWNAIRTMREEKVLSLPTFVSVFGSYCSAGRFKDAIKSFEVMDRYDVEKDVVAVNSLLSA 184
           MWN IRTMR+EK+LSLPTFVSVFGSYCSAGRFK+A  SFEVMDRY+VEKDVVAVNSLLSA
Sbjct: 121 MWNGIRTMRQEKILSLPTFVSVFGSYCSAGRFKEATMSFEVMDRYEVEKDVVAVNSLLSA 180

Query: 185 ICTEENQTSKAWEFFKKHKEKIPLDGDSYAILLEGWEKEGNVEKAEVTFDEMVKRVGWNP 244
           IC+EENQTSKAWEFF+KHKEKIPLDGDS+AILLEGWEKEGNVEKAEVTFDEMVKR+GWNP
Sbjct: 181 ICSEENQTSKAWEFFEKHKEKIPLDGDSFAILLEGWEKEGNVEKAEVTFDEMVKRIGWNP 240

Query: 245 ENVSAYDAFLITLVRRGQSAEAVKVLLEIKNNGCSPGLKFLSNALDSLINQNDATHAILL 304
           ENVS+YDAFLITLVR G+S +A+KVLLE+K N C PGLKFLSNALDSLI QNDA HAILL
Sbjct: 241 ENVSSYDAFLITLVRGGRSEDAIKVLLELKKNRCLPGLKFLSNALDSLIQQNDANHAILL 300

Query: 305 WDIAVGNGLVPNLIMYNAIIGLLSEKGKIEDSFRLLDAMVFHGAFPNSLTYNLIFSCLIK 364
           WDI VG+GLVPNLI+YNAIIGLLSE  KI+D+FRLLD+MVFHGAFPNS+TYNLIFSCLIK
Sbjct: 301 WDIVVGSGLVPNLIVYNAIIGLLSENSKIDDTFRLLDSMVFHGAFPNSVTYNLIFSCLIK 360

Query: 365 NKKVKEASQFFREMVKNECPPTPSNCAAAITMLFDGYDPETAIDIWNFMVDNNIKPMDAS 424
           NKKVKE SQFFREMVKNECPPTPSNCAAAITMLFDGYDPETAIDIWN+M +N+I+PMDAS
Sbjct: 361 NKKVKEVSQFFREMVKNECPPTPSNCAAAITMLFDGYDPETAIDIWNYMDENHIEPMDAS 420

Query: 425 ANALLIGLCDLDRLTEIKS 443
           ANALLIGLC+L+RLTE++S
Sbjct: 421 ANALLIGLCNLNRLTEVRS 439

BLAST of Cp4.1LG06g05950 vs. ExPASy TrEMBL
Match: A0A1S3B9W9 (pentatricopeptide repeat-containing protein At1g77360, mitochondrial-like OS=Cucumis melo OX=3656 GN=LOC103487631 PE=4 SV=1)

HSP 1 Score: 762 bits (1968), Expect = 5.29e-271
Identity = 369/436 (84.63%), Postives = 408/436 (93.58%), Query Frame = 0

Query: 5   MAENPKIGVKKPFKLPPSSSPPLVKSPRLPVHLDAPDISPAAKTVCKVLVRASPNDVEAA 64
           MAENPK+GV+ P K+PPSS+P    SPR P HLD PDISPAAKT+C+VL++   N+V+AA
Sbjct: 1   MAENPKMGVRNPSKIPPSSAPRSPNSPRFPSHLDLPDISPAAKTICEVLIKVPRNEVDAA 60

Query: 65  LSATGVVPSPELVQEVLRVSYNYPSSAIKFFRWARQLAKQSAYSWNLMVDLLGKNELFDQ 124
           LSATG+ PSPELVQEVLRVSYNYPSSAIKFFRWARQLAKQSAYSWNLM+DLLGKNELF++
Sbjct: 61  LSATGLAPSPELVQEVLRVSYNYPSSAIKFFRWARQLAKQSAYSWNLMIDLLGKNELFEE 120

Query: 125 MWNAIRTMREEKVLSLPTFVSVFGSYCSAGRFKDAIKSFEVMDRYDVEKDVVAVNSLLSA 184
           MWN IRTMR+EK+LSLPTFVSVFGSYCSAGRFK+A  SFEVMDRY+VEKDVVAVNSLLSA
Sbjct: 121 MWNGIRTMRQEKILSLPTFVSVFGSYCSAGRFKEATMSFEVMDRYEVEKDVVAVNSLLSA 180

Query: 185 ICTEENQTSKAWEFFKKHKEKIPLDGDSYAILLEGWEKEGNVEKAEVTFDEMVKRVGWNP 244
           IC+EENQTSKAWEFF+KHKEKIPLDGDS+AILLEGWEKEGNVEKAEVTFDEMVKR+GWNP
Sbjct: 181 ICSEENQTSKAWEFFEKHKEKIPLDGDSFAILLEGWEKEGNVEKAEVTFDEMVKRIGWNP 240

Query: 245 ENVSAYDAFLITLVRRGQSAEAVKVLLEIKNNGCSPGLKFLSNALDSLINQNDATHAILL 304
           ENVS+YDAFLITLVR G+S +A+KVLLE+K N C PGLKFLSNALDSLI QNDA HAILL
Sbjct: 241 ENVSSYDAFLITLVRGGRSEDAIKVLLELKKNRCLPGLKFLSNALDSLIQQNDANHAILL 300

Query: 305 WDIAVGNGLVPNLIMYNAIIGLLSEKGKIEDSFRLLDAMVFHGAFPNSLTYNLIFSCLIK 364
           WDI VG+GLVPNLI+YNAIIGLLSE  KI+D+FRLLD+MVFHGAFPNS+TYNLIFSCLIK
Sbjct: 301 WDIVVGSGLVPNLIVYNAIIGLLSENSKIDDTFRLLDSMVFHGAFPNSVTYNLIFSCLIK 360

Query: 365 NKKVKEASQFFREMVKNECPPTPSNCAAAITMLFDGYDPETAIDIWNFMVDNNIKPMDAS 424
           NKKVKE SQFFREMVKNECPPTPSNCAAAITMLFDGYDPETAIDIWN+M +N+I+PMDAS
Sbjct: 361 NKKVKEVSQFFREMVKNECPPTPSNCAAAITMLFDGYDPETAIDIWNYMDENHIEPMDAS 420

Query: 425 ANALLIGLCDLDRLTE 440
           ANALLIGLC+L+RLTE
Sbjct: 421 ANALLIGLCNLNRLTE 436

BLAST of Cp4.1LG06g05950 vs. TAIR 10
Match: AT1G77360.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 192.6 bits (488), Expect = 1.1e-48
Identity = 119/402 (29.60%), Postives = 216/402 (53.73%), Query Frame = 0

Query: 41  DISPAAKTVCKVLVRASPNDVEAALSATGVVPSPELVQEVLRVSYNYPSSAIKFFRWARQ 100
           D++  AK + KVL+ +    +++AL  +G+  S E+V++VL    N      +FF+W+ +
Sbjct: 67  DVADVAKNISKVLMSSPQLVLDSALDQSGLRVSQEVVEDVLNRFRNAGLLTYRFFQWSEK 126

Query: 101 LA--KQSAYSWNLMVDLLGKNELFDQMWNAIRTMREEKVLSLPTFVSVFGSYCSAGRFKD 160
               + S  ++++M++   K   +  MW+ I  MR++K+L++ TF  V   Y  A +  +
Sbjct: 127 QRHYEHSVRAYHMMIESTAKIRQYKLMWDLINAMRKKKMLNVETFCIVMRKYARAQKVDE 186

Query: 161 AIKSFEVMDRYDVEKDVVAVNSLLSAICTEENQTSKAWEFFKKHKEKIPLDGDSYAILLE 220
           AI +F VM++YD+  ++VA N LLSA+C  +N   KA E F+  +++   D  +Y+ILLE
Sbjct: 187 AIYAFNVMEKYDLPPNLVAFNGLLSALCKSKN-VRKAQEVFENMRDRFTPDSKTYSILLE 246

Query: 221 GWEKEGNVEKAEVTFDEMVKRVGWNPENVSAYDAFLITLVRRGQSAEAVKVLLEIKNNGC 280
           GW KE N+ KA   F EM+   G +P+ V+ Y   +  L + G+  EA+ ++  +  + C
Sbjct: 247 GWGKEPNLPKAREVFREMID-AGCHPDIVT-YSIMVDILCKAGRVDEALGIVRSMDPSIC 306

Query: 281 SPGLKFLSNALDSLINQNDATHAILLWDIAVGNGLVPNLIMYNAIIGLLSEKGKIEDSFR 340
            P     S  + +   +N    A+  +     +G+  ++ ++N++IG   +  ++++ +R
Sbjct: 307 KPTTFIYSVLVHTYGTENRLEEAVDTFLEMERSGMKADVAVFNSLIGAFCKANRMKNVYR 366

Query: 341 LLDAMVFHGAFPNSLTYNLIFSCLIKNKKVKEASQFFREMVKNECPPTPSNCAAAITMLF 400
           +L  M   G  PNS + N+I   LI+  +  EA   FR+M+K  C P        I M  
Sbjct: 367 VLKEMKSKGVTPNSKSCNIILRHLIERGEKDEAFDVFRKMIK-VCEPDADTYTMVIKMFC 426

Query: 401 DGYDPETAIDIWNFMVDNNIKPMDASANALLIGLCDLDRLTE 441
           +  + ETA  +W +M    + P   + + L+ GLC+ +R T+
Sbjct: 427 EKKEMETADKVWKYMRKKGVFPSMHTFSVLINGLCE-ERTTQ 463

BLAST of Cp4.1LG06g05950 vs. TAIR 10
Match: AT5G45320.1 (FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; EXPRESSED IN: inflorescence meristem, root, flower; EXPRESSED DURING: petal differentiation and expansion stage; CONTAINS InterPro DOMAIN/s: Late embryogenesis abundant protein, group 2 (InterPro:IPR004864); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G26350.1); Has 253 Blast hits to 253 proteins in 16 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 253; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 186.4 bits (472), Expect = 8.1e-47
Identity = 95/192 (49.48%), Postives = 132/192 (68.75%), Query Frame = 0

Query: 545 RH-TRPLIWMAGIICATIAVAVIIAGIVNFIGYVTIRPTVPSISVTYGHLDRIRNSRIGL 604
           RH T P IW A IICA I++ VI+ GI+ F+GY+ I P VP ISV   HLD ++   +G+
Sbjct: 7   RHGTSPFIWCAAIICAIISIVVIVGGIIVFVGYLVIHPRVPIISVADAHLDFLKYDIVGV 66

Query: 605 LEVQMKIVVRAENQNARAQASFSHTDFVLIFDGIEIASLMAHRPFKVNKMSYLDLHFLVE 664
           L+ Q+ IV+R EN NA+A A F  T+F L ++G  IA L A   F+V K   + L +LV+
Sbjct: 67  LQTQLTIVIRVENDNAKAHALFDETEFKLSYEGKPIAILKAPE-FEVVKEKSMFLPYLVQ 126

Query: 665 SSAIPLNPMQMQHLSWSLNRNLMQFDLKGSSRTRWRVGVLGPLKFWCHLNCRLRFYPRNG 724
           S  IPLNP  MQ + +++ ++++ F+LKG SRTRWRVG LG +KF C+L+C+LRF P + 
Sbjct: 127 SYPIPLNPTMMQAVDYAVKKDVITFELKGGSRTRWRVGPLGSVKFECNLSCQLRFRPSDH 186

Query: 725 SYIPAPCSSKDK 736
           SYIP+PC+S  K
Sbjct: 187 SYIPSPCTSAHK 197

BLAST of Cp4.1LG06g05950 vs. TAIR 10
Match: AT1G71060.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 172.9 bits (437), Expect = 9.3e-43
Identity = 111/406 (27.34%), Postives = 199/406 (49.01%), Query Frame = 0

Query: 37  LDAPDISPAAKTVCKVLVRASPNDVEAALSATGVVPSPELVQEVLRVSYNYPSSAIKFFR 96
           + A D S  A+ +CK+L + + + VE  L+   V  SP L++EVL+   N    A+  F+
Sbjct: 57  VSANDASQDAERICKILTKFTDSKVETLLNEASVKLSPALIEEVLKKLSNAGVLALSVFK 116

Query: 97  WARQLA--KQSAYSWNLMVDLLGKNELFDQMWNAIRTMREEKVLSLPTFVSVFGSYCSAG 156
           WA      K +  ++N +++ LGK + F  +W+ +  M+ +K+LS  TF  +   Y  A 
Sbjct: 117 WAENQKGFKHTTSNYNALIESLGKIKQFKLIWSLVDDMKAKKLLSKETFALISRRYARAR 176

Query: 157 RFKDAIKSFEVMDRYDVEKDVVAVNSLLSAICTEENQTSKAWEFFKKHKEKIPLDGDSYA 216
           + K+AI +F  M+ +  + +    N +L  +    N       F K  K++   D  SY 
Sbjct: 177 KVKEAIGAFHKMEEFGFKMESSDFNRMLDTLSKSRNVGDAQKVFDKMKKKRFEPDIKSYT 236

Query: 217 ILLEGWEKEGNVEKAEVTFDEMVKRVGWNPENVSAYDAFLITLVRRGQSAEAVKVLLEIK 276
           ILLEGW +E N+ + +    EM K  G+ P+ V AY   +    +  +  EA++   E++
Sbjct: 237 ILLEGWGQELNLLRVDEVNREM-KDEGFEPD-VVAYGIIINAHCKAKKYEEAIRFFNEME 296

Query: 277 NNGCSPGLKFLSNALDSLINQNDATHAILLWDIAVGNGLVPNLIMYNAIIGLLSEKGKIE 336
              C P      + ++ L ++     A+  ++ +  +G       YNA++G      ++E
Sbjct: 297 QRNCKPSPHIFCSLINGLGSEKKLNDALEFFERSKSSGFPLEAPTYNALVGAYCWSQRME 356

Query: 337 DSFRLLDAMVFHGAFPNSLTYNLIFSCLIKNKKVKEASQFFREMVKNECPPTPSNCAAAI 396
           D+++ +D M   G  PN+ TY++I   LI+ ++ KEA + ++ M    C PT S     +
Sbjct: 357 DAYKTVDEMRLKGVGPNARTYDIILHHLIRMQRSKEAYEVYQTM---SCEPTVSTYEIMV 416

Query: 397 TMLFDGYDPETAIDIWNFMVDNNIKPMDASANALLIGLCDLDRLTE 441
            M  +    + AI IW+ M    + P     ++L+  LC  ++L E
Sbjct: 417 RMFCNKERLDMAIKIWDEMKGKGVLPGMHMFSSLITALCHENKLDE 457

BLAST of Cp4.1LG06g05950 vs. TAIR 10
Match: AT3G62470.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 141.0 bits (354), Expect = 3.9e-33
Identity = 101/388 (26.03%), Postives = 187/388 (48.20%), Query Frame = 0

Query: 49  VCKVL--VRASPNDVEAALSATGVVPSPELVQEVLRVSYNYPSSAIKFFRWA--RQLAKQ 108
           VCKV+  + A   ++EA L    +  S +L+ EVL    +    A +FF WA  RQ    
Sbjct: 134 VCKVIDELFALDRNMEAVLDEMKLDLSHDLIVEVLERFRHARKPAFRFFCWAAERQGFAH 193

Query: 109 SAYSWNLMVDLLGKNELFDQMWNAIRTMREEKVLSLPTFVSVFGSYCSAGRFKDAIKSFE 168
            + ++N M+ +L K   F+ M + +  M  + +L++ TF     ++ +A   K A+  FE
Sbjct: 194 DSRTYNSMMSILAKTRQFETMVSVLEEMGTKGLLTMETFTIAMKAFAAAKERKKAVGIFE 253

Query: 169 VMDRYDVEKDVVAVNSLLSAICTEENQTSKAWEFFKKHKEKIPLDGDSYAILLEGWEKEG 228
           +M +Y  +  V  +N LL ++        +A   F K KE+   +  +Y +LL GW +  
Sbjct: 254 LMKKYKFKIGVETINCLLDSL-GRAKLGKEAQVLFDKLKERFTPNMMTYTVLLNGWCRVR 313

Query: 229 NVEKAEVTFDEMVKRVGWNPENVSAYDAFLITLVRRGQSAEAVKVLLEIKNNGCSPGLKF 288
           N+ +A   +++M+ + G  P+ + A++  L  L+R  + ++A+K+   +K+ G  P ++ 
Sbjct: 314 NLIEAARIWNDMIDQ-GLKPD-IVAHNVMLEGLLRSRKKSDAIKLFHVMKSKGPCPNVRS 373

Query: 289 LSNALDSLINQNDATHAILLWDIAVGNGLVPNLIMYNAIIGLLSEKGKIEDSFRLLDAMV 348
            +  +     Q+    AI  +D  V +GL P+  +Y  +I     + K++  + LL  M 
Sbjct: 374 YTIMIRDFCKQSSMETAIEYFDDMVDSGLQPDAAVYTCLITGFGTQKKLDTVYELLKEMQ 433

Query: 349 FHGAFPNSLTYNLIFSCLIKNKKVKEASQFFREMVKNECPPTPSNCAAAITMLFDGYDPE 408
             G  P+  TYN +   +   K  + A++ + +M++NE  P+       +   F   + E
Sbjct: 434 EKGHPPDGKTYNALIKLMANQKMPEHATRIYNKMIQNEIEPSIHTFNMIMKSYFMARNYE 493

Query: 409 TAIDIWNFMVDNNIKPMDASANALLIGL 433
               +W  M+   I P D S   L+ GL
Sbjct: 494 MGRAVWEEMIKKGICPDDNSYTVLIRGL 518

BLAST of Cp4.1LG06g05950 vs. TAIR 10
Match: AT1G74900.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 140.2 bits (352), Expect = 6.7e-33
Identity = 103/397 (25.94%), Postives = 182/397 (45.84%), Query Frame = 0

Query: 44  PAAKTVCKVLVRASPN----DVEAALSATGVVPSPELVQEVLRVSYNYPSSAIKFFRWAR 103
           PA       L+ +SPN    D +  LS      +P LV  VL+  +N+   A++FF +  
Sbjct: 22  PADSAAIAKLILSSPNTTHQDDQFLLSTKTTPWTPNLVNSVLKRLWNHGPKALQFFHFLD 81

Query: 104 QLAKQ---SAYSWNLMVDLLGKNELFDQMWNAIRTMREEKVLSLP-TFVSVFGSYCSAGR 163
              ++    A S++L +D+  +  L   +W+ I  MR  ++   P TF  V   Y SAG+
Sbjct: 82  NHHREYVHDASSFDLAIDIAARLHLHPTVWSLIHRMRSLRIGPSPKTFAIVAERYASAGK 141

Query: 164 FKDAIKSFEVMDRYDVEKDVVAVNSLLSAICTEENQTSKAWEFFKKHKEKIPLDGDSYAI 223
              A+K F  M  +   +D+ + N++L  +C +  +  KA+E F+  + +  +D  +Y +
Sbjct: 142 PDKAVKLFLNMHEHGCFQDLASFNTILDVLC-KSKRVEKAYELFRALRGRFSVDTVTYNV 201

Query: 224 LLEGWEKEGNVEKAEVTFDEMVKRVGWNPENVSAYDAFLITLVRRGQSAEAVKVLLEIKN 283
           +L GW       KA     EMV+R G NP N++ Y+  L    R GQ   A +  LE+K 
Sbjct: 202 ILNGWCLIKRTPKALEVLKEMVER-GINP-NLTTYNTMLKGFFRAGQIRHAWEFFLEMKK 261

Query: 284 NGCSPGLKFLSNALDSLINQNDATHAILLWDIAVGNGLVPNLIMYNAIIGLLSEKGKIED 343
             C   +   +  +       +   A  ++D  +  G++P++  YNA+I +L +K  +E+
Sbjct: 262 RDCEIDVVTYTTVVHGFGVAGEIKRARNVFDEMIREGVLPSVATYNAMIQVLCKKDNVEN 321

Query: 344 SFRLLDAMVFHGAFPNSLTYNLIFSCLIKNKKVKEASQFFREMVKNECPPTPSNCAAAIT 403
           +  + + MV  G  PN  TYN++   L    +     +  + M    C P        I 
Sbjct: 322 AVVMFEEMVRRGYEPNVTTYNVLIRGLFHAGEFSRGEELMQRMENEGCEPNFQTYNMMIR 381

Query: 404 MLFDGYDPETAIDIWNFMVDNNIKPMDASANALLIGL 433
              +  + E A+ ++  M   +  P   + N L+ G+
Sbjct: 382 YYSECSEVEKALGLFEKMGSGDCLPNLDTYNILISGM 415

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9FVX21.6e-4729.60Pentatricopeptide repeat-containing protein At1g77360, mitochondrial OS=Arabidop... [more]
Q9C9A21.3e-4127.34Pentatricopeptide repeat-containing protein At1g71060, mitochondrial OS=Arabidop... [more]
Q9LZP35.5e-3226.03Pentatricopeptide repeat-containing protein At3g62470, mitochondrial OS=Arabidop... [more]
Q9S7R49.4e-3225.94Pentatricopeptide repeat-containing protein At1g74900, mitochondrial OS=Arabidop... [more]
Q9SSR61.2e-3127.68Pentatricopeptide repeat-containing protein At1g52640, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
KGN53998.20.077.03hypothetical protein Csa_011831 [Cucumis sativus][more]
XP_023536455.10.095.11pentatricopeptide repeat-containing protein At1g77360, mitochondrial-like [Cucur... [more]
XP_022976764.12.05e-31498.64pentatricopeptide repeat-containing protein At1g77360, mitochondrial-like [Cucur... [more]
KAG6591855.12.99e-31493.83Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita a... [more]
XP_022937011.11.21e-31393.83pentatricopeptide repeat-containing protein At1g77360, mitochondrial-like [Cucur... [more]
Match NameE-valueIdentityDescription
A0A6J1IPL09.91e-31598.64pentatricopeptide repeat-containing protein At1g77360, mitochondrial-like OS=Cuc... [more]
A0A6J1F9375.87e-31493.83pentatricopeptide repeat-containing protein At1g77360, mitochondrial-like OS=Cuc... [more]
A0A6J1CG537.87e-27782.03pentatricopeptide repeat-containing protein At1g77360, mitochondrial-like OS=Mom... [more]
A0A5A7UDH96.96e-27284.28Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S3B9W95.29e-27184.63pentatricopeptide repeat-containing protein At1g77360, mitochondrial-like OS=Cuc... [more]
Match NameE-valueIdentityDescription
AT1G77360.11.1e-4829.60Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT5G45320.18.1e-4749.48FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknow... [more]
AT1G71060.19.3e-4327.34Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G62470.13.9e-3326.03Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G74900.16.7e-3325.94Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 354..386
e-value: 1.6E-7
score: 29.0
coord: 212..244
e-value: 3.7E-4
score: 18.5
coord: 318..351
e-value: 0.0029
score: 15.7
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 107..136
e-value: 0.57
score: 10.6
coord: 212..239
e-value: 0.028
score: 14.7
coord: 142..169
e-value: 0.09
score: 13.1
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 315..363
e-value: 1.8E-9
score: 37.6
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 316..350
score: 9.898111
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 246..280
score: 9.624079
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 139..173
score: 8.6266
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 209..239
score: 8.801982
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 351..385
score: 11.772493
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 189..284
e-value: 3.5E-10
score: 41.8
coord: 285..447
e-value: 3.7E-26
score: 94.2
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 75..188
e-value: 1.7E-16
score: 62.1
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 87..269
NoneNo IPR availablePANTHERPTHR47942TETRATRICOPEPTIDE REPEAT (TPR)-LIKE SUPERFAMILY PROTEIN-RELATEDcoord: 26..442
NoneNo IPR availablePANTHERPTHR47942:SF28OSJNBA0019D11.15 PROTEINcoord: 26..442

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG06g05950.1Cp4.1LG06g05950.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006886 intracellular protein transport
biological_process GO:0006898 receptor-mediated endocytosis
cellular_component GO:0030124 AP-4 adaptor complex
cellular_component GO:0005794 Golgi apparatus
molecular_function GO:0140312 cargo adaptor activity
molecular_function GO:0005515 protein binding