Cp4.1LG04g01560 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG04g01560
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionPentatricopeptide repeat-containing family protein
LocationCp4.1LG04 : 550468 .. 552389 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTTCGACTTCGGATCTCGGCCGTTCATCAGATTTTCCCACCCAACGCCCACAATTCCAATTCCAATTTCCTCTCCAGAAAGCATCAATTCCTCTCCCTTATTAAGCTCTGTTCTTCACCAAATCATCTATTTCAAATCCATTCTCAAATCATCGTCTCTGGCCTCCAAAATGACTCATTTCTCACCACTGAACTCCTCCGCTTTGCTGCTCTATCGCCTTCCAGAAATCTTAGCTATGCCCGCTCTCTCCTCTTCCGTTACAACCTTCATTTCTCTCCTTTTCCATGGAATTGCATCATCAGAGGATATGCCTCGAGCGATTCTCCACGAGAGGCCATTTGGGTATTTGAGGACATGCGAAGACGAGGAATCAGACCCAATAATCTCACCTTCCCCTTCCTTATCAAAGCCTGCGCCACGCTCACGACGCTCCAAGAAGGTAAGAAATTTCATGCTGATGCCATTAAGTGTGGTTTAGATTTAGATGTTTATGTTCGGAACACTTTGATTAATTTCTATGGGTCCTGTAAAAGAATGTCTGGTGCGCGGAAGGTATTCGACGAAATGTCTGTAAGAACTTTAGTTTCATGGAATGCGGTTATTACAGCATGTGTTGAGAATTTTTGCTTTGATAAAGCTATTGAGTACTTTTTGAAAATGGGTAACCATGGTTTTGAGCCGGATGAAACTACAATGGTGGTTATATTATCAGCTTGTGCAGAGCTTGGTAACTTAAGCTTAGGAAGATGGGTTCATTCTCAAGTGGTGGAAAGGGGGATGGTTTTGAATGTTCAATTGGGCACTGCCCTCGTTGATATGTATGCAAAATCTGGCGATGTTGGATGTGCTAGACTTGTATTCAATTGTTTGAAACAGAGAAGTGTATGGACGTGGAGTGCAATGATTTTGGGGTTAGCCCAACATGGATTTGCCAATGAAGCCATTGAGCTTTTCACAAATATGATGAGCTCCTCTGTGACGCCTAACTATGTCACTTTCATTGGTGTCCTATGTGCTTGCAGCCATGCTGGATTGGTGGATAAAGGATACCATTACTTCAACATTATGGAGAGAGTGTACGGGATTAAGCCGATGATGATACATTATGGGTCGATGGTGGATGTTTTATGTCGTGCAAGTAGAGTCAAGGAGGCTTATGAGTTCATCATGAGGATGCCTGTGGAGCCTGATCCAATTGTGTGGAGGACATTGCTGAGTGCGTGCAGTGCTCGCGATGTAGATGGTGGGGCTCAGGTTGTGGAGGAGGCGAGGAAGAGGCTGCTTGAGCTCGAGCCGAAGAGGGGCGGGAATGTGGTGATGGTTGCAAACATGTTTGCGGAAGTTGGGATGTGGAAACAGGCAGCAGATTGCCGGAGGGCCATGAAAGATGGAGGGATGAAAAAGATGGCAGGGGAGAGTTGCGTGGAAGTTGGTGGCTCTTTGCGCAAATTCTTCTCAGGTTTTGATGGTCGGGCTGATTCTGATGGCATCTATGATTTGCTTGATGGATTGAACCTGCATATGCAAATGGTTAACTTCTAATTACTTCATTTACAAATTTTCTTTTTTTCCTTTTTCTTCAAGTTTCCGTCTTCTAAGGCTCTCAATAAAATCTCATAAGTTCAAAGATACAAAAAGAAAAGTAATGAAGCATTCATACTTGTCATGCTCGGGTGTTACTAATGATTAGTATGATGCATCATGTATTATAATTTGATGTTTTCTACTCATAGGATTTCATAATATGTGTATAATTTATATAGGTATGAGAAATAATATATTCATTTTTCTTTCTTTCTATGACTCTACAAGGATCTCCATCAATATATTGTGAGGTCGGTGTGTAAGAATTTGACGATACATATGAAATCGATGCATACCCACATGGTCTAGTGATTACAAAATCGGTAGGAGATTAG

mRNA sequence

ATGGTTCGACTTCGGATCTCGGCCGTTCATCAGATTTTCCCACCCAACGCCCACAATTCCAATTCCAATTTCCTCTCCAGAAAGCATCAATTCCTCTCCCTTATTAAGCTCTGTTCTTCACCAAATCATCTATTTCAAATCCATTCTCAAATCATCGTCTCTGGCCTCCAAAATGACTCATTTCTCACCACTGAACTCCTCCGCTTTGCTGCTCTATCGCCTTCCAGAAATCTTAGCTATGCCCGCTCTCTCCTCTTCCGTTACAACCTTCATTTCTCTCCTTTTCCATGGAATTGCATCATCAGAGGATATGCCTCGAGCGATTCTCCACGAGAGGCCATTTGGGTATTTGAGGACATGCGAAGACGAGGAATCAGACCCAATAATCTCACCTTCCCCTTCCTTATCAAAGCCTGCGCCACGCTCACGACGCTCCAAGAAGGTAAGAAATTTCATGCTGATGCCATTAAGTGTGGTTTAGATTTAGATGTTTATGTTCGGAACACTTTGATTAATTTCTATGGGTCCTGTAAAAGAATGTCTGGTGCGCGGAAGGTATTCGACGAAATGTCTGTAAGAACTTTAGTTTCATGGAATGCGGTTATTACAGCATGTGTTGAGAATTTTTGCTTTGATAAAGCTATTGAGTACTTTTTGAAAATGGGTAACCATGGTTTTGAGCCGGATGAAACTACAATGGTGGTTATATTATCAGCTTGTGCAGAGCTTGGTAACTTAAGCTTAGGAAGATGGGTTCATTCTCAAGTGGTGGAAAGGGGGATGGTTTTGAATGTTCAATTGGGCACTGCCCTCGTTGATATGTATGCAAAATCTGGCGATGTTGGATGTGCTAGACTTGTATTCAATTGTTTGAAACAGAGAAGTGTATGGACGTGGAGTGCAATGATTTTGGGGTTAGCCCAACATGGATTTGCCAATGAAGCCATTGAGCTTTTCACAAATATGATGAGCTCCTCTGTGACGCCTAACTATGTCACTTTCATTGGTGTCCTATGTGCTTGCAGCCATGCTGGATTGGTGGATAAAGGATACCATTACTTCAACATTATGGAGAGAGTGTACGGGATTAAGCCGATGATGATACATTATGGGTCGATGGTGGATGTTTTATGTCGTGCAAGTAGAGTCAAGGAGGCTTATGAGTTCATCATGAGGATGCCTGTGGAGCCTGATCCAATTGTGTGGAGGACATTGCTGAGTGCGTGCAGTGCTCGCGATGTAGATGGTGGGGCTCAGGTTGTGGAGGAGGCGAGGAAGAGGCTGCTTGAGCTCGAGCCGAAGAGGGGCGGGAATGTGGTGATGGTTGCAAACATGTTTGCGGAAGTTGGGATGTGGAAACAGGCAGCAGATTGCCGGAGGGCCATGAAAGATGGAGGGATGAAAAAGATGGCAGGGGAGAGTTGCGTGGAAGTTGGTGGCTCTTTGCGCAAATTCTTCTCAGGAGATTAG

Coding sequence (CDS)

ATGGTTCGACTTCGGATCTCGGCCGTTCATCAGATTTTCCCACCCAACGCCCACAATTCCAATTCCAATTTCCTCTCCAGAAAGCATCAATTCCTCTCCCTTATTAAGCTCTGTTCTTCACCAAATCATCTATTTCAAATCCATTCTCAAATCATCGTCTCTGGCCTCCAAAATGACTCATTTCTCACCACTGAACTCCTCCGCTTTGCTGCTCTATCGCCTTCCAGAAATCTTAGCTATGCCCGCTCTCTCCTCTTCCGTTACAACCTTCATTTCTCTCCTTTTCCATGGAATTGCATCATCAGAGGATATGCCTCGAGCGATTCTCCACGAGAGGCCATTTGGGTATTTGAGGACATGCGAAGACGAGGAATCAGACCCAATAATCTCACCTTCCCCTTCCTTATCAAAGCCTGCGCCACGCTCACGACGCTCCAAGAAGGTAAGAAATTTCATGCTGATGCCATTAAGTGTGGTTTAGATTTAGATGTTTATGTTCGGAACACTTTGATTAATTTCTATGGGTCCTGTAAAAGAATGTCTGGTGCGCGGAAGGTATTCGACGAAATGTCTGTAAGAACTTTAGTTTCATGGAATGCGGTTATTACAGCATGTGTTGAGAATTTTTGCTTTGATAAAGCTATTGAGTACTTTTTGAAAATGGGTAACCATGGTTTTGAGCCGGATGAAACTACAATGGTGGTTATATTATCAGCTTGTGCAGAGCTTGGTAACTTAAGCTTAGGAAGATGGGTTCATTCTCAAGTGGTGGAAAGGGGGATGGTTTTGAATGTTCAATTGGGCACTGCCCTCGTTGATATGTATGCAAAATCTGGCGATGTTGGATGTGCTAGACTTGTATTCAATTGTTTGAAACAGAGAAGTGTATGGACGTGGAGTGCAATGATTTTGGGGTTAGCCCAACATGGATTTGCCAATGAAGCCATTGAGCTTTTCACAAATATGATGAGCTCCTCTGTGACGCCTAACTATGTCACTTTCATTGGTGTCCTATGTGCTTGCAGCCATGCTGGATTGGTGGATAAAGGATACCATTACTTCAACATTATGGAGAGAGTGTACGGGATTAAGCCGATGATGATACATTATGGGTCGATGGTGGATGTTTTATGTCGTGCAAGTAGAGTCAAGGAGGCTTATGAGTTCATCATGAGGATGCCTGTGGAGCCTGATCCAATTGTGTGGAGGACATTGCTGAGTGCGTGCAGTGCTCGCGATGTAGATGGTGGGGCTCAGGTTGTGGAGGAGGCGAGGAAGAGGCTGCTTGAGCTCGAGCCGAAGAGGGGCGGGAATGTGGTGATGGTTGCAAACATGTTTGCGGAAGTTGGGATGTGGAAACAGGCAGCAGATTGCCGGAGGGCCATGAAAGATGGAGGGATGAAAAAGATGGCAGGGGAGAGTTGCGTGGAAGTTGGTGGCTCTTTGCGCAAATTCTTCTCAGGAGATTAG

Protein sequence

MVRLRISAVHQIFPPNAHNSNSNFLSRKHQFLSLIKLCSSPNHLFQIHSQIIVSGLQNDSFLTTELLRFAALSPSRNLSYARSLLFRYNLHFSPFPWNCIIRGYASSDSPREAIWVFEDMRRRGIRPNNLTFPFLIKACATLTTLQEGKKFHADAIKCGLDLDVYVRNTLINFYGSCKRMSGARKVFDEMSVRTLVSWNAVITACVENFCFDKAIEYFLKMGNHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVERGMVLNVQLGTALVDMYAKSGDVGCARLVFNCLKQRSVWTWSAMILGLAQHGFANEAIELFTNMMSSSVTPNYVTFIGVLCACSHAGLVDKGYHYFNIMERVYGIKPMMIHYGSMVDVLCRASRVKEAYEFIMRMPVEPDPIVWRTLLSACSARDVDGGAQVVEEARKRLLELEPKRGGNVVMVANMFAEVGMWKQAADCRRAMKDGGMKKMAGESCVEVGGSLRKFFSGD
BLAST of Cp4.1LG04g01560 vs. Swiss-Prot
Match: PP188_ARATH (Pentatricopeptide repeat-containing protein At2g36730 OS=Arabidopsis thaliana GN=PCMP-E44 PE=3 SV=1)

HSP 1 Score: 565.8 bits (1457), Expect = 4.4e-160
Identity = 280/471 (59.45%), Postives = 360/471 (76.43%), Query Frame = 1

Query: 19  NSNSNFLSRKHQFLSLIKLCSSPNHLFQIHSQIIVSGLQNDSFLTTELLRFAALSPSRNL 78
           +S+S F SRKHQ L  +KLCSS  HL QIH QI +S LQNDSF+ +EL+R ++LS +++L
Sbjct: 4   SSDSCFKSRKHQCLIFLKLCSSIKHLLQIHGQIHLSSLQNDSFIISELVRVSSLSLAKDL 63

Query: 79  SYARSLLFRYNLHFSPFPWNCIIRGYASSDSPREAIWVFEDMRRRGIRPNNLTFPFLIKA 138
           ++AR+LL  ++   +P  WN + RGY+SSDSP E+IWV+ +M+RRGI+PN LTFPFL+KA
Sbjct: 64  AFARTLLL-HSSDSTPSTWNMLSRGYSSSDSPVESIWVYSEMKRRGIKPNKLTFPFLLKA 123

Query: 139 CATLTTLQEGKKFHADAIKCGLDLDVYVRNTLINFYGSCKRMSGARKVFDEMSVRTLVSW 198
           CA+   L  G++   + +K G D DVYV N LI+ YG+CK+ S ARKVFDEM+ R +VSW
Sbjct: 124 CASFLGLTAGRQIQVEVLKHGFDFDVYVGNNLIHLYGTCKKTSDARKVFDEMTERNVVSW 183

Query: 199 NAVITACVENFCFDKAIEYFLKMGNHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVE 258
           N+++TA VEN   +   E F +M    F PDETTMVV+LSAC   GNLSLG+ VHSQV+ 
Sbjct: 184 NSIMTALVENGKLNLVFECFCEMIGKRFCPDETTMVVLLSACG--GNLSLGKLVHSQVMV 243

Query: 259 RGMVLNVQLGTALVDMYAKSGDVGCARLVFNCLKQRSVWTWSAMILGLAQHGFANEAIEL 318
           R + LN +LGTALVDMYAKSG +  ARLVF  +  ++VWTWSAMI+GLAQ+GFA EA++L
Sbjct: 244 RELELNCRLGTALVDMYAKSGGLEYARLVFERMVDKNVWTWSAMIVGLAQYGFAEEALQL 303

Query: 319 FTNMMS-SSVTPNYVTFIGVLCACSHAGLVDKGYHYFNIMERVYGIKPMMIHYGSMVDVL 378
           F+ MM  SSV PNYVTF+GVLCACSH GLVD GY YF+ ME+++ IKPMMIHYG+MVD+L
Sbjct: 304 FSKMMKESSVRPNYVTFLGVLCACSHTGLVDDGYKYFHEMEKIHKIKPMMIHYGAMVDIL 363

Query: 379 CRASRVKEAYEFIMRMPVEPDPIVWRTLLSACSARDVDGGAQVVEEARKRLLELEPKRGG 438
            RA R+ EAY+FI +MP EPD +VWRTLLSACS    +    + E+ +KRL+ELEPKR G
Sbjct: 364 GRAGRLNEAYDFIKKMPFEPDAVVWRTLLSACSIHHDEDDEGIGEKVKKRLIELEPKRSG 423

Query: 439 NVVMVANMFAEVGMWKQAADCRRAMKDGGMKKMAGESCVEVGGSLRKFFSG 489
           N+V+VAN FAE  MW +AA+ RR MK+  MKK+AGESC+E+GGS  +FFSG
Sbjct: 424 NLVIVANRFAEARMWAEAAEVRRVMKETKMKKIAGESCLELGGSFHRFFSG 471

BLAST of Cp4.1LG04g01560 vs. Swiss-Prot
Match: PP330_ARATH (Pentatricopeptide repeat-containing protein At4g21065 OS=Arabidopsis thaliana GN=PCMP-H28 PE=2 SV=2)

HSP 1 Score: 334.0 bits (855), Expect = 2.8e-90
Identity = 175/455 (38.46%), Postives = 280/455 (61.54%), Query Frame = 1

Query: 39  SSPNHLFQIHSQIIVSGLQ-NDSFLTTELLRFAALSPSRN-LSYARSLLFRYNLHFSPFP 98
           SS   L QIH+  I  G+  +D+ L   L+ +    PS   +SYA  +  +     + F 
Sbjct: 28  SSITKLRQIHAFSIRHGVSISDAELGKHLIFYLVSLPSPPPMSYAHKVFSKIEKPINVFI 87

Query: 99  WNCIIRGYASSDSPREAIWVFEDMRRRG-IRPNNLTFPFLIKACATLTTLQEGKKFHADA 158
           WN +IRGYA   +   A  ++ +MR  G + P+  T+PFLIKA  T+  ++ G+  H+  
Sbjct: 88  WNTLIRGYAEIGNSISAFSLYREMRVSGLVEPDTHTYPFLIKAVTTMADVRLGETIHSVV 147

Query: 159 IKCGLDLDVYVRNTLINFYGSCKRMSGARKVFDEMSVRTLVSWNAVITACVENFCFDKAI 218
           I+ G    +YV+N+L++ Y +C  ++ A KVFD+M  + LV+WN+VI    EN   ++A+
Sbjct: 148 IRSGFGSLIYVQNSLLHLYANCGDVASAYKVFDKMPEKDLVAWNSVINGFAENGKPEEAL 207

Query: 219 EYFLKMGNHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVERGMVLNVQLGTALVDMY 278
             + +M + G +PD  T+V +LSACA++G L+LG+ VH  +++ G+  N+     L+D+Y
Sbjct: 208 ALYTEMNSKGIKPDGFTIVSLLSACAKIGALTLGKRVHVYMIKVGLTRNLHSSNVLLDLY 267

Query: 279 AKSGDVGCARLVFNCLKQRSVWTWSAMILGLAQHGFANEAIELFTNMMSS-SVTPNYVTF 338
           A+ G V  A+ +F+ +  ++  +W+++I+GLA +GF  EAIELF  M S+  + P  +TF
Sbjct: 268 ARCGRVEEAKTLFDEMVDKNSVSWTSLIVGLAVNGFGKEAIELFKYMESTEGLLPCEITF 327

Query: 339 IGVLCACSHAGLVDKGYHYFNIMERVYGIKPMMIHYGSMVDVLCRASRVKEAYEFIMRMP 398
           +G+L ACSH G+V +G+ YF  M   Y I+P + H+G MVD+L RA +VK+AYE+I  MP
Sbjct: 328 VGILYACSHCGMVKEGFEYFRRMREEYKIEPRIEHFGCMVDLLARAGQVKKAYEYIKSMP 387

Query: 399 VEPDPIVWRTLLSACSARDVDGGAQVVEEARKRLLELEPKRGGNVVMVANMFAEVGMWKQ 458
           ++P+ ++WRTLL AC+   V G + + E AR ++L+LEP   G+ V+++NM+A    W  
Sbjct: 388 MQPNVVIWRTLLGACT---VHGDSDLAEFARIQILQLEPNHSGDYVLLSNMYASEQRWSD 447

Query: 459 AADCRRAMKDGGMKKMAGESCVEVGGSLRKFFSGD 490
               R+ M   G+KK+ G S VEVG  + +F  GD
Sbjct: 448 VQKIRKQMLRDGVKKVPGHSLVEVGNRVHEFLMGD 479

BLAST of Cp4.1LG04g01560 vs. Swiss-Prot
Match: PP145_ARATH (Pentatricopeptide repeat-containing protein At2g02980, chloroplastic OS=Arabidopsis thaliana GN=PCMP-H26 PE=2 SV=2)

HSP 1 Score: 331.3 bits (848), Expect = 1.8e-89
Identity = 175/457 (38.29%), Postives = 274/457 (59.96%), Query Frame = 1

Query: 34  LIKLCSSPNHLFQIHSQIIVSGLQNDSFLTTELLRFAALSPSRN-LSYARSLLFRYNLHF 93
           LI  C+S   L QI +  I S +++ SF+  +L+ F   SP+ + +SYAR L F      
Sbjct: 35  LISKCNSLRELMQIQAYAIKSHIEDVSFVA-KLINFCTESPTESSMSYARHL-FEAMSEP 94

Query: 94  SPFPWNCIIRGYASSDSPREAIWVFEDMRRRGIRPNNLTFPFLIKACATLTTLQEGKKFH 153
               +N + RGY+   +P E   +F ++   GI P+N TFP L+KACA    L+EG++ H
Sbjct: 95  DIVIFNSMARGYSRFTNPLEVFSLFVEILEDGILPDNYTFPSLLKACAVAKALEEGRQLH 154

Query: 154 ADAIKCGLDLDVYVRNTLINFYGSCKRMSGARKVFDEMSVRTLVSWNAVITACVENFCFD 213
             ++K GLD +VYV  TLIN Y  C+ +  AR VFD +    +V +NA+IT        +
Sbjct: 155 CLSMKLGLDDNVYVCPTLINMYTECEDVDSARCVFDRIVEPCVVCYNAMITGYARRNRPN 214

Query: 214 KAIEYFLKMGNHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVERGMVLNVQLGTALV 273
           +A+  F +M     +P+E T++ +LS+CA LG+L LG+W+H    +      V++ TAL+
Sbjct: 215 EALSLFREMQGKYLKPNEITLLSVLSSCALLGSLDLGKWIHKYAKKHSFCKYVKVNTALI 274

Query: 274 DMYAKSGDVGCARLVFNCLKQRSVWTWSAMILGLAQHGFANEAIELFTNMMSSSVTPNYV 333
           DM+AK G +  A  +F  ++ +    WSAMI+  A HG A +++ +F  M S +V P+ +
Sbjct: 275 DMFAKCGSLDDAVSIFEKMRYKDTQAWSAMIVAYANHGKAEKSMLMFERMRSENVQPDEI 334

Query: 334 TFIGVLCACSHAGLVDKGYHYFNIMERVYGIKPMMIHYGSMVDVLCRASRVKEAYEFIMR 393
           TF+G+L ACSH G V++G  YF+ M   +GI P + HYGSMVD+L RA  +++AYEFI +
Sbjct: 335 TFLGLLNACSHTGRVEEGRKYFSQMVSKFGIVPSIKHYGSMVDLLSRAGNLEDAYEFIDK 394

Query: 394 MPVEPDPIVWRTLLSACSARDVDGGAQVVEEARKRLLELEPKRGGNVVMVANMFAEVGMW 453
           +P+ P P++WR LL+ACS+ +      + E+  +R+ EL+   GG+ V+++N++A    W
Sbjct: 395 LPISPTPMLWRILLAACSSHN---NLDLAEKVSERIFELDDSHGGDYVILSNLYARNKKW 454

Query: 454 KQAADCRRAMKDGGMKKMAGESCVEVGGSLRKFFSGD 490
           +     R+ MKD    K+ G S +EV   + +FFSGD
Sbjct: 455 EYVDSLRKVMKDRKAVKVPGCSSIEVNNVVHEFFSGD 486

BLAST of Cp4.1LG04g01560 vs. Swiss-Prot
Match: PPR85_ARATH (Pentatricopeptide repeat-containing protein At1g59720, chloroplastic/mitochondrial OS=Arabidopsis thaliana GN=PCMP-H51 PE=2 SV=2)

HSP 1 Score: 318.9 bits (816), Expect = 9.5e-86
Identity = 186/501 (37.13%), Postives = 287/501 (57.29%), Query Frame = 1

Query: 9   VHQIFP--PNAHNSNSNFLSRKHQ-FLSLIKLCSSPNHLFQIHSQIIVSGLQNDS---FL 68
           VH + P  P A + +++     HQ   SL + CS  + L Q+H+  + +    +    FL
Sbjct: 26  VHPLSPHIPPASSPSASTAGNHHQRIFSLAETCSDMSQLKQLHAFTLRTTYPEEPATLFL 85

Query: 69  TTELLRFAALSPSRNLSYARSLLFRYNLHFSPFPWNCIIRGYASSDSPRE-AIWVFEDMR 128
             ++L+ +  S   +++YA  +      H S F WN +IR  A   S +E A  ++  M 
Sbjct: 86  YGKILQLS--SSFSDVNYAFRVFDSIENH-SSFMWNTLIRACAHDVSRKEEAFMLYRKML 145

Query: 129 RRG-IRPNNLTFPFLIKACATLTTLQEGKKFHADAIKCGLDLDVYVRNTLINFYGSCKRM 188
            RG   P+  TFPF++KACA +    EGK+ H   +K G   DVYV N LI+ YGSC  +
Sbjct: 146 ERGESSPDKHTFPFVLKACAYIFGFSEGKQVHCQIVKHGFGGDVYVNNGLIHLYGSCGCL 205

Query: 189 SGARKVFDEMSVRTLVSWNAVITACVENFCFDKAIEYFLKMGNHGFEPDETTMVVILSAC 248
             ARKVFDEM  R+LVSWN++I A V    +D A++ F +M    FEPD  TM  +LSAC
Sbjct: 206 DLARKVFDEMPERSLVSWNSMIDALVRFGEYDSALQLFREM-QRSFEPDGYTMQSVLSAC 265

Query: 249 AELGNLSLGRWVHSQVVER---GMVLNVQLGTALVDMYAKSGDVGCARLVFNCLKQRSVW 308
           A LG+LSLG W H+ ++ +    + ++V +  +L++MY K G +  A  VF  +++R + 
Sbjct: 266 AGLGSLSLGTWAHAFLLRKCDVDVAMDVLVKNSLIEMYCKCGSLRMAEQVFQGMQKRDLA 325

Query: 309 TWSAMILGLAQHGFANEAIELFTNMMSS--SVTPNYVTFIGVLCACSHAGLVDKGYHYFN 368
           +W+AMILG A HG A EA+  F  M+    +V PN VTF+G+L AC+H G V+KG  YF+
Sbjct: 326 SWNAMILGFATHGRAEEAMNFFDRMVDKRENVRPNSVTFVGLLIACNHRGFVNKGRQYFD 385

Query: 369 IMERVYGIKPMMIHYGSMVDVLCRASRVKEAYEFIMRMPVEPDPIVWRTLLSACSARDVD 428
           +M R Y I+P + HYG +VD++ RA  + EA + +M MP++PD ++WR+LL AC  +   
Sbjct: 386 MMVRDYCIEPALEHYGCIVDLIARAGYITEAIDMVMSMPMKPDAVIWRSLLDACCKK--G 445

Query: 429 GGAQVVEEARKRLL----ELEPKRG---GNVVMVANMFAEVGMWKQAADCRRAMKDGGMK 488
              ++ EE  + ++    + E   G   G  V+++ ++A    W      R+ M + G++
Sbjct: 446 ASVELSEEIARNIIGTKEDNESSNGNCSGAYVLLSRVYASASRWNDVGIVRKLMSEHGIR 505

Query: 489 KMAGESCVEVGGSLRKFFSGD 490
           K  G S +E+ G   +FF+GD
Sbjct: 506 KEPGCSSIEINGISHEFFAGD 520

BLAST of Cp4.1LG04g01560 vs. Swiss-Prot
Match: PP182_ARATH (Pentatricopeptide repeat-containing protein At2g33760 OS=Arabidopsis thaliana GN=PCMP-H6 PE=3 SV=1)

HSP 1 Score: 312.4 bits (799), Expect = 8.9e-84
Identity = 169/451 (37.47%), Postives = 256/451 (56.76%), Query Frame = 1

Query: 44  LFQIHSQIIVSGLQNDSFLTTELLRFAALSPSRNLSYARSLLFRYNLHFSPFPWNCIIRG 103
           L Q+H+ +IV+G      L T+L+  A    +R ++Y   L     L    F +N +I+ 
Sbjct: 25  LQQVHAHLIVTGYGRSRSLLTKLITLAC--SARAIAYTHLLFLSVPLP-DDFLFNSVIKS 84

Query: 104 YASSDSPREAIWVFEDMRRRGIRPNNLTFPFLIKACATLTTLQEGKKFHADAIKCGLDLD 163
            +    P   +  +  M    + P+N TF  +IK+CA L+ L+ GK  H  A+  G  LD
Sbjct: 85  TSKLRLPLHCVAYYRRMLSSNVSPSNYTFTSVIKSCADLSALRIGKGVHCHAVVSGFGLD 144

Query: 164 VYVRNTLINFYGSCKRMSGARKVFDEMSVRTLVSWNAVITACVENFCFDKAIEYFLKMGN 223
            YV+  L+ FY  C  M GAR+VFD M  +++V+WN++++   +N   D+AI+ F +M  
Sbjct: 145 TYVQAALVTFYSKCGDMEGARQVFDRMPEKSIVAWNSLVSGFEQNGLADEAIQVFYQMRE 204

Query: 224 HGFEPDETTMVVILSACAELGNLSLGRWVHSQVVERGMVLNVQLGTALVDMYAKSGDVGC 283
            GFEPD  T V +LSACA+ G +SLG WVH  ++  G+ LNV+LGTAL+++Y++ GDVG 
Sbjct: 205 SGFEPDSATFVSLLSACAQTGAVSLGSWVHQYIISEGLDLNVKLGTALINLYSRCGDVGK 264

Query: 284 ARLVFNCLKQRSVWTWSAMILGLAQHGFANEAIELFTNMMSS-SVTPNYVTFIGVLCACS 343
           AR VF+ +K+ +V  W+AMI     HG+  +A+ELF  M       PN VTF+ VL AC+
Sbjct: 265 AREVFDKMKETNVAAWTAMISAYGTHGYGQQAVELFNKMEDDCGPIPNNVTFVAVLSACA 324

Query: 344 HAGLVDKGYHYFNIMERVYGIKPMMIHYGSMVDVLCRASRVKEAYEFIMRMPV---EPDP 403
           HAGLV++G   +  M + Y + P + H+  MVD+L RA  + EAY+FI ++        P
Sbjct: 325 HAGLVEEGRSVYKRMTKSYRLIPGVEHHVCMVDMLGRAGFLDEAYKFIHQLDATGKATAP 384

Query: 404 IVWRTLLSACSA-RDVDGGAQVVEEARKRLLELEPKRGGNVVMVANMFAEVGMWKQAADC 463
            +W  +L AC   R+ D G ++     KRL+ LEP   G+ VM++N++A  G   + +  
Sbjct: 385 ALWTAMLGACKMHRNYDLGVEIA----KRLIALEPDNPGHHVMLSNIYALSGKTDEVSHI 444

Query: 464 RRAMKDGGMKKMAGESCVEVGGSLRKFFSGD 490
           R  M    ++K  G S +EV      F  GD
Sbjct: 445 RDGMMRNNLRKQVGYSVIEVENKTYMFSMGD 468

BLAST of Cp4.1LG04g01560 vs. TrEMBL
Match: A0A0A0K153_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_7G007910 PE=4 SV=1)

HSP 1 Score: 847.4 bits (2188), Expect = 8.5e-243
Identity = 416/490 (84.90%), Postives = 444/490 (90.61%), Query Frame = 1

Query: 1   MVRLRISAVHQIFPPNAHNSNSN--FLSRKHQFLSLIKLCSSPNHLFQIHSQIIVSGLQN 60
           MVRL ISAVHQ FP N HN +S   FLS KHQ LSL+  CSS NHLF+IH+QI+VSGLQN
Sbjct: 1   MVRLWISAVHQFFPINVHNYSSKPKFLSTKHQLLSLLNHCSSTNHLFEIHAQILVSGLQN 60

Query: 61  DSFLTTELLRFAALSPSRNLSYARSLLFRYNLHFSPFPWNCIIRGYASSDSPREAIWVFE 120
           DSF TTELLR AALSPSRNLSY  SLLF  + H +  PWN IIRGY+SSDSP+EAI +F 
Sbjct: 61  DSFFTTELLRVAALSPSRNLSYGCSLLFHCHFHSATMPWNFIIRGYSSSDSPQEAISLFG 120

Query: 121 DMRRRGIRPNNLTFPFLIKACATLTTLQEGKKFHADAIKCGLDLDVYVRNTLINFYGSCK 180
           +MRRRG+RPNNLTFPFL+KACATL TLQEGK+FHA AIKCGLDLDVYVRNTLI FYGSCK
Sbjct: 121 EMRRRGVRPNNLTFPFLLKACATLATLQEGKQFHAIAIKCGLDLDVYVRNTLIYFYGSCK 180

Query: 181 RMSGARKVFDEMSVRTLVSWNAVITACVENFCFDKAIEYFLKMGNHGFEPDETTMVVILS 240
           RMSGARKVFDEM+ RTLVSWNAVITACVENFCFD+AI+YFLKMGNHGFEPDETTMVVILS
Sbjct: 181 RMSGARKVFDEMTERTLVSWNAVITACVENFCFDEAIDYFLKMGNHGFEPDETTMVVILS 240

Query: 241 ACAELGNLSLGRWVHSQVVERGMVLNVQLGTALVDMYAKSGDVGCARLVFNCLKQRSVWT 300
           ACAELGNLSLGRWVHSQVV RGMVLNVQLGTA VDMYAKSGDVGCAR VFNCLKQ+SVWT
Sbjct: 241 ACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARHVFNCLKQKSVWT 300

Query: 301 WSAMILGLAQHGFANEAIELFTNMMSSSVTPNYVTFIGVLCACSHAGLVDKGYHYFNIME 360
           WSAMILGLAQHGFANEAIELFTNMMSS + PN+VTFIGVLCACSHAGLVDK YHYFN+ME
Sbjct: 301 WSAMILGLAQHGFANEAIELFTNMMSSPIVPNHVTFIGVLCACSHAGLVDKSYHYFNLME 360

Query: 361 RVYGIKPMMIHYGSMVDVLCRASRVKEAYEFIMRMPVEPDPIVWRTLLSACSARDVDGGA 420
           RVYGIKPMMIHYGSMVDVL RA +VKEAYE IM MPVEPDPIVWRTLLSACS RDV+GGA
Sbjct: 361 RVYGIKPMMIHYGSMVDVLGRAGQVKEAYELIMSMPVEPDPIVWRTLLSACSGRDVNGGA 420

Query: 421 QVVEEARKRLLELEPKRGGNVVMVANMFAEVGMWKQAADCRRAMKDGGMKKMAGESCVEV 480
           +V EEARKRLLELEPKRGGNVVMVAN FAE+GMWKQAAD RR MKD G+KKMAGESC+E+
Sbjct: 421 EVAEEARKRLLELEPKRGGNVVMVANKFAELGMWKQAADYRRTMKDRGIKKMAGESCIEL 480

Query: 481 GGSLRKFFSG 489
           GGSLRKFFSG
Sbjct: 481 GGSLRKFFSG 490

BLAST of Cp4.1LG04g01560 vs. TrEMBL
Match: M5WZ10_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa004522mg PE=4 SV=1)

HSP 1 Score: 646.7 bits (1667), Expect = 2.2e-182
Identity = 315/488 (64.55%), Postives = 387/488 (79.30%), Query Frame = 1

Query: 1   MVRLRISAVHQIFPPNAHNSNSNFLSRKHQFLSLIKLCSSPNHLFQIHSQIIVSGLQNDS 60
           MVRL I        P A+N NSNF S+K Q L L+ LC +   L Q+H+QI VSG Q D 
Sbjct: 1   MVRLPI--------PTANNCNSNFGSKKQQCLYLLNLCFTFKQLSQVHAQIQVSGFQRDH 60

Query: 61  FLTTELLRFAALSPSRNLSYARSLLFRYNLHFSPFPWNCIIRGYASSDSPREAIWVFEDM 120
           FL T+L+RF ALSPS+N +YAR+LL  ++    P  WN +IRGYASSD+PREAIW F  M
Sbjct: 61  FLLTQLIRFCALSPSKNFNYARTLL-DHSESSPPSSWNFLIRGYASSDTPREAIWAFRAM 120

Query: 121 RRRGIRPNNLTFPFLIKACATLTTLQEGKKFHADAIKCGLDLDVYVRNTLINFYGSCKRM 180
             RGIRPN LTFPFLIK+CA+   L+EG++ H   +KCGLD DVYV+N L++FYG+CK++
Sbjct: 121 LGRGIRPNQLTFPFLIKSCASAAALKEGRQVHVGVVKCGLDCDVYVQNNLVHFYGACKKI 180

Query: 181 SGARKVFDEMSVRTLVSWNAVITACVENFCFDKAIEYFLKMGNHGFEPDETTMVVILSAC 240
             A++VFD MSVRT+VSWNAV+TACVENF  D+ I YF+KM + GFEPDETTMVV+L+A 
Sbjct: 181 KDAQRVFDGMSVRTVVSWNAVLTACVENFWLDEGIGYFVKMRDCGFEPDETTMVVMLNAS 240

Query: 241 AELGNLSLGRWVHSQVVERGMVLNVQLGTALVDMYAKSGDVGCARLVFNCLKQRSVWTWS 300
           +ELGNLSLG+WVHSQV+E+G++LN QLGTALVDMYAKSG +  ARLVF+ ++ R+VWTWS
Sbjct: 241 SELGNLSLGKWVHSQVIEKGLILNCQLGTALVDMYAKSGALVYARLVFDRMELRNVWTWS 300

Query: 301 AMILGLAQHGFANEAIELFTNMMSSSVTPNYVTFIGVLCACSHAGLVDKGYHYFNIMERV 360
           AMILGLAQHGFA EA+ELF  M++ SV PNYVTF+GVLCACSHAG VD GY YF+ ME V
Sbjct: 301 AMILGLAQHGFAKEALELFPKMLNFSVRPNYVTFLGVLCACSHAGQVDDGYQYFHDMEHV 360

Query: 361 YGIKPMMIHYGSMVDVLCRASRVKEAYEFIMRMPVEPDPIVWRTLLSACSARDVDGGAQV 420
           +GIKPMMIHYG+MVD+L RA R+ EAY FIM MP +PDPIVWRTLLSAC+ RD +    V
Sbjct: 361 HGIKPMMIHYGAMVDILGRAGRLNEAYSFIMSMPFDPDPIVWRTLLSACNTRDANDDEGV 420

Query: 421 VEEARKRLLELEPKRGGNVVMVANMFAEVGMWKQAADCRRAMKDGGMKKMAGESCVEVGG 480
             +  ++LLELEP RGGN+VMVANM+AEVGMW++AA+ R+ MK+  +KK AGESCVE+GG
Sbjct: 421 GNKVSEKLLELEPSRGGNLVMVANMYAEVGMWEKAANLRKVMKERRVKKTAGESCVELGG 479

Query: 481 SLRKFFSG 489
           S+ KFFSG
Sbjct: 481 SIHKFFSG 479

BLAST of Cp4.1LG04g01560 vs. TrEMBL
Match: B9T0U0_RICCO (Cell division protein ftsH, putative OS=Ricinus communis GN=RCOM_0340700 PE=3 SV=1)

HSP 1 Score: 634.8 bits (1636), Expect = 8.7e-179
Identity = 301/458 (65.72%), Postives = 373/458 (81.44%), Query Frame = 1

Query: 32   LSLIKLCSSPNHLFQIHSQIIVSGLQNDSFLTTELLRFAALSPSRNLSYARSLLFRYNLH 91
            LSL+KLCSS  HL+QIHSQI VSGLQ D+FL T+L++F++LSPS++LSYA+S+L  +++H
Sbjct: 668  LSLLKLCSSIKHLYQIHSQIQVSGLQGDTFLVTQLIKFSSLSPSKDLSYAQSIL-DHSVH 727

Query: 92   FSPFPWNCIIRGYASSDSPREAIWVFEDMRRRGIRPNNLTFPFLIKACATLTTLQEGKKF 151
              P PWN +IRGYA S++P++A++V+ +MR  GIRPN+LTFPFL+KACA     +EGK+ 
Sbjct: 728  PVPLPWNILIRGYADSNTPKDALFVYRNMRNEGIRPNSLTFPFLLKACAACFATKEGKQV 787

Query: 152  HADAIKCGLDLDVYVRNTLINFYGSCKRMSGARKVFDEMSVRTLVSWNAVITACVENFCF 211
            H + IK GLD DVYV N L+NFYGSCK++  A KVFDEM  RT+VSWNAVIT+CVE+   
Sbjct: 788  HVEVIKYGLDCDVYVNNNLVNFYGSCKKILDACKVFDEMPERTVVSWNAVITSCVESLKL 847

Query: 212  DKAIEYFLKMGNHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVERGMVLNVQLGTAL 271
             +AI YFLKM + GFEPD TTMV++L  CAE+GNL LGRW+HSQV+ERG+VLN QLGTAL
Sbjct: 848  GEAIRYFLKMRDFGFEPDGTTMVLMLVICAEMGNLGLGRWIHSQVIERGLVLNYQLGTAL 907

Query: 272  VDMYAKSGDVGCARLVFNCLKQRSVWTWSAMILGLAQHGFANEAIELFTNMMSSS-VTPN 331
            VDMYAKSG VG A+LVF+ +K+++VWTWSAMILGLAQHGFA E +ELF +MM SS + PN
Sbjct: 908  VDMYAKSGAVGYAKLVFDRMKEKNVWTWSAMILGLAQHGFAKEGLELFLDMMRSSLIHPN 967

Query: 332  YVTFIGVLCACSHAGLVDKGYHYFNIMERVYGIKPMMIHYGSMVDVLCRASRVKEAYEFI 391
            YVTF+GVLCACSHAGLV  G+ YF+ M   YGIKPMM+HYG+MVD+L RA  +KEAY FI
Sbjct: 968  YVTFLGVLCACSHAGLVSDGFRYFHEMGHTYGIKPMMVHYGAMVDILGRAGLLKEAYNFI 1027

Query: 392  MRMPVEPDPIVWRTLLSACSARDVDGGAQVVEEARKRLLELEPKRGGNVVMVANMFAEVG 451
             +MP +PDPIVWRTLLSACS  DV     V  + RKRLLELEP+R GN VMVANM+A+ G
Sbjct: 1028 TKMPFQPDPIVWRTLLSACSIHDVKDSTGVAYKVRKRLLELEPRRSGNFVMVANMYADAG 1087

Query: 452  MWKQAADCRRAMKDGGMKKMAGESCVEVGGSLRKFFSG 489
            MW++AA  RR M+DGG+KK AGESCVE+ GS+ +FFSG
Sbjct: 1088 MWEKAAKVRRVMRDGGLKKKAGESCVELSGSIHRFFSG 1124

BLAST of Cp4.1LG04g01560 vs. TrEMBL
Match: A0A0S3RS08_PHAAN (Uncharacterized protein OS=Vigna angularis var. angularis GN=Vigan.04G050100 PE=4 SV=1)

HSP 1 Score: 620.5 bits (1599), Expect = 1.7e-174
Identity = 301/470 (64.04%), Postives = 369/470 (78.51%), Query Frame = 1

Query: 22  SNFLSRKHQFLSLIKLCSSPNHLFQIHSQIIVSGLQNDSFLTTELLRFAALSPSRNLSYA 81
           + FLS+KHQ L L+ LC S   L QI +QI +SGL  D+   +EL+ F +LSPS+NL +A
Sbjct: 9   TQFLSKKHQCLFLLNLCGSMEQLHQIQAQIHLSGLYQDTHTLSELVYFCSLSPSKNLRHA 68

Query: 82  RSLLFRYNLHFSPFPWNCIIRGYASSDSPREAIWVFEDMRRRGIRPNNLTFPFLIKACAT 141
           R+L+  +    SP  WN +IRGYA+SDSP EA WVF+ MR RG  PN LTFPFLIK+CA 
Sbjct: 69  RALV-HHAATPSPISWNILIRGYAASDSPLEAFWVFQKMRERGAMPNKLTFPFLIKSCAA 128

Query: 142 LTTLQEGKKFHADAIKCGLDLDVYVRNTLINFYGSCKRMSGARKVFDEMSVRTLVSWNAV 201
            T L EGK+ HADA KCGLD DVYV N LINFYG CKR+  ARKVFDEM  RT+VSWN+V
Sbjct: 129 ATALGEGKQVHADAFKCGLDSDVYVGNNLINFYGCCKRIVDARKVFDEMPERTVVSWNSV 188

Query: 202 ITACVENFCFDKAIEYFLKMGNHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVERGM 261
           ITACVE+   D+ IEYF +M   GFEPDET+MV++LSACAELG LSLGRW HSQ+V RGM
Sbjct: 189 ITACVESLWLDEGIEYFFRMWGCGFEPDETSMVLLLSACAELGYLSLGRWAHSQLVLRGM 248

Query: 262 VLNVQLGTALVDMYAKSGDVGCARLVFNCLKQRSVWTWSAMILGLAQHGFANEAIELFTN 321
           VL+VQLGTALVDMY KSG +G AR VF  +++R+VWTWSAMILGLAQHGFA EA+ LF  
Sbjct: 249 VLSVQLGTALVDMYGKSGALGYARFVFERMEKRNVWTWSAMILGLAQHGFAEEALALFAM 308

Query: 322 MM---SSSVTPNYVTFIGVLCACSHAGLVDKGYHYFNIMERVYGIKPMMIHYGSMVDVLC 381
           M    +  + PNYVT++GVLCACSHAG+VD+G  YF+ ME V+GIKP+M+HYG MVDVL 
Sbjct: 309 MSINNNHDICPNYVTYLGVLCACSHAGMVDEGCQYFHDMECVHGIKPLMMHYGVMVDVLG 368

Query: 382 RASRVKEAYEFIMRMPVEPDPIVWRTLLSACSARDVDGGAQVVEEARKRLLELEPKRGGN 441
           RA R++EAY FI  MP+EPDP+VWRTLLSAC+  DV   A + E  RKRLL +EP+RGGN
Sbjct: 369 RAGRLEEAYWFIQMMPIEPDPVVWRTLLSACAIHDVHDHAGIGERVRKRLLRMEPRRGGN 428

Query: 442 VVMVANMFAEVGMWKQAADCRRAMKDGGMKKMAGESCVEVGGSLRKFFSG 489
           +V+VANM+AEVGMW++A + RR M++GGMKK+AGESCV++GGS+ +FF+G
Sbjct: 429 LVIVANMYAEVGMWEKATNVRRVMRNGGMKKLAGESCVDLGGSMHRFFAG 477

BLAST of Cp4.1LG04g01560 vs. TrEMBL
Match: A0A061EGE7_THECC (Pentatricopeptide repeat (PPR) superfamily protein OS=Theobroma cacao GN=TCM_047037 PE=4 SV=1)

HSP 1 Score: 614.4 bits (1583), Expect = 1.2e-172
Identity = 294/469 (62.69%), Postives = 376/469 (80.17%), Query Frame = 1

Query: 21  NSNFLSRKHQFLSLIKLCSSPNHLFQIHSQIIVSGLQNDSFLTTELLRFAALSPSRNLSY 80
           N NFLSRK+QFL  +KLCSS  HL Q+H+QI++S L  DSFL TEL+RF++LSP +NLSY
Sbjct: 3   NQNFLSRKNQFLVFLKLCSSIKHLSQVHAQILISNLHQDSFLLTELVRFSSLSPYKNLSY 62

Query: 81  ARSLLFRYNLHFSPFPWNCIIRGYASSDSPREAIWVFEDMRRRGIRPNNLTFPFLIKACA 140
             +LL   +L+ +P  WN +IRGYASSD+P++AIWV ++MR+RG++ N LT+PF++KACA
Sbjct: 63  THTLLVN-SLNSTPSTWNILIRGYASSDTPQKAIWVLKEMRKRGLQRNKLTYPFVLKACA 122

Query: 141 TLTTLQEGKKFHADAIKCGLDLDVYVRNTLINFYGSCKRMSGARKVFDEMSVRTLVSWNA 200
               L EG++ H +  K GLD DVYV N L++FYG CK++  A++VFD M  RT+VSWNA
Sbjct: 123 RGEALAEGRQVHGEIFKHGLDDDVYVENNLVHFYGCCKKIIDAKQVFDGMGERTVVSWNA 182

Query: 201 VITACVENFCFDKAIEYFLKMGNHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVERG 260
           V++ACVENFC + AI YF KM N G   DETT+V++LSACAELG+LS GR +H QVVERG
Sbjct: 183 VLSACVENFCIEDAIGYFDKMRNCGL--DETTIVIMLSACAELGSLSFGRLLHLQVVERG 242

Query: 261 MVLNVQLGTALVDMYAKSGDVGCARLVFNCLKQRSVWTWSAMILGLAQHGFANEAIELFT 320
           ++LN QLGTALVDMYAKSG VG A  VF+ +++++VWTWSAMILG AQHGFA EA+E+F 
Sbjct: 243 LILNCQLGTALVDMYAKSGYVGYASRVFDRMEEKNVWTWSAMILGFAQHGFAKEALEIFV 302

Query: 321 NMMSSS-VTPNYVTFIGVLCACSHAGLVDKGYHYFNIMERVYGIKPMMIHYGSMVDVLCR 380
            MM SS + PNYVT++GVLCACSH+GLVD GY YF+ ME V+GIKPMM+HYG+MVD L R
Sbjct: 303 KMMKSSCIRPNYVTYLGVLCACSHSGLVDDGYRYFHEMEYVHGIKPMMVHYGAMVDALGR 362

Query: 381 ASRVKEAYEFIMRMPVEPDPIVWRTLLSACSARDVDGGAQVVEEARKRLLELEPKRGGNV 440
           A R+K+AY FIM MP+EPDPI+WRTLLSAC+  +V+    V +  RKRLLELEP+R GN+
Sbjct: 363 AGRLKDAYTFIMNMPIEPDPILWRTLLSACTIHNVNDTDGVSDRVRKRLLELEPRRSGNL 422

Query: 441 VMVANMFAEVGMWKQAADCRRAMKDGGMKKMAGESCVEVGGSLRKFFSG 489
           VMVANM+AE GMW +AA+ R+ M+DG +KKMAGESC+E+ GS+ +FFSG
Sbjct: 423 VMVANMYAEAGMWDRAANVRKVMRDGRLKKMAGESCLELNGSIYQFFSG 468

BLAST of Cp4.1LG04g01560 vs. TAIR10
Match: AT2G36730.1 (AT2G36730.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 565.8 bits (1457), Expect = 2.5e-161
Identity = 280/471 (59.45%), Postives = 360/471 (76.43%), Query Frame = 1

Query: 19  NSNSNFLSRKHQFLSLIKLCSSPNHLFQIHSQIIVSGLQNDSFLTTELLRFAALSPSRNL 78
           +S+S F SRKHQ L  +KLCSS  HL QIH QI +S LQNDSF+ +EL+R ++LS +++L
Sbjct: 4   SSDSCFKSRKHQCLIFLKLCSSIKHLLQIHGQIHLSSLQNDSFIISELVRVSSLSLAKDL 63

Query: 79  SYARSLLFRYNLHFSPFPWNCIIRGYASSDSPREAIWVFEDMRRRGIRPNNLTFPFLIKA 138
           ++AR+LL  ++   +P  WN + RGY+SSDSP E+IWV+ +M+RRGI+PN LTFPFL+KA
Sbjct: 64  AFARTLLL-HSSDSTPSTWNMLSRGYSSSDSPVESIWVYSEMKRRGIKPNKLTFPFLLKA 123

Query: 139 CATLTTLQEGKKFHADAIKCGLDLDVYVRNTLINFYGSCKRMSGARKVFDEMSVRTLVSW 198
           CA+   L  G++   + +K G D DVYV N LI+ YG+CK+ S ARKVFDEM+ R +VSW
Sbjct: 124 CASFLGLTAGRQIQVEVLKHGFDFDVYVGNNLIHLYGTCKKTSDARKVFDEMTERNVVSW 183

Query: 199 NAVITACVENFCFDKAIEYFLKMGNHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVE 258
           N+++TA VEN   +   E F +M    F PDETTMVV+LSAC   GNLSLG+ VHSQV+ 
Sbjct: 184 NSIMTALVENGKLNLVFECFCEMIGKRFCPDETTMVVLLSACG--GNLSLGKLVHSQVMV 243

Query: 259 RGMVLNVQLGTALVDMYAKSGDVGCARLVFNCLKQRSVWTWSAMILGLAQHGFANEAIEL 318
           R + LN +LGTALVDMYAKSG +  ARLVF  +  ++VWTWSAMI+GLAQ+GFA EA++L
Sbjct: 244 RELELNCRLGTALVDMYAKSGGLEYARLVFERMVDKNVWTWSAMIVGLAQYGFAEEALQL 303

Query: 319 FTNMMS-SSVTPNYVTFIGVLCACSHAGLVDKGYHYFNIMERVYGIKPMMIHYGSMVDVL 378
           F+ MM  SSV PNYVTF+GVLCACSH GLVD GY YF+ ME+++ IKPMMIHYG+MVD+L
Sbjct: 304 FSKMMKESSVRPNYVTFLGVLCACSHTGLVDDGYKYFHEMEKIHKIKPMMIHYGAMVDIL 363

Query: 379 CRASRVKEAYEFIMRMPVEPDPIVWRTLLSACSARDVDGGAQVVEEARKRLLELEPKRGG 438
            RA R+ EAY+FI +MP EPD +VWRTLLSACS    +    + E+ +KRL+ELEPKR G
Sbjct: 364 GRAGRLNEAYDFIKKMPFEPDAVVWRTLLSACSIHHDEDDEGIGEKVKKRLIELEPKRSG 423

Query: 439 NVVMVANMFAEVGMWKQAADCRRAMKDGGMKKMAGESCVEVGGSLRKFFSG 489
           N+V+VAN FAE  MW +AA+ RR MK+  MKK+AGESC+E+GGS  +FFSG
Sbjct: 424 NLVIVANRFAEARMWAEAAEVRRVMKETKMKKIAGESCLELGGSFHRFFSG 471

BLAST of Cp4.1LG04g01560 vs. TAIR10
Match: AT4G21065.1 (AT4G21065.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 334.0 bits (855), Expect = 1.6e-91
Identity = 175/455 (38.46%), Postives = 280/455 (61.54%), Query Frame = 1

Query: 39  SSPNHLFQIHSQIIVSGLQ-NDSFLTTELLRFAALSPSRN-LSYARSLLFRYNLHFSPFP 98
           SS   L QIH+  I  G+  +D+ L   L+ +    PS   +SYA  +  +     + F 
Sbjct: 28  SSITKLRQIHAFSIRHGVSISDAELGKHLIFYLVSLPSPPPMSYAHKVFSKIEKPINVFI 87

Query: 99  WNCIIRGYASSDSPREAIWVFEDMRRRG-IRPNNLTFPFLIKACATLTTLQEGKKFHADA 158
           WN +IRGYA   +   A  ++ +MR  G + P+  T+PFLIKA  T+  ++ G+  H+  
Sbjct: 88  WNTLIRGYAEIGNSISAFSLYREMRVSGLVEPDTHTYPFLIKAVTTMADVRLGETIHSVV 147

Query: 159 IKCGLDLDVYVRNTLINFYGSCKRMSGARKVFDEMSVRTLVSWNAVITACVENFCFDKAI 218
           I+ G    +YV+N+L++ Y +C  ++ A KVFD+M  + LV+WN+VI    EN   ++A+
Sbjct: 148 IRSGFGSLIYVQNSLLHLYANCGDVASAYKVFDKMPEKDLVAWNSVINGFAENGKPEEAL 207

Query: 219 EYFLKMGNHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVERGMVLNVQLGTALVDMY 278
             + +M + G +PD  T+V +LSACA++G L+LG+ VH  +++ G+  N+     L+D+Y
Sbjct: 208 ALYTEMNSKGIKPDGFTIVSLLSACAKIGALTLGKRVHVYMIKVGLTRNLHSSNVLLDLY 267

Query: 279 AKSGDVGCARLVFNCLKQRSVWTWSAMILGLAQHGFANEAIELFTNMMSS-SVTPNYVTF 338
           A+ G V  A+ +F+ +  ++  +W+++I+GLA +GF  EAIELF  M S+  + P  +TF
Sbjct: 268 ARCGRVEEAKTLFDEMVDKNSVSWTSLIVGLAVNGFGKEAIELFKYMESTEGLLPCEITF 327

Query: 339 IGVLCACSHAGLVDKGYHYFNIMERVYGIKPMMIHYGSMVDVLCRASRVKEAYEFIMRMP 398
           +G+L ACSH G+V +G+ YF  M   Y I+P + H+G MVD+L RA +VK+AYE+I  MP
Sbjct: 328 VGILYACSHCGMVKEGFEYFRRMREEYKIEPRIEHFGCMVDLLARAGQVKKAYEYIKSMP 387

Query: 399 VEPDPIVWRTLLSACSARDVDGGAQVVEEARKRLLELEPKRGGNVVMVANMFAEVGMWKQ 458
           ++P+ ++WRTLL AC+   V G + + E AR ++L+LEP   G+ V+++NM+A    W  
Sbjct: 388 MQPNVVIWRTLLGACT---VHGDSDLAEFARIQILQLEPNHSGDYVLLSNMYASEQRWSD 447

Query: 459 AADCRRAMKDGGMKKMAGESCVEVGGSLRKFFSGD 490
               R+ M   G+KK+ G S VEVG  + +F  GD
Sbjct: 448 VQKIRKQMLRDGVKKVPGHSLVEVGNRVHEFLMGD 479

BLAST of Cp4.1LG04g01560 vs. TAIR10
Match: AT2G02980.1 (AT2G02980.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 331.3 bits (848), Expect = 1.0e-90
Identity = 175/457 (38.29%), Postives = 274/457 (59.96%), Query Frame = 1

Query: 34  LIKLCSSPNHLFQIHSQIIVSGLQNDSFLTTELLRFAALSPSRN-LSYARSLLFRYNLHF 93
           LI  C+S   L QI +  I S +++ SF+  +L+ F   SP+ + +SYAR L F      
Sbjct: 35  LISKCNSLRELMQIQAYAIKSHIEDVSFVA-KLINFCTESPTESSMSYARHL-FEAMSEP 94

Query: 94  SPFPWNCIIRGYASSDSPREAIWVFEDMRRRGIRPNNLTFPFLIKACATLTTLQEGKKFH 153
               +N + RGY+   +P E   +F ++   GI P+N TFP L+KACA    L+EG++ H
Sbjct: 95  DIVIFNSMARGYSRFTNPLEVFSLFVEILEDGILPDNYTFPSLLKACAVAKALEEGRQLH 154

Query: 154 ADAIKCGLDLDVYVRNTLINFYGSCKRMSGARKVFDEMSVRTLVSWNAVITACVENFCFD 213
             ++K GLD +VYV  TLIN Y  C+ +  AR VFD +    +V +NA+IT        +
Sbjct: 155 CLSMKLGLDDNVYVCPTLINMYTECEDVDSARCVFDRIVEPCVVCYNAMITGYARRNRPN 214

Query: 214 KAIEYFLKMGNHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVERGMVLNVQLGTALV 273
           +A+  F +M     +P+E T++ +LS+CA LG+L LG+W+H    +      V++ TAL+
Sbjct: 215 EALSLFREMQGKYLKPNEITLLSVLSSCALLGSLDLGKWIHKYAKKHSFCKYVKVNTALI 274

Query: 274 DMYAKSGDVGCARLVFNCLKQRSVWTWSAMILGLAQHGFANEAIELFTNMMSSSVTPNYV 333
           DM+AK G +  A  +F  ++ +    WSAMI+  A HG A +++ +F  M S +V P+ +
Sbjct: 275 DMFAKCGSLDDAVSIFEKMRYKDTQAWSAMIVAYANHGKAEKSMLMFERMRSENVQPDEI 334

Query: 334 TFIGVLCACSHAGLVDKGYHYFNIMERVYGIKPMMIHYGSMVDVLCRASRVKEAYEFIMR 393
           TF+G+L ACSH G V++G  YF+ M   +GI P + HYGSMVD+L RA  +++AYEFI +
Sbjct: 335 TFLGLLNACSHTGRVEEGRKYFSQMVSKFGIVPSIKHYGSMVDLLSRAGNLEDAYEFIDK 394

Query: 394 MPVEPDPIVWRTLLSACSARDVDGGAQVVEEARKRLLELEPKRGGNVVMVANMFAEVGMW 453
           +P+ P P++WR LL+ACS+ +      + E+  +R+ EL+   GG+ V+++N++A    W
Sbjct: 395 LPISPTPMLWRILLAACSSHN---NLDLAEKVSERIFELDDSHGGDYVILSNLYARNKKW 454

Query: 454 KQAADCRRAMKDGGMKKMAGESCVEVGGSLRKFFSGD 490
           +     R+ MKD    K+ G S +EV   + +FFSGD
Sbjct: 455 EYVDSLRKVMKDRKAVKVPGCSSIEVNNVVHEFFSGD 486

BLAST of Cp4.1LG04g01560 vs. TAIR10
Match: AT1G59720.1 (AT1G59720.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 318.9 bits (816), Expect = 5.3e-87
Identity = 186/501 (37.13%), Postives = 287/501 (57.29%), Query Frame = 1

Query: 9   VHQIFP--PNAHNSNSNFLSRKHQ-FLSLIKLCSSPNHLFQIHSQIIVSGLQNDS---FL 68
           VH + P  P A + +++     HQ   SL + CS  + L Q+H+  + +    +    FL
Sbjct: 26  VHPLSPHIPPASSPSASTAGNHHQRIFSLAETCSDMSQLKQLHAFTLRTTYPEEPATLFL 85

Query: 69  TTELLRFAALSPSRNLSYARSLLFRYNLHFSPFPWNCIIRGYASSDSPRE-AIWVFEDMR 128
             ++L+ +  S   +++YA  +      H S F WN +IR  A   S +E A  ++  M 
Sbjct: 86  YGKILQLS--SSFSDVNYAFRVFDSIENH-SSFMWNTLIRACAHDVSRKEEAFMLYRKML 145

Query: 129 RRG-IRPNNLTFPFLIKACATLTTLQEGKKFHADAIKCGLDLDVYVRNTLINFYGSCKRM 188
            RG   P+  TFPF++KACA +    EGK+ H   +K G   DVYV N LI+ YGSC  +
Sbjct: 146 ERGESSPDKHTFPFVLKACAYIFGFSEGKQVHCQIVKHGFGGDVYVNNGLIHLYGSCGCL 205

Query: 189 SGARKVFDEMSVRTLVSWNAVITACVENFCFDKAIEYFLKMGNHGFEPDETTMVVILSAC 248
             ARKVFDEM  R+LVSWN++I A V    +D A++ F +M    FEPD  TM  +LSAC
Sbjct: 206 DLARKVFDEMPERSLVSWNSMIDALVRFGEYDSALQLFREM-QRSFEPDGYTMQSVLSAC 265

Query: 249 AELGNLSLGRWVHSQVVER---GMVLNVQLGTALVDMYAKSGDVGCARLVFNCLKQRSVW 308
           A LG+LSLG W H+ ++ +    + ++V +  +L++MY K G +  A  VF  +++R + 
Sbjct: 266 AGLGSLSLGTWAHAFLLRKCDVDVAMDVLVKNSLIEMYCKCGSLRMAEQVFQGMQKRDLA 325

Query: 309 TWSAMILGLAQHGFANEAIELFTNMMSS--SVTPNYVTFIGVLCACSHAGLVDKGYHYFN 368
           +W+AMILG A HG A EA+  F  M+    +V PN VTF+G+L AC+H G V+KG  YF+
Sbjct: 326 SWNAMILGFATHGRAEEAMNFFDRMVDKRENVRPNSVTFVGLLIACNHRGFVNKGRQYFD 385

Query: 369 IMERVYGIKPMMIHYGSMVDVLCRASRVKEAYEFIMRMPVEPDPIVWRTLLSACSARDVD 428
           +M R Y I+P + HYG +VD++ RA  + EA + +M MP++PD ++WR+LL AC  +   
Sbjct: 386 MMVRDYCIEPALEHYGCIVDLIARAGYITEAIDMVMSMPMKPDAVIWRSLLDACCKK--G 445

Query: 429 GGAQVVEEARKRLL----ELEPKRG---GNVVMVANMFAEVGMWKQAADCRRAMKDGGMK 488
              ++ EE  + ++    + E   G   G  V+++ ++A    W      R+ M + G++
Sbjct: 446 ASVELSEEIARNIIGTKEDNESSNGNCSGAYVLLSRVYASASRWNDVGIVRKLMSEHGIR 505

Query: 489 KMAGESCVEVGGSLRKFFSGD 490
           K  G S +E+ G   +FF+GD
Sbjct: 506 KEPGCSSIEINGISHEFFAGD 520

BLAST of Cp4.1LG04g01560 vs. TAIR10
Match: AT2G33760.1 (AT2G33760.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 312.4 bits (799), Expect = 5.0e-85
Identity = 169/451 (37.47%), Postives = 256/451 (56.76%), Query Frame = 1

Query: 44  LFQIHSQIIVSGLQNDSFLTTELLRFAALSPSRNLSYARSLLFRYNLHFSPFPWNCIIRG 103
           L Q+H+ +IV+G      L T+L+  A    +R ++Y   L     L    F +N +I+ 
Sbjct: 25  LQQVHAHLIVTGYGRSRSLLTKLITLAC--SARAIAYTHLLFLSVPLP-DDFLFNSVIKS 84

Query: 104 YASSDSPREAIWVFEDMRRRGIRPNNLTFPFLIKACATLTTLQEGKKFHADAIKCGLDLD 163
            +    P   +  +  M    + P+N TF  +IK+CA L+ L+ GK  H  A+  G  LD
Sbjct: 85  TSKLRLPLHCVAYYRRMLSSNVSPSNYTFTSVIKSCADLSALRIGKGVHCHAVVSGFGLD 144

Query: 164 VYVRNTLINFYGSCKRMSGARKVFDEMSVRTLVSWNAVITACVENFCFDKAIEYFLKMGN 223
            YV+  L+ FY  C  M GAR+VFD M  +++V+WN++++   +N   D+AI+ F +M  
Sbjct: 145 TYVQAALVTFYSKCGDMEGARQVFDRMPEKSIVAWNSLVSGFEQNGLADEAIQVFYQMRE 204

Query: 224 HGFEPDETTMVVILSACAELGNLSLGRWVHSQVVERGMVLNVQLGTALVDMYAKSGDVGC 283
            GFEPD  T V +LSACA+ G +SLG WVH  ++  G+ LNV+LGTAL+++Y++ GDVG 
Sbjct: 205 SGFEPDSATFVSLLSACAQTGAVSLGSWVHQYIISEGLDLNVKLGTALINLYSRCGDVGK 264

Query: 284 ARLVFNCLKQRSVWTWSAMILGLAQHGFANEAIELFTNMMSS-SVTPNYVTFIGVLCACS 343
           AR VF+ +K+ +V  W+AMI     HG+  +A+ELF  M       PN VTF+ VL AC+
Sbjct: 265 AREVFDKMKETNVAAWTAMISAYGTHGYGQQAVELFNKMEDDCGPIPNNVTFVAVLSACA 324

Query: 344 HAGLVDKGYHYFNIMERVYGIKPMMIHYGSMVDVLCRASRVKEAYEFIMRMPV---EPDP 403
           HAGLV++G   +  M + Y + P + H+  MVD+L RA  + EAY+FI ++        P
Sbjct: 325 HAGLVEEGRSVYKRMTKSYRLIPGVEHHVCMVDMLGRAGFLDEAYKFIHQLDATGKATAP 384

Query: 404 IVWRTLLSACSA-RDVDGGAQVVEEARKRLLELEPKRGGNVVMVANMFAEVGMWKQAADC 463
            +W  +L AC   R+ D G ++     KRL+ LEP   G+ VM++N++A  G   + +  
Sbjct: 385 ALWTAMLGACKMHRNYDLGVEIA----KRLIALEPDNPGHHVMLSNIYALSGKTDEVSHI 444

Query: 464 RRAMKDGGMKKMAGESCVEVGGSLRKFFSGD 490
           R  M    ++K  G S +EV      F  GD
Sbjct: 445 RDGMMRNNLRKQVGYSVIEVENKTYMFSMGD 468

BLAST of Cp4.1LG04g01560 vs. NCBI nr
Match: gi|449461643|ref|XP_004148551.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g36730 isoform X1 [Cucumis sativus])

HSP 1 Score: 847.4 bits (2188), Expect = 1.2e-242
Identity = 416/490 (84.90%), Postives = 444/490 (90.61%), Query Frame = 1

Query: 1   MVRLRISAVHQIFPPNAHNSNSN--FLSRKHQFLSLIKLCSSPNHLFQIHSQIIVSGLQN 60
           MVRL ISAVHQ FP N HN +S   FLS KHQ LSL+  CSS NHLF+IH+QI+VSGLQN
Sbjct: 1   MVRLWISAVHQFFPINVHNYSSKPKFLSTKHQLLSLLNHCSSTNHLFEIHAQILVSGLQN 60

Query: 61  DSFLTTELLRFAALSPSRNLSYARSLLFRYNLHFSPFPWNCIIRGYASSDSPREAIWVFE 120
           DSF TTELLR AALSPSRNLSY  SLLF  + H +  PWN IIRGY+SSDSP+EAI +F 
Sbjct: 61  DSFFTTELLRVAALSPSRNLSYGCSLLFHCHFHSATMPWNFIIRGYSSSDSPQEAISLFG 120

Query: 121 DMRRRGIRPNNLTFPFLIKACATLTTLQEGKKFHADAIKCGLDLDVYVRNTLINFYGSCK 180
           +MRRRG+RPNNLTFPFL+KACATL TLQEGK+FHA AIKCGLDLDVYVRNTLI FYGSCK
Sbjct: 121 EMRRRGVRPNNLTFPFLLKACATLATLQEGKQFHAIAIKCGLDLDVYVRNTLIYFYGSCK 180

Query: 181 RMSGARKVFDEMSVRTLVSWNAVITACVENFCFDKAIEYFLKMGNHGFEPDETTMVVILS 240
           RMSGARKVFDEM+ RTLVSWNAVITACVENFCFD+AI+YFLKMGNHGFEPDETTMVVILS
Sbjct: 181 RMSGARKVFDEMTERTLVSWNAVITACVENFCFDEAIDYFLKMGNHGFEPDETTMVVILS 240

Query: 241 ACAELGNLSLGRWVHSQVVERGMVLNVQLGTALVDMYAKSGDVGCARLVFNCLKQRSVWT 300
           ACAELGNLSLGRWVHSQVV RGMVLNVQLGTA VDMYAKSGDVGCAR VFNCLKQ+SVWT
Sbjct: 241 ACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARHVFNCLKQKSVWT 300

Query: 301 WSAMILGLAQHGFANEAIELFTNMMSSSVTPNYVTFIGVLCACSHAGLVDKGYHYFNIME 360
           WSAMILGLAQHGFANEAIELFTNMMSS + PN+VTFIGVLCACSHAGLVDK YHYFN+ME
Sbjct: 301 WSAMILGLAQHGFANEAIELFTNMMSSPIVPNHVTFIGVLCACSHAGLVDKSYHYFNLME 360

Query: 361 RVYGIKPMMIHYGSMVDVLCRASRVKEAYEFIMRMPVEPDPIVWRTLLSACSARDVDGGA 420
           RVYGIKPMMIHYGSMVDVL RA +VKEAYE IM MPVEPDPIVWRTLLSACS RDV+GGA
Sbjct: 361 RVYGIKPMMIHYGSMVDVLGRAGQVKEAYELIMSMPVEPDPIVWRTLLSACSGRDVNGGA 420

Query: 421 QVVEEARKRLLELEPKRGGNVVMVANMFAEVGMWKQAADCRRAMKDGGMKKMAGESCVEV 480
           +V EEARKRLLELEPKRGGNVVMVAN FAE+GMWKQAAD RR MKD G+KKMAGESC+E+
Sbjct: 421 EVAEEARKRLLELEPKRGGNVVMVANKFAELGMWKQAADYRRTMKDRGIKKMAGESCIEL 480

Query: 481 GGSLRKFFSG 489
           GGSLRKFFSG
Sbjct: 481 GGSLRKFFSG 490

BLAST of Cp4.1LG04g01560 vs. NCBI nr
Match: gi|659094369|ref|XP_008448023.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g36730 isoform X1 [Cucumis melo])

HSP 1 Score: 842.8 bits (2176), Expect = 3.0e-241
Identity = 413/490 (84.29%), Postives = 442/490 (90.20%), Query Frame = 1

Query: 1   MVRLRISAVHQIFPPNAHN--SNSNFLSRKHQFLSLIKLCSSPNHLFQIHSQIIVSGLQN 60
           MVRL ISAVHQ FP NAH+  S   FLS KHQFLSL+K CSS NHLF+IH+QI+VSG QN
Sbjct: 1   MVRLWISAVHQFFPINAHSYISKPKFLSTKHQFLSLLKHCSSTNHLFEIHAQILVSGRQN 60

Query: 61  DSFLTTELLRFAALSPSRNLSYARSLLFRYNLHFSPFPWNCIIRGYASSDSPREAIWVFE 120
           DSFLTTELLR AALSPSRNLSY  SLLF  + H +  PWN IIRGY+SSDSPREAI +F 
Sbjct: 61  DSFLTTELLRVAALSPSRNLSYGCSLLFHCHFHSATLPWNLIIRGYSSSDSPREAISLFG 120

Query: 121 DMRRRGIRPNNLTFPFLIKACATLTTLQEGKKFHADAIKCGLDLDVYVRNTLINFYGSCK 180
           +MRRRG+ PNNLTFPFL+KACATL TLQEGK+FHA  IKCGLDLDVYVRNTLI+FYGSCK
Sbjct: 121 EMRRRGVIPNNLTFPFLLKACATLATLQEGKQFHAIVIKCGLDLDVYVRNTLIHFYGSCK 180

Query: 181 RMSGARKVFDEMSVRTLVSWNAVITACVENFCFDKAIEYFLKMGNHGFEPDETTMVVILS 240
           RMSGARKVFDEM+ RTLVSWNAVITACVENF FD+AI+YFLKMGNHGFEPDETTMVVILS
Sbjct: 181 RMSGARKVFDEMTERTLVSWNAVITACVENFFFDEAIDYFLKMGNHGFEPDETTMVVILS 240

Query: 241 ACAELGNLSLGRWVHSQVVERGMVLNVQLGTALVDMYAKSGDVGCARLVFNCLKQRSVWT 300
           ACAELGNLSLGRWVHSQVV RGMVLN+QLGTA VDMYAKSGDVGCAR VFNCLKQ+SVWT
Sbjct: 241 ACAELGNLSLGRWVHSQVVGRGMVLNIQLGTAFVDMYAKSGDVGCARRVFNCLKQKSVWT 300

Query: 301 WSAMILGLAQHGFANEAIELFTNMMSSSVTPNYVTFIGVLCACSHAGLVDKGYHYFNIME 360
           WSAMILGLAQHGFANEAIELFTNM SS + PNYVTF+GVLCACSHAGLVDK YHYFN+ME
Sbjct: 301 WSAMILGLAQHGFANEAIELFTNMKSSPIVPNYVTFVGVLCACSHAGLVDKSYHYFNVME 360

Query: 361 RVYGIKPMMIHYGSMVDVLCRASRVKEAYEFIMRMPVEPDPIVWRTLLSACSARDVDGGA 420
           RVYGIKPMMIHYG MVDVL RA +VKEAYE IM MPVEPDP+VWRTLLSACS RDV+GGA
Sbjct: 361 RVYGIKPMMIHYGLMVDVLGRAGQVKEAYELIMSMPVEPDPVVWRTLLSACSGRDVNGGA 420

Query: 421 QVVEEARKRLLELEPKRGGNVVMVANMFAEVGMWKQAADCRRAMKDGGMKKMAGESCVEV 480
           +V EEARKRLLELEPKRGGNVVMVAN FAEVGMWKQAAD RR MKD G+KKMAGESC+E+
Sbjct: 421 EVAEEARKRLLELEPKRGGNVVMVANKFAEVGMWKQAADYRRTMKDRGIKKMAGESCIEL 480

Query: 481 GGSLRKFFSG 489
           GGSLRKFFSG
Sbjct: 481 GGSLRKFFSG 490

BLAST of Cp4.1LG04g01560 vs. NCBI nr
Match: gi|1000941847|ref|XP_015582500.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g36730 [Ricinus communis])

HSP 1 Score: 664.8 bits (1714), Expect = 1.1e-187
Identity = 319/489 (65.24%), Postives = 392/489 (80.16%), Query Frame = 1

Query: 1   MVRLRISAVHQIFPPNAHNSNSNFLSRKHQFLSLIKLCSSPNHLFQIHSQIIVSGLQNDS 60
           MVR  I     IFPP   +SNSNFLS KHQ LSL+KLCSS  HL+QIHSQI VSGLQ D+
Sbjct: 1   MVRFPIPTATPIFPPEPISSNSNFLSIKHQCLSLLKLCSSIKHLYQIHSQIQVSGLQGDT 60

Query: 61  FLTTELLRFAALSPSRNLSYARSLLFRYNLHFSPFPWNCIIRGYASSDSPREAIWVFEDM 120
           FL T+L++F++LSPS++LSYA+S+L  +++H  P PWN +IRGYA S++P++A++V+ +M
Sbjct: 61  FLVTQLIKFSSLSPSKDLSYAQSIL-DHSVHPVPLPWNILIRGYADSNTPKDALFVYRNM 120

Query: 121 RRRGIRPNNLTFPFLIKACATLTTLQEGKKFHADAIKCGLDLDVYVRNTLINFYGSCKRM 180
           R  GIRPN+LTFPFL+KACA     +EGK+ H + IK GLD DVYV N L+NFYGSCK++
Sbjct: 121 RNEGIRPNSLTFPFLLKACAACFATKEGKQVHVEVIKYGLDCDVYVNNNLVNFYGSCKKI 180

Query: 181 SGARKVFDEMSVRTLVSWNAVITACVENFCFDKAIEYFLKMGNHGFEPDETTMVVILSAC 240
             A KVFDEM  RT+VSWNAVIT+CVE+    +AI YFLKM + GFEPD TTMV++L  C
Sbjct: 181 LDACKVFDEMPERTVVSWNAVITSCVESLKLGEAIRYFLKMRDFGFEPDGTTMVLMLVIC 240

Query: 241 AELGNLSLGRWVHSQVVERGMVLNVQLGTALVDMYAKSGDVGCARLVFNCLKQRSVWTWS 300
           AE+GNL LGRW+HSQV+ERG+VLN QLGTALVDMYAKSG VG A+LVF+ +K+++VWTWS
Sbjct: 241 AEMGNLGLGRWIHSQVIERGLVLNYQLGTALVDMYAKSGAVGYAKLVFDRMKEKNVWTWS 300

Query: 301 AMILGLAQHGFANEAIELFTNMMSSS-VTPNYVTFIGVLCACSHAGLVDKGYHYFNIMER 360
           AMILGLAQHGFA E +ELF +MM SS + PNYVTF+GVLCACSHAGLV  G+ YF+ M  
Sbjct: 301 AMILGLAQHGFAKEGLELFLDMMRSSLIHPNYVTFLGVLCACSHAGLVSDGFRYFHEMGH 360

Query: 361 VYGIKPMMIHYGSMVDVLCRASRVKEAYEFIMRMPVEPDPIVWRTLLSACSARDVDGGAQ 420
            YGIKPMM+HYG+MVD+L RA  +KEAY FI +MP +PDPIVWRTLLSACS  DV     
Sbjct: 361 TYGIKPMMVHYGAMVDILGRAGLLKEAYNFITKMPFQPDPIVWRTLLSACSIHDVKDSTG 420

Query: 421 VVEEARKRLLELEPKRGGNVVMVANMFAEVGMWKQAADCRRAMKDGGMKKMAGESCVEVG 480
           V  + RKRLLELEP+R GN VMVANM+A+ GMW++AA  RR M+DGG+KK AGESCVE+ 
Sbjct: 421 VAYKVRKRLLELEPRRSGNFVMVANMYADAGMWEKAAKVRRVMRDGGLKKKAGESCVELS 480

Query: 481 GSLRKFFSG 489
           GS+ +FFSG
Sbjct: 481 GSIHRFFSG 488

BLAST of Cp4.1LG04g01560 vs. NCBI nr
Match: gi|1009125132|ref|XP_015879446.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g36730-like [Ziziphus jujuba])

HSP 1 Score: 664.5 bits (1713), Expect = 1.5e-187
Identity = 319/490 (65.10%), Postives = 391/490 (79.80%), Query Frame = 1

Query: 1   MVRLRISAVHQIFPPNA--HNSNSNFLSRKHQFLSLIKLCSSPNHLFQIHSQIIVSGLQN 60
           MVRL+    +++F P    H  +S+F+S+K Q LSL K+CSS   L QIH+Q+ +SGLQ 
Sbjct: 1   MVRLQTQTANRVFAPKTYHHPDSSDFVSKKQQCLSLFKICSSIKQLSQIHAQLHLSGLQG 60

Query: 61  DSFLTTELLRFAALSPSRNLSYARSLLFRYNLHFSPFPWNCIIRGYASSDSPREAIWVFE 120
           D+FL T+L+RF ALSPS++L++AR++L R + H  P  WN +IRGYASSDSP EAIWVF 
Sbjct: 61  DTFLLTQLVRFCALSPSKDLNHARTILHRSD-HSPPSSWNILIRGYASSDSPTEAIWVFR 120

Query: 121 DMRRRGIRPNNLTFPFLIKACATLTTLQEGKKFHADAIKCGLDLDVYVRNTLINFYGSCK 180
           +MR RGIRPN LTFPFL+KACAT+  L+ G++ HAD  K GLD DVYV+N LI+FYG CK
Sbjct: 121 EMRCRGIRPNKLTFPFLLKACATIMALKVGRQVHADVFKRGLDGDVYVQNNLIHFYGCCK 180

Query: 181 RMSGARKVFDEMSVRTLVSWNAVITACVENFCFDKAIEYFLKMGNHGFEPDETTMVVILS 240
           ++S A+K+FD MSVRTLVSWN++ITACVEN CFD  I YFL+M N GF+PDETTMVV+L+
Sbjct: 181 KISNAQKLFDAMSVRTLVSWNSIITACVENSCFDNGIGYFLRMRNCGFQPDETTMVVVLN 240

Query: 241 ACAELGNLSLGRWVHSQVVERGMVLNVQLGTALVDMYAKSGDVGCARLVFNCLKQRSVWT 300
           ACAELGNLSLGRWVHSQ ++R + LN QLGT+LVDMYAKSG +  A  VF+ L +R+VWT
Sbjct: 241 ACAELGNLSLGRWVHSQTIQRELGLNCQLGTSLVDMYAKSGALDYATKVFDSLGERNVWT 300

Query: 301 WSAMILGLAQHGFANEAIELFTNMMSSSVTPNYVTFIGVLCACSHAGLVDKGYHYFNIME 360
           WSAMILGLAQHGF NE +ELF  MM SS+ PNYVTF+GVLCACSHAGLV  GY YF  ME
Sbjct: 301 WSAMILGLAQHGFGNEGLELFAKMMKSSICPNYVTFLGVLCACSHAGLVQDGYQYFYDME 360

Query: 361 RVYGIKPMMIHYGSMVDVLCRASRVKEAYEFIMRMPVEPDPIVWRTLLSACSARDVDGGA 420
            V+GIKPMMIHYG+MVD+L RA R+ EAY FI  MP EPDPI+WRTLLS C   DV    
Sbjct: 361 HVHGIKPMMIHYGAMVDILARAGRLSEAYAFINNMPFEPDPIIWRTLLSVCCNCDVKDKE 420

Query: 421 QVVEEARKRLLELEPKRGGNVVMVANMFAEVGMWKQAADCRRAMKDGGMKKMAGESCVEV 480
            + ++ RKRLL LEP+RGGN+VMVA M+AEVGMW++AA+ RR M+ GG+KK AGESC+E+
Sbjct: 421 GIGDKVRKRLLNLEPRRGGNLVMVAKMYAEVGMWEKAANVRRFMRSGGLKKSAGESCIEL 480

Query: 481 GGSLRKFFSG 489
           GGS+R+FFSG
Sbjct: 481 GGSIRRFFSG 489

BLAST of Cp4.1LG04g01560 vs. NCBI nr
Match: gi|657975030|ref|XP_008379357.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g36730 [Malus domestica])

HSP 1 Score: 656.8 bits (1693), Expect = 3.1e-185
Identity = 313/474 (66.03%), Postives = 382/474 (80.59%), Query Frame = 1

Query: 15  PNAHNSNSNFLSRKHQFLSLIKLCSSPNHLFQIHSQIIVSGLQNDSFLTTELLRFAALSP 74
           P A+N NSNF S+K Q L L+  CS+  HL QIH+QI VSG QND FL T+L+RF A SP
Sbjct: 7   PTANNCNSNFGSKKEQCLHLLSRCSTFKHLSQIHAQIQVSGFQNDHFLLTQLIRFCASSP 66

Query: 75  SRNLSYARSLLFRYNLHFSPFPWNCIIRGYASSDSPREAIWVFEDMRRRGIRPNNLTFPF 134
           ++N +YAR+LL  ++    P  WN +IRG ASSDS REAIWVF  M  RG+RPN LTFPF
Sbjct: 67  AKNFAYARNLL-DHSESSPPSSWNFLIRGCASSDSXREAIWVFRAMLARGVRPNQLTFPF 126

Query: 135 LIKACATLTTLQEGKKFHADAIKCGLDLDVYVRNTLINFYGSCKRMSGARKVFDEMSVRT 194
           LIK+CA+   L+EG++ H   +KCGLD DVYV+N L++FYG CK++  A KVFDEMS R+
Sbjct: 127 LIKSCASAAALKEGRQVHVGVVKCGLDCDVYVQNNLVHFYGECKKIKDAXKVFDEMSERS 186

Query: 195 LVSWNAVITACVENFCFDKAIEYFLKMGNHGFEPDETTMVVILSACAELGNLSLGRWVHS 254
           +VSWNA+ITACVENF  D+ IEYF+KM   GFEPDETTMVV+L+A +ELGNLS+GRWVHS
Sbjct: 187 VVSWNAIITACVENFWLDEGIEYFMKMRGCGFEPDETTMVVVLNASSELGNLSIGRWVHS 246

Query: 255 QVVERGMVLNVQLGTALVDMYAKSGDVGCARLVFNCLKQRSVWTWSAMILGLAQHGFANE 314
           QV+ERG+ LN QLGTALVDMYAKSGD+G AR+VFN ++ R+VWT SAMILGLAQHGFA E
Sbjct: 247 QVIERGLALNCQLGTALVDMYAKSGDLGYARIVFNKMETRNVWTXSAMILGLAQHGFAKE 306

Query: 315 AIELFTNMMSSSVTPNYVTFIGVLCACSHAGLVDKGYHYFNIMERVYGIKPMMIHYGSMV 374
           A+E+F  M+SSSV PNYVTF+GVLCACSHAGLV+ GY YFN ME V+GIKPMM+HYG+MV
Sbjct: 307 ALEIFRKMLSSSVRPNYVTFLGVLCACSHAGLVEDGYRYFNDMEHVHGIKPMMVHYGAMV 366

Query: 375 DVLCRASRVKEAYEFIMRMPVEPDPIVWRTLLSACSARDVDGGAQVVEEARKRLLELEPK 434
           D+L RA R+ EAY F+  MP++PDPIVWRTLLSAC+         V  + R++LLELEPK
Sbjct: 367 DILGRAGRLNEAYSFMXSMPLDPDPIVWRTLLSACTTHSAKDNEGVGNKVREKLLELEPK 426

Query: 435 RGGNVVMVANMFAEVGMWKQAADCRRAMKDGGMKKMAGESCVEVGGSLRKFFSG 489
           RGGN+VMVANM+AEVGMW++AA+ R+ MK+  MKKMAGESC+E+GGS+ KFFSG
Sbjct: 427 RGGNLVMVANMYAEVGMWEKAANLRKVMKERRMKKMAGESCIELGGSVHKFFSG 479

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP188_ARATH4.4e-16059.45Pentatricopeptide repeat-containing protein At2g36730 OS=Arabidopsis thaliana GN... [more]
PP330_ARATH2.8e-9038.46Pentatricopeptide repeat-containing protein At4g21065 OS=Arabidopsis thaliana GN... [more]
PP145_ARATH1.8e-8938.29Pentatricopeptide repeat-containing protein At2g02980, chloroplastic OS=Arabidop... [more]
PPR85_ARATH9.5e-8637.13Pentatricopeptide repeat-containing protein At1g59720, chloroplastic/mitochondri... [more]
PP182_ARATH8.9e-8437.47Pentatricopeptide repeat-containing protein At2g33760 OS=Arabidopsis thaliana GN... [more]
Match NameE-valueIdentityDescription
A0A0A0K153_CUCSA8.5e-24384.90Uncharacterized protein OS=Cucumis sativus GN=Csa_7G007910 PE=4 SV=1[more]
M5WZ10_PRUPE2.2e-18264.55Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa004522mg PE=4 SV=1[more]
B9T0U0_RICCO8.7e-17965.72Cell division protein ftsH, putative OS=Ricinus communis GN=RCOM_0340700 PE=3 SV... [more]
A0A0S3RS08_PHAAN1.7e-17464.04Uncharacterized protein OS=Vigna angularis var. angularis GN=Vigan.04G050100 PE=... [more]
A0A061EGE7_THECC1.2e-17262.69Pentatricopeptide repeat (PPR) superfamily protein OS=Theobroma cacao GN=TCM_047... [more]
Match NameE-valueIdentityDescription
AT2G36730.12.5e-16159.45 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT4G21065.11.6e-9138.46 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT2G02980.11.0e-9038.29 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G59720.15.3e-8737.13 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT2G33760.15.0e-8537.47 Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
gi|449461643|ref|XP_004148551.1|1.2e-24284.90PREDICTED: pentatricopeptide repeat-containing protein At2g36730 isoform X1 [Cuc... [more]
gi|659094369|ref|XP_008448023.1|3.0e-24184.29PREDICTED: pentatricopeptide repeat-containing protein At2g36730 isoform X1 [Cuc... [more]
gi|1000941847|ref|XP_015582500.1|1.1e-18765.24PREDICTED: pentatricopeptide repeat-containing protein At2g36730 [Ricinus commun... [more]
gi|1009125132|ref|XP_015879446.1|1.5e-18765.10PREDICTED: pentatricopeptide repeat-containing protein At2g36730-like [Ziziphus ... [more]
gi|657975030|ref|XP_008379357.1|3.1e-18566.03PREDICTED: pentatricopeptide repeat-containing protein At2g36730 [Malus domestic... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0051301 cell division
biological_process GO:0009451 RNA modification
biological_process GO:0090305 nucleic acid phosphodiester bond hydrolysis
biological_process GO:0006508 proteolysis
biological_process GO:0031425 chloroplast RNA processing
biological_process GO:0016556 mRNA modification
cellular_component GO:0009507 chloroplast
cellular_component GO:0005575 cellular_component
cellular_component GO:0016020 membrane
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0005524 ATP binding
molecular_function GO:0004222 metalloendopeptidase activity
molecular_function GO:0008568 microtubule-severing ATPase activity
molecular_function GO:0016787 hydrolase activity
molecular_function GO:0003674 molecular_function
molecular_function GO:0004519 endonuclease activity
molecular_function GO:0003723 RNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG04g01560.1Cp4.1LG04g01560.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 168..191
score: 0.0035coord: 298..325
score: 7.2E-5coord: 269..295
score: 0.88coord: 369..394
score: 0.
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 97..140
score: 1.3E-10coord: 195..241
score: 9.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 298..330
score: 6.3E-5coord: 196..230
score: 8.5E-7coord: 97..128
score: 1.3E-6coord: 168..191
score: 4.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 398..431
score: 5.064coord: 194..228
score: 10.139coord: 163..193
score: 8.035coord: 93..127
score: 12.386coord: 366..396
score: 7.081coord: 330..365
score: 6.467coord: 229..263
score: 7.87coord: 264..294
score: 6.018coord: 128..162
score: 5.952coord: 295..329
score: 10
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 17..476
score: 8.2E
NoneNo IPR availablePANTHERPTHR24015:SF36SUBFAMILY NOT NAMEDcoord: 17..476
score: 8.2E

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG04g01560Cp4.1LG03g08090Cucurbita pepo (Zucchini)cpecpeB475
The following block(s) are covering this gene:
GeneOrganismBlock
Cp4.1LG04g01560Cucurbita pepo (Zucchini)cpecpeB221
Cp4.1LG04g01560Cucumber (Gy14) v1cgycpeB0893
Cp4.1LG04g01560Wild cucumber (PI 183967)cpecpiB673
Cp4.1LG04g01560Cucumber (Chinese Long) v2cpecuB670
Cp4.1LG04g01560Melon (DHL92) v3.5.1cpemeB597
Cp4.1LG04g01560Melon (DHL92) v3.5.1cpemeB631
Cp4.1LG04g01560Cucumber (Gy14) v2cgybcpeB415
Cp4.1LG04g01560Cucumber (Gy14) v2cgybcpeB684
Cp4.1LG04g01560Melon (DHL92) v3.6.1cpemedB704
Cp4.1LG04g01560Melon (DHL92) v3.6.1cpemedB753
Cp4.1LG04g01560Silver-seed gourdcarcpeB0380
Cp4.1LG04g01560Cucumber (Chinese Long) v3cpecucB0838
Cp4.1LG04g01560Cucumber (Chinese Long) v3cpecucB0853
Cp4.1LG04g01560Wax gourdcpewgoB0832