Cp4.1LG02g14250 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG02g14250
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionPentatricopeptide repeat-containing protein
LocationCp4.1LG02 : 14112097 .. 14115057 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCTGTATGCATCTTCTTATCAGCGATTCAATTCAGCTTCGTTTTGTTTTCCACGATTCACAATCAATTTCCCCTCTCATTTTCGATACTATTTCAACTCCCTTTTTTCCTCAATCACGTATGACGACGAGCTTCTCGATTCTTTCGATCGTCTTCTTCGGCAATGCAATGGGATTCGACATTGCAAACAAGTTCATTCCGCCACTGTTGTCACTGGCGCCTGTTCGTCGGCGTTCGTTGCCGCCCGGCTTGTGTCCGTCTATGCCCGTTCTGGTTTTGTTTTCGATGCTCGGAAAGTGTTTGATACTGTGCCATTTGAAGGTTTGTCGAACTTGCTGTTGTGGAATTCGATTATAAGAGCAAATGTAGATGGTTATAGTAGAGAAGCCCTTCAACTTTATGGGAAAATGAGAAATTTTGGGGTTTTGGCTGATGGGTTTACTTTTCCTCTGGTTTTGAGGGCTTCTTCCAATTTGGGTATTTTCAATTTGTGCAAGAGTCTTCATTGTCATGTTGTACAATTTGGATTCCAGAATCATTTGCACGTTGTGAATGAATTGATGGGAATGTATGTGAAACTCCGACGAATGGATGATGCTCGAAAAGTGTTTGACAAAATGCGTGTTAAAAGTGTAATTTCTTGGAACACTATGGTTTCTGGCTATGCCTATAATTATGATGTTAATGGTGCTTCTAGGATGTTCCTTCAAATGGAGTTGGAAGGGGTCGAGCCGAACCCTGTAACTTGGACTTCTTTGCTGTCAAGTCATGCTCGGTGCGGTCATCTTGAAGAAACTATTGCCTTGTTTAGCAAGATGAGGATGAAAGGTGTTGGTGCCACTGCTGAAATGCTTGCTGTGGTGTTATCTGTTTGTGCTGATTTAGATACATTTGACAGGGGTCAGATGGTTCATGGATATATAGTAAAGGGAGGTTTCGAAGATTACTTGTTTGCTAAAAATGCGCTTATAACTGTATATGGAAAAGGAGGAGACATAAGAGATGCAGAGAAGTTATTTCATGAGATGAAAGTGAAGAATCTTGTGAGTTGGAATTCTCTTATATCCTCTTATGCTGAATCTGGATTATATGACAAAGCTTTTGAAGCGTTTTCTGAGCTTGAGAAAATGGAAGGCTGTCCAGAGATGAAACCTAGTGTCATAACTTGGAGCGCAGTCATTTGTGGATTTGCTTCTAATGGATTTGGAGAAGAATCTTTGGAAGTTTTTCGCCAAATGCAGCTTGCAAATGTAAAAGCGAACTCAGTGACTATATCTAGTGTTCTATCAATTTGTGCTATGCTAGCAGCTCTGAATCTTGGTAGGGAAATGCACGGTCATGTGATTAGAGCTCGGATGGAAGATAACATACTGGTGGGAAATGGATTGATTAACATGTATACAAAATGTGGAAGTTTCAAGCCAGGCTGTTTGGTGTTCGGAAAACTAGAAAATCGAGATTTAATCTCATGGAACTCACTGATTGCAGGATATGGAATGCATGGACTTGGTAAAGATGCTCTCGTAACTTTTGACGAGATGATCAAATCAGGATTTAGACCAGATGATGTTACCTTTATTGCTGCTCTTTCTGCTTGTAGTCATGCCGGTCTTGTTGCCGAAGGCCGTTGGCTTTTCGATCAGATGCTACAGAACTTCAAGATCAAACCTCAGATGGAGCACTATGCGTGCATGGTCGATCTTCTAGGTCGTGCTGGGCTCGTGGAAGAAGCAAGTAACATAGTCAAGAGCATGCCAATCAAACCCAATGCTTATATCTGGAGTGCTCTTCTCAACTCTTGCAGGATGCACAAGGATACAGATCTAGCAGAAGAAACTGGCTCTCAGATTTTAAGTCTGGATTCCGAGATAACAGGAAGCCATATGTTGCTGTCGAATATTTATTCTGCAAGCTGTAGATGGGAGGACTCTGCGAGGGTCAGGATCTCGGCAAGGATGAAGGGTTTAAAAAAAGTTCCTGGGTGCAGCTGGATTGAGGTGAAGAAAAAGGTTTATATGTTCAAAGCAGGAAACTCAATGCAAGAAGGTTTAGAGAGAGTTGATGAAATTCTTCATGATTTGGCTCTTCAGATTGAAAGTCGAGATTTTGATGATAGTATTATTGAATAGAATGTTCAAGAACAACATTACTGAAATCTTTTACTAGAAAGCTCTCTCCATATGTGTGAAGCTGATGATGGTTTATCCTTAATTAAGGAAATTCAAAATGAAAAATGGTCACTTAAGGCTGCATTTAGCTAATGATATAACAGTGGAGCATCAATCCAAAATTAACATTTATCTACTGTTCATTAATTTTAGAAGTCAACTGTTGGAATAACTGTACAAGATACGTTAGAGGTAACAGAAGTTTCTGAAAAAGTCGAAGAGAACAGAGAGGATTGAATTCAACTGTGCATCATATCATCGTGTTATGAAATGCAGGGAGGCAGATTAGATAAAAGCCCCTCCATGGAATTGCAAGACCCGGTGAAAGCAGCTCTTTTCACATCCCTTCAAAACAGGTACATGATGCTAGCCAGGCCTGAAATGGTGTAGAATTCATAAGTATGTACTTCAAGTTTCTTTTTCCTTTCTTTTGATAAGAGACAATTTAAATGAATTTGAACCAAAATTGAAGGAAAGCACAGATGGCGTTGAATCGATTGAAAATTCATATCCATATACAAGCGTTTGGTATACTCTCTCCGAAATGTAACTGTACATTCCTTTTTCTCGCATCCCAAAACATGTCAACAAAAAGTGTGGATCGACTATTATGCAGCTAACACCTAAGACATCTAGAACCATACATAACAGATCAATACACACAATATCTGAATTGTATTCAACTTCGAACCCACTTGTAGACCAGAAACGCTTACGTTTCGAGGGAATGTGGGAGAGGAAGTAGCAGAGAAACTCTATGACCCTCAACTCAGTCCTAAGGTGATATCATGA

mRNA sequence

ATGCTGTATGCATCTTCTTATCAGCGATTCAATTCAGCTTCGTTTTGTTTTCCACGATTCACAATCAATTTCCCCTCTCATTTTCGATACTATTTCAACTCCCTTTTTTCCTCAATCACGTATGACGACGAGCTTCTCGATTCTTTCGATCGTCTTCTTCGGCAATGCAATGGGATTCGACATTGCAAACAAGTTCATTCCGCCACTGTTGTCACTGGCGCCTGTTCGTCGGCGTTCGTTGCCGCCCGGCTTGTGTCCGTCTATGCCCGTTCTGGTTTTGTTTTCGATGCTCGGAAAGTGTTTGATACTGTGCCATTTGAAGGTTTGTCGAACTTGCTGTTGTGGAATTCGATTATAAGAGCAAATGTAGATGGTTATAGTAGAGAAGCCCTTCAACTTTATGGGAAAATGAGAAATTTTGGGGTTTTGGCTGATGGGTTTACTTTTCCTCTGGTTTTGAGGGCTTCTTCCAATTTGGGTATTTTCAATTTGTGCAAGAGTCTTCATTGTCATGTTGTACAATTTGGATTCCAGAATCATTTGCACGTTGTGAATGAATTGATGGGAATGTATGTGAAACTCCGACGAATGGATGATGCTCGAAAAGTGTTTGACAAAATGCGTGTTAAAAGTGTAATTTCTTGGAACACTATGGTTTCTGGCTATGCCTATAATTATGATGTTAATGGTGCTTCTAGGATGTTCCTTCAAATGGAGTTGGAAGGGGTCGAGCCGAACCCTGTAACTTGGACTTCTTTGCTGTCAAGTCATGCTCGGTGCGGTCATCTTGAAGAAACTATTGCCTTGTTTAGCAAGATGAGGATGAAAGGTGTTGGTGCCACTGCTGAAATGCTTGCTGTGGTGTTATCTGTTTGTGCTGATTTAGATACATTTGACAGGGGTCAGATGGTTCATGGATATATAGTAAAGGGAGGTTTCGAAGATTACTTGTTTGCTAAAAATGCGCTTATAACTGTATATGGAAAAGGAGGAGACATAAGAGATGCAGAGAAGTTATTTCATGAGATGAAAGTGAAGAATCTTGTGAGTTGGAATTCTCTTATATCCTCTTATGCTGAATCTGGATTATATGACAAAGCTTTTGAAGCGTTTTCTGAGCTTGAGAAAATGGAAGGCTGTCCAGAGATGAAACCTAGTGTCATAACTTGGAGCGCAGTCATTTGTGGATTTGCTTCTAATGGATTTGGAGAAGAATCTTTGGAAGTTTTTCGCCAAATGCAGCTTGCAAATGGAGGCAGATTAGATAAAAGCCCCTCCATGGAATTGCAAGACCCGGTGAAAGCAGCTCTTTTCACATCCCTTCAAAACAGACCAGAAACGCTTACGTTTCGAGGGAATGTGGGAGAGGAAGTAGCAGAGAAACTCTATGACCCTCAACTCAGTCCTAAGGTGATATCATGA

Coding sequence (CDS)

ATGCTGTATGCATCTTCTTATCAGCGATTCAATTCAGCTTCGTTTTGTTTTCCACGATTCACAATCAATTTCCCCTCTCATTTTCGATACTATTTCAACTCCCTTTTTTCCTCAATCACGTATGACGACGAGCTTCTCGATTCTTTCGATCGTCTTCTTCGGCAATGCAATGGGATTCGACATTGCAAACAAGTTCATTCCGCCACTGTTGTCACTGGCGCCTGTTCGTCGGCGTTCGTTGCCGCCCGGCTTGTGTCCGTCTATGCCCGTTCTGGTTTTGTTTTCGATGCTCGGAAAGTGTTTGATACTGTGCCATTTGAAGGTTTGTCGAACTTGCTGTTGTGGAATTCGATTATAAGAGCAAATGTAGATGGTTATAGTAGAGAAGCCCTTCAACTTTATGGGAAAATGAGAAATTTTGGGGTTTTGGCTGATGGGTTTACTTTTCCTCTGGTTTTGAGGGCTTCTTCCAATTTGGGTATTTTCAATTTGTGCAAGAGTCTTCATTGTCATGTTGTACAATTTGGATTCCAGAATCATTTGCACGTTGTGAATGAATTGATGGGAATGTATGTGAAACTCCGACGAATGGATGATGCTCGAAAAGTGTTTGACAAAATGCGTGTTAAAAGTGTAATTTCTTGGAACACTATGGTTTCTGGCTATGCCTATAATTATGATGTTAATGGTGCTTCTAGGATGTTCCTTCAAATGGAGTTGGAAGGGGTCGAGCCGAACCCTGTAACTTGGACTTCTTTGCTGTCAAGTCATGCTCGGTGCGGTCATCTTGAAGAAACTATTGCCTTGTTTAGCAAGATGAGGATGAAAGGTGTTGGTGCCACTGCTGAAATGCTTGCTGTGGTGTTATCTGTTTGTGCTGATTTAGATACATTTGACAGGGGTCAGATGGTTCATGGATATATAGTAAAGGGAGGTTTCGAAGATTACTTGTTTGCTAAAAATGCGCTTATAACTGTATATGGAAAAGGAGGAGACATAAGAGATGCAGAGAAGTTATTTCATGAGATGAAAGTGAAGAATCTTGTGAGTTGGAATTCTCTTATATCCTCTTATGCTGAATCTGGATTATATGACAAAGCTTTTGAAGCGTTTTCTGAGCTTGAGAAAATGGAAGGCTGTCCAGAGATGAAACCTAGTGTCATAACTTGGAGCGCAGTCATTTGTGGATTTGCTTCTAATGGATTTGGAGAAGAATCTTTGGAAGTTTTTCGCCAAATGCAGCTTGCAAATGGAGGCAGATTAGATAAAAGCCCCTCCATGGAATTGCAAGACCCGGTGAAAGCAGCTCTTTTCACATCCCTTCAAAACAGACCAGAAACGCTTACGTTTCGAGGGAATGTGGGAGAGGAAGTAGCAGAGAAACTCTATGACCCTCAACTCAGTCCTAAGGTGATATCATGA

Protein sequence

MLYASSYQRFNSASFCFPRFTINFPSHFRYYFNSLFSSITYDDELLDSFDRLLRQCNGIRHCKQVHSATVVTGACSSAFVAARLVSVYARSGFVFDARKVFDTVPFEGLSNLLLWNSIIRANVDGYSREALQLYGKMRNFGVLADGFTFPLVLRASSNLGIFNLCKSLHCHVVQFGFQNHLHVVNELMGMYVKLRRMDDARKVFDKMRVKSVISWNTMVSGYAYNYDVNGASRMFLQMELEGVEPNPVTWTSLLSSHARCGHLEETIALFSKMRMKGVGATAEMLAVVLSVCADLDTFDRGQMVHGYIVKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMKVKNLVSWNSLISSYAESGLYDKAFEAFSELEKMEGCPEMKPSVITWSAVICGFASNGFGEESLEVFRQMQLANGGRLDKSPSMELQDPVKAALFTSLQNRPETLTFRGNVGEEVAEKLYDPQLSPKVIS
BLAST of Cp4.1LG02g14250 vs. Swiss-Prot
Match: PPR47_ARATH (Putative pentatricopeptide repeat-containing protein At1g17630 OS=Arabidopsis thaliana GN=PCMP-E72 PE=3 SV=1)

HSP 1 Score: 345.1 bits (884), Expect = 1.2e-93
Identity = 187/413 (45.28%), Postives = 262/413 (63.44%), Query Frame = 1

Query: 14  SFCF-----PRFTINFP---SHFRYYFNSLFSSITYDDELLDSFDRLLRQCNGIRHCKQV 73
           +FCF     P  +I+ P   S   YY  SL S+   D  L   FD LL  C   + C+QV
Sbjct: 20  NFCFLTSQCPYTSISSPDTVSVSSYY--SLTSN--NDQSLFHYFDHLLGLCLTAQQCRQV 79

Query: 74  HSATVVTGAC-SSAFVAARLVSVYARSGFVFDARKVFDTVPFEGLSNLLLWNSIIRANVD 133
           H+  +++     S  +AA L+SVYAR G + DAR VF+TV    LS+L LWNSI++ANV 
Sbjct: 80  HAQVLLSDFIFRSGSLAANLISVYARLGLLLDARNVFETVSLVLLSDLRLWNSILKANVS 139

Query: 134 -GYSREALQLYGKMRNFGVLADGFTFPLVLRASSNLGIFNLCKSLHCHVVQFGFQNHLHV 193
            G    AL+LY  MR  G+  DG+  PL+LRA   LG F LC++ H  V+Q G + +LHV
Sbjct: 140 HGLYENALELYRGMRQRGLTGDGYILPLILRACRYLGRFGLCRAFHTQVIQIGLKENLHV 199

Query: 194 VNELMGMYVKLRRMDDARKVFDKMRVKSVISWNTMVSGYAYNYDVNGASRMFLQMELEGV 253
           VNEL+ +Y K  RM DA  +F +M V++ +SWN M+ G++  YD   A ++F  M+ E  
Sbjct: 200 VNELLTLYPKAGRMGDAYNLFVEMPVRNRMSWNVMIKGFSQEYDCESAVKIFEWMQREEF 259

Query: 254 EPNPVTWTSLLSSHARCGHLEETIALFSKMRMKGVGATAEMLAVVLSVCADLDTFDRGQM 313
           +P+ VTWTS+LS H++CG  E+ +  F  MRM G   + E LAV  SVCA+L+     + 
Sbjct: 260 KPDEVTWTSVLSCHSQCGKFEDVLKYFHLMRMSGNAVSGEALAVFFSVCAELEALSIAEK 319

Query: 314 VHGYIVKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMKVKNLVSWNSLISSYAESGL 373
           VHGY++KGGFE+YL ++NALI VYGK G ++DAE LF +++ K + SWNSLI+S+ ++G 
Sbjct: 320 VHGYVIKGGFEEYLPSRNALIHVYGKQGKVKDAEHLFRQIRNKGIESWNSLITSFVDAGK 379

Query: 374 YDKAFEAFSELEKMEGCPEMKPSVITWSAVICGFASNGFGEESLEVFRQMQLA 417
            D+A   FSELE+M     +K +V+TW++VI G    G G++SLE FRQMQ +
Sbjct: 380 LDEALSLFSELEEMNHVCNVKANVVTWTSVIKGCNVQGRGDDSLEYFRQMQFS 428

BLAST of Cp4.1LG02g14250 vs. Swiss-Prot
Match: PPR52_ARATH (Pentatricopeptide repeat-containing protein At1g19720 OS=Arabidopsis thaliana GN=DYW7 PE=2 SV=1)

HSP 1 Score: 196.8 bits (499), Expect = 5.2e-49
Identity = 120/402 (29.85%), Postives = 204/402 (50.75%), Query Frame = 1

Query: 51  RLLRQC--NGIRHCKQVHSATVVTGACSSAFVAARLVSVYARSGFVFDARKVFDTVPFEG 110
           +LL  C  +G  H  ++  A          FV  +L+S+YA+ G + DARKVFD++    
Sbjct: 86  KLLESCIDSGSIHLGRILHARFGLFTEPDVFVETKLLSMYAKCGCIADARKVFDSMRER- 145

Query: 111 LSNLLLWNSIIRA-NVDGYSREALQLYGKMRNFGVLADGFTFPLVLRASSNLGIFNLCKS 170
             NL  W+++I A + +   RE  +L+  M   GVL D F FP +L+  +N G     K 
Sbjct: 146 --NLFTWSAMIGAYSRENRWREVAKLFRLMMKDGVLPDDFLFPKILQGCANCGDVEAGKV 205

Query: 171 LHCHVVQFGFQNHLHVVNELMGMYVKLRRMDDARKVFDKMRVKSVISWNTMVSGYAYN-- 230
           +H  V++ G  + L V N ++ +Y K   +D A K F +MR + VI+WN+++  Y  N  
Sbjct: 206 IHSVVIKLGMSSCLRVSNSILAVYAKCGELDFATKFFRRMRERDVIAWNSVLLAYCQNGK 265

Query: 231 -----------------------------YDVNG----ASRMFLQMELEGVEPNPVTWTS 290
                                        Y+  G    A  +  +ME  G+  +  TWT+
Sbjct: 266 HEEAVELVKEMEKEGISPGLVTWNILIGGYNQLGKCDAAMDLMQKMETFGITADVFTWTA 325

Query: 291 LLSSHARCGHLEETIALFSKMRMKGVGATAEMLAVVLSVCADLDTFDRGQMVHGYIVKGG 350
           ++S     G   + + +F KM + GV   A  +   +S C+ L   ++G  VH   VK G
Sbjct: 326 MISGLIHNGMRYQALDMFRKMFLAGVVPNAVTIMSAVSACSCLKVINQGSEVHSIAVKMG 385

Query: 351 FEDYLFAKNALITVYGKGGDIRDAEKLFHEMKVKNLVSWNSLISSYAESGLYDKAFEAFS 410
           F D +   N+L+ +Y K G + DA K+F  +K K++ +WNS+I+ Y ++G   KA+E F+
Sbjct: 386 FIDDVLVGNSLVDMYSKCGKLEDARKVFDSVKNKDVYTWNSMITGYCQAGYCGKAYELFT 445

Query: 411 ELEKMEGCPEMKPSVITWSAVICGFASNGFGEESLEVFRQMQ 415
            ++       ++P++ITW+ +I G+  NG   E++++F++M+
Sbjct: 446 RMQD----ANLRPNIITWNTMISGYIKNGDEGEAMDLFQRME 480

BLAST of Cp4.1LG02g14250 vs. Swiss-Prot
Match: PP151_ARATH (Pentatricopeptide repeat-containing protein At2g13600 OS=Arabidopsis thaliana GN=PCMP-E76 PE=3 SV=1)

HSP 1 Score: 179.9 bits (455), Expect = 6.6e-44
Identity = 121/378 (32.01%), Postives = 194/378 (51.32%), Query Frame = 1

Query: 48  SFDRLLRQCNGIRHCK---QVHSATVVTGACSSAFVAARLVSVYARSGFVFDARKVFDTV 107
           SF  +L  C+G+       QVHS    +   S  ++ + LV +Y++ G V DA++VFD +
Sbjct: 154 SFASVLSACSGLNDMNKGVQVHSLIAKSPFLSDVYIGSALVDMYSKCGNVNDAQRVFDEM 213

Query: 108 PFEGLSNLLLWNSIIRA-NVDGYSREALQLYGKMRNFGVLADGFTFPLVLRASSNLGIFN 167
              G  N++ WNS+I     +G + EAL ++  M    V  D  T   V+ A ++L    
Sbjct: 214 ---GDRNVVSWNSLITCFEQNGPAVEALDVFQMMLESRVEPDEVTLASVISACASLSAIK 273

Query: 168 LCKSLHCHVVQFG-FQNHLHVVNELMGMYVKLRRMDDARKVFDKMRVKSVISWNTMVSGY 227
           + + +H  VV+    +N + + N  + MY K  R+ +AR +FD M +++VI+  +M+SGY
Sbjct: 274 VGQEVHGRVVKNDKLRNDIILSNAFVDMYAKCSRIKEARFIFDSMPIRNVIAETSMISGY 333

Query: 228 AYNYDVNGASRMFLQMELEGVEPNPVTWTSLLSSHARCGHLEETIALFSKMRMKGVGATA 287
           A       A  MF +M     E N V+W +L++ + + G  EE ++LF  ++ + V  T 
Sbjct: 334 AMAASTKAARLMFTKM----AERNVVSWNALIAGYTQNGENEEALSLFCLLKRESVCPTH 393

Query: 288 EMLAVVLSVCADLDTFDRGQMVHGYIVKGGF------EDYLFAKNALITVYGKGGDIRDA 347
              A +L  CADL     G   H +++K GF      ED +F  N+LI +Y K G + + 
Sbjct: 394 YSFANILKACADLAELHLGMQAHVHVLKHGFKFQSGEEDDIFVGNSLIDMYVKCGCVEEG 453

Query: 348 EKLFHEMKVKNLVSWNSLISSYAESGLYDKAFEAFSE-LEKMEGCPEMKPSVITWSAVIC 407
             +F +M  ++ VSWN++I  +A++G  ++A E F E LE  E     KP  IT   V+ 
Sbjct: 454 YLVFRKMMERDCVSWNAMIIGFAQNGYGNEALELFREMLESGE-----KPDHITMIGVLS 513

Query: 408 GFASNGFGEESLEVFRQM 414
                GF EE    F  M
Sbjct: 514 ACGHAGFVEEGRHYFSSM 519

BLAST of Cp4.1LG02g14250 vs. Swiss-Prot
Match: PP165_ARATH (Pentatricopeptide repeat-containing protein At2g20540 OS=Arabidopsis thaliana GN=PCMP-E78 PE=2 SV=1)

HSP 1 Score: 177.2 bits (448), Expect = 4.3e-43
Identity = 119/411 (28.95%), Postives = 214/411 (52.07%), Query Frame = 1

Query: 44  ELLDSFDRLLRQCNGIRHCKQVHSATVVTGACSSAFVAARLVSVYARSGFVFDARKVFDT 103
           E+ + F   L++       K+++++ ++ G   S+F+  ++V    +   +  A ++F+ 
Sbjct: 8   EVENYFIPFLQRVKSRNEWKKINASIIIHGLSQSSFMVTKMVDFCDKIEDMDYATRLFNQ 67

Query: 104 VPFEGLSNLLLWNSIIRANV-DGYSREALQLYGKM-RNFGVLADGFTFPLVLRASSNLGI 163
           V      N+ L+NSIIRA   +    + +++Y ++ R    L D FTFP + ++ ++LG 
Sbjct: 68  V---SNPNVFLYNSIIRAYTHNSLYCDVIRIYKQLLRKSFELPDRFTFPFMFKSCASLGS 127

Query: 164 FNLCKSLHCHVVQFGFQNHLHVVNELMGMYVKLRRMDDARKVFDKMRVKSVISWNTMVSG 223
             L K +H H+ +FG + H+   N L+ MY+K   + DA KVFD+M  + VISWN+++SG
Sbjct: 128 CYLGKQVHGHLCKFGPRFHVVTENALIDMYMKFDDLVDAHKVFDEMYERDVISWNSLLSG 187

Query: 224 YAYNYDVNGASRMFLQMELEGVEPNPVTWTSLLSSHARCGHLEETIALFSKMRMKGVGAT 283
           YA    +  A  +F  M    ++   V+WT+++S +   G   E +  F +M++ G+   
Sbjct: 188 YARLGQMKKAKGLFHLM----LDKTIVSWTAMISGYTGIGCYVEAMDFFREMQLAGIEPD 247

Query: 284 AEMLAVVLSVCADLDTFDRGQMVHGYIVKGGFEDYLFAKNALITVYGKGGDIRDAEKLFH 343
              L  VL  CA L + + G+ +H Y  + GF       NALI +Y K G I  A +LF 
Sbjct: 248 EISLISVLPSCAQLGSLELGKWIHLYAERRGFLKQTGVCNALIEMYSKCGVISQAIQLFG 307

Query: 344 EMKVKNLVSWNSLISSYAESGLYDKAFEAFSELEKMEGCPEMKPSVITWSAVICGFASNG 403
           +M+ K+++SW+++IS YA  G    A E F+E+++     ++KP+ IT+  ++   +  G
Sbjct: 308 QMEGKDVISWSTMISGYAYHGNAHGAIETFNEMQR----AKVKPNGITFLGLLSACSHVG 367

Query: 404 FGEESLEVFRQMQ------------------LANGGRLDKSPSMELQDPVK 435
             +E L  F  M+                  LA  G+L+++  +    P+K
Sbjct: 368 MWQEGLRYFDMMRQDYQIEPKIEHYGCLIDVLARAGKLERAVEITKTMPMK 407

BLAST of Cp4.1LG02g14250 vs. Swiss-Prot
Match: PPR53_ARATH (Pentatricopeptide repeat-containing protein At1g20230 OS=Arabidopsis thaliana GN=PCMP-H21 PE=2 SV=2)

HSP 1 Score: 176.0 bits (445), Expect = 9.6e-43
Identity = 102/354 (28.81%), Postives = 187/354 (52.82%), Query Frame = 1

Query: 64  QVHSATVVTGACSSAFVAARLVSVYARSGFVFDARKVFDTVPFEGLSNLLLWNSIIRANV 123
           Q H+  + +GA +  +++A+L++ Y+      DA  V  ++P   + +   ++S+I A  
Sbjct: 36  QAHARILKSGAQNDGYISAKLIASYSNYNCFNDADLVLQSIPDPTIYS---FSSLIYALT 95

Query: 124 DG-YSREALQLYGKMRNFGVLADGFTFPLVLRASSNLGIFNLCKSLHCHVVQFGFQNHLH 183
                 +++ ++ +M + G++ D    P + +  + L  F + K +HC     G      
Sbjct: 96  KAKLFTQSIGVFSRMFSHGLIPDSHVLPNLFKVCAELSAFKVGKQIHCVSCVSGLDMDAF 155

Query: 184 VVNELMGMYVKLRRMDDARKVFDKMRVKSVISWNTMVSGYAYNYDVNGASRMFLQMELEG 243
           V   +  MY++  RM DARKVFD+M  K V++ + ++  YA    +    R+  +ME  G
Sbjct: 156 VQGSMFHMYMRCGRMGDARKVFDRMSDKDVVTCSALLCAYARKGCLEEVVRILSEMESSG 215

Query: 244 VEPNPVTWTSLLSSHARCGHLEETIALFSKMRMKGVGATAEMLAVVLSVCADLDTFDRGQ 303
           +E N V+W  +LS   R G+ +E + +F K+   G       ++ VL    D +  + G+
Sbjct: 216 IEANIVSWNGILSGFNRSGYHKEAVVMFQKIHHLGFCPDQVTVSSVLPSVGDSEMLNMGR 275

Query: 304 MVHGYIVKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMKVKNLVSWNSLISSYAESG 363
           ++HGY++K G        +A+I +YGK G +     LF++ ++      N+ I+  + +G
Sbjct: 276 LIHGYVIKQGLLKDKCVISAMIDMYGKSGHVYGIISLFNQFEMMEAGVCNAYITGLSRNG 335

Query: 364 LYDKAFEAFSELEKMEGCPEMKPSVITWSAVICGFASNGFGEESLEVFRQMQLA 417
           L DKA E F EL K +    M+ +V++W+++I G A NG   E+LE+FR+MQ+A
Sbjct: 336 LVDKALEMF-ELFKEQ---TMELNVVSWTSIIAGCAQNGKDIEALELFREMQVA 382

BLAST of Cp4.1LG02g14250 vs. TrEMBL
Match: A0A0A0LFT1_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G848280 PE=4 SV=1)

HSP 1 Score: 671.4 bits (1731), Expect = 8.1e-190
Identity = 335/418 (80.14%), Postives = 364/418 (87.08%), Query Frame = 1

Query: 1   MLYASSYQRFNSASFCFPRFTINFPSHFRYYFNSLFSSITYDDELLDSFDRLLRQCNGIR 60
           ML ASSYQRF S SFCFP  +INF        +S FSSITYD++L D FD LLRQCNGI+
Sbjct: 1   MLCASSYQRFKSVSFCFPPLSINF--------HSQFSSITYDEDLPDFFDHLLRQCNGIQ 60

Query: 61  HCKQVHSATVVTGACSSAFVAARLVSVYARSGFVFDARKVFDTVPFEGLSNLLLWNSIIR 120
           H KQVHSATVVTGA  SAFV+ARLVS+Y+R G V DARKVF + PFE  SN LLWNSIIR
Sbjct: 61  HSKQVHSATVVTGAYCSAFVSARLVSIYSRYGLVSDARKVFGSAPFECYSNFLLWNSIIR 120

Query: 121 ANV-DGYSREALQLYGKMRNFGVLADGFTFPLVLRASSNLGIFNLCKSLHCHVVQFGFQN 180
           ANV  GY  EALQLYGKMRN+GVL DGFTFPL+LRASSNLG FN+CK+LHCHVVQFGFQN
Sbjct: 121 ANVYHGYCIEALQLYGKMRNYGVLGDGFTFPLLLRASSNLGAFNMCKNLHCHVVQFGFQN 180

Query: 181 HLHVVNELMGMYVKLRRMDDARKVFDKMRVKSVISWNTMVSGYAYNYDVNGASRMFLQME 240
           HLHV NEL+GMY KL RMDDARKVFDKMR+KSV+SWNTMVSGYAYNYDVNGASRMF QME
Sbjct: 181 HLHVGNELIGMYAKLERMDDARKVFDKMRIKSVVSWNTMVSGYAYNYDVNGASRMFHQME 240

Query: 241 LEGVEPNPVTWTSLLSSHARCGHLEETIALFSKMRMKGVGATAEMLAVVLSVCADLDTFD 300
           LEGVEPNPVTWTSLLSSHARCGHLEET+ LF KMRMKGVG TAEMLAVVLSVCADL T +
Sbjct: 241 LEGVEPNPVTWTSLLSSHARCGHLEETMVLFCKMRMKGVGPTAEMLAVVLSVCADLATLN 300

Query: 301 RGQMVHGYIVKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMKVKNLVSWNSLISSYA 360
            GQM+HGY+VKGGF DYLFAKNALIT+YGKGG + DAEKLFHEMKVKNLVSWN+LISS+A
Sbjct: 301 SGQMIHGYMVKGGFNDYLFAKNALITLYGKGGGVGDAEKLFHEMKVKNLVSWNALISSFA 360

Query: 361 ESGLYDKAFEAFSELEKMEGCPEMKPSVITWSAVICGFASNGFGEESLEVFRQMQLAN 418
           ESG+YDKA E  S+LEKME  PEMKP+VITWSA+ICGFAS G GEESLEVFR+MQLAN
Sbjct: 361 ESGVYDKALELLSQLEKMEAYPEMKPNVITWSAIICGFASKGLGEESLEVFRKMQLAN 410

BLAST of Cp4.1LG02g14250 vs. TrEMBL
Match: W9S1P1_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_005106 PE=4 SV=1)

HSP 1 Score: 492.7 bits (1267), Expect = 5.2e-136
Identity = 256/427 (59.95%), Postives = 316/427 (74.00%), Query Frame = 1

Query: 1   MLYASSYQRFNSASFCFPRFTINFPSHFRYYFNSLFSSITYD----------DELLDSFD 60
           ML AS+ Q F S    F    ++F S+    F+   S  ++D          +E+LD FD
Sbjct: 1   MLNASA-QCFTSTFSRFRHSHLSFRSNSTSSFSFAHSIHSHDLQPHPSRATHNEVLDFFD 60

Query: 61  RLLRQCNGIRHCKQVHSATVVTGACSSAFVAARLVSVYARSGFVFDARKVFDTVPFEGLS 120
             L+QC   +HCKQ+HS  +V+GA  S F+A+RLVSVY+R G V DA+KVFD  P E  S
Sbjct: 61  SFLKQCTKTQHCKQLHSQVIVSGAHRSGFLASRLVSVYSRLGLVGDAQKVFDEFPVENCS 120

Query: 121 NLLLWNSIIRANVD-GYSREALQLYGKMRNFGVLADGFTFPLVLRASSNLGIFNLCKSLH 180
           NLLLWNSI RANV  G  +EALQL+ KMR  GV  DGFTFPL++RA + +G   LC+ +H
Sbjct: 121 NLLLWNSIARANVSHGLYKEALQLFDKMRKLGVWPDGFTFPLIIRACAFIGSLALCRRVH 180

Query: 181 CHVVQFGFQNHLHVVNELMGMYVKLRRMDDARKVFDKMRVKSVISWNTMVSGYAYNYDVN 240
             V+Q GF+NHLH VNEL+GMY KL RMDDA  +FD+M V+S +SWNTM+SGYAYNYD  
Sbjct: 181 GLVLQMGFRNHLHAVNELLGMYGKLERMDDACLLFDRMPVRSYVSWNTMISGYAYNYDCV 240

Query: 241 GASRMFLQMELEGVEPNPVTWTSLLSSHARCGHLEETIALFSKMRMKGVGATAEMLAVVL 300
           G+S+MF +M+LEG EPN VTWTSLLSSHARCG  +E + LFS MR  GVG TAE LAVVL
Sbjct: 241 GSSKMFERMDLEGFEPNSVTWTSLLSSHARCGRRDEAVELFSLMRSTGVGPTAEALAVVL 300

Query: 301 SVCADLDTFDRGQMVHGYIVKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMKVKNLV 360
           SVCADL + D+G+M+HGY+VKGGFEDYLFAKNALI +YGK G +  A+K F EMK KNLV
Sbjct: 301 SVCADLASADKGKMIHGYVVKGGFEDYLFAKNALICMYGKCGLLEHAQKAFLEMKTKNLV 360

Query: 361 SWNSLISSYAESGLYDKAFEAFSELEKMEGCPEMKPSVITWSAVICGFASNGFGEESLEV 417
           SWN+LISSYAESGL D+AFE F++LEK  G P ++P++I+WSA ICGFA  G GEESLE+
Sbjct: 361 SWNTLISSYAESGLCDEAFEVFTQLEKSCGYPMVRPNIISWSAAICGFALKGRGEESLEL 420

BLAST of Cp4.1LG02g14250 vs. TrEMBL
Match: M5WX31_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa001759mg PE=4 SV=1)

HSP 1 Score: 489.2 bits (1258), Expect = 5.7e-135
Identity = 257/425 (60.47%), Postives = 317/425 (74.59%), Query Frame = 1

Query: 1   MLYASSYQRFNSAS-------FCFP-RFTINF--PSHFRYYFNSLFSSITYDDELLDSFD 60
           ML+ASS QRF S S       F FP + + +F  P H          S T  +E LD F+
Sbjct: 1   MLHASS-QRFISTSSRLHHNHFRFPPKLSRSFSKPGHIHQTQIISHPSCTTHNEFLDFFE 60

Query: 61  RLLRQCNGIRHCKQVHSATVVTGACSSAFVAARLVSVYARSGFVFDARKVFDTVPFEGLS 120
            +LRQC G + CKQVH+  + TG   S F+AA+LV+ YAR G +FDA+KVFDT P EG S
Sbjct: 61  HILRQCTGNKQCKQVHAQIITTGTYQSEFLAAKLVTAYARIGLIFDAQKVFDTGPVEGRS 120

Query: 121 NLLLWNSIIRANVD-GYSREALQLYGKMRNFGVLADGFTFPLVLRASSNLGIFNLCKSLH 180
           NLLLWNSI+RANV  G+  +AL+LY KM N GVL DGFTFPLV+RA + +    L K++H
Sbjct: 121 NLLLWNSILRANVSHGFYEQALKLYDKMTNLGVLGDGFTFPLVIRACAFMDRLKLSKNVH 180

Query: 181 CHVVQFGFQNHLHVVNELMGMYVKLRRMDDARKVFDKMRVKSVISWNTMVSGYAYNYDVN 240
            HV+Q GFQNHLHVVNEL+GMY K+ RMD AR +FD+MRV+S +SWNTMVS YA+NYD +
Sbjct: 181 SHVLQMGFQNHLHVVNELIGMYGKVGRMDCARLLFDRMRVRSYVSWNTMVSSYAFNYDCD 240

Query: 241 GASRMFLQMELEGVEPNPVTWTSLLSSHARCGHLEETIALFSKMRMKGVGATAEMLAVVL 300
           GA+ MF +MELEG+EPNPVTWTSLLSS AR G  EETI LF  MR++GVG TAE+LAVVL
Sbjct: 241 GATEMFRRMELEGLEPNPVTWTSLLSSRARRGRREETIQLFGMMRVRGVGTTAEVLAVVL 300

Query: 301 SVCADLDTFDRGQMVHGYIVKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMKVKNLV 360
           SVCADL   D+G+M+HGY+++GGF+DYLF +NALI +YGK G + DA+KLF  M+ KNLV
Sbjct: 301 SVCADLAVVDKGKMIHGYVIRGGFKDYLFVENALICMYGKCGHVEDADKLFLGMESKNLV 360

Query: 361 SWNSLISSYAESGLYDKAFEAFSELEKMEGCPEMKPSVITWSAVICGFASNGFGEESLEV 415
           SWN+LIS YAESGL D+AF  FS+L      P M+P++I+WSAVI GF+S G GEESLE+
Sbjct: 361 SWNALISCYAESGLCDEAFTIFSQLNDH---PFMRPNIISWSAVIGGFSSKGRGEESLEL 420

BLAST of Cp4.1LG02g14250 vs. TrEMBL
Match: D7U009_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_09s0002g02250 PE=4 SV=1)

HSP 1 Score: 463.4 bits (1191), Expect = 3.3e-127
Identity = 231/376 (61.44%), Postives = 293/376 (77.93%), Query Frame = 1

Query: 42  DDELLDSFDRLLRQCNGIRHCKQVHSATVVTGACSSAFVAARLVSVYARSGFVFDARKVF 101
           ++++LD F+ LL+QC+     +Q+HS  +VTG+  SAF+AAR+VSVYA  G V DA++VF
Sbjct: 30  NNDVLDFFNDLLQQCSKSHLSQQIHSQIIVTGSHRSAFLAARVVSVYAGFGLVSDAQRVF 89

Query: 102 DTVPFEGLSNLLLWNSIIRANV-DGYSREALQLYGKMRNFGVLADGFTFPLVLRASSNLG 161
           +  P E  SNLLLWNSI+RANV  GY  EAL++Y +MR  GV ADGFTFPLV+RA + +G
Sbjct: 90  EVSPIECFSNLLLWNSILRANVAHGYCEEALEIYCRMRKLGVSADGFTFPLVIRACALMG 149

Query: 162 IFNLCKSLHCHVVQFGFQNHLHVVNELMGMYVKLRRMDDARKVFDKMRVKSVISWNTMVS 221
              LC+S+H HVV+ GFQ +LHV NELMGMY K+ RMDDARKVF++M V+S +SWNTMVS
Sbjct: 150 SRKLCRSVHGHVVEMGFQWNLHVGNELMGMYGKIGRMDDARKVFERMAVRSCVSWNTMVS 209

Query: 222 GYAYNYDVNGASRMFLQMELEGVEPNPVTWTSLLSSHARCGHLEETIALFSKMRMKGVGA 281
           GYA NYD +GAS MF  M   G+EPN VTWTSLLSSHARCG   ET+ LF +MRM+G+GA
Sbjct: 210 GYALNYDCHGASEMFRMMGSAGLEPNLVTWTSLLSSHARCGQHVETMELFGRMRMRGIGA 269

Query: 282 TAEMLAVVLSVCADLDTFDRGQMVHGYIVKGGFEDYLFAKNALITVYGKGGDIRDAEKLF 341
           TAE LAVVLSV  DL  FD G+++HGY+VKGGFE+YLF KN+LI +YGK G++  A  LF
Sbjct: 270 TAEALAVVLSVSVDLAAFDEGKVIHGYVVKGGFENYLFVKNSLICLYGKHGNVNAARILF 329

Query: 342 HEMKVKNLVSWNSLISSYAESGLYDKAFEAFSELEKMEGCPEMKPSVITWSAVICGFASN 401
            E+K KN+VSWN+LISSYA+ G  D+AF  F +LEK +  P ++P+V++WSAVI GFAS 
Sbjct: 330 LEIKTKNIVSWNALISSYADLGWCDEAFAIFLQLEKTDEYPMVRPNVVSWSAVIGGFASK 389

Query: 402 GFGEESLEVFRQMQLA 417
           G GEE+LE+FR+MQLA
Sbjct: 390 GQGEEALELFRRMQLA 405

BLAST of Cp4.1LG02g14250 vs. TrEMBL
Match: A5B4B4_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_031739 PE=4 SV=1)

HSP 1 Score: 463.4 bits (1191), Expect = 3.3e-127
Identity = 231/376 (61.44%), Postives = 293/376 (77.93%), Query Frame = 1

Query: 42  DDELLDSFDRLLRQCNGIRHCKQVHSATVVTGACSSAFVAARLVSVYARSGFVFDARKVF 101
           ++++LD F+ LL+QC+     +Q+HS  +VTG+  SAF+AAR+VSVYA  G V DA++VF
Sbjct: 30  NNDVLDFFNDLLQQCSKSHLSQQIHSQIIVTGSHRSAFLAARVVSVYAGFGLVSDAQRVF 89

Query: 102 DTVPFEGLSNLLLWNSIIRANV-DGYSREALQLYGKMRNFGVLADGFTFPLVLRASSNLG 161
           +  P E  SNLLLWNSI+RANV  GY  EAL++Y +MR  GV ADGFTFPLV+RA + +G
Sbjct: 90  EVSPIECFSNLLLWNSILRANVAHGYCEEALEIYCRMRKLGVSADGFTFPLVIRACALMG 149

Query: 162 IFNLCKSLHCHVVQFGFQNHLHVVNELMGMYVKLRRMDDARKVFDKMRVKSVISWNTMVS 221
              LC+S+H HVV+ GFQ +LHV NELMGMY K+ RMDDARKVF++M V+S +SWNTMVS
Sbjct: 150 SRKLCRSVHGHVVEMGFQWNLHVGNELMGMYGKIGRMDDARKVFERMAVRSCVSWNTMVS 209

Query: 222 GYAYNYDVNGASRMFLQMELEGVEPNPVTWTSLLSSHARCGHLEETIALFSKMRMKGVGA 281
           GYA NYD +GAS MF  M   G+EPN VTWTSLLSSHARCG   ET+ LF +MRM+G+GA
Sbjct: 210 GYALNYDCHGASEMFRMMGSAGLEPNLVTWTSLLSSHARCGQHVETMELFGRMRMRGIGA 269

Query: 282 TAEMLAVVLSVCADLDTFDRGQMVHGYIVKGGFEDYLFAKNALITVYGKGGDIRDAEKLF 341
           TAE LAVVLSV  DL  FD G+++HGY+VKGGFE+YLF KN+LI +YGK G++  A  LF
Sbjct: 270 TAEALAVVLSVSVDLAAFDEGKVIHGYVVKGGFENYLFVKNSLICLYGKHGNVNAARILF 329

Query: 342 HEMKVKNLVSWNSLISSYAESGLYDKAFEAFSELEKMEGCPEMKPSVITWSAVICGFASN 401
            E+K KN+VSWN+LISSYA+ G  D+AF  F +LEK +  P ++P+V++WSAVI GFAS 
Sbjct: 330 LEIKTKNIVSWNALISSYADLGWCDEAFAIFLQLEKTDEYPMVRPNVVSWSAVIGGFASK 389

Query: 402 GFGEESLEVFRQMQLA 417
           G GEE+LE+FR+MQLA
Sbjct: 390 GQGEEALELFRRMQLA 405

BLAST of Cp4.1LG02g14250 vs. TAIR10
Match: AT1G17630.1 (AT1G17630.1 Pentatricopeptide repeat (PPR-like) superfamily protein)

HSP 1 Score: 345.1 bits (884), Expect = 6.7e-95
Identity = 187/413 (45.28%), Postives = 262/413 (63.44%), Query Frame = 1

Query: 14  SFCF-----PRFTINFP---SHFRYYFNSLFSSITYDDELLDSFDRLLRQCNGIRHCKQV 73
           +FCF     P  +I+ P   S   YY  SL S+   D  L   FD LL  C   + C+QV
Sbjct: 20  NFCFLTSQCPYTSISSPDTVSVSSYY--SLTSN--NDQSLFHYFDHLLGLCLTAQQCRQV 79

Query: 74  HSATVVTGAC-SSAFVAARLVSVYARSGFVFDARKVFDTVPFEGLSNLLLWNSIIRANVD 133
           H+  +++     S  +AA L+SVYAR G + DAR VF+TV    LS+L LWNSI++ANV 
Sbjct: 80  HAQVLLSDFIFRSGSLAANLISVYARLGLLLDARNVFETVSLVLLSDLRLWNSILKANVS 139

Query: 134 -GYSREALQLYGKMRNFGVLADGFTFPLVLRASSNLGIFNLCKSLHCHVVQFGFQNHLHV 193
            G    AL+LY  MR  G+  DG+  PL+LRA   LG F LC++ H  V+Q G + +LHV
Sbjct: 140 HGLYENALELYRGMRQRGLTGDGYILPLILRACRYLGRFGLCRAFHTQVIQIGLKENLHV 199

Query: 194 VNELMGMYVKLRRMDDARKVFDKMRVKSVISWNTMVSGYAYNYDVNGASRMFLQMELEGV 253
           VNEL+ +Y K  RM DA  +F +M V++ +SWN M+ G++  YD   A ++F  M+ E  
Sbjct: 200 VNELLTLYPKAGRMGDAYNLFVEMPVRNRMSWNVMIKGFSQEYDCESAVKIFEWMQREEF 259

Query: 254 EPNPVTWTSLLSSHARCGHLEETIALFSKMRMKGVGATAEMLAVVLSVCADLDTFDRGQM 313
           +P+ VTWTS+LS H++CG  E+ +  F  MRM G   + E LAV  SVCA+L+     + 
Sbjct: 260 KPDEVTWTSVLSCHSQCGKFEDVLKYFHLMRMSGNAVSGEALAVFFSVCAELEALSIAEK 319

Query: 314 VHGYIVKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMKVKNLVSWNSLISSYAESGL 373
           VHGY++KGGFE+YL ++NALI VYGK G ++DAE LF +++ K + SWNSLI+S+ ++G 
Sbjct: 320 VHGYVIKGGFEEYLPSRNALIHVYGKQGKVKDAEHLFRQIRNKGIESWNSLITSFVDAGK 379

Query: 374 YDKAFEAFSELEKMEGCPEMKPSVITWSAVICGFASNGFGEESLEVFRQMQLA 417
            D+A   FSELE+M     +K +V+TW++VI G    G G++SLE FRQMQ +
Sbjct: 380 LDEALSLFSELEEMNHVCNVKANVVTWTSVIKGCNVQGRGDDSLEYFRQMQFS 428

BLAST of Cp4.1LG02g14250 vs. TAIR10
Match: AT1G19720.1 (AT1G19720.1 Pentatricopeptide repeat (PPR-like) superfamily protein)

HSP 1 Score: 196.8 bits (499), Expect = 3.0e-50
Identity = 120/402 (29.85%), Postives = 204/402 (50.75%), Query Frame = 1

Query: 51  RLLRQC--NGIRHCKQVHSATVVTGACSSAFVAARLVSVYARSGFVFDARKVFDTVPFEG 110
           +LL  C  +G  H  ++  A          FV  +L+S+YA+ G + DARKVFD++    
Sbjct: 86  KLLESCIDSGSIHLGRILHARFGLFTEPDVFVETKLLSMYAKCGCIADARKVFDSMRER- 145

Query: 111 LSNLLLWNSIIRA-NVDGYSREALQLYGKMRNFGVLADGFTFPLVLRASSNLGIFNLCKS 170
             NL  W+++I A + +   RE  +L+  M   GVL D F FP +L+  +N G     K 
Sbjct: 146 --NLFTWSAMIGAYSRENRWREVAKLFRLMMKDGVLPDDFLFPKILQGCANCGDVEAGKV 205

Query: 171 LHCHVVQFGFQNHLHVVNELMGMYVKLRRMDDARKVFDKMRVKSVISWNTMVSGYAYN-- 230
           +H  V++ G  + L V N ++ +Y K   +D A K F +MR + VI+WN+++  Y  N  
Sbjct: 206 IHSVVIKLGMSSCLRVSNSILAVYAKCGELDFATKFFRRMRERDVIAWNSVLLAYCQNGK 265

Query: 231 -----------------------------YDVNG----ASRMFLQMELEGVEPNPVTWTS 290
                                        Y+  G    A  +  +ME  G+  +  TWT+
Sbjct: 266 HEEAVELVKEMEKEGISPGLVTWNILIGGYNQLGKCDAAMDLMQKMETFGITADVFTWTA 325

Query: 291 LLSSHARCGHLEETIALFSKMRMKGVGATAEMLAVVLSVCADLDTFDRGQMVHGYIVKGG 350
           ++S     G   + + +F KM + GV   A  +   +S C+ L   ++G  VH   VK G
Sbjct: 326 MISGLIHNGMRYQALDMFRKMFLAGVVPNAVTIMSAVSACSCLKVINQGSEVHSIAVKMG 385

Query: 351 FEDYLFAKNALITVYGKGGDIRDAEKLFHEMKVKNLVSWNSLISSYAESGLYDKAFEAFS 410
           F D +   N+L+ +Y K G + DA K+F  +K K++ +WNS+I+ Y ++G   KA+E F+
Sbjct: 386 FIDDVLVGNSLVDMYSKCGKLEDARKVFDSVKNKDVYTWNSMITGYCQAGYCGKAYELFT 445

Query: 411 ELEKMEGCPEMKPSVITWSAVICGFASNGFGEESLEVFRQMQ 415
            ++       ++P++ITW+ +I G+  NG   E++++F++M+
Sbjct: 446 RMQD----ANLRPNIITWNTMISGYIKNGDEGEAMDLFQRME 480

BLAST of Cp4.1LG02g14250 vs. TAIR10
Match: AT2G13600.1 (AT2G13600.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 179.9 bits (455), Expect = 3.7e-45
Identity = 121/378 (32.01%), Postives = 194/378 (51.32%), Query Frame = 1

Query: 48  SFDRLLRQCNGIRHCK---QVHSATVVTGACSSAFVAARLVSVYARSGFVFDARKVFDTV 107
           SF  +L  C+G+       QVHS    +   S  ++ + LV +Y++ G V DA++VFD +
Sbjct: 154 SFASVLSACSGLNDMNKGVQVHSLIAKSPFLSDVYIGSALVDMYSKCGNVNDAQRVFDEM 213

Query: 108 PFEGLSNLLLWNSIIRA-NVDGYSREALQLYGKMRNFGVLADGFTFPLVLRASSNLGIFN 167
              G  N++ WNS+I     +G + EAL ++  M    V  D  T   V+ A ++L    
Sbjct: 214 ---GDRNVVSWNSLITCFEQNGPAVEALDVFQMMLESRVEPDEVTLASVISACASLSAIK 273

Query: 168 LCKSLHCHVVQFG-FQNHLHVVNELMGMYVKLRRMDDARKVFDKMRVKSVISWNTMVSGY 227
           + + +H  VV+    +N + + N  + MY K  R+ +AR +FD M +++VI+  +M+SGY
Sbjct: 274 VGQEVHGRVVKNDKLRNDIILSNAFVDMYAKCSRIKEARFIFDSMPIRNVIAETSMISGY 333

Query: 228 AYNYDVNGASRMFLQMELEGVEPNPVTWTSLLSSHARCGHLEETIALFSKMRMKGVGATA 287
           A       A  MF +M     E N V+W +L++ + + G  EE ++LF  ++ + V  T 
Sbjct: 334 AMAASTKAARLMFTKM----AERNVVSWNALIAGYTQNGENEEALSLFCLLKRESVCPTH 393

Query: 288 EMLAVVLSVCADLDTFDRGQMVHGYIVKGGF------EDYLFAKNALITVYGKGGDIRDA 347
              A +L  CADL     G   H +++K GF      ED +F  N+LI +Y K G + + 
Sbjct: 394 YSFANILKACADLAELHLGMQAHVHVLKHGFKFQSGEEDDIFVGNSLIDMYVKCGCVEEG 453

Query: 348 EKLFHEMKVKNLVSWNSLISSYAESGLYDKAFEAFSE-LEKMEGCPEMKPSVITWSAVIC 407
             +F +M  ++ VSWN++I  +A++G  ++A E F E LE  E     KP  IT   V+ 
Sbjct: 454 YLVFRKMMERDCVSWNAMIIGFAQNGYGNEALELFREMLESGE-----KPDHITMIGVLS 513

Query: 408 GFASNGFGEESLEVFRQM 414
                GF EE    F  M
Sbjct: 514 ACGHAGFVEEGRHYFSSM 519

BLAST of Cp4.1LG02g14250 vs. TAIR10
Match: AT2G20540.1 (AT2G20540.1 mitochondrial editing factor 21)

HSP 1 Score: 177.2 bits (448), Expect = 2.4e-44
Identity = 119/411 (28.95%), Postives = 214/411 (52.07%), Query Frame = 1

Query: 44  ELLDSFDRLLRQCNGIRHCKQVHSATVVTGACSSAFVAARLVSVYARSGFVFDARKVFDT 103
           E+ + F   L++       K+++++ ++ G   S+F+  ++V    +   +  A ++F+ 
Sbjct: 8   EVENYFIPFLQRVKSRNEWKKINASIIIHGLSQSSFMVTKMVDFCDKIEDMDYATRLFNQ 67

Query: 104 VPFEGLSNLLLWNSIIRANV-DGYSREALQLYGKM-RNFGVLADGFTFPLVLRASSNLGI 163
           V      N+ L+NSIIRA   +    + +++Y ++ R    L D FTFP + ++ ++LG 
Sbjct: 68  V---SNPNVFLYNSIIRAYTHNSLYCDVIRIYKQLLRKSFELPDRFTFPFMFKSCASLGS 127

Query: 164 FNLCKSLHCHVVQFGFQNHLHVVNELMGMYVKLRRMDDARKVFDKMRVKSVISWNTMVSG 223
             L K +H H+ +FG + H+   N L+ MY+K   + DA KVFD+M  + VISWN+++SG
Sbjct: 128 CYLGKQVHGHLCKFGPRFHVVTENALIDMYMKFDDLVDAHKVFDEMYERDVISWNSLLSG 187

Query: 224 YAYNYDVNGASRMFLQMELEGVEPNPVTWTSLLSSHARCGHLEETIALFSKMRMKGVGAT 283
           YA    +  A  +F  M    ++   V+WT+++S +   G   E +  F +M++ G+   
Sbjct: 188 YARLGQMKKAKGLFHLM----LDKTIVSWTAMISGYTGIGCYVEAMDFFREMQLAGIEPD 247

Query: 284 AEMLAVVLSVCADLDTFDRGQMVHGYIVKGGFEDYLFAKNALITVYGKGGDIRDAEKLFH 343
              L  VL  CA L + + G+ +H Y  + GF       NALI +Y K G I  A +LF 
Sbjct: 248 EISLISVLPSCAQLGSLELGKWIHLYAERRGFLKQTGVCNALIEMYSKCGVISQAIQLFG 307

Query: 344 EMKVKNLVSWNSLISSYAESGLYDKAFEAFSELEKMEGCPEMKPSVITWSAVICGFASNG 403
           +M+ K+++SW+++IS YA  G    A E F+E+++     ++KP+ IT+  ++   +  G
Sbjct: 308 QMEGKDVISWSTMISGYAYHGNAHGAIETFNEMQR----AKVKPNGITFLGLLSACSHVG 367

Query: 404 FGEESLEVFRQMQ------------------LANGGRLDKSPSMELQDPVK 435
             +E L  F  M+                  LA  G+L+++  +    P+K
Sbjct: 368 MWQEGLRYFDMMRQDYQIEPKIEHYGCLIDVLARAGKLERAVEITKTMPMK 407

BLAST of Cp4.1LG02g14250 vs. TAIR10
Match: AT1G20230.1 (AT1G20230.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 176.0 bits (445), Expect = 5.4e-44
Identity = 102/354 (28.81%), Postives = 187/354 (52.82%), Query Frame = 1

Query: 64  QVHSATVVTGACSSAFVAARLVSVYARSGFVFDARKVFDTVPFEGLSNLLLWNSIIRANV 123
           Q H+  + +GA +  +++A+L++ Y+      DA  V  ++P   + +   ++S+I A  
Sbjct: 36  QAHARILKSGAQNDGYISAKLIASYSNYNCFNDADLVLQSIPDPTIYS---FSSLIYALT 95

Query: 124 DG-YSREALQLYGKMRNFGVLADGFTFPLVLRASSNLGIFNLCKSLHCHVVQFGFQNHLH 183
                 +++ ++ +M + G++ D    P + +  + L  F + K +HC     G      
Sbjct: 96  KAKLFTQSIGVFSRMFSHGLIPDSHVLPNLFKVCAELSAFKVGKQIHCVSCVSGLDMDAF 155

Query: 184 VVNELMGMYVKLRRMDDARKVFDKMRVKSVISWNTMVSGYAYNYDVNGASRMFLQMELEG 243
           V   +  MY++  RM DARKVFD+M  K V++ + ++  YA    +    R+  +ME  G
Sbjct: 156 VQGSMFHMYMRCGRMGDARKVFDRMSDKDVVTCSALLCAYARKGCLEEVVRILSEMESSG 215

Query: 244 VEPNPVTWTSLLSSHARCGHLEETIALFSKMRMKGVGATAEMLAVVLSVCADLDTFDRGQ 303
           +E N V+W  +LS   R G+ +E + +F K+   G       ++ VL    D +  + G+
Sbjct: 216 IEANIVSWNGILSGFNRSGYHKEAVVMFQKIHHLGFCPDQVTVSSVLPSVGDSEMLNMGR 275

Query: 304 MVHGYIVKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMKVKNLVSWNSLISSYAESG 363
           ++HGY++K G        +A+I +YGK G +     LF++ ++      N+ I+  + +G
Sbjct: 276 LIHGYVIKQGLLKDKCVISAMIDMYGKSGHVYGIISLFNQFEMMEAGVCNAYITGLSRNG 335

Query: 364 LYDKAFEAFSELEKMEGCPEMKPSVITWSAVICGFASNGFGEESLEVFRQMQLA 417
           L DKA E F EL K +    M+ +V++W+++I G A NG   E+LE+FR+MQ+A
Sbjct: 336 LVDKALEMF-ELFKEQ---TMELNVVSWTSIIAGCAQNGKDIEALELFREMQVA 382

BLAST of Cp4.1LG02g14250 vs. NCBI nr
Match: gi|449458231|ref|XP_004146851.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At1g17630 [Cucumis sativus])

HSP 1 Score: 671.4 bits (1731), Expect = 1.2e-189
Identity = 335/418 (80.14%), Postives = 364/418 (87.08%), Query Frame = 1

Query: 1   MLYASSYQRFNSASFCFPRFTINFPSHFRYYFNSLFSSITYDDELLDSFDRLLRQCNGIR 60
           ML ASSYQRF S SFCFP  +INF        +S FSSITYD++L D FD LLRQCNGI+
Sbjct: 1   MLCASSYQRFKSVSFCFPPLSINF--------HSQFSSITYDEDLPDFFDHLLRQCNGIQ 60

Query: 61  HCKQVHSATVVTGACSSAFVAARLVSVYARSGFVFDARKVFDTVPFEGLSNLLLWNSIIR 120
           H KQVHSATVVTGA  SAFV+ARLVS+Y+R G V DARKVF + PFE  SN LLWNSIIR
Sbjct: 61  HSKQVHSATVVTGAYCSAFVSARLVSIYSRYGLVSDARKVFGSAPFECYSNFLLWNSIIR 120

Query: 121 ANV-DGYSREALQLYGKMRNFGVLADGFTFPLVLRASSNLGIFNLCKSLHCHVVQFGFQN 180
           ANV  GY  EALQLYGKMRN+GVL DGFTFPL+LRASSNLG FN+CK+LHCHVVQFGFQN
Sbjct: 121 ANVYHGYCIEALQLYGKMRNYGVLGDGFTFPLLLRASSNLGAFNMCKNLHCHVVQFGFQN 180

Query: 181 HLHVVNELMGMYVKLRRMDDARKVFDKMRVKSVISWNTMVSGYAYNYDVNGASRMFLQME 240
           HLHV NEL+GMY KL RMDDARKVFDKMR+KSV+SWNTMVSGYAYNYDVNGASRMF QME
Sbjct: 181 HLHVGNELIGMYAKLERMDDARKVFDKMRIKSVVSWNTMVSGYAYNYDVNGASRMFHQME 240

Query: 241 LEGVEPNPVTWTSLLSSHARCGHLEETIALFSKMRMKGVGATAEMLAVVLSVCADLDTFD 300
           LEGVEPNPVTWTSLLSSHARCGHLEET+ LF KMRMKGVG TAEMLAVVLSVCADL T +
Sbjct: 241 LEGVEPNPVTWTSLLSSHARCGHLEETMVLFCKMRMKGVGPTAEMLAVVLSVCADLATLN 300

Query: 301 RGQMVHGYIVKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMKVKNLVSWNSLISSYA 360
            GQM+HGY+VKGGF DYLFAKNALIT+YGKGG + DAEKLFHEMKVKNLVSWN+LISS+A
Sbjct: 301 SGQMIHGYMVKGGFNDYLFAKNALITLYGKGGGVGDAEKLFHEMKVKNLVSWNALISSFA 360

Query: 361 ESGLYDKAFEAFSELEKMEGCPEMKPSVITWSAVICGFASNGFGEESLEVFRQMQLAN 418
           ESG+YDKA E  S+LEKME  PEMKP+VITWSA+ICGFAS G GEESLEVFR+MQLAN
Sbjct: 361 ESGVYDKALELLSQLEKMEAYPEMKPNVITWSAIICGFASKGLGEESLEVFRKMQLAN 410

BLAST of Cp4.1LG02g14250 vs. NCBI nr
Match: gi|659093593|ref|XP_008447611.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At1g17630 [Cucumis melo])

HSP 1 Score: 662.9 bits (1709), Expect = 4.1e-187
Identity = 331/418 (79.19%), Postives = 363/418 (86.84%), Query Frame = 1

Query: 1   MLYASSYQRFNSASFCFPRFTINFPSHFRYYFNSLFSSITYDDELLDSFDRLLRQCNGIR 60
           ML A SYQRF S SFCFP  +INF        +S FSSITYD++L + FD LLRQCNGI+
Sbjct: 1   MLCAYSYQRFKSVSFCFPPLSINF--------HSQFSSITYDEDLPEFFDHLLRQCNGIQ 60

Query: 61  HCKQVHSATVVTGACSSAFVAARLVSVYARSGFVFDARKVFDTVPFEGLSNLLLWNSIIR 120
           H KQVHSATVVTGA  SAFV+ARLVS+Y+R G V DARKVF + PFE LSN LLWNSIIR
Sbjct: 61  HSKQVHSATVVTGAYCSAFVSARLVSIYSRYGLVSDARKVFGSAPFECLSNFLLWNSIIR 120

Query: 121 ANV-DGYSREALQLYGKMRNFGVLADGFTFPLVLRASSNLGIFNLCKSLHCHVVQFGFQN 180
           ANV  GY  EAL LYGKMRN+GVL DGFTFPLVLRASSNLG  ++CK+LHCHVVQFGFQN
Sbjct: 121 ANVYHGYCIEALHLYGKMRNYGVLGDGFTFPLVLRASSNLGTSDVCKNLHCHVVQFGFQN 180

Query: 181 HLHVVNELMGMYVKLRRMDDARKVFDKMRVKSVISWNTMVSGYAYNYDVNGASRMFLQME 240
           HLHV NEL+GMY KL RMDDARKVFDKMR+KSV+SWNTMVSGYAYNYDVNGASRMF QME
Sbjct: 181 HLHVGNELIGMYAKLERMDDARKVFDKMRIKSVVSWNTMVSGYAYNYDVNGASRMFHQME 240

Query: 241 LEGVEPNPVTWTSLLSSHARCGHLEETIALFSKMRMKGVGATAEMLAVVLSVCADLDTFD 300
           LEGVEPNPVTWTSLLSSHARCGHL ET+ LF KMRMKGVGATAEMLAVVLSVCADL T +
Sbjct: 241 LEGVEPNPVTWTSLLSSHARCGHLVETMVLFCKMRMKGVGATAEMLAVVLSVCADLATLN 300

Query: 301 RGQMVHGYIVKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMKVKNLVSWNSLISSYA 360
            GQM+HGY+VKGGF DYLFAKNALIT+YGKGGD+ DAEKLFHEMKVKNLVSWN+LISS+A
Sbjct: 301 SGQMIHGYMVKGGFNDYLFAKNALITLYGKGGDVGDAEKLFHEMKVKNLVSWNALISSFA 360

Query: 361 ESGLYDKAFEAFSELEKMEGCPEMKPSVITWSAVICGFASNGFGEESLEVFRQMQLAN 418
           ESG+YDKA E  S+LEKME  PEMKP+VITWS++ICGF+S G GEESLEVFR+MQLAN
Sbjct: 361 ESGVYDKALELLSQLEKMEAYPEMKPNVITWSSIICGFSSKGLGEESLEVFRKMQLAN 410

BLAST of Cp4.1LG02g14250 vs. NCBI nr
Match: gi|645244349|ref|XP_008228390.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At1g17630 [Prunus mume])

HSP 1 Score: 500.0 bits (1286), Expect = 4.6e-138
Identity = 261/425 (61.41%), Postives = 320/425 (75.29%), Query Frame = 1

Query: 1   MLYASSYQRFNSAS-------FCFP-RFTINF--PSHFRYYFNSLFSSITYDDELLDSFD 60
           ML+ASS QRF S S       F FP + + +F  P H          S T  +E LD F+
Sbjct: 1   MLHASS-QRFISTSSRLHHTHFRFPPKLSRSFSKPRHIHQTQIISHPSCTTHNEFLDFFE 60

Query: 61  RLLRQCNGIRHCKQVHSATVVTGACSSAFVAARLVSVYARSGFVFDARKVFDTVPFEGLS 120
            +LRQC G + CKQVH+  + TG   S F+AA+LV+ YAR G +FDA+KVFDT P EG S
Sbjct: 61  HILRQCTGHKQCKQVHAQIITTGTSQSEFLAAKLVTAYARIGLIFDAQKVFDTGPVEGRS 120

Query: 121 NLLLWNSIIRANVD-GYSREALQLYGKMRNFGVLADGFTFPLVLRASSNLGIFNLCKSLH 180
           NLLLWNSI+RANV  G+  +AL+LY KM+N GVL DGFTFPLV+RA + +    L K++H
Sbjct: 121 NLLLWNSILRANVSHGFYEQALKLYDKMKNLGVLGDGFTFPLVIRACAFMDRLKLSKNVH 180

Query: 181 CHVVQFGFQNHLHVVNELMGMYVKLRRMDDARKVFDKMRVKSVISWNTMVSGYAYNYDVN 240
            HV+Q GFQNHLHVVNEL+GMY KL RMD AR++FD+MRV+S +SWNTMVS YA+NYD +
Sbjct: 181 SHVLQMGFQNHLHVVNELIGMYGKLGRMDCARRLFDRMRVRSYVSWNTMVSSYAFNYDCD 240

Query: 241 GASRMFLQMELEGVEPNPVTWTSLLSSHARCGHLEETIALFSKMRMKGVGATAEMLAVVL 300
           GA+ MF +MELEG+EPNPVTWTSLLSSHARCG  EETI LF  MR++GVG TAE+LAVVL
Sbjct: 241 GATEMFRRMELEGLEPNPVTWTSLLSSHARCGRREETIQLFGMMRVRGVGTTAEVLAVVL 300

Query: 301 SVCADLDTFDRGQMVHGYIVKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMKVKNLV 360
           SVCADL   DRG+M+HGY+++GGF+DYLF +NALI +YGK G   DA+KLF  M+ KNLV
Sbjct: 301 SVCADLAVVDRGKMIHGYVIRGGFKDYLFVENALICMYGKCGHEEDADKLFLGMESKNLV 360

Query: 361 SWNSLISSYAESGLYDKAFEAFSELEKMEGCPEMKPSVITWSAVICGFASNGFGEESLEV 415
           SWN+LIS YAESGL D+AF  FS+L      P M+P++I+WSAVI GF+S G GEESLE+
Sbjct: 361 SWNALISCYAESGLCDEAFTIFSQLNNH---PFMRPNIISWSAVIGGFSSKGRGEESLEL 420

BLAST of Cp4.1LG02g14250 vs. NCBI nr
Match: gi|703136748|ref|XP_010106241.1| (hypothetical protein L484_005106 [Morus notabilis])

HSP 1 Score: 492.7 bits (1267), Expect = 7.4e-136
Identity = 256/427 (59.95%), Postives = 316/427 (74.00%), Query Frame = 1

Query: 1   MLYASSYQRFNSASFCFPRFTINFPSHFRYYFNSLFSSITYD----------DELLDSFD 60
           ML AS+ Q F S    F    ++F S+    F+   S  ++D          +E+LD FD
Sbjct: 1   MLNASA-QCFTSTFSRFRHSHLSFRSNSTSSFSFAHSIHSHDLQPHPSRATHNEVLDFFD 60

Query: 61  RLLRQCNGIRHCKQVHSATVVTGACSSAFVAARLVSVYARSGFVFDARKVFDTVPFEGLS 120
             L+QC   +HCKQ+HS  +V+GA  S F+A+RLVSVY+R G V DA+KVFD  P E  S
Sbjct: 61  SFLKQCTKTQHCKQLHSQVIVSGAHRSGFLASRLVSVYSRLGLVGDAQKVFDEFPVENCS 120

Query: 121 NLLLWNSIIRANVD-GYSREALQLYGKMRNFGVLADGFTFPLVLRASSNLGIFNLCKSLH 180
           NLLLWNSI RANV  G  +EALQL+ KMR  GV  DGFTFPL++RA + +G   LC+ +H
Sbjct: 121 NLLLWNSIARANVSHGLYKEALQLFDKMRKLGVWPDGFTFPLIIRACAFIGSLALCRRVH 180

Query: 181 CHVVQFGFQNHLHVVNELMGMYVKLRRMDDARKVFDKMRVKSVISWNTMVSGYAYNYDVN 240
             V+Q GF+NHLH VNEL+GMY KL RMDDA  +FD+M V+S +SWNTM+SGYAYNYD  
Sbjct: 181 GLVLQMGFRNHLHAVNELLGMYGKLERMDDACLLFDRMPVRSYVSWNTMISGYAYNYDCV 240

Query: 241 GASRMFLQMELEGVEPNPVTWTSLLSSHARCGHLEETIALFSKMRMKGVGATAEMLAVVL 300
           G+S+MF +M+LEG EPN VTWTSLLSSHARCG  +E + LFS MR  GVG TAE LAVVL
Sbjct: 241 GSSKMFERMDLEGFEPNSVTWTSLLSSHARCGRRDEAVELFSLMRSTGVGPTAEALAVVL 300

Query: 301 SVCADLDTFDRGQMVHGYIVKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMKVKNLV 360
           SVCADL + D+G+M+HGY+VKGGFEDYLFAKNALI +YGK G +  A+K F EMK KNLV
Sbjct: 301 SVCADLASADKGKMIHGYVVKGGFEDYLFAKNALICMYGKCGLLEHAQKAFLEMKTKNLV 360

Query: 361 SWNSLISSYAESGLYDKAFEAFSELEKMEGCPEMKPSVITWSAVICGFASNGFGEESLEV 417
           SWN+LISSYAESGL D+AFE F++LEK  G P ++P++I+WSA ICGFA  G GEESLE+
Sbjct: 361 SWNTLISSYAESGLCDEAFEVFTQLEKSCGYPMVRPNIISWSAAICGFALKGRGEESLEL 420

BLAST of Cp4.1LG02g14250 vs. NCBI nr
Match: gi|595966256|ref|XP_007217158.1| (hypothetical protein PRUPE_ppa001759mg [Prunus persica])

HSP 1 Score: 489.2 bits (1258), Expect = 8.2e-135
Identity = 257/425 (60.47%), Postives = 317/425 (74.59%), Query Frame = 1

Query: 1   MLYASSYQRFNSAS-------FCFP-RFTINF--PSHFRYYFNSLFSSITYDDELLDSFD 60
           ML+ASS QRF S S       F FP + + +F  P H          S T  +E LD F+
Sbjct: 1   MLHASS-QRFISTSSRLHHNHFRFPPKLSRSFSKPGHIHQTQIISHPSCTTHNEFLDFFE 60

Query: 61  RLLRQCNGIRHCKQVHSATVVTGACSSAFVAARLVSVYARSGFVFDARKVFDTVPFEGLS 120
            +LRQC G + CKQVH+  + TG   S F+AA+LV+ YAR G +FDA+KVFDT P EG S
Sbjct: 61  HILRQCTGNKQCKQVHAQIITTGTYQSEFLAAKLVTAYARIGLIFDAQKVFDTGPVEGRS 120

Query: 121 NLLLWNSIIRANVD-GYSREALQLYGKMRNFGVLADGFTFPLVLRASSNLGIFNLCKSLH 180
           NLLLWNSI+RANV  G+  +AL+LY KM N GVL DGFTFPLV+RA + +    L K++H
Sbjct: 121 NLLLWNSILRANVSHGFYEQALKLYDKMTNLGVLGDGFTFPLVIRACAFMDRLKLSKNVH 180

Query: 181 CHVVQFGFQNHLHVVNELMGMYVKLRRMDDARKVFDKMRVKSVISWNTMVSGYAYNYDVN 240
            HV+Q GFQNHLHVVNEL+GMY K+ RMD AR +FD+MRV+S +SWNTMVS YA+NYD +
Sbjct: 181 SHVLQMGFQNHLHVVNELIGMYGKVGRMDCARLLFDRMRVRSYVSWNTMVSSYAFNYDCD 240

Query: 241 GASRMFLQMELEGVEPNPVTWTSLLSSHARCGHLEETIALFSKMRMKGVGATAEMLAVVL 300
           GA+ MF +MELEG+EPNPVTWTSLLSS AR G  EETI LF  MR++GVG TAE+LAVVL
Sbjct: 241 GATEMFRRMELEGLEPNPVTWTSLLSSRARRGRREETIQLFGMMRVRGVGTTAEVLAVVL 300

Query: 301 SVCADLDTFDRGQMVHGYIVKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMKVKNLV 360
           SVCADL   D+G+M+HGY+++GGF+DYLF +NALI +YGK G + DA+KLF  M+ KNLV
Sbjct: 301 SVCADLAVVDKGKMIHGYVIRGGFKDYLFVENALICMYGKCGHVEDADKLFLGMESKNLV 360

Query: 361 SWNSLISSYAESGLYDKAFEAFSELEKMEGCPEMKPSVITWSAVICGFASNGFGEESLEV 415
           SWN+LIS YAESGL D+AF  FS+L      P M+P++I+WSAVI GF+S G GEESLE+
Sbjct: 361 SWNALISCYAESGLCDEAFTIFSQLNDH---PFMRPNIISWSAVIGGFSSKGRGEESLEL 420

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PPR47_ARATH1.2e-9345.28Putative pentatricopeptide repeat-containing protein At1g17630 OS=Arabidopsis th... [more]
PPR52_ARATH5.2e-4929.85Pentatricopeptide repeat-containing protein At1g19720 OS=Arabidopsis thaliana GN... [more]
PP151_ARATH6.6e-4432.01Pentatricopeptide repeat-containing protein At2g13600 OS=Arabidopsis thaliana GN... [more]
PP165_ARATH4.3e-4328.95Pentatricopeptide repeat-containing protein At2g20540 OS=Arabidopsis thaliana GN... [more]
PPR53_ARATH9.6e-4328.81Pentatricopeptide repeat-containing protein At1g20230 OS=Arabidopsis thaliana GN... [more]
Match NameE-valueIdentityDescription
A0A0A0LFT1_CUCSA8.1e-19080.14Uncharacterized protein OS=Cucumis sativus GN=Csa_3G848280 PE=4 SV=1[more]
W9S1P1_9ROSA5.2e-13659.95Uncharacterized protein OS=Morus notabilis GN=L484_005106 PE=4 SV=1[more]
M5WX31_PRUPE5.7e-13560.47Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa001759mg PE=4 SV=1[more]
D7U009_VITVI3.3e-12761.44Putative uncharacterized protein OS=Vitis vinifera GN=VIT_09s0002g02250 PE=4 SV=... [more]
A5B4B4_VITVI3.3e-12761.44Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_031739 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G17630.16.7e-9545.28 Pentatricopeptide repeat (PPR-like) superfamily protein[more]
AT1G19720.13.0e-5029.85 Pentatricopeptide repeat (PPR-like) superfamily protein[more]
AT2G13600.13.7e-4532.01 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT2G20540.12.4e-4428.95 mitochondrial editing factor 21[more]
AT1G20230.15.4e-4428.81 Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
gi|449458231|ref|XP_004146851.1|1.2e-18980.14PREDICTED: putative pentatricopeptide repeat-containing protein At1g17630 [Cucum... [more]
gi|659093593|ref|XP_008447611.1|4.1e-18779.19PREDICTED: putative pentatricopeptide repeat-containing protein At1g17630 [Cucum... [more]
gi|645244349|ref|XP_008228390.1|4.6e-13861.41PREDICTED: putative pentatricopeptide repeat-containing protein At1g17630 [Prunu... [more]
gi|703136748|ref|XP_010106241.1|7.4e-13659.95hypothetical protein L484_005106 [Morus notabilis][more]
gi|595966256|ref|XP_007217158.1|8.2e-13560.47hypothetical protein PRUPE_ppa001759mg [Prunus persica][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0006457 protein folding
biological_process GO:0031930 mitochondria-nucleus signaling pathway
cellular_component GO:0005575 cellular_component
molecular_function GO:0016787 hydrolase activity
molecular_function GO:0005515 protein binding
molecular_function GO:0000166 nucleotide binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG02g14250.1Cp4.1LG02g14250.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 321..347
score: 3.6E-6coord: 349..376
score: 1.2E-6coord: 388..414
score: 1.9E-5coord: 115..142
score:
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 210..258
score: 9.2
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 248..278
score: 2.0E-5coord: 349..380
score: 7.9E-6coord: 321..348
score: 5.9E-6coord: 388..414
score: 9.0E-5coord: 213..246
score: 7.7E-6coord: 185..210
score: 3.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 145..179
score: 5.415coord: 386..416
score: 8.561coord: 211..245
score: 10.249coord: 347..381
score: 11.038coord: 77..111
score: 6.533coord: 180..210
score: 7.826coord: 246..280
score: 11.268coord: 316..346
score: 8
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 313..377
score: 7.9E-6coord: 180..276
score: 7.
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 1..34
score: 3.0E-175coord: 300..417
score: 3.0E-175coord: 64..264
score: 3.0E
NoneNo IPR availablePANTHERPTHR24015:SF617SUBFAMILY NOT NAMEDcoord: 64..264
score: 3.0E-175coord: 1..34
score: 3.0E-175coord: 300..417
score: 3.0E

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG02g14250Cp4.1LG17g04980Cucurbita pepo (Zucchini)cpecpeB332
The following block(s) are covering this gene:
GeneOrganismBlock
Cp4.1LG02g14250Cucumber (Chinese Long) v3cpecucB0661
Cp4.1LG02g14250Cucumber (Chinese Long) v3cpecucB0678
Cp4.1LG02g14250Cucumber (Chinese Long) v3cpecucB0734
Cp4.1LG02g14250Wax gourdcpewgoB0730
Cp4.1LG02g14250Wax gourdcpewgoB0740
Cp4.1LG02g14250Cucurbita pepo (Zucchini)cpecpeB122
Cp4.1LG02g14250Cucurbita pepo (Zucchini)cpecpeB181
Cp4.1LG02g14250Cucurbita pepo (Zucchini)cpecpeB355
Cp4.1LG02g14250Cucurbita pepo (Zucchini)cpecpeB451
Cp4.1LG02g14250Cucurbita pepo (Zucchini)cpecpeB453
Cp4.1LG02g14250Cucurbita pepo (Zucchini)cpecpeB473
Cp4.1LG02g14250Cucumber (Gy14) v1cgycpeB0277
Cp4.1LG02g14250Cucumber (Gy14) v1cgycpeB0941
Cp4.1LG02g14250Cucurbita maxima (Rimu)cmacpeB282
Cp4.1LG02g14250Cucurbita maxima (Rimu)cmacpeB399
Cp4.1LG02g14250Cucurbita maxima (Rimu)cmacpeB921
Cp4.1LG02g14250Cucurbita moschata (Rifu)cmocpeB244
Cp4.1LG02g14250Cucurbita moschata (Rifu)cmocpeB361
Cp4.1LG02g14250Cucurbita moschata (Rifu)cmocpeB857
Cp4.1LG02g14250Wild cucumber (PI 183967)cpecpiB545
Cp4.1LG02g14250Wild cucumber (PI 183967)cpecpiB595
Cp4.1LG02g14250Cucumber (Chinese Long) v2cpecuB543
Cp4.1LG02g14250Cucumber (Chinese Long) v2cpecuB594
Cp4.1LG02g14250Bottle gourd (USVL1VR-Ls)cpelsiB450
Cp4.1LG02g14250Watermelon (Charleston Gray)cpewcgB481
Cp4.1LG02g14250Watermelon (Charleston Gray)cpewcgB494
Cp4.1LG02g14250Watermelon (Charleston Gray)cpewcgB505
Cp4.1LG02g14250Watermelon (97103) v1cpewmB534
Cp4.1LG02g14250Watermelon (97103) v1cpewmB554
Cp4.1LG02g14250Watermelon (97103) v1cpewmB583
Cp4.1LG02g14250Melon (DHL92) v3.5.1cpemeB504
Cp4.1LG02g14250Melon (DHL92) v3.5.1cpemeB518
Cp4.1LG02g14250Melon (DHL92) v3.5.1cpemeB527
Cp4.1LG02g14250Cucumber (Gy14) v2cgybcpeB094
Cp4.1LG02g14250Cucumber (Gy14) v2cgybcpeB225
Cp4.1LG02g14250Cucumber (Gy14) v2cgybcpeB953
Cp4.1LG02g14250Melon (DHL92) v3.6.1cpemedB596
Cp4.1LG02g14250Melon (DHL92) v3.6.1cpemedB601
Cp4.1LG02g14250Melon (DHL92) v3.6.1cpemedB626
Cp4.1LG02g14250Silver-seed gourdcarcpeB0140
Cp4.1LG02g14250Silver-seed gourdcarcpeB0175
Cp4.1LG02g14250Silver-seed gourdcarcpeB0813
Cp4.1LG02g14250Silver-seed gourdcarcpeB1429