Cp4.1LG03g03520 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG03g03520
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionPentatricopeptide repeat-containing family protein
LocationCp4.1LG03 : 109631 .. 111118 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAAGCATCAGGGTTGTGAAGAGGAGCGACCACTTCCTCAGAAAACACAGGAAGTGCCCATTTTCACCCTTCAAAACCAAATGGCATCAAACCTTCAATCAGGATGAAGCTCTGCGAGCCATTAAACAAGCTGCAAATTCGCCGAAACAACCCAATTCCGACGAACCCTACCTTCTTTCGGTTCTTATAAGCTCCTTCAGAGCCTATTGCTGCGATCCAACCCCCAATGCCTACCACTTCGTCCTCAAAACCTTTGTTCGAAGCTCACAGTTCCACTACATTGCCGCTGTTCTCGATCGTCTCGAACGCGTCGAGAAATTTGAAACCCCAGAGTACATATTTGTCGACCTCCTCAAAGTTTACGGACGAGTCAACCGAATTCAAGATGCCATCACTCTGTTCCGTAGGATTCCCATGTTTAGGTGCGTTCCTTCTGCGCTCTCACTCAACTCTCTACTTTTCCTGCTCTGTAGAAATGGCGAAGGTCTTCGAATAGTTCCTGAGATCATATTGAGTAGCCAGACTATGGGTATTAGGCTTGAAGAGTCCACTTTTCGGATACTAATTACTGCGTTATGTAAAATTTATAAAGTTGGGCATGCAATGGAGCTTTTCAATTATATGATAACTGAAGGGTATGGTCTGAACCCTGGAATCTGCTCTTTGATATTGGCATCGCTATGTGAGCATAAGAAATCCACTGGTAATGTGGTTCTTACCTTTCTGGAACAAATGAGGCTAAAAGGATTCTGTCCCGGTGTTGTGGATTATTCTAATGTAATTAAGTTCTTGGTTAGGAGAGGGCTGAGTTCAGACGCTCTTGATCTGTTGAATAAAATGAAGGCCGATGGTTTCAAGCCTGACATTGTTTGTTATACTATGGTCTTGAATGGGGTGATTGCGGATGGGGATTATAAGATGGCAGATGAACTGTTTGATGAATTGCTTCTCTTTGGTTTAGTTCCTGATATTTATACATACAACGTGTACATACATGGATTATGCAAACAAGGCAATTGGGAAGCAGGGATTCAAATGATTTTGCATATGGAGGAATTGGGGTGCAAACCTGACGTGATCACTTACAATATTCTCTTGGAATGTTTGTGTAAAATCGGTGAACTTGACGAGGCAAGGAAGCTTCGAGGTAACATGCAACTAAAGGGTCTGGCAAAAAACGTGCGGACTTTTAGGATTATGATCAGTGGGTTATTTAATAACGGTGATGTAATTGAGGCTTGTATCCTATTGGAGGAAATGCTAGTATGTCGTTTCCCTCCTCAGATTTCAACTTTTGGTGAGATACTTTCTTGGCTGTGCAAAAGGGACATGGTAGGCAAAGCACTTGAGCTGCTCACGTTAATGGTTGGCATGAACTTTTCCCCTGGTCCTAAGGCTTGGGAAACACTGCTCCTGAGCTCTGAAAGTGAATTACCTTCTGTTAAGAGCCTTGAAACTACTTTAGAAGATTTAGTAGGCATCTGA

mRNA sequence

ATGGAAAGCATCAGGGTTGTGAAGAGGAGCGACCACTTCCTCAGAAAACACAGGAAGTGCCCATTTTCACCCTTCAAAACCAAATGGCATCAAACCTTCAATCAGGATGAAGCTCTGCGAGCCATTAAACAAGCTGCAAATTCGCCGAAACAACCCAATTCCGACGAACCCTACCTTCTTTCGGTTCTTATAAGCTCCTTCAGAGCCTATTGCTGCGATCCAACCCCCAATGCCTACCACTTCGTCCTCAAAACCTTTGTTCGAAGCTCACAGTTCCACTACATTGCCGCTGTTCTCGATCGTCTCGAACGCGTCGAGAAATTTGAAACCCCAGAGTACATATTTGTCGACCTCCTCAAAGTTTACGGACGAGTCAACCGAATTCAAGATGCCATCACTCTGTTCCGTAGGATTCCCATGTTTAGGTGCGTTCCTTCTGCGCTCTCACTCAACTCTCTACTTTTCCTGCTCTGTAGAAATGGCGAAGGTCTTCGAATAGTTCCTGAGATCATATTGAGTAGCCAGACTATGGGTATTAGGCTTGAAGAGTCCACTTTTCGGATACTAATTACTGCGTTATGTAAAATTTATAAAGTTGGGCATGCAATGGAGCTTTTCAATTATATGATAACTGAAGGGTATGGTCTGAACCCTGGAATCTGCTCTTTGATATTGGCATCGCTATGTGAGCATAAGAAATCCACTGGTAATGTGGTTCTTACCTTTCTGGAACAAATGAGGCTAAAAGGATTCTGTCCCGGTGTTGTGGATTATTCTAATGTAATTAAGTTCTTGGTTAGGAGAGGGCTGAGTTCAGACGCTCTTGATCTGTTGAATAAAATGAAGGCCGATGGTTTCAAGCCTGACATTGTTTGTTATACTATGGTCTTGAATGGGGTGATTGCGGATGGGGATTATAAGATGGCAGATGAACTGTTTGATGAATTGCTTCTCTTTGGTTTAGTTCCTGATATTTATACATACAACGTGTACATACATGGATTATGCAAACAAGGCAATTGGGAAGCAGGGATTCAAATGATTTTGCATATGGAGGAATTGGGGTGCAAACCTGACGTGATCACTTACAATATTCTCTTGGAATGTTTGTGTAAAATCGGTGAACTTGACGAGGCAAGGAAGCTTCGAGGTAACATGCAACTAAAGGGTCTGGCAAAAAACGTGCGGACTTTTAGGATTATGATCAGTGGGTTATTTAATAACGGTGATGTAATTGAGGCTTGTATCCTATTGGAGGAAATGCTAGTATGTCGTTTCCCTCCTCAGATTTCAACTTTTGGTGAGATACTTTCTTGGCTGTGCAAAAGGGACATGGTAGGCAAAGCACTTGAGCTGCTCACGTTAATGGTTGGCATGAACTTTTCCCCTGGTCCTAAGGCTTGGGAAACACTGCTCCTGAGCTCTGAAAGTGAATTACCTTCTGTTAAGAGCCTTGAAACTACTTTAGAAGATTTAGTAGGCATCTGA

Coding sequence (CDS)

ATGGAAAGCATCAGGGTTGTGAAGAGGAGCGACCACTTCCTCAGAAAACACAGGAAGTGCCCATTTTCACCCTTCAAAACCAAATGGCATCAAACCTTCAATCAGGATGAAGCTCTGCGAGCCATTAAACAAGCTGCAAATTCGCCGAAACAACCCAATTCCGACGAACCCTACCTTCTTTCGGTTCTTATAAGCTCCTTCAGAGCCTATTGCTGCGATCCAACCCCCAATGCCTACCACTTCGTCCTCAAAACCTTTGTTCGAAGCTCACAGTTCCACTACATTGCCGCTGTTCTCGATCGTCTCGAACGCGTCGAGAAATTTGAAACCCCAGAGTACATATTTGTCGACCTCCTCAAAGTTTACGGACGAGTCAACCGAATTCAAGATGCCATCACTCTGTTCCGTAGGATTCCCATGTTTAGGTGCGTTCCTTCTGCGCTCTCACTCAACTCTCTACTTTTCCTGCTCTGTAGAAATGGCGAAGGTCTTCGAATAGTTCCTGAGATCATATTGAGTAGCCAGACTATGGGTATTAGGCTTGAAGAGTCCACTTTTCGGATACTAATTACTGCGTTATGTAAAATTTATAAAGTTGGGCATGCAATGGAGCTTTTCAATTATATGATAACTGAAGGGTATGGTCTGAACCCTGGAATCTGCTCTTTGATATTGGCATCGCTATGTGAGCATAAGAAATCCACTGGTAATGTGGTTCTTACCTTTCTGGAACAAATGAGGCTAAAAGGATTCTGTCCCGGTGTTGTGGATTATTCTAATGTAATTAAGTTCTTGGTTAGGAGAGGGCTGAGTTCAGACGCTCTTGATCTGTTGAATAAAATGAAGGCCGATGGTTTCAAGCCTGACATTGTTTGTTATACTATGGTCTTGAATGGGGTGATTGCGGATGGGGATTATAAGATGGCAGATGAACTGTTTGATGAATTGCTTCTCTTTGGTTTAGTTCCTGATATTTATACATACAACGTGTACATACATGGATTATGCAAACAAGGCAATTGGGAAGCAGGGATTCAAATGATTTTGCATATGGAGGAATTGGGGTGCAAACCTGACGTGATCACTTACAATATTCTCTTGGAATGTTTGTGTAAAATCGGTGAACTTGACGAGGCAAGGAAGCTTCGAGGTAACATGCAACTAAAGGGTCTGGCAAAAAACGTGCGGACTTTTAGGATTATGATCAGTGGGTTATTTAATAACGGTGATGTAATTGAGGCTTGTATCCTATTGGAGGAAATGCTAGTATGTCGTTTCCCTCCTCAGATTTCAACTTTTGGTGAGATACTTTCTTGGCTGTGCAAAAGGGACATGGTAGGCAAAGCACTTGAGCTGCTCACGTTAATGGTTGGCATGAACTTTTCCCCTGGTCCTAAGGCTTGGGAAACACTGCTCCTGAGCTCTGAAAGTGAATTACCTTCTGTTAAGAGCCTTGAAACTACTTTAGAAGATTTAGTAGGCATCTGA

Protein sequence

MESIRVVKRSDHFLRKHRKCPFSPFKTKWHQTFNQDEALRAIKQAANSPKQPNSDEPYLLSVLISSFRAYCCDPTPNAYHFVLKTFVRSSQFHYIAAVLDRLERVEKFETPEYIFVDLLKVYGRVNRIQDAITLFRRIPMFRCVPSALSLNSLLFLLCRNGEGLRIVPEIILSSQTMGIRLEESTFRILITALCKIYKVGHAMELFNYMITEGYGLNPGICSLILASLCEHKKSTGNVVLTFLEQMRLKGFCPGVVDYSNVIKFLVRRGLSSDALDLLNKMKADGFKPDIVCYTMVLNGVIADGDYKMADELFDELLLFGLVPDIYTYNVYIHGLCKQGNWEAGIQMILHMEELGCKPDVITYNILLECLCKIGELDEARKLRGNMQLKGLAKNVRTFRIMISGLFNNGDVIEACILLEEMLVCRFPPQISTFGEILSWLCKRDMVGKALELLTLMVGMNFSPGPKAWETLLLSSESELPSVKSLETTLEDLVGI
BLAST of Cp4.1LG03g03520 vs. Swiss-Prot
Match: PP193_ARATH (Pentatricopeptide repeat-containing protein At2g38420, mitochondrial OS=Arabidopsis thaliana GN=At2g38420 PE=2 SV=2)

HSP 1 Score: 398.7 bits (1023), Expect = 9.5e-110
Identity = 207/449 (46.10%), Postives = 298/449 (66.37%), Query Frame = 1

Query: 9   RSDHFLRKHRKCPFSPFKTKWHQTFNQDEALRAIKQAANSPKQPNSDEPYLLSVLISSFR 68
           R  +F+RK+RK P S FKTKW++   Q  A+  ++    S    +S+   ++  L+SSF+
Sbjct: 9   RMSNFMRKYRKIPHSSFKTKWNENLKQKYAMEELR----SNLLTDSENASVMRTLLSSFQ 68

Query: 69  AYCCDPTPNAYHFVLKTFVRSSQFHYIAAVLDRLERVEKFETPEYIFVDLLKVYGRVNRI 128
            + C+PTP AY FV+KT  +SSQ   I++VL  LE  EKF+TPE IF D++  YG   RI
Sbjct: 69  LHNCEPTPQAYRFVIKTLAKSSQLENISSVLYHLEVSEKFDTPESIFRDVIAAYGFSGRI 128

Query: 129 QDAITLFRRIPMFRCVPSALSLNSLLFLLCRNGEGLRIVPEIILSSQTMGIRLEESTFRI 188
           ++AI +F +IP FRCVPSA +LN+LL +L R  + L +VPEI++ +  MG+RLEESTF I
Sbjct: 129 EEAIEVFFKIPNFRCVPSAYTLNALLLVLVRKRQSLELVPEILVKACRMGVRLEESTFGI 188

Query: 189 LITALCKIYKVGHAMELFNYMITEGYGLNPGICSLILASLCEHKKSTGNVVLTFLEQMRL 248
           LI ALC+I +V  A EL  YM  +   ++P + S +L+S+C+HK S+   V+ +LE +R 
Sbjct: 189 LIDALCRIGEVDCATELVRYMSQDSVIVDPRLYSRLLSSVCKHKDSSCFDVIGYLEDLRK 248

Query: 249 KGFCPGVVDYSNVIKFLVRRGLSSDALDLLNKMKADGFKPDIVCYTMVLNGVIADGDYKM 308
             F PG+ DY+ V++FLV  G   + + +LN+MK D  +PD+VCYT+VL GVIAD DY  
Sbjct: 249 TRFSPGLRDYTVVMRFLVEGGRGKEVVSVLNQMKCDRVEPDLVCYTIVLQGVIADEDYPK 308

Query: 309 ADELFDELLLFGLVPDIYTYNVYIHGLCKQGNWEAGIQMILHMEELGCKPDVITYNILLE 368
           AD+LFDELLL GL PD+YTYNVYI+GLCKQ + E  ++M+  M +LG +P+V+TYNIL++
Sbjct: 309 ADKLFDELLLLGLAPDVYTYNVYINGLCKQNDIEGALKMMSSMNKLGSEPNVVTYNILIK 368

Query: 369 CLCKIGELDEARKLRGNMQLKGLAKNVRTFRIMISGLFNNGDVIEACILLEEMLVCRFPP 428
            L K G+L  A+ L   M+  G+ +N  TF IMIS      +V+ A  LLEE        
Sbjct: 369 ALVKAGDLSRAKTLWKEMETNGVNRNSHTFDIMISAYIEVDEVVCAHGLLEEAFNMNVFV 428

Query: 429 QISTFGEILSWLCKRDMVGKALELLTLMV 458
           + S   E++S LC++ ++ +A+ELL  +V
Sbjct: 429 KSSRIEEVISRLCEKGLMDQAVELLAHLV 453

BLAST of Cp4.1LG03g03520 vs. Swiss-Prot
Match: PPR28_ARATH (Pentatricopeptide repeat-containing protein At1g09900 OS=Arabidopsis thaliana GN=At1g09900 PE=2 SV=1)

HSP 1 Score: 170.2 bits (430), Expect = 5.5e-41
Identity = 111/419 (26.49%), Postives = 200/419 (47.73%), Query Frame = 1

Query: 74  PTPNAYHFVLKTFVRSSQFHYIAAVLDRLERVEKFETPEYI-FVDLLKVYGRVNRIQDAI 133
           P    Y+ ++  + ++ + +   +VLDR+       +P+ + +  +L+      +++ A+
Sbjct: 170 PDVITYNVMISGYCKAGEINNALSVLDRMS-----VSPDVVTYNTILRSLCDSGKLKQAM 229

Query: 134 TLFRRIPMFRCVPSALSLNSLLFLLCRN---GEGLRIVPEIILSSQTMGIRLEESTFRIL 193
            +  R+    C P  ++   L+   CR+   G  ++++ E+    +  G   +  T+ +L
Sbjct: 230 EVLDRMLQRDCYPDVITYTILIEATCRDSGVGHAMKLLDEM----RDRGCTPDVVTYNVL 289

Query: 194 ITALCKIYKVGHAMELFNYMITEGYGLNPGICSLILASLCEHKKSTGNVVLT--FLEQMR 253
           +  +CK  ++  A++  N M + G   N    ++IL S+C    STG  +     L  M 
Sbjct: 290 VNGICKEGRLDEAIKFLNDMPSSGCQPNVITHNIILRSMC----STGRWMDAEKLLADML 349

Query: 254 LKGFCPGVVDYSNVIKFLVRRGLSSDALDLLNKMKADGFKPDIVCYTMVLNGVIADGDYK 313
            KGF P VV ++ +I FL R+GL   A+D+L KM   G +P+ + Y  +L+G   +    
Sbjct: 350 RKGFSPSVVTFNILINFLCRKGLLGRAIDILEKMPQHGCQPNSLSYNPLLHGFCKEKKMD 409

Query: 314 MADELFDELLLFGLVPDIYTYNVYIHGLCKQGNWEAGIQMILHMEELGCKPDVITYNILL 373
            A E  + ++  G  PDI TYN  +  LCK G  E  ++++  +   GC P +ITYN ++
Sbjct: 410 RAIEYLERMVSRGCYPDIVTYNTMLTALCKDGKVEDAVEILNQLSSKGCSPVLITYNTVI 469

Query: 374 ECLCKIGELDEARKLRGNMQLKGLAKNVRTFRIMISGLFNNGDVIEACILLEEMLVCRFP 433
           + L K G+  +A KL   M+ K L  +  T+  ++ GL   G V EA     E       
Sbjct: 470 DGLAKAGKTGKAIKLLDEMRAKDLKPDTITYSSLVGGLSREGKVDEAIKFFHEFERMGIR 529

Query: 434 PQISTFGEILSWLCKRDMVGKALELLTLMVGMNFSPGPKAWETLLLSSESELPSVKSLE 487
           P   TF  I+  LCK     +A++ L  M+     P   ++  L+     E  + ++LE
Sbjct: 530 PNAVTFNSIMLGLCKSRQTDRAIDFLVFMINRGCKPNETSYTILIEGLAYEGMAKEALE 575

BLAST of Cp4.1LG03g03520 vs. Swiss-Prot
Match: PP281_ARATH (Pentatricopeptide repeat-containing protein At3g53700, chloroplastic OS=Arabidopsis thaliana GN=MEE40 PE=2 SV=1)

HSP 1 Score: 169.5 bits (428), Expect = 9.4e-41
Identity = 102/402 (25.37%), Postives = 195/402 (48.51%), Query Frame = 1

Query: 74  PTPNAYHFVLKTFVRSSQFHYIAAVLDRLERVEKFETPEYIFVDLLKVYGRVNRIQDAIT 133
           P  + ++ ++K   R+ Q      +L+ +         E  F  +++ Y     +  A+ 
Sbjct: 187 PDVSTFNVLIKALCRAHQLRPAILMLEDMPSYG-LVPDEKTFTTVMQGYIEEGDLDGALR 246

Query: 134 LFRRIPMFRCVPSALSLNSLLFLLCRNG---EGLRIVPEIILSSQTMGIRLEESTFRILI 193
           +  ++  F C  S +S+N ++   C+ G   + L  + E+   S   G   ++ TF  L+
Sbjct: 247 IREQMVEFGCSWSNVSVNVIVHGFCKEGRVEDALNFIQEM---SNQDGFFPDQYTFNTLV 306

Query: 194 TALCKIYKVGHAMELFNYMITEGYGLNPGICSLILASLCEHKKSTGNVVLTFLEQMRLKG 253
             LCK   V HA+E+ + M+ EGY  +    + +++ LC  K       +  L+QM  + 
Sbjct: 307 NGLCKAGHVKHAIEIMDVMLQEGYDPDVYTYNSVISGLC--KLGEVKEAVEVLDQMITRD 366

Query: 254 FCPGVVDYSNVIKFLVRRGLSSDALDLLNKMKADGFKPDIVCYTMVLNGVIADGDYKMAD 313
             P  V Y+ +I  L +     +A +L   + + G  PD+  +  ++ G+    ++++A 
Sbjct: 367 CSPNTVTYNTLISTLCKENQVEEATELARVLTSKGILPDVCTFNSLIQGLCLTRNHRVAM 426

Query: 314 ELFDELLLFGLVPDIYTYNVYIHGLCKQGNWEAGIQMILHMEELGCKPDVITYNILLECL 373
           ELF+E+   G  PD +TYN+ I  LC +G  +  + M+  ME  GC   VITYN L++  
Sbjct: 427 ELFEEMRSKGCEPDEFTYNMLIDSLCSKGKLDEALNMLKQMELSGCARSVITYNTLIDGF 486

Query: 374 CKIGELDEARKLRGNMQLKGLAKNVRTFRIMISGLFNNGDVIEACILLEEMLVCRFPPQI 433
           CK  +  EA ++   M++ G+++N  T+  +I GL  +  V +A  L+++M++    P  
Sbjct: 487 CKANKTREAEEIFDEMEVHGVSRNSVTYNTLIDGLCKSRRVEDAAQLMDQMIMEGQKPDK 546

Query: 434 STFGEILSWLCKRDMVGKALELLTLMVGMNFSPGPKAWETLL 473
            T+  +L+  C+   + KA +++  M      P    + TL+
Sbjct: 547 YTYNSLLTHFCRGGDIKKAADIVQAMTSNGCEPDIVTYGTLI 582

BLAST of Cp4.1LG03g03520 vs. Swiss-Prot
Match: PPR90_ARATH (Pentatricopeptide repeat-containing protein At1g62590 OS=Arabidopsis thaliana GN=At1g62590 PE=2 SV=1)

HSP 1 Score: 168.7 bits (426), Expect = 1.6e-40
Identity = 107/381 (28.08%), Postives = 185/381 (48.56%), Query Frame = 1

Query: 79  YHFVLKTFVRSSQFHYIAAVLDRLERVEKFETPEYIFVDLLKVYGRVNRIQDAITLFRRI 138
           Y+ ++  F R SQ     A+L ++ ++  +E        LL  Y    RI DA+ L  ++
Sbjct: 123 YNILINCFCRRSQISLALALLGKMMKLG-YEPSIVTLSSLLNGYCHGKRISDAVALVDQM 182

Query: 139 PMFRCVPSALSLNSL---LFLLCRNGEGLRIVPEIILSSQTMGIRLEESTFRILITALCK 198
                 P  ++  +L   LFL  +  E + +V  ++      G +    T+ +++  LCK
Sbjct: 183 VEMGYRPDTITFTTLIHGLFLHNKASEAVALVDRMV----QRGCQPNLVTYGVVVNGLCK 242

Query: 199 IYKVGHAMELFNYMITEGYGLNPGICSLILASLCEHKKSTGNVVLTFLEQMRLKGFCPGV 258
                 A+ L N M       +  I + I+ SLC+++       L   ++M  KG  P V
Sbjct: 243 RGDTDLALNLLNKMEAAKIEADVVIFNTIIDSLCKYRHVDD--ALNLFKEMETKGIRPNV 302

Query: 259 VDYSNVIKFLVRRGLSSDALDLLNKMKADGFKPDIVCYTMVLNGVIADGDYKMADELFDE 318
           V YS++I  L   G  SDA  LL+ M      P++V +  +++  + +G +  A++L+D+
Sbjct: 303 VTYSSLISCLCSYGRWSDASQLLSDMIEKKINPNLVTFNALIDAFVKEGKFVEAEKLYDD 362

Query: 319 LLLFGLVPDIYTYNVYIHGLCKQGNWEAGIQMILHMEELGCKPDVITYNILLECLCKIGE 378
           ++   + PDI+TYN  ++G C     +   QM   M    C PDV+TYN L++  CK   
Sbjct: 363 MIKRSIDPDIFTYNSLVNGFCMHDRLDKAKQMFEFMVSKDCFPDVVTYNTLIKGFCKSKR 422

Query: 379 LDEARKLRGNMQLKGLAKNVRTFRIMISGLFNNGDVIEACILLEEMLVCRFPPQISTFGE 438
           +++  +L   M  +GL  +  T+  +I GLF++GD   A  + ++M+    PP I T+  
Sbjct: 423 VEDGTELFREMSHRGLVGDTVTYTTLIQGLFHDGDCDNAQKVFKQMVSDGVPPDIMTYSI 482

Query: 439 ILSWLCKRDMVGKALELLTLM 457
           +L  LC    + KALE+   M
Sbjct: 483 LLDGLCNNGKLEKALEVFDYM 496

BLAST of Cp4.1LG03g03520 vs. Swiss-Prot
Match: PPR91_ARATH (Pentatricopeptide repeat-containing protein At1g62670, mitochondrial OS=Arabidopsis thaliana GN=At1g62670 PE=3 SV=2)

HSP 1 Score: 167.9 bits (424), Expect = 2.7e-40
Identity = 98/346 (28.32%), Postives = 174/346 (50.29%), Query Frame = 1

Query: 127 RIQDAITLFRRIPMFRCVPSALSLNSLLFLLCRNGEGLRIVPEIILSSQTMGIRLEESTF 186
           ++ DA+ LF  +   R  PS +  + LL  + +  +   +V  +    Q +GI     T+
Sbjct: 61  KLDDAVALFGEMVKSRPFPSIIEFSKLLSAIAKMNK-FDVVISLGEQMQNLGIPHNHYTY 120

Query: 187 RILITALCKIYKVGHAMELFNYMITEGYGLNPGICSLILASLCEHKKSTGNVVLTFLEQM 246
            ILI   C+  ++  A+ +   M+  GY  N    S +L   C  K+ +  V L  ++QM
Sbjct: 121 SILINCFCRRSQLPLALAVLGKMMKLGYEPNIVTLSSLLNGYCHSKRISEAVAL--VDQM 180

Query: 247 RLKGFCPGVVDYSNVIKFLVRRGLSSDALDLLNKMKADGFKPDIVCYTMVLNGVIADGDY 306
            + G+ P  V ++ +I  L     +S+A+ L+++M A G +PD+V Y +V+NG+   GD 
Sbjct: 181 FVTGYQPNTVTFNTLIHGLFLHNKASEAMALIDRMVAKGCQPDLVTYGVVVNGLCKRGDT 240

Query: 307 KMADELFDELLLFGLVPDIYTYNVYIHGLCKQGNWEAGIQMILHMEELGCKPDVITYNIL 366
            +A  L +++    L P +  YN  I GLCK  + +  + +   ME  G +P+V+TY+ L
Sbjct: 241 DLAFNLLNKMEQGKLEPGVLIYNTIIDGLCKYKHMDDALNLFKEMETKGIRPNVVTYSSL 300

Query: 367 LECLCKIGELDEARKLRGNMQLKGLAKNVRTFRIMISGLFNNGDVIEACILLEEMLVCRF 426
           + CLC  G   +A +L  +M  + +  +V TF  +I      G ++EA  L +EM+    
Sbjct: 301 ISCLCNYGRWSDASRLLSDMIERKINPDVFTFSALIDAFVKEGKLVEAEKLYDEMVKRSI 360

Query: 427 PPQISTFGEILSWLCKRDMVGKALELLTLMVGMNFSPGPKAWETLL 473
            P I T+  +++  C  D + +A ++   MV  +  P    + TL+
Sbjct: 361 DPSIVTYSSLINGFCMHDRLDEAKQMFEFMVSKHCFPDVVTYNTLI 403

BLAST of Cp4.1LG03g03520 vs. TrEMBL
Match: A0A0A0LA35_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G730720 PE=4 SV=1)

HSP 1 Score: 782.3 bits (2019), Expect = 3.4e-223
Identity = 389/493 (78.90%), Postives = 434/493 (88.03%), Query Frame = 1

Query: 3   SIRVVKRSDHFLRKHRKCPFSPFKTKWHQTFNQDEALRAIKQAANSPKQPNSDEPYLLSV 62
           S  VVK+S++FLRKHRK P S  KTKWHQTF+QDEALR +KQAAN P QP+     LLS 
Sbjct: 4   SFSVVKKSNNFLRKHRKWPLSSHKTKWHQTFDQDEALRILKQAAN-PDQPH----LLLSA 63

Query: 63  LISSFRAYCCDPTPNAYHFVLKTFVRSSQFHYIAAVLDRLERVEKFETPEYIFVDLLKVY 122
           L++SF AY C PTPNAY+FVLKT  R+SQFH+I  VL RL+ +E F+TPEYIFVDL+K+Y
Sbjct: 64  LVTSFTAYSCHPTPNAYYFVLKTLARTSQFHHIPPVLHRLQFLENFQTPEYIFVDLIKLY 123

Query: 123 GRVNRIQDAITLFRRIPMFRCVPSALSLNSLLFLLCRNGEGLRIVPEIILSSQTMGIRLE 182
           GR+NRIQDA+TLFRRIPMFRCVPS LSLNSLL  L RN +GL I+P+IIL+S +MGIRLE
Sbjct: 124 GRMNRIQDAVTLFRRIPMFRCVPSTLSLNSLLSQLSRNAQGLPIIPDIILNSHSMGIRLE 183

Query: 183 ESTFRILITALCKIYKVGHAMELFNYMITEGYGLNPGICSLILASLCEHKKSTGNVVLTF 242
            STF+ILITALCK+ KVGHAMELFNYMITEGYGLNP ICSLILASLC+ KKS+G+VVL F
Sbjct: 184 HSTFQILITALCKVNKVGHAMELFNYMITEGYGLNPQICSLILASLCQQKKSSGDVVLGF 243

Query: 243 LEQMRLKGFCPGVVDYSNVIKFLVRRGLSSDALDLLNKMKADGFKPDIVCYTMVLNGVIA 302
           LE+MR KGFCP VVDYSNVIKF V RG+ SDA+DLLNKMKADGFKPDIVCYTMVLNGVIA
Sbjct: 244 LEEMRQKGFCPAVVDYSNVIKFFVTRGMGSDAVDLLNKMKADGFKPDIVCYTMVLNGVIA 303

Query: 303 DGDYKMADELFDELLLFGLVPDIYTYNVYIHGLCKQGNWEAGIQMILHMEELGCKPDVIT 362
           DGDYKMADELFDELLLFGLVPDIYTYNVYIHGLCKQG+  AG+QMI HME LGC+P+VIT
Sbjct: 304 DGDYKMADELFDELLLFGLVPDIYTYNVYIHGLCKQGDSVAGLQMIPHMEALGCQPNVIT 363

Query: 363 YNILLECLCKIGELDEARKLRGNMQLKGLAKNVRTFRIMISGLFNNGDVIEACILLEEML 422
           YN++L+ LCK GELDEARKLR  MQLKGLA+N+RTFRIMI GLF+NG+VIEAC+LLEEML
Sbjct: 364 YNVILKSLCKTGELDEARKLRSKMQLKGLAENLRTFRIMIDGLFHNGEVIEACVLLEEML 423

Query: 423 VCRFPPQISTFGEILSWLCKRDMVGKALELLTLMVGMNFSPGPKAWETLLLSSESELPSV 482
             RFPPQISTF EILSWLCKR MVGKA+ELL LMVG NFSPGPKAWE LLLSSESEL SV
Sbjct: 424 GSRFPPQISTFSEILSWLCKRHMVGKAVELLALMVGKNFSPGPKAWEILLLSSESELTSV 483

Query: 483 KSLETTLEDLVGI 496
           KSLETTL+DLVGI
Sbjct: 484 KSLETTLKDLVGI 491

BLAST of Cp4.1LG03g03520 vs. TrEMBL
Match: W9QV00_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_021993 PE=4 SV=1)

HSP 1 Score: 529.3 bits (1362), Expect = 5.2e-147
Identity = 262/489 (53.58%), Postives = 356/489 (72.80%), Query Frame = 1

Query: 4   IRVVKRSDHFLRKHRKCPFSPFKTKWHQTFNQDEALRAIKQAANSPKQPNSDEPYLLSVL 63
           IR    ++ FLRKHR+ P SP+KTKWH+TFNQ +AL+ +K+  N  + PN     LLS+L
Sbjct: 3   IRPFSLTNKFLRKHREFPISPYKTKWHETFNQTQALQTLKRHQN--ENPNR----LLSLL 62

Query: 64  ISSFRAYCCDPTPNAYHFVLKTFVRSSQFHYIAAVLDRLERVEKFETPEYIFVDLLKVYG 123
           ++SF +Y C+PTP AYHFVLKT +++SQF +I +VLDR+E VEKFETPEY F  ++  YG
Sbjct: 63  LNSFNSYDCNPTPEAYHFVLKTLIKTSQFDHIHSVLDRIEFVEKFETPEYFFAQIIGFYG 122

Query: 124 RVNRIQDAITLFRRIPMFRCVPSALSLNSLLFLLCRNGEGLRIVPEIILSSQTMGIRLEE 183
            ++RI+DAI +F RIP FRCVPS+ SLNSLL++LCR  EGLR VPE+++ S+ M IRLEE
Sbjct: 123 FLDRIEDAIDIFWRIPKFRCVPSSYSLNSLLYVLCRRNEGLRFVPEVLIKSRDMNIRLEE 182

Query: 184 STFRILITALCKIYKVGHAMELFNYMITEGYGLNPGICSLILASLCEHKKS---TGNVVL 243
           ++FRILITALCKI KVG+A+E+ + MI++GY ++  ICSLIL+ LC   K     G  VL
Sbjct: 183 ASFRILITALCKIGKVGYAIEILDCMISDGYDIDARICSLILSFLCGKNKELDLAGFDVL 242

Query: 244 TFLEQMRLKGFCPGVVDYSNVIKFLVRRGLSSDALDLLNKMKADGFKPDIVCYTMVLNGV 303
             L++M   GFCP + DYS VI+ LVR     +ALD+L +MKADG KPD+VCYTMVL+G+
Sbjct: 243 ELLQKMEKMGFCPRMGDYSKVIRILVREKRGLEALDILGQMKADGMKPDVVCYTMVLHGI 302

Query: 304 IADGDYKMADELFDELLLFGLVPDIYTYNVYIHGLCKQGNWEAGIQMILHMEELGCKPDV 363
           +A+G+Y  ADE+FDE+L+ GLVPD+YTYN YI+GLCKQ + +  +  IL MEELGCKP++
Sbjct: 303 VAEGEYSKADEMFDEMLVLGLVPDVYTYNAYINGLCKQNDVDGALDTILRMEELGCKPNL 362

Query: 364 ITYNILLECLCKIGELDEARKLRGNMQLKGLAKNVRTFRIMISGLFNNGDVIEACILLEE 423
           ITYN++L  LCK GE   A++L   M LKG    ++T+ IM+  L   G+++EAC L+EE
Sbjct: 363 ITYNLILRALCKNGEFGRAKELVAEMSLKGFEDYLQTYIIMLDVLLGKGEIVEACGLMEE 422

Query: 424 MLVCRFPPQISTFGEILSWLCKRDMVGKALELLTLMVGMNFSPGPKAWETLLLSSESELP 483
           ML      + S + EI+  LC+R +  KA E+L  MVG N +PG +AW+ LLLSS SEL 
Sbjct: 423 MLDKLLCRRCSMYDEIIFGLCRRGLDCKASEMLGKMVGKNVAPGARAWDALLLSSGSELT 482

Query: 484 SVKSLETTL 490
             +++ ++L
Sbjct: 483 LPEAIWSSL 485

BLAST of Cp4.1LG03g03520 vs. TrEMBL
Match: A0A067JJ28_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_00390 PE=4 SV=1)

HSP 1 Score: 526.6 bits (1355), Expect = 3.4e-146
Identity = 259/464 (55.82%), Postives = 334/464 (71.98%), Query Frame = 1

Query: 13  FLRKHRKCPFSPFKTKWHQTFNQDEALRAIKQAANSPK--QPNSDEPYLLSVLISSFRAY 72
           FLRKHR+ P SP+K KWH  FNQ +A++ +KQ A S +  Q NS    LLS LI SF  Y
Sbjct: 13  FLRKHRRWPHSPYKAKWHHIFNQQQAMQNLKQEATSLQNLQQNSKSSNLLSSLIHSFSVY 72

Query: 73  CCDPTPNAYHFVLKTFVRSSQFHYIAAVLDRLERVEKFETPEYIFVDLLKVYGRVNRIQD 132
             +PTP A+HF++KT   ++Q HYI  VLD LE++E FETPE+I   L+K YG  N IQ 
Sbjct: 73  NSEPTPQAFHFLIKTLTETTQLHYIPLVLDHLEKIENFETPEFILAHLIKFYGNANEIQK 132

Query: 133 AITLFRRIPMFRCVPSALSLNSLLFLLCRNGEGLRIVPEIILSSQTMGIRLEESTFRILI 192
           AI LF RIP FRC+PS  SLN+LL +LCR+ +GL  VPEI+L S+ MGIRLE+S+FR+L 
Sbjct: 133 AIELFYRIPKFRCLPSVYSLNTLLSVLCRSSQGLERVPEILLRSRVMGIRLEDSSFRLLT 192

Query: 193 TALCKIYKVGHAMELFNYMITEGYGLNPGICSLILASLCEHKKSTGNVVLTFLEQMRLKG 252
            A+CKI KVG+A+E+FN MI +G+ ++  ICSL+L+S+CE    +   V+ FL Q+R  G
Sbjct: 193 AAICKIKKVGYAVEIFNCMINDGFDVDTKICSLLLSSVCEQADVSRVDVMGFLGQLRKLG 252

Query: 253 FCPGVVDYSNVIKFLVRRGLSSDALDLLNKMKADGFKPDIVCYTMVLNGVIADGDYKMAD 312
           FCPG+VDYSNVI+FLVR G+  DALD+LN+MK DG KPD+VCYTMVL GVIA+G Y  AD
Sbjct: 253 FCPGMVDYSNVIRFLVRGGMGMDALDVLNQMKIDGIKPDVVCYTMVLKGVIANGFYSKAD 312

Query: 313 ELFDELLLFGLVPDIYTYNVYIHGLCKQGNWEAGIQMILHMEELGCKPDVITYNILLECL 372
           +LFDELL+FGLVPD+YTYN Y+ GLCKQ N  AGI+MI  MEELGCKP++ITYN+LLE L
Sbjct: 313 DLFDELLVFGLVPDVYTYNAYVDGLCKQENLHAGIKMIASMEELGCKPNLITYNLLLEAL 372

Query: 373 CKIGELDEARKLRGNMQLKGLAKNVRTFRIMISGLFNNGDVIEACILLEEMLVCRFPPQI 432
           CK GE+  AR L   M  KG+  +++T+++MI G   +G +IEAC LL+E+L      + 
Sbjct: 373 CKSGEISRARDLMKEMGKKGIGPSMQTYKVMIDGSTCSGKIIEACALLDEVLDKGLCAES 432

Query: 433 STFGEILSWLCKRDMVGKALELLTLMVGMNFSPGPKAWETLLLS 475
             F EI+  LC+   + KALELL  M   N +PG + W+ LL S
Sbjct: 433 LIFDEIICGLCQIGSISKALELLEKMALKNVAPGVRVWKVLLSS 476

BLAST of Cp4.1LG03g03520 vs. TrEMBL
Match: V4UT39_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10013613mg PE=4 SV=1)

HSP 1 Score: 525.0 bits (1351), Expect = 9.8e-146
Identity = 264/491 (53.77%), Postives = 353/491 (71.89%), Query Frame = 1

Query: 7   VKRSDHFLRKHRKCPFSPFKTKWHQTFNQDEALRAIKQAANSP----KQPNSDEPYLLSV 66
           +KR++  LRKHRK P SP+K KWHQT +Q +A + +KQ+  +P    +Q    +P++LS 
Sbjct: 7   LKRANLHLRKHRKWPLSPYKAKWHQTLDQQQAKQNVKQSLTTPPTKQQQQIPKQPHILSS 66

Query: 67  LISSFRAYCCDPTPNAYHFVLKTFVRSSQFHYIAAVLDRLERVEKFETPEYIFVDLLKVY 126
           L+ SF  Y C+P P AYHFV+KT   +SQF  I++VLD +E+ E FETPE+IF+DL+K Y
Sbjct: 67  LLHSFSIYNCEPPPEAYHFVIKTLAENSQFCDISSVLDHIEKRENFETPEFIFIDLIKTY 126

Query: 127 GRVNRIQDAITLFRRIPMFRCVPSALSLNSLLFLLCRNGEGLRIVPEIILSSQTMGIRLE 186
              +R QD++ LF +IP FRCVPS  SLN+LL +LCRN E +++VP+I+L SQ M IR+E
Sbjct: 127 ADAHRFQDSVNLFYKIPKFRCVPSVYSLNALLSVLCRNKEWVKMVPQILLKSQLMNIRIE 186

Query: 187 ESTFRILITALCKIYKVGHAMELFNYMITEGYGLNPGICSLILASLCEHKKSTGNVVLTF 246
           ES+FRILI+ LC+I +VG A+E+ N MI +G+ ++   CS IL+S+CE +  + + +L F
Sbjct: 187 ESSFRILISTLCRINRVGFAIEILNCMINDGFCVDGKTCSWILSSVCEQRDLSSDELLGF 246

Query: 247 LEQMRLKGFCPGVVDYSNVIKFLVRRGLSSDALDLLNKMKADGFKPDIVCYTMVLNGVIA 306
           +++M+  GFC G+VDY+NVI+ LV++    DAL +LN+MK+DG KPDIVCYTMVLNGVI 
Sbjct: 247 VQEMKKLGFCFGMVDYTNVIRSLVKKEKVFDALGILNQMKSDGIKPDIVCYTMVLNGVIV 306

Query: 307 DGDYKMADELFDELLLFGLVPDIYTYNVYIHGLCKQGNWEAGIQMILHMEELGCKPDVIT 366
             DY  A+ELFDELL+ GLVPD+YTYNVYI+GLCKQ N EAGI+MI  MEELG KPDVIT
Sbjct: 307 QEDYVKAEELFDELLVLGLVPDVYTYNVYINGLCKQNNVEAGIKMIACMEELGSKPDVIT 366

Query: 367 YNILLECLCKIGELDEARKLRGNMQLKGLAKNVRTFRIMISGLFNNGDVIEACILLEEML 426
           YN LL+ LCK+ EL+  R+L   M+ KG+  N++T+ IMI GL + GD+IEAC LLEE L
Sbjct: 367 YNTLLQALCKVRELNRLRELVKEMKWKGIVLNLQTYSIMIDGLASKGDIIEACGLLEEAL 426

Query: 427 VCRFPPQISTFGEILSWLCKRDMVGKALELLTLMVGMNFSPGPKAWETLLLSSESELPSV 486
                 Q S F E +  LC+R +V KALELL  M   + SPG + WE LLLSS S+L  V
Sbjct: 427 NKGLCTQSSMFDETICGLCQRGLVRKALELLKQMADKDVSPGARVWEALLLSSVSKLDFV 486

Query: 487 KSLETTLEDLV 494
            +    L D +
Sbjct: 487 NTSFIRLVDQI 497

BLAST of Cp4.1LG03g03520 vs. TrEMBL
Match: V7B2Q8_PHAVU (Uncharacterized protein OS=Phaseolus vulgaris GN=PHAVU_008G089500g PE=4 SV=1)

HSP 1 Score: 521.5 bits (1342), Expect = 1.1e-144
Identity = 253/490 (51.63%), Postives = 346/490 (70.61%), Query Frame = 1

Query: 8   KRSDHFLRKHRKCPFSPFKTKWHQTFNQDEALRAIKQAA---NSPKQPNSDEPYLLSVLI 67
           KR++ +LRK RK P SP+KT WH  F + +A+  +KQA      P+ PN   P+LLS L+
Sbjct: 8   KRANKYLRKFRKWPHSPYKTSWHHNFGEQQAMHKLKQATLEMGCPQTPNLPHPFLLSTLL 67

Query: 68  SSFRAYCCDPTPNAYHFVLKTFVRSSQFHYIAAVLDRLERVEKFETPEYIFVDLLKVYGR 127
            +F+AY CDPTP AY+FV+KT   +S    I  VLD LE++E FETPE+I V L++ YG 
Sbjct: 68  DAFKAYSCDPTPKAYYFVIKTLTSTSHLQDIPPVLDHLEQLETFETPEFILVYLIRFYGL 127

Query: 128 VNRIQDAITLFRRIPMFRCVPSALSLNSLLFLLCRNGEGLRIVPEIILSSQTMGIRLEES 187
            +R+QDA+ LF RIP FRC P+  SLN +L LLCR  E L++VPEI+L SQ M IR+EES
Sbjct: 128 SDRVQDAVDLFLRIPRFRCTPTVWSLNLVLSLLCRKRECLKMVPEILLKSQHMNIRVEES 187

Query: 188 TFRILITALCKIYKVGHAMELFNYMITEGYGLNPGICSLILASLCEHKKSTGNVVLTFLE 247
           TF++LI ALC+I +VG+A+++ NYMI  GYGL+  ICSLI++SLCE +  T    L    
Sbjct: 188 TFQVLIEALCRIKRVGYAIKMLNYMIEGGYGLDETICSLIISSLCEQEDMTSVEALVIWR 247

Query: 248 QMRLKGFCPGVVDYSNVIKFLVRRGLSSDALDLLNKMKADGFKPDIVCYTMVLNGVIADG 307
            MR  GFCPGV+DY+N+I+FLV+ G  +DALD+LN+ K DG KPD+VCYTMVL+G++A+G
Sbjct: 248 DMRKLGFCPGVMDYTNMIRFLVKEGKGTDALDILNQQKKDGIKPDVVCYTMVLSGIVAEG 307

Query: 308 DYKMADELFDELLLFGLVPDIYTYNVYIHGLCKQGNWEAGIQMILHMEELGCKPDVITYN 367
           +Y   +ELFDE+L+FGLVPD+YTYNVYI+GLCKQ N +  ++++  MEEL C+P+V+T N
Sbjct: 308 EYVKLEELFDEILVFGLVPDVYTYNVYINGLCKQNNVDEALKIVASMEELECRPNVVTCN 367

Query: 368 ILLECLCKIGELDEARKLRGNMQLKGLAKNVRTFRIMISGLFNNGDVIEACILLEEMLVC 427
            LL  LC  G+L +AR +   M  KG+  N+ ++RIM+ GL   G++ EAC LLEEML  
Sbjct: 368 TLLGALCVAGDLRKARGVMKEMGWKGVGLNLHSYRIMLDGLVGKGEIGEACFLLEEMLEK 427

Query: 428 RFPPQISTFGEILSWLCKRDMVGKALELLTLMVGMNFSPGPKAWETLLLSSESELPSVKS 487
            F P+ STF  I+  +C++ ++ +A+EL   +V  +F PG +AWE LLL S S+L     
Sbjct: 428 CFFPRSSTFDHIIFQMCQKGLIAEAIELTKKIVAKSFVPGARAWEALLLKSGSKL---GF 487

Query: 488 LETTLEDLVG 495
            ETT   L+G
Sbjct: 488 SETTFSGLLG 494

BLAST of Cp4.1LG03g03520 vs. TAIR10
Match: AT2G38420.1 (AT2G38420.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 398.7 bits (1023), Expect = 5.4e-111
Identity = 207/449 (46.10%), Postives = 298/449 (66.37%), Query Frame = 1

Query: 9   RSDHFLRKHRKCPFSPFKTKWHQTFNQDEALRAIKQAANSPKQPNSDEPYLLSVLISSFR 68
           R  +F+RK+RK P S FKTKW++   Q  A+  ++    S    +S+   ++  L+SSF+
Sbjct: 9   RMSNFMRKYRKIPHSSFKTKWNENLKQKYAMEELR----SNLLTDSENASVMRTLLSSFQ 68

Query: 69  AYCCDPTPNAYHFVLKTFVRSSQFHYIAAVLDRLERVEKFETPEYIFVDLLKVYGRVNRI 128
            + C+PTP AY FV+KT  +SSQ   I++VL  LE  EKF+TPE IF D++  YG   RI
Sbjct: 69  LHNCEPTPQAYRFVIKTLAKSSQLENISSVLYHLEVSEKFDTPESIFRDVIAAYGFSGRI 128

Query: 129 QDAITLFRRIPMFRCVPSALSLNSLLFLLCRNGEGLRIVPEIILSSQTMGIRLEESTFRI 188
           ++AI +F +IP FRCVPSA +LN+LL +L R  + L +VPEI++ +  MG+RLEESTF I
Sbjct: 129 EEAIEVFFKIPNFRCVPSAYTLNALLLVLVRKRQSLELVPEILVKACRMGVRLEESTFGI 188

Query: 189 LITALCKIYKVGHAMELFNYMITEGYGLNPGICSLILASLCEHKKSTGNVVLTFLEQMRL 248
           LI ALC+I +V  A EL  YM  +   ++P + S +L+S+C+HK S+   V+ +LE +R 
Sbjct: 189 LIDALCRIGEVDCATELVRYMSQDSVIVDPRLYSRLLSSVCKHKDSSCFDVIGYLEDLRK 248

Query: 249 KGFCPGVVDYSNVIKFLVRRGLSSDALDLLNKMKADGFKPDIVCYTMVLNGVIADGDYKM 308
             F PG+ DY+ V++FLV  G   + + +LN+MK D  +PD+VCYT+VL GVIAD DY  
Sbjct: 249 TRFSPGLRDYTVVMRFLVEGGRGKEVVSVLNQMKCDRVEPDLVCYTIVLQGVIADEDYPK 308

Query: 309 ADELFDELLLFGLVPDIYTYNVYIHGLCKQGNWEAGIQMILHMEELGCKPDVITYNILLE 368
           AD+LFDELLL GL PD+YTYNVYI+GLCKQ + E  ++M+  M +LG +P+V+TYNIL++
Sbjct: 309 ADKLFDELLLLGLAPDVYTYNVYINGLCKQNDIEGALKMMSSMNKLGSEPNVVTYNILIK 368

Query: 369 CLCKIGELDEARKLRGNMQLKGLAKNVRTFRIMISGLFNNGDVIEACILLEEMLVCRFPP 428
            L K G+L  A+ L   M+  G+ +N  TF IMIS      +V+ A  LLEE        
Sbjct: 369 ALVKAGDLSRAKTLWKEMETNGVNRNSHTFDIMISAYIEVDEVVCAHGLLEEAFNMNVFV 428

Query: 429 QISTFGEILSWLCKRDMVGKALELLTLMV 458
           + S   E++S LC++ ++ +A+ELL  +V
Sbjct: 429 KSSRIEEVISRLCEKGLMDQAVELLAHLV 453

BLAST of Cp4.1LG03g03520 vs. TAIR10
Match: AT1G09900.1 (AT1G09900.1 Pentatricopeptide repeat (PPR-like) superfamily protein)

HSP 1 Score: 170.2 bits (430), Expect = 3.1e-42
Identity = 111/419 (26.49%), Postives = 200/419 (47.73%), Query Frame = 1

Query: 74  PTPNAYHFVLKTFVRSSQFHYIAAVLDRLERVEKFETPEYI-FVDLLKVYGRVNRIQDAI 133
           P    Y+ ++  + ++ + +   +VLDR+       +P+ + +  +L+      +++ A+
Sbjct: 170 PDVITYNVMISGYCKAGEINNALSVLDRMS-----VSPDVVTYNTILRSLCDSGKLKQAM 229

Query: 134 TLFRRIPMFRCVPSALSLNSLLFLLCRN---GEGLRIVPEIILSSQTMGIRLEESTFRIL 193
            +  R+    C P  ++   L+   CR+   G  ++++ E+    +  G   +  T+ +L
Sbjct: 230 EVLDRMLQRDCYPDVITYTILIEATCRDSGVGHAMKLLDEM----RDRGCTPDVVTYNVL 289

Query: 194 ITALCKIYKVGHAMELFNYMITEGYGLNPGICSLILASLCEHKKSTGNVVLT--FLEQMR 253
           +  +CK  ++  A++  N M + G   N    ++IL S+C    STG  +     L  M 
Sbjct: 290 VNGICKEGRLDEAIKFLNDMPSSGCQPNVITHNIILRSMC----STGRWMDAEKLLADML 349

Query: 254 LKGFCPGVVDYSNVIKFLVRRGLSSDALDLLNKMKADGFKPDIVCYTMVLNGVIADGDYK 313
            KGF P VV ++ +I FL R+GL   A+D+L KM   G +P+ + Y  +L+G   +    
Sbjct: 350 RKGFSPSVVTFNILINFLCRKGLLGRAIDILEKMPQHGCQPNSLSYNPLLHGFCKEKKMD 409

Query: 314 MADELFDELLLFGLVPDIYTYNVYIHGLCKQGNWEAGIQMILHMEELGCKPDVITYNILL 373
            A E  + ++  G  PDI TYN  +  LCK G  E  ++++  +   GC P +ITYN ++
Sbjct: 410 RAIEYLERMVSRGCYPDIVTYNTMLTALCKDGKVEDAVEILNQLSSKGCSPVLITYNTVI 469

Query: 374 ECLCKIGELDEARKLRGNMQLKGLAKNVRTFRIMISGLFNNGDVIEACILLEEMLVCRFP 433
           + L K G+  +A KL   M+ K L  +  T+  ++ GL   G V EA     E       
Sbjct: 470 DGLAKAGKTGKAIKLLDEMRAKDLKPDTITYSSLVGGLSREGKVDEAIKFFHEFERMGIR 529

Query: 434 PQISTFGEILSWLCKRDMVGKALELLTLMVGMNFSPGPKAWETLLLSSESELPSVKSLE 487
           P   TF  I+  LCK     +A++ L  M+     P   ++  L+     E  + ++LE
Sbjct: 530 PNAVTFNSIMLGLCKSRQTDRAIDFLVFMINRGCKPNETSYTILIEGLAYEGMAKEALE 575

BLAST of Cp4.1LG03g03520 vs. TAIR10
Match: AT3G53700.1 (AT3G53700.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 169.5 bits (428), Expect = 5.3e-42
Identity = 102/402 (25.37%), Postives = 195/402 (48.51%), Query Frame = 1

Query: 74  PTPNAYHFVLKTFVRSSQFHYIAAVLDRLERVEKFETPEYIFVDLLKVYGRVNRIQDAIT 133
           P  + ++ ++K   R+ Q      +L+ +         E  F  +++ Y     +  A+ 
Sbjct: 187 PDVSTFNVLIKALCRAHQLRPAILMLEDMPSYG-LVPDEKTFTTVMQGYIEEGDLDGALR 246

Query: 134 LFRRIPMFRCVPSALSLNSLLFLLCRNG---EGLRIVPEIILSSQTMGIRLEESTFRILI 193
           +  ++  F C  S +S+N ++   C+ G   + L  + E+   S   G   ++ TF  L+
Sbjct: 247 IREQMVEFGCSWSNVSVNVIVHGFCKEGRVEDALNFIQEM---SNQDGFFPDQYTFNTLV 306

Query: 194 TALCKIYKVGHAMELFNYMITEGYGLNPGICSLILASLCEHKKSTGNVVLTFLEQMRLKG 253
             LCK   V HA+E+ + M+ EGY  +    + +++ LC  K       +  L+QM  + 
Sbjct: 307 NGLCKAGHVKHAIEIMDVMLQEGYDPDVYTYNSVISGLC--KLGEVKEAVEVLDQMITRD 366

Query: 254 FCPGVVDYSNVIKFLVRRGLSSDALDLLNKMKADGFKPDIVCYTMVLNGVIADGDYKMAD 313
             P  V Y+ +I  L +     +A +L   + + G  PD+  +  ++ G+    ++++A 
Sbjct: 367 CSPNTVTYNTLISTLCKENQVEEATELARVLTSKGILPDVCTFNSLIQGLCLTRNHRVAM 426

Query: 314 ELFDELLLFGLVPDIYTYNVYIHGLCKQGNWEAGIQMILHMEELGCKPDVITYNILLECL 373
           ELF+E+   G  PD +TYN+ I  LC +G  +  + M+  ME  GC   VITYN L++  
Sbjct: 427 ELFEEMRSKGCEPDEFTYNMLIDSLCSKGKLDEALNMLKQMELSGCARSVITYNTLIDGF 486

Query: 374 CKIGELDEARKLRGNMQLKGLAKNVRTFRIMISGLFNNGDVIEACILLEEMLVCRFPPQI 433
           CK  +  EA ++   M++ G+++N  T+  +I GL  +  V +A  L+++M++    P  
Sbjct: 487 CKANKTREAEEIFDEMEVHGVSRNSVTYNTLIDGLCKSRRVEDAAQLMDQMIMEGQKPDK 546

Query: 434 STFGEILSWLCKRDMVGKALELLTLMVGMNFSPGPKAWETLL 473
            T+  +L+  C+   + KA +++  M      P    + TL+
Sbjct: 547 YTYNSLLTHFCRGGDIKKAADIVQAMTSNGCEPDIVTYGTLI 582

BLAST of Cp4.1LG03g03520 vs. TAIR10
Match: AT1G62590.1 (AT1G62590.1 pentatricopeptide (PPR) repeat-containing protein)

HSP 1 Score: 168.7 bits (426), Expect = 9.0e-42
Identity = 107/381 (28.08%), Postives = 185/381 (48.56%), Query Frame = 1

Query: 79  YHFVLKTFVRSSQFHYIAAVLDRLERVEKFETPEYIFVDLLKVYGRVNRIQDAITLFRRI 138
           Y+ ++  F R SQ     A+L ++ ++  +E        LL  Y    RI DA+ L  ++
Sbjct: 123 YNILINCFCRRSQISLALALLGKMMKLG-YEPSIVTLSSLLNGYCHGKRISDAVALVDQM 182

Query: 139 PMFRCVPSALSLNSL---LFLLCRNGEGLRIVPEIILSSQTMGIRLEESTFRILITALCK 198
                 P  ++  +L   LFL  +  E + +V  ++      G +    T+ +++  LCK
Sbjct: 183 VEMGYRPDTITFTTLIHGLFLHNKASEAVALVDRMV----QRGCQPNLVTYGVVVNGLCK 242

Query: 199 IYKVGHAMELFNYMITEGYGLNPGICSLILASLCEHKKSTGNVVLTFLEQMRLKGFCPGV 258
                 A+ L N M       +  I + I+ SLC+++       L   ++M  KG  P V
Sbjct: 243 RGDTDLALNLLNKMEAAKIEADVVIFNTIIDSLCKYRHVDD--ALNLFKEMETKGIRPNV 302

Query: 259 VDYSNVIKFLVRRGLSSDALDLLNKMKADGFKPDIVCYTMVLNGVIADGDYKMADELFDE 318
           V YS++I  L   G  SDA  LL+ M      P++V +  +++  + +G +  A++L+D+
Sbjct: 303 VTYSSLISCLCSYGRWSDASQLLSDMIEKKINPNLVTFNALIDAFVKEGKFVEAEKLYDD 362

Query: 319 LLLFGLVPDIYTYNVYIHGLCKQGNWEAGIQMILHMEELGCKPDVITYNILLECLCKIGE 378
           ++   + PDI+TYN  ++G C     +   QM   M    C PDV+TYN L++  CK   
Sbjct: 363 MIKRSIDPDIFTYNSLVNGFCMHDRLDKAKQMFEFMVSKDCFPDVVTYNTLIKGFCKSKR 422

Query: 379 LDEARKLRGNMQLKGLAKNVRTFRIMISGLFNNGDVIEACILLEEMLVCRFPPQISTFGE 438
           +++  +L   M  +GL  +  T+  +I GLF++GD   A  + ++M+    PP I T+  
Sbjct: 423 VEDGTELFREMSHRGLVGDTVTYTTLIQGLFHDGDCDNAQKVFKQMVSDGVPPDIMTYSI 482

Query: 439 ILSWLCKRDMVGKALELLTLM 457
           +L  LC    + KALE+   M
Sbjct: 483 LLDGLCNNGKLEKALEVFDYM 496

BLAST of Cp4.1LG03g03520 vs. TAIR10
Match: AT1G62670.1 (AT1G62670.1 rna processing factor 2)

HSP 1 Score: 167.9 bits (424), Expect = 1.5e-41
Identity = 98/346 (28.32%), Postives = 174/346 (50.29%), Query Frame = 1

Query: 127 RIQDAITLFRRIPMFRCVPSALSLNSLLFLLCRNGEGLRIVPEIILSSQTMGIRLEESTF 186
           ++ DA+ LF  +   R  PS +  + LL  + +  +   +V  +    Q +GI     T+
Sbjct: 61  KLDDAVALFGEMVKSRPFPSIIEFSKLLSAIAKMNK-FDVVISLGEQMQNLGIPHNHYTY 120

Query: 187 RILITALCKIYKVGHAMELFNYMITEGYGLNPGICSLILASLCEHKKSTGNVVLTFLEQM 246
            ILI   C+  ++  A+ +   M+  GY  N    S +L   C  K+ +  V L  ++QM
Sbjct: 121 SILINCFCRRSQLPLALAVLGKMMKLGYEPNIVTLSSLLNGYCHSKRISEAVAL--VDQM 180

Query: 247 RLKGFCPGVVDYSNVIKFLVRRGLSSDALDLLNKMKADGFKPDIVCYTMVLNGVIADGDY 306
            + G+ P  V ++ +I  L     +S+A+ L+++M A G +PD+V Y +V+NG+   GD 
Sbjct: 181 FVTGYQPNTVTFNTLIHGLFLHNKASEAMALIDRMVAKGCQPDLVTYGVVVNGLCKRGDT 240

Query: 307 KMADELFDELLLFGLVPDIYTYNVYIHGLCKQGNWEAGIQMILHMEELGCKPDVITYNIL 366
            +A  L +++    L P +  YN  I GLCK  + +  + +   ME  G +P+V+TY+ L
Sbjct: 241 DLAFNLLNKMEQGKLEPGVLIYNTIIDGLCKYKHMDDALNLFKEMETKGIRPNVVTYSSL 300

Query: 367 LECLCKIGELDEARKLRGNMQLKGLAKNVRTFRIMISGLFNNGDVIEACILLEEMLVCRF 426
           + CLC  G   +A +L  +M  + +  +V TF  +I      G ++EA  L +EM+    
Sbjct: 301 ISCLCNYGRWSDASRLLSDMIERKINPDVFTFSALIDAFVKEGKLVEAEKLYDEMVKRSI 360

Query: 427 PPQISTFGEILSWLCKRDMVGKALELLTLMVGMNFSPGPKAWETLL 473
            P I T+  +++  C  D + +A ++   MV  +  P    + TL+
Sbjct: 361 DPSIVTYSSLINGFCMHDRLDEAKQMFEFMVSKHCFPDVVTYNTLI 403

BLAST of Cp4.1LG03g03520 vs. NCBI nr
Match: gi|449440241|ref|XP_004137893.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g38420, mitochondrial [Cucumis sativus])

HSP 1 Score: 782.3 bits (2019), Expect = 4.9e-223
Identity = 389/493 (78.90%), Postives = 434/493 (88.03%), Query Frame = 1

Query: 3   SIRVVKRSDHFLRKHRKCPFSPFKTKWHQTFNQDEALRAIKQAANSPKQPNSDEPYLLSV 62
           S  VVK+S++FLRKHRK P S  KTKWHQTF+QDEALR +KQAAN P QP+     LLS 
Sbjct: 4   SFSVVKKSNNFLRKHRKWPLSSHKTKWHQTFDQDEALRILKQAAN-PDQPH----LLLSA 63

Query: 63  LISSFRAYCCDPTPNAYHFVLKTFVRSSQFHYIAAVLDRLERVEKFETPEYIFVDLLKVY 122
           L++SF AY C PTPNAY+FVLKT  R+SQFH+I  VL RL+ +E F+TPEYIFVDL+K+Y
Sbjct: 64  LVTSFTAYSCHPTPNAYYFVLKTLARTSQFHHIPPVLHRLQFLENFQTPEYIFVDLIKLY 123

Query: 123 GRVNRIQDAITLFRRIPMFRCVPSALSLNSLLFLLCRNGEGLRIVPEIILSSQTMGIRLE 182
           GR+NRIQDA+TLFRRIPMFRCVPS LSLNSLL  L RN +GL I+P+IIL+S +MGIRLE
Sbjct: 124 GRMNRIQDAVTLFRRIPMFRCVPSTLSLNSLLSQLSRNAQGLPIIPDIILNSHSMGIRLE 183

Query: 183 ESTFRILITALCKIYKVGHAMELFNYMITEGYGLNPGICSLILASLCEHKKSTGNVVLTF 242
            STF+ILITALCK+ KVGHAMELFNYMITEGYGLNP ICSLILASLC+ KKS+G+VVL F
Sbjct: 184 HSTFQILITALCKVNKVGHAMELFNYMITEGYGLNPQICSLILASLCQQKKSSGDVVLGF 243

Query: 243 LEQMRLKGFCPGVVDYSNVIKFLVRRGLSSDALDLLNKMKADGFKPDIVCYTMVLNGVIA 302
           LE+MR KGFCP VVDYSNVIKF V RG+ SDA+DLLNKMKADGFKPDIVCYTMVLNGVIA
Sbjct: 244 LEEMRQKGFCPAVVDYSNVIKFFVTRGMGSDAVDLLNKMKADGFKPDIVCYTMVLNGVIA 303

Query: 303 DGDYKMADELFDELLLFGLVPDIYTYNVYIHGLCKQGNWEAGIQMILHMEELGCKPDVIT 362
           DGDYKMADELFDELLLFGLVPDIYTYNVYIHGLCKQG+  AG+QMI HME LGC+P+VIT
Sbjct: 304 DGDYKMADELFDELLLFGLVPDIYTYNVYIHGLCKQGDSVAGLQMIPHMEALGCQPNVIT 363

Query: 363 YNILLECLCKIGELDEARKLRGNMQLKGLAKNVRTFRIMISGLFNNGDVIEACILLEEML 422
           YN++L+ LCK GELDEARKLR  MQLKGLA+N+RTFRIMI GLF+NG+VIEAC+LLEEML
Sbjct: 364 YNVILKSLCKTGELDEARKLRSKMQLKGLAENLRTFRIMIDGLFHNGEVIEACVLLEEML 423

Query: 423 VCRFPPQISTFGEILSWLCKRDMVGKALELLTLMVGMNFSPGPKAWETLLLSSESELPSV 482
             RFPPQISTF EILSWLCKR MVGKA+ELL LMVG NFSPGPKAWE LLLSSESEL SV
Sbjct: 424 GSRFPPQISTFSEILSWLCKRHMVGKAVELLALMVGKNFSPGPKAWEILLLSSESELTSV 483

Query: 483 KSLETTLEDLVGI 496
           KSLETTL+DLVGI
Sbjct: 484 KSLETTLKDLVGI 491

BLAST of Cp4.1LG03g03520 vs. NCBI nr
Match: gi|659083416|ref|XP_008442345.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g38420, mitochondrial [Cucumis melo])

HSP 1 Score: 776.2 bits (2003), Expect = 3.5e-221
Identity = 390/495 (78.79%), Postives = 432/495 (87.27%), Query Frame = 1

Query: 1   MESIRVVKRSDHFLRKHRKCPFSPFKTKWHQTFNQDEALRAIKQAANSPKQPNSDEPYLL 60
           M S  VVK S++FLRKHRK PFS  KT WHQTF+QDEALR +KQ+ANS  QP+     LL
Sbjct: 1   MGSFSVVKNSNNFLRKHRKWPFSSHKTTWHQTFDQDEALRILKQSANS-HQPH----LLL 60

Query: 61  SVLISSFRAYCCDPTPNAYHFVLKTFVRSSQFHYIAAVLDRLERVEKFETPEYIFVDLLK 120
           S LI+SF +Y C PTPNAY+FVLKT  R+SQFH+I  VL RL+ VEKF+TPEYIFVDL+K
Sbjct: 61  STLITSFTSYSCHPTPNAYYFVLKTLARTSQFHHIPPVLHRLQFVEKFQTPEYIFVDLIK 120

Query: 121 VYGRVNRIQDAITLFRRIPMFRCVPSALSLNSLLFLLCRNGEGLRIVPEIILSSQTMGIR 180
           +YGR+NRIQDA+TLFRRIPMFRC PS LSLNSLL LL RN +GL ++P+I+L+S +MGIR
Sbjct: 121 LYGRMNRIQDAVTLFRRIPMFRCFPSTLSLNSLLSLLSRNAQGLPMIPDILLNSHSMGIR 180

Query: 181 LEESTFRILITALCKIYKVGHAMELFNYMITEGYGLNPGICSLILASLCEHKKSTGNVVL 240
           LE STFRILITALCK+ KVGHAME+FNYMITEGYGLNP ICSLIL SLC+ K S+G+VVL
Sbjct: 181 LEHSTFRILITALCKVNKVGHAMEIFNYMITEGYGLNPQICSLILGSLCQQKNSSGDVVL 240

Query: 241 TFLEQMRLKGFCPGVVDYSNVIKFLVRRGLSSDALDLLNKMKADGFKPDIVCYTMVLNGV 300
           +FLE+ R KGFCP VVDYSNVIKF V RG+SSDA+DLLNKMKADGFKPDIVCYTMVLNGV
Sbjct: 241 SFLEETRQKGFCPAVVDYSNVIKFFVTRGMSSDAIDLLNKMKADGFKPDIVCYTMVLNGV 300

Query: 301 IADGDYKMADELFDELLLFGLVPDIYTYNVYIHGLCKQGNWEAGIQMILHMEELGCKPDV 360
           IADGDY+MADELFDELLLFGLVPDIYTYNVYIHGLCKQGN  AG+QMI HME LGCKP+V
Sbjct: 301 IADGDYQMADELFDELLLFGLVPDIYTYNVYIHGLCKQGNSAAGLQMISHMEALGCKPNV 360

Query: 361 ITYNILLECLCKIGELDEARKLRGNMQLKGLAKNVRTFRIMISGLFNNGDVIEACILLEE 420
           ITYNILL+ LC  GELDEARKLR  MQLKGLA+N+RTFRIMI GLF+NG+VIEAC LLEE
Sbjct: 361 ITYNILLKSLCITGELDEARKLRSKMQLKGLAENLRTFRIMIDGLFHNGEVIEACALLEE 420

Query: 421 MLVCRFPPQISTFGEILSWLCKRDMVGKALELLTLMVGMNFSPGPKAWETLLLSSESELP 480
           ML  RFPPQISTF EILS LCKR MVGKALELLTLMVG NFSPGPKAWE LLLSSESEL 
Sbjct: 421 MLRSRFPPQISTFSEILSRLCKRHMVGKALELLTLMVGKNFSPGPKAWEILLLSSESELT 480

Query: 481 SVKSLETTLEDLVGI 496
           SVKSLETTL+DLVGI
Sbjct: 481 SVKSLETTLKDLVGI 490

BLAST of Cp4.1LG03g03520 vs. NCBI nr
Match: gi|645279621|ref|XP_008244810.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g38420, mitochondrial [Prunus mume])

HSP 1 Score: 564.7 bits (1454), Expect = 1.6e-157
Identity = 289/487 (59.34%), Postives = 367/487 (75.36%), Query Frame = 1

Query: 10  SDHFLRKHRKCPFSPFKTKWHQTFNQDEALRAIKQAANSPKQPNSDEPYLLSVLISSFRA 69
           S  FLRKHRK P SP  TKWHQTFNQ++A +++K+++    QP     +LLS LI SF +
Sbjct: 11  SKFFLRKHRKWPVSPHNTKWHQTFNQNQAFQSLKKSSPPQNQPQ----HLLSTLIYSFNS 70

Query: 70  YCCDPTPNAYHFVLKTFVRSSQFHYIAAVLDRLERVEKFETPEYIFVDLLKVYGRVNRIQ 129
           Y C+P P AY+FV+KT  ++SQF+ IA+VLDRLE VEKF TPEYIF  L+  YG+  R Q
Sbjct: 71  YNCEPNPEAYNFVIKTLTKTSQFNDIASVLDRLEFVEKFNTPEYIFAHLISFYGQSKRTQ 130

Query: 130 DAITLFRRIPMFRCVPSALSLNSLLFLLCRNGEGLRIVPEIILSSQTMGIRLEESTFRIL 189
            AI LF RIP FRCVPSA SLNSLL++LC + EGL++VPEI+L S  M IRLEES+F+IL
Sbjct: 131 GAIDLFYRIPKFRCVPSAHSLNSLLYVLCGSREGLKMVPEILLRSHIMSIRLEESSFQIL 190

Query: 190 ITALCKIYKVGHAMELFNYMITEGYGLNPGICSLILASLCEHKKSTGNVVLTFLEQMRLK 249
           + +LC I KVG+A+E+ N MI+ GYGLN  +CSLIL+SLCE K S+G  VL F+E+M+  
Sbjct: 191 VNSLCAIGKVGYAIEIMNCMISYGYGLNAKMCSLILSSLCEQKDSSGFEVLGFVEEMKKL 250

Query: 250 GFCPGVVDYSNVIKFLVRRGLSSDALDLLNKMKADGFKPDIVCYTMVLNGVIADGDYKMA 309
           GFCPG++DYSNVI+++V++G   DAL++L KMK +G KPDIVCYTMVL+GVIA+GDY+ A
Sbjct: 251 GFCPGMMDYSNVIRYMVKQGRGLDALNVLVKMKVEGIKPDIVCYTMVLHGVIAEGDYENA 310

Query: 310 DELFDELLLFGLVPDIYTYNVYIHGLCKQGNWEAGIQMILHMEELGCKPDVITYNILLEC 369
           DELFDELLL GLVPD+YTYNVY++GLCKQ   + G++MI  MEELGCKP++ITYNILL+ 
Sbjct: 311 DELFDELLLLGLVPDVYTYNVYVNGLCKQNKVKDGLKMISSMEELGCKPNLITYNILLKG 370

Query: 370 LCKIGELDEARKLRGNMQLKGLAKNVRTFRIMISGLFNNGDVIEACILLEEML---VCRF 429
           LC  GEL  AR+L   M LKG+  N++T RIM+ GLF  GD+ EACI ++EML   +CRF
Sbjct: 371 LCNNGELSRARELVSEMTLKGIGVNLQTHRIMLDGLFGQGDIDEACIFMDEMLDKFLCRF 430

Query: 430 PPQISTFGEILSWLCKRDMVGKALELLTLMVGMNFSPGPKAWETLLLSSESELPSVKSLE 489
               S+F E++  LC++  V KA++LL  MV  N +PG KAWE LLLSS SE P     E
Sbjct: 431 ---CSSFDEVIYGLCRKGSVCKAMDLLKKMVDKNVAPGAKAWEALLLSSGSE-PGF--AE 487

Query: 490 TTLEDLV 494
           TT  DLV
Sbjct: 491 TTWTDLV 487

BLAST of Cp4.1LG03g03520 vs. NCBI nr
Match: gi|694395598|ref|XP_009373117.1| (PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At2g38420, mitochondrial [Pyrus x bretschneideri])

HSP 1 Score: 542.0 bits (1395), Expect = 1.1e-150
Identity = 276/479 (57.62%), Postives = 347/479 (72.44%), Query Frame = 1

Query: 10  SDHFLRKHRKCPFSPFKTKWHQTFNQDEALRAIKQAANSPKQPNSDEPYLLSVLISSFRA 69
           S  FLRKHRK P SP+ T+W Q FNQ +A +++K++  +P        +LL  LI SF+ 
Sbjct: 11  SKFFLRKHRKWPVSPYNTRWQQIFNQSQAFQSLKKSLPAPC-------HLLPTLIYSFKT 70

Query: 70  YCCDPTPNAYHFVLKTFVRSSQFHYIAAVLDRLERVEKFETPEYIFVDLLKVYGRVNRIQ 129
           Y  DPTP AYHF LKT  ++SQF +I  VL +LE+VEKF TPE+IF  L+  Y R +R Q
Sbjct: 71  YNADPTPEAYHFFLKTLTKTSQFDHIVPVLPQLEKVEKFGTPEHIFSHLIGFYDRWSRTQ 130

Query: 130 DAITLFRRIPMFRCVPSALSLNSLLFLLCRNGEGLRIVPEIILSSQTMGIRLEESTFRIL 189
           DAI LF RIP FRCVPSA SLN+LL +LC + EGL++VPEI L S  MGIRLEES FRIL
Sbjct: 131 DAIDLFYRIPKFRCVPSAHSLNALLCVLCGSCEGLKLVPEIFLRSHVMGIRLEESRFRIL 190

Query: 190 ITALCKIYKVGHAMELFNYMITEGYGLNPGICSLILASLCEHKKSTGNVVLTFLEQMRLK 249
           + ALC I+KVG+A+E+ N M+  GYGLN  IC+LIL+SLCE K S   VV+ FL +M++ 
Sbjct: 191 VNALCGIWKVGYAIEIMNCMMNNGYGLNVKICALILSSLCEQKDSECVVVMGFLGEMQIL 250

Query: 250 GFCPGVVDYSNVIKFLVRRGLSSDALDLLNKMKADGFKPDIVCYTMVLNGVIADGDYKMA 309
           GFCPG++DYSNVI+F+V++G   DAL++L KMK +G KPDIVCYTMVL+GVIA+GD+K  
Sbjct: 251 GFCPGMIDYSNVIRFMVKQGRGLDALNVLVKMKEEGIKPDIVCYTMVLHGVIAEGDFKKV 310

Query: 310 DELFDELLLFGLVPDIYTYNVYIHGLCKQGNWEAGIQMILHMEELGCKPDVITYNILLEC 369
           D++FDELL+FGLVPD+YTYNVYI+GLCKQ   E+G+ MI  MEELGCKP++ITYNILL+ 
Sbjct: 311 DQVFDELLVFGLVPDVYTYNVYINGLCKQNRVESGLMMISSMEELGCKPNLITYNILLKA 370

Query: 370 LCKIGELDEARKLRGNMQLKGLAKNVRTFRIMISGLFNNGDVIEACILLEEMLVCRFPPQ 429
           LC  GEL  AR L   M L G+  N++T+RIM+ GLF  GD+ EAC+ +EEML       
Sbjct: 371 LCNSGELRRARNLMREMTLNGVGVNLQTYRIMLEGLFGKGDIEEACVFMEEMLDKVLVCF 430

Query: 430 ISTFGEILSWLCKRDMVGKALELLTLMVGMNFSPGPKAWETLLLSSESELPSVKSLETT 489
            S+F  ++  LC+R +V KA ELL  MV     PG KAWE LLLSS SE     SLE T
Sbjct: 431 CSSFDVVIYGLCQRGLVCKATELLKKMVAKKVDPGAKAWEALLLSSGSE----HSLEET 478

BLAST of Cp4.1LG03g03520 vs. NCBI nr
Match: gi|743811538|ref|XP_011019040.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g38420, mitochondrial [Populus euphratica])

HSP 1 Score: 539.3 bits (1388), Expect = 7.2e-150
Identity = 278/487 (57.08%), Postives = 356/487 (73.10%), Query Frame = 1

Query: 8   KRSDHFLRKHRKCPFSPFKTKWHQTFNQDEALRAIKQAANSPKQPNSD-EPYLLSVLISS 67
           K +  FLRKHRK P+SP+K +WH+ FNQ +A++++KQ+A  P Q  S  +P+LLS LI S
Sbjct: 8   KSASFFLRKHRKWPYSPYKARWHRIFNQQQAMQSLKQSALKPPQQESPHKPHLLSSLIHS 67

Query: 68  FRAYCCDPTPNAYHFVLKTFVRSSQFHYIAAVLDRLERVEKFETPEYIFVDLLKVYGRVN 127
           F  Y  +PTP A+ F+ KT V++SQFH+I +VLD LE+VE FE PE  F  L++VYGR N
Sbjct: 68  FGIYDVEPTPKAFDFIFKTLVKTSQFHHIPSVLDHLEKVESFEPPESTFAYLIEVYGRTN 127

Query: 128 RIQDAITLFRRIPMFRCVPSALSLNSLLFLLCRNGEGLRIVPEIILSSQTMGIRLEESTF 187
           + Q+AI LF RIP FRCVPS  SLN+L+ +LCRN +GL+ VPEI+L SQ M IR+EESTF
Sbjct: 128 KTQEAIELFYRIPKFRCVPSVYSLNTLISVLCRNSKGLKSVPEILLKSQVMNIRVEESTF 187

Query: 188 RILITALCKIYKVGHAMELFNYMITEGYGLNPGICSLILASLCEHKKSTGNVVLTFLEQM 247
           ++LITALC+I KVG A+E+ N M+ +G+ +N  I SL+L+SLCE K +T   V+ FLEQ+
Sbjct: 188 QVLITALCRIRKVGFAIEMLNCMVNDGFIVNAEIYSLLLSSLCEQKDATKFEVMGFLEQL 247

Query: 248 RLKGFCPGVVDYSNVIKFLVRRGLSSDALDLLNKMKADGFKPDIVCYTMVLNGVIADGDY 307
           R  GF PG+VDYSNVI+FLV+     DAL +LN MK+D  KPDI CYTMVL+GVI   DY
Sbjct: 248 RKLGFFPGMVDYSNVIRFLVKGKRGLDALHVLNHMKSDRIKPDIFCYTMVLHGVIEAKDY 307

Query: 308 KMADELFDELLLFGLVPDIYTYNVYIHGLCKQGNWEAGIQMILHMEELGCKPDVITYNIL 367
             ADELFDELL++GLVPD YTYNVYI+GLCKQ N +AGI+M+  MEELGCKP++ITYN+L
Sbjct: 308 LKADELFDELLVYGLVPDAYTYNVYINGLCKQNNVQAGIKMVASMEELGCKPNLITYNML 367

Query: 368 LECLCKIGELDEARKLRGNMQLKGLAKNVRTFRIMISGLFNNGDVIEACILLEEMLVCRF 427
           ++ LCK+GEL +A  L   M +KG+  N++T+RIMI GL +NG ++EAC L EE L  R 
Sbjct: 368 VKQLCKVGELSKAGDLVREMGVKGIGLNMQTYRIMIDGLASNGKIVEACGLFEEALDKRL 427

Query: 428 PPQISTFGEILSWLCKRDMVGKALELLTLMVGMNFSPGPKAWETLLLSSESELPSVKSLE 487
             Q     EI+  L  RD+  KALELL  MVG N SPG +AW+ LLLSS  +L  V   E
Sbjct: 428 CTQRLLLDEIICGLGDRDLSCKALELLEKMVGKNVSPGARAWKALLLSSGFKLDCV---E 487

Query: 488 TTLEDLV 494
           T L  LV
Sbjct: 488 TKLFSLV 491

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP193_ARATH9.5e-11046.10Pentatricopeptide repeat-containing protein At2g38420, mitochondrial OS=Arabidop... [more]
PPR28_ARATH5.5e-4126.49Pentatricopeptide repeat-containing protein At1g09900 OS=Arabidopsis thaliana GN... [more]
PP281_ARATH9.4e-4125.37Pentatricopeptide repeat-containing protein At3g53700, chloroplastic OS=Arabidop... [more]
PPR90_ARATH1.6e-4028.08Pentatricopeptide repeat-containing protein At1g62590 OS=Arabidopsis thaliana GN... [more]
PPR91_ARATH2.7e-4028.32Pentatricopeptide repeat-containing protein At1g62670, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A0A0LA35_CUCSA3.4e-22378.90Uncharacterized protein OS=Cucumis sativus GN=Csa_3G730720 PE=4 SV=1[more]
W9QV00_9ROSA5.2e-14753.58Uncharacterized protein OS=Morus notabilis GN=L484_021993 PE=4 SV=1[more]
A0A067JJ28_JATCU3.4e-14655.82Uncharacterized protein OS=Jatropha curcas GN=JCGZ_00390 PE=4 SV=1[more]
V4UT39_9ROSI9.8e-14653.77Uncharacterized protein OS=Citrus clementina GN=CICLE_v10013613mg PE=4 SV=1[more]
V7B2Q8_PHAVU1.1e-14451.63Uncharacterized protein OS=Phaseolus vulgaris GN=PHAVU_008G089500g PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G38420.15.4e-11146.10 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G09900.13.1e-4226.49 Pentatricopeptide repeat (PPR-like) superfamily protein[more]
AT3G53700.15.3e-4225.37 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G62590.19.0e-4228.08 pentatricopeptide (PPR) repeat-containing protein[more]
AT1G62670.11.5e-4128.32 rna processing factor 2[more]
Match NameE-valueIdentityDescription
gi|449440241|ref|XP_004137893.1|4.9e-22378.90PREDICTED: pentatricopeptide repeat-containing protein At2g38420, mitochondrial ... [more]
gi|659083416|ref|XP_008442345.1|3.5e-22178.79PREDICTED: pentatricopeptide repeat-containing protein At2g38420, mitochondrial ... [more]
gi|645279621|ref|XP_008244810.1|1.6e-15759.34PREDICTED: pentatricopeptide repeat-containing protein At2g38420, mitochondrial ... [more]
gi|694395598|ref|XP_009373117.1|1.1e-15057.62PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At2g... [more]
gi|743811538|ref|XP_011019040.1|7.2e-15057.08PREDICTED: pentatricopeptide repeat-containing protein At2g38420, mitochondrial ... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG03g03520.1Cp4.1LG03g03520.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 185..213
score: 0.0035coord: 436..460
score:
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 389..421
score: 4.
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 323..372
score: 2.4E-16coord: 253..300
score: 1.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 397..428
score: 0.0015coord: 326..360
score: 1.9E-9coord: 432..463
score: 0.0018coord: 361..392
score: 1.2E-6coord: 185..216
score: 9.0E-4coord: 291..325
score: 1.6E-4coord: 258..290
score: 7.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 111..145
score: 7.585coord: 394..428
score: 9.767coord: 146..181
score: 5.667coord: 359..393
score: 11.542coord: 429..463
score: 8.331coord: 324..358
score: 12.704coord: 217..253
score: 7.048coord: 289..323
score: 9.668coord: 182..216
score: 8.714coord: 254..288
score: 9
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 6..472
score: 5.1E
NoneNo IPR availablePANTHERPTHR24015:SF316SUBFAMILY NOT NAMEDcoord: 6..472
score: 5.1E

The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Cp4.1LG03g03520Wax gourdcpewgoB0744
Cp4.1LG03g03520Wax gourdcpewgoB0812
Cp4.1LG03g03520Cucurbita pepo (Zucchini)cpecpeB093
Cp4.1LG03g03520Cucurbita pepo (Zucchini)cpecpeB487
Cp4.1LG03g03520Cucumber (Gy14) v1cgycpeB0347
Cp4.1LG03g03520Cucumber (Gy14) v1cgycpeB0488
Cp4.1LG03g03520Cucurbita maxima (Rimu)cmacpeB283
Cp4.1LG03g03520Cucurbita maxima (Rimu)cmacpeB745
Cp4.1LG03g03520Cucurbita maxima (Rimu)cmacpeB746
Cp4.1LG03g03520Cucurbita maxima (Rimu)cmacpeB830
Cp4.1LG03g03520Cucurbita moschata (Rifu)cmocpeB245
Cp4.1LG03g03520Cucurbita moschata (Rifu)cmocpeB634
Cp4.1LG03g03520Cucurbita moschata (Rifu)cmocpeB699
Cp4.1LG03g03520Cucurbita moschata (Rifu)cmocpeB783
Cp4.1LG03g03520Wild cucumber (PI 183967)cpecpiB600
Cp4.1LG03g03520Wild cucumber (PI 183967)cpecpiB620
Cp4.1LG03g03520Cucumber (Chinese Long) v2cpecuB598
Cp4.1LG03g03520Cucumber (Chinese Long) v2cpecuB618
Cp4.1LG03g03520Bottle gourd (USVL1VR-Ls)cpelsiB492
Cp4.1LG03g03520Bottle gourd (USVL1VR-Ls)cpelsiB503
Cp4.1LG03g03520Watermelon (Charleston Gray)cpewcgB546
Cp4.1LG03g03520Watermelon (Charleston Gray)cpewcgB553
Cp4.1LG03g03520Melon (DHL92) v3.6.1cpemedB678
Cp4.1LG03g03520Watermelon (97103) v1cpewmB597
Cp4.1LG03g03520Watermelon (97103) v1cpewmB618
Cp4.1LG03g03520Melon (DHL92) v3.5.1cpemeB571
Cp4.1LG03g03520Cucumber (Gy14) v2cgybcpeB395
Cp4.1LG03g03520Cucumber (Gy14) v2cgybcpeB555
Cp4.1LG03g03520Silver-seed gourdcarcpeB0230
Cp4.1LG03g03520Silver-seed gourdcarcpeB0944
Cp4.1LG03g03520Silver-seed gourdcarcpeB1391
Cp4.1LG03g03520Cucumber (Chinese Long) v3cpecucB0747
Cp4.1LG03g03520Cucumber (Chinese Long) v3cpecucB0771