Cp4.1LG09g00920.1 (mRNA) Cucurbita pepo (Zucchini)

NameCp4.1LG09g00920.1
TypemRNA
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionPentatricopeptide repeat-containing protein, putative
LocationCp4.1LG09 : 531877 .. 533304 (-)
Sequence length1428
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAACGCTTTATCGCTTCTGAGTGCGCGGATAGTCTGCAGATTGGTTTCAAATTCTTTCATTGTGCGTCGAACAATTTGGAATCGGAGTTTTGGTAGTGATGACCGATTTGAGTTTGTTATGGAACCCTATAAGAAGGGGTTCGAGAGATCAGAGTCTTGTGAGTTTGAGAGTCTTAGATGCGATGATTTGTCTGTGAGAAGGGGTTTTCTAGAGGATTCGAAAACTGATGCTGGAAAAGTTCTTGAGGTTCTCAAACAGGATGGGCCTGGATTTGACACATTAGTGGCTTTGGATGAACTCCAACTAAAGGTCTCAGGAGTTCTTGTTAGGGAAGTTTTGAGGGGGATTTTGAGAAGCATAAATGTTCTAAACAAAACTCAGTGTGCAAAATTGGGGTACAAGTTCTTCGTGTGGTCTGGTAAGATTGAGAATTTCAACCACACCGCGAGTTCGTATCATATGATCATGAAAATATTTGCTGAATGTGAGGAGTTCAAGGCTATGTGGAGGTTAGTTGATGAGATGACTGAGAAAGGGTACCCTGTTACTGCAAGGACATTCACCTTATTGATATGTACTTGTGGTGATGCAGGCTTGGCTAGGAAAGTTGTCGAGAGGTTTATCAAATCGAAAACGTTCAATTACAGGCCGTTTAAGCACTCGTATAATGCCATTCTCCATGCACTTCTTGGTATAAAACAGTACAAGTTGATTGAATGGGTGTATCAGCAAATGCTGCTTGATGGTCATAGCTCGGACATGCTGACATATAATTTGCTATTGTATGCAAGGTGCAAATTGGGGAAATTGGATCAGTTTCACAGGTTGCTTGATGAAATGGCCAGGAATGGGTTCTCTCCTGATTATCATACTTATAACATTCTTCTCTATGTTCTTGGCAAAGGAGACAAGCCTCTTGCAGCTTTAAACCTTTTGAACCACATGAGGGAAGTGGGCTCTGATCCGAGCATTCTTCACTTCACGACGTTGATAAATGGACTTAGTCGGGCTGGAAATTTGGATGCTTGCAAATATTTTTTTGATGAACTTTCTAATAACGGTTGCACTCCTGATGTCGTTTGCTACACTGTGATGATCACAAGCTATACAGTAGCTGGGGAGCACGAAAAGGCTCGAGAACTTTTCGATGAGATGGTAATAAAGGGCCAACTCCCTAATGTATTTACATACAACTCCATGATTCGTGGGTTTTGTATGGTCGGCGAATTCGAGGAAGCTTATACTATGCTCGAAGAAATGGAATCCCGAGGCTGCAAACCAAATTTTCTTGTATATAGTACCCTTGTAAGTTATTTGCGAAACGCTGGACAGCTTTCCGAGGCTCACAAAGTTATTACACATATGGTCGAGAAGGGGCAATACGCGCATTTAGTGACGAAATTCAAGGGATATAGAAGATGCTAG

mRNA sequence

ATGAACGCTTTATCGCTTCTGAGTGCGCGGATAGTCTGCAGATTGGTTTCAAATTCTTTCATTGTGCGTCGAACAATTTGGAATCGGAGTTTTGGTAGTGATGACCGATTTGAGTTTGTTATGGAACCCTATAAGAAGGGGTTCGAGAGATCAGAGTCTTGTGAGTTTGAGAGTCTTAGATGCGATGATTTGTCTGTGAGAAGGGGTTTTCTAGAGGATTCGAAAACTGATGCTGGAAAAGTTCTTGAGGTTCTCAAACAGGATGGGCCTGGATTTGACACATTAGTGGCTTTGGATGAACTCCAACTAAAGGTCTCAGGAGTTCTTGTTAGGGAAGTTTTGAGGGGGATTTTGAGAAGCATAAATGTTCTAAACAAAACTCAGTGTGCAAAATTGGGGTACAAGTTCTTCGTGTGGTCTGGTAAGATTGAGAATTTCAACCACACCGCGAGTTCGTATCATATGATCATGAAAATATTTGCTGAATGTGAGGAGTTCAAGGCTATGTGGAGGTTAGTTGATGAGATGACTGAGAAAGGGTACCCTGTTACTGCAAGGACATTCACCTTATTGATATGTACTTGTGGTGATGCAGGCTTGGCTAGGAAAGTTGTCGAGAGGTTTATCAAATCGAAAACGTTCAATTACAGGCCGTTTAAGCACTCGTATAATGCCATTCTCCATGCACTTCTTGGTATAAAACAGTACAAGTTGATTGAATGGGTGTATCAGCAAATGCTGCTTGATGGTCATAGCTCGGACATGCTGACATATAATTTGCTATTGTATGCAAGGTGCAAATTGGGGAAATTGGATCAGTTTCACAGGTTGCTTGATGAAATGGCCAGGAATGGGTTCTCTCCTGATTATCATACTTATAACATTCTTCTCTATGTTCTTGGCAAAGGAGACAAGCCTCTTGCAGCTTTAAACCTTTTGAACCACATGAGGGAAGTGGGCTCTGATCCGAGCATTCTTCACTTCACGACGTTGATAAATGGACTTAGTCGGGCTGGAAATTTGGATGCTTGCAAATATTTTTTTGATGAACTTTCTAATAACGGTTGCACTCCTGATGTCGTTTGCTACACTGTGATGATCACAAGCTATACAGTAGCTGGGGAGCACGAAAAGGCTCGAGAACTTTTCGATGAGATGGTAATAAAGGGCCAACTCCCTAATGTATTTACATACAACTCCATGATTCGTGGGTTTTGTATGGTCGGCGAATTCGAGGAAGCTTATACTATGCTCGAAGAAATGGAATCCCGAGGCTGCAAACCAAATTTTCTTGTATATAGTACCCTTGTAAGTTATTTGCGAAACGCTGGACAGCTTTCCGAGGCTCACAAAGTTATTACACATATGGTCGAGAAGGGGCAATACGCGCATTTAGTGACGAAATTCAAGGGATATAGAAGATGCTAG

Coding sequence (CDS)

ATGAACGCTTTATCGCTTCTGAGTGCGCGGATAGTCTGCAGATTGGTTTCAAATTCTTTCATTGTGCGTCGAACAATTTGGAATCGGAGTTTTGGTAGTGATGACCGATTTGAGTTTGTTATGGAACCCTATAAGAAGGGGTTCGAGAGATCAGAGTCTTGTGAGTTTGAGAGTCTTAGATGCGATGATTTGTCTGTGAGAAGGGGTTTTCTAGAGGATTCGAAAACTGATGCTGGAAAAGTTCTTGAGGTTCTCAAACAGGATGGGCCTGGATTTGACACATTAGTGGCTTTGGATGAACTCCAACTAAAGGTCTCAGGAGTTCTTGTTAGGGAAGTTTTGAGGGGGATTTTGAGAAGCATAAATGTTCTAAACAAAACTCAGTGTGCAAAATTGGGGTACAAGTTCTTCGTGTGGTCTGGTAAGATTGAGAATTTCAACCACACCGCGAGTTCGTATCATATGATCATGAAAATATTTGCTGAATGTGAGGAGTTCAAGGCTATGTGGAGGTTAGTTGATGAGATGACTGAGAAAGGGTACCCTGTTACTGCAAGGACATTCACCTTATTGATATGTACTTGTGGTGATGCAGGCTTGGCTAGGAAAGTTGTCGAGAGGTTTATCAAATCGAAAACGTTCAATTACAGGCCGTTTAAGCACTCGTATAATGCCATTCTCCATGCACTTCTTGGTATAAAACAGTACAAGTTGATTGAATGGGTGTATCAGCAAATGCTGCTTGATGGTCATAGCTCGGACATGCTGACATATAATTTGCTATTGTATGCAAGGTGCAAATTGGGGAAATTGGATCAGTTTCACAGGTTGCTTGATGAAATGGCCAGGAATGGGTTCTCTCCTGATTATCATACTTATAACATTCTTCTCTATGTTCTTGGCAAAGGAGACAAGCCTCTTGCAGCTTTAAACCTTTTGAACCACATGAGGGAAGTGGGCTCTGATCCGAGCATTCTTCACTTCACGACGTTGATAAATGGACTTAGTCGGGCTGGAAATTTGGATGCTTGCAAATATTTTTTTGATGAACTTTCTAATAACGGTTGCACTCCTGATGTCGTTTGCTACACTGTGATGATCACAAGCTATACAGTAGCTGGGGAGCACGAAAAGGCTCGAGAACTTTTCGATGAGATGGTAATAAAGGGCCAACTCCCTAATGTATTTACATACAACTCCATGATTCGTGGGTTTTGTATGGTCGGCGAATTCGAGGAAGCTTATACTATGCTCGAAGAAATGGAATCCCGAGGCTGCAAACCAAATTTTCTTGTATATAGTACCCTTGTAAGTTATTTGCGAAACGCTGGACAGCTTTCCGAGGCTCACAAAGTTATTACACATATGGTCGAGAAGGGGCAATACGCGCATTTAGTGACGAAATTCAAGGGATATAGAAGATGCTAG

Protein sequence

MNALSLLSARIVCRLVSNSFIVRRTIWNRSFGSDDRFEFVMEPYKKGFERSESCEFESLRCDDLSVRRGFLEDSKTDAGKVLEVLKQDGPGFDTLVALDELQLKVSGVLVREVLRGILRSINVLNKTQCAKLGYKFFVWSGKIENFNHTASSYHMIMKIFAECEEFKAMWRLVDEMTEKGYPVTARTFTLLICTCGDAGLARKVVERFIKSKTFNYRPFKHSYNAILHALLGIKQYKLIEWVYQQMLLDGHSSDMLTYNLLLYARCKLGKLDQFHRLLDEMARNGFSPDYHTYNILLYVLGKGDKPLAALNLLNHMREVGSDPSILHFTTLINGLSRAGNLDACKYFFDELSNNGCTPDVVCYTVMITSYTVAGEHEKARELFDEMVIKGQLPNVFTYNSMIRGFCMVGEFEEAYTMLEEMESRGCKPNFLVYSTLVSYLRNAGQLSEAHKVITHMVEKGQYAHLVTKFKGYRRC
BLAST of Cp4.1LG09g00920.1 vs. Swiss-Prot
Match: PPR81_ARATH (Pentatricopeptide repeat-containing protein At1g55630 OS=Arabidopsis thaliana GN=At1g55630 PE=2 SV=1)

HSP 1 Score: 610.9 bits (1574), Expect = 1.2e-173
Identity = 300/476 (63.03%), Postives = 367/476 (77.10%), Query Frame = 1

Query: 1   MNALSLLSARIVCRLVSNSFIVRRTIWNRSFGSDDRFEFVMEPYKKGFERSE-SCEFESL 60
           MN++   S  +  R  S      R   N S G D       EP K  +E SE  CEF+  
Sbjct: 1   MNSVIHYSTSVAVRKASRFLFTSRKFCNGSIGGDVTDNGTEEPLKITWESSEMDCEFDQE 60

Query: 61  RCDD-LSVRRGFLEDSKTDAGKVLEVLKQDGPGFDTLVALDELQLKVSGVLVREVLRGIL 120
              + +SVR+ F+E +K  A +VL+ L+QD PGF+T  ALDEL + +SG+LVREVL GIL
Sbjct: 61  ENGEKISVRKRFMESTKLSASRVLDTLQQDCPGFNTKSALDELNVSISGLLVREVLVGIL 120

Query: 121 RSINVLNKTQCAKLGYKFFVWSGKIENFNHTASSYHMIMKIFAECEEFKAMWRLVDEMTE 180
           R+++  NKT+CAKL YKFFVW G  ENF HTA+ YH++MKIFAEC E+KAM RL+DEM +
Sbjct: 121 RTLSFDNKTRCAKLAYKFFVWCGGQENFRHTANCYHLLMKIFAECGEYKAMCRLIDEMIK 180

Query: 181 KGYPVTARTFTLLICTCGDAGLARKVVERFIKSKTFNYRPFKHSYNAILHALLGIKQYKL 240
            GYP TA TF LLICTCG+AGLAR VVE+FIKSKTFNYRP+KHSYNAILH+LLG+KQYKL
Sbjct: 181 DGYPTTACTFNLLICTCGEAGLARDVVEQFIKSKTFNYRPYKHSYNAILHSLLGVKQYKL 240

Query: 241 IEWVYQQMLLDGHSSDMLTYNLLLYARCKLGKLDQFHRLLDEMARNGFSPDYHTYNILLY 300
           I+WVY+QML DG + D+LTYN++++A  +LGK D+ +RLLDEM ++GFSPD +TYNILL+
Sbjct: 241 IDWVYEQMLEDGFTPDVLTYNIVMFANFRLGKTDRLYRLLDEMVKDGFSPDLYTYNILLH 300

Query: 301 VLGKGDKPLAALNLLNHMREVGSDPSILHFTTLINGLSRAGNLDACKYFFDELSNNGCTP 360
            L  G+KPLAALNLLNHMREVG +P ++HFTTLI+GLSRAG L+ACKYF DE    GCTP
Sbjct: 301 HLATGNKPLAALNLLNHMREVGVEPGVIHFTTLIDGLSRAGKLEACKYFMDETVKVGCTP 360

Query: 361 DVVCYTVMITSYTVAGEHEKARELFDEMVIKGQLPNVFTYNSMIRGFCMVGEFEEAYTML 420
           DVVCYTVMIT Y   GE EKA E+F EM  KGQLPNVFTYNSMIRGFCM G+F+EA  +L
Sbjct: 361 DVVCYTVMITGYISGGELEKAEEMFKEMTEKGQLPNVFTYNSMIRGFCMAGKFKEACALL 420

Query: 421 EEMESRGCKPNFLVYSTLVSYLRNAGQLSEAHKVITHMVEKGQYAHLVTKFKGYRR 475
           +EMESRGC PNF+VYSTLV+ L+NAG++ EAH+V+  MVEKG Y HL++K K YRR
Sbjct: 421 KEMESRGCNPNFVVYSTLVNNLKNAGKVLEAHEVVKDMVEKGHYVHLISKLKKYRR 476

BLAST of Cp4.1LG09g00920.1 vs. Swiss-Prot
Match: PP288_ARATH (Pentatricopeptide repeat-containing protein At3g60050 OS=Arabidopsis thaliana GN=At3g60050 PE=2 SV=1)

HSP 1 Score: 598.6 bits (1542), Expect = 6.0e-170
Identity = 298/475 (62.74%), Postives = 367/475 (77.26%), Query Frame = 1

Query: 1   MNALSLLSARIVCRLVSNSFIVRRTIWNRSFGSDDRFEFVMEPYKKGF-ERSESCEFESL 60
           MN   +L   +V +     FI R+   N +FG ++  +        GF E S   E  S+
Sbjct: 1   MNLALVLGTNVVRKAYRFLFISRK-FCNGNFGGNE-IDNGFPDLDCGFDEDSNISELRSI 60

Query: 61  RCDDLSVRRGFLEDSKTDAGKVLEVLKQDGPGFDTLVALDELQLKVSGVLVREVLRGILR 120
             + +SVR  FLE +   A +VL  L+ D  GF++   LDEL ++VSG+LVREVL GILR
Sbjct: 61  DREVISVRSRFLESANHSASRVLVTLQLDESGFNSKSVLDELNVRVSGLLVREVLVGILR 120

Query: 121 SINVLNKTQCAKLGYKFFVWSGKIENFNHTASSYHMIMKIFAECEEFKAMWRLVDEMTEK 180
           +++  NK +CAKL Y+FF+WSG+ E F HT +SYH++MKIFAEC E+KAMWRLVDEM + 
Sbjct: 121 NLSYDNKARCAKLAYRFFLWSGEQECFRHTVNSYHLLMKIFAECGEYKAMWRLVDEMVQD 180

Query: 181 GYPVTARTFTLLICTCGDAGLARKVVERFIKSKTFNYRPFKHSYNAILHALLGIKQYKLI 240
           G+P TARTF LLIC+CG+AGLA++ V +F+KSKTFNYRPFKHSYNAIL++LLG+KQYKLI
Sbjct: 181 GFPTTARTFNLLICSCGEAGLAKQAVVQFMKSKTFNYRPFKHSYNAILNSLLGVKQYKLI 240

Query: 241 EWVYQQMLLDGHSSDMLTYNLLLYARCKLGKLDQFHRLLDEMARNGFSPDYHTYNILLYV 300
           EWVY+QML DG S D+LTYN+LL+   +LGK+D+F RL DEMAR+GFSPD +TYNILL++
Sbjct: 241 EWVYKQMLEDGFSPDVLTYNILLWTNYRLGKMDRFDRLFDEMARDGFSPDSYTYNILLHI 300

Query: 301 LGKGDKPLAALNLLNHMREVGSDPSILHFTTLINGLSRAGNLDACKYFFDELSNNGCTPD 360
           LGKG+KPLAAL  LNHM+EVG DPS+LH+TTLI+GLSRAGNL+ACKYF DE+   GC PD
Sbjct: 301 LGKGNKPLAALTTLNHMKEVGIDPSVLHYTTLIDGLSRAGNLEACKYFLDEMVKAGCRPD 360

Query: 361 VVCYTVMITSYTVAGEHEKARELFDEMVIKGQLPNVFTYNSMIRGFCMVGEFEEAYTMLE 420
           VVCYTVMIT Y V+GE +KA+E+F EM +KGQLPNVFTYNSMIRG CM GEF EA  +L+
Sbjct: 361 VVCYTVMITGYVVSGELDKAKEMFREMTVKGQLPNVFTYNSMIRGLCMAGEFREACWLLK 420

Query: 421 EMESRGCKPNFLVYSTLVSYLRNAGQLSEAHKVITHMVEKGQYAHLVTKFKGYRR 475
           EMESRGC PNF+VYSTLVSYLR AG+LSEA KVI  MV+KG Y HLV K   YRR
Sbjct: 421 EMESRGCNPNFVVYSTLVSYLRKAGKLSEARKVIREMVKKGHYVHLVPKMMKYRR 473

BLAST of Cp4.1LG09g00920.1 vs. Swiss-Prot
Match: PP236_ARATH (Pentatricopeptide repeat-containing protein At3g16010 OS=Arabidopsis thaliana GN=At3g16010 PE=2 SV=1)

HSP 1 Score: 161.4 bits (407), Expect = 2.5e-38
Identity = 110/373 (29.49%), Postives = 179/373 (47.99%), Query Frame = 1

Query: 91  GFDTLVALDELQLKVSGVLVREVLRGILRSINVLNKTQCAKLGYKFFVWSGKIENFNHTA 150
           G D   AL+ L+LKV   LVR +L  I   INV           +FF W+GK  NF H  
Sbjct: 77  GPDAEKALEVLKLKVDHRLVRSILE-IDVEINVK---------IQFFKWAGKRRNFQHDC 136

Query: 151 SSYHMIMKIFAECEEFKAMWRLVDEMTEKGY-PVTARTFTLLICTCGDAGLARKVVERFI 210
           S+Y  +++   E   +  M+R + E+    Y  V+    + L+   G A +  K +  F 
Sbjct: 137 STYMTLIRCLEEARLYGEMYRTIQEVVRNTYVSVSPAVLSELVKALGRAKMVSKALSVFY 196

Query: 211 KSKTFNYRPFKHSYNAILHALLGIKQYKLIEWVYQQMLLDGHS-SDMLTYNLLLYARCKL 270
           ++K    +P   +YN+++  L+   Q++ +  VY +M  +G    D +TY+ L+ +  KL
Sbjct: 197 QAKGRKCKPTSSTYNSVILMLMQEGQHEKVHEVYTEMCNEGDCFPDTITYSALISSYEKL 256

Query: 271 GKLDQFHRLLDEMARNGFSPDYHTYNILLYVLGKGDKPLAALNLLNHMREVGSDPSILHF 330
           G+ D   RL DEM  N   P    Y  LL +  K  K   AL+L   M+  G  P++  +
Sbjct: 257 GRNDSAIRLFDEMKDNCMQPTEKIYTTLLGIYFKVGKVEKALDLFEEMKRAGCSPTVYTY 316

Query: 331 TTLINGLSRAGNLDACKYFFDELSNNGCTPDVVCYTVMITSYTVAGEHEKARELFDEMVI 390
           T LI GL +AG +D    F+ ++  +G TPDVV    ++      G  E+   +F EM +
Sbjct: 317 TELIKGLGKAGRVDEAYGFYKDMLRDGLTPDVVFLNNLMNILGKVGRVEELTNVFSEMGM 376

Query: 391 KGQLPNVFTYNSMIRG-FCMVGEFEEAYTMLEEMESRGCKPNFLVYSTLVSYLRNAGQLS 450
               P V +YN++I+  F       E  +  ++M++    P+   YS L+       ++ 
Sbjct: 377 WRCTPTVVSYNTVIKALFESKAHVSEVSSWFDKMKADSVSPSEFTYSILIDGYCKTNRVE 436

Query: 451 EAHKVITHMVEKG 461
           +A  ++  M EKG
Sbjct: 437 KALLLLEEMDEKG 439

BLAST of Cp4.1LG09g00920.1 vs. Swiss-Prot
Match: PP447_ARATH (Putative pentatricopeptide repeat-containing protein At5g65820 OS=Arabidopsis thaliana GN=At5g65820 PE=3 SV=1)

HSP 1 Score: 156.8 bits (395), Expect = 6.0e-37
Identity = 104/391 (26.60%), Postives = 185/391 (47.31%), Query Frame = 1

Query: 72  EDSKTDAGKVLEVLKQDGPGFDTLVALDELQLKVSGVLVREVLRGILRSINVLNKT-QCA 131
           ++  +D  K   +L++    F + V   EL L  SGV +R    G++    VLN+     
Sbjct: 77  DEFASDVEKSYRILRK----FHSRVPKLELALNESGVELRP---GLIE--RVLNRCGDAG 136

Query: 132 KLGYKFFVWSGKIENFNHTASSYHMIMKIFAECEEFKAMWRLVDEM-TEKGYPVTARTFT 191
            LGY+FFVW+ K   + H+   Y  ++KI ++  +F A+W L++EM  E    +    F 
Sbjct: 137 NLGYRFFVWAAKQPRYCHSIEVYKSMVKILSKMRQFGAVWGLIEEMRKENPQLIEPELFV 196

Query: 192 LLICTCGDAGLARKVVERFIKSKTFNYRPFKHSYNAILHALLGIKQYKLIEWVYQQMLLD 251
           +L+     A + +K +E   +   F + P ++ +  +L AL      K    +++ M + 
Sbjct: 197 VLVQRFASADMVKKAIEVLDEMPKFGFEPDEYVFGCLLDALCKHGSVKDAAKLFEDMRM- 256

Query: 252 GHSSDMLTYNLLLYARCKLGKLDQFHRLLDEMARNGFSPDYHTYNILLYVLGKGDKPLAA 311
               ++  +  LLY  C++GK+ +   +L +M   GF PD   Y  LL       K   A
Sbjct: 257 RFPVNLRYFTSLLYGWCRVGKMMEAKYVLVQMNEAGFEPDIVDYTNLLSGYANAGKMADA 316

Query: 312 LNLLNHMREVGSDPSILHFTTLINGLSRAGNLDACKYFFDELSNNGCTPDVVCYTVMITS 371
            +LL  MR  G +P+   +T LI  L +   ++     F E+    C  DVV YT +++ 
Sbjct: 317 YDLLRDMRRRGFEPNANCYTVLIQALCKVDRMEEAMKVFVEMERYECEADVVTYTALVSG 376

Query: 372 YTVAGEHEKARELFDEMVIKGQLPNVFTYNSMIRGFCMVGEFEEAYTMLEEMESRGCKPN 431
           +   G+ +K   + D+M+ KG +P+  TY  ++        FEE   ++E+M      P+
Sbjct: 377 FCKWGKIDKCYIVLDDMIKKGLMPSELTYMHIMVAHEKKESFEECLELMEKMRQIEYHPD 436

Query: 432 FLVYSTLVSYLRNAGQLSEAHKVITHMVEKG 461
             +Y+ ++      G++ EA ++   M E G
Sbjct: 437 IGIYNVVIRLACKLGEVKEAVRLWNEMEENG 457

BLAST of Cp4.1LG09g00920.1 vs. Swiss-Prot
Match: PP298_ARATH (Pentatricopeptide repeat-containing protein At4g01400, mitochondrial OS=Arabidopsis thaliana GN=At4g01400 PE=2 SV=2)

HSP 1 Score: 156.0 bits (393), Expect = 1.0e-36
Identity = 101/335 (30.15%), Postives = 159/335 (47.46%), Query Frame = 1

Query: 132 LGYKFFVWSGKIENFNHTASSYHMIMKIFAECEEFKAMWRLVDEMTEKGYPVTARTFTLL 191
           L  + F ++ +  NF H+ SS+ +++        F  +  ++ +    GYP+T   FT L
Sbjct: 66  LAKEIFDYASQQPNFRHSRSSHLILILKLGRGRYFNLIDDVLAKHRSSGYPLTGEIFTYL 125

Query: 192 ICTCGDAGLARKVVERFIKSKTFNYRPFKHSYNAILHALLGIKQY--KLIEWVYQQMLLD 251
           I    +A L  KV+  F K   FN+ P     N IL  L+  + Y  K  E +++   L 
Sbjct: 126 IKVYAEAKLPEKVLSTFYKMLEFNFTPQPKHLNRILDVLVSHRGYLQKAFE-LFKSSRLH 185

Query: 252 GHSSDMLTYNLLLYARCKLGKLDQFHRLLDEMARNGFSPDYHTYNILLYVLGKGDKPLAA 311
           G   +  +YNLL+ A C    L   ++L  +M      PD  +Y IL+    +  +   A
Sbjct: 186 GVMPNTRSYNLLMQAFCLNDDLSIAYQLFGKMLERDVVPDVDSYKILIQGFCRKGQVNGA 245

Query: 312 LNLLNHMREVGSDPSILHFTTLINGLSRAGNLDACKYFFDELSNNGCTPDVVCYTVMITS 371
           + LL+ M   G  P  L +TTL+N L R   L         +   GC PD+V Y  MI  
Sbjct: 246 MELLDDMLNKGFVPDRLSYTTLLNSLCRKTQLREAYKLLCRMKLKGCNPDLVHYNTMILG 305

Query: 372 YTVAGEHEKARELFDEMVIKGQLPNVFTYNSMIRGFCMVGEFEEAYTMLEEMESRGCKPN 431
           +        AR++ D+M+  G  PN  +Y ++I G C  G F+E    LEEM S+G  P+
Sbjct: 306 FCREDRAMDARKVLDDMLSNGCSPNSVSYRTLIGGLCDQGMFDEGKKYLEEMISKGFSPH 365

Query: 432 FLVYSTLVSYLRNAGQLSEAHKVITHMVEKGQYAH 465
           F V + LV    + G++ EA  V+  +++ G+  H
Sbjct: 366 FSVSNCLVKGFCSFGKVEEACDVVEVVMKNGETLH 399

BLAST of Cp4.1LG09g00920.1 vs. TrEMBL
Match: A0A0A0L8P4_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G182100 PE=4 SV=1)

HSP 1 Score: 797.7 bits (2059), Expect = 7.5e-228
Identity = 381/476 (80.04%), Postives = 431/476 (90.55%), Query Frame = 1

Query: 1   MNALSLLSARIVCRLVSNSFIVRRTIWNRSFGSDDRFEFVMEPYKKGFE-RSESCEFESL 60
           MN+LSL+S+RIVCRL S SFIVRRTIWNR+F SDDRF+F +EP+    +  S+S E +S 
Sbjct: 1   MNSLSLVSSRIVCRLFSTSFIVRRTIWNRNFCSDDRFQFFVEPFSYLADGNSDSFETDSR 60

Query: 61  RCDDLSVRRGFLEDSKTDAGKVLEVLKQDGPGFDTLVALDELQLKVSGVLVREVLRGILR 120
           R DD S RR FL+D+K DA KV+E+LKQDGPGFDT +ALDELQLKVSGVLV EVL+GIL+
Sbjct: 61  RWDDFSFRRSFLKDAKIDAEKVIEILKQDGPGFDTFLALDELQLKVSGVLVGEVLKGILK 120

Query: 121 SINVLNKTQCAKLGYKFFVWSGKIENFNHTASSYHMIMKIFAECEEFKAMWRLVDEMTEK 180
           S +VLNKTQCAKLGYKFF+WSG+IEN+ HT +SYH+IMKIFAECEEFKAMWR++DEMTEK
Sbjct: 121 SKSVLNKTQCAKLGYKFFIWSGRIENYRHTVNSYHIIMKIFAECEEFKAMWRVLDEMTEK 180

Query: 181 GYPVTARTFTLLICTCGDAGLARKVVERFIKSKTFNYRPFKHSYNAILHALLGIKQYKLI 240
           GYPVTARTF +LICTCG+AGLA++VVERFIKSKTFN+RP+KHSYNAILH L+ +KQYKLI
Sbjct: 181 GYPVTARTFMILICTCGEAGLAKRVVERFIKSKTFNFRPYKHSYNAILHGLVIVKQYKLI 240

Query: 241 EWVYQQMLLDGHSSDMLTYNLLLYARCKLGKLDQFHRLLDEMARNGFSPDYHTYNILLYV 300
            WVY QMLLD HS D+LTYN+LL++ CKLGKLDQFHRLLDEMAR GFSPD+HTYNILLYV
Sbjct: 241 GWVYDQMLLDDHSPDILTYNVLLFSSCKLGKLDQFHRLLDEMARKGFSPDFHTYNILLYV 300

Query: 301 LGKGDKPLAALNLLNHMREVGSDPSILHFTTLINGLSRAGNLDACKYFFDELSNNGCTPD 360
           LGKGDKPLAALNLLNHMREVG  P++LHFTTLINGLSRAGNLDACKYFFDEL NNGC PD
Sbjct: 301 LGKGDKPLAALNLLNHMREVGFGPNVLHFTTLINGLSRAGNLDACKYFFDELGNNGCIPD 360

Query: 361 VVCYTVMITSYTVAGEHEKARELFDEMVIKGQLPNVFTYNSMIRGFCMVGEFEEAYTMLE 420
           VVCYTVMITS+T AG+HEKAR  FDEM++KGQLPNVFTYNSMIRGFCMVG+F+EAY+ML 
Sbjct: 361 VVCYTVMITSFTEAGQHEKARAFFDEMIMKGQLPNVFTYNSMIRGFCMVGKFKEAYSMLS 420

Query: 421 EMESRGCKPNFLVYSTLVSYLRNAGQLSEAHKVITHMVEKGQYAHLVTKFKGYRRC 476
           EMESRGC+PNFLVYSTLVSYLRNAG+L EAHKVI  MVE GQYAHL+TKFKGYRRC
Sbjct: 421 EMESRGCRPNFLVYSTLVSYLRNAGKLGEAHKVIKQMVENGQYAHLMTKFKGYRRC 476

BLAST of Cp4.1LG09g00920.1 vs. TrEMBL
Match: M5XQE6_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa025794mg PE=4 SV=1)

HSP 1 Score: 672.2 bits (1733), Expect = 4.8e-190
Identity = 313/427 (73.30%), Postives = 376/427 (88.06%), Query Frame = 1

Query: 49  ERSESCEFESLRCDDLSVRRGFLEDSKTDAGKVLEVLKQDGPGFDTLVALDELQLKVSGV 108
           E+    E+E+   +  S+RR F E++K    +VLEVL+QDGPGFDT  ALDEL ++VSG+
Sbjct: 20  EKQNVPEYENRPREYFSLRRSFFENAKIHTRRVLEVLQQDGPGFDTKAALDELHIEVSGL 79

Query: 109 LVREVLRGILRSINVLNKTQCAKLGYKFFVWSGKIENFNHTASSYHMIMKIFAECEEFKA 168
           LVREVL  IL+ +N  +K +CAKLGYKFFVWSG++EN+ HTA++YH++MKIFA+CEEFKA
Sbjct: 80  LVREVLFKILKQVNYASKMRCAKLGYKFFVWSGQLENYRHTANTYHLMMKIFADCEEFKA 139

Query: 169 MWRLVDEMTEKGYPVTARTFTLLICTCGDAGLARKVVERFIKSKTFNYRPFKHSYNAILH 228
           MWRLVDEM EKGYP TA+TF +LICTCG+AGLA+KVVERFIKSKTFNYRPFKHSYNAILH
Sbjct: 140 MWRLVDEMIEKGYPTTAQTFNILICTCGEAGLAKKVVERFIKSKTFNYRPFKHSYNAILH 199

Query: 229 ALLGIKQYKLIEWVYQQMLLDGHSSDMLTYNLLLYARCKLGKLDQFHRLLDEMARNGFSP 288
           +L+ +KQYKLIEWVYQQML DGH +D+LTYN+++YA+ +LGKLDQFHRLL+EM R+GF+P
Sbjct: 200 SLVVVKQYKLIEWVYQQMLADGHCTDILTYNVMMYAKYRLGKLDQFHRLLEEMGRSGFAP 259

Query: 289 DYHTYNILLYVLGKGDKPLAALNLLNHMREVGSDPSILHFTTLINGLSRAGNLDACKYFF 348
           D HTYNILL+VLGKGDKPLAALNLLNHM+EVG DPS+LHFTTLI+GLSR+GNLDACKYFF
Sbjct: 260 DLHTYNILLHVLGKGDKPLAALNLLNHMKEVGLDPSVLHFTTLIDGLSRSGNLDACKYFF 319

Query: 349 DELSNNGCTPDVVCYTVMITSYTVAGEHEKARELFDEMVIKGQLPNVFTYNSMIRGFCMV 408
           DE+  + C PDVVCYTVMI+ Y VAGE EKA+ +FDEM+  GQLPNVFTYN+MIRG CM 
Sbjct: 320 DEMIKHECFPDVVCYTVMISGYIVAGELEKAQGVFDEMIPNGQLPNVFTYNAMIRGLCMA 379

Query: 409 GEFEEAYTMLEEMESRGCKPNFLVYSTLVSYLRNAGQLSEAHKVITHMVEKGQYAHLVTK 468
           G+FEEA +ML++MESRGC PNF VYSTLVSYLRNAG+L++AH+VITHMVEKGQY HL++K
Sbjct: 380 GKFEEACSMLKDMESRGCNPNFTVYSTLVSYLRNAGKLAKAHEVITHMVEKGQYTHLLSK 439

Query: 469 FKGYRRC 476
           FKGYRRC
Sbjct: 440 FKGYRRC 446

BLAST of Cp4.1LG09g00920.1 vs. TrEMBL
Match: W9S7I4_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_015785 PE=4 SV=1)

HSP 1 Score: 669.8 bits (1727), Expect = 2.4e-189
Identity = 324/478 (67.78%), Postives = 388/478 (81.17%), Query Frame = 1

Query: 11  IVCRLVSNSFIVRRTIWNRSF---GSDDRFEFVMEPYKKGFERS-------ESCEFESLR 70
           +V ++  +S +V R   NRSF     ++ ++   EP K+ ++ S       E  +F S  
Sbjct: 21  VVSKVTLSSLVVIRNFCNRSFDGVNGENGYDCFEEPLKRMWKSSYFDSDMDEQSDFYSRE 80

Query: 71  CD---DLSVRRGFLEDSKTDAGKVLEVLKQDGPGFDTLVALDELQLKVSGVLVREVLRGI 130
                + SVR+ F E ++ DAG+VLEVL+QDGPGFD   ALDEL ++VSG+LVR+VL GI
Sbjct: 81  MPARGNFSVRQSFFETARIDAGRVLEVLQQDGPGFDAKPALDELNIRVSGLLVRKVLLGI 140

Query: 131 LRSINVLNKTQCAKLGYKFFVWSGKIENFNHTASSYHMIMKIFAECEEFKAMWRLVDEMT 190
           L +I+  NK +CAKLG+KFF WSG+  N+ HTA+SYH+++KIFAECEEFKAMWRLVDEM 
Sbjct: 141 LSNISHTNKIRCAKLGFKFFTWSGQQGNYRHTANSYHLLIKIFAECEEFKAMWRLVDEMI 200

Query: 191 EKGYPVTARTFTLLICTCGDAGLARKVVERFIKSKTFNYRPFKHSYNAILHALLGIKQYK 250
           E+G+P TART  +LICTCG+AGLARKVVERFIKSKTFN+RPFKHSYNAILH LL   QYK
Sbjct: 201 ERGFPTTARTLNILICTCGEAGLARKVVERFIKSKTFNFRPFKHSYNAILHCLLVTNQYK 260

Query: 251 LIEWVYQQMLLDGHSSDMLTYNLLLYARCKLGKLDQFHRLLDEMARNGFSPDYHTYNILL 310
           LIEWVYQQML DG S D+LTYN+L+  + +LGKLDQFHRLLDEM R GFSPD HTYNILL
Sbjct: 261 LIEWVYQQMLADGFSPDILTYNVLMLTKYRLGKLDQFHRLLDEMGRRGFSPDLHTYNILL 320

Query: 311 YVLGKGDKPLAALNLLNHMREVGSDPSILHFTTLINGLSRAGNLDACKYFFDELSNNGCT 370
           +VLGKGDKPLAALNLLNHM+E G DP +LHFTTLI+GLSRAGNLDAC++FFDE+  NGC 
Sbjct: 321 HVLGKGDKPLAALNLLNHMKETGIDPGVLHFTTLIDGLSRAGNLDACRFFFDEMPKNGCI 380

Query: 371 PDVVCYTVMITSYTVAGEHEKARELFDEMVIKGQLPNVFTYNSMIRGFCMVGEFEEAYTM 430
           PDVVCYTVMIT Y VAGE EKA+ +FDEM+ KGQ+PNVFTYNSMIRG CM G+FEEA +M
Sbjct: 381 PDVVCYTVMITGYVVAGELEKAQSIFDEMITKGQIPNVFTYNSMIRGLCMAGKFEEACSM 440

Query: 431 LEEMESRGCKPNFLVYSTLVSYLRNAGQLSEAHKVITHMVEKGQYAHLVTKFKGYRRC 476
           L++MESRGC PNFLVY+TLVS LRNAG+LSEAH+V+ HMVEKGQY HL++KFKGYRRC
Sbjct: 441 LKDMESRGCNPNFLVYTTLVSNLRNAGKLSEAHQVVKHMVEKGQYVHLLSKFKGYRRC 498

BLAST of Cp4.1LG09g00920.1 vs. TrEMBL
Match: V4S4A4_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10004849mg PE=4 SV=1)

HSP 1 Score: 667.2 bits (1720), Expect = 1.5e-188
Identity = 312/411 (75.91%), Postives = 367/411 (89.29%), Query Frame = 1

Query: 65  SVRRGFLEDSKTDAGKVLEVLKQDGPGFDTLVALDELQLKVSGVLVREVLRGILRSINVL 124
           S+RR FL++ K DA +VLE+L+ DGPGFD  + L E  ++VS +LVREVL GILRS+N  
Sbjct: 77  SIRRMFLDNVKFDASRVLEILQNDGPGFDAKLVLSESGIRVSEILVREVLSGILRSVNYA 136

Query: 125 NKTQCAKLGYKFFVWSGKIENFNHTASSYHMIMKIFAECEEFKAMWRLVDEMTEKGYPVT 184
           +KT+CAKLGYKFFVWSG+ ENF HTA+SYH+IMKIFA+CEEFKAMWRLVDEM E G+P T
Sbjct: 137 DKTKCAKLGYKFFVWSGQQENFRHTANSYHLIMKIFADCEEFKAMWRLVDEMIENGFPTT 196

Query: 185 ARTFTLLICTCGDAGLARKVVERFIKSKTFNYRPFKHSYNAILHALLGIKQYKLIEWVYQ 244
           ARTF +LICTCG+ GLARKVVERFIKSK FN+RPFK+SYNAILHALLGI+QYKLIEWVYQ
Sbjct: 197 ARTFNILICTCGEVGLARKVVERFIKSKLFNFRPFKNSYNAILHALLGIRQYKLIEWVYQ 256

Query: 245 QMLLDGHSSDMLTYNLLLYARCKLGKLDQFHRLLDEMARNGFSPDYHTYNILLYVLGKGD 304
           QM  +G++ D+LTYN+++ A+ +LGKLDQFHRLLDEM R+GFSPD+HTYNILL+VLGKGD
Sbjct: 257 QMSDEGYAPDILTYNIVMCAKYRLGKLDQFHRLLDEMGRSGFSPDFHTYNILLHVLGKGD 316

Query: 305 KPLAALNLLNHMREVGSDPSILHFTTLINGLSRAGNLDACKYFFDELSNNGCTPDVVCYT 364
           KPLAALNLLNHM+EVG DPS+LHFTTL++GLSRAGNLDACKYFFDE++N GC PDVVCYT
Sbjct: 317 KPLAALNLLNHMKEVGFDPSVLHFTTLMDGLSRAGNLDACKYFFDEMANKGCMPDVVCYT 376

Query: 365 VMITSYTVAGEHEKARELFDEMVIKGQLPNVFTYNSMIRGFCMVGEFEEAYTMLEEMESR 424
           VMITSY  AGE EKA++LFD M+ KGQLPNVFTYNSMIRGFCM G+F+EA TM++EMESR
Sbjct: 377 VMITSYIAAGELEKAQDLFDGMITKGQLPNVFTYNSMIRGFCMAGKFDEACTMMKEMESR 436

Query: 425 GCKPNFLVYSTLVSYLRNAGQLSEAHKVITHMVEKGQYAHLVTKFKGYRRC 476
           GC PNFLVY+TLVS LRNAG+L+EAH+VI HMVEKG+Y HLV+KFK Y+RC
Sbjct: 437 GCNPNFLVYNTLVSNLRNAGKLAEAHEVIRHMVEKGKYIHLVSKFKRYKRC 487

BLAST of Cp4.1LG09g00920.1 vs. TrEMBL
Match: A0A059DJY2_EUCGR (Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_A02787 PE=4 SV=1)

HSP 1 Score: 660.2 bits (1702), Expect = 1.9e-186
Identity = 327/487 (67.15%), Postives = 387/487 (79.47%), Query Frame = 1

Query: 1   MNALSLLSARIVCRLVSNSFIVRRTIWNRSFGSD---DRFEFVMEPYKKGFERSES---- 60
           MN+ +L   RIV + +S+  I  RT+ +R F      + FE V EP KK +         
Sbjct: 1   MNSRALFGPRIVHK-ISHLLISCRTLCDRGFRGGKIKEGFECVEEPLKKMWVNDSDGFMD 60

Query: 61  -----CEFESLRCDDLSVRRGFLEDSKTDAGKVLEVLKQDGPGFDTLVALDELQLKVSGV 120
                 E+ES   +  S R+ F  +++  A KVL VL+QDGPGFD   +L+EL+++VSG+
Sbjct: 61  DARILSEYESSDEEYYSTRQNFCANARVGAEKVLRVLQQDGPGFDVKTSLEELRVRVSGL 120

Query: 121 LVREVLRGILRSINVLNKTQCAKLGYKFFVWSGKIENFNHTASSYHMIMKIFAECEEFKA 180
           LVREVL GILRS    NK++C KLGYKFFVWSG  EN+ HT  +YH +MKIFAEC+EFKA
Sbjct: 121 LVREVLLGILRSTEFANKSRCVKLGYKFFVWSGMQENYRHTTDAYHSMMKIFAECDEFKA 180

Query: 181 MWRLVDEMTEKGYPVTARTFTLLICTCGDAGLARKVVERFIKSKTFNYRPFKHSYNAILH 240
           MWRLVDEM EKG  +TARTF +LICTCG+ GLAR+VVERFIKSK FNYRPFKHSYNAILH
Sbjct: 181 MWRLVDEMIEKGLFMTARTFNILICTCGEVGLARRVVERFIKSKAFNYRPFKHSYNAILH 240

Query: 241 ALLGIKQYKLIEWVYQQMLLDGHSSDMLTYNLLLYARCKLGKLDQFHRLLDEMARNGFSP 300
           +L+ +KQYKLIEWVYQQML DG S D+LTYN+L+ A+ +LGKLDQFHRLLDEM R GFSP
Sbjct: 241 SLVALKQYKLIEWVYQQMLADGLSPDILTYNILMCAKYRLGKLDQFHRLLDEMGRRGFSP 300

Query: 301 DYHTYNILLYVLGKGDKPLAALNLLNHMREVGSDPSILHFTTLINGLSRAGNLDACKYFF 360
           D+HTYN+LL+VLGKGDKPLAAL+LLNHMREVG DPS+LHFTTL++GLSRAGNLDACKYFF
Sbjct: 301 DFHTYNLLLHVLGKGDKPLAALSLLNHMREVGIDPSVLHFTTLMDGLSRAGNLDACKYFF 360

Query: 361 DELSNNGCTPDVVCYTVMITSYTVAGEHEKARELFDEMVIKGQLPNVFTYNSMIRGFCMV 420
           DE+  N C PDVVCYTV+IT Y VAGE EKA E+F++M+I GQLPNVFTYNSMIRG+CM 
Sbjct: 361 DEMIKNNCKPDVVCYTVLITGYVVAGELEKAIEMFNKMMIDGQLPNVFTYNSMIRGYCMA 420

Query: 421 GEFEEAYTMLEEMESRGCKPNFLVYSTLVSYLRNAGQLSEAHKVITHMVEKGQYAHLVTK 476
           G F EA +ML EME+RGC PNFLVYSTLVSYLRNAG+LS+AH VI  MVEKGQY HLV+K
Sbjct: 421 GNFAEACSMLNEMEARGCNPNFLVYSTLVSYLRNAGKLSQAHSVIRRMVEKGQYVHLVSK 480

BLAST of Cp4.1LG09g00920.1 vs. TAIR10
Match: AT1G55630.1 (AT1G55630.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 610.9 bits (1574), Expect = 6.6e-175
Identity = 300/476 (63.03%), Postives = 367/476 (77.10%), Query Frame = 1

Query: 1   MNALSLLSARIVCRLVSNSFIVRRTIWNRSFGSDDRFEFVMEPYKKGFERSE-SCEFESL 60
           MN++   S  +  R  S      R   N S G D       EP K  +E SE  CEF+  
Sbjct: 1   MNSVIHYSTSVAVRKASRFLFTSRKFCNGSIGGDVTDNGTEEPLKITWESSEMDCEFDQE 60

Query: 61  RCDD-LSVRRGFLEDSKTDAGKVLEVLKQDGPGFDTLVALDELQLKVSGVLVREVLRGIL 120
              + +SVR+ F+E +K  A +VL+ L+QD PGF+T  ALDEL + +SG+LVREVL GIL
Sbjct: 61  ENGEKISVRKRFMESTKLSASRVLDTLQQDCPGFNTKSALDELNVSISGLLVREVLVGIL 120

Query: 121 RSINVLNKTQCAKLGYKFFVWSGKIENFNHTASSYHMIMKIFAECEEFKAMWRLVDEMTE 180
           R+++  NKT+CAKL YKFFVW G  ENF HTA+ YH++MKIFAEC E+KAM RL+DEM +
Sbjct: 121 RTLSFDNKTRCAKLAYKFFVWCGGQENFRHTANCYHLLMKIFAECGEYKAMCRLIDEMIK 180

Query: 181 KGYPVTARTFTLLICTCGDAGLARKVVERFIKSKTFNYRPFKHSYNAILHALLGIKQYKL 240
            GYP TA TF LLICTCG+AGLAR VVE+FIKSKTFNYRP+KHSYNAILH+LLG+KQYKL
Sbjct: 181 DGYPTTACTFNLLICTCGEAGLARDVVEQFIKSKTFNYRPYKHSYNAILHSLLGVKQYKL 240

Query: 241 IEWVYQQMLLDGHSSDMLTYNLLLYARCKLGKLDQFHRLLDEMARNGFSPDYHTYNILLY 300
           I+WVY+QML DG + D+LTYN++++A  +LGK D+ +RLLDEM ++GFSPD +TYNILL+
Sbjct: 241 IDWVYEQMLEDGFTPDVLTYNIVMFANFRLGKTDRLYRLLDEMVKDGFSPDLYTYNILLH 300

Query: 301 VLGKGDKPLAALNLLNHMREVGSDPSILHFTTLINGLSRAGNLDACKYFFDELSNNGCTP 360
            L  G+KPLAALNLLNHMREVG +P ++HFTTLI+GLSRAG L+ACKYF DE    GCTP
Sbjct: 301 HLATGNKPLAALNLLNHMREVGVEPGVIHFTTLIDGLSRAGKLEACKYFMDETVKVGCTP 360

Query: 361 DVVCYTVMITSYTVAGEHEKARELFDEMVIKGQLPNVFTYNSMIRGFCMVGEFEEAYTML 420
           DVVCYTVMIT Y   GE EKA E+F EM  KGQLPNVFTYNSMIRGFCM G+F+EA  +L
Sbjct: 361 DVVCYTVMITGYISGGELEKAEEMFKEMTEKGQLPNVFTYNSMIRGFCMAGKFKEACALL 420

Query: 421 EEMESRGCKPNFLVYSTLVSYLRNAGQLSEAHKVITHMVEKGQYAHLVTKFKGYRR 475
           +EMESRGC PNF+VYSTLV+ L+NAG++ EAH+V+  MVEKG Y HL++K K YRR
Sbjct: 421 KEMESRGCNPNFVVYSTLVNNLKNAGKVLEAHEVVKDMVEKGHYVHLISKLKKYRR 476

BLAST of Cp4.1LG09g00920.1 vs. TAIR10
Match: AT3G60050.1 (AT3G60050.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 598.6 bits (1542), Expect = 3.4e-171
Identity = 298/475 (62.74%), Postives = 367/475 (77.26%), Query Frame = 1

Query: 1   MNALSLLSARIVCRLVSNSFIVRRTIWNRSFGSDDRFEFVMEPYKKGF-ERSESCEFESL 60
           MN   +L   +V +     FI R+   N +FG ++  +        GF E S   E  S+
Sbjct: 1   MNLALVLGTNVVRKAYRFLFISRK-FCNGNFGGNE-IDNGFPDLDCGFDEDSNISELRSI 60

Query: 61  RCDDLSVRRGFLEDSKTDAGKVLEVLKQDGPGFDTLVALDELQLKVSGVLVREVLRGILR 120
             + +SVR  FLE +   A +VL  L+ D  GF++   LDEL ++VSG+LVREVL GILR
Sbjct: 61  DREVISVRSRFLESANHSASRVLVTLQLDESGFNSKSVLDELNVRVSGLLVREVLVGILR 120

Query: 121 SINVLNKTQCAKLGYKFFVWSGKIENFNHTASSYHMIMKIFAECEEFKAMWRLVDEMTEK 180
           +++  NK +CAKL Y+FF+WSG+ E F HT +SYH++MKIFAEC E+KAMWRLVDEM + 
Sbjct: 121 NLSYDNKARCAKLAYRFFLWSGEQECFRHTVNSYHLLMKIFAECGEYKAMWRLVDEMVQD 180

Query: 181 GYPVTARTFTLLICTCGDAGLARKVVERFIKSKTFNYRPFKHSYNAILHALLGIKQYKLI 240
           G+P TARTF LLIC+CG+AGLA++ V +F+KSKTFNYRPFKHSYNAIL++LLG+KQYKLI
Sbjct: 181 GFPTTARTFNLLICSCGEAGLAKQAVVQFMKSKTFNYRPFKHSYNAILNSLLGVKQYKLI 240

Query: 241 EWVYQQMLLDGHSSDMLTYNLLLYARCKLGKLDQFHRLLDEMARNGFSPDYHTYNILLYV 300
           EWVY+QML DG S D+LTYN+LL+   +LGK+D+F RL DEMAR+GFSPD +TYNILL++
Sbjct: 241 EWVYKQMLEDGFSPDVLTYNILLWTNYRLGKMDRFDRLFDEMARDGFSPDSYTYNILLHI 300

Query: 301 LGKGDKPLAALNLLNHMREVGSDPSILHFTTLINGLSRAGNLDACKYFFDELSNNGCTPD 360
           LGKG+KPLAAL  LNHM+EVG DPS+LH+TTLI+GLSRAGNL+ACKYF DE+   GC PD
Sbjct: 301 LGKGNKPLAALTTLNHMKEVGIDPSVLHYTTLIDGLSRAGNLEACKYFLDEMVKAGCRPD 360

Query: 361 VVCYTVMITSYTVAGEHEKARELFDEMVIKGQLPNVFTYNSMIRGFCMVGEFEEAYTMLE 420
           VVCYTVMIT Y V+GE +KA+E+F EM +KGQLPNVFTYNSMIRG CM GEF EA  +L+
Sbjct: 361 VVCYTVMITGYVVSGELDKAKEMFREMTVKGQLPNVFTYNSMIRGLCMAGEFREACWLLK 420

Query: 421 EMESRGCKPNFLVYSTLVSYLRNAGQLSEAHKVITHMVEKGQYAHLVTKFKGYRR 475
           EMESRGC PNF+VYSTLVSYLR AG+LSEA KVI  MV+KG Y HLV K   YRR
Sbjct: 421 EMESRGCNPNFVVYSTLVSYLRKAGKLSEARKVIREMVKKGHYVHLVPKMMKYRR 473

BLAST of Cp4.1LG09g00920.1 vs. TAIR10
Match: AT3G60040.1 (AT3G60040.1 F-box family protein)

HSP 1 Score: 169.9 bits (429), Expect = 3.9e-42
Identity = 85/153 (55.56%), Postives = 104/153 (67.97%), Query Frame = 1

Query: 322 DPSILHFTTLINGLSRAGNLDACKYFFDELSNNGCTPDVVCYTVMITSYTVAGEHEKARE 381
           D + L    L++ L +     A     + +   G  P V+ YT +I  Y V+GE +KA+E
Sbjct: 686 DGTQLLLIILLHILGKGNKPLAALTTLNHMKEVGIDPSVLHYTTLIDGYVVSGELDKAKE 745

Query: 382 LFDEMVIKGQLPNVFTYNSMIRGFCMVGEFEEAYTMLEEMESRGCKPNFLVYSTLVSYLR 441
           +F EM +KGQLPNVFTYNSMIRG CM GEF EA  +L+EMESRGC PNF+VYSTLV YLR
Sbjct: 746 MFREMTVKGQLPNVFTYNSMIRGLCMAGEFREACWLLKEMESRGCNPNFVVYSTLVGYLR 805

Query: 442 NAGQLSEAHKVITHMVEKGQYAHLVTKFKGYRR 475
            AG+LSEA KVI  MV+KG Y HLV+K   YRR
Sbjct: 806 KAGKLSEARKVIKEMVKKGHYVHLVSKMMKYRR 838

BLAST of Cp4.1LG09g00920.1 vs. TAIR10
Match: AT3G16010.1 (AT3G16010.1 Pentatricopeptide repeat (PPR-like) superfamily protein)

HSP 1 Score: 161.4 bits (407), Expect = 1.4e-39
Identity = 110/373 (29.49%), Postives = 179/373 (47.99%), Query Frame = 1

Query: 91  GFDTLVALDELQLKVSGVLVREVLRGILRSINVLNKTQCAKLGYKFFVWSGKIENFNHTA 150
           G D   AL+ L+LKV   LVR +L  I   INV           +FF W+GK  NF H  
Sbjct: 77  GPDAEKALEVLKLKVDHRLVRSILE-IDVEINVK---------IQFFKWAGKRRNFQHDC 136

Query: 151 SSYHMIMKIFAECEEFKAMWRLVDEMTEKGY-PVTARTFTLLICTCGDAGLARKVVERFI 210
           S+Y  +++   E   +  M+R + E+    Y  V+    + L+   G A +  K +  F 
Sbjct: 137 STYMTLIRCLEEARLYGEMYRTIQEVVRNTYVSVSPAVLSELVKALGRAKMVSKALSVFY 196

Query: 211 KSKTFNYRPFKHSYNAILHALLGIKQYKLIEWVYQQMLLDGHS-SDMLTYNLLLYARCKL 270
           ++K    +P   +YN+++  L+   Q++ +  VY +M  +G    D +TY+ L+ +  KL
Sbjct: 197 QAKGRKCKPTSSTYNSVILMLMQEGQHEKVHEVYTEMCNEGDCFPDTITYSALISSYEKL 256

Query: 271 GKLDQFHRLLDEMARNGFSPDYHTYNILLYVLGKGDKPLAALNLLNHMREVGSDPSILHF 330
           G+ D   RL DEM  N   P    Y  LL +  K  K   AL+L   M+  G  P++  +
Sbjct: 257 GRNDSAIRLFDEMKDNCMQPTEKIYTTLLGIYFKVGKVEKALDLFEEMKRAGCSPTVYTY 316

Query: 331 TTLINGLSRAGNLDACKYFFDELSNNGCTPDVVCYTVMITSYTVAGEHEKARELFDEMVI 390
           T LI GL +AG +D    F+ ++  +G TPDVV    ++      G  E+   +F EM +
Sbjct: 317 TELIKGLGKAGRVDEAYGFYKDMLRDGLTPDVVFLNNLMNILGKVGRVEELTNVFSEMGM 376

Query: 391 KGQLPNVFTYNSMIRG-FCMVGEFEEAYTMLEEMESRGCKPNFLVYSTLVSYLRNAGQLS 450
               P V +YN++I+  F       E  +  ++M++    P+   YS L+       ++ 
Sbjct: 377 WRCTPTVVSYNTVIKALFESKAHVSEVSSWFDKMKADSVSPSEFTYSILIDGYCKTNRVE 436

Query: 451 EAHKVITHMVEKG 461
           +A  ++  M EKG
Sbjct: 437 KALLLLEEMDEKG 439

BLAST of Cp4.1LG09g00920.1 vs. TAIR10
Match: AT5G65820.1 (AT5G65820.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 156.8 bits (395), Expect = 3.4e-38
Identity = 104/391 (26.60%), Postives = 185/391 (47.31%), Query Frame = 1

Query: 72  EDSKTDAGKVLEVLKQDGPGFDTLVALDELQLKVSGVLVREVLRGILRSINVLNKT-QCA 131
           ++  +D  K   +L++    F + V   EL L  SGV +R    G++    VLN+     
Sbjct: 77  DEFASDVEKSYRILRK----FHSRVPKLELALNESGVELRP---GLIE--RVLNRCGDAG 136

Query: 132 KLGYKFFVWSGKIENFNHTASSYHMIMKIFAECEEFKAMWRLVDEM-TEKGYPVTARTFT 191
            LGY+FFVW+ K   + H+   Y  ++KI ++  +F A+W L++EM  E    +    F 
Sbjct: 137 NLGYRFFVWAAKQPRYCHSIEVYKSMVKILSKMRQFGAVWGLIEEMRKENPQLIEPELFV 196

Query: 192 LLICTCGDAGLARKVVERFIKSKTFNYRPFKHSYNAILHALLGIKQYKLIEWVYQQMLLD 251
           +L+     A + +K +E   +   F + P ++ +  +L AL      K    +++ M + 
Sbjct: 197 VLVQRFASADMVKKAIEVLDEMPKFGFEPDEYVFGCLLDALCKHGSVKDAAKLFEDMRM- 256

Query: 252 GHSSDMLTYNLLLYARCKLGKLDQFHRLLDEMARNGFSPDYHTYNILLYVLGKGDKPLAA 311
               ++  +  LLY  C++GK+ +   +L +M   GF PD   Y  LL       K   A
Sbjct: 257 RFPVNLRYFTSLLYGWCRVGKMMEAKYVLVQMNEAGFEPDIVDYTNLLSGYANAGKMADA 316

Query: 312 LNLLNHMREVGSDPSILHFTTLINGLSRAGNLDACKYFFDELSNNGCTPDVVCYTVMITS 371
            +LL  MR  G +P+   +T LI  L +   ++     F E+    C  DVV YT +++ 
Sbjct: 317 YDLLRDMRRRGFEPNANCYTVLIQALCKVDRMEEAMKVFVEMERYECEADVVTYTALVSG 376

Query: 372 YTVAGEHEKARELFDEMVIKGQLPNVFTYNSMIRGFCMVGEFEEAYTMLEEMESRGCKPN 431
           +   G+ +K   + D+M+ KG +P+  TY  ++        FEE   ++E+M      P+
Sbjct: 377 FCKWGKIDKCYIVLDDMIKKGLMPSELTYMHIMVAHEKKESFEECLELMEKMRQIEYHPD 436

Query: 432 FLVYSTLVSYLRNAGQLSEAHKVITHMVEKG 461
             +Y+ ++      G++ EA ++   M E G
Sbjct: 437 IGIYNVVIRLACKLGEVKEAVRLWNEMEENG 457

BLAST of Cp4.1LG09g00920.1 vs. NCBI nr
Match: gi|659077478|ref|XP_008439226.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g55630-like [Cucumis melo])

HSP 1 Score: 810.1 bits (2091), Expect = 2.1e-231
Identity = 388/476 (81.51%), Postives = 435/476 (91.39%), Query Frame = 1

Query: 1   MNALSLLSARIVCRLVSNSFIVRRTIWNRSFGSDDRFEFVMEPYKKGFE-RSESCEFESL 60
           MN+LSL+S+RIVCRL S SFIVRRTIWNRSF SDDRF+FV+EP+    +  S+S E +S 
Sbjct: 1   MNSLSLVSSRIVCRLFSTSFIVRRTIWNRSFCSDDRFQFVVEPFSYFADGNSDSFETDSR 60

Query: 61  RCDDLSVRRGFLEDSKTDAGKVLEVLKQDGPGFDTLVALDELQLKVSGVLVREVLRGILR 120
           R DD S+RR FL+D+K DA KV+E+LKQDGPGFDT +ALDEL+L+VSGVLV EVL+GIL+
Sbjct: 61  RWDDFSLRRSFLKDAKIDAEKVIEILKQDGPGFDTFLALDELKLQVSGVLVGEVLKGILK 120

Query: 121 SINVLNKTQCAKLGYKFFVWSGKIENFNHTASSYHMIMKIFAECEEFKAMWRLVDEMTEK 180
           S +VLNKTQCAKLGYKFFVWS +IEN+ HT  SYHMIMKIFAECEEFKAMWRL+DEMTEK
Sbjct: 121 STSVLNKTQCAKLGYKFFVWSSRIENYRHTVKSYHMIMKIFAECEEFKAMWRLLDEMTEK 180

Query: 181 GYPVTARTFTLLICTCGDAGLARKVVERFIKSKTFNYRPFKHSYNAILHALLGIKQYKLI 240
           GYPVTARTF +LICTCGDAGLA+KVVERFIKSKTFN+RP+KHSYNAILH L+ +KQYKLI
Sbjct: 181 GYPVTARTFMILICTCGDAGLAKKVVERFIKSKTFNFRPYKHSYNAILHGLVVVKQYKLI 240

Query: 241 EWVYQQMLLDGHSSDMLTYNLLLYARCKLGKLDQFHRLLDEMARNGFSPDYHTYNILLYV 300
           EWVY+QMLLDGH  D+LTYN+LL++RCKLGKLDQFHRLLDEMAR GFSPD+HTYNILLYV
Sbjct: 241 EWVYEQMLLDGHGPDILTYNVLLFSRCKLGKLDQFHRLLDEMARKGFSPDFHTYNILLYV 300

Query: 301 LGKGDKPLAALNLLNHMREVGSDPSILHFTTLINGLSRAGNLDACKYFFDELSNNGCTPD 360
           LGKGDKPLAALNLLNHMREVG  P+ILHFTTLINGLSRAGNLDACKYFFDEL N+ C PD
Sbjct: 301 LGKGDKPLAALNLLNHMREVGFGPNILHFTTLINGLSRAGNLDACKYFFDELGNHDCIPD 360

Query: 361 VVCYTVMITSYTVAGEHEKARELFDEMVIKGQLPNVFTYNSMIRGFCMVGEFEEAYTMLE 420
           VVCYTVMITSYT AG+HEKAR  FDEM++KGQLPNVFTYNSMIRGFCMVG+F+EAY+ML 
Sbjct: 361 VVCYTVMITSYTEAGQHEKARAFFDEMIMKGQLPNVFTYNSMIRGFCMVGKFKEAYSMLS 420

Query: 421 EMESRGCKPNFLVYSTLVSYLRNAGQLSEAHKVITHMVEKGQYAHLVTKFKGYRRC 476
           EMESRGC+PNFLVYSTLVSYLRNAG+L+EAHKVIT MVE GQYAHL+TKFKGYRRC
Sbjct: 421 EMESRGCRPNFLVYSTLVSYLRNAGKLAEAHKVITRMVENGQYAHLMTKFKGYRRC 476

BLAST of Cp4.1LG09g00920.1 vs. NCBI nr
Match: gi|449446161|ref|XP_004140840.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g55630-like [Cucumis sativus])

HSP 1 Score: 797.7 bits (2059), Expect = 1.1e-227
Identity = 381/476 (80.04%), Postives = 431/476 (90.55%), Query Frame = 1

Query: 1   MNALSLLSARIVCRLVSNSFIVRRTIWNRSFGSDDRFEFVMEPYKKGFE-RSESCEFESL 60
           MN+LSL+S+RIVCRL S SFIVRRTIWNR+F SDDRF+F +EP+    +  S+S E +S 
Sbjct: 1   MNSLSLVSSRIVCRLFSTSFIVRRTIWNRNFCSDDRFQFFVEPFSYLADGNSDSFETDSR 60

Query: 61  RCDDLSVRRGFLEDSKTDAGKVLEVLKQDGPGFDTLVALDELQLKVSGVLVREVLRGILR 120
           R DD S RR FL+D+K DA KV+E+LKQDGPGFDT +ALDELQLKVSGVLV EVL+GIL+
Sbjct: 61  RWDDFSFRRSFLKDAKIDAEKVIEILKQDGPGFDTFLALDELQLKVSGVLVGEVLKGILK 120

Query: 121 SINVLNKTQCAKLGYKFFVWSGKIENFNHTASSYHMIMKIFAECEEFKAMWRLVDEMTEK 180
           S +VLNKTQCAKLGYKFF+WSG+IEN+ HT +SYH+IMKIFAECEEFKAMWR++DEMTEK
Sbjct: 121 SKSVLNKTQCAKLGYKFFIWSGRIENYRHTVNSYHIIMKIFAECEEFKAMWRVLDEMTEK 180

Query: 181 GYPVTARTFTLLICTCGDAGLARKVVERFIKSKTFNYRPFKHSYNAILHALLGIKQYKLI 240
           GYPVTARTF +LICTCG+AGLA++VVERFIKSKTFN+RP+KHSYNAILH L+ +KQYKLI
Sbjct: 181 GYPVTARTFMILICTCGEAGLAKRVVERFIKSKTFNFRPYKHSYNAILHGLVIVKQYKLI 240

Query: 241 EWVYQQMLLDGHSSDMLTYNLLLYARCKLGKLDQFHRLLDEMARNGFSPDYHTYNILLYV 300
            WVY QMLLD HS D+LTYN+LL++ CKLGKLDQFHRLLDEMAR GFSPD+HTYNILLYV
Sbjct: 241 GWVYDQMLLDDHSPDILTYNVLLFSSCKLGKLDQFHRLLDEMARKGFSPDFHTYNILLYV 300

Query: 301 LGKGDKPLAALNLLNHMREVGSDPSILHFTTLINGLSRAGNLDACKYFFDELSNNGCTPD 360
           LGKGDKPLAALNLLNHMREVG  P++LHFTTLINGLSRAGNLDACKYFFDEL NNGC PD
Sbjct: 301 LGKGDKPLAALNLLNHMREVGFGPNVLHFTTLINGLSRAGNLDACKYFFDELGNNGCIPD 360

Query: 361 VVCYTVMITSYTVAGEHEKARELFDEMVIKGQLPNVFTYNSMIRGFCMVGEFEEAYTMLE 420
           VVCYTVMITS+T AG+HEKAR  FDEM++KGQLPNVFTYNSMIRGFCMVG+F+EAY+ML 
Sbjct: 361 VVCYTVMITSFTEAGQHEKARAFFDEMIMKGQLPNVFTYNSMIRGFCMVGKFKEAYSMLS 420

Query: 421 EMESRGCKPNFLVYSTLVSYLRNAGQLSEAHKVITHMVEKGQYAHLVTKFKGYRRC 476
           EMESRGC+PNFLVYSTLVSYLRNAG+L EAHKVI  MVE GQYAHL+TKFKGYRRC
Sbjct: 421 EMESRGCRPNFLVYSTLVSYLRNAGKLGEAHKVIKQMVENGQYAHLMTKFKGYRRC 476

BLAST of Cp4.1LG09g00920.1 vs. NCBI nr
Match: gi|645257085|ref|XP_008234250.1| (PREDICTED: pentatricopeptide repeat-containing protein At3g60050-like [Prunus mume])

HSP 1 Score: 683.3 bits (1762), Expect = 3.0e-193
Identity = 329/488 (67.42%), Postives = 405/488 (82.99%), Query Frame = 1

Query: 1   MNALSLLSARIVCRLVSNSFIVRRTIWNRSFGS---DDRFEFVMEPYKKGF--------- 60
           MN +SL   R++ + +S  F+V R + +  FG    D+  EF+ EP K+ +         
Sbjct: 1   MNCISLFGPRVIQK-ISCYFVVARKLSDGCFGGNKGDNGHEFIEEPLKRIWRSSDFDSVL 60

Query: 61  -ERSESCEFESLRCDDLSVRRGFLEDSKTDAGKVLEVLKQDGPGFDTLVALDELQLKVSG 120
            E+    E+E+   +  S+RR F E++K    +VLEVL+QDGPGFDT  ALDEL ++VSG
Sbjct: 61  DEKQNVHEYENRPREYFSLRRSFFENAKIHTRRVLEVLQQDGPGFDTKAALDELHIEVSG 120

Query: 121 VLVREVLRGILRSINVLNKTQCAKLGYKFFVWSGKIENFNHTASSYHMIMKIFAECEEFK 180
           +LVREVL  IL+ +N  +K +CAKLGYKFFVWSG++EN+ HTA++YH++MKIFA+CEEFK
Sbjct: 121 LLVREVLFNILKQVNYASKMRCAKLGYKFFVWSGQLENYRHTANTYHLMMKIFADCEEFK 180

Query: 181 AMWRLVDEMTEKGYPVTARTFTLLICTCGDAGLARKVVERFIKSKTFNYRPFKHSYNAIL 240
           AMWRLVDEM EKGYP TA+TF++LI TCG+AGLA+KVVERFIKSKTFNYRPFKHSYNAIL
Sbjct: 181 AMWRLVDEMIEKGYPTTAQTFSILIRTCGEAGLAKKVVERFIKSKTFNYRPFKHSYNAIL 240

Query: 241 HALLGIKQYKLIEWVYQQMLLDGHSSDMLTYNLLLYARCKLGKLDQFHRLLDEMARNGFS 300
           H+L+ +KQYKLIEWVYQQML DGH +D+LTYN+++YA+ +LGKLDQFHRLL+EM R+GF+
Sbjct: 241 HSLVVVKQYKLIEWVYQQMLADGHCTDILTYNVMMYAKYRLGKLDQFHRLLEEMGRSGFA 300

Query: 301 PDYHTYNILLYVLGKGDKPLAALNLLNHMREVGSDPSILHFTTLINGLSRAGNLDACKYF 360
           PD+HTYNILL+VLGKGDKPLAALNLLNHM+EVG DPS+LHFTTLI+GLSRAGNLDACKYF
Sbjct: 301 PDFHTYNILLHVLGKGDKPLAALNLLNHMKEVGFDPSVLHFTTLIDGLSRAGNLDACKYF 360

Query: 361 FDELSNNGCTPDVVCYTVMITSYTVAGEHEKARELFDEMVIKGQLPNVFTYNSMIRGFCM 420
           FDE+  + C PDVVCYTVMI+ Y VAGE EKA+ +FDEM+  GQLPNVFTYN+MIRG CM
Sbjct: 361 FDEMIKHECFPDVVCYTVMISGYIVAGELEKAQGVFDEMIPNGQLPNVFTYNAMIRGLCM 420

Query: 421 VGEFEEAYTMLEEMESRGCKPNFLVYSTLVSYLRNAGQLSEAHKVITHMVEKGQYAHLVT 476
            G+FEEA +ML++MESRGC PNF VYSTLVSYLRNAG+ ++AH+VITHMVEKGQY HL++
Sbjct: 421 AGKFEEACSMLKDMESRGCNPNFTVYSTLVSYLRNAGKRAKAHEVITHMVEKGQYTHLLS 480

BLAST of Cp4.1LG09g00920.1 vs. NCBI nr
Match: gi|596041207|ref|XP_007220061.1| (hypothetical protein PRUPE_ppa025794mg [Prunus persica])

HSP 1 Score: 672.2 bits (1733), Expect = 6.8e-190
Identity = 313/427 (73.30%), Postives = 376/427 (88.06%), Query Frame = 1

Query: 49  ERSESCEFESLRCDDLSVRRGFLEDSKTDAGKVLEVLKQDGPGFDTLVALDELQLKVSGV 108
           E+    E+E+   +  S+RR F E++K    +VLEVL+QDGPGFDT  ALDEL ++VSG+
Sbjct: 20  EKQNVPEYENRPREYFSLRRSFFENAKIHTRRVLEVLQQDGPGFDTKAALDELHIEVSGL 79

Query: 109 LVREVLRGILRSINVLNKTQCAKLGYKFFVWSGKIENFNHTASSYHMIMKIFAECEEFKA 168
           LVREVL  IL+ +N  +K +CAKLGYKFFVWSG++EN+ HTA++YH++MKIFA+CEEFKA
Sbjct: 80  LVREVLFKILKQVNYASKMRCAKLGYKFFVWSGQLENYRHTANTYHLMMKIFADCEEFKA 139

Query: 169 MWRLVDEMTEKGYPVTARTFTLLICTCGDAGLARKVVERFIKSKTFNYRPFKHSYNAILH 228
           MWRLVDEM EKGYP TA+TF +LICTCG+AGLA+KVVERFIKSKTFNYRPFKHSYNAILH
Sbjct: 140 MWRLVDEMIEKGYPTTAQTFNILICTCGEAGLAKKVVERFIKSKTFNYRPFKHSYNAILH 199

Query: 229 ALLGIKQYKLIEWVYQQMLLDGHSSDMLTYNLLLYARCKLGKLDQFHRLLDEMARNGFSP 288
           +L+ +KQYKLIEWVYQQML DGH +D+LTYN+++YA+ +LGKLDQFHRLL+EM R+GF+P
Sbjct: 200 SLVVVKQYKLIEWVYQQMLADGHCTDILTYNVMMYAKYRLGKLDQFHRLLEEMGRSGFAP 259

Query: 289 DYHTYNILLYVLGKGDKPLAALNLLNHMREVGSDPSILHFTTLINGLSRAGNLDACKYFF 348
           D HTYNILL+VLGKGDKPLAALNLLNHM+EVG DPS+LHFTTLI+GLSR+GNLDACKYFF
Sbjct: 260 DLHTYNILLHVLGKGDKPLAALNLLNHMKEVGLDPSVLHFTTLIDGLSRSGNLDACKYFF 319

Query: 349 DELSNNGCTPDVVCYTVMITSYTVAGEHEKARELFDEMVIKGQLPNVFTYNSMIRGFCMV 408
           DE+  + C PDVVCYTVMI+ Y VAGE EKA+ +FDEM+  GQLPNVFTYN+MIRG CM 
Sbjct: 320 DEMIKHECFPDVVCYTVMISGYIVAGELEKAQGVFDEMIPNGQLPNVFTYNAMIRGLCMA 379

Query: 409 GEFEEAYTMLEEMESRGCKPNFLVYSTLVSYLRNAGQLSEAHKVITHMVEKGQYAHLVTK 468
           G+FEEA +ML++MESRGC PNF VYSTLVSYLRNAG+L++AH+VITHMVEKGQY HL++K
Sbjct: 380 GKFEEACSMLKDMESRGCNPNFTVYSTLVSYLRNAGKLAKAHEVITHMVEKGQYTHLLSK 439

Query: 469 FKGYRRC 476
           FKGYRRC
Sbjct: 440 FKGYRRC 446

BLAST of Cp4.1LG09g00920.1 vs. NCBI nr
Match: gi|703141203|ref|XP_010107444.1| (hypothetical protein L484_015785 [Morus notabilis])

HSP 1 Score: 669.8 bits (1727), Expect = 3.4e-189
Identity = 324/478 (67.78%), Postives = 388/478 (81.17%), Query Frame = 1

Query: 11  IVCRLVSNSFIVRRTIWNRSF---GSDDRFEFVMEPYKKGFERS-------ESCEFESLR 70
           +V ++  +S +V R   NRSF     ++ ++   EP K+ ++ S       E  +F S  
Sbjct: 21  VVSKVTLSSLVVIRNFCNRSFDGVNGENGYDCFEEPLKRMWKSSYFDSDMDEQSDFYSRE 80

Query: 71  CD---DLSVRRGFLEDSKTDAGKVLEVLKQDGPGFDTLVALDELQLKVSGVLVREVLRGI 130
                + SVR+ F E ++ DAG+VLEVL+QDGPGFD   ALDEL ++VSG+LVR+VL GI
Sbjct: 81  MPARGNFSVRQSFFETARIDAGRVLEVLQQDGPGFDAKPALDELNIRVSGLLVRKVLLGI 140

Query: 131 LRSINVLNKTQCAKLGYKFFVWSGKIENFNHTASSYHMIMKIFAECEEFKAMWRLVDEMT 190
           L +I+  NK +CAKLG+KFF WSG+  N+ HTA+SYH+++KIFAECEEFKAMWRLVDEM 
Sbjct: 141 LSNISHTNKIRCAKLGFKFFTWSGQQGNYRHTANSYHLLIKIFAECEEFKAMWRLVDEMI 200

Query: 191 EKGYPVTARTFTLLICTCGDAGLARKVVERFIKSKTFNYRPFKHSYNAILHALLGIKQYK 250
           E+G+P TART  +LICTCG+AGLARKVVERFIKSKTFN+RPFKHSYNAILH LL   QYK
Sbjct: 201 ERGFPTTARTLNILICTCGEAGLARKVVERFIKSKTFNFRPFKHSYNAILHCLLVTNQYK 260

Query: 251 LIEWVYQQMLLDGHSSDMLTYNLLLYARCKLGKLDQFHRLLDEMARNGFSPDYHTYNILL 310
           LIEWVYQQML DG S D+LTYN+L+  + +LGKLDQFHRLLDEM R GFSPD HTYNILL
Sbjct: 261 LIEWVYQQMLADGFSPDILTYNVLMLTKYRLGKLDQFHRLLDEMGRRGFSPDLHTYNILL 320

Query: 311 YVLGKGDKPLAALNLLNHMREVGSDPSILHFTTLINGLSRAGNLDACKYFFDELSNNGCT 370
           +VLGKGDKPLAALNLLNHM+E G DP +LHFTTLI+GLSRAGNLDAC++FFDE+  NGC 
Sbjct: 321 HVLGKGDKPLAALNLLNHMKETGIDPGVLHFTTLIDGLSRAGNLDACRFFFDEMPKNGCI 380

Query: 371 PDVVCYTVMITSYTVAGEHEKARELFDEMVIKGQLPNVFTYNSMIRGFCMVGEFEEAYTM 430
           PDVVCYTVMIT Y VAGE EKA+ +FDEM+ KGQ+PNVFTYNSMIRG CM G+FEEA +M
Sbjct: 381 PDVVCYTVMITGYVVAGELEKAQSIFDEMITKGQIPNVFTYNSMIRGLCMAGKFEEACSM 440

Query: 431 LEEMESRGCKPNFLVYSTLVSYLRNAGQLSEAHKVITHMVEKGQYAHLVTKFKGYRRC 476
           L++MESRGC PNFLVY+TLVS LRNAG+LSEAH+V+ HMVEKGQY HL++KFKGYRRC
Sbjct: 441 LKDMESRGCNPNFLVYTTLVSNLRNAGKLSEAHQVVKHMVEKGQYVHLLSKFKGYRRC 498

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PPR81_ARATH1.2e-17363.03Pentatricopeptide repeat-containing protein At1g55630 OS=Arabidopsis thaliana GN... [more]
PP288_ARATH6.0e-17062.74Pentatricopeptide repeat-containing protein At3g60050 OS=Arabidopsis thaliana GN... [more]
PP236_ARATH2.5e-3829.49Pentatricopeptide repeat-containing protein At3g16010 OS=Arabidopsis thaliana GN... [more]
PP447_ARATH6.0e-3726.60Putative pentatricopeptide repeat-containing protein At5g65820 OS=Arabidopsis th... [more]
PP298_ARATH1.0e-3630.15Pentatricopeptide repeat-containing protein At4g01400, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A0A0L8P4_CUCSA7.5e-22880.04Uncharacterized protein OS=Cucumis sativus GN=Csa_3G182100 PE=4 SV=1[more]
M5XQE6_PRUPE4.8e-19073.30Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa025794mg PE=4 SV=1[more]
W9S7I4_9ROSA2.4e-18967.78Uncharacterized protein OS=Morus notabilis GN=L484_015785 PE=4 SV=1[more]
V4S4A4_9ROSI1.5e-18875.91Uncharacterized protein OS=Citrus clementina GN=CICLE_v10004849mg PE=4 SV=1[more]
A0A059DJY2_EUCGR1.9e-18667.15Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_A02787 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G55630.16.6e-17563.03 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT3G60050.13.4e-17162.74 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT3G60040.13.9e-4255.56 F-box family protein[more]
AT3G16010.11.4e-3929.49 Pentatricopeptide repeat (PPR-like) superfamily protein[more]
AT5G65820.13.4e-3826.60 Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659077478|ref|XP_008439226.1|2.1e-23181.51PREDICTED: pentatricopeptide repeat-containing protein At1g55630-like [Cucumis m... [more]
gi|449446161|ref|XP_004140840.1|1.1e-22780.04PREDICTED: pentatricopeptide repeat-containing protein At1g55630-like [Cucumis s... [more]
gi|645257085|ref|XP_008234250.1|3.0e-19367.42PREDICTED: pentatricopeptide repeat-containing protein At3g60050-like [Prunus mu... [more]
gi|596041207|ref|XP_007220061.1|6.8e-19073.30hypothetical protein PRUPE_ppa025794mg [Prunus persica][more]
gi|703141203|ref|XP_010107444.1|3.4e-18967.78hypothetical protein L484_015785 [Morus notabilis][more]
The following terms have been associated with this mRNA:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Cp4.1LG09g00920Cp4.1LG09g00920gene


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cp4.1LG09g00920.1:cds:001Cp4.1LG09g00920.1:cds:001CDS


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Cp4.1LG09g00920.1Cp4.1LG09g00920.1-proteinpolypeptide


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 328..356
score: 2.
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 254..300
score: 3.3E-10coord: 358..406
score: 1.7
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 417..460
score: 0
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 152..184
score: 0.0029coord: 328..360
score: 5.7E-8coord: 361..394
score: 3.2E-7coord: 291..324
score: 2.6E-4coord: 396..429
score: 1.2E-12coord: 257..289
score: 3.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 219..253
score: 6.697coord: 429..463
score: 8.583coord: 149..183
score: 8.791coord: 394..428
score: 14.754coord: 359..393
score: 11.498coord: 184..218
score: 6.621coord: 289..323
score: 9.46coord: 254..288
score: 11.794coord: 324..358
score: 10
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 223..291
score: 6.7E-5coord: 362..458
score: 6.
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 124..463
score: 2.6E-211coord: 81..100
score: 2.6E
NoneNo IPR availablePANTHERPTHR24015:SF361SUBFAMILY NOT NAMEDcoord: 124..463
score: 2.6E-211coord: 81..100
score: 2.6E
NoneNo IPR availableunknownSSF81901HCP-likecoord: 223..423
score: 2.8