Cp4.1LG01g24300 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG01g24300
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionPentatricopeptide repeat-containing protein, putative
LocationCp4.1LG01 : 19152530 .. 19155006 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTGATGAATCAACTCCCATTAAAAAGTGTTCTTGTTCATATTGGACGTCATGGGTCCATACTTCAAGCTGTTGCTCTATCATCTTCAACACCTGATAATCTCATTACCACTGTACTTAACTGCAAAAGCCCCAAAAAGGCACTTGAATTGTTCAATGCGGCACCCGAAAAGAATACTCGGCTTTACTCGGCTATCATTCATGTCTTAGTAGGATCCAAGCTATTTTCCCATGCCAGATGTTTGCTAAAAGAGCTCATACAAGACCTCCTCGTAAAATCTCGCAGGCCATACCATGTATGTCAGTTGGCATTCAATGCGCTGAGTAGCTTAAAAACCTCAAAATTTTCTCCAAATGTATATAGCGAGTTAATTATTGTCTTATCTAAGATGGGACTTGTAGATGAAGCTTTGTGGATGTACCGCAAGGTTGGGGTGGCGGTTGCAAGGCAGGCTTGTAATGTGCTTTTAGATGTCTTGGTTAAGACTGGAAGGTTTGAATTGTTGTGGGGGATTTATGAAGAAATGGTTTCCAATGGGCTGTCTCCTGATGTTATCACTTACGGCATCCTAATTGATGGTCGCTGCCGGCAGGGCGATCTTTTAAGGGCGCATGAAATATTCGATGAAATGAGAGTGAAAGGAATTGAGCCAACAGTTGTCGTGTACACCATTCTTATTCGTGGCCTCTGCTCCGAAAACAAAATGGAGGAAGCAGAGAGTATACATAGATTGATGAGGGAATTAGGGGTACTTCCAAATGTGTACACTTACAACACTTTGATGAATGGGCACTGCAAGGTGGCCAATGTAAAACAGGCTCTTAGATTGTATCATAACATGCTGGGTGAAGATCTAGTGCCAGACAATGTTACATTCGGCATTTTAATTGACGGGCTCTGCAAATTTGGCGACATCAAGGCTGCTCGGAATCTTTCTGTGAATATGGTGAAGTTTAGTGTTACTCCTAGCATAGCTGTATATAATTCTTTGATCGATGGTTACTGTAAAGCAGGGGATATTTCTGAAGCAATGGCTTTCCTTTCGGAGCTGGAAAGGTTTAAGGTTTCGCCAGACGTCGTTACGTACAGTATACTTATTAGAGGTCTCTGTTCTGCGGGTAGAATTGAAGAAGCAGATAACATGCTTGAGAAAATGATGAAAGAGGGAATTCCTGCAAACTCTGTTACATATAATTCACTTATTGATGGATGCTGCAAAGAAGGCAACATGAATAAGGCATTGGAAATATGCTCCCGAATGATCGAAAACGGTGTAGAACCAAATGTGATCACGTTCTCAATGCTGATTGATGGTTATTGCAAGATAAGGAACGTAGAAGCTGCTATGGGCATATACTCAGAAATGGGTATCAAAAGCCTTTCTCCTGATGTAGTTGCTTATACAGCTATGATAGATGGGCATTGCAAGCATGGTAGCATGAAAGAGGCTCTAAAACTCTACAATGATATGCTGGATAATGGTCTTACTCCAAATTCTTACACTCTTAGTTGCTTATTAGATGGACTTTGTAAAGACGGCAGAGTCTCGGACGCACTCGAACTTTTCACGGAAAAGGCTGAATTTGGGACTACAAAATGCAAGCTTTCCTTCACAAATCATGTAGTGTATACAGCTTTAATCCATGGATTGTGTGAGGATGGACAAATTTTCAAGGCAGCAAAGTTGTTTTCAGACATGAGAAGCTACGGTTTGCAACCAGATGAAGTAATTTACGTGGTCATGTTAAAAGGTTACTTCCAAGTTAAACGCATCCTCGACATGACGATGCTACATGCCGATATGTTGAAGTTTGGTATTATCCCAAACTCGGCCATCTACTCGACATTGTCTAAGGGTTATCGAGAGAGTGGATTTCTGAAATCGGCTCTGAATTGTTCGAAGGAACTGAAGGAACTATATTGTTGAAGCTATCAATGGGGAGCCTTTTACACAAGTCATCCAGTTGATTGCAATGAGTTGACGGAGAACTTGAATCCACATTCTTTTCTCCCTGGAGTGAGTGGAAACTGCTCTTTCAAGTTCTGATCATGTAAGTTATCACCAAATTTAATAGTAGCTTCAATGGAAACTGCTTCCAACTTCGAAGCTAACATTATCTTCTGCTGTTCTTGATTGATTTGTTTTATGTTCATTATTTTGAAATGCAGAATTGGGTAATTGGGGAAGAGGGGAACTGGTTTTCTGCTGTGATCAGTGATTAGAGAATGGCAGAAGAGAACAAGAGAAGAGCTTGAGAAACAGATATGCTTTTGGATTCCTCGTTGTATTTCTCTTGAACTTGATCATTTTGGTGCTCACATTTGAGCAGCACAAATTATGGCTATCATAATTTGATGTTTTATCGTTAAGATCTAACCGCGAATTTTAGGTTAAATTATACAAAGTACTCTTGAACTTTCGAGTAAGCTTCAATTATACCCTTCAATTTTAAACGAATTTTTCTTACTTTTATG

mRNA sequence

ATGTTGATGAATCAACTCCCATTAAAAAGTGTTCTTGTTCATATTGGACGTCATGGGTCCATACTTCAAGCTGTTGCTCTATCATCTTCAACACCTGATAATCTCATTACCACTGTACTTAACTGCAAAAGCCCCAAAAAGGCACTTGAATTGTTCAATGCGGCACCCGAAAAGAATACTCGGCTTTACTCGGCTATCATTCATGTCTTAGTAGGATCCAAGCTATTTTCCCATGCCAGATGTTTGCTAAAAGAGCTCATACAAGACCTCCTCGTAAAATCTCGCAGGCCATACCATGTATGTCAGTTGGCATTCAATGCGCTGAGTAGCTTAAAAACCTCAAAATTTTCTCCAAATGTATATAGCGAGTTAATTATTGTCTTATCTAAGATGGGACTTGTAGATGAAGCTTTGTGGATGTACCGCAAGGTTGGGGTGGCGGTTGCAAGGCAGGCTTGTAATGTGCTTTTAGATGTCTTGGTTAAGACTGGAAGGTTTGAATTGTTGTGGGGGATTTATGAAGAAATGGTTTCCAATGGGCTGTCTCCTGATGTTATCACTTACGGCATCCTAATTGATGGTCGCTGCCGGCAGGGCGATCTTTTAAGGGCGCATGAAATATTCGATGAAATGAGAGTGAAAGGAATTGAGCCAACAGTTGTCGTGTACACCATTCTTATTCGTGGCCTCTGCTCCGAAAACAAAATGGAGGAAGCAGAGAGTATACATAGATTGATGAGGGAATTAGGGGTACTTCCAAATGTGTACACTTACAACACTTTGATGAATGGGCACTGCAAGGTGGCCAATGTAAAACAGGCTCTTAGATTGTATCATAACATGCTGGGTGAAGATCTAGTGCCAGACAATGTTACATTCGGCATTTTAATTGACGGGCTCTGCAAATTTGGCGACATCAAGGCTGCTCGGAATCTTTCTGTGAATATGGTGAAGTTTAGTGTTACTCCTAGCATAGCTGTATATAATTCTTTGATCGATGGTTACTGTAAAGCAGGGGATATTTCTGAAGCAATGGCTTTCCTTTCGGAGCTGGAAAGGTTTAAGGTTTCGCCAGACGTCGTTACGTACAGTATACTTATTAGAGGTCTCTGTTCTGCGGGTAGAATTGAAGAAGCAGATAACATGCTTGAGAAAATGATGAAAGAGGGAATTCCTGCAAACTCTGTTACATATAATTCACTTATTGATGGATGCTGCAAAGAAGGCAACATGAATAAGGCATTGGAAATATGCTCCCGAATGATCGAAAACGGTGTAGAACCAAATGTGATCACGTTCTCAATGCTGATTGATGGTTATTGCAAGATAAGGAACGTAGAAGCTGCTATGGGCATATACTCAGAAATGGGTATCAAAAGCCTTTCTCCTGATGTAGTTGCTTATACAGCTATGATAGATGGGCATTGCAAGCATGGTAGCATGAAAGAGGCTCTAAAACTCTACAATGATATGCTGGATAATGGTCTTACTCCAAATTCTTACACTCTTAGTTGCTTATTAGATGGACTTTGTAAAGACGGCAGAGTCTCGGACGCACTCGAACTTTTCACGGAAAAGGCTGAATTTGGGACTACAAAATGCAAGCTTTCCTTCACAAATCATGTAGTGTATACAGCTTTAATCCATGGATTGTGTGAGGATGGACAAATTTTCAAGGCAGCAAAGTTGTTTTCAGACATGAGAAGCTACGGTTTGCAACCAGATGAAGTAATTTACGTGGTCATGTTAAAAGGTTACTTCCAAGTTAAACGCATCCTCGACATGACGATGCTACATGCCGATATGTTGAAGTTTGAATTGGGTAATTGGGGAAGAGGGGAACTGGTTTTCTGCTGTGATCAGTGATTAGAGAATGGCAGAAGAGAACAAGAGAAGAGCTTGAGAAACAGATATGCTTTTGGATTCCTCGTTGTATTTCTCTTGAACTTGATCATTTTGGTGCTCACATTTGAGCAGCACAAATTATGGCTATCATAATTTGATGTTTTATCGTTAAGATCTAACCGCGAATTTTAGGTTAAATTATACAAAGTACTCTTGAACTTTCGAGTAAGCTTCAATTATACCCTTCAATTTTAAACGAATTTTTCTTACTTTTATG

Coding sequence (CDS)

ATGTTGATGAATCAACTCCCATTAAAAAGTGTTCTTGTTCATATTGGACGTCATGGGTCCATACTTCAAGCTGTTGCTCTATCATCTTCAACACCTGATAATCTCATTACCACTGTACTTAACTGCAAAAGCCCCAAAAAGGCACTTGAATTGTTCAATGCGGCACCCGAAAAGAATACTCGGCTTTACTCGGCTATCATTCATGTCTTAGTAGGATCCAAGCTATTTTCCCATGCCAGATGTTTGCTAAAAGAGCTCATACAAGACCTCCTCGTAAAATCTCGCAGGCCATACCATGTATGTCAGTTGGCATTCAATGCGCTGAGTAGCTTAAAAACCTCAAAATTTTCTCCAAATGTATATAGCGAGTTAATTATTGTCTTATCTAAGATGGGACTTGTAGATGAAGCTTTGTGGATGTACCGCAAGGTTGGGGTGGCGGTTGCAAGGCAGGCTTGTAATGTGCTTTTAGATGTCTTGGTTAAGACTGGAAGGTTTGAATTGTTGTGGGGGATTTATGAAGAAATGGTTTCCAATGGGCTGTCTCCTGATGTTATCACTTACGGCATCCTAATTGATGGTCGCTGCCGGCAGGGCGATCTTTTAAGGGCGCATGAAATATTCGATGAAATGAGAGTGAAAGGAATTGAGCCAACAGTTGTCGTGTACACCATTCTTATTCGTGGCCTCTGCTCCGAAAACAAAATGGAGGAAGCAGAGAGTATACATAGATTGATGAGGGAATTAGGGGTACTTCCAAATGTGTACACTTACAACACTTTGATGAATGGGCACTGCAAGGTGGCCAATGTAAAACAGGCTCTTAGATTGTATCATAACATGCTGGGTGAAGATCTAGTGCCAGACAATGTTACATTCGGCATTTTAATTGACGGGCTCTGCAAATTTGGCGACATCAAGGCTGCTCGGAATCTTTCTGTGAATATGGTGAAGTTTAGTGTTACTCCTAGCATAGCTGTATATAATTCTTTGATCGATGGTTACTGTAAAGCAGGGGATATTTCTGAAGCAATGGCTTTCCTTTCGGAGCTGGAAAGGTTTAAGGTTTCGCCAGACGTCGTTACGTACAGTATACTTATTAGAGGTCTCTGTTCTGCGGGTAGAATTGAAGAAGCAGATAACATGCTTGAGAAAATGATGAAAGAGGGAATTCCTGCAAACTCTGTTACATATAATTCACTTATTGATGGATGCTGCAAAGAAGGCAACATGAATAAGGCATTGGAAATATGCTCCCGAATGATCGAAAACGGTGTAGAACCAAATGTGATCACGTTCTCAATGCTGATTGATGGTTATTGCAAGATAAGGAACGTAGAAGCTGCTATGGGCATATACTCAGAAATGGGTATCAAAAGCCTTTCTCCTGATGTAGTTGCTTATACAGCTATGATAGATGGGCATTGCAAGCATGGTAGCATGAAAGAGGCTCTAAAACTCTACAATGATATGCTGGATAATGGTCTTACTCCAAATTCTTACACTCTTAGTTGCTTATTAGATGGACTTTGTAAAGACGGCAGAGTCTCGGACGCACTCGAACTTTTCACGGAAAAGGCTGAATTTGGGACTACAAAATGCAAGCTTTCCTTCACAAATCATGTAGTGTATACAGCTTTAATCCATGGATTGTGTGAGGATGGACAAATTTTCAAGGCAGCAAAGTTGTTTTCAGACATGAGAAGCTACGGTTTGCAACCAGATGAAGTAATTTACGTGGTCATGTTAAAAGGTTACTTCCAAGTTAAACGCATCCTCGACATGACGATGCTACATGCCGATATGTTGAAGTTTGAATTGGGTAATTGGGGAAGAGGGGAACTGGTTTTCTGCTGTGATCAGTGA

Protein sequence

MLMNQLPLKSVLVHIGRHGSILQAVALSSSTPDNLITTVLNCKSPKKALELFNAAPEKNTRLYSAIIHVLVGSKLFSHARCLLKELIQDLLVKSRRPYHVCQLAFNALSSLKTSKFSPNVYSELIIVLSKMGLVDEALWMYRKVGVAVARQACNVLLDVLVKTGRFELLWGIYEEMVSNGLSPDVITYGILIDGRCRQGDLLRAHEIFDEMRVKGIEPTVVVYTILIRGLCSENKMEEAESIHRLMRELGVLPNVYTYNTLMNGHCKVANVKQALRLYHNMLGEDLVPDNVTFGILIDGLCKFGDIKAARNLSVNMVKFSVTPSIAVYNSLIDGYCKAGDISEAMAFLSELERFKVSPDVVTYSILIRGLCSAGRIEEADNMLEKMMKEGIPANSVTYNSLIDGCCKEGNMNKALEICSRMIENGVEPNVITFSMLIDGYCKIRNVEAAMGIYSEMGIKSLSPDVVAYTAMIDGHCKHGSMKEALKLYNDMLDNGLTPNSYTLSCLLDGLCKDGRVSDALELFTEKAEFGTTKCKLSFTNHVVYTALIHGLCEDGQIFKAAKLFSDMRSYGLQPDEVIYVVMLKGYFQVKRILDMTMLHADMLKFELGNWGRGELVFCCDQ
BLAST of Cp4.1LG01g24300 vs. Swiss-Prot
Match: PP440_ARATH (Pentatricopeptide repeat-containing protein At5g61400 OS=Arabidopsis thaliana GN=At5g61400 PE=2 SV=1)

HSP 1 Score: 539.7 bits (1389), Expect = 4.3e-152
Identity = 274/583 (47.00%), Postives = 390/583 (66.90%), Query Frame = 1

Query: 28  SSSTPDNLITTVLNCKSPKKALELFNAAPEKNT------RLYSAIIHVLVGSKLFSHARC 87
           SS +  +L   +L C+S ++A +LF  +           + +SA+IHVL G+  ++ ARC
Sbjct: 37  SSFSSSSLAEAILKCRSAEEAFKLFETSSRSRVSKSNDLQSFSAVIHVLTGAHKYTLARC 96

Query: 88  LLKELIQDLLVKSRRPYHVCQLAFNALSSLKTSKFSPNVYSELIIVLSKMGLVDEALWMY 147
           L+K LI+ L   S  P ++    FNAL  +++ KFS  V+S LI+   +MGL +EALW+ 
Sbjct: 97  LIKSLIERLKRHSE-PSNMSHRLFNALEDIQSPKFSIGVFSLLIMEFLEMGLFEEALWVS 156

Query: 148 RKVGVAVARQACNVLLDVLVKTGRFELLWGIYEEMVSNGLSPDVITYGILIDGRCRQGDL 207
           R++  +   +AC  +L+ LV+  RF+ +W  Y+ M+S GL PDV  Y +L     +QG  
Sbjct: 157 REMKCSPDSKACLSILNGLVRRRRFDSVWVDYQLMISRGLVPDVHIYFVLFQCCFKQGLY 216

Query: 208 LRAHEIFDEMRVKGIEPTVVVYTILIRGLCSENKMEEAESIHRLMRELGVLPNVYTYNTL 267
            +  ++ DEM   GI+P V +YTI I  LC +NKMEEAE +  LM++ GVLPN+YTY+ +
Sbjct: 217 SKKEKLLDEMTSLGIKPNVYIYTIYILDLCRDNKMEEAEKMFELMKKHGVLPNLYTYSAM 276

Query: 268 MNGHCKVANVKQALRLYHNMLGEDLVPDNVTFGILIDGLCKFGDIKAARNLSVNMVKFSV 327
           ++G+CK  NV+QA  LY  +L  +L+P+ V FG L+DG CK  ++  AR+L V+MVKF V
Sbjct: 277 IDGYCKTGNVRQAYGLYKEILVAELLPNVVVFGTLVDGFCKARELVTARSLFVHMVKFGV 336

Query: 328 TPSIAVYNSLIDGYCKAGDISEAMAFLSELERFKVSPDVVTYSILIRGLCSAGRIEEADN 387
            P++ VYN LI G+CK+G++ EA+  LSE+E   +SPDV TY+ILI GLC   ++ EA+ 
Sbjct: 337 DPNLYVYNCLIHGHCKSGNMLEAVGLLSEMESLNLSPDVFTYTILINGLCIEDQVAEANR 396

Query: 388 MLEKMMKEGIPANSVTYNSLIDGCCKEGNMNKALEICSRMIENGVEPNVITFSMLIDGYC 447
           + +KM  E I  +S TYNSLI G CKE NM +AL++CS M  +GVEPN+ITFS LIDGYC
Sbjct: 397 LFQKMKNERIFPSSATYNSLIHGYCKEYNMEQALDLCSEMTASGVEPNIITFSTLIDGYC 456

Query: 448 KIRNVEAAMGIYSEMGIKSLSPDVVAYTAMIDGHCKHGSMKEALKLYNDMLDNGLTPNSY 507
            +R+++AAMG+Y EM IK + PDVV YTA+ID H K  +MKEAL+LY+DML+ G+ PN +
Sbjct: 457 NVRDIKAAMGLYFEMTIKGIVPDVVTYTALIDAHFKEANMKEALRLYSDMLEAGIHPNDH 516

Query: 508 TLSCLLDGLCKDGRVSDALELFTEKAEFGTTKCKLSFTNHVVYTALIHGLCEDGQIFKAA 567
           T +CL+DG  K+GR+S A++ + E  +      + S  NHV +T LI GLC++G I +A+
Sbjct: 517 TFACLVDGFWKEGRLSVAIDFYQENNQ------QRSCWNHVGFTCLIEGLCQNGYILRAS 576

Query: 568 KLFSDMRSYGLQPDEVIYVVMLKGYFQVKRILDMTMLHADMLK 605
           + FSDMRS G+ PD   YV MLKG+ Q KRI D  ML  DM+K
Sbjct: 577 RFFSDMRSCGITPDICSYVSMLKGHLQEKRITDTMMLQCDMIK 612

BLAST of Cp4.1LG01g24300 vs. Swiss-Prot
Match: PP143_ARATH (Putative pentatricopeptide repeat-containing protein At2g02150 OS=Arabidopsis thaliana GN=At2g02150 PE=3 SV=1)

HSP 1 Score: 298.9 bits (764), Expect = 1.3e-79
Identity = 176/577 (30.50%), Postives = 295/577 (51.13%), Query Frame = 1

Query: 43  KSPKKALELFNAAPEKN-----TRLYSAIIHVLVGSKLFSHARCLLKELIQ--------D 102
           + PK A + F  +  +N        Y  + H+L  ++++  A  +LKE++         D
Sbjct: 120 EDPKLAFKFFKWSMTRNGFKHSVESYCIVAHILFCARMYYDANSVLKEMVLSKADCDVFD 179

Query: 103 LLVKSRRPYHVCQLAFNALSSLKTSKFSPNVYSELIIVLSKMGLVDEALWMYRKVG---V 162
           +L  +R   +VC   F              V+  L  VL  +G+++EA+  + K+    V
Sbjct: 180 VLWSTR---NVCVPGFG-------------VFDALFSVLIDLGMLEEAIQCFSKMKRFRV 239

Query: 163 AVARQACNVLLDVLVKTGRFELLWGIYEEMVSNGLSPDVITYGILIDGRCRQGDLLRAHE 222
               ++CN LL    K G+ + +   +++M+  G  P V TY I+ID  C++GD+  A  
Sbjct: 240 FPKTRSCNGLLHRFAKLGKTDDVKRFFKDMIGAGARPTVFTYNIMIDCMCKEGDVEAARG 299

Query: 223 IFDEMRVKGIEPTVVVYTILIRGLCSENKMEEAESIHRLMRELGVLPNVYTYNTLMNGHC 282
           +F+EM+ +G+ P  V Y  +I G     ++++       M+++   P+V TYN L+N  C
Sbjct: 300 LFEEMKFRGLVPDTVTYNSMIDGFGKVGRLDDTVCFFEEMKDMCCEPDVITYNALINCFC 359

Query: 283 KVANVKQALRLYHNMLGEDLVPDNVTFGILIDGLCKFGDIKAARNLSVNMVKFSVTPSIA 342
           K   +   L  Y  M G  L P+ V++  L+D  CK G ++ A    V+M +  + P+  
Sbjct: 360 KFGKLPIGLEFYREMKGNGLKPNVVSYSTLVDAFCKEGMMQQAIKFYVDMRRVGLVPNEY 419

Query: 343 VYNSLIDGYCKAGDISEAMAFLSELERFKVSPDVVTYSILIRGLCSAGRIEEADNMLEKM 402
            Y SLID  CK G++S+A    +E+ +  V  +VVTY+ LI GLC A R++EA+ +  KM
Sbjct: 420 TYTSLIDANCKIGNLSDAFRLGNEMLQVGVEWNVVTYTALIDGLCDAERMKEAEELFGKM 479

Query: 403 MKEGIPANSVTYNSLIDGCCKEGNMNKALEICSRMIENGVEPNVITFSMLIDGYCKIRNV 462
              G+  N  +YN+LI G  K  NM++ALE+ + +   G++P+++ +   I G C +  +
Sbjct: 480 DTAGVIPNLASYNALIHGFVKAKNMDRALELLNELKGRGIKPDLLLYGTFIWGLCSLEKI 539

Query: 463 EAAMGIYSEMGIKSLSPDVVAYTAMIDGHCKHGSMKEALKLYNDMLDNGLTPNSYTLSCL 522
           EAA  + +EM    +  + + YT ++D + K G+  E L L ++M +  +     T   L
Sbjct: 540 EAAKVVMNEMKECGIKANSLIYTTLMDAYFKSGNPTEGLHLLDEMKELDIEVTVVTFCVL 599

Query: 523 LDGLCKDGRVSDALELFTE-KAEFGTTKCKLSFTNHVVYTALIHGLCEDGQIFKAAKLFS 582
           +DGLCK+  VS A++ F     +FG         N  ++TA+I GLC+D Q+  A  LF 
Sbjct: 600 IDGLCKNKLVSKAVDYFNRISNDFGLQ------ANAAIFTAMIDGLCKDNQVEAATTLFE 659

Query: 583 DMRSYGLQPDEVIYVVMLKGYFQVKRILDMTMLHADM 603
            M   GL PD   Y  ++ G F+   +L+   L   M
Sbjct: 660 QMVQKGLVPDRTAYTSLMDGNFKQGNVLEALALRDKM 674

BLAST of Cp4.1LG01g24300 vs. Swiss-Prot
Match: PP445_ARATH (Pentatricopeptide repeat-containing protein At5g65560 OS=Arabidopsis thaliana GN=At5g65560 PE=2 SV=1)

HSP 1 Score: 285.8 bits (730), Expect = 1.1e-75
Identity = 167/485 (34.43%), Postives = 256/485 (52.78%), Query Frame = 1

Query: 121 YSELIIVLSKMGLVDEALWMYRKV---GVAVARQACNVLLDVLVKTGRFELLWGIYEEMV 180
           Y+ L+  L++ GLVDE   +Y ++    V       N +++   K G  E       ++V
Sbjct: 186 YNTLLNSLARFGLVDEMKQVYMEMLEDKVCPNIYTYNKMVNGYCKLGNVEEANQYVSKIV 245

Query: 181 SNGLSPDVITYGILIDGRCRQGDLLRAHEIFDEMRVKGIEPTVVVYTILIRGLCSENKME 240
             GL PD  TY  LI G C++ DL  A ++F+EM +KG     V YT LI GLC   +++
Sbjct: 246 EAGLDPDFFTYTSLIMGYCQRKDLDSAFKVFNEMPLKGCRRNEVAYTHLIHGLCVARRID 305

Query: 241 EAESIHRLMRELGVLPNVYTYNTLMNGHCKVANVKQALRLYHNMLGEDLVPDNVTFGILI 300
           EA  +   M++    P V TY  L+   C      +AL L   M    + P+  T+ +LI
Sbjct: 306 EAMDLFVKMKDDECFPTVRTYTVLIKSLCGSERKSEALNLVKEMEETGIKPNIHTYTVLI 365

Query: 301 DGLCKFGDIKAARNLSVNMVKFSVTPSIAVYNSLIDGYCKAGDISEAMAFLSELERFKVS 360
           D LC     + AR L   M++  + P++  YN+LI+GYCK G I +A+  +  +E  K+S
Sbjct: 366 DSLCSQCKFEKARELLGQMLEKGLMPNVITYNALINGYCKRGMIEDAVDVVELMESRKLS 425

Query: 361 PDVVTYSILIRGLCSAGRIEEADNMLEKMMKEGIPANSVTYNSLIDGCCKEGNMNKALEI 420
           P+  TY+ LI+G C +  + +A  +L KM++  +  + VTYNSLIDG C+ GN + A  +
Sbjct: 426 PNTRTYNELIKGYCKS-NVHKAMGVLNKMLERKVLPDVVTYNSLIDGQCRSGNFDSAYRL 485

Query: 421 CSRMIENGVEPNVITFSMLIDGYCKIRNVEAAMGIYSEMGIKSLSPDVVAYTAMIDGHCK 480
            S M + G+ P+  T++ +ID  CK + VE A  ++  +  K ++P+VV YTA+IDG+CK
Sbjct: 486 LSLMNDRGLVPDQWTYTSMIDSLCKSKRVEEACDLFDSLEQKGVNPNVVMYTALIDGYCK 545

Query: 481 HGSMKEALKLYNDMLDNGLTPNSYTLSCLLDGLCKDGRVSDALELFTEKAEFGTTKCKLS 540
            G + EA  +   ML     PNS T + L+ GLC DG++ +A  L  +  + G      +
Sbjct: 546 AGKVDEAHLMLEKMLSKNCLPNSLTFNALIHGLCADGKLKEATLLEEKMVKIGLQPTVST 605

Query: 541 FTNHVVYTALIHGLCEDGQIFKAAKLFSDMRSYGLQPDEVIYVVMLKGYFQVKRILDMTM 600
                  T LIH L +DG    A   F  M S G +PD   Y   ++ Y +  R+LD   
Sbjct: 606 ------DTILIHRLLKDGDFDHAYSRFQQMLSSGTKPDAHTYTTFIQTYCREGRLLDAED 663

Query: 601 LHADM 603
           + A M
Sbjct: 666 MMAKM 663

BLAST of Cp4.1LG01g24300 vs. Swiss-Prot
Match: PP407_ARATH (Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana GN=EMB2745 PE=2 SV=1)

HSP 1 Score: 284.3 bits (726), Expect = 3.3e-75
Identity = 149/460 (32.39%), Postives = 254/460 (55.22%), Query Frame = 1

Query: 154 NVLLDVLVKTGRFELLWGIYEEMVSNGLSPDVITYGILIDGRCRQGDLLRAHEIFDEMRV 213
           N+L+      G  ++   ++++M + G  P+V+TY  LIDG C+   +    ++   M +
Sbjct: 209 NILIRGFCFAGNIDVALTLFDKMETKGCLPNVVTYNTLIDGYCKLRKIDDGFKLLRSMAL 268

Query: 214 KGIEPTVVVYTILIRGLCSENKMEEAESIHRLMRELGVLPNVYTYNTLMNGHCKVANVKQ 273
           KG+EP ++ Y ++I GLC E +M+E   +   M   G   +  TYNTL+ G+CK  N  Q
Sbjct: 269 KGLEPNLISYNVVINGLCREGRMKEVSFVLTEMNRRGYSLDEVTYNTLIKGYCKEGNFHQ 328

Query: 274 ALRLYHNMLGEDLVPDNVTFGILIDGLCKFGDIKAARNLSVNMVKFSVTPSIAVYNSLID 333
           AL ++  ML   L P  +T+  LI  +CK G++  A      M    + P+   Y +L+D
Sbjct: 329 ALVMHAEMLRHGLTPSVITYTSLIHSMCKAGNMNRAMEFLDQMRVRGLCPNERTYTTLVD 388

Query: 334 GYCKAGDISEAMAFLSELERFKVSPDVVTYSILIRGLCSAGRIEEADNMLEKMMKEGIPA 393
           G+ + G ++EA   L E+     SP VVTY+ LI G C  G++E+A  +LE M ++G+  
Sbjct: 389 GFSQKGYMNEAYRVLREMNDNGFSPSVVTYNALINGHCVTGKMEDAIAVLEDMKEKGLSP 448

Query: 394 NSVTYNSLIDGCCKEGNMNKALEICSRMIENGVEPNVITFSMLIDGYCKIRNVEAAMGIY 453
           + V+Y++++ G C+  ++++AL +   M+E G++P+ IT+S LI G+C+ R  + A  +Y
Sbjct: 449 DVVSYSTVLSGFCRSYDVDEALRVKREMVEKGIKPDTITYSSLIQGFCEQRRTKEACDLY 508

Query: 454 SEMGIKSLSPDVVAYTAMIDGHCKHGSMKEALKLYNDMLDNGLTPNSYTLSCLLDGLCKD 513
            EM    L PD   YTA+I+ +C  G +++AL+L+N+M++ G+ P+  T S L++GL K 
Sbjct: 509 EEMLRVGLPPDEFTYTALINAYCMEGDLEKALQLHNEMVEKGVLPDVVTYSVLINGLNKQ 568

Query: 514 GRVSDA----LELFTEKA-----EFGTTKCKLSFTNHVVYTALIHGLCEDGQIFKAAKLF 573
            R  +A    L+LF E++      + T     S        +LI G C  G + +A ++F
Sbjct: 569 SRTREAKRLLLKLFYEESVPSDVTYHTLIENCSNIEFKSVVSLIKGFCMKGMMTEADQVF 628

Query: 574 SDMRSYGLQPDEVIYVVMLKGYFQVKRILDMTMLHADMLK 605
             M     +PD   Y +M+ G+ +   I     L+ +M+K
Sbjct: 629 ESMLGKNHKPDGTAYNIMIHGHCRAGDIRKAYTLYKEMVK 668

BLAST of Cp4.1LG01g24300 vs. Swiss-Prot
Match: PP437_ARATH (Putative pentatricopeptide repeat-containing protein At5g59900 OS=Arabidopsis thaliana GN=At5g59900 PE=3 SV=1)

HSP 1 Score: 283.9 bits (725), Expect = 4.3e-75
Identity = 185/597 (30.99%), Postives = 294/597 (49.25%), Query Frame = 1

Query: 25  VALSSSTPDNLITTVL-------NCKSPKKALELFNAAP-----EKNTRLYSAIIHVLVG 84
           +ALSS      + TV            PK  L  FN        + +T  +  +IH LV 
Sbjct: 57  IALSSELVSRRLKTVHVEEILIGTIDDPKLGLRFFNFLGLHRGFDHSTASFCILIHALVK 116

Query: 85  SKLFSHARCLLKELIQDLLVKSRRPYHVCQLAFNALSSLKTSKFSPNVYSELII--VLSK 144
           + LF  A  LL    Q LL+++ +P  V  + F+     K S  S     +L+I   +  
Sbjct: 117 ANLFWPASSLL----QTLLLRALKPSDVFNVLFSCYEKCKLSSSSS---FDLLIQHYVRS 176

Query: 145 MGLVDEAL---WMYRKVGVAVARQACNVLLDVLVKTGRFELLWGIYEEMVSNGLSPDVIT 204
             ++D  L    M  KV +    +  + LL  LVK   F L   ++ +MVS G+ PDV  
Sbjct: 177 RRVLDGVLVFKMMITKVSLLPEVRTLSALLHGLVKFRHFGLAMELFNDMVSVGIRPDVYI 236

Query: 205 YGILIDGRCRQGDLLRAHEIFDEMRVKGIEPTVVVYTILIRGLCSENKMEEAESIHRLMR 264
           Y  +I   C   DL RA E+   M   G +  +V Y +LI GLC + K+ EA  I + + 
Sbjct: 237 YTGVIRSLCELKDLSRAKEMIAHMEATGCDVNIVPYNVLIDGLCKKQKVWEAVGIKKDLA 296

Query: 265 ELGVLPNVYTYNTLMNGHCKVANVKQALRLYHNMLGEDLVPDNVTFGILIDGLCKFGDIK 324
              + P+V TY TL+ G CKV   +  L +   ML     P       L++GL K G I+
Sbjct: 297 GKDLKPDVVTYCTLVYGLCKVQEFEIGLEMMDEMLCLRFSPSEAAVSSLVEGLRKRGKIE 356

Query: 325 AARNLSVNMVKFSVTPSIAVYNSLIDGYCKAGDISEAMAFLSELERFKVSPDVVTYSILI 384
            A NL   +V F V+P++ VYN+LID  CK     EA      + +  + P+ VTYSILI
Sbjct: 357 EALNLVKRVVDFGVSPNLFVYNALIDSLCKGRKFHEAELLFDRMGKIGLRPNDVTYSILI 416

Query: 385 RGLCSAGRIEEADNMLEKMMKEGIPANSVTYNSLIDGCCKEGNMNKALEICSRMIENGVE 444
              C  G+++ A + L +M+  G+  +   YNSLI+G CK G+++ A    + MI   +E
Sbjct: 417 DMFCRRGKLDTALSFLGEMVDTGLKLSVYPYNSLINGHCKFGDISAAEGFMAEMINKKLE 476

Query: 445 PNVITFSMLIDGYCKIRNVEAAMGIYSEMGIKSLSPDVVAYTAMIDGHCKHGSMKEALKL 504
           P V+T++ L+ GYC    +  A+ +Y EM  K ++P +  +T ++ G  + G +++A+KL
Sbjct: 477 PTVVTYTSLMGGYCSKGKINKALRLYHEMTGKGIAPSIYTFTTLLSGLFRAGLIRDAVKL 536

Query: 505 YNDMLDNGLTPNSYTLSCLLDGLCKDGRVSDALELFTEKAEFGTTKCKLSFTNHVVYTAL 564
           +N+M +  + PN  T + +++G C++G +S A E   E  E G      S      Y  L
Sbjct: 537 FNEMAEWNVKPNRVTYNVMIEGYCEEGDMSKAFEFLKEMTEKGIVPDTYS------YRPL 596

Query: 565 IHGLCEDGQIFKAAKLFSDMRSYGLQPDEVIYVVMLKGYFQVKRILDMTMLHADMLK 605
           IHGLC  GQ  +A      +     + +E+ Y  +L G+ +  ++ +   +  +M++
Sbjct: 597 IHGLCLTGQASEAKVFVDGLHKGNCELNEICYTGLLHGFCREGKLEEALSVCQEMVQ 640

BLAST of Cp4.1LG01g24300 vs. TrEMBL
Match: A0A0A0KS30_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G496480 PE=4 SV=1)

HSP 1 Score: 958.4 bits (2476), Expect = 4.4e-276
Identity = 484/612 (79.08%), Postives = 532/612 (86.93%), Query Frame = 1

Query: 1   MLMNQLPLKSVLVHIGRHGSILQAVALSSSTPDNLITTVLNCKSPKKALELFNAAPEKNT 60
           MLM Q PLKSVLV IG +G++LQ V+LSS TPD+LITTVLNC+SP KALE FNAAPEKN 
Sbjct: 12  MLMTQFPLKSVLVRIGLNGTMLQVVSLSSLTPDSLITTVLNCRSPWKALEFFNAAPEKNI 71

Query: 61  RLYSAIIHVLVGSKLFSHARCLLKELIQDLLVKSRRPYHVCQLAFNALSSLKTSKFSPNV 120
           +LYSAIIHVLVGSKL SHAR LL +L+Q+L VKS +PYH CQLAF+ LS LK+SKF+PNV
Sbjct: 72  QLYSAIIHVLVGSKLLSHARYLLNDLVQNL-VKSHKPYHACQLAFSELSRLKSSKFTPNV 131

Query: 121 YSELIIVLSKMGLVDEALWMYRKVGVAVARQACNVLLDVLVKTGRFELLWGIYEEMVSNG 180
           Y ELIIVL KM LV+EAL MY KVG A+  QACNVLL VLVKTGRFELLW IYEEM+SNG
Sbjct: 132 YGELIIVLCKMELVEEALSMYHKVGAALTIQACNVLLYVLVKTGRFELLWRIYEEMISNG 191

Query: 181 LSPDVITYGILIDGRCRQGDLLRAHEIFDEMRVKGIEPTVVVYTILIRGLCSENKMEEAE 240
           LSP VIT+G LIDG CRQGDLLRA E+FDEMRVKGI PTV+VYTILIRGLCS+NK+EEAE
Sbjct: 192 LSPSVITFGTLIDGCCRQGDLLRAQEMFDEMRVKGIVPTVIVYTILIRGLCSDNKIEEAE 251

Query: 241 SIHRLMRELGVLPNVYTYNTLMNGHCKVANVKQALRLYHNMLGEDLVPDNVTFGILIDGL 300
           S+HR MRE+GV PNVYTYNTLM+G+CK+AN KQALRLY +MLGE LVPD VTFGILIDGL
Sbjct: 252 SMHRAMREVGVYPNVYTYNTLMDGYCKLANAKQALRLYQDMLGEGLVPDVVTFGILIDGL 311

Query: 301 CKFGDIKAARNLSVNMVKFSVTPSIAVYNSLIDGYCKAGDISEAMAFLSELERFKVSPDV 360
           CKFG++KAARNL VNM+KFSVTP+IAVYNSLID YCK GD+SEAMA   ELERF+VSPDV
Sbjct: 312 CKFGEMKAARNLFVNMIKFSVTPNIAVYNSLIDAYCKVGDVSEAMALFLELERFEVSPDV 371

Query: 361 VTYSILIRGLCSAGRIEEADNMLEKMMKEGIPANSVTYNSLIDGCCKEGNMNKALEICSR 420
            TYSILIRGLCS  R EEA N+ EKM KEGI ANSVTYNSLIDGCCKEG M+KALEICS+
Sbjct: 372 FTYSILIRGLCSVSRTEEAGNIFEKMTKEGILANSVTYNSLIDGCCKEGKMDKALEICSQ 431

Query: 421 MIENGVEPNVITFSMLIDGYCKIRNVEAAMGIYSEMGIKSLSPDVVAYTAMIDGHCKHGS 480
           M ENGVEPNVITFS LIDGYCKIRN++AAMGIYSEM IKSLSPDVV YTAMIDGHCK+GS
Sbjct: 432 MTENGVEPNVITFSTLIDGYCKIRNLQAAMGIYSEMVIKSLSPDVVTYTAMIDGHCKYGS 491

Query: 481 MKEALKLYNDMLDNGLTPNSYTLSCLLDGLCKDGRVSDALELFTEKAEFGTTKC------ 540
           MKEALKLY+DMLDNG+TPN YT+SCLLDGLCKDG++SDALELFTEK EF T +C      
Sbjct: 492 MKEALKLYSDMLDNGITPNCYTISCLLDGLCKDGKISDALELFTEKIEFQTPRCNVDAGG 551

Query: 541 -KLSFTNHVVYTALIHGLCEDGQIFKAAKLFSDMRSYGLQPDEVIYVVMLKGYFQVKRIL 600
            K S TNHV YTALIHGLC+DGQ  KA KLFSDMR YGLQPDEVIYVVML+G FQVK IL
Sbjct: 552 SKPSLTNHVAYTALIHGLCQDGQFSKAVKLFSDMRRYGLQPDEVIYVVMLRGLFQVKYIL 611

Query: 601 DMTMLHADMLKF 606
              MLHADMLKF
Sbjct: 612 --MMLHADMLKF 620

BLAST of Cp4.1LG01g24300 vs. TrEMBL
Match: A5AF05_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_031722 PE=4 SV=1)

HSP 1 Score: 691.4 bits (1783), Expect = 9.9e-196
Identity = 346/622 (55.63%), Postives = 458/622 (73.63%), Query Frame = 1

Query: 2   LMNQLPLKSVLVHIGRHGSILQAVALSS-------STPDNLITTVLNCKSPKKALELFNA 61
           ++   P KS  ++  +H S +     SS       S+P +L  ++L C++  +ALELF++
Sbjct: 1   MLKSFPPKSRRIY-AKHSSFISRPLSSSPSSSSSDSSPSSLPNSILTCRTANQALELFHS 60

Query: 62  APE-----KNTRLYSAIIHVLVGSKLFSHARCLLKELIQDLLVKSRRPYHVCQLAFNALS 121
                   KN +LYSAIIHVL G+KL++ ARCL+++LIQ  L KSRR   +C   FN LS
Sbjct: 61  VSRRADLAKNPQLYSAIIHVLTGAKLYAKARCLMRDLIQ-CLQKSRRS-RICCSVFNVLS 120

Query: 122 SLKTSKFSPNVYSELIIVLSKMGLVDEALWMYRKVGVAVARQACNVLLDVLVKTGRFELL 181
            L++SKF+PNV+  LII  S+MGLV+EALW+Y K+ V  A QACN++LD LVK GRF+ +
Sbjct: 121 RLESSKFTPNVFGVLIIAFSEMGLVEEALWVYYKMDVLPAMQACNMVLDGLVKKGRFDTM 180

Query: 182 WGIYEEMVSNGLSPDVITYGILIDGRCRQGDLLRAHEIFDEMRVKGIEPTVVVYTILIRG 241
           W +Y +MV+ G SP+V+TYG LIDG CRQGD L+A  +FDEM  K I PTVV+YTILIRG
Sbjct: 181 WKVYGDMVARGASPNVVTYGTLIDGCCRQGDFLKAFRLFDEMIEKKIFPTVVIYTILIRG 240

Query: 242 LCSENKMEEAESIHRLMRELGVLPNVYTYNTLMNGHCKVANVKQALRLYHNMLGEDLVPD 301
           LC E+++ EAES+ R MR  G+LPN+YTYNT+M+G+CK+A+VK+AL LY  MLG+ L+P+
Sbjct: 241 LCGESRISEAESMFRTMRNSGMLPNLYTYNTMMDGYCKIAHVKKALELYXEMLGDGLLPN 300

Query: 302 NVTFGILIDGLCKFGDIKAARNLSVNMVKFSVTPSIAVYNSLIDGYCKAGDISEAMAFLS 361
            VTFGILIDGLCK  ++ +AR   ++M  F V P+I VYN LIDGYCKAG++SEA++  S
Sbjct: 301 VVTFGILIDGLCKTDEMVSARKFLIDMASFGVVPNIFVYNCLIDGYCKAGNLSEALSLHS 360

Query: 362 ELERFKVSPDVVTYSILIRGLCSAGRIEEADNMLEKMMKEGIPANSVTYNSLIDGCCKEG 421
           E+E+ ++ PDV TYSILI+GLC   R+EEAD +L++M K+G   N+VTYN+LIDG CKEG
Sbjct: 361 EIEKHEILPDVFTYSILIKGLCGVDRMEEADGLLQEMKKKGFLPNAVTYNTLIDGYCKEG 420

Query: 422 NMNKALEICSRMIENGVEPNVITFSMLIDGYCKIRNVEAAMGIYSEMGIKSLSPDVVAYT 481
           NM KA+E+CS+M E G+EPN+ITFS LIDGYCK   +EAAMG+Y+EM IK L PDVVAYT
Sbjct: 421 NMEKAIEVCSQMTEKGIEPNIITFSTLIDGYCKAGKMEAAMGLYTEMVIKGLLPDVVAYT 480

Query: 482 AMIDGHCKHGSMKEALKLYNDMLDNGLTPNSYTLSCLLDGLCKDGRVSDALELFTEKAEF 541
           A+IDGH K G+ KEA +L+ +M + GL PN +TLSCL+DGLCKDGR+SDA++LF  K   
Sbjct: 481 ALIDGHFKDGNTKEAFRLHKEMQEAGLHPNVFTLSCLIDGLCKDGRISDAIKLFLAKTGT 540

Query: 542 GTTKCK-------LSFTNHVVYTALIHGLCEDGQIFKAAKLFSDMRSYGLQPDEVIYVVM 601
            TT  K       L   NHV+YTALI GLC DG+IFKA+K FSDMR  GL+PD    +V+
Sbjct: 541 DTTGSKTNELDRSLCSPNHVMYTALIQGLCTDGRIFKASKFFSDMRCSGLRPDVFTCIVI 600

Query: 602 LKGYFQVKRILDMTMLHADMLK 605
           ++G+F+   + D+ ML AD+LK
Sbjct: 601 IQGHFRAMHLRDVMMLQADILK 619

BLAST of Cp4.1LG01g24300 vs. TrEMBL
Match: V4TQ94_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10024595mg PE=4 SV=1)

HSP 1 Score: 675.2 bits (1741), Expect = 7.4e-191
Identity = 336/584 (57.53%), Postives = 441/584 (75.51%), Query Frame = 1

Query: 28  SSSTP--DNLITTVLNCKSPKKALELFNAA-----PEKNTRLYSAIIHVLVGSKLFSHAR 87
           SSS P   NL   +LN K+P +AL LFN++     P K+   ++AI +VL  +KL+ +AR
Sbjct: 48  SSSLPPRSNLTNAILNSKTPNQALVLFNSSSKKLNPTKSLAPFAAIFYVLANAKLYKNAR 107

Query: 88  CLLKELIQDLLVKSRRPYHVCQLAFNALSSLKTSKFSPNVYSELIIVLSKMGLVDEALWM 147
           CL+K++ ++LL KSR+P+HVC   FNAL+SL+  KF+P+V+S LII  S+MG ++EALW+
Sbjct: 108 CLIKDVTENLL-KSRKPHHVCYSVFNALNSLEIPKFNPSVFSTLIIAFSEMGHIEEALWV 167

Query: 148 YRKVGVAVARQACNVLLDVLVKTGRFELLWGIYEEMVSNGLSPDVITYGILIDGRCRQGD 207
           YRK+ V  A QACN LL+ L+K G+F+ +W  YEEMV  GL  DV+TYG+LID  C QGD
Sbjct: 168 YRKIEVLPAIQACNALLNGLIKKGKFDSVWEFYEEMVLCGLVADVVTYGVLIDCCCGQGD 227

Query: 208 LLRAHEIFDEMRVKGIEPTVVVYTILIRGLCSENKMEEAESIHRLMRELGVLPNVYTYNT 267
           +++A  +FDEM  KGIEPTVV+YTILI GLC+ENKM EAES+ R MRE GV+PN+YTYN 
Sbjct: 228 VMKALNLFDEMIDKGIEPTVVIYTILIHGLCNENKMVEAESMFRSMRECGVVPNLYTYNA 287

Query: 268 LMNGHCKVANVKQALRLYHNMLGEDLVPDNVTFGILIDGLCKFGDIKAARNLSVNMVKFS 327
           LM+G+CKVA+V +AL  YH ML  +L P+ VTFG+L+DGLCK G+++AA N  V+M KF 
Sbjct: 288 LMDGYCKVADVNRALEFYHEMLHHNLQPNVVTFGVLMDGLCKVGELRAAGNFFVHMAKFG 347

Query: 328 VTPSIAVYNSLIDGYCKAGDISEAMAFLSELERFKVSPDVVTYSILIRGLCSAGRIEEAD 387
           V P+I VYN LIDG+CKAG++ EAM+  SE+E+F++SPDV TY+ILI+GLC  G++E A+
Sbjct: 348 VFPNIFVYNCLIDGHCKAGNLFEAMSLCSEMEKFEISPDVFTYNILIKGLCGVGQLEGAE 407

Query: 388 NMLEKMMKEGIPANSVTYNSLIDGCCKEGNMNKALEICSRMIENGVEPNVITFSMLIDGY 447
            +L+KM KEGI AN VTYNSLIDG CKEG+M KAL +CS+M E GVEPNV+TFS LIDG 
Sbjct: 408 GLLQKMYKEGILANVVTYNSLIDGYCKEGDMEKALSVCSQMTEKGVEPNVVTFSSLIDGQ 467

Query: 448 CKIRNVEAAMGIYSEMGIKSLSPDVVAYTAMIDGHCKHGSMKEALKLYNDMLDNGLTPNS 507
           CK  N++AAMG+Y+EM IKSL PDVV +TA+IDG  K G+MKE L+LY +ML+  +TP+ 
Sbjct: 468 CKAGNIDAAMGLYTEMVIKSLVPDVVVFTALIDGLSKDGNMKETLRLYKEMLEAKITPSV 527

Query: 508 YTLSCLLDGLCKDGRVSDALELFTEKAEFGTTKCKLSFTNHVVYTALIHGLCEDGQIFKA 567
           +T+S L+ GL K+GR+S+AL  F EK +   T       NHV+Y A+I  LC DGQI KA
Sbjct: 528 FTVSSLIHGLFKNGRISNALNFFLEKTD--KTDGGYCSPNHVLYAAIIQALCYDGQILKA 587

Query: 568 AKLFSDMRSYGLQPDEVIYVVMLKGYFQVKRILDMTMLHADMLK 605
           +KLFSDMRS  L+PD   Y  ML+G  + KR+LD+ ML ADM+K
Sbjct: 588 SKLFSDMRSDNLRPDNCTYTTMLRGLLRAKRMLDVMMLLADMIK 628

BLAST of Cp4.1LG01g24300 vs. TrEMBL
Match: A0A061G4F9_THECC (Pentatricopeptide repeat-containing protein, putative OS=Theobroma cacao GN=TCM_014098 PE=4 SV=1)

HSP 1 Score: 674.1 bits (1738), Expect = 1.6e-190
Identity = 332/586 (56.66%), Postives = 436/586 (74.40%), Query Frame = 1

Query: 34  NLITTVLNCKSPKKALELFNAA-----PEKNTRLYSAIIHVLVGSKLFSHARCLLKELIQ 93
           NL   +LN ++P +AL LFN+      P KN   YSAIIHVL G+KL++ ARCL+K LI+
Sbjct: 35  NLTKAILNSQTPHQALNLFNSNIKLINPSKNLEPYSAIIHVLTGAKLYTDARCLIKYLIK 94

Query: 94  DLLVKSRRPYHVCQLAFNALSSLKTSKFSPNVYSELIIVLSKMGLVDEALWMYRKVGVAV 153
            L   S +P   C L FNALS L+TSKF+PNV+  LII  S+MGL++EALW+YRK+    
Sbjct: 95  TLQ-SSLKPRRACHLIFNALSKLQTSKFTPNVFGSLIIAFSEMGLIEEALWVYRKIRTFP 154

Query: 154 ARQACNVLLDVLVKTGRFELLWGIYEEMVSNGLSPDVITYGILIDGRCRQGDLLRAHEIF 213
             QACN LLD LVK GRF+ +W +Y +++S G  P+V+TYG+LI+G C QGD  +A E+F
Sbjct: 155 PMQACNSLLDGLVKMGRFDSMWDVYYDLLSRGFLPNVVTYGVLINGCCCQGDASKARELF 214

Query: 214 DEMRVKGIEPTVVVYTILIRGLCSENKMEEAESIHRLMRELGVLPNVYTYNTLMNGHCKV 273
            E+ +KGI+P VV++T +I+ LCSE +M EAE + RL+++L  LPN+YT+N LMNG+CK+
Sbjct: 215 HELLMKGIQPNVVIFTTVIKILCSEGQMLEAECMFRLIKDLYFLPNLYTFNVLMNGYCKM 274

Query: 274 ANVKQALRLYHNMLGEDLVPDNVTFGILIDGLCKFGDIKAARNLSVNMVKFSVTPSIAVY 333
            NV++A  +Y  M+G+ L P+ VTFGILIDGLCK G +  ARN  V MVK+ V P++ VY
Sbjct: 275 DNVERAFEIYWMMIGDGLRPNVVTFGILIDGLCKMGALVVARNYFVCMVKYGVFPNVFVY 334

Query: 334 NSLIDGYCKAGDISEAMAFLSELERFKVSPDVVTYSILIRGLCSAGRIEEADNMLEKMMK 393
           N LIDGYCKAG++SEA+   SE+E+ K+ PDV TYSILI+GLCS GR+EE   +L+KM+K
Sbjct: 335 NCLIDGYCKAGNVSEAVELSSEMEKLKILPDVFTYSILIKGLCSVGRVEEGSFLLQKMIK 394

Query: 394 EGIPANSVTYNSLIDGCCKEGNMNKALEICSRMIENGVEPNVITFSMLIDGYCKIRNVEA 453
           +G+ ANSVTYNSLIDG C+ GNM KALEICS+M E GVEPNVITFS LIDGYCK  N++A
Sbjct: 395 DGVLANSVTYNSLIDGYCRVGNMEKALEICSQMTEKGVEPNVITFSTLIDGYCKAGNMQA 454

Query: 454 AMGIYSEMGIKSLSPDVVAYTAMIDGHCKHGSMKEALKLYNDMLDNGLTPNSYTLSCLLD 513
           AMG YSEM IKS+ PDVVAYTA+I+G CK+G++KEAL+L+  ML +GLTPN++TLSCL+D
Sbjct: 455 AMGFYSEMVIKSIVPDVVAYTALINGCCKNGNVKEALRLHKVMLGSGLTPNAFTLSCLVD 514

Query: 514 GLCKDGRVSDALELFTEKAEFGTTK----------CKLSFTNHVVYTALIHGLCEDGQIF 573
           GLCKDG V +A  +F EK   G ++          C  +   +++YT LI  LC+DGQIF
Sbjct: 515 GLCKDGIVFEAFSVFLEKTRAGISENGINEMDGLFCLPNHVMYMIYTTLIQALCKDGQIF 574

Query: 574 KAAKLFSDMRSYGLQPDEVIYVVMLKGYFQVKRILDMTMLHADMLK 605
           KA K+FSD+R   L  D   Y+VML+G+FQ K ++D+ MLHADM+K
Sbjct: 575 KANKIFSDIRCIDLIADVPSYIVMLEGHFQAKNMIDVMMLHADMIK 619

BLAST of Cp4.1LG01g24300 vs. TrEMBL
Match: A0A067K4Z7_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_11657 PE=4 SV=1)

HSP 1 Score: 672.2 bits (1733), Expect = 6.2e-190
Identity = 329/592 (55.57%), Postives = 441/592 (74.49%), Query Frame = 1

Query: 28  SSSTPDNLITTVLNCKSPKKALELFNAA-------PEKNTRLYSAIIHVLVGSKLFSHAR 87
           SS +  +L T +L+ ++P++AL+ F          P KN  LYSA+IHVL  +++++ AR
Sbjct: 26  SSRSSSDLTTAILDSETPEQALQFFTNVLNQNPKNPTKNLHLYSAVIHVLTSARIYTTAR 85

Query: 88  CLLKELIQDLLVKSRRPYHVCQLAFNALSSLKTSKFSPNVYSELIIVLSKMGLVDEALWM 147
           CL K+LIQ LL +SR+PY +  L FNAL+ L+  KFSPNV+  LII  S++GL+DEAL +
Sbjct: 86  CLTKDLIQTLL-QSRKPYRISSLVFNALNQLQGPKFSPNVFGVLIIAFSELGLLDEALSV 145

Query: 148 YRKVGVAVARQACNVLLDVLVKTGRFELLWGIYEEMVSNGLSPDVITYGILIDGRCRQGD 207
           YRK G+  A QACN LL+ LVK G F+ LW +Y++MVS GL P V+TY +L+D  C QGD
Sbjct: 146 YRKTGIFPAVQACNALLNGLVKKGSFDSLWELYKDMVSRGLVPSVVTYNVLVDACCSQGD 205

Query: 208 LLRAHEIFDEMRVKGIEPTVVVYTILIRGLCSENKMEEAESIHRLMRELGVLPNVYTYNT 267
           + +A  + +EM  KGIEPTVV+Y+ L+RGLCSE+K+ EA+ + R M+E GVLPN+YTYN 
Sbjct: 206 IWKAKSLINEMEKKGIEPTVVIYSTLMRGLCSESKLTEAQDMLRQMKESGVLPNLYTYNV 265

Query: 268 LMNGHCKVANVKQALRLYHNMLGEDLVPDNVTFGILIDGLCKFGDIKAARNLSVNMVKFS 327
           LM+G+CK+A +KQ L L+ ++L + L P+ VTFGIL+D LCK G + AARNL V M K  
Sbjct: 266 LMDGYCKIAKIKQVLDLFQDLLNDGLQPNVVTFGILVDALCKVGKLLAARNLFVQMAKLG 325

Query: 328 VTPSIAVYNSLIDGYCKAGDISEAMAFLSELERFKVSPDVVTYSILIRGLCSAGRIEEAD 387
           V P++ VYNSLI+GY KAG++ +AM  L E+E+FK+ PDV TYSILI+ +CS   ++EAD
Sbjct: 326 VVPNVLVYNSLINGYSKAGNLPKAMDLLLEMEKFKIVPDVFTYSILIKSVCSLSTVKEAD 385

Query: 388 NMLEKMMKEGIPANSVTYNSLIDGCCKEGNMNKALEICSRMIENGVEPNVITFSMLIDGY 447
            +L+KM KEG+PANSV YNS+IDG CK+GNM KALE+C+ M + GVEPNVITFS LIDGY
Sbjct: 386 RILKKMEKEGVPANSVIYNSMIDGYCKKGNMEKALEVCAEMTKKGVEPNVITFSTLIDGY 445

Query: 448 CKIRNVEAAMGIYSEMGIKSLSPDVVAYTAMIDGHCKHGSMKEALKLYNDM-LDNGLTPN 507
           CK  N+++AMG+YSEM IKSL PDVVA+TA+IDGHCK G+MKEAL+LY  M  D GL+PN
Sbjct: 446 CKEGNMQSAMGLYSEMLIKSLVPDVVAFTALIDGHCKSGNMKEALRLYKHMQQDAGLSPN 505

Query: 508 SYTLSCLLDGLCKDGRVSDALELFTEKA-------EFGTTKCKLSFTNHVVYTALIHGLC 567
            +T S L+DGLCK GRVSDAL+LF +K        +   T  +L   N+V+YT+LI  LC
Sbjct: 506 VFTFSSLIDGLCKAGRVSDALKLFLDKTRGYCSRNKINGTDSRLYSPNYVIYTSLIQALC 565

Query: 568 EDGQIFKAAKLFSDMRSYGLQPDEVIYVVMLKGYFQVKRILDMTMLHADMLK 605
           ++GQ+FKA+KLF DMR   L+PD + Y V+L+G+  VK ++D+ +LHADM+K
Sbjct: 566 KEGQMFKASKLFFDMRCNDLRPDALAYTVILQGHLNVKHVIDVMILHADMIK 616

BLAST of Cp4.1LG01g24300 vs. TAIR10
Match: AT5G61400.1 (AT5G61400.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 539.7 bits (1389), Expect = 2.4e-153
Identity = 274/583 (47.00%), Postives = 390/583 (66.90%), Query Frame = 1

Query: 28  SSSTPDNLITTVLNCKSPKKALELFNAAPEKNT------RLYSAIIHVLVGSKLFSHARC 87
           SS +  +L   +L C+S ++A +LF  +           + +SA+IHVL G+  ++ ARC
Sbjct: 37  SSFSSSSLAEAILKCRSAEEAFKLFETSSRSRVSKSNDLQSFSAVIHVLTGAHKYTLARC 96

Query: 88  LLKELIQDLLVKSRRPYHVCQLAFNALSSLKTSKFSPNVYSELIIVLSKMGLVDEALWMY 147
           L+K LI+ L   S  P ++    FNAL  +++ KFS  V+S LI+   +MGL +EALW+ 
Sbjct: 97  LIKSLIERLKRHSE-PSNMSHRLFNALEDIQSPKFSIGVFSLLIMEFLEMGLFEEALWVS 156

Query: 148 RKVGVAVARQACNVLLDVLVKTGRFELLWGIYEEMVSNGLSPDVITYGILIDGRCRQGDL 207
           R++  +   +AC  +L+ LV+  RF+ +W  Y+ M+S GL PDV  Y +L     +QG  
Sbjct: 157 REMKCSPDSKACLSILNGLVRRRRFDSVWVDYQLMISRGLVPDVHIYFVLFQCCFKQGLY 216

Query: 208 LRAHEIFDEMRVKGIEPTVVVYTILIRGLCSENKMEEAESIHRLMRELGVLPNVYTYNTL 267
            +  ++ DEM   GI+P V +YTI I  LC +NKMEEAE +  LM++ GVLPN+YTY+ +
Sbjct: 217 SKKEKLLDEMTSLGIKPNVYIYTIYILDLCRDNKMEEAEKMFELMKKHGVLPNLYTYSAM 276

Query: 268 MNGHCKVANVKQALRLYHNMLGEDLVPDNVTFGILIDGLCKFGDIKAARNLSVNMVKFSV 327
           ++G+CK  NV+QA  LY  +L  +L+P+ V FG L+DG CK  ++  AR+L V+MVKF V
Sbjct: 277 IDGYCKTGNVRQAYGLYKEILVAELLPNVVVFGTLVDGFCKARELVTARSLFVHMVKFGV 336

Query: 328 TPSIAVYNSLIDGYCKAGDISEAMAFLSELERFKVSPDVVTYSILIRGLCSAGRIEEADN 387
            P++ VYN LI G+CK+G++ EA+  LSE+E   +SPDV TY+ILI GLC   ++ EA+ 
Sbjct: 337 DPNLYVYNCLIHGHCKSGNMLEAVGLLSEMESLNLSPDVFTYTILINGLCIEDQVAEANR 396

Query: 388 MLEKMMKEGIPANSVTYNSLIDGCCKEGNMNKALEICSRMIENGVEPNVITFSMLIDGYC 447
           + +KM  E I  +S TYNSLI G CKE NM +AL++CS M  +GVEPN+ITFS LIDGYC
Sbjct: 397 LFQKMKNERIFPSSATYNSLIHGYCKEYNMEQALDLCSEMTASGVEPNIITFSTLIDGYC 456

Query: 448 KIRNVEAAMGIYSEMGIKSLSPDVVAYTAMIDGHCKHGSMKEALKLYNDMLDNGLTPNSY 507
            +R+++AAMG+Y EM IK + PDVV YTA+ID H K  +MKEAL+LY+DML+ G+ PN +
Sbjct: 457 NVRDIKAAMGLYFEMTIKGIVPDVVTYTALIDAHFKEANMKEALRLYSDMLEAGIHPNDH 516

Query: 508 TLSCLLDGLCKDGRVSDALELFTEKAEFGTTKCKLSFTNHVVYTALIHGLCEDGQIFKAA 567
           T +CL+DG  K+GR+S A++ + E  +      + S  NHV +T LI GLC++G I +A+
Sbjct: 517 TFACLVDGFWKEGRLSVAIDFYQENNQ------QRSCWNHVGFTCLIEGLCQNGYILRAS 576

Query: 568 KLFSDMRSYGLQPDEVIYVVMLKGYFQVKRILDMTMLHADMLK 605
           + FSDMRS G+ PD   YV MLKG+ Q KRI D  ML  DM+K
Sbjct: 577 RFFSDMRSCGITPDICSYVSMLKGHLQEKRITDTMMLQCDMIK 612

BLAST of Cp4.1LG01g24300 vs. TAIR10
Match: AT2G02150.1 (AT2G02150.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 298.9 bits (764), Expect = 7.3e-81
Identity = 176/577 (30.50%), Postives = 295/577 (51.13%), Query Frame = 1

Query: 43  KSPKKALELFNAAPEKN-----TRLYSAIIHVLVGSKLFSHARCLLKELIQ--------D 102
           + PK A + F  +  +N        Y  + H+L  ++++  A  +LKE++         D
Sbjct: 120 EDPKLAFKFFKWSMTRNGFKHSVESYCIVAHILFCARMYYDANSVLKEMVLSKADCDVFD 179

Query: 103 LLVKSRRPYHVCQLAFNALSSLKTSKFSPNVYSELIIVLSKMGLVDEALWMYRKVG---V 162
           +L  +R   +VC   F              V+  L  VL  +G+++EA+  + K+    V
Sbjct: 180 VLWSTR---NVCVPGFG-------------VFDALFSVLIDLGMLEEAIQCFSKMKRFRV 239

Query: 163 AVARQACNVLLDVLVKTGRFELLWGIYEEMVSNGLSPDVITYGILIDGRCRQGDLLRAHE 222
               ++CN LL    K G+ + +   +++M+  G  P V TY I+ID  C++GD+  A  
Sbjct: 240 FPKTRSCNGLLHRFAKLGKTDDVKRFFKDMIGAGARPTVFTYNIMIDCMCKEGDVEAARG 299

Query: 223 IFDEMRVKGIEPTVVVYTILIRGLCSENKMEEAESIHRLMRELGVLPNVYTYNTLMNGHC 282
           +F+EM+ +G+ P  V Y  +I G     ++++       M+++   P+V TYN L+N  C
Sbjct: 300 LFEEMKFRGLVPDTVTYNSMIDGFGKVGRLDDTVCFFEEMKDMCCEPDVITYNALINCFC 359

Query: 283 KVANVKQALRLYHNMLGEDLVPDNVTFGILIDGLCKFGDIKAARNLSVNMVKFSVTPSIA 342
           K   +   L  Y  M G  L P+ V++  L+D  CK G ++ A    V+M +  + P+  
Sbjct: 360 KFGKLPIGLEFYREMKGNGLKPNVVSYSTLVDAFCKEGMMQQAIKFYVDMRRVGLVPNEY 419

Query: 343 VYNSLIDGYCKAGDISEAMAFLSELERFKVSPDVVTYSILIRGLCSAGRIEEADNMLEKM 402
            Y SLID  CK G++S+A    +E+ +  V  +VVTY+ LI GLC A R++EA+ +  KM
Sbjct: 420 TYTSLIDANCKIGNLSDAFRLGNEMLQVGVEWNVVTYTALIDGLCDAERMKEAEELFGKM 479

Query: 403 MKEGIPANSVTYNSLIDGCCKEGNMNKALEICSRMIENGVEPNVITFSMLIDGYCKIRNV 462
              G+  N  +YN+LI G  K  NM++ALE+ + +   G++P+++ +   I G C +  +
Sbjct: 480 DTAGVIPNLASYNALIHGFVKAKNMDRALELLNELKGRGIKPDLLLYGTFIWGLCSLEKI 539

Query: 463 EAAMGIYSEMGIKSLSPDVVAYTAMIDGHCKHGSMKEALKLYNDMLDNGLTPNSYTLSCL 522
           EAA  + +EM    +  + + YT ++D + K G+  E L L ++M +  +     T   L
Sbjct: 540 EAAKVVMNEMKECGIKANSLIYTTLMDAYFKSGNPTEGLHLLDEMKELDIEVTVVTFCVL 599

Query: 523 LDGLCKDGRVSDALELFTE-KAEFGTTKCKLSFTNHVVYTALIHGLCEDGQIFKAAKLFS 582
           +DGLCK+  VS A++ F     +FG         N  ++TA+I GLC+D Q+  A  LF 
Sbjct: 600 IDGLCKNKLVSKAVDYFNRISNDFGLQ------ANAAIFTAMIDGLCKDNQVEAATTLFE 659

Query: 583 DMRSYGLQPDEVIYVVMLKGYFQVKRILDMTMLHADM 603
            M   GL PD   Y  ++ G F+   +L+   L   M
Sbjct: 660 QMVQKGLVPDRTAYTSLMDGNFKQGNVLEALALRDKM 674

BLAST of Cp4.1LG01g24300 vs. TAIR10
Match: AT5G65560.1 (AT5G65560.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 285.8 bits (730), Expect = 6.4e-77
Identity = 167/485 (34.43%), Postives = 256/485 (52.78%), Query Frame = 1

Query: 121 YSELIIVLSKMGLVDEALWMYRKV---GVAVARQACNVLLDVLVKTGRFELLWGIYEEMV 180
           Y+ L+  L++ GLVDE   +Y ++    V       N +++   K G  E       ++V
Sbjct: 186 YNTLLNSLARFGLVDEMKQVYMEMLEDKVCPNIYTYNKMVNGYCKLGNVEEANQYVSKIV 245

Query: 181 SNGLSPDVITYGILIDGRCRQGDLLRAHEIFDEMRVKGIEPTVVVYTILIRGLCSENKME 240
             GL PD  TY  LI G C++ DL  A ++F+EM +KG     V YT LI GLC   +++
Sbjct: 246 EAGLDPDFFTYTSLIMGYCQRKDLDSAFKVFNEMPLKGCRRNEVAYTHLIHGLCVARRID 305

Query: 241 EAESIHRLMRELGVLPNVYTYNTLMNGHCKVANVKQALRLYHNMLGEDLVPDNVTFGILI 300
           EA  +   M++    P V TY  L+   C      +AL L   M    + P+  T+ +LI
Sbjct: 306 EAMDLFVKMKDDECFPTVRTYTVLIKSLCGSERKSEALNLVKEMEETGIKPNIHTYTVLI 365

Query: 301 DGLCKFGDIKAARNLSVNMVKFSVTPSIAVYNSLIDGYCKAGDISEAMAFLSELERFKVS 360
           D LC     + AR L   M++  + P++  YN+LI+GYCK G I +A+  +  +E  K+S
Sbjct: 366 DSLCSQCKFEKARELLGQMLEKGLMPNVITYNALINGYCKRGMIEDAVDVVELMESRKLS 425

Query: 361 PDVVTYSILIRGLCSAGRIEEADNMLEKMMKEGIPANSVTYNSLIDGCCKEGNMNKALEI 420
           P+  TY+ LI+G C +  + +A  +L KM++  +  + VTYNSLIDG C+ GN + A  +
Sbjct: 426 PNTRTYNELIKGYCKS-NVHKAMGVLNKMLERKVLPDVVTYNSLIDGQCRSGNFDSAYRL 485

Query: 421 CSRMIENGVEPNVITFSMLIDGYCKIRNVEAAMGIYSEMGIKSLSPDVVAYTAMIDGHCK 480
            S M + G+ P+  T++ +ID  CK + VE A  ++  +  K ++P+VV YTA+IDG+CK
Sbjct: 486 LSLMNDRGLVPDQWTYTSMIDSLCKSKRVEEACDLFDSLEQKGVNPNVVMYTALIDGYCK 545

Query: 481 HGSMKEALKLYNDMLDNGLTPNSYTLSCLLDGLCKDGRVSDALELFTEKAEFGTTKCKLS 540
            G + EA  +   ML     PNS T + L+ GLC DG++ +A  L  +  + G      +
Sbjct: 546 AGKVDEAHLMLEKMLSKNCLPNSLTFNALIHGLCADGKLKEATLLEEKMVKIGLQPTVST 605

Query: 541 FTNHVVYTALIHGLCEDGQIFKAAKLFSDMRSYGLQPDEVIYVVMLKGYFQVKRILDMTM 600
                  T LIH L +DG    A   F  M S G +PD   Y   ++ Y +  R+LD   
Sbjct: 606 ------DTILIHRLLKDGDFDHAYSRFQQMLSSGTKPDAHTYTTFIQTYCREGRLLDAED 663

Query: 601 LHADM 603
           + A M
Sbjct: 666 MMAKM 663

BLAST of Cp4.1LG01g24300 vs. TAIR10
Match: AT5G39710.1 (AT5G39710.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 284.3 bits (726), Expect = 1.8e-76
Identity = 149/460 (32.39%), Postives = 254/460 (55.22%), Query Frame = 1

Query: 154 NVLLDVLVKTGRFELLWGIYEEMVSNGLSPDVITYGILIDGRCRQGDLLRAHEIFDEMRV 213
           N+L+      G  ++   ++++M + G  P+V+TY  LIDG C+   +    ++   M +
Sbjct: 209 NILIRGFCFAGNIDVALTLFDKMETKGCLPNVVTYNTLIDGYCKLRKIDDGFKLLRSMAL 268

Query: 214 KGIEPTVVVYTILIRGLCSENKMEEAESIHRLMRELGVLPNVYTYNTLMNGHCKVANVKQ 273
           KG+EP ++ Y ++I GLC E +M+E   +   M   G   +  TYNTL+ G+CK  N  Q
Sbjct: 269 KGLEPNLISYNVVINGLCREGRMKEVSFVLTEMNRRGYSLDEVTYNTLIKGYCKEGNFHQ 328

Query: 274 ALRLYHNMLGEDLVPDNVTFGILIDGLCKFGDIKAARNLSVNMVKFSVTPSIAVYNSLID 333
           AL ++  ML   L P  +T+  LI  +CK G++  A      M    + P+   Y +L+D
Sbjct: 329 ALVMHAEMLRHGLTPSVITYTSLIHSMCKAGNMNRAMEFLDQMRVRGLCPNERTYTTLVD 388

Query: 334 GYCKAGDISEAMAFLSELERFKVSPDVVTYSILIRGLCSAGRIEEADNMLEKMMKEGIPA 393
           G+ + G ++EA   L E+     SP VVTY+ LI G C  G++E+A  +LE M ++G+  
Sbjct: 389 GFSQKGYMNEAYRVLREMNDNGFSPSVVTYNALINGHCVTGKMEDAIAVLEDMKEKGLSP 448

Query: 394 NSVTYNSLIDGCCKEGNMNKALEICSRMIENGVEPNVITFSMLIDGYCKIRNVEAAMGIY 453
           + V+Y++++ G C+  ++++AL +   M+E G++P+ IT+S LI G+C+ R  + A  +Y
Sbjct: 449 DVVSYSTVLSGFCRSYDVDEALRVKREMVEKGIKPDTITYSSLIQGFCEQRRTKEACDLY 508

Query: 454 SEMGIKSLSPDVVAYTAMIDGHCKHGSMKEALKLYNDMLDNGLTPNSYTLSCLLDGLCKD 513
            EM    L PD   YTA+I+ +C  G +++AL+L+N+M++ G+ P+  T S L++GL K 
Sbjct: 509 EEMLRVGLPPDEFTYTALINAYCMEGDLEKALQLHNEMVEKGVLPDVVTYSVLINGLNKQ 568

Query: 514 GRVSDA----LELFTEKA-----EFGTTKCKLSFTNHVVYTALIHGLCEDGQIFKAAKLF 573
            R  +A    L+LF E++      + T     S        +LI G C  G + +A ++F
Sbjct: 569 SRTREAKRLLLKLFYEESVPSDVTYHTLIENCSNIEFKSVVSLIKGFCMKGMMTEADQVF 628

Query: 574 SDMRSYGLQPDEVIYVVMLKGYFQVKRILDMTMLHADMLK 605
             M     +PD   Y +M+ G+ +   I     L+ +M+K
Sbjct: 629 ESMLGKNHKPDGTAYNIMIHGHCRAGDIRKAYTLYKEMVK 668

BLAST of Cp4.1LG01g24300 vs. TAIR10
Match: AT5G59900.1 (AT5G59900.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 283.9 bits (725), Expect = 2.4e-76
Identity = 185/597 (30.99%), Postives = 294/597 (49.25%), Query Frame = 1

Query: 25  VALSSSTPDNLITTVL-------NCKSPKKALELFNAAP-----EKNTRLYSAIIHVLVG 84
           +ALSS      + TV            PK  L  FN        + +T  +  +IH LV 
Sbjct: 57  IALSSELVSRRLKTVHVEEILIGTIDDPKLGLRFFNFLGLHRGFDHSTASFCILIHALVK 116

Query: 85  SKLFSHARCLLKELIQDLLVKSRRPYHVCQLAFNALSSLKTSKFSPNVYSELII--VLSK 144
           + LF  A  LL    Q LL+++ +P  V  + F+     K S  S     +L+I   +  
Sbjct: 117 ANLFWPASSLL----QTLLLRALKPSDVFNVLFSCYEKCKLSSSSS---FDLLIQHYVRS 176

Query: 145 MGLVDEAL---WMYRKVGVAVARQACNVLLDVLVKTGRFELLWGIYEEMVSNGLSPDVIT 204
             ++D  L    M  KV +    +  + LL  LVK   F L   ++ +MVS G+ PDV  
Sbjct: 177 RRVLDGVLVFKMMITKVSLLPEVRTLSALLHGLVKFRHFGLAMELFNDMVSVGIRPDVYI 236

Query: 205 YGILIDGRCRQGDLLRAHEIFDEMRVKGIEPTVVVYTILIRGLCSENKMEEAESIHRLMR 264
           Y  +I   C   DL RA E+   M   G +  +V Y +LI GLC + K+ EA  I + + 
Sbjct: 237 YTGVIRSLCELKDLSRAKEMIAHMEATGCDVNIVPYNVLIDGLCKKQKVWEAVGIKKDLA 296

Query: 265 ELGVLPNVYTYNTLMNGHCKVANVKQALRLYHNMLGEDLVPDNVTFGILIDGLCKFGDIK 324
              + P+V TY TL+ G CKV   +  L +   ML     P       L++GL K G I+
Sbjct: 297 GKDLKPDVVTYCTLVYGLCKVQEFEIGLEMMDEMLCLRFSPSEAAVSSLVEGLRKRGKIE 356

Query: 325 AARNLSVNMVKFSVTPSIAVYNSLIDGYCKAGDISEAMAFLSELERFKVSPDVVTYSILI 384
            A NL   +V F V+P++ VYN+LID  CK     EA      + +  + P+ VTYSILI
Sbjct: 357 EALNLVKRVVDFGVSPNLFVYNALIDSLCKGRKFHEAELLFDRMGKIGLRPNDVTYSILI 416

Query: 385 RGLCSAGRIEEADNMLEKMMKEGIPANSVTYNSLIDGCCKEGNMNKALEICSRMIENGVE 444
              C  G+++ A + L +M+  G+  +   YNSLI+G CK G+++ A    + MI   +E
Sbjct: 417 DMFCRRGKLDTALSFLGEMVDTGLKLSVYPYNSLINGHCKFGDISAAEGFMAEMINKKLE 476

Query: 445 PNVITFSMLIDGYCKIRNVEAAMGIYSEMGIKSLSPDVVAYTAMIDGHCKHGSMKEALKL 504
           P V+T++ L+ GYC    +  A+ +Y EM  K ++P +  +T ++ G  + G +++A+KL
Sbjct: 477 PTVVTYTSLMGGYCSKGKINKALRLYHEMTGKGIAPSIYTFTTLLSGLFRAGLIRDAVKL 536

Query: 505 YNDMLDNGLTPNSYTLSCLLDGLCKDGRVSDALELFTEKAEFGTTKCKLSFTNHVVYTAL 564
           +N+M +  + PN  T + +++G C++G +S A E   E  E G      S      Y  L
Sbjct: 537 FNEMAEWNVKPNRVTYNVMIEGYCEEGDMSKAFEFLKEMTEKGIVPDTYS------YRPL 596

Query: 565 IHGLCEDGQIFKAAKLFSDMRSYGLQPDEVIYVVMLKGYFQVKRILDMTMLHADMLK 605
           IHGLC  GQ  +A      +     + +E+ Y  +L G+ +  ++ +   +  +M++
Sbjct: 597 IHGLCLTGQASEAKVFVDGLHKGNCELNEICYTGLLHGFCREGKLEEALSVCQEMVQ 640

BLAST of Cp4.1LG01g24300 vs. NCBI nr
Match: gi|778703158|ref|XP_011655325.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g61400 [Cucumis sativus])

HSP 1 Score: 958.4 bits (2476), Expect = 6.2e-276
Identity = 484/612 (79.08%), Postives = 532/612 (86.93%), Query Frame = 1

Query: 1   MLMNQLPLKSVLVHIGRHGSILQAVALSSSTPDNLITTVLNCKSPKKALELFNAAPEKNT 60
           MLM Q PLKSVLV IG +G++LQ V+LSS TPD+LITTVLNC+SP KALE FNAAPEKN 
Sbjct: 12  MLMTQFPLKSVLVRIGLNGTMLQVVSLSSLTPDSLITTVLNCRSPWKALEFFNAAPEKNI 71

Query: 61  RLYSAIIHVLVGSKLFSHARCLLKELIQDLLVKSRRPYHVCQLAFNALSSLKTSKFSPNV 120
           +LYSAIIHVLVGSKL SHAR LL +L+Q+L VKS +PYH CQLAF+ LS LK+SKF+PNV
Sbjct: 72  QLYSAIIHVLVGSKLLSHARYLLNDLVQNL-VKSHKPYHACQLAFSELSRLKSSKFTPNV 131

Query: 121 YSELIIVLSKMGLVDEALWMYRKVGVAVARQACNVLLDVLVKTGRFELLWGIYEEMVSNG 180
           Y ELIIVL KM LV+EAL MY KVG A+  QACNVLL VLVKTGRFELLW IYEEM+SNG
Sbjct: 132 YGELIIVLCKMELVEEALSMYHKVGAALTIQACNVLLYVLVKTGRFELLWRIYEEMISNG 191

Query: 181 LSPDVITYGILIDGRCRQGDLLRAHEIFDEMRVKGIEPTVVVYTILIRGLCSENKMEEAE 240
           LSP VIT+G LIDG CRQGDLLRA E+FDEMRVKGI PTV+VYTILIRGLCS+NK+EEAE
Sbjct: 192 LSPSVITFGTLIDGCCRQGDLLRAQEMFDEMRVKGIVPTVIVYTILIRGLCSDNKIEEAE 251

Query: 241 SIHRLMRELGVLPNVYTYNTLMNGHCKVANVKQALRLYHNMLGEDLVPDNVTFGILIDGL 300
           S+HR MRE+GV PNVYTYNTLM+G+CK+AN KQALRLY +MLGE LVPD VTFGILIDGL
Sbjct: 252 SMHRAMREVGVYPNVYTYNTLMDGYCKLANAKQALRLYQDMLGEGLVPDVVTFGILIDGL 311

Query: 301 CKFGDIKAARNLSVNMVKFSVTPSIAVYNSLIDGYCKAGDISEAMAFLSELERFKVSPDV 360
           CKFG++KAARNL VNM+KFSVTP+IAVYNSLID YCK GD+SEAMA   ELERF+VSPDV
Sbjct: 312 CKFGEMKAARNLFVNMIKFSVTPNIAVYNSLIDAYCKVGDVSEAMALFLELERFEVSPDV 371

Query: 361 VTYSILIRGLCSAGRIEEADNMLEKMMKEGIPANSVTYNSLIDGCCKEGNMNKALEICSR 420
            TYSILIRGLCS  R EEA N+ EKM KEGI ANSVTYNSLIDGCCKEG M+KALEICS+
Sbjct: 372 FTYSILIRGLCSVSRTEEAGNIFEKMTKEGILANSVTYNSLIDGCCKEGKMDKALEICSQ 431

Query: 421 MIENGVEPNVITFSMLIDGYCKIRNVEAAMGIYSEMGIKSLSPDVVAYTAMIDGHCKHGS 480
           M ENGVEPNVITFS LIDGYCKIRN++AAMGIYSEM IKSLSPDVV YTAMIDGHCK+GS
Sbjct: 432 MTENGVEPNVITFSTLIDGYCKIRNLQAAMGIYSEMVIKSLSPDVVTYTAMIDGHCKYGS 491

Query: 481 MKEALKLYNDMLDNGLTPNSYTLSCLLDGLCKDGRVSDALELFTEKAEFGTTKC------ 540
           MKEALKLY+DMLDNG+TPN YT+SCLLDGLCKDG++SDALELFTEK EF T +C      
Sbjct: 492 MKEALKLYSDMLDNGITPNCYTISCLLDGLCKDGKISDALELFTEKIEFQTPRCNVDAGG 551

Query: 541 -KLSFTNHVVYTALIHGLCEDGQIFKAAKLFSDMRSYGLQPDEVIYVVMLKGYFQVKRIL 600
            K S TNHV YTALIHGLC+DGQ  KA KLFSDMR YGLQPDEVIYVVML+G FQVK IL
Sbjct: 552 SKPSLTNHVAYTALIHGLCQDGQFSKAVKLFSDMRRYGLQPDEVIYVVMLRGLFQVKYIL 611

Query: 601 DMTMLHADMLKF 606
              MLHADMLKF
Sbjct: 612 --MMLHADMLKF 620

BLAST of Cp4.1LG01g24300 vs. NCBI nr
Match: gi|659127196|ref|XP_008463574.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g61400 [Cucumis melo])

HSP 1 Score: 946.4 bits (2445), Expect = 2.5e-272
Identity = 478/612 (78.10%), Postives = 527/612 (86.11%), Query Frame = 1

Query: 1   MLMNQLPLKSVLVHIGRHGSILQAVALSSSTPDNLITTVLNCKSPKKALELFNAAPEKNT 60
           MLM Q PLKSVLV IG +G++LQ V+LSS T D+L+TTVLNC+SP+KALE FNAAPEK  
Sbjct: 1   MLMTQFPLKSVLVRIGLNGTMLQVVSLSSLTSDSLLTTVLNCRSPRKALEFFNAAPEKTI 60

Query: 61  RLYSAIIHVLVGSKLFSHARCLLKELIQDLLVKSRRPYHVCQLAFNALSSLKTSKFSPNV 120
           +LYSAIIHVLVGS+L SHAR LLK+L+Q+L VKS +PYH CQL F+ LS LK+SKFSPNV
Sbjct: 61  QLYSAIIHVLVGSELLSHARYLLKDLVQNL-VKSHKPYHACQLVFSELSRLKSSKFSPNV 120

Query: 121 YSELIIVLSKMGLVDEALWMYRKVGVAVARQACNVLLDVLVKTGRFELLWGIYEEMVSNG 180
           Y ELIIVL KM LV+EAL MY KVG  +  QACNVLL+VLVKTGRFELLW IYEEM+SNG
Sbjct: 121 YGELIIVLCKMELVEEALSMYHKVGATLTIQACNVLLNVLVKTGRFELLWRIYEEMISNG 180

Query: 181 LSPDVITYGILIDGRCRQGDLLRAHEIFDEMRVKGIEPTVVVYTILIRGLCSENKMEEAE 240
           LSP VIT+G LIDG CRQGDLLRA E+FDEMRVKGI PTVVVYTILIRGLCS++KMEEAE
Sbjct: 181 LSPSVITFGTLIDGCCRQGDLLRAQEMFDEMRVKGIVPTVVVYTILIRGLCSDSKMEEAE 240

Query: 241 SIHRLMRELGVLPNVYTYNTLMNGHCKVANVKQALRLYHNMLGEDLVPDNVTFGILIDGL 300
           S+HR MRE+GV PN+YTYNTLM+G+CK+AN KQALRLY +MLGE LVPD VTFGILIDGL
Sbjct: 241 SMHRAMREVGVYPNLYTYNTLMDGYCKLANAKQALRLYQDMLGEGLVPDVVTFGILIDGL 300

Query: 301 CKFGDIKAARNLSVNMVKFSVTPSIAVYNSLIDGYCKAGDISEAMAFLSELERFKVSPDV 360
           CKFG++KAARNL VNM+KF VTP+I VYNSLID YCK GD+SEAMAF  ELER+KVSPDV
Sbjct: 301 CKFGEMKAARNLFVNMIKFCVTPNINVYNSLIDAYCKVGDVSEAMAFFLELERYKVSPDV 360

Query: 361 VTYSILIRGLCSAGRIEEADNMLEKMMKEGIPANSVTYNSLIDGCCKEGNMNKALEICSR 420
            TYSILIRGLCS  R EEA N+ EKM KEGI ANSVTYNSLIDG CKEG M KALEICS+
Sbjct: 361 FTYSILIRGLCSVTRTEEAGNIFEKMTKEGILANSVTYNSLIDGYCKEGKMEKALEICSQ 420

Query: 421 MIENGVEPNVITFSMLIDGYCKIRNVEAAMGIYSEMGIKSLSPDVVAYTAMIDGHCKHGS 480
           M ENGVEPNVITFS LIDGYCKIRN++AAMGIYSEM IKSLSPDVV YTAMIDGHCK+GS
Sbjct: 421 MTENGVEPNVITFSTLIDGYCKIRNLQAAMGIYSEMVIKSLSPDVVTYTAMIDGHCKYGS 480

Query: 481 MKEALKLYNDMLDNGLTPNSYTLSCLLDGLCKDGRVSDALELFTEKAEFGTTKC------ 540
           MKEALKLY+DMLDNG+TPN YT+SCLLDGLCKDGR+SDAL LFTEK EF T +C      
Sbjct: 481 MKEALKLYSDMLDNGITPNCYTISCLLDGLCKDGRISDALRLFTEKIEFQTPRCNVDAGG 540

Query: 541 -KLSFTNHVVYTALIHGLCEDGQIFKAAKLFSDMRSYGLQPDEVIYVVMLKGYFQVKRIL 600
            K S TNHV YTALIHGLC+DGQ FKA KLFSDMR YGLQPDEVIYVVML+G  QVK IL
Sbjct: 541 SKPSLTNHVAYTALIHGLCQDGQFFKAVKLFSDMRRYGLQPDEVIYVVMLQGLLQVKHIL 600

Query: 601 DMTMLHADMLKF 606
              MLHADMLKF
Sbjct: 601 --MMLHADMLKF 609

BLAST of Cp4.1LG01g24300 vs. NCBI nr
Match: gi|1009132528|ref|XP_015883421.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g61400 [Ziziphus jujuba])

HSP 1 Score: 717.2 bits (1850), Expect = 2.4e-203
Identity = 355/597 (59.46%), Postives = 453/597 (75.88%), Query Frame = 1

Query: 20  SILQAVALSSSTPDNLITTVLNCKSPKKALELFNAA-----PEKNTRLYSAIIHVLVGSK 79
           ++ + V+ SSS+ ++L  T+LNCK+P++ALE FN A     P KN +LYSAI+H LVG+K
Sbjct: 19  TLSKPVSSSSSSSNDLTNTILNCKTPRQALESFNFAINQIGPRKNPQLYSAIVHFLVGAK 78

Query: 80  LFSHARCLLKELIQDLLVKSRRPYHVCQLAFNALSSLKTSKFSPNVYSELIIVLSKMGLV 139
           L+  AR LLK+LI +L  K  +P   C L FNALS L++S+F+PNV+  LII LS+MGLV
Sbjct: 79  LYCKARYLLKDLILELQ-KFCKPRRACHLTFNALSRLESSRFTPNVFGSLIIALSEMGLV 138

Query: 140 DEALWMYRKVGVAVARQACNVLLDVLVKTGRFELLWGIYEEMVSNGLSPDVITYGILIDG 199
           DE LW+Y K+G   A QACN LL  LV+  RF+ +W +Y EM S G SP+V++YG+LID 
Sbjct: 139 DEGLWVYHKIGALPAIQACNALLGGLVEVARFDSMWELYREMGSRGFSPNVVSYGVLIDC 198

Query: 200 RCRQGDLLRAHEIFDEMRVKGIEPTVVVYTILIRGLCSENKMEEAESIHRLMRELGVLPN 259
            C++GD+L A E+FDEM  KGI PTVV+YT LI GLCS++KM EAES+   MRE GVLPN
Sbjct: 199 CCKKGDVLHARELFDEMGDKGIYPTVVIYTTLIHGLCSKSKMVEAESMFEAMREAGVLPN 258

Query: 260 VYTYNTLMNGHCKVANVKQALRLYHNMLGEDLVPDNVTFGILIDGLCKFGDIKAARNLSV 319
           +YTYN+L++G+CK+AN+KQAL LY NML + + P+ VTFGIL+DGLCK      ARN   
Sbjct: 259 LYTYNSLIDGYCKLANIKQALALYRNMLDDGVRPNVVTFGILVDGLCKVNIFTTARNFFA 318

Query: 320 NMVKFSVTPSIAVYNSLIDGYCKAGDISEAMAFLSELERFKVSPDVVTYSILIRGLCSAG 379
           +M KF V P+I VYN LIDG+CKA  + EAM F  E+E+  + PDV TY+ILI+GLC  G
Sbjct: 319 SMAKFGVRPNIFVYNCLIDGHCKAEKLYEAMEFYLEMEKHGIPPDVFTYNILIKGLCVVG 378

Query: 380 RIEEADNMLEKMMKEGIPANSVTYNSLIDGCCKEGNMNKALEICSRMIENGVEPNVITFS 439
           R+EEA+ +L+KM +EG+ ANSVTYNSLIDG CKEGN+ KALE+CS+M ENGVEPNVITFS
Sbjct: 379 RVEEANGLLQKMNEEGVIANSVTYNSLIDGYCKEGNLEKALEVCSQMTENGVEPNVITFS 438

Query: 440 MLIDGYCKIRNVEAAMGIYSEMGIKSLSPDVVAYTAMIDGHCKHGSMKEALKLYNDMLDN 499
            LIDGYCK  N+ AAMG+YSEM IK L PDVVA+TA+IDGHCK+ +MKEAL+L  +ML+ 
Sbjct: 439 TLIDGYCKTGNMNAAMGMYSEMVIKGLLPDVVAFTALIDGHCKNNNMKEALRLQKEMLEV 498

Query: 500 GLTPNSYTLSCLLDGLCKDGRVSDALELFTEK-------AEFGTTKCKLSFTNHVVYTAL 559
           GLTPN  T+SCL+DGL KDGR SDA++LF EK       +E   + C   F +HV+YTA+
Sbjct: 499 GLTPNLLTVSCLIDGLFKDGRTSDAIKLFLEKTRSNPLISEGSKSDCCFCFPDHVLYTAV 558

Query: 560 IHGLCEDGQIFKAAKLFSDMRSYGLQPDEVIYVVMLKGYFQVKRILDMTMLHADMLK 605
           I GLC+DGQIFKA K FSDMR YGL+PD + Y+V+LKG FQ K  L++ +LHADM+K
Sbjct: 559 IQGLCKDGQIFKATKFFSDMRCYGLRPDVLTYIVILKGQFQAKHKLNVMLLHADMIK 614

BLAST of Cp4.1LG01g24300 vs. NCBI nr
Match: gi|147817754|emb|CAN66662.1| (hypothetical protein VITISV_031722 [Vitis vinifera])

HSP 1 Score: 691.4 bits (1783), Expect = 1.4e-195
Identity = 346/622 (55.63%), Postives = 458/622 (73.63%), Query Frame = 1

Query: 2   LMNQLPLKSVLVHIGRHGSILQAVALSS-------STPDNLITTVLNCKSPKKALELFNA 61
           ++   P KS  ++  +H S +     SS       S+P +L  ++L C++  +ALELF++
Sbjct: 1   MLKSFPPKSRRIY-AKHSSFISRPLSSSPSSSSSDSSPSSLPNSILTCRTANQALELFHS 60

Query: 62  APE-----KNTRLYSAIIHVLVGSKLFSHARCLLKELIQDLLVKSRRPYHVCQLAFNALS 121
                   KN +LYSAIIHVL G+KL++ ARCL+++LIQ  L KSRR   +C   FN LS
Sbjct: 61  VSRRADLAKNPQLYSAIIHVLTGAKLYAKARCLMRDLIQ-CLQKSRRS-RICCSVFNVLS 120

Query: 122 SLKTSKFSPNVYSELIIVLSKMGLVDEALWMYRKVGVAVARQACNVLLDVLVKTGRFELL 181
            L++SKF+PNV+  LII  S+MGLV+EALW+Y K+ V  A QACN++LD LVK GRF+ +
Sbjct: 121 RLESSKFTPNVFGVLIIAFSEMGLVEEALWVYYKMDVLPAMQACNMVLDGLVKKGRFDTM 180

Query: 182 WGIYEEMVSNGLSPDVITYGILIDGRCRQGDLLRAHEIFDEMRVKGIEPTVVVYTILIRG 241
           W +Y +MV+ G SP+V+TYG LIDG CRQGD L+A  +FDEM  K I PTVV+YTILIRG
Sbjct: 181 WKVYGDMVARGASPNVVTYGTLIDGCCRQGDFLKAFRLFDEMIEKKIFPTVVIYTILIRG 240

Query: 242 LCSENKMEEAESIHRLMRELGVLPNVYTYNTLMNGHCKVANVKQALRLYHNMLGEDLVPD 301
           LC E+++ EAES+ R MR  G+LPN+YTYNT+M+G+CK+A+VK+AL LY  MLG+ L+P+
Sbjct: 241 LCGESRISEAESMFRTMRNSGMLPNLYTYNTMMDGYCKIAHVKKALELYXEMLGDGLLPN 300

Query: 302 NVTFGILIDGLCKFGDIKAARNLSVNMVKFSVTPSIAVYNSLIDGYCKAGDISEAMAFLS 361
            VTFGILIDGLCK  ++ +AR   ++M  F V P+I VYN LIDGYCKAG++SEA++  S
Sbjct: 301 VVTFGILIDGLCKTDEMVSARKFLIDMASFGVVPNIFVYNCLIDGYCKAGNLSEALSLHS 360

Query: 362 ELERFKVSPDVVTYSILIRGLCSAGRIEEADNMLEKMMKEGIPANSVTYNSLIDGCCKEG 421
           E+E+ ++ PDV TYSILI+GLC   R+EEAD +L++M K+G   N+VTYN+LIDG CKEG
Sbjct: 361 EIEKHEILPDVFTYSILIKGLCGVDRMEEADGLLQEMKKKGFLPNAVTYNTLIDGYCKEG 420

Query: 422 NMNKALEICSRMIENGVEPNVITFSMLIDGYCKIRNVEAAMGIYSEMGIKSLSPDVVAYT 481
           NM KA+E+CS+M E G+EPN+ITFS LIDGYCK   +EAAMG+Y+EM IK L PDVVAYT
Sbjct: 421 NMEKAIEVCSQMTEKGIEPNIITFSTLIDGYCKAGKMEAAMGLYTEMVIKGLLPDVVAYT 480

Query: 482 AMIDGHCKHGSMKEALKLYNDMLDNGLTPNSYTLSCLLDGLCKDGRVSDALELFTEKAEF 541
           A+IDGH K G+ KEA +L+ +M + GL PN +TLSCL+DGLCKDGR+SDA++LF  K   
Sbjct: 481 ALIDGHFKDGNTKEAFRLHKEMQEAGLHPNVFTLSCLIDGLCKDGRISDAIKLFLAKTGT 540

Query: 542 GTTKCK-------LSFTNHVVYTALIHGLCEDGQIFKAAKLFSDMRSYGLQPDEVIYVVM 601
            TT  K       L   NHV+YTALI GLC DG+IFKA+K FSDMR  GL+PD    +V+
Sbjct: 541 DTTGSKTNELDRSLCSPNHVMYTALIQGLCTDGRIFKASKFFSDMRCSGLRPDVFTCIVI 600

Query: 602 LKGYFQVKRILDMTMLHADMLK 605
           ++G+F+   + D+ ML AD+LK
Sbjct: 601 IQGHFRAMHLRDVMMLQADILK 619

BLAST of Cp4.1LG01g24300 vs. NCBI nr
Match: gi|359491317|ref|XP_003634263.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g61400 [Vitis vinifera])

HSP 1 Score: 690.3 bits (1780), Expect = 3.2e-195
Identity = 343/622 (55.14%), Postives = 457/622 (73.47%), Query Frame = 1

Query: 2   LMNQLPLKSVLVHIGRHGSILQAVALSS-------STPDNLITTVLNCKSPKKALELFNA 61
           ++   P KS  ++  +H S +     SS       S+P +L  ++L C++  +ALELF++
Sbjct: 1   MLKSFPPKSRRIY-AKHSSFISRPLSSSPSSSSSDSSPSSLPNSILTCRTANQALELFHS 60

Query: 62  APE-----KNTRLYSAIIHVLVGSKLFSHARCLLKELIQDLLVKSRRPYHVCQLAFNALS 121
                   KN +LYSAIIHVL G+KL++ ARCL+++LIQ L  ++ R   +C   FN LS
Sbjct: 61  VSRRADLAKNPQLYSAIIHVLTGAKLYAKARCLMRDLIQCL--QNSRRSRICCSVFNVLS 120

Query: 122 SLKTSKFSPNVYSELIIVLSKMGLVDEALWMYRKVGVAVARQACNVLLDVLVKTGRFELL 181
            L++SKF+PNV+  LII  S+MGLV+EALW+Y K+ V  A QACN++LD LVK GRF+ +
Sbjct: 121 RLESSKFTPNVFGVLIIAFSEMGLVEEALWVYYKMDVLPAMQACNMVLDGLVKKGRFDTM 180

Query: 182 WGIYEEMVSNGLSPDVITYGILIDGRCRQGDLLRAHEIFDEMRVKGIEPTVVVYTILIRG 241
           W +Y +MV+ G SP+V+TYG LIDG CRQGD L+A  +FDEM  K I PTVV+YTILIRG
Sbjct: 181 WKVYGDMVARGASPNVVTYGTLIDGCCRQGDFLKAFRLFDEMIEKKIFPTVVIYTILIRG 240

Query: 242 LCSENKMEEAESIHRLMRELGVLPNVYTYNTLMNGHCKVANVKQALRLYHNMLGEDLVPD 301
           LC E+++ EAES+ R MR  G+LPN+YTYNT+M+G+CK+A+VK+AL LY  MLG+ L+P+
Sbjct: 241 LCGESRISEAESMFRTMRNSGMLPNLYTYNTMMDGYCKIAHVKKALELYQEMLGDGLLPN 300

Query: 302 NVTFGILIDGLCKFGDIKAARNLSVNMVKFSVTPSIAVYNSLIDGYCKAGDISEAMAFLS 361
            VTFGILIDGLCK  ++ +AR   ++M  F V P+I VYN LIDGYCKAG++SEA++  S
Sbjct: 301 VVTFGILIDGLCKTDEMVSARKFLIDMASFGVVPNIFVYNCLIDGYCKAGNLSEALSLHS 360

Query: 362 ELERFKVSPDVVTYSILIRGLCSAGRIEEADNMLEKMMKEGIPANSVTYNSLIDGCCKEG 421
           E+E+ ++ PDV TYSILI+GLC   R+EEAD +L++M K+G   N+VTYN+LIDG CKEG
Sbjct: 361 EIEKHEILPDVFTYSILIKGLCGVDRMEEADGLLQEMKKKGFLPNAVTYNTLIDGYCKEG 420

Query: 422 NMNKALEICSRMIENGVEPNVITFSMLIDGYCKIRNVEAAMGIYSEMGIKSLSPDVVAYT 481
           NM KA+E+CS+M E G+EPN+ITFS LIDGYCK   +EAAMG+Y+EM IK L PDVVAYT
Sbjct: 421 NMEKAIEVCSQMTEKGIEPNIITFSTLIDGYCKAGKMEAAMGLYTEMVIKGLLPDVVAYT 480

Query: 482 AMIDGHCKHGSMKEALKLYNDMLDNGLTPNSYTLSCLLDGLCKDGRVSDALELFTEKAEF 541
           A+IDGH K G+ KEA +L+ +M + GL PN +TLSCL+DGLCKDGR+SDA++LF  K   
Sbjct: 481 ALIDGHFKDGNTKEAFRLHKEMQEAGLHPNVFTLSCLIDGLCKDGRISDAIKLFLAKTGT 540

Query: 542 GTTKCK-------LSFTNHVVYTALIHGLCEDGQIFKAAKLFSDMRSYGLQPDEVIYVVM 601
            TT  K       L   NHV+YTALI GLC DG+IFKA+K FSDMR  GL+PD    +V+
Sbjct: 541 DTTGSKTNELDRSLCSPNHVMYTALIQGLCTDGRIFKASKFFSDMRCSGLRPDVFTCIVI 600

Query: 602 LKGYFQVKRILDMTMLHADMLK 605
           ++G+F+   + D+ ML AD+LK
Sbjct: 601 IQGHFRAMHLRDVMMLQADILK 619

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP440_ARATH4.3e-15247.00Pentatricopeptide repeat-containing protein At5g61400 OS=Arabidopsis thaliana GN... [more]
PP143_ARATH1.3e-7930.50Putative pentatricopeptide repeat-containing protein At2g02150 OS=Arabidopsis th... [more]
PP445_ARATH1.1e-7534.43Pentatricopeptide repeat-containing protein At5g65560 OS=Arabidopsis thaliana GN... [more]
PP407_ARATH3.3e-7532.39Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana GN... [more]
PP437_ARATH4.3e-7530.99Putative pentatricopeptide repeat-containing protein At5g59900 OS=Arabidopsis th... [more]
Match NameE-valueIdentityDescription
A0A0A0KS30_CUCSA4.4e-27679.08Uncharacterized protein OS=Cucumis sativus GN=Csa_5G496480 PE=4 SV=1[more]
A5AF05_VITVI9.9e-19655.63Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_031722 PE=4 SV=1[more]
V4TQ94_9ROSI7.4e-19157.53Uncharacterized protein OS=Citrus clementina GN=CICLE_v10024595mg PE=4 SV=1[more]
A0A061G4F9_THECC1.6e-19056.66Pentatricopeptide repeat-containing protein, putative OS=Theobroma cacao GN=TCM_... [more]
A0A067K4Z7_JATCU6.2e-19055.57Uncharacterized protein OS=Jatropha curcas GN=JCGZ_11657 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G61400.12.4e-15347.00 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT2G02150.17.3e-8130.50 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G65560.16.4e-7734.43 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT5G39710.11.8e-7632.39 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G59900.12.4e-7630.99 Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
gi|778703158|ref|XP_011655325.1|6.2e-27679.08PREDICTED: pentatricopeptide repeat-containing protein At5g61400 [Cucumis sativu... [more]
gi|659127196|ref|XP_008463574.1|2.5e-27278.10PREDICTED: pentatricopeptide repeat-containing protein At5g61400 [Cucumis melo][more]
gi|1009132528|ref|XP_015883421.1|2.4e-20359.46PREDICTED: pentatricopeptide repeat-containing protein At5g61400 [Ziziphus jujub... [more]
gi|147817754|emb|CAN66662.1|1.4e-19555.63hypothetical protein VITISV_031722 [Vitis vinifera][more]
gi|359491317|ref|XP_003634263.1|3.2e-19555.14PREDICTED: pentatricopeptide repeat-containing protein At5g61400 [Vitis vinifera... [more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG01g24300.1Cp4.1LG01g24300.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 153..181
score: 0.015coord: 121..143
score:
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 463..512
score: 8.5E-21coord: 540..586
score: 8.4E-13coord: 394..442
score: 2.2E-19coord: 253..302
score: 6.6E-17coord: 323..372
score: 6.2E-17coord: 185..232
score: 1.6
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 396..430
score: 2.2E-12coord: 186..220
score: 1.3E-8coord: 327..360
score: 4.6E-9coord: 256..289
score: 1.1E-8coord: 291..324
score: 0.0014coord: 466..499
score: 7.0E-11coord: 501..524
score: 2.5E-4coord: 431..465
score: 7.3E-7coord: 221..255
score: 4.9E-9coord: 152..185
score: 2.3E-7coord: 361..394
score: 4.1E-10coord: 542..576
score: 8.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 540..574
score: 12.057coord: 184..218
score: 12.858coord: 254..288
score: 12.014coord: 359..393
score: 13.307coord: 394..428
score: 14.502coord: 59..93
score: 6.204coord: 499..533
score: 9.953coord: 289..323
score: 10.052coord: 149..183
score: 10.019coord: 324..358
score: 12.386coord: 429..463
score: 11.115coord: 219..253
score: 12.068coord: 117..147
score: 6.401coord: 464..498
score: 13.8coord: 575..609
score: 6
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 45..59
score: 3.8E-9coord: 115..166
score: 3.8E-9coord: 325..424
score: 3.8E-9coord: 257..290
score: 3.
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 28..89
score: 5.3E-222coord: 115..121
score: 5.3E-222coord: 152..604
score: 5.3E
NoneNo IPR availableunknownSSF81901HCP-likecoord: 330..531
score: 4.9