Cp4.1LG01g08660 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG01g08660
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionPentatricopeptide repeat-containing protein family
LocationCp4.1LG01 : 4816016 .. 4817962 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCACTATACCGCCTCCTCCTCCGCTCTCTCCGCCGCACTTCAACCTCGCCATCGCATTCTCGAGCTCTGAGCATTGGTCCTCTCAGTCAACATCTGCAGACCCCGATTCTTCCATCCTCGCAAAGCTCTTCTCTTATTTCGCTTCTCCATGCCCGCTCATTTGCTTTTTCCTCCGCCGAAGAAGCTGCTGCCGAAAGACGCCGTAGAAAGCGCCGTCTTCGTATCGAACCCCCTCTCCATGCTCTTCGCCGCGACAACTATCCGCCCCCTCAGCGTGATCCTAATGCTCCTCGTCTTCCTGACTCCACATCCGCTCTTGTGGGGCCTCGTCTGAGCCTTCACAATCGTGTTCAATCCCTAATTCGTGCCGGTGATCTTGATGCGGCTTCTGCTGTCGCTCGCCACTCTGTGTTCTCGAACACCCGGCCCACGGTTTTCACTTGTAACGCTATTATTGCTGCCATGTATCGGGCCAAGAGGTATGGTGATGCGATTGCACTGTTTCAGTTCTTCTTTAACCAGTCGAATATAGTTCCCAATGTTGTGTCGTATAATAATTTGATTAATGCTCATTGCGATGAGGGTCGTGTTGATGTGGGTCTTGAGATTTATCGCCATATTATTGCAAATGCTCCGTTTAGTCCTTCGGCAGTAACTTATCGGCATTTGACCAAGGGATTGATTGATTCTGGGAGGATTGGGGAGGCTGTGGATCTTCTGCGGGAAATGTTGAATAAAGGGCATGGGGCTGATTCGTTGGTTTATAATAATTTGATTTCCGGGTTTCTAAATTTGGAGAATTTGGAGAAGGCGAATGAACTGTTTGATGAGTTGAAGGAGAGGTGTTTGGTGTATGATGGAGTTGTGAATGCTACGTTCATGGATTGGTTCTTTAATAAGGGGAAAGCAAAGGAGGCCATGGAATCGTACAAGTCATTGCTTGATAGGCAATTCAAGATGATTCCAGCCACTTGCAATGTGCTGTTGGAGGTTTTGCTCAAGCATGGGAAGAAAACGGAGGCTTGGACCTTATTTGATCAGATGTTGGATAATCACACTCCTCCAAATTTCCAAGCAGTCAATTCAGACACGTTTAACATAATGGTTAATGAGTGCTTTAAGCTTGGTAAGTTCTCAGAAGCAGTAGAGACTTTCCGGAAGGTGGGAACTCAACCAAAGTCGAGGCCTTTTGCGATGGACGTTGCAGGGTATAACAATATTATTGTAAGGTTTTGTGAGCATGGAATGATGGAAGATGCAGAGACTTTCTTTGCTGAGCTTTGCTCGAAGTCCTTGTCCCCTGATGTCCCAACTCATAGGACATTGATCGAAGCTTATTTAAAGCTTGAGCAGATTGATGATGTATTGAAAGTTTTCAACAGAATGGTCGATGTAGGTTTGAGAGTCGTAGCTAGCTTCGGAAACAGGGTATTTGGCGAATTGATTAAGAATGGCAAGGCAGTTGAATGTGCTCAGATTTTAACAAAAATGGGAGAGAGGGATCCTAAACCAGATCCCACATGCTATGATGTGGTGATTAGAGGGCTATGTAATGAAGGTGCTCTCGATGCTAGTCGGATGTTGCTTGACCAGGTAATGAGGTACGGTATTGGCCTCACTCCCTCACTTCAGGAATTTGTTAAAGAGGTATTTGTAAAGGCCGGCCGGAATGAAGAGATTGAAAGACTGTTAATGATGAACAGAGGGGGACATGCCCCTTATCGCCCCACGTCTGGACCCCCAAGAATTTCACAATCGCAGGTCCCTCAATTTAGAGGAAGTTACGGCCCTTCAGCACCTCAAATGACAGGCCCCAACTATTTTCAATCAGGATCAGTTCAAATGACAAGACCACAACAGCCATCATCAGGTCCACCGCCTTCAATGGAAAAACAGCAGCAGCATTCACAACCCCCCCAAATGGCTGGGCAGGCAGTAGCTTGA

mRNA sequence

ATGTCACTATACCGCCTCCTCCTCCGCTCTCTCCGCCGCACTTCAACCTCGCCATCGCATTCTCGAGCTCTGAGCATTGGTCCTCTCAGTCAACATCTGCAGACCCCGATTCTTCCATCCTCGCAAAGCTCTTCTCTTATTTCGCTTCTCCATGCCCGCTCATTTGCTTTTTCCTCCGCCGAAGAAGCTGCTGCCGAAAGACGCCGTAGAAAGCGCCGTCTTCGTATCGAACCCCCTCTCCATGCTCTTCGCCGCGACAACTATCCGCCCCCTCAGCGTGATCCTAATGCTCCTCGTCTTCCTGACTCCACATCCGCTCTTGTGGGGCCTCGTCTGAGCCTTCACAATCGTGTTCAATCCCTAATTCGTGCCGGTGATCTTGATGCGGCTTCTGCTGTCGCTCGCCACTCTGTGTTCTCGAACACCCGGCCCACGGTTTTCACTTGTAACGCTATTATTGCTGCCATGTATCGGGCCAAGAGGTATGGTGATGCGATTGCACTGTTTCAGTTCTTCTTTAACCAGTCGAATATAGTTCCCAATGTTGTGTCGTATAATAATTTGATTAATGCTCATTGCGATGAGGGTCGTGTTGATGTGGGTCTTGAGATTTATCGCCATATTATTGCAAATGCTCCGTTTAGTCCTTCGGCAGTAACTTATCGGCATTTGACCAAGGGATTGATTGATTCTGGGAGGATTGGGGAGGCTGTGGATCTTCTGCGGGAAATGTTGAATAAAGGGCATGGGGCTGATTCGTTGGTTTATAATAATTTGATTTCCGGGTTTCTAAATTTGGAGAATTTGGAGAAGGCGAATGAACTGTTTGATGAGTTGAAGGAGAGGTGTTTGGTGTATGATGGAGTTGTGAATGCTACGTTCATGGATTGGTTCTTTAATAAGGGGAAAGCAAAGGAGGCCATGGAATCGTACAAGTCATTGCTTGATAGGCAATTCAAGATGATTCCAGCCACTTGCAATGTGCTGTTGGAGGTTTTGCTCAAGCATGGGAAGAAAACGGAGGCTTGGACCTTATTTGATCAGATGTTGGATAATCACACTCCTCCAAATTTCCAAGCAGTCAATTCAGACACGTTTAACATAATGGTTAATGAGTGCTTTAAGCTTGGTAAGTTCTCAGAAGCAGTAGAGACTTTCCGGAAGGTGGGAACTCAACCAAAGTCGAGGCCTTTTGCGATGGACGTTGCAGGGTATAACAATATTATTGTAAGGTTTTGTGAGCATGGAATGATGGAAGATGCAGAGACTTTCTTTGCTGAGCTTTGCTCGAAGTCCTTGTCCCCTGATGTCCCAACTCATAGGACATTGATCGAAGCTTATTTAAAGCTTGAGCAGATTGATGATGTATTGAAAGTTTTCAACAGAATGGTCGATGTAGGTTTGAGAGTCGTAGCTAGCTTCGGAAACAGGGTATTTGGCGAATTGATTAAGAATGGCAAGGCAGTTGAATGTGCTCAGATTTTAACAAAAATGGGAGAGAGGGATCCTAAACCAGATCCCACATGCTATGATGTGGTGATTAGAGGGCTATGTAATGAAGGTGCTCTCGATGCTAGTCGGATGTTGCTTGACCAGGTAATGAGGTACGGTATTGGCCTCACTCCCTCACTTCAGGAATTTGTTAAAGAGGTATTTGTAAAGGCCGGCCGGAATGAAGAGATTGAAAGACTGTTAATGATGAACAGAGGGGGACATGCCCCTTATCGCCCCACGTCTGGACCCCCAAGAATTTCACAATCGCAGGTCCCTCAATTTAGAGGAAGTTACGGCCCTTCAGCACCTCAAATGACAGGCCCCAACTATTTTCAATCAGGATCAGTTCAAATGACAAGACCACAACAGCCATCATCAGGTCCACCGCCTTCAATGGAAAAACAGCAGCAGCATTCACAACCCCCCCAAATGGCTGGGCAGGCAGTAGCTTGA

Coding sequence (CDS)

ATGTCACTATACCGCCTCCTCCTCCGCTCTCTCCGCCGCACTTCAACCTCGCCATCGCATTCTCGAGCTCTGAGCATTGGTCCTCTCAGTCAACATCTGCAGACCCCGATTCTTCCATCCTCGCAAAGCTCTTCTCTTATTTCGCTTCTCCATGCCCGCTCATTTGCTTTTTCCTCCGCCGAAGAAGCTGCTGCCGAAAGACGCCGTAGAAAGCGCCGTCTTCGTATCGAACCCCCTCTCCATGCTCTTCGCCGCGACAACTATCCGCCCCCTCAGCGTGATCCTAATGCTCCTCGTCTTCCTGACTCCACATCCGCTCTTGTGGGGCCTCGTCTGAGCCTTCACAATCGTGTTCAATCCCTAATTCGTGCCGGTGATCTTGATGCGGCTTCTGCTGTCGCTCGCCACTCTGTGTTCTCGAACACCCGGCCCACGGTTTTCACTTGTAACGCTATTATTGCTGCCATGTATCGGGCCAAGAGGTATGGTGATGCGATTGCACTGTTTCAGTTCTTCTTTAACCAGTCGAATATAGTTCCCAATGTTGTGTCGTATAATAATTTGATTAATGCTCATTGCGATGAGGGTCGTGTTGATGTGGGTCTTGAGATTTATCGCCATATTATTGCAAATGCTCCGTTTAGTCCTTCGGCAGTAACTTATCGGCATTTGACCAAGGGATTGATTGATTCTGGGAGGATTGGGGAGGCTGTGGATCTTCTGCGGGAAATGTTGAATAAAGGGCATGGGGCTGATTCGTTGGTTTATAATAATTTGATTTCCGGGTTTCTAAATTTGGAGAATTTGGAGAAGGCGAATGAACTGTTTGATGAGTTGAAGGAGAGGTGTTTGGTGTATGATGGAGTTGTGAATGCTACGTTCATGGATTGGTTCTTTAATAAGGGGAAAGCAAAGGAGGCCATGGAATCGTACAAGTCATTGCTTGATAGGCAATTCAAGATGATTCCAGCCACTTGCAATGTGCTGTTGGAGGTTTTGCTCAAGCATGGGAAGAAAACGGAGGCTTGGACCTTATTTGATCAGATGTTGGATAATCACACTCCTCCAAATTTCCAAGCAGTCAATTCAGACACGTTTAACATAATGGTTAATGAGTGCTTTAAGCTTGGTAAGTTCTCAGAAGCAGTAGAGACTTTCCGGAAGGTGGGAACTCAACCAAAGTCGAGGCCTTTTGCGATGGACGTTGCAGGGTATAACAATATTATTGTAAGGTTTTGTGAGCATGGAATGATGGAAGATGCAGAGACTTTCTTTGCTGAGCTTTGCTCGAAGTCCTTGTCCCCTGATGTCCCAACTCATAGGACATTGATCGAAGCTTATTTAAAGCTTGAGCAGATTGATGATGTATTGAAAGTTTTCAACAGAATGGTCGATGTAGGTTTGAGAGTCGTAGCTAGCTTCGGAAACAGGGTATTTGGCGAATTGATTAAGAATGGCAAGGCAGTTGAATGTGCTCAGATTTTAACAAAAATGGGAGAGAGGGATCCTAAACCAGATCCCACATGCTATGATGTGGTGATTAGAGGGCTATGTAATGAAGGTGCTCTCGATGCTAGTCGGATGTTGCTTGACCAGGTAATGAGGTACGGTATTGGCCTCACTCCCTCACTTCAGGAATTTGTTAAAGAGGTATTTGTAAAGGCCGGCCGGAATGAAGAGATTGAAAGACTGTTAATGATGAACAGAGGGGGACATGCCCCTTATCGCCCCACGTCTGGACCCCCAAGAATTTCACAATCGCAGGTCCCTCAATTTAGAGGAAGTTACGGCCCTTCAGCACCTCAAATGACAGGCCCCAACTATTTTCAATCAGGATCAGTTCAAATGACAAGACCACAACAGCCATCATCAGGTCCACCGCCTTCAATGGAAAAACAGCAGCAGCATTCACAACCCCCCCAAATGGCTGGGCAGGCAGTAGCTTGA

Protein sequence

MSLYRLLLRSLRRTSTSPSHSRALSIGPLSQHLQTPILPSSQSSSLISLLHARSFAFSSAEEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRLPDSTSALVGPRLSLHNRVQSLIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYGDAIALFQFFFNQSNIVPNVVSYNNLINAHCDEGRVDVGLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDLLREMLNKGHGADSLVYNNLISGFLNLENLEKANELFDELKERCLVYDGVVNATFMDWFFNKGKAKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQAVNSDTFNIMVNECFKLGKFSEAVETFRKVGTQPKSRPFAMDVAGYNNIIVRFCEHGMMEDAETFFAELCSKSLSPDVPTHRTLIEAYLKLEQIDDVLKVFNRMVDVGLRVVASFGNRVFGELIKNGKAVECAQILTKMGERDPKPDPTCYDVVIRGLCNEGALDASRMLLDQVMRYGIGLTPSLQEFVKEVFVKAGRNEEIERLLMMNRGGHAPYRPTSGPPRISQSQVPQFRGSYGPSAPQMTGPNYFQSGSVQMTRPQQPSSGPPPSMEKQQQHSQPPQMAGQAVA
BLAST of Cp4.1LG01g08660 vs. Swiss-Prot
Match: PPR29_ARATH (Pentatricopeptide repeat-containing protein At1g10270 OS=Arabidopsis thaliana GN=GRP23 PE=1 SV=1)

HSP 1 Score: 741.1 bits (1912), Expect = 1.0e-212
Identity = 388/616 (62.99%), Postives = 467/616 (75.81%), Query Frame = 1

Query: 53  RSFAFSSAEEAAAERRRRKRRLRIEPPLHALRRD-NYPPPQRDPNAPRLPDSTSALVGPR 112
           R+ AFSSAEEAAAERRRRKRRLRIEPPLHALRRD + PPP+RDPNAPRLPDSTSALVG R
Sbjct: 86  RTMAFSSAEEAAAERRRRKRRLRIEPPLHALRRDPSAPPPKRDPNAPRLPDSTSALVGQR 145

Query: 113 LSLHNRVQSLIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYGDAIALFQF 172
           L+LHNRVQSLIRA DLDAAS +AR SVFSNTRPTVFTCNAIIAAMYRAKRY ++I+LFQ+
Sbjct: 146 LNLHNRVQSLIRASDLDAASKLARQSVFSNTRPTVFTCNAIIAAMYRAKRYSESISLFQY 205

Query: 173 FFNQSNIVPNVVSYNNLINAHCDEGRVDVGLEIYRHIIANAPFSPSAVTYRHLTKGLIDS 232
           FF QSNIVPNVVSYN +INAHCDEG VD  LE+YRHI+ANAPF+PS+VTYRHLTKGL+ +
Sbjct: 206 FFKQSNIVPNVVSYNQIINAHCDEGNVDEALEVYRHILANAPFAPSSVTYRHLTKGLVQA 265

Query: 233 GRIGEAVDLLREMLNKGHGADSLVYNNLISGFLNLENLEKANELFDELKERCLVYDGVVN 292
           GRIG+A  LLREML+KG  ADS VYNNLI G+L+L + +KA E FDELK +C VYDG+VN
Sbjct: 266 GRIGDAASLLREMLSKGQAADSTVYNNLIRGYLDLGDFDKAVEFFDELKSKCTVYDGIVN 325

Query: 293 ATFMDWFFNKGKAKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLD 352
           ATFM+++F KG  KEAMESY+SLLD++F+M P T NVLLEV LK GKK EAW LF++MLD
Sbjct: 326 ATFMEYWFEKGNDKEAMESYRSLLDKKFRMHPPTGNVLLEVFLKFGKKDEAWALFNEMLD 385

Query: 353 NHTPPNFQAVNSDTFNIMVNECFKLGKFSEAVETFRKVGTQPKSRPFAMDVAGYNNIIVR 412
           NH PPN  +VNSDT  IMVNECFK+G+FSEA+ TF+KVG++  S+PF MD  GY NI+ R
Sbjct: 386 NHAPPNILSVNSDTVGIMVNECFKMGEFSEAINTFKKVGSKVTSKPFVMDYLGYCNIVTR 445

Query: 413 FCEHGMMEDAETFFAELCSKSLSPDVPTHRTLIEAYLKLEQIDDVLKVFNRMVDVGLRVV 472
           FCE GM+ +AE FFAE  S+SL  D P+HR +I+AYLK E+IDD +K+ +RMVDV LRVV
Sbjct: 446 FCEQGMLTEAERFFAEGVSRSLPADAPSHRAMIDAYLKAERIDDAVKMLDRMVDVNLRVV 505

Query: 473 ASFGNRVFGELIKNGKAVECAQILTKMGERDPKPDPTCYDVVIRGLCNEGALDASRMLLD 532
           A FG RVFGELIKNGK  E A++LTKMGER+PKPDP+ YDVV+RGLC+  ALD ++ ++ 
Sbjct: 506 ADFGARVFGELIKNGKLTESAEVLTKMGEREPKPDPSIYDVVVRGLCDGDALDQAKDIVG 565

Query: 533 QVMRYGIGLTPSLQEFVKEVFVKAGRNEEIERLL------MMNRGGH------------- 592
           +++R+ +G+T  L+EF+ EVF KAGR EEIE++L      + N G               
Sbjct: 566 EMIRHNVGVTTVLREFIIEVFEKAGRREEIEKILNSVARPVRNAGQSGNTPPRVPAVFGT 625

Query: 593 ---APYRPTSGPPRISQSQVPQFRGSYGPSAPQMTGPNYFQSGSVQMTRPQQPSSGPPPS 646
              AP +P    P  SQ  V    G    +A Q  G      G+ +    Q PS     +
Sbjct: 626 TPAAPQQPRDRAPWTSQGVVHSNSGWANGTAGQTAG------GAYKANNGQNPSWS--NT 685

BLAST of Cp4.1LG01g08660 vs. Swiss-Prot
Match: PP273_ARATH (Pentatricopeptide repeat-containing protein At3g49240 OS=Arabidopsis thaliana GN=EMB1796 PE=2 SV=1)

HSP 1 Score: 337.0 bits (863), Expect = 4.5e-91
Identity = 207/555 (37.30%), Postives = 313/555 (56.40%), Query Frame = 1

Query: 23  ALSIGPLSQHLQTPILPSSQSSSLIS--LLHARSFAFSSAEEAAAERRRRKRRLRIEPPL 82
           ++S      HLQT  L  S    ++    L  R  +F++ EEAAAERRRRKRRLR+EPP+
Sbjct: 2   SISKAAFLNHLQT--LSRSYRHRVLPQPFLAVRYMSFATQEEAAAERRRRKRRLRMEPPV 61

Query: 83  HALRRDNY-----PPPQRDPNAPRLPDSTSALVGPRLSLHNRVQSLIRAGDLDAASAVAR 142
           ++  R        P P ++PN P+LP+S SALVG RL LHN +  LIR  DL+ A+   R
Sbjct: 62  NSFNRSQQQQSQIPRPIQNPNIPKLPESVSALVGKRLDLHNHILKLIRENDLEEAALYTR 121

Query: 143 HSVFSNTRPTVFTCNAIIAAMYRAKRYGDAIALFQFFFNQSNIVPNVVSYNNLINAHCDE 202
           HSV+SN RPT+FT N ++AA  R  +YG A+     F NQ+ I PN+++YN +  A+ D 
Sbjct: 122 HSVYSNCRPTIFTVNTVLAAQLRQAKYG-ALLQLHGFINQAGIAPNIITYNLIFQAYLDV 181

Query: 203 GRVDVGLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDLLREMLNKGHGADSLV 262
            + ++ LE Y+  I NAP +PS  T+R L KGL+ +  + +A+++  +M  KG   D +V
Sbjct: 182 RKPEIALEHYKLFIDNAPLNPSIATFRILVKGLVSNDNLEKAMEIKEDMAVKGFVVDPVV 241

Query: 263 YNNLISGFLNLENLEKANELFDELKERC--LVYDGVVNATFMDWFFNKGKAKEAMESYKS 322
           Y+ L+ G +   + +   +L+ ELKE+    V DGVV    M  +F K   KEAME Y+ 
Sbjct: 242 YSYLMMGCVKNSDADGVLKLYQELKEKLGGFVDDGVVYGQLMKGYFMKEMEKEAMECYEE 301

Query: 323 LL--DRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQAVNSDTFNIMVN 382
            +  + + +M     N +LE L ++GK  EA  LFD +   H PP   AVN  TFN+MVN
Sbjct: 302 AVGENSKVRMSAMAYNYVLEALSENGKFDEALKLFDAVKKEHNPPRHLAVNLGTFNVMVN 361

Query: 383 ECFKLGKFSEAVETFRKVGTQPKSRPFAMDVAGYNNIIVRFCEHGMMEDAETFFAELCSK 442
                GKF EA+E FR++G   K  P   D   +NN++ + C++ ++ +AE  + E+  K
Sbjct: 362 GYCAGGKFEEAMEVFRQMG-DFKCSP---DTLSFNNLMNQLCDNELLAEAEKLYGEMEEK 421

Query: 443 SLSPDVPTHRTLIEAYLKLEQIDDVLKVFNRMVDVGLRVVASFGNRVFGELIKNGKAVEC 502
           ++ PD  T+  L++   K  +ID+    +  MV+  LR   +  NR+  +LIK GK  + 
Sbjct: 422 NVKPDEYTYGLLMDTCFKEGKIDEGAAYYKTMVESNLRPNLAVYNRLQDQLIKAGKLDDA 481

Query: 503 AQILTKMGERDPKPDPTCYDVVIRGLCNEGALDASRMLLDQVMRYG-IGLTPSLQEFVKE 562
                 M  +  K D   Y  ++R L   G LD    ++D+++    + ++  LQEFVKE
Sbjct: 482 KSFFDMMVSK-LKMDDEAYKFIMRALSEAGRLDEMLKIVDEMLDDDTVRVSEELQEFVKE 541

Query: 563 VFVKAGRNEEIERLL 566
              K GR  ++E+L+
Sbjct: 542 ELRKGGREGDLEKLM 548

BLAST of Cp4.1LG01g08660 vs. Swiss-Prot
Match: PP289_ARATH (Pentatricopeptide repeat-containing protein At3g60960, mitochondrial OS=Arabidopsis thaliana GN=At3g60960 PE=2 SV=2)

HSP 1 Score: 232.6 bits (592), Expect = 1.2e-59
Identity = 151/402 (37.56%), Postives = 222/402 (55.22%), Query Frame = 1

Query: 90  PPQRDPNA-PRL-PDSTSALVGPRLSLHNRVQSLIRAGDLDAASAVARHSVFSN---TRP 149
           P  RDP++ P+L P S S +    +SL  RV+++I   +LD AS ++R +V +     R 
Sbjct: 27  PLGRDPSSLPKLDPVSISYIDSRPISLRYRVRAMIEMSNLDEASKLSRLAVLNGFLVDRD 86

Query: 150 TVFTCNAIIAAMYRAKRYGDAIALFQFFFNQSNIVPNVVSYNNLINAHCDEGRVDVGLEI 209
           TVF CN++I AM  AKRY DAI+LF +FFN+S  +PN +S + +I AHCD+G VD  LE+
Sbjct: 87  TVFICNSVIGAMCSAKRYDDAISLFNYFFNESQTLPNTLSCDLIIKAHCDQGHVDDALEL 146

Query: 210 YRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDLLREMLNKGHGADSLVYNNLISGFL 269
           YRHI+ +   +P   TY  L K L+D+ R  EA  L R M         +VY+ LI GFL
Sbjct: 147 YRHILLDGRVAPGIETYMILAKALVDAKRFDEACVLARSM----SCCSFMVYDILIRGFL 206

Query: 270 NLENLEKANELFDELKERCLVYDG--------VVNATFMDWFFNKGKAKEAMESYKSLLD 329
           ++ N  KA+++F+ELK       G        + N +FM+++F +GK +EAME   +L D
Sbjct: 207 DIGNFVKASQIFEELKGLDSKLPGREYHKANAIFNVSFMNYWFKQGKDEEAMEILANLED 266

Query: 330 RQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQAVNSDTFNIMVNECFKL 389
            Q  + P   N +L+VL+KHGKKTEAW LF +M+        +  +S+T +IM       
Sbjct: 267 AQV-LNPIVGNRVLQVLVKHGKKTEAWELFGEMI--------EICDSETVDIMSE----- 326

Query: 390 GKFSEAVETFRKVGTQPKSRPFAMDVAGYNNIIVRFCEHGMMEDAETFFAELCSK----- 449
             FSE    F +           +    Y  +IV  CEHG + DAE  FAE+ +      
Sbjct: 327 -YFSEKTVPFER-----------LRKTCYRKMIVSLCEHGKVSDAEKLFAEMFTDVDGGD 386

Query: 450 -SLSPDVPTHRTLIEAYLKLEQIDDVLKVFNRMVDVGLRVVA 473
             + PD+   R +I  Y+ + ++DD +K  N+M    LR +A
Sbjct: 387 LLVGPDLLIFRAMINGYVSVGRVDDAIKTLNKMRISNLRKLA 398

BLAST of Cp4.1LG01g08660 vs. Swiss-Prot
Match: PP290_ARATH (Pentatricopeptide repeat-containing protein At3g60980, mitochondrial OS=Arabidopsis thaliana GN=At3g60980 PE=2 SV=1)

HSP 1 Score: 227.3 bits (578), Expect = 5.0e-58
Identity = 147/380 (38.68%), Postives = 217/380 (57.11%), Query Frame = 1

Query: 117 RVQSLIRA-GDLDAASAVARHSVFSNTRP--TVFTCNAIIAAMYRAKRYGDAIALFQFFF 176
           RV  LIR  GDLD A+  AR +VF++ +   T   C +II  M R KR  DA  L++FFF
Sbjct: 38  RVSYLIRCVGDLDTAAKYARLAVFTSIKSESTTTICQSIIGGMLRDKRLKDAYDLYEFFF 97

Query: 177 NQSNIVPNVVSYNNLINAHCDEGRVDVGLEIYRHIIANAPFS--PSAVTYRHLTKGLIDS 236
           NQ N+ PN   +N +I +   +G V+  L  +   I +      PS  ++R LTKGL+ S
Sbjct: 98  NQHNLRPNSHCWNYIIESGFQQGLVNDALHFHHRCINSGQVHDYPSDDSFRILTKGLVHS 157

Query: 237 GRIGEAVDLLR-EMLNKGHGADSLVYNNLISGFLNLENLEKANELFDELKERCLV----- 296
           GR+ +A   LR   +N+    D + YNNLI GFL+L N +KAN +  E K   L+     
Sbjct: 158 GRLDQAEAFLRGRTVNRTTYPDHVAYNNLIRGFLDLGNFKKANLVLGEFKRLFLIALSET 217

Query: 297 --------YDGVV---NATFMDWFFNKGKAKEAMESY-KSLLDRQFKMIPATCNVLLEVL 356
                   Y+  V    ATFM+++F +GK  EAME Y + +L  +  +   T N LL+VL
Sbjct: 218 KDDLHHSNYENRVAFLMATFMEYWFKQGKQVEAMECYNRCVLSNRLLVCAETGNALLKVL 277

Query: 357 LKHGKKTEAWTLFDQMLDNHTPPNFQAVNSDTFNIMVNECFKLGKFSEAVETFRKVGTQP 416
           LK+G+K  AW L+ ++LD +       ++SDT  IMV+ECF +G FSEA+ET++K   +P
Sbjct: 278 LKYGEKKNAWALYHELLDKNGTGK-GCLDSDTIKIMVDECFDMGWFSEAMETYKK--ARP 337

Query: 417 KSRPFAMDVAGYNNIIVRFCEHGMMEDAETFFAELCSKSLS-PDVPTHRTLIEAYLKLEQ 473
           K+     D      II RFCE+ M+ +AE+ F +  +      DV T++T+I+AY+K  +
Sbjct: 338 KN-----DYLSDKYIITRFCENRMLSEAESVFVDSLADDFGYIDVNTYKTMIDAYVKAGR 397

BLAST of Cp4.1LG01g08660 vs. Swiss-Prot
Match: PPR28_ARATH (Pentatricopeptide repeat-containing protein At1g09900 OS=Arabidopsis thaliana GN=At1g09900 PE=2 SV=1)

HSP 1 Score: 166.4 bits (420), Expect = 1.0e-39
Identity = 112/464 (24.14%), Postives = 211/464 (45.47%), Query Frame = 1

Query: 115 HNRVQSLIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYGDAIALFQFFFN 174
           +N ++ ++R G+L+       + V+    P +  C  +I    R  +   A  + +    
Sbjct: 106 NNHLRQMVRTGELEEGFKFLENMVYHGNVPDIIPCTTLIRGFCRLGKTRKAAKILEIL-E 165

Query: 175 QSNIVPNVVSYNNLINAHCDEGRVDVGLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRI 234
            S  VP+V++YN +I+ +C  G ++  L +    +     SP  VTY  + + L DSG++
Sbjct: 166 GSGAVPDVITYNVMISGYCKAGEINNALSV----LDRMSVSPDVVTYNTILRSLCDSGKL 225

Query: 235 GEAVDLLREMLNKGHGADSLVYNNLISGFLNLENLEKANELFDELKERCLVYDGVVNATF 294
            +A+++L  ML +    D + Y  LI        +  A +L DE+++R    D V     
Sbjct: 226 KQAMEVLDRMLQRDCYPDVITYTILIEATCRDSGVGHAMKLLDEMRDRGCTPDVVTYNVL 285

Query: 295 MDWFFNKGKAKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHT 354
           ++    +G+  EA++    +     +    T N++L  +   G+  +A  L   ML    
Sbjct: 286 VNGICKEGRLDEAIKFLNDMPSSGCQPNVITHNIILRSMCSTGRWMDAEKLLADMLRKGF 345

Query: 355 PPNFQAVNSDTFNIMVNECFKLGKFSEAVETFRKV---GTQPKSRPFAMDVAGYNNIIVR 414
            P+       TFNI++N   + G    A++   K+   G QP S         YN ++  
Sbjct: 346 SPSVV-----TFNILINFLCRKGLLGRAIDILEKMPQHGCQPNS-------LSYNPLLHG 405

Query: 415 FCEHGMMEDAETFFAELCSKSLSPDVPTHRTLIEAYLKLEQIDDVLKVFNRMVDVGLRVV 474
           FC+   M+ A  +   + S+   PD+ T+ T++ A  K  +++D +++ N++   G   V
Sbjct: 406 FCKEKKMDRAIEYLERMVSRGCYPDIVTYNTMLTALCKDGKVEDAVEILNQLSSKGCSPV 465

Query: 475 ASFGNRVFGELIKNGKAVECAQILTKMGERDPKPDPTCYDVVIRGLCNEGALDASRMLLD 534
               N V   L K GK  +  ++L +M  +D KPD   Y  ++ GL  EG +D +     
Sbjct: 466 LITYNTVIDGLAKAGKTGKAIKLLDEMRAKDLKPDTITYSSLVGGLSREGKVDEAIKFFH 525

Query: 535 QVMRYGIGLTPSLQEFVKEVFVKAGRNEEIER-----LLMMNRG 571
           +  R  +G+ P+   F   + +   ++ + +R     + M+NRG
Sbjct: 526 EFER--MGIRPNAVTF-NSIMLGLCKSRQTDRAIDFLVFMINRG 549

BLAST of Cp4.1LG01g08660 vs. TrEMBL
Match: A0A0A0LTQ3_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_2G435500 PE=4 SV=1)

HSP 1 Score: 1088.6 bits (2814), Expect = 0.0e+00
Identity = 570/678 (84.07%), Postives = 597/678 (88.05%), Query Frame = 1

Query: 1   MSLYRLLLRSLRRTSTSPSHSRALS-IGPLSQHLQTPILPSSQSSSLISLLHARSFAFSS 60
           MS YR LLRSLRR+STSPSH+ AL+ I PL+QH+     PSSQ+SS ISLL ARSF+FSS
Sbjct: 1   MSPYRFLLRSLRRSSTSPSHAPALTTIAPLNQHIP----PSSQTSSPISLLLARSFSFSS 60

Query: 61  AEEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRLPDSTSALVGPRLSLHNRVQ 120
           AEEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRLPDSTSALVGPRL+LHNRVQ
Sbjct: 61  AEEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRLPDSTSALVGPRLNLHNRVQ 120

Query: 121 SLIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYGDAIALFQFFFNQSNIV 180
           SLIRAGDLDAAS+VARHSVFSNTRPTVFTCNAIIAAMYRAKRY DAIALFQFFFNQSNIV
Sbjct: 121 SLIRAGDLDAASSVARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIV 180

Query: 181 PNVVSYNNLINAHCDEGRVDVGLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVD 240
           PNVVSYNNLINAHCDEGRVDVGLE+YRHIIANAPFSPSAVTYRHLTKGLID+GRI EAVD
Sbjct: 181 PNVVSYNNLINAHCDEGRVDVGLEVYRHIIANAPFSPSAVTYRHLTKGLIDAGRIEEAVD 240

Query: 241 LLREMLNKGHGADSLVYNNLISGFLNLENLEKANELFDELKERCLVYDGVVNATFMDWFF 300
           LLREMLNKGHGADSLV+NNLISGFLNL NLEKANELFDELKERCLVYDGVVNATFMDWFF
Sbjct: 241 LLREMLNKGHGADSLVFNNLISGFLNLGNLEKANELFDELKERCLVYDGVVNATFMDWFF 300

Query: 301 NKGKAKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQ 360
           N+GK KEAMESYKSLLDRQFKM+PATCNVLLEVLLKH KKTEAWTLFDQMLDNHTPPNFQ
Sbjct: 301 NQGKEKEAMESYKSLLDRQFKMVPATCNVLLEVLLKHEKKTEAWTLFDQMLDNHTPPNFQ 360

Query: 361 AVNSDTFNIMVNECFKLGKFSEAVETFRKVGTQPKSRPFAMDVAGYNNIIVRFCEHGMME 420
           AVNSDTFNIMVNECFK GKF+EAVETFRKVGTQPKSRPFAMDVAGYNNII RFCE GMM 
Sbjct: 361 AVNSDTFNIMVNECFKHGKFAEAVETFRKVGTQPKSRPFAMDVAGYNNIIARFCEQGMMA 420

Query: 421 DAETFFAELCSKSLSPDVPTHRTLIEAYLKLEQIDDVLKVFNRMVDVGLRVVASFGNRVF 480
           DAETFFAELCSKSLSPDVPTHRTLIE+YLK+EQIDD L+VFNRMVDVGLRVVASFGN VF
Sbjct: 421 DAETFFAELCSKSLSPDVPTHRTLIESYLKIEQIDDALRVFNRMVDVGLRVVASFGNMVF 480

Query: 481 GELIKNGKAVECAQILTKMGERDPKPDPTCYDVVIRGLCNEGALDASRMLLDQVMRYGIG 540
           GELIKNGKA +CAQILTKMGERDPKPDPTCYDVVIRGLCNEGALDASR LLDQ+MRYGIG
Sbjct: 481 GELIKNGKAADCAQILTKMGERDPKPDPTCYDVVIRGLCNEGALDASRELLDQIMRYGIG 540

Query: 541 LTPSLQEFVKEVFVKAGRNEEIERLLMMNRGGHAPYRPTSGPPRISQSQVP--------- 600
           LTP+L+EFVKE FVKAGR+EEIERLL MN+ GHA YRP SGPPRISQSQVP         
Sbjct: 541 LTPTLEEFVKEAFVKAGRHEEIERLLNMNKWGHAAYRPLSGPPRISQSQVPPQMGGPLQG 600

Query: 601 ---------------QFRGSYG------PS-----APQMTGPNYFQSGSVQMTRPQQPSS 643
                          Q RG+Y       PS     +PQ TG NYFQSGSVQMT+ Q  S 
Sbjct: 601 PPQMAEPNWRPSINPQARGTYSSPQMSSPSHFQSGSPQTTGSNYFQSGSVQMTKSQHSSF 660

BLAST of Cp4.1LG01g08660 vs. TrEMBL
Match: Q6E438_CUCME (ACT11D09.4 OS=Cucumis melo GN=ACT11D09.4 PE=4 SV=1)

HSP 1 Score: 1087.0 bits (2810), Expect = 0.0e+00
Identity = 569/679 (83.80%), Postives = 596/679 (87.78%), Query Frame = 1

Query: 1   MSLYRLLLRSLRRTSTSPSHSRALS-IGPLSQHLQTPILPSSQSSSLISLLHARSFAFSS 60
           MS YR LLRSLRR+STSPS++ AL+ I PL+ H+     PSSQ+SS ISLL ARSF+FSS
Sbjct: 1   MSPYRFLLRSLRRSSTSPSYAPALTTIAPLNHHIP----PSSQTSSPISLLLARSFSFSS 60

Query: 61  AEEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRLPDSTSALVGPRLSLHNRVQ 120
           AEEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRLPDSTSALVGPRL+LHNRVQ
Sbjct: 61  AEEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRLPDSTSALVGPRLNLHNRVQ 120

Query: 121 SLIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYGDAIALFQFFFNQSNIV 180
           SLIRAGDLDAAS+VARHSVFSNTRPTVFTCNAIIAAMYRAKRY DAIALFQFFFNQSNIV
Sbjct: 121 SLIRAGDLDAASSVARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIV 180

Query: 181 PNVVSYNNLINAHCDEGRVDVGLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVD 240
           PNVVSYNNLINAHCDEGRVDVGLE+YRHIIANAPFSPSAVTYRHLTKGLID+GRI EAVD
Sbjct: 181 PNVVSYNNLINAHCDEGRVDVGLEVYRHIIANAPFSPSAVTYRHLTKGLIDAGRIEEAVD 240

Query: 241 LLREMLNKGHGADSLVYNNLISGFLNLENLEKANELFDELKERCLVYDGVVNATFMDWFF 300
           LLREMLNKGHGADSLV+NNLISGFLNL NL KANELFDELKERCLVYDGVVNATFMDWFF
Sbjct: 241 LLREMLNKGHGADSLVFNNLISGFLNLGNLVKANELFDELKERCLVYDGVVNATFMDWFF 300

Query: 301 NKGKAKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQ 360
           N+GK KEAMESYKSLLDRQFKM+PATCNVLLEVLLKH KKTEAWTLFDQMLDNHTPPNFQ
Sbjct: 301 NQGKEKEAMESYKSLLDRQFKMVPATCNVLLEVLLKHEKKTEAWTLFDQMLDNHTPPNFQ 360

Query: 361 AVNSDTFNIMVNECFKLGKFSEAVETFRKVGTQPKSRPFAMDVAGYNNIIVRFCEHGMME 420
           AVNSDTFNIMVNECFKLGKF+EAVETFRKVGTQPKSRPFAMDVAGYNNII RFCE GMM 
Sbjct: 361 AVNSDTFNIMVNECFKLGKFTEAVETFRKVGTQPKSRPFAMDVAGYNNIIARFCEQGMMA 420

Query: 421 DAETFFAELCSKSLSPDVPTHRTLIEAYLKLEQIDDVLKVFNRMVDVGLRVVASFGNRVF 480
           DAETFFAELCSKSLSPDVPTHRTLIE+YLK+EQIDD L+VFNRMVDVGLRVVASFGN VF
Sbjct: 421 DAETFFAELCSKSLSPDVPTHRTLIESYLKIEQIDDALRVFNRMVDVGLRVVASFGNMVF 480

Query: 481 GELIKNGKAVECAQILTKMGERDPKPDPTCYDVVIRGLCNEGALDASRMLLDQVMRYGIG 540
           GELIKNGKA +CAQILTKMGERDPKPDPTCYDVVIRGLCNEGALD SR LLDQ+MRYGIG
Sbjct: 481 GELIKNGKAADCAQILTKMGERDPKPDPTCYDVVIRGLCNEGALDTSRELLDQIMRYGIG 540

Query: 541 LTPSLQEFVKEVFVKAGRNEEIERLLMMNRGGHAPYRPTSGPPRISQSQVP--------- 600
           LTP+L+EFVK+ FVKAGR+EEIERLL MN+ GHA YRP SGPPRISQSQVP         
Sbjct: 541 LTPTLEEFVKDAFVKAGRHEEIERLLNMNKWGHAAYRPPSGPPRISQSQVPPQMGRPLQG 600

Query: 601 ---------------QFRGSYG------PS-----APQMTGPNYFQSGSVQMTRPQQPSS 644
                          Q RGSY       PS      PQMTG NYFQSGS QMT+PQ  S 
Sbjct: 601 PPQMAEPNWRPSINPQARGSYSSPQMSSPSHFQSGPPQMTGSNYFQSGSAQMTKPQHSSF 660

BLAST of Cp4.1LG01g08660 vs. TrEMBL
Match: W9RZT4_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_010965 PE=4 SV=1)

HSP 1 Score: 916.4 bits (2367), Expect = 2.0e-263
Identity = 478/645 (74.11%), Postives = 537/645 (83.26%), Query Frame = 1

Query: 1   MSLYRLLLRSLRRTSTSPSHSRALSIGPLSQHLQTPILPSSQSSSLISLLHARSFAFSSA 60
           MS+YRLLLRSLRR ST+P      S+     H+      ++ ++++  L   RSFAFSSA
Sbjct: 1   MSVYRLLLRSLRRPSTTPQTLTTTSL----LHI------TNTTTTIPDLTFRRSFAFSSA 60

Query: 61  EEAAAERRRRKRRLRIEPPLHALRRD-NYPPPQRDPNAPRLPDSTSALVGPRLSLHNRVQ 120
           EEAAAERRRRKRRLRIEPPL ALRRD ++ PP RDPNAPRLPDSTSALVGPRL+LHNRVQ
Sbjct: 61  EEAAAERRRRKRRLRIEPPLQALRRDPHFHPPPRDPNAPRLPDSTSALVGPRLNLHNRVQ 120

Query: 121 SLIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYGDAIALFQFFFNQSNIV 180
           SLIRAGDLDAAS+VARHSVFSNTRPTVFTCNAIIAAMYRAKRY DAIALFQFFF QSNIV
Sbjct: 121 SLIRAGDLDAASSVARHSVFSNTRPTVFTCNAIIAAMYRAKRYNDAIALFQFFFQQSNIV 180

Query: 181 PNVVSYNNLINAHCDEGRVDVGLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVD 240
           PN+VSYNNLINAHCDEGRVDVGL+++RHI+ANAPFSPS VTYRHLTKGLID+GRIGEAVD
Sbjct: 181 PNIVSYNNLINAHCDEGRVDVGLDVFRHIMANAPFSPSPVTYRHLTKGLIDAGRIGEAVD 240

Query: 241 LLREMLNKGHGADSLVYNNLISGFLNLENLEKANELFDELKERCLVYDGVVNATFMDWFF 300
           LLREMLNKGHGADSLVYNNLISGFL+L NLE+ANELF ELKERCLVYDGVV+ATFMDWFF
Sbjct: 241 LLREMLNKGHGADSLVYNNLISGFLSLGNLERANELFGELKERCLVYDGVVSATFMDWFF 300

Query: 301 NKGKAKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQ 360
           N+G  KEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAW LFDQMLDNHTPPNFQ
Sbjct: 301 NRGMEKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWALFDQMLDNHTPPNFQ 360

Query: 361 AVNSDTFNIMVNECFKLGKFSEAVETFRKVGTQPKSRPFAMDVAGYNNIIVRFCEHGMME 420
           AVNS++F+IMVNECFKL +  +A+ TFRKVGT+  S+PFAMDVAGYNNII R+CE+ M+ 
Sbjct: 361 AVNSESFSIMVNECFKLERIEDAIVTFRKVGTKVNSKPFAMDVAGYNNIITRYCENRMLS 420

Query: 421 DAETFFAELCSKSLSPDVPTHRTLIEAYLKLEQIDDVLKVFNRMVDVGLRVVASFGNRVF 480
           +AE+ FAELCSKSLSPDVPT+RTLIEAYLK EQID+ L++FNRMV+ GLRVVASFGNRVF
Sbjct: 421 EAESMFAELCSKSLSPDVPTYRTLIEAYLKEEQIDNALQMFNRMVEAGLRVVASFGNRVF 480

Query: 481 GELIKNGKAVECAQILTKMGERDPKPDPTCYDVVIRGLCNEGALDASRMLLDQVMRYGIG 540
            ELIKNGKAV+CAQIL KMGE+DPKPD +CY+VVI+GLCNEGA D S  L+++VMRYGIG
Sbjct: 481 DELIKNGKAVDCAQILKKMGEKDPKPDVSCYEVVIKGLCNEGAFDVSLDLVEEVMRYGIG 540

Query: 541 LTPSLQEFVKEVFVKAGRNEEIERLLMMNRGGHAPYRPTSGPPRISQSQVPQFRGSYGPS 600
           +TP+LQ+FV E F K GR +EIER+L M+R GH P   T  P      Q P  R      
Sbjct: 541 VTPTLQQFVNEAFAKVGRGQEIERVLSMDRWGHTPPSRTERP-----GQQPLGRA----- 600

Query: 601 APQMTGPNYFQSGSVQMTRPQQPSSGPPPSMEKQQQHSQPPQMAG 645
             QM GP++  SG  QM+    PS G P      Q  S  PQM G
Sbjct: 601 --QMAGPSHAPSGPGQMSSWSNPSFGSPQRTGFHQSPSASPQMTG 623

BLAST of Cp4.1LG01g08660 vs. TrEMBL
Match: A0A059ATG5_EUCGR (Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_I02789 PE=4 SV=1)

HSP 1 Score: 906.0 bits (2340), Expect = 2.7e-260
Identity = 466/640 (72.81%), Postives = 527/640 (82.34%), Query Frame = 1

Query: 1   MSLYRLLLRSLRRTSTSPSHSRALSIGPLSQHLQTPILPSSQSSSLISLLHARSFAFSSA 60
           MSLYR+LLRSLR  S+S S S A +   L  H    + P++Q          RSFAFSSA
Sbjct: 1   MSLYRILLRSLRHRSSSSSSSAAAAAADL--HFLPSLAPAAQH---------RSFAFSSA 60

Query: 61  EEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRLPDSTSALVGPRLSLHNRVQS 120
           EEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRLPDSTSALVGPRL+LHNRVQS
Sbjct: 61  EEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRLPDSTSALVGPRLNLHNRVQS 120

Query: 121 LIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYGDAIALFQFFFNQSNIVP 180
           LIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIA+MYR KRY DAIALFQFFFNQSNIVP
Sbjct: 121 LIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIASMYRGKRYTDAIALFQFFFNQSNIVP 180

Query: 181 NVVSYNNLINAHCDEGRVDVGLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDL 240
           NVVSYNNLIN HCD G VD  LE+YRHI+A+APFSPS+VTYRHLTKGLID+GR+G+AVDL
Sbjct: 181 NVVSYNNLINTHCDMGNVDTALEVYRHILAHAPFSPSSVTYRHLTKGLIDAGRMGDAVDL 240

Query: 241 LREMLNKGHGADSLVYNNLISGFLNLENLEKANELFDELKERCLVYDGVVNATFMDWFFN 300
           LREML KGHGADSLVYNNLISGFLNL NL+KANELFDELKERCLVYDGVV+ATFMDWFFN
Sbjct: 241 LREMLTKGHGADSLVYNNLISGFLNLGNLDKANELFDELKERCLVYDGVVSATFMDWFFN 300

Query: 301 KGKAKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQA 360
           +G+ KEAMESYKSLLDRQF+M+PATCNVL+EVLLKHG+KTEAW +FDQMLDNHTPP FQA
Sbjct: 301 QGREKEAMESYKSLLDRQFRMVPATCNVLIEVLLKHGRKTEAWAMFDQMLDNHTPPTFQA 360

Query: 361 VNSDTFNIMVNECFKLGKFSEAVETFRKVGTQPKSRPFAMDVAGYNNIIVRFCEHGMMED 420
           VNSDTFNIMVNECFK GKF EA+ TFRKVGT+  SRPF+MDVAGYNNII R+CEHGM+ +
Sbjct: 361 VNSDTFNIMVNECFKNGKFDEAISTFRKVGTKAGSRPFSMDVAGYNNIITRYCEHGMLAE 420

Query: 421 AETFFAELCSKSLSPDVPTHRTLIEAYLKLEQIDDVLKVFNRMVDVGLRVVASFGNRVFG 480
           A+  F EL SKSL PDV +HRTLI+AYLK  ++DD L +F+RMVD GLRVVASFGNR+  
Sbjct: 421 ADNLFRELMSKSLCPDVTSHRTLIDAYLKENRVDDALSMFSRMVDAGLRVVASFGNRILE 480

Query: 481 ELIKNGKAVECAQILTKMGERDPKPDPTCYDVVIRGLCNEGALDASRMLLDQVMRYGIGL 540
             I NGKAV+CAQ LTKMGE+DPKPDPTCY+VV++GLC+EG+ D ++ L+ Q++RYGIG 
Sbjct: 481 VFIGNGKAVDCAQALTKMGEKDPKPDPTCYEVVLKGLCDEGSFDVAQDLVGQMIRYGIGF 540

Query: 541 TPSLQEFVKEVFVKAGRNEEIERLLMMNRGGHAPYRP---------TSGPPRISQSQVP- 600
           TPSL+EFV   F KAGR+EEIERLL MNR G+AP  P           GPP+++ +Q P 
Sbjct: 541 TPSLREFVSTAFGKAGRSEEIERLLSMNRWGYAPRYPQQGTNPQPRVQGPPQMTATQGPY 600

Query: 601 QFRGSYGP-SAPQMTGPNYFQSGSVQMTRPQQPSS-GPPP 629
              GS  P  APQ  G       +  +  PQ  ++ G PP
Sbjct: 601 SMNGSLPPQGAPQGFGSLRPSQSNGPLGHPQTAAAPGSPP 629

BLAST of Cp4.1LG01g08660 vs. TrEMBL
Match: M5XCA4_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa001970mg PE=4 SV=1)

HSP 1 Score: 894.0 bits (2309), Expect = 1.1e-256
Identity = 468/646 (72.45%), Postives = 533/646 (82.51%), Query Frame = 1

Query: 1   MSLYRLLLRSLRRTSTSPSHSRALSIGPLSQHLQTPILPSSQSSSLISLLHARSFAFSSA 60
           M+LYRLLLRSLRR ST PS +++L+   L+     P +P   +++       R+FAFSSA
Sbjct: 1   MTLYRLLLRSLRRPSTPPSLTQSLT--SLALQNPNPSIPFHLTTTA-----TRTFAFSSA 60

Query: 61  EEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRLPDSTSALVGPRLSLHNRVQS 120
           EEAAAERRRRKRRLRIEPP++ALRRD++PPP RDPNAPRLPD+TSALVG RL+LHNRVQS
Sbjct: 61  EEAAAERRRRKRRLRIEPPINALRRDSHPPPPRDPNAPRLPDTTSALVGHRLNLHNRVQS 120

Query: 121 LIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYGDAIALFQFFFNQSNIVP 180
           LIRAGDLDAASAVARHSVFSNTRPTVFTCNAI+AAMYRAKRY DA+ALFQFF+NQSNIVP
Sbjct: 121 LIRAGDLDAASAVARHSVFSNTRPTVFTCNAIVAAMYRAKRYNDAVALFQFFYNQSNIVP 180

Query: 181 NVVSYNNLINAHCDEGRVDVGLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDL 240
           NVVSYN LINAHCD+GRVDVGLE+YRHI+ANAPFSPS VTYRHLTKGL+D+GRIGEAVDL
Sbjct: 181 NVVSYNILINAHCDDGRVDVGLEVYRHILANAPFSPSQVTYRHLTKGLVDAGRIGEAVDL 240

Query: 241 LREMLNKGHGADSLVYNNLISGFLNLENLEKANELFDELKERCLVYDGVVNATFMDWFFN 300
           LREMLNK  GADS VYNNLI+GFL+LEN +KA ELFDELK+RCL YDGVVNATFMDWFFN
Sbjct: 241 LREMLNKNLGADSGVYNNLINGFLHLENFDKAVELFDELKDRCLAYDGVVNATFMDWFFN 300

Query: 301 KGKAKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQA 360
           KGK KEAMESYKS LDRQF+M  AT NVLLEVLLKHGKK EAW LFDQMLDNHTPP  QA
Sbjct: 301 KGKEKEAMESYKSELDRQFRMTTATGNVLLEVLLKHGKKKEAWALFDQMLDNHTPPTIQA 360

Query: 361 VNSDTFNIMVNECFKLGKFSEAVETFRKVGTQPKSRPFAMDVAGYNNIIVRFCEHGMMED 420
           VNS+TFNIMVNECF LGKF EA+ TF+KVGT+  SRPF+MDVAGYNNII R+CE+GM+ +
Sbjct: 361 VNSETFNIMVNECFGLGKFDEALATFKKVGTKVNSRPFSMDVAGYNNIIARYCENGMLSE 420

Query: 421 AETFFAELCSKSLSPDVPTHRTLIEAYLKLEQIDDVLKVFNRMVDVGLRVVASFGNRVFG 480
           AET FAEL SK+L+PDV THRTLI+AYLK+E+IDD LK+F RM +VGLRVVAS GNRVF 
Sbjct: 421 AETLFAELSSKALTPDVTTHRTLIDAYLKVERIDDALKIFRRMAEVGLRVVASLGNRVFD 480

Query: 481 ELIKNGKAVECAQILTKMGERDPKPDPTCYDVVIRGLCNEGALDASRMLLDQVMRYGIGL 540
           ELIKNGKA++CAQIL KMGE+DPKPD + YDVVIRGLCNE A D SR LL++++RYGIG+
Sbjct: 481 ELIKNGKAMDCAQILKKMGEKDPKPDASFYDVVIRGLCNEVAFDPSRDLLEEMVRYGIGV 540

Query: 541 TPSLQEFVKEVFVKAGRNEEIERLLMMNRGGHAP--YRPTSGPPRISQSQVPQFRGSYGP 600
            P+LQ+FV EVF KAGR EEI+R+L M++ G+ P   RP    P  S     Q     GP
Sbjct: 541 PPALQQFVNEVFGKAGRGEEIQRVLNMSKWGNTPAQARPRQFQPMRSPQMAGQQEPPSGP 600

Query: 601 SAPQMTGPNYFQSGSVQMTRPQQPSSGPPPSMEKQQQHSQPPQMAG 645
           S  QM G ++  S   QM  P QPSSGP     +    S P QMAG
Sbjct: 601 S--QMAGQHHSSSAPPQMAGPYQPSSGPYQMAGQHHPSSAPSQMAG 637

BLAST of Cp4.1LG01g08660 vs. TAIR10
Match: AT1G10270.1 (AT1G10270.1 glutamine-rich protein 23)

HSP 1 Score: 741.1 bits (1912), Expect = 5.8e-214
Identity = 388/616 (62.99%), Postives = 467/616 (75.81%), Query Frame = 1

Query: 53  RSFAFSSAEEAAAERRRRKRRLRIEPPLHALRRD-NYPPPQRDPNAPRLPDSTSALVGPR 112
           R+ AFSSAEEAAAERRRRKRRLRIEPPLHALRRD + PPP+RDPNAPRLPDSTSALVG R
Sbjct: 86  RTMAFSSAEEAAAERRRRKRRLRIEPPLHALRRDPSAPPPKRDPNAPRLPDSTSALVGQR 145

Query: 113 LSLHNRVQSLIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYGDAIALFQF 172
           L+LHNRVQSLIRA DLDAAS +AR SVFSNTRPTVFTCNAIIAAMYRAKRY ++I+LFQ+
Sbjct: 146 LNLHNRVQSLIRASDLDAASKLARQSVFSNTRPTVFTCNAIIAAMYRAKRYSESISLFQY 205

Query: 173 FFNQSNIVPNVVSYNNLINAHCDEGRVDVGLEIYRHIIANAPFSPSAVTYRHLTKGLIDS 232
           FF QSNIVPNVVSYN +INAHCDEG VD  LE+YRHI+ANAPF+PS+VTYRHLTKGL+ +
Sbjct: 206 FFKQSNIVPNVVSYNQIINAHCDEGNVDEALEVYRHILANAPFAPSSVTYRHLTKGLVQA 265

Query: 233 GRIGEAVDLLREMLNKGHGADSLVYNNLISGFLNLENLEKANELFDELKERCLVYDGVVN 292
           GRIG+A  LLREML+KG  ADS VYNNLI G+L+L + +KA E FDELK +C VYDG+VN
Sbjct: 266 GRIGDAASLLREMLSKGQAADSTVYNNLIRGYLDLGDFDKAVEFFDELKSKCTVYDGIVN 325

Query: 293 ATFMDWFFNKGKAKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLD 352
           ATFM+++F KG  KEAMESY+SLLD++F+M P T NVLLEV LK GKK EAW LF++MLD
Sbjct: 326 ATFMEYWFEKGNDKEAMESYRSLLDKKFRMHPPTGNVLLEVFLKFGKKDEAWALFNEMLD 385

Query: 353 NHTPPNFQAVNSDTFNIMVNECFKLGKFSEAVETFRKVGTQPKSRPFAMDVAGYNNIIVR 412
           NH PPN  +VNSDT  IMVNECFK+G+FSEA+ TF+KVG++  S+PF MD  GY NI+ R
Sbjct: 386 NHAPPNILSVNSDTVGIMVNECFKMGEFSEAINTFKKVGSKVTSKPFVMDYLGYCNIVTR 445

Query: 413 FCEHGMMEDAETFFAELCSKSLSPDVPTHRTLIEAYLKLEQIDDVLKVFNRMVDVGLRVV 472
           FCE GM+ +AE FFAE  S+SL  D P+HR +I+AYLK E+IDD +K+ +RMVDV LRVV
Sbjct: 446 FCEQGMLTEAERFFAEGVSRSLPADAPSHRAMIDAYLKAERIDDAVKMLDRMVDVNLRVV 505

Query: 473 ASFGNRVFGELIKNGKAVECAQILTKMGERDPKPDPTCYDVVIRGLCNEGALDASRMLLD 532
           A FG RVFGELIKNGK  E A++LTKMGER+PKPDP+ YDVV+RGLC+  ALD ++ ++ 
Sbjct: 506 ADFGARVFGELIKNGKLTESAEVLTKMGEREPKPDPSIYDVVVRGLCDGDALDQAKDIVG 565

Query: 533 QVMRYGIGLTPSLQEFVKEVFVKAGRNEEIERLL------MMNRGGH------------- 592
           +++R+ +G+T  L+EF+ EVF KAGR EEIE++L      + N G               
Sbjct: 566 EMIRHNVGVTTVLREFIIEVFEKAGRREEIEKILNSVARPVRNAGQSGNTPPRVPAVFGT 625

Query: 593 ---APYRPTSGPPRISQSQVPQFRGSYGPSAPQMTGPNYFQSGSVQMTRPQQPSSGPPPS 646
              AP +P    P  SQ  V    G    +A Q  G      G+ +    Q PS     +
Sbjct: 626 TPAAPQQPRDRAPWTSQGVVHSNSGWANGTAGQTAG------GAYKANNGQNPSWS--NT 685

BLAST of Cp4.1LG01g08660 vs. TAIR10
Match: AT3G49240.1 (AT3G49240.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 337.0 bits (863), Expect = 2.5e-92
Identity = 207/555 (37.30%), Postives = 313/555 (56.40%), Query Frame = 1

Query: 23  ALSIGPLSQHLQTPILPSSQSSSLIS--LLHARSFAFSSAEEAAAERRRRKRRLRIEPPL 82
           ++S      HLQT  L  S    ++    L  R  +F++ EEAAAERRRRKRRLR+EPP+
Sbjct: 2   SISKAAFLNHLQT--LSRSYRHRVLPQPFLAVRYMSFATQEEAAAERRRRKRRLRMEPPV 61

Query: 83  HALRRDNY-----PPPQRDPNAPRLPDSTSALVGPRLSLHNRVQSLIRAGDLDAASAVAR 142
           ++  R        P P ++PN P+LP+S SALVG RL LHN +  LIR  DL+ A+   R
Sbjct: 62  NSFNRSQQQQSQIPRPIQNPNIPKLPESVSALVGKRLDLHNHILKLIRENDLEEAALYTR 121

Query: 143 HSVFSNTRPTVFTCNAIIAAMYRAKRYGDAIALFQFFFNQSNIVPNVVSYNNLINAHCDE 202
           HSV+SN RPT+FT N ++AA  R  +YG A+     F NQ+ I PN+++YN +  A+ D 
Sbjct: 122 HSVYSNCRPTIFTVNTVLAAQLRQAKYG-ALLQLHGFINQAGIAPNIITYNLIFQAYLDV 181

Query: 203 GRVDVGLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDLLREMLNKGHGADSLV 262
            + ++ LE Y+  I NAP +PS  T+R L KGL+ +  + +A+++  +M  KG   D +V
Sbjct: 182 RKPEIALEHYKLFIDNAPLNPSIATFRILVKGLVSNDNLEKAMEIKEDMAVKGFVVDPVV 241

Query: 263 YNNLISGFLNLENLEKANELFDELKERC--LVYDGVVNATFMDWFFNKGKAKEAMESYKS 322
           Y+ L+ G +   + +   +L+ ELKE+    V DGVV    M  +F K   KEAME Y+ 
Sbjct: 242 YSYLMMGCVKNSDADGVLKLYQELKEKLGGFVDDGVVYGQLMKGYFMKEMEKEAMECYEE 301

Query: 323 LL--DRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQAVNSDTFNIMVN 382
            +  + + +M     N +LE L ++GK  EA  LFD +   H PP   AVN  TFN+MVN
Sbjct: 302 AVGENSKVRMSAMAYNYVLEALSENGKFDEALKLFDAVKKEHNPPRHLAVNLGTFNVMVN 361

Query: 383 ECFKLGKFSEAVETFRKVGTQPKSRPFAMDVAGYNNIIVRFCEHGMMEDAETFFAELCSK 442
                GKF EA+E FR++G   K  P   D   +NN++ + C++ ++ +AE  + E+  K
Sbjct: 362 GYCAGGKFEEAMEVFRQMG-DFKCSP---DTLSFNNLMNQLCDNELLAEAEKLYGEMEEK 421

Query: 443 SLSPDVPTHRTLIEAYLKLEQIDDVLKVFNRMVDVGLRVVASFGNRVFGELIKNGKAVEC 502
           ++ PD  T+  L++   K  +ID+    +  MV+  LR   +  NR+  +LIK GK  + 
Sbjct: 422 NVKPDEYTYGLLMDTCFKEGKIDEGAAYYKTMVESNLRPNLAVYNRLQDQLIKAGKLDDA 481

Query: 503 AQILTKMGERDPKPDPTCYDVVIRGLCNEGALDASRMLLDQVMRYG-IGLTPSLQEFVKE 562
                 M  +  K D   Y  ++R L   G LD    ++D+++    + ++  LQEFVKE
Sbjct: 482 KSFFDMMVSK-LKMDDEAYKFIMRALSEAGRLDEMLKIVDEMLDDDTVRVSEELQEFVKE 541

Query: 563 VFVKAGRNEEIERLL 566
              K GR  ++E+L+
Sbjct: 542 ELRKGGREGDLEKLM 548

BLAST of Cp4.1LG01g08660 vs. TAIR10
Match: AT3G60960.1 (AT3G60960.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 232.6 bits (592), Expect = 6.7e-61
Identity = 151/402 (37.56%), Postives = 222/402 (55.22%), Query Frame = 1

Query: 90  PPQRDPNA-PRL-PDSTSALVGPRLSLHNRVQSLIRAGDLDAASAVARHSVFSN---TRP 149
           P  RDP++ P+L P S S +    +SL  RV+++I   +LD AS ++R +V +     R 
Sbjct: 27  PLGRDPSSLPKLDPVSISYIDSRPISLRYRVRAMIEMSNLDEASKLSRLAVLNGFLVDRD 86

Query: 150 TVFTCNAIIAAMYRAKRYGDAIALFQFFFNQSNIVPNVVSYNNLINAHCDEGRVDVGLEI 209
           TVF CN++I AM  AKRY DAI+LF +FFN+S  +PN +S + +I AHCD+G VD  LE+
Sbjct: 87  TVFICNSVIGAMCSAKRYDDAISLFNYFFNESQTLPNTLSCDLIIKAHCDQGHVDDALEL 146

Query: 210 YRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDLLREMLNKGHGADSLVYNNLISGFL 269
           YRHI+ +   +P   TY  L K L+D+ R  EA  L R M         +VY+ LI GFL
Sbjct: 147 YRHILLDGRVAPGIETYMILAKALVDAKRFDEACVLARSM----SCCSFMVYDILIRGFL 206

Query: 270 NLENLEKANELFDELKERCLVYDG--------VVNATFMDWFFNKGKAKEAMESYKSLLD 329
           ++ N  KA+++F+ELK       G        + N +FM+++F +GK +EAME   +L D
Sbjct: 207 DIGNFVKASQIFEELKGLDSKLPGREYHKANAIFNVSFMNYWFKQGKDEEAMEILANLED 266

Query: 330 RQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQAVNSDTFNIMVNECFKL 389
            Q  + P   N +L+VL+KHGKKTEAW LF +M+        +  +S+T +IM       
Sbjct: 267 AQV-LNPIVGNRVLQVLVKHGKKTEAWELFGEMI--------EICDSETVDIMSE----- 326

Query: 390 GKFSEAVETFRKVGTQPKSRPFAMDVAGYNNIIVRFCEHGMMEDAETFFAELCSK----- 449
             FSE    F +           +    Y  +IV  CEHG + DAE  FAE+ +      
Sbjct: 327 -YFSEKTVPFER-----------LRKTCYRKMIVSLCEHGKVSDAEKLFAEMFTDVDGGD 386

Query: 450 -SLSPDVPTHRTLIEAYLKLEQIDDVLKVFNRMVDVGLRVVA 473
             + PD+   R +I  Y+ + ++DD +K  N+M    LR +A
Sbjct: 387 LLVGPDLLIFRAMINGYVSVGRVDDAIKTLNKMRISNLRKLA 398

BLAST of Cp4.1LG01g08660 vs. TAIR10
Match: AT3G60980.1 (AT3G60980.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 227.3 bits (578), Expect = 2.8e-59
Identity = 147/380 (38.68%), Postives = 217/380 (57.11%), Query Frame = 1

Query: 117 RVQSLIRA-GDLDAASAVARHSVFSNTRP--TVFTCNAIIAAMYRAKRYGDAIALFQFFF 176
           RV  LIR  GDLD A+  AR +VF++ +   T   C +II  M R KR  DA  L++FFF
Sbjct: 38  RVSYLIRCVGDLDTAAKYARLAVFTSIKSESTTTICQSIIGGMLRDKRLKDAYDLYEFFF 97

Query: 177 NQSNIVPNVVSYNNLINAHCDEGRVDVGLEIYRHIIANAPFS--PSAVTYRHLTKGLIDS 236
           NQ N+ PN   +N +I +   +G V+  L  +   I +      PS  ++R LTKGL+ S
Sbjct: 98  NQHNLRPNSHCWNYIIESGFQQGLVNDALHFHHRCINSGQVHDYPSDDSFRILTKGLVHS 157

Query: 237 GRIGEAVDLLR-EMLNKGHGADSLVYNNLISGFLNLENLEKANELFDELKERCLV----- 296
           GR+ +A   LR   +N+    D + YNNLI GFL+L N +KAN +  E K   L+     
Sbjct: 158 GRLDQAEAFLRGRTVNRTTYPDHVAYNNLIRGFLDLGNFKKANLVLGEFKRLFLIALSET 217

Query: 297 --------YDGVV---NATFMDWFFNKGKAKEAMESY-KSLLDRQFKMIPATCNVLLEVL 356
                   Y+  V    ATFM+++F +GK  EAME Y + +L  +  +   T N LL+VL
Sbjct: 218 KDDLHHSNYENRVAFLMATFMEYWFKQGKQVEAMECYNRCVLSNRLLVCAETGNALLKVL 277

Query: 357 LKHGKKTEAWTLFDQMLDNHTPPNFQAVNSDTFNIMVNECFKLGKFSEAVETFRKVGTQP 416
           LK+G+K  AW L+ ++LD +       ++SDT  IMV+ECF +G FSEA+ET++K   +P
Sbjct: 278 LKYGEKKNAWALYHELLDKNGTGK-GCLDSDTIKIMVDECFDMGWFSEAMETYKK--ARP 337

Query: 417 KSRPFAMDVAGYNNIIVRFCEHGMMEDAETFFAELCSKSLS-PDVPTHRTLIEAYLKLEQ 473
           K+     D      II RFCE+ M+ +AE+ F +  +      DV T++T+I+AY+K  +
Sbjct: 338 KN-----DYLSDKYIITRFCENRMLSEAESVFVDSLADDFGYIDVNTYKTMIDAYVKAGR 397

BLAST of Cp4.1LG01g08660 vs. TAIR10
Match: AT5G28340.1 (AT5G28340.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 183.3 bits (464), Expect = 4.6e-46
Identity = 112/322 (34.78%), Postives = 174/322 (54.04%), Query Frame = 1

Query: 162 YGDAIALFQFFFNQSNIVPNVVSYNNLINAHCDEGRVDVGLEIYRHIIANAPFSPSAVTY 221
           Y +AI+LF +FFN+S  +PN++S N +I AHCD+G VD  LE+YRHI+ +   +P   TY
Sbjct: 133 YDEAISLFDYFFNESQTLPNMLSCNLIIKAHCDQGSVDHALELYRHILLDGSLAPGIETY 192

Query: 222 RHLTKGLIDSGRIGEAVDLLREMLNKGHGADSLVYNNLISGFLNLENLEKANELFDELK- 281
           R LTK L+ + R+ EA D++R M       D  VY+ LI GFL+     +A+++F+ELK 
Sbjct: 193 RILTKALVGAKRLDEACDVVRSMSR----CDFAVYDILIRGFLDKGKFVRASQIFEELKG 252

Query: 282 -------ERCLVYDGVVNATFMDWFFNKGKAKEAMESYKSLLDRQFKMIPATCNVLLEVL 341
                          + N +FMD++F +GK +EAME + +L   +  +   + N +L+ L
Sbjct: 253 PNSKLPWRNYHKAIAIFNVSFMDYWFKQGKDEEAMEIFATLEHAEL-LNTISGNGVLKCL 312

Query: 342 LKHGKKTEAWTLFDQMLDNHTPPNFQAVNSDTFNIMVNECFKLGKFSEAVETFRKVGTQP 401
           ++HG+KTEAW LF  M+        +  +S+T  I+++   K G F E    F +V    
Sbjct: 313 VEHGRKTEAWELFLDMI--------EICDSETVGIIMS---KEGFFGEKTIPFERVRR-- 372

Query: 402 KSRPFAMDVAGYNNIIVRFCEHGMMEDAETFFAELCSK------SLSPDVPTHRTLIEAY 461
                      Y  +I   C+ G M +AE  FA++ +          PDV T R +I  Y
Sbjct: 373 ---------TCYTRMIASLCQQGNMLEAEKLFADMFADVDGDDLLAGPDVSTFRAMINGY 427

Query: 462 LKLEQIDDVLKVFNRMVDVGLR 470
           +K+ ++DD +K  N+M    LR
Sbjct: 433 VKVGRVDDAIKTLNKMKISNLR 427

BLAST of Cp4.1LG01g08660 vs. NCBI nr
Match: gi|778673836|ref|XP_011650072.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g10270 [Cucumis sativus])

HSP 1 Score: 1088.6 bits (2814), Expect = 0.0e+00
Identity = 570/678 (84.07%), Postives = 597/678 (88.05%), Query Frame = 1

Query: 1   MSLYRLLLRSLRRTSTSPSHSRALS-IGPLSQHLQTPILPSSQSSSLISLLHARSFAFSS 60
           MS YR LLRSLRR+STSPSH+ AL+ I PL+QH+     PSSQ+SS ISLL ARSF+FSS
Sbjct: 1   MSPYRFLLRSLRRSSTSPSHAPALTTIAPLNQHIP----PSSQTSSPISLLLARSFSFSS 60

Query: 61  AEEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRLPDSTSALVGPRLSLHNRVQ 120
           AEEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRLPDSTSALVGPRL+LHNRVQ
Sbjct: 61  AEEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRLPDSTSALVGPRLNLHNRVQ 120

Query: 121 SLIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYGDAIALFQFFFNQSNIV 180
           SLIRAGDLDAAS+VARHSVFSNTRPTVFTCNAIIAAMYRAKRY DAIALFQFFFNQSNIV
Sbjct: 121 SLIRAGDLDAASSVARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIV 180

Query: 181 PNVVSYNNLINAHCDEGRVDVGLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVD 240
           PNVVSYNNLINAHCDEGRVDVGLE+YRHIIANAPFSPSAVTYRHLTKGLID+GRI EAVD
Sbjct: 181 PNVVSYNNLINAHCDEGRVDVGLEVYRHIIANAPFSPSAVTYRHLTKGLIDAGRIEEAVD 240

Query: 241 LLREMLNKGHGADSLVYNNLISGFLNLENLEKANELFDELKERCLVYDGVVNATFMDWFF 300
           LLREMLNKGHGADSLV+NNLISGFLNL NLEKANELFDELKERCLVYDGVVNATFMDWFF
Sbjct: 241 LLREMLNKGHGADSLVFNNLISGFLNLGNLEKANELFDELKERCLVYDGVVNATFMDWFF 300

Query: 301 NKGKAKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQ 360
           N+GK KEAMESYKSLLDRQFKM+PATCNVLLEVLLKH KKTEAWTLFDQMLDNHTPPNFQ
Sbjct: 301 NQGKEKEAMESYKSLLDRQFKMVPATCNVLLEVLLKHEKKTEAWTLFDQMLDNHTPPNFQ 360

Query: 361 AVNSDTFNIMVNECFKLGKFSEAVETFRKVGTQPKSRPFAMDVAGYNNIIVRFCEHGMME 420
           AVNSDTFNIMVNECFK GKF+EAVETFRKVGTQPKSRPFAMDVAGYNNII RFCE GMM 
Sbjct: 361 AVNSDTFNIMVNECFKHGKFAEAVETFRKVGTQPKSRPFAMDVAGYNNIIARFCEQGMMA 420

Query: 421 DAETFFAELCSKSLSPDVPTHRTLIEAYLKLEQIDDVLKVFNRMVDVGLRVVASFGNRVF 480
           DAETFFAELCSKSLSPDVPTHRTLIE+YLK+EQIDD L+VFNRMVDVGLRVVASFGN VF
Sbjct: 421 DAETFFAELCSKSLSPDVPTHRTLIESYLKIEQIDDALRVFNRMVDVGLRVVASFGNMVF 480

Query: 481 GELIKNGKAVECAQILTKMGERDPKPDPTCYDVVIRGLCNEGALDASRMLLDQVMRYGIG 540
           GELIKNGKA +CAQILTKMGERDPKPDPTCYDVVIRGLCNEGALDASR LLDQ+MRYGIG
Sbjct: 481 GELIKNGKAADCAQILTKMGERDPKPDPTCYDVVIRGLCNEGALDASRELLDQIMRYGIG 540

Query: 541 LTPSLQEFVKEVFVKAGRNEEIERLLMMNRGGHAPYRPTSGPPRISQSQVP--------- 600
           LTP+L+EFVKE FVKAGR+EEIERLL MN+ GHA YRP SGPPRISQSQVP         
Sbjct: 541 LTPTLEEFVKEAFVKAGRHEEIERLLNMNKWGHAAYRPLSGPPRISQSQVPPQMGGPLQG 600

Query: 601 ---------------QFRGSYG------PS-----APQMTGPNYFQSGSVQMTRPQQPSS 643
                          Q RG+Y       PS     +PQ TG NYFQSGSVQMT+ Q  S 
Sbjct: 601 PPQMAEPNWRPSINPQARGTYSSPQMSSPSHFQSGSPQTTGSNYFQSGSVQMTKSQHSSF 660

BLAST of Cp4.1LG01g08660 vs. NCBI nr
Match: gi|46095227|gb|AAS80150.1| (ACT11D09.4 [Cucumis melo])

HSP 1 Score: 1087.0 bits (2810), Expect = 0.0e+00
Identity = 569/679 (83.80%), Postives = 596/679 (87.78%), Query Frame = 1

Query: 1   MSLYRLLLRSLRRTSTSPSHSRALS-IGPLSQHLQTPILPSSQSSSLISLLHARSFAFSS 60
           MS YR LLRSLRR+STSPS++ AL+ I PL+ H+     PSSQ+SS ISLL ARSF+FSS
Sbjct: 1   MSPYRFLLRSLRRSSTSPSYAPALTTIAPLNHHIP----PSSQTSSPISLLLARSFSFSS 60

Query: 61  AEEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRLPDSTSALVGPRLSLHNRVQ 120
           AEEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRLPDSTSALVGPRL+LHNRVQ
Sbjct: 61  AEEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRLPDSTSALVGPRLNLHNRVQ 120

Query: 121 SLIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYGDAIALFQFFFNQSNIV 180
           SLIRAGDLDAAS+VARHSVFSNTRPTVFTCNAIIAAMYRAKRY DAIALFQFFFNQSNIV
Sbjct: 121 SLIRAGDLDAASSVARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIV 180

Query: 181 PNVVSYNNLINAHCDEGRVDVGLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVD 240
           PNVVSYNNLINAHCDEGRVDVGLE+YRHIIANAPFSPSAVTYRHLTKGLID+GRI EAVD
Sbjct: 181 PNVVSYNNLINAHCDEGRVDVGLEVYRHIIANAPFSPSAVTYRHLTKGLIDAGRIEEAVD 240

Query: 241 LLREMLNKGHGADSLVYNNLISGFLNLENLEKANELFDELKERCLVYDGVVNATFMDWFF 300
           LLREMLNKGHGADSLV+NNLISGFLNL NL KANELFDELKERCLVYDGVVNATFMDWFF
Sbjct: 241 LLREMLNKGHGADSLVFNNLISGFLNLGNLVKANELFDELKERCLVYDGVVNATFMDWFF 300

Query: 301 NKGKAKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQ 360
           N+GK KEAMESYKSLLDRQFKM+PATCNVLLEVLLKH KKTEAWTLFDQMLDNHTPPNFQ
Sbjct: 301 NQGKEKEAMESYKSLLDRQFKMVPATCNVLLEVLLKHEKKTEAWTLFDQMLDNHTPPNFQ 360

Query: 361 AVNSDTFNIMVNECFKLGKFSEAVETFRKVGTQPKSRPFAMDVAGYNNIIVRFCEHGMME 420
           AVNSDTFNIMVNECFKLGKF+EAVETFRKVGTQPKSRPFAMDVAGYNNII RFCE GMM 
Sbjct: 361 AVNSDTFNIMVNECFKLGKFTEAVETFRKVGTQPKSRPFAMDVAGYNNIIARFCEQGMMA 420

Query: 421 DAETFFAELCSKSLSPDVPTHRTLIEAYLKLEQIDDVLKVFNRMVDVGLRVVASFGNRVF 480
           DAETFFAELCSKSLSPDVPTHRTLIE+YLK+EQIDD L+VFNRMVDVGLRVVASFGN VF
Sbjct: 421 DAETFFAELCSKSLSPDVPTHRTLIESYLKIEQIDDALRVFNRMVDVGLRVVASFGNMVF 480

Query: 481 GELIKNGKAVECAQILTKMGERDPKPDPTCYDVVIRGLCNEGALDASRMLLDQVMRYGIG 540
           GELIKNGKA +CAQILTKMGERDPKPDPTCYDVVIRGLCNEGALD SR LLDQ+MRYGIG
Sbjct: 481 GELIKNGKAADCAQILTKMGERDPKPDPTCYDVVIRGLCNEGALDTSRELLDQIMRYGIG 540

Query: 541 LTPSLQEFVKEVFVKAGRNEEIERLLMMNRGGHAPYRPTSGPPRISQSQVP--------- 600
           LTP+L+EFVK+ FVKAGR+EEIERLL MN+ GHA YRP SGPPRISQSQVP         
Sbjct: 541 LTPTLEEFVKDAFVKAGRHEEIERLLNMNKWGHAAYRPPSGPPRISQSQVPPQMGRPLQG 600

Query: 601 ---------------QFRGSYG------PS-----APQMTGPNYFQSGSVQMTRPQQPSS 644
                          Q RGSY       PS      PQMTG NYFQSGS QMT+PQ  S 
Sbjct: 601 PPQMAEPNWRPSINPQARGSYSSPQMSSPSHFQSGPPQMTGSNYFQSGSAQMTKPQHSSF 660

BLAST of Cp4.1LG01g08660 vs. NCBI nr
Match: gi|659118502|ref|XP_008459155.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g10270 [Cucumis melo])

HSP 1 Score: 1084.7 bits (2804), Expect = 0.0e+00
Identity = 568/679 (83.65%), Postives = 595/679 (87.63%), Query Frame = 1

Query: 1   MSLYRLLLRSLRRTSTSPSHSRALS-IGPLSQHLQTPILPSSQSSSLISLLHARSFAFSS 60
           MS YR LLRSLRR+STSPS++ AL+ I PL+ H+     PSSQ+SS ISLL ARSF+FSS
Sbjct: 1   MSPYRFLLRSLRRSSTSPSYAPALTTIAPLNHHIP----PSSQTSSPISLLLARSFSFSS 60

Query: 61  AEEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRLPDSTSALVGPRLSLHNRVQ 120
           AEEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRLPDSTSALVGPRL+LHNRVQ
Sbjct: 61  AEEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRLPDSTSALVGPRLNLHNRVQ 120

Query: 121 SLIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYGDAIALFQFFFNQSNIV 180
           SLIRAGDLDAAS+VARHSVFSNTRPTVFTCNAIIAAMYRAKRY DAIALFQFFFNQSNIV
Sbjct: 121 SLIRAGDLDAASSVARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIV 180

Query: 181 PNVVSYNNLINAHCDEGRVDVGLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVD 240
           PNVVSYNNLINAHCDEGRVDVGLE+YRHIIANAPFSPSAVTYRHLTKGLID+GRI EAVD
Sbjct: 181 PNVVSYNNLINAHCDEGRVDVGLEVYRHIIANAPFSPSAVTYRHLTKGLIDAGRIEEAVD 240

Query: 241 LLREMLNKGHGADSLVYNNLISGFLNLENLEKANELFDELKERCLVYDGVVNATFMDWFF 300
           LLREMLNKGHGADSLV+NNLISGFLNL NL KANELFDELKERCLVYDGVVNATFMDWFF
Sbjct: 241 LLREMLNKGHGADSLVFNNLISGFLNLGNLVKANELFDELKERCLVYDGVVNATFMDWFF 300

Query: 301 NKGKAKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQ 360
           N+GK KEAMESYKSLLDRQFKM+PATCNVLLEVLLKH KKTEAWTLFDQMLDNHTPPNFQ
Sbjct: 301 NQGKEKEAMESYKSLLDRQFKMVPATCNVLLEVLLKHEKKTEAWTLFDQMLDNHTPPNFQ 360

Query: 361 AVNSDTFNIMVNECFKLGKFSEAVETFRKVGTQPKSRPFAMDVAGYNNIIVRFCEHGMME 420
           AVNSDTFNIMVNECFKLGKF+EAVETFRKVGTQPKSRPFAMDVAGYNNII RFCE GMM 
Sbjct: 361 AVNSDTFNIMVNECFKLGKFTEAVETFRKVGTQPKSRPFAMDVAGYNNIIARFCEQGMMA 420

Query: 421 DAETFFAELCSKSLSPDVPTHRTLIEAYLKLEQIDDVLKVFNRMVDVGLRVVASFGNRVF 480
           DAETFFAELCSKSLSPDVPTHRTLIE+YLK+EQIDD L+VFNRMVDVGLRVVASFGN VF
Sbjct: 421 DAETFFAELCSKSLSPDVPTHRTLIESYLKIEQIDDALRVFNRMVDVGLRVVASFGNMVF 480

Query: 481 GELIKNGKAVECAQILTKMGERDPKPDPTCYDVVIRGLCNEGALDASRMLLDQVMRYGIG 540
           GELIKNGKA +CAQILTKMGERDPKPDPTCYDVVIRGLCNEGALD SR LLDQ+MRYGIG
Sbjct: 481 GELIKNGKAADCAQILTKMGERDPKPDPTCYDVVIRGLCNEGALDTSRELLDQIMRYGIG 540

Query: 541 LTPSLQEFVKEVFVKAGRNEEIERLLMMNRGGHAPYRPTSGPPRISQSQVP--------- 600
           LTP+L+EFVK+ FVKAGR+EEIERLL MN+ GHA YRP SGPPRISQSQVP         
Sbjct: 541 LTPTLEEFVKDAFVKAGRHEEIERLLNMNKWGHAAYRPPSGPPRISQSQVPPQMGRPLQG 600

Query: 601 ---------------QFRGSYG------PS-----APQMTGPNYFQSGSVQMTRPQQPSS 644
                          Q RGSY       PS      PQ TG NYFQSGS QMT+PQ  S 
Sbjct: 601 PPQMAEPNWRPSINPQARGSYSSPQMSSPSHFQSGPPQTTGSNYFQSGSAQMTKPQHSSF 660

BLAST of Cp4.1LG01g08660 vs. NCBI nr
Match: gi|703111114|ref|XP_010099778.1| (hypothetical protein L484_010965 [Morus notabilis])

HSP 1 Score: 916.4 bits (2367), Expect = 2.8e-263
Identity = 478/645 (74.11%), Postives = 537/645 (83.26%), Query Frame = 1

Query: 1   MSLYRLLLRSLRRTSTSPSHSRALSIGPLSQHLQTPILPSSQSSSLISLLHARSFAFSSA 60
           MS+YRLLLRSLRR ST+P      S+     H+      ++ ++++  L   RSFAFSSA
Sbjct: 1   MSVYRLLLRSLRRPSTTPQTLTTTSL----LHI------TNTTTTIPDLTFRRSFAFSSA 60

Query: 61  EEAAAERRRRKRRLRIEPPLHALRRD-NYPPPQRDPNAPRLPDSTSALVGPRLSLHNRVQ 120
           EEAAAERRRRKRRLRIEPPL ALRRD ++ PP RDPNAPRLPDSTSALVGPRL+LHNRVQ
Sbjct: 61  EEAAAERRRRKRRLRIEPPLQALRRDPHFHPPPRDPNAPRLPDSTSALVGPRLNLHNRVQ 120

Query: 121 SLIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYGDAIALFQFFFNQSNIV 180
           SLIRAGDLDAAS+VARHSVFSNTRPTVFTCNAIIAAMYRAKRY DAIALFQFFF QSNIV
Sbjct: 121 SLIRAGDLDAASSVARHSVFSNTRPTVFTCNAIIAAMYRAKRYNDAIALFQFFFQQSNIV 180

Query: 181 PNVVSYNNLINAHCDEGRVDVGLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVD 240
           PN+VSYNNLINAHCDEGRVDVGL+++RHI+ANAPFSPS VTYRHLTKGLID+GRIGEAVD
Sbjct: 181 PNIVSYNNLINAHCDEGRVDVGLDVFRHIMANAPFSPSPVTYRHLTKGLIDAGRIGEAVD 240

Query: 241 LLREMLNKGHGADSLVYNNLISGFLNLENLEKANELFDELKERCLVYDGVVNATFMDWFF 300
           LLREMLNKGHGADSLVYNNLISGFL+L NLE+ANELF ELKERCLVYDGVV+ATFMDWFF
Sbjct: 241 LLREMLNKGHGADSLVYNNLISGFLSLGNLERANELFGELKERCLVYDGVVSATFMDWFF 300

Query: 301 NKGKAKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQ 360
           N+G  KEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAW LFDQMLDNHTPPNFQ
Sbjct: 301 NRGMEKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWALFDQMLDNHTPPNFQ 360

Query: 361 AVNSDTFNIMVNECFKLGKFSEAVETFRKVGTQPKSRPFAMDVAGYNNIIVRFCEHGMME 420
           AVNS++F+IMVNECFKL +  +A+ TFRKVGT+  S+PFAMDVAGYNNII R+CE+ M+ 
Sbjct: 361 AVNSESFSIMVNECFKLERIEDAIVTFRKVGTKVNSKPFAMDVAGYNNIITRYCENRMLS 420

Query: 421 DAETFFAELCSKSLSPDVPTHRTLIEAYLKLEQIDDVLKVFNRMVDVGLRVVASFGNRVF 480
           +AE+ FAELCSKSLSPDVPT+RTLIEAYLK EQID+ L++FNRMV+ GLRVVASFGNRVF
Sbjct: 421 EAESMFAELCSKSLSPDVPTYRTLIEAYLKEEQIDNALQMFNRMVEAGLRVVASFGNRVF 480

Query: 481 GELIKNGKAVECAQILTKMGERDPKPDPTCYDVVIRGLCNEGALDASRMLLDQVMRYGIG 540
            ELIKNGKAV+CAQIL KMGE+DPKPD +CY+VVI+GLCNEGA D S  L+++VMRYGIG
Sbjct: 481 DELIKNGKAVDCAQILKKMGEKDPKPDVSCYEVVIKGLCNEGAFDVSLDLVEEVMRYGIG 540

Query: 541 LTPSLQEFVKEVFVKAGRNEEIERLLMMNRGGHAPYRPTSGPPRISQSQVPQFRGSYGPS 600
           +TP+LQ+FV E F K GR +EIER+L M+R GH P   T  P      Q P  R      
Sbjct: 541 VTPTLQQFVNEAFAKVGRGQEIERVLSMDRWGHTPPSRTERP-----GQQPLGRA----- 600

Query: 601 APQMTGPNYFQSGSVQMTRPQQPSSGPPPSMEKQQQHSQPPQMAG 645
             QM GP++  SG  QM+    PS G P      Q  S  PQM G
Sbjct: 601 --QMAGPSHAPSGPGQMSSWSNPSFGSPQRTGFHQSPSASPQMTG 623

BLAST of Cp4.1LG01g08660 vs. NCBI nr
Match: gi|694427483|ref|XP_009341369.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g10270-like isoform X1 [Pyrus x bretschneideri])

HSP 1 Score: 913.7 bits (2360), Expect = 1.8e-262
Identity = 479/647 (74.03%), Postives = 534/647 (82.53%), Query Frame = 1

Query: 1   MSLYRLLLRSLRRTSTSPSHSRALSIGPLSQHLQTPILPSSQSSSLISLLHARSFAFSSA 60
           MS YRLLLRSLRR ST  +     S+  L  HL  P    + +  L      R+FAFSSA
Sbjct: 1   MSFYRLLLRSLRRPSTPTTLPLTQSLSSL--HLHNP----NHNIPLHLTTPTRTFAFSSA 60

Query: 61  EEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRLPDSTSALVGPRLSLHNRVQS 120
           EEAAAERRRRKRRLRIEPPL+ALRRD+ PPP RDPNAPRLPD+TSALVGPRL+LHNRVQS
Sbjct: 61  EEAAAERRRRKRRLRIEPPLNALRRDSRPPPPRDPNAPRLPDTTSALVGPRLNLHNRVQS 120

Query: 121 LIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYGDAIALFQFFFNQSNIVP 180
           LIRAGDLDAAS VAR SVFSNTRPTVFTCNAI+AAMYRAKRY DAIALF FFFNQSNIVP
Sbjct: 121 LIRAGDLDAASEVARRSVFSNTRPTVFTCNAIVAAMYRAKRYNDAIALFHFFFNQSNIVP 180

Query: 181 NVVSYNNLINAHCDEGRVDVGLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDL 240
           N+VSYNNLIN HCDEGRVDVGLEIYRHIIAN P+SPS VTYRHLTKGL+D+GRIGE VDL
Sbjct: 181 NIVSYNNLINTHCDEGRVDVGLEIYRHIIANCPYSPSPVTYRHLTKGLVDAGRIGEGVDL 240

Query: 241 LREMLNKGHGADSLVYNNLISGFLNLENLEKANELFDELKERCLVYDGVVNATFMDWFFN 300
           LREMLNKGHGADSLVYNNLI GFL+LEN+EKA ELFDELKERCLVYDGVVN+TFMDW FN
Sbjct: 241 LREMLNKGHGADSLVYNNLIKGFLHLENMEKAVELFDELKERCLVYDGVVNSTFMDWLFN 300

Query: 301 KGKAKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQA 360
           +GK KEAMESYKSLLDRQF+M+PATCNVLLEVLLKHGKK EAW LFDQMLDNHTPPNFQA
Sbjct: 301 QGKEKEAMESYKSLLDRQFRMVPATCNVLLEVLLKHGKKKEAWDLFDQMLDNHTPPNFQA 360

Query: 361 VNSDTFNIMVNECFKLGKFSEAVETFRKVGTQPKSRPFAMDVAGYNNIIVRFCEHGMMED 420
           VNSDTFNIMVNECFKLGK  EA+ TF+KVGT+  S+PF+MDVAGYNNII R+C++GM+ +
Sbjct: 361 VNSDTFNIMVNECFKLGKCDEAIATFKKVGTKVNSKPFSMDVAGYNNIIARYCDNGMLPE 420

Query: 421 AETFFAELCSKSLSPDVPTHRTLIEAYLKLEQIDDVLKVFNRMVDVGLRVVASFGNRVFG 480
           AET FAEL SKSL+PDV THRTLI+AYLK+E+IDD LK+F+RMV+VGLRVVAS GNRVF 
Sbjct: 421 AETLFAELSSKSLTPDVTTHRTLIDAYLKVERIDDALKIFSRMVEVGLRVVASLGNRVFD 480

Query: 481 ELIKNGKAVECAQILTKMGERDPKPDPTCYDVVIRGLCNEGALDASRMLLDQVMRYGIGL 540
           ELIKNGK V+CAQIL KMGE++PKPD + Y+ VI+GLCNEGALD    LL++++RYGIG+
Sbjct: 481 ELIKNGKVVDCAQILKKMGEKEPKPDASFYEAVIKGLCNEGALDTGCDLLEEMVRYGIGV 540

Query: 541 TPSLQEFVKEVFVKAGRNEEIERLLMMNRGGHAPYRPTSGPPRISQSQVPQFRGSYGPSA 600
            P+LQ+FV EVF +AGR EEI+R+L  NR G+ P  P   PPR  Q + PQ  G   PS 
Sbjct: 541 PPALQQFVNEVFGEAGRGEEIQRVLNTNRRGYQPL-PREAPPRQFQPRSPQMAGQEPPSG 600

Query: 601 P-QMTGPNYFQSGSVQMTRPQQPSSGP--PPSMEKQQQHSQPPQMAG 645
           P QM G     S   QMTR  QP SGP  PPS    Q  S P QM G
Sbjct: 601 PSQMAGQYRPSSALPQMTRQYQPPSGPYQPPS-GTYQPPSGPYQMTG 639

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PPR29_ARATH1.0e-21262.99Pentatricopeptide repeat-containing protein At1g10270 OS=Arabidopsis thaliana GN... [more]
PP273_ARATH4.5e-9137.30Pentatricopeptide repeat-containing protein At3g49240 OS=Arabidopsis thaliana GN... [more]
PP289_ARATH1.2e-5937.56Pentatricopeptide repeat-containing protein At3g60960, mitochondrial OS=Arabidop... [more]
PP290_ARATH5.0e-5838.68Pentatricopeptide repeat-containing protein At3g60980, mitochondrial OS=Arabidop... [more]
PPR28_ARATH1.0e-3924.14Pentatricopeptide repeat-containing protein At1g09900 OS=Arabidopsis thaliana GN... [more]
Match NameE-valueIdentityDescription
A0A0A0LTQ3_CUCSA0.0e+0084.07Uncharacterized protein OS=Cucumis sativus GN=Csa_2G435500 PE=4 SV=1[more]
Q6E438_CUCME0.0e+0083.80ACT11D09.4 OS=Cucumis melo GN=ACT11D09.4 PE=4 SV=1[more]
W9RZT4_9ROSA2.0e-26374.11Uncharacterized protein OS=Morus notabilis GN=L484_010965 PE=4 SV=1[more]
A0A059ATG5_EUCGR2.7e-26072.81Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_I02789 PE=4 SV=1[more]
M5XCA4_PRUPE1.1e-25672.45Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa001970mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G10270.15.8e-21462.99 glutamine-rich protein 23[more]
AT3G49240.12.5e-9237.30 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT3G60960.16.7e-6137.56 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G60980.12.8e-5938.68 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G28340.14.6e-4634.78 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
gi|778673836|ref|XP_011650072.1|0.0e+0084.07PREDICTED: pentatricopeptide repeat-containing protein At1g10270 [Cucumis sativu... [more]
gi|46095227|gb|AAS80150.1|0.0e+0083.80ACT11D09.4 [Cucumis melo][more]
gi|659118502|ref|XP_008459155.1|0.0e+0083.65PREDICTED: pentatricopeptide repeat-containing protein At1g10270 [Cucumis melo][more]
gi|703111114|ref|XP_010099778.1|2.8e-26374.11hypothetical protein L484_010965 [Morus notabilis][more]
gi|694427483|ref|XP_009341369.1|1.8e-26274.03PREDICTED: pentatricopeptide repeat-containing protein At1g10270-like isoform X1... [more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0051301 cell division
biological_process GO:0009793 embryo development ending in seed dormancy
biological_process GO:0044699 single-organism process
cellular_component GO:0005575 cellular_component
cellular_component GO:0005739 mitochondrion
cellular_component GO:0005634 nucleus
cellular_component GO:0043231 intracellular membrane-bounded organelle
molecular_function GO:0005515 protein binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG01g08660.1Cp4.1LG01g08660.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 219..248
score: 0.0085coord: 325..352
score: 0.012coord: 365..388
score: 0.01coord: 255..282
score: 1.
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 177..203
score: 5.
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 401..448
score: 9.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 325..357
score: 4.9E-5coord: 405..437
score: 7.6E-5coord: 219..252
score: 0.0017coord: 439..468
score: 2.9E-4coord: 255..287
score: 1.9E-4coord: 183..210
score: 4.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 217..251
score: 9.712coord: 287..321
score: 6.358coord: 506..540
score: 9.12coord: 436..470
score: 9.986coord: 362..392
score: 8.276coord: 145..180
score: 7.585coord: 401..435
score: 10.03coord: 471..505
score: 5.02coord: 252..282
score: 9.372coord: 181..216
score: 9.35coord: 322..356
score: 9
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 182..498
score: 2.5
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 36..60
score: 0.0coord: 86..568
score:
NoneNo IPR availablePANTHERPTHR24015:SF393SUBFAMILY NOT NAMEDcoord: 36..60
score: 0.0coord: 86..568
score:
NoneNo IPR availableunknownSSF81901HCP-likecoord: 155..389
score: 3.14

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG01g08660Cp4.1LG14g00610Cucurbita pepo (Zucchini)cpecpeB233
The following block(s) are covering this gene:
GeneOrganismBlock
Cp4.1LG01g08660Melon (DHL92) v3.6.1cpemedB482
Cp4.1LG01g08660Cucumber (Chinese Long) v3cpecucB0491
Cp4.1LG01g08660Wax gourdcpewgoB0489
Cp4.1LG01g08660Cucumber (Gy14) v1cgycpeB0267
Cp4.1LG01g08660Wild cucumber (PI 183967)cpecpiB392
Cp4.1LG01g08660Cucumber (Chinese Long) v2cpecuB393
Cp4.1LG01g08660Bottle gourd (USVL1VR-Ls)cpelsiB370
Cp4.1LG01g08660Melon (DHL92) v3.5.1cpemeB409
Cp4.1LG01g08660Cucumber (Gy14) v2cgybcpeB369