CmaCh04G000780 (gene) Cucurbita maxima (Rimu)

NameCmaCh04G000780
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionPentatricopeptide repeat-containing protein
LocationCma_Chr04 : 385256 .. 387537 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTCAAAGAAGCCCTCCAATTCTTGGCTCACCTGCGACGAATCTCCCGCTTTCCCTCCCCTTTCACCTGCAACAAGCTTGTCCACTCTCTCATCAACTCCGGCTGCGGCGAGCTCTCCGCCAAAGTGCTTTTCCACTTTCTCTCCAAAGGGTACACTCCTCATTCATCGTCTTTCAATTCCATCATCTCCTTTTTCTGTAAATTAGGGAACATAAAATATGCCGAACGGATTTTGAATTCAATGCCTAGATTTGGGTGCTCGCCTGATATTGTGTCCTACAATTCTTTGTTACATGGGTATTGTGCGAGTTATAAGATTAGAGAGGCTTGTTTTCTTGTTAATAGAGTTCGTGGGGGTTTGTTGAATCCTGATTTGGTTATGTTTAATATACTGTTTAATGGGTTTGCTAAGGTTTATATGAAGAATGAGGCGTTTATGTATCTGGGTTTGATGTGGAAATCCTGTTTGCCTAATGTTGTTACTTATGGTACACTCGTTGATATGTTCTGTAAGATGGGGGATATGGAAGTGGGTAACAAAATGTTTTTTGATATGATGAAGGTGGGGGTTGTGCCTAATTTGGTTGCTTTTAGCTCCTTGATTGATGGGTATTGCAAGGCTGGGAGTTTGAATGTTGCATTTGGATACTTAGAGAGAATGCAACAATGCTCGGTTCAGCCGAACGAGTTCACATATTCAACGTTGATTGATGGTTGTTGCAAGCAGGGGATGTTGGAAAGAGCTGACTTTTTGTTTGAAGAGATGTTGAGTGTTGGTATTCTGCCTAATTGTACGGTTCATACTTCGATAATAGACGGACATTTTAAGAAGGGAAATGTAGACAATGCGTTGAAGTATATAAACAGGATGTTCGATCGAGAAATAGAACTCGATCTAACAGCATATACGGTAGTTATCGCGGGCTTCCGTAGAGTTGGTAGGTTGCATAAGGCAATGGAAGCTGCAGAAAATGTGGTGAAGAATGGATTACTTCCTGATAGGATAATACTAACAGCTATTATGGATGTGCATTTCAAAGCTGGAAACTTAAAAGAAGCTTTGAATGCGTACAGAATATTACTCGCTAGGGGTTTTGAGCCTGATACCGTGACTCTATCCGCTCTAATAGATGGTCTATGCAAGAACGGATACTTGCAGGAGGCTAGACGGTACGTGGTCAAAGAAAAGGCCAATGAAATTCTATATACAGTGCTTATAGATGCACTATGTAAGGAGGGTAATTTAGATGAAGCTGAGAGAACCATTAAGGAAATGTGCGAGGCAGGGTTTGTTCCTGATAAATATGTGTATACTTCTTGGATTGCAGAGCTTTGCAAGCAAGGAAATTTGCTCAAGGCTTTCACGGTCAAGAAAAGGATGGTTCAAGAAAACATTGAACCTGATTTATTAACCTACAGTTCCCTGATTGGTGGTTTGGCGGAGAAGGGACTAATGATAGAAGCCAAACAGGTTTTTGATGACATGTTAAATGCAGGAATCACTCCAGATTCTGTTGCTTATGACATCCTTATAAGAGGGTATCATAATCAGGGTAATGTAGTTGCGATTTCAGGTCTACACGATGAAATGAGAACGAGAGGAATTGTTATTGAACATTAATACACGAGATATCGACGCCGGGTCTAATCCTTCAATTTATACCGGAACCTTCCATACGAGAAGCCAAGAGGCATGTGATTGGCATCAGATGATATCGAAAATAGAGAGATTCCAAGTCATATATTCCTATTCCTCCATATATTGCTTTGCTTATTGCAGGTTTCAAATGCCCCCAAAGGTTTTAAATCATGCTGTTCACATGTGGGTTATCGTCATAAGGTGCTGTTTCGATATCTACTATTGACCACATGTAAACAAAATCGAACTCTAGACTTCTACGCAACCAATTTGCAACTCTATGATCTCGTGCTCTCTGTTTGGCAGATTTTGTGCTTCTTCTCCTCTTGCTGAGTAGCTGCAACATGGGATCTGGGAATAGACGTGACATTGCTTGAGATGGCACCTGATGGTTGCTGTAGTGTTGGAGAATTTCAAGTCGTCAGGCTGCTACTGAAGAAAAATACCTATTCTTGCAAGCCTGAGCTTTCTATATCTTTATTGAATCCATTGAAGATTTGTAGTAAACCCAAATAAACACCACACACACATAGGAATTGTTGCTTATGAAGAATCTCTCAATCGAAATAGAATTATCCATTCAAGGTTGATCTTTGTTTTTCAGCATATATACATTTATAATACACACAATTGATTGTTCC

mRNA sequence

ATGGTCAAAGAAGCCCTCCAATTCTTGGCTCACCTGCGACGAATCTCCCGCTTTCCCTCCCCTTTCACCTGCAACAAGCTTGTCCACTCTCTCATCAACTCCGGCTGCGGCGAGCTCTCCGCCAAAGTGCTTTTCCACTTTCTCTCCAAAGGGTACACTCCTCATTCATCGTCTTTCAATTCCATCATCTCCTTTTTCTGTAAATTAGGGAACATAAAATATGCCGAACGGATTTTGAATTCAATGCCTAGATTTGGGTGCTCGCCTGATATTGTGTCCTACAATTCTTTGTTACATGGGTATTGTGCGAGTTATAAGATTAGAGAGGCTTGTTTTCTTGTTAATAGAGTTCGTGGGGGTTTGTTGAATCCTGATTTGGTTATGTTTAATATACTGTTTAATGGGTTTGCTAAGGTTTATATGAAGAATGAGGCGTTTATGTATCTGGGTTTGATGTGGAAATCCTGTTTGCCTAATGTTGTTACTTATGGTACACTCGTTGATATGTTCTGTAAGATGGGGGATATGGAAGTGGGTAACAAAATGTTTTTTGATATGATGAAGGTGGGGGTTGTGCCTAATTTGGTTGCTTTTAGCTCCTTGATTGATGGGTATTGCAAGGCTGGGAGTTTGAATGTTGCATTTGGATACTTAGAGAGAATGCAACAATGCTCGGTTCAGCCGAACGAGTTCACATATTCAACGTTGATTGATGGTTGTTGCAAGCAGGGGATGTTGGAAAGAGCTGACTTTTTGTTTGAAGAGATGTTGAGTGTTGGTATTCTGCCTAATTGTACGGTTCATACTTCGATAATAGACGGACATTTTAAGAAGGGAAATGTAGACAATGCGTTGAAGTATATAAACAGGATGTTCGATCGAGAAATAGAACTCGATCTAACAGCATATACGGTAGTTATCGCGGGCTTCCGTAGAGTTGGTAGGTTGCATAAGGCAATGGAAGCTGCAGAAAATGTGGTGAAGAATGGATTACTTCCTGATAGGATAATACTAACAGCTATTATGGATGTGCATTTCAAAGCTGGAAACTTAAAAGAAGCTTTGAATGCGTACAGAATATTACTCGCTAGGGGTTTTGAGCCTGATACCGTGACTCTATCCGCTCTAATAGATGGTCTATGCAAGAACGGATACTTGCAGGAGGCTAGACGGTACGTGGTCAAAGAAAAGGCCAATGAAATTCTATATACAGTGCTTATAGATGCACTATGTAAGGAGGGTAATTTAGATGAAGCTGAGAGAACCATTAAGGAAATGTGCGAGGCAGGGTTTGTTCCTGATAAATATGTGTATACTTCTTGGATTGCAGAGCTTTGCAAGCAAGGAAATTTGCTCAAGGCTTTCACGGTCAAGAAAAGGATGGTTCAAGAAAACATTGAACCTGATTTATTAACCTACAGTTCCCTGATTGGTGGTTTGGCGGAGAAGGGACTAATGATAGAAGCCAAACAGGTTTTTGATGACATGTTAAATGCAGGAATCACTCCAGATTCTGTTGCTTATGACATCCTTATAAGAGGGTATCATAATCAGGGTAATGTAGTTGCGATTTCAGATTTTGTGCTTCTTCTCCTCTTGCTGAGTAGCTGCAACATGGGATCTGGGAATAGACGTGACATTGCTTGAGATGGCACCTGATGGTTGCTGTAGTGTTGGAGAATTTCAAGTCGTCAGGCTGCTACTGAAGAAAAATACCTATTCTTGCAAGCCTGAGCTTTCTATATCTTTATTGAATCCATTGAAGATTTGTAGTAAACCCAAATAAACACCACACACACATAGGAATTGTTGCTTATGAAGAATCTCTCAATCGAAATAGAATTATCCATTCAAGGTTGATCTTTGTTTTTCAGCATATATACATTTATAATACACACAATTGATTGTTCC

Coding sequence (CDS)

ATGGTCAAAGAAGCCCTCCAATTCTTGGCTCACCTGCGACGAATCTCCCGCTTTCCCTCCCCTTTCACCTGCAACAAGCTTGTCCACTCTCTCATCAACTCCGGCTGCGGCGAGCTCTCCGCCAAAGTGCTTTTCCACTTTCTCTCCAAAGGGTACACTCCTCATTCATCGTCTTTCAATTCCATCATCTCCTTTTTCTGTAAATTAGGGAACATAAAATATGCCGAACGGATTTTGAATTCAATGCCTAGATTTGGGTGCTCGCCTGATATTGTGTCCTACAATTCTTTGTTACATGGGTATTGTGCGAGTTATAAGATTAGAGAGGCTTGTTTTCTTGTTAATAGAGTTCGTGGGGGTTTGTTGAATCCTGATTTGGTTATGTTTAATATACTGTTTAATGGGTTTGCTAAGGTTTATATGAAGAATGAGGCGTTTATGTATCTGGGTTTGATGTGGAAATCCTGTTTGCCTAATGTTGTTACTTATGGTACACTCGTTGATATGTTCTGTAAGATGGGGGATATGGAAGTGGGTAACAAAATGTTTTTTGATATGATGAAGGTGGGGGTTGTGCCTAATTTGGTTGCTTTTAGCTCCTTGATTGATGGGTATTGCAAGGCTGGGAGTTTGAATGTTGCATTTGGATACTTAGAGAGAATGCAACAATGCTCGGTTCAGCCGAACGAGTTCACATATTCAACGTTGATTGATGGTTGTTGCAAGCAGGGGATGTTGGAAAGAGCTGACTTTTTGTTTGAAGAGATGTTGAGTGTTGGTATTCTGCCTAATTGTACGGTTCATACTTCGATAATAGACGGACATTTTAAGAAGGGAAATGTAGACAATGCGTTGAAGTATATAAACAGGATGTTCGATCGAGAAATAGAACTCGATCTAACAGCATATACGGTAGTTATCGCGGGCTTCCGTAGAGTTGGTAGGTTGCATAAGGCAATGGAAGCTGCAGAAAATGTGGTGAAGAATGGATTACTTCCTGATAGGATAATACTAACAGCTATTATGGATGTGCATTTCAAAGCTGGAAACTTAAAAGAAGCTTTGAATGCGTACAGAATATTACTCGCTAGGGGTTTTGAGCCTGATACCGTGACTCTATCCGCTCTAATAGATGGTCTATGCAAGAACGGATACTTGCAGGAGGCTAGACGGTACGTGGTCAAAGAAAAGGCCAATGAAATTCTATATACAGTGCTTATAGATGCACTATGTAAGGAGGGTAATTTAGATGAAGCTGAGAGAACCATTAAGGAAATGTGCGAGGCAGGGTTTGTTCCTGATAAATATGTGTATACTTCTTGGATTGCAGAGCTTTGCAAGCAAGGAAATTTGCTCAAGGCTTTCACGGTCAAGAAAAGGATGGTTCAAGAAAACATTGAACCTGATTTATTAACCTACAGTTCCCTGATTGGTGGTTTGGCGGAGAAGGGACTAATGATAGAAGCCAAACAGGTTTTTGATGACATGTTAAATGCAGGAATCACTCCAGATTCTGTTGCTTATGACATCCTTATAAGAGGGTATCATAATCAGGGTAATGTAGTTGCGATTTCAGATTTTGTGCTTCTTCTCCTCTTGCTGAGTAGCTGCAACATGGGATCTGGGAATAGACGTGACATTGCTTGA

Protein sequence

MVKEALQFLAHLRRISRFPSPFTCNKLVHSLINSGCGELSAKVLFHFLSKGYTPHSSSFNSIISFFCKLGNIKYAERILNSMPRFGCSPDIVSYNSLLHGYCASYKIREACFLVNRVRGGLLNPDLVMFNILFNGFAKVYMKNEAFMYLGLMWKSCLPNVVTYGTLVDMFCKMGDMEVGNKMFFDMMKVGVVPNLVAFSSLIDGYCKAGSLNVAFGYLERMQQCSVQPNEFTYSTLIDGCCKQGMLERADFLFEEMLSVGILPNCTVHTSIIDGHFKKGNVDNALKYINRMFDREIELDLTAYTVVIAGFRRVGRLHKAMEAAENVVKNGLLPDRIILTAIMDVHFKAGNLKEALNAYRILLARGFEPDTVTLSALIDGLCKNGYLQEARRYVVKEKANEILYTVLIDALCKEGNLDEAERTIKEMCEAGFVPDKYVYTSWIAELCKQGNLLKAFTVKKRMVQENIEPDLLTYSSLIGGLAEKGLMIEAKQVFDDMLNAGITPDSVAYDILIRGYHNQGNVVAISDFVLLLLLLSSCNMGSGNRRDIA
BLAST of CmaCh04G000780 vs. Swiss-Prot
Match: PP141_ARATH (Pentatricopeptide repeat-containing protein At2g01740 OS=Arabidopsis thaliana GN=At2g01740 PE=3 SV=1)

HSP 1 Score: 599.7 bits (1545), Expect = 3.1e-170
Identity = 282/532 (53.01%), Postives = 387/532 (72.74%), Query Frame = 1

Query: 1   MVKEALQFLAHLRRISRFPSPFTCNKLVHSLINSGCGELSAKVLFHFLSKGYTPHSSSFN 60
           MV+EALQFL+ LR+ S  P PFTCNK +H LINS CG LS K L + +S+GYTPH SSFN
Sbjct: 1   MVREALQFLSRLRKSSNLPDPFTCNKHIHQLINSNCGILSLKFLAYLVSRGYTPHRSSFN 60

Query: 61  SIISFFCKLGNIKYAERILNSMPRFGCSPDIVSYNSLLHGYCASYKIREACFLVNRVR-- 120
           S++SF CKLG +K+AE I++SMPRFGC PD++SYNSL+ G+C +  IR A  ++  +R  
Sbjct: 61  SVVSFVCKLGQVKFAEDIVHSMPRFGCEPDVISYNSLIDGHCRNGDIRSASLVLESLRAS 120

Query: 121 -GGLLNPDLVMFNILFNGFAKVYMKNEAFMYLGLMWKSCLPNVVTYGTLVDMFCKMGDME 180
            G +  PD+V FN LFNGF+K+ M +E F+Y+G+M K C PNVVTY T +D FCK G+++
Sbjct: 121 HGFICKPDIVSFNSLFNGFSKMKMLDEVFVYMGVMLKCCSPNVVTYSTWIDTFCKSGELQ 180

Query: 181 VGNKMFFDMMKVGVVPNLVAFSSLIDGYCKAGSLNVAFGYLERMQQCSVQPNEFTYSTLI 240
           +  K F  M +  + PN+V F+ LIDGYCKAG L VA    + M++  +  N  TY+ LI
Sbjct: 181 LALKSFHSMKRDALSPNVVTFTCLIDGYCKAGDLEVAVSLYKEMRRVRMSLNVVTYTALI 240

Query: 241 DGCCKQGMLERADFLFEEMLSVGILPNCTVHTSIIDGHFKKGNVDNALKYINRMFDREIE 300
           DG CK+G ++RA+ ++  M+   + PN  V+T+IIDG F++G+ DNA+K++ +M ++ + 
Sbjct: 241 DGFCKKGEMQRAEEMYSRMVEDRVEPNSLVYTTIIDGFFQRGDSDNAMKFLAKMLNQGMR 300

Query: 301 LDLTAYTVVIAGFRRVGRLHKAMEAAENVVKNGLLPDRIILTAIMDVHFKAGNLKEALNA 360
           LD+TAY V+I+G    G+L +A E  E++ K+ L+PD +I T +M+ +FK+G +K A+N 
Sbjct: 301 LDITAYGVIISGLCGNGKLKEATEIVEDMEKSDLVPDMVIFTTMMNAYFKSGRMKAAVNM 360

Query: 361 YRILLARGFEPDTVTLSALIDGLCKNGYLQEARRYVVKEKANEILYTVLIDALCKEGNLD 420
           Y  L+ RGFEPD V LS +IDG+ KNG L EA  Y   EKAN+++YTVLIDALCKEG+  
Sbjct: 361 YHKLIERGFEPDVVALSTMIDGIAKNGQLHEAIVYFCIEKANDVMYTVLIDALCKEGDFI 420

Query: 421 EAERTIKEMCEAGFVPDKYVYTSWIAELCKQGNLLKAFTVKKRMVQENIEPDLLTYSSLI 480
           E ER   ++ EAG VPDK++YTSWIA LCKQGNL+ AF +K RMVQE +  DLL Y++LI
Sbjct: 421 EVERLFSKISEAGLVPDKFMYTSWIAGLCKQGNLVDAFKLKTRMVQEGLLLDLLAYTTLI 480

Query: 481 GGLAEKGLMIEAKQVFDDMLNAGITPDSVAYDILIRGYHNQGNVVAISDFVL 530
            GLA KGLM+EA+QVFD+MLN+GI+PDS  +D+LIR Y  +GN+ A SD +L
Sbjct: 481 YGLASKGLMVEARQVFDEMLNSGISPDSAVFDLLIRAYEKEGNMAAASDLLL 532

BLAST of CmaCh04G000780 vs. Swiss-Prot
Match: PP143_ARATH (Putative pentatricopeptide repeat-containing protein At2g02150 OS=Arabidopsis thaliana GN=At2g02150 PE=3 SV=1)

HSP 1 Score: 302.0 bits (772), Expect = 1.3e-80
Identity = 172/548 (31.39%), Postives = 285/548 (52.01%), Query Frame = 1

Query: 1   MVKEALQFLAHLRRISRFPSPFTCNKLVHSLINSGCGELSAKVLFHFLSKGYTPHSSSFN 60
           M++EA+Q  + ++R   FP   +CN L+H     G  +   +     +  G  P   ++N
Sbjct: 207 MLEEAIQCFSKMKRFRVFPKTRSCNGLLHRFAKLGKTDDVKRFFKDMIGAGARPTVFTYN 266

Query: 61  SIISFFCKLGNIKYAERILNSMPRFGCSPDIVSYNSLLHGYCASYKIREACFLVNRVRGG 120
            +I   CK G+++ A  +   M   G  PD V+YNS++ G+    ++ +       ++  
Sbjct: 267 IMIDCMCKEGDVEAARGLFEEMKFRGLVPDTVTYNSMIDGFGKVGRLDDTVCFFEEMKDM 326

Query: 121 LLNPDLVMFNILFNGFAKV-YMKNEAFMYLGLMWKSCLPNVVTYGTLVDMFCKMGDMEVG 180
              PD++ +N L N F K   +      Y  +      PNVV+Y TLVD FCK G M+  
Sbjct: 327 CCEPDVITYNALINCFCKFGKLPIGLEFYREMKGNGLKPNVVSYSTLVDAFCKEGMMQQA 386

Query: 181 NKMFFDMMKVGVVPNLVAFSSLIDGYCKAGSLNVAFGYLERMQQCSVQPNEFTYSTLIDG 240
            K + DM +VG+VPN   ++SLID  CK G+L+ AF     M Q  V+ N  TY+ LIDG
Sbjct: 387 IKFYVDMRRVGLVPNEYTYTSLIDANCKIGNLSDAFRLGNEMLQVGVEWNVVTYTALIDG 446

Query: 241 CCKQGMLERADFLFEEMLSVGILPNCTVHTSIIDGHFKKGNVDNALKYINRMFDREIELD 300
            C    ++ A+ LF +M + G++PN   + ++I G  K  N+D AL+ +N +  R I+ D
Sbjct: 447 LCDAERMKEAEELFGKMDTAGVIPNLASYNALIHGFVKAKNMDRALELLNELKGRGIKPD 506

Query: 301 LTAYTVVIAGFRRVGRLHKAMEAAENVVKNGLLPDRIILTAIMDVHFKAGNLKEALNAYR 360
           L  Y   I G   + ++  A      + + G+  + +I T +MD +FK+GN  E L+   
Sbjct: 507 LLLYGTFIWGLCSLEKIEAAKVVMNEMKECGIKANSLIYTTLMDAYFKSGNPTEGLHLLD 566

Query: 361 ILLARGFEPDTVTLSALIDGLCKNGYLQEARRYVVK------EKANEILYTVLIDALCKE 420
            +     E   VT   LIDGLCKN  + +A  Y  +       +AN  ++T +ID LCK+
Sbjct: 567 EMKELDIEVTVVTFCVLIDGLCKNKLVSKAVDYFNRISNDFGLQANAAIFTAMIDGLCKD 626

Query: 421 GNLDEAERTIKEMCEAGFVPDKYVYTSWIAELCKQGNLLKAFTVKKRMVQENIEPDLLTY 480
             ++ A    ++M + G VPD+  YTS +    KQGN+L+A  ++ +M +  ++ DLL Y
Sbjct: 627 NQVEAATTLFEQMVQKGLVPDRTAYTSLMDGNFKQGNVLEALALRDKMAEIGMKLDLLAY 686

Query: 481 SSLIGGLAEKGLMIEAKQVFDDMLNAGITPDSVAYDILIRGYHNQG---NVVAISDFVLL 539
           +SL+ GL+    + +A+   ++M+  GI PD V    +++ ++  G     V +  +++ 
Sbjct: 687 TSLVWGLSHCNQLQKARSFLEEMIGEGIHPDEVLCISVLKKHYELGCIDEAVELQSYLMK 746

BLAST of CmaCh04G000780 vs. Swiss-Prot
Match: PP445_ARATH (Pentatricopeptide repeat-containing protein At5g65560 OS=Arabidopsis thaliana GN=At5g65560 PE=2 SV=1)

HSP 1 Score: 263.1 bits (671), Expect = 6.9e-69
Identity = 155/521 (29.75%), Postives = 272/521 (52.21%), Query Frame = 1

Query: 1   MVKEALQFLAHLRRISRFPSPFTCNKLVHSLINSGCGELSAKVLFHFLSKGYTPHSSSFN 60
           +V E  Q    +      P+ +T NK+V+     G  E + + +   +  G  P   ++ 
Sbjct: 198 LVDEMKQVYMEMLEDKVCPNIYTYNKMVNGYCKLGNVEEANQYVSKIVEAGLDPDFFTYT 257

Query: 61  SIISFFCKLGNIKYAERILNSMPRFGCSPDIVSYNSLLHGYCASYKIREACFLVNRVRGG 120
           S+I  +C+  ++  A ++ N MP  GC  + V+Y  L+HG C + +I EA  L  +++  
Sbjct: 258 SLIMGYCQRKDLDSAFKVFNEMPLKGCRRNEVAYTHLIHGLCVARRIDEAMDLFVKMKDD 317

Query: 121 LLNPDLVMFNILFNGFAKVYMKNEAFMYLGLMWKSCL-PNVVTYGTLVDMFCKMGDMEVG 180
              P +  + +L         K+EA   +  M ++ + PN+ TY  L+D  C     E  
Sbjct: 318 ECFPTVRTYTVLIKSLCGSERKSEALNLVKEMEETGIKPNIHTYTVLIDSLCSQCKFEKA 377

Query: 181 NKMFFDMMKVGVVPNLVAFSSLIDGYCKAGSLNVAFGYLERMQQCSVQPNEFTYSTLIDG 240
            ++   M++ G++PN++ +++LI+GYCK G +  A   +E M+   + PN  TY+ LI G
Sbjct: 378 RELLGQMLEKGLMPNVITYNALINGYCKRGMIEDAVDVVELMESRKLSPNTRTYNELIKG 437

Query: 241 CCKQGMLERADFLFEEMLSVGILPNCTVHTSIIDGHFKKGNVDNALKYINRMFDREIELD 300
            CK  +  +A  +  +ML   +LP+   + S+IDG  + GN D+A + ++ M DR +  D
Sbjct: 438 YCKSNV-HKAMGVLNKMLERKVLPDVVTYNSLIDGQCRSGNFDSAYRLLSLMNDRGLVPD 497

Query: 301 LTAYTVVIAGFRRVGRLHKAMEAAENVVKNGLLPDRIILTAIMDVHFKAGNLKEALNAYR 360
              YT +I    +  R+ +A +  +++ + G+ P+ ++ TA++D + KAG + EA     
Sbjct: 498 QWTYTSMIDSLCKSKRVEEACDLFDSLEQKGVNPNVVMYTALIDGYCKAGKVDEAHLMLE 557

Query: 361 ILLARGFEPDTVTLSALIDGLCKNGYLQEARRYVVKEKANEI-------LYTVLIDALCK 420
            +L++   P+++T +ALI GLC +G L+EA   +++EK  +I         T+LI  L K
Sbjct: 558 KMLSKNCLPNSLTFNALIHGLCADGKLKEAT--LLEEKMVKIGLQPTVSTDTILIHRLLK 617

Query: 421 EGNLDEAERTIKEMCEAGFVPDKYVYTSWIAELCKQGNLLKAFTVKKRMVQENIEPDLLT 480
           +G+ D A    ++M  +G  PD + YT++I   C++G LL A  +  +M +  + PDL T
Sbjct: 618 DGDFDHAYSRFQQMLSSGTKPDAHTYTTFIQTYCREGRLLDAEDMMAKMRENGVSPDLFT 677

Query: 481 YSSLIGGLAEKGLMIEAKQVFDDMLNAGITPDSVAYDILIR 514
           YSSLI G  + G    A  V   M + G  P    +  LI+
Sbjct: 678 YSSLIKGYGDLGQTNFAFDVLKRMRDTGCEPSQHTFLSLIK 715

BLAST of CmaCh04G000780 vs. Swiss-Prot
Match: PP407_ARATH (Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana GN=EMB2745 PE=2 SV=1)

HSP 1 Score: 251.5 bits (641), Expect = 2.1e-65
Identity = 158/529 (29.87%), Postives = 275/529 (51.98%), Query Frame = 1

Query: 1   MVKEALQFLAHLRRISRF-PSPFTCNKLVHSLINSGCGELSAKVLF-HFLSKGYTPHSSS 60
           ++ +AL  + HL +   F P   + N ++ + I S      A+ +F   L    +P+  +
Sbjct: 149 LIDKALS-IVHLAQAHGFMPGVLSYNAVLDATIRSKRNISFAENVFKEMLESQVSPNVFT 208

Query: 61  FNSIISFFCKLGNIKYAERILNSMPRFGCSPDIVSYNSLLHGYCASYKIREACFLVNRVR 120
           +N +I  FC  GNI  A  + + M   GC P++V+YN+L+ GYC   KI +   L+  + 
Sbjct: 209 YNILIRGFCFAGNIDVALTLFDKMETKGCLPNVVTYNTLIDGYCKLRKIDDGFKLLRSMA 268

Query: 121 GGLLNPDLVMFNILFNGFAKV-YMKNEAFMYLGLMWKSCLPNVVTYGTLVDMFCKMGDME 180
              L P+L+ +N++ NG  +   MK  +F+   +  +    + VTY TL+  +CK G+  
Sbjct: 269 LKGLEPNLISYNVVINGLCREGRMKEVSFVLTEMNRRGYSLDEVTYNTLIKGYCKEGNFH 328

Query: 181 VGNKMFFDMMKVGVVPNLVAFSSLIDGYCKAGSLNVAFGYLERMQQCSVQPNEFTYSTLI 240
               M  +M++ G+ P+++ ++SLI   CKAG++N A  +L++M+   + PNE TY+TL+
Sbjct: 329 QALVMHAEMLRHGLTPSVITYTSLIHSMCKAGNMNRAMEFLDQMRVRGLCPNERTYTTLV 388

Query: 241 DGCCKQGMLERADFLFEEMLSVGILPNCTVHTSIIDGHFKKGNVDNALKYINRMFDREIE 300
           DG  ++G +  A  +  EM   G  P+   + ++I+GH   G +++A+  +  M ++ + 
Sbjct: 389 DGFSQKGYMNEAYRVLREMNDNGFSPSVVTYNALINGHCVTGKMEDAIAVLEDMKEKGLS 448

Query: 301 LDLTAYTVVIAGFRRVGRLHKAMEAAENVVKNGLLPDRIILTAIMDVHFKAGNLKEALNA 360
            D+ +Y+ V++GF R   + +A+     +V+ G+ PD I  ++++    +    KEA + 
Sbjct: 449 PDVVSYSTVLSGFCRSYDVDEALRVKREMVEKGIKPDTITYSSLIQGFCEQRRTKEACDL 508

Query: 361 YRILLARGFEPDTVTLSALIDGLCKNGYLQEARRY--VVKEKA---NEILYTVLIDALCK 420
           Y  +L  G  PD  T +ALI+  C  G L++A +    + EK    + + Y+VLI+ L K
Sbjct: 509 YEEMLRVGLPPDEFTYTALINAYCMEGDLEKALQLHNEMVEKGVLPDVVTYSVLINGLNK 568

Query: 421 EGNLDEAERTIKEMCEAGFVPDKYVYTSWIAELCKQGNLLKAFTVKKRMVQENIEPDLLT 480
           +    EA+R + ++     VP    Y + I E C                  NIE    +
Sbjct: 569 QSRTREAKRLLLKLFYEESVPSDVTYHTLI-ENCS-----------------NIE--FKS 628

Query: 481 YSSLIGGLAEKGLMIEAKQVFDDMLNAGITPDSVAYDILIRGYHNQGNV 522
             SLI G   KG+M EA QVF+ ML     PD  AY+I+I G+   G++
Sbjct: 629 VVSLIKGFCMKGMMTEADQVFESMLGKNHKPDGTAYNIMIHGHCRAGDI 656

BLAST of CmaCh04G000780 vs. Swiss-Prot
Match: PP432_ARATH (Pentatricopeptide repeat-containing protein At5g55840 OS=Arabidopsis thaliana GN=At5g55840 PE=3 SV=2)

HSP 1 Score: 250.4 bits (638), Expect = 4.6e-65
Identity = 148/528 (28.03%), Postives = 261/528 (49.43%), Query Frame = 1

Query: 1   MVKEALQFLAHLRRISRFPSPFTCNKLVHSLINSGCGELSAKVLFHFLSKGYTPHSSSFN 60
           M++++L+    +      PS +TCN ++ S++ SG        L   L +   P  ++FN
Sbjct: 138 MIQDSLEIFRLMGLYGFNPSVYTCNAILGSVVKSGEDVSVWSFLKEMLKRKICPDVATFN 197

Query: 61  SIISFFCKLGNIKYAERILNSMPRFGCSPDIVSYNSLLHGYCASYKIREACFLVNRVRGG 120
            +I+  C  G+ + +  ++  M + G +P IV+YN++LH YC   + + A  L++ ++  
Sbjct: 198 ILINVLCAEGSFEKSSYLMQKMEKSGYAPTIVTYNTVLHWYCKKGRFKAAIELLDHMKSK 257

Query: 121 LLNPDLVMFNILFNGFAKVYMKNEAFMYLGLMWKSCL-PNVVTYGTLVDMFCKMGDMEVG 180
            ++ D+  +N+L +   +     + ++ L  M K  + PN VTY TL++ F   G + + 
Sbjct: 258 GVDADVCTYNMLIHDLCRSNRIAKGYLLLRDMRKRMIHPNEVTYNTLINGFSNEGKVLIA 317

Query: 181 NKMFFDMMKVGVVPNLVAFSSLIDGYCKAGSLNVAFGYLERMQQCSVQPNEFTYSTLIDG 240
           +++  +M+  G+ PN V F++LIDG+   G+   A      M+   + P+E +Y  L+DG
Sbjct: 318 SQLLNEMLSFGLSPNHVTFNALIDGHISEGNFKEALKMFYMMEAKGLTPSEVSYGVLLDG 377

Query: 241 CCKQGMLERADFLFEEMLSVGILPNCTVHTSIIDGHFKKGNVDNALKYINRMFDREIELD 300
            CK    + A   +  M   G+      +T +IDG  K G +D A+  +N M    I+ D
Sbjct: 378 LCKNAEFDLARGFYMRMKRNGVCVGRITYTGMIDGLCKNGFLDEAVVLLNEMSKDGIDPD 437

Query: 301 LTAYTVVIAGFRRVGRLHKAMEAAENVVKNGLLPDRIILTAIMDVHFKAGNLKEALNAYR 360
           +  Y+ +I GF +VGR   A E    + + GL P+ II + ++    + G LKEA+  Y 
Sbjct: 438 IVTYSALINGFCKVGRFKTAKEIVCRIYRVGLSPNGIIYSTLIYNCCRMGCLKEAIRIYE 497

Query: 361 ILLARGFEPDTVTLSALIDGLCKNGYLQEARRYVVKEKANEIL-----YTVLIDALCKEG 420
            ++  G   D  T + L+  LCK G + EA  ++    ++ IL     +  LI+     G
Sbjct: 498 AMILEGHTRDHFTFNVLVTSLCKAGKVAEAEEFMRCMTSDGILPNTVSFDCLINGYGNSG 557

Query: 421 NLDEAERTIKEMCEAGFVPDKYVYTSWIAELCKQGNLLKAFTVKKRMVQENIEPDLLTYS 480
              +A     EM + G  P  + Y S +  LCK G+L +A    K +       D + Y+
Sbjct: 558 EGLKAFSVFDEMTKVGHHPTFFTYGSLLKGLCKGGHLREAEKFLKSLHAVPAAVDTVMYN 617

Query: 481 SLIGGLAEKGLMIEAKQVFDDMLNAGITPDSVAYDILIRGYHNQGNVV 523
           +L+  + + G + +A  +F +M+   I PDS  Y  LI G   +G  V
Sbjct: 618 TLLTAMCKSGNLAKAVSLFGEMVQRSILPDSYTYTSLISGLCRKGKTV 665

BLAST of CmaCh04G000780 vs. TrEMBL
Match: A0A0A0KPG7_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G021290 PE=4 SV=1)

HSP 1 Score: 697.2 bits (1798), Expect = 1.6e-197
Identity = 336/398 (84.42%), Postives = 368/398 (92.46%), Query Frame = 1

Query: 128 MFNILFNGFAKVYMKNEAFMYLGLMWKSCLPNVVTYGTLVDMFCKMGDMEVGNKMFFDMM 187
           MFNILFNGFAKVYMKNEAFMY GLMWK CLP++VTYGT VDMFCKMGDM++GN+MF DMM
Sbjct: 1   MFNILFNGFAKVYMKNEAFMYFGLMWKYCLPSIVTYGTFVDMFCKMGDMKMGNRMFLDMM 60

Query: 188 KVGVVPNLVAFSSLIDGYCKAGSLNVAFGYLERMQQCSVQPNEFTYSTLIDGCCKQGMLE 247
           KVG+VPNLV FSSLIDGYCKAGSL+VAF Y ERM++CSV+PNEFTYSTLIDGC K GML 
Sbjct: 61  KVGIVPNLVVFSSLIDGYCKAGSLDVAFEYFERMKECSVRPNEFTYSTLIDGCSKHGMLA 120

Query: 248 RADFLFEEMLSVGILPNCTVHTSIIDGHFKKGNVDNALKYINRMFDREIELDLTAYTVVI 307
           RAD LFE+MLS  ILPNCTV+TSIIDGHFKKGNVD+A+KYIN+MFDR+I+LDLTAYTV+I
Sbjct: 121 RADSLFEKMLSASILPNCTVYTSIIDGHFKKGNVDDAIKYINQMFDRDIKLDLTAYTVII 180

Query: 308 AGFRRVGRLHKAMEAAENVVKNGLLPDRIILTAIMDVHFKAGNLKEALNAYRILLARGFE 367
           +GF RVGR  K+MEAAE V KNGLLPDRIILTAIMDVHFKAGN+KEALNAY+ILLA+GFE
Sbjct: 181 SGFHRVGRFDKSMEAAEYVAKNGLLPDRIILTAIMDVHFKAGNIKEALNAYKILLAKGFE 240

Query: 368 PDTVTLSALIDGLCKNGYLQEARRYVVKEKANEILYTVLIDALCKEGNLDEAERTIKEMC 427
            D VTLSAL+DGL K+GYLQEARRY+VKE ANEILYTV IDALCKEGNLD+AE+ IKEM 
Sbjct: 241 ADVVTLSALMDGLSKHGYLQEARRYLVKENANEILYTVFIDALCKEGNLDDAEKMIKEMS 300

Query: 428 EAGFVPDKYVYTSWIAELCKQGNLLKAFTVKKRMVQENIEPDLLTYSSLIGGLAEKGLMI 487
           EAGFVPDK+VYTSWIAELCKQGNLLKAF VKKRMVQE++EPDLLTYSSLIGGLAEKGLMI
Sbjct: 301 EAGFVPDKFVYTSWIAELCKQGNLLKAFMVKKRMVQEHVEPDLLTYSSLIGGLAEKGLMI 360

Query: 488 EAKQVFDDMLNAGITPDSVAYDILIRGYHNQGNVVAIS 526
           EAKQVFDDMLN GITPD V+YDILIRGYHNQGN  AIS
Sbjct: 361 EAKQVFDDMLNKGITPDFVSYDILIRGYHNQGNGAAIS 398

BLAST of CmaCh04G000780 vs. TrEMBL
Match: W9R0T6_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_005539 PE=4 SV=1)

HSP 1 Score: 696.8 bits (1797), Expect = 2.1e-197
Identity = 324/525 (61.71%), Postives = 411/525 (78.29%), Query Frame = 1

Query: 1   MVKEALQFLAHLRRISRFPSPFTCNKLVHSLINSGCGELSAKVLFHFLSKGYTPHSSSFN 60
           MV+E LQF AHLRR SRFP+PFT NKL+H L ++ CG+LS K+L HFL+K Y PH SSFN
Sbjct: 1   MVRETLQFFAHLRRTSRFPTPFTFNKLLHHLTSANCGDLSLKILSHFLTKRYVPHPSSFN 60

Query: 61  SIISFFCKLGNIKYAERILNSMPRFGCSPDIVSYNSLLHGYCASYKIREACFLVNRVRGG 120
           S++SF CK G +++A  +++SMP+FG SPD+V+YN L+ G+C +  + EACF+V+++R G
Sbjct: 61  SVLSFLCKSGQLRFARNVVDSMPKFGFSPDVVTYNCLVDGFCKNLDVEEACFVVSKMRMG 120

Query: 121 LLNPDLVMFNILFNGFAKVYMKNEAFMYLGLMWKSCLPNVVTYGTLVDMFCKMGDMEVGN 180
              PDLV FN LFNGF+K  M+ EAF+Y+GLMWK CLPNVVTY T VDMFCK+G+ ++G 
Sbjct: 121 KCGPDLVTFNTLFNGFSKTRMEREAFVYMGLMWKCCLPNVVTYSTFVDMFCKVGNFDLGY 180

Query: 181 KMFFDMMKVGVVPNLVAFSSLIDGYCKAGSLNVAFGYLERMQQCSVQPNEFTYSTLIDGC 240
           K+F DM+  GV+PN V F++L+DGYCKAG+L++AF     M++ SV PN  TY+ L+DG 
Sbjct: 181 KVFRDMVNAGVLPNSVVFTALLDGYCKAGNLDIAFELFVEMKRSSVSPNVVTYAALVDGF 240

Query: 241 CKQGMLERADFLFEEMLSVGILPNCTVHTSIIDGHFKKGNVDNALKYINRMFDREIELDL 300
           CK+G LERA+ LF +ML  G+ PN  V+TSIIDGHF KGNVD+A+KY+ +M D+ + LD+
Sbjct: 241 CKRGALERAESLFSKMLEDGVEPNSVVYTSIIDGHFVKGNVDDAVKYMTKMCDQGLRLDM 300

Query: 301 TAYTVVIAGFRRVGRLHKAMEAAENVVKNGLLPDRIILTAIMDVHFKAGNLKEALNAYRI 360
           TAY VVI GF + GRL KAME   ++ ++GL PD+I+LT +MD HFK+G+LK AL  YR 
Sbjct: 301 TAYEVVIRGFCKNGRLDKAMEVMRSMTESGLFPDKIMLTTVMDAHFKSGDLKRALEVYRE 360

Query: 361 LLARGFEPDTVTLSALIDGLCKNGYLQEARRYVVKEKANEILYTVLIDALCKEGNLDEAE 420
           +L RGFEPD VTLS+++DGL K G+LQEAR Y+ +EKANEI YTVLID +CKEG+  E E
Sbjct: 361 ILFRGFEPDIVTLSSIMDGLSKKGHLQEARGYLCREKANEISYTVLIDGMCKEGHFGEVE 420

Query: 421 RTIKEMCEAGFVPDKYVYTSWIAELCKQGNLLKAFTVKKRMVQENIEPDLLTYSSLIGGL 480
              +EM EAGFVPDKY YTSWIA LCKQG L++AF +K RM QE IEPDLLTYSSLI GL
Sbjct: 421 MVFREMSEAGFVPDKYAYTSWIAGLCKQGKLVEAFVLKNRMAQEGIEPDLLTYSSLIFGL 480

Query: 481 AEKGLMIEAKQVFDDMLNAGITPDSVAYDILIRGYHNQGNVVAIS 526
           A KGLM+EAKQVFDDML  GI+PDS  YDILIRGY  +GN VA+S
Sbjct: 481 ANKGLMVEAKQVFDDMLKRGISPDSAVYDILIRGYLKEGNEVAVS 525

BLAST of CmaCh04G000780 vs. TrEMBL
Match: A0A061EFM6_THECC (Tetratricopeptide repeat (TPR)-like superfamily protein, putative isoform 1 OS=Theobroma cacao GN=TCM_011056 PE=4 SV=1)

HSP 1 Score: 622.5 bits (1604), Expect = 5.0e-175
Identity = 300/523 (57.36%), Postives = 374/523 (71.51%), Query Frame = 1

Query: 6   LQFLAHLRRISRFPSPFTCNKLVHSLINSGCGELSAKVLFHFLSKGYTPHSSSFNSIISF 65
           LQF + L++ S++P PF  NKL+H L  S CG LS K+L  FLSKGYTPH SSFNS ISF
Sbjct: 43  LQFCSQLKKTSKYPDPFFFNKLLHRLTASNCGTLSLKLLSFFLSKGYTPHPSSFNSSISF 102

Query: 66  FCKLGNIKYAERILNSMPRFGCSPDIVSYNSLLHGYCASYKIREACFLVNRVRGGLLNPD 125
            CKLG   YA++++NSMP +GC PDI +YNSL+ GY     + +AC + + +R G   PD
Sbjct: 103 LCKLGRSDYAQKLVNSMPFYGCEPDIATYNSLIDGYFKCGDVVKACLVFDDIRAGKCKPD 162

Query: 126 LVMFNILFNGFAKVYMKNEAFMYLGLMWKSCLPNVVTYGTLVDMFCKMGDMEVGNKMFFD 185
           LV FN LFNGF K+    E F+Y+G MWK CLPNV+TY T +DMFCK+GD+++G K+F D
Sbjct: 163 LVTFNALFNGFCKMRRNKEVFVYMGYMWKCCLPNVITYSTWIDMFCKLGDLKMGFKVFRD 222

Query: 186 MMKVGVVPNLVAFSSLIDGYCKAGSLNVAFGYLERMQQCSVQPNEFTYSTLIDGCCKQGM 245
           M K GV  N + F+ LIDG CK G   +AF     M+Q  +  N  TY+ LIDG CK+GM
Sbjct: 223 MKKDGVSLNSIVFTCLIDGCCKVGDFELAFELYWEMKQTKLALNVVTYTALIDGLCKKGM 282

Query: 246 LERADFLFEEMLSVGILPNCTVHTSIIDGHFKKGNVDNALKYINRMFDREIELDLTAYTV 305
           LERA+ LF  ML   + PN  V+TSIIDGHFKK NV +ALKY+ +M  + I+ D+  Y V
Sbjct: 283 LERAECLFLRMLKDKVQPNSVVYTSIIDGHFKKRNVSDALKYLAKMCVQGIKFDMALYGV 342

Query: 306 VIAGFRRVGRLHKAMEAAENVVKNGLLPDRIILTAIMDVHFKAGNLKEALNAYRILLARG 365
           +I+G    GR  KA +  EN+VK+GLLPD+++LT +MD HFKAGN+K AL+ Y  LLARG
Sbjct: 343 IISGLSNCGRFDKASKFMENMVKSGLLPDKLLLTTMMDAHFKAGNVKAALDVYGELLARG 402

Query: 366 FEPDTVTLSALIDGLCKNGYLQEARRYVVKEKANEILYTVLIDALCKEGNLDEAERTIKE 425
           F+PD V LS+L+DGLCK G L EA  Y  +EKANEI YTVLID L K+G+  E  R  +E
Sbjct: 403 FDPDVVVLSSLMDGLCKRGCLHEAESYFCREKANEISYTVLIDGLAKKGDFTEVNRVFRE 462

Query: 426 MCEAGFVPDKYVYTSWIAELCKQGNLLKAFTVKKRMVQENIEPDLLTYSSLIGGLAEKGL 485
           M EAGF PDKYVYTSWIA LC+QGNL++AF +K RMVQE  +PDLLTYSSLI GLA KGL
Sbjct: 463 MLEAGFTPDKYVYTSWIAGLCEQGNLIEAFRLKNRMVQEGFQPDLLTYSSLIFGLANKGL 522

Query: 486 MIEAKQVFDDMLNAGITPDSVAYDILIRGYHNQGNVVAISDFV 529
           MIEAKQ+F DML   ITPD+  YDI+IRGY  Q N  A+S+ +
Sbjct: 523 MIEAKQIFQDMLKRKITPDAAVYDIMIRGYLQQNNEAAVSELL 565

BLAST of CmaCh04G000780 vs. TrEMBL
Match: A0A0D2Q7W0_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_002G171900 PE=4 SV=1)

HSP 1 Score: 619.4 bits (1596), Expect = 4.2e-174
Identity = 296/520 (56.92%), Postives = 375/520 (72.12%), Query Frame = 1

Query: 6   LQFLAHLRRISRFPSPFTCNKLVHSLINSGCGELSAKVLFHFLSKGYTPHSSSFNSIISF 65
           LQF + L++ S++P P++ NKL+H L  S CG LS K+L  FLSKGYTPH SSFNS ISF
Sbjct: 40  LQFFSDLKKTSKYPDPYSFNKLLHKLTASDCGALSLKLLSFFLSKGYTPHPSSFNSTISF 99

Query: 66  FCKLGNIKYAERILNSMPRFGCSPDIVSYNSLLHGYCASYKIREACFLVNRVRGGLLNPD 125
           FCKLG   YA++++N MP +GC PDI +YNSL+ GY    ++ +AC +VN +R     PD
Sbjct: 100 FCKLGQSSYAQKLVNLMPLYGCEPDIATYNSLIDGYFKCGEVVKACLIVNEIRVDKCKPD 159

Query: 126 LVMFNILFNGFAKVYMKNEAFMYLGLMWKSCLPNVVTYGTLVDMFCKMGDMEVGNKMFFD 185
           LV FN+LFNGF K+  K EAF+Y+GLMWK CLPNVVTY T +DMFCK+GD+ +G K+F D
Sbjct: 160 LVTFNVLFNGFCKMRKKKEAFVYMGLMWKCCLPNVVTYSTWIDMFCKVGDLNMGVKVFRD 219

Query: 186 MMKVGVVPNLVAFSSLIDGYCKAGSLNVAFGYLERMQQCSVQPNEFTYSTLIDGCCKQGM 245
           M K  V+ N + F+ LIDGYCK G    AF   + M+   +  N  TY+ LIDG CK+GM
Sbjct: 220 MKKDKVLLNSIVFTCLIDGYCKVGDFEAAFELCKEMKLVKLAVNVVTYTALIDGLCKKGM 279

Query: 246 LERADFLFEEMLSVGILPNCTVHTSIIDGHFKKGNVDNALKYINRMFDREIELDLTAYTV 305
           LERA+ LF  ML   + PN  V+TSIID HFKK NV +ALKY+ +M+ + +E D+ AY V
Sbjct: 280 LERAECLFFRMLKDKVKPNSVVYTSIIDAHFKKSNVTDALKYLGKMYVQGLEFDMAAYGV 339

Query: 306 VIAGFRRVGRLHKAMEAAENVVKNGLLPDRIILTAIMDVHFKAGNLKEALNAYRILLARG 365
           +IAG    G   KA    EN+VK+GL PD+++LT IMD HFKAGN+K ALN Y  +LARG
Sbjct: 340 IIAGLCNTGMFDKASIYMENMVKSGLRPDKLMLTTIMDAHFKAGNVKAALNVYGEILARG 399

Query: 366 FEPDTVTLSALIDGLCKNGYLQEARRYVVKEKANEILYTVLIDALCKEGNLDEAERTIKE 425
           F+PD + L++L+DGLCK+G L EA  Y  + KAN+I YTVLI+ L K+G+  E  R  +E
Sbjct: 400 FDPDVIVLTSLMDGLCKHGCLNEAESYFCRGKANKISYTVLINGLAKKGDFTELNRVFRE 459

Query: 426 MCEAGFVPDKYVYTSWIAELCKQGNLLKAFTVKKRMVQENIEPDLLTYSSLIGGLAEKGL 485
           M EAGF  DKYVYTSWIA LC+QGNL++AF VK RMVQE  +PDLLTYSSLI GLA KGL
Sbjct: 460 MLEAGFTADKYVYTSWIAGLCEQGNLIEAFRVKNRMVQEGFQPDLLTYSSLIFGLANKGL 519

Query: 486 MIEAKQVFDDMLNAGITPDSVAYDILIRGYHNQGNVVAIS 526
           MIEAKQ+F+DML   ITPD+  Y+I+IRGY  Q N  A++
Sbjct: 520 MIEAKQIFEDMLKRQITPDAAVYEIMIRGYLRQDNEAAVT 559

BLAST of CmaCh04G000780 vs. TrEMBL
Match: V4KT02_EUTSA (Uncharacterized protein OS=Eutrema salsugineum GN=EUTSA_v10003932mg PE=4 SV=1)

HSP 1 Score: 609.8 bits (1571), Expect = 3.3e-171
Identity = 290/532 (54.51%), Postives = 386/532 (72.56%), Query Frame = 1

Query: 1   MVKEALQFLAHLRRISRFPSPFTCNKLVHSLINSGCGELSAKVLFHFLSKGYTPHSSSFN 60
           MV+EALQF++ LR+ S  P P TCNK +H LINS CG LS K L + LS+GYTPH SSFN
Sbjct: 1   MVREALQFISRLRKSSNLPDPITCNKYIHQLINSNCGVLSLKFLAYLLSRGYTPHRSSFN 60

Query: 61  SIISFFCKLGNIKYAERILNSMPRFGCSPDIVSYNSLLHGYCASYKIREACFLVNRVR-- 120
           S+ SF CKLG +K+AE I++SMPRFGC PD+VSYNSL+ G+C + +IR A  ++ R+R  
Sbjct: 61  SVASFVCKLGQVKFAEYIVHSMPRFGCLPDVVSYNSLIDGHCRNGEIRSASLVLKRLRAS 120

Query: 121 -GGLLNPDLVMFNILFNGFAKVYMKNEAFMYLGLMWKSCLPNVVTYGTLVDMFCKMGDME 180
            G +  PD+V FN LFNGF+K+ M  E F+Y+G+M K C PNVVTY T +D FCK G+++
Sbjct: 121 HGFMCRPDIVSFNSLFNGFSKMKMLKEVFVYMGVMLKCCSPNVVTYSTWIDTFCKSGELQ 180

Query: 181 VGNKMFFDMMKVGVVPNLVAFSSLIDGYCKAGSLNVAFGYLERMQQCSVQPNEFTYSTLI 240
           +  K F  M K  + PN+V F+ LIDGYCKAG L VA    E M++  +  N  TY+ L+
Sbjct: 181 LALKSFNCMKKDALSPNVVTFTCLIDGYCKAGDLEVAVSLYEDMRRVQMSLNVVTYTALL 240

Query: 241 DGCCKQGMLERADFLFEEMLSVGILPNCTVHTSIIDGHFKKGNVDNALKYINRMFDREIE 300
           DG CK+G +ERA+ L+  M    + PN  V+T+IIDG+F KG+ DNA+K++ +M ++ + 
Sbjct: 241 DGFCKRGEMERAEGLYSRMHEDKVEPNSLVYTTIIDGYFHKGDADNAMKFLAKMLNQGMR 300

Query: 301 LDLTAYTVVIAGFRRVGRLHKAMEAAENVVKNGLLPDRIILTAIMDVHFKAGNLKEALNA 360
           LD+ AY V+I+G    G+L +A E  E++ K GL+PD++ILT +MD +FK+G +K ALN 
Sbjct: 301 LDIAAYGVIISGLCGNGKLKEATEVVEDMEKGGLVPDKMILTTMMDAYFKSGLMKAALNV 360

Query: 361 YRILLARGFEPDTVTLSALIDGLCKNGYLQEARRYVVKEKANEILYTVLIDALCKEGNLD 420
           YR  + RGFEPD V L+ LIDGL KNG L EA  Y  KEKAN+++YTVLIDALCKEG+  
Sbjct: 361 YREFIERGFEPDVVALTTLIDGLAKNGQLHEAIAYFCKEKANDVMYTVLIDALCKEGDFI 420

Query: 421 EAERTIKEMCEAGFVPDKYVYTSWIAELCKQGNLLKAFTVKKRMVQENIEPDLLTYSSLI 480
           E ER   ++ EAG VPDK++YTSWIA LCKQGNL+ AF +K +MVQE ++ DLLTY++LI
Sbjct: 421 EVERFFSKILEAGLVPDKFMYTSWIAGLCKQGNLVDAFKLKTKMVQEGLKLDLLTYTTLI 480

Query: 481 GGLAEKGLMIEAKQVFDDMLNAGITPDSVAYDILIRGYHNQGNVVAISDFVL 530
            GLA KGLM+EA+QVFD+ML +G +PDS  +D+LIR Y  +GN+ A SD  L
Sbjct: 481 NGLASKGLMVEARQVFDEMLRSGTSPDSAVFDLLIRAYEKEGNMTAASDLFL 532

BLAST of CmaCh04G000780 vs. TAIR10
Match: AT2G01740.1 (AT2G01740.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 599.7 bits (1545), Expect = 1.8e-171
Identity = 282/532 (53.01%), Postives = 387/532 (72.74%), Query Frame = 1

Query: 1   MVKEALQFLAHLRRISRFPSPFTCNKLVHSLINSGCGELSAKVLFHFLSKGYTPHSSSFN 60
           MV+EALQFL+ LR+ S  P PFTCNK +H LINS CG LS K L + +S+GYTPH SSFN
Sbjct: 1   MVREALQFLSRLRKSSNLPDPFTCNKHIHQLINSNCGILSLKFLAYLVSRGYTPHRSSFN 60

Query: 61  SIISFFCKLGNIKYAERILNSMPRFGCSPDIVSYNSLLHGYCASYKIREACFLVNRVR-- 120
           S++SF CKLG +K+AE I++SMPRFGC PD++SYNSL+ G+C +  IR A  ++  +R  
Sbjct: 61  SVVSFVCKLGQVKFAEDIVHSMPRFGCEPDVISYNSLIDGHCRNGDIRSASLVLESLRAS 120

Query: 121 -GGLLNPDLVMFNILFNGFAKVYMKNEAFMYLGLMWKSCLPNVVTYGTLVDMFCKMGDME 180
            G +  PD+V FN LFNGF+K+ M +E F+Y+G+M K C PNVVTY T +D FCK G+++
Sbjct: 121 HGFICKPDIVSFNSLFNGFSKMKMLDEVFVYMGVMLKCCSPNVVTYSTWIDTFCKSGELQ 180

Query: 181 VGNKMFFDMMKVGVVPNLVAFSSLIDGYCKAGSLNVAFGYLERMQQCSVQPNEFTYSTLI 240
           +  K F  M +  + PN+V F+ LIDGYCKAG L VA    + M++  +  N  TY+ LI
Sbjct: 181 LALKSFHSMKRDALSPNVVTFTCLIDGYCKAGDLEVAVSLYKEMRRVRMSLNVVTYTALI 240

Query: 241 DGCCKQGMLERADFLFEEMLSVGILPNCTVHTSIIDGHFKKGNVDNALKYINRMFDREIE 300
           DG CK+G ++RA+ ++  M+   + PN  V+T+IIDG F++G+ DNA+K++ +M ++ + 
Sbjct: 241 DGFCKKGEMQRAEEMYSRMVEDRVEPNSLVYTTIIDGFFQRGDSDNAMKFLAKMLNQGMR 300

Query: 301 LDLTAYTVVIAGFRRVGRLHKAMEAAENVVKNGLLPDRIILTAIMDVHFKAGNLKEALNA 360
           LD+TAY V+I+G    G+L +A E  E++ K+ L+PD +I T +M+ +FK+G +K A+N 
Sbjct: 301 LDITAYGVIISGLCGNGKLKEATEIVEDMEKSDLVPDMVIFTTMMNAYFKSGRMKAAVNM 360

Query: 361 YRILLARGFEPDTVTLSALIDGLCKNGYLQEARRYVVKEKANEILYTVLIDALCKEGNLD 420
           Y  L+ RGFEPD V LS +IDG+ KNG L EA  Y   EKAN+++YTVLIDALCKEG+  
Sbjct: 361 YHKLIERGFEPDVVALSTMIDGIAKNGQLHEAIVYFCIEKANDVMYTVLIDALCKEGDFI 420

Query: 421 EAERTIKEMCEAGFVPDKYVYTSWIAELCKQGNLLKAFTVKKRMVQENIEPDLLTYSSLI 480
           E ER   ++ EAG VPDK++YTSWIA LCKQGNL+ AF +K RMVQE +  DLL Y++LI
Sbjct: 421 EVERLFSKISEAGLVPDKFMYTSWIAGLCKQGNLVDAFKLKTRMVQEGLLLDLLAYTTLI 480

Query: 481 GGLAEKGLMIEAKQVFDDMLNAGITPDSVAYDILIRGYHNQGNVVAISDFVL 530
            GLA KGLM+EA+QVFD+MLN+GI+PDS  +D+LIR Y  +GN+ A SD +L
Sbjct: 481 YGLASKGLMVEARQVFDEMLNSGISPDSAVFDLLIRAYEKEGNMAAASDLLL 532

BLAST of CmaCh04G000780 vs. TAIR10
Match: AT2G02150.1 (AT2G02150.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 302.0 bits (772), Expect = 7.6e-82
Identity = 172/548 (31.39%), Postives = 285/548 (52.01%), Query Frame = 1

Query: 1   MVKEALQFLAHLRRISRFPSPFTCNKLVHSLINSGCGELSAKVLFHFLSKGYTPHSSSFN 60
           M++EA+Q  + ++R   FP   +CN L+H     G  +   +     +  G  P   ++N
Sbjct: 207 MLEEAIQCFSKMKRFRVFPKTRSCNGLLHRFAKLGKTDDVKRFFKDMIGAGARPTVFTYN 266

Query: 61  SIISFFCKLGNIKYAERILNSMPRFGCSPDIVSYNSLLHGYCASYKIREACFLVNRVRGG 120
            +I   CK G+++ A  +   M   G  PD V+YNS++ G+    ++ +       ++  
Sbjct: 267 IMIDCMCKEGDVEAARGLFEEMKFRGLVPDTVTYNSMIDGFGKVGRLDDTVCFFEEMKDM 326

Query: 121 LLNPDLVMFNILFNGFAKV-YMKNEAFMYLGLMWKSCLPNVVTYGTLVDMFCKMGDMEVG 180
              PD++ +N L N F K   +      Y  +      PNVV+Y TLVD FCK G M+  
Sbjct: 327 CCEPDVITYNALINCFCKFGKLPIGLEFYREMKGNGLKPNVVSYSTLVDAFCKEGMMQQA 386

Query: 181 NKMFFDMMKVGVVPNLVAFSSLIDGYCKAGSLNVAFGYLERMQQCSVQPNEFTYSTLIDG 240
            K + DM +VG+VPN   ++SLID  CK G+L+ AF     M Q  V+ N  TY+ LIDG
Sbjct: 387 IKFYVDMRRVGLVPNEYTYTSLIDANCKIGNLSDAFRLGNEMLQVGVEWNVVTYTALIDG 446

Query: 241 CCKQGMLERADFLFEEMLSVGILPNCTVHTSIIDGHFKKGNVDNALKYINRMFDREIELD 300
            C    ++ A+ LF +M + G++PN   + ++I G  K  N+D AL+ +N +  R I+ D
Sbjct: 447 LCDAERMKEAEELFGKMDTAGVIPNLASYNALIHGFVKAKNMDRALELLNELKGRGIKPD 506

Query: 301 LTAYTVVIAGFRRVGRLHKAMEAAENVVKNGLLPDRIILTAIMDVHFKAGNLKEALNAYR 360
           L  Y   I G   + ++  A      + + G+  + +I T +MD +FK+GN  E L+   
Sbjct: 507 LLLYGTFIWGLCSLEKIEAAKVVMNEMKECGIKANSLIYTTLMDAYFKSGNPTEGLHLLD 566

Query: 361 ILLARGFEPDTVTLSALIDGLCKNGYLQEARRYVVK------EKANEILYTVLIDALCKE 420
            +     E   VT   LIDGLCKN  + +A  Y  +       +AN  ++T +ID LCK+
Sbjct: 567 EMKELDIEVTVVTFCVLIDGLCKNKLVSKAVDYFNRISNDFGLQANAAIFTAMIDGLCKD 626

Query: 421 GNLDEAERTIKEMCEAGFVPDKYVYTSWIAELCKQGNLLKAFTVKKRMVQENIEPDLLTY 480
             ++ A    ++M + G VPD+  YTS +    KQGN+L+A  ++ +M +  ++ DLL Y
Sbjct: 627 NQVEAATTLFEQMVQKGLVPDRTAYTSLMDGNFKQGNVLEALALRDKMAEIGMKLDLLAY 686

Query: 481 SSLIGGLAEKGLMIEAKQVFDDMLNAGITPDSVAYDILIRGYHNQG---NVVAISDFVLL 539
           +SL+ GL+    + +A+   ++M+  GI PD V    +++ ++  G     V +  +++ 
Sbjct: 687 TSLVWGLSHCNQLQKARSFLEEMIGEGIHPDEVLCISVLKKHYELGCIDEAVELQSYLMK 746

BLAST of CmaCh04G000780 vs. TAIR10
Match: AT5G65560.1 (AT5G65560.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 263.1 bits (671), Expect = 3.9e-70
Identity = 155/521 (29.75%), Postives = 272/521 (52.21%), Query Frame = 1

Query: 1   MVKEALQFLAHLRRISRFPSPFTCNKLVHSLINSGCGELSAKVLFHFLSKGYTPHSSSFN 60
           +V E  Q    +      P+ +T NK+V+     G  E + + +   +  G  P   ++ 
Sbjct: 198 LVDEMKQVYMEMLEDKVCPNIYTYNKMVNGYCKLGNVEEANQYVSKIVEAGLDPDFFTYT 257

Query: 61  SIISFFCKLGNIKYAERILNSMPRFGCSPDIVSYNSLLHGYCASYKIREACFLVNRVRGG 120
           S+I  +C+  ++  A ++ N MP  GC  + V+Y  L+HG C + +I EA  L  +++  
Sbjct: 258 SLIMGYCQRKDLDSAFKVFNEMPLKGCRRNEVAYTHLIHGLCVARRIDEAMDLFVKMKDD 317

Query: 121 LLNPDLVMFNILFNGFAKVYMKNEAFMYLGLMWKSCL-PNVVTYGTLVDMFCKMGDMEVG 180
              P +  + +L         K+EA   +  M ++ + PN+ TY  L+D  C     E  
Sbjct: 318 ECFPTVRTYTVLIKSLCGSERKSEALNLVKEMEETGIKPNIHTYTVLIDSLCSQCKFEKA 377

Query: 181 NKMFFDMMKVGVVPNLVAFSSLIDGYCKAGSLNVAFGYLERMQQCSVQPNEFTYSTLIDG 240
            ++   M++ G++PN++ +++LI+GYCK G +  A   +E M+   + PN  TY+ LI G
Sbjct: 378 RELLGQMLEKGLMPNVITYNALINGYCKRGMIEDAVDVVELMESRKLSPNTRTYNELIKG 437

Query: 241 CCKQGMLERADFLFEEMLSVGILPNCTVHTSIIDGHFKKGNVDNALKYINRMFDREIELD 300
            CK  +  +A  +  +ML   +LP+   + S+IDG  + GN D+A + ++ M DR +  D
Sbjct: 438 YCKSNV-HKAMGVLNKMLERKVLPDVVTYNSLIDGQCRSGNFDSAYRLLSLMNDRGLVPD 497

Query: 301 LTAYTVVIAGFRRVGRLHKAMEAAENVVKNGLLPDRIILTAIMDVHFKAGNLKEALNAYR 360
              YT +I    +  R+ +A +  +++ + G+ P+ ++ TA++D + KAG + EA     
Sbjct: 498 QWTYTSMIDSLCKSKRVEEACDLFDSLEQKGVNPNVVMYTALIDGYCKAGKVDEAHLMLE 557

Query: 361 ILLARGFEPDTVTLSALIDGLCKNGYLQEARRYVVKEKANEI-------LYTVLIDALCK 420
            +L++   P+++T +ALI GLC +G L+EA   +++EK  +I         T+LI  L K
Sbjct: 558 KMLSKNCLPNSLTFNALIHGLCADGKLKEAT--LLEEKMVKIGLQPTVSTDTILIHRLLK 617

Query: 421 EGNLDEAERTIKEMCEAGFVPDKYVYTSWIAELCKQGNLLKAFTVKKRMVQENIEPDLLT 480
           +G+ D A    ++M  +G  PD + YT++I   C++G LL A  +  +M +  + PDL T
Sbjct: 618 DGDFDHAYSRFQQMLSSGTKPDAHTYTTFIQTYCREGRLLDAEDMMAKMRENGVSPDLFT 677

Query: 481 YSSLIGGLAEKGLMIEAKQVFDDMLNAGITPDSVAYDILIR 514
           YSSLI G  + G    A  V   M + G  P    +  LI+
Sbjct: 678 YSSLIKGYGDLGQTNFAFDVLKRMRDTGCEPSQHTFLSLIK 715

BLAST of CmaCh04G000780 vs. TAIR10
Match: AT5G39710.1 (AT5G39710.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 251.5 bits (641), Expect = 1.2e-66
Identity = 158/529 (29.87%), Postives = 275/529 (51.98%), Query Frame = 1

Query: 1   MVKEALQFLAHLRRISRF-PSPFTCNKLVHSLINSGCGELSAKVLF-HFLSKGYTPHSSS 60
           ++ +AL  + HL +   F P   + N ++ + I S      A+ +F   L    +P+  +
Sbjct: 149 LIDKALS-IVHLAQAHGFMPGVLSYNAVLDATIRSKRNISFAENVFKEMLESQVSPNVFT 208

Query: 61  FNSIISFFCKLGNIKYAERILNSMPRFGCSPDIVSYNSLLHGYCASYKIREACFLVNRVR 120
           +N +I  FC  GNI  A  + + M   GC P++V+YN+L+ GYC   KI +   L+  + 
Sbjct: 209 YNILIRGFCFAGNIDVALTLFDKMETKGCLPNVVTYNTLIDGYCKLRKIDDGFKLLRSMA 268

Query: 121 GGLLNPDLVMFNILFNGFAKV-YMKNEAFMYLGLMWKSCLPNVVTYGTLVDMFCKMGDME 180
              L P+L+ +N++ NG  +   MK  +F+   +  +    + VTY TL+  +CK G+  
Sbjct: 269 LKGLEPNLISYNVVINGLCREGRMKEVSFVLTEMNRRGYSLDEVTYNTLIKGYCKEGNFH 328

Query: 181 VGNKMFFDMMKVGVVPNLVAFSSLIDGYCKAGSLNVAFGYLERMQQCSVQPNEFTYSTLI 240
               M  +M++ G+ P+++ ++SLI   CKAG++N A  +L++M+   + PNE TY+TL+
Sbjct: 329 QALVMHAEMLRHGLTPSVITYTSLIHSMCKAGNMNRAMEFLDQMRVRGLCPNERTYTTLV 388

Query: 241 DGCCKQGMLERADFLFEEMLSVGILPNCTVHTSIIDGHFKKGNVDNALKYINRMFDREIE 300
           DG  ++G +  A  +  EM   G  P+   + ++I+GH   G +++A+  +  M ++ + 
Sbjct: 389 DGFSQKGYMNEAYRVLREMNDNGFSPSVVTYNALINGHCVTGKMEDAIAVLEDMKEKGLS 448

Query: 301 LDLTAYTVVIAGFRRVGRLHKAMEAAENVVKNGLLPDRIILTAIMDVHFKAGNLKEALNA 360
            D+ +Y+ V++GF R   + +A+     +V+ G+ PD I  ++++    +    KEA + 
Sbjct: 449 PDVVSYSTVLSGFCRSYDVDEALRVKREMVEKGIKPDTITYSSLIQGFCEQRRTKEACDL 508

Query: 361 YRILLARGFEPDTVTLSALIDGLCKNGYLQEARRY--VVKEKA---NEILYTVLIDALCK 420
           Y  +L  G  PD  T +ALI+  C  G L++A +    + EK    + + Y+VLI+ L K
Sbjct: 509 YEEMLRVGLPPDEFTYTALINAYCMEGDLEKALQLHNEMVEKGVLPDVVTYSVLINGLNK 568

Query: 421 EGNLDEAERTIKEMCEAGFVPDKYVYTSWIAELCKQGNLLKAFTVKKRMVQENIEPDLLT 480
           +    EA+R + ++     VP    Y + I E C                  NIE    +
Sbjct: 569 QSRTREAKRLLLKLFYEESVPSDVTYHTLI-ENCS-----------------NIE--FKS 628

Query: 481 YSSLIGGLAEKGLMIEAKQVFDDMLNAGITPDSVAYDILIRGYHNQGNV 522
             SLI G   KG+M EA QVF+ ML     PD  AY+I+I G+   G++
Sbjct: 629 VVSLIKGFCMKGMMTEADQVFESMLGKNHKPDGTAYNIMIHGHCRAGDI 656

BLAST of CmaCh04G000780 vs. TAIR10
Match: AT5G55840.1 (AT5G55840.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 250.4 bits (638), Expect = 2.6e-66
Identity = 148/528 (28.03%), Postives = 261/528 (49.43%), Query Frame = 1

Query: 1   MVKEALQFLAHLRRISRFPSPFTCNKLVHSLINSGCGELSAKVLFHFLSKGYTPHSSSFN 60
           M++++L+    +      PS +TCN ++ S++ SG        L   L +   P  ++FN
Sbjct: 178 MIQDSLEIFRLMGLYGFNPSVYTCNAILGSVVKSGEDVSVWSFLKEMLKRKICPDVATFN 237

Query: 61  SIISFFCKLGNIKYAERILNSMPRFGCSPDIVSYNSLLHGYCASYKIREACFLVNRVRGG 120
            +I+  C  G+ + +  ++  M + G +P IV+YN++LH YC   + + A  L++ ++  
Sbjct: 238 ILINVLCAEGSFEKSSYLMQKMEKSGYAPTIVTYNTVLHWYCKKGRFKAAIELLDHMKSK 297

Query: 121 LLNPDLVMFNILFNGFAKVYMKNEAFMYLGLMWKSCL-PNVVTYGTLVDMFCKMGDMEVG 180
            ++ D+  +N+L +   +     + ++ L  M K  + PN VTY TL++ F   G + + 
Sbjct: 298 GVDADVCTYNMLIHDLCRSNRIAKGYLLLRDMRKRMIHPNEVTYNTLINGFSNEGKVLIA 357

Query: 181 NKMFFDMMKVGVVPNLVAFSSLIDGYCKAGSLNVAFGYLERMQQCSVQPNEFTYSTLIDG 240
           +++  +M+  G+ PN V F++LIDG+   G+   A      M+   + P+E +Y  L+DG
Sbjct: 358 SQLLNEMLSFGLSPNHVTFNALIDGHISEGNFKEALKMFYMMEAKGLTPSEVSYGVLLDG 417

Query: 241 CCKQGMLERADFLFEEMLSVGILPNCTVHTSIIDGHFKKGNVDNALKYINRMFDREIELD 300
            CK    + A   +  M   G+      +T +IDG  K G +D A+  +N M    I+ D
Sbjct: 418 LCKNAEFDLARGFYMRMKRNGVCVGRITYTGMIDGLCKNGFLDEAVVLLNEMSKDGIDPD 477

Query: 301 LTAYTVVIAGFRRVGRLHKAMEAAENVVKNGLLPDRIILTAIMDVHFKAGNLKEALNAYR 360
           +  Y+ +I GF +VGR   A E    + + GL P+ II + ++    + G LKEA+  Y 
Sbjct: 478 IVTYSALINGFCKVGRFKTAKEIVCRIYRVGLSPNGIIYSTLIYNCCRMGCLKEAIRIYE 537

Query: 361 ILLARGFEPDTVTLSALIDGLCKNGYLQEARRYVVKEKANEIL-----YTVLIDALCKEG 420
            ++  G   D  T + L+  LCK G + EA  ++    ++ IL     +  LI+     G
Sbjct: 538 AMILEGHTRDHFTFNVLVTSLCKAGKVAEAEEFMRCMTSDGILPNTVSFDCLINGYGNSG 597

Query: 421 NLDEAERTIKEMCEAGFVPDKYVYTSWIAELCKQGNLLKAFTVKKRMVQENIEPDLLTYS 480
              +A     EM + G  P  + Y S +  LCK G+L +A    K +       D + Y+
Sbjct: 598 EGLKAFSVFDEMTKVGHHPTFFTYGSLLKGLCKGGHLREAEKFLKSLHAVPAAVDTVMYN 657

Query: 481 SLIGGLAEKGLMIEAKQVFDDMLNAGITPDSVAYDILIRGYHNQGNVV 523
           +L+  + + G + +A  +F +M+   I PDS  Y  LI G   +G  V
Sbjct: 658 TLLTAMCKSGNLAKAVSLFGEMVQRSILPDSYTYTSLISGLCRKGKTV 705

BLAST of CmaCh04G000780 vs. NCBI nr
Match: gi|778698035|ref|XP_011654465.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g01740 isoform X1 [Cucumis sativus])

HSP 1 Score: 909.1 bits (2348), Expect = 3.8e-261
Identity = 440/526 (83.65%), Postives = 483/526 (91.83%), Query Frame = 1

Query: 1   MVKEALQFLAHLRRISRFPSPFTCNKLVHSLINSGCGELSAKVLFHFLSKGYTPHSSSFN 60
           MVKEALQ+LAHLRR  RFP+PFTCNKL+HSLINSGCG LSAK+LFHFLSKGYTPH SSFN
Sbjct: 1   MVKEALQYLAHLRRTFRFPTPFTCNKLLHSLINSGCGHLSAKLLFHFLSKGYTPHPSSFN 60

Query: 61  SIISFFCKLGNIKYAERILNSMPRFGCSPDIVSYNSLLHGYCASYKIREACFLVNRVRGG 120
           SIISFFC+ GN+K+AE I  SM RFGCSPDIVSYNSLL GYC+SY+I++ACFLVNRVRG 
Sbjct: 61  SIISFFCRSGNVKFAEHIFISMSRFGCSPDIVSYNSLLDGYCSSYQIQKACFLVNRVRGC 120

Query: 121 LLN-PDLVMFNILFNGFAKVYMKNEAFMYLGLMWKSCLPNVVTYGTLVDMFCKMGDMEVG 180
            LN PDLVMFNILFNGFAKVYMKNEAFMY GLMWK CLP++VTYGT VDMFCKMGDM++G
Sbjct: 121 ELNRPDLVMFNILFNGFAKVYMKNEAFMYFGLMWKYCLPSIVTYGTFVDMFCKMGDMKMG 180

Query: 181 NKMFFDMMKVGVVPNLVAFSSLIDGYCKAGSLNVAFGYLERMQQCSVQPNEFTYSTLIDG 240
           N+MF DMMKVG+VPNLV FSSLIDGYCKAGSL+VAF Y ERM++CSV+PNEFTYSTLIDG
Sbjct: 181 NRMFLDMMKVGIVPNLVVFSSLIDGYCKAGSLDVAFEYFERMKECSVRPNEFTYSTLIDG 240

Query: 241 CCKQGMLERADFLFEEMLSVGILPNCTVHTSIIDGHFKKGNVDNALKYINRMFDREIELD 300
           C K GML RAD LFE+MLS  ILPNCTV+TSIIDGHFKKGNVD+A+KYIN+MFDR+I+LD
Sbjct: 241 CSKHGMLARADSLFEKMLSASILPNCTVYTSIIDGHFKKGNVDDAIKYINQMFDRDIKLD 300

Query: 301 LTAYTVVIAGFRRVGRLHKAMEAAENVVKNGLLPDRIILTAIMDVHFKAGNLKEALNAYR 360
           LTAYTV+I+GF RVGR  K+MEAAE V KNGLLPDRIILTAIMDVHFKAGN+KEALNAY+
Sbjct: 301 LTAYTVIISGFHRVGRFDKSMEAAEYVAKNGLLPDRIILTAIMDVHFKAGNIKEALNAYK 360

Query: 361 ILLARGFEPDTVTLSALIDGLCKNGYLQEARRYVVKEKANEILYTVLIDALCKEGNLDEA 420
           ILLA+GFE D VTLSAL+DGL K+GYLQEARRY+VKE ANEILYTV IDALCKEGNLD+A
Sbjct: 361 ILLAKGFEADVVTLSALMDGLSKHGYLQEARRYLVKENANEILYTVFIDALCKEGNLDDA 420

Query: 421 ERTIKEMCEAGFVPDKYVYTSWIAELCKQGNLLKAFTVKKRMVQENIEPDLLTYSSLIGG 480
           E+ IKEM EAGFVPDK+VYTSWIAELCKQGNLLKAF VKKRMVQE++EPDLLTYSSLIGG
Sbjct: 421 EKMIKEMSEAGFVPDKFVYTSWIAELCKQGNLLKAFMVKKRMVQEHVEPDLLTYSSLIGG 480

Query: 481 LAEKGLMIEAKQVFDDMLNAGITPDSVAYDILIRGYHNQGNVVAIS 526
           LAEKGLMIEAKQVFDDMLN GITPD V+YDILIRGYHNQGN  AIS
Sbjct: 481 LAEKGLMIEAKQVFDDMLNKGITPDFVSYDILIRGYHNQGNGAAIS 526

BLAST of CmaCh04G000780 vs. NCBI nr
Match: gi|659102971|ref|XP_008452410.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g01740 isoform X4 [Cucumis melo])

HSP 1 Score: 894.0 bits (2309), Expect = 1.3e-256
Identity = 438/526 (83.27%), Postives = 479/526 (91.06%), Query Frame = 1

Query: 1   MVKEALQFLAHLRRISRFPSPFTCNKLVHSLINSGCGELSAKVLFHFLSKGYTPHSSSFN 60
           MVKEALQFLAHLRRISRFPSPFTCNKL+HSLINSGCG LSAK+L H LSKGYTPH SSFN
Sbjct: 1   MVKEALQFLAHLRRISRFPSPFTCNKLLHSLINSGCGHLSAKLLIHLLSKGYTPHPSSFN 60

Query: 61  SIISFFCKLGNIKYAERILNSMPRFGCSPDIVSYNSLLHGYCASYKIREACFLVNRVRGG 120
           SIISFFC+ GN+K+AE+I  SM RFGCSPDIVSYNSLL GYC+S +I++ACFLVNRVRG 
Sbjct: 61  SIISFFCRSGNVKFAEQIFISMSRFGCSPDIVSYNSLLDGYCSSCQIQKACFLVNRVRGC 120

Query: 121 LLN-PDLVMFNILFNGFAKVYMKNEAFMYLGLMWKSCLPNVVTYGTLVDMFCKMGDMEVG 180
            LN PDLVMFNILF GFAKVYMKNEAFMYLGLMWK  LP++VTYGT VDMFCKMGDME+G
Sbjct: 121 ELNRPDLVMFNILFKGFAKVYMKNEAFMYLGLMWKYYLPSIVTYGTFVDMFCKMGDMEMG 180

Query: 181 NKMFFDMMKVGVVPNLVAFSSLIDGYCKAGSLNVAFGYLERMQQCSVQPNEFTYSTLIDG 240
           N+MF DMMKVG+VPNL+ FSSLIDGYCKAGSL+VAF Y ERM++CSV+PNEFTYSTLIDG
Sbjct: 181 NRMFLDMMKVGIVPNLIVFSSLIDGYCKAGSLDVAFEYFERMKECSVRPNEFTYSTLIDG 240

Query: 241 CCKQGMLERADFLFEEMLSVGILPNCTVHTSIIDGHFKKGNVDNALKYINRMFDREIELD 300
           C K+GML RAD LFE+MLS  ILPNCTV+TSIIDGHFKKGNVD+A+KYIN+MFD++I+LD
Sbjct: 241 CSKRGMLARADSLFEKMLSASILPNCTVYTSIIDGHFKKGNVDDAIKYINQMFDQDIKLD 300

Query: 301 LTAYTVVIAGFRRVGRLHKAMEAAENVVKNGLLPDRIILTAIMDVHFKAGNLKEALNAYR 360
           LTAYTV+I+GF RVGR  K+MEAAE V K GLLPDRIILTAIMDVHFKAGN+KEALNAY+
Sbjct: 301 LTAYTVIISGFHRVGRFDKSMEAAEYVAKKGLLPDRIILTAIMDVHFKAGNIKEALNAYK 360

Query: 361 ILLARGFEPDTVTLSALIDGLCKNGYLQEARRYVVKEKANEILYTVLIDALCKEGNLDEA 420
           ILLA+GFE D  TLSAL+DGL K+GYLQ+ARRY VKEKANEILYTV IDALCKEGNLDEA
Sbjct: 361 ILLAKGFEADVATLSALMDGLSKHGYLQKARRYFVKEKANEILYTVFIDALCKEGNLDEA 420

Query: 421 ERTIKEMCEAGFVPDKYVYTSWIAELCKQGNLLKAFTVKKRMVQENIEPDLLTYSSLIGG 480
           E+ IKEM EAGFVPDK+VYTS IAELCKQGNLLKAF VKKRMVQE+IEPDLLTYSSLI G
Sbjct: 421 EKMIKEMSEAGFVPDKFVYTSLIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLISG 480

Query: 481 LAEKGLMIEAKQVFDDMLNAGITPDSVAYDILIRGYHNQGNVVAIS 526
           LAEKGLMIEAKQVFDDMLN GITPD VAYDILIRGYHNQGN  AIS
Sbjct: 481 LAEKGLMIEAKQVFDDMLNKGITPDFVAYDILIRGYHNQGNGAAIS 526

BLAST of CmaCh04G000780 vs. NCBI nr
Match: gi|778698039|ref|XP_011654466.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g01740 isoform X2 [Cucumis sativus])

HSP 1 Score: 802.7 bits (2072), Expect = 3.9e-229
Identity = 403/526 (76.62%), Postives = 447/526 (84.98%), Query Frame = 1

Query: 1   MVKEALQFLAHLRRISRFPSPFTCNKLVHSLINSGCGELSAKVLFHFLSKGYTPHSSSFN 60
           MVKEALQ+LAHLRR  RFP+PFTCNKL+HSL                ++ G   H S+  
Sbjct: 1   MVKEALQYLAHLRRTFRFPTPFTCNKLLHSL----------------INSG-CGHLSA-- 60

Query: 61  SIISFFCKLGNIKYAERILNSMPRFGCSPDIVSYNSLLHGYCASYKIREACFLVNRVRGG 120
            ++  F   G              FGCSPDIVSYNSLL GYC+SY+I++ACFLVNRVRG 
Sbjct: 61  KLLFHFLSKG--------------FGCSPDIVSYNSLLDGYCSSYQIQKACFLVNRVRGC 120

Query: 121 LLN-PDLVMFNILFNGFAKVYMKNEAFMYLGLMWKSCLPNVVTYGTLVDMFCKMGDMEVG 180
            LN PDLVMFNILFNGFAKVYMKNEAFMY GLMWK CLP++VTYGT VDMFCKMGDM++G
Sbjct: 121 ELNRPDLVMFNILFNGFAKVYMKNEAFMYFGLMWKYCLPSIVTYGTFVDMFCKMGDMKMG 180

Query: 181 NKMFFDMMKVGVVPNLVAFSSLIDGYCKAGSLNVAFGYLERMQQCSVQPNEFTYSTLIDG 240
           N+MF DMMKVG+VPNLV FSSLIDGYCKAGSL+VAF Y ERM++CSV+PNEFTYSTLIDG
Sbjct: 181 NRMFLDMMKVGIVPNLVVFSSLIDGYCKAGSLDVAFEYFERMKECSVRPNEFTYSTLIDG 240

Query: 241 CCKQGMLERADFLFEEMLSVGILPNCTVHTSIIDGHFKKGNVDNALKYINRMFDREIELD 300
           C K GML RAD LFE+MLS  ILPNCTV+TSIIDGHFKKGNVD+A+KYIN+MFDR+I+LD
Sbjct: 241 CSKHGMLARADSLFEKMLSASILPNCTVYTSIIDGHFKKGNVDDAIKYINQMFDRDIKLD 300

Query: 301 LTAYTVVIAGFRRVGRLHKAMEAAENVVKNGLLPDRIILTAIMDVHFKAGNLKEALNAYR 360
           LTAYTV+I+GF RVGR  K+MEAAE V KNGLLPDRIILTAIMDVHFKAGN+KEALNAY+
Sbjct: 301 LTAYTVIISGFHRVGRFDKSMEAAEYVAKNGLLPDRIILTAIMDVHFKAGNIKEALNAYK 360

Query: 361 ILLARGFEPDTVTLSALIDGLCKNGYLQEARRYVVKEKANEILYTVLIDALCKEGNLDEA 420
           ILLA+GFE D VTLSAL+DGL K+GYLQEARRY+VKE ANEILYTV IDALCKEGNLD+A
Sbjct: 361 ILLAKGFEADVVTLSALMDGLSKHGYLQEARRYLVKENANEILYTVFIDALCKEGNLDDA 420

Query: 421 ERTIKEMCEAGFVPDKYVYTSWIAELCKQGNLLKAFTVKKRMVQENIEPDLLTYSSLIGG 480
           E+ IKEM EAGFVPDK+VYTSWIAELCKQGNLLKAF VKKRMVQE++EPDLLTYSSLIGG
Sbjct: 421 EKMIKEMSEAGFVPDKFVYTSWIAELCKQGNLLKAFMVKKRMVQEHVEPDLLTYSSLIGG 480

Query: 481 LAEKGLMIEAKQVFDDMLNAGITPDSVAYDILIRGYHNQGNVVAIS 526
           LAEKGLMIEAKQVFDDMLN GITPD V+YDILIRGYHNQGN  AIS
Sbjct: 481 LAEKGLMIEAKQVFDDMLNKGITPDFVSYDILIRGYHNQGNGAAIS 493

BLAST of CmaCh04G000780 vs. NCBI nr
Match: gi|1009154833|ref|XP_015895385.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g01740 [Ziziphus jujuba])

HSP 1 Score: 714.1 bits (1842), Expect = 1.8e-202
Identity = 336/524 (64.12%), Postives = 412/524 (78.63%), Query Frame = 1

Query: 1   MVKEALQFLAHLRRISRFPSPFTCNKLVHSLINSGCGELSAKVLFHFLSKGYTPHSSSFN 60
           MV E LQF A LRR S+FPSPFT NKL+HSL++S CGELS K L  FLSKGY PH SSFN
Sbjct: 1   MVSETLQFFAQLRRASKFPSPFTFNKLLHSLVDSNCGELSLKFLSFFLSKGYVPHPSSFN 60

Query: 61  SIISFFCKLGNIKYAERILNSMPRFGCSPDIVSYNSLLHGYCASYKIREACFLVNRVRGG 120
           S++SF CKLGN  +AE++++SMP FGC PD V+YNSL+ GYC ++ I   C ++ ++R G
Sbjct: 61  SVLSFLCKLGNFCFAEKVVDSMPGFGCVPDAVTYNSLIDGYCKNFDIERGCLVLKKIRVG 120

Query: 121 LLNPDLVMFNILFNGFAKVYMKNEAFMYLGLMWKSCLPNVVTYGTLVDMFCKMGDMEVGN 180
             NPD+V FNIL NGF+K+ MK EAF+Y+GLMWK CLPNVVTY T +DMFCKMGD+++G 
Sbjct: 121 HCNPDVVTFNILLNGFSKLKMKKEAFVYMGLMWKCCLPNVVTYSTFIDMFCKMGDLDMGY 180

Query: 181 KMFFDMMKVGVVPNLVAFSSLIDGYCKAGSLNVAFGYLERMQQCSVQPNEFTYSTLIDGC 240
           K+F DMMK GV PNL+AF+SLIDGYCKAG++ +AF   E+M+Q S+ PN  TY+ LIDG 
Sbjct: 181 KVFSDMMKNGVFPNLIAFTSLIDGYCKAGNVEMAFELFEKMKQSSLSPNVVTYTALIDGL 240

Query: 241 CKQGMLERADFLFEEMLSVGILPNCTVHTSIIDGHFKKGNVDNALKYINRMFDREIELDL 300
           CK GMLE A+ LF +ML  G+ PN  V+TS+IDG+F KGNVDNA+KY+ +M ++ I  DL
Sbjct: 241 CKHGMLEGAESLFSKMLEDGVEPNSAVYTSMIDGNFVKGNVDNAIKYVIKMREQNIRFDL 300

Query: 301 TAYTVVIAGFRRVGRLHKAMEAAENVVKNGLLPDRIILTAIMDVHFKAGNLKEALNAYRI 360
           T + V+I GF + GRL KAME    VV +GL PD+IILT IMD HFK+ NLK ALN YR 
Sbjct: 301 TTFGVIIWGFCKTGRLDKAMEVMGIVVASGLAPDKIILTTIMDAHFKSRNLKAALNLYRE 360

Query: 361 LLARGFEPDTVTLSALIDGLCKNGYLQEARRYVVKEKANEILYTVLIDALCKEGNLDEAE 420
           LL RGFEPD +TLS L++GLCK+G+LQEARRY  +EKAN+I YTVLID +CKEG  +E E
Sbjct: 361 LLVRGFEPDVITLSTLLNGLCKHGHLQEARRYFCREKANQISYTVLIDGICKEGQFNEVE 420

Query: 421 RTIKEMCEAGFVPDKYVYTSWIAELCKQGNLLKAFTVKKRMVQENIEPDLLTYSSLIGGL 480
             +KEM E GFVPDKYVYTSWIA LCKQGNL++AF +K +MV+E +EPDLLTYSSLI GL
Sbjct: 421 MVLKEMSEVGFVPDKYVYTSWIAGLCKQGNLVEAFRLKNKMVKEGVEPDLLTYSSLISGL 480

Query: 481 AEKGLMIEAKQVFDDMLNAGITPDSVAYDILIRGYHNQGNVVAI 525
           A KGLMIEAKQVFD+ML  GITPDS  YDIL+RGY  +G+  A+
Sbjct: 481 ASKGLMIEAKQVFDNMLKMGITPDSAVYDILVRGYLKEGDEAAV 524

BLAST of CmaCh04G000780 vs. NCBI nr
Match: gi|645229218|ref|XP_008221361.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g01740 [Prunus mume])

HSP 1 Score: 700.3 bits (1806), Expect = 2.7e-198
Identity = 337/520 (64.81%), Postives = 410/520 (78.85%), Query Frame = 1

Query: 6   LQFLAHLRRISRFPSPFTCNKLVHSLINSGCGELSAKVLFHFLSKGYTPHSSSFNSIISF 65
           L+F A LRR S+FP+PFTCNKL+HSLI+S CGELS K+L H LSKGY PH SSFNS+ISF
Sbjct: 283 LKFFALLRRTSQFPTPFTCNKLLHSLISSNCGELSVKILCHLLSKGYNPHPSSFNSVISF 342

Query: 66  FCKLGNIKYAERILNSMPRFGCSPDIVSYNSLLHGYCASYKIREACFLVNRVRGGLLNPD 125
           FCKLG+I +A  +++SMPR+GC PDIV+YNSL+ GYC    I EAC ++ ++R G   PD
Sbjct: 343 FCKLGHISFARTLVDSMPRYGCLPDIVTYNSLIDGYCKFCDIDEACLMMRKIRIGGCRPD 402

Query: 126 LVMFNILFNGFAKVYMKNEAFMYLGLMWKSCLPNVVTYGTLVDMFCKMGDMEVGNKMFFD 185
           L  FN+LFNGF KV MK EAF+Y+GLMWKSC PNVVTY T +DMFCK GD+ +G ++  D
Sbjct: 403 LGTFNVLFNGFCKVKMKKEAFVYMGLMWKSCSPNVVTYSTFIDMFCKTGDLGLGYRVLGD 462

Query: 186 MMKVGVVPNLVAFSSLIDGYCKAGSLNVAFGYLERMQQCSVQPNEFTYSTLIDGCCKQGM 245
           M+K GV+PNLVAF+SLIDGYCKAG+L VAF  LE+M+Q S+ PN  TY+ LI G C QGM
Sbjct: 463 MVKDGVLPNLVAFTSLIDGYCKAGNLEVAFELLEKMRQSSLLPNVVTYNALIKGLCMQGM 522

Query: 246 LERADFLFEEMLSVGILPNCTVHTSIIDGHFKKGNVDNALKYINRMFDREIELDLTAYTV 305
            ERAD+LF +M   G+ PN  V+TSIIDGH +KGNVD+A+KY++RM D+   LD+ AY V
Sbjct: 523 SERADYLFSKMWEDGVEPNSAVYTSIIDGHLQKGNVDDAMKYMSRMHDQGFNLDVAAYGV 582

Query: 306 VIAGFRRVGRLHKAMEAAENVVKNGLLPDRIILTAIMDVHFKAGNLKEALNAYRILLARG 365
           VI+G  +  RL KA+   E++V +GL+PD+++L  IMD +FKAGNLK AL  YR LL RG
Sbjct: 583 VISGLCKNSRLDKAILFIEDMVSSGLVPDQMLLATIMDAYFKAGNLKAALGVYRELLERG 642

Query: 366 FEPDTVTLSALIDGLCKNGYLQEARRYVVKEKANEILYTVLIDALCKEGNLDEAERTIKE 425
           FEPD VTLSAL+DGLCK+G L+EAR Y  KEKANEI Y+VLI+ +CKEGNL E E+  +E
Sbjct: 643 FEPDGVTLSALMDGLCKHGCLKEARGYFCKEKANEISYSVLINGMCKEGNLSEVEKVFRE 702

Query: 426 MCEAGFVPDKYVYTSWIAELCKQGNLLKAFTVKKRMVQENIEPDLLTYSSLIGGLAEKGL 485
           M EAGF+PDKYVYTSWIA LCKQG+L +AF +K +MV+E I PDLLTYSSLI GLA  GL
Sbjct: 703 MSEAGFIPDKYVYTSWIAGLCKQGSLSEAFRLKNKMVKEGIIPDLLTYSSLIFGLANAGL 762

Query: 486 MIEAKQVFDDMLNAGITPDSVAYDILIRGYHNQGNVVAIS 526
           MIEAKQVFDDML  GITPDS  +DILIRGYH +GN  AIS
Sbjct: 763 MIEAKQVFDDMLKKGITPDSAVFDILIRGYHKEGNDAAIS 802

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP141_ARATH3.1e-17053.01Pentatricopeptide repeat-containing protein At2g01740 OS=Arabidopsis thaliana GN... [more]
PP143_ARATH1.3e-8031.39Putative pentatricopeptide repeat-containing protein At2g02150 OS=Arabidopsis th... [more]
PP445_ARATH6.9e-6929.75Pentatricopeptide repeat-containing protein At5g65560 OS=Arabidopsis thaliana GN... [more]
PP407_ARATH2.1e-6529.87Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana GN... [more]
PP432_ARATH4.6e-6528.03Pentatricopeptide repeat-containing protein At5g55840 OS=Arabidopsis thaliana GN... [more]
Match NameE-valueIdentityDescription
A0A0A0KPG7_CUCSA1.6e-19784.42Uncharacterized protein OS=Cucumis sativus GN=Csa_5G021290 PE=4 SV=1[more]
W9R0T6_9ROSA2.1e-19761.71Uncharacterized protein OS=Morus notabilis GN=L484_005539 PE=4 SV=1[more]
A0A061EFM6_THECC5.0e-17557.36Tetratricopeptide repeat (TPR)-like superfamily protein, putative isoform 1 OS=T... [more]
A0A0D2Q7W0_GOSRA4.2e-17456.92Uncharacterized protein OS=Gossypium raimondii GN=B456_002G171900 PE=4 SV=1[more]
V4KT02_EUTSA3.3e-17154.51Uncharacterized protein OS=Eutrema salsugineum GN=EUTSA_v10003932mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G01740.11.8e-17153.01 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT2G02150.17.6e-8231.39 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G65560.13.9e-7029.75 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT5G39710.11.2e-6629.87 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G55840.12.6e-6628.03 Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
gi|778698035|ref|XP_011654465.1|3.8e-26183.65PREDICTED: pentatricopeptide repeat-containing protein At2g01740 isoform X1 [Cuc... [more]
gi|659102971|ref|XP_008452410.1|1.3e-25683.27PREDICTED: pentatricopeptide repeat-containing protein At2g01740 isoform X4 [Cuc... [more]
gi|778698039|ref|XP_011654466.1|3.9e-22976.62PREDICTED: pentatricopeptide repeat-containing protein At2g01740 isoform X2 [Cuc... [more]
gi|1009154833|ref|XP_015895385.1|1.8e-20264.12PREDICTED: pentatricopeptide repeat-containing protein At2g01740 [Ziziphus jujub... [more]
gi|645229218|ref|XP_008221361.1|2.7e-19864.81PREDICTED: pentatricopeptide repeat-containing protein At2g01740 [Prunus mume][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0005515 protein binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh04G000780.1CmaCh04G000780.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 302..321
score: 1.1coord: 268..294
score: 0.
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 155..186
score: 1.
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 333..382
score: 3.0E-9coord: 472..515
score: 1.4E-10coord: 55..102
score: 6.4E-12coord: 193..242
score: 2.7E-16coord: 399..447
score: 1.3
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 58..91
score: 8.5E-8coord: 338..370
score: 4.7E-4coord: 436..469
score: 4.2E-6coord: 196..230
score: 4.1E-7coord: 402..434
score: 2.0E-10coord: 472..505
score: 3.3E-7coord: 161..194
score: 2.7E-7coord: 231..265
score: 1.4E-10coord: 267..299
score: 1.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 399..433
score: 13.088coord: 229..263
score: 13.581coord: 20..54
score: 7.662coord: 194..228
score: 11.17coord: 434..468
score: 9.986coord: 55..89
score: 11.082coord: 264..298
score: 8.55coord: 369..396
score: 6.412coord: 125..155
score: 6.226coord: 90..124
score: 8.923coord: 504..539
score: 5.59coord: 159..193
score: 11.63coord: 469..503
score: 12.079coord: 334..368
score: 9.175coord: 299..333
score: 9
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 396..432
score: 2.0E-7coord: 197..364
score: 2.
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 1..520
score: 1.4E

The following gene(s) are paralogous to this gene:

None