CmoCh14G004650 (gene) Cucurbita moschata (Rifu)

NameCmoCh14G004650
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionPentatricopeptide repeat superfamily protein
LocationCmo_Chr14 : 2246874 .. 2249016 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CTCTCGTTTTCCCTCCGATCATCCTTGGATCACCAGAGCTGGGAGCTTCAAGAACAGTCAAGAACTGGAAGAAGAAGAACGATTTGCGTTCCTCATCAGAAAATGTCCCAACATGAGAGTGCTCCGGCAGCTTCACGCCCACATTCTCACACGCCCACTTCCTCTTTCCACATTTTCCTTTGCGCTTTCCAAAATCACTGTCTTCTGTGCTCTTTCTCCGCTCGGCAACATCGATTATGCCCGCTTAGTTTTCGCTCAAATTTCTCGTCCCAGCATTTTCTCTTGGAATTCTCTGATCAAGGGCTGTTCCAAGATTCAAAACCCTTCCAAGGAACCGATAGCTTTGTTCCAGAAGCTTACTGAAACAGGGTACCCTGTTCCGAACTCCTTCACTATGGCTTTTGTTCTCAAGGCTTGTGCGATTGTTACAGCGTTTGAAGAGGGCTTACAGGTTCATTCCCGTGTTTTGAAAGATGGGTTTGGTAGTAGTTCGTTTGTTCAAACTTCGTTGGTTAACTTTTATGGGAAATGTGAAGAGATTGGTCTTGCCACTAAGGTGTTCGACGAAATGCCTGACAGAAACTTGGTGGCCTGGACTGCGATGATTAGTGGGCATGTGAGAGTTGGAGCAGTGGATGAAGCTATGGGGTTGTTTAGGGAGATGCAGAAGGCCGGGGTTGAGCCGGATGCGGTGACTCTAGTGAGTGTGGTTTCGGCTTGTGCTGCGGCGGGGGCCTTGGATATTGGCAGCTGGGTGCATGCTTATATTGAGAAACATTCTGTTTTGACCGATCTCGAGCTTGGCACTGCACTTGTAGATATGTATGCTAAATGTGGATGCATTGAGAGGGCGAAGCAGGTCTTTGTTCATATGCCTGTGAGAGATACAAGAGCTTGGAGCTCCATGATTATGGGGTTTGCATATCATGGACTTTCGGAGGATGCCATTGGCGTGTTTCGACAAATGTTGGAAGCTGAGGTAATGCACGAAGATGTACAAATTTCTTAATTGTTAATGAACTTTGAAAAGCTTCAATTACATTCGTGAACTTCCAATTCTACTTCTCATAAGATTAACATTATTGTTTGATCTCTAATGGCACGTTTCTTCCTTTACAAGTTCATTCTTCTGTGTCAGTTCTAGTTTTTGTTTGATAGACATATGCAAGATAAAAACTTGTTATTTCTGTGGTGATTTTCAGGTGATGCCGGACCGTGTAACTTTCATTGGCATTTTATCAGCATGTGCTCACGGTGGAATAGTCTCTGAAGGTCTAAGGTTTTGGTCACTCATGCTTGAATGTGGCATTGAGCCATCAGTTGAGCATTATGGTTGCATAGTTGATTTACTATGCCGAACAGGTCTCGTTGAAGAAGCTTATAGAATCGTTACGACGACAAATATCCCGTCGAATCCTGCAACTTGGCGGAGTTTGCTAAAGGGTTGTAAGAAGAAAAAGCTGTCGAATCTAGGCGAGATCGTCGCAAGGTATCTTCTTCAACTAGAACCCTTAAATGCAGAGAACTATATTGTGATCTCAAATTTATATTCTTCTGTTTCACAATGGGAGAAGATGAGTGAACTAAGAAAGGAGATGAAGGAGAACGACGTAAAGCCAATACCTGGTTGTAGCTCGATCGAAGTCGATGGCGTTGTACATGAGTTTGAGATGGGTGATCAGTCCCATCCAGAGGTGAAAATATTGAGGGAGTTTATGGAAGAGATGGCTAAGCGAGTGTGGGATTCTGGGTATAGACCTAGTGTTTCGGATGTACTTCATAAAGTCATGTATGAAGAGAAAGAAGGGGCTTTAGGTGAGCATAGTGAGAGATTTGCTATTGCATATGGGCTACTAAAAACTAGAGCACCTGTTGTGATTAGGGTAGTGAAGAATCTGAGGGTATGTGGAGATTGCCATGAAGTGATTAAGATAATTAGTAAGATTTATGAAAGGGAAATCATTGTACGAGATCGAGTTCGATTCCATAAGTTCGTCGAGGGTACTTGTTCTTGTAAGGATTACTGGTGAATAGTATGCTTATATCTTTTCTCATACATTATCTTTTTCCTTCCATTATATTGTATCACTCTCATTGGGAGTCAAAATCTCAACATTATATTTGAAGAGATGTCAAAATCCCG

mRNA sequence

CTCTCGTTTTCCCTCCGATCATCCTTGGATCACCAGAGCTGGGAGCTTCAAGAACAGTCAAGAACTGGAAGAAGAAGAACGATTTGCGTTCCTCATCAGAAAATGTCCCAACATGAGAGTGCTCCGGCAGCTTCACGCCCACATTCTCACACGCCCACTTCCTCTTTCCACATTTTCCTTTGCGCTTTCCAAAATCACTGTCTTCTGTGCTCTTTCTCCGCTCGGCAACATCGATTATGCCCGCTTAGTTTTCGCTCAAATTTCTCGTCCCAGCATTTTCTCTTGGAATTCTCTGATCAAGGGCTGTTCCAAGATTCAAAACCCTTCCAAGGAACCGATAGCTTTGTTCCAGAAGCTTACTGAAACAGGGTACCCTGTTCCGAACTCCTTCACTATGGCTTTTGTTCTCAAGGCTTGTGCGATTGTTACAGCGTTTGAAGAGGGCTTACAGGTTCATTCCCGTGTTTTGAAAGATGGGTTTGGTAGTAGTTCGTTTGTTCAAACTTCGTTGGTTAACTTTTATGGGAAATGTGAAGAGATTGGTCTTGCCACTAAGGTGTTCGACGAAATGCCTGACAGAAACTTGGTGGCCTGGACTGCGATGATTAGTGGGCATGTGAGAGTTGGAGCAGTGGATGAAGCTATGGGGTTGTTTAGGGAGATGCAGAAGGCCGGGGTTGAGCCGGATGCGGTGACTCTAGTGAGTGTGGTTTCGGCTTGTGCTGCGGCGGGGGCCTTGGATATTGGCAGCTGGGTGCATGCTTATATTGAGAAACATTCTGTTTTGACCGATCTCGAGCTTGGCACTGCACTTGTAGATATGTATGCTAAATGTGGATGCATTGAGAGGGCGAAGCAGGTCTTTGTTCATATGCCTGTGAGAGATACAAGAGCTTGGAGCTCCATGATTATGGGGTTTGCATATCATGGACTTTCGGAGGATGCCATTGGCGTGTTTCGACAAATGTTGGAAGCTGAGGTGATGCCGGACCGTGTAACTTTCATTGGCATTTTATCAGCATGTGCTCACGGTGGAATAGTCTCTGAAGGTCTAAGGTTTTGGTCACTCATGCTTGAATGTGGCATTGAGCCATCAGTTGAGCATTATGGTTGCATAGTTGATTTACTATGCCGAACAGGTCTCGTTGAAGAAGCTTATAGAATCGTTACGACGACAAATATCCCGTCGAATCCTGCAACTTGGCGGAGTTTGCTAAAGGGTTGTAAGAAGAAAAAGCTGTCGAATCTAGGCGAGATCGTCGCAAGGTATCTTCTTCAACTAGAACCCTTAAATGCAGAGAACTATATTGTGATCTCAAATTTATATTCTTCTGTTTCACAATGGGAGAAGATGAGTGAACTAAGAAAGGAGATGAAGGAGAACGACGTAAAGCCAATACCTGGTTGTAGCTCGATCGAAGTCGATGGCGTTGTACATGAGTTTGAGATGGGTGATCAGTCCCATCCAGAGGTGAAAATATTGAGGGAGTTTATGGAAGAGATGGCTAAGCGAGTGTGGGATTCTGGGTATAGACCTAGTGTTTCGGATGTACTTCATAAAGTCATGTATGAAGAGAAAGAAGGGGCTTTAGGTGAGCATAGTGAGAGATTTGCTATTGCATATGGGCTACTAAAAACTAGAGCACCTGTTGTGATTAGGGTAGTGAAGAATCTGAGGGTATGTGGAGATTGCCATGAAGTGATTAAGATAATTAGTAAGATTTATGAAAGGGAAATCATTGTACGAGATCGAGTTCGATTCCATAAGTTCGTCGAGGGTACTTGTTCTTGTAAGGATTACTGGTGAATAGTATGCTTATATCTTTTCTCATACATTATCTTTTTCCTTCCATTATATTGTATCACTCTCATTGGGAGTCAAAATCTCAACATTATATTTGAAGAGATGTCAAAATCCCG

Coding sequence (CDS)

ATGAGAGTGCTCCGGCAGCTTCACGCCCACATTCTCACACGCCCACTTCCTCTTTCCACATTTTCCTTTGCGCTTTCCAAAATCACTGTCTTCTGTGCTCTTTCTCCGCTCGGCAACATCGATTATGCCCGCTTAGTTTTCGCTCAAATTTCTCGTCCCAGCATTTTCTCTTGGAATTCTCTGATCAAGGGCTGTTCCAAGATTCAAAACCCTTCCAAGGAACCGATAGCTTTGTTCCAGAAGCTTACTGAAACAGGGTACCCTGTTCCGAACTCCTTCACTATGGCTTTTGTTCTCAAGGCTTGTGCGATTGTTACAGCGTTTGAAGAGGGCTTACAGGTTCATTCCCGTGTTTTGAAAGATGGGTTTGGTAGTAGTTCGTTTGTTCAAACTTCGTTGGTTAACTTTTATGGGAAATGTGAAGAGATTGGTCTTGCCACTAAGGTGTTCGACGAAATGCCTGACAGAAACTTGGTGGCCTGGACTGCGATGATTAGTGGGCATGTGAGAGTTGGAGCAGTGGATGAAGCTATGGGGTTGTTTAGGGAGATGCAGAAGGCCGGGGTTGAGCCGGATGCGGTGACTCTAGTGAGTGTGGTTTCGGCTTGTGCTGCGGCGGGGGCCTTGGATATTGGCAGCTGGGTGCATGCTTATATTGAGAAACATTCTGTTTTGACCGATCTCGAGCTTGGCACTGCACTTGTAGATATGTATGCTAAATGTGGATGCATTGAGAGGGCGAAGCAGGTCTTTGTTCATATGCCTGTGAGAGATACAAGAGCTTGGAGCTCCATGATTATGGGGTTTGCATATCATGGACTTTCGGAGGATGCCATTGGCGTGTTTCGACAAATGTTGGAAGCTGAGGTGATGCCGGACCGTGTAACTTTCATTGGCATTTTATCAGCATGTGCTCACGGTGGAATAGTCTCTGAAGGTCTAAGGTTTTGGTCACTCATGCTTGAATGTGGCATTGAGCCATCAGTTGAGCATTATGGTTGCATAGTTGATTTACTATGCCGAACAGGTCTCGTTGAAGAAGCTTATAGAATCGTTACGACGACAAATATCCCGTCGAATCCTGCAACTTGGCGGAGTTTGCTAAAGGGTTGTAAGAAGAAAAAGCTGTCGAATCTAGGCGAGATCGTCGCAAGGTATCTTCTTCAACTAGAACCCTTAAATGCAGAGAACTATATTGTGATCTCAAATTTATATTCTTCTGTTTCACAATGGGAGAAGATGAGTGAACTAAGAAAGGAGATGAAGGAGAACGACGTAAAGCCAATACCTGGTTGTAGCTCGATCGAAGTCGATGGCGTTGTACATGAGTTTGAGATGGGTGATCAGTCCCATCCAGAGGTGAAAATATTGAGGGAGTTTATGGAAGAGATGGCTAAGCGAGTGTGGGATTCTGGGTATAGACCTAGTGTTTCGGATGTACTTCATAAAGTCATGTATGAAGAGAAAGAAGGGGCTTTAGGTGAGCATAGTGAGAGATTTGCTATTGCATATGGGCTACTAAAAACTAGAGCACCTGTTGTGATTAGGGTAGTGAAGAATCTGAGGGTATGTGGAGATTGCCATGAAGTGATTAAGATAATTAGTAAGATTTATGAAAGGGAAATCATTGTACGAGATCGAGTTCGATTCCATAAGTTCGTCGAGGGTACTTGTTCTTGTAAGGATTACTGGTGA
BLAST of CmoCh14G004650 vs. Swiss-Prot
Match: PPR85_ARATH (Pentatricopeptide repeat-containing protein At1g59720, chloroplastic/mitochondrial OS=Arabidopsis thaliana GN=PCMP-H51 PE=2 SV=2)

HSP 1 Score: 465.7 bits (1197), Expect = 7.2e-130
Identity = 247/581 (42.51%), Postives = 358/581 (61.62%), Query Frame = 1

Query: 1   MRVLRQLHAHILTRPLPLSTFS-FALSKITVFCALSPLGNIDYARLVFAQISRPSIFSWN 60
           M  L+QLHA  L    P    + F   KI      S   +++YA  VF  I   S F WN
Sbjct: 61  MSQLKQLHAFTLRTTYPEEPATLFLYGKILQLS--SSFSDVNYAFRVFDSIENHSSFMWN 120

Query: 61  SLIKGCSKIQNPSKEPIALFQKLTETGYPVPNSFTMAFVLKACAIVTAFEEGLQVHSRVL 120
           +LI+ C+   +  +E   L++K+ E G   P+  T  FVLKACA +  F EG QVH +++
Sbjct: 121 TLIRACAHDVSRKEEAFMLYRKMLERGESSPDKHTFPFVLKACAYIFGFSEGKQVHCQIV 180

Query: 121 KDGFGSSSFVQTSLVNFYGKCEEIGLATKVFDEMPDRNLVAWTAMISGHVRVGAVDEAMG 180
           K GFG   +V   L++ YG C  + LA KVFDEMP+R+LV+W +MI   VR G  D A+ 
Sbjct: 181 KHGFGGDVYVNNGLIHLYGSCGCLDLARKVFDEMPERSLVSWNSMIDALVRFGEYDSALQ 240

Query: 181 LFREMQKAGVEPDAVTLVSVVSACAAAGALDIGSWVHAYIEKH---SVLTDLELGTALVD 240
           LFREMQ++  EPD  T+ SV+SACA  G+L +G+W HA++ +     V  D+ +  +L++
Sbjct: 241 LFREMQRS-FEPDGYTMQSVLSACAGLGSLSLGTWAHAFLLRKCDVDVAMDVLVKNSLIE 300

Query: 241 MYAKCGCIERAKQVFVHMPVRDTRAWSSMIMGFAYHGLSEDAIGVFRQMLE--AEVMPDR 300
           MY KCG +  A+QVF  M  RD  +W++MI+GFA HG +E+A+  F +M++    V P+ 
Sbjct: 301 MYCKCGSLRMAEQVFQGMQKRDLASWNAMILGFATHGRAEEAMNFFDRMVDKRENVRPNS 360

Query: 301 VTFIGILSACAHGGIVSEGLRFWSLML-ECGIEPSVEHYGCIVDLLCRTGLVEEAYRIVT 360
           VTF+G+L AC H G V++G +++ +M+ +  IEP++EHYGCIVDL+ R G + EA  +V 
Sbjct: 361 VTFVGLLIACNHRGFVNKGRQYFDMMVRDYCIEPALEHYGCIVDLIARAGYITEAIDMVM 420

Query: 361 TTNIPSNPATWRSLLKGCKKKKLS-NLGEIVARYLLQLEPLNAEN-------YIVISNLY 420
           +  +  +   WRSLL  C KK  S  L E +AR ++  +  N  +       Y+++S +Y
Sbjct: 421 SMPMKPDAVIWRSLLDACCKKGASVELSEEIARNIIGTKEDNESSNGNCSGAYVLLSRVY 480

Query: 421 SSVSQWEKMSELRKEMKENDVKPIPGCSSIEVDGVVHEFEMGDQSHPEVKILREFMEEMA 480
           +S S+W  +  +RK M E+ ++  PGCSSIE++G+ HEF  GD SHP+ K + + ++ + 
Sbjct: 481 ASASRWNDVGIVRKLMSEHGIRKEPGCSSIEINGISHEFFAGDTSHPQTKQIYQQLKVID 540

Query: 481 KRVWDSGYRP--SVSDVLHKVMYEEKEGALGEHSERFAIAYGLLKTRAPVVIRVVKNLRV 540
            R+   GY P  S + ++       KE +L  HSER AIA+GL+       IR+ KNLRV
Sbjct: 541 DRLRSIGYLPDRSQAPLVDATNDGSKEYSLRLHSERLAIAFGLINLPPQTPIRIFKNLRV 600

Query: 541 CGDCHEVIKIISKIYEREIIVRDRVRFHKFVEGTCSCKDYW 565
           C DCHEV K+ISK++  EIIVRDRVRFH F +G+CSC DYW
Sbjct: 601 CNDCHEVTKLISKVFNTEIIVRDRVRFHHFKDGSCSCLDYW 638

BLAST of CmoCh14G004650 vs. Swiss-Prot
Match: PP330_ARATH (Pentatricopeptide repeat-containing protein At4g21065 OS=Arabidopsis thaliana GN=PCMP-H28 PE=2 SV=2)

HSP 1 Score: 458.8 bits (1179), Expect = 8.8e-128
Identity = 232/564 (41.13%), Postives = 355/564 (62.94%), Query Frame = 1

Query: 4   LRQLHAHILTRPLPLSTFSFALSKITVFCALSPLGNIDYARLVFAQISRP-SIFSWNSLI 63
           LRQ+HA  +   + +S        I    +L     + YA  VF++I +P ++F WN+LI
Sbjct: 33  LRQIHAFSIRHGVSISDAELGKHLIFYLVSLPSPPPMSYAHKVFSKIEKPINVFIWNTLI 92

Query: 64  KGCSKIQNPSKEPIALFQKLTETGYPVPNSFTMAFVLKACAIVTAFEEGLQVHSRVLKDG 123
           +G ++I N S    +L++++  +G   P++ T  F++KA   +     G  +HS V++ G
Sbjct: 93  RGYAEIGN-SISAFSLYREMRVSGLVEPDTHTYPFLIKAVTTMADVRLGETIHSVVIRSG 152

Query: 124 FGSSSFVQTSLVNFYGKCEEIGLATKVFDEMPDRNLVAWTAMISGHVRVGAVDEAMGLFR 183
           FGS  +VQ SL++ Y  C ++  A KVFD+MP+++LVAW ++I+G    G  +EA+ L+ 
Sbjct: 153 FGSLIYVQNSLLHLYANCGDVASAYKVFDKMPEKDLVAWNSVINGFAENGKPEEALALYT 212

Query: 184 EMQKAGVEPDAVTLVSVVSACAAAGALDIGSWVHAYIEKHSVLTDLELGTALVDMYAKCG 243
           EM   G++PD  T+VS++SACA  GAL +G  VH Y+ K  +  +L     L+D+YA+CG
Sbjct: 213 EMNSKGIKPDGFTIVSLLSACAKIGALTLGKRVHVYMIKVGLTRNLHSSNVLLDLYARCG 272

Query: 244 CIERAKQVFVHMPVRDTRAWSSMIMGFAYHGLSEDAIGVFRQMLEAE-VMPDRVTFIGIL 303
            +E AK +F  M  +++ +W+S+I+G A +G  ++AI +F+ M   E ++P  +TF+GIL
Sbjct: 273 RVEEAKTLFDEMVDKNSVSWTSLIVGLAVNGFGKEAIELFKYMESTEGLLPCEITFVGIL 332

Query: 304 SACAHGGIVSEGLRFWSLMLE-CGIEPSVEHYGCIVDLLCRTGLVEEAYRIVTTTNIPSN 363
            AC+H G+V EG  ++  M E   IEP +EH+GC+VDLL R G V++AY  + +  +  N
Sbjct: 333 YACSHCGMVKEGFEYFRRMREEYKIEPRIEHFGCMVDLLARAGQVKKAYEYIKSMPMQPN 392

Query: 364 PATWRSLLKGCKKKKLSNLGEIVARYLLQLEPLNAENYIVISNLYSSVSQWEKMSELRKE 423
              WR+LL  C     S+L E     +LQLEP ++ +Y+++SN+Y+S  +W  + ++RK+
Sbjct: 393 VVIWRTLLGACTVHGDSDLAEFARIQILQLEPNHSGDYVLLSNMYASEQRWSDVQKIRKQ 452

Query: 424 MKENDVKPIPGCSSIEVDGVVHEFEMGDQSHPEVKILREFMEEMAKRVWDSGYRPSVSDV 483
           M  + VK +PG S +EV   VHEF MGD+SHP+   +   ++EM  R+   GY P +S+V
Sbjct: 453 MLRDGVKKVPGHSLVEVGNRVHEFLMGDKSHPQSDAIYAKLKEMTGRLRSEGYVPQISNV 512

Query: 484 LHKVMYEEKEGALGEHSERFAIAYGLLKTRAPVVIRVVKNLRVCGDCHEVIKIISKIYER 543
              V  EEKE A+  HSE+ AIA+ L+ T     I VVKNLRVC DCH  IK++SK+Y R
Sbjct: 513 YVDVEEEEKENAVVYHSEKIAIAFMLISTPERSPITVVKNLRVCADCHLAIKLVSKVYNR 572

Query: 544 EIIVRDRVRFHKFVEGTCSCKDYW 565
           EI+VRDR RFH F  G+CSC+DYW
Sbjct: 573 EIVVRDRSRFHHFKNGSCSCQDYW 595

BLAST of CmoCh14G004650 vs. Swiss-Prot
Match: PP145_ARATH (Pentatricopeptide repeat-containing protein At2g02980, chloroplastic OS=Arabidopsis thaliana GN=PCMP-H26 PE=2 SV=2)

HSP 1 Score: 454.5 bits (1168), Expect = 1.7e-126
Identity = 235/567 (41.45%), Postives = 354/567 (62.43%), Query Frame = 1

Query: 1   MRVLRQLHAHILTRPLPLSTFSFALSKITVFCALSPL-GNIDYARLVFAQISRPSIFSWN 60
           +R L Q+ A+ +   +   +F   ++K+  FC  SP   ++ YAR +F  +S P I  +N
Sbjct: 42  LRELMQIQAYAIKSHIEDVSF---VAKLINFCTESPTESSMSYARHLFEAMSEPDIVIFN 101

Query: 61  SLIKGCSKIQNPSKEPIALFQKLTETGYPVPNSFTMAFVLKACAIVTAFEEGLQVHSRVL 120
           S+ +G S+  NP  E  +LF ++ E G  +P+++T   +LKACA+  A EEG Q+H   +
Sbjct: 102 SMARGYSRFTNPL-EVFSLFVEILEDGI-LPDNYTFPSLLKACAVAKALEEGRQLHCLSM 161

Query: 121 KDGFGSSSFVQTSLVNFYGKCEEIGLATKVFDEMPDRNLVAWTAMISGHVRVGAVDEAMG 180
           K G   + +V  +L+N Y +CE++  A  VFD + +  +V + AMI+G+ R    +EA+ 
Sbjct: 162 KLGLDDNVYVCPTLINMYTECEDVDSARCVFDRIVEPCVVCYNAMITGYARRNRPNEALS 221

Query: 181 LFREMQKAGVEPDAVTLVSVVSACAAAGALDIGSWVHAYIEKHSVLTDLELGTALVDMYA 240
           LFREMQ   ++P+ +TL+SV+S+CA  G+LD+G W+H Y +KHS    +++ TAL+DM+A
Sbjct: 222 LFREMQGKYLKPNEITLLSVLSSCALLGSLDLGKWIHKYAKKHSFCKYVKVNTALIDMFA 281

Query: 241 KCGCIERAKQVFVHMPVRDTRAWSSMIMGFAYHGLSEDAIGVFRQMLEAEVMPDRVTFIG 300
           KCG ++ A  +F  M  +DT+AWS+MI+ +A HG +E ++ +F +M    V PD +TF+G
Sbjct: 282 KCGSLDDAVSIFEKMRYKDTQAWSAMIVAYANHGKAEKSMLMFERMRSENVQPDEITFLG 341

Query: 301 ILSACAHGGIVSEGLRFWSLML-ECGIEPSVEHYGCIVDLLCRTGLVEEAYRIVTTTNIP 360
           +L+AC+H G V EG +++S M+ + GI PS++HYG +VDLL R G +E+AY  +    I 
Sbjct: 342 LLNACSHTGRVEEGRKYFSQMVSKFGIVPSIKHYGSMVDLLSRAGNLEDAYEFIDKLPIS 401

Query: 361 SNPATWRSLLKGCKKKKLSNLGEIVARYLLQLEPLNAENYIVISNLYSSVSQWEKMSELR 420
             P  WR LL  C      +L E V+  + +L+  +  +Y+++SNLY+   +WE +  LR
Sbjct: 402 PTPMLWRILLAACSSHNNLDLAEKVSERIFELDDSHGGDYVILSNLYARNKKWEYVDSLR 461

Query: 421 KEMKENDVKPIPGCSSIEVDGVVHEFEMGDQSHPEVKILREFMEEMAKRVWDSGYRPSVS 480
           K MK+     +PGCSSIEV+ VVHEF  GD        L   ++EM K +  SGY P  S
Sbjct: 462 KVMKDRKAVKVPGCSSIEVNNVVHEFFSGDGVKSATTKLHRALDEMVKELKLSGYVPDTS 521

Query: 481 DVLHKVMY-EEKEGALGEHSERFAIAYGLLKTRAPVVIRVVKNLRVCGDCHEVIKIISKI 540
            V+H  M  +EKE  L  HSE+ AI +GLL T     IRVVKNLRVC DCH   K+IS I
Sbjct: 522 MVVHANMNDQEKEITLRYHSEKLAITFGLLNTPPGTTIRVVKNLRVCRDCHNAAKLISLI 581

Query: 541 YEREIIVRDRVRFHKFVEGTCSCKDYW 565
           + R++++RD  RFH F +G CSC D+W
Sbjct: 582 FGRKVVLRDVQRFHHFEDGKCSCGDFW 603

BLAST of CmoCh14G004650 vs. Swiss-Prot
Match: PPR21_ARATH (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 443.4 bits (1139), Expect = 3.8e-123
Identity = 217/530 (40.94%), Postives = 335/530 (63.21%), Query Frame = 1

Query: 38  GNIDYARLVFAQISRPSIFSWNSLIKGCSKIQNPSKEPIALFQKLTETGYPVPNSFTMAF 97
           G I+ A+ +F +I    + SWN++I G ++  N  KE + LF+ + +T    P+  TM  
Sbjct: 214 GYIENAQKLFDEIPVKDVVSWNAMISGYAETGN-YKEALELFKDMMKTNVR-PDESTMVT 273

Query: 98  VLKACAIVTAFEEGLQVHSRVLKDGFGSSSFVQTSLVNFYGKCEEIGLATKVFDEMPDRN 157
           V+ ACA   + E G QVH  +   GFGS+  +  +L++ Y KC E+  A  +F+ +P ++
Sbjct: 274 VVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELETACGLFERLPYKD 333

Query: 158 LVAWTAMISGHVRVGAVDEAMGLFREMQKAGVEPDAVTLVSVVSACAAAGALDIGSWVHA 217
           +++W  +I G+  +    EA+ LF+EM ++G  P+ VT++S++ ACA  GA+DIG W+H 
Sbjct: 334 VISWNTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSILPACAHLGAIDIGRWIHV 393

Query: 218 YIEKH--SVLTDLELGTALVDMYAKCGCIERAKQVFVHMPVRDTRAWSSMIMGFAYHGLS 277
           YI+K    V     L T+L+DMYAKCG IE A QVF  +  +   +W++MI GFA HG +
Sbjct: 394 YIDKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSSWNAMIFGFAMHGRA 453

Query: 278 EDAIGVFRQMLEAEVMPDRVTFIGILSACAHGGIVSEGLRFWSLMLE-CGIEPSVEHYGC 337
           + +  +F +M +  + PD +TF+G+LSAC+H G++  G   +  M +   + P +EHYGC
Sbjct: 454 DASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQDYKMTPKLEHYGC 513

Query: 338 IVDLLCRTGLVEEAYRIVTTTNIPSNPATWRSLLKGCKKKKLSNLGEIVARYLLQLEPLN 397
           ++DLL  +GL +EA  ++    +  +   W SLLK CK      LGE  A  L+++EP N
Sbjct: 514 MIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELGESFAENLIKIEPEN 573

Query: 398 AENYIVISNLYSSVSQWEKMSELRKEMKENDVKPIPGCSSIEVDGVVHEFEMGDQSHPEV 457
             +Y+++SN+Y+S  +W ++++ R  + +  +K +PGCSSIE+D VVHEF +GD+ HP  
Sbjct: 574 PGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSVVHEFIIGDKFHPRN 633

Query: 458 KILREFMEEMAKRVWDSGYRPSVSDVLHKVMYEEKEGALGEHSERFAIAYGLLKTRAPVV 517
           + +   +EEM   +  +G+ P  S+VL ++  E KEGAL  HSE+ AIA+GL+ T+    
Sbjct: 634 REIYGMLEEMEVLLEKAGFVPDTSEVLQEMEEEWKEGALRHHSEKLAIAFGLISTKPGTK 693

Query: 518 IRVVKNLRVCGDCHEVIKIISKIYEREIIVRDRVRFHKFVEGTCSCKDYW 565
           + +VKNLRVC +CHE  K+ISKIY+REII RDR RFH F +G CSC DYW
Sbjct: 694 LTIVKNLRVCRNCHEATKLISKIYKREIIARDRTRFHHFRDGVCSCNDYW 741

BLAST of CmoCh14G004650 vs. Swiss-Prot
Match: PP285_ARATH (Pentatricopeptide repeat-containing protein At3g57430, chloroplastic OS=Arabidopsis thaliana GN=PCMP-H81 PE=2 SV=2)

HSP 1 Score: 429.5 bits (1103), Expect = 5.7e-119
Identity = 233/578 (40.31%), Postives = 351/578 (60.73%), Query Frame = 1

Query: 1   MRVLRQLHAHILTRPLPLSTFSFALSKIT-VFCALSPLGNIDYARLVFAQISRPSIFSWN 60
           +R  ++LHA+ L     L   SF  S +  ++C    + +    R VF  +    I  WN
Sbjct: 318 LRTGKELHAYALKNG-SLDENSFVGSALVDMYCNCKQVLS---GRRVFDGMFDRKIGLWN 377

Query: 61  SLIKGCSKIQNPSKEPIALFQKLTETGYPVPNSFTMAFVLKACAIVTAFEEGLQVHSRVL 120
           ++I G S+ ++  KE + LF  + E+   + NS TMA V+ AC    AF     +H  V+
Sbjct: 378 AMIAGYSQNEH-DKEALLLFIGMEESAGLLANSTTMAGVVPACVRSGAFSRKEAIHGFVV 437

Query: 121 KDGFGSSSFVQTSLVNFYGKCEEIGLATKVFDEMPDRNLVAWTAMISGHVRVGAVDEAMG 180
           K G     FVQ +L++ Y +  +I +A ++F +M DR+LV W  MI+G+V     ++A+ 
Sbjct: 438 KRGLDRDRFVQNTLMDMYSRLGKIDIAMRIFGKMEDRDLVTWNTMITGYVFSEHHEDALL 497

Query: 181 LFREMQ-----------KAGVEPDAVTLVSVVSACAAAGALDIGSWVHAYIEKHSVLTDL 240
           L  +MQ           +  ++P+++TL++++ +CAA  AL  G  +HAY  K+++ TD+
Sbjct: 498 LLHKMQNLERKVSKGASRVSLKPNSITLMTILPSCAALSALAKGKEIHAYAIKNNLATDV 557

Query: 241 ELGTALVDMYAKCGCIERAKQVFVHMPVRDTRAWSSMIMGFAYHGLSEDAIGVFRQMLEA 300
            +G+ALVDMYAKCGC++ +++VF  +P ++   W+ +IM +  HG  ++AI + R M+  
Sbjct: 558 AVGSALVDMYAKCGCLQMSRKVFDQIPQKNVITWNVIIMAYGMHGNGQEAIDLLRMMMVQ 617

Query: 301 EVMPDRVTFIGILSACAHGGIVSEGLR-FWSLMLECGIEPSVEHYGCIVDLLCRTGLVEE 360
            V P+ VTFI + +AC+H G+V EGLR F+ +  + G+EPS +HY C+VDLL R G ++E
Sbjct: 618 GVKPNEVTFISVFAACSHSGMVDEGLRIFYVMKPDYGVEPSSDHYACVVDLLGRAGRIKE 677

Query: 361 AYRIVTTTNIPSNPA-TWRSLLKGCKKKKLSNLGEIVARYLLQLEPLNAENYIVISNLYS 420
           AY+++       N A  W SLL   +      +GEI A+ L+QLEP  A +Y++++N+YS
Sbjct: 678 AYQLMNMMPRDFNKAGAWSSLLGASRIHNNLEIGEIAAQNLIQLEPNVASHYVLLANIYS 737

Query: 421 SVSQWEKMSELRKEMKENDVKPIPGCSSIEVDGVVHEFEMGDQSHPEVKILREFMEEMAK 480
           S   W+K +E+R+ MKE  V+  PGCS IE    VH+F  GD SHP+ + L  ++E + +
Sbjct: 738 SAGLWDKATEVRRNMKEQGVRKEPGCSWIEHGDEVHKFVAGDSSHPQSEKLSGYLETLWE 797

Query: 481 RVWDSGYRPSVSDVLHKVMYEEKEGALGEHSERFAIAYGLLKTRAPVVIRVVKNLRVCGD 540
           R+   GY P  S VLH V  +EKE  L  HSE+ AIA+G+L T    +IRV KNLRVC D
Sbjct: 798 RMRKEGYVPDTSCVLHNVEEDEKEILLCGHSEKLAIAFGILNTSPGTIIRVAKNLRVCND 857

Query: 541 CHEVIKIISKIYEREIIVRDRVRFHKFVEGTCSCKDYW 565
           CH   K ISKI +REII+RD  RFH+F  GTCSC DYW
Sbjct: 858 CHLATKFISKIVDREIILRDVRRFHRFKNGTCSCGDYW 890

BLAST of CmoCh14G004650 vs. TrEMBL
Match: A0A0A0LF80_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G812780 PE=4 SV=1)

HSP 1 Score: 969.9 bits (2506), Expect = 1.3e-279
Identity = 477/564 (84.57%), Postives = 513/564 (90.96%), Query Frame = 1

Query: 1   MRVLRQLHAHILTRPLPLSTFSFALSKITVFCALSPLGNIDYARLVFAQISRPSIFSWNS 60
           MRVLRQLHAHILTRPLPLS+F+FALSKI  FCALSP GNI+YAR VFAQI  P+IFSWNS
Sbjct: 1   MRVLRQLHAHILTRPLPLSSFAFALSKIVAFCALSPFGNINYARSVFAQIPHPNIFSWNS 60

Query: 61  LIKGCSKIQNPSKEPIALFQKLTETGYPVPNSFTMAFVLKACAIVTAFEEGLQVHSRVLK 120
           LIKG S+I   SKEPI LF+KLTETGYPVPNSFT+AFVLKACAIVTAF EGLQVHS VLK
Sbjct: 61  LIKGYSQIHTLSKEPIFLFKKLTETGYPVPNSFTLAFVLKACAIVTAFGEGLQVHSHVLK 120

Query: 121 DGFGSSSFVQTSLVNFYGKCEEIGLATKVFDEMPDRNLVAWTAMISGHVRVGAVDEAMGL 180
           DGFGSS FVQTSLVNFYGKCEEIG A KVF+EMP RNLVAWTAMISGH RVGAVDEAM L
Sbjct: 121 DGFGSSLFVQTSLVNFYGKCEEIGFARKVFEEMPVRNLVAWTAMISGHARVGAVDEAMEL 180

Query: 181 FREMQKAGVEPDAVTLVSVVSACAAAGALDIGSWVHAYIEKHSVLTDLELGTALVDMYAK 240
           FREMQKAG++PDA+TLVSVVSACA AGALDIG W+HAYIEK+ VLTDLEL TALVDMYAK
Sbjct: 181 FREMQKAGIQPDAMTLVSVVSACAVAGALDIGCWLHAYIEKYFVLTDLELSTALVDMYAK 240

Query: 241 CGCIERAKQVFVHMPVRDTRAWSSMIMGFAYHGLSEDAIGVFRQMLEAEVMPDRVTFIGI 300
           CGCIERAKQVFVHMPV+DT AWSSMIMGFAYHGL++DAI  F+QMLE EV PD VTF+ +
Sbjct: 241 CGCIERAKQVFVHMPVKDTTAWSSMIMGFAYHGLAQDAIDAFQQMLETEVTPDHVTFLAV 300

Query: 301 LSACAHGGIVSEGLRFWSLMLECGIEPSVEHYGCIVDLLCRTGLVEEAYRIVTTTNIPSN 360
           LSACAHGG+VS G RFWSLMLE GIEPSVEHYGC VDLLCR+GLVEEAYRI TT  IP N
Sbjct: 301 LSACAHGGLVSRGRRFWSLMLEFGIEPSVEHYGCKVDLLCRSGLVEEAYRITTTMKIPPN 360

Query: 361 PATWRSLLKGCKKKKLSNLGEIVARYLLQLEPLNAENYIVISNLYSSVSQWEKMSELRKE 420
            ATWRSLL GCKKKKL NLGEIVARYLL+LEPLNAEN+I+ISNLYSS+SQWEKMSELRK 
Sbjct: 361 AATWRSLLMGCKKKKLLNLGEIVARYLLELEPLNAENFIMISNLYSSLSQWEKMSELRKV 420

Query: 421 MKENDVKPIPGCSSIEVDGVVHEFEMGDQSHPEVKILREFMEEMAKRVWDSGYRPSVSDV 480
           MKE  +KP+PGCSSIEVDGVVHEF MGDQSHPEVK+LREFMEEM+ RV DSGYRPS+SDV
Sbjct: 421 MKEKCIKPVPGCSSIEVDGVVHEFVMGDQSHPEVKMLREFMEEMSMRVRDSGYRPSISDV 480

Query: 481 LHKVMYEEKEGALGEHSERFAIAYGLLKTRAPVVIRVVKNLRVCGDCHEVIKIISKIYER 540
           LHKV+ EEKE AL EHSERFAIAYGLLKTRAP+VIRVVKNLRVC DCHEVIKIISK+YER
Sbjct: 481 LHKVVDEEKECALSEHSERFAIAYGLLKTRAPIVIRVVKNLRVCVDCHEVIKIISKLYER 540

Query: 541 EIIVRDRVRFHKFVEGTCSCKDYW 565
           EIIVRDRVRFHKF++GTCSCKD+W
Sbjct: 541 EIIVRDRVRFHKFIKGTCSCKDFW 564

BLAST of CmoCh14G004650 vs. TrEMBL
Match: F6HL02_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_08s0007g08320 PE=4 SV=1)

HSP 1 Score: 783.1 bits (2021), Expect = 2.3e-223
Identity = 379/559 (67.80%), Postives = 459/559 (82.11%), Query Frame = 1

Query: 1   MRVLRQLHAHILTRPLPLSTFSFALSKITVFCALSPLGNIDYARLVFAQISRPSIFSWNS 60
           MRVLRQ+HA +LT  +P+S+ SF L KI  FCALSP G+IDYAR +F+QI RP+IFSWNS
Sbjct: 1   MRVLRQIHARLLTHAMPISSISFGLCKIIGFCALSPYGDIDYARKLFSQIQRPNIFSWNS 60

Query: 61  LIKGCSKIQNPSKEPIALFQKLTETGYPVPNSFTMAFVLKACAIVTAFEEGLQVHSRVLK 120
           +I+GCS+ Q PSKEP+ LF+K+   GYP PN+FTMAFVLKAC+IV+A EEG QVH+ VLK
Sbjct: 61  MIRGCSQSQTPSKEPVILFRKMVRRGYPNPNTFTMAFVLKACSIVSALEEGQQVHANVLK 120

Query: 121 DGFGSSSFVQTSLVNFYGKCEEIGLATKVFDEMPDRNLVAWTAMISGHVRVGAVDEAMGL 180
            GFGSS FV+T+LVNFY KCE+I LA+KVFDE+ DRNLVAW+ MISG+ R+G V+EA+GL
Sbjct: 121 SGFGSSPFVETALVNFYAKCEDIVLASKVFDEITDRNLVAWSTMISGYARIGLVNEALGL 180

Query: 181 FREMQKAGVEPDAVTLVSVVSACAAAGALDIGSWVHAYIEKHSVLTDLELGTALVDMYAK 240
           FR+MQKAGV PD VT+VSV+SACAA+GALD G WVHAYI K  + TDLEL TALV+MYAK
Sbjct: 181 FRDMQKAGVVPDEVTMVSVISACAASGALDTGKWVHAYINKQLIETDLELSTALVNMYAK 240

Query: 241 CGCIERAKQVFVHMPVRDTRAWSSMIMGFAYHGLSEDAIGVFRQMLEAEVMPDRVTFIGI 300
           CGCIERAK+VF  MPV+DT+AWSSMI+G A +GL+EDA+  F +M EA+V P+ VTFIG+
Sbjct: 241 CGCIERAKEVFDAMPVKDTKAWSSMIVGLAINGLAEDALEEFFRMEEAKVKPNHVTFIGV 300

Query: 301 LSACAHGGIVSEGLRFWSLMLECGIEPSVEHYGCIVDLLCRTGLVEEAYRIVTTTNIPSN 360
           LSACAH G+VSEG R+WS MLE GI PS+E YGC+VDLLCR  LVE+A  +V T  I  N
Sbjct: 301 LSACAHSGLVSEGRRYWSSMLEFGIVPSMELYGCMVDLLCRASLVEDACTLVETMPISPN 360

Query: 361 PATWRSLLKGCKKKKLSNLGEIVARYLLQLEPLNAENYIVISNLYSSVSQWEKMSELRKE 420
           P  WR+LL GCKK K  +  E+VA+ LL+LEP NAENYI++SNLY+S+SQWEKMS++RK+
Sbjct: 361 PVIWRTLLVGCKKSKNLDKSEVVAQRLLELEPHNAENYILLSNLYASMSQWEKMSQVRKK 420

Query: 421 MKENDVKPIPGCSSIEVDGVVHEFEMGDQSHPEVKILREFMEEMAKRVWDSGYRPSVSDV 480
           MK   +K +PGCSSIEVDG+VHEF MGD SHPE   +RE + +++KRV   G++P +SDV
Sbjct: 421 MKGMGIKAVPGCSSIEVDGLVHEFVMGDWSHPEAMEVREILRDISKRVHAVGHQPGISDV 480

Query: 481 LHKVMYEEKEGALGEHSERFAIAYGLLKTRAPVVIRVVKNLRVCGDCHEVIKIISKIYER 540
           LH V+ EEKE AL EHSER AIAYGLLKT+ P+ IR+VKNLRVCGDCHEV KIIS  Y R
Sbjct: 481 LHNVVDEEKENALCEHSERLAIAYGLLKTKTPMAIRIVKNLRVCGDCHEVTKIISAEYRR 540

Query: 541 EIIVRDRVRFHKFVEGTCS 560
           EIIVRDRVRFHKFV G+CS
Sbjct: 541 EIIVRDRVRFHKFVNGSCS 559

BLAST of CmoCh14G004650 vs. TrEMBL
Match: W9RU69_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_018009 PE=4 SV=1)

HSP 1 Score: 751.1 bits (1938), Expect = 9.6e-214
Identity = 355/564 (62.94%), Postives = 443/564 (78.55%), Query Frame = 1

Query: 1   MRVLRQLHAHILTRPLPLSTFSFALSKITVFCALSPLGNIDYARLVFAQISRPSIFSWNS 60
           MRVLRQ+HAH+LTR LP+S  SFALSKI  FCALS +G+I YAR VF++I  P+IF WN+
Sbjct: 1   MRVLRQIHAHVLTRFLPISALSFALSKIAAFCALSAVGDIAYARRVFSRIPCPNIFCWNA 60

Query: 61  LIKGCSKIQNPSKEPIALFQKLTETGYPVPNSFTMAFVLKACAIVTAFEEGLQVHSRVLK 120
           +I+GCS ++NPSKE I LF+KL   GYP PN+FT++FVLKAC+I++A  EG QVH+RVL+
Sbjct: 61  MIRGCSNVENPSKESIYLFKKLIRKGYPGPNTFTLSFVLKACSILSASHEGWQVHTRVLR 120

Query: 121 DGFGSSSFVQTSLVNFYGKCEEIGLATKVFDEMPDRNLVAWTAMISGHVRVGAVDEAMGL 180
            GFGSS FVQTSLVN Y KCEE+  A  VFDE+P+RNLVAW+AMI G+ RVG VD + GL
Sbjct: 121 SGFGSSPFVQTSLVNMYAKCEEVWDARLVFDEIPERNLVAWSAMIGGYARVGLVDASFGL 180

Query: 181 FREMQKAGVEPDAVTLVSVVSACAAAGALDIGSWVHAYIEKHSVLTDLELGTALVDMYAK 240
           FREMQ AGV PD VT+ S+VSAC  AG+L +G WVH Y EK  +  DLELGTAL++MYAK
Sbjct: 181 FREMQMAGVVPDQVTMASIVSACTCAGSLYLGRWVHVYAEKKKIEIDLELGTALINMYAK 240

Query: 241 CGCIERAKQVFVHMPVRDTRAWSSMIMGFAYHGLSEDAIGVFRQMLEAEVMPDRVTFIGI 300
           CG IE+AK +F  + V+DT+AW+SMI+G A HGLSE+A+  F  M EA+V PD  TF+G+
Sbjct: 241 CGWIEKAKAIFRKLSVKDTKAWNSMIVGLALHGLSEEALKAFSMMEEAKVKPDSGTFLGV 300

Query: 301 LSACAHGGIVSEGLRFWSLMLECGIEPSVEHYGCIVDLLCRTGLVEEAYRIVTTTNIPSN 360
           L  C    +VSEG RFWS ML  G +PS EHYGC+VDLLCR GLVEEA+ +V    I  N
Sbjct: 301 LFTCGQSSLVSEGRRFWSRMLGFGTKPSTEHYGCMVDLLCRAGLVEEAHTLVQNMAISPN 360

Query: 361 PATWRSLLKGCKKKKLSNLGEIVARYLLQLEPLNAENYIVISNLYSSVSQWEKMSELRKE 420
           P  WR LL GC K ++   GE++A  LL+LEPLNAENY+++S+LY+SVSQWEKM  +R +
Sbjct: 361 PVIWRKLLMGCNKSRMLERGELIAERLLELEPLNAENYVLLSSLYASVSQWEKMMLVRAK 420

Query: 421 MKENDVKPIPGCSSIEVDGVVHEFEMGDQSHPEVKILREFMEEMAKRVWDSGYRPSVSDV 480
           MKE  ++PIP CSSIEV+G++HEF MGD+SHPE K LRE + +++ R+   GY+PS+ ++
Sbjct: 421 MKEKRIRPIPACSSIEVNGIIHEFTMGDRSHPEAKELREVLRDISDRIRGVGYKPSIVEI 480

Query: 481 LHKVMYEEKEGALGEHSERFAIAYGLLKTRAPVVIRVVKNLRVCGDCHEVIKIISKIYER 540
           LH+V+ EEKE A GEHS R AIAYGL KT+AP VIRVV ++R+CGDCHEV KIISKIYER
Sbjct: 481 LHQVINEEKENAHGEHSVRLAIAYGLWKTKAPAVIRVVNSIRICGDCHEVTKIISKIYER 540

Query: 541 EIIVRDRVRFHKFVEGTCSCKDYW 565
           EIIVRDRV FHKFV G+C+CKD+W
Sbjct: 541 EIIVRDRVWFHKFVNGSCTCKDHW 564

BLAST of CmoCh14G004650 vs. TrEMBL
Match: A0A0D2RW39_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_006G137700 PE=4 SV=1)

HSP 1 Score: 749.2 bits (1933), Expect = 3.6e-213
Identity = 359/563 (63.77%), Postives = 443/563 (78.69%), Query Frame = 1

Query: 2   RVLRQLHAHILTRPLPLSTFSFALSKITVFCALSPLGNIDYARLVFAQISRPSIFSWNSL 61
           RV+RQ+HAH+LTR LP+S  SF LSKI  FCALS  G+I++AR VFAQ   P+IFSWNSL
Sbjct: 17  RVIRQIHAHVLTRLLPISAVSFLLSKIVGFCALSRHGDINHARKVFAQTPNPNIFSWNSL 76

Query: 62  IKGCSKIQNPSKEPIALFQKLTETGYPVPNSFTMAFVLKACAIVTAFEEGLQVHSRVLKD 121
           I+G   + + SK P+ L+++L   GYP  N+FT+AFVLKAC+ + AF+EG QVH+RV + 
Sbjct: 77  IRGYYLVGSQSKVPLFLYKELVGKGYPSANTFTLAFVLKACSNILAFDEGKQVHARVFRS 136

Query: 122 GFGSSSFVQTSLVNFYGKCEEIGLATKVFDEMPDRNLVAWTAMISGHVRVGAVDEAMGLF 181
           GFGS+ FVQT L+NFY KCE+IGLA KVFDE+ +RN++AW+ MISG+  +G V++A G F
Sbjct: 137 GFGSNQFVQTGLLNFYAKCEDIGLAEKVFDEIHERNVIAWSTMISGYAMMGLVNKAFGAF 196

Query: 182 REMQKAGVEPDAVTLVSVVSACAAAGALDIGSWVHAYIEKHSVLTDLELGTALVDMYAKC 241
           REMQ + V PD VT+VSV+SACA AGALDIG W+HAYIEKH + TD+ L TALV+MYAKC
Sbjct: 197 REMQTSNVVPDKVTMVSVISACAMAGALDIGRWIHAYIEKHMIETDIMLSTALVNMYAKC 256

Query: 242 GCIERAKQVFVHMPVRDTRAWSSMIMGFAYHGLSEDAIGVFRQMLEAEVMPDRVTFIGIL 301
           GCIE+A ++F  +PV+D +AWSSMI+G A HGL+E+A+  F +M E++V P  VTFIG+L
Sbjct: 257 GCIEKATEIFKGIPVKDHKAWSSMIVGLAVHGLAEEALEAFSRMEESKVTPSHVTFIGVL 316

Query: 302 SACAHGGIVSEGLRFWSLMLECGIEPSVEHYGCIVDLLCRTGLVEEAYRIVTTTNIPSNP 361
           SACAHGG+VSEG R+WS M+E GIEPS+EHYGC+VDLLCR  LV EA   V T     NP
Sbjct: 317 SACAHGGLVSEGRRYWSSMIELGIEPSIEHYGCMVDLLCRASLVGEACSFVQTMPFYPNP 376

Query: 362 ATWRSLLKGCKKKKLSNLGEIVARYLLQLEPLNAENYIVISNLYSSVSQWEKMSELRKEM 421
             WR+LL GC+K K+ + GE+    LL LEP N ENYI++SN Y+SV+QWEKMS +RK M
Sbjct: 377 VIWRTLLIGCQKNKMLHKGEVAGEQLLVLEPSNPENYILLSNFYASVAQWEKMSHVRKMM 436

Query: 422 KENDVKPIPGCSSIEVDGVVHEFEMGDQSHPEVKILREFMEEMAKRVWDSGYRPSVSDVL 481
           KE  +K +PGC+SIE+DG VHEF MGD  HPE K +R+ +  +A+RV D+GY P VSDVL
Sbjct: 437 KERGMKVVPGCASIEIDGFVHEFVMGDWHHPEAKEIRQALRVIAERVSDAGYEPQVSDVL 496

Query: 482 HKVMYEEKEGALGEHSERFAIAYGLLKTRAPVVIRVVKNLRVCGDCHEVIKIISKIYERE 541
           H V  EEK   L EHSER AIAYG+LKT+APV IR+VKNLRVC DCHEV KIISKIYERE
Sbjct: 497 HNVGNEEKGIYLCEHSERLAIAYGILKTKAPVPIRIVKNLRVCIDCHEVTKIISKIYERE 556

Query: 542 IIVRDRVRFHKFVEGTCSCKDYW 565
           IIVRDRVRFHKFV+GTCSCKDYW
Sbjct: 557 IIVRDRVRFHKFVDGTCSCKDYW 579

BLAST of CmoCh14G004650 vs. TrEMBL
Match: E6NUE8_JATCU (JMS10C05.1 protein OS=Jatropha curcas GN=JMS10C05.1 PE=4 SV=1)

HSP 1 Score: 738.4 bits (1905), Expect = 6.4e-210
Identity = 352/564 (62.41%), Postives = 444/564 (78.72%), Query Frame = 1

Query: 1   MRVLRQLHAHILTRPLPLSTFSFALSKITVFCALSPLGNIDYARLVFAQISRPSIFSWNS 60
           M++LRQ+HA ILT   P+S+ SF +SKI  F ALSP GN DYAR +F+QI  P IF++NS
Sbjct: 1   MQILRQIHARILTHVPPISSVSFLISKILSFAALSPFGNFDYARKIFSQIPNPGIFAYNS 60

Query: 61  LIKGCSKIQNPSKEPIALFQKLTETGYPVPNSFTMAFVLKACAIVTAFEEGLQVHSRVLK 120
           +I+GC   + PSKEPI LF+ +   GYP PN+FTMAFVLKAC+I+ A EEG Q+H+++L+
Sbjct: 61  VIRGCLYTKIPSKEPIHLFKDMVGKGYPNPNTFTMAFVLKACSIIMALEEGKQIHAQILR 120

Query: 121 DGFGSSSFVQTSLVNFYGKCEEIGLATKVFDEMPDRNLVAWTAMISGHVRVGAVDEAMGL 180
            GF SS +VQ+SLVNFY KCEEI +A KVFDE+ +RNLV W+AM+SG+ R+G ++EA+ +
Sbjct: 121 SGFSSSPYVQSSLVNFYSKCEEITIARKVFDEITERNLVCWSAMVSGYARLGMINEALIM 180

Query: 181 FREMQKAGVEPDAVTLVSVVSACAAAGALDIGSWVHAYIEKHSVLTDLELGTALVDMYAK 240
           FREMQ  G+EPD V+LV V+SACA  GALDIG WVHAYI+K  +  DLEL TAL++MYAK
Sbjct: 181 FREMQVVGIEPDEVSLVGVLSACAMVGALDIGKWVHAYIKKRMIHVDLELNTALINMYAK 240

Query: 241 CGCIERAKQVFVHMPVRDTRAWSSMIMGFAYHGLSEDAIGVFRQMLEAEVMPDRVTFIGI 300
           CGCIE+A+++F  M V+D++AWSSMI+G A HGL+EDA+ VF +M EA+  P+ VTFIGI
Sbjct: 241 CGCIEKAREIFDEMRVKDSKAWSSMIVGLAIHGLAEDALNVFSRMEEAQAKPNHVTFIGI 300

Query: 301 LSACAHGGIVSEGLRFWSLMLECGIEPSVEHYGCIVDLLCRTGLVEEAYRIVTTTNIPSN 360
           LSACAHGG+VS+G R+WS MLE GIEPS+EHYGC+VDLLCR GL++EAY        P +
Sbjct: 301 LSACAHGGLVSDGKRYWSSMLELGIEPSMEHYGCMVDLLCRGGLIDEAYDFALIIPTP-D 360

Query: 361 PATWRSLLKGCKKKKLSNLGEIVARYLLQLEPLNAENYIVISNLYSSVSQWEKMSELRKE 420
           P  WR+LL    K ++    E+VA  LL+LEP  AENYI+++NLY+SVSQ EK+S +RK 
Sbjct: 361 PVIWRTLLVAYTKNRMLQKAEMVAGKLLELEPWKAENYIILANLYASVSQLEKVSHVRKM 420

Query: 421 MKENDVKPIPGCSSIEVDGVVHEFEMGDQSHPEVKILREFMEEMAKRVWDSGYRPSVSDV 480
           MKEN +K +PGC+SIEVDG VH F  GD SHPE + +++ + ++A ++  SGY+P VS V
Sbjct: 421 MKENGIKALPGCTSIEVDGFVHNFVTGDWSHPEAEEIKKTLRDVALKILISGYKPFVSVV 480

Query: 481 LHKVMYEEKEGALGEHSERFAIAYGLLKTRAPVVIRVVKNLRVCGDCHEVIKIISKIYER 540
           LH V  EEKE  L EHSER AIAYGL+KT+AP  IR+VKNLRVCGDCHEV KIISKIY+R
Sbjct: 481 LHLVNDEEKENVLYEHSERLAIAYGLMKTKAPATIRIVKNLRVCGDCHEVTKIISKIYDR 540

Query: 541 EIIVRDRVRFHKFVEGTCSCKDYW 565
           EIIVRDRVRFHKFV GTCSCKDYW
Sbjct: 541 EIIVRDRVRFHKFVNGTCSCKDYW 563

BLAST of CmoCh14G004650 vs. TAIR10
Match: AT1G59720.1 (AT1G59720.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 465.7 bits (1197), Expect = 4.1e-131
Identity = 247/581 (42.51%), Postives = 358/581 (61.62%), Query Frame = 1

Query: 1   MRVLRQLHAHILTRPLPLSTFS-FALSKITVFCALSPLGNIDYARLVFAQISRPSIFSWN 60
           M  L+QLHA  L    P    + F   KI      S   +++YA  VF  I   S F WN
Sbjct: 61  MSQLKQLHAFTLRTTYPEEPATLFLYGKILQLS--SSFSDVNYAFRVFDSIENHSSFMWN 120

Query: 61  SLIKGCSKIQNPSKEPIALFQKLTETGYPVPNSFTMAFVLKACAIVTAFEEGLQVHSRVL 120
           +LI+ C+   +  +E   L++K+ E G   P+  T  FVLKACA +  F EG QVH +++
Sbjct: 121 TLIRACAHDVSRKEEAFMLYRKMLERGESSPDKHTFPFVLKACAYIFGFSEGKQVHCQIV 180

Query: 121 KDGFGSSSFVQTSLVNFYGKCEEIGLATKVFDEMPDRNLVAWTAMISGHVRVGAVDEAMG 180
           K GFG   +V   L++ YG C  + LA KVFDEMP+R+LV+W +MI   VR G  D A+ 
Sbjct: 181 KHGFGGDVYVNNGLIHLYGSCGCLDLARKVFDEMPERSLVSWNSMIDALVRFGEYDSALQ 240

Query: 181 LFREMQKAGVEPDAVTLVSVVSACAAAGALDIGSWVHAYIEKH---SVLTDLELGTALVD 240
           LFREMQ++  EPD  T+ SV+SACA  G+L +G+W HA++ +     V  D+ +  +L++
Sbjct: 241 LFREMQRS-FEPDGYTMQSVLSACAGLGSLSLGTWAHAFLLRKCDVDVAMDVLVKNSLIE 300

Query: 241 MYAKCGCIERAKQVFVHMPVRDTRAWSSMIMGFAYHGLSEDAIGVFRQMLE--AEVMPDR 300
           MY KCG +  A+QVF  M  RD  +W++MI+GFA HG +E+A+  F +M++    V P+ 
Sbjct: 301 MYCKCGSLRMAEQVFQGMQKRDLASWNAMILGFATHGRAEEAMNFFDRMVDKRENVRPNS 360

Query: 301 VTFIGILSACAHGGIVSEGLRFWSLML-ECGIEPSVEHYGCIVDLLCRTGLVEEAYRIVT 360
           VTF+G+L AC H G V++G +++ +M+ +  IEP++EHYGCIVDL+ R G + EA  +V 
Sbjct: 361 VTFVGLLIACNHRGFVNKGRQYFDMMVRDYCIEPALEHYGCIVDLIARAGYITEAIDMVM 420

Query: 361 TTNIPSNPATWRSLLKGCKKKKLS-NLGEIVARYLLQLEPLNAEN-------YIVISNLY 420
           +  +  +   WRSLL  C KK  S  L E +AR ++  +  N  +       Y+++S +Y
Sbjct: 421 SMPMKPDAVIWRSLLDACCKKGASVELSEEIARNIIGTKEDNESSNGNCSGAYVLLSRVY 480

Query: 421 SSVSQWEKMSELRKEMKENDVKPIPGCSSIEVDGVVHEFEMGDQSHPEVKILREFMEEMA 480
           +S S+W  +  +RK M E+ ++  PGCSSIE++G+ HEF  GD SHP+ K + + ++ + 
Sbjct: 481 ASASRWNDVGIVRKLMSEHGIRKEPGCSSIEINGISHEFFAGDTSHPQTKQIYQQLKVID 540

Query: 481 KRVWDSGYRP--SVSDVLHKVMYEEKEGALGEHSERFAIAYGLLKTRAPVVIRVVKNLRV 540
            R+   GY P  S + ++       KE +L  HSER AIA+GL+       IR+ KNLRV
Sbjct: 541 DRLRSIGYLPDRSQAPLVDATNDGSKEYSLRLHSERLAIAFGLINLPPQTPIRIFKNLRV 600

Query: 541 CGDCHEVIKIISKIYEREIIVRDRVRFHKFVEGTCSCKDYW 565
           C DCHEV K+ISK++  EIIVRDRVRFH F +G+CSC DYW
Sbjct: 601 CNDCHEVTKLISKVFNTEIIVRDRVRFHHFKDGSCSCLDYW 638

BLAST of CmoCh14G004650 vs. TAIR10
Match: AT4G21065.1 (AT4G21065.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 458.8 bits (1179), Expect = 5.0e-129
Identity = 232/564 (41.13%), Postives = 355/564 (62.94%), Query Frame = 1

Query: 4   LRQLHAHILTRPLPLSTFSFALSKITVFCALSPLGNIDYARLVFAQISRP-SIFSWNSLI 63
           LRQ+HA  +   + +S        I    +L     + YA  VF++I +P ++F WN+LI
Sbjct: 33  LRQIHAFSIRHGVSISDAELGKHLIFYLVSLPSPPPMSYAHKVFSKIEKPINVFIWNTLI 92

Query: 64  KGCSKIQNPSKEPIALFQKLTETGYPVPNSFTMAFVLKACAIVTAFEEGLQVHSRVLKDG 123
           +G ++I N S    +L++++  +G   P++ T  F++KA   +     G  +HS V++ G
Sbjct: 93  RGYAEIGN-SISAFSLYREMRVSGLVEPDTHTYPFLIKAVTTMADVRLGETIHSVVIRSG 152

Query: 124 FGSSSFVQTSLVNFYGKCEEIGLATKVFDEMPDRNLVAWTAMISGHVRVGAVDEAMGLFR 183
           FGS  +VQ SL++ Y  C ++  A KVFD+MP+++LVAW ++I+G    G  +EA+ L+ 
Sbjct: 153 FGSLIYVQNSLLHLYANCGDVASAYKVFDKMPEKDLVAWNSVINGFAENGKPEEALALYT 212

Query: 184 EMQKAGVEPDAVTLVSVVSACAAAGALDIGSWVHAYIEKHSVLTDLELGTALVDMYAKCG 243
           EM   G++PD  T+VS++SACA  GAL +G  VH Y+ K  +  +L     L+D+YA+CG
Sbjct: 213 EMNSKGIKPDGFTIVSLLSACAKIGALTLGKRVHVYMIKVGLTRNLHSSNVLLDLYARCG 272

Query: 244 CIERAKQVFVHMPVRDTRAWSSMIMGFAYHGLSEDAIGVFRQMLEAE-VMPDRVTFIGIL 303
            +E AK +F  M  +++ +W+S+I+G A +G  ++AI +F+ M   E ++P  +TF+GIL
Sbjct: 273 RVEEAKTLFDEMVDKNSVSWTSLIVGLAVNGFGKEAIELFKYMESTEGLLPCEITFVGIL 332

Query: 304 SACAHGGIVSEGLRFWSLMLE-CGIEPSVEHYGCIVDLLCRTGLVEEAYRIVTTTNIPSN 363
            AC+H G+V EG  ++  M E   IEP +EH+GC+VDLL R G V++AY  + +  +  N
Sbjct: 333 YACSHCGMVKEGFEYFRRMREEYKIEPRIEHFGCMVDLLARAGQVKKAYEYIKSMPMQPN 392

Query: 364 PATWRSLLKGCKKKKLSNLGEIVARYLLQLEPLNAENYIVISNLYSSVSQWEKMSELRKE 423
              WR+LL  C     S+L E     +LQLEP ++ +Y+++SN+Y+S  +W  + ++RK+
Sbjct: 393 VVIWRTLLGACTVHGDSDLAEFARIQILQLEPNHSGDYVLLSNMYASEQRWSDVQKIRKQ 452

Query: 424 MKENDVKPIPGCSSIEVDGVVHEFEMGDQSHPEVKILREFMEEMAKRVWDSGYRPSVSDV 483
           M  + VK +PG S +EV   VHEF MGD+SHP+   +   ++EM  R+   GY P +S+V
Sbjct: 453 MLRDGVKKVPGHSLVEVGNRVHEFLMGDKSHPQSDAIYAKLKEMTGRLRSEGYVPQISNV 512

Query: 484 LHKVMYEEKEGALGEHSERFAIAYGLLKTRAPVVIRVVKNLRVCGDCHEVIKIISKIYER 543
              V  EEKE A+  HSE+ AIA+ L+ T     I VVKNLRVC DCH  IK++SK+Y R
Sbjct: 513 YVDVEEEEKENAVVYHSEKIAIAFMLISTPERSPITVVKNLRVCADCHLAIKLVSKVYNR 572

Query: 544 EIIVRDRVRFHKFVEGTCSCKDYW 565
           EI+VRDR RFH F  G+CSC+DYW
Sbjct: 573 EIVVRDRSRFHHFKNGSCSCQDYW 595

BLAST of CmoCh14G004650 vs. TAIR10
Match: AT2G02980.1 (AT2G02980.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 454.5 bits (1168), Expect = 9.4e-128
Identity = 235/567 (41.45%), Postives = 354/567 (62.43%), Query Frame = 1

Query: 1   MRVLRQLHAHILTRPLPLSTFSFALSKITVFCALSPL-GNIDYARLVFAQISRPSIFSWN 60
           +R L Q+ A+ +   +   +F   ++K+  FC  SP   ++ YAR +F  +S P I  +N
Sbjct: 42  LRELMQIQAYAIKSHIEDVSF---VAKLINFCTESPTESSMSYARHLFEAMSEPDIVIFN 101

Query: 61  SLIKGCSKIQNPSKEPIALFQKLTETGYPVPNSFTMAFVLKACAIVTAFEEGLQVHSRVL 120
           S+ +G S+  NP  E  +LF ++ E G  +P+++T   +LKACA+  A EEG Q+H   +
Sbjct: 102 SMARGYSRFTNPL-EVFSLFVEILEDGI-LPDNYTFPSLLKACAVAKALEEGRQLHCLSM 161

Query: 121 KDGFGSSSFVQTSLVNFYGKCEEIGLATKVFDEMPDRNLVAWTAMISGHVRVGAVDEAMG 180
           K G   + +V  +L+N Y +CE++  A  VFD + +  +V + AMI+G+ R    +EA+ 
Sbjct: 162 KLGLDDNVYVCPTLINMYTECEDVDSARCVFDRIVEPCVVCYNAMITGYARRNRPNEALS 221

Query: 181 LFREMQKAGVEPDAVTLVSVVSACAAAGALDIGSWVHAYIEKHSVLTDLELGTALVDMYA 240
           LFREMQ   ++P+ +TL+SV+S+CA  G+LD+G W+H Y +KHS    +++ TAL+DM+A
Sbjct: 222 LFREMQGKYLKPNEITLLSVLSSCALLGSLDLGKWIHKYAKKHSFCKYVKVNTALIDMFA 281

Query: 241 KCGCIERAKQVFVHMPVRDTRAWSSMIMGFAYHGLSEDAIGVFRQMLEAEVMPDRVTFIG 300
           KCG ++ A  +F  M  +DT+AWS+MI+ +A HG +E ++ +F +M    V PD +TF+G
Sbjct: 282 KCGSLDDAVSIFEKMRYKDTQAWSAMIVAYANHGKAEKSMLMFERMRSENVQPDEITFLG 341

Query: 301 ILSACAHGGIVSEGLRFWSLML-ECGIEPSVEHYGCIVDLLCRTGLVEEAYRIVTTTNIP 360
           +L+AC+H G V EG +++S M+ + GI PS++HYG +VDLL R G +E+AY  +    I 
Sbjct: 342 LLNACSHTGRVEEGRKYFSQMVSKFGIVPSIKHYGSMVDLLSRAGNLEDAYEFIDKLPIS 401

Query: 361 SNPATWRSLLKGCKKKKLSNLGEIVARYLLQLEPLNAENYIVISNLYSSVSQWEKMSELR 420
             P  WR LL  C      +L E V+  + +L+  +  +Y+++SNLY+   +WE +  LR
Sbjct: 402 PTPMLWRILLAACSSHNNLDLAEKVSERIFELDDSHGGDYVILSNLYARNKKWEYVDSLR 461

Query: 421 KEMKENDVKPIPGCSSIEVDGVVHEFEMGDQSHPEVKILREFMEEMAKRVWDSGYRPSVS 480
           K MK+     +PGCSSIEV+ VVHEF  GD        L   ++EM K +  SGY P  S
Sbjct: 462 KVMKDRKAVKVPGCSSIEVNNVVHEFFSGDGVKSATTKLHRALDEMVKELKLSGYVPDTS 521

Query: 481 DVLHKVMY-EEKEGALGEHSERFAIAYGLLKTRAPVVIRVVKNLRVCGDCHEVIKIISKI 540
            V+H  M  +EKE  L  HSE+ AI +GLL T     IRVVKNLRVC DCH   K+IS I
Sbjct: 522 MVVHANMNDQEKEITLRYHSEKLAITFGLLNTPPGTTIRVVKNLRVCRDCHNAAKLISLI 581

Query: 541 YEREIIVRDRVRFHKFVEGTCSCKDYW 565
           + R++++RD  RFH F +G CSC D+W
Sbjct: 582 FGRKVVLRDVQRFHHFEDGKCSCGDFW 603

BLAST of CmoCh14G004650 vs. TAIR10
Match: AT1G08070.1 (AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 443.4 bits (1139), Expect = 2.2e-124
Identity = 217/530 (40.94%), Postives = 335/530 (63.21%), Query Frame = 1

Query: 38  GNIDYARLVFAQISRPSIFSWNSLIKGCSKIQNPSKEPIALFQKLTETGYPVPNSFTMAF 97
           G I+ A+ +F +I    + SWN++I G ++  N  KE + LF+ + +T    P+  TM  
Sbjct: 214 GYIENAQKLFDEIPVKDVVSWNAMISGYAETGN-YKEALELFKDMMKTNVR-PDESTMVT 273

Query: 98  VLKACAIVTAFEEGLQVHSRVLKDGFGSSSFVQTSLVNFYGKCEEIGLATKVFDEMPDRN 157
           V+ ACA   + E G QVH  +   GFGS+  +  +L++ Y KC E+  A  +F+ +P ++
Sbjct: 274 VVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELETACGLFERLPYKD 333

Query: 158 LVAWTAMISGHVRVGAVDEAMGLFREMQKAGVEPDAVTLVSVVSACAAAGALDIGSWVHA 217
           +++W  +I G+  +    EA+ LF+EM ++G  P+ VT++S++ ACA  GA+DIG W+H 
Sbjct: 334 VISWNTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSILPACAHLGAIDIGRWIHV 393

Query: 218 YIEKH--SVLTDLELGTALVDMYAKCGCIERAKQVFVHMPVRDTRAWSSMIMGFAYHGLS 277
           YI+K    V     L T+L+DMYAKCG IE A QVF  +  +   +W++MI GFA HG +
Sbjct: 394 YIDKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSSWNAMIFGFAMHGRA 453

Query: 278 EDAIGVFRQMLEAEVMPDRVTFIGILSACAHGGIVSEGLRFWSLMLE-CGIEPSVEHYGC 337
           + +  +F +M +  + PD +TF+G+LSAC+H G++  G   +  M +   + P +EHYGC
Sbjct: 454 DASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQDYKMTPKLEHYGC 513

Query: 338 IVDLLCRTGLVEEAYRIVTTTNIPSNPATWRSLLKGCKKKKLSNLGEIVARYLLQLEPLN 397
           ++DLL  +GL +EA  ++    +  +   W SLLK CK      LGE  A  L+++EP N
Sbjct: 514 MIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELGESFAENLIKIEPEN 573

Query: 398 AENYIVISNLYSSVSQWEKMSELRKEMKENDVKPIPGCSSIEVDGVVHEFEMGDQSHPEV 457
             +Y+++SN+Y+S  +W ++++ R  + +  +K +PGCSSIE+D VVHEF +GD+ HP  
Sbjct: 574 PGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSVVHEFIIGDKFHPRN 633

Query: 458 KILREFMEEMAKRVWDSGYRPSVSDVLHKVMYEEKEGALGEHSERFAIAYGLLKTRAPVV 517
           + +   +EEM   +  +G+ P  S+VL ++  E KEGAL  HSE+ AIA+GL+ T+    
Sbjct: 634 REIYGMLEEMEVLLEKAGFVPDTSEVLQEMEEEWKEGALRHHSEKLAIAFGLISTKPGTK 693

Query: 518 IRVVKNLRVCGDCHEVIKIISKIYEREIIVRDRVRFHKFVEGTCSCKDYW 565
           + +VKNLRVC +CHE  K+ISKIY+REII RDR RFH F +G CSC DYW
Sbjct: 694 LTIVKNLRVCRNCHEATKLISKIYKREIIARDRTRFHHFRDGVCSCNDYW 741

BLAST of CmoCh14G004650 vs. TAIR10
Match: AT3G57430.1 (AT3G57430.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 429.5 bits (1103), Expect = 3.2e-120
Identity = 233/578 (40.31%), Postives = 351/578 (60.73%), Query Frame = 1

Query: 1   MRVLRQLHAHILTRPLPLSTFSFALSKIT-VFCALSPLGNIDYARLVFAQISRPSIFSWN 60
           +R  ++LHA+ L     L   SF  S +  ++C    + +    R VF  +    I  WN
Sbjct: 318 LRTGKELHAYALKNG-SLDENSFVGSALVDMYCNCKQVLS---GRRVFDGMFDRKIGLWN 377

Query: 61  SLIKGCSKIQNPSKEPIALFQKLTETGYPVPNSFTMAFVLKACAIVTAFEEGLQVHSRVL 120
           ++I G S+ ++  KE + LF  + E+   + NS TMA V+ AC    AF     +H  V+
Sbjct: 378 AMIAGYSQNEH-DKEALLLFIGMEESAGLLANSTTMAGVVPACVRSGAFSRKEAIHGFVV 437

Query: 121 KDGFGSSSFVQTSLVNFYGKCEEIGLATKVFDEMPDRNLVAWTAMISGHVRVGAVDEAMG 180
           K G     FVQ +L++ Y +  +I +A ++F +M DR+LV W  MI+G+V     ++A+ 
Sbjct: 438 KRGLDRDRFVQNTLMDMYSRLGKIDIAMRIFGKMEDRDLVTWNTMITGYVFSEHHEDALL 497

Query: 181 LFREMQ-----------KAGVEPDAVTLVSVVSACAAAGALDIGSWVHAYIEKHSVLTDL 240
           L  +MQ           +  ++P+++TL++++ +CAA  AL  G  +HAY  K+++ TD+
Sbjct: 498 LLHKMQNLERKVSKGASRVSLKPNSITLMTILPSCAALSALAKGKEIHAYAIKNNLATDV 557

Query: 241 ELGTALVDMYAKCGCIERAKQVFVHMPVRDTRAWSSMIMGFAYHGLSEDAIGVFRQMLEA 300
            +G+ALVDMYAKCGC++ +++VF  +P ++   W+ +IM +  HG  ++AI + R M+  
Sbjct: 558 AVGSALVDMYAKCGCLQMSRKVFDQIPQKNVITWNVIIMAYGMHGNGQEAIDLLRMMMVQ 617

Query: 301 EVMPDRVTFIGILSACAHGGIVSEGLR-FWSLMLECGIEPSVEHYGCIVDLLCRTGLVEE 360
            V P+ VTFI + +AC+H G+V EGLR F+ +  + G+EPS +HY C+VDLL R G ++E
Sbjct: 618 GVKPNEVTFISVFAACSHSGMVDEGLRIFYVMKPDYGVEPSSDHYACVVDLLGRAGRIKE 677

Query: 361 AYRIVTTTNIPSNPA-TWRSLLKGCKKKKLSNLGEIVARYLLQLEPLNAENYIVISNLYS 420
           AY+++       N A  W SLL   +      +GEI A+ L+QLEP  A +Y++++N+YS
Sbjct: 678 AYQLMNMMPRDFNKAGAWSSLLGASRIHNNLEIGEIAAQNLIQLEPNVASHYVLLANIYS 737

Query: 421 SVSQWEKMSELRKEMKENDVKPIPGCSSIEVDGVVHEFEMGDQSHPEVKILREFMEEMAK 480
           S   W+K +E+R+ MKE  V+  PGCS IE    VH+F  GD SHP+ + L  ++E + +
Sbjct: 738 SAGLWDKATEVRRNMKEQGVRKEPGCSWIEHGDEVHKFVAGDSSHPQSEKLSGYLETLWE 797

Query: 481 RVWDSGYRPSVSDVLHKVMYEEKEGALGEHSERFAIAYGLLKTRAPVVIRVVKNLRVCGD 540
           R+   GY P  S VLH V  +EKE  L  HSE+ AIA+G+L T    +IRV KNLRVC D
Sbjct: 798 RMRKEGYVPDTSCVLHNVEEDEKEILLCGHSEKLAIAFGILNTSPGTIIRVAKNLRVCND 857

Query: 541 CHEVIKIISKIYEREIIVRDRVRFHKFVEGTCSCKDYW 565
           CH   K ISKI +REII+RD  RFH+F  GTCSC DYW
Sbjct: 858 CHLATKFISKIVDREIILRDVRRFHRFKNGTCSCGDYW 890

BLAST of CmoCh14G004650 vs. NCBI nr
Match: gi|659084927|ref|XP_008443148.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g59720, mitochondrial-like [Cucumis melo])

HSP 1 Score: 976.1 bits (2522), Expect = 2.6e-281
Identity = 479/564 (84.93%), Postives = 512/564 (90.78%), Query Frame = 1

Query: 1   MRVLRQLHAHILTRPLPLSTFSFALSKITVFCALSPLGNIDYARLVFAQISRPSIFSWNS 60
           MRVLRQLHAHILTRPLPLS+F+FALSKI  FCALSP GNIDYAR VF QI  P+IFSWNS
Sbjct: 1   MRVLRQLHAHILTRPLPLSSFAFALSKIVAFCALSPFGNIDYARSVFVQIPHPNIFSWNS 60

Query: 61  LIKGCSKIQNPSKEPIALFQKLTETGYPVPNSFTMAFVLKACAIVTAFEEGLQVHSRVLK 120
           LIKG S+I  PSKEPI LF+KLTETGYPVPNSFT+AFVLKACAIV AF EGLQVHS VLK
Sbjct: 61  LIKGYSQIYTPSKEPIFLFKKLTETGYPVPNSFTLAFVLKACAIVAAFGEGLQVHSHVLK 120

Query: 121 DGFGSSSFVQTSLVNFYGKCEEIGLATKVFDEMPDRNLVAWTAMISGHVRVGAVDEAMGL 180
           DGFGSS FVQTSLVNFYGKCEEIG A KVFDEMP RNLVAWTAMISGH RVGAVDEAMGL
Sbjct: 121 DGFGSSLFVQTSLVNFYGKCEEIGFARKVFDEMPVRNLVAWTAMISGHARVGAVDEAMGL 180

Query: 181 FREMQKAGVEPDAVTLVSVVSACAAAGALDIGSWVHAYIEKHSVLTDLELGTALVDMYAK 240
           FREMQKAGV+PDA+TLVSVVSACA AGALDIG W+HAYIEK+ VLTDLEL TAL+DMYAK
Sbjct: 181 FREMQKAGVQPDAMTLVSVVSACAVAGALDIGCWLHAYIEKYFVLTDLELSTALLDMYAK 240

Query: 241 CGCIERAKQVFVHMPVRDTRAWSSMIMGFAYHGLSEDAIGVFRQMLEAEVMPDRVTFIGI 300
           CGCIERAKQVFVHMPV+DT AWSSMIMG AYHGL EDA+  F+QMLE EVMPD VTF+ +
Sbjct: 241 CGCIERAKQVFVHMPVKDTTAWSSMIMGLAYHGLVEDAVDAFQQMLETEVMPDHVTFLAV 300

Query: 301 LSACAHGGIVSEGLRFWSLMLECGIEPSVEHYGCIVDLLCRTGLVEEAYRIVTTTNIPSN 360
           LSACAHGG+VS G RFWSLMLE GIEPSVEHYGC VDLLCR+GLVEEAYRI TT  IP N
Sbjct: 301 LSACAHGGLVSRGRRFWSLMLEFGIEPSVEHYGCKVDLLCRSGLVEEAYRITTTMKIPPN 360

Query: 361 PATWRSLLKGCKKKKLSNLGEIVARYLLQLEPLNAENYIVISNLYSSVSQWEKMSELRKE 420
            ATWRSLL GCKKKKL NLGEI+ARYLL+LEPLNAENYI+ISNLYSS+SQWEKMSELRK 
Sbjct: 361 AATWRSLLMGCKKKKLLNLGEIIARYLLELEPLNAENYIMISNLYSSLSQWEKMSELRKV 420

Query: 421 MKENDVKPIPGCSSIEVDGVVHEFEMGDQSHPEVKILREFMEEMAKRVWDSGYRPSVSDV 480
           MKE  +KP+PGCSSIEVDGVVHEF MGDQSHPEVK+LREFM+EM+ RV DSGYRPS+SDV
Sbjct: 421 MKEKCIKPVPGCSSIEVDGVVHEFVMGDQSHPEVKVLREFMKEMSMRVRDSGYRPSISDV 480

Query: 481 LHKVMYEEKEGALGEHSERFAIAYGLLKTRAPVVIRVVKNLRVCGDCHEVIKIISKIYER 540
           LHKV+ EEKE AL EHSERFAIAYGLLKTRAPVVIRVVKNLRVC DCHEVIKIISK+YER
Sbjct: 481 LHKVVDEEKECALSEHSERFAIAYGLLKTRAPVVIRVVKNLRVCVDCHEVIKIISKLYER 540

Query: 541 EIIVRDRVRFHKFVEGTCSCKDYW 565
           EIIVRDRVRFHKF++GTCSCKD+W
Sbjct: 541 EIIVRDRVRFHKFIKGTCSCKDFW 564

BLAST of CmoCh14G004650 vs. NCBI nr
Match: gi|778684969|ref|XP_004136598.2| (PREDICTED: pentatricopeptide repeat-containing protein At1g59720, mitochondrial-like [Cucumis sativus])

HSP 1 Score: 969.9 bits (2506), Expect = 1.9e-279
Identity = 477/564 (84.57%), Postives = 513/564 (90.96%), Query Frame = 1

Query: 1   MRVLRQLHAHILTRPLPLSTFSFALSKITVFCALSPLGNIDYARLVFAQISRPSIFSWNS 60
           MRVLRQLHAHILTRPLPLS+F+FALSKI  FCALSP GNI+YAR VFAQI  P+IFSWNS
Sbjct: 1   MRVLRQLHAHILTRPLPLSSFAFALSKIVAFCALSPFGNINYARSVFAQIPHPNIFSWNS 60

Query: 61  LIKGCSKIQNPSKEPIALFQKLTETGYPVPNSFTMAFVLKACAIVTAFEEGLQVHSRVLK 120
           LIKG S+I   SKEPI LF+KLTETGYPVPNSFT+AFVLKACAIVTAF EGLQVHS VLK
Sbjct: 61  LIKGYSQIHTLSKEPIFLFKKLTETGYPVPNSFTLAFVLKACAIVTAFGEGLQVHSHVLK 120

Query: 121 DGFGSSSFVQTSLVNFYGKCEEIGLATKVFDEMPDRNLVAWTAMISGHVRVGAVDEAMGL 180
           DGFGSS FVQTSLVNFYGKCEEIG A KVF+EMP RNLVAWTAMISGH RVGAVDEAM L
Sbjct: 121 DGFGSSLFVQTSLVNFYGKCEEIGFARKVFEEMPVRNLVAWTAMISGHARVGAVDEAMEL 180

Query: 181 FREMQKAGVEPDAVTLVSVVSACAAAGALDIGSWVHAYIEKHSVLTDLELGTALVDMYAK 240
           FREMQKAG++PDA+TLVSVVSACA AGALDIG W+HAYIEK+ VLTDLEL TALVDMYAK
Sbjct: 181 FREMQKAGIQPDAMTLVSVVSACAVAGALDIGCWLHAYIEKYFVLTDLELSTALVDMYAK 240

Query: 241 CGCIERAKQVFVHMPVRDTRAWSSMIMGFAYHGLSEDAIGVFRQMLEAEVMPDRVTFIGI 300
           CGCIERAKQVFVHMPV+DT AWSSMIMGFAYHGL++DAI  F+QMLE EV PD VTF+ +
Sbjct: 241 CGCIERAKQVFVHMPVKDTTAWSSMIMGFAYHGLAQDAIDAFQQMLETEVTPDHVTFLAV 300

Query: 301 LSACAHGGIVSEGLRFWSLMLECGIEPSVEHYGCIVDLLCRTGLVEEAYRIVTTTNIPSN 360
           LSACAHGG+VS G RFWSLMLE GIEPSVEHYGC VDLLCR+GLVEEAYRI TT  IP N
Sbjct: 301 LSACAHGGLVSRGRRFWSLMLEFGIEPSVEHYGCKVDLLCRSGLVEEAYRITTTMKIPPN 360

Query: 361 PATWRSLLKGCKKKKLSNLGEIVARYLLQLEPLNAENYIVISNLYSSVSQWEKMSELRKE 420
            ATWRSLL GCKKKKL NLGEIVARYLL+LEPLNAEN+I+ISNLYSS+SQWEKMSELRK 
Sbjct: 361 AATWRSLLMGCKKKKLLNLGEIVARYLLELEPLNAENFIMISNLYSSLSQWEKMSELRKV 420

Query: 421 MKENDVKPIPGCSSIEVDGVVHEFEMGDQSHPEVKILREFMEEMAKRVWDSGYRPSVSDV 480
           MKE  +KP+PGCSSIEVDGVVHEF MGDQSHPEVK+LREFMEEM+ RV DSGYRPS+SDV
Sbjct: 421 MKEKCIKPVPGCSSIEVDGVVHEFVMGDQSHPEVKMLREFMEEMSMRVRDSGYRPSISDV 480

Query: 481 LHKVMYEEKEGALGEHSERFAIAYGLLKTRAPVVIRVVKNLRVCGDCHEVIKIISKIYER 540
           LHKV+ EEKE AL EHSERFAIAYGLLKTRAP+VIRVVKNLRVC DCHEVIKIISK+YER
Sbjct: 481 LHKVVDEEKECALSEHSERFAIAYGLLKTRAPIVIRVVKNLRVCVDCHEVIKIISKLYER 540

Query: 541 EIIVRDRVRFHKFVEGTCSCKDYW 565
           EIIVRDRVRFHKF++GTCSCKD+W
Sbjct: 541 EIIVRDRVRFHKFIKGTCSCKDFW 564

BLAST of CmoCh14G004650 vs. NCBI nr
Match: gi|225441789|ref|XP_002283735.1| (PREDICTED: pentatricopeptide repeat-containing protein At4g21065 [Vitis vinifera])

HSP 1 Score: 795.4 bits (2053), Expect = 6.4e-227
Identity = 382/564 (67.73%), Postives = 464/564 (82.27%), Query Frame = 1

Query: 1   MRVLRQLHAHILTRPLPLSTFSFALSKITVFCALSPLGNIDYARLVFAQISRPSIFSWNS 60
           MRVLRQ+HA +LT  +P+S+ SF L KI  FCALSP G+IDYAR +F+QI RP+IFSWNS
Sbjct: 1   MRVLRQIHARLLTHAMPISSISFGLCKIIGFCALSPYGDIDYARKLFSQIQRPNIFSWNS 60

Query: 61  LIKGCSKIQNPSKEPIALFQKLTETGYPVPNSFTMAFVLKACAIVTAFEEGLQVHSRVLK 120
           +I+GCS+ Q PSKEP+ LF+K+   GYP PN+FTMAFVLKAC+IV+A EEG QVH+ VLK
Sbjct: 61  MIRGCSQSQTPSKEPVILFRKMVRRGYPNPNTFTMAFVLKACSIVSALEEGQQVHANVLK 120

Query: 121 DGFGSSSFVQTSLVNFYGKCEEIGLATKVFDEMPDRNLVAWTAMISGHVRVGAVDEAMGL 180
            GFGSS FV+T+LVNFY KCE+I LA+KVFDE+ DRNLVAW+ MISG+ R+G V+EA+GL
Sbjct: 121 SGFGSSPFVETALVNFYAKCEDIVLASKVFDEITDRNLVAWSTMISGYARIGLVNEALGL 180

Query: 181 FREMQKAGVEPDAVTLVSVVSACAAAGALDIGSWVHAYIEKHSVLTDLELGTALVDMYAK 240
           FR+MQKAGV PD VT+VSV+SACAA+GALD G WVHAYI K  + TDLEL TALV+MYAK
Sbjct: 181 FRDMQKAGVVPDEVTMVSVISACAASGALDTGKWVHAYINKQLIETDLELSTALVNMYAK 240

Query: 241 CGCIERAKQVFVHMPVRDTRAWSSMIMGFAYHGLSEDAIGVFRQMLEAEVMPDRVTFIGI 300
           CGCIERAK+VF  MPV+DT+AWSSMI+G A +GL+EDA+  F +M EA+V P+ VTFIG+
Sbjct: 241 CGCIERAKEVFDAMPVKDTKAWSSMIVGLAINGLAEDALEEFFRMEEAKVKPNHVTFIGV 300

Query: 301 LSACAHGGIVSEGLRFWSLMLECGIEPSVEHYGCIVDLLCRTGLVEEAYRIVTTTNIPSN 360
           LSACAH G+VSEG R+WS MLE GI PS+E YGC+VDLLCR  LVE+A  +V T  I  N
Sbjct: 301 LSACAHSGLVSEGRRYWSSMLEFGIVPSMELYGCMVDLLCRASLVEDACTLVETMPISPN 360

Query: 361 PATWRSLLKGCKKKKLSNLGEIVARYLLQLEPLNAENYIVISNLYSSVSQWEKMSELRKE 420
           P  WR+LL GCKK K  +  E+VA+ LL+LEP NAENYI++SNLY+S+SQWEKMS++RK+
Sbjct: 361 PVIWRTLLVGCKKSKNLDKSEVVAQRLLELEPHNAENYILLSNLYASMSQWEKMSQVRKK 420

Query: 421 MKENDVKPIPGCSSIEVDGVVHEFEMGDQSHPEVKILREFMEEMAKRVWDSGYRPSVSDV 480
           MK   +K +PGCSSIEVDG+VHEF MGD SHPE   +RE + +++KRV   G++P +SDV
Sbjct: 421 MKGMGIKAVPGCSSIEVDGLVHEFVMGDWSHPEAMEVREILRDISKRVHAVGHQPGISDV 480

Query: 481 LHKVMYEEKEGALGEHSERFAIAYGLLKTRAPVVIRVVKNLRVCGDCHEVIKIISKIYER 540
           LH V+ EEKE AL EHSER AIAYGLLKT+ P+ IR+VKNLRVCGDCHEV KIIS  Y R
Sbjct: 481 LHNVVDEEKENALCEHSERLAIAYGLLKTKTPMAIRIVKNLRVCGDCHEVTKIISAEYRR 540

Query: 541 EIIVRDRVRFHKFVEGTCSCKDYW 565
           EIIVRDRVRFHKFV G+CSC+D+W
Sbjct: 541 EIIVRDRVRFHKFVNGSCSCRDFW 564

BLAST of CmoCh14G004650 vs. NCBI nr
Match: gi|297739678|emb|CBI29860.3| (unnamed protein product [Vitis vinifera])

HSP 1 Score: 783.1 bits (2021), Expect = 3.3e-223
Identity = 379/559 (67.80%), Postives = 459/559 (82.11%), Query Frame = 1

Query: 1   MRVLRQLHAHILTRPLPLSTFSFALSKITVFCALSPLGNIDYARLVFAQISRPSIFSWNS 60
           MRVLRQ+HA +LT  +P+S+ SF L KI  FCALSP G+IDYAR +F+QI RP+IFSWNS
Sbjct: 70  MRVLRQIHARLLTHAMPISSISFGLCKIIGFCALSPYGDIDYARKLFSQIQRPNIFSWNS 129

Query: 61  LIKGCSKIQNPSKEPIALFQKLTETGYPVPNSFTMAFVLKACAIVTAFEEGLQVHSRVLK 120
           +I+GCS+ Q PSKEP+ LF+K+   GYP PN+FTMAFVLKAC+IV+A EEG QVH+ VLK
Sbjct: 130 MIRGCSQSQTPSKEPVILFRKMVRRGYPNPNTFTMAFVLKACSIVSALEEGQQVHANVLK 189

Query: 121 DGFGSSSFVQTSLVNFYGKCEEIGLATKVFDEMPDRNLVAWTAMISGHVRVGAVDEAMGL 180
            GFGSS FV+T+LVNFY KCE+I LA+KVFDE+ DRNLVAW+ MISG+ R+G V+EA+GL
Sbjct: 190 SGFGSSPFVETALVNFYAKCEDIVLASKVFDEITDRNLVAWSTMISGYARIGLVNEALGL 249

Query: 181 FREMQKAGVEPDAVTLVSVVSACAAAGALDIGSWVHAYIEKHSVLTDLELGTALVDMYAK 240
           FR+MQKAGV PD VT+VSV+SACAA+GALD G WVHAYI K  + TDLEL TALV+MYAK
Sbjct: 250 FRDMQKAGVVPDEVTMVSVISACAASGALDTGKWVHAYINKQLIETDLELSTALVNMYAK 309

Query: 241 CGCIERAKQVFVHMPVRDTRAWSSMIMGFAYHGLSEDAIGVFRQMLEAEVMPDRVTFIGI 300
           CGCIERAK+VF  MPV+DT+AWSSMI+G A +GL+EDA+  F +M EA+V P+ VTFIG+
Sbjct: 310 CGCIERAKEVFDAMPVKDTKAWSSMIVGLAINGLAEDALEEFFRMEEAKVKPNHVTFIGV 369

Query: 301 LSACAHGGIVSEGLRFWSLMLECGIEPSVEHYGCIVDLLCRTGLVEEAYRIVTTTNIPSN 360
           LSACAH G+VSEG R+WS MLE GI PS+E YGC+VDLLCR  LVE+A  +V T  I  N
Sbjct: 370 LSACAHSGLVSEGRRYWSSMLEFGIVPSMELYGCMVDLLCRASLVEDACTLVETMPISPN 429

Query: 361 PATWRSLLKGCKKKKLSNLGEIVARYLLQLEPLNAENYIVISNLYSSVSQWEKMSELRKE 420
           P  WR+LL GCKK K  +  E+VA+ LL+LEP NAENYI++SNLY+S+SQWEKMS++RK+
Sbjct: 430 PVIWRTLLVGCKKSKNLDKSEVVAQRLLELEPHNAENYILLSNLYASMSQWEKMSQVRKK 489

Query: 421 MKENDVKPIPGCSSIEVDGVVHEFEMGDQSHPEVKILREFMEEMAKRVWDSGYRPSVSDV 480
           MK   +K +PGCSSIEVDG+VHEF MGD SHPE   +RE + +++KRV   G++P +SDV
Sbjct: 490 MKGMGIKAVPGCSSIEVDGLVHEFVMGDWSHPEAMEVREILRDISKRVHAVGHQPGISDV 549

Query: 481 LHKVMYEEKEGALGEHSERFAIAYGLLKTRAPVVIRVVKNLRVCGDCHEVIKIISKIYER 540
           LH V+ EEKE AL EHSER AIAYGLLKT+ P+ IR+VKNLRVCGDCHEV KIIS  Y R
Sbjct: 550 LHNVVDEEKENALCEHSERLAIAYGLLKTKTPMAIRIVKNLRVCGDCHEVTKIISAEYRR 609

Query: 541 EIIVRDRVRFHKFVEGTCS 560
           EIIVRDRVRFHKFV G+CS
Sbjct: 610 EIIVRDRVRFHKFVNGSCS 628

BLAST of CmoCh14G004650 vs. NCBI nr
Match: gi|720094066|ref|XP_010246245.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g48910-like [Nelumbo nucifera])

HSP 1 Score: 781.9 bits (2018), Expect = 7.3e-223
Identity = 364/564 (64.54%), Postives = 454/564 (80.50%), Query Frame = 1

Query: 1   MRVLRQLHAHILTRPLPLSTFSFALSKITVFCALSPLGNIDYARLVFAQISRPSIFSWNS 60
           MR++ Q+HAH L + LP +T S+AL+KI  FCALSP+G+IDYAR VF++I  PSIFSWN 
Sbjct: 34  MRIVHQIHAHFLVQGLPSTTLSYALNKIVGFCALSPVGDIDYARSVFSRIRNPSIFSWNC 93

Query: 61  LIKGCSKIQNPSKEPIALFQKLTETGYPVPNSFTMAFVLKACAIVTAFEEGLQVHSRVLK 120
           LI+GCS ++ PSKEP  LF++L + GYP PNSFT+AFVLKAC+IV+AF EGLQ+HS VL+
Sbjct: 94  LIRGCSLLEIPSKEPFFLFKRLIQRGYPSPNSFTLAFVLKACSIVSAFSEGLQIHSHVLR 153

Query: 121 DGFGSSSFVQTSLVNFYGKCEEIGLATKVFDEMPDRNLVAWTAMISGHVRVGAVDEAMGL 180
            G GSS F+QT+LVNFY KCEEI  A   FDE+P+RNLVAW+ MISG+ + G V+E++ L
Sbjct: 154 SGLGSSQFIQTALVNFYAKCEEIRFARCAFDEIPERNLVAWSTMISGYTKTGLVNESLSL 213

Query: 181 FREMQKAGVEPDAVTLVSVVSACAAAGALDIGSWVHAYIEKHSVLTDLELGTALVDMYAK 240
           FREMQK  + PD VT+VSV+SACAAAGAL +G WVHA+I+KH +  DLELGTAL +MY K
Sbjct: 214 FREMQKTEISPDKVTMVSVLSACAAAGALGLGRWVHAFIDKHMINVDLELGTALFNMYTK 273

Query: 241 CGCIERAKQVFVHMPVRDTRAWSSMIMGFAYHGLSEDAIGVFRQMLEAEVMPDRVTFIGI 300
           CGCIE+A+++F  MP+RDT+AWSSMIMG A HGL EDA+  F QM+E +V P++ TF+G+
Sbjct: 274 CGCIEKARELFDGMPMRDTKAWSSMIMGLAIHGLKEDALHFFSQMVEMKVKPNKATFVGV 333

Query: 301 LSACAHGGIVSEGLRFWSLMLECGIEPSVEHYGCIVDLLCRTGLVEEAYRIVTTTNIPSN 360
           LSACAHGG+V+EG R+WS ML+ GIEPS+EHYGC+VDLLCR GLVEEA   V    I  N
Sbjct: 334 LSACAHGGLVAEGWRYWSCMLKLGIEPSIEHYGCMVDLLCRVGLVEEACTFVEAMPISPN 393

Query: 361 PATWRSLLKGCKKKKLSNLGEIVARYLLQLEPLNAENYIVISNLYSSVSQWEKMSELRKE 420
           P  WR+LL GC+K  L + GEI+A  LL+LEPLN ENYI++SN+Y+S SQWEK   +RK+
Sbjct: 394 PVIWRTLLVGCRKSGLLDKGEIIAGQLLELEPLNGENYILLSNMYASSSQWEKAKYVRKK 453

Query: 421 MKENDVKPIPGCSSIEVDGVVHEFEMGDQSHPEVKILREFMEEMAKRVWDSGYRPSVSDV 480
           MK+N +K +PGCSSIE+DG +H+F M D SHPE K +   +E+++ R+  +G+ P  SDV
Sbjct: 454 MKDNGLKLVPGCSSIEIDGFIHKFVMADGSHPETKEITRLLEDISDRIRHAGHEPCTSDV 513

Query: 481 LHKVMYEEKEGALGEHSERFAIAYGLLKTRAPVVIRVVKNLRVCGDCHEVIKIISKIYER 540
           LH V  EEKE AL EHSER AIAYGLLKT+APVVIRVVKNLR CGDCHEV K+ISKIYER
Sbjct: 514 LHDVSDEEKENALFEHSERLAIAYGLLKTKAPVVIRVVKNLRFCGDCHEVTKLISKIYER 573

Query: 541 EIIVRDRVRFHKFVEGTCSCKDYW 565
           EIIVRDRVRFH+F++G CSCKD+W
Sbjct: 574 EIIVRDRVRFHRFIDGACSCKDFW 597

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PPR85_ARATH7.2e-13042.51Pentatricopeptide repeat-containing protein At1g59720, chloroplastic/mitochondri... [more]
PP330_ARATH8.8e-12841.13Pentatricopeptide repeat-containing protein At4g21065 OS=Arabidopsis thaliana GN... [more]
PP145_ARATH1.7e-12641.45Pentatricopeptide repeat-containing protein At2g02980, chloroplastic OS=Arabidop... [more]
PPR21_ARATH3.8e-12340.94Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
PP285_ARATH5.7e-11940.31Pentatricopeptide repeat-containing protein At3g57430, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A0A0LF80_CUCSA1.3e-27984.57Uncharacterized protein OS=Cucumis sativus GN=Csa_3G812780 PE=4 SV=1[more]
F6HL02_VITVI2.3e-22367.80Putative uncharacterized protein OS=Vitis vinifera GN=VIT_08s0007g08320 PE=4 SV=... [more]
W9RU69_9ROSA9.6e-21462.94Uncharacterized protein OS=Morus notabilis GN=L484_018009 PE=4 SV=1[more]
A0A0D2RW39_GOSRA3.6e-21363.77Uncharacterized protein OS=Gossypium raimondii GN=B456_006G137700 PE=4 SV=1[more]
E6NUE8_JATCU6.4e-21062.41JMS10C05.1 protein OS=Jatropha curcas GN=JMS10C05.1 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G59720.14.1e-13142.51 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT4G21065.15.0e-12941.13 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT2G02980.19.4e-12841.45 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G08070.12.2e-12440.94 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G57430.13.2e-12040.31 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659084927|ref|XP_008443148.1|2.6e-28184.93PREDICTED: pentatricopeptide repeat-containing protein At1g59720, mitochondrial-... [more]
gi|778684969|ref|XP_004136598.2|1.9e-27984.57PREDICTED: pentatricopeptide repeat-containing protein At1g59720, mitochondrial-... [more]
gi|225441789|ref|XP_002283735.1|6.4e-22767.73PREDICTED: pentatricopeptide repeat-containing protein At4g21065 [Vitis vinifera... [more]
gi|297739678|emb|CBI29860.3|3.3e-22367.80unnamed protein product [Vitis vinifera][more]
gi|720094066|ref|XP_010246245.1|7.3e-22364.54PREDICTED: pentatricopeptide repeat-containing protein At5g48910-like [Nelumbo n... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
Vocabulary: Molecular Function
TermDefinition
GO:0008270zinc ion binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0005515 protein binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh14G004650.1CmoCh14G004650.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 232..255
score: 0.049coord: 261..288
score: 3.5E-5coord: 331..351
score:
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 157..204
score: 3.4
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 261..293
score: 3.0E-4coord: 295..329
score: 3.2E-4coord: 159..193
score: 1.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 394..428
score: 8.495coord: 328..362
score: 5.897coord: 91..125
score: 6.73coord: 293..327
score: 9.832coord: 54..89
score: 8.923coord: 157..191
score: 13.055coord: 192..226
score: 6.215coord: 227..257
score: 7.059coord: 258..292
score: 10.545coord: 126..156
score:
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 1..435
score: 2.5E
NoneNo IPR availablePANTHERPTHR24015:SF838SUBFAMILY NOT NAMEDcoord: 1..435
score: 2.5E