CmoCh04G016970 (gene) Cucurbita moschata (Rifu)

NameCmoCh04G016970
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionPentatricopeptide repeat-containing family protein
LocationCmo_Chr04 : 8603211 .. 8605028 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCTCCTGTCAATCCCACCAAATGGACAGTCTATTCCGGTGGAAATCAAGCATGAGACATTCCATGCCCGTCAAAGTCGACTTCTTCACTTGCTGAGCGACTGTACTGATTTGTCGAGGCTCAAGCAAATACACGCTCAGGCACTTCGCACCTTCTCCACCCACAAATCTTCTCTCTTCCTCTTTAGCCGAATTCTTCACGTTTCTTCATTAACTGATTTTGAGTATGCATTACGCGTTTTTGATCAAATCGATAGCCCTAATTCGTTCATGTGGAACACTCTAATCGGAGCCTGTGCACGGAGCTTGGACCGGAAAGAGCAGGCGATTGAACTCTTCTACAGAATGTTAGAAGAAGGCTCAGTTGAACCAGATAAACATACTTTTCCTTTTCTTCTTAAAGCGTGTGCTTACGTATTCGCTTTATCTGAAGGGAGACAGGCGCATGCTCATATCGTTAAACGTGGGTTAGATTCGGATGTTTATGTTGGCAATAGCTTGATTCATTTATATGCTTCTTGTGGCTGTTTGAGTTTAGCATTGAAGGTGTTCGAGAAAATGCCTCACAGAAGTTTGGTTTCGTGGAATGTGATGATTGATGCGTATGTACAATGTGGGCTTTTTAAAAATGCTCTCAATCTATTCGCTGAAATGCAGAACACTTTTGAGCCCGATGGATATACAATGCAGAGCATAATTAGCGCTTGTGCGGGTATTGGAGCTTTATCTCTGGGGATGTGGTCTCATGCTTATGTGTTGAGGAAGACTGGTGGCGCTATGGTCGGTGAGGTCCTGATCAACTCCTCACTGGTGGATATGTACAGCAAGTGTGGTTCTTTGAGTATGGCTCAGCAGGTCTTCGAGACAATGCCCAAACATGACCTGAATTCATGGAATTCAATGATTTTAGCGTTTGCCATGCATGGACGGGCGGAAGCTGCCTTGGGATGTTTCTCTCGGCTGGTTGAAACGGAGAAGTTCCTGCCCAACTCTGTCACGTTTGTAGGTGTTCTTAGTGCATGTAACCACAGAGGTATGGTTGCTGAAGGCCGGAAATATTTTGATATGATGGTTAATGAATACAAGATTGTACCCCGGTTGGAGCACTATGGATGCCTTGTTGATCTCCTATCACGCTCTGGTTTCATTGATGAAGCTTTGGAGTTGGTGACAAATATTCATATAAAACCAGATGCAGTGATCTGGAGGAGTCTTCTTGATGCTTGTTATAAGCAGAATGCTGGCGTTGAGCTGAGCGAAGAAGTGGCATTGCAGATTCTTCAATCTGAAACAACAACTTCTAGTGGTGTTTATGTGCTGTTGTCAAGAGTCTATGCTTCAGCACACCGGTGGAACGATGTCGGGTTAGTTAGGAAGGCAATGGCCGACAAGGGTGTGGCAAAAGAGCCAGGCTGCAGTTCAATAGAAATAGATGGTGTTAGCCATGAGTTTTTTGCAGGAGACACATCTCACCCCAAGATAAAAGAGATCTATGCTGTTATTGATTTGATCGAAGAAAAACTACAGAAGCATGGTTATTCACCTGACTATTCACAGGCAACCATGGTCGACGACCCCGATACCGTCAAATGGCAGTCGCTTAAGTTGCATAGTGAGAGATTCGCCATTGCTTTTGGGCTACTAAACTTGAAACCTGGGATGCCAATACGCATATTCAAGAATCTTAGAGTATGCAACGACTGCCACCAGGTAACCAAGTTGATTTCTCGAATTTTTAACGTAGAGATTATCATGAGAGATCGTAATAGGTTTCATCATTTTGAGAATGGCATGTGTTCCTGCATGGACTTCTGGTGA

mRNA sequence

ATGCTCCTGTCAATCCCACCAAATGGACAGTCTATTCCGGTGGAAATCAAGCATGAGACATTCCATGCCCGTCAAAGTCGACTTCTTCACTTGCTGAGCGACTGTACTGATTTGTCGAGGCTCAAGCAAATACACGCTCAGGCACTTCGCACCTTCTCCACCCACAAATCTTCTCTCTTCCTCTTTAGCCGAATTCTTCACGTTTCTTCATTAACTGATTTTGAGTATGCATTACGCGTTTTTGATCAAATCGATAGCCCTAATTCGTTCATGTGGAACACTCTAATCGGAGCCTGTGCACGGAGCTTGGACCGGAAAGAGCAGGCGATTGAACTCTTCTACAGAATGTTAGAAGAAGGCTCAGTTGAACCAGATAAACATACTTTTCCTTTTCTTCTTAAAGCGTGTGCTTACGTATTCGCTTTATCTGAAGGGAGACAGGCGCATGCTCATATCGTTAAACGTGGGTTAGATTCGGATGTTTATGTTGGCAATAGCTTGATTCATTTATATGCTTCTTGTGGCTGTTTGAGTTTAGCATTGAAGGTGTTCGAGAAAATGCCTCACAGAAGTTTGGTTTCGTGGAATGTGATGATTGATGCGTATGTACAATGTGGGCTTTTTAAAAATGCTCTCAATCTATTCGCTGAAATGCAGAACACTTTTGAGCCCGATGGATATACAATGCAGAGCATAATTAGCGCTTGTGCGGGTATTGGAGCTTTATCTCTGGGGATGTGGTCTCATGCTTATGTGTTGAGGAAGACTGGTGGCGCTATGGTCGGTGAGGTCCTGATCAACTCCTCACTGGTGGATATGTACAGCAAGTGTGGTTCTTTGAGTATGGCTCAGCAGGTCTTCGAGACAATGCCCAAACATGACCTGAATTCATGGAATTCAATGATTTTAGCGTTTGCCATGCATGGACGGGCGGAAGCTGCCTTGGGATGTTTCTCTCGGCTGGTTGAAACGGAGAAGTTCCTGCCCAACTCTGTCACGTTTGTAGGTGTTCTTAGTGCATGTAACCACAGAGGTATGGTTGCTGAAGGCCGGAAATATTTTGATATGATGGTTAATGAATACAAGATTGTACCCCGGTTGGAGCACTATGGATGCCTTGTTGATCTCCTATCACGCTCTGGTTTCATTGATGAAGCTTTGGAGTTGGTGACAAATATTCATATAAAACCAGATGCAGTGATCTGGAGGAGTCTTCTTGATGCTTGTTATAAGCAGAATGCTGGCGTTGAGCTGAGCGAAGAAGTGGCATTGCAGATTCTTCAATCTGAAACAACAACTTCTAGTGGTGTTTATGTGCTGTTGTCAAGAGTCTATGCTTCAGCACACCGGTGGAACGATGTCGGGTTAGTTAGGAAGGCAATGGCCGACAAGGGTGTGGCAAAAGAGCCAGGCTGCAGTTCAATAGAAATAGATGGTGTTAGCCATGAGTTTTTTGCAGGAGACACATCTCACCCCAAGATAAAAGAGATCTATGCTGTTATTGATTTGATCGAAGAAAAACTACAGAAGCATGGTTATTCACCTGACTATTCACAGGCAACCATGGTCGACGACCCCGATACCGTCAAATGGCAGTCGCTTAAGTTGCATAGTGAGAGATTCGCCATTGCTTTTGGGCTACTAAACTTGAAACCTGGGATGCCAATACGCATATTCAAGAATCTTAGAGTATGCAACGACTGCCACCAGGTAACCAAGTTGATTTCTCGAATTTTTAACGTAGAGATTATCATGAGAGATCGTAATAGGTTTCATCATTTTGAGAATGGCATGTGTTCCTGCATGGACTTCTGGTGA

Coding sequence (CDS)

ATGCTCCTGTCAATCCCACCAAATGGACAGTCTATTCCGGTGGAAATCAAGCATGAGACATTCCATGCCCGTCAAAGTCGACTTCTTCACTTGCTGAGCGACTGTACTGATTTGTCGAGGCTCAAGCAAATACACGCTCAGGCACTTCGCACCTTCTCCACCCACAAATCTTCTCTCTTCCTCTTTAGCCGAATTCTTCACGTTTCTTCATTAACTGATTTTGAGTATGCATTACGCGTTTTTGATCAAATCGATAGCCCTAATTCGTTCATGTGGAACACTCTAATCGGAGCCTGTGCACGGAGCTTGGACCGGAAAGAGCAGGCGATTGAACTCTTCTACAGAATGTTAGAAGAAGGCTCAGTTGAACCAGATAAACATACTTTTCCTTTTCTTCTTAAAGCGTGTGCTTACGTATTCGCTTTATCTGAAGGGAGACAGGCGCATGCTCATATCGTTAAACGTGGGTTAGATTCGGATGTTTATGTTGGCAATAGCTTGATTCATTTATATGCTTCTTGTGGCTGTTTGAGTTTAGCATTGAAGGTGTTCGAGAAAATGCCTCACAGAAGTTTGGTTTCGTGGAATGTGATGATTGATGCGTATGTACAATGTGGGCTTTTTAAAAATGCTCTCAATCTATTCGCTGAAATGCAGAACACTTTTGAGCCCGATGGATATACAATGCAGAGCATAATTAGCGCTTGTGCGGGTATTGGAGCTTTATCTCTGGGGATGTGGTCTCATGCTTATGTGTTGAGGAAGACTGGTGGCGCTATGGTCGGTGAGGTCCTGATCAACTCCTCACTGGTGGATATGTACAGCAAGTGTGGTTCTTTGAGTATGGCTCAGCAGGTCTTCGAGACAATGCCCAAACATGACCTGAATTCATGGAATTCAATGATTTTAGCGTTTGCCATGCATGGACGGGCGGAAGCTGCCTTGGGATGTTTCTCTCGGCTGGTTGAAACGGAGAAGTTCCTGCCCAACTCTGTCACGTTTGTAGGTGTTCTTAGTGCATGTAACCACAGAGGTATGGTTGCTGAAGGCCGGAAATATTTTGATATGATGGTTAATGAATACAAGATTGTACCCCGGTTGGAGCACTATGGATGCCTTGTTGATCTCCTATCACGCTCTGGTTTCATTGATGAAGCTTTGGAGTTGGTGACAAATATTCATATAAAACCAGATGCAGTGATCTGGAGGAGTCTTCTTGATGCTTGTTATAAGCAGAATGCTGGCGTTGAGCTGAGCGAAGAAGTGGCATTGCAGATTCTTCAATCTGAAACAACAACTTCTAGTGGTGTTTATGTGCTGTTGTCAAGAGTCTATGCTTCAGCACACCGGTGGAACGATGTCGGGTTAGTTAGGAAGGCAATGGCCGACAAGGGTGTGGCAAAAGAGCCAGGCTGCAGTTCAATAGAAATAGATGGTGTTAGCCATGAGTTTTTTGCAGGAGACACATCTCACCCCAAGATAAAAGAGATCTATGCTGTTATTGATTTGATCGAAGAAAAACTACAGAAGCATGGTTATTCACCTGACTATTCACAGGCAACCATGGTCGACGACCCCGATACCGTCAAATGGCAGTCGCTTAAGTTGCATAGTGAGAGATTCGCCATTGCTTTTGGGCTACTAAACTTGAAACCTGGGATGCCAATACGCATATTCAAGAATCTTAGAGTATGCAACGACTGCCACCAGGTAACCAAGTTGATTTCTCGAATTTTTAACGTAGAGATTATCATGAGAGATCGTAATAGGTTTCATCATTTTGAGAATGGCATGTGTTCCTGCATGGACTTCTGGTGA
BLAST of CmoCh04G016970 vs. Swiss-Prot
Match: PPR85_ARATH (Pentatricopeptide repeat-containing protein At1g59720, chloroplastic/mitochondrial OS=Arabidopsis thaliana GN=PCMP-H51 PE=2 SV=2)

HSP 1 Score: 753.8 bits (1945), Expect = 1.4e-216
Identity = 364/589 (61.80%), Postives = 455/589 (77.25%), Query Frame = 1

Query: 27  RLLHLLSDCTDLSRLKQIHAQALRT-FSTHKSSLFLFSRILHVSS-LTDFEYALRVFDQI 86
           R+  L   C+D+S+LKQ+HA  LRT +    ++LFL+ +IL +SS  +D  YA RVFD I
Sbjct: 50  RIFSLAETCSDMSQLKQLHAFTLRTTYPEEPATLFLYGKILQLSSSFSDVNYAFRVFDSI 109

Query: 87  DSPNSFMWNTLIGACARSLDRKEQAIELFYRMLEEGSVEPDKHTFPFLLKACAYVFALSE 146
           ++ +SFMWNTLI ACA  + RKE+A  L+ +MLE G   PDKHTFPF+LKACAY+F  SE
Sbjct: 110 ENHSSFMWNTLIRACAHDVSRKEEAFMLYRKMLERGESSPDKHTFPFVLKACAYIFGFSE 169

Query: 147 GRQAHAHIVKRGLDSDVYVGNSLIHLYASCGCLSLALKVFEKMPHRSLVSWNVMIDAYVQ 206
           G+Q H  IVK G   DVYV N LIHLY SCGCL LA KVF++MP RSLVSWN MIDA V+
Sbjct: 170 GKQVHCQIVKHGFGGDVYVNNGLIHLYGSCGCLDLARKVFDEMPERSLVSWNSMIDALVR 229

Query: 207 CGLFKNALNLFAEMQNTFEPDGYTMQSIISACAGIGALSLGMWSHAYVLRKTGGAMVGEV 266
            G + +AL LF EMQ +FEPDGYTMQS++SACAG+G+LSLG W+HA++LRK    +  +V
Sbjct: 230 FGEYDSALQLFREMQRSFEPDGYTMQSVLSACAGLGSLSLGTWAHAFLLRKCDVDVAMDV 289

Query: 267 LINSSLVDMYSKCGSLSMAQQVFETMPKHDLNSWNSMILAFAMHGRAEAALGCFSRLVE- 326
           L+ +SL++MY KCGSL MA+QVF+ M K DL SWN+MIL FA HGRAE A+  F R+V+ 
Sbjct: 290 LVKNSLIEMYCKCGSLRMAEQVFQGMQKRDLASWNAMILGFATHGRAEEAMNFFDRMVDK 349

Query: 327 TEKFLPNSVTFVGVLSACNHRGMVAEGRKYFDMMVNEYKIVPRLEHYGCLVDLLSRSGFI 386
            E   PNSVTFVG+L ACNHRG V +GR+YFDMMV +Y I P LEHYGC+VDL++R+G+I
Sbjct: 350 RENVRPNSVTFVGLLIACNHRGFVNKGRQYFDMMVRDYCIEPALEHYGCIVDLIARAGYI 409

Query: 387 DEALELVTNIHIKPDAVIWRSLLDACYKQNAGVELSEEVALQIL------QSETTTSSGV 446
            EA+++V ++ +KPDAVIWRSLLDAC K+ A VELSEE+A  I+      +S     SG 
Sbjct: 410 TEAIDMVMSMPMKPDAVIWRSLLDACCKKGASVELSEEIARNIIGTKEDNESSNGNCSGA 469

Query: 447 YVLLSRVYASAHRWNDVGLVRKAMADKGVAKEPGCSSIEIDGVSHEFFAGDTSHPKIKEI 506
           YVLLSRVYASA RWNDVG+VRK M++ G+ KEPGCSSIEI+G+SHEFFAGDTSHP+ K+I
Sbjct: 470 YVLLSRVYASASRWNDVGIVRKLMSEHGIRKEPGCSSIEINGISHEFFAGDTSHPQTKQI 529

Query: 507 YAVIDLIEEKLQKHGYSPDYSQATMVD-DPDTVKWQSLKLHSERFAIAFGLLNLKPGMPI 566
           Y  + +I+++L+  GY PD SQA +VD   D  K  SL+LHSER AIAFGL+NL P  PI
Sbjct: 530 YQQLKVIDDRLRSIGYLPDRSQAPLVDATNDGSKEYSLRLHSERLAIAFGLINLPPQTPI 589

Query: 567 RIFKNLRVCNDCHQVTKLISRIFNVEIIMRDRNRFHHFENGMCSCMDFW 606
           RIFKNLRVCNDCH+VTKLIS++FN EII+RDR RFHHF++G CSC+D+W
Sbjct: 590 RIFKNLRVCNDCHEVTKLISKVFNTEIIVRDRVRFHHFKDGSCSCLDYW 638

BLAST of CmoCh04G016970 vs. Swiss-Prot
Match: PPR21_ARATH (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 477.6 bits (1228), Expect = 2.0e-133
Identity = 247/554 (44.58%), Postives = 351/554 (63.36%), Query Frame = 1

Query: 53  STHKSSLFLFSRILHVSSLTDFEYALRVFDQIDSPNSFMWNTLIGACARSLDRKEQAIEL 112
           S H+  +   + I   +S    E A ++FD+I   +   WN +I   A + + KE A+EL
Sbjct: 195 SPHRDVVSYTALIKGYASRGYIENAQKLFDEIPVKDVVSWNAMISGYAETGNYKE-ALEL 254

Query: 113 FYRMLEEGSVEPDKHTFPFLLKACAYVFALSEGRQAHAHIVKRGLDSDVYVGNSLIHLYA 172
           F  M++  +V PD+ T   ++ ACA   ++  GRQ H  I   G  S++ + N+LI LY+
Sbjct: 255 FKDMMKT-NVRPDESTMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYS 314

Query: 173 SCGCLSLALKVFEKMPHRSLVSWNVMIDAYVQCGLFKNALNLFAEMQNTFE-PDGYTMQS 232
            CG L  A  +FE++P++ ++SWN +I  Y    L+K AL LF EM  + E P+  TM S
Sbjct: 315 KCGELETACGLFERLPYKDVISWNTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLS 374

Query: 233 IISACAGIGALSLGMWSHAYVLRKTGGAMVGEVLINSSLVDMYSKCGSLSMAQQVFETMP 292
           I+ ACA +GA+ +G W H Y+ ++  G      L  +SL+DMY+KCG +  A QVF ++ 
Sbjct: 375 ILPACAHLGAIDIGRWIHVYIDKRLKGVTNASSL-RTSLIDMYAKCGDIEAAHQVFNSIL 434

Query: 293 KHDLNSWNSMILAFAMHGRAEAALGCFSRLVETEKFLPNSVTFVGVLSACNHRGMVAEGR 352
              L+SWN+MI  FAMHGRA+A+   FSR+ +     P+ +TFVG+LSAC+H GM+  GR
Sbjct: 435 HKSLSSWNAMIFGFAMHGRADASFDLFSRMRKIG-IQPDDITFVGLLSACSHSGMLDLGR 494

Query: 353 KYFDMMVNEYKIVPRLEHYGCLVDLLSRSGFIDEALELVTNIHIKPDAVIWRSLLDACYK 412
             F  M  +YK+ P+LEHYGC++DLL  SG   EA E++  + ++PD VIW SLL AC K
Sbjct: 495 HIFRTMTQDYKMTPKLEHYGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKAC-K 554

Query: 413 QNAGVELSEEVALQILQSETTTSSGVYVLLSRVYASAHRWNDVGLVRKAMADKGVAKEPG 472
            +  VEL E  A  +++ E   + G YVLLS +YASA RWN+V   R  + DKG+ K PG
Sbjct: 555 MHGNVELGESFAENLIKIEPE-NPGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPG 614

Query: 473 CSSIEIDGVSHEFFAGDTSHPKIKEIYAVIDLIEEKLQKHGYSPDYSQATMVDDPDTVKW 532
           CSSIEID V HEF  GD  HP+ +EIY +++ +E  L+K G+ PD S+  + +  +  K 
Sbjct: 615 CSSIEIDSVVHEFIIGDKFHPRNREIYGMLEEMEVLLEKAGFVPDTSEV-LQEMEEEWKE 674

Query: 533 QSLKLHSERFAIAFGLLNLKPGMPIRIFKNLRVCNDCHQVTKLISRIFNVEIIMRDRNRF 592
            +L+ HSE+ AIAFGL++ KPG  + I KNLRVC +CH+ TKLIS+I+  EII RDR RF
Sbjct: 675 GALRHHSEKLAIAFGLISTKPGTKLTIVKNLRVCRNCHEATKLISKIYKREIIARDRTRF 734

Query: 593 HHFENGMCSCMDFW 606
           HHF +G+CSC D+W
Sbjct: 735 HHFRDGVCSCNDYW 741

BLAST of CmoCh04G016970 vs. Swiss-Prot
Match: PP330_ARATH (Pentatricopeptide repeat-containing protein At4g21065 OS=Arabidopsis thaliana GN=PCMP-H28 PE=2 SV=2)

HSP 1 Score: 460.3 bits (1183), Expect = 3.3e-128
Identity = 241/575 (41.91%), Postives = 359/575 (62.43%), Query Frame = 1

Query: 36  TDLSRLKQIHAQALR---TFSTHKSSLFLFSRILHVSSLTDFEYALRVFDQIDSP-NSFM 95
           + +++L+QIHA ++R   + S  +    L   ++ + S     YA +VF +I+ P N F+
Sbjct: 28  SSITKLRQIHAFSIRHGVSISDAELGKHLIFYLVSLPSPPPMSYAHKVFSKIEKPINVFI 87

Query: 96  WNTLIGACARSLDRKEQAIELFYRMLEEGSVEPDKHTFPFLLKACAYVFALSEGRQAHAH 155
           WNTLI   A  +     A  L+  M   G VEPD HT+PFL+KA   +  +  G   H+ 
Sbjct: 88  WNTLIRGYAE-IGNSISAFSLYREMRVSGLVEPDTHTYPFLIKAVTTMADVRLGETIHSV 147

Query: 156 IVKRGLDSDVYVGNSLIHLYASCGCLSLALKVFEKMPHRSLVSWNVMIDAYVQCGLFKNA 215
           +++ G  S +YV NSL+HLYA+CG ++ A KVF+KMP + LV+WN +I+ + + G  + A
Sbjct: 148 VIRSGFGSLIYVQNSLLHLYANCGDVASAYKVFDKMPEKDLVAWNSVINGFAENGKPEEA 207

Query: 216 LNLFAEMQNT-FEPDGYTMQSIISACAGIGALSLGMWSHAYVLRKTGGAMVGEVLINSSL 275
           L L+ EM +   +PDG+T+ S++SACA IGAL+LG   H Y+++     +   +  ++ L
Sbjct: 208 LALYTEMNSKGIKPDGFTIVSLLSACAKIGALTLGKRVHVYMIKV---GLTRNLHSSNVL 267

Query: 276 VDMYSKCGSLSMAQQVFETMPKHDLNSWNSMILAFAMHGRAEAALGCFSRLVETEKFLPN 335
           +D+Y++CG +  A+ +F+ M   +  SW S+I+  A++G  + A+  F  +  TE  LP 
Sbjct: 268 LDLYARCGRVEEAKTLFDEMVDKNSVSWTSLIVGLAVNGFGKEAIELFKYMESTEGLLPC 327

Query: 336 SVTFVGVLSACNHRGMVAEGRKYFDMMVNEYKIVPRLEHYGCLVDLLSRSGFIDEALELV 395
            +TFVG+L AC+H GMV EG +YF  M  EYKI PR+EH+GC+VDLL+R+G + +A E +
Sbjct: 328 EITFVGILYACSHCGMVKEGFEYFRRMREEYKIEPRIEHFGCMVDLLARAGQVKKAYEYI 387

Query: 396 TNIHIKPDAVIWRSLLDACYKQNAGVELSEEVALQILQSETTTSSGVYVLLSRVYASAHR 455
            ++ ++P+ VIWR+LL AC   +   +L+E   +QILQ E    SG YVLLS +YAS  R
Sbjct: 388 KSMPMQPNVVIWRTLLGAC-TVHGDSDLAEFARIQILQLE-PNHSGDYVLLSNMYASEQR 447

Query: 456 WNDVGLVRKAMADKGVAKEPGCSSIEIDGVSHEFFAGDTSHPKIKEIYAVIDLIEEKLQK 515
           W+DV  +RK M   GV K PG S +E+    HEF  GD SHP+   IYA +  +  +L+ 
Sbjct: 448 WSDVQKIRKQMLRDGVKKVPGHSLVEVGNRVHEFLMGDKSHPQSDAIYAKLKEMTGRLRS 507

Query: 516 HGYSPDYSQATMVDDPDTVKWQSLKLHSERFAIAFGLLNLKPGMPIRIFKNLRVCNDCHQ 575
            GY P  S    VD  +  K  ++  HSE+ AIAF L++     PI + KNLRVC DCH 
Sbjct: 508 EGYVPQISN-VYVDVEEEEKENAVVYHSEKIAIAFMLISTPERSPITVVKNLRVCADCHL 567

Query: 576 VTKLISRIFNVEIIMRDRNRFHHFENGMCSCMDFW 606
             KL+S+++N EI++RDR+RFHHF+NG CSC D+W
Sbjct: 568 AIKLVSKVYNREIVVRDRSRFHHFKNGSCSCQDYW 595

BLAST of CmoCh04G016970 vs. Swiss-Prot
Match: PP145_ARATH (Pentatricopeptide repeat-containing protein At2g02980, chloroplastic OS=Arabidopsis thaliana GN=PCMP-H26 PE=2 SV=2)

HSP 1 Score: 449.5 bits (1155), Expect = 5.7e-125
Identity = 235/580 (40.52%), Postives = 357/580 (61.55%), Query Frame = 1

Query: 31  LLSDCTDLSRLKQIHAQALRTFSTHKSSLFLFSRILHVSSLTDFE----YALRVFDQIDS 90
           L+S C  L  L QI A A+++   H   +   +++++  + +  E    YA  +F+ +  
Sbjct: 35  LISKCNSLRELMQIQAYAIKS---HIEDVSFVAKLINFCTESPTESSMSYARHLFEAMSE 94

Query: 91  PNSFMWNTLIGACARSLDRKEQAIELFYRMLEEGSVEPDKHTFPFLLKACAYVFALSEGR 150
           P+  ++N++    +R  +  E    LF  +LE+G + PD +TFP LLKACA   AL EGR
Sbjct: 95  PDIVIFNSMARGYSRFTNPLE-VFSLFVEILEDG-ILPDNYTFPSLLKACAVAKALEEGR 154

Query: 151 QAHAHIVKRGLDSDVYVGNSLIHLYASCGCLSLALKVFEKMPHRSLVSWNVMIDAYVQCG 210
           Q H   +K GLD +VYV  +LI++Y  C  +  A  VF+++    +V +N MI  Y +  
Sbjct: 155 QLHCLSMKLGLDDNVYVCPTLINMYTECEDVDSARCVFDRIVEPCVVCYNAMITGYARRN 214

Query: 211 LFKNALNLFAEMQNTF-EPDGYTMQSIISACAGIGALSLGMWSHAYVLRKTGGAMVGEVL 270
               AL+LF EMQ  + +P+  T+ S++S+CA +G+L LG W H Y  + +       V 
Sbjct: 215 RPNEALSLFREMQGKYLKPNEITLLSVLSSCALLGSLDLGKWIHKYAKKHS---FCKYVK 274

Query: 271 INSSLVDMYSKCGSLSMAQQVFETMPKHDLNSWNSMILAFAMHGRAEAALGCFSRLVETE 330
           +N++L+DM++KCGSL  A  +FE M   D  +W++MI+A+A HG+AE ++  F R+  +E
Sbjct: 275 VNTALIDMFAKCGSLDDAVSIFEKMRYKDTQAWSAMIVAYANHGKAEKSMLMFERM-RSE 334

Query: 331 KFLPNSVTFVGVLSACNHRGMVAEGRKYFDMMVNEYKIVPRLEHYGCLVDLLSRSGFIDE 390
              P+ +TF+G+L+AC+H G V EGRKYF  MV+++ IVP ++HYG +VDLLSR+G +++
Sbjct: 335 NVQPDEITFLGLLNACSHTGRVEEGRKYFSQMVSKFGIVPSIKHYGSMVDLLSRAGNLED 394

Query: 391 ALELVTNIHIKPDAVIWRSLLDACYKQNAGVELSEEVALQILQSETTTSSGVYVLLSRVY 450
           A E +  + I P  ++WR LL AC   N  ++L+E+V+ +I + + +   G YV+LS +Y
Sbjct: 395 AYEFIDKLPISPTPMLWRILLAACSSHN-NLDLAEKVSERIFELDDS-HGGDYVILSNLY 454

Query: 451 ASAHRWNDVGLVRKAMADKGVAKEPGCSSIEIDGVSHEFFAGDTSHPKIKEIYAVIDLIE 510
           A   +W  V  +RK M D+   K PGCSSIE++ V HEFF+GD       +++  +D + 
Sbjct: 455 ARNKKWEYVDSLRKVMKDRKAVKVPGCSSIEVNNVVHEFFSGDGVKSATTKLHRALDEMV 514

Query: 511 EKLQKHGYSPDYSQATMVDDPDTVKWQSLKLHSERFAIAFGLLNLKPGMPIRIFKNLRVC 570
           ++L+  GY PD S     +  D  K  +L+ HSE+ AI FGLLN  PG  IR+ KNLRVC
Sbjct: 515 KELKLSGYVPDTSMVVHANMNDQEKEITLRYHSEKLAITFGLLNTPPGTTIRVVKNLRVC 574

Query: 571 NDCHQVTKLISRIFNVEIIMRDRNRFHHFENGMCSCMDFW 606
            DCH   KLIS IF  ++++RD  RFHHFE+G CSC DFW
Sbjct: 575 RDCHNAAKLISLIFGRKVVLRDVQRFHHFEDGKCSCGDFW 603

BLAST of CmoCh04G016970 vs. Swiss-Prot
Match: PPR68_ARATH (Pentatricopeptide repeat-containing protein At1g31920 OS=Arabidopsis thaliana GN=PCMP-H11 PE=2 SV=1)

HSP 1 Score: 444.5 bits (1142), Expect = 1.8e-123
Identity = 236/591 (39.93%), Postives = 361/591 (61.08%), Query Frame = 1

Query: 21  FHARQSRLLHLLSDCTDLSRLKQIHAQALRT---FSTHKSSLFLFSRILHVSSLTDFEYA 80
           F  ++   L+LL  C ++   KQ+HA+ ++    +S+  S+  + ++  H        YA
Sbjct: 26  FGGKEQECLYLLKRCHNIDEFKQVHARFIKLSLFYSSSFSASSVLAKCAHSGWENSMNYA 85

Query: 81  LRVFDQIDSPNSFMWNTLIGACARSLDRKEQAIELFYRMLEEGSVEPDKHTFPFLLKACA 140
             +F  ID P +F +NT+I      +   E+A+  +  M++ G+ EPD  T+P LLKAC 
Sbjct: 86  ASIFRGIDDPCTFDFNTMIRGYVNVMSF-EEALCFYNEMMQRGN-EPDNFTYPCLLKACT 145

Query: 141 YVFALSEGRQAHAHIVKRGLDSDVYVGNSLIHLYASCGCLSLALKVFEKMPHRSLVSWNV 200
            + ++ EG+Q H  + K GL++DV+V NSLI++Y  CG + L+  VFEK+  ++  SW+ 
Sbjct: 146 RLKSIREGKQIHGQVFKLGLEADVFVQNSLINMYGRCGEMELSSAVFEKLESKTAASWSS 205

Query: 201 MIDAYVQCGLFKNALNLFAEM--QNTFEPDGYTMQSIISACAGIGALSLGMWSHAYVLRK 260
           M+ A    G++   L LF  M  +   + +   M S + ACA  GAL+LGM  H ++LR 
Sbjct: 206 MVSARAGMGMWSECLLLFRGMCSETNLKAEESGMVSALLACANTGALNLGMSIHGFLLRN 265

Query: 261 TGGAMVGEVLINSSLVDMYSKCGSLSMAQQVFETMPKHDLNSWNSMILAFAMHGRAEAAL 320
                   +++ +SLVDMY KCG L  A  +F+ M K +  ++++MI   A+HG  E+AL
Sbjct: 266 ISEL---NIIVQTSLVDMYVKCGCLDKALHIFQKMEKRNNLTYSAMISGLALHGEGESAL 325

Query: 321 GCFSRLVETEKFLPNSVTFVGVLSACNHRGMVAEGRKYFDMMVNEYKIVPRLEHYGCLVD 380
             FS++++ E   P+ V +V VL+AC+H G+V EGR+ F  M+ E K+ P  EHYGCLVD
Sbjct: 326 RMFSKMIK-EGLEPDHVVYVSVLNACSHSGLVKEGRRVFAEMLKEGKVEPTAEHYGCLVD 385

Query: 381 LLSRSGFIDEALELVTNIHIKPDAVIWRSLLDAC-YKQNAGVELSEEVALQILQSETTTS 440
           LL R+G ++EALE + +I I+ + VIWR+ L  C  +QN  +EL + +A Q L   ++ +
Sbjct: 386 LLGRAGLLEEALETIQSIPIEKNDVIWRTFLSQCRVRQN--IELGQ-IAAQELLKLSSHN 445

Query: 441 SGVYVLLSRVYASAHRWNDVGLVRKAMADKGVAKEPGCSSIEIDGVSHEFFAGDTSHPKI 500
            G Y+L+S +Y+    W+DV   R  +A KG+ + PG S +E+ G +H F + D SHPK 
Sbjct: 446 PGDYLLISNLYSQGQMWDDVARTRTEIAIKGLKQTPGFSIVELKGKTHRFVSQDRSHPKC 505

Query: 501 KEIYAVIDLIEEKLQKHGYSPDYSQATMVDDPDTVKWQSLKLHSERFAIAFGLLNLKPGM 560
           KEIY ++  +E +L+  GYSPD +Q  +  D +  K + LK HS++ AIAFGLL   PG 
Sbjct: 506 KEIYKMLHQMEWQLKFEGYSPDLTQILLNVDEEEKK-ERLKGHSQKVAIAFGLLYTPPGS 565

Query: 561 PIRIFKNLRVCNDCHQVTKLISRIFNVEIIMRDRNRFHHFENGMCSCMDFW 606
            I+I +NLR+C+DCH  TK IS I+  EI++RDRNRFH F+ G CSC D+W
Sbjct: 566 IIKIARNLRMCSDCHTYTKKISMIYEREIVVRDRNRFHLFKGGTCSCKDYW 606

BLAST of CmoCh04G016970 vs. TrEMBL
Match: A0A0A0KTV7_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G604120 PE=4 SV=1)

HSP 1 Score: 1079.7 bits (2791), Expect = 0.0e+00
Identity = 522/605 (86.28%), Postives = 557/605 (92.07%), Query Frame = 1

Query: 1   MLLSIPPNGQSIPVEIKHETFHARQSRLLHLLSDCTDLSRLKQIHAQALRTFSTHKSSLF 60
           MLL+IP N QS+P+EIK E     QSR LHLL+DCTDLS+LKQIHAQA+R FSTH SSLF
Sbjct: 1   MLLAIPTNSQSLPIEIKGENSKTHQSRFLHLLTDCTDLSKLKQIHAQAIRNFSTHNSSLF 60

Query: 61  LFSRILHVSSLTDFEYALRVFDQIDSPNSFMWNTLIGACARSLDRKEQAIELFYRMLEEG 120
           L+SRILHVSSL DF+YA RVF+QID+PNSFMWNTLIGACARSLDRKEQAIE+FYRMLEEG
Sbjct: 61  LYSRILHVSSLIDFDYACRVFNQIDNPNSFMWNTLIGACARSLDRKEQAIEIFYRMLEEG 120

Query: 121 SVEPDKHTFPFLLKACAYVFALSEGRQAHAHIVKRGLDSDVYVGNSLIHLYASCGCLSLA 180
           SVEPDKHTFPFLLKACAYVFALSEGRQAHA I K GLD DVYVGNSLIHLYASCGCLS+A
Sbjct: 121 SVEPDKHTFPFLLKACAYVFALSEGRQAHAQIFKLGLDLDVYVGNSLIHLYASCGCLSMA 180

Query: 181 LKVFEKMPHRSLVSWNVMIDAYVQCGLFKNALNLFAEMQNTFEPDGYTMQSIISACAGIG 240
           LKVFEKMP RSLVSWNVMIDAYVQ GLF+NAL LF EMQN+FEPDGYTMQSI+SACAGIG
Sbjct: 181 LKVFEKMPLRSLVSWNVMIDAYVQSGLFENALKLFVEMQNSFEPDGYTMQSIVSACAGIG 240

Query: 241 ALSLGMWSHAYVLRKTGGAMVGEVLINSSLVDMYSKCGSLSMAQQVFETMPKHDLNSWNS 300
           ALSLGMW+HAYVLRK  GAM G+VLINSSLVDMYSKCGSL MAQQVFETMPKHDLNSWNS
Sbjct: 241 ALSLGMWAHAYVLRKASGAMAGDVLINSSLVDMYSKCGSLRMAQQVFETMPKHDLNSWNS 300

Query: 301 MILAFAMHGRAEAALGCFSRLVETEKFLPNSVTFVGVLSACNHRGMVAEGRKYFDMMVNE 360
           MILA AMHGR +AAL CFSRLVE EKFLPNSVTFVGVLSACNH GMVA+GRKYFDMMVN+
Sbjct: 301 MILALAMHGRGQAALQCFSRLVEMEKFLPNSVTFVGVLSACNHGGMVADGRKYFDMMVND 360

Query: 361 YKIVPRLEHYGCLVDLLSRSGFIDEALELVTNIHIKPDAVIWRSLLDACYKQNAGVELSE 420
           YKI PRLEHYGCLVDLLSRSGFIDEALELV N+HIKPDAVIWRSLLDACYKQNAGVELSE
Sbjct: 361 YKIEPRLEHYGCLVDLLSRSGFIDEALELVANMHIKPDAVIWRSLLDACYKQNAGVELSE 420

Query: 421 EVALQILQSETTTSSGVYVLLSRVYASAHRWNDVGLVRKAMADKGVAKEPGCSSIEIDGV 480
           EVA +ILQSE T SSGVYV+LSRVYASA +WNDVG++RK M D GV KEPGCSSIEIDG+
Sbjct: 421 EVAFKILQSEKTISSGVYVMLSRVYASARQWNDVGIIRKVMTDMGVTKEPGCSSIEIDGI 480

Query: 481 SHEFFAGDTSHPKIKEIYAVIDLIEEKLQKHGYSPDYSQATMVDDPDTVKWQSLKLHSER 540
           SHEFFAGDTSHP+IKEIY VIDLIEEKL++ GYSPD SQATMVD+PD +K QSLKLHSER
Sbjct: 481 SHEFFAGDTSHPRIKEIYGVIDLIEEKLERRGYSPDCSQATMVDEPDNIKQQSLKLHSER 540

Query: 541 FAIAFGLLNLKPGMPIRIFKNLRVCNDCHQVTKLISRIFNVEIIMRDRNRFHHFENGMCS 600
            AIAFGLLNLKPG P+RIFKNLRVCNDCHQVTKLIS IFNVEIIMRDRNRFHHF++GMCS
Sbjct: 541 LAIAFGLLNLKPGTPVRIFKNLRVCNDCHQVTKLISEIFNVEIIMRDRNRFHHFKHGMCS 600

Query: 601 CMDFW 606
           CMDFW
Sbjct: 601 CMDFW 605

BLAST of CmoCh04G016970 vs. TrEMBL
Match: A0A067H5B3_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g038206mg PE=4 SV=1)

HSP 1 Score: 852.8 bits (2202), Expect = 2.5e-244
Identity = 419/615 (68.13%), Postives = 491/615 (79.84%), Query Frame = 1

Query: 1   MLLSIPPNGQSIPVEIKHETFHARQ--------SRLLHLLSDCTDLSRLKQIHAQALRTF 60
           M ++I   G   P    H  F+  +        S LL  L++C  +S+LKQIHAQALRT 
Sbjct: 1   MAVAIVQGGPPTPQTHSHSIFNNNRNEGSFNNHSSLLSSLTECKSMSQLKQIHAQALRTA 60

Query: 61  --STHKSSLFLFSRILHVSSLTDFEYALRVFDQIDSPNSFMWNTLIGACARSLDRKEQAI 120
               HK+ L ++SRI+H +S  D +YA RVF QI++PNSF WNTLI ACARS+D K QAI
Sbjct: 61  LPQQHKT-LLIYSRIIHFASFADLDYAFRVFYQIENPNSFTWNTLIRACARSVDAKPQAI 120

Query: 121 ELFYRMLEEGSVEPDKHTFPFLLKACAYVFALSEGRQAHAHIVKRGLDSDVYVGNSLIHL 180
            LF RM+E+G+V PDKHTFPF LKACAY+FA S+G+QAHAHI KRGL SDVY+ NSLIH 
Sbjct: 121 VLFQRMIEQGNVLPDKHTFPFALKACAYLFAFSQGKQAHAHIFKRGLVSDVYINNSLIHF 180

Query: 181 YASCGCLSLALKVFEKMPHRSLVSWNVMIDAYVQCGLFKNALNLFAEMQNTFEPDGYTMQ 240
           YASCG L LA KVF+ M  RSLVSWNVMIDA+VQ G F +AL LF  MQ  FEPDGYT Q
Sbjct: 181 YASCGHLDLANKVFDNMLERSLVSWNVMIDAFVQFGEFDSALKLFRRMQILFEPDGYTFQ 240

Query: 241 SIISACAGIGALSLGMWSHAYVLRKTGGAMVGEVLINSSLVDMYSKCGSLSMAQQVFETM 300
           SI SACAG+  LSLGMW+HAY+LR    ++V +VL+N+SL+DMY KCGSL +A+QVFE+M
Sbjct: 241 SITSACAGLATLSLGMWAHAYILRHCDHSLVTDVLVNNSLIDMYCKCGSLDIARQVFESM 300

Query: 301 PKHDLNSWNSMILAFAMHGRAEAALGCFSRLVETEKFLPNSVTFVGVLSACNHRGMVAEG 360
           PK DL SWNS+IL FA+HGRAEAAL  F RLV  E F PNS+TFVGVLSACNHRGMV+EG
Sbjct: 301 PKRDLTSWNSIILGFALHGRAEAALKYFDRLVVEESFSPNSITFVGVLSACNHRGMVSEG 360

Query: 361 RKYFDMMVNEYKIVPRLEHYGCLVDLLSRSGFIDEALELVTNIHIKPDAVIWRSLLDACY 420
           R YFD+M+NEY I P LEHYGCLVDLL+R+G IDEAL LV+N+ +KPDAVIWRSLLDAC 
Sbjct: 361 RDYFDVMINEYNITPVLEHYGCLVDLLARAGNIDEALHLVSNMPMKPDAVIWRSLLDACC 420

Query: 421 KQNAGVELSEEVALQILQSETTTSSGVYVLLSRVYASAHRWNDVGLVRKAMADKGVAKEP 480
           K++A V LSEEVA Q+++SE    SGVYVLLSRVYASA RWNDVGLVRK M DKGV KEP
Sbjct: 421 KKHASVVLSEEVAKQVIESEGGICSGVYVLLSRVYASARRWNDVGLVRKLMTDKGVTKEP 480

Query: 481 GCSSIEIDGVSHEFFAGDTSHPKIKEIYAVIDLIEEKLQKHGYSPDYSQATMVDDPDTVK 540
           GCSSIEIDG++HEFFAGDTSHP+ K+IY  +DLI+EKL+  GY+PDYSQA MVD+ D  K
Sbjct: 481 GCSSIEIDGIAHEFFAGDTSHPQTKQIYGFLDLIDEKLKSRGYTPDYSQAAMVDELDDGK 540

Query: 541 WQSLKLHSERFAIAFGLLNLKPGMPIRIFKNLRVCNDCHQVTKLISRIFNVEIIMRDRNR 600
             SL+LHSER AIA G+LNLKPGMPIR+FKNLRVC DCH+VTKLISRIFNVEII+RDR R
Sbjct: 541 QSSLRLHSERLAIALGILNLKPGMPIRVFKNLRVCKDCHEVTKLISRIFNVEIIVRDRAR 600

Query: 601 FHHFENGMCSCMDFW 606
           FHHF++G CSCMD+W
Sbjct: 601 FHHFKDGSCSCMDYW 614

BLAST of CmoCh04G016970 vs. TrEMBL
Match: V4VGX7_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10033899mg PE=4 SV=1)

HSP 1 Score: 851.7 bits (2199), Expect = 5.6e-244
Identity = 415/596 (69.63%), Postives = 486/596 (81.54%), Query Frame = 1

Query: 12  IPVEIKHETFHARQSRLLHLLSDCTDLSRLKQIHAQALRTF--STHKSSLFLFSRILHVS 71
           +P  + +E      S LL  L++C  +S+LKQIHAQALRT     HK+ L ++SRI+H +
Sbjct: 3   LPHLVTNEGSFNNHSSLLSSLTECKSMSQLKQIHAQALRTALPQQHKT-LLIYSRIIHFA 62

Query: 72  SLTDFEYALRVFDQIDSPNSFMWNTLIGACARSLDRKEQAIELFYRMLEEGSVEPDKHTF 131
           S  D +YA RVF QI++PNSF WNTLI ACARS+D K QAI LF RM+E+G+V PDKHTF
Sbjct: 63  SFADLDYAFRVFYQIENPNSFTWNTLIRACARSVDAKPQAIVLFQRMIEQGNVLPDKHTF 122

Query: 132 PFLLKACAYVFALSEGRQAHAHIVKRGLDSDVYVGNSLIHLYASCGCLSLALKVFEKMPH 191
           PF LKACAY+FA S+G+QAHAHI KRGL SDVY+ NSLIH YA+CG L LA KVF+ M  
Sbjct: 123 PFALKACAYLFAFSQGKQAHAHIFKRGLASDVYINNSLIHFYATCGHLDLANKVFDNMLE 182

Query: 192 RSLVSWNVMIDAYVQCGLFKNALNLFAEMQNTFEPDGYTMQSIISACAGIGALSLGMWSH 251
           RSLVSWNVMIDA+VQ G F +AL LF  MQ  FEPDGYT QSI SACAG+  LSLG W+H
Sbjct: 183 RSLVSWNVMIDAFVQFGEFDSALKLFRRMQILFEPDGYTFQSITSACAGLATLSLGTWAH 242

Query: 252 AYVLRKTGGAMVGEVLINSSLVDMYSKCGSLSMAQQVFETMPKHDLNSWNSMILAFAMHG 311
           AY+LR    ++V +VL+N+SL+DMY KCGSL +A+QVFE+MPK DL SWNS+IL FA+HG
Sbjct: 243 AYILRHCDHSLVTDVLVNNSLIDMYCKCGSLDIARQVFESMPKRDLTSWNSIILGFALHG 302

Query: 312 RAEAALGCFSRLVETEKFLPNSVTFVGVLSACNHRGMVAEGRKYFDMMVNEYKIVPRLEH 371
           RAEAAL  F RLVE E F PNS+TFVGVLSACNH GMV+EGR YFD+M+NEY I P LEH
Sbjct: 303 RAEAALKYFDRLVEEESFSPNSITFVGVLSACNHMGMVSEGRDYFDVMINEYNITPVLEH 362

Query: 372 YGCLVDLLSRSGFIDEALELVTNIHIKPDAVIWRSLLDACYKQNAGVELSEEVALQILQS 431
           YGCLVDLL+R+G IDEAL LV+N+ +KPDAVIWRSLLDAC K++A V LSEEVA QI++S
Sbjct: 363 YGCLVDLLARAGNIDEALHLVSNMPMKPDAVIWRSLLDACCKKHASVVLSEEVAKQIIES 422

Query: 432 ETTTSSGVYVLLSRVYASAHRWNDVGLVRKAMADKGVAKEPGCSSIEIDGVSHEFFAGDT 491
           E    SGVYVLLSRVYASA RWNDVGLVRK M DKGV KEPGCSSIEIDG++HEFFAGDT
Sbjct: 423 EGGICSGVYVLLSRVYASARRWNDVGLVRKLMTDKGVTKEPGCSSIEIDGIAHEFFAGDT 482

Query: 492 SHPKIKEIYAVIDLIEEKLQKHGYSPDYSQATMVDDPDTVKWQSLKLHSERFAIAFGLLN 551
           SHP+ K+IY V+DLI+EKL+  GY+PDYSQA MVD+ D  K  SL+LHSER AIA G+LN
Sbjct: 483 SHPQTKQIYGVLDLIDEKLKSRGYTPDYSQAAMVDELDDGKQSSLRLHSERLAIALGILN 542

Query: 552 LKPGMPIRIFKNLRVCNDCHQVTKLISRIFNVEIIMRDRNRFHHFENGMCSCMDFW 606
           LKPGMPIR+FKNLRVC DCH+VTKLISRIFNVEII+RDR RFHHF++G CSCMD+W
Sbjct: 543 LKPGMPIRVFKNLRVCKDCHEVTKLISRIFNVEIIVRDRARFHHFKDGSCSCMDYW 597

BLAST of CmoCh04G016970 vs. TrEMBL
Match: W9R9J8_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_021064 PE=4 SV=1)

HSP 1 Score: 832.8 bits (2150), Expect = 2.7e-238
Identity = 402/586 (68.60%), Postives = 481/586 (82.08%), Query Frame = 1

Query: 23  ARQSRLLHLLSDCTDLSRLKQIHAQALRTFSTHKS--SLFLFSRILHVSSLTDFEYALRV 82
           A Q+RLL  L++C D+S+LKQIHAQ LRT S   +  +LFL+SRILH SSL D +YA RV
Sbjct: 26  AHQARLLRFLNECKDMSQLKQIHAQTLRTTSNTNNPHTLFLYSRILHFSSLADADYAFRV 85

Query: 83  FDQIDSPNSFMWNTLIGACARSLDRKEQAIELFYRMLEEGSVEPDKHTFPFLLKACAYVF 142
           FDQI++PNSFMWNTLI ACARS DRKEQAI L+ RMLEEG V PDK+TFPF+L+ACAY+F
Sbjct: 86  FDQIETPNSFMWNTLIRACARSDDRKEQAIVLYCRMLEEGIVLPDKYTFPFVLRACAYLF 145

Query: 143 ALSEGRQAHAHIVKRGLDSDVYVGNSLIHLYASCGCLSLALKVFEKMPHRSLVSWNVMID 202
            LSEG Q HAH++K G  SDVY+ NSLIH YASCG L LA KVF+KMP RSLVSWN MID
Sbjct: 146 DLSEGEQTHAHVLKLGFCSDVYICNSLIHFYASCGHLDLAQKVFDKMPERSLVSWNAMID 205

Query: 203 AYVQCGLFKNALNLFAEMQNTFEPDGYTMQSIISACAGIGALSLGMWSHAYVLRKTGGAM 262
           A+VQ G F+ AL LF+EMQN F+PDGYT+QSII+ACAG+G L+LGMW+HAY+LR    A+
Sbjct: 206 AFVQFGEFETALKLFSEMQNVFKPDGYTLQSIINACAGLGGLALGMWAHAYILRMLDTAV 265

Query: 263 VGEVLINSSLVDMYSKCGSLSMAQQVFETMPKHDLNSWNSMILAFAMHGRAEAALGCFSR 322
             +VL+ SSL+DMY KCG L +A+QVFE MPK D+  WNSMIL FAMHG AEAAL CFS 
Sbjct: 266 ASDVLVCSSLMDMYCKCGCLELARQVFERMPKRDITMWNSMILGFAMHGLAEAALECFSC 325

Query: 323 LVETEKFLPNSVTFVGVLSACNHRGMVAEGRKYFDMMVNEYKIVPRLEHYGCLVDLLSRS 382
           LV TE   PNS+TFVGVLSACNHRGMV+EG  YF+ M+ +YKI PRLEHYGCLVDLL+R+
Sbjct: 326 LVRTESCAPNSITFVGVLSACNHRGMVSEGLNYFEKMIKKYKIEPRLEHYGCLVDLLARA 385

Query: 383 GFIDEALELVTNIHIKPDAVIWRSLLDACYKQNAGVELSEEVALQILQSE-TTTSSGVYV 442
           GFI++AL  VTN+ +KPDAVIWRS+LDAC KQ+A VELSEEVA Q+L+SE    SSGVYV
Sbjct: 386 GFINKALNFVTNMPMKPDAVIWRSILDACSKQDASVELSEEVARQVLESEGDGASSGVYV 445

Query: 443 LLSRVYASAHRWNDVGLVRKAMADKGVAKEPGCSSIEIDGVSHEFFAGDTSHPKIKEIYA 502
           L+SRVYASA RW+DVGLVRK M D GV KEPGCS IE++G++HEFFAGDTSHP+ + IY 
Sbjct: 446 LMSRVYASASRWDDVGLVRKLMEDDGVTKEPGCSIIEVEGITHEFFAGDTSHPRSRGIYQ 505

Query: 503 VIDLIEEKLQKHGYSPDYSQATMVDDPDTVKWQSLKLHSERFAIAFGLLNLKPGMPIRIF 562
            +++I+++L+  GYSPDYSQA +VD+    K  SL LHSER A+AFGLL++K G PIRIF
Sbjct: 506 FLNVIKDRLKLMGYSPDYSQAPLVDEQGDTKQHSLGLHSERIALAFGLLSMKSGTPIRIF 565

Query: 563 KNLRVCNDCHQVTKLISRIFNVEIIMRDRNRFHHFENGMCSCMDFW 606
           KNLRVCNDCH+V KLIS  F+VEI+MRDR RFHHF++G CSCM++W
Sbjct: 566 KNLRVCNDCHEVFKLISTAFSVEIVMRDRTRFHHFKHGTCSCMEYW 611

BLAST of CmoCh04G016970 vs. TrEMBL
Match: F6HPG3_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_01s0026g01440 PE=4 SV=1)

HSP 1 Score: 828.2 bits (2138), Expect = 6.6e-237
Identity = 399/601 (66.39%), Postives = 483/601 (80.37%), Query Frame = 1

Query: 6   PPNGQSIPVEIKHETFHARQSRLLHLLSDCTDLSRLKQIHAQALRTFSTHK-SSLFLFSR 65
           PP+   +P  I +        RLL  L+ CT +S+LKQ+HAQ +RT S+H  ++ FL+SR
Sbjct: 9   PPS--HLPHAISNSDSFTHHRRLLLFLNSCTCISQLKQLHAQTIRTTSSHHPNTFFLYSR 68

Query: 66  ILHVSSLTDFEYALRVFDQIDSPNSFMWNTLIGACARSLDRKEQAIELFYRMLEEGSVEP 125
           ILH SSL D  YA RVF QI++PNSFMWN LI ACARS DRK+ AI L++RMLE+GSV  
Sbjct: 69  ILHFSSLHDLRYAFRVFHQIENPNSFMWNALIRACARSTDRKQHAIALYHRMLEQGSVMQ 128

Query: 126 DKHTFPFLLKACAYVFALSEGRQAHAHIVKRGLDSDVYVGNSLIHLYASCGCLSLALKVF 185
           DKHTFPF+LKACAY+FALSEG Q HA I+K G DSDVY+ NSL+H YA+C  L  A  VF
Sbjct: 129 DKHTFPFVLKACAYLFALSEGEQIHAQILKLGFDSDVYINNSLVHFYATCDRLDFAKGVF 188

Query: 186 EKMPHRSLVSWNVMIDAYVQCGLFKNALNLFAEMQNTFEPDGYTMQSIISACAGIGALSL 245
           ++M  RSLVSWNV+IDA+V+ G F  ALNLF EMQ  FEPDGYT+QSI +ACAG+G+LSL
Sbjct: 189 DRMSERSLVSWNVVIDAFVRFGEFDAALNLFGEMQKFFEPDGYTIQSIANACAGMGSLSL 248

Query: 246 GMWSHAYVLRKTGGAMVGEVLINSSLVDMYSKCGSLSMAQQVFETMPKHDLNSWNSMILA 305
           GMW+H ++L+K     V +VL+N+SLVDMY KCGSL +A Q+F  MPK D+ SWNSMIL 
Sbjct: 249 GMWAHVFLLKKFDADRVNDVLLNTSLVDMYCKCGSLELALQLFHRMPKRDVTSWNSMILG 308

Query: 306 FAMHGRAEAALGCFSRLVETEKFLPNSVTFVGVLSACNHRGMVAEGRKYFDMMVNEYKIV 365
           F+ HG   AAL  F  +V TEK +PN++TFVGVLSACNH G+V+EGR+YFD+MV EYKI 
Sbjct: 309 FSTHGEVAAALEYFGCMVRTEKLMPNAITFVGVLSACNHGGLVSEGRRYFDVMVTEYKIK 368

Query: 366 PRLEHYGCLVDLLSRSGFIDEALELVTNIHIKPDAVIWRSLLDACYKQNAGVELSEEVAL 425
           P LEHYGCLVDLL+R+G IDEAL++V+N+ ++PD VIWRSLLDAC KQNAGVELSEE+A 
Sbjct: 369 PELEHYGCLVDLLARAGLIDEALDVVSNMPMRPDLVIWRSLLDACCKQNAGVELSEEMAR 428

Query: 426 QILQSETTTSSGVYVLLSRVYASAHRWNDVGLVRKAMADKGVAKEPGCSSIEIDGVSHEF 485
           ++L++E    SGVYVLLSRVYASA RWNDVG+VRK M DKGV KEPGCSSIEIDGV+HEF
Sbjct: 429 RVLEAEGGVCSGVYVLLSRVYASASRWNDVGMVRKLMTDKGVVKEPGCSSIEIDGVAHEF 488

Query: 486 FAGDTSHPKIKEIYAVIDLIEEKLQKHGYSPDYSQATMVDDPDTVKWQSLKLHSERFAIA 545
           FAGDTSHP+ +EIY+ +D+IEE++++ GYSPD SQA MVD+    K  SL+LHSER AIA
Sbjct: 489 FAGDTSHPQTEEIYSALDVIEERVERVGYSPDSSQAPMVDETIDGKQYSLRLHSERLAIA 548

Query: 546 FGLLNLKPGMPIRIFKNLRVCNDCHQVTKLISRIFNVEIIMRDRNRFHHFENGMCSCMDF 605
           FGLL  KPGMPIRIFKNLRVCN+CHQVTKLISR+FN EII+RDR RFHHF++G CSCMD+
Sbjct: 549 FGLLKTKPGMPIRIFKNLRVCNNCHQVTKLISRVFNREIIVRDRIRFHHFKDGACSCMDY 607

BLAST of CmoCh04G016970 vs. TAIR10
Match: AT1G59720.1 (AT1G59720.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 753.8 bits (1945), Expect = 8.0e-218
Identity = 364/589 (61.80%), Postives = 455/589 (77.25%), Query Frame = 1

Query: 27  RLLHLLSDCTDLSRLKQIHAQALRT-FSTHKSSLFLFSRILHVSS-LTDFEYALRVFDQI 86
           R+  L   C+D+S+LKQ+HA  LRT +    ++LFL+ +IL +SS  +D  YA RVFD I
Sbjct: 50  RIFSLAETCSDMSQLKQLHAFTLRTTYPEEPATLFLYGKILQLSSSFSDVNYAFRVFDSI 109

Query: 87  DSPNSFMWNTLIGACARSLDRKEQAIELFYRMLEEGSVEPDKHTFPFLLKACAYVFALSE 146
           ++ +SFMWNTLI ACA  + RKE+A  L+ +MLE G   PDKHTFPF+LKACAY+F  SE
Sbjct: 110 ENHSSFMWNTLIRACAHDVSRKEEAFMLYRKMLERGESSPDKHTFPFVLKACAYIFGFSE 169

Query: 147 GRQAHAHIVKRGLDSDVYVGNSLIHLYASCGCLSLALKVFEKMPHRSLVSWNVMIDAYVQ 206
           G+Q H  IVK G   DVYV N LIHLY SCGCL LA KVF++MP RSLVSWN MIDA V+
Sbjct: 170 GKQVHCQIVKHGFGGDVYVNNGLIHLYGSCGCLDLARKVFDEMPERSLVSWNSMIDALVR 229

Query: 207 CGLFKNALNLFAEMQNTFEPDGYTMQSIISACAGIGALSLGMWSHAYVLRKTGGAMVGEV 266
            G + +AL LF EMQ +FEPDGYTMQS++SACAG+G+LSLG W+HA++LRK    +  +V
Sbjct: 230 FGEYDSALQLFREMQRSFEPDGYTMQSVLSACAGLGSLSLGTWAHAFLLRKCDVDVAMDV 289

Query: 267 LINSSLVDMYSKCGSLSMAQQVFETMPKHDLNSWNSMILAFAMHGRAEAALGCFSRLVE- 326
           L+ +SL++MY KCGSL MA+QVF+ M K DL SWN+MIL FA HGRAE A+  F R+V+ 
Sbjct: 290 LVKNSLIEMYCKCGSLRMAEQVFQGMQKRDLASWNAMILGFATHGRAEEAMNFFDRMVDK 349

Query: 327 TEKFLPNSVTFVGVLSACNHRGMVAEGRKYFDMMVNEYKIVPRLEHYGCLVDLLSRSGFI 386
            E   PNSVTFVG+L ACNHRG V +GR+YFDMMV +Y I P LEHYGC+VDL++R+G+I
Sbjct: 350 RENVRPNSVTFVGLLIACNHRGFVNKGRQYFDMMVRDYCIEPALEHYGCIVDLIARAGYI 409

Query: 387 DEALELVTNIHIKPDAVIWRSLLDACYKQNAGVELSEEVALQIL------QSETTTSSGV 446
            EA+++V ++ +KPDAVIWRSLLDAC K+ A VELSEE+A  I+      +S     SG 
Sbjct: 410 TEAIDMVMSMPMKPDAVIWRSLLDACCKKGASVELSEEIARNIIGTKEDNESSNGNCSGA 469

Query: 447 YVLLSRVYASAHRWNDVGLVRKAMADKGVAKEPGCSSIEIDGVSHEFFAGDTSHPKIKEI 506
           YVLLSRVYASA RWNDVG+VRK M++ G+ KEPGCSSIEI+G+SHEFFAGDTSHP+ K+I
Sbjct: 470 YVLLSRVYASASRWNDVGIVRKLMSEHGIRKEPGCSSIEINGISHEFFAGDTSHPQTKQI 529

Query: 507 YAVIDLIEEKLQKHGYSPDYSQATMVD-DPDTVKWQSLKLHSERFAIAFGLLNLKPGMPI 566
           Y  + +I+++L+  GY PD SQA +VD   D  K  SL+LHSER AIAFGL+NL P  PI
Sbjct: 530 YQQLKVIDDRLRSIGYLPDRSQAPLVDATNDGSKEYSLRLHSERLAIAFGLINLPPQTPI 589

Query: 567 RIFKNLRVCNDCHQVTKLISRIFNVEIIMRDRNRFHHFENGMCSCMDFW 606
           RIFKNLRVCNDCH+VTKLIS++FN EII+RDR RFHHF++G CSC+D+W
Sbjct: 590 RIFKNLRVCNDCHEVTKLISKVFNTEIIVRDRVRFHHFKDGSCSCLDYW 638

BLAST of CmoCh04G016970 vs. TAIR10
Match: AT1G08070.1 (AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 477.6 bits (1228), Expect = 1.1e-134
Identity = 247/554 (44.58%), Postives = 351/554 (63.36%), Query Frame = 1

Query: 53  STHKSSLFLFSRILHVSSLTDFEYALRVFDQIDSPNSFMWNTLIGACARSLDRKEQAIEL 112
           S H+  +   + I   +S    E A ++FD+I   +   WN +I   A + + KE A+EL
Sbjct: 195 SPHRDVVSYTALIKGYASRGYIENAQKLFDEIPVKDVVSWNAMISGYAETGNYKE-ALEL 254

Query: 113 FYRMLEEGSVEPDKHTFPFLLKACAYVFALSEGRQAHAHIVKRGLDSDVYVGNSLIHLYA 172
           F  M++  +V PD+ T   ++ ACA   ++  GRQ H  I   G  S++ + N+LI LY+
Sbjct: 255 FKDMMKT-NVRPDESTMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYS 314

Query: 173 SCGCLSLALKVFEKMPHRSLVSWNVMIDAYVQCGLFKNALNLFAEMQNTFE-PDGYTMQS 232
            CG L  A  +FE++P++ ++SWN +I  Y    L+K AL LF EM  + E P+  TM S
Sbjct: 315 KCGELETACGLFERLPYKDVISWNTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLS 374

Query: 233 IISACAGIGALSLGMWSHAYVLRKTGGAMVGEVLINSSLVDMYSKCGSLSMAQQVFETMP 292
           I+ ACA +GA+ +G W H Y+ ++  G      L  +SL+DMY+KCG +  A QVF ++ 
Sbjct: 375 ILPACAHLGAIDIGRWIHVYIDKRLKGVTNASSL-RTSLIDMYAKCGDIEAAHQVFNSIL 434

Query: 293 KHDLNSWNSMILAFAMHGRAEAALGCFSRLVETEKFLPNSVTFVGVLSACNHRGMVAEGR 352
              L+SWN+MI  FAMHGRA+A+   FSR+ +     P+ +TFVG+LSAC+H GM+  GR
Sbjct: 435 HKSLSSWNAMIFGFAMHGRADASFDLFSRMRKIG-IQPDDITFVGLLSACSHSGMLDLGR 494

Query: 353 KYFDMMVNEYKIVPRLEHYGCLVDLLSRSGFIDEALELVTNIHIKPDAVIWRSLLDACYK 412
             F  M  +YK+ P+LEHYGC++DLL  SG   EA E++  + ++PD VIW SLL AC K
Sbjct: 495 HIFRTMTQDYKMTPKLEHYGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKAC-K 554

Query: 413 QNAGVELSEEVALQILQSETTTSSGVYVLLSRVYASAHRWNDVGLVRKAMADKGVAKEPG 472
            +  VEL E  A  +++ E   + G YVLLS +YASA RWN+V   R  + DKG+ K PG
Sbjct: 555 MHGNVELGESFAENLIKIEPE-NPGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPG 614

Query: 473 CSSIEIDGVSHEFFAGDTSHPKIKEIYAVIDLIEEKLQKHGYSPDYSQATMVDDPDTVKW 532
           CSSIEID V HEF  GD  HP+ +EIY +++ +E  L+K G+ PD S+  + +  +  K 
Sbjct: 615 CSSIEIDSVVHEFIIGDKFHPRNREIYGMLEEMEVLLEKAGFVPDTSEV-LQEMEEEWKE 674

Query: 533 QSLKLHSERFAIAFGLLNLKPGMPIRIFKNLRVCNDCHQVTKLISRIFNVEIIMRDRNRF 592
            +L+ HSE+ AIAFGL++ KPG  + I KNLRVC +CH+ TKLIS+I+  EII RDR RF
Sbjct: 675 GALRHHSEKLAIAFGLISTKPGTKLTIVKNLRVCRNCHEATKLISKIYKREIIARDRTRF 734

Query: 593 HHFENGMCSCMDFW 606
           HHF +G+CSC D+W
Sbjct: 735 HHFRDGVCSCNDYW 741

BLAST of CmoCh04G016970 vs. TAIR10
Match: AT4G21065.1 (AT4G21065.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 460.3 bits (1183), Expect = 1.8e-129
Identity = 241/575 (41.91%), Postives = 359/575 (62.43%), Query Frame = 1

Query: 36  TDLSRLKQIHAQALR---TFSTHKSSLFLFSRILHVSSLTDFEYALRVFDQIDSP-NSFM 95
           + +++L+QIHA ++R   + S  +    L   ++ + S     YA +VF +I+ P N F+
Sbjct: 28  SSITKLRQIHAFSIRHGVSISDAELGKHLIFYLVSLPSPPPMSYAHKVFSKIEKPINVFI 87

Query: 96  WNTLIGACARSLDRKEQAIELFYRMLEEGSVEPDKHTFPFLLKACAYVFALSEGRQAHAH 155
           WNTLI   A  +     A  L+  M   G VEPD HT+PFL+KA   +  +  G   H+ 
Sbjct: 88  WNTLIRGYAE-IGNSISAFSLYREMRVSGLVEPDTHTYPFLIKAVTTMADVRLGETIHSV 147

Query: 156 IVKRGLDSDVYVGNSLIHLYASCGCLSLALKVFEKMPHRSLVSWNVMIDAYVQCGLFKNA 215
           +++ G  S +YV NSL+HLYA+CG ++ A KVF+KMP + LV+WN +I+ + + G  + A
Sbjct: 148 VIRSGFGSLIYVQNSLLHLYANCGDVASAYKVFDKMPEKDLVAWNSVINGFAENGKPEEA 207

Query: 216 LNLFAEMQNT-FEPDGYTMQSIISACAGIGALSLGMWSHAYVLRKTGGAMVGEVLINSSL 275
           L L+ EM +   +PDG+T+ S++SACA IGAL+LG   H Y+++     +   +  ++ L
Sbjct: 208 LALYTEMNSKGIKPDGFTIVSLLSACAKIGALTLGKRVHVYMIKV---GLTRNLHSSNVL 267

Query: 276 VDMYSKCGSLSMAQQVFETMPKHDLNSWNSMILAFAMHGRAEAALGCFSRLVETEKFLPN 335
           +D+Y++CG +  A+ +F+ M   +  SW S+I+  A++G  + A+  F  +  TE  LP 
Sbjct: 268 LDLYARCGRVEEAKTLFDEMVDKNSVSWTSLIVGLAVNGFGKEAIELFKYMESTEGLLPC 327

Query: 336 SVTFVGVLSACNHRGMVAEGRKYFDMMVNEYKIVPRLEHYGCLVDLLSRSGFIDEALELV 395
            +TFVG+L AC+H GMV EG +YF  M  EYKI PR+EH+GC+VDLL+R+G + +A E +
Sbjct: 328 EITFVGILYACSHCGMVKEGFEYFRRMREEYKIEPRIEHFGCMVDLLARAGQVKKAYEYI 387

Query: 396 TNIHIKPDAVIWRSLLDACYKQNAGVELSEEVALQILQSETTTSSGVYVLLSRVYASAHR 455
            ++ ++P+ VIWR+LL AC   +   +L+E   +QILQ E    SG YVLLS +YAS  R
Sbjct: 388 KSMPMQPNVVIWRTLLGAC-TVHGDSDLAEFARIQILQLE-PNHSGDYVLLSNMYASEQR 447

Query: 456 WNDVGLVRKAMADKGVAKEPGCSSIEIDGVSHEFFAGDTSHPKIKEIYAVIDLIEEKLQK 515
           W+DV  +RK M   GV K PG S +E+    HEF  GD SHP+   IYA +  +  +L+ 
Sbjct: 448 WSDVQKIRKQMLRDGVKKVPGHSLVEVGNRVHEFLMGDKSHPQSDAIYAKLKEMTGRLRS 507

Query: 516 HGYSPDYSQATMVDDPDTVKWQSLKLHSERFAIAFGLLNLKPGMPIRIFKNLRVCNDCHQ 575
            GY P  S    VD  +  K  ++  HSE+ AIAF L++     PI + KNLRVC DCH 
Sbjct: 508 EGYVPQISN-VYVDVEEEEKENAVVYHSEKIAIAFMLISTPERSPITVVKNLRVCADCHL 567

Query: 576 VTKLISRIFNVEIIMRDRNRFHHFENGMCSCMDFW 606
             KL+S+++N EI++RDR+RFHHF+NG CSC D+W
Sbjct: 568 AIKLVSKVYNREIVVRDRSRFHHFKNGSCSCQDYW 595

BLAST of CmoCh04G016970 vs. TAIR10
Match: AT2G02980.1 (AT2G02980.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 449.5 bits (1155), Expect = 3.2e-126
Identity = 235/580 (40.52%), Postives = 357/580 (61.55%), Query Frame = 1

Query: 31  LLSDCTDLSRLKQIHAQALRTFSTHKSSLFLFSRILHVSSLTDFE----YALRVFDQIDS 90
           L+S C  L  L QI A A+++   H   +   +++++  + +  E    YA  +F+ +  
Sbjct: 35  LISKCNSLRELMQIQAYAIKS---HIEDVSFVAKLINFCTESPTESSMSYARHLFEAMSE 94

Query: 91  PNSFMWNTLIGACARSLDRKEQAIELFYRMLEEGSVEPDKHTFPFLLKACAYVFALSEGR 150
           P+  ++N++    +R  +  E    LF  +LE+G + PD +TFP LLKACA   AL EGR
Sbjct: 95  PDIVIFNSMARGYSRFTNPLE-VFSLFVEILEDG-ILPDNYTFPSLLKACAVAKALEEGR 154

Query: 151 QAHAHIVKRGLDSDVYVGNSLIHLYASCGCLSLALKVFEKMPHRSLVSWNVMIDAYVQCG 210
           Q H   +K GLD +VYV  +LI++Y  C  +  A  VF+++    +V +N MI  Y +  
Sbjct: 155 QLHCLSMKLGLDDNVYVCPTLINMYTECEDVDSARCVFDRIVEPCVVCYNAMITGYARRN 214

Query: 211 LFKNALNLFAEMQNTF-EPDGYTMQSIISACAGIGALSLGMWSHAYVLRKTGGAMVGEVL 270
               AL+LF EMQ  + +P+  T+ S++S+CA +G+L LG W H Y  + +       V 
Sbjct: 215 RPNEALSLFREMQGKYLKPNEITLLSVLSSCALLGSLDLGKWIHKYAKKHS---FCKYVK 274

Query: 271 INSSLVDMYSKCGSLSMAQQVFETMPKHDLNSWNSMILAFAMHGRAEAALGCFSRLVETE 330
           +N++L+DM++KCGSL  A  +FE M   D  +W++MI+A+A HG+AE ++  F R+  +E
Sbjct: 275 VNTALIDMFAKCGSLDDAVSIFEKMRYKDTQAWSAMIVAYANHGKAEKSMLMFERM-RSE 334

Query: 331 KFLPNSVTFVGVLSACNHRGMVAEGRKYFDMMVNEYKIVPRLEHYGCLVDLLSRSGFIDE 390
              P+ +TF+G+L+AC+H G V EGRKYF  MV+++ IVP ++HYG +VDLLSR+G +++
Sbjct: 335 NVQPDEITFLGLLNACSHTGRVEEGRKYFSQMVSKFGIVPSIKHYGSMVDLLSRAGNLED 394

Query: 391 ALELVTNIHIKPDAVIWRSLLDACYKQNAGVELSEEVALQILQSETTTSSGVYVLLSRVY 450
           A E +  + I P  ++WR LL AC   N  ++L+E+V+ +I + + +   G YV+LS +Y
Sbjct: 395 AYEFIDKLPISPTPMLWRILLAACSSHN-NLDLAEKVSERIFELDDS-HGGDYVILSNLY 454

Query: 451 ASAHRWNDVGLVRKAMADKGVAKEPGCSSIEIDGVSHEFFAGDTSHPKIKEIYAVIDLIE 510
           A   +W  V  +RK M D+   K PGCSSIE++ V HEFF+GD       +++  +D + 
Sbjct: 455 ARNKKWEYVDSLRKVMKDRKAVKVPGCSSIEVNNVVHEFFSGDGVKSATTKLHRALDEMV 514

Query: 511 EKLQKHGYSPDYSQATMVDDPDTVKWQSLKLHSERFAIAFGLLNLKPGMPIRIFKNLRVC 570
           ++L+  GY PD S     +  D  K  +L+ HSE+ AI FGLLN  PG  IR+ KNLRVC
Sbjct: 515 KELKLSGYVPDTSMVVHANMNDQEKEITLRYHSEKLAITFGLLNTPPGTTIRVVKNLRVC 574

Query: 571 NDCHQVTKLISRIFNVEIIMRDRNRFHHFENGMCSCMDFW 606
            DCH   KLIS IF  ++++RD  RFHHFE+G CSC DFW
Sbjct: 575 RDCHNAAKLISLIFGRKVVLRDVQRFHHFEDGKCSCGDFW 603

BLAST of CmoCh04G016970 vs. TAIR10
Match: AT1G31920.1 (AT1G31920.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 444.5 bits (1142), Expect = 1.0e-124
Identity = 236/591 (39.93%), Postives = 361/591 (61.08%), Query Frame = 1

Query: 21  FHARQSRLLHLLSDCTDLSRLKQIHAQALRT---FSTHKSSLFLFSRILHVSSLTDFEYA 80
           F  ++   L+LL  C ++   KQ+HA+ ++    +S+  S+  + ++  H        YA
Sbjct: 26  FGGKEQECLYLLKRCHNIDEFKQVHARFIKLSLFYSSSFSASSVLAKCAHSGWENSMNYA 85

Query: 81  LRVFDQIDSPNSFMWNTLIGACARSLDRKEQAIELFYRMLEEGSVEPDKHTFPFLLKACA 140
             +F  ID P +F +NT+I      +   E+A+  +  M++ G+ EPD  T+P LLKAC 
Sbjct: 86  ASIFRGIDDPCTFDFNTMIRGYVNVMSF-EEALCFYNEMMQRGN-EPDNFTYPCLLKACT 145

Query: 141 YVFALSEGRQAHAHIVKRGLDSDVYVGNSLIHLYASCGCLSLALKVFEKMPHRSLVSWNV 200
            + ++ EG+Q H  + K GL++DV+V NSLI++Y  CG + L+  VFEK+  ++  SW+ 
Sbjct: 146 RLKSIREGKQIHGQVFKLGLEADVFVQNSLINMYGRCGEMELSSAVFEKLESKTAASWSS 205

Query: 201 MIDAYVQCGLFKNALNLFAEM--QNTFEPDGYTMQSIISACAGIGALSLGMWSHAYVLRK 260
           M+ A    G++   L LF  M  +   + +   M S + ACA  GAL+LGM  H ++LR 
Sbjct: 206 MVSARAGMGMWSECLLLFRGMCSETNLKAEESGMVSALLACANTGALNLGMSIHGFLLRN 265

Query: 261 TGGAMVGEVLINSSLVDMYSKCGSLSMAQQVFETMPKHDLNSWNSMILAFAMHGRAEAAL 320
                   +++ +SLVDMY KCG L  A  +F+ M K +  ++++MI   A+HG  E+AL
Sbjct: 266 ISEL---NIIVQTSLVDMYVKCGCLDKALHIFQKMEKRNNLTYSAMISGLALHGEGESAL 325

Query: 321 GCFSRLVETEKFLPNSVTFVGVLSACNHRGMVAEGRKYFDMMVNEYKIVPRLEHYGCLVD 380
             FS++++ E   P+ V +V VL+AC+H G+V EGR+ F  M+ E K+ P  EHYGCLVD
Sbjct: 326 RMFSKMIK-EGLEPDHVVYVSVLNACSHSGLVKEGRRVFAEMLKEGKVEPTAEHYGCLVD 385

Query: 381 LLSRSGFIDEALELVTNIHIKPDAVIWRSLLDAC-YKQNAGVELSEEVALQILQSETTTS 440
           LL R+G ++EALE + +I I+ + VIWR+ L  C  +QN  +EL + +A Q L   ++ +
Sbjct: 386 LLGRAGLLEEALETIQSIPIEKNDVIWRTFLSQCRVRQN--IELGQ-IAAQELLKLSSHN 445

Query: 441 SGVYVLLSRVYASAHRWNDVGLVRKAMADKGVAKEPGCSSIEIDGVSHEFFAGDTSHPKI 500
            G Y+L+S +Y+    W+DV   R  +A KG+ + PG S +E+ G +H F + D SHPK 
Sbjct: 446 PGDYLLISNLYSQGQMWDDVARTRTEIAIKGLKQTPGFSIVELKGKTHRFVSQDRSHPKC 505

Query: 501 KEIYAVIDLIEEKLQKHGYSPDYSQATMVDDPDTVKWQSLKLHSERFAIAFGLLNLKPGM 560
           KEIY ++  +E +L+  GYSPD +Q  +  D +  K + LK HS++ AIAFGLL   PG 
Sbjct: 506 KEIYKMLHQMEWQLKFEGYSPDLTQILLNVDEEEKK-ERLKGHSQKVAIAFGLLYTPPGS 565

Query: 561 PIRIFKNLRVCNDCHQVTKLISRIFNVEIIMRDRNRFHHFENGMCSCMDFW 606
            I+I +NLR+C+DCH  TK IS I+  EI++RDRNRFH F+ G CSC D+W
Sbjct: 566 IIKIARNLRMCSDCHTYTKKISMIYEREIVVRDRNRFHLFKGGTCSCKDYW 606

BLAST of CmoCh04G016970 vs. NCBI nr
Match: gi|659090955|ref|XP_008446291.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g59720, mitochondrial [Cucumis melo])

HSP 1 Score: 1102.0 bits (2849), Expect = 0.0e+00
Identity = 534/605 (88.26%), Postives = 564/605 (93.22%), Query Frame = 1

Query: 1   MLLSIPPNGQSIPVEIKHETFHARQSRLLHLLSDCTDLSRLKQIHAQALRTFSTHKSSLF 60
           MLL+IPPN QS P+EIK ETF+  QSRLLHLL+DCTDLS+LKQIHAQA+R FSTHKSSLF
Sbjct: 7   MLLAIPPNSQSFPIEIKRETFNTHQSRLLHLLTDCTDLSKLKQIHAQAIRNFSTHKSSLF 66

Query: 61  LFSRILHVSSLTDFEYALRVFDQIDSPNSFMWNTLIGACARSLDRKEQAIELFYRMLEEG 120
           L+SRILHVSSL DF+YA RVF+QI +PNSFMWNTLIGACARSLDRKEQAIE+FYRMLEEG
Sbjct: 67  LYSRILHVSSLIDFDYACRVFNQIGNPNSFMWNTLIGACARSLDRKEQAIEIFYRMLEEG 126

Query: 121 SVEPDKHTFPFLLKACAYVFALSEGRQAHAHIVKRGLDSDVYVGNSLIHLYASCGCLSLA 180
           SVEPDKHTFPFLLKACAYVFALSEGRQAHAHI K GLD DVYVGNSLIHLYASCGCLS+A
Sbjct: 127 SVEPDKHTFPFLLKACAYVFALSEGRQAHAHIFKLGLDLDVYVGNSLIHLYASCGCLSMA 186

Query: 181 LKVFEKMPHRSLVSWNVMIDAYVQCGLFKNALNLFAEMQNTFEPDGYTMQSIISACAGIG 240
           LKVFEKMP RSLVSWNVMIDAYVQCGLF+NAL LF EMQN+FEPDGYTMQSIISACAGIG
Sbjct: 187 LKVFEKMPLRSLVSWNVMIDAYVQCGLFENALKLFFEMQNSFEPDGYTMQSIISACAGIG 246

Query: 241 ALSLGMWSHAYVLRKTGGAMVGEVLINSSLVDMYSKCGSLSMAQQVFETMPKHDLNSWNS 300
           ALSLGMW+HAYVLRK GGAM G+VLINSSLVDMYSKCGSL MAQQVFETMPKHDLNSWNS
Sbjct: 247 ALSLGMWAHAYVLRKAGGAMAGDVLINSSLVDMYSKCGSLRMAQQVFETMPKHDLNSWNS 306

Query: 301 MILAFAMHGRAEAALGCFSRLVETEKFLPNSVTFVGVLSACNHRGMVAEGRKYFDMMVNE 360
           MILA AMHG  EAAL CFSRLVE E FLPNSVTFVGVLSACNHRGMVA+GRKYFDMMVNE
Sbjct: 307 MILALAMHGLGEAALQCFSRLVEMEIFLPNSVTFVGVLSACNHRGMVADGRKYFDMMVNE 366

Query: 361 YKIVPRLEHYGCLVDLLSRSGFIDEALELVTNIHIKPDAVIWRSLLDACYKQNAGVELSE 420
           YKI PRLEHYGCLVDLLSRSGFIDEALELV N+HIKPDAVIWRSLLDACYKQNAGVELSE
Sbjct: 367 YKIEPRLEHYGCLVDLLSRSGFIDEALELVANMHIKPDAVIWRSLLDACYKQNAGVELSE 426

Query: 421 EVALQILQSETTTSSGVYVLLSRVYASAHRWNDVGLVRKAMADKGVAKEPGCSSIEIDGV 480
           EVA +ILQSE T SSGVYVLLSRVYASA +WNDVG++RK M D GV KEPGCSSIEIDG+
Sbjct: 427 EVAFKILQSEKTVSSGVYVLLSRVYASARQWNDVGIIRKVMTDMGVTKEPGCSSIEIDGI 486

Query: 481 SHEFFAGDTSHPKIKEIYAVIDLIEEKLQKHGYSPDYSQATMVDDPDTVKWQSLKLHSER 540
           SHEFFAGDTSHP+IKEIY VIDLIEEKL+KHGYSPD SQATMVD+PD +K QSLKLHSER
Sbjct: 487 SHEFFAGDTSHPRIKEIYGVIDLIEEKLEKHGYSPDCSQATMVDEPDYIKQQSLKLHSER 546

Query: 541 FAIAFGLLNLKPGMPIRIFKNLRVCNDCHQVTKLISRIFNVEIIMRDRNRFHHFENGMCS 600
            AIAFGLLNLKPG P+RIFKNLRVCNDCHQVTKLIS IFNVEIIMRDRNRFHHF++GMCS
Sbjct: 547 LAIAFGLLNLKPGTPVRIFKNLRVCNDCHQVTKLISEIFNVEIIMRDRNRFHHFKHGMCS 606

Query: 601 CMDFW 606
           CMDFW
Sbjct: 607 CMDFW 611

BLAST of CmoCh04G016970 vs. NCBI nr
Match: gi|449435366|ref|XP_004135466.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g59720, mitochondrial [Cucumis sativus])

HSP 1 Score: 1079.7 bits (2791), Expect = 0.0e+00
Identity = 522/605 (86.28%), Postives = 557/605 (92.07%), Query Frame = 1

Query: 1   MLLSIPPNGQSIPVEIKHETFHARQSRLLHLLSDCTDLSRLKQIHAQALRTFSTHKSSLF 60
           MLL+IP N QS+P+EIK E     QSR LHLL+DCTDLS+LKQIHAQA+R FSTH SSLF
Sbjct: 1   MLLAIPTNSQSLPIEIKGENSKTHQSRFLHLLTDCTDLSKLKQIHAQAIRNFSTHNSSLF 60

Query: 61  LFSRILHVSSLTDFEYALRVFDQIDSPNSFMWNTLIGACARSLDRKEQAIELFYRMLEEG 120
           L+SRILHVSSL DF+YA RVF+QID+PNSFMWNTLIGACARSLDRKEQAIE+FYRMLEEG
Sbjct: 61  LYSRILHVSSLIDFDYACRVFNQIDNPNSFMWNTLIGACARSLDRKEQAIEIFYRMLEEG 120

Query: 121 SVEPDKHTFPFLLKACAYVFALSEGRQAHAHIVKRGLDSDVYVGNSLIHLYASCGCLSLA 180
           SVEPDKHTFPFLLKACAYVFALSEGRQAHA I K GLD DVYVGNSLIHLYASCGCLS+A
Sbjct: 121 SVEPDKHTFPFLLKACAYVFALSEGRQAHAQIFKLGLDLDVYVGNSLIHLYASCGCLSMA 180

Query: 181 LKVFEKMPHRSLVSWNVMIDAYVQCGLFKNALNLFAEMQNTFEPDGYTMQSIISACAGIG 240
           LKVFEKMP RSLVSWNVMIDAYVQ GLF+NAL LF EMQN+FEPDGYTMQSI+SACAGIG
Sbjct: 181 LKVFEKMPLRSLVSWNVMIDAYVQSGLFENALKLFVEMQNSFEPDGYTMQSIVSACAGIG 240

Query: 241 ALSLGMWSHAYVLRKTGGAMVGEVLINSSLVDMYSKCGSLSMAQQVFETMPKHDLNSWNS 300
           ALSLGMW+HAYVLRK  GAM G+VLINSSLVDMYSKCGSL MAQQVFETMPKHDLNSWNS
Sbjct: 241 ALSLGMWAHAYVLRKASGAMAGDVLINSSLVDMYSKCGSLRMAQQVFETMPKHDLNSWNS 300

Query: 301 MILAFAMHGRAEAALGCFSRLVETEKFLPNSVTFVGVLSACNHRGMVAEGRKYFDMMVNE 360
           MILA AMHGR +AAL CFSRLVE EKFLPNSVTFVGVLSACNH GMVA+GRKYFDMMVN+
Sbjct: 301 MILALAMHGRGQAALQCFSRLVEMEKFLPNSVTFVGVLSACNHGGMVADGRKYFDMMVND 360

Query: 361 YKIVPRLEHYGCLVDLLSRSGFIDEALELVTNIHIKPDAVIWRSLLDACYKQNAGVELSE 420
           YKI PRLEHYGCLVDLLSRSGFIDEALELV N+HIKPDAVIWRSLLDACYKQNAGVELSE
Sbjct: 361 YKIEPRLEHYGCLVDLLSRSGFIDEALELVANMHIKPDAVIWRSLLDACYKQNAGVELSE 420

Query: 421 EVALQILQSETTTSSGVYVLLSRVYASAHRWNDVGLVRKAMADKGVAKEPGCSSIEIDGV 480
           EVA +ILQSE T SSGVYV+LSRVYASA +WNDVG++RK M D GV KEPGCSSIEIDG+
Sbjct: 421 EVAFKILQSEKTISSGVYVMLSRVYASARQWNDVGIIRKVMTDMGVTKEPGCSSIEIDGI 480

Query: 481 SHEFFAGDTSHPKIKEIYAVIDLIEEKLQKHGYSPDYSQATMVDDPDTVKWQSLKLHSER 540
           SHEFFAGDTSHP+IKEIY VIDLIEEKL++ GYSPD SQATMVD+PD +K QSLKLHSER
Sbjct: 481 SHEFFAGDTSHPRIKEIYGVIDLIEEKLERRGYSPDCSQATMVDEPDNIKQQSLKLHSER 540

Query: 541 FAIAFGLLNLKPGMPIRIFKNLRVCNDCHQVTKLISRIFNVEIIMRDRNRFHHFENGMCS 600
            AIAFGLLNLKPG P+RIFKNLRVCNDCHQVTKLIS IFNVEIIMRDRNRFHHF++GMCS
Sbjct: 541 LAIAFGLLNLKPGTPVRIFKNLRVCNDCHQVTKLISEIFNVEIIMRDRNRFHHFKHGMCS 600

Query: 601 CMDFW 606
           CMDFW
Sbjct: 601 CMDFW 605

BLAST of CmoCh04G016970 vs. NCBI nr
Match: gi|641864025|gb|KDO82711.1| (hypothetical protein CISIN_1g038206mg [Citrus sinensis])

HSP 1 Score: 852.8 bits (2202), Expect = 3.6e-244
Identity = 419/615 (68.13%), Postives = 491/615 (79.84%), Query Frame = 1

Query: 1   MLLSIPPNGQSIPVEIKHETFHARQ--------SRLLHLLSDCTDLSRLKQIHAQALRTF 60
           M ++I   G   P    H  F+  +        S LL  L++C  +S+LKQIHAQALRT 
Sbjct: 1   MAVAIVQGGPPTPQTHSHSIFNNNRNEGSFNNHSSLLSSLTECKSMSQLKQIHAQALRTA 60

Query: 61  --STHKSSLFLFSRILHVSSLTDFEYALRVFDQIDSPNSFMWNTLIGACARSLDRKEQAI 120
               HK+ L ++SRI+H +S  D +YA RVF QI++PNSF WNTLI ACARS+D K QAI
Sbjct: 61  LPQQHKT-LLIYSRIIHFASFADLDYAFRVFYQIENPNSFTWNTLIRACARSVDAKPQAI 120

Query: 121 ELFYRMLEEGSVEPDKHTFPFLLKACAYVFALSEGRQAHAHIVKRGLDSDVYVGNSLIHL 180
            LF RM+E+G+V PDKHTFPF LKACAY+FA S+G+QAHAHI KRGL SDVY+ NSLIH 
Sbjct: 121 VLFQRMIEQGNVLPDKHTFPFALKACAYLFAFSQGKQAHAHIFKRGLVSDVYINNSLIHF 180

Query: 181 YASCGCLSLALKVFEKMPHRSLVSWNVMIDAYVQCGLFKNALNLFAEMQNTFEPDGYTMQ 240
           YASCG L LA KVF+ M  RSLVSWNVMIDA+VQ G F +AL LF  MQ  FEPDGYT Q
Sbjct: 181 YASCGHLDLANKVFDNMLERSLVSWNVMIDAFVQFGEFDSALKLFRRMQILFEPDGYTFQ 240

Query: 241 SIISACAGIGALSLGMWSHAYVLRKTGGAMVGEVLINSSLVDMYSKCGSLSMAQQVFETM 300
           SI SACAG+  LSLGMW+HAY+LR    ++V +VL+N+SL+DMY KCGSL +A+QVFE+M
Sbjct: 241 SITSACAGLATLSLGMWAHAYILRHCDHSLVTDVLVNNSLIDMYCKCGSLDIARQVFESM 300

Query: 301 PKHDLNSWNSMILAFAMHGRAEAALGCFSRLVETEKFLPNSVTFVGVLSACNHRGMVAEG 360
           PK DL SWNS+IL FA+HGRAEAAL  F RLV  E F PNS+TFVGVLSACNHRGMV+EG
Sbjct: 301 PKRDLTSWNSIILGFALHGRAEAALKYFDRLVVEESFSPNSITFVGVLSACNHRGMVSEG 360

Query: 361 RKYFDMMVNEYKIVPRLEHYGCLVDLLSRSGFIDEALELVTNIHIKPDAVIWRSLLDACY 420
           R YFD+M+NEY I P LEHYGCLVDLL+R+G IDEAL LV+N+ +KPDAVIWRSLLDAC 
Sbjct: 361 RDYFDVMINEYNITPVLEHYGCLVDLLARAGNIDEALHLVSNMPMKPDAVIWRSLLDACC 420

Query: 421 KQNAGVELSEEVALQILQSETTTSSGVYVLLSRVYASAHRWNDVGLVRKAMADKGVAKEP 480
           K++A V LSEEVA Q+++SE    SGVYVLLSRVYASA RWNDVGLVRK M DKGV KEP
Sbjct: 421 KKHASVVLSEEVAKQVIESEGGICSGVYVLLSRVYASARRWNDVGLVRKLMTDKGVTKEP 480

Query: 481 GCSSIEIDGVSHEFFAGDTSHPKIKEIYAVIDLIEEKLQKHGYSPDYSQATMVDDPDTVK 540
           GCSSIEIDG++HEFFAGDTSHP+ K+IY  +DLI+EKL+  GY+PDYSQA MVD+ D  K
Sbjct: 481 GCSSIEIDGIAHEFFAGDTSHPQTKQIYGFLDLIDEKLKSRGYTPDYSQAAMVDELDDGK 540

Query: 541 WQSLKLHSERFAIAFGLLNLKPGMPIRIFKNLRVCNDCHQVTKLISRIFNVEIIMRDRNR 600
             SL+LHSER AIA G+LNLKPGMPIR+FKNLRVC DCH+VTKLISRIFNVEII+RDR R
Sbjct: 541 QSSLRLHSERLAIALGILNLKPGMPIRVFKNLRVCKDCHEVTKLISRIFNVEIIVRDRAR 600

Query: 601 FHHFENGMCSCMDFW 606
           FHHF++G CSCMD+W
Sbjct: 601 FHHFKDGSCSCMDYW 614

BLAST of CmoCh04G016970 vs. NCBI nr
Match: gi|568859476|ref|XP_006483265.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g59720, chloroplastic/mitochondrial [Citrus sinensis])

HSP 1 Score: 852.0 bits (2200), Expect = 6.1e-244
Identity = 419/615 (68.13%), Postives = 491/615 (79.84%), Query Frame = 1

Query: 1   MLLSIPPNGQSIPVEIKHETFHARQ--------SRLLHLLSDCTDLSRLKQIHAQALRTF 60
           M ++I   G   P    H  F+  +        S LL  L++C  +S+LKQIHAQALRT 
Sbjct: 1   MAVAIVQGGPPTPQTHSHSIFNNNRNEGSFNNHSSLLSSLTECKSMSQLKQIHAQALRTA 60

Query: 61  --STHKSSLFLFSRILHVSSLTDFEYALRVFDQIDSPNSFMWNTLIGACARSLDRKEQAI 120
               HK+ L ++SRI+H +S  D +YA RVF QI++PNSF WNTLI ACARS+D K QAI
Sbjct: 61  LPQQHKT-LLIYSRIIHFASFADLDYAFRVFYQIENPNSFTWNTLIRACARSVDAKPQAI 120

Query: 121 ELFYRMLEEGSVEPDKHTFPFLLKACAYVFALSEGRQAHAHIVKRGLDSDVYVGNSLIHL 180
            LF RM+E+G+V PDKHTFPF LKACAY+FA S+G+QAHAHI KRGL SDVY+ NSLIH 
Sbjct: 121 VLFQRMIEQGNVLPDKHTFPFALKACAYLFAFSQGKQAHAHIFKRGLVSDVYINNSLIHF 180

Query: 181 YASCGCLSLALKVFEKMPHRSLVSWNVMIDAYVQCGLFKNALNLFAEMQNTFEPDGYTMQ 240
           YA+CG L LA KVF+ M  RSLVSWNVMIDA+VQ G F +AL LF  MQ  FEPDGYT Q
Sbjct: 181 YATCGHLDLANKVFDNMLERSLVSWNVMIDAFVQFGEFDSALKLFRRMQILFEPDGYTFQ 240

Query: 241 SIISACAGIGALSLGMWSHAYVLRKTGGAMVGEVLINSSLVDMYSKCGSLSMAQQVFETM 300
           SI SACAG+  LSLG W+HAY+LR    ++V +VL+N+SL+DMY KCGSL +A+QVFE+M
Sbjct: 241 SITSACAGLATLSLGTWAHAYILRHCDHSLVTDVLVNNSLIDMYCKCGSLDIARQVFESM 300

Query: 301 PKHDLNSWNSMILAFAMHGRAEAALGCFSRLVETEKFLPNSVTFVGVLSACNHRGMVAEG 360
           PK DL SWNS+IL FA+HGRAEAAL  F RLVE E F PNS+TFVGVLSACNH GMV+EG
Sbjct: 301 PKRDLTSWNSIILGFALHGRAEAALKYFDRLVEEESFSPNSITFVGVLSACNHMGMVSEG 360

Query: 361 RKYFDMMVNEYKIVPRLEHYGCLVDLLSRSGFIDEALELVTNIHIKPDAVIWRSLLDACY 420
           R YFD+M+NEY I P LEHYGCLVDLL+R+G IDEAL LV+N+ +KPDAVIWRSLLDAC 
Sbjct: 361 RDYFDVMINEYNITPVLEHYGCLVDLLARAGNIDEALHLVSNMPMKPDAVIWRSLLDACC 420

Query: 421 KQNAGVELSEEVALQILQSETTTSSGVYVLLSRVYASAHRWNDVGLVRKAMADKGVAKEP 480
           K++A V LSEEVA QI++SE    SGVYVLLSRVYASA RWNDVGLVRK M DKGV KEP
Sbjct: 421 KKHASVVLSEEVAKQIIESEGGICSGVYVLLSRVYASARRWNDVGLVRKLMTDKGVTKEP 480

Query: 481 GCSSIEIDGVSHEFFAGDTSHPKIKEIYAVIDLIEEKLQKHGYSPDYSQATMVDDPDTVK 540
           GCSSIEIDG++HEFFAGDTSHP+ K+IY V+DLI+EKL+  GY+PDYSQA MVD+ D  K
Sbjct: 481 GCSSIEIDGIAHEFFAGDTSHPQTKQIYGVLDLIDEKLKSRGYTPDYSQAAMVDELDDGK 540

Query: 541 WQSLKLHSERFAIAFGLLNLKPGMPIRIFKNLRVCNDCHQVTKLISRIFNVEIIMRDRNR 600
             SL+LHSER AIA G+LNLKPGMPIR+FKNLRVC DCH+VTKLISRIFNVEII+RDR R
Sbjct: 541 QSSLRLHSERLAIALGILNLKPGMPIRVFKNLRVCKDCHEVTKLISRIFNVEIIVRDRAR 600

Query: 601 FHHFENGMCSCMDFW 606
           FHHF++G CSCMD+W
Sbjct: 601 FHHFKDGSCSCMDYW 614

BLAST of CmoCh04G016970 vs. NCBI nr
Match: gi|567892067|ref|XP_006438554.1| (hypothetical protein CICLE_v10033899mg [Citrus clementina])

HSP 1 Score: 851.7 bits (2199), Expect = 8.0e-244
Identity = 415/596 (69.63%), Postives = 486/596 (81.54%), Query Frame = 1

Query: 12  IPVEIKHETFHARQSRLLHLLSDCTDLSRLKQIHAQALRTF--STHKSSLFLFSRILHVS 71
           +P  + +E      S LL  L++C  +S+LKQIHAQALRT     HK+ L ++SRI+H +
Sbjct: 3   LPHLVTNEGSFNNHSSLLSSLTECKSMSQLKQIHAQALRTALPQQHKT-LLIYSRIIHFA 62

Query: 72  SLTDFEYALRVFDQIDSPNSFMWNTLIGACARSLDRKEQAIELFYRMLEEGSVEPDKHTF 131
           S  D +YA RVF QI++PNSF WNTLI ACARS+D K QAI LF RM+E+G+V PDKHTF
Sbjct: 63  SFADLDYAFRVFYQIENPNSFTWNTLIRACARSVDAKPQAIVLFQRMIEQGNVLPDKHTF 122

Query: 132 PFLLKACAYVFALSEGRQAHAHIVKRGLDSDVYVGNSLIHLYASCGCLSLALKVFEKMPH 191
           PF LKACAY+FA S+G+QAHAHI KRGL SDVY+ NSLIH YA+CG L LA KVF+ M  
Sbjct: 123 PFALKACAYLFAFSQGKQAHAHIFKRGLASDVYINNSLIHFYATCGHLDLANKVFDNMLE 182

Query: 192 RSLVSWNVMIDAYVQCGLFKNALNLFAEMQNTFEPDGYTMQSIISACAGIGALSLGMWSH 251
           RSLVSWNVMIDA+VQ G F +AL LF  MQ  FEPDGYT QSI SACAG+  LSLG W+H
Sbjct: 183 RSLVSWNVMIDAFVQFGEFDSALKLFRRMQILFEPDGYTFQSITSACAGLATLSLGTWAH 242

Query: 252 AYVLRKTGGAMVGEVLINSSLVDMYSKCGSLSMAQQVFETMPKHDLNSWNSMILAFAMHG 311
           AY+LR    ++V +VL+N+SL+DMY KCGSL +A+QVFE+MPK DL SWNS+IL FA+HG
Sbjct: 243 AYILRHCDHSLVTDVLVNNSLIDMYCKCGSLDIARQVFESMPKRDLTSWNSIILGFALHG 302

Query: 312 RAEAALGCFSRLVETEKFLPNSVTFVGVLSACNHRGMVAEGRKYFDMMVNEYKIVPRLEH 371
           RAEAAL  F RLVE E F PNS+TFVGVLSACNH GMV+EGR YFD+M+NEY I P LEH
Sbjct: 303 RAEAALKYFDRLVEEESFSPNSITFVGVLSACNHMGMVSEGRDYFDVMINEYNITPVLEH 362

Query: 372 YGCLVDLLSRSGFIDEALELVTNIHIKPDAVIWRSLLDACYKQNAGVELSEEVALQILQS 431
           YGCLVDLL+R+G IDEAL LV+N+ +KPDAVIWRSLLDAC K++A V LSEEVA QI++S
Sbjct: 363 YGCLVDLLARAGNIDEALHLVSNMPMKPDAVIWRSLLDACCKKHASVVLSEEVAKQIIES 422

Query: 432 ETTTSSGVYVLLSRVYASAHRWNDVGLVRKAMADKGVAKEPGCSSIEIDGVSHEFFAGDT 491
           E    SGVYVLLSRVYASA RWNDVGLVRK M DKGV KEPGCSSIEIDG++HEFFAGDT
Sbjct: 423 EGGICSGVYVLLSRVYASARRWNDVGLVRKLMTDKGVTKEPGCSSIEIDGIAHEFFAGDT 482

Query: 492 SHPKIKEIYAVIDLIEEKLQKHGYSPDYSQATMVDDPDTVKWQSLKLHSERFAIAFGLLN 551
           SHP+ K+IY V+DLI+EKL+  GY+PDYSQA MVD+ D  K  SL+LHSER AIA G+LN
Sbjct: 483 SHPQTKQIYGVLDLIDEKLKSRGYTPDYSQAAMVDELDDGKQSSLRLHSERLAIALGILN 542

Query: 552 LKPGMPIRIFKNLRVCNDCHQVTKLISRIFNVEIIMRDRNRFHHFENGMCSCMDFW 606
           LKPGMPIR+FKNLRVC DCH+VTKLISRIFNVEII+RDR RFHHF++G CSCMD+W
Sbjct: 543 LKPGMPIRVFKNLRVCKDCHEVTKLISRIFNVEIIVRDRARFHHFKDGSCSCMDYW 597

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PPR85_ARATH1.4e-21661.80Pentatricopeptide repeat-containing protein At1g59720, chloroplastic/mitochondri... [more]
PPR21_ARATH2.0e-13344.58Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
PP330_ARATH3.3e-12841.91Pentatricopeptide repeat-containing protein At4g21065 OS=Arabidopsis thaliana GN... [more]
PP145_ARATH5.7e-12540.52Pentatricopeptide repeat-containing protein At2g02980, chloroplastic OS=Arabidop... [more]
PPR68_ARATH1.8e-12339.93Pentatricopeptide repeat-containing protein At1g31920 OS=Arabidopsis thaliana GN... [more]
Match NameE-valueIdentityDescription
A0A0A0KTV7_CUCSA0.0e+0086.28Uncharacterized protein OS=Cucumis sativus GN=Csa_5G604120 PE=4 SV=1[more]
A0A067H5B3_CITSI2.5e-24468.13Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g038206mg PE=4 SV=1[more]
V4VGX7_9ROSI5.6e-24469.63Uncharacterized protein OS=Citrus clementina GN=CICLE_v10033899mg PE=4 SV=1[more]
W9R9J8_9ROSA2.7e-23868.60Uncharacterized protein OS=Morus notabilis GN=L484_021064 PE=4 SV=1[more]
F6HPG3_VITVI6.6e-23766.39Putative uncharacterized protein OS=Vitis vinifera GN=VIT_01s0026g01440 PE=4 SV=... [more]
Match NameE-valueIdentityDescription
AT1G59720.18.0e-21861.80 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G08070.11.1e-13444.58 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT4G21065.11.8e-12941.91 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT2G02980.13.2e-12640.52 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G31920.11.0e-12439.93 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659090955|ref|XP_008446291.1|0.0e+0088.26PREDICTED: pentatricopeptide repeat-containing protein At1g59720, mitochondrial ... [more]
gi|449435366|ref|XP_004135466.1|0.0e+0086.28PREDICTED: pentatricopeptide repeat-containing protein At1g59720, mitochondrial ... [more]
gi|641864025|gb|KDO82711.1|3.6e-24468.13hypothetical protein CISIN_1g038206mg [Citrus sinensis][more]
gi|568859476|ref|XP_006483265.1|6.1e-24468.13PREDICTED: pentatricopeptide repeat-containing protein At1g59720, chloroplastic/... [more]
gi|567892067|ref|XP_006438554.1|8.0e-24469.63hypothetical protein CICLE_v10033899mg [Citrus clementina][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0008270zinc ion binding
GO:0005515protein binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0016556 mRNA modification
biological_process GO:0090305 nucleic acid phosphodiester bond hydrolysis
biological_process GO:0008150 biological_process
cellular_component GO:0009507 chloroplast
cellular_component GO:0005575 cellular_component
molecular_function GO:0004519 endonuclease activity
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh04G016970.1CmoCh04G016970.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 165..190
score: 7.3E-4coord: 268..293
score: 0.045coord: 369..389
score: 0.38coord: 332..359
score: 0.96coord: 297..322
score: 0.
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 87..137
score: 2.4E-7coord: 191..237
score: 2.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 193..220
score: 3.7E-5coord: 91..125
score: 0
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 330..360
score: 7.344coord: 294..329
score: 8.528coord: 160..190
score: 8.298coord: 398..433
score: 5.711coord: 366..396
score: 6.138coord: 434..468
score: 5.919coord: 263..293
score: 7.18coord: 191..221
score: 9.997coord: 125..159
score: 5.788coord: 88..123
score: 10
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 72..124
score: 9.0E-8coord: 190..358
score: 9.
IPR011990Tetratricopeptide-like helical domainunknownSSF48452TPR-likecoord: 292..328
score: 9.38E-5coord: 86..124
score: 9.38E-5coord: 181..230
score: 9.3
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 1..475
score: 3.6E
NoneNo IPR availablePANTHERPTHR24015:SF20SUBFAMILY NOT NAMEDcoord: 1..475
score: 3.6E

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
CmoCh04G016970Wax gourdcmowgoB0841
CmoCh04G016970Wax gourdcmowgoB0886
CmoCh04G016970Cucurbita moschata (Rifu)cmocmoB124
CmoCh04G016970Cucurbita moschata (Rifu)cmocmoB268
CmoCh04G016970Cucurbita moschata (Rifu)cmocmoB340
CmoCh04G016970Cucumber (Gy14) v1cgycmoB0443
CmoCh04G016970Cucumber (Gy14) v1cgycmoB0591
CmoCh04G016970Cucurbita maxima (Rimu)cmacmoB314
CmoCh04G016970Cucurbita maxima (Rimu)cmacmoB423
CmoCh04G016970Wild cucumber (PI 183967)cmocpiB687
CmoCh04G016970Wild cucumber (PI 183967)cmocpiB689
CmoCh04G016970Wild cucumber (PI 183967)cmocpiB738
CmoCh04G016970Cucumber (Chinese Long) v2cmocuB673
CmoCh04G016970Cucumber (Chinese Long) v2cmocuB676
CmoCh04G016970Cucumber (Chinese Long) v2cmocuB730
CmoCh04G016970Melon (DHL92) v3.5.1cmomeB630
CmoCh04G016970Melon (DHL92) v3.5.1cmomeB645
CmoCh04G016970Melon (DHL92) v3.5.1cmomeB659
CmoCh04G016970Melon (DHL92) v3.5.1cmomeB666
CmoCh04G016970Watermelon (Charleston Gray)cmowcgB609
CmoCh04G016970Watermelon (Charleston Gray)cmowcgB669
CmoCh04G016970Watermelon (97103) v1cmowmB647
CmoCh04G016970Watermelon (97103) v1cmowmB713
CmoCh04G016970Watermelon (97103) v1cmowmB741
CmoCh04G016970Cucurbita pepo (Zucchini)cmocpeB647
CmoCh04G016970Bottle gourd (USVL1VR-Ls)cmolsiB642
CmoCh04G016970Bottle gourd (USVL1VR-Ls)cmolsiB644
CmoCh04G016970Bottle gourd (USVL1VR-Ls)cmolsiB687
CmoCh04G016970Cucumber (Gy14) v2cgybcmoB119
CmoCh04G016970Cucumber (Gy14) v2cgybcmoB644
CmoCh04G016970Melon (DHL92) v3.6.1cmomedB718
CmoCh04G016970Melon (DHL92) v3.6.1cmomedB753
CmoCh04G016970Silver-seed gourdcarcmoB0142
CmoCh04G016970Cucumber (Chinese Long) v3cmocucB0798
CmoCh04G016970Cucumber (Chinese Long) v3cmocucB0807
CmoCh04G016970Cucumber (Chinese Long) v3cmocucB0847
CmoCh04G016970Cucumber (Chinese Long) v3cmocucB0864
CmoCh04G016970Watermelon (97103) v2cmowmbB688
CmoCh04G016970Watermelon (97103) v2cmowmbB773
CmoCh04G016970Watermelon (97103) v2cmowmbB734
CmoCh04G016970Watermelon (97103) v2cmowmbB746