Cp4.1LG09g04680 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG09g04680
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionPentatricopeptide repeat-containing protein
LocationCp4.1LG09 : 3077575 .. 3078958 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGCTCGCAGCTACTTCCAGTCTCCTCTCTTCCATCCATCGCCATAGCCAAAGCTCACAATCGAACTCCATTAAAGCCAAGCTCCATTTCCATCTCCTCATCTCCTCTTCCATCTCACACTCTTTCAGAAAGAACGCACATCCCCTCCCATGTCTACAGGCATCCAGCCGCCGTGCTTCTCGAGCTCTGCACTTCCATGAAAGAGCTCCACCAAATCCTCCCACTGGTCATAAAAAATGGCCTCTACAACGAGCATTTATTTCAAACCAAGCTCGTCAGCTTGTTTTCCAAGTATGGCAGCATCAACGAGGCCGCTCGTGTTTTCGAGCCGATTGAGGAGAAGATCGACGTTCTCTACCACACGATGCTAAAAGGATATGCGAAGAATTCGTCGTTGGAGACTGCTCTTGCTTTTCTTTGTCGAATGAGGTACGATGATGTTAAGCCTGTTGTGTATAATTTTACTTATTTGCTTAAGGTTTGTGGCGATAATGCGGATTTGAAGAGGGGTAGGGAGATTCATGGGAATCTGATTAAGAATTCGTTTGGGGCGAATGTATTTGCAATGACTGGTGTTGTGAACATGTATGCGAAGTGTAGGCAGATTGACGATGCATACAAGATGTTCGACAGAATGCCTGTGAGAGATTTGGTGTCTTGGAATACGATTATTACAGGGTTTTCCCAAAATGGGTTTGCAAATAAGGCCCTGGAGTTGGTTTTGAGTATGCAAGATGAAGGCCAAAGGCCTGATTCGATTACATTGGTTACTGTTCTTCCTGCTGCTGCTGATATTGGATCGTTAATGGTGGGGAAATCTATTCATGGGTATGCCATTAGAGCTGGATTCTCAAAGCTTGTTAATATTTCAACTGCTTTGGTTGATATGTATTCAAAATGTGGATCAGTTGATACAGCTAGATTGATTTTTGATGGAATGGAACAGAAGACTGTTGTGTCATGGAATTCCATGATGGCTGGATATGTGCAGAGTGGTGAACCAGAGATGGCTATTGCAATCTTTGAGAAGATGTTGGATGAAGGAATAGAGCCTACCAATGTAACCATTATGGAAGCGTTACATGCCTGCGCCGATTTGGGTGATTTCGAAATGGGGAAGTTCGTTCATAAGTTCGTCGATAAGTTAAATCTTGGTTCGGACGTATCCATTATGAACTCATTGATATCTATGTATTCGAAGTGTAATCGAGTCGACATTGCTTCTGATATCTTCAAGAACTTACATCGAAAAACTCTCGTCTCATGGAATGCAATGATTTTGGGTTATACTCAAAATGGAAGAGTGAGTGAAGCTTTGAATTGTTTTTGTGAGATGCAATCTTTAGGTATAAAATCTGATTCATTTACAGTGGTTAGTGTGA

mRNA sequence

ATGAGCTCGCAGCTACTTCCAGTCTCCTCTCTTCCATCCATCGCCATAGCCAAAGCTCACAATCGAACTCCATTAAAGCCAAGCTCCATTTCCATCTCCTCATCTCCTCTTCCATCTCACACTCTTTCAGAAAGAACGCACATCCCCTCCCATGTCTACAGGCATCCAGCCGCCGTGCTTCTCGAGCTCTGCACTTCCATGAAAGAGCTCCACCAAATCCTCCCACTGGTCATAAAAAATGGCCTCTACAACGAGCATTTATTTCAAACCAAGCTCGTCAGCTTGTTTTCCAAGTATGGCAGCATCAACGAGGCCGCTCGTGTTTTCGAGCCGATTGAGGAGAAGATCGACGTTCTCTACCACACGATGCTAAAAGGATATGCGAAGAATTCGTCGTTGGAGACTGCTCTTGCTTTTCTTTGTCGAATGAGGTACGATGATGTTAAGCCTGTTGTGTATAATTTTACTTATTTGCTTAAGGTTTGTGGCGATAATGCGGATTTGAAGAGGGGTAGGGAGATTCATGGGAATCTGATTAAGAATTCGTTTGGGGCGAATGTATTTGCAATGACTGGTGTTGTGAACATGTATGCGAAGTGTAGGCAGATTGACGATGCATACAAGATGTTCGACAGAATGCCTGTGAGAGATTTGGTGTCTTGGAATACGATTATTACAGGGTTTTCCCAAAATGGGTTTGCAAATAAGGCCCTGGAGTTGGTTTTGAGTATGCAAGATGAAGGCCAAAGGCCTGATTCGATTACATTGGTTACTGTTCTTCCTGCTGCTGCTGATATTGGATCGTTAATGGTGGGGAAATCTATTCATGGGTATGCCATTAGAGCTGGATTCTCAAAGCTTGTTAATATTTCAACTGCTTTGGTTGATATGTATTCAAAATGTGGATCAGTTGATACAGCTAGATTGATTTTTGATGGAATGGAACAGAAGACTGTTGTGTCATGGAATTCCATGATGGCTGGATATGTGCAGAGTGGTGAACCAGAGATGGCTATTGCAATCTTTGAGAAGATGTTGGATGAAGGAATAGAGCCTACCAATGTAACCATTATGGAAGCGTTACATGCCTGCGCCGATTTGGGTGATTTCGAAATGGGGAAGTTCGTTCATAAGTTCGTCGATAAGTTAAATCTTGGTTCGGACGTATCCATTATGAACTCATTGATATCTATGTATTCGAAGTGTAATCGAGTCGACATTGCTTCTGATATCTTCAAGAACTTACATCGAAAAACTCTCGTCTCATGGAATGCAATGATTTTGGGTTATACTCAAAATGGAAGATGGTTAGTGTGA

Coding sequence (CDS)

ATGAGCTCGCAGCTACTTCCAGTCTCCTCTCTTCCATCCATCGCCATAGCCAAAGCTCACAATCGAACTCCATTAAAGCCAAGCTCCATTTCCATCTCCTCATCTCCTCTTCCATCTCACACTCTTTCAGAAAGAACGCACATCCCCTCCCATGTCTACAGGCATCCAGCCGCCGTGCTTCTCGAGCTCTGCACTTCCATGAAAGAGCTCCACCAAATCCTCCCACTGGTCATAAAAAATGGCCTCTACAACGAGCATTTATTTCAAACCAAGCTCGTCAGCTTGTTTTCCAAGTATGGCAGCATCAACGAGGCCGCTCGTGTTTTCGAGCCGATTGAGGAGAAGATCGACGTTCTCTACCACACGATGCTAAAAGGATATGCGAAGAATTCGTCGTTGGAGACTGCTCTTGCTTTTCTTTGTCGAATGAGGTACGATGATGTTAAGCCTGTTGTGTATAATTTTACTTATTTGCTTAAGGTTTGTGGCGATAATGCGGATTTGAAGAGGGGTAGGGAGATTCATGGGAATCTGATTAAGAATTCGTTTGGGGCGAATGTATTTGCAATGACTGGTGTTGTGAACATGTATGCGAAGTGTAGGCAGATTGACGATGCATACAAGATGTTCGACAGAATGCCTGTGAGAGATTTGGTGTCTTGGAATACGATTATTACAGGGTTTTCCCAAAATGGGTTTGCAAATAAGGCCCTGGAGTTGGTTTTGAGTATGCAAGATGAAGGCCAAAGGCCTGATTCGATTACATTGGTTACTGTTCTTCCTGCTGCTGCTGATATTGGATCGTTAATGGTGGGGAAATCTATTCATGGGTATGCCATTAGAGCTGGATTCTCAAAGCTTGTTAATATTTCAACTGCTTTGGTTGATATGTATTCAAAATGTGGATCAGTTGATACAGCTAGATTGATTTTTGATGGAATGGAACAGAAGACTGTTGTGTCATGGAATTCCATGATGGCTGGATATGTGCAGAGTGGTGAACCAGAGATGGCTATTGCAATCTTTGAGAAGATGTTGGATGAAGGAATAGAGCCTACCAATGTAACCATTATGGAAGCGTTACATGCCTGCGCCGATTTGGGTGATTTCGAAATGGGGAAGTTCGTTCATAAGTTCGTCGATAAGTTAAATCTTGGTTCGGACGTATCCATTATGAACTCATTGATATCTATGTATTCGAAGTGTAATCGAGTCGACATTGCTTCTGATATCTTCAAGAACTTACATCGAAAAACTCTCGTCTCATGGAATGCAATGATTTTGGGTTATACTCAAAATGGAAGATGGTTAGTGTGA

Protein sequence

MSSQLLPVSSLPSIAIAKAHNRTPLKPSSISISSSPLPSHTLSERTHIPSHVYRHPAAVLLELCTSMKELHQILPLVIKNGLYNEHLFQTKLVSLFSKYGSINEAARVFEPIEEKIDVLYHTMLKGYAKNSSLETALAFLCRMRYDDVKPVVYNFTYLLKVCGDNADLKRGREIHGNLIKNSFGANVFAMTGVVNMYAKCRQIDDAYKMFDRMPVRDLVSWNTIITGFSQNGFANKALELVLSMQDEGQRPDSITLVTVLPAAADIGSLMVGKSIHGYAIRAGFSKLVNISTALVDMYSKCGSVDTARLIFDGMEQKTVVSWNSMMAGYVQSGEPEMAIAIFEKMLDEGIEPTNVTIMEALHACADLGDFEMGKFVHKFVDKLNLGSDVSIMNSLISMYSKCNRVDIASDIFKNLHRKTLVSWNAMILGYTQNGRWLV
BLAST of Cp4.1LG09g04680 vs. Swiss-Prot
Match: PPR32_ARATH (Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidopsis thaliana GN=PCMP-H40 PE=2 SV=1)

HSP 1 Score: 550.1 bits (1416), Expect = 2.3e-155
Identity = 269/435 (61.84%), Postives = 339/435 (77.93%), Query Frame = 1

Query: 1   MSSQLLPVSSLPSIAIAKAHNRTPLKPSSISISSSPLPSHTLSERTHIPSHVYRHPAAVL 60
           MSSQL+  S++P I    + +R                 H LSER +IP++VY HPAA+L
Sbjct: 1   MSSQLVQFSTVPQIPNPPSRHR-----------------HFLSERNYIPANVYEHPAALL 60

Query: 61  LELCTSMKELHQILPLVIKNGLYNEHLFQTKLVSLFSKYGSINEAARVFEPIEEKIDVLY 120
           LE C+S+KEL QILPLV KNGLY EH FQTKLVSLF +YGS++EAARVFEPI+ K++VLY
Sbjct: 61  LERCSSLKELRQILPLVFKNGLYQEHFFQTKLVSLFCRYGSVDEAARVFEPIDSKLNVLY 120

Query: 121 HTMLKGYAKNSSLETALAFLCRMRYDDVKPVVYNFTYLLKVCGDNADLKRGREIHGNLIK 180
           HTMLKG+AK S L+ AL F  RMRYDDV+PVVYNFTYLLKVCGD A+L+ G+EIHG L+K
Sbjct: 121 HTMLKGFAKVSDLDKALQFFVRMRYDDVEPVVYNFTYLLKVCGDEAELRVGKEIHGLLVK 180

Query: 181 NSFGANVFAMTGVVNMYAKCRQIDDAYKMFDRMPVRDLVSWNTIITGFSQNGFANKALEL 240
           + F  ++FAMTG+ NMYAKCRQ+++A K+FDRMP RDLVSWNTI+ G+SQNG A  ALE+
Sbjct: 181 SGFSLDLFAMTGLENMYAKCRQVNEARKVFDRMPERDLVSWNTIVAGYSQNGMARMALEM 240

Query: 241 VLSMQDEGQRPDSITLVTVLPAAADIGSLMVGKSIHGYAIRAGFSKLVNISTALVDMYSK 300
           V SM +E  +P  IT+V+VLPA + +  + VGK IHGYA+R+GF  LVNISTALVDMY+K
Sbjct: 241 VKSMCEENLKPSFITIVSVLPAVSALRLISVGKEIHGYAMRSGFDSLVNISTALVDMYAK 300

Query: 301 CGSVDTARLIFDGMEQKTVVSWNSMMAGYVQSGEPEMAIAIFEKMLDEGIEPTNVTIMEA 360
           CGS++TAR +FDGM ++ VVSWNSM+  YVQ+  P+ A+ IF+KMLDEG++PT+V++M A
Sbjct: 301 CGSLETARQLFDGMLERNVVSWNSMIDAYVQNENPKEAMLIFQKMLDEGVKPTDVSVMGA 360

Query: 361 LHACADLGDFEMGKFVHKFVDKLNLGSDVSIMNSLISMYSKCNRVDIASDIFKNLHRKTL 420
           LHACADLGD E G+F+HK   +L L  +VS++NSLISMY KC  VD A+ +F  L  +TL
Sbjct: 361 LHACADLGDLERGRFIHKLSVELGLDRNVSVVNSLISMYCKCKEVDTAASMFGKLQSRTL 418

Query: 421 VSWNAMILGYTQNGR 436
           VSWNAMILG+ QNGR
Sbjct: 421 VSWNAMILGFAQNGR 418

BLAST of Cp4.1LG09g04680 vs. Swiss-Prot
Match: PPR45_ARATH (Pentatricopeptide repeat-containing protein At1g15510, chloroplastic OS=Arabidopsis thaliana GN=PCMP-H73 PE=3 SV=1)

HSP 1 Score: 254.2 bits (648), Expect = 2.6e-66
Identity = 123/344 (35.76%), Postives = 202/344 (58.72%), Query Frame = 1

Query: 93  VSLFSKYGSINEAARVFEPIEEKIDVLYHTMLKGYAKNSSLETALAFLCRMRY-DDVKPV 152
           +++F ++G++ +A  VF  + E+    ++ ++ GYAK    + A+    RM +   VKP 
Sbjct: 136 LAMFVRFGNLVDAWYVFGKMSERNLFSWNVLVGGYAKQGYFDEAMCLYHRMLWVGGVKPD 195

Query: 153 VYNFTYLLKVCGDNADLKRGREIHGNLIKNSFGANVFAMTGVVNMYAKCRQIDDAYKMFD 212
           VY F  +L+ CG   DL RG+E+H ++++  +  ++  +  ++ MY KC  +  A  +FD
Sbjct: 196 VYTFPCVLRTCGGIPDLARGKEVHVHVVRYGYELDIDVVNALITMYVKCGDVKSARLLFD 255

Query: 213 RMPVRDLVSWNTIITGFSQNGFANKALELVLSMQDEGQRPDSITLVTVLPAAADIGSLMV 272
           RMP RD++SWN +I+G+ +NG  ++ LEL  +M+     PD +TL +V+ A   +G   +
Sbjct: 256 RMPRRDIISWNAMISGYFENGMCHEGLELFFAMRGLSVDPDLMTLTSVISACELLGDRRL 315

Query: 273 GKSIHGYAIRAGFSKLVNISTALVDMYSKCGSVDTARLIFDGMEQKTVVSWNSMMAGYVQ 332
           G+ IH Y I  GF+  +++  +L  MY   GS   A  +F  ME+K +VSW +M++GY  
Sbjct: 316 GRDIHAYVITTGFAVDISVCNSLTQMYLNAGSWREAEKLFSRMERKDIVSWTTMISGYEY 375

Query: 333 SGEPEMAIAIFEKMLDEGIEPTNVTIMEALHACADLGDFEMGKFVHKFVDKLNLGSDVSI 392
           +  P+ AI  +  M  + ++P  +T+   L ACA LGD + G  +HK   K  L S V +
Sbjct: 376 NFLPDKAIDTYRMMDQDSVKPDEITVAAVLSACATLGDLDTGVELHKLAIKARLISYVIV 435

Query: 393 MNSLISMYSKCNRVDIASDIFKNLHRKTLVSWNAMILGYTQNGR 436
            N+LI+MYSKC  +D A DIF N+ RK ++SW ++I G   N R
Sbjct: 436 ANNLINMYSKCKCIDKALDIFHNIPRKNVISWTSIIAGLRLNNR 479

BLAST of Cp4.1LG09g04680 vs. Swiss-Prot
Match: PP224_ARATH (Pentatricopeptide repeat-containing protein At3g12770 OS=Arabidopsis thaliana GN=PCMP-H43 PE=2 SV=1)

HSP 1 Score: 244.2 bits (622), Expect = 2.7e-63
Identity = 123/380 (32.37%), Postives = 218/380 (57.37%), Query Frame = 1

Query: 58  AVLLELCTSMKELHQILPLVIKNGLYNEHLFQTKLVSLFSKYGSINEAARVFEPIEEKID 117
           A L++  T   +L QI   ++  GL       TKL+   S +G I  A +VF+ +     
Sbjct: 25  ASLIDSATHKAQLKQIHARLLVLGLQFSGFLITKLIHASSSFGDITFARQVFDDLPRPQI 84

Query: 118 VLYHTMLKGYAKNSSLETALAFLCRMRYDDVKPVVYNFTYLLKVCGDNADLKRGREIHGN 177
             ++ +++GY++N+  + AL     M+   V P  + F +LLK C   + L+ GR +H  
Sbjct: 85  FPWNAIIRGYSRNNHFQDALLMYSNMQLARVSPDSFTFPHLLKACSGLSHLQMGRFVHAQ 144

Query: 178 LIKNSFGANVFAMTGVVNMYAKCRQIDDAYKMFDRMPV--RDLVSWNTIITGFSQNGFAN 237
           + +  F A+VF   G++ +YAKCR++  A  +F+ +P+  R +VSW  I++ ++QNG   
Sbjct: 145 VFRLGFDADVFVQNGLIALYAKCRRLGSARTVFEGLPLPERTIVSWTAIVSAYAQNGEPM 204

Query: 238 KALELVLSMQDEGQRPDSITLVTVLPAAADIGSLMVGKSIHGYAIRAGFSKLVNISTALV 297
           +ALE+   M+    +PD + LV+VL A   +  L  G+SIH   ++ G     ++  +L 
Sbjct: 205 EALEIFSQMRKMDVKPDWVALVSVLNAFTCLQDLKQGRSIHASVVKMGLEIEPDLLISLN 264

Query: 298 DMYSKCGSVDTARLIFDGMEQKTVVSWNSMMAGYVQSGEPEMAIAIFEKMLDEGIEPTNV 357
            MY+KCG V TA+++FD M+   ++ WN+M++GY ++G    AI +F +M+++ + P  +
Sbjct: 265 TMYAKCGQVATAKILFDKMKSPNLILWNAMISGYAKNGYAREAIDMFHEMINKDVRPDTI 324

Query: 358 TIMEALHACADLGDFEMGKFVHKFVDKLNLGSDVSIMNSLISMYSKCNRVDIASDIFKNL 417
           +I  A+ ACA +G  E  + ++++V + +   DV I ++LI M++KC  V+ A  +F   
Sbjct: 325 SITSAISACAQVGSLEQARSMYEYVGRSDYRDDVFISSALIDMFAKCGSVEGARLVFDRT 384

Query: 418 HRKTLVSWNAMILGYTQNGR 436
             + +V W+AMI+GY  +GR
Sbjct: 385 LDRDVVVWSAMIVGYGLHGR 404

BLAST of Cp4.1LG09g04680 vs. Swiss-Prot
Match: PP320_ARATH (Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis thaliana GN=DOT4 PE=2 SV=1)

HSP 1 Score: 235.7 bits (600), Expect = 9.4e-61
Identity = 119/377 (31.56%), Postives = 211/377 (55.97%), Query Frame = 1

Query: 60  LLELCT---SMKELHQILPLVIKNGLYNEHLFQTKLVSLFSKYGSINEAARVFEPIEEKI 119
           +L+LC    S+K+  ++   +  NG   +    +KL  +++  G + EA+RVF+ ++ + 
Sbjct: 100 VLQLCADSKSLKDGKEVDNFIRGNGFVIDSNLGSKLSLMYTNCGDLKEASRVFDEVKIEK 159

Query: 120 DVLYHTMLKGYAKNSSLETALAFLCRMRYDDVKPVVYNFTYLLKVCGDNADLKRGREIHG 179
            + ++ ++   AK+     ++    +M    V+   Y F+ + K       +  G ++HG
Sbjct: 160 ALFWNILMNELAKSGDFSGSIGLFKKMMSSGVEMDSYTFSCVSKSFSSLRSVHGGEQLHG 219

Query: 180 NLIKNSFGANVFAMTGVVNMYAKCRQIDDAYKMFDRMPVRDLVSWNTIITGFSQNGFANK 239
            ++K+ FG        +V  Y K +++D A K+FD M  RD++SWN+II G+  NG A K
Sbjct: 220 FILKSGFGERNSVGNSLVAFYLKNQRVDSARKVFDEMTERDVISWNSIINGYVSNGLAEK 279

Query: 240 ALELVLSMQDEGQRPDSITLVTVLPAAADIGSLMVGKSIHGYAIRAGFSKLVNISTALVD 299
            L + + M   G   D  T+V+V    AD   + +G+++H   ++A FS+       L+D
Sbjct: 280 GLSVFVQMLVSGIEIDLATIVSVFAGCADSRLISLGRAVHSIGVKACFSREDRFCNTLLD 339

Query: 300 MYSKCGSVDTARLIFDGMEQKTVVSWNSMMAGYVQSGEPEMAIAIFEKMLDEGIEPTNVT 359
           MYSKCG +D+A+ +F  M  ++VVS+ SM+AGY + G    A+ +FE+M +EGI P   T
Sbjct: 340 MYSKCGDLDSAKAVFREMSDRSVVSYTSMIAGYAREGLAGEAVKLFEEMEEEGISPDVYT 399

Query: 360 IMEALHACADLGDFEMGKFVHKFVDKLNLGSDVSIMNSLISMYSKCNRVDIASDIFKNLH 419
           +   L+ CA     + GK VH+++ + +LG D+ + N+L+ MY+KC  +  A  +F  + 
Sbjct: 400 VTAVLNCCARYRLLDEGKRVHEWIKENDLGFDIFVSNALMDMYAKCGSMQEAELVFSEMR 459

Query: 420 RKTLVSWNAMILGYTQN 434
            K ++SWN +I GY++N
Sbjct: 460 VKDIISWNTIIGGYSKN 476

BLAST of Cp4.1LG09g04680 vs. Swiss-Prot
Match: PP214_ARATH (Putative pentatricopeptide repeat-containing protein At3g05240 OS=Arabidopsis thaliana GN=PCMP-E82 PE=3 SV=2)

HSP 1 Score: 233.8 bits (595), Expect = 3.6e-60
Identity = 122/390 (31.28%), Postives = 221/390 (56.67%), Query Frame = 1

Query: 56  PAAVLLELCTSMKELHQILPLVIKNGLYNEHLFQTKLVSLFS---KYGSINEAARVFEPI 115
           P    LE C S+ EL+Q+  L+IK+ +    +  ++L+   +   +  +++ A  VFE I
Sbjct: 8   PILSQLENCRSLVELNQLHGLMIKSSVIRNVIPLSRLIDFCTTCPETMNLSYARSVFESI 67

Query: 116 EEKIDVLYHTMLKGYAKNSSLETALAFLCRMRYDDVKPVVYNFTYLLKVCGDNADLKRGR 175
           +     ++++M++GY+ + + + AL F   M      P  + F Y+LK C    D++ G 
Sbjct: 68  DCPSVYIWNSMIRGYSNSPNPDKALIFYQEMLRKGYSPDYFTFPYVLKACSGLRDIQFGS 127

Query: 176 EIHGNLIKNSFGANVFAMTGVVNMYAKCRQIDDAYKMFDRMPVRDLVSWNTIITGFSQNG 235
            +HG ++K  F  N++  T +++MY  C +++   ++F+ +P  ++V+W ++I+GF  N 
Sbjct: 128 CVHGFVVKTGFEVNMYVSTCLLHMYMCCGEVNYGLRVFEDIPQWNVVAWGSLISGFVNNN 187

Query: 236 FANKALELVLSMQDEGQRPDSITLVTVLPAAADIGSLMVGKSIHGYAIRAGF-----SKL 295
             + A+E    MQ  G + +   +V +L A      ++ GK  HG+    GF     SK+
Sbjct: 188 RFSDAIEAFREMQSNGVKANETIMVDLLVACGRCKDIVTGKWFHGFLQGLGFDPYFQSKV 247

Query: 296 ---VNISTALVDMYSKCGSVDTARLIFDGMEQKTVVSWNSMMAGYVQSGEPEMAIAIFEK 355
              V ++T+L+DMY+KCG + TAR +FDGM ++T+VSWNS++ GY Q+G+ E A+ +F  
Sbjct: 248 GFNVILATSLIDMYAKCGDLRTARYLFDGMPERTLVSWNSIITGYSQNGDAEEALCMFLD 307

Query: 356 MLDEGIEPTNVTIMEALHACADLGDFEMGKFVHKFVDKLNLGSDVSIMNSLISMYSKCNR 415
           MLD GI P  VT +  + A    G  ++G+ +H +V K     D +I+ +L++MY+K   
Sbjct: 308 MLDLGIAPDKVTFLSVIRASMIQGCSQLGQSIHAYVSKTGFVKDAAIVCALVNMYAKTGD 367

Query: 416 VDIASDIFKNLHRKTLVSWNAMILGYTQNG 435
            + A   F++L +K  ++W  +I+G   +G
Sbjct: 368 AESAKKAFEDLEKKDTIAWTVVIIGLASHG 397

BLAST of Cp4.1LG09g04680 vs. TrEMBL
Match: A0A0A0KPP6_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G583260 PE=4 SV=1)

HSP 1 Score: 767.7 bits (1981), Expect = 7.7e-219
Identity = 379/427 (88.76%), Postives = 401/427 (93.91%), Query Frame = 1

Query: 9   SSLPSIAIAKAHNRTPLKPSSISISSSPLPSHTLSERTHIPSHVYRHPAAVLLELCTSMK 68
           SS+P IAI KAHN+TPLK SSI+I SSPLP HTLSER HIPSHVY+HPAAVLLELCTSMK
Sbjct: 4   SSVPPIAIVKAHNQTPLKSSSITIPSSPLPFHTLSERAHIPSHVYKHPAAVLLELCTSMK 63

Query: 69  ELHQILPLVIKNGLYNEHLFQTKLVSLFSKYGSINEAARVFEPIEEKIDVLYHTMLKGYA 128
           ELHQI+PLVIKNGLYNEHLFQTKLVSLFSKYGSINEAARVFEPI++K+D LYHTMLKGYA
Sbjct: 64  ELHQIIPLVIKNGLYNEHLFQTKLVSLFSKYGSINEAARVFEPIDDKLDALYHTMLKGYA 123

Query: 129 KNSSLETALAFLCRMRYDDVKPVVYNFTYLLKVCGDNADLKRGREIHGNLIKNSFGANVF 188
           KNSSLETALAFLCRMRYDDVKPVVYNFTYLLKVCGDNADLKRG+EIHG LI NSF ANVF
Sbjct: 124 KNSSLETALAFLCRMRYDDVKPVVYNFTYLLKVCGDNADLKRGKEIHGQLITNSFAANVF 183

Query: 189 AMTGVVNMYAKCRQIDDAYKMFDRMPVRDLVSWNTIITGFSQNGFANKALELVLSMQDEG 248
           AMTGVVNMYAKCRQIDDAYKMFDRMP RDLVSWNTII GFSQNGFA KALELVL MQDEG
Sbjct: 184 AMTGVVNMYAKCRQIDDAYKMFDRMPERDLVSWNTIIAGFSQNGFAKKALELVLRMQDEG 243

Query: 249 QRPDSITLVTVLPAAADIGSLMVGKSIHGYAIRAGFSKLVNISTALVDMYSKCGSVDTAR 308
           QRPDSITLVTVLPAAAD+G LMVGKSIHGYAIRAGF+KLVNISTAL DMYSKCGSV+TAR
Sbjct: 244 QRPDSITLVTVLPAAADVGLLMVGKSIHGYAIRAGFAKLVNISTALADMYSKCGSVETAR 303

Query: 309 LIFDGMEQKTVVSWNSMMAGYVQSGEPEMAIAIFEKMLDEGIEPTNVTIMEALHACADLG 368
           LIFDGM+QKTVVSWNSMM GYVQ+GEPE AIA+FEKML+EGI+PT VTIMEALHACADLG
Sbjct: 304 LIFDGMDQKTVVSWNSMMDGYVQNGEPEKAIAVFEKMLEEGIDPTGVTIMEALHACADLG 363

Query: 369 DFEMGKFVHKFVDKLNLGSDVSIMNSLISMYSKCNRVDIASDIFKNLHRKTLVSWNAMIL 428
           D E GKFVHKFVD+LNLGSD+S+MNSLISMYSKC RVDIASDIF NL+ +T VSWNAMIL
Sbjct: 364 DLERGKFVHKFVDQLNLGSDISVMNSLISMYSKCKRVDIASDIFNNLNGRTHVSWNAMIL 423

Query: 429 GYTQNGR 436
           GY QNGR
Sbjct: 424 GYAQNGR 430

BLAST of Cp4.1LG09g04680 vs. TrEMBL
Match: B9GXA8_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0003s19010g PE=4 SV=2)

HSP 1 Score: 634.0 bits (1634), Expect = 1.3e-178
Identity = 306/434 (70.51%), Postives = 361/434 (83.18%), Query Frame = 1

Query: 1   MSSQLLPVSSLPSIAIAKAHNRTPLKPSSISISSSPLPSHTLSERTHIPSHVYRHPAAVL 60
           MSS LLP ++ P              P  I   +SPL  HTLS+RTHIPSH+Y+HPAA+L
Sbjct: 1   MSSHLLPFTATP--------------PPQIPSKASPLAQHTLSQRTHIPSHIYKHPAAIL 60

Query: 61  LELCTSMKELHQILPLVIKNGLYNEHLFQTKLVSLFSKYGSINEAARVFEPIEEKIDVLY 120
           LELCTS KE+HQILP +IKNGLYNE LFQTKL+SLF KYG++ EA+RVFEPIE+K D LY
Sbjct: 61  LELCTSSKEVHQILPQIIKNGLYNETLFQTKLISLFCKYGNLTEASRVFEPIEDKFDALY 120

Query: 121 HTMLKGYAKNSSLETALAFLCRMRYDDVKPVVYNFTYLLKVCGDNADLKRGREIHGNLIK 180
           HTMLKGYAK+SSL++AL+F  RM++D V+PVVYNFTYLLK+CGDN+DLKRG+EIHG++I 
Sbjct: 121 HTMLKGYAKSSSLDSALSFFSRMKHDSVRPVVYNFTYLLKLCGDNSDLKRGKEIHGSVIT 180

Query: 181 NSFGANVFAMTGVVNMYAKCRQIDDAYKMFDRMPVRDLVSWNTIITGFSQNGFANKALEL 240
           + F  N+FAMTGVVNMYAKCRQI+DAY MFDRMP RDLV WNT+I+G++QNGFA  AL L
Sbjct: 181 SGFSWNLFAMTGVVNMYAKCRQINDAYNMFDRMPERDLVCWNTMISGYAQNGFAKVALML 240

Query: 241 VLSMQDEGQRPDSITLVTVLPAAADIGSLMVGKSIHGYAIRAGFSKLVNISTALVDMYSK 300
           VL M +EG RPDSIT+V++LPA AD   L +G ++HGY +RAGF  LVN+STALVDMYSK
Sbjct: 241 VLRMSEEGHRPDSITIVSILPAVADTRLLRIGMAVHGYVLRAGFESLVNVSTALVDMYSK 300

Query: 301 CGSVDTARLIFDGMEQKTVVSWNSMMAGYVQSGEPEMAIAIFEKMLDEGIEPTNVTIMEA 360
           CGSV  AR+IFDGM+ +TVVSWNSM+ GYVQSG+ E A+ IF+KMLDEG++PTNVT+M A
Sbjct: 301 CGSVSIARVIFDGMDHRTVVSWNSMIDGYVQSGDAEGAMLIFQKMLDEGVQPTNVTVMGA 360

Query: 361 LHACADLGDFEMGKFVHKFVDKLNLGSDVSIMNSLISMYSKCNRVDIASDIFKNLHRKTL 420
           LHACADLGD E GKFVHK VD+L L SDVS+MNSLISMYSKC RVDIA+DIFKNL  KTL
Sbjct: 361 LHACADLGDLERGKFVHKLVDQLKLDSDVSVMNSLISMYSKCKRVDIAADIFKNLRNKTL 420

Query: 421 VSWNAMILGYTQNG 435
           VSWNAMILGY QNG
Sbjct: 421 VSWNAMILGYAQNG 420

BLAST of Cp4.1LG09g04680 vs. TrEMBL
Match: F6I6N4_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_13s0067g02100 PE=4 SV=1)

HSP 1 Score: 632.1 bits (1629), Expect = 5.0e-178
Identity = 304/415 (73.25%), Postives = 358/415 (86.27%), Query Frame = 1

Query: 21  NRTPLKPSSISISSSPLPSHTLSERTHIPSHVYRHPAAVLLELCTSMKELHQILPLVIKN 80
           N  PL P      +S L   T S RT+IPSHVY+HP+A+LLELCTSMKELHQ +PL+IKN
Sbjct: 51  NTLPLPPPPPPSPTSNL-HRTPSSRTYIPSHVYKHPSAILLELCTSMKELHQFIPLIIKN 110

Query: 81  GLYNEHLFQTKLVSLFSKYGSINEAARVFEPIEEKIDVLYHTMLKGYAKNSSLETALAFL 140
           GLY+EHLFQTKLVSLF K+GS++EAARVF+PIE+KID LYHTMLKGYA+NSSL+ A++F 
Sbjct: 111 GLYSEHLFQTKLVSLFCKFGSLHEAARVFQPIEDKIDELYHTMLKGYARNSSLDDAVSFF 170

Query: 141 CRMRYDDVKPVVYNFTYLLKVCGDNADLKRGREIHGNLIKNSFGANVFAMTGVVNMYAKC 200
           CRMRYD V+PVVYNFTYLLKVCGDNADL++G+EIH  LI N F +NVFAMTGVVNMYAKC
Sbjct: 171 CRMRYDGVRPVVYNFTYLLKVCGDNADLRKGKEIHCQLIVNGFASNVFAMTGVVNMYAKC 230

Query: 201 RQIDDAYKMFDRMPVRDLVSWNTIITGFSQNGFANKALELVLSMQDEGQRPDSITLVTVL 260
           R +++AYKMFDRMP RDLV WNTII+G++QNGF   ALELVL MQ+EG+RPDSIT+V++L
Sbjct: 231 RLVEEAYKMFDRMPERDLVCWNTIISGYAQNGFGKTALELVLRMQEEGKRPDSITIVSIL 290

Query: 261 PAAADIGSLMVGKSIHGYAIRAGFSKLVNISTALVDMYSKCGSVDTARLIFDGMEQKTVV 320
           PA AD+GSL +G+SIHGY++RAGF   VN+STALVDMYSKCGSV TARLIFD M  KTVV
Sbjct: 291 PAVADVGSLRIGRSIHGYSMRAGFESFVNVSTALVDMYSKCGSVGTARLIFDRMTGKTVV 350

Query: 321 SWNSMMAGYVQSGEPEMAIAIFEKMLDEGIEPTNVTIMEALHACADLGDFEMGKFVHKFV 380
           SWNSM+ GYVQ+G+P  A+ IF+KM+DE +E TNVT+M ALHACADLGD E G+FVHK +
Sbjct: 351 SWNSMIDGYVQNGDPGAAMEIFQKMMDEQVEMTNVTVMGALHACADLGDVEQGRFVHKLL 410

Query: 381 DKLNLGSDVSIMNSLISMYSKCNRVDIASDIFKNLHRKTLVSWNAMILGYTQNGR 436
           D+L LGSDVS+MNSLISMYSKC RVDIA++IF+NL  KTLVSWNAMILGY QNGR
Sbjct: 411 DQLELGSDVSVMNSLISMYSKCKRVDIAAEIFENLQHKTLVSWNAMILGYAQNGR 464

BLAST of Cp4.1LG09g04680 vs. TrEMBL
Match: M5X677_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa001444mg PE=4 SV=1)

HSP 1 Score: 628.2 bits (1619), Expect = 7.3e-177
Identity = 315/438 (71.92%), Postives = 366/438 (83.56%), Query Frame = 1

Query: 1   MSSQLLPVSSLPSIAIAKAHNRTPLKPSSIS--ISSSPLPS-HTLSERTHIPSHVYRHPA 60
           MSSQLL  ++L  + I  +    PL PS     IS+    + HTLS+RTHIPSHVY HPA
Sbjct: 1   MSSQLLHYTAL--LPITNSITPPPLTPSRARPPISAPQFQAFHTLSQRTHIPSHVYTHPA 60

Query: 61  AVLLELCTSMKELHQILPLVIKNGLYNEHLFQTKLVSLFSKYGSINEAARVFEPIEEKID 120
           A+LLELCTS+KEL+QI+PL+IKNGLYNEHLFQTKLVSLF  YGS +EA RVFE +E+K++
Sbjct: 61  AILLELCTSIKELNQIIPLIIKNGLYNEHLFQTKLVSLFCNYGSPSEAFRVFETVEDKLE 120

Query: 121 VLYHTMLKGYAKNSSLETALAFLCRMRYDDVKPVVYNFTYLLKVCGDNADLKRGREIHGN 180
           V YHT+LKGYAKNSSL  A++F CRM+ D V+PVVYNFTYLLKVCGDNADL+RG+EIH +
Sbjct: 121 VFYHTLLKGYAKNSSLGDAMSFFCRMKSDGVRPVVYNFTYLLKVCGDNADLRRGKEIHAH 180

Query: 181 LIKNSFGANVFAMTGVVNMYAKCRQIDDAYKMFDRMPVRDLVSWNTIITGFSQNGFANKA 240
           LI + F  N+FAMT VVNMYAKCRQI++AYKMFDRMP RDLVSWNTII G++QNG A  A
Sbjct: 181 LISSGFATNLFAMTAVVNMYAKCRQINEAYKMFDRMPERDLVSWNTIIAGYAQNGLAKIA 240

Query: 241 LELVLSMQDEGQRPDSITLVTVLPAAADIGSLMVGKSIHGYAIRAGFSKLVNISTALVDM 300
           LELV+ MQ+EGQ+PDSITLVT+LPA AD GSL++GKSIH Y +RA F  LVNISTAL+DM
Sbjct: 241 LELVIRMQEEGQKPDSITLVTLLPAVADYGSLIIGKSIHAYVLRASFESLVNISTALLDM 300

Query: 301 YSKCGSVDTARLIFDGMEQKTVVSWNSMMAGYVQSGEPEMAIAIFEKMLDEGIEPTNVTI 360
           YSKCGSV TARLIF+ M+QKT VSWNSM+ GYVQ+ + E A+ IF+KMLDEG +PTNVTI
Sbjct: 301 YSKCGSVGTARLIFNRMKQKTAVSWNSMIDGYVQNEDAEEAMEIFQKMLDEGFQPTNVTI 360

Query: 361 MEALHACADLGDFEMGKFVHKFVDKLNLGSDVSIMNSLISMYSKCNRVDIASDIFKNLHR 420
           MEALHACADLGD E GKFVHK VD+L LGSDVS+MNSL+SMYSKC RVDIA+ IFKNL  
Sbjct: 361 MEALHACADLGDLERGKFVHKLVDQLKLGSDVSVMNSLMSMYSKCKRVDIAAKIFKNLLG 420

Query: 421 KTLVSWNAMILGYTQNGR 436
           KTLVSWN MILGY QNGR
Sbjct: 421 KTLVSWNTMILGYAQNGR 436

BLAST of Cp4.1LG09g04680 vs. TrEMBL
Match: A0A067L904_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_24583 PE=4 SV=1)

HSP 1 Score: 604.4 bits (1557), Expect = 1.1e-169
Identity = 295/434 (67.97%), Postives = 361/434 (83.18%), Query Frame = 1

Query: 1   MSSQLLPVSSLPSIAIAKAHNRTPLKPSSISISSSPLPSHTLSERTHIPSHVYRHPAAVL 60
           MSSQLLP +S P+   +  +++T         S+SP    TLS+R HIP+++Y+HPAA+L
Sbjct: 1   MSSQLLPFASTPTPPSSPLYSKT-------RSSASP----TLSKRIHIPAYIYKHPAAIL 60

Query: 61  LELCTSMKELHQILPLVIKNGLYNEHLFQTKLVSLFSKYGSINEAARVFEPIEEKIDVLY 120
           LELCTS+KELHQILPL+IKNG YNE LFQTKLVSLF KYGS+ EAA VFE +++K++ LY
Sbjct: 61  LELCTSIKELHQILPLIIKNGFYNEELFQTKLVSLFCKYGSLTEAACVFEHVDDKLEALY 120

Query: 121 HTMLKGYAKNSSLETALAFLCRMRYDDVKPVVYNFTYLLKVCGDNADLKRGREIHGNLIK 180
           HTMLKGYAKNSSL+ AL+F CRM++D+V+PVVYNFTYLLK+CGDN+DL+RG+EIHG LI 
Sbjct: 121 HTMLKGYAKNSSLDAALSFFCRMKHDNVEPVVYNFTYLLKLCGDNSDLRRGKEIHGQLIT 180

Query: 181 NSFGANVFAMTGVVNMYAKCRQIDDAYKMFDRMPVRDLVSWNTIITGFSQNGFANKALEL 240
           +    N FAMTGVVN+YAKCR+ DDAYKMFDRM  RDLV WNTII+G++QNG    AL+L
Sbjct: 181 SGLSWNQFAMTGVVNLYAKCRKTDDAYKMFDRMSERDLVCWNTIISGYAQNGLPEVALQL 240

Query: 241 VLSMQDEGQRPDSITLVTVLPAAADIGSLMVGKSIHGYAIRAGFSKLVNISTALVDMYSK 300
           V  + +EG RPDSIT+V++LPA A+I SL +GK+IHGY IRAGF  LVN STALVDMYSK
Sbjct: 241 VPRIFEEGHRPDSITIVSILPAVANIKSLRIGKAIHGYVIRAGFESLVNTSTALVDMYSK 300

Query: 301 CGSVDTARLIFDGMEQKTVVSWNSMMAGYVQSGEPEMAIAIFEKMLDEGIEPTNVTIMEA 360
           C SV TAR+IFDGM ++TVV+WNSM+ G VQSG+P+ A+A+F+KMLDEG +P++VT+ME 
Sbjct: 301 CESVGTARVIFDGMNRRTVVTWNSMIDGCVQSGDPQEAMALFQKMLDEGFQPSDVTLMEV 360

Query: 361 LHACADLGDFEMGKFVHKFVDKLNLGSDVSIMNSLISMYSKCNRVDIASDIFKNLHRKTL 420
           LHACADLGD E GKFVHK VD+L L S+VS+MNSLISMYS+C RVDIA++IFKNL  KTL
Sbjct: 361 LHACADLGDLEQGKFVHKLVDELKLDSNVSVMNSLISMYSRCKRVDIAANIFKNLQNKTL 420

Query: 421 VSWNAMILGYTQNG 435
           VSWNAMILGY QNG
Sbjct: 421 VSWNAMILGYAQNG 423

BLAST of Cp4.1LG09g04680 vs. TAIR10
Match: AT1G11290.1 (AT1G11290.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 550.1 bits (1416), Expect = 1.3e-156
Identity = 269/435 (61.84%), Postives = 339/435 (77.93%), Query Frame = 1

Query: 1   MSSQLLPVSSLPSIAIAKAHNRTPLKPSSISISSSPLPSHTLSERTHIPSHVYRHPAAVL 60
           MSSQL+  S++P I    + +R                 H LSER +IP++VY HPAA+L
Sbjct: 1   MSSQLVQFSTVPQIPNPPSRHR-----------------HFLSERNYIPANVYEHPAALL 60

Query: 61  LELCTSMKELHQILPLVIKNGLYNEHLFQTKLVSLFSKYGSINEAARVFEPIEEKIDVLY 120
           LE C+S+KEL QILPLV KNGLY EH FQTKLVSLF +YGS++EAARVFEPI+ K++VLY
Sbjct: 61  LERCSSLKELRQILPLVFKNGLYQEHFFQTKLVSLFCRYGSVDEAARVFEPIDSKLNVLY 120

Query: 121 HTMLKGYAKNSSLETALAFLCRMRYDDVKPVVYNFTYLLKVCGDNADLKRGREIHGNLIK 180
           HTMLKG+AK S L+ AL F  RMRYDDV+PVVYNFTYLLKVCGD A+L+ G+EIHG L+K
Sbjct: 121 HTMLKGFAKVSDLDKALQFFVRMRYDDVEPVVYNFTYLLKVCGDEAELRVGKEIHGLLVK 180

Query: 181 NSFGANVFAMTGVVNMYAKCRQIDDAYKMFDRMPVRDLVSWNTIITGFSQNGFANKALEL 240
           + F  ++FAMTG+ NMYAKCRQ+++A K+FDRMP RDLVSWNTI+ G+SQNG A  ALE+
Sbjct: 181 SGFSLDLFAMTGLENMYAKCRQVNEARKVFDRMPERDLVSWNTIVAGYSQNGMARMALEM 240

Query: 241 VLSMQDEGQRPDSITLVTVLPAAADIGSLMVGKSIHGYAIRAGFSKLVNISTALVDMYSK 300
           V SM +E  +P  IT+V+VLPA + +  + VGK IHGYA+R+GF  LVNISTALVDMY+K
Sbjct: 241 VKSMCEENLKPSFITIVSVLPAVSALRLISVGKEIHGYAMRSGFDSLVNISTALVDMYAK 300

Query: 301 CGSVDTARLIFDGMEQKTVVSWNSMMAGYVQSGEPEMAIAIFEKMLDEGIEPTNVTIMEA 360
           CGS++TAR +FDGM ++ VVSWNSM+  YVQ+  P+ A+ IF+KMLDEG++PT+V++M A
Sbjct: 301 CGSLETARQLFDGMLERNVVSWNSMIDAYVQNENPKEAMLIFQKMLDEGVKPTDVSVMGA 360

Query: 361 LHACADLGDFEMGKFVHKFVDKLNLGSDVSIMNSLISMYSKCNRVDIASDIFKNLHRKTL 420
           LHACADLGD E G+F+HK   +L L  +VS++NSLISMY KC  VD A+ +F  L  +TL
Sbjct: 361 LHACADLGDLERGRFIHKLSVELGLDRNVSVVNSLISMYCKCKEVDTAASMFGKLQSRTL 418

Query: 421 VSWNAMILGYTQNGR 436
           VSWNAMILG+ QNGR
Sbjct: 421 VSWNAMILGFAQNGR 418

BLAST of Cp4.1LG09g04680 vs. TAIR10
Match: AT1G15510.1 (AT1G15510.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 254.2 bits (648), Expect = 1.4e-67
Identity = 123/344 (35.76%), Postives = 202/344 (58.72%), Query Frame = 1

Query: 93  VSLFSKYGSINEAARVFEPIEEKIDVLYHTMLKGYAKNSSLETALAFLCRMRY-DDVKPV 152
           +++F ++G++ +A  VF  + E+    ++ ++ GYAK    + A+    RM +   VKP 
Sbjct: 136 LAMFVRFGNLVDAWYVFGKMSERNLFSWNVLVGGYAKQGYFDEAMCLYHRMLWVGGVKPD 195

Query: 153 VYNFTYLLKVCGDNADLKRGREIHGNLIKNSFGANVFAMTGVVNMYAKCRQIDDAYKMFD 212
           VY F  +L+ CG   DL RG+E+H ++++  +  ++  +  ++ MY KC  +  A  +FD
Sbjct: 196 VYTFPCVLRTCGGIPDLARGKEVHVHVVRYGYELDIDVVNALITMYVKCGDVKSARLLFD 255

Query: 213 RMPVRDLVSWNTIITGFSQNGFANKALELVLSMQDEGQRPDSITLVTVLPAAADIGSLMV 272
           RMP RD++SWN +I+G+ +NG  ++ LEL  +M+     PD +TL +V+ A   +G   +
Sbjct: 256 RMPRRDIISWNAMISGYFENGMCHEGLELFFAMRGLSVDPDLMTLTSVISACELLGDRRL 315

Query: 273 GKSIHGYAIRAGFSKLVNISTALVDMYSKCGSVDTARLIFDGMEQKTVVSWNSMMAGYVQ 332
           G+ IH Y I  GF+  +++  +L  MY   GS   A  +F  ME+K +VSW +M++GY  
Sbjct: 316 GRDIHAYVITTGFAVDISVCNSLTQMYLNAGSWREAEKLFSRMERKDIVSWTTMISGYEY 375

Query: 333 SGEPEMAIAIFEKMLDEGIEPTNVTIMEALHACADLGDFEMGKFVHKFVDKLNLGSDVSI 392
           +  P+ AI  +  M  + ++P  +T+   L ACA LGD + G  +HK   K  L S V +
Sbjct: 376 NFLPDKAIDTYRMMDQDSVKPDEITVAAVLSACATLGDLDTGVELHKLAIKARLISYVIV 435

Query: 393 MNSLISMYSKCNRVDIASDIFKNLHRKTLVSWNAMILGYTQNGR 436
            N+LI+MYSKC  +D A DIF N+ RK ++SW ++I G   N R
Sbjct: 436 ANNLINMYSKCKCIDKALDIFHNIPRKNVISWTSIIAGLRLNNR 479

BLAST of Cp4.1LG09g04680 vs. TAIR10
Match: AT3G12770.1 (AT3G12770.1 mitochondrial editing factor 22)

HSP 1 Score: 244.2 bits (622), Expect = 1.5e-64
Identity = 123/380 (32.37%), Postives = 218/380 (57.37%), Query Frame = 1

Query: 58  AVLLELCTSMKELHQILPLVIKNGLYNEHLFQTKLVSLFSKYGSINEAARVFEPIEEKID 117
           A L++  T   +L QI   ++  GL       TKL+   S +G I  A +VF+ +     
Sbjct: 25  ASLIDSATHKAQLKQIHARLLVLGLQFSGFLITKLIHASSSFGDITFARQVFDDLPRPQI 84

Query: 118 VLYHTMLKGYAKNSSLETALAFLCRMRYDDVKPVVYNFTYLLKVCGDNADLKRGREIHGN 177
             ++ +++GY++N+  + AL     M+   V P  + F +LLK C   + L+ GR +H  
Sbjct: 85  FPWNAIIRGYSRNNHFQDALLMYSNMQLARVSPDSFTFPHLLKACSGLSHLQMGRFVHAQ 144

Query: 178 LIKNSFGANVFAMTGVVNMYAKCRQIDDAYKMFDRMPV--RDLVSWNTIITGFSQNGFAN 237
           + +  F A+VF   G++ +YAKCR++  A  +F+ +P+  R +VSW  I++ ++QNG   
Sbjct: 145 VFRLGFDADVFVQNGLIALYAKCRRLGSARTVFEGLPLPERTIVSWTAIVSAYAQNGEPM 204

Query: 238 KALELVLSMQDEGQRPDSITLVTVLPAAADIGSLMVGKSIHGYAIRAGFSKLVNISTALV 297
           +ALE+   M+    +PD + LV+VL A   +  L  G+SIH   ++ G     ++  +L 
Sbjct: 205 EALEIFSQMRKMDVKPDWVALVSVLNAFTCLQDLKQGRSIHASVVKMGLEIEPDLLISLN 264

Query: 298 DMYSKCGSVDTARLIFDGMEQKTVVSWNSMMAGYVQSGEPEMAIAIFEKMLDEGIEPTNV 357
            MY+KCG V TA+++FD M+   ++ WN+M++GY ++G    AI +F +M+++ + P  +
Sbjct: 265 TMYAKCGQVATAKILFDKMKSPNLILWNAMISGYAKNGYAREAIDMFHEMINKDVRPDTI 324

Query: 358 TIMEALHACADLGDFEMGKFVHKFVDKLNLGSDVSIMNSLISMYSKCNRVDIASDIFKNL 417
           +I  A+ ACA +G  E  + ++++V + +   DV I ++LI M++KC  V+ A  +F   
Sbjct: 325 SITSAISACAQVGSLEQARSMYEYVGRSDYRDDVFISSALIDMFAKCGSVEGARLVFDRT 384

Query: 418 HRKTLVSWNAMILGYTQNGR 436
             + +V W+AMI+GY  +GR
Sbjct: 385 LDRDVVVWSAMIVGYGLHGR 404

BLAST of Cp4.1LG09g04680 vs. TAIR10
Match: AT4G18750.1 (AT4G18750.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 235.7 bits (600), Expect = 5.3e-62
Identity = 119/377 (31.56%), Postives = 211/377 (55.97%), Query Frame = 1

Query: 60  LLELCT---SMKELHQILPLVIKNGLYNEHLFQTKLVSLFSKYGSINEAARVFEPIEEKI 119
           +L+LC    S+K+  ++   +  NG   +    +KL  +++  G + EA+RVF+ ++ + 
Sbjct: 100 VLQLCADSKSLKDGKEVDNFIRGNGFVIDSNLGSKLSLMYTNCGDLKEASRVFDEVKIEK 159

Query: 120 DVLYHTMLKGYAKNSSLETALAFLCRMRYDDVKPVVYNFTYLLKVCGDNADLKRGREIHG 179
            + ++ ++   AK+     ++    +M    V+   Y F+ + K       +  G ++HG
Sbjct: 160 ALFWNILMNELAKSGDFSGSIGLFKKMMSSGVEMDSYTFSCVSKSFSSLRSVHGGEQLHG 219

Query: 180 NLIKNSFGANVFAMTGVVNMYAKCRQIDDAYKMFDRMPVRDLVSWNTIITGFSQNGFANK 239
            ++K+ FG        +V  Y K +++D A K+FD M  RD++SWN+II G+  NG A K
Sbjct: 220 FILKSGFGERNSVGNSLVAFYLKNQRVDSARKVFDEMTERDVISWNSIINGYVSNGLAEK 279

Query: 240 ALELVLSMQDEGQRPDSITLVTVLPAAADIGSLMVGKSIHGYAIRAGFSKLVNISTALVD 299
            L + + M   G   D  T+V+V    AD   + +G+++H   ++A FS+       L+D
Sbjct: 280 GLSVFVQMLVSGIEIDLATIVSVFAGCADSRLISLGRAVHSIGVKACFSREDRFCNTLLD 339

Query: 300 MYSKCGSVDTARLIFDGMEQKTVVSWNSMMAGYVQSGEPEMAIAIFEKMLDEGIEPTNVT 359
           MYSKCG +D+A+ +F  M  ++VVS+ SM+AGY + G    A+ +FE+M +EGI P   T
Sbjct: 340 MYSKCGDLDSAKAVFREMSDRSVVSYTSMIAGYAREGLAGEAVKLFEEMEEEGISPDVYT 399

Query: 360 IMEALHACADLGDFEMGKFVHKFVDKLNLGSDVSIMNSLISMYSKCNRVDIASDIFKNLH 419
           +   L+ CA     + GK VH+++ + +LG D+ + N+L+ MY+KC  +  A  +F  + 
Sbjct: 400 VTAVLNCCARYRLLDEGKRVHEWIKENDLGFDIFVSNALMDMYAKCGSMQEAELVFSEMR 459

Query: 420 RKTLVSWNAMILGYTQN 434
            K ++SWN +I GY++N
Sbjct: 460 VKDIISWNTIIGGYSKN 476

BLAST of Cp4.1LG09g04680 vs. TAIR10
Match: AT3G05240.1 (AT3G05240.1 mitochondrial editing factor 19)

HSP 1 Score: 233.8 bits (595), Expect = 2.0e-61
Identity = 122/390 (31.28%), Postives = 221/390 (56.67%), Query Frame = 1

Query: 56  PAAVLLELCTSMKELHQILPLVIKNGLYNEHLFQTKLVSLFS---KYGSINEAARVFEPI 115
           P    LE C S+ EL+Q+  L+IK+ +    +  ++L+   +   +  +++ A  VFE I
Sbjct: 8   PILSQLENCRSLVELNQLHGLMIKSSVIRNVIPLSRLIDFCTTCPETMNLSYARSVFESI 67

Query: 116 EEKIDVLYHTMLKGYAKNSSLETALAFLCRMRYDDVKPVVYNFTYLLKVCGDNADLKRGR 175
           +     ++++M++GY+ + + + AL F   M      P  + F Y+LK C    D++ G 
Sbjct: 68  DCPSVYIWNSMIRGYSNSPNPDKALIFYQEMLRKGYSPDYFTFPYVLKACSGLRDIQFGS 127

Query: 176 EIHGNLIKNSFGANVFAMTGVVNMYAKCRQIDDAYKMFDRMPVRDLVSWNTIITGFSQNG 235
            +HG ++K  F  N++  T +++MY  C +++   ++F+ +P  ++V+W ++I+GF  N 
Sbjct: 128 CVHGFVVKTGFEVNMYVSTCLLHMYMCCGEVNYGLRVFEDIPQWNVVAWGSLISGFVNNN 187

Query: 236 FANKALELVLSMQDEGQRPDSITLVTVLPAAADIGSLMVGKSIHGYAIRAGF-----SKL 295
             + A+E    MQ  G + +   +V +L A      ++ GK  HG+    GF     SK+
Sbjct: 188 RFSDAIEAFREMQSNGVKANETIMVDLLVACGRCKDIVTGKWFHGFLQGLGFDPYFQSKV 247

Query: 296 ---VNISTALVDMYSKCGSVDTARLIFDGMEQKTVVSWNSMMAGYVQSGEPEMAIAIFEK 355
              V ++T+L+DMY+KCG + TAR +FDGM ++T+VSWNS++ GY Q+G+ E A+ +F  
Sbjct: 248 GFNVILATSLIDMYAKCGDLRTARYLFDGMPERTLVSWNSIITGYSQNGDAEEALCMFLD 307

Query: 356 MLDEGIEPTNVTIMEALHACADLGDFEMGKFVHKFVDKLNLGSDVSIMNSLISMYSKCNR 415
           MLD GI P  VT +  + A    G  ++G+ +H +V K     D +I+ +L++MY+K   
Sbjct: 308 MLDLGIAPDKVTFLSVIRASMIQGCSQLGQSIHAYVSKTGFVKDAAIVCALVNMYAKTGD 367

Query: 416 VDIASDIFKNLHRKTLVSWNAMILGYTQNG 435
            + A   F++L +K  ++W  +I+G   +G
Sbjct: 368 AESAKKAFEDLEKKDTIAWTVVIIGLASHG 397

BLAST of Cp4.1LG09g04680 vs. NCBI nr
Match: gi|659090279|ref|XP_008445930.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g11290 isoform X1 [Cucumis melo])

HSP 1 Score: 774.6 bits (1999), Expect = 9.0e-221
Identity = 382/427 (89.46%), Postives = 404/427 (94.61%), Query Frame = 1

Query: 9   SSLPSIAIAKAHNRTPLKPSSISISSSPLPSHTLSERTHIPSHVYRHPAAVLLELCTSMK 68
           SS+P IAI KAHNRTPLK SSI+I SSPLP HTLSERTHIPSHVY+HPAAVLLELCTSMK
Sbjct: 4   SSVPPIAIVKAHNRTPLKSSSITIPSSPLPFHTLSERTHIPSHVYKHPAAVLLELCTSMK 63

Query: 69  ELHQILPLVIKNGLYNEHLFQTKLVSLFSKYGSINEAARVFEPIEEKIDVLYHTMLKGYA 128
           ELHQI+PLVIKNGLYNEHLFQTKLVSLFSKYGSINEAARVFEPI++K+D LYHTMLKGYA
Sbjct: 64  ELHQIIPLVIKNGLYNEHLFQTKLVSLFSKYGSINEAARVFEPIDDKLDALYHTMLKGYA 123

Query: 129 KNSSLETALAFLCRMRYDDVKPVVYNFTYLLKVCGDNADLKRGREIHGNLIKNSFGANVF 188
           KNSSLETALAFLCRMRYDDVKPVVYNFTYLLKVCGDNADLKRG+EIHG LI NSFGANVF
Sbjct: 124 KNSSLETALAFLCRMRYDDVKPVVYNFTYLLKVCGDNADLKRGKEIHGQLITNSFGANVF 183

Query: 189 AMTGVVNMYAKCRQIDDAYKMFDRMPVRDLVSWNTIITGFSQNGFANKALELVLSMQDEG 248
           AMTGVVNMYAKCRQIDDAYKMFDRMP RDLVSWNTII GFSQNGFA KALELVL MQDEG
Sbjct: 184 AMTGVVNMYAKCRQIDDAYKMFDRMPERDLVSWNTIIAGFSQNGFAKKALELVLRMQDEG 243

Query: 249 QRPDSITLVTVLPAAADIGSLMVGKSIHGYAIRAGFSKLVNISTALVDMYSKCGSVDTAR 308
           QRPDSITLVTVLPAAAD+GSLMVGKSIHGYAIRAGF+KLVNISTAL DMYSKCGSV+TAR
Sbjct: 244 QRPDSITLVTVLPAAADVGSLMVGKSIHGYAIRAGFAKLVNISTALADMYSKCGSVETAR 303

Query: 309 LIFDGMEQKTVVSWNSMMAGYVQSGEPEMAIAIFEKMLDEGIEPTNVTIMEALHACADLG 368
           LIFDGM+QKTVVSWNSMM GYVQ+GEPE AIA+FEKML+EGI+PT+VTIMEALHACADLG
Sbjct: 304 LIFDGMDQKTVVSWNSMMDGYVQNGEPEKAIAVFEKMLEEGIDPTSVTIMEALHACADLG 363

Query: 369 DFEMGKFVHKFVDKLNLGSDVSIMNSLISMYSKCNRVDIASDIFKNLHRKTLVSWNAMIL 428
           D E GKFVHKFVD+LNLGSD+S+MNSLISMYSKC R DIASDIF NL+ +T VSWNAMIL
Sbjct: 364 DLERGKFVHKFVDQLNLGSDISVMNSLISMYSKCKRADIASDIFNNLNGRTHVSWNAMIL 423

Query: 429 GYTQNGR 436
           GY QNGR
Sbjct: 424 GYAQNGR 430

BLAST of Cp4.1LG09g04680 vs. NCBI nr
Match: gi|659090281|ref|XP_008445931.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g11290 isoform X2 [Cucumis melo])

HSP 1 Score: 774.6 bits (1999), Expect = 9.0e-221
Identity = 382/427 (89.46%), Postives = 404/427 (94.61%), Query Frame = 1

Query: 9   SSLPSIAIAKAHNRTPLKPSSISISSSPLPSHTLSERTHIPSHVYRHPAAVLLELCTSMK 68
           SS+P IAI KAHNRTPLK SSI+I SSPLP HTLSERTHIPSHVY+HPAAVLLELCTSMK
Sbjct: 4   SSVPPIAIVKAHNRTPLKSSSITIPSSPLPFHTLSERTHIPSHVYKHPAAVLLELCTSMK 63

Query: 69  ELHQILPLVIKNGLYNEHLFQTKLVSLFSKYGSINEAARVFEPIEEKIDVLYHTMLKGYA 128
           ELHQI+PLVIKNGLYNEHLFQTKLVSLFSKYGSINEAARVFEPI++K+D LYHTMLKGYA
Sbjct: 64  ELHQIIPLVIKNGLYNEHLFQTKLVSLFSKYGSINEAARVFEPIDDKLDALYHTMLKGYA 123

Query: 129 KNSSLETALAFLCRMRYDDVKPVVYNFTYLLKVCGDNADLKRGREIHGNLIKNSFGANVF 188
           KNSSLETALAFLCRMRYDDVKPVVYNFTYLLKVCGDNADLKRG+EIHG LI NSFGANVF
Sbjct: 124 KNSSLETALAFLCRMRYDDVKPVVYNFTYLLKVCGDNADLKRGKEIHGQLITNSFGANVF 183

Query: 189 AMTGVVNMYAKCRQIDDAYKMFDRMPVRDLVSWNTIITGFSQNGFANKALELVLSMQDEG 248
           AMTGVVNMYAKCRQIDDAYKMFDRMP RDLVSWNTII GFSQNGFA KALELVL MQDEG
Sbjct: 184 AMTGVVNMYAKCRQIDDAYKMFDRMPERDLVSWNTIIAGFSQNGFAKKALELVLRMQDEG 243

Query: 249 QRPDSITLVTVLPAAADIGSLMVGKSIHGYAIRAGFSKLVNISTALVDMYSKCGSVDTAR 308
           QRPDSITLVTVLPAAAD+GSLMVGKSIHGYAIRAGF+KLVNISTAL DMYSKCGSV+TAR
Sbjct: 244 QRPDSITLVTVLPAAADVGSLMVGKSIHGYAIRAGFAKLVNISTALADMYSKCGSVETAR 303

Query: 309 LIFDGMEQKTVVSWNSMMAGYVQSGEPEMAIAIFEKMLDEGIEPTNVTIMEALHACADLG 368
           LIFDGM+QKTVVSWNSMM GYVQ+GEPE AIA+FEKML+EGI+PT+VTIMEALHACADLG
Sbjct: 304 LIFDGMDQKTVVSWNSMMDGYVQNGEPEKAIAVFEKMLEEGIDPTSVTIMEALHACADLG 363

Query: 369 DFEMGKFVHKFVDKLNLGSDVSIMNSLISMYSKCNRVDIASDIFKNLHRKTLVSWNAMIL 428
           D E GKFVHKFVD+LNLGSD+S+MNSLISMYSKC R DIASDIF NL+ +T VSWNAMIL
Sbjct: 364 DLERGKFVHKFVDQLNLGSDISVMNSLISMYSKCKRADIASDIFNNLNGRTHVSWNAMIL 423

Query: 429 GYTQNGR 436
           GY QNGR
Sbjct: 424 GYAQNGR 430

BLAST of Cp4.1LG09g04680 vs. NCBI nr
Match: gi|778704342|ref|XP_004147126.2| (PREDICTED: pentatricopeptide repeat-containing protein At1g11290 isoform X1 [Cucumis sativus])

HSP 1 Score: 767.7 bits (1981), Expect = 1.1e-218
Identity = 379/427 (88.76%), Postives = 401/427 (93.91%), Query Frame = 1

Query: 9   SSLPSIAIAKAHNRTPLKPSSISISSSPLPSHTLSERTHIPSHVYRHPAAVLLELCTSMK 68
           SS+P IAI KAHN+TPLK SSI+I SSPLP HTLSER HIPSHVY+HPAAVLLELCTSMK
Sbjct: 4   SSVPPIAIVKAHNQTPLKSSSITIPSSPLPFHTLSERAHIPSHVYKHPAAVLLELCTSMK 63

Query: 69  ELHQILPLVIKNGLYNEHLFQTKLVSLFSKYGSINEAARVFEPIEEKIDVLYHTMLKGYA 128
           ELHQI+PLVIKNGLYNEHLFQTKLVSLFSKYGSINEAARVFEPI++K+D LYHTMLKGYA
Sbjct: 64  ELHQIIPLVIKNGLYNEHLFQTKLVSLFSKYGSINEAARVFEPIDDKLDALYHTMLKGYA 123

Query: 129 KNSSLETALAFLCRMRYDDVKPVVYNFTYLLKVCGDNADLKRGREIHGNLIKNSFGANVF 188
           KNSSLETALAFLCRMRYDDVKPVVYNFTYLLKVCGDNADLKRG+EIHG LI NSF ANVF
Sbjct: 124 KNSSLETALAFLCRMRYDDVKPVVYNFTYLLKVCGDNADLKRGKEIHGQLITNSFAANVF 183

Query: 189 AMTGVVNMYAKCRQIDDAYKMFDRMPVRDLVSWNTIITGFSQNGFANKALELVLSMQDEG 248
           AMTGVVNMYAKCRQIDDAYKMFDRMP RDLVSWNTII GFSQNGFA KALELVL MQDEG
Sbjct: 184 AMTGVVNMYAKCRQIDDAYKMFDRMPERDLVSWNTIIAGFSQNGFAKKALELVLRMQDEG 243

Query: 249 QRPDSITLVTVLPAAADIGSLMVGKSIHGYAIRAGFSKLVNISTALVDMYSKCGSVDTAR 308
           QRPDSITLVTVLPAAAD+G LMVGKSIHGYAIRAGF+KLVNISTAL DMYSKCGSV+TAR
Sbjct: 244 QRPDSITLVTVLPAAADVGLLMVGKSIHGYAIRAGFAKLVNISTALADMYSKCGSVETAR 303

Query: 309 LIFDGMEQKTVVSWNSMMAGYVQSGEPEMAIAIFEKMLDEGIEPTNVTIMEALHACADLG 368
           LIFDGM+QKTVVSWNSMM GYVQ+GEPE AIA+FEKML+EGI+PT VTIMEALHACADLG
Sbjct: 304 LIFDGMDQKTVVSWNSMMDGYVQNGEPEKAIAVFEKMLEEGIDPTGVTIMEALHACADLG 363

Query: 369 DFEMGKFVHKFVDKLNLGSDVSIMNSLISMYSKCNRVDIASDIFKNLHRKTLVSWNAMIL 428
           D E GKFVHKFVD+LNLGSD+S+MNSLISMYSKC RVDIASDIF NL+ +T VSWNAMIL
Sbjct: 364 DLERGKFVHKFVDQLNLGSDISVMNSLISMYSKCKRVDIASDIFNNLNGRTHVSWNAMIL 423

Query: 429 GYTQNGR 436
           GY QNGR
Sbjct: 424 GYAQNGR 430

BLAST of Cp4.1LG09g04680 vs. NCBI nr
Match: gi|566163472|ref|XP_002303960.2| (hypothetical protein POPTR_0003s19010g [Populus trichocarpa])

HSP 1 Score: 634.0 bits (1634), Expect = 1.9e-178
Identity = 306/434 (70.51%), Postives = 361/434 (83.18%), Query Frame = 1

Query: 1   MSSQLLPVSSLPSIAIAKAHNRTPLKPSSISISSSPLPSHTLSERTHIPSHVYRHPAAVL 60
           MSS LLP ++ P              P  I   +SPL  HTLS+RTHIPSH+Y+HPAA+L
Sbjct: 1   MSSHLLPFTATP--------------PPQIPSKASPLAQHTLSQRTHIPSHIYKHPAAIL 60

Query: 61  LELCTSMKELHQILPLVIKNGLYNEHLFQTKLVSLFSKYGSINEAARVFEPIEEKIDVLY 120
           LELCTS KE+HQILP +IKNGLYNE LFQTKL+SLF KYG++ EA+RVFEPIE+K D LY
Sbjct: 61  LELCTSSKEVHQILPQIIKNGLYNETLFQTKLISLFCKYGNLTEASRVFEPIEDKFDALY 120

Query: 121 HTMLKGYAKNSSLETALAFLCRMRYDDVKPVVYNFTYLLKVCGDNADLKRGREIHGNLIK 180
           HTMLKGYAK+SSL++AL+F  RM++D V+PVVYNFTYLLK+CGDN+DLKRG+EIHG++I 
Sbjct: 121 HTMLKGYAKSSSLDSALSFFSRMKHDSVRPVVYNFTYLLKLCGDNSDLKRGKEIHGSVIT 180

Query: 181 NSFGANVFAMTGVVNMYAKCRQIDDAYKMFDRMPVRDLVSWNTIITGFSQNGFANKALEL 240
           + F  N+FAMTGVVNMYAKCRQI+DAY MFDRMP RDLV WNT+I+G++QNGFA  AL L
Sbjct: 181 SGFSWNLFAMTGVVNMYAKCRQINDAYNMFDRMPERDLVCWNTMISGYAQNGFAKVALML 240

Query: 241 VLSMQDEGQRPDSITLVTVLPAAADIGSLMVGKSIHGYAIRAGFSKLVNISTALVDMYSK 300
           VL M +EG RPDSIT+V++LPA AD   L +G ++HGY +RAGF  LVN+STALVDMYSK
Sbjct: 241 VLRMSEEGHRPDSITIVSILPAVADTRLLRIGMAVHGYVLRAGFESLVNVSTALVDMYSK 300

Query: 301 CGSVDTARLIFDGMEQKTVVSWNSMMAGYVQSGEPEMAIAIFEKMLDEGIEPTNVTIMEA 360
           CGSV  AR+IFDGM+ +TVVSWNSM+ GYVQSG+ E A+ IF+KMLDEG++PTNVT+M A
Sbjct: 301 CGSVSIARVIFDGMDHRTVVSWNSMIDGYVQSGDAEGAMLIFQKMLDEGVQPTNVTVMGA 360

Query: 361 LHACADLGDFEMGKFVHKFVDKLNLGSDVSIMNSLISMYSKCNRVDIASDIFKNLHRKTL 420
           LHACADLGD E GKFVHK VD+L L SDVS+MNSLISMYSKC RVDIA+DIFKNL  KTL
Sbjct: 361 LHACADLGDLERGKFVHKLVDQLKLDSDVSVMNSLISMYSKCKRVDIAADIFKNLRNKTL 420

Query: 421 VSWNAMILGYTQNG 435
           VSWNAMILGY QNG
Sbjct: 421 VSWNAMILGYAQNG 420

BLAST of Cp4.1LG09g04680 vs. NCBI nr
Match: gi|743817748|ref|XP_011020490.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g11290 [Populus euphratica])

HSP 1 Score: 634.0 bits (1634), Expect = 1.9e-178
Identity = 306/434 (70.51%), Postives = 360/434 (82.95%), Query Frame = 1

Query: 1   MSSQLLPVSSLPSIAIAKAHNRTPLKPSSISISSSPLPSHTLSERTHIPSHVYRHPAAVL 60
           MSS LLP ++ P              P  I   +SPL  HTLS+RTHIPSH+Y+HPAA+L
Sbjct: 42  MSSHLLPFTATP--------------PPQIPSKASPLAQHTLSQRTHIPSHIYKHPAAIL 101

Query: 61  LELCTSMKELHQILPLVIKNGLYNEHLFQTKLVSLFSKYGSINEAARVFEPIEEKIDVLY 120
           LELCTS KELHQILP +IKNG+YNE LFQTKL+SLF KYG++ EA+RVFEPIE+K D LY
Sbjct: 102 LELCTSSKELHQILPQIIKNGIYNETLFQTKLISLFCKYGNLTEASRVFEPIEDKFDALY 161

Query: 121 HTMLKGYAKNSSLETALAFLCRMRYDDVKPVVYNFTYLLKVCGDNADLKRGREIHGNLIK 180
           HTMLKGYAK+SSL+ AL+F  RM++D V+PVVYNFTYLLK+CGDN+DLKRG+EIHG++I 
Sbjct: 162 HTMLKGYAKSSSLDCALSFFSRMKHDSVRPVVYNFTYLLKLCGDNSDLKRGKEIHGSVIT 221

Query: 181 NSFGANVFAMTGVVNMYAKCRQIDDAYKMFDRMPVRDLVSWNTIITGFSQNGFANKALEL 240
           + F  N+FAMTGVVNMYAKCRQI+DAY MFDRMP RDLV WNTII+G++QNGFA  AL L
Sbjct: 222 SGFSWNLFAMTGVVNMYAKCRQINDAYNMFDRMPERDLVCWNTIISGYAQNGFAKAALML 281

Query: 241 VLSMQDEGQRPDSITLVTVLPAAADIGSLMVGKSIHGYAIRAGFSKLVNISTALVDMYSK 300
           VL M +EG RPDSIT+V++LPA AD   L +G ++HGY +RAGF  LVN+STALVDMYSK
Sbjct: 282 VLRMSEEGHRPDSITIVSILPAVADTRLLRIGMAVHGYVLRAGFESLVNVSTALVDMYSK 341

Query: 301 CGSVDTARLIFDGMEQKTVVSWNSMMAGYVQSGEPEMAIAIFEKMLDEGIEPTNVTIMEA 360
           CGSV  AR+IFDGM+ +TVVSWNSM+ GYVQSG+ E A+ IF+KMLDEG++PTNVT+M A
Sbjct: 342 CGSVSIARVIFDGMDHRTVVSWNSMIDGYVQSGDAEGAMLIFQKMLDEGVQPTNVTVMGA 401

Query: 361 LHACADLGDFEMGKFVHKFVDKLNLGSDVSIMNSLISMYSKCNRVDIASDIFKNLHRKTL 420
           LHACADLGD E GKFVHK VD+L L SD+S+MNSLISMYSKC RVDIA+DIFKNL  KTL
Sbjct: 402 LHACADLGDLERGKFVHKLVDQLKLDSDLSVMNSLISMYSKCKRVDIAADIFKNLRNKTL 461

Query: 421 VSWNAMILGYTQNG 435
           VSWNAMILGY QNG
Sbjct: 462 VSWNAMILGYAQNG 461

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PPR32_ARATH2.3e-15561.84Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidop... [more]
PPR45_ARATH2.6e-6635.76Pentatricopeptide repeat-containing protein At1g15510, chloroplastic OS=Arabidop... [more]
PP224_ARATH2.7e-6332.37Pentatricopeptide repeat-containing protein At3g12770 OS=Arabidopsis thaliana GN... [more]
PP320_ARATH9.4e-6131.56Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis t... [more]
PP214_ARATH3.6e-6031.28Putative pentatricopeptide repeat-containing protein At3g05240 OS=Arabidopsis th... [more]
Match NameE-valueIdentityDescription
A0A0A0KPP6_CUCSA7.7e-21988.76Uncharacterized protein OS=Cucumis sativus GN=Csa_5G583260 PE=4 SV=1[more]
B9GXA8_POPTR1.3e-17870.51Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0003s19010g PE=4 SV=2[more]
F6I6N4_VITVI5.0e-17873.25Putative uncharacterized protein OS=Vitis vinifera GN=VIT_13s0067g02100 PE=4 SV=... [more]
M5X677_PRUPE7.3e-17771.92Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa001444mg PE=4 SV=1[more]
A0A067L904_JATCU1.1e-16967.97Uncharacterized protein OS=Jatropha curcas GN=JCGZ_24583 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G11290.11.3e-15661.84 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G15510.11.4e-6735.76 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G12770.11.5e-6432.37 mitochondrial editing factor 22[more]
AT4G18750.15.3e-6231.56 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT3G05240.12.0e-6131.28 mitochondrial editing factor 19[more]
Match NameE-valueIdentityDescription
gi|659090279|ref|XP_008445930.1|9.0e-22189.46PREDICTED: pentatricopeptide repeat-containing protein At1g11290 isoform X1 [Cuc... [more]
gi|659090281|ref|XP_008445931.1|9.0e-22189.46PREDICTED: pentatricopeptide repeat-containing protein At1g11290 isoform X2 [Cuc... [more]
gi|778704342|ref|XP_004147126.2|1.1e-21888.76PREDICTED: pentatricopeptide repeat-containing protein At1g11290 isoform X1 [Cuc... [more]
gi|566163472|ref|XP_002303960.2|1.9e-17870.51hypothetical protein POPTR_0003s19010g [Populus trichocarpa][more]
gi|743817748|ref|XP_011020490.1|1.9e-17870.51PREDICTED: pentatricopeptide repeat-containing protein At1g11290 [Populus euphra... [more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0016556 mRNA modification
biological_process GO:0090305 nucleic acid phosphodiester bond hydrolysis
biological_process GO:0006464 cellular protein modification process
biological_process GO:0015031 protein transport
biological_process GO:0008150 biological_process
cellular_component GO:0009507 chloroplast
cellular_component GO:0005575 cellular_component
molecular_function GO:0004519 endonuclease activity
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG09g04680.1Cp4.1LG09g04680.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 292..317
score: 0.054coord: 191..216
score: 0.0051coord: 421..436
score: 0.061coord: 392..418
score: 0
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 318..357
score: 2.4E-10coord: 217..261
score: 2.
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 119..165
score: 4.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 320..353
score: 1.5E-9coord: 219..252
score: 5.1E-6coord: 118..147
score: 2.6E-4coord: 191..218
score: 0.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 217..251
score: 11.257coord: 388..422
score: 8.89coord: 116..150
score: 9.01coord: 85..115
score: 5.24coord: 318..352
score: 13.197coord: 287..317
score: 6.511coord: 186..216
score: 8.254coord: 151..185
score: 5
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 316..436
score: 2.4E-4coord: 196..217
score: 2.
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 389..435
score: 1.5E-195coord: 22..353
score: 1.5E
NoneNo IPR availablePANTHERPTHR24015:SF605SUBFAMILY NOT NAMEDcoord: 22..353
score: 1.5E-195coord: 389..435
score: 1.5E

The following gene(s) are paralogous to this gene:

None