Cp4.1LG10g12020 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG10g12020
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionPentatricopeptide repeat-containing protein
LocationCp4.1LG10 : 8306859 .. 8311332 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTCGAAGCTCGCAAGCTTTTCGACGAAACTCCAACTAAAAATTCTATCACTTGGTCATCCCTGGTATCTGGATATTGCAGAAATGGGTGTGAAGTTGAAGGCTTGAGGCTGTTCAGCCAAATGTGGAGTGAGGGACAGAAGCCAAGTCAATATACATTGGGCAGTGTTTTACGAGCATGTTCGACTTTGGGTTTACTCCATAGTGGCAAAATGATTCATGGCTATGTAACAAAAATACAATTAGAAGCAAATATCTTCGTTGCTACCGGTCTCGTCGACATGTATTCCAAGTGTAAGTGTCTCCTGGAGGCTGAATACCTCTTTGTATCATTGTCTGATAGGAAAAACTATGTTCTATCGACCGCTATGCTCACCGGTTATGCTCAAAATGGCGAGAGTTTGAAGGCAATGCAGTGTTTTAAGGAGATGAGAATACAGGGAATGGAGTCTAACCATTTCACATTTCCCAGCATACTGACAGCATGTACAGCAATTTCAGCTTATGCTTTTGGTCAGCAGGTACATGGATGCATTATTTTGAGTGGTTTTGGTGCTAATGTTTATGTTCAAAGTGCATTAGTTGATATGTATGCGAAATGTGGCGACTTGAATAGTGCGAGGATGCTACTAAATATCATGGAAATCGATGATGTTGTATGCTGGAACTCGATGATTGTCGGGTGTGTGACACACGGACATATGGAGGAAGCTCTAGTTTTGTTCCATAAGATGCATAATCGGGATATTGTAATCGACGATTTCACGTACCCGTCTGTTTTGAAATCTCTGGGTACTTGCAGGGACCTGAAAAATGGAGAATCTGTTCATTCTCTGATAATGAAAACTGGTTTTGATGCCTGCAAAACAGTGAGCAATGCGCTTGTTGATATGTATGCTAAACAGGGAAACTTAAATTGTGCATTAGAGGTTTTCAATAAGATATCTGATAAAGATGTCATTTCTTGGACCTCCTTGGTCACGGGATATGTTCATAATGGCTTCCATGAAAAGGCTCTCAAGTTATTCTGTGACATGAGAATTGCAGGTGTTGATCTAGACCAATTCGTAATTGCCTGTGTCTTTAGTGCTTGTGCTGAACTAACGATTATCGAGTTTGGTCGACAGGTTCACGGAAACTTCATAAAATCAAGCGTTGGTTCACTGTTATCTGCTGAGAACTCTCTGATAACAATGTATGCCAAATGTGGATGCTTAGAAGATGCAACTCGAGTCTTTGACTCGATGGAAACTCGAAATGTCATATCATGGACTGCTATAATAGTTGGTTATGCACAGAATGGGAGAGGGAAGGACTCGCTTCGTTTTTACGACCGGATGATAATCGATGGCGTAAAGCCTGACCCTGTTACTTTCATTGGTTTGTTGTTTGCTTGCAGCCATGCAGGTCTAGTGGAAACTGGTCGATCTTACTTCGAATCAATGGAAAAGGTTTATGGAATAAAGCCGGGTTCTGATCATTATGCTTGCATGATTGATCTACTGGGACGTGCTGGAAAGCTTAACGAGGCAGAGGAGTTATTGAATCGAATGGACGTTGAGCCCGATGCAACCGTATGGAAGTCGTTACTTTCGGCATGTCGGGTTCATGGGAACTTAGAACTTGGAGAAAGGGCGGGGAAAAACCTCATTAAGTTGGAGCCTTTGAATTCTCTGCCATATGTTCTATTGTCCAATATGTTCTCTGTTGCTGGTAGATGGGAAGATGCAACATATATACGTAATTCAATGAAAAGAATGGGTATTAACAAGGAGCCTGGATATAGTTGGATTGAAATGAAGAGCCAAGTGCATTCATTTATATCAGAAGATAGAAGTCACCCTATGGCTGCTGAAATATATTCTAAGATTGATGAAATGATGATCTTAATAAAGGAAGCTGGGTATGTTCCCGATATGAACTTCGCGTTACGTGACATGGACGAAGAGGCTAAGGAACGTAGTTTAACATATCATAGCGAAAAGTTGGCTGTTGCGTTTGGACTCCTTGCAGTCCCGAATGGAGCGCCGATTCGAATTTTCAAAAATCTTAGGGTATGTGGGGACTGTCACTCAGCCATGAAATATATATCTAGCGTTTTTAAGCGGCATGTTATTTTGAGAGACTTGAATTGTTTCCATCACTTCAAAGAGGGAAAATGTTCTTGTGGAGACTTCTGGTAAGGGAGGTGTTTCACCACTTTTTAAGGAGGATCGGAAACCAGCTAAGGCACCGTAGCATCCCCTAAAGGCATAACTGACAGGTTCTATTCTATCTGCCCATCTGAGTCATCCTGTGTCGTTGATCCACCTTGGAGAATGAAACTTAAATATATATCTAGCGTTTTTAAGCGGCATGTTCTTTTGAGAGGCTTGAATTGTTTCCATCACTTCAAAGAGGCAAAATGTTCTTGTGGAGACTTTTGGTAAGGGAGGTGTATCGGAACCAGCTAAGGGACCGTAGCGTCCCCTAAAAGCGTAACTGATAGGTTCTGTTCTATCTGCCCATCCGAGTCATCCCGTGTCGTTGTTGACCCACCTTAGAGAATGAAACTTAAATATATATCTAGCGTTTTTAAGCGGCCTGTTCTTTTGAGAGACTTGAATTGTTTCCATCACTTCAAAGAGGGAAAATGTTCTTGTGGAGACTTTTGGTAAGGGAGGTGTATCGGAAACAGCTAAGGGACCGTAGCGTCCCCTAAAGGCGTAACTGACAGGTTCTGTTCTATCTGCCCATCCGAGTCATCCCGTGTCGTTGTTGACCCACCTTAGAGAATGAAACTTAAATATATATCTAGCGTTTTTAAGCGGCATGTTCTTTTGAGAGACTTGAATTGTTTCCATCACTTCAAATAGGGAAAGTGTTCTTGTGGAGACTTCTGGTAAGGGAGGTGTATCGGAAGCCAGCTAAGACACTGTAGCGTCCCCTAAAGATGTAACTGACAGGTTCTGTTCTATTTGCCCATCCGAGTGATCCCGTGTCTTTGTTGACCTACCTTGGAGAATGAAACTACTATTTCTTGATGATACCTGAGTTTTCCTATTAGCACAACTCAAGCAAGCTAAGAATGGTGGTAGTCATTTCCTGATCCTGATAAGAAGTAACTCCTAAATCCTTCACTTTGAAGCTTAATCCTACCATCTCTTTGTCCAACAACTCCGTTCAGGCGAAACCGCGGTGGTCCGTTCAGACAGGATTCTTCACTAGGCTTGGCAATTCTTCACTAGGCTTGGCTGGCTGCTCCGGGAGAAGTCAAAAAGTGATGTCGTGAGTATCTTCTTAGGCTTCTCGAACGCTCGAGACAACCTTGCTATTATCGACGAGCTTCTTCTATACTCGAACCTTGTGGAAACTGATAAGAGTTCTCGACGAACTCAAAGAGGTCCATCTTCTATTCCTTTACTGCATTCTTCTTTGTTCATGGATTCTTAGATTATTTGAAGAACTTGTCAAAACTAATACATGTATTGGCAGTTCATTTGATTCTGGAGTCTTTTGAAAAAATCATTCATAAGCTGAATGAAAAAGTATAATTAATCATTGTGTTGAAAACTTTATAGGCATTGTTGATGTGTGATTTCGGGTTAAGGATCACTGTTAAGATCGTGATGAGCTTCCATGATGACATATTGGCAGGGAAGCGTAAGTCGGGTCGTGAAATAAAGGTAATGAGTTCACGAATTGTTTCACTGCTTTGCTTTGTTTTGGCATTCGGTTGAGCATTCTTTGTAAAGGTATGGAAACCTCTTTTTAGTAGACGCGTTTTAAAACTTTGAGGGAAAACCCAAAAGGACAATATCTGCTAGCGGTGGGCTTGAGCTGTTACATGTATGGGTTGAGAGACTCATTTTGTATTACGGAATATGATGGAGTGTAGCATTATGAACCTTAACTATATTTCGTCTTCATTTTTGTAATTTTGTAACGGAGATTTCTTTATTAGGTTTTGGCAAGCCTATTTCTGGCTTCCCCCATCTTGAAATTTCTTTGTTTCTTATGTTTCGAGAACAGGAGATGTGAGATCTCCACATCGGTTAGGGAGGAGAACCAAGGTGTAGAAACCTCCTTTTCCTAGCAGCCCATTTTAAAATCGTGAGTCAGACGACAATACGTAATTGGCCAAAGCGAACAATATCTGCTATCAGTGGACTTGGGCTGTTACAAATGGTATAAGAGCTAGACATCGGGCGATGTGCCCGCAAGGAGGTTGAACCTCGAAGTGGGGTGGATTGGGAGTCCCACATCGATTGGAATGAGTGCCAACTACAACGTTGGACCCTGAAGGCAGTGAATTGTGAGATCCCACGAAGCATTCTTTATGAGAGCGTGGAAACCTTTCCCTACTAGATGCGTTATGAAAATTTTGAATAGTTCTTCATTTGTCATCTGATTTGTGCTATTCGGAATTGTCTTTTTGAACTCTATAAATGGAAGGTTTGCTATCATCGAAATATATGGTTTGATATGAGTTTG

mRNA sequence

ATGGTCGAAGCTCGCAAGCTTTTCGACGAAACTCCAACTAAAAATTCTATCACTTGGTCATCCCTGGTATCTGGATATTGCAGAAATGGGTGTGAAGTTGAAGGCTTGAGGCTGTTCAGCCAAATGTGGAGTGAGGGACAGAAGCCAAGTCAATATACATTGGGCAGTGTTTTACGAGCATGTTCGACTTTGGGTTTACTCCATAGTGGCAAAATGATTCATGGCTATGTAACAAAAATACAATTAGAAGCAAATATCTTCGTTGCTACCGGTCTCGTCGACATGTATTCCAAGTGTAAGTGTCTCCTGGAGGCTGAATACCTCTTTGTATCATTGTCTGATAGGAAAAACTATGTTCTATCGACCGCTATGCTCACCGGTTATGCTCAAAATGGCGAGAGTTTGAAGGCAATGCAGTGTTTTAAGGAGATGAGAATACAGGGAATGGAGGACCTGAAAAATGGAGAATCTGTTCATTCTCTGATAATGAAAACTGGTTTTGATGCCTGCAAAACAGTGAGCAATGCGCTTGTTGATATGTATGCTAAACAGGGAAACTTAAATTGTGCATTAGAGGTTTTCAATAAGATATCTGATAAAGATGTCATTTCTTGGACCTCCTTGGTCACGGGATATGTTCATAATGGCTTCCATGAAAAGGCTCTCAAGTTATTCTGTGACATGAGAATTGCAGGTGTTGATCTAGACCAATTCGTAATTGCCTGTGTCTTTAGTGCTTGTGCTGAACTAACGATTATCGAGTTTGGTCGACAGGCATTGTTGATGTGTGATTTCGGGTTAAGGATCACTGTTAAGATCGTGATGAGCTTCCATGATGACATATTGGCAGGGAAGCGTAAGTCGGGTCGTGAAATAAAGGAGATGTGAGATCTCCACATCGGTTAGGGAGGAGAACCAAGGTGTAGAAACCTCCTTTTCCTAGCAGCCCATTTTAAAATCGTGAGTCAGACGACAATACGTAATTGGCCAAAGCGAACAATATCTGCTATCAGTGGACTTGGGCTGTTACAAATGGTATAAGAGCTAGACATCGGGCGATGTGCCCGCAAGGAGGTTGAACCTCGAAGTGGGGTGGATTGGGAGTCCCACATCGATTGGAATGAGTGCCAACTACAACGTTGGACCCTGAAGGCAGTGAATTGTGAGATCCCACGAAGCATTCTTTATGAGAGCGTGGAAACCTTTCCCTACTAGATGCGTTATGAAAATTTTGAATAGTTCTTCATTTGTCATCTGATTTGTGCTATTCGGAATTGTCTTTTTGAACTCTATAAATGGAAGGTTTGCTATCATCGAAATATATGGTTTGATATGAGTTTG

Coding sequence (CDS)

ATGGTCGAAGCTCGCAAGCTTTTCGACGAAACTCCAACTAAAAATTCTATCACTTGGTCATCCCTGGTATCTGGATATTGCAGAAATGGGTGTGAAGTTGAAGGCTTGAGGCTGTTCAGCCAAATGTGGAGTGAGGGACAGAAGCCAAGTCAATATACATTGGGCAGTGTTTTACGAGCATGTTCGACTTTGGGTTTACTCCATAGTGGCAAAATGATTCATGGCTATGTAACAAAAATACAATTAGAAGCAAATATCTTCGTTGCTACCGGTCTCGTCGACATGTATTCCAAGTGTAAGTGTCTCCTGGAGGCTGAATACCTCTTTGTATCATTGTCTGATAGGAAAAACTATGTTCTATCGACCGCTATGCTCACCGGTTATGCTCAAAATGGCGAGAGTTTGAAGGCAATGCAGTGTTTTAAGGAGATGAGAATACAGGGAATGGAGGACCTGAAAAATGGAGAATCTGTTCATTCTCTGATAATGAAAACTGGTTTTGATGCCTGCAAAACAGTGAGCAATGCGCTTGTTGATATGTATGCTAAACAGGGAAACTTAAATTGTGCATTAGAGGTTTTCAATAAGATATCTGATAAAGATGTCATTTCTTGGACCTCCTTGGTCACGGGATATGTTCATAATGGCTTCCATGAAAAGGCTCTCAAGTTATTCTGTGACATGAGAATTGCAGGTGTTGATCTAGACCAATTCGTAATTGCCTGTGTCTTTAGTGCTTGTGCTGAACTAACGATTATCGAGTTTGGTCGACAGGCATTGTTGATGTGTGATTTCGGGTTAAGGATCACTGTTAAGATCGTGATGAGCTTCCATGATGACATATTGGCAGGGAAGCGTAAGTCGGGTCGTGAAATAAAGGAGATGTGA

Protein sequence

MVEARKLFDETPTKNSITWSSLVSGYCRNGCEVEGLRLFSQMWSEGQKPSQYTLGSVLRACSTLGLLHSGKMIHGYVTKIQLEANIFVATGLVDMYSKCKCLLEAEYLFVSLSDRKNYVLSTAMLTGYAQNGESLKAMQCFKEMRIQGMEDLKNGESVHSLIMKTGFDACKTVSNALVDMYAKQGNLNCALEVFNKISDKDVISWTSLVTGYVHNGFHEKALKLFCDMRIAGVDLDQFVIACVFSACAELTIIEFGRQALLMCDFGLRITVKIVMSFHDDILAGKRKSGREIKEM
BLAST of Cp4.1LG10g12020 vs. Swiss-Prot
Match: PP181_ARATH (Pentatricopeptide repeat-containing protein At2g33680 OS=Arabidopsis thaliana GN=PCMP-E19 PE=3 SV=1)

HSP 1 Score: 188.7 bits (478), Expect = 8.9e-47
Identity = 99/271 (36.53%), Postives = 159/271 (58.67%), Query Frame = 1

Query: 3   EARKLFDETPTKNSITWSSLVSGYCRNGCEVEGLRLFSQMWSEGQKPSQYTLGSVLRACS 62
           EA K+FD +  +NSITWS++V+GY +NG  +E ++LFS+M+S G KPS+YT+  VL ACS
Sbjct: 274 EACKMFDSSGDRNSITWSAMVTGYSQNGESLEAVKLFSRMFSAGIKPSEYTIVGVLNACS 333

Query: 63  TLGLLHSGKMIHGYVTKIQLEANIFVATGLVDMYSKCKCLLEAEYLFVSLSDRKNYVLST 122
            +  L  GK +H ++ K+  E ++F  T LVDMY+K  CL +A   F  L +R +  L T
Sbjct: 334 DICYLEEGKQLHSFLLKLGFERHLFATTALVDMYAKAGCLADARKGFDCLQER-DVALWT 393

Query: 123 AMLTGYAQNGESLKAMQCFKEMRIQG-----------------MEDLKNGESVHSLIMKT 182
           ++++GY QN ++ +A+  ++ M+  G                 +  L+ G+ VH   +K 
Sbjct: 394 SLISGYVQNSDNEEALILYRRMKTAGIIPNDPTMASVLKACSSLATLELGKQVHGHTIKH 453

Query: 183 GFDACKTVSNALVDMYAKQGNLNCALEVFNKISDKDVISWTSLVTGYVHNGFHEKALKLF 242
           GF     + +AL  MY+K G+L     VF +  +KDV+SW ++++G  HNG  ++AL+LF
Sbjct: 454 GFGLEVPIGSALSTMYSKCGSLEDGNLVFRRTPNKDVVSWNAMISGLSHNGQGDEALELF 513

Query: 243 CDMRIAGVDLDQFVIACVFSACAELTIIEFG 257
            +M   G++ D      + SAC+    +E G
Sbjct: 514 EEMLAEGMEPDDVTFVNIISACSHKGFVERG 543

BLAST of Cp4.1LG10g12020 vs. Swiss-Prot
Match: PP347_ARATH (Pentatricopeptide repeat-containing protein At4g33170 OS=Arabidopsis thaliana GN=PCMP-H53 PE=3 SV=1)

HSP 1 Score: 172.6 bits (436), Expect = 6.6e-42
Identity = 99/275 (36.00%), Postives = 156/275 (56.73%), Query Frame = 1

Query: 4   ARKLFDETPTKNSITWSSLVSGYCRNGCEVEGLRLFSQMWSEGQKPSQYTLGSVLRACST 63
           AR +FD    ++ I+W+S+++G  +NG EVE + LF Q+   G KP QYT+ SVL+A S+
Sbjct: 369 ARTVFDNMSERDLISWNSVIAGIAQNGLEVEAVCLFMQLLRCGLKPDQYTMTSVLKAASS 428

Query: 64  LGL-LHSGKMIHGYVTKIQLEANIFVATGLVDMYSKCKCLLEAEYLFVSLSDRKNY--VL 123
           L   L   K +H +  KI   ++ FV+T L+D YS+ +C+ EAE LF    +R N+  V 
Sbjct: 429 LPEGLSLSKQVHVHAIKINNVSDSFVSTALIDAYSRNRCMKEAEILF----ERHNFDLVA 488

Query: 124 STAMLTGYAQNGESLKAMQCFKEMRIQGMED-----------------LKNGESVHSLIM 183
             AM+ GY Q+ +  K ++ F  M  QG                    +  G+ VH+  +
Sbjct: 489 WNAMMAGYTQSHDGHKTLKLFALMHKQGERSDDFTLATVFKTCGFLFAINQGKQVHAYAI 548

Query: 184 KTGFDACKTVSNALVDMYAKQGNLNCALEVFNKISDKDVISWTSLVTGYVHNGFHEKALK 243
           K+G+D    VS+ ++DMY K G+++ A   F+ I   D ++WT++++G + NG  E+A  
Sbjct: 549 KSGYDLDLWVSSGILDMYVKCGDMSAAQFAFDSIPVPDDVAWTTMISGCIENGEEERAFH 608

Query: 244 LFCDMRIAGVDLDQFVIACVFSACAELTIIEFGRQ 259
           +F  MR+ GV  D+F IA +  A + LT +E GRQ
Sbjct: 609 VFSQMRLMGVLPDEFTIATLAKASSCLTALEQGRQ 639

BLAST of Cp4.1LG10g12020 vs. Swiss-Prot
Match: PP108_ARATH (Putative pentatricopeptide repeat-containing protein At1g68930 OS=Arabidopsis thaliana GN=PCMP-H22 PE=3 SV=1)

HSP 1 Score: 162.9 bits (411), Expect = 5.2e-39
Identity = 87/273 (31.87%), Postives = 152/273 (55.68%), Query Frame = 1

Query: 14  KNSITWSSLVSGYCRNGCEVEGLRLFSQMWSEGQKPSQYTLGSVLRACSTLGLLHSGKMI 73
           K+S++W++++ G  +NG   E +  F +M  +G K  QY  GSVL AC  LG ++ GK I
Sbjct: 233 KDSVSWAAMIKGLAQNGLAKEAIECFREMKVQGLKMDQYPFGSVLPACGGLGAINEGKQI 292

Query: 74  HGYVTKIQLEANIFVATGLVDMYSKCKCLLEAEYLFVSLSDRKNYVLSTAMLTGYAQNGE 133
           H  + +   + +I+V + L+DMY KCKCL  A+ +F  +  +KN V  TAM+ GY Q G 
Sbjct: 293 HACIIRTNFQDHIYVGSALIDMYCKCKCLHYAKTVFDRMK-QKNVVSWTAMVVGYGQTGR 352

Query: 134 SLKAMQCFKEMRIQGME-----------------DLKNGESVHSLIMKTGFDACKTVSNA 193
           + +A++ F +M+  G++                  L+ G   H   + +G     TVSN+
Sbjct: 353 AEEAVKIFLDMQRSGIDPDHYTLGQAISACANVSSLEEGSQFHGKAITSGLIHYVTVSNS 412

Query: 194 LVDMYAKQGNLNCALEVFNKISDKDVISWTSLVTGYVHNGFHEKALKLFCDMRIAGVDLD 253
           LV +Y K G+++ +  +FN+++ +D +SWT++V+ Y   G   + ++LF  M   G+  D
Sbjct: 413 LVTLYGKCGDIDDSTRLFNEMNVRDAVSWTAMVSAYAQFGRAVETIQLFDKMVQHGLKPD 472

Query: 254 QFVIACVFSACAELTIIEFGRQ--ALLMCDFGL 268
              +  V SAC+   ++E G++   L+  ++G+
Sbjct: 473 GVTLTGVISACSRAGLVEKGQRYFKLMTSEYGI 504

BLAST of Cp4.1LG10g12020 vs. Swiss-Prot
Match: PP220_ARATH (Pentatricopeptide repeat-containing protein At3g09040, mitochondrial OS=Arabidopsis thaliana GN=PCMP-E88 PE=2 SV=1)

HSP 1 Score: 162.9 bits (411), Expect = 5.2e-39
Identity = 93/273 (34.07%), Postives = 142/273 (52.01%), Query Frame = 1

Query: 3   EARKLFDETPTKNSITWSSLVSGYCRNGCEVEGLRLFSQMWSEGQKPSQYTLGSVLRACS 62
           +AR LF E  + + + W+ ++SG+ + GCE   +  F  M     K ++ TLGSVL A  
Sbjct: 279 DARLLFGEMSSPDVVAWNVMISGHGKRGCETVAIEYFFNMRKSSVKSTRSTLGSVLSAIG 338

Query: 63  TLGLLHSGKMIHGYVTKIQLEANIFVATGLVDMYSKCKCLLEAEYLFVSLSDRKNYVLST 122
            +  L  G ++H    K+ L +NI+V + LV MYSKC+ +  A  +F +L + KN V   
Sbjct: 339 IVANLDLGLVVHAEAIKLGLASNIYVGSSLVSMYSKCEKMEAAAKVFEAL-EEKNDVFWN 398

Query: 123 AMLTGYAQNGESLKAMQCFKEMRIQG-----------------MEDLKNGESVHSLIMKT 182
           AM+ GYA NGES K M+ F +M+  G                   DL+ G   HS+I+K 
Sbjct: 399 AMIRGYAHNGESHKVMELFMDMKSSGYNIDDFTFTSLLSTCAASHDLEMGSQFHSIIIKK 458

Query: 183 GFDACKTVSNALVDMYAKQGNLNCALEVFNKISDKDVISWTSLVTGYVHNGFHEKALKLF 242
                  V NALVDMYAK G L  A ++F ++ D+D ++W +++  YV +    +A  LF
Sbjct: 459 KLAKNLFVGNALVDMYAKCGALEDARQIFERMCDRDNVTWNTIIGSYVQDENESEAFDLF 518

Query: 243 CDMRIAGVDLDQFVIACVFSACAELTIIEFGRQ 259
             M + G+  D   +A    AC  +  +  G+Q
Sbjct: 519 KRMNLCGIVSDGACLASTLKACTHVHGLYQGKQ 550

BLAST of Cp4.1LG10g12020 vs. Swiss-Prot
Match: PPR32_ARATH (Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidopsis thaliana GN=PCMP-H40 PE=2 SV=1)

HSP 1 Score: 162.5 bits (410), Expect = 6.8e-39
Identity = 92/267 (34.46%), Postives = 142/267 (53.18%), Query Frame = 1

Query: 3   EARKLFDETPTKNSITWSSLVSGYCRNGCEVEGLRLFSQMWSEGQKPSQYTLGSVLRACS 62
           EARK+FD  P ++ ++W+++V+GY +NG     L +   M  E  KPS  T+ SVL A S
Sbjct: 188 EARKVFDRMPERDLVSWNTIVAGYSQNGMARMALEMVKSMCEENLKPSFITIVSVLPAVS 247

Query: 63  TLGLLHSGKMIHGYVTKIQLEANIFVATGLVDMYSKCKCLLEAEYLFVSLSDRKNYVLST 122
            L L+  GK IHGY  +   ++ + ++T LVDMY+KC  L  A  LF  + +R N V   
Sbjct: 248 ALRLISVGKEIHGYAMRSGFDSLVNISTALVDMYAKCGSLETARQLFDGMLER-NVVSWN 307

Query: 123 AMLTGYAQNGESLKAMQCFKEMRIQGME-----------------DLKNGESVHSLIMKT 182
           +M+  Y QN    +AM  F++M  +G++                 DL+ G  +H L ++ 
Sbjct: 308 SMIDAYVQNENPKEAMLIFQKMLDEGVKPTDVSVMGALHACADLGDLERGRFIHKLSVEL 367

Query: 183 GFDACKTVSNALVDMYAKQGNLNCALEVFNKISDKDVISWTSLVTGYVHNGFHEKALKLF 242
           G D   +V N+L+ MY K   ++ A  +F K+  + ++SW +++ G+  NG    AL  F
Sbjct: 368 GLDRNVSVVNSLISMYCKCKEVDTAASMFGKLQSRTLVSWNAMILGFAQNGRPIDALNYF 427

Query: 243 CDMRIAGVDLDQFVIACVFSACAELTI 253
             MR   V  D F    V +A AEL+I
Sbjct: 428 SQMRSRTVKPDTFTYVSVITAIAELSI 453

BLAST of Cp4.1LG10g12020 vs. TrEMBL
Match: A0A0A0L1C4_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G554180 PE=4 SV=1)

HSP 1 Score: 322.4 bits (825), Expect = 5.8e-85
Identity = 169/273 (61.90%), Postives = 193/273 (70.70%), Query Frame = 1

Query: 1   MVEARKLFDETPTKNSITWSSLVSGYCRNGCEVEGLRLFSQMWSEGQKPSQYTLGSVLRA 60
           +VEARKLF+ETP KNSITWSSLVSGYC+NGCEVEGLR FSQMWS+GQKPSQYTLGSVLRA
Sbjct: 84  LVEARKLFNETPIKNSITWSSLVSGYCKNGCEVEGLRQFSQMWSDGQKPSQYTLGSVLRA 143

Query: 61  CSTLGLLHSGKMIHGYVTKIQLEANIFVATGLVDMYSKCKCLLEAEYLFVSLSDRKNYVL 120
           CSTL LLH+GKMIH Y  KIQLEANIFVATGLVDMYSKCKCLLEAEYLF SL DRKNYV 
Sbjct: 144 CSTLSLLHTGKMIHCYAIKIQLEANIFVATGLVDMYSKCKCLLEAEYLFFSLPDRKNYVQ 203

Query: 121 STAMLTGYAQNGESLKAMQCFKEMRIQGMED-----------------LKNGESVHSLIM 180
            TAMLTGYAQNGESLKA+QCFKEMR QGME                     G  VH  I+
Sbjct: 204 WTAMLTGYAQNGESLKAIQCFKEMRNQGMESNHFTFPSILTACTSISAYAFGRQVHGCII 263

Query: 181 KTGFDACKTVSNALVDMYAKQGNLNCALEVFNKISDKDVISWTSLVTGYVHNGFHEKALK 240
            +GF     V +ALVDMYAK G+L  A  + + +   DV+ W S++ G V +G+ E+AL 
Sbjct: 264 WSGFGPNVYVQSALVDMYAKCGDLASARMILDTMEIDDVVCWNSMIVGCVTHGYMEEALV 323

Query: 241 LFCDMRIAGVDLDQFVIACVFSACAELTIIEFG 257
           LF  M    + +D F    V  + A    ++ G
Sbjct: 324 LFHKMHNRDIRIDDFTYPSVLKSLASCKNLKIG 356

BLAST of Cp4.1LG10g12020 vs. TrEMBL
Match: M5X7G8_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa001951mg PE=4 SV=1)

HSP 1 Score: 268.5 bits (685), Expect = 9.9e-69
Identity = 139/265 (52.45%), Postives = 175/265 (66.04%), Query Frame = 1

Query: 3   EARKLFDETPTKNSITWSSLVSGYCRNGCEVEGLRLFSQMWSEGQKPSQYTLGSVLRACS 62
           EA++LFD TP+K  ITWSSL+SGYCRN CE E   LF QM  EG +PSQYTLGSVLR CS
Sbjct: 13  EAKQLFDATPSKTPITWSSLISGYCRNECESEAFVLFWQMQLEGHRPSQYTLGSVLRLCS 72

Query: 63  TLGLLHSGKMIHGYVTKIQLEANIFVATGLVDMYSKCKCLLEAEYLFVSLSDRKNYVLST 122
           TL LL SG+++HGYV K Q + N FV TGLVDMY+KCK + EAEYLF +L DRKN+VL T
Sbjct: 73  TLVLLQSGELVHGYVIKTQFDTNAFVVTGLVDMYAKCKRISEAEYLFETLPDRKNHVLWT 132

Query: 123 AMLTGYAQNGESLKAMQCFKEMRIQGMED---------------LKN--GESVHSLIMKT 182
            MLTGY+QNG+  KAM+CF++MR +G+E                L N  G  VH  I+++
Sbjct: 133 VMLTGYSQNGDGFKAMKCFRDMRAEGVESNQFTFPSILTASALILANSFGAQVHGCIVQS 192

Query: 183 GFDACKTVSNALVDMYAKQGNLNCALEVFNKISDKDVISWTSLVTGYVHNGFHEKALKLF 242
           GF A   V +ALVDMY K G+ N A +    +   DV+SW S++ G V  GF E+AL LF
Sbjct: 193 GFGANVFVQSALVDMYVKCGDHNSAKKALKSMEVDDVVSWNSMIVGCVRQGFTEEALSLF 252

Query: 243 CDMRIAGVDLDQFVIACVFSACAEL 251
            +MR   + +D F    V ++ A L
Sbjct: 253 KEMRSRELKIDHFTYPSVLNSLAAL 277

BLAST of Cp4.1LG10g12020 vs. TrEMBL
Match: G7L1H0_MEDTR (Pentatricopeptide (PPR) repeat protein OS=Medicago truncatula GN=MTR_7g076020 PE=4 SV=2)

HSP 1 Score: 257.3 bits (656), Expect = 2.3e-65
Identity = 132/265 (49.81%), Postives = 175/265 (66.04%), Query Frame = 1

Query: 1   MVEARKLFDETPTKNSITWSSLVSGYCRNGCEVEGLRLFSQMWSEGQKPSQYTLGSVLRA 60
           +VEAR+LFD    K+SITWSS++SGYC+ GC+VE   LF  M  EG K SQ+TLGSVLR 
Sbjct: 83  LVEARELFDGCSCKSSITWSSIISGYCKFGCKVEAFDLFRSMRLEGWKASQFTLGSVLRV 142

Query: 61  CSTLGLLHSGKMIHGYVTKIQLEANIFVATGLVDMYSKCKCLLEAEYLFVSLS-DRKNYV 120
           CS+LGL+ +G+MIHG+V K   E N+FV TGLVDMY+KCKC+ EAE+LF  L  DRKN+V
Sbjct: 143 CSSLGLIQTGEMIHGFVVKNGFEGNVFVVTGLVDMYAKCKCVSEAEFLFKGLEFDRKNHV 202

Query: 121 LSTAMLTGYAQNGESLKAMQCFKEMRIQGMEDLKN-----------------GESVHSLI 180
           L TAM+TGYAQNG+  KA++ F+ M  QG+E  +                  GE VH  I
Sbjct: 203 LWTAMVTGYAQNGDGYKAVEFFRYMHAQGVECNQYTFPTILTACSSVLARCFGEQVHGFI 262

Query: 181 MKTGFDACKTVSNALVDMYAKQGNLNCALEVFNKISDKDVISWTSLVTGYVHNGFHEKAL 240
           +K+GF +   V +ALVDMYAK G+L  A  +   + D DV+SW SL+ G+V +G  E+AL
Sbjct: 263 VKSGFGSNVYVQSALVDMYAKCGDLKNAKNMLETMEDDDVVSWNSLMVGFVRHGLEEEAL 322

Query: 241 KLFCDMRIAGVDLDQFVIACVFSAC 248
           +LF +M    + +D +    V + C
Sbjct: 323 RLFKNMHGRNMKIDDYTFPSVLNCC 347

BLAST of Cp4.1LG10g12020 vs. TrEMBL
Match: A0A061DTD9_THECC (Tetratricopeptide repeat-like superfamily protein OS=Theobroma cacao GN=TCM_005409 PE=4 SV=1)

HSP 1 Score: 253.4 bits (646), Expect = 3.3e-64
Identity = 124/267 (46.44%), Postives = 172/267 (64.42%), Query Frame = 1

Query: 1   MVEARKLFDETPTKNSITWSSLVSGYCRNGCEVEGLRLFSQMWSEGQKPSQYTLGSVLRA 60
           + EA +LF E P K+SITW+SL+SGYCR G E+E   LF  M  EGQ+P+QYT+GS+LR 
Sbjct: 83  LTEAIELFKEIPMKSSITWNSLISGYCRGGMEIEAFDLFWGMQFEGQRPNQYTMGSILRL 142

Query: 61  CSTLGLLHSGKMIHGYVTKIQLEANIFVATGLVDMYSKCKCLLEAEYLFVSLSDRKNYVL 120
           CSTLGLL  GK +HGYV K Q E+N +V TGLVDMY+KC C+LEAE LF  + D++N+V+
Sbjct: 143 CSTLGLLQRGKQVHGYVIKTQFESNDYVVTGLVDMYAKCNCILEAECLFKMMPDKRNHVM 202

Query: 121 STAMLTGYAQNGESLKAMQCFKEMRIQGMEDLKN-----------------GESVHSLIM 180
            TA++ GY+QNGE+ KA++CF++M ++G+E  +                  G  VH  I 
Sbjct: 203 WTAIVAGYSQNGEAFKAIECFRDMLVEGVESNQFTFPSVLIACAAVKAGNVGAQVHGCIF 262

Query: 181 KTGFDACKTVSNALVDMYAKQGNLNCALEVFNKISDKDVISWTSLVTGYVHNGFHEKALK 240
           ++GF+    V +ALVDMYAK  +L+ A+ V   +   DV+SW S++ G V  GF E+AL 
Sbjct: 263 RSGFETNVYVQSALVDMYAKCRDLDNAMRVLENMEVDDVVSWNSMIVGCVRQGFEEEALS 322

Query: 241 LFCDMRIAGVDLDQFVIACVFSACAEL 251
           LF  M    + +D F    V +  A +
Sbjct: 323 LFRKMHARDMKMDSFTYPSVLNCFASM 349

BLAST of Cp4.1LG10g12020 vs. TrEMBL
Match: A0A0D2QPM6_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_007G096200 PE=4 SV=1)

HSP 1 Score: 251.5 bits (641), Expect = 1.2e-63
Identity = 124/265 (46.79%), Postives = 173/265 (65.28%), Query Frame = 1

Query: 1   MVEARKLFDETPTKNSITWSSLVSGYCRNGCEVEGLRLFSQMWSEGQKPSQYTLGSVLRA 60
           + EA +LF ETP K+SITW+ L+SGYC +G E E   LFS+M  EGQ+P+QYT+GS+LR 
Sbjct: 83  LTEAIQLFKETPIKSSITWNLLISGYCLHGMETEAFHLFSRMQFEGQRPNQYTMGSILRL 142

Query: 61  CSTLGLLHSGKMIHGYVTKIQLEANIFVATGLVDMYSKCKCLLEAEYLFVSLSDRKNYVL 120
           CSTLGLL  GK +HGYV K Q E+N +V TGLVDMY+KC C+LEAEYLF  + +++N+V+
Sbjct: 143 CSTLGLLQRGKQVHGYVIKTQFESNDYVVTGLVDMYAKCNCILEAEYLFKMMPNKRNHVM 202

Query: 121 STAMLTGYAQNGESLKAMQCFKEMRIQG-----------------MEDLKNGESVHSLIM 180
            TAM+ GY+QNGE+ KA++C+++M ++G                 ++    G  VHS I+
Sbjct: 203 WTAMVAGYSQNGEAFKAIECYRDMVVEGVASNQFTFPSVLTACAAVQARNFGTQVHSFIV 262

Query: 181 KTGFDACKTVSNALVDMYAKQGNLNCALEVFNKISDKDVISWTSLVTGYVHNGFHEKALK 240
           ++GF+A   V +AL+DMYAK  +L+ AL V   +   DV+SW S++ G V  G  E+AL 
Sbjct: 263 RSGFEANVFVQSALIDMYAKCRDLDSALIVLENMEVDDVVSWNSMLVGCVRQGCEEEALS 322

Query: 241 LFCDMRIAGVDLDQFVIACVFSACA 249
           LF  M    + L  F    V +  A
Sbjct: 323 LFRKMHARDMKLGNFTYPSVLNCFA 347

BLAST of Cp4.1LG10g12020 vs. TAIR10
Match: AT3G61170.1 (AT3G61170.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 203.8 bits (517), Expect = 1.5e-52
Identity = 104/263 (39.54%), Postives = 157/263 (59.70%), Query Frame = 1

Query: 3   EARKLFDETPTKNSITWSSLVSGYCRNGCEVEGLRLFSQMWSEGQKPSQYTLGSVLRACS 62
           +A KLF   P KN+I+W++L+SGYC++G +VE   LF +M S+G KP++YTLGSVLR C+
Sbjct: 77  DAEKLFRSNPVKNTISWNALISGYCKSGSKVEAFNLFWEMQSDGIKPNEYTLGSVLRMCT 136

Query: 63  TLGLLHSGKMIHGYVTKIQLEANIFVATGLVDMYSKCKCLLEAEYLFVSLSDRKNYVLST 122
           +L LL  G+ IHG+  K   + ++ V  GL+ MY++CK + EAEYLF ++   KN V  T
Sbjct: 137 SLVLLLRGEQIHGHTIKTGFDLDVNVVNGLLAMYAQCKRISEAEYLFETMEGEKNNVTWT 196

Query: 123 AMLTGYAQNGESLKAMQCFKEMRIQGMED-----------------LKNGESVHSLIMKT 182
           +MLTGY+QNG + KA++CF+++R +G +                   + G  VH  I+K+
Sbjct: 197 SMLTGYSQNGFAFKAIECFRDLRREGNQSNQYTFPSVLTACASVSACRVGVQVHCCIVKS 256

Query: 183 GFDACKTVSNALVDMYAKQGNLNCALEVFNKISDKDVISWTSLVTGYVHNGFHEKALKLF 242
           GF     V +AL+DMYAK   +  A  +   +   DV+SW S++ G V  G   +AL +F
Sbjct: 257 GFKTNIYVQSALIDMYAKCREMESARALLEGMEVDDVVSWNSMIVGCVRQGLIGEALSMF 316

Query: 243 CDMRIAGVDLDQFVIACVFSACA 249
             M    + +D F I  + +  A
Sbjct: 317 GRMHERDMKIDDFTIPSILNCFA 339

BLAST of Cp4.1LG10g12020 vs. TAIR10
Match: AT2G33680.1 (AT2G33680.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 188.7 bits (478), Expect = 5.0e-48
Identity = 99/271 (36.53%), Postives = 159/271 (58.67%), Query Frame = 1

Query: 3   EARKLFDETPTKNSITWSSLVSGYCRNGCEVEGLRLFSQMWSEGQKPSQYTLGSVLRACS 62
           EA K+FD +  +NSITWS++V+GY +NG  +E ++LFS+M+S G KPS+YT+  VL ACS
Sbjct: 274 EACKMFDSSGDRNSITWSAMVTGYSQNGESLEAVKLFSRMFSAGIKPSEYTIVGVLNACS 333

Query: 63  TLGLLHSGKMIHGYVTKIQLEANIFVATGLVDMYSKCKCLLEAEYLFVSLSDRKNYVLST 122
            +  L  GK +H ++ K+  E ++F  T LVDMY+K  CL +A   F  L +R +  L T
Sbjct: 334 DICYLEEGKQLHSFLLKLGFERHLFATTALVDMYAKAGCLADARKGFDCLQER-DVALWT 393

Query: 123 AMLTGYAQNGESLKAMQCFKEMRIQG-----------------MEDLKNGESVHSLIMKT 182
           ++++GY QN ++ +A+  ++ M+  G                 +  L+ G+ VH   +K 
Sbjct: 394 SLISGYVQNSDNEEALILYRRMKTAGIIPNDPTMASVLKACSSLATLELGKQVHGHTIKH 453

Query: 183 GFDACKTVSNALVDMYAKQGNLNCALEVFNKISDKDVISWTSLVTGYVHNGFHEKALKLF 242
           GF     + +AL  MY+K G+L     VF +  +KDV+SW ++++G  HNG  ++AL+LF
Sbjct: 454 GFGLEVPIGSALSTMYSKCGSLEDGNLVFRRTPNKDVVSWNAMISGLSHNGQGDEALELF 513

Query: 243 CDMRIAGVDLDQFVIACVFSACAELTIIEFG 257
            +M   G++ D      + SAC+    +E G
Sbjct: 514 EEMLAEGMEPDDVTFVNIISACSHKGFVERG 543

BLAST of Cp4.1LG10g12020 vs. TAIR10
Match: AT4G33170.1 (AT4G33170.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 172.6 bits (436), Expect = 3.7e-43
Identity = 99/275 (36.00%), Postives = 156/275 (56.73%), Query Frame = 1

Query: 4   ARKLFDETPTKNSITWSSLVSGYCRNGCEVEGLRLFSQMWSEGQKPSQYTLGSVLRACST 63
           AR +FD    ++ I+W+S+++G  +NG EVE + LF Q+   G KP QYT+ SVL+A S+
Sbjct: 369 ARTVFDNMSERDLISWNSVIAGIAQNGLEVEAVCLFMQLLRCGLKPDQYTMTSVLKAASS 428

Query: 64  LGL-LHSGKMIHGYVTKIQLEANIFVATGLVDMYSKCKCLLEAEYLFVSLSDRKNY--VL 123
           L   L   K +H +  KI   ++ FV+T L+D YS+ +C+ EAE LF    +R N+  V 
Sbjct: 429 LPEGLSLSKQVHVHAIKINNVSDSFVSTALIDAYSRNRCMKEAEILF----ERHNFDLVA 488

Query: 124 STAMLTGYAQNGESLKAMQCFKEMRIQGMED-----------------LKNGESVHSLIM 183
             AM+ GY Q+ +  K ++ F  M  QG                    +  G+ VH+  +
Sbjct: 489 WNAMMAGYTQSHDGHKTLKLFALMHKQGERSDDFTLATVFKTCGFLFAINQGKQVHAYAI 548

Query: 184 KTGFDACKTVSNALVDMYAKQGNLNCALEVFNKISDKDVISWTSLVTGYVHNGFHEKALK 243
           K+G+D    VS+ ++DMY K G+++ A   F+ I   D ++WT++++G + NG  E+A  
Sbjct: 549 KSGYDLDLWVSSGILDMYVKCGDMSAAQFAFDSIPVPDDVAWTTMISGCIENGEEERAFH 608

Query: 244 LFCDMRIAGVDLDQFVIACVFSACAELTIIEFGRQ 259
           +F  MR+ GV  D+F IA +  A + LT +E GRQ
Sbjct: 609 VFSQMRLMGVLPDEFTIATLAKASSCLTALEQGRQ 639

BLAST of Cp4.1LG10g12020 vs. TAIR10
Match: AT3G09040.1 (AT3G09040.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 162.9 bits (411), Expect = 3.0e-40
Identity = 93/273 (34.07%), Postives = 142/273 (52.01%), Query Frame = 1

Query: 3   EARKLFDETPTKNSITWSSLVSGYCRNGCEVEGLRLFSQMWSEGQKPSQYTLGSVLRACS 62
           +AR LF E  + + + W+ ++SG+ + GCE   +  F  M     K ++ TLGSVL A  
Sbjct: 279 DARLLFGEMSSPDVVAWNVMISGHGKRGCETVAIEYFFNMRKSSVKSTRSTLGSVLSAIG 338

Query: 63  TLGLLHSGKMIHGYVTKIQLEANIFVATGLVDMYSKCKCLLEAEYLFVSLSDRKNYVLST 122
            +  L  G ++H    K+ L +NI+V + LV MYSKC+ +  A  +F +L + KN V   
Sbjct: 339 IVANLDLGLVVHAEAIKLGLASNIYVGSSLVSMYSKCEKMEAAAKVFEAL-EEKNDVFWN 398

Query: 123 AMLTGYAQNGESLKAMQCFKEMRIQG-----------------MEDLKNGESVHSLIMKT 182
           AM+ GYA NGES K M+ F +M+  G                   DL+ G   HS+I+K 
Sbjct: 399 AMIRGYAHNGESHKVMELFMDMKSSGYNIDDFTFTSLLSTCAASHDLEMGSQFHSIIIKK 458

Query: 183 GFDACKTVSNALVDMYAKQGNLNCALEVFNKISDKDVISWTSLVTGYVHNGFHEKALKLF 242
                  V NALVDMYAK G L  A ++F ++ D+D ++W +++  YV +    +A  LF
Sbjct: 459 KLAKNLFVGNALVDMYAKCGALEDARQIFERMCDRDNVTWNTIIGSYVQDENESEAFDLF 518

Query: 243 CDMRIAGVDLDQFVIACVFSACAELTIIEFGRQ 259
             M + G+  D   +A    AC  +  +  G+Q
Sbjct: 519 KRMNLCGIVSDGACLASTLKACTHVHGLYQGKQ 550

BLAST of Cp4.1LG10g12020 vs. TAIR10
Match: AT1G68930.1 (AT1G68930.1 pentatricopeptide (PPR) repeat-containing protein)

HSP 1 Score: 162.9 bits (411), Expect = 3.0e-40
Identity = 87/273 (31.87%), Postives = 152/273 (55.68%), Query Frame = 1

Query: 14  KNSITWSSLVSGYCRNGCEVEGLRLFSQMWSEGQKPSQYTLGSVLRACSTLGLLHSGKMI 73
           K+S++W++++ G  +NG   E +  F +M  +G K  QY  GSVL AC  LG ++ GK I
Sbjct: 233 KDSVSWAAMIKGLAQNGLAKEAIECFREMKVQGLKMDQYPFGSVLPACGGLGAINEGKQI 292

Query: 74  HGYVTKIQLEANIFVATGLVDMYSKCKCLLEAEYLFVSLSDRKNYVLSTAMLTGYAQNGE 133
           H  + +   + +I+V + L+DMY KCKCL  A+ +F  +  +KN V  TAM+ GY Q G 
Sbjct: 293 HACIIRTNFQDHIYVGSALIDMYCKCKCLHYAKTVFDRMK-QKNVVSWTAMVVGYGQTGR 352

Query: 134 SLKAMQCFKEMRIQGME-----------------DLKNGESVHSLIMKTGFDACKTVSNA 193
           + +A++ F +M+  G++                  L+ G   H   + +G     TVSN+
Sbjct: 353 AEEAVKIFLDMQRSGIDPDHYTLGQAISACANVSSLEEGSQFHGKAITSGLIHYVTVSNS 412

Query: 194 LVDMYAKQGNLNCALEVFNKISDKDVISWTSLVTGYVHNGFHEKALKLFCDMRIAGVDLD 253
           LV +Y K G+++ +  +FN+++ +D +SWT++V+ Y   G   + ++LF  M   G+  D
Sbjct: 413 LVTLYGKCGDIDDSTRLFNEMNVRDAVSWTAMVSAYAQFGRAVETIQLFDKMVQHGLKPD 472

Query: 254 QFVIACVFSACAELTIIEFGRQ--ALLMCDFGL 268
              +  V SAC+   ++E G++   L+  ++G+
Sbjct: 473 GVTLTGVISACSRAGLVEKGQRYFKLMTSEYGI 504

BLAST of Cp4.1LG10g12020 vs. NCBI nr
Match: gi|659083162|ref|XP_008442216.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At3g13770, mitochondrial isoform X2 [Cucumis melo])

HSP 1 Score: 324.7 bits (831), Expect = 1.7e-85
Identity = 169/274 (61.68%), Postives = 193/274 (70.44%), Query Frame = 1

Query: 1   MVEARKLFDETPTKNSITWSSLVSGYCRNGCEVEGLRLFSQMWSEGQKPSQYTLGSVLRA 60
           +VEAR+LF ETP KNSITWS+LVSGYC+NGCEVEGLRLFSQMWS+GQKPSQYTLGSVLRA
Sbjct: 22  LVEARRLFSETPIKNSITWSTLVSGYCKNGCEVEGLRLFSQMWSDGQKPSQYTLGSVLRA 81

Query: 61  CSTLGLLHSGKMIHGYVTKIQLEANIFVATGLVDMYSKCKCLLEAEYLFVSLSDRKNYVL 120
           CSTL LLHSGKMIH Y  KIQLE NIFVATGLVDMYSKCKCLLEAEYLF SL DRKNYV 
Sbjct: 82  CSTLSLLHSGKMIHCYAIKIQLEENIFVATGLVDMYSKCKCLLEAEYLFFSLPDRKNYVQ 141

Query: 121 STAMLTGYAQNGESLKAMQCFKEMRIQGMED-----------------LKNGESVHSLIM 180
            TAMLTGYAQNGESLKA+QCFKEMRIQGME                     G  VH  I+
Sbjct: 142 WTAMLTGYAQNGESLKAIQCFKEMRIQGMESNHFTFPSILTACTSISAYAFGRQVHGCII 201

Query: 181 KTGFDACKTVSNALVDMYAKQGNLNCALEVFNKISDKDVISWTSLVTGYVHNGFHEKALK 240
            +GF     V +ALVDMYAK G+L  A  + N +   DV+ W S++ G V +G+ E+AL 
Sbjct: 202 WSGFGPNVYVQSALVDMYAKCGDLASARVILNTMEIDDVVCWNSMIVGCVTHGYMEEALV 261

Query: 241 LFCDMRIAGVDLDQFVIACVFSACAELTIIEFGR 258
           LF  M    + +D F       + A    ++ G+
Sbjct: 262 LFHKMHNRDIRIDDFTYPSALKSLASSKNLKIGQ 295

BLAST of Cp4.1LG10g12020 vs. NCBI nr
Match: gi|659083154|ref|XP_008442211.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At3g13770, mitochondrial isoform X1 [Cucumis melo])

HSP 1 Score: 324.7 bits (831), Expect = 1.7e-85
Identity = 169/274 (61.68%), Postives = 193/274 (70.44%), Query Frame = 1

Query: 1   MVEARKLFDETPTKNSITWSSLVSGYCRNGCEVEGLRLFSQMWSEGQKPSQYTLGSVLRA 60
           +VEAR+LF ETP KNSITWS+LVSGYC+NGCEVEGLRLFSQMWS+GQKPSQYTLGSVLRA
Sbjct: 81  LVEARRLFSETPIKNSITWSTLVSGYCKNGCEVEGLRLFSQMWSDGQKPSQYTLGSVLRA 140

Query: 61  CSTLGLLHSGKMIHGYVTKIQLEANIFVATGLVDMYSKCKCLLEAEYLFVSLSDRKNYVL 120
           CSTL LLHSGKMIH Y  KIQLE NIFVATGLVDMYSKCKCLLEAEYLF SL DRKNYV 
Sbjct: 141 CSTLSLLHSGKMIHCYAIKIQLEENIFVATGLVDMYSKCKCLLEAEYLFFSLPDRKNYVQ 200

Query: 121 STAMLTGYAQNGESLKAMQCFKEMRIQGMED-----------------LKNGESVHSLIM 180
            TAMLTGYAQNGESLKA+QCFKEMRIQGME                     G  VH  I+
Sbjct: 201 WTAMLTGYAQNGESLKAIQCFKEMRIQGMESNHFTFPSILTACTSISAYAFGRQVHGCII 260

Query: 181 KTGFDACKTVSNALVDMYAKQGNLNCALEVFNKISDKDVISWTSLVTGYVHNGFHEKALK 240
            +GF     V +ALVDMYAK G+L  A  + N +   DV+ W S++ G V +G+ E+AL 
Sbjct: 261 WSGFGPNVYVQSALVDMYAKCGDLASARVILNTMEIDDVVCWNSMIVGCVTHGYMEEALV 320

Query: 241 LFCDMRIAGVDLDQFVIACVFSACAELTIIEFGR 258
           LF  M    + +D F       + A    ++ G+
Sbjct: 321 LFHKMHNRDIRIDDFTYPSALKSLASSKNLKIGQ 354

BLAST of Cp4.1LG10g12020 vs. NCBI nr
Match: gi|778695095|ref|XP_011653924.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g03880, mitochondrial [Cucumis sativus])

HSP 1 Score: 322.4 bits (825), Expect = 8.3e-85
Identity = 169/273 (61.90%), Postives = 193/273 (70.70%), Query Frame = 1

Query: 1   MVEARKLFDETPTKNSITWSSLVSGYCRNGCEVEGLRLFSQMWSEGQKPSQYTLGSVLRA 60
           +VEARKLF+ETP KNSITWSSLVSGYC+NGCEVEGLR FSQMWS+GQKPSQYTLGSVLRA
Sbjct: 84  LVEARKLFNETPIKNSITWSSLVSGYCKNGCEVEGLRQFSQMWSDGQKPSQYTLGSVLRA 143

Query: 61  CSTLGLLHSGKMIHGYVTKIQLEANIFVATGLVDMYSKCKCLLEAEYLFVSLSDRKNYVL 120
           CSTL LLH+GKMIH Y  KIQLEANIFVATGLVDMYSKCKCLLEAEYLF SL DRKNYV 
Sbjct: 144 CSTLSLLHTGKMIHCYAIKIQLEANIFVATGLVDMYSKCKCLLEAEYLFFSLPDRKNYVQ 203

Query: 121 STAMLTGYAQNGESLKAMQCFKEMRIQGMED-----------------LKNGESVHSLIM 180
            TAMLTGYAQNGESLKA+QCFKEMR QGME                     G  VH  I+
Sbjct: 204 WTAMLTGYAQNGESLKAIQCFKEMRNQGMESNHFTFPSILTACTSISAYAFGRQVHGCII 263

Query: 181 KTGFDACKTVSNALVDMYAKQGNLNCALEVFNKISDKDVISWTSLVTGYVHNGFHEKALK 240
            +GF     V +ALVDMYAK G+L  A  + + +   DV+ W S++ G V +G+ E+AL 
Sbjct: 264 WSGFGPNVYVQSALVDMYAKCGDLASARMILDTMEIDDVVCWNSMIVGCVTHGYMEEALV 323

Query: 241 LFCDMRIAGVDLDQFVIACVFSACAELTIIEFG 257
           LF  M    + +D F    V  + A    ++ G
Sbjct: 324 LFHKMHNRDIRIDDFTYPSVLKSLASCKNLKIG 356

BLAST of Cp4.1LG10g12020 vs. NCBI nr
Match: gi|700199701|gb|KGN54859.1| (hypothetical protein Csa_4G554180 [Cucumis sativus])

HSP 1 Score: 322.4 bits (825), Expect = 8.3e-85
Identity = 169/273 (61.90%), Postives = 193/273 (70.70%), Query Frame = 1

Query: 1   MVEARKLFDETPTKNSITWSSLVSGYCRNGCEVEGLRLFSQMWSEGQKPSQYTLGSVLRA 60
           +VEARKLF+ETP KNSITWSSLVSGYC+NGCEVEGLR FSQMWS+GQKPSQYTLGSVLRA
Sbjct: 84  LVEARKLFNETPIKNSITWSSLVSGYCKNGCEVEGLRQFSQMWSDGQKPSQYTLGSVLRA 143

Query: 61  CSTLGLLHSGKMIHGYVTKIQLEANIFVATGLVDMYSKCKCLLEAEYLFVSLSDRKNYVL 120
           CSTL LLH+GKMIH Y  KIQLEANIFVATGLVDMYSKCKCLLEAEYLF SL DRKNYV 
Sbjct: 144 CSTLSLLHTGKMIHCYAIKIQLEANIFVATGLVDMYSKCKCLLEAEYLFFSLPDRKNYVQ 203

Query: 121 STAMLTGYAQNGESLKAMQCFKEMRIQGMED-----------------LKNGESVHSLIM 180
            TAMLTGYAQNGESLKA+QCFKEMR QGME                     G  VH  I+
Sbjct: 204 WTAMLTGYAQNGESLKAIQCFKEMRNQGMESNHFTFPSILTACTSISAYAFGRQVHGCII 263

Query: 181 KTGFDACKTVSNALVDMYAKQGNLNCALEVFNKISDKDVISWTSLVTGYVHNGFHEKALK 240
            +GF     V +ALVDMYAK G+L  A  + + +   DV+ W S++ G V +G+ E+AL 
Sbjct: 264 WSGFGPNVYVQSALVDMYAKCGDLASARMILDTMEIDDVVCWNSMIVGCVTHGYMEEALV 323

Query: 241 LFCDMRIAGVDLDQFVIACVFSACAELTIIEFG 257
           LF  M    + +D F    V  + A    ++ G
Sbjct: 324 LFHKMHNRDIRIDDFTYPSVLKSLASCKNLKIG 356

BLAST of Cp4.1LG10g12020 vs. NCBI nr
Match: gi|694425837|ref|XP_009340625.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g03880, mitochondrial-like [Pyrus x bretschneideri])

HSP 1 Score: 273.5 bits (698), Expect = 4.4e-70
Identity = 138/267 (51.69%), Postives = 177/267 (66.29%), Query Frame = 1

Query: 1   MVEARKLFDETPTKNSITWSSLVSGYCRNGCEVEGLRLFSQMWSEGQKPSQYTLGSVLRA 60
           + EA++LFD TP+KNSITWSSL+SGYCRN CE E   LF +M  EG KPSQYTLGSVLR 
Sbjct: 95  LAEAKQLFDATPSKNSITWSSLISGYCRNECESEAFELFWRMQFEGHKPSQYTLGSVLRL 154

Query: 61  CSTLGLLHSGKMIHGYVTKIQLEANIFVATGLVDMYSKCKCLLEAEYLFVSLSDRKNYVL 120
           CSTLGLL  G ++HGY+TK Q + NIFV TGLVDMY+KCK + EAEYLFV+L DRKN+VL
Sbjct: 155 CSTLGLLQRGALVHGYMTKTQFDNNIFVVTGLVDMYAKCKRISEAEYLFVTLPDRKNHVL 214

Query: 121 STAMLTGYAQNGESLKAMQCFKEMRIQGMED-----------------LKNGESVHSLIM 180
            T MLTGY+QNG+  KAM+ F++MR +G+E                     G  VH  ++
Sbjct: 215 WTVMLTGYSQNGDGFKAMKFFRDMRAEGVESNHFTFPSVLTASASVLAHSFGAQVHGCVV 274

Query: 181 KTGFDACKTVSNALVDMYAKQGNLNCALEVFNKISDKDVISWTSLVTGYVHNGFHEKALK 240
           ++G  A   V ++LVDMY K G+LN A +    +   DV+SW S++ G V  GF E+AL 
Sbjct: 275 QSGLGANVFVQSSLVDMYVKCGDLNSAKKALRSMEVDDVVSWNSMIVGCVRQGFAEEALS 334

Query: 241 LFCDMRIAGVDLDQFVIACVFSACAEL 251
           LF DMR   + +D F    V ++ A +
Sbjct: 335 LFKDMRSREMKIDHFTYPSVLNSFAAM 361

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP181_ARATH8.9e-4736.53Pentatricopeptide repeat-containing protein At2g33680 OS=Arabidopsis thaliana GN... [more]
PP347_ARATH6.6e-4236.00Pentatricopeptide repeat-containing protein At4g33170 OS=Arabidopsis thaliana GN... [more]
PP108_ARATH5.2e-3931.87Putative pentatricopeptide repeat-containing protein At1g68930 OS=Arabidopsis th... [more]
PP220_ARATH5.2e-3934.07Pentatricopeptide repeat-containing protein At3g09040, mitochondrial OS=Arabidop... [more]
PPR32_ARATH6.8e-3934.46Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A0A0L1C4_CUCSA5.8e-8561.90Uncharacterized protein OS=Cucumis sativus GN=Csa_4G554180 PE=4 SV=1[more]
M5X7G8_PRUPE9.9e-6952.45Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa001951mg PE=4 SV=1[more]
G7L1H0_MEDTR2.3e-6549.81Pentatricopeptide (PPR) repeat protein OS=Medicago truncatula GN=MTR_7g076020 PE... [more]
A0A061DTD9_THECC3.3e-6446.44Tetratricopeptide repeat-like superfamily protein OS=Theobroma cacao GN=TCM_0054... [more]
A0A0D2QPM6_GOSRA1.2e-6346.79Uncharacterized protein OS=Gossypium raimondii GN=B456_007G096200 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G61170.11.5e-5239.54 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT2G33680.15.0e-4836.53 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT4G33170.13.7e-4336.00 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G09040.13.0e-4034.07 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G68930.13.0e-4031.87 pentatricopeptide (PPR) repeat-containing protein[more]
Match NameE-valueIdentityDescription
gi|659083162|ref|XP_008442216.1|1.7e-8561.68PREDICTED: putative pentatricopeptide repeat-containing protein At3g13770, mitoc... [more]
gi|659083154|ref|XP_008442211.1|1.7e-8561.68PREDICTED: putative pentatricopeptide repeat-containing protein At3g13770, mitoc... [more]
gi|778695095|ref|XP_011653924.1|8.3e-8561.90PREDICTED: pentatricopeptide repeat-containing protein At2g03880, mitochondrial ... [more]
gi|700199701|gb|KGN54859.1|8.3e-8561.90hypothetical protein Csa_4G554180 [Cucumis sativus][more]
gi|694425837|ref|XP_009340625.1|4.4e-7051.69PREDICTED: pentatricopeptide repeat-containing protein At2g03880, mitochondrial-... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0005515 protein binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG10g12020.1Cp4.1LG10g12020.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 203..233
score: 1.4E-7coord: 122..148
score: 3.5E-5coord: 175..201
score: 0.
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 14..61
score: 7.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 122..149
score: 1.0E-4coord: 17..50
score: 1.2E-7coord: 175..202
score: 0.0016coord: 203..236
score: 1.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 170..200
score: 6.939coord: 85..115
score: 5.492coord: 117..151
score: 8.703coord: 201..235
score: 10.786coord: 15..49
score: 12
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 3..258
score: 3.8E
NoneNo IPR availablePANTHERPTHR24015:SF894PENTATRICOPEPTIDE (PPR) REPEAT-CONTAINING PROTEINcoord: 3..258
score: 3.8E

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
Cp4.1LG10g12020Lsi01G019470Bottle gourd (USVL1VR-Ls)cpelsiB055
Cp4.1LG10g12020MELO3C009001.2Melon (DHL92) v3.6.1cpemedB095
Cp4.1LG10g12020CsaV3_4G031220Cucumber (Chinese Long) v3cpecucB0076
Cp4.1LG10g12020Bhi03G002219Wax gourdcpewgoB0104
Cp4.1LG10g12020CsGy4G019480Cucumber (Gy14) v2cgybcpeB464
Cp4.1LG10g12020Carg25730Silver-seed gourdcarcpeB1196
The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG10g12020Cp4.1LG19g00490Cucurbita pepo (Zucchini)cpecpeB079