Cp4.1LG01g16670 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG01g16670
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionPentatricopeptide repeat-containing family protein
LocationCp4.1LG01 : 10362629 .. 10367304 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTGCAGCGCTTCTTTAGCCTTCCTCGAATCTTCTCCCGTCTCGCAGGACCTTTACTTGAAATGGGGCACCATGTTTCTTATTCCACATATGCTTCCCAGCCCCCACTCTCCGACCTCGATCAAACCATGGCTTCAGTACAGTTTATTTCCTTACATTCCTGCTTTTATTTCATGGGACTTTGCAGGAACATTGATACTCTCATCAAGTTCCATGGGTTGCTCATAGTTCACGGCCTTGTCGGTAATCTTCTTTGTGATACTAAGTTAGTCGGTGTTTATGGCGCACTTGGGGATGTGGGTTCTGCTCGCATGGTGTTCGATCAAATGCGCGACCCAGACTTTTATGCGTGGAAGGTAATGATCAGGTGGTATTTCTTGAATGACATGTTTGCGAGTATTATTCCGTTCTATAATCGCATGCGAATGTCATTTAAGGAATGTGATAACATTATTTTTTCAATTATTTTGAAAGCGTGTAGCGAATTGCGTGAAATTGATGAAGGGAGGAAGGTCCATTGCCAGATTGTGAAGGTGGGGGGTCCGGATAGTTTTGTATTGACTGGTTTGATAGATATGTATGGTAAATGTGGGCAGATTGAGTGGTCAAGCGCTGTGTTTGAAGGAATTATCGATAAGAATGTGGTTTCTTGGACTACAATGATTGCGGGATATGTACAAAATGATTGTGCAGAAGAGGGTCTGGTTTTATTCAATCGGATGAGAGAATCATTGGTCGAAAGCAACCAATTTACTTTAGGGAGCATAATAACTGCTTGTACAAGATTAAGAGCTTTGCATCAGGGGAAATGGGTACATGGCTATGCCATTAAGAACGTTATTATTGAACTTAACTCTTTTTTGGCGACAGCTTTCTTGGACATGTATGTTAAATGTGGGCAAACAAGAGATGCTCACATGATATTTGACGAGCTACCTAGTATTGATCTCGTTTCATGGACTGCAATGATCGTTGGATATTCCCAAGCTGGCCAACCCAACGAGGCATTGAGGCTTTTCACTGATAAAATAAGGTCTGGTCTCTTACCTAATTCGGTCACTGCTGCAAGTATTCTTTCGGCATGTTCGGTGTCTGGTAATTTAAGTATCGGAATGTTAGTTCATGGACTTGGGATTAAACTTGGGCTGGAAGAGTGTGCAGTGAAGAATGCTCTTATTGACATGTATGCTAAATGCCATATGATTGACGATGCTTATGTTGTATTTCTTGGGGTTTTGGAAAAAGATGTGATTACTTGGAACTCAATGATATCTGGGTATGCTCAGAGTGGATCTGCATATGATGCCCTCCGTCTCTTTAATCAAATGAGATCGGACTCCCTTGCACCTGACGCAATAACTCTGGTGAGCGCCCTTTCAGCATCTGCCATCCTAGGTGCTGTACAGGTTGGTTCATCACTTCATGCTTACTCGATTAAAGAAGGCTTGTTTTCATCAAATCTTTACATTGGCACTGCACTTTTGAACTTGTATGCCAAATGTGGCGATGCTAAATCAGCACGTATGGTGTTCGACAGTATGGGAGATAAGAATATTATCACATGGAGTGCAATGATAGGTGGTTATGGAGTGCAGGGTGATGGAAGTGGATCCCTTGCCATTTTCTCCGACATGTTGAAGGAGGATTTAAAACCTAATGACGTAATTTTCACGACGGTATTATCTGCTTGTAGCTATTCCGGGATGGTTGAAGAGGGATGGAGATATTTCAAATCTATGAGTCAGGATTATAACTATGCGCCTTCCATGAAACACTATGCCTGTATGGTTGATCTTTTATCCCGATCTGGTAGACTGGAGGAAGCATTGGACTTTATTAAGAAAATGCCAGTTCAACCAGATATTAGTTTGTATGGAGCTTTTCTTCATGGATGTGGATTATACTCGAGGTTTGATCTTGGAGAAGTCATAGTCAGAGAAATGCTAGAGCTTCATCCAAATGAAGCTAGCCTTTATGTGCTTTTATCTAACCTGTATGCTTCAGATGGGAGATGGGGCCAAGTTAATGAGGTGAGAGATTTGATGCTACGGAGAGGATTGAAAAAGGTTCCAGGGTATAGCCTAGTAGAAACTAATGCAGGTGTGCCATTTCATTAGTTGAAGTGACTTGTGTGGTTCCTATCTTCATGTTTTCGTTGAGATAATCAAGATACTTTACCCAGTGAGGCTCTATTATATGGATCTATGACATTTTCTTAATGTCCTTGTCCTCTCCTTCACTGGAGTTTCGACAAGCCTTGTCGAATGGGGAAACGTTGGATACTATCAATCTTCTATCCTTGCGTGGAGAGTTTTGGTGGATGGAAAGAAGGATGTTCGTATCTAGAATCCTGTCCTTTTGAATGGTTCTCATGTAAATCCTTTTTTTCTTTTGAATGGCTCTCATGATTCTCATGCACTCTAATAAAATCACGGTGGTGACTTTGTGATGTTTTGTCGTGAATGGATGTCTTGGAGGGCAAGGAGTTGCATACTCAATAATTGTCTAGGATTGTAGCAGTGAAGTACACAATTATCTTCTCATTGACAAGATTCTTCTGCAAACACAACTTACACTTAAGCTGATTGTACACATCTTGCAAAGGACTTGAGTTAAAGGGAATTCGATTCATTACATCGAGCTAACTAGAGCGGCATCCACCAGGACTAATTGGCCAGACATACTACAGTGACATCGTCAATTCTAAAAGTGAGATTTCATTTTCGTAATTTTTCCTTTTCGAATATTAAGCATGTTCTGATGAGCTGAATAAAATTTCAAAACTCATACTAGGCTTGGCCGCTTGTCCAATTTTCAGTAGAAAGATCTTATTTCCTGCTGACTTCTGATTGGATATTTCTTGTTCTAATTGGTAGCGTTGAAGTGCCTGTTTGGGCATTATTTAAATATTTCTCTGAATTTTCAATATGGAGATTTTTTAAAATTCATCGTCATAGCTTAGAGATTTGAGCTTTTGGAATTAGTAGTAGTTTAACAAACGTGGCACCATCTGGTACATTCTTAGCATTCTATTGTATGATCGAGGATTGTTGGGAGGGAGTCCCACATTGACTAATTTAGGGAATGATCATGGGTTTATAAGTAAGGAATACATCTCCATTGGTATGAGGCCTTTCAGAGAAGCCCAAAGCAAAGCTATGAGAGCTTATGCTCAAAGTGGACAATATCATACCATTGTAAAGAGTCGTTATTCCTAACACTCTTCTGTCTCTAAATCACCAACTTATTATGTATCTACTAATCTGGGATTGCTGAATAAAGTTTTCCTTTGCCAATTTTGATGTGCTTGATCGGGCCAGTGACATTTTTACTTGTATATCTTCATTATGAGCTCATACAGGCCGACTTTTGACCTCTTTTAGACTTTGGCAACCTTTCTCTTTGCAGGCAAAGTGTTTTGCTTGATTCTTTTGCTTCTTGATCAGAACGTGAAATTGACTGCAATTTACTAAAAGCCTTGCACTGTGGAGGACTAATGTTTTACAACTCTGAGTGGCAGAGACTTCCTTTTACCTCTCTAACAGTATTGCAAACAGTGAAATAATGCTTACAAAACAACAACTTGAGTTCCTTTGTGGGATCAATTCATCAGGTACTATTGACTCTGTTAACATCTCTAACGGAAAAGACCAATCGGTTAAGGTTCTTAATAGATTTAGCTCAATTTATAAGGATCATTGAAACTTTTTGTAGTTTGAATTTGAAAATCGTATCCCAGTTCTTAAAATATATATACTTGTGCTTGATGCAAACATTAGTGACAACTTTTCTTCTATTGCTGACAGTGGTCGAGCCAAGGAAGATATGAAACCTATCCTTAAGTATTCATATAATTTCTTGCAGAGTGGACCAGAGAACTCCGAAAGATCATACGGAAGGCAACGGTGATGAGCAACCTTCAATCATTCAAGTATTTTACTATTTTTAAACTTGATTTGTATTGAAGTGCAAAATTCCTGTATGACCACTTTATGAACTATGATAATTTATTGATTGATGCTATGCCATCTCCTTTTCAGCTCAGAATACAAATGATTCAAGTGTAGGAAGCTGTCCATCATCTCAAGTTTCAAGCATTCTGTACACTAGGAATAAAGCTCAAGACGTGGCTCATTTGGCTCAGGAAACAGATCTCCACATTCTTCTGATTTAAGGTTAGCACTCTACCTAGATACCATGGAAGAGAAGATATACTTTCTGAACTTACACTTGGAGAACCATACTCCATCAATTGTAGTTATAAAACTACTACGAAAGCTGAAGAACAATGCCGTGTGGAGGTAGGAGGAGGCCTACAGGCCGTGAGCCGCCTATCCCGAAAGAAAATGGGATAATAGAAAATACGGGATTCTTCATTACACTTGGATTGAAGCTAACAACAATCATAATTCAGGACTTCTAGGTAAACCCTTTTAGAGGAAATTGAAAGGTTGTGTATGTTGGGAATCGCGAACCTACATAATGACAAAGCATAAACTCTCATAGTTTTATCTTTAGTTTTTCCAAAATACTTCGTACCAATGAAGATGTATTTCCTTACTTATAAACTTATGATCAACCCCTTAATTAGCCGATGTGGGACTTCTCTCCCAACAATCTTTAACCGTGTAAAGTACAAAGAAATTCACACTCTGTCTGGGTTTCTTTGCAGTAC

mRNA sequence

ATGTTGCAGCGCTTCTTTAGCCTTCCTCGAATCTTCTCCCGTCTCGCAGGACCTTTACTTGAAATGGGGCACCATGTTTCTTATTCCACATATGCTTCCCAGCCCCCACTCTCCGACCTCGATCAAACCATGGCTTCAGTACAGTTTATTTCCTTACATTCCTGCTTTTATTTCATGGGACTTTGCAGGAACATTGATACTCTCATCAAGTTCCATGGGTTGCTCATAGTTCACGGCCTTGTCGGTAATCTTCTTTGTGATACTAAGTTAGTCGGTGTTTATGGCGCACTTGGGGATGTGGGTTCTGCTCGCATGGTGTTCGATCAAATGCGCGACCCAGACTTTTATGCGTGGAAGGTAATGATCAGGTGGTATTTCTTGAATGACATGTTTGCGAGTATTATTCCGTTCTATAATCGCATGCGAATGTCATTTAAGGAATGTGATAACATTATTTTTTCAATTATTTTGAAAGCGTGTAGCGAATTGCGTGAAATTGATGAAGGGAGGAAGGTCCATTGCCAGATTGTGAAGGTGGGGGGTCCGGATAGTTTTGTATTGACTGGTTTGATAGATATGTATGGTAAATGTGGGCAGATTGAGTGGTCAAGCGCTGTGTTTGAAGGAATTATCGATAAGAATGTGGTTTCTTGGACTACAATGATTGCGGGATATGTACAAAATGATTGTGCAGAAGAGGGTCTGGTTTTATTCAATCGGATGAGAGAATCATTGGTCGAAAGCAACCAATTTACTTTAGGGAGCATAATAACTGCTTGTACAAGATTAAGAGCTTTGCATCAGGGGAAATGGGTACATGGCTATGCCATTAAGAACGTTATTATTGAACTTAACTCTTTTTTGGCGACAGCTTTCTTGGACATGTATGTTAAATGTGGGCAAACAAGAGATGCTCACATGATATTTGACGAGCTACCTAGTATTGATCTCGTTTCATGGACTGCAATGATCGTTGGATATTCCCAAGCTGGCCAACCCAACGAGGCATTGAGGCTTTTCACTGATAAAATAAGGTCTGGTCTCTTACCTAATTCGGTCACTGCTGCAAGTATTCTTTCGGCATGTTCGGTGTCTGGTAATTTAAGTATCGGAATGTTAGTTCATGGACTTGGGATTAAACTTGGGCTGGAAGAGTGTGCAGTGAAGAATGCTCTTATTGACATGTATGCTAAATGCCATATGATTGACGATGCTTATGTTGTATTTCTTGGGGTTTTGGAAAAAGATGTGATTACTTGGAACTCAATGATATCTGGGTATGCTCAGAGTGGATCTGCATATGATGCCCTCCGTCTCTTTAATCAAATGAGATCGGACTCCCTTGCACCTGACGCAATAACTCTGGTGAGCGCCCTTTCAGCATCTGCCATCCTAGAGACTTCCTTTTACCTCTCTAACAGTATTGCAAACAGTGAAATAATGCTTACAAAACAACAACTTGAGTTCCTTTGTGGGATCAATTCATCAGCTCAGAATACAAATGATTCAAGTGTAGGAAGCTGTCCATCATCTCAAGTTTCAAGCATTCTGTACACTAGGAATAAAGCTCAAGACGTGGCTCATTTGGCTCAGGAAACAGATCTCCACATTCTTCTGATTTAAGGTTAGCACTCTACCTAGATACCATGGAAGAGAAGATATACTTTCTGAACTTACACTTGGAGAACCATACTCCATCAATTGTAGTTATAAAACTACTACGAAAGCTGAAGAACAATGCCGTGTGGAGGTAGGAGGAGGCCTACAGGCCGTGAGCCGCCTATCCCGAAAGAAAATGGGATAATAGAAAATACGGGATTCTTCATTACACTTGGATTGAAGCTAACAACAATCATAATTCAGGACTTCTAGGTAAACCCTTTTAGAGGAAATTGAAAGGTTGTGTATGTTGGGAATCGCGAACCTACATAATGACAAAGCATAAACTCTCATAGTTTTATCTTTAGTTTTTCCAAAATACTTCGTACCAATGAAGATGTATTTCCTTACTTATAAACTTATGATCAACCCCTTAATTAGCCGATGTGGGACTTCTCTCCCAACAATCTTTAACCGTGTAAAGTACAAAGAAATTCACACTCTGTCTGGGTTTCTTTGCAGTAC

Coding sequence (CDS)

ATGTTGCAGCGCTTCTTTAGCCTTCCTCGAATCTTCTCCCGTCTCGCAGGACCTTTACTTGAAATGGGGCACCATGTTTCTTATTCCACATATGCTTCCCAGCCCCCACTCTCCGACCTCGATCAAACCATGGCTTCAGTACAGTTTATTTCCTTACATTCCTGCTTTTATTTCATGGGACTTTGCAGGAACATTGATACTCTCATCAAGTTCCATGGGTTGCTCATAGTTCACGGCCTTGTCGGTAATCTTCTTTGTGATACTAAGTTAGTCGGTGTTTATGGCGCACTTGGGGATGTGGGTTCTGCTCGCATGGTGTTCGATCAAATGCGCGACCCAGACTTTTATGCGTGGAAGGTAATGATCAGGTGGTATTTCTTGAATGACATGTTTGCGAGTATTATTCCGTTCTATAATCGCATGCGAATGTCATTTAAGGAATGTGATAACATTATTTTTTCAATTATTTTGAAAGCGTGTAGCGAATTGCGTGAAATTGATGAAGGGAGGAAGGTCCATTGCCAGATTGTGAAGGTGGGGGGTCCGGATAGTTTTGTATTGACTGGTTTGATAGATATGTATGGTAAATGTGGGCAGATTGAGTGGTCAAGCGCTGTGTTTGAAGGAATTATCGATAAGAATGTGGTTTCTTGGACTACAATGATTGCGGGATATGTACAAAATGATTGTGCAGAAGAGGGTCTGGTTTTATTCAATCGGATGAGAGAATCATTGGTCGAAAGCAACCAATTTACTTTAGGGAGCATAATAACTGCTTGTACAAGATTAAGAGCTTTGCATCAGGGGAAATGGGTACATGGCTATGCCATTAAGAACGTTATTATTGAACTTAACTCTTTTTTGGCGACAGCTTTCTTGGACATGTATGTTAAATGTGGGCAAACAAGAGATGCTCACATGATATTTGACGAGCTACCTAGTATTGATCTCGTTTCATGGACTGCAATGATCGTTGGATATTCCCAAGCTGGCCAACCCAACGAGGCATTGAGGCTTTTCACTGATAAAATAAGGTCTGGTCTCTTACCTAATTCGGTCACTGCTGCAAGTATTCTTTCGGCATGTTCGGTGTCTGGTAATTTAAGTATCGGAATGTTAGTTCATGGACTTGGGATTAAACTTGGGCTGGAAGAGTGTGCAGTGAAGAATGCTCTTATTGACATGTATGCTAAATGCCATATGATTGACGATGCTTATGTTGTATTTCTTGGGGTTTTGGAAAAAGATGTGATTACTTGGAACTCAATGATATCTGGGTATGCTCAGAGTGGATCTGCATATGATGCCCTCCGTCTCTTTAATCAAATGAGATCGGACTCCCTTGCACCTGACGCAATAACTCTGGTGAGCGCCCTTTCAGCATCTGCCATCCTAGAGACTTCCTTTTACCTCTCTAACAGTATTGCAAACAGTGAAATAATGCTTACAAAACAACAACTTGAGTTCCTTTGTGGGATCAATTCATCAGCTCAGAATACAAATGATTCAAGTGTAGGAAGCTGTCCATCATCTCAAGTTTCAAGCATTCTGTACACTAGGAATAAAGCTCAAGACGTGGCTCATTTGGCTCAGGAAACAGATCTCCACATTCTTCTGATTTAA

Protein sequence

MLQRFFSLPRIFSRLAGPLLEMGHHVSYSTYASQPPLSDLDQTMASVQFISLHSCFYFMGLCRNIDTLIKFHGLLIVHGLVGNLLCDTKLVGVYGALGDVGSARMVFDQMRDPDFYAWKVMIRWYFLNDMFASIIPFYNRMRMSFKECDNIIFSIILKACSELREIDEGRKVHCQIVKVGGPDSFVLTGLIDMYGKCGQIEWSSAVFEGIIDKNVVSWTTMIAGYVQNDCAEEGLVLFNRMRESLVESNQFTLGSIITACTRLRALHQGKWVHGYAIKNVIIELNSFLATAFLDMYVKCGQTRDAHMIFDELPSIDLVSWTAMIVGYSQAGQPNEALRLFTDKIRSGLLPNSVTAASILSACSVSGNLSIGMLVHGLGIKLGLEECAVKNALIDMYAKCHMIDDAYVVFLGVLEKDVITWNSMISGYAQSGSAYDALRLFNQMRSDSLAPDAITLVSALSASAILETSFYLSNSIANSEIMLTKQQLEFLCGINSSAQNTNDSSVGSCPSSQVSSILYTRNKAQDVAHLAQETDLHILLI
BLAST of Cp4.1LG01g16670 vs. Swiss-Prot
Match: PP146_ARATH (Pentatricopeptide repeat-containing protein At2g03380, mitochondrial OS=Arabidopsis thaliana GN=PCMP-E47 PE=3 SV=1)

HSP 1 Score: 424.1 bits (1089), Expect = 2.3e-117
Identity = 212/421 (50.36%), Postives = 289/421 (68.65%), Query Frame = 1

Query: 45  ASVQFISLHSCFYFMGLCRNIDTLIKFHGLLIVHGLVGNLLCDTKLVGVYGALGDVGSAR 104
           +S+ + +   CF  +  C NID+L + HG+L  +GL+G++   TKLV +YG  G    AR
Sbjct: 37  SSLHYAASSPCFLLLSKCTNIDSLRQSHGVLTGNGLMGDISIATKLVSLYGFFGYTKDAR 96

Query: 105 MVFDQMRDPDFYAWKVMIRWYFLNDMFASIIPFYNRMRMSFKECDNIIFSIILKACSELR 164
           +VFDQ+ +PDFY WKVM+R Y LN     ++  Y+ +       D+I+FS  LKAC+EL+
Sbjct: 97  LVFDQIPEPDFYLWKVMLRCYCLNKESVEVVKLYDLLMKHGFRYDDIVFSKALKACTELQ 156

Query: 165 EIDEGRKVHCQIVKVGGPDSFVLTGLIDMYGKCGQIEWSSAVFEGIIDKNVVSWTTMIAG 224
           ++D G+K+HCQ+VKV   D+ VLTGL+DMY KCG+I+ +  VF  I  +NVV WT+MIAG
Sbjct: 157 DLDNGKKIHCQLVKVPSFDNVVLTGLLDMYAKCGEIKSAHKVFNDITLRNVVCWTSMIAG 216

Query: 225 YVQNDCAEEGLVLFNRMRESLVESNQFTLGSIITACTRLRALHQGKWVHGYAIKNVIIEL 284
           YV+ND  EEGLVLFNRMRE+ V  N++T G++I ACT+L ALHQGKW HG  +K+  IEL
Sbjct: 217 YVKNDLCEEGLVLFNRMRENNVLGNEYTYGTLIMACTKLSALHQGKWFHGCLVKSG-IEL 276

Query: 285 NSFLATAFLDMYVKCGQTRDAHMIFDELPSIDLVSWTAMIVGYSQAGQPNEALRLFTDKI 344
           +S L T+ LDMYVKCG   +A  +F+E   +DLV WTAMIVGY+  G  NEAL LF    
Sbjct: 277 SSCLVTSLLDMYVKCGDISNARRVFNEHSHVDLVMWTAMIVGYTHNGSVNEALSLFQKMK 336

Query: 345 RSGLLPNSVTAASILSACSVSGNLSIGMLVHGLGIKLGLEECAVKNALIDMYAKCHMIDD 404
              + PN VT AS+LS C +  NL +G  VHGL IK+G+ +  V NAL+ MYAKC+   D
Sbjct: 337 GVEIKPNCVTIASVLSGCGLIENLELGRSVHGLSIKVGIWDTNVANALVHMYAKCYQNRD 396

Query: 405 AYVVFLGVLEKDVITWNSMISGYAQSGSAYDALRLFNQMRSDSLAPDAITLVSALSASAI 464
           A  VF    EKD++ WNS+ISG++Q+GS ++AL LF++M S+S+ P+ +T+ S  SA A 
Sbjct: 397 AKYVFEMESEKDIVAWNSIISGFSQNGSIHEALFLFHRMNSESVTPNGVTVASLFSACAS 456

Query: 465 L 466
           L
Sbjct: 457 L 456

BLAST of Cp4.1LG01g16670 vs. Swiss-Prot
Match: PP224_ARATH (Pentatricopeptide repeat-containing protein At3g12770 OS=Arabidopsis thaliana GN=PCMP-H43 PE=2 SV=1)

HSP 1 Score: 246.5 bits (628), Expect = 6.6e-64
Identity = 143/434 (32.95%), Postives = 242/434 (55.76%), Query Frame = 1

Query: 68  LIKFHGLLIVHGLVGNLLCDTKLVGVYGALGDVGSARMVFDQMRDPDFYAWKVMIRWYFL 127
           L + H  L+V GL  +    TKL+    + GD+  AR VFD +  P  + W  +IR Y  
Sbjct: 37  LKQIHARLLVLGLQFSGFLITKLIHASSSFGDITFARQVFDDLPRPQIFPWNAIIRGYSR 96

Query: 128 NDMFASIIPFYNRMRMSFKECDNIIFSIILKACSELREIDEGRKVHCQIVKVG-GPDSFV 187
           N+ F   +  Y+ M+++    D+  F  +LKACS L  +  GR VH Q+ ++G   D FV
Sbjct: 97  NNHFQDALLMYSNMQLARVSPDSFTFPHLLKACSGLSHLQMGRFVHAQVFRLGFDADVFV 156

Query: 188 LTGLIDMYGKCGQIEWSSAVFEG--IIDKNVVSWTTMIAGYVQNDCAEEGLVLFNRMRES 247
             GLI +Y KC ++  +  VFEG  + ++ +VSWT +++ Y QN    E L +F++MR+ 
Sbjct: 157 QNGLIALYAKCRRLGSARTVFEGLPLPERTIVSWTAIVSAYAQNGEPMEALEIFSQMRKM 216

Query: 248 LVESNQFTLGSIITACTRLRALHQGKWVHGYAIKNVIIELNSFLATAFLDMYVKCGQTRD 307
            V+ +   L S++ A T L+ L QG+ +H   +K + +E+   L  +   MY KCGQ   
Sbjct: 217 DVKPDWVALVSVLNAFTCLQDLKQGRSIHASVVK-MGLEIEPDLLISLNTMYAKCGQVAT 276

Query: 308 AHMIFDELPSIDLVSWTAMIVGYSQAGQPNEALRLFTDKIRSGLLPNSVTAASILSACSV 367
           A ++FD++ S +L+ W AMI GY++ G   EA+ +F + I   + P++++  S +SAC+ 
Sbjct: 277 AKILFDKMKSPNLILWNAMISGYAKNGYAREAIDMFHEMINKDVRPDTISITSAISACAQ 336

Query: 368 SGNLSIGMLVHG-LGIKLGLEECAVKNALIDMYAKCHMIDDAYVVFLGVLEKDVITWNSM 427
            G+L     ++  +G     ++  + +ALIDM+AKC  ++ A +VF   L++DV+ W++M
Sbjct: 337 VGSLEQARSMYEYVGRSDYRDDVFISSALIDMFAKCGSVEGARLVFDRTLDRDVVVWSAM 396

Query: 428 ISGYAQSGSAYDALRLFNQMRSDSLAPDAITLVSALSA---SAILETSFYLSNSIANSEI 487
           I GY   G A +A+ L+  M    + P+ +T +  L A   S ++   ++  N +A+ +I
Sbjct: 397 IVGYGLHGRAREAISLYRAMERGGVHPNDVTFLGLLMACNHSGMVREGWWFFNRMADHKI 456

Query: 488 MLTKQQLEFLCGIN 495
               QQ  + C I+
Sbjct: 457 --NPQQQHYACVID 467

BLAST of Cp4.1LG01g16670 vs. Swiss-Prot
Match: PP319_ARATH (Pentatricopeptide repeat-containing protein At4g18520 OS=Arabidopsis thaliana GN=PCMP-A2 PE=2 SV=1)

HSP 1 Score: 245.7 bits (626), Expect = 1.1e-63
Identity = 146/397 (36.78%), Postives = 221/397 (55.67%), Query Frame = 1

Query: 72  HGLLIVHGLVGNLLCDTKLVGVYGALGDVGSARMVFDQMRDPDFYAWKVMIRWYFLNDMF 131
           HG ++  G VGNL+ ++ LV  Y   G++ SA   FD M + D  +W  +I         
Sbjct: 207 HGNMVKVG-VGNLIVESSLVYFYAQCGELTSALRAFDMMEEKDVISWTAVISACSRKGHG 266

Query: 132 ASIIPFYNRMRMSFKECDNIIFSIILKACSELREIDEGRKVHCQIVK-VGGPDSFVLTGL 191
              I  +  M   +   +      ILKACSE + +  GR+VH  +VK +   D FV T L
Sbjct: 267 IKAIGMFIGMLNHWFLPNEFTVCSILKACSEEKALRFGRQVHSLVVKRMIKTDVFVGTSL 326

Query: 192 IDMYGKCGQIEWSSAVFEGIIDKNVVSWTTMIAGYVQNDCAEEGLVLFNRMRESLVESNQ 251
           +DMY KCG+I     VF+G+ ++N V+WT++IA + +    EE + LF  M+   + +N 
Sbjct: 327 MDMYAKCGEISDCRKVFDGMSNRNTVTWTSIIAAHAREGFGEEAISLFRIMKRRHLIANN 386

Query: 252 FTLGSIITACTRLRALHQGKWVHGYAIKNVIIELNSFLATAFLDMYVKCGQTRDAHMIFD 311
            T+ SI+ AC  + AL  GK +H   IKN I E N ++ +  + +Y KCG++RDA  +  
Sbjct: 387 LTVVSILRACGSVGALLLGKELHAQIIKNSI-EKNVYIGSTLVWLYCKCGESRDAFNVLQ 446

Query: 312 ELPSIDLVSWTAMIVGYSQAGQPNEALRLFTDKIRSGLLPNSVTAASILSACSVSGNLSI 371
           +LPS D+VSWTAMI G S  G  +EAL    + I+ G+ PN  T +S L AC+ S +L I
Sbjct: 447 QLPSRDVVSWTAMISGCSSLGHESEALDFLKEMIQEGVEPNPFTYSSALKACANSESLLI 506

Query: 372 GMLVHGLGIK-LGLEECAVKNALIDMYAKCHMIDDAYVVFLGVLEKDVITWNSMISGYAQ 431
           G  +H +  K   L    V +ALI MYAKC  + +A+ VF  + EK++++W +MI GYA+
Sbjct: 507 GRSIHSIAKKNHALSNVFVGSALIHMYAKCGFVSEAFRVFDSMPEKNLVSWKAMIMGYAR 566

Query: 432 SGSAYDALRLFNQMRSDSLAPDAITLVSALSASAILE 467
           +G   +AL+L  +M ++    D     + LS    +E
Sbjct: 567 NGFCREALKLMYRMEAEGFEVDDYIFATILSTCGDIE 601

BLAST of Cp4.1LG01g16670 vs. Swiss-Prot
Match: PPR32_ARATH (Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidopsis thaliana GN=PCMP-H40 PE=2 SV=1)

HSP 1 Score: 241.5 bits (615), Expect = 2.1e-62
Identity = 139/420 (33.10%), Postives = 227/420 (54.05%), Query Frame = 1

Query: 53  HSCFYFMGLCRNIDTLIKFHGLLIVHGLVGNLLCDTKLVGVYGALGDVGSARMVFDQMRD 112
           H     +  C ++  L +   L+  +GL       TKLV ++   G V  A  VF+ +  
Sbjct: 38  HPAALLLERCSSLKELRQILPLVFKNGLYQEHFFQTKLVSLFCRYGSVDEAARVFEPIDS 97

Query: 113 PDFYAWKVMIRWYFLNDMFASIIPFYNRMRMSFKECDNIIFSIILKACSELREIDEGRKV 172
                +  M++ +         + F+ RMR    E     F+ +LK C +  E+  G+++
Sbjct: 98  KLNVLYHTMLKGFAKVSDLDKALQFFVRMRYDDVEPVVYNFTYLLKVCGDEAELRVGKEI 157

Query: 173 HCQIVKVG-GPDSFVLTGLIDMYGKCGQIEWSSAVFEGIIDKNVVSWTTMIAGYVQNDCA 232
           H  +VK G   D F +TGL +MY KC Q+  +  VF+ + ++++VSW T++AGY QN  A
Sbjct: 158 HGLLVKSGFSLDLFAMTGLENMYAKCRQVNEARKVFDRMPERDLVSWNTIVAGYSQNGMA 217

Query: 233 EEGLVLFNRMRESLVESNQFTLGSIITACTRLRALHQGKWVHGYAIKNVIIELNSFLATA 292
              L +   M E  ++ +  T+ S++ A + LR +  GK +HGYA+++    L + ++TA
Sbjct: 218 RMALEMVKSMCEENLKPSFITIVSVLPAVSALRLISVGKEIHGYAMRSGFDSLVN-ISTA 277

Query: 293 FLDMYVKCGQTRDAHMIFDELPSIDLVSWTAMIVGYSQAGQPNEALRLFTDKIRSGLLPN 352
            +DMY KCG    A  +FD +   ++VSW +MI  Y Q   P EA+ +F   +  G+ P 
Sbjct: 278 LVDMYAKCGSLETARQLFDGMLERNVVSWNSMIDAYVQNENPKEAMLIFQKMLDEGVKPT 337

Query: 353 SVTAASILSACSVSGNLSIGMLVHGLGIKLGLE-ECAVKNALIDMYAKCHMIDDAYVVFL 412
            V+    L AC+  G+L  G  +H L ++LGL+   +V N+LI MY KC  +D A  +F 
Sbjct: 338 DVSVMGALHACADLGDLERGRFIHKLSVELGLDRNVSVVNSLISMYCKCKEVDTAASMFG 397

Query: 413 GVLEKDVITWNSMISGYAQSGSAYDALRLFNQMRSDSLAPDAITLVSALSASAILETSFY 471
            +  + +++WN+MI G+AQ+G   DAL  F+QMRS ++ PD  T VS ++A A L  + +
Sbjct: 398 KLQSRTLVSWNAMILGFAQNGRPIDALNYFSQMRSRTVKPDTFTYVSVITAIAELSITHH 456

BLAST of Cp4.1LG01g16670 vs. Swiss-Prot
Match: PP214_ARATH (Putative pentatricopeptide repeat-containing protein At3g05240 OS=Arabidopsis thaliana GN=PCMP-E82 PE=3 SV=2)

HSP 1 Score: 233.4 bits (594), Expect = 5.8e-60
Identity = 139/413 (33.66%), Postives = 215/413 (52.06%), Query Frame = 1

Query: 62  CRNIDTLIKFHGLLIVHGLVGNLLCDTKLVGVYGALGD---VGSARMVFDQMRDPDFYAW 121
           CR++  L + HGL+I   ++ N++  ++L+       +   +  AR VF+ +  P  Y W
Sbjct: 16  CRSLVELNQLHGLMIKSSVIRNVIPLSRLIDFCTTCPETMNLSYARSVFESIDCPSVYIW 75

Query: 122 KVMIRWYFLNDMFASIIPFYNRMRMSFKECDNIIFSIILKACSELREIDEGRKVHCQIVK 181
             MIR Y  +      + FY  M       D   F  +LKACS LR+I  G  VH  +VK
Sbjct: 76  NSMIRGYSNSPNPDKALIFYQEMLRKGYSPDYFTFPYVLKACSGLRDIQFGSCVHGFVVK 135

Query: 182 VGGP-DSFVLTGLIDMYGKCGQIEWSSAVFEGIIDKNVVSWTTMIAGYVQNDCAEEGLVL 241
            G   + +V T L+ MY  CG++ +   VFE I   NVV+W ++I+G+V N+   + +  
Sbjct: 136 TGFEVNMYVSTCLLHMYMCCGEVNYGLRVFEDIPQWNVVAWGSLISGFVNNNRFSDAIEA 195

Query: 242 FNRMRESLVESNQFTLGSIITACTRLRALHQGKWVHGYA-------IKNVIIELNSFLAT 301
           F  M+ + V++N+  +  ++ AC R + +  GKW HG+             +  N  LAT
Sbjct: 196 FREMQSNGVKANETIMVDLLVACGRCKDIVTGKWFHGFLQGLGFDPYFQSKVGFNVILAT 255

Query: 302 AFLDMYVKCGQTRDAHMIFDELPSIDLVSWTAMIVGYSQAGQPNEALRLFTDKIRSGLLP 361
           + +DMY KCG  R A  +FD +P   LVSW ++I GYSQ G   EAL +F D +  G+ P
Sbjct: 256 SLIDMYAKCGDLRTARYLFDGMPERTLVSWNSIITGYSQNGDAEEALCMFLDMLDLGIAP 315

Query: 362 NSVTAASILSACSVSGNLSIGMLVHGLGIKLG-LEECAVKNALIDMYAKCHMIDDAYVVF 421
           + VT  S++ A  + G   +G  +H    K G +++ A+  AL++MYAK    + A   F
Sbjct: 316 DKVTFLSVIRASMIQGCSQLGQSIHAYVSKTGFVKDAAIVCALVNMYAKTGDAESAKKAF 375

Query: 422 LGVLEKDVITWNSMISGYAQSGSAYDALRLFNQMRSDSLA-PDAITLVSALSA 462
             + +KD I W  +I G A  G   +AL +F +M+    A PD IT +  L A
Sbjct: 376 EDLEKKDTIAWTVVIIGLASHGHGNEALSIFQRMQEKGNATPDGITYLGVLYA 428

BLAST of Cp4.1LG01g16670 vs. TrEMBL
Match: A0A0A0KLZ1_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G189930 PE=4 SV=1)

HSP 1 Score: 786.9 bits (2031), Expect = 1.5e-224
Identity = 393/465 (84.52%), Postives = 420/465 (90.32%), Query Frame = 1

Query: 1   MLQRFFSLPRIFSRLAGPLLEMGHHVSYSTYASQPPLSDLDQTMASVQFISLHSCFYFMG 60
           MLQRF   PR F RLA PLL+MGH +SYSTYAS PPLSDL QTM SVQFISL  C Y MG
Sbjct: 13  MLQRF---PRAFFRLASPLLQMGHRMSYSTYASHPPLSDLHQTMPSVQFISLPLCCYLMG 72

Query: 61  LCRNIDTLIKFHGLLIVHGLVGNLLCDTKLVGVYGALGDVGSARMVFDQMRDPDFYAWKV 120
           L RNIDTLIKFHGLLIVHGL+GNLLCDTKLVGVYGALGDV SARMVFDQM +PDFYAWKV
Sbjct: 73  LFRNIDTLIKFHGLLIVHGLIGNLLCDTKLVGVYGALGDVRSARMVFDQMPNPDFYAWKV 132

Query: 121 MIRWYFLNDMFASIIPFYNRMRMSFKECDNIIFSIILKACSELREIDEGRKVHCQIVKVG 180
           MIRWYFLND+F  +IPFYNRMRMSF+ECDNIIFSIILKACSELREI EGRKVHCQIVKVG
Sbjct: 133 MIRWYFLNDLFVDVIPFYNRMRMSFRECDNIIFSIILKACSELREIVEGRKVHCQIVKVG 192

Query: 181 GPDSFVLTGLIDMYGKCGQIEWSSAVFEGIIDKNVVSWTTMIAGYVQNDCAEEGLVLFNR 240
           GPDSFV+TGLIDMYGKCGQ+E SSAVFE I+DKNVVSWT+MIAGYVQN+CAEEGLVLFNR
Sbjct: 193 GPDSFVMTGLIDMYGKCGQVECSSAVFEEIMDKNVVSWTSMIAGYVQNNCAEEGLVLFNR 252

Query: 241 MRESLVESNQFTLGSIITACTRLRALHQGKWVHGYAIKNVIIELNSFLATAFLDMYVKCG 300
           MR++LVESN FTLGSII ACT+LRALHQGKWVHGYAIKN I EL+SFLAT FLDMYVKCG
Sbjct: 253 MRDALVESNPFTLGSIINACTKLRALHQGKWVHGYAIKN-IAELSSFLATTFLDMYVKCG 312

Query: 301 QTRDAHMIFDELPSIDLVSWTAMIVGYSQAGQPNEALRLFTDKIRSGLLPNSVTAASILS 360
           QTRDA MI+DELP+IDLVSWTAMIVGY+QA QPN+ LRLF D+IRS LLPNSVTAAS+LS
Sbjct: 313 QTRDARMIYDELPTIDLVSWTAMIVGYTQARQPNDGLRLFADEIRSDLLPNSVTAASVLS 372

Query: 361 ACSVSGNLSIGMLVHGLGIKLGLEECAVKNALIDMYAKCHMIDDAYVVFLGVLEKDVITW 420
           ACSVSGNL++GM VHGLGIK+GLEEC VKNALIDMYAKCH I DAY +F GVLEKDVITW
Sbjct: 373 ACSVSGNLNLGMSVHGLGIKIGLEECVVKNALIDMYAKCHKIGDAYAIFHGVLEKDVITW 432

Query: 421 NSMISGYAQSGSAYDALRLFNQMRSDSLAPDAITLVSALSASAIL 466
           NSMISGYAQ+GSAYDALRLFNQMR   LAPD ITLVS LSA A L
Sbjct: 433 NSMISGYAQNGSAYDALRLFNQMRLYFLAPDVITLVSTLSACATL 473

BLAST of Cp4.1LG01g16670 vs. TrEMBL
Match: A0A061EAZ8_THECC (Pentatricopeptide repeat superfamily protein OS=Theobroma cacao GN=TCM_011604 PE=4 SV=1)

HSP 1 Score: 597.4 bits (1539), Expect = 1.7e-167
Identity = 287/438 (65.53%), Postives = 353/438 (80.59%), Query Frame = 1

Query: 30  TYASQPPLS--DLDQTMASVQFISLHSCFYFMGLCRNIDTLIKFHGLLIVHGLVGNLLCD 89
           +Y +  PL    +D+T+AS+  ISL+ CF  +GLCRNID+L K H L +++G+ G+LLCD
Sbjct: 30  SYTTDHPLEYPSMDRTLASMHSISLNPCFALLGLCRNIDSLKKVHALFVINGIKGDLLCD 89

Query: 90  TKLVGVYGALGDVGSARMVFDQMRDPDFYAWKVMIRWYFLNDMFASIIPFYNRMRMSFKE 149
           TKLV +YG  G +G AR++FDQ+ DPDFY+WKVMIRWYFLND+   II FY RMRMS + 
Sbjct: 90  TKLVSLYGLFGHIGCARLMFDQIPDPDFYSWKVMIRWYFLNDLCMEIIGFYARMRMSVRM 149

Query: 150 CDNIIFSIILKACSELREIDEGRKVHCQIVKVGGPDSFVLTGLIDMYGKCGQIEWSSAVF 209
           CDN++FS++LKACSE+R+IDEGRKVHCQIVK G PDSFV TGL+DMY KCG+IE S  VF
Sbjct: 150 CDNVVFSVVLKACSEMRDIDEGRKVHCQIVKAGNPDSFVQTGLVDMYAKCGEIECSRKVF 209

Query: 210 EGIIDKNVVSWTTMIAGYVQNDCAEEGLVLFNRMRESLVESNQFTLGSIITACTRLRALH 269
             IID+NVVSWT+MIAGYVQNDCAE+ LVLFNRMRE++VE N+FTLGS++TAC +L ALH
Sbjct: 210 SEIIDRNVVSWTSMIAGYVQNDCAEDALVLFNRMREAMVEGNEFTLGSLVTACGKLGALH 269

Query: 270 QGKWVHGYAIKNVIIELNSFLATAFLDMYVKCGQTRDAHMIFDELPSIDLVSWTAMIVGY 329
           QGKWVHGY IKN  IELNS+  T  LDMYVKCG  RDA  +FDEL S+DLVSWTAMIVGY
Sbjct: 270 QGKWVHGYVIKNG-IELNSYSVTTLLDMYVKCGSIRDARSVFDELSSVDLVSWTAMIVGY 329

Query: 330 SQAGQPNEALRLFTDKIRSGLLPNSVTAASILSACSVSGNLSIGMLVHGLGIKLGLEECA 389
           SQ+G P+EAL+LF DK   G+LPN+VT AS+LSAC+   NLS G LVH LGI+LGL++  
Sbjct: 330 SQSGFPDEALKLFIDKKWFGILPNAVTIASLLSACAQLSNLSFGRLVHALGIQLGLKDST 389

Query: 390 VKNALIDMYAKCHMIDDAYVVFLGVLEKDVITWNSMISGYAQSGSAYDALRLFNQMRSDS 449
           V NAL+DMYAKC MI DA  +F  V +K++I WNS+ISGY+Q+GSAY+A  LF+QMRS S
Sbjct: 390 VINALVDMYAKCGMIGDARYIFETVSDKNIIAWNSIISGYSQNGSAYEAFELFHQMRSKS 449

Query: 450 LAPDAITLVSALSASAIL 466
           ++PDA+T+VS  SA A L
Sbjct: 450 VSPDAVTVVSIFSACASL 466

BLAST of Cp4.1LG01g16670 vs. TrEMBL
Match: A5AY98_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_01s0011g00900 PE=4 SV=1)

HSP 1 Score: 577.8 bits (1488), Expect = 1.4e-161
Identity = 276/426 (64.79%), Postives = 343/426 (80.52%), Query Frame = 1

Query: 39  DLDQTMASVQFISLHSCFYFMGLCRNIDTLIKFHGLLIVHGLVGNLLCDTKLVGVYGALG 98
           ++D+T+AS+Q IS + CF  +G+C+ + +L K H LL+VHGL  +LLC+TKLV +YG+ G
Sbjct: 26  EIDRTIASIQSISSNPCFSLLGICKTVSSLRKIHALLVVHGLSEDLLCETKLVSLYGSFG 85

Query: 99  DVGSARMVFDQMRDPDFYAWKVMIRWYFLNDMFASIIPFYN-RMRMSFKECDNIIFSIIL 158
            V  AR++FD++R+PD Y+WKVMIRWYFLND ++ I+ FYN R+R    E DN++FSI+L
Sbjct: 86  HVECARLMFDRIRNPDLYSWKVMIRWYFLNDSYSEIVQFYNTRLRKCLNEYDNVVFSIVL 145

Query: 159 KACSELREIDEGRKVHCQIVKVGGPDSFVLTGLIDMYGKCGQIEWSSAVFEGIIDKNVVS 218
           KACSELRE DEGRK+HCQIVKVG PDSFVLTGL+DMY KC ++E S  VF+ I+D+NVV 
Sbjct: 146 KACSELRETDEGRKLHCQIVKVGSPDSFVLTGLVDMYAKCREVEDSRRVFDEILDRNVVC 205

Query: 219 WTTMIAGYVQNDCAEEGLVLFNRMRESLVESNQFTLGSIITACTRLRALHQGKWVHGYAI 278
           WT+MI GYVQNDC +EGLVLFNRMRE LVE NQ+TLGS++TACT+L ALHQGKWVHGY I
Sbjct: 206 WTSMIVGYVQNDCLKEGLVLFNRMREGLVEGNQYTLGSLVTACTKLGALHQGKWVHGYVI 265

Query: 279 KNVIIELNSFLATAFLDMYVKCGQTRDAHMIFDELPSIDLVSWTAMIVGYSQAGQPNEAL 338
           K+   +LNSFL T  LD+Y KCG  RDA  +FDEL +IDLVSWTAMIVGY+Q G P EAL
Sbjct: 266 KSGF-DLNSFLVTPLLDLYFKCGDIRDAFSVFDELSTIDLVSWTAMIVGYAQRGYPREAL 325

Query: 339 RLFTDKIRSGLLPNSVTAASILSACSVSGNLSIGMLVHGLGIKLGLEECAVKNALIDMYA 398
           +LFTD+    LLPN+VT +S+LSAC+ +G+L++G  VH LGIKLG E+   +NAL+DMYA
Sbjct: 326 KLFTDERWKDLLPNTVTTSSVLSACAQTGSLNMGRSVHCLGIKLGSEDATFENALVDMYA 385

Query: 399 KCHMIDDAYVVFLGVLEKDVITWNSMISGYAQSGSAYDALRLFNQMRSDSLAPDAITLVS 458
           KCHMI DA  VF  V +KDVI WNS+ISGY Q+G AY+AL LF+QMRSDS+ PDAITLVS
Sbjct: 386 KCHMIGDARYVFETVFDKDVIAWNSIISGYTQNGYAYEALELFDQMRSDSVYPDAITLVS 445

Query: 459 ALSASA 464
            LSA A
Sbjct: 446 VLSACA 450

BLAST of Cp4.1LG01g16670 vs. TrEMBL
Match: A0A0D2RH88_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_008G178400 PE=4 SV=1)

HSP 1 Score: 562.8 bits (1449), Expect = 4.6e-157
Identity = 283/488 (57.99%), Postives = 364/488 (74.59%), Query Frame = 1

Query: 27  SYSTYASQP-PLSDLDQTMASVQFISLHSCFYFMGLCRNIDTLIKFHGLLIVHGLVGNLL 86
           S S +  QP     LD ++AS+  +S + CF  +  C+NID L + H L I++G+ G+LL
Sbjct: 44  SLSYFTDQPFEYPSLDPSLASLHSVSSNPCFALLSFCKNIDCLKEVHALFIINGIKGDLL 103

Query: 87  CDTKLVGVYGALGDVGSARMVFDQMRDPDFYAWKVMIRWYFLNDMFASIIPFYNRMRMSF 146
           CDTKLV +YG+ G VG A  VFD++ +PDFY+WKVMIRWYFLND++  II FY RMRMS 
Sbjct: 104 CDTKLVSLYGSFGHVGYAGSVFDRIPEPDFYSWKVMIRWYFLNDLYTEIIGFYGRMRMSV 163

Query: 147 KECDNIIFSIILKACSELREIDEGRKVHCQIVKVGGPDSFVLTGLIDMYGKCGQIEWSSA 206
           +  DN++FS++LKACSEL++I+EGRKVHC +VKVG PDSFV TGL+DMY KC QI+ +  
Sbjct: 164 RGFDNVVFSVVLKACSELQDINEGRKVHCDVVKVGNPDSFVQTGLVDMYAKCRQIKCARK 223

Query: 207 VFEGIIDKNVVSWTTMIAGYVQNDCAEEGLVLFNRMRESLVESNQFTLGSIITACTRLRA 266
           VF  I  +NVVSWT+M+AGYVQN+C++E LVLFNRMRE++VESNQFTLGS++TAC +L A
Sbjct: 224 VFGEIFYRNVVSWTSMLAGYVQNNCSKEALVLFNRMREAMVESNQFTLGSLVTACGKLGA 283

Query: 267 LHQGKWVHGYAIKNVIIELNSFLATAFLDMYVKCGQTRDAHMIFDELPSIDLVSWTAMIV 326
           LHQGKWVHGY IK   IELNS+L TA LDMYVKCG  RDA   FD LPS+DLVSWTAMIV
Sbjct: 284 LHQGKWVHGYIIKTG-IELNSYLVTAILDMYVKCGSLRDARSAFDALPSVDLVSWTAMIV 343

Query: 327 GYSQAGQPNEALRLFTDKIRSGLLPNSVTAASILSACSVSGNLSIGMLVHGLGIKLGLEE 386
           GYSQ+G P+EAL+LF DK R G+LPN+VT AS+LSAC+   NLS G LVH LGI+LGL +
Sbjct: 344 GYSQSGFPDEALKLFVDKRRFGILPNAVTIASLLSACAQLSNLSAGRLVHSLGIQLGLID 403

Query: 387 CAVKNALIDMYAKCHMIDDAYVVFLGVLEKDVITWNSMISGYAQSGSAYDALRLFNQMRS 446
             V NAL+DMYAKC +I  A  +F  V +K++I WNS++SGY+Q+G AYDAL LF+QMRS
Sbjct: 404 PTVINALVDMYAKCGVIRAASYIFETVSDKNLIAWNSILSGYSQNGLAYDALELFHQMRS 463

Query: 447 DSLAPDAITLVSALSASAILET--------SFYLSNSIANSEIMLTKQQLEFL--CGINS 504
           +S++PDA+TLVS  SA A +          ++ + N + +S + +    L F   CG + 
Sbjct: 464 NSVSPDAVTLVSIFSACASVGAFQVGSSLHAYTMKNGLLSSSVYVGTALLNFYAKCGDSK 523

BLAST of Cp4.1LG01g16670 vs. TrEMBL
Match: A0A067L9V5_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_15672 PE=4 SV=1)

HSP 1 Score: 562.0 bits (1447), Expect = 7.9e-157
Identity = 280/487 (57.49%), Postives = 359/487 (73.72%), Query Frame = 1

Query: 27  SYSTYASQPPLSDLDQTMASVQFISLHSCFYFMGLCRNIDTLIKFHGLLIVHGLVGNLLC 86
           SY T+         D  +AS+ +I  H CF  +G C+NI +L K HGLLIV GL G+LLC
Sbjct: 32  SYLTHQLPLDPPQFDHNIASIHYIFSHPCFNLLGFCKNIYSLKKVHGLLIVDGLDGDLLC 91

Query: 87  DTKLVGVYGALGDVGSARMVFDQMRDPDFYAWKVMIRWYFLNDMFASIIPFYNRMRMSFK 146
           +TKLV +YG+ GD+ +AR+VFD++ +PD Y+WKVM+RWYFL+D++  I   Y+RM++  K
Sbjct: 92  NTKLVSLYGSFGDIDAARVVFDRIPNPDLYSWKVMLRWYFLSDLYWEIFGLYSRMKICVK 151

Query: 147 ECDNIIFSIILKACSELREIDEGRKVHCQIVKVGGPDSFVLTGLIDMYGKCGQIEWSSAV 206
           E DN++FSI+LKACSELR IDEGRK+HCQ+VKVG PDSFVLTGL+DMY KCG+IE S  V
Sbjct: 152 EYDNVMFSIVLKACSELRCIDEGRKIHCQVVKVGDPDSFVLTGLVDMYAKCGEIESSRHV 211

Query: 207 FEGIIDKNVVSWTTMIAGYVQNDCAEEGLVLFNRMRESLVESNQFTLGSIITACTRLRAL 266
           F+  +D+NVVSWT+MIAGYVQNDC  EGL LFNRMRE  V  NQFTLGS++TACT+L AL
Sbjct: 212 FDENLDRNVVSWTSMIAGYVQNDCPAEGLTLFNRMREGFVGGNQFTLGSLVTACTKLGAL 271

Query: 267 HQGKWVHGYAIKNVIIELNSFLATAFLDMYVKCGQTRDAHMIFDELPSIDLVSWTAMIVG 326
           HQGKWVHG+AIK+  +ELNS+L TA LDMYVKCG  +DA  +FDEL S+DLVSWTAMIVG
Sbjct: 272 HQGKWVHGFAIKSG-VELNSYLVTALLDMYVKCGVIKDARSVFDELSSVDLVSWTAMIVG 331

Query: 327 YSQAGQPNEALRLFTDKIRSGLLPNSVTAASILSACSVSGNLSIGMLVHGLGIKLGLEEC 386
           Y+Q+G  +EAL+LF D+ +   LPN VT  ++LSAC+  GNL++G  VHGLGIKLG  + 
Sbjct: 332 YTQSGLFHEALKLFMDE-KFDALPNDVTIVTVLSACAQLGNLNLGRSVHGLGIKLGFRQS 391

Query: 387 AVKNALIDMYAKCHMIDDAYVVFLGVLEKDVITWNSMISGYAQSGSAYDALRLFNQMRSD 446
            V NAL+DMYAKCHM  DA  +F  +  KDV+ WNS+ISGY Q+GSAY+AL LF+QMR +
Sbjct: 392 TVANALVDMYAKCHMNRDASFIFERISHKDVVAWNSIISGYYQNGSAYEALELFHQMRME 451

Query: 447 SLAPDAITLVSALSASAILET--------SFYLSNSIANSEIMLTKQQLEFL--CGINSS 504
            + PDA+TLVS  SA A+L          ++ +   + +S + +    L F   CG  +S
Sbjct: 452 LVLPDAVTLVSVFSACALLGALRAGSSLHAYSIKEGLLSSNVYVGTALLTFYAKCGDANS 511

BLAST of Cp4.1LG01g16670 vs. TAIR10
Match: AT2G03380.1 (AT2G03380.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 424.1 bits (1089), Expect = 1.3e-118
Identity = 212/421 (50.36%), Postives = 289/421 (68.65%), Query Frame = 1

Query: 45  ASVQFISLHSCFYFMGLCRNIDTLIKFHGLLIVHGLVGNLLCDTKLVGVYGALGDVGSAR 104
           +S+ + +   CF  +  C NID+L + HG+L  +GL+G++   TKLV +YG  G    AR
Sbjct: 37  SSLHYAASSPCFLLLSKCTNIDSLRQSHGVLTGNGLMGDISIATKLVSLYGFFGYTKDAR 96

Query: 105 MVFDQMRDPDFYAWKVMIRWYFLNDMFASIIPFYNRMRMSFKECDNIIFSIILKACSELR 164
           +VFDQ+ +PDFY WKVM+R Y LN     ++  Y+ +       D+I+FS  LKAC+EL+
Sbjct: 97  LVFDQIPEPDFYLWKVMLRCYCLNKESVEVVKLYDLLMKHGFRYDDIVFSKALKACTELQ 156

Query: 165 EIDEGRKVHCQIVKVGGPDSFVLTGLIDMYGKCGQIEWSSAVFEGIIDKNVVSWTTMIAG 224
           ++D G+K+HCQ+VKV   D+ VLTGL+DMY KCG+I+ +  VF  I  +NVV WT+MIAG
Sbjct: 157 DLDNGKKIHCQLVKVPSFDNVVLTGLLDMYAKCGEIKSAHKVFNDITLRNVVCWTSMIAG 216

Query: 225 YVQNDCAEEGLVLFNRMRESLVESNQFTLGSIITACTRLRALHQGKWVHGYAIKNVIIEL 284
           YV+ND  EEGLVLFNRMRE+ V  N++T G++I ACT+L ALHQGKW HG  +K+  IEL
Sbjct: 217 YVKNDLCEEGLVLFNRMRENNVLGNEYTYGTLIMACTKLSALHQGKWFHGCLVKSG-IEL 276

Query: 285 NSFLATAFLDMYVKCGQTRDAHMIFDELPSIDLVSWTAMIVGYSQAGQPNEALRLFTDKI 344
           +S L T+ LDMYVKCG   +A  +F+E   +DLV WTAMIVGY+  G  NEAL LF    
Sbjct: 277 SSCLVTSLLDMYVKCGDISNARRVFNEHSHVDLVMWTAMIVGYTHNGSVNEALSLFQKMK 336

Query: 345 RSGLLPNSVTAASILSACSVSGNLSIGMLVHGLGIKLGLEECAVKNALIDMYAKCHMIDD 404
              + PN VT AS+LS C +  NL +G  VHGL IK+G+ +  V NAL+ MYAKC+   D
Sbjct: 337 GVEIKPNCVTIASVLSGCGLIENLELGRSVHGLSIKVGIWDTNVANALVHMYAKCYQNRD 396

Query: 405 AYVVFLGVLEKDVITWNSMISGYAQSGSAYDALRLFNQMRSDSLAPDAITLVSALSASAI 464
           A  VF    EKD++ WNS+ISG++Q+GS ++AL LF++M S+S+ P+ +T+ S  SA A 
Sbjct: 397 AKYVFEMESEKDIVAWNSIISGFSQNGSIHEALFLFHRMNSESVTPNGVTVASLFSACAS 456

Query: 465 L 466
           L
Sbjct: 457 L 456

BLAST of Cp4.1LG01g16670 vs. TAIR10
Match: AT3G12770.1 (AT3G12770.1 mitochondrial editing factor 22)

HSP 1 Score: 246.5 bits (628), Expect = 3.7e-65
Identity = 143/434 (32.95%), Postives = 242/434 (55.76%), Query Frame = 1

Query: 68  LIKFHGLLIVHGLVGNLLCDTKLVGVYGALGDVGSARMVFDQMRDPDFYAWKVMIRWYFL 127
           L + H  L+V GL  +    TKL+    + GD+  AR VFD +  P  + W  +IR Y  
Sbjct: 37  LKQIHARLLVLGLQFSGFLITKLIHASSSFGDITFARQVFDDLPRPQIFPWNAIIRGYSR 96

Query: 128 NDMFASIIPFYNRMRMSFKECDNIIFSIILKACSELREIDEGRKVHCQIVKVG-GPDSFV 187
           N+ F   +  Y+ M+++    D+  F  +LKACS L  +  GR VH Q+ ++G   D FV
Sbjct: 97  NNHFQDALLMYSNMQLARVSPDSFTFPHLLKACSGLSHLQMGRFVHAQVFRLGFDADVFV 156

Query: 188 LTGLIDMYGKCGQIEWSSAVFEG--IIDKNVVSWTTMIAGYVQNDCAEEGLVLFNRMRES 247
             GLI +Y KC ++  +  VFEG  + ++ +VSWT +++ Y QN    E L +F++MR+ 
Sbjct: 157 QNGLIALYAKCRRLGSARTVFEGLPLPERTIVSWTAIVSAYAQNGEPMEALEIFSQMRKM 216

Query: 248 LVESNQFTLGSIITACTRLRALHQGKWVHGYAIKNVIIELNSFLATAFLDMYVKCGQTRD 307
            V+ +   L S++ A T L+ L QG+ +H   +K + +E+   L  +   MY KCGQ   
Sbjct: 217 DVKPDWVALVSVLNAFTCLQDLKQGRSIHASVVK-MGLEIEPDLLISLNTMYAKCGQVAT 276

Query: 308 AHMIFDELPSIDLVSWTAMIVGYSQAGQPNEALRLFTDKIRSGLLPNSVTAASILSACSV 367
           A ++FD++ S +L+ W AMI GY++ G   EA+ +F + I   + P++++  S +SAC+ 
Sbjct: 277 AKILFDKMKSPNLILWNAMISGYAKNGYAREAIDMFHEMINKDVRPDTISITSAISACAQ 336

Query: 368 SGNLSIGMLVHG-LGIKLGLEECAVKNALIDMYAKCHMIDDAYVVFLGVLEKDVITWNSM 427
            G+L     ++  +G     ++  + +ALIDM+AKC  ++ A +VF   L++DV+ W++M
Sbjct: 337 VGSLEQARSMYEYVGRSDYRDDVFISSALIDMFAKCGSVEGARLVFDRTLDRDVVVWSAM 396

Query: 428 ISGYAQSGSAYDALRLFNQMRSDSLAPDAITLVSALSA---SAILETSFYLSNSIANSEI 487
           I GY   G A +A+ L+  M    + P+ +T +  L A   S ++   ++  N +A+ +I
Sbjct: 397 IVGYGLHGRAREAISLYRAMERGGVHPNDVTFLGLLMACNHSGMVREGWWFFNRMADHKI 456

Query: 488 MLTKQQLEFLCGIN 495
               QQ  + C I+
Sbjct: 457 --NPQQQHYACVID 467

BLAST of Cp4.1LG01g16670 vs. TAIR10
Match: AT4G18520.1 (AT4G18520.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 245.7 bits (626), Expect = 6.3e-65
Identity = 146/397 (36.78%), Postives = 221/397 (55.67%), Query Frame = 1

Query: 72  HGLLIVHGLVGNLLCDTKLVGVYGALGDVGSARMVFDQMRDPDFYAWKVMIRWYFLNDMF 131
           HG ++  G VGNL+ ++ LV  Y   G++ SA   FD M + D  +W  +I         
Sbjct: 207 HGNMVKVG-VGNLIVESSLVYFYAQCGELTSALRAFDMMEEKDVISWTAVISACSRKGHG 266

Query: 132 ASIIPFYNRMRMSFKECDNIIFSIILKACSELREIDEGRKVHCQIVK-VGGPDSFVLTGL 191
              I  +  M   +   +      ILKACSE + +  GR+VH  +VK +   D FV T L
Sbjct: 267 IKAIGMFIGMLNHWFLPNEFTVCSILKACSEEKALRFGRQVHSLVVKRMIKTDVFVGTSL 326

Query: 192 IDMYGKCGQIEWSSAVFEGIIDKNVVSWTTMIAGYVQNDCAEEGLVLFNRMRESLVESNQ 251
           +DMY KCG+I     VF+G+ ++N V+WT++IA + +    EE + LF  M+   + +N 
Sbjct: 327 MDMYAKCGEISDCRKVFDGMSNRNTVTWTSIIAAHAREGFGEEAISLFRIMKRRHLIANN 386

Query: 252 FTLGSIITACTRLRALHQGKWVHGYAIKNVIIELNSFLATAFLDMYVKCGQTRDAHMIFD 311
            T+ SI+ AC  + AL  GK +H   IKN I E N ++ +  + +Y KCG++RDA  +  
Sbjct: 387 LTVVSILRACGSVGALLLGKELHAQIIKNSI-EKNVYIGSTLVWLYCKCGESRDAFNVLQ 446

Query: 312 ELPSIDLVSWTAMIVGYSQAGQPNEALRLFTDKIRSGLLPNSVTAASILSACSVSGNLSI 371
           +LPS D+VSWTAMI G S  G  +EAL    + I+ G+ PN  T +S L AC+ S +L I
Sbjct: 447 QLPSRDVVSWTAMISGCSSLGHESEALDFLKEMIQEGVEPNPFTYSSALKACANSESLLI 506

Query: 372 GMLVHGLGIK-LGLEECAVKNALIDMYAKCHMIDDAYVVFLGVLEKDVITWNSMISGYAQ 431
           G  +H +  K   L    V +ALI MYAKC  + +A+ VF  + EK++++W +MI GYA+
Sbjct: 507 GRSIHSIAKKNHALSNVFVGSALIHMYAKCGFVSEAFRVFDSMPEKNLVSWKAMIMGYAR 566

Query: 432 SGSAYDALRLFNQMRSDSLAPDAITLVSALSASAILE 467
           +G   +AL+L  +M ++    D     + LS    +E
Sbjct: 567 NGFCREALKLMYRMEAEGFEVDDYIFATILSTCGDIE 601

BLAST of Cp4.1LG01g16670 vs. TAIR10
Match: AT1G11290.1 (AT1G11290.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 241.5 bits (615), Expect = 1.2e-63
Identity = 139/420 (33.10%), Postives = 227/420 (54.05%), Query Frame = 1

Query: 53  HSCFYFMGLCRNIDTLIKFHGLLIVHGLVGNLLCDTKLVGVYGALGDVGSARMVFDQMRD 112
           H     +  C ++  L +   L+  +GL       TKLV ++   G V  A  VF+ +  
Sbjct: 38  HPAALLLERCSSLKELRQILPLVFKNGLYQEHFFQTKLVSLFCRYGSVDEAARVFEPIDS 97

Query: 113 PDFYAWKVMIRWYFLNDMFASIIPFYNRMRMSFKECDNIIFSIILKACSELREIDEGRKV 172
                +  M++ +         + F+ RMR    E     F+ +LK C +  E+  G+++
Sbjct: 98  KLNVLYHTMLKGFAKVSDLDKALQFFVRMRYDDVEPVVYNFTYLLKVCGDEAELRVGKEI 157

Query: 173 HCQIVKVG-GPDSFVLTGLIDMYGKCGQIEWSSAVFEGIIDKNVVSWTTMIAGYVQNDCA 232
           H  +VK G   D F +TGL +MY KC Q+  +  VF+ + ++++VSW T++AGY QN  A
Sbjct: 158 HGLLVKSGFSLDLFAMTGLENMYAKCRQVNEARKVFDRMPERDLVSWNTIVAGYSQNGMA 217

Query: 233 EEGLVLFNRMRESLVESNQFTLGSIITACTRLRALHQGKWVHGYAIKNVIIELNSFLATA 292
              L +   M E  ++ +  T+ S++ A + LR +  GK +HGYA+++    L + ++TA
Sbjct: 218 RMALEMVKSMCEENLKPSFITIVSVLPAVSALRLISVGKEIHGYAMRSGFDSLVN-ISTA 277

Query: 293 FLDMYVKCGQTRDAHMIFDELPSIDLVSWTAMIVGYSQAGQPNEALRLFTDKIRSGLLPN 352
            +DMY KCG    A  +FD +   ++VSW +MI  Y Q   P EA+ +F   +  G+ P 
Sbjct: 278 LVDMYAKCGSLETARQLFDGMLERNVVSWNSMIDAYVQNENPKEAMLIFQKMLDEGVKPT 337

Query: 353 SVTAASILSACSVSGNLSIGMLVHGLGIKLGLE-ECAVKNALIDMYAKCHMIDDAYVVFL 412
            V+    L AC+  G+L  G  +H L ++LGL+   +V N+LI MY KC  +D A  +F 
Sbjct: 338 DVSVMGALHACADLGDLERGRFIHKLSVELGLDRNVSVVNSLISMYCKCKEVDTAASMFG 397

Query: 413 GVLEKDVITWNSMISGYAQSGSAYDALRLFNQMRSDSLAPDAITLVSALSASAILETSFY 471
            +  + +++WN+MI G+AQ+G   DAL  F+QMRS ++ PD  T VS ++A A L  + +
Sbjct: 398 KLQSRTLVSWNAMILGFAQNGRPIDALNYFSQMRSRTVKPDTFTYVSVITAIAELSITHH 456

BLAST of Cp4.1LG01g16670 vs. TAIR10
Match: AT3G05240.1 (AT3G05240.1 mitochondrial editing factor 19)

HSP 1 Score: 233.4 bits (594), Expect = 3.3e-61
Identity = 139/413 (33.66%), Postives = 215/413 (52.06%), Query Frame = 1

Query: 62  CRNIDTLIKFHGLLIVHGLVGNLLCDTKLVGVYGALGD---VGSARMVFDQMRDPDFYAW 121
           CR++  L + HGL+I   ++ N++  ++L+       +   +  AR VF+ +  P  Y W
Sbjct: 16  CRSLVELNQLHGLMIKSSVIRNVIPLSRLIDFCTTCPETMNLSYARSVFESIDCPSVYIW 75

Query: 122 KVMIRWYFLNDMFASIIPFYNRMRMSFKECDNIIFSIILKACSELREIDEGRKVHCQIVK 181
             MIR Y  +      + FY  M       D   F  +LKACS LR+I  G  VH  +VK
Sbjct: 76  NSMIRGYSNSPNPDKALIFYQEMLRKGYSPDYFTFPYVLKACSGLRDIQFGSCVHGFVVK 135

Query: 182 VGGP-DSFVLTGLIDMYGKCGQIEWSSAVFEGIIDKNVVSWTTMIAGYVQNDCAEEGLVL 241
            G   + +V T L+ MY  CG++ +   VFE I   NVV+W ++I+G+V N+   + +  
Sbjct: 136 TGFEVNMYVSTCLLHMYMCCGEVNYGLRVFEDIPQWNVVAWGSLISGFVNNNRFSDAIEA 195

Query: 242 FNRMRESLVESNQFTLGSIITACTRLRALHQGKWVHGYA-------IKNVIIELNSFLAT 301
           F  M+ + V++N+  +  ++ AC R + +  GKW HG+             +  N  LAT
Sbjct: 196 FREMQSNGVKANETIMVDLLVACGRCKDIVTGKWFHGFLQGLGFDPYFQSKVGFNVILAT 255

Query: 302 AFLDMYVKCGQTRDAHMIFDELPSIDLVSWTAMIVGYSQAGQPNEALRLFTDKIRSGLLP 361
           + +DMY KCG  R A  +FD +P   LVSW ++I GYSQ G   EAL +F D +  G+ P
Sbjct: 256 SLIDMYAKCGDLRTARYLFDGMPERTLVSWNSIITGYSQNGDAEEALCMFLDMLDLGIAP 315

Query: 362 NSVTAASILSACSVSGNLSIGMLVHGLGIKLG-LEECAVKNALIDMYAKCHMIDDAYVVF 421
           + VT  S++ A  + G   +G  +H    K G +++ A+  AL++MYAK    + A   F
Sbjct: 316 DKVTFLSVIRASMIQGCSQLGQSIHAYVSKTGFVKDAAIVCALVNMYAKTGDAESAKKAF 375

Query: 422 LGVLEKDVITWNSMISGYAQSGSAYDALRLFNQMRSDSLA-PDAITLVSALSA 462
             + +KD I W  +I G A  G   +AL +F +M+    A PD IT +  L A
Sbjct: 376 EDLEKKDTIAWTVVIIGLASHGHGNEALSIFQRMQEKGNATPDGITYLGVLYA 428

BLAST of Cp4.1LG01g16670 vs. NCBI nr
Match: gi|659114052|ref|XP_008456885.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g03380, mitochondrial [Cucumis melo])

HSP 1 Score: 814.7 bits (2103), Expect = 9.7e-233
Identity = 413/513 (80.51%), Postives = 446/513 (86.94%), Query Frame = 1

Query: 1   MLQRFFSLPRIFSRLAGPLLEMGHHVSYSTYASQPPLSDLDQTMASVQFISLHSCFYFMG 60
           MLQRFFSLPR FS L G LL+MGH +SYSTYAS PPLSDL QTM SVQFISLHSC Y MG
Sbjct: 1   MLQRFFSLPRAFSHLTGSLLQMGHRMSYSTYASHPPLSDLHQTMPSVQFISLHSCCYLMG 60

Query: 61  LCRNIDTLIKFHGLLIVHGLVGNLLCDTKLVGVYGALGDVGSARMVFDQMRDPDFYAWKV 120
           L RNIDTLIKFHGLLIVHGL+GNLLCDTKLVGVYGALGDV SARMVFDQM DPDFYAWKV
Sbjct: 61  LFRNIDTLIKFHGLLIVHGLIGNLLCDTKLVGVYGALGDVRSARMVFDQMPDPDFYAWKV 120

Query: 121 MIRWYFLNDMFASIIPFYNRMRMSFKECDNIIFSIILKACSELREIDEGRKVHCQIVKVG 180
           MIRWYFLND+F  +IPFYN MRMSF+ECDNIIFSIILKACSELREIDEGRKVHCQIVKVG
Sbjct: 121 MIRWYFLNDLFVDVIPFYNCMRMSFRECDNIIFSIILKACSELREIDEGRKVHCQIVKVG 180

Query: 181 GPDSFVLTGLIDMYGKCGQIEWSSAVFEGIIDKNVVSWTTMIAGYVQNDCAEEGLVLFNR 240
           GPDSFV+TGLIDMYGKC Q+E SSAVFE I+DKNVVSWT+MIAGYVQN+CAEEGLVLFNR
Sbjct: 181 GPDSFVMTGLIDMYGKCRQVECSSAVFEEIMDKNVVSWTSMIAGYVQNNCAEEGLVLFNR 240

Query: 241 MRESLVESNQFTLGSIITACTRLRALHQGKWVHGYAIKNVIIELNSFLATAFLDMYVKCG 300
           MR++LVESN FTLGSII ACT+LRALHQGKWVHGYAIKN I+E +SFLAT FLDMYVKCG
Sbjct: 241 MRDALVESNPFTLGSIINACTKLRALHQGKWVHGYAIKN-IVEFSSFLATTFLDMYVKCG 300

Query: 301 QTRDAHMIFDELPSIDLVSWTAMIVGYSQAGQPNEALRLFTDKIRSGLLPNSVTAASILS 360
           QTRDA MIFDELP+IDLVSWTAMIVGY+QA QPN+ LRLF D+IRS LLPNSVTAAS+LS
Sbjct: 301 QTRDARMIFDELPTIDLVSWTAMIVGYTQASQPNDGLRLFADEIRSDLLPNSVTAASVLS 360

Query: 361 ACSVSGNLSIGMLVHGLGIKLGLEECAVKNALIDMYAKCHMIDDAYVVFLGVLEKDVITW 420
           ACSVSGNL++GM VHGLGIKLGLEECAVKNALIDMYAKCH I DAYV+F GVLEKDVITW
Sbjct: 361 ACSVSGNLNLGMSVHGLGIKLGLEECAVKNALIDMYAKCHKIGDAYVIFHGVLEKDVITW 420

Query: 421 NSMISGYAQSGSAYDALRLFNQMRSDSLAPDAITLVSALSASAILET--------SFYLS 480
           NSMISGYAQ+GSAYDALRLFNQMRS SLAPDAITLVS LSASA L          ++ + 
Sbjct: 421 NSMISGYAQNGSAYDALRLFNQMRSYSLAPDAITLVSTLSASATLGAVQVGSSLHAYSVK 480

Query: 481 NSIANSEIMLTKQQLEFL--CGINSSAQNTNDS 504
             + +S + +    L F   CG   SA+   DS
Sbjct: 481 EGLFSSNLYIGTALLNFYAKCGDAKSARTVFDS 512

BLAST of Cp4.1LG01g16670 vs. NCBI nr
Match: gi|700195421|gb|KGN50598.1| (hypothetical protein Csa_5G189930 [Cucumis sativus])

HSP 1 Score: 786.9 bits (2031), Expect = 2.2e-224
Identity = 393/465 (84.52%), Postives = 420/465 (90.32%), Query Frame = 1

Query: 1   MLQRFFSLPRIFSRLAGPLLEMGHHVSYSTYASQPPLSDLDQTMASVQFISLHSCFYFMG 60
           MLQRF   PR F RLA PLL+MGH +SYSTYAS PPLSDL QTM SVQFISL  C Y MG
Sbjct: 13  MLQRF---PRAFFRLASPLLQMGHRMSYSTYASHPPLSDLHQTMPSVQFISLPLCCYLMG 72

Query: 61  LCRNIDTLIKFHGLLIVHGLVGNLLCDTKLVGVYGALGDVGSARMVFDQMRDPDFYAWKV 120
           L RNIDTLIKFHGLLIVHGL+GNLLCDTKLVGVYGALGDV SARMVFDQM +PDFYAWKV
Sbjct: 73  LFRNIDTLIKFHGLLIVHGLIGNLLCDTKLVGVYGALGDVRSARMVFDQMPNPDFYAWKV 132

Query: 121 MIRWYFLNDMFASIIPFYNRMRMSFKECDNIIFSIILKACSELREIDEGRKVHCQIVKVG 180
           MIRWYFLND+F  +IPFYNRMRMSF+ECDNIIFSIILKACSELREI EGRKVHCQIVKVG
Sbjct: 133 MIRWYFLNDLFVDVIPFYNRMRMSFRECDNIIFSIILKACSELREIVEGRKVHCQIVKVG 192

Query: 181 GPDSFVLTGLIDMYGKCGQIEWSSAVFEGIIDKNVVSWTTMIAGYVQNDCAEEGLVLFNR 240
           GPDSFV+TGLIDMYGKCGQ+E SSAVFE I+DKNVVSWT+MIAGYVQN+CAEEGLVLFNR
Sbjct: 193 GPDSFVMTGLIDMYGKCGQVECSSAVFEEIMDKNVVSWTSMIAGYVQNNCAEEGLVLFNR 252

Query: 241 MRESLVESNQFTLGSIITACTRLRALHQGKWVHGYAIKNVIIELNSFLATAFLDMYVKCG 300
           MR++LVESN FTLGSII ACT+LRALHQGKWVHGYAIKN I EL+SFLAT FLDMYVKCG
Sbjct: 253 MRDALVESNPFTLGSIINACTKLRALHQGKWVHGYAIKN-IAELSSFLATTFLDMYVKCG 312

Query: 301 QTRDAHMIFDELPSIDLVSWTAMIVGYSQAGQPNEALRLFTDKIRSGLLPNSVTAASILS 360
           QTRDA MI+DELP+IDLVSWTAMIVGY+QA QPN+ LRLF D+IRS LLPNSVTAAS+LS
Sbjct: 313 QTRDARMIYDELPTIDLVSWTAMIVGYTQARQPNDGLRLFADEIRSDLLPNSVTAASVLS 372

Query: 361 ACSVSGNLSIGMLVHGLGIKLGLEECAVKNALIDMYAKCHMIDDAYVVFLGVLEKDVITW 420
           ACSVSGNL++GM VHGLGIK+GLEEC VKNALIDMYAKCH I DAY +F GVLEKDVITW
Sbjct: 373 ACSVSGNLNLGMSVHGLGIKIGLEECVVKNALIDMYAKCHKIGDAYAIFHGVLEKDVITW 432

Query: 421 NSMISGYAQSGSAYDALRLFNQMRSDSLAPDAITLVSALSASAIL 466
           NSMISGYAQ+GSAYDALRLFNQMR   LAPD ITLVS LSA A L
Sbjct: 433 NSMISGYAQNGSAYDALRLFNQMRLYFLAPDVITLVSTLSACATL 473

BLAST of Cp4.1LG01g16670 vs. NCBI nr
Match: gi|778708407|ref|XP_011656184.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g03380, mitochondrial-like [Cucumis sativus])

HSP 1 Score: 782.3 bits (2019), Expect = 5.3e-223
Identity = 390/465 (83.87%), Postives = 420/465 (90.32%), Query Frame = 1

Query: 1   MLQRFFSLPRIFSRLAGPLLEMGHHVSYSTYASQPPLSDLDQTMASVQFISLHSCFYFMG 60
           MLQRF   PR F RLA PLL+MGH +SYSTYAS PPLSDL QTM SVQFISL  C Y MG
Sbjct: 13  MLQRF---PRAFFRLASPLLQMGHRMSYSTYASHPPLSDLHQTMPSVQFISLPLCCYLMG 72

Query: 61  LCRNIDTLIKFHGLLIVHGLVGNLLCDTKLVGVYGALGDVGSARMVFDQMRDPDFYAWKV 120
           L RNIDTLIKFHGLLIVHGL+GNLLCDTKLVGVYGALGDV SARMVFDQM +PDFYAWKV
Sbjct: 73  LFRNIDTLIKFHGLLIVHGLIGNLLCDTKLVGVYGALGDVRSARMVFDQMPNPDFYAWKV 132

Query: 121 MIRWYFLNDMFASIIPFYNRMRMSFKECDNIIFSIILKACSELREIDEGRKVHCQIVKVG 180
           MIRWYFLND+F  +IPFYNRMRMSF+ECDNIIFSIILKACSELREI EGRKVHCQIVKVG
Sbjct: 133 MIRWYFLNDLFVDVIPFYNRMRMSFRECDNIIFSIILKACSELREIVEGRKVHCQIVKVG 192

Query: 181 GPDSFVLTGLIDMYGKCGQIEWSSAVFEGIIDKNVVSWTTMIAGYVQNDCAEEGLVLFNR 240
           GPDSFV+TGLIDMYGKCGQ+E SSAVFE I+DKNVVSWT+MIAGYVQN+CAEEGLVLFNR
Sbjct: 193 GPDSFVMTGLIDMYGKCGQVECSSAVFEEIMDKNVVSWTSMIAGYVQNNCAEEGLVLFNR 252

Query: 241 MRESLVESNQFTLGSIITACTRLRALHQGKWVHGYAIKNVIIELNSFLATAFLDMYVKCG 300
           MR++LVESN FTLGSII ACT+LRALHQGKWVHGYAIKN I EL+SFLAT FLDMYVKCG
Sbjct: 253 MRDALVESNPFTLGSIINACTKLRALHQGKWVHGYAIKN-IAELSSFLATTFLDMYVKCG 312

Query: 301 QTRDAHMIFDELPSIDLVSWTAMIVGYSQAGQPNEALRLFTDKIRSGLLPNSVTAASILS 360
           QTRDA MI+DELP+IDLVSWTAMIVGY+QA QPN+ LRLF D+IRS LLPNSVTAAS+LS
Sbjct: 313 QTRDARMIYDELPTIDLVSWTAMIVGYTQARQPNDGLRLFADEIRSDLLPNSVTAASVLS 372

Query: 361 ACSVSGNLSIGMLVHGLGIKLGLEECAVKNALIDMYAKCHMIDDAYVVFLGVLEKDVITW 420
           ACSVSGNL++GM VHGLGIK+GLEEC VKNALIDMYAKCH I DAY +F GVLEKDVITW
Sbjct: 373 ACSVSGNLNLGMSVHGLGIKIGLEECVVKNALIDMYAKCHKIGDAYAIFHGVLEKDVITW 432

Query: 421 NSMISGYAQSGSAYDALRLFNQMRSDSLAPDAITLVSALSASAIL 466
           NSMISGYAQ+GSAYDALRLFNQMR   LAPD ITLVS  + SA++
Sbjct: 433 NSMISGYAQNGSAYDALRLFNQMRLYFLAPDVITLVSWFTWSAMI 473

BLAST of Cp4.1LG01g16670 vs. NCBI nr
Match: gi|590699556|ref|XP_007045956.1| (Pentatricopeptide repeat superfamily protein [Theobroma cacao])

HSP 1 Score: 597.4 bits (1539), Expect = 2.4e-167
Identity = 287/438 (65.53%), Postives = 353/438 (80.59%), Query Frame = 1

Query: 30  TYASQPPLS--DLDQTMASVQFISLHSCFYFMGLCRNIDTLIKFHGLLIVHGLVGNLLCD 89
           +Y +  PL    +D+T+AS+  ISL+ CF  +GLCRNID+L K H L +++G+ G+LLCD
Sbjct: 30  SYTTDHPLEYPSMDRTLASMHSISLNPCFALLGLCRNIDSLKKVHALFVINGIKGDLLCD 89

Query: 90  TKLVGVYGALGDVGSARMVFDQMRDPDFYAWKVMIRWYFLNDMFASIIPFYNRMRMSFKE 149
           TKLV +YG  G +G AR++FDQ+ DPDFY+WKVMIRWYFLND+   II FY RMRMS + 
Sbjct: 90  TKLVSLYGLFGHIGCARLMFDQIPDPDFYSWKVMIRWYFLNDLCMEIIGFYARMRMSVRM 149

Query: 150 CDNIIFSIILKACSELREIDEGRKVHCQIVKVGGPDSFVLTGLIDMYGKCGQIEWSSAVF 209
           CDN++FS++LKACSE+R+IDEGRKVHCQIVK G PDSFV TGL+DMY KCG+IE S  VF
Sbjct: 150 CDNVVFSVVLKACSEMRDIDEGRKVHCQIVKAGNPDSFVQTGLVDMYAKCGEIECSRKVF 209

Query: 210 EGIIDKNVVSWTTMIAGYVQNDCAEEGLVLFNRMRESLVESNQFTLGSIITACTRLRALH 269
             IID+NVVSWT+MIAGYVQNDCAE+ LVLFNRMRE++VE N+FTLGS++TAC +L ALH
Sbjct: 210 SEIIDRNVVSWTSMIAGYVQNDCAEDALVLFNRMREAMVEGNEFTLGSLVTACGKLGALH 269

Query: 270 QGKWVHGYAIKNVIIELNSFLATAFLDMYVKCGQTRDAHMIFDELPSIDLVSWTAMIVGY 329
           QGKWVHGY IKN  IELNS+  T  LDMYVKCG  RDA  +FDEL S+DLVSWTAMIVGY
Sbjct: 270 QGKWVHGYVIKNG-IELNSYSVTTLLDMYVKCGSIRDARSVFDELSSVDLVSWTAMIVGY 329

Query: 330 SQAGQPNEALRLFTDKIRSGLLPNSVTAASILSACSVSGNLSIGMLVHGLGIKLGLEECA 389
           SQ+G P+EAL+LF DK   G+LPN+VT AS+LSAC+   NLS G LVH LGI+LGL++  
Sbjct: 330 SQSGFPDEALKLFIDKKWFGILPNAVTIASLLSACAQLSNLSFGRLVHALGIQLGLKDST 389

Query: 390 VKNALIDMYAKCHMIDDAYVVFLGVLEKDVITWNSMISGYAQSGSAYDALRLFNQMRSDS 449
           V NAL+DMYAKC MI DA  +F  V +K++I WNS+ISGY+Q+GSAY+A  LF+QMRS S
Sbjct: 390 VINALVDMYAKCGMIGDARYIFETVSDKNIIAWNSIISGYSQNGSAYEAFELFHQMRSKS 449

Query: 450 LAPDAITLVSALSASAIL 466
           ++PDA+T+VS  SA A L
Sbjct: 450 VSPDAVTVVSIFSACASL 466

BLAST of Cp4.1LG01g16670 vs. NCBI nr
Match: gi|645229621|ref|XP_008221546.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g03380, mitochondrial [Prunus mume])

HSP 1 Score: 583.9 bits (1504), Expect = 2.8e-163
Identity = 276/441 (62.59%), Postives = 356/441 (80.73%), Query Frame = 1

Query: 25  HVSYSTYASQPPLSDLDQTMASVQFISLHSCFYFMGLCRNIDTLIKFHGLLIVHGLVGNL 84
           H  Y   +S+PP  DL +T+AS + +  + CF  + LCRNID+L K H LL++HGL  +L
Sbjct: 28  HAIYQLPSSEPP--DLSETLASTRSVFSNPCFNLLVLCRNIDSLKKVHSLLVLHGLSDDL 87

Query: 85  LCDTKLVGVYGALGDVGSARMVFDQMRDPDFYAWKVMIRWYFLNDMFASIIPFYNRMRMS 144
           LC TKL+ +YG+ G V  AR++FDQM  PDFY+WKVM+RWYF+++++A ++ FY  MR+ 
Sbjct: 88  LCRTKLISLYGSFGYVKCARLLFDQMPSPDFYSWKVMLRWYFMHNLYAEVMGFYTHMRIC 147

Query: 145 FKECDNIIFSIILKACSELREIDEGRKVHCQIVKVGGPDSFVLTGLIDMYGKCGQIEWSS 204
            +E DN++FSI+LKACSELR+ +EGRKVHCQ+VKV  PDSFVLTGL+D+Y KCG IE S 
Sbjct: 148 VREHDNVVFSIVLKACSELRDFNEGRKVHCQVVKVASPDSFVLTGLVDVYAKCGWIECSR 207

Query: 205 AVFEGIIDKNVVSWTTMIAGYVQNDCAEEGLVLFNRMRESLVESNQFTLGSIITACTRLR 264
           AVF+GI+D+NVV WT+MI GYVQNDC ++GLVLFNRMRE L++ NQFTLGS++TACT+LR
Sbjct: 208 AVFDGIVDRNVVCWTSMIVGYVQNDCPQDGLVLFNRMREELIKGNQFTLGSVLTACTKLR 267

Query: 265 ALHQGKWVHGYAIKNVIIELNSFLATAFLDMYVKCGQTRDAHMIFDELPSIDLVSWTAMI 324
           ALHQGKW+HG+ IK   IE++SFL T+ LDMYVKCG  R A  IFDELP+IDLVSWTAMI
Sbjct: 268 ALHQGKWIHGHLIKTG-IEVSSFLVTSLLDMYVKCGDIRYARSIFDELPAIDLVSWTAMI 327

Query: 325 VGYSQAGQPNEALRLFTDKIRSGLLPNSVTAASILSACSVSGNLSIGMLVHGLGIKLGLE 384
           VGY+Q+G P+EAL+LFTD+   GLLPNS+T AS+LS+C+ S NL++G  +HGLGIKLGLE
Sbjct: 328 VGYTQSGCPDEALKLFTDEKWVGLLPNSITTASVLSSCAQSYNLNLGRSIHGLGIKLGLE 387

Query: 385 ECAVKNALIDMYAKCHMIDDAYVVFLGVLEKDVITWNSMISGYAQSGSAYDALRLFNQMR 444
           +  V+NAL+DMYAKCHMI DA  +F  +L+K+VI WNS+ISGY+Q+GSA +AL+LF+QMR
Sbjct: 388 DSTVRNALVDMYAKCHMIGDARYIFETILDKNVIAWNSIISGYSQNGSACEALQLFHQMR 447

Query: 445 SDSLAPDAITLVSALSASAIL 466
           S+S + DA TL S LSA   L
Sbjct: 448 SESFSHDAFTLASVLSACTTL 465

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP146_ARATH2.3e-11750.36Pentatricopeptide repeat-containing protein At2g03380, mitochondrial OS=Arabidop... [more]
PP224_ARATH6.6e-6432.95Pentatricopeptide repeat-containing protein At3g12770 OS=Arabidopsis thaliana GN... [more]
PP319_ARATH1.1e-6336.78Pentatricopeptide repeat-containing protein At4g18520 OS=Arabidopsis thaliana GN... [more]
PPR32_ARATH2.1e-6233.10Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidop... [more]
PP214_ARATH5.8e-6033.66Putative pentatricopeptide repeat-containing protein At3g05240 OS=Arabidopsis th... [more]
Match NameE-valueIdentityDescription
A0A0A0KLZ1_CUCSA1.5e-22484.52Uncharacterized protein OS=Cucumis sativus GN=Csa_5G189930 PE=4 SV=1[more]
A0A061EAZ8_THECC1.7e-16765.53Pentatricopeptide repeat superfamily protein OS=Theobroma cacao GN=TCM_011604 PE... [more]
A5AY98_VITVI1.4e-16164.79Putative uncharacterized protein OS=Vitis vinifera GN=VIT_01s0011g00900 PE=4 SV=... [more]
A0A0D2RH88_GOSRA4.6e-15757.99Uncharacterized protein OS=Gossypium raimondii GN=B456_008G178400 PE=4 SV=1[more]
A0A067L9V5_JATCU7.9e-15757.49Uncharacterized protein OS=Jatropha curcas GN=JCGZ_15672 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G03380.11.3e-11850.36 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT3G12770.13.7e-6532.95 mitochondrial editing factor 22[more]
AT4G18520.16.3e-6536.78 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G11290.11.2e-6333.10 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT3G05240.13.3e-6133.66 mitochondrial editing factor 19[more]
Match NameE-valueIdentityDescription
gi|659114052|ref|XP_008456885.1|9.7e-23380.51PREDICTED: pentatricopeptide repeat-containing protein At2g03380, mitochondrial ... [more]
gi|700195421|gb|KGN50598.1|2.2e-22484.52hypothetical protein Csa_5G189930 [Cucumis sativus][more]
gi|778708407|ref|XP_011656184.1|5.3e-22383.87PREDICTED: pentatricopeptide repeat-containing protein At2g03380, mitochondrial-... [more]
gi|590699556|ref|XP_007045956.1|2.4e-16765.53Pentatricopeptide repeat superfamily protein [Theobroma cacao][more]
gi|645229621|ref|XP_008221546.1|2.8e-16362.59PREDICTED: pentatricopeptide repeat-containing protein At2g03380, mitochondrial ... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function
molecular_function GO:0005515 protein binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG01g16670.1Cp4.1LG01g16670.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 390..413
score: 0.52coord: 290..313
score:
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 213..260
score: 9.6E-9coord: 415..455
score: 3.7E-10coord: 316..362
score: 1.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 318..351
score: 6.1E-5coord: 216..244
score: 2.6E-5coord: 418..452
score: 1.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 316..350
score: 11.696coord: 285..315
score: 6.708coord: 83..117
score: 6.27coord: 149..183
score: 7.344coord: 416..450
score: 12.912coord: 214..248
score: 9.624coord: 385..415
score: 5
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 501..508
score: 1.5E-194coord: 65..351
score: 1.5E-194coord: 10..38
score: 1.5E-194coord: 386..465
score: 1.5E
NoneNo IPR availablePANTHERPTHR24015:SF51SUBFAMILY NOT NAMEDcoord: 386..465
score: 1.5E-194coord: 65..351
score: 1.5E-194coord: 501..508
score: 1.5E-194coord: 10..38
score: 1.5E

The following gene(s) are paralogous to this gene:

None