Cp4.1LG14g06660 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG14g06660
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionPentatricopeptide repeat-containing protein, putative
LocationCp4.1LG14 : 275564 .. 280533 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
AATATATAAATTTACAAGCAATGTTGTGTAATGGCCTTCCACCTATTAACATAGCTCCTTTAAGAAATGCTTCTTTTTCTCCTCTTGCTCTGTTTAGTCTGTTTCCTCTGTTTGTTCAGCCCTGCATCCACCTCTGAACTCTGTTTCCACTCCAGCTGCGATGGCGATTTTCAGACCATTCAATTCCCTTTCAGAATTGAAAACCAACAGCCCAAATCATGTGGGTATCCAGGCTTTGATTTGACCTGCCCCTCCACAGGTCAACCACTTCTCCATCTTCCTTCTTCAGGGGATTTCACTGTTCAGTACATAGATTACGAGAACCAAGAAATTTTGGTCAATGATCCAAACAAATGTCTTCCCCGAAAGATTCTGTCTCTCGAGCTTTCTGGGTCGCCGTTTCATGGCACAAATAGTGAAGATTTCACCTTCTTTAATTGTTCGTGGAGTGATCCAATCCCATCTGAGTTTAACTTGAATCCAATATATTGCCTCAGTGGGTTGTCGTATGCAGTTTTCGCATCACCTTCTTCATTTGTAAATGAGATTTTGTCGTCGAGTTGTGTTGCGATGAAGACTGTATCGGTGCCGTATTCATGGTCGTTTTCGACGGATTTGACGAATGATCTTCGATTGGGGTGGAAAAAGCCTAATTGTAGAAGGTGCGAGTCACACGGTGGAATATGTGGCCTCAAACCCAATTCTACCGATCAAATCCAATGCAAACACAGTGCTCAACCTCGACATGGTATGCTTCCACTTCTCCCTCTACCTCCGTTTAGTGCTTTCAACATTTTACAAAAGAACCCACCTTGTAAATAACATCACAAAATTTCAAACACCTTACAAGTATGGATGTTCATATAAGAGTGTGTGAAAGAAAAAGCTACCATTTTTCATATTTATTTTTGTAGTTCCAATAACTAATTTTTATTTATTTATTGACTAGTAAATTGGATAAGAGTAAACTTTTAATAATAATAGATTTATTTGCATCAATTGGGTAGGTAAGCTCACATTTTTTTAATTTTTTTTTTTACAAAAATTAGTATAAAAGTATTTTTAAAAAATAATTATTGAGGGTGGTTGATCCACCTTATAATAAAATAGGTAATAGACTTTTGGTTAGTATGAGCGCACGTCTTCATAGGATTTTTTTGCCCTAATCTTAGGGTCACCATATTGTTATATAGTAATTTTCTTTTAGTTAGGGGTGAACTATGTAAATTATTAATTGTAATATAAGATGTATATATATATATATTATTTTTTTAAAAAAATTATTATTAGGGATCCCGAGAGGTGCTCGCTATGCTGTATCCATAGGAGTGGGAGTCCCAGCGACCATGTGCATTTTGGGCTTCTTATGTTGCTTTTGTGCTCGAGTCAGATCCTACTCGAGAGGCCGAAACTCCAGCATTGAGGCCCATTGGGTCATCTCTTCTCGGCCCACTTTAATGGGCCTAGATGAGCCCACTATCGATTCCTATCCAAAAATTGAATTGGGTGAAAGCCTGCGTCTACCAAAGCCCAATGACAATATTTGCGCAATTTGCCTGTCGGAATATCGGCCCAAAGAGACTGTGAAGAGTATACCCCAATGTCAACATTTCTTCCATCAAGATTGTATTGATGAATGGCTGCGATTGAATCCCTCCTGCCCTGTTTGTAGAATGCCTCCCGTCAAATCTCCGCCGTCCGATTCTTCCCTCTAACACCGTCCGTTGTAAATACTTCAGATTTTCTTTTTTCTTTTTTTTTCTTTTTCTTTCAAGGCTAAAACTTAGCTTTTTCTAAAGACCTACAAAAATGATGTAATATGGGTCTTTGGAATTTTTTGTATTCTTANTTTTTTTGCTTTAATATTTGAATTCTGGACTAAAAATTAAAAATTTAGAAATTTATGAAATTTATATATCTATTAAATATAAAATTGAGAGATATAATTAAAATGAATAATTTTTGGACCATTGAAATTTGAAAATCTTATCCATTTGAAACGGCAGTTGCAAATCCAATCTAAATAGCCCAACCAAGGCGCGAAAAAGAGCAACCGGTAATACAGTTAGAAGGTGAAGATGCAGAGTCTGCCTCCTCCCACTACCCTCAAAATTCCCTCCCTTTCTTCAAATCCCTCCCCTTCTCTCCAATTCCCCACATTTTCAACGAATCGTTTGATCCGCCAAATCAACGATGGCCGCCTTCGTACAGCGATCTCAACTCTCGAACACATGGTGCAGCACGGAACCCATCCTGATCTTCAAACCTATTCACTTTTCCTCAAAAGATGTATTAGAACTCGTAGTTTTGATCTTGGTAGGCTCGTTCATGAAAACCTCACTCAGTCGGACCTCCAGCTCGACTCTGTGACTCTCAATTCTTTGATTAGCTTGTACTCCAAGAGCGGGCAGTGGGAGAAAGCAAAATCCATTTTTGAGCGCATGGGAAATAGTAGGGATTTGATTTCGTGGAGTGCGATGGTCTCTTGCTTTGCCAATAACAAGATGGGGTTTGAGGCGCTTCATACGTTTCTTGATATGATCCAAAATGGTTATTACCCAAATGAGTATTGCTTTTCCGCCGCAATTCGCGCGTGTTCCACTGTTGAATTTGCATCGGTGGGTGACTCTATTTTTGGATATGTTATTAAAACTGGATATTTTGCTTCAGATGTATGCGTTGGGTGTGGCTTGATTGATATGTTTGTGAAGGGCCGCGGCGATTTGGTTTCTGCCTTTGAGGTGTTTGAGAAAATGCCGGAAAGAAATGCAGTTACTTGGACACTGATGATTACTAGATTTATGCAATTTGGGTACGCAGGGGAAGCCATTGATGTATTTTTGGATATGATATTAAGTGGATACGAACCTGATAGATTCACATTAAGTGCTGTGATATCAGCTAGTGCAAAGCTAGAATTGTTATCGTTAGGGCAGCAGTTGCACTCTCAAGCCATAAAACATGGGTTGACTCTGGATCGTTGTGTTGGTTGTTGTCTAATAAATATGTATGCTAAATGCTCTGTGGATGGATCCATGAGTGAATCCAGAAAGATTTTTGATCAGATTCTGGATCACAACGTCATCTCTTGGACTGCAATGATCACAGGATATGTTCAAAAAGGGGGATATGATAAAGAAGCTCTTGACCTTTTTCGTGGGATGATCTTGACTCACGTTCTACCCAACCATTTTACGTTTTCCAGCACTCTCAAGGCCTGTGCAAATCTAGCCGATCTACGGATTGGCGAACAGGTTTTTACTCATGCAGTAAAGCTCGGTTTCTCAATAGTTAATTGTGTTGCAAACTCACTTATTAGCATGTATGCACGATCTGGCAAAATTGATGATGCAAGGAAAGCGTTCGATATTCTGTTTGAGAAGAATTTGATTTCTTATAATACGGTAATTGATGCATATTCTAAGAACTTAAATTCTGAAGAAGCTTTTGAACTTTTCAATGAGATTGAGGATCAAGGGATGGGGGCTAGTGCTTTCACATTTGCTAGCCTTCTGAGTGGAGCAGCCAGCATTGGTACAATAGGGAAGGGCGAGCAAATTCATGCTCGCGTGATAAAGTCAGGTTTGAAGTCAAATCAATCCATATGCAATGCCTTAATCTCTATGTATTCCAAGTGTGGAGACATTGATTCTGCTTTCCAAGTTTTTGAAGACATGGACGACAGAAATGTCATCTCTTGGACTTCAATTATCACAGGGTTTGCAAAACATGGGTTTGCAACAAAAGCCTTGGAGCTGTTCCACAAGATGCTCGAGGCTGGTATTAGACCAAATGAGGTCTCCTACATTGCTGTTTTATCTGCTTGTAGTCACGTGGGTCTTGTTAATGAGGGTTGGAAACACTTCAAATCAATGTACGCAGAGCATGGAGTCACTCCGAGAATGGAACATTATGCTTGTATGGTTGACATACTGGGTCGTTCAGGATCTCTCTCTGAAGCCATTCAGTTTATCAACTCAATGCCTTTCAAAGCCGATGCACTTGTGTGGCGAACATTTCTCGGAGCATGTCGAGTTCACGACAACCTAGAATTGGGGAAACATGCTGCAAAAATGATCATCGAACAAGAGCCACAGGATCCTGCTGCATATATCTTGCTATCAAATTTGTATGCATCCACCTCGCAATGGGAAGAAGTTGCAAGTATTAGAAAGGTAATGAAACAAAAAAACTTGATCAAAGAAGCAGGCTGCAGCTGGGTAGAGATTGAAAATAAAGTACACAAGTTTTATGTGGGTGATACATCACATTCAAAAGCTGAGGAAATATATGATGAACTTGAACACTTGTCTTTAAAAATAAAGAAATTGGGATATGTCCCCAACATGGATTTTGTGCTTCATGATGTGGAGGAAGAGAAAAAGGAGAAATATTTGTTTCAGCACAGTGAAAGAATAGCAGTAGCCTTTGGTCTTATCAGCATATCTAAGTCGAAGCCCATCAGAGTTTTCAAGAATCTACGAATTTGTGGGGACTGTCACTCTGCAATCAAATACATTTCATTGGCCACAGGCAGAGAGATCATCGTTAGAGATGCAAACAGGTTTCATCATATTAAGGATGGAAGATGCTCCTGCAATGAGTATTGGTGATGATGTTAAAACTTAAAACTGAAGCTGATTTCACCATCATTTGGACAATGACTGATGCTGCAGTAACGAAACCTGGTGATTAAAGGGTGCTCGAAAACCGACGGTCGAAATCGGGTATGTCTTCTTCATCCAAAACTGGTGATTAAAGGGTGTTCCATATTCATTTTAATTTAGATAACCCCTCGTAGACATGGCTTGATTTACTTGATTTTCTACTTTCAATTCATTGTCTTTATCCTTTTTCTGATGCATAAAGATGAGTCCATATTAGAAACGTTCATAAATTAAGGGATAAGAATGGTGAATGATTCTGAGAAGAGACCATTGGTTCTGTTGCAACTACATTCTGAATACACC

mRNA sequence

AATATATAAATTTACAAGCAATGTTGTGTAATGGCCTTCCACCTATTAACATAGCTCCTTTAAGAAATGCTTCTTTTTCTCCTCTTGCTCTGTTTAGTCTGTTTCCTCTCCCTGCATCCACCTCTGAACTCTGTTTCCACTCCAGCTGCGATGGCGATTTTCAGACCATTCAATTCCCTTTCAGAATTGAAAACCAACAGCCCAAATCATGTGGGTATCCAGGCTTTGATTTGACCTGCCCCTCCACAGGTCAACCACTTCTCCATCTTCCTTCTTCAGGGGATTTCACTGTTCAGTACATAGATTACGAGAACCAAGAAATTTTGGTCAATGATCCAAACAAATGTCTTCCCCGAAAGATTCTGTCTCTCGAGCTTTCTGGGTCGCCGTTTCATGGCACAAATAGTGAAGATTTCACCTTCTTTAATTGTTCGTGGAGTGATCCAATCCCATCTGAGTTTAACTTGAATCCAATATATTGCCTCAGTGGGTTGTCGTATGCAGTTTTCGCATCACCTTCTTCATTTGTAAATGAGATTTTGTCGTCGAGTTGTGTTGCGATGAAGACTGTATCGGTGCCGTATTCATGGTCGTTTTCGACGGATTTGACGAATGATCTTCGATTGGGGTGGAAAAAGCCTAATTGTAGAAGGTGCGAGTCACACGGTGGAATATGTGGCCTCAAACCCAATTCTACCGATCAAATCCAATGCAAACACAGTGCTCAACCTCGACATGGGATCCCGAGAGGTGCTCGCTATGCTGTATCCATAGGAGTGGGAGTCCCAGCGACCATGTGCATTTTGGGCTTCTTATGTTGCTTTTGTGCTCGAGTCAGATCCTACTCGAGAGGCCGAAACTCCAGCATTGAGGCCCATTGGGTCATCTCTTCTCGGCCCACTTTAATGGGCCTAGATGAGCCCACTATCGATTCCTATCCAAAAATTGAATTGGGTGAAAGCCTGCGTCTACCAAAGCCCAATGACAATATTTGCGCAATTTGCCTGTCGGAATATCGGCCCAAAGAGACTGTGAAGAGTATACCCCAATGTCAACATTTCTTCCATCAAGATTGTATTGATGAATGGCTGCGATTGAATCCCTCCTGCCCTAATGCCTCCCGTCAAATCTCCGCCGTCCGATTCTTCCCTCTAACACCGTCCGTTAGTCTGCCTCCTCCCACTACCCTCAAAATTCCCTCCCTTTCTTCAAATCCCTCCCCTTCTCTCCAATTCCCCACATTTTCAACGAATCGTTTGATCCGCCAAATCAACGATGGCCGCCTTCGTACAGCGATCTCAACTCTCGAACACATGGTGCAGCACGGAACCCATCCTGATCTTCAAACCTATTCACTTTTCCTCAAAAGATGTATTAGAACTCGTAGTTTTGATCTTGGTAGGCTCGTTCATGAAAACCTCACTCAGTCGGACCTCCAGCTCGACTCTGTGACTCTCAATTCTTTGATTAGCTTGTACTCCAAGAGCGGGCAGTGGGAGAAAGCAAAATCCATTTTTGAGCGCATGGGAAATAGTAGGGATTTGATTTCGTGGAGTGCGATGGTCTCTTGCTTTGCCAATAACAAGATGGGGTTTGAGGCGCTTCATACGTTTCTTGATATGATCCAAAATGGTTATTACCCAAATGAGTATTGCTTTTCCGCCGCAATTCGCGCGTGTTCCACTGTTGAATTTGCATCGGTGGGTGACTCTATTTTTGGATATGTTATTAAAACTGGATATTTTGCTTCAGATGTATGCGTTGGGTGTGGCTTGATTGATATGTTTGTGAAGGGCCGCGGCGATTTGGTTTCTGCCTTTGAGGTGTTTGAGAAAATGCCGGAAAGAAATGCAGTTACTTGGACACTGATGATTACTAGATTTATGCAATTTGGGTACGCAGGGGAAGCCATTGATGTATTTTTGGATATGATATTAAGTGGATACGAACCTGATAGATTCACATTAAGTGCTGTGATATCAGCTAGTGCAAAGCTAGAATTGTTATCGTTAGGGCAGCAGTTGCACTCTCAAGCCATAAAACATGGGTTGACTCTGGATCGTTGTGTTGGTTGTTGTCTAATAAATATGTATGCTAAATGCTCTGTGGATGGATCCATGAGTGAATCCAGAAAGATTTTTGATCAGATTCTGGATCACAACGTCATCTCTTGGACTGCAATGATCACAGGATATGTTCAAAAAGGGGGATATGATAAAGAAGCTCTTGACCTTTTTCGTGGGATGATCTTGACTCACGTTCTACCCAACCATTTTACGTTTTCCAGCACTCTCAAGGCCTGTGCAAATCTAGCCGATCTACGGATTGGCGAACAGGTTTTTACTCATGCAGTAAAGCTCGGTTTCTCAATAGTTAATTGTGTTGCAAACTCACTTATTAGCATGTATGCACGATCTGGCAAAATTGATGATGCAAGGAAAGCGTTCGATATTCTGTTTGAGAAGAATTTGATTTCTTATAATACGGTAATTGATGCATATTCTAAGAACTTAAATTCTGAAGAAGCTTTTGAACTTTTCAATGAGATTGAGGATCAAGGGATGGGGGCTAGTGCTTTCACATTTGCTAGCCTTCTGAGTGGAGCAGCCAGCATTGGGATAAGAATGGTGAATGATTCTGAGAAGAGACCATTGGTTCTGTTGCAACTACATTCTGAATACACC

Coding sequence (CDS)

ATGTTGTGTAATGGCCTTCCACCTATTAACATAGCTCCTTTAAGAAATGCTTCTTTTTCTCCTCTTGCTCTGTTTAGTCTGTTTCCTCTCCCTGCATCCACCTCTGAACTCTGTTTCCACTCCAGCTGCGATGGCGATTTTCAGACCATTCAATTCCCTTTCAGAATTGAAAACCAACAGCCCAAATCATGTGGGTATCCAGGCTTTGATTTGACCTGCCCCTCCACAGGTCAACCACTTCTCCATCTTCCTTCTTCAGGGGATTTCACTGTTCAGTACATAGATTACGAGAACCAAGAAATTTTGGTCAATGATCCAAACAAATGTCTTCCCCGAAAGATTCTGTCTCTCGAGCTTTCTGGGTCGCCGTTTCATGGCACAAATAGTGAAGATTTCACCTTCTTTAATTGTTCGTGGAGTGATCCAATCCCATCTGAGTTTAACTTGAATCCAATATATTGCCTCAGTGGGTTGTCGTATGCAGTTTTCGCATCACCTTCTTCATTTGTAAATGAGATTTTGTCGTCGAGTTGTGTTGCGATGAAGACTGTATCGGTGCCGTATTCATGGTCGTTTTCGACGGATTTGACGAATGATCTTCGATTGGGGTGGAAAAAGCCTAATTGTAGAAGGTGCGAGTCACACGGTGGAATATGTGGCCTCAAACCCAATTCTACCGATCAAATCCAATGCAAACACAGTGCTCAACCTCGACATGGGATCCCGAGAGGTGCTCGCTATGCTGTATCCATAGGAGTGGGAGTCCCAGCGACCATGTGCATTTTGGGCTTCTTATGTTGCTTTTGTGCTCGAGTCAGATCCTACTCGAGAGGCCGAAACTCCAGCATTGAGGCCCATTGGGTCATCTCTTCTCGGCCCACTTTAATGGGCCTAGATGAGCCCACTATCGATTCCTATCCAAAAATTGAATTGGGTGAAAGCCTGCGTCTACCAAAGCCCAATGACAATATTTGCGCAATTTGCCTGTCGGAATATCGGCCCAAAGAGACTGTGAAGAGTATACCCCAATGTCAACATTTCTTCCATCAAGATTGTATTGATGAATGGCTGCGATTGAATCCCTCCTGCCCTAATGCCTCCCGTCAAATCTCCGCCGTCCGATTCTTCCCTCTAACACCGTCCGTTAGTCTGCCTCCTCCCACTACCCTCAAAATTCCCTCCCTTTCTTCAAATCCCTCCCCTTCTCTCCAATTCCCCACATTTTCAACGAATCGTTTGATCCGCCAAATCAACGATGGCCGCCTTCGTACAGCGATCTCAACTCTCGAACACATGGTGCAGCACGGAACCCATCCTGATCTTCAAACCTATTCACTTTTCCTCAAAAGATGTATTAGAACTCGTAGTTTTGATCTTGGTAGGCTCGTTCATGAAAACCTCACTCAGTCGGACCTCCAGCTCGACTCTGTGACTCTCAATTCTTTGATTAGCTTGTACTCCAAGAGCGGGCAGTGGGAGAAAGCAAAATCCATTTTTGAGCGCATGGGAAATAGTAGGGATTTGATTTCGTGGAGTGCGATGGTCTCTTGCTTTGCCAATAACAAGATGGGGTTTGAGGCGCTTCATACGTTTCTTGATATGATCCAAAATGGTTATTACCCAAATGAGTATTGCTTTTCCGCCGCAATTCGCGCGTGTTCCACTGTTGAATTTGCATCGGTGGGTGACTCTATTTTTGGATATGTTATTAAAACTGGATATTTTGCTTCAGATGTATGCGTTGGGTGTGGCTTGATTGATATGTTTGTGAAGGGCCGCGGCGATTTGGTTTCTGCCTTTGAGGTGTTTGAGAAAATGCCGGAAAGAAATGCAGTTACTTGGACACTGATGATTACTAGATTTATGCAATTTGGGTACGCAGGGGAAGCCATTGATGTATTTTTGGATATGATATTAAGTGGATACGAACCTGATAGATTCACATTAAGTGCTGTGATATCAGCTAGTGCAAAGCTAGAATTGTTATCGTTAGGGCAGCAGTTGCACTCTCAAGCCATAAAACATGGGTTGACTCTGGATCGTTGTGTTGGTTGTTGTCTAATAAATATGTATGCTAAATGCTCTGTGGATGGATCCATGAGTGAATCCAGAAAGATTTTTGATCAGATTCTGGATCACAACGTCATCTCTTGGACTGCAATGATCACAGGATATGTTCAAAAAGGGGGATATGATAAAGAAGCTCTTGACCTTTTTCGTGGGATGATCTTGACTCACGTTCTACCCAACCATTTTACGTTTTCCAGCACTCTCAAGGCCTGTGCAAATCTAGCCGATCTACGGATTGGCGAACAGGTTTTTACTCATGCAGTAAAGCTCGGTTTCTCAATAGTTAATTGTGTTGCAAACTCACTTATTAGCATGTATGCACGATCTGGCAAAATTGATGATGCAAGGAAAGCGTTCGATATTCTGTTTGAGAAGAATTTGATTTCTTATAATACGGTAATTGATGCATATTCTAAGAACTTAAATTCTGAAGAAGCTTTTGAACTTTTCAATGAGATTGAGGATCAAGGGATGGGGGCTAGTGCTTTCACATTTGCTAGCCTTCTGAGTGGAGCAGCCAGCATTGGGATAAGAATGGTGAATGATTCTGAGAAGAGACCATTGGTTCTGTTGCAACTACATTCTGAATACACC

Protein sequence

MLCNGLPPINIAPLRNASFSPLALFSLFPLPASTSELCFHSSCDGDFQTIQFPFRIENQQPKSCGYPGFDLTCPSTGQPLLHLPSSGDFTVQYIDYENQEILVNDPNKCLPRKILSLELSGSPFHGTNSEDFTFFNCSWSDPIPSEFNLNPIYCLSGLSYAVFASPSSFVNEILSSSCVAMKTVSVPYSWSFSTDLTNDLRLGWKKPNCRRCESHGGICGLKPNSTDQIQCKHSAQPRHGIPRGARYAVSIGVGVPATMCILGFLCCFCARVRSYSRGRNSSIEAHWVISSRPTLMGLDEPTIDSYPKIELGESLRLPKPNDNICAICLSEYRPKETVKSIPQCQHFFHQDCIDEWLRLNPSCPNASRQISAVRFFPLTPSVSLPPPTTLKIPSLSSNPSPSLQFPTFSTNRLIRQINDGRLRTAISTLEHMVQHGTHPDLQTYSLFLKRCIRTRSFDLGRLVHENLTQSDLQLDSVTLNSLISLYSKSGQWEKAKSIFERMGNSRDLISWSAMVSCFANNKMGFEALHTFLDMIQNGYYPNEYCFSAAIRACSTVEFASVGDSIFGYVIKTGYFASDVCVGCGLIDMFVKGRGDLVSAFEVFEKMPERNAVTWTLMITRFMQFGYAGEAIDVFLDMILSGYEPDRFTLSAVISASAKLELLSLGQQLHSQAIKHGLTLDRCVGCCLINMYAKCSVDGSMSESRKIFDQILDHNVISWTAMITGYVQKGGYDKEALDLFRGMILTHVLPNHFTFSSTLKACANLADLRIGEQVFTHAVKLGFSIVNCVANSLISMYARSGKIDDARKAFDILFEKNLISYNTVIDAYSKNLNSEEAFELFNEIEDQGMGASAFTFASLLSGAASIGIRMVNDSEKRPLVLLQLHSEYT
BLAST of Cp4.1LG14g06660 vs. Swiss-Prot
Match: PP272_ARATH (Pentatricopeptide repeat-containing protein At3g49170, chloroplastic OS=Arabidopsis thaliana GN=EMB2261 PE=2 SV=1)

HSP 1 Score: 492.3 bits (1266), Expect = 1.1e-137
Identity = 260/491 (52.95%), Postives = 343/491 (69.86%), Query Frame = 1

Query: 381 SVSLPPPTTLKIPSLSSNPSPSLQFPTFSTNRLI-RQINDGRLRTAISTLEHMVQHGTHP 440
           S S P P  L I S      PS+       +RLI R +N G LR A+S L+ M + G  P
Sbjct: 5   SFSFPSPAKLPIKS-----QPSVSNRINVADRLILRHLNAGDLRGAVSALDLMARDGIRP 64

Query: 441 -DLQTYSLFLKRCIRTRSFDLGRLVHENLTQSDLQLDSVTLNSLISLYSKSGQWEKAKSI 500
            D  T+S  LK CIR R F LG+LVH  L + D++ DSV  NSLISLYSKSG   KA+ +
Sbjct: 65  MDSVTFSSLLKSCIRARDFRLGKLVHARLIEFDIEPDSVLYNSLISLYSKSGDSAKAEDV 124

Query: 501 FERMGN--SRDLISWSAMVSCFANNKMGFEALHTFLDMIQNGYYPNEYCFSAAIRACSTV 560
           FE M     RD++SWSAM++C+ NN    +A+  F++ ++ G  PN+YC++A IRACS  
Sbjct: 125 FETMRRFGKRDVVSWSAMMACYGNNGRELDAIKVFVEFLELGLVPNDYCYTAVIRACSNS 184

Query: 561 EFASVGDSIFGYVIKTGYFASDVCVGCGLIDMFVKGRGDLVSAFEVFEKMPERNAVTWTL 620
           +F  VG    G+++KTG+F SDVCVGC LIDMFVKG     +A++VF+KM E N VTWTL
Sbjct: 185 DFVGVGRVTLGFLMKTGHFESDVCVGCSLIDMFVKGENSFENAYKVFDKMSELNVVTWTL 244

Query: 621 MITRFMQFGYAGEAIDVFLDMILSGYEPDRFTLSAVISASAKLELLSLGQQLHSQAIKHG 680
           MITR MQ G+  EAI  FLDM+LSG+E D+FTLS+V SA A+LE LSLG+QLHS AI+ G
Sbjct: 245 MITRCMQMGFPREAIRFFLDMVLSGFESDKFTLSSVFSACAELENLSLGKQLHSWAIRSG 304

Query: 681 LTLDRCVGCCLINMYAKCSVDGSMSESRKIFDQILDHNVISWTAMITGYVQKGGYDKEAL 740
           L  D  V C L++MYAKCS DGS+ + RK+FD++ DH+V+SWTA+ITGY++      EA+
Sbjct: 305 LVDD--VECSLVDMYAKCSADGSVDDCRKVFDRMEDHSVMSWTALITGYMKNCNLATEAI 364

Query: 741 DLFRGMILT-HVLPNHFTFSSTLKACANLADLRIGEQVFTHAVKLGFSIVNCVANSLISM 800
           +LF  MI   HV PNHFTFSS  KAC NL+D R+G+QV   A K G +  + VANS+ISM
Sbjct: 365 NLFSEMITQGHVEPNHFTFSSAFKACGNLSDPRVGKQVLGQAFKRGLASNSSVANSVISM 424

Query: 801 YARSGKIDDARKAFDILFEKNLISYNTVIDAYSKNLNSEEAFELFNEIEDQGMGASAFTF 860
           + +S +++DA++AF+ L EKNL+SYNT +D   +NLN E+AF+L +EI ++ +G SAFTF
Sbjct: 425 FVKSDRMEDAQRAFESLSEKNLVSYNTFLDGTCRNLNFEQAFKLLSEITERELGVSAFTF 484

Query: 861 ASLLSGAASIG 867
           ASLLSG A++G
Sbjct: 485 ASLLSGVANVG 488

BLAST of Cp4.1LG14g06660 vs. Swiss-Prot
Match: PP151_ARATH (Pentatricopeptide repeat-containing protein At2g13600 OS=Arabidopsis thaliana GN=PCMP-E76 PE=3 SV=1)

HSP 1 Score: 237.3 bits (604), Expect = 6.6e-61
Identity = 158/472 (33.47%), Postives = 245/472 (51.91%), Query Frame = 1

Query: 456 SFDLGRLVHENLTQSDLQLDSVTLNSLISLYSKSGQWEKAKSIFERMGNSRDLISWSAMV 515
           S + GR V + + Q ++     T NS+++  +K G  ++A S+F  M   RD  +W++MV
Sbjct: 70  SLEDGRQVFDKMPQRNIY----TWNSVVTGLTKLGFLDEADSLFRSMPE-RDQCTWNSMV 129

Query: 516 SCFANNKMGFEALHTFLDMIQNGYYPNEYCFSAAIRACSTVEFASVGDSIFGYVIKTGYF 575
           S FA +    EAL  F  M + G+  NEY F++ + ACS +   + G  +   + K+  F
Sbjct: 130 SGFAQHDRCEEALCYFAMMHKEGFVLNEYSFASVLSACSGLNDMNKGVQVHSLIAKSP-F 189

Query: 576 ASDVCVGCGLIDMFVKGRGDLVSAFEVFEKMPERNAVTWTLMITRFMQFGYAGEAIDVFL 635
            SDV +G  L+DM+ K  G++  A  VF++M +RN V+W  +IT F Q G A EA+DVF 
Sbjct: 190 LSDVYIGSALVDMYSKC-GNVNDAQRVFDEMGDRNVVSWNSLITCFEQNGPAVEALDVFQ 249

Query: 636 DMILSGYEPDRFTLSAVISASAKLELLSLGQQLHSQAIKHG-LTLDRCVGCCLINMYAKC 695
            M+ S  EPD  TL++VISA A L  + +GQ++H + +K+  L  D  +    ++MYAKC
Sbjct: 250 MMLESRVEPDEVTLASVISACASLSAIKVGQEVHGRVVKNDKLRNDIILSNAFVDMYAKC 309

Query: 696 S----------------------------VDGSMSESRKIFDQILDHNVISWTAMITGYV 755
           S                            +  S   +R +F ++ + NV+SW A+I GY 
Sbjct: 310 SRIKEARFIFDSMPIRNVIAETSMISGYAMAASTKAARLMFTKMAERNVVSWNALIAGYT 369

Query: 756 QKGGYDKEALDLFRGMILTHVLPNHFTFSSTLKACANLADLRIGEQVFTHAVKLGFSIVN 815
           Q G  ++EAL LF  +    V P H++F++ LKACA+LA+L +G Q   H +K GF   +
Sbjct: 370 QNGE-NEEALSLFCLLKRESVCPTHYSFANILKACADLAELHLGMQAHVHVLKHGFKFQS 429

Query: 816 ------CVANSLISMYARSGKIDDARKAFDILFEKNLISYNTVIDAYSKNLNSEEAFELF 875
                  V NSLI MY + G +++    F  + E++ +S+N +I  +++N    EA ELF
Sbjct: 430 GEEDDIFVGNSLIDMYVKCGCVEEGYLVFRKMMERDCVSWNAMIIGFAQNGYGNEALELF 489

Query: 876 NEIEDQGMGASAFTFASLLSGAASIGI----RMVNDSEKRPLVLLQLHSEYT 889
            E+ + G      T   +LS     G     R    S  R   +  L   YT
Sbjct: 490 REMLESGEKPDHITMIGVLSACGHAGFVEEGRHYFSSMTRDFGVAPLRDHYT 533

BLAST of Cp4.1LG14g06660 vs. Swiss-Prot
Match: PP319_ARATH (Pentatricopeptide repeat-containing protein At4g18520 OS=Arabidopsis thaliana GN=PCMP-A2 PE=2 SV=1)

HSP 1 Score: 236.1 bits (601), Expect = 1.5e-60
Identity = 144/442 (32.58%), Postives = 237/442 (53.62%), Query Frame = 1

Query: 425 AISTLEHMVQHGTH-PDLQTYSLFLKRCIRTRSFDLGRLVHENLTQSDLQLDSVTLNSLI 484
           A +  E  V+HG    + + +   L  C R   F+LGR VH N+ +  +  + +  +SL+
Sbjct: 167 AFALFEDYVKHGIRFTNERMFVCLLNLCSRRAEFELGRQVHGNMVKVGVG-NLIVESSLV 226

Query: 485 SLYSKSGQWEKAKSIFERMGNSRDLISWSAMVSCFANNKMGFEALHTFLDMIQNGYYPNE 544
             Y++ G+   A   F+ M   +D+ISW+A++S  +    G +A+  F+ M+ + + PNE
Sbjct: 227 YFYAQCGELTSALRAFDMM-EEKDVISWTAVISACSRKGHGIKAIGMFIGMLNHWFLPNE 286

Query: 545 YCFSAAIRACSTVEFASVGDSIFGYVIKTGYFASDVCVGCGLIDMFVKGRGDLVSAFEVF 604
           +   + ++ACS  +    G  +   V+K     +DV VG  L+DM+ K  G++    +VF
Sbjct: 287 FTVCSILKACSEEKALRFGRQVHSLVVKR-MIKTDVFVGTSLMDMYAKC-GEISDCRKVF 346

Query: 605 EKMPERNAVTWTLMITRFMQFGYAGEAIDVFLDMILSGYEPDRFTLSAVISASAKLELLS 664
           + M  RN VTWT +I    + G+  EAI +F  M       +  T+ +++ A   +  L 
Sbjct: 347 DGMSNRNTVTWTSIIAAHAREGFGEEAISLFRIMKRRHLIANNLTVVSILRACGSVGALL 406

Query: 665 LGQQLHSQAIKHGLTLDRCVGCCLINMYAKCSVDGSMSESRKIFDQILDHNVISWTAMIT 724
           LG++LH+Q IK+ +  +  +G  L+ +Y KC   G   ++  +  Q+   +V+SWTAMI+
Sbjct: 407 LGKELHAQIIKNSIEKNVYIGSTLVWLYCKC---GESRDAFNVLQQLPSRDVVSWTAMIS 466

Query: 725 GYVQKGGYDKEALDLFRGMILTHVLPNHFTFSSTLKACANLADLRIGEQVFTHAVKLGFS 784
           G     G++ EALD  + MI   V PN FT+SS LKACAN   L IG  + + A K    
Sbjct: 467 G-CSSLGHESEALDFLKEMIQEGVEPNPFTYSSALKACANSESLLIGRSIHSIAKKNHAL 526

Query: 785 IVNCVANSLISMYARSGKIDDARKAFDILFEKNLISYNTVIDAYSKNLNSEEAFELFNEI 844
               V ++LI MYA+ G + +A + FD + EKNL+S+  +I  Y++N    EA +L   +
Sbjct: 527 SNVFVGSALIHMYAKCGFVSEAFRVFDSMPEKNLVSWKAMIMGYARNGFCREALKLMYRM 586

Query: 845 EDQGMGASAFTFASLLSGAASI 866
           E +G     + FA++LS    I
Sbjct: 587 EAEGFEVDDYIFATILSTCGDI 600

BLAST of Cp4.1LG14g06660 vs. Swiss-Prot
Match: PP280_ARATH (Pentatricopeptide repeat-containing protein At3g53360, mitochondrial OS=Arabidopsis thaliana GN=PCMP-E86 PE=2 SV=1)

HSP 1 Score: 229.9 bits (585), Expect = 1.1e-58
Identity = 137/451 (30.38%), Postives = 239/451 (52.99%), Query Frame = 1

Query: 419 DGRLRTAISTLEHMVQHGTHPDLQTYSLFLKRCIRTRSFDLGRLVHENLTQSDLQLDSVT 478
           +G+   AI     M+Q    PD   +   +K C  +    LG+ +H  + + +     + 
Sbjct: 146 NGQGAEAIRLYLKMLQEDLVPDQFAFGSIIKACASSSDVGLGKQLHAQVIKLESSSHLIA 205

Query: 479 LNSLISLYSKSGQWEKAKSIFERMGNSRDLISWSAMVSCFANNKMGFEALHTFLDMIQNG 538
            N+LI++Y +  Q   A  +F  +   +DLISWS++++ F+     FEAL    +M+  G
Sbjct: 206 QNALIAMYVRFNQMSDASRVFYGIP-MKDLISWSSIIAGFSQLGFEFEALSHLKEMLSFG 265

Query: 539 -YYPNEYCFSAAIRACSTVEFASVGDSIFGYVIKTGYFASDVCVGCGLIDMFVKGRGDLV 598
            ++PNEY F ++++ACS++     G  I G  IK+   A +   GC L DM+ +  G L 
Sbjct: 266 VFHPNEYIFGSSLKACSSLLRPDYGSQIHGLCIKS-ELAGNAIAGCSLCDMYAR-CGFLN 325

Query: 599 SAFEVFEKMPERNAVTWTLMITRFMQFGYAGEAIDVFLDMILSGYEPDRFTLSAVISASA 658
           SA  VF+++   +  +W ++I      GYA EA+ VF  M  SG+ PD  +L +++ A  
Sbjct: 326 SARRVFDQIERPDTASWNVIIAGLANNGYADEAVSVFSQMRSSGFIPDAISLRSLLCAQT 385

Query: 659 KLELLSLGQQLHSQAIKHGLTLDRCVGCCLINMYAKCSVDGSMSESRKIFDQILDH-NVI 718
           K   LS G Q+HS  IK G   D  V   L+ MY  CS    +     +F+   ++ + +
Sbjct: 386 KPMALSQGMQIHSYIIKWGFLADLTVCNSLLTMYTFCS---DLYCCFNLFEDFRNNADSV 445

Query: 719 SWTAMITGYVQKGGYDKEALDLFRGMILTHVLPNHFTFSSTLKACANLADLRIGEQVFTH 778
           SW  ++T  +Q      E L LF+ M+++   P+H T  + L+ C  ++ L++G QV  +
Sbjct: 446 SWNTILTACLQH-EQPVEMLRLFKLMLVSECEPDHITMGNLLRGCVEISSLKLGSQVHCY 505

Query: 779 AVKLGFSIVNCVANSLISMYARSGKIDDARKAFDILFEKNLISYNTVIDAYSKNLNSEEA 838
           ++K G +    + N LI MYA+ G +  AR+ FD +  ++++S++T+I  Y+++   EEA
Sbjct: 506 SLKTGLAPEQFIKNGLIDMYAKCGSLGQARRIFDSMDNRDVVSWSTLIVGYAQSGFGEEA 565

Query: 839 FELFNEIEDQGMGASAFTFASLLSGAASIGI 868
             LF E++  G+  +  TF  +L+  + +G+
Sbjct: 566 LILFKEMKSAGIEPNHVTFVGVLTACSHVGL 589

BLAST of Cp4.1LG14g06660 vs. Swiss-Prot
Match: PP307_ARATH (Pentatricopeptide repeat-containing protein At4g13650 OS=Arabidopsis thaliana GN=PCMP-H42 PE=2 SV=2)

HSP 1 Score: 228.8 bits (582), Expect = 2.3e-58
Identity = 145/465 (31.18%), Postives = 246/465 (52.90%), Query Frame = 1

Query: 408 FSTNRLIRQINDGRLRTAISTL-EHMVQHGTHPDLQTYSLFLKRCIR-TRSFDLGRLVHE 467
           F+ N++I+++    L   +  L   MV     P+  T+S  L+ C   + +FD+   +H 
Sbjct: 152 FTWNKMIKELASRNLIGEVFGLFVRMVSENVTPNEGTFSGVLEACRGGSVAFDVVEQIHA 211

Query: 468 NLTQSDLQLDSVTLNSLISLYSKSGQWEKAKSIFERMGNSRDLISWSAMVSCFANNKMGF 527
            +    L+  +V  N LI LYS++G  + A+ +F+ +   +D  SW AM+S  + N+   
Sbjct: 212 RILYQGLRDSTVVCNPLIDLYSRNGFVDLARRVFDGL-RLKDHSSWVAMISGLSKNECEA 271

Query: 528 EALHTFLDMIQNGYYPNEYCFSAAIRACSTVEFASVGDSIFGYVIKTGYFASDVCVGCGL 587
           EA+  F DM   G  P  Y FS+ + AC  +E   +G+ + G V+K G F+SD  V   L
Sbjct: 272 EAIRLFCDMYVLGIMPTPYAFSSVLSACKKIESLEIGEQLHGLVLKLG-FSSDTYVCNAL 331

Query: 588 IDMFVKGRGDLVSAFEVFEKMPERNAVTWTLMITRFMQFGYAGEAIDVFLDMILSGYEPD 647
           + ++    G+L+SA  +F  M +R+AVT+  +I    Q GY  +A+++F  M L G EPD
Sbjct: 332 VSLYFH-LGNLISAEHIFSNMSQRDAVTYNTLINGLSQCGYGEKAMELFKRMHLDGLEPD 391

Query: 648 RFTLSAVISASAKLELLSLGQQLHSQAIKHGLTLDRCVGCCLINMYAKCSVDGSMSESRK 707
             TL++++ A +    L  GQQLH+   K G   +  +   L+N+YAKC+    +  +  
Sbjct: 392 SNTLASLVVACSADGTLFRGQQLHAYTTKLGFASNNKIEGALLNLYAKCA---DIETALD 451

Query: 708 IFDQILDHNVISWTAMITGYVQKGGYD--KEALDLFRGMILTHVLPNHFTFSSTLKACAN 767
            F +    NV+ W  M+  Y   G  D  + +  +FR M +  ++PN +T+ S LK C  
Sbjct: 452 YFLETEVENVVLWNVMLVAY---GLLDDLRNSFRIFRQMQIEEIVPNQYTYPSILKTCIR 511

Query: 768 LADLRIGEQVFTHAVKLGFSIVNCVANSLISMYARSGKIDDARKAFDILFE---KNLISY 827
           L DL +GEQ+ +  +K  F +   V + LI MYA+ GK+D    A+DIL     K+++S+
Sbjct: 512 LGDLELGEQIHSQIIKTNFQLNAYVCSVLIDMYAKLGKLD---TAWDILIRFAGKDVVSW 571

Query: 828 NTVIDAYSKNLNSEEAFELFNEIEDQGMGASAFTFASLLSGAASI 866
            T+I  Y++    ++A   F ++ D+G+ +      + +S  A +
Sbjct: 572 TTMIAGYTQYNFDDKALTTFRQMLDRGIRSDEVGLTNAVSACAGL 604

BLAST of Cp4.1LG14g06660 vs. TrEMBL
Match: A0A0A0LBE4_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G603610 PE=4 SV=1)

HSP 1 Score: 851.7 bits (2199), Expect = 8.2e-244
Identity = 427/489 (87.32%), Postives = 451/489 (92.23%), Query Frame = 1

Query: 383 SLPPPTTLKIPSLSSNPSPSLQFPTFS-----TNRLIRQINDGRLRTAISTLEHMVQHGT 442
           SLP PTTLKIP  SSNPS SLQFPTF+     T RLI++IN+GRL  AISTLEHMV  G+
Sbjct: 3   SLPLPTTLKIPFPSSNPSSSLQFPTFTNPNPLTGRLIQEINNGRLHKAISTLEHMVHQGS 62

Query: 443 HPDLQTYSLFLKRCIRTRSFDLGRLVHENLTQSDLQLDSVTLNSLISLYSKSGQWEKAKS 502
           HPDLQTYSLFLK+CIRTRSFD+G LVHE LTQSDLQLDSVTLNSLISLYSK GQWEKA S
Sbjct: 63  HPDLQTYSLFLKKCIRTRSFDIGTLVHEKLTQSDLQLDSVTLNSLISLYSKCGQWEKATS 122

Query: 503 IFERMGNSRDLISWSAMVSCFANNKMGFEALHTFLDMIQNGYYPNEYCFSAAIRACSTVE 562
           IF+ MG+SRDLISWSAMVSCFANN MGF AL TF+DMI+NGYYPNEYCF+AA RACST E
Sbjct: 123 IFQLMGSSRDLISWSAMVSCFANNNMGFRALLTFVDMIENGYYPNEYCFAAATRACSTAE 182

Query: 563 FASVGDSIFGYVIKTGYFASDVCVGCGLIDMFVKGRGDLVSAFEVFEKMPERNAVTWTLM 622
           F SVGDSIFG+V+KTGY  SDVCVGCGLIDMFVKGRGDLVSAF+VFEKMPERNAVTWTLM
Sbjct: 183 FVSVGDSIFGFVVKTGYLQSDVCVGCGLIDMFVKGRGDLVSAFKVFEKMPERNAVTWTLM 242

Query: 623 ITRFMQFGYAGEAIDVFLDMILSGYEPDRFTLSAVISASAKLELLSLGQQLHSQAIKHGL 682
           ITR MQFGYAGEAID+FL+MILSGYEPDRFTLS VISA A +ELL LGQQLHSQAI+HGL
Sbjct: 243 ITRLMQFGYAGEAIDLFLEMILSGYEPDRFTLSGVISACANMELLLLGQQLHSQAIRHGL 302

Query: 683 TLDRCVGCCLINMYAKCSVDGSMSESRKIFDQILDHNVISWTAMITGYVQKGGYDKEALD 742
           TLDRCVGCCLINMYAKCSVDGSM  +RKIFDQILDHNV SWTAMITGYVQKGGYD+EALD
Sbjct: 303 TLDRCVGCCLINMYAKCSVDGSMCAARKIFDQILDHNVFSWTAMITGYVQKGGYDEEALD 362

Query: 743 LFRGMILTHVLPNHFTFSSTLKACANLADLRIGEQVFTHAVKLGFSIVNCVANSLISMYA 802
           LFRGMILTHV+PNHFTFSSTLKACANLA LRIGEQVFTHAVKLGFS VNCVANSLISMYA
Sbjct: 363 LFRGMILTHVIPNHFTFSSTLKACANLAALRIGEQVFTHAVKLGFSSVNCVANSLISMYA 422

Query: 803 RSGKIDDARKAFDILFEKNLISYNTVIDAYSKNLNSEEAFELFNEIEDQGMGASAFTFAS 862
           RSG+IDDARKAFDILFEKNLISYNTVIDAY+KNLNSEEA ELFNEIEDQGMGASAFTFAS
Sbjct: 423 RSGRIDDARKAFDILFEKNLISYNTVIDAYAKNLNSEEALELFNEIEDQGMGASAFTFAS 482

Query: 863 LLSGAASIG 867
           LLSGAASIG
Sbjct: 483 LLSGAASIG 491

BLAST of Cp4.1LG14g06660 vs. TrEMBL
Match: B9IDW4_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0015s02060g PE=4 SV=2)

HSP 1 Score: 612.5 bits (1578), Expect = 8.4e-172
Identity = 288/444 (64.86%), Postives = 360/444 (81.08%), Query Frame = 1

Query: 423 RTAISTLEHMVQHGTHPDLQTYSLFLKRCIRTRSFDLGRLVHENLTQSDLQLDSVTLNSL 482
           + AISTL+ M   GTHPDL TYSL LK CIR+ ++ LG LVH  LTQS L+LDSV LNSL
Sbjct: 130 KKAISTLDQMSLQGTHPDLITYSLLLKSCIRSHNYQLGHLVHHRLTQSGLELDSVILNSL 189

Query: 483 ISLYSKSGQWEKAKSIFERMGNSRDLISWSAMVSCFANNKMGFEALHTFLDMIQNGYYPN 542
           ISLYSK G W++A  IFE MGN RDL+SWSA++SC+ANN+  FEA+  F DM++ G+YPN
Sbjct: 190 ISLYSKCGDWQQAHEIFESMGNKRDLVSWSALISCYANNEKAFEAISAFFDMLECGFYPN 249

Query: 543 EYCFSAAIRACSTVEFASVGDSIFGYVIKTGYFASDVCVGCGLIDMFVKGRGDLVSAFEV 602
           EYCF+   RACS  E  S+G  IFG+++KTGYF SDVCVGC LIDMFVKG GDL SA++V
Sbjct: 250 EYCFTGVFRACSNKENISLGKIIFGFLLKTGYFESDVCVGCALIDMFVKGNGDLESAYKV 309

Query: 603 FEKMPERNAVTWTLMITRFMQFGYAGEAIDVFLDMILSGYEPDRFTLSAVISASAKLELL 662
           F++MP+RN VTWTLMITRF Q G++ +A+D+FLDM+LSGY PDRFTLS V+SA A++ LL
Sbjct: 310 FDRMPDRNVVTWTLMITRFQQLGFSRDAVDLFLDMVLSGYVPDRFTLSGVVSACAEMGLL 369

Query: 663 SLGQQLHSQAIKHGLTLDRCVGCCLINMYAKCSVDGSMSESRKIFDQILDHNVISWTAMI 722
           SLG+Q H   +K GL LD CVGC L++MYAKC  DGS+ ++RK+FD++  HNV+SWTA+I
Sbjct: 370 SLGRQFHCLVMKSGLDLDVCVGCSLVDMYAKCVADGSVDDARKVFDRMPVHNVMSWTAII 429

Query: 723 TGYVQKGGYDKEALDLFRGMILTHVLPNHFTFSSTLKACANLADLRIGEQVFTHAVKLGF 782
           TGYVQ GG D+EA++LF  M+   V PNHFTFSS LKACANL+D+ +GEQV+   VK+  
Sbjct: 430 TGYVQSGGCDREAIELFLEMVQGQVKPNHFTFSSVLKACANLSDIWLGEQVYALVVKMRL 489

Query: 783 SIVNCVANSLISMYARSGKIDDARKAFDILFEKNLISYNTVIDAYSKNLNSEEAFELFNE 842
           + +NCV NSLISMY+R G +++ARKAFD+LFEKNL+SYNT+++AY+K+LNSEEAFELFNE
Sbjct: 490 ASINCVGNSLISMYSRCGNMENARKAFDVLFEKNLVSYNTIVNAYAKSLNSEEAFELFNE 549

Query: 843 IEDQGMGASAFTFASLLSGAASIG 867
           IE  G G +AFTFASLLSGA+SIG
Sbjct: 550 IEGAGTGVNAFTFASLLSGASSIG 573

BLAST of Cp4.1LG14g06660 vs. TrEMBL
Match: M5WX51_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa001611mg PE=4 SV=1)

HSP 1 Score: 609.4 bits (1570), Expect = 7.1e-171
Identity = 293/434 (67.51%), Postives = 355/434 (81.80%), Query Frame = 1

Query: 432 MVQHGTHPDLQTYSLFLKRCIRTRSFDLGRLVHENLTQSDLQLDSVTLNSLISLYSKSGQ 491
           M Q GTHPDL  YSL LK CIR+R+FDLGRLVH  L  S L+LD V LNSLISLYSKS  
Sbjct: 1   MAQRGTHPDLPIYSLLLKSCIRSRNFDLGRLVHARLVHSQLELDPVVLNSLISLYSKSRD 60

Query: 492 WEKAKSIFERMGNSRDLISWSAMVSCFANNKMGFEALHTFLDMIQNGYYPNEYCFSAAIR 551
           W+KA SIFE MGN R+L+SWSAMVSCFANN MG EA+ TFLDM+++G+YPNEYCF++ IR
Sbjct: 61  WKKANSIFENMGNKRNLVSWSAMVSCFANNDMGLEAILTFLDMLEDGFYPNEYCFASVIR 120

Query: 552 ACSTVEFASVGDSIFGYVIKTGYFASDVCVGCGLIDMFVKGRGDLVSAFEVFEKMPERNA 611
           ACS  +   +G+ IFG VIK+GY  SDVCVGC LIDMF KG G+L  A++VFE MPE +A
Sbjct: 121 ACSNAQNIRIGNIIFGSVIKSGYLGSDVCVGCSLIDMFAKGSGELDDAYKVFETMPETDA 180

Query: 612 VTWTLMITRFMQFGYAGEAIDVFLDMILSGYEPDRFTLSAVISASAKLELLSLGQQLHSQ 671
           VTWTLMITR  Q G  GEAID+++DM+ SG  PD+FTLS VISA  KL+ LSLGQQLHS 
Sbjct: 181 VTWTLMITRLAQMGCPGEAIDLYVDMLWSGLMPDQFTLSGVISACTKLDSLSLGQQLHSW 240

Query: 672 AIKHGLTLDRCVGCCLINMYAKCSVDGSMSESRKIFDQILDHNVISWTAMITGYVQKGGY 731
            I+ GL L  CVGCCL++MYAKC+ DGSM ++RK+FD++ +HNV+SWT++I GYVQ G  
Sbjct: 241 VIRSGLALGHCVGCCLVDMYAKCAADGSMDDARKVFDRMPNHNVLSWTSIINGYVQSGEG 300

Query: 732 DKEALDLFRGMILTHVLPNHFTFSSTLKACANLADLRIGEQVFTHAVKLGFSIVNCVANS 791
           D+EA+ LF GM+  HV PNHFTFSS LKACANL+DLR G+QV + AVKLG + VNCV NS
Sbjct: 301 DEEAIKLFVGMMTGHVPPNHFTFSSILKACANLSDLRKGDQVHSLAVKLGLASVNCVGNS 360

Query: 792 LISMYARSGKIDDARKAFDILFEKNLISYNTVIDAYSKNLNSEEAFELFNEIEDQGMGAS 851
           LISMY+RSG+++DARKAFDIL+EKNLISYNT++DAY+K+ ++EEAF +F+EI+D G GAS
Sbjct: 361 LISMYSRSGQVEDARKAFDILYEKNLISYNTIVDAYAKHSDTEEAFGIFHEIQDTGFGAS 420

Query: 852 AFTFASLLSGAASI 866
           AFTF+SLLSGAASI
Sbjct: 421 AFTFSSLLSGAASI 434

BLAST of Cp4.1LG14g06660 vs. TrEMBL
Match: A0A061G281_THECC (Tetratricopeptide repeat-like superfamily protein OS=Theobroma cacao GN=TCM_015369 PE=4 SV=1)

HSP 1 Score: 606.7 bits (1563), Expect = 4.6e-170
Identity = 302/494 (61.13%), Postives = 379/494 (76.72%), Query Frame = 1

Query: 380 PSVSLPPPTTLKI---PSLSSNPSPSLQFPTFST--NRLIRQINDGRLRTAISTLEHMVQ 439
           PS + PPP +LK    P  +  P   ++   F T  NRLI  +++G L  A+STL+ M +
Sbjct: 9   PSPAKPPPHSLKPSTRPRQTLAPPSVIRPVNFETLRNRLINHLDEGHLHKAVSTLDVMAR 68

Query: 440 HGTHPDLQTYSLFLKRCIRTRSFDLGRLVHENLTQSDLQLDSVTLNSLISLYSKSGQWEK 499
             THPDL TYSL LK CIR+R F LG++VH NL QS L+LDSV  NSLISLYSKSG W +
Sbjct: 69  QNTHPDLITYSLLLKACIRSRDFQLGKIVHTNLNQSKLELDSVLFNSLISLYSKSGDWAR 128

Query: 500 AKSIFERMGNSRDLISWSAMVSCFANNKMGFEALHTFLDMIQNGYYPNEYCFSAAIRACS 559
           A  IF+RM + RDL+SWSAM+SCFANNKM F+A+ TFLDM++NG+YPNEYCF+A +RACS
Sbjct: 129 AHKIFQRMEDKRDLVSWSAMISCFANNKMEFKAILTFLDMLENGFYPNEYCFTAVVRACS 188

Query: 560 TVEFASVGDSIFGYVIKTGYFASDVCVGCGLIDMFVKGRGDLVSAFEVFEKMPERNAVTW 619
             EF S+G+ I G+++K+GY  SD  VGC LIDMFVKG  DL SAF+VF+KMP +N V W
Sbjct: 189 KAEFFSIGEIILGFLVKSGYLESDTNVGCALIDMFVKGNSDLASAFKVFDKMPAKNVVAW 248

Query: 620 TLMITRFMQFGYAGEAIDVFLDMILSGYEPDRFTLSAVISASAKL--ELLSLGQQLHSQA 679
           TLMITR  Q GY  +AID+FLDM+L GY PDRFTLS +ISA  +L  E LSLG+QLHS  
Sbjct: 249 TLMITRCTQLGYPRDAIDLFLDMVLGGYVPDRFTLSGIISACTELESESLSLGKQLHSWV 308

Query: 680 IKHGLTLDRCVGCCLINMYAKCSVDGSMSESRKIFDQILDHNVISWTAMITGYVQKGGYD 739
           I+ G  LD C+GC L++MYAKC+V GS+ +SRK+F ++ +HNV+SWTA+ITGYVQ GG D
Sbjct: 309 IRSGFALDVCIGCSLVDMYAKCTVGGSLDDSRKVFGRMEEHNVMSWTAIITGYVQCGGRD 368

Query: 740 KEALDLFRGMILTHVLPNHFTFSSTLKACANLADLRIGEQVFTHAVKLGFSIVNCVANSL 799
           KEAL+LF  M+   V PNHFTFSS LKAC NL+D   GEQ + HAVK GF+  +CV NSL
Sbjct: 369 KEALELFSKMMGGPVQPNHFTFSSVLKACGNLSDSCTGEQFYAHAVKHGFASDDCVGNSL 428

Query: 800 ISMYARSGKIDDARKAFDILFEKNLISYNTVIDAYSKNLNSEEAFELFNEIEDQGMGASA 859
           ISMYARSG++D+A+KAF+ LFEKNL+SYNT++DA +KNL+SE AFELF+E+ D  +  +A
Sbjct: 429 ISMYARSGRMDNAQKAFESLFEKNLVSYNTIVDACAKNLDSEGAFELFHELTDSKIELNA 488

Query: 860 FTFASLLSGAASIG 867
           FTFASLLSGA+S+G
Sbjct: 489 FTFASLLSGASSVG 502

BLAST of Cp4.1LG14g06660 vs. TrEMBL
Match: V4RMD3_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10006927mg PE=4 SV=1)

HSP 1 Score: 602.8 bits (1553), Expect = 6.6e-169
Identity = 307/503 (61.03%), Postives = 385/503 (76.54%), Query Frame = 1

Query: 381 SVSLPPPTTLKIPSLSS----NPS----------PSLQFPTFS---TNRLIRQINDGRLR 440
           ++SLP P   KIP LSS    NPS          P +  PT S   +NRLI  +N+GR++
Sbjct: 3   TLSLPAPA--KIPPLSSFKPSNPSRQNLPPSSSPPFIAQPTTSEPLSNRLIYHLNEGRVQ 62

Query: 441 TAISTLEHMVQHGTHPDLQTYSLFLKRCIRTRSFDLGRLVHENLTQSDLQLDSVTLNSLI 500
            AI TL+ M Q G HPDL TYSL LK CIR+R+F LG+LVH  LT+S L+ +SV LNSLI
Sbjct: 63  KAIFTLDLMTQKGNHPDLDTYSLLLKSCIRSRNFHLGKLVHSLLTRSKLEPNSVILNSLI 122

Query: 501 SLYSKSGQWEKAKSIFERMGNSRDLISWSAMVSCFANNKMGFEALHTFLDMIQNGYYPNE 560
           SLYSK G   +A  IF+ MGN RD++SWS+M+S + N     +A+H F++M++ G+ PNE
Sbjct: 123 SLYSKCGDLNEANKIFKSMGNKRDIVSWSSMISSYVNRGKQVDAIHMFVEMLELGFCPNE 182

Query: 561 YCFSAAIRACSTVEFASVGDSIFGYVIKTGYFASDVCVGCGLIDMFVKGRGDLVSAFEVF 620
           YCFSA IRACS  E  ++G  I+G+++K GYF SDVCVGC LIDMFVKG  DL SA++VF
Sbjct: 183 YCFSAVIRACSNTENVAIGHIIYGFLLKCGYFDSDVCVGCALIDMFVKGSVDLESAYKVF 242

Query: 621 EKMPERNAVTWTLMITRFMQFGYAGEAIDVFLDMILSGYEPDRFTLSAVISASAKLELLS 680
           +KM E+N V WTLMITR  Q G   +AI +FLDMILSG+ PDRFTLS V+SA ++LEL +
Sbjct: 243 DKMTEKNTVGWTLMITRCTQLGCPRDAIRLFLDMILSGFLPDRFTLSGVVSACSELELFT 302

Query: 681 LGQQLHSQAIKHGLTLDRCVGCCLINMYAKCSVDGSMSESRKIFDQILDHNVISWTAMIT 740
            G+QLHS AI+ GL LD CVGC L++MYAKC+VDGS+ +SRK+FD++LDHNV+SWTA+IT
Sbjct: 303 SGKQLHSWAIRTGLALDVCVGCSLVDMYAKCTVDGSVDDSRKVFDRMLDHNVMSWTAIIT 362

Query: 741 GYVQKGGYDKEALDLFRGMILTHVLPNHFTFSSTLKACANLADLRIGEQVFTHAVKLGFS 800
           GYVQ GG DKEA+ LF  MI   V PNHFTF+S LKAC NL D  + EQV+THAVK G +
Sbjct: 363 GYVQSGGRDKEAVKLFSDMIQGQVAPNHFTFASVLKACGNLLDSSVAEQVYTHAVKRGRA 422

Query: 801 IVNCVANSLISMYARSGKIDDARKAFDILFEKNLISYNTVIDAYSKNLNSEEAFELFNEI 860
           + +CV NSLISMYARSG+++DARKAF+ LFEKNL+SYNT++DAY+KNLNSE+AFEL +EI
Sbjct: 423 LDDCVGNSLISMYARSGRMEDARKAFESLFEKNLVSYNTMVDAYAKNLNSEKAFELLHEI 482

Query: 861 EDQGMGASAFTFASLLSGAASIG 867
           ED G+G SA+TFASLLSGA+SIG
Sbjct: 483 EDTGVGTSAYTFASLLSGASSIG 503

BLAST of Cp4.1LG14g06660 vs. TAIR10
Match: AT3G49170.1 (AT3G49170.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 492.3 bits (1266), Expect = 6.4e-139
Identity = 260/491 (52.95%), Postives = 343/491 (69.86%), Query Frame = 1

Query: 381 SVSLPPPTTLKIPSLSSNPSPSLQFPTFSTNRLI-RQINDGRLRTAISTLEHMVQHGTHP 440
           S S P P  L I S      PS+       +RLI R +N G LR A+S L+ M + G  P
Sbjct: 5   SFSFPSPAKLPIKS-----QPSVSNRINVADRLILRHLNAGDLRGAVSALDLMARDGIRP 64

Query: 441 -DLQTYSLFLKRCIRTRSFDLGRLVHENLTQSDLQLDSVTLNSLISLYSKSGQWEKAKSI 500
            D  T+S  LK CIR R F LG+LVH  L + D++ DSV  NSLISLYSKSG   KA+ +
Sbjct: 65  MDSVTFSSLLKSCIRARDFRLGKLVHARLIEFDIEPDSVLYNSLISLYSKSGDSAKAEDV 124

Query: 501 FERMGN--SRDLISWSAMVSCFANNKMGFEALHTFLDMIQNGYYPNEYCFSAAIRACSTV 560
           FE M     RD++SWSAM++C+ NN    +A+  F++ ++ G  PN+YC++A IRACS  
Sbjct: 125 FETMRRFGKRDVVSWSAMMACYGNNGRELDAIKVFVEFLELGLVPNDYCYTAVIRACSNS 184

Query: 561 EFASVGDSIFGYVIKTGYFASDVCVGCGLIDMFVKGRGDLVSAFEVFEKMPERNAVTWTL 620
           +F  VG    G+++KTG+F SDVCVGC LIDMFVKG     +A++VF+KM E N VTWTL
Sbjct: 185 DFVGVGRVTLGFLMKTGHFESDVCVGCSLIDMFVKGENSFENAYKVFDKMSELNVVTWTL 244

Query: 621 MITRFMQFGYAGEAIDVFLDMILSGYEPDRFTLSAVISASAKLELLSLGQQLHSQAIKHG 680
           MITR MQ G+  EAI  FLDM+LSG+E D+FTLS+V SA A+LE LSLG+QLHS AI+ G
Sbjct: 245 MITRCMQMGFPREAIRFFLDMVLSGFESDKFTLSSVFSACAELENLSLGKQLHSWAIRSG 304

Query: 681 LTLDRCVGCCLINMYAKCSVDGSMSESRKIFDQILDHNVISWTAMITGYVQKGGYDKEAL 740
           L  D  V C L++MYAKCS DGS+ + RK+FD++ DH+V+SWTA+ITGY++      EA+
Sbjct: 305 LVDD--VECSLVDMYAKCSADGSVDDCRKVFDRMEDHSVMSWTALITGYMKNCNLATEAI 364

Query: 741 DLFRGMILT-HVLPNHFTFSSTLKACANLADLRIGEQVFTHAVKLGFSIVNCVANSLISM 800
           +LF  MI   HV PNHFTFSS  KAC NL+D R+G+QV   A K G +  + VANS+ISM
Sbjct: 365 NLFSEMITQGHVEPNHFTFSSAFKACGNLSDPRVGKQVLGQAFKRGLASNSSVANSVISM 424

Query: 801 YARSGKIDDARKAFDILFEKNLISYNTVIDAYSKNLNSEEAFELFNEIEDQGMGASAFTF 860
           + +S +++DA++AF+ L EKNL+SYNT +D   +NLN E+AF+L +EI ++ +G SAFTF
Sbjct: 425 FVKSDRMEDAQRAFESLSEKNLVSYNTFLDGTCRNLNFEQAFKLLSEITERELGVSAFTF 484

Query: 861 ASLLSGAASIG 867
           ASLLSG A++G
Sbjct: 485 ASLLSGVANVG 488

BLAST of Cp4.1LG14g06660 vs. TAIR10
Match: AT5G53110.1 (AT5G53110.1 RING/U-box superfamily protein)

HSP 1 Score: 338.6 bits (867), Expect = 1.2e-92
Identity = 182/375 (48.53%), Postives = 239/375 (63.73%), Query Frame = 1

Query: 18  SFSPLALFSLFPLPASTSEL--CFHSSCDGDFQTIQFPFRIENQQPKSCGYP-GFDLTCP 77
           SFS L L  L P   +T+    C ++ C  D   I+FPFR+++QQ  SCGY  GFDLTC 
Sbjct: 8   SFSLLFLSLLIPTTTTTTSTVTCTNAVCRRDGPIIRFPFRLKHQQSHSCGYDKGFDLTCD 67

Query: 78  --STGQPLLHLPSSGDFTVQYIDYENQEILVNDPNKCLPRKILSLELSGSPFHGTNSEDF 137
             +  +  + LP SG+FTV+ IDY  QEI +NDPN CLP++IL L L+ +PF G     F
Sbjct: 68  INAGNRTTITLPFSGNFTVEEIDYAAQEIWINDPNNCLPQRILQLNLNSTPFSGVYMRQF 127

Query: 138 TFFNCSWSDPIPSEFNLNPIYCLSGLSYAVFASPSS-FVNEILSSSCVAMKTVSVPYSWS 197
           TFFNC  S+ +     LNPI CLSG +  VFA+PS   +N + S SC  MKTV VP  W 
Sbjct: 128 TFFNCPTSEYLRFR-PLNPITCLSGKNSTVFATPSPRVINYLSSQSCRLMKTVYVPVRWP 187

Query: 198 F------STDLTNDLRLGWKKPNCRRCESHGGICGLKPNSTDQIQCKHSAQPRHGIPRGA 257
           F      S+DL+++L L W+ P C RCE  GG CG+K NS+ +I C H  +P   IPR A
Sbjct: 188 FYEQIVSSSDLSDNLWLTWRVPRCSRCEIKGGKCGIKSNSSREIICSHVHKP--AIPRRA 247

Query: 258 RYAVSIGVGVPATMCILGFLCCFCARVRSYSRGRN-------SSIEAHWVISSRPTLMGL 317
           RYA+++G G+P  + + G  C   +++ S  + R        ++ +AH++ SS   +MGL
Sbjct: 248 RYAIAVGAGIPGALIVFGLFCFVYSKISSCIKRRRLVPTPEINNAQAHYLHSS-VIVMGL 307

Query: 318 DEPTIDSYPKIELGESLRLPKPNDNICAICLSEYRPKETVKSIPQCQHFFHQDCIDEWLR 371
           D PTI+SYPKI LGES RLPK +D  CAICLSEY PKET+++IPQCQH FH DCIDEWL+
Sbjct: 308 DGPTIESYPKIVLGESKRLPKVDDATCAICLSEYEPKETLRTIPQCQHCFHADCIDEWLK 367

BLAST of Cp4.1LG14g06660 vs. TAIR10
Match: AT2G13600.1 (AT2G13600.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 237.3 bits (604), Expect = 3.7e-62
Identity = 158/472 (33.47%), Postives = 245/472 (51.91%), Query Frame = 1

Query: 456 SFDLGRLVHENLTQSDLQLDSVTLNSLISLYSKSGQWEKAKSIFERMGNSRDLISWSAMV 515
           S + GR V + + Q ++     T NS+++  +K G  ++A S+F  M   RD  +W++MV
Sbjct: 70  SLEDGRQVFDKMPQRNIY----TWNSVVTGLTKLGFLDEADSLFRSMPE-RDQCTWNSMV 129

Query: 516 SCFANNKMGFEALHTFLDMIQNGYYPNEYCFSAAIRACSTVEFASVGDSIFGYVIKTGYF 575
           S FA +    EAL  F  M + G+  NEY F++ + ACS +   + G  +   + K+  F
Sbjct: 130 SGFAQHDRCEEALCYFAMMHKEGFVLNEYSFASVLSACSGLNDMNKGVQVHSLIAKSP-F 189

Query: 576 ASDVCVGCGLIDMFVKGRGDLVSAFEVFEKMPERNAVTWTLMITRFMQFGYAGEAIDVFL 635
            SDV +G  L+DM+ K  G++  A  VF++M +RN V+W  +IT F Q G A EA+DVF 
Sbjct: 190 LSDVYIGSALVDMYSKC-GNVNDAQRVFDEMGDRNVVSWNSLITCFEQNGPAVEALDVFQ 249

Query: 636 DMILSGYEPDRFTLSAVISASAKLELLSLGQQLHSQAIKHG-LTLDRCVGCCLINMYAKC 695
            M+ S  EPD  TL++VISA A L  + +GQ++H + +K+  L  D  +    ++MYAKC
Sbjct: 250 MMLESRVEPDEVTLASVISACASLSAIKVGQEVHGRVVKNDKLRNDIILSNAFVDMYAKC 309

Query: 696 S----------------------------VDGSMSESRKIFDQILDHNVISWTAMITGYV 755
           S                            +  S   +R +F ++ + NV+SW A+I GY 
Sbjct: 310 SRIKEARFIFDSMPIRNVIAETSMISGYAMAASTKAARLMFTKMAERNVVSWNALIAGYT 369

Query: 756 QKGGYDKEALDLFRGMILTHVLPNHFTFSSTLKACANLADLRIGEQVFTHAVKLGFSIVN 815
           Q G  ++EAL LF  +    V P H++F++ LKACA+LA+L +G Q   H +K GF   +
Sbjct: 370 QNGE-NEEALSLFCLLKRESVCPTHYSFANILKACADLAELHLGMQAHVHVLKHGFKFQS 429

Query: 816 ------CVANSLISMYARSGKIDDARKAFDILFEKNLISYNTVIDAYSKNLNSEEAFELF 875
                  V NSLI MY + G +++    F  + E++ +S+N +I  +++N    EA ELF
Sbjct: 430 GEEDDIFVGNSLIDMYVKCGCVEEGYLVFRKMMERDCVSWNAMIIGFAQNGYGNEALELF 489

Query: 876 NEIEDQGMGASAFTFASLLSGAASIGI----RMVNDSEKRPLVLLQLHSEYT 889
            E+ + G      T   +LS     G     R    S  R   +  L   YT
Sbjct: 490 REMLESGEKPDHITMIGVLSACGHAGFVEEGRHYFSSMTRDFGVAPLRDHYT 533

BLAST of Cp4.1LG14g06660 vs. TAIR10
Match: AT4G18520.1 (AT4G18520.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 236.1 bits (601), Expect = 8.3e-62
Identity = 144/442 (32.58%), Postives = 237/442 (53.62%), Query Frame = 1

Query: 425 AISTLEHMVQHGTH-PDLQTYSLFLKRCIRTRSFDLGRLVHENLTQSDLQLDSVTLNSLI 484
           A +  E  V+HG    + + +   L  C R   F+LGR VH N+ +  +  + +  +SL+
Sbjct: 167 AFALFEDYVKHGIRFTNERMFVCLLNLCSRRAEFELGRQVHGNMVKVGVG-NLIVESSLV 226

Query: 485 SLYSKSGQWEKAKSIFERMGNSRDLISWSAMVSCFANNKMGFEALHTFLDMIQNGYYPNE 544
             Y++ G+   A   F+ M   +D+ISW+A++S  +    G +A+  F+ M+ + + PNE
Sbjct: 227 YFYAQCGELTSALRAFDMM-EEKDVISWTAVISACSRKGHGIKAIGMFIGMLNHWFLPNE 286

Query: 545 YCFSAAIRACSTVEFASVGDSIFGYVIKTGYFASDVCVGCGLIDMFVKGRGDLVSAFEVF 604
           +   + ++ACS  +    G  +   V+K     +DV VG  L+DM+ K  G++    +VF
Sbjct: 287 FTVCSILKACSEEKALRFGRQVHSLVVKR-MIKTDVFVGTSLMDMYAKC-GEISDCRKVF 346

Query: 605 EKMPERNAVTWTLMITRFMQFGYAGEAIDVFLDMILSGYEPDRFTLSAVISASAKLELLS 664
           + M  RN VTWT +I    + G+  EAI +F  M       +  T+ +++ A   +  L 
Sbjct: 347 DGMSNRNTVTWTSIIAAHAREGFGEEAISLFRIMKRRHLIANNLTVVSILRACGSVGALL 406

Query: 665 LGQQLHSQAIKHGLTLDRCVGCCLINMYAKCSVDGSMSESRKIFDQILDHNVISWTAMIT 724
           LG++LH+Q IK+ +  +  +G  L+ +Y KC   G   ++  +  Q+   +V+SWTAMI+
Sbjct: 407 LGKELHAQIIKNSIEKNVYIGSTLVWLYCKC---GESRDAFNVLQQLPSRDVVSWTAMIS 466

Query: 725 GYVQKGGYDKEALDLFRGMILTHVLPNHFTFSSTLKACANLADLRIGEQVFTHAVKLGFS 784
           G     G++ EALD  + MI   V PN FT+SS LKACAN   L IG  + + A K    
Sbjct: 467 G-CSSLGHESEALDFLKEMIQEGVEPNPFTYSSALKACANSESLLIGRSIHSIAKKNHAL 526

Query: 785 IVNCVANSLISMYARSGKIDDARKAFDILFEKNLISYNTVIDAYSKNLNSEEAFELFNEI 844
               V ++LI MYA+ G + +A + FD + EKNL+S+  +I  Y++N    EA +L   +
Sbjct: 527 SNVFVGSALIHMYAKCGFVSEAFRVFDSMPEKNLVSWKAMIMGYARNGFCREALKLMYRM 586

Query: 845 EDQGMGASAFTFASLLSGAASI 866
           E +G     + FA++LS    I
Sbjct: 587 EAEGFEVDDYIFATILSTCGDI 600

BLAST of Cp4.1LG14g06660 vs. TAIR10
Match: AT3G53360.1 (AT3G53360.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 229.9 bits (585), Expect = 5.9e-60
Identity = 137/451 (30.38%), Postives = 239/451 (52.99%), Query Frame = 1

Query: 419 DGRLRTAISTLEHMVQHGTHPDLQTYSLFLKRCIRTRSFDLGRLVHENLTQSDLQLDSVT 478
           +G+   AI     M+Q    PD   +   +K C  +    LG+ +H  + + +     + 
Sbjct: 146 NGQGAEAIRLYLKMLQEDLVPDQFAFGSIIKACASSSDVGLGKQLHAQVIKLESSSHLIA 205

Query: 479 LNSLISLYSKSGQWEKAKSIFERMGNSRDLISWSAMVSCFANNKMGFEALHTFLDMIQNG 538
            N+LI++Y +  Q   A  +F  +   +DLISWS++++ F+     FEAL    +M+  G
Sbjct: 206 QNALIAMYVRFNQMSDASRVFYGIP-MKDLISWSSIIAGFSQLGFEFEALSHLKEMLSFG 265

Query: 539 -YYPNEYCFSAAIRACSTVEFASVGDSIFGYVIKTGYFASDVCVGCGLIDMFVKGRGDLV 598
            ++PNEY F ++++ACS++     G  I G  IK+   A +   GC L DM+ +  G L 
Sbjct: 266 VFHPNEYIFGSSLKACSSLLRPDYGSQIHGLCIKS-ELAGNAIAGCSLCDMYAR-CGFLN 325

Query: 599 SAFEVFEKMPERNAVTWTLMITRFMQFGYAGEAIDVFLDMILSGYEPDRFTLSAVISASA 658
           SA  VF+++   +  +W ++I      GYA EA+ VF  M  SG+ PD  +L +++ A  
Sbjct: 326 SARRVFDQIERPDTASWNVIIAGLANNGYADEAVSVFSQMRSSGFIPDAISLRSLLCAQT 385

Query: 659 KLELLSLGQQLHSQAIKHGLTLDRCVGCCLINMYAKCSVDGSMSESRKIFDQILDH-NVI 718
           K   LS G Q+HS  IK G   D  V   L+ MY  CS    +     +F+   ++ + +
Sbjct: 386 KPMALSQGMQIHSYIIKWGFLADLTVCNSLLTMYTFCS---DLYCCFNLFEDFRNNADSV 445

Query: 719 SWTAMITGYVQKGGYDKEALDLFRGMILTHVLPNHFTFSSTLKACANLADLRIGEQVFTH 778
           SW  ++T  +Q      E L LF+ M+++   P+H T  + L+ C  ++ L++G QV  +
Sbjct: 446 SWNTILTACLQH-EQPVEMLRLFKLMLVSECEPDHITMGNLLRGCVEISSLKLGSQVHCY 505

Query: 779 AVKLGFSIVNCVANSLISMYARSGKIDDARKAFDILFEKNLISYNTVIDAYSKNLNSEEA 838
           ++K G +    + N LI MYA+ G +  AR+ FD +  ++++S++T+I  Y+++   EEA
Sbjct: 506 SLKTGLAPEQFIKNGLIDMYAKCGSLGQARRIFDSMDNRDVVSWSTLIVGYAQSGFGEEA 565

Query: 839 FELFNEIEDQGMGASAFTFASLLSGAASIGI 868
             LF E++  G+  +  TF  +L+  + +G+
Sbjct: 566 LILFKEMKSAGIEPNHVTFVGVLTACSHVGL 589

BLAST of Cp4.1LG14g06660 vs. NCBI nr
Match: gi|449467092|ref|XP_004151259.1| (PREDICTED: pentatricopeptide repeat-containing protein At3g49170, chloroplastic [Cucumis sativus])

HSP 1 Score: 851.7 bits (2199), Expect = 1.2e-243
Identity = 427/489 (87.32%), Postives = 451/489 (92.23%), Query Frame = 1

Query: 383 SLPPPTTLKIPSLSSNPSPSLQFPTFS-----TNRLIRQINDGRLRTAISTLEHMVQHGT 442
           SLP PTTLKIP  SSNPS SLQFPTF+     T RLI++IN+GRL  AISTLEHMV  G+
Sbjct: 3   SLPLPTTLKIPFPSSNPSSSLQFPTFTNPNPLTGRLIQEINNGRLHKAISTLEHMVHQGS 62

Query: 443 HPDLQTYSLFLKRCIRTRSFDLGRLVHENLTQSDLQLDSVTLNSLISLYSKSGQWEKAKS 502
           HPDLQTYSLFLK+CIRTRSFD+G LVHE LTQSDLQLDSVTLNSLISLYSK GQWEKA S
Sbjct: 63  HPDLQTYSLFLKKCIRTRSFDIGTLVHEKLTQSDLQLDSVTLNSLISLYSKCGQWEKATS 122

Query: 503 IFERMGNSRDLISWSAMVSCFANNKMGFEALHTFLDMIQNGYYPNEYCFSAAIRACSTVE 562
           IF+ MG+SRDLISWSAMVSCFANN MGF AL TF+DMI+NGYYPNEYCF+AA RACST E
Sbjct: 123 IFQLMGSSRDLISWSAMVSCFANNNMGFRALLTFVDMIENGYYPNEYCFAAATRACSTAE 182

Query: 563 FASVGDSIFGYVIKTGYFASDVCVGCGLIDMFVKGRGDLVSAFEVFEKMPERNAVTWTLM 622
           F SVGDSIFG+V+KTGY  SDVCVGCGLIDMFVKGRGDLVSAF+VFEKMPERNAVTWTLM
Sbjct: 183 FVSVGDSIFGFVVKTGYLQSDVCVGCGLIDMFVKGRGDLVSAFKVFEKMPERNAVTWTLM 242

Query: 623 ITRFMQFGYAGEAIDVFLDMILSGYEPDRFTLSAVISASAKLELLSLGQQLHSQAIKHGL 682
           ITR MQFGYAGEAID+FL+MILSGYEPDRFTLS VISA A +ELL LGQQLHSQAI+HGL
Sbjct: 243 ITRLMQFGYAGEAIDLFLEMILSGYEPDRFTLSGVISACANMELLLLGQQLHSQAIRHGL 302

Query: 683 TLDRCVGCCLINMYAKCSVDGSMSESRKIFDQILDHNVISWTAMITGYVQKGGYDKEALD 742
           TLDRCVGCCLINMYAKCSVDGSM  +RKIFDQILDHNV SWTAMITGYVQKGGYD+EALD
Sbjct: 303 TLDRCVGCCLINMYAKCSVDGSMCAARKIFDQILDHNVFSWTAMITGYVQKGGYDEEALD 362

Query: 743 LFRGMILTHVLPNHFTFSSTLKACANLADLRIGEQVFTHAVKLGFSIVNCVANSLISMYA 802
           LFRGMILTHV+PNHFTFSSTLKACANLA LRIGEQVFTHAVKLGFS VNCVANSLISMYA
Sbjct: 363 LFRGMILTHVIPNHFTFSSTLKACANLAALRIGEQVFTHAVKLGFSSVNCVANSLISMYA 422

Query: 803 RSGKIDDARKAFDILFEKNLISYNTVIDAYSKNLNSEEAFELFNEIEDQGMGASAFTFAS 862
           RSG+IDDARKAFDILFEKNLISYNTVIDAY+KNLNSEEA ELFNEIEDQGMGASAFTFAS
Sbjct: 423 RSGRIDDARKAFDILFEKNLISYNTVIDAYAKNLNSEEALELFNEIEDQGMGASAFTFAS 482

Query: 863 LLSGAASIG 867
           LLSGAASIG
Sbjct: 483 LLSGAASIG 491

BLAST of Cp4.1LG14g06660 vs. NCBI nr
Match: gi|659082006|ref|XP_008441615.1| (PREDICTED: pentatricopeptide repeat-containing protein At3g49170, chloroplastic [Cucumis melo])

HSP 1 Score: 843.6 bits (2178), Expect = 3.2e-241
Identity = 424/489 (86.71%), Postives = 450/489 (92.02%), Query Frame = 1

Query: 383 SLPPPTTLKIPSLSSNPSPSLQFPTFS-----TNRLIRQINDGRLRTAISTLEHMVQHGT 442
           SLP PTTLKIP  S NPS SLQFP+F+     T+RLI++IN+GRL  AISTLEHMV  G+
Sbjct: 3   SLPLPTTLKIPFPSPNPSSSLQFPSFTNPNPLTDRLIQEINNGRLHKAISTLEHMVHQGS 62

Query: 443 HPDLQTYSLFLKRCIRTRSFDLGRLVHENLTQSDLQLDSVTLNSLISLYSKSGQWEKAKS 502
           HPDLQTYSLFLK+CIRTRSFDLG LVHE LT+S+LQLDSVTLNSLISLYSK GQWEKA S
Sbjct: 63  HPDLQTYSLFLKKCIRTRSFDLGTLVHEKLTRSNLQLDSVTLNSLISLYSKCGQWEKATS 122

Query: 503 IFERMGNSRDLISWSAMVSCFANNKMGFEALHTFLDMIQNGYYPNEYCFSAAIRACSTVE 562
           IF+RMG+SRDLISWSAMVSCFANN MGF AL TF+DMI+NGYYPNEYCF+AA RACS+ E
Sbjct: 123 IFQRMGSSRDLISWSAMVSCFANNNMGFRALLTFVDMIENGYYPNEYCFAAATRACSSAE 182

Query: 563 FASVGDSIFGYVIKTGYFASDVCVGCGLIDMFVKGRGDLVSAFEVFEKMPERNAVTWTLM 622
           F SVGDSIFG+VIKTGYF SDVCVGCGLIDMFVKGRGDLVSAF+VFEKMPERNAVTWTLM
Sbjct: 183 FVSVGDSIFGFVIKTGYFESDVCVGCGLIDMFVKGRGDLVSAFKVFEKMPERNAVTWTLM 242

Query: 623 ITRFMQFGYAGEAIDVFLDMILSGYEPDRFTLSAVISASAKLELLSLGQQLHSQAIKHGL 682
           ITR MQFG AGEAID+FLDMILSGYEPDRFTLS VISA A +ELL LGQQLHSQAIKHGL
Sbjct: 243 ITRLMQFGCAGEAIDLFLDMILSGYEPDRFTLSGVISACANMELLLLGQQLHSQAIKHGL 302

Query: 683 TLDRCVGCCLINMYAKCSVDGSMSESRKIFDQILDHNVISWTAMITGYVQKGGYDKEALD 742
           TLDRCVGCCLINMYAKCSVDGSM  +RK+FDQILDHNV SWTAMITGYVQKGGYD+EALD
Sbjct: 303 TLDRCVGCCLINMYAKCSVDGSMCPARKVFDQILDHNVFSWTAMITGYVQKGGYDEEALD 362

Query: 743 LFRGMILTHVLPNHFTFSSTLKACANLADLRIGEQVFTHAVKLGFSIVNCVANSLISMYA 802
           LFRGMI THV+PNHFTFSSTLKACANLA LRIGEQVFTHAVKLGFS VNCVANSLISMYA
Sbjct: 363 LFRGMISTHVIPNHFTFSSTLKACANLAALRIGEQVFTHAVKLGFSSVNCVANSLISMYA 422

Query: 803 RSGKIDDARKAFDILFEKNLISYNTVIDAYSKNLNSEEAFELFNEIEDQGMGASAFTFAS 862
           RSG+IDDARKAFDILFEKNLISYNTVIDAY+ NLNSEEAF LFNEIEDQGMGASAFTFAS
Sbjct: 423 RSGRIDDARKAFDILFEKNLISYNTVIDAYATNLNSEEAFVLFNEIEDQGMGASAFTFAS 482

Query: 863 LLSGAASIG 867
           LLSGAASIG
Sbjct: 483 LLSGAASIG 491

BLAST of Cp4.1LG14g06660 vs. NCBI nr
Match: gi|645248512|ref|XP_008230331.1| (PREDICTED: pentatricopeptide repeat-containing protein At3g49170, chloroplastic [Prunus mume])

HSP 1 Score: 632.1 bits (1629), Expect = 1.5e-177
Identity = 316/485 (65.15%), Postives = 383/485 (78.97%), Query Frame = 1

Query: 381 SVSLPPPTTLKIPSLSSNPSPSLQFPTFSTNRLIRQINDGRLRTAISTLEHMVQHGTHPD 440
           S+SL  P  L +P  S  P  S  F + + NRLI  IN G LR AI+TL+ M Q GTHPD
Sbjct: 5   SLSLHAPAKLPLPP-SLRPQKSPNFDSLN-NRLISHINVGHLRKAITTLDLMAQRGTHPD 64

Query: 441 LQTYSLFLKRCIRTRSFDLGRLVHENLTQSDLQLDSVTLNSLISLYSKSGQWEKAKSIFE 500
           L  YSL LK CIR+R+FDLGRLVH  L  S L+LD V LNSLISLYSKS  W+ A SIFE
Sbjct: 65  LPIYSLLLKSCIRSRNFDLGRLVHARLVHSQLELDPVVLNSLISLYSKSRDWKMANSIFE 124

Query: 501 RMGNSRDLISWSAMVSCFANNKMGFEALHTFLDMIQNGYYPNEYCFSAAIRACSTVEFAS 560
            MGN R+L+SWSAMVSCFANN MG EA+ TFLDM++NG+YPNEYCF++ IRACS  +   
Sbjct: 125 NMGNKRNLVSWSAMVSCFANNDMGLEAILTFLDMLENGFYPNEYCFASVIRACSKAQNIR 184

Query: 561 VGDSIFGYVIKTGYFASDVCVGCGLIDMFVKGRGDLVSAFEVFEKMPERNAVTWTLMITR 620
           +G+ IFG VIK+GY  SDVCVGC LIDMF KG G+L  A++VFE MPE +AVTWTLMITR
Sbjct: 185 IGNIIFGSVIKSGYLGSDVCVGCSLIDMFAKGSGELDDAYKVFETMPETDAVTWTLMITR 244

Query: 621 FMQFGYAGEAIDVFLDMILSGYEPDRFTLSAVISASAKLELLSLGQQLHSQAIKHGLTLD 680
             Q G  GEAID+++DM+ SG  PD+FTLS VISA  KL+ LSLGQQLHS  I+ GL L 
Sbjct: 245 LAQMGCPGEAIDLYVDMLWSGLMPDQFTLSGVISACTKLDSLSLGQQLHSWVIRSGLALG 304

Query: 681 RCVGCCLINMYAKCSVDGSMSESRKIFDQILDHNVISWTAMITGYVQKGGYDKEALDLFR 740
            CVGCCL++MYAKC+ DGSM ++RK+FD++ +HNV+SWT++I GYVQ G  D+EA+ LF 
Sbjct: 305 HCVGCCLVDMYAKCAADGSMDDARKVFDRMPNHNVMSWTSIINGYVQSGEGDEEAIKLFV 364

Query: 741 GMILTHVLPNHFTFSSTLKACANLADLRIGEQVFTHAVKLGFSIVNCVANSLISMYARSG 800
           GM+  +V PNHFTFSS LKACANL+DLR G+QV + AVKLG + VNCV NSLISMY+RSG
Sbjct: 365 GMMTGYVPPNHFTFSSILKACANLSDLRKGDQVHSLAVKLGLASVNCVGNSLISMYSRSG 424

Query: 801 KIDDARKAFDILFEKNLISYNTVIDAYSKNLNSEEAFELFNEIEDQGMGASAFTFASLLS 860
           +++DARKAFDIL+EKNLISYNT++DAY+K+ ++EEAF LF+EI+D G GASAFTF+SLLS
Sbjct: 425 QVEDARKAFDILYEKNLISYNTIVDAYAKHSDTEEAFGLFHEIQDTGFGASAFTFSSLLS 484

Query: 861 GAASI 866
           GAASI
Sbjct: 485 GAASI 487

BLAST of Cp4.1LG14g06660 vs. NCBI nr
Match: gi|296083564|emb|CBI23556.3| (unnamed protein product [Vitis vinifera])

HSP 1 Score: 630.9 bits (1626), Expect = 3.3e-177
Identity = 313/469 (66.74%), Postives = 377/469 (80.38%), Query Frame = 1

Query: 400 SPSLQFPTFST--NRLIRQINDGRLRTAISTLEHMVQHGTHPDLQTYSLFLKRCIRTRSF 459
           S SL+ P F    NRLIRQ++ GRL  A STL+ M Q    PDL TYS+ LK CIR R+F
Sbjct: 1   SLSLKNPNFEPLKNRLIRQLDVGRLHHAFSTLDLMTQQNAPPDLTTYSILLKSCIRFRNF 60

Query: 460 DLGRLVHENLTQSDLQLDSVTLNSLISLYSKSGQWEKAKSIFERMGNSRDLISWSAMVSC 519
            LG+LVH  L QS L+LDSV LN+LISLYSK G  E A+ IFE MGN RDL+SWSAMVSC
Sbjct: 61  QLGKLVHRKLMQSGLELDSVVLNTLISLYSKCGDTETARLIFEGMGNKRDLVSWSAMVSC 120

Query: 520 FANNKMGFEALHTFLDMIQNGYYPNEYCFSAAIRACSTVEFASVGDSIFGYVIKTGYFAS 579
           FANN M ++A+ TFLDM++ G+YPNEYCF+A IRACS   +A VG+ I+G+V+KTGY  +
Sbjct: 121 FANNSMEWQAIWTFLDMLELGFYPNEYCFAAVIRACSNANYAWVGEIIYGFVVKTGYLEA 180

Query: 580 DVCVGCGLIDMFVKGRGDLVSAFEVFEKMPERNAVTWTLMITRFMQFGYAGEAIDVFLDM 639
           DVCVGC LIDMFVKG GDL SA++VF+KMPERN VTWTLMITRF Q G A +AID+FLDM
Sbjct: 181 DVCVGCELIDMFVKGSGDLGSAYKVFDKMPERNLVTWTLMITRFAQLGCARDAIDLFLDM 240

Query: 640 ILSGYEPDRFTLSAVISASAKLELLSLGQQLHSQAIKHGLTLDRCVGCCLINMYAKCSVD 699
            LSGY PDRFT S+V+SA  +L LL+LG+QLHS+ I+ GL LD CVGC L++MYAKC+ D
Sbjct: 241 ELSGYVPDRFTYSSVLSACTELGLLALGKQLHSRVIRLGLALDVCVGCSLVDMYAKCAAD 300

Query: 700 GSMSESRKIFDQILDHNVISWTAMITGYVQKGGYDKEALDLFRGMILTHVLPNHFTFSST 759
           GS+ +SRK+F+Q+ +HNV+SWTA+IT YVQ G  DKEA++LF  MI  H+ PNHF+FSS 
Sbjct: 301 GSVDDSRKVFEQMPEHNVMSWTAIITAYVQSGECDKEAIELFCKMISGHIRPNHFSFSSV 360

Query: 760 LKACANLADLRIGEQVFTHAVKLGFSIVNCVANSLISMYARSGKIDDARKAFDILFEKNL 819
           LKAC NL+D   GEQV+++AVKLG + VNCV NSLISMYARSG+++DARKAFDILFEKNL
Sbjct: 361 LKACGNLSDPYTGEQVYSYAVKLGIASVNCVGNSLISMYARSGRMEDARKAFDILFEKNL 420

Query: 820 ISYNTVIDAYSKNLNSEEAFELFNEIEDQGMGASAFTFASLLSGAASIG 867
           +SYN ++D Y+KNL SEEAF LFNEI D G+G SAFTFASLLSGAASIG
Sbjct: 421 VSYNAIVDGYAKNLKSEEAFLLFNEIADTGIGISAFTFASLLSGAASIG 469

BLAST of Cp4.1LG14g06660 vs. NCBI nr
Match: gi|694396949|ref|XP_009373739.1| (PREDICTED: pentatricopeptide repeat-containing protein At3g49170, chloroplastic-like [Pyrus x bretschneideri])

HSP 1 Score: 628.2 bits (1619), Expect = 2.1e-176
Identity = 309/486 (63.58%), Postives = 383/486 (78.81%), Query Frame = 1

Query: 380 PSVSLPPPTTLKIPSLSSNPSPSLQFPTFSTNRLIRQINDGRLRTAISTLEHMVQHGTHP 439
           P +SL  P   K+P        +  F   + NRLI QIN G LR AI+TL+ + Q G HP
Sbjct: 2   PGLSLSLPAPAKLPPPPPLGPKATNFELLN-NRLINQINVGHLRKAITTLDLLAQRGIHP 61

Query: 440 DLQTYSLFLKRCIRTRSFDLGRLVHENLTQSDLQLDSVTLNSLISLYSKSGQWEKAKSIF 499
           DL TYSL +K CIR+R+FDLG+LVH+ L  S L+ D V LNSLISLYSKSG W+KA SIF
Sbjct: 62  DLPTYSLLIKSCIRSRNFDLGKLVHDRLAHSQLEPDPVLLNSLISLYSKSGDWKKANSIF 121

Query: 500 ERMGNSRDLISWSAMVSCFANNKMGFEALHTFLDMIQNGYYPNEYCFSAAIRACSTVEFA 559
           E MG+ R+L+SWSAMVSCFANN MGFEA+ TFLDM+++G+YPNEYCF++ IRACS     
Sbjct: 122 ENMGSERNLVSWSAMVSCFANNDMGFEAITTFLDMLEHGFYPNEYCFASVIRACSNARNI 181

Query: 560 SVGDSIFGYVIKTGYFASDVCVGCGLIDMFVKGRGDLVSAFEVFEKMPERNAVTWTLMIT 619
            +G  IFG VIK GY  SDVCVGC LIDMF KG GDL  A++VFE+MPE +AVTWTLMIT
Sbjct: 182 GIGKIIFGSVIKGGYLGSDVCVGCSLIDMFAKGGGDLGEAYKVFEEMPETDAVTWTLMIT 241

Query: 620 RFMQFGYAGEAIDVFLDMILSGYEPDRFTLSAVISASAKLELLSLGQQLHSQAIKHGLTL 679
           RF Q G+  EAI +++DM+LSG+ PD+F LS VISA  KLE LSLGQQLHS  I+ GL L
Sbjct: 242 RFAQMGFPREAIGLYVDMLLSGFMPDQFALSGVISACTKLESLSLGQQLHSWVIRSGLAL 301

Query: 680 DRCVGCCLINMYAKCSVDGSMSESRKIFDQILDHNVISWTAMITGYVQKGGYDKEALDLF 739
             CVGCCL++MYAKC+ DGSM+++RK+FD++ +HNV+SWTA+I GYVQ G  D+EA+ LF
Sbjct: 302 GHCVGCCLVDMYAKCAADGSMNDARKVFDRMPNHNVMSWTAIINGYVQSGKGDEEAIKLF 361

Query: 740 RGMILTHVLPNHFTFSSTLKACANLADLRIGEQVFTHAVKLGFSIVNCVANSLISMYARS 799
             M+  HV PNHFTFSS LKACANL+DLR GEQ+ + AVK G + VNCV NSLI+MY++S
Sbjct: 362 VEMMSGHVPPNHFTFSSILKACANLSDLRKGEQIHSLAVKSGLASVNCVGNSLITMYSKS 421

Query: 800 GKIDDARKAFDILFEKNLISYNTVIDAYSKNLNSEEAFELFNEIEDQGMGASAFTFASLL 859
           G+++DARK+FD+L+EKNLISYNT++DAY+K+L++EEAF LF+EI+D G GASAFTF+SLL
Sbjct: 422 GQVEDARKSFDVLYEKNLISYNTIVDAYAKHLDAEEAFGLFHEIQDTGYGASAFTFSSLL 481

Query: 860 SGAASI 866
           SGAASI
Sbjct: 482 SGAASI 486

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP272_ARATH1.1e-13752.95Pentatricopeptide repeat-containing protein At3g49170, chloroplastic OS=Arabidop... [more]
PP151_ARATH6.6e-6133.47Pentatricopeptide repeat-containing protein At2g13600 OS=Arabidopsis thaliana GN... [more]
PP319_ARATH1.5e-6032.58Pentatricopeptide repeat-containing protein At4g18520 OS=Arabidopsis thaliana GN... [more]
PP280_ARATH1.1e-5830.38Pentatricopeptide repeat-containing protein At3g53360, mitochondrial OS=Arabidop... [more]
PP307_ARATH2.3e-5831.18Pentatricopeptide repeat-containing protein At4g13650 OS=Arabidopsis thaliana GN... [more]
Match NameE-valueIdentityDescription
A0A0A0LBE4_CUCSA8.2e-24487.32Uncharacterized protein OS=Cucumis sativus GN=Csa_3G603610 PE=4 SV=1[more]
B9IDW4_POPTR8.4e-17264.86Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0015s02060g PE=4 SV=2[more]
M5WX51_PRUPE7.1e-17167.51Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa001611mg PE=4 SV=1[more]
A0A061G281_THECC4.6e-17061.13Tetratricopeptide repeat-like superfamily protein OS=Theobroma cacao GN=TCM_0153... [more]
V4RMD3_9ROSI6.6e-16961.03Uncharacterized protein OS=Citrus clementina GN=CICLE_v10006927mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G49170.16.4e-13952.95 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G53110.11.2e-9248.53 RING/U-box superfamily protein[more]
AT2G13600.13.7e-6233.47 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT4G18520.18.3e-6232.58 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT3G53360.15.9e-6030.38 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
gi|449467092|ref|XP_004151259.1|1.2e-24387.32PREDICTED: pentatricopeptide repeat-containing protein At3g49170, chloroplastic ... [more]
gi|659082006|ref|XP_008441615.1|3.2e-24186.71PREDICTED: pentatricopeptide repeat-containing protein At3g49170, chloroplastic ... [more]
gi|645248512|ref|XP_008230331.1|1.5e-17765.15PREDICTED: pentatricopeptide repeat-containing protein At3g49170, chloroplastic ... [more]
gi|296083564|emb|CBI23556.3|3.3e-17766.74unnamed protein product [Vitis vinifera][more]
gi|694396949|ref|XP_009373739.1|2.1e-17663.58PREDICTED: pentatricopeptide repeat-containing protein At3g49170, chloroplastic-... [more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0030247polysaccharide binding
GO:0008270zinc ion binding
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR025287WAK_GUB
IPR013083Znf_RING/FYVE/PHD
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
IPR001841Znf_RING
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009793 embryo development ending in seed dormancy
biological_process GO:0006468 protein phosphorylation
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0005524 ATP binding
molecular_function GO:0030247 polysaccharide binding
molecular_function GO:0005515 protein binding
molecular_function GO:0004672 protein kinase activity
molecular_function GO:0008270 zinc ion binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG14g06660.1Cp4.1LG14g06660.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001841Zinc finger, RING-typePFAMPF13639zf-RING_2coord: 324..365
score: 4.3
IPR001841Zinc finger, RING-typeSMARTSM00184ring_2coord: 325..366
score: 7.
IPR001841Zinc finger, RING-typePROFILEPS50089ZF_RING_2coord: 325..364
score: 10
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 477..503
score: 6.8E-7coord: 509..538
score: 0.
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 610..657
score: 4.9E-8coord: 815..862
score: 3.3E-10coord: 714..762
score: 3.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 612..645
score: 0.0019coord: 818..848
score: 2.3E-6coord: 509..543
score: 3.8E-5coord: 477..502
score: 1.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 542..576
score: 5.13coord: 816..850
score: 10.819coord: 578..609
score: 6.577coord: 440..474
score: 6.906coord: 507..541
score: 9.756coord: 610..644
score: 10.106coord: 750..784
score: 7.169coord: 785..815
score: 6.971coord: 475..505
score: 10.731coord: 645..679
score: 6.818coord: 714..749
score: 10
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 478..538
score: 4.0E-6coord: 749..842
score: 4.0E-6coord: 360..375
score: 4.
IPR013083Zinc finger, RING/FYVE/PHD-typeGENE3DG3DSA:3.30.40.10coord: 321..359
score: 2.2
IPR025287Wall-associated receptor kinase, galacturonan-binding domainPFAMPF13947GUB_WAK_bindcoord: 40..137
score: 5.
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 786..866
score: 2.6E-216coord: 379..407
score: 2.6E-216coord: 464..750
score: 2.6E
NoneNo IPR availablePANTHERPTHR24015:SF154SUBFAMILY NOT NAMEDcoord: 464..750
score: 2.6E-216coord: 786..866
score: 2.6E-216coord: 379..407
score: 2.6E
NoneNo IPR availableunknownSSF57850RING/U-boxcoord: 319..365
score: 1.44

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG14g06660Cp4.1LG08g00720Cucurbita pepo (Zucchini)cpecpeB254