Cp4.1LG14g01660 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG14g01660
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionPentatricopeptide repeat-containing protein
LocationCp4.1LG14 : 3539003 .. 3540259 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCCTCCATCGCCACCCGCCACATCCGTCGTCTATCGACGGCAGCCACCGCCACCGCCACCGCAACCGCTGCTGCTAAGGCCACAGCAACGACGTCCTCTTCAATATCCATTTCAAAAGCGAAATCTAAACTAAGAACCGAGTATGATCCAGATAAAGCTCTGAACATTTACTCTTCTGTTTCCAGTCACTACACTTCCCCTGTCTCCTCTCGCTATGCTCAGGAACTCACCATTCGGCGCCTTGCAAAGTCTCGTCGATTCGATGACATCGAATCCCTGATCGAATCCCATAAAAATGACCCGAAGATCACTCAGGAGCCCTTTTTGTCCACCTTGATTCGATCCTACGGTCGAGCTGGTATGTTCGAGCACGCTATGAGGACTTATAATCAGATGGAAGATTTCGGCACTCCTCGATCGGTGATTTCCTTCAATGCGCTATTATGTGCATTTAACCATTCGAAGCAATTCGACAAAGTTCCTCAACTGTTCGATGAAATTCCGAAGAAATACAGTTTCTCTCCCAATAAGATCTCGTACGGGATCCTGGTCAAATCCTATTGCGAATCCGGTTCCCCTGAAAAGGCCATGCAAATCGTAAGGGAGATGGAGGAAAACGATGTGGAGGTAACTGCGGTGACATTCACAACCATTTTAGATGCTCTGTACAAGAAGGGCGAGAGCGAAGAGGCAGAGAAAATCTGGAACAAGATGATATCAAAAGGGTGTGAACTCGATGTGGGTGCCTATAACGTTAGATTGATGCACGAGCACGGCGGCAAGCCAGAACACGTTGAAGCATTGATCGAGGAAATGGCTAATTCAGGTATGAAACCCGACACCATTAGCTATAATTACTTAATGACTTGTTATTGCAAGAATGGGATGATTGATGAAGCAAAGAAGGTGTATGATGATATGGAGATAAATGGGTGTAACAAGAACGCTGCAACTTTTAGAACATTTATGTATTACCTCTGTAGAAATGGGGATTATGAAAAAGGGTATATGGTTTTCAAGGAGAGTGTGAAGGTTCATAAGATTCCTGATGTTAACACAGTGAAGTATTTGGTGGAGGGGCTGATGGAGAAGAAGAAGATGAAAGAGGCCAAGGGTTTAATCAGGACCATAAGGAAGAAGTTCCCTCCTGATTCTTTGAAGGCCTGGAGAAAAGTTGAGGAGGCTCTTGGTTTGGCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTAAGGCTGATGATGAAACTGAAGAATAA

mRNA sequence

ATGTCCTCCATCGCCACCCGCCACATCCGTCGTCTATCGACGGCAGCCACCGCCACCGCCACCGCAACCGCTGCTGCTAAGGCCACAGCAACGACGTCCTCTTCAATATCCATTTCAAAAGCGAAATCTAAACTAAGAACCGAGTATGATCCAGATAAAGCTCTGAACATTTACTCTTCTGTTTCCAGTCACTACACTTCCCCTGTCTCCTCTCGCTATGCTCAGGAACTCACCATTCGGCGCCTTGCAAAGTCTCGTCGATTCGATGACATCGAATCCCTGATCGAATCCCATAAAAATGACCCGAAGATCACTCAGGAGCCCTTTTTGTCCACCTTGATTCGATCCTACGGTCGAGCTGGTATGTTCGAGCACGCTATGAGGACTTATAATCAGATGGAAGATTTCGGCACTCCTCGATCGGTGATTTCCTTCAATGCGCTATTATGTGCATTTAACCATTCGAAGCAATTCGACAAAGTTCCTCAACTGTTCGATGAAATTCCGAAGAAATACAGTTTCTCTCCCAATAAGATCTCGTACGGGATCCTGGTCAAATCCTATTGCGAATCCGGTTCCCCTGAAAAGGCCATGCAAATCGTAAGGGAGATGGAGGAAAACGATGTGGAGGTAACTGCGGTGACATTCACAACCATTTTAGATGCTCTGTACAAGAAGGGCGAGAGCGAAGAGGCAGAGAAAATCTGGAACAAGATGATATCAAAAGGGTGTGAACTCGATGTGGGTGCCTATAACGTTAGATTGATGCACGAGCACGGCGGCAAGCCAGAACACGTTGAAGCATTGATCGAGGAAATGGCTAATTCAGGTATGAAACCCGACACCATTAGCTATAATTACTTAATGACTTGTTATTGCAAGAATGGGATGATTGATGAAGCAAAGAAGGTGTATGATGATATGGAGATAAATGGGTGTAACAAGAACGCTGCAACTTTTAGAACATTTATGTATTACCTCTGTAGAAATGGGGATTATGAAAAAGGGTATATGGTTTTCAAGGAGAGTGTGAAGGTTCATAAGATTCCTGATGTTAACACAGTGAAGTATTTGGTGGAGGGGCTGATGGAGAAGAAGAAGATGAAAGAGGCCAAGGGTTTAATCAGGACCATAAGGAAGAAGTTCCCTCCTGATTCTTTGAAGGCCTGGAGAAAAGTTGAGGAGGCTCTTGGTTTGGCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTAAGGCTGATGATGAAACTGAAGAATAA

Coding sequence (CDS)

ATGTCCTCCATCGCCACCCGCCACATCCGTCGTCTATCGACGGCAGCCACCGCCACCGCCACCGCAACCGCTGCTGCTAAGGCCACAGCAACGACGTCCTCTTCAATATCCATTTCAAAAGCGAAATCTAAACTAAGAACCGAGTATGATCCAGATAAAGCTCTGAACATTTACTCTTCTGTTTCCAGTCACTACACTTCCCCTGTCTCCTCTCGCTATGCTCAGGAACTCACCATTCGGCGCCTTGCAAAGTCTCGTCGATTCGATGACATCGAATCCCTGATCGAATCCCATAAAAATGACCCGAAGATCACTCAGGAGCCCTTTTTGTCCACCTTGATTCGATCCTACGGTCGAGCTGGTATGTTCGAGCACGCTATGAGGACTTATAATCAGATGGAAGATTTCGGCACTCCTCGATCGGTGATTTCCTTCAATGCGCTATTATGTGCATTTAACCATTCGAAGCAATTCGACAAAGTTCCTCAACTGTTCGATGAAATTCCGAAGAAATACAGTTTCTCTCCCAATAAGATCTCGTACGGGATCCTGGTCAAATCCTATTGCGAATCCGGTTCCCCTGAAAAGGCCATGCAAATCGTAAGGGAGATGGAGGAAAACGATGTGGAGGTAACTGCGGTGACATTCACAACCATTTTAGATGCTCTGTACAAGAAGGGCGAGAGCGAAGAGGCAGAGAAAATCTGGAACAAGATGATATCAAAAGGGTGTGAACTCGATGTGGGTGCCTATAACGTTAGATTGATGCACGAGCACGGCGGCAAGCCAGAACACGTTGAAGCATTGATCGAGGAAATGGCTAATTCAGGTATGAAACCCGACACCATTAGCTATAATTACTTAATGACTTGTTATTGCAAGAATGGGATGATTGATGAAGCAAAGAAGGTGTATGATGATATGGAGATAAATGGGTGTAACAAGAACGCTGCAACTTTTAGAACATTTATGTATTACCTCTGTAGAAATGGGGATTATGAAAAAGGGTATATGGTTTTCAAGGAGAGTGTGAAGGTTCATAAGATTCCTGATGTTAACACAGTGAAGTATTTGGTGGAGGGGCTGATGGAGAAGAAGAAGATGAAAGAGGCCAAGGGTTTAATCAGGACCATAAGGAAGAAGTTCCCTCCTGATTCTTTGAAGGCCTGGAGAAAAGTTGAGGAGGCTCTTGGTTTGGCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTAAGGCTGATGATGAAACTGAAGAATAA

Protein sequence

MSSIATRHIRRLSTAATATATATAAAKATATTSSSISISKAKSKLRTEYDPDKALNIYSSVSSHYTSPVSSRYAQELTIRRLAKSRRFDDIESLIESHKNDPKITQEPFLSTLIRSYGRAGMFEHAMRTYNQMEDFGTPRSVISFNALLCAFNHSKQFDKVPQLFDEIPKKYSFSPNKISYGILVKSYCESGSPEKAMQIVREMEENDVEVTAVTFTTILDALYKKGESEEAEKIWNKMISKGCELDVGAYNVRLMHEHGGKPEHVEALIEEMANSGMKPDTISYNYLMTCYCKNGMIDEAKKVYDDMEINGCNKNAATFRTFMYYLCRNGDYEKGYMVFKESVKVHKIPDVNTVKYLVEGLMEKKKMKEAKGLIRTIRKKFPPDSLKAWRKVEEALGLASSSSSSSSSSKADDETEE
BLAST of Cp4.1LG14g01660 vs. Swiss-Prot
Match: PP352_ARATH (Pentatricopeptide repeat-containing protein At4g36680, mitochondrial OS=Arabidopsis thaliana GN=At4g36680 PE=2 SV=1)

HSP 1 Score: 505.4 bits (1300), Expect = 6.1e-142
Identity = 259/411 (63.02%), Postives = 323/411 (78.59%), Query Frame = 1

Query: 2   SSIATRHIRRLSTAATATATATAAAKATATTSSSISISKAKSKLRTEYDPDKALNIYSSV 61
           S I+ R +RR ++AA    T       TA +S  IS+SKAKS LR E+DPDKAL IY++V
Sbjct: 4   SRISLRLVRRFASAAADGTT-------TAPSSGKISVSKAKSTLRKEHDPDKALKIYANV 63

Query: 62  SSHYTSPVSSRYAQELTIRRLAKSRRFDDIESLIESHKNDPKITQEPFLSTLIRSYGRAG 121
           S H  SPVSSRYAQELT+RRLAK RRF DIE+LIESHKNDPKI +EPF STLIRSYG+A 
Sbjct: 64  SDHSASPVSSRYAQELTVRRLAKCRRFSDIETLIESHKNDPKIKEEPFYSTLIRSYGQAS 123

Query: 122 MFEHAMRTYNQMEDFGTPRSVISFNALLCAFNHSKQFDKVPQLFDEIPKKYS-FSPNKIS 181
           MF HAMRT+ QM+ +GTPRS +SFNALL A  HSK FDKVPQLFDEIP++Y+   P+KIS
Sbjct: 124 MFNHAMRTFEQMDQYGTPRSAVSFNALLNACLHSKNFDKVPQLFDEIPQRYNKIIPDKIS 183

Query: 182 YGILVKSYCESGSPEKAMQIVREMEENDVEVTAVTFTTILDALYKKGESEEAEKIWNKMI 241
           YGIL+KSYC+SG+PEKA++I+R+M+   +EVT + FTTIL +LYKKGE E A+ +WN+M+
Sbjct: 184 YGILIKSYCDSGTPEKAIEIMRQMQGKGMEVTTIAFTTILSSLYKKGELEVADNLWNEMV 243

Query: 242 SKGCELDVGAYNVRLMHEHGGKPEHVEALIEEMANSGMKPDTISYNYLMTCYCKNGMIDE 301
            KGCELD  AYNVR+M      PE V+ LIEEM++ G+KPDTISYNYLMT YC+ GM+DE
Sbjct: 244 KKGCELDNAAYNVRIMSAQKESPERVKELIEEMSSMGLKPDTISYNYLMTAYCERGMLDE 303

Query: 302 AKKVYDDMEINGCNKNAATFRTFMYYLCRNGDYEKGYMVFKESVKVHKIPDVNTVKYLVE 361
           AKKVY+ +E N C  NAATFRT +++LC +  YE+GY +FK+SV +HKIPD NT+K+LV 
Sbjct: 304 AKKVYEGLEGNNCAPNAATFRTLIFHLCYSRLYEQGYAIFKKSVYMHKIPDFNTLKHLVV 363

Query: 362 GLMEKKKMKEAKGLIRTIRKKFPPDSLKAWRKVEEALGLASSSSSSSSSSK 412
           GL+E KK  +AKGLIRT++KKFPP  L AW+K+EE LGL S + +  SS+K
Sbjct: 364 GLVENKKRDDAKGLIRTVKKKFPPSFLNAWKKLEEELGLYSKTDAFPSSAK 407

BLAST of Cp4.1LG14g01660 vs. Swiss-Prot
Match: PP162_ARATH (Pentatricopeptide repeat-containing protein At2g18520, mitochondrial OS=Arabidopsis thaliana GN=At2g18520 PE=2 SV=1)

HSP 1 Score: 442.6 bits (1137), Expect = 4.9e-123
Identity = 238/424 (56.13%), Postives = 311/424 (73.35%), Query Frame = 1

Query: 2   SSIATRHIRRLSTAATATATATAAAKATATTSSSISISKAKSKLRTEYDPDKALNIYSSV 61
           S +  R +RR STA    +  TA   A       I++SKAKSKLR   DPDKAL IY SV
Sbjct: 4   SRLYLRFLRRFSTATGIDSQTTAYPGA-------ITMSKAKSKLRKVQDPDKALAIYKSV 63

Query: 62  SSHYTSPVSSRYAQELTIRRLAKSRRFDDIESLIESHKNDPKITQEPFLSTLIRSYGRAG 121
           S++ TSP+SSRYA ELT++RLAKS+RF DIE+LIESHKN+PKI  E FLSTLIRSYGRA 
Sbjct: 64  SNNSTSPLSSRYAMELTVQRLAKSQRFSDIEALIESHKNNPKIKTETFLSTLIRSYGRAS 123

Query: 122 MFEHAMRTYNQMEDFGTPRSVISFNALLCAFNHSKQFDKVPQLFDEIPKKY-SFSPNKIS 181
           MF+HAM+ + +M+  GTPR+V+SFNALL A  HS  F++VPQLFDE P++Y + +P+KIS
Sbjct: 124 MFDHAMKMFEEMDKLGTPRTVVSFNALLAACLHSDLFERVPQLFDEFPQRYNNITPDKIS 183

Query: 182 YGILVKSYCESGSPEKAMQIVREMEENDVEVTAVTFTTILDALYKKGESEEAEKIWNKMI 241
           YG+L+KSYC+SG PEKAM+I+R+ME   VEVT + FTTIL +LYK G  +EAE +W +M+
Sbjct: 184 YGMLIKSYCDSGKPEKAMEIMRDMEVKGVEVTIIAFTTILGSLYKNGLVDEAESLWIEMV 243

Query: 242 SKGCELDVGAYNVRLMHEHGGKPEHVEALIEEMANSGMKPDTISYNYLMTCYCKNGMIDE 301
           +KGC+LD   YNVRLM+     PE V+ L+EEM++ G+KPDT+SYNYLMT YC  GM+ E
Sbjct: 244 NKGCDLDNTVYNVRLMNAAKESPERVKELMEEMSSVGLKPDTVSYNYLMTAYCVKGMMSE 303

Query: 302 AKKVYDDMEINGCNKNAATFRTFMYYLCRNGDYEKGYMVFKESVKVHKIPDVNTVKYLVE 361
           AKKVY+ +E      NAATFRT +++LC NG Y++G  VFK+S  VHKIPD  T K+L E
Sbjct: 304 AKKVYEGLE----QPNAATFRTLIFHLCINGLYDQGLTVFKKSAIVHKIPDFKTCKHLTE 363

Query: 362 GLMEKKKMKEAKGLIRTIRKKFPPDSLKAWRKVEEALGL------ASSSSSSSSSSKADD 419
           GL++  +M++A+G+ R ++KKFPP  +  W+K+EE LGL      A+ SSSS +    D 
Sbjct: 364 GLVKNNRMEDARGVARIVKKKFPPRLVTEWKKLEEKLGLYSKGNAAAVSSSSQTREVLDQ 416

BLAST of Cp4.1LG14g01660 vs. Swiss-Prot
Match: PPR87_ARATH (Pentatricopeptide repeat-containing protein At1g61870, mitochondrial OS=Arabidopsis thaliana GN=PPR336 PE=2 SV=2)

HSP 1 Score: 198.7 bits (504), Expect = 1.2e-49
Identity = 125/395 (31.65%), Postives = 218/395 (55.19%), Query Frame = 1

Query: 5   ATRHIRRLSTAATATATATAAAKATATTSSSISISKAKSKLRTEYDPDKALNIYSSVSSH 64
           A+  IR LS+A+T  +  +     T  TS   S   A S L++E DPD+ L I  + S  
Sbjct: 19  ASPQIRSLSSASTILSPDSK----TPLTSKEKS-KAALSLLKSEKDPDRILEICRAASLT 78

Query: 65  YTSPVSSRYAQELTIRRLAKSRRFDDIESLIESH-KNDPKITQEPFLSTLIRSYGRAGMF 124
               +  R A    +  LA+ + F  + +L++   +N P +  E F +  I  Y +A M 
Sbjct: 79  PDCRID-RIAFSAAVENLAEKKHFSAVSNLLDGFIENRPDLKSERFAAHAIVLYAQANML 138

Query: 125 EHAMRTYNQMEDFGTPRSVISFNALLCAFNHSKQFDKVPQLFDEIPKKYSFSPNKISYGI 184
           +H++R +  +E F   R+V S NALL A   +K + +  +++ E+PK Y   P+  +Y  
Sbjct: 139 DHSLRVFRDLEKFEISRTVKSLNALLFACLVAKDYKEAKRVYIEMPKMYGIEPDLETYNR 198

Query: 185 LVKSYCESGSPEKAMQIVREMEENDVEVTAVTFTTILDALYKKGESEEAEKIWNKMISKG 244
           ++K +CESGS   +  IV EME   ++  + +F  ++   Y + +S+E  K+   M  +G
Sbjct: 199 MIKVFCESGSASSSYSIVAEMERKGIKPNSSSFGLMISGFYAEDKSDEVGKVLAMMKDRG 258

Query: 245 CELDVGAYNVRLMHE-HGGKPEHVEALIEEMANSGMKPDTISYNYLMTCYCKNGMIDEAK 304
             + V  YN+R+       K +  +AL++ M ++GMKP+T++Y++L+  +C     +EAK
Sbjct: 259 VNIGVSTYNIRIQSLCKRKKSKEAKALLDGMLSAGMKPNTVTYSHLIHGFCNEDDFEEAK 318

Query: 305 KVYDDMEINGCNKNAATFRTFMYYLCRNGDYEKGYMVFKESVKVHKIPDVNTVKYLVEGL 364
           K++  M   GC  ++  + T +YYLC+ GD+E    + KES++ + +P  + +K LV GL
Sbjct: 319 KLFKIMVNRGCKPDSECYFTLIYYLCKGGDFETALSLCKESMEKNWVPSFSIMKSLVNGL 378

Query: 365 MEKKKMKEAKGLIRTIRKKFPPDSLKAWRKVEEAL 398
            +  K++EAK LI  +++KF   +++ W +VE AL
Sbjct: 379 AKDSKVEEAKELIGQVKEKF-TRNVELWNEVEAAL 406

BLAST of Cp4.1LG14g01660 vs. Swiss-Prot
Match: PPR82_ARATH (Pentatricopeptide repeat-containing protein At1g55890, mitochondrial OS=Arabidopsis thaliana GN=At1g55890 PE=2 SV=1)

HSP 1 Score: 198.7 bits (504), Expect = 1.2e-49
Identity = 120/371 (32.35%), Postives = 208/371 (56.06%), Query Frame = 1

Query: 9   IRRLSTAATATATATAAAKATATTSSSISISKAKSKLRTEYDPDKALNIYSSV--SSHYT 68
           IRR S+AAT  +  TA   A +    S++     S +  E +P + +  +     S  + 
Sbjct: 17  IRRFSSAATVVSEPTAVTAAISPPQKSLT-----SLVNGERNPKRIVEKFKKACESERFR 76

Query: 69  SPVSSRYAQELTIRRLAKSRRFDDIESLIESHKNDPKITQEPFLSTLIRSYGRAGMFEHA 128
           + ++     + T+RRL  ++R   +E ++E  K    +++E F + +I  YG+AGMFE+A
Sbjct: 77  TNIA---VYDRTVRRLVAAKRLHYVEEILEEQKKYRDMSKEGFAARIISLYGKAGMFENA 136

Query: 129 MRTYNQMEDFGTPRSVISFNALLCAFNHSKQFDKVPQLFDEIPKKYSFSPNKISYGILVK 188
            + + +M +    RSV+SFNALL A+  SK+FD V +LF+E+P K S  P+ +SY  L+K
Sbjct: 137 QKVFEEMPNRDCKRSVLSFNALLSAYRLSKKFDVVEELFNELPGKLSIKPDIVSYNTLIK 196

Query: 189 SYCESGSPEKAMQIVREMEENDVEVTAVTFTTILDALYKKGESEEAEKIWNKMISKGCEL 248
           + CE  S  +A+ ++ E+E   ++   VTF T+L + Y KG+ E  E+IW KM+ K   +
Sbjct: 197 ALCEKDSLPEAVALLDEIENKGLKPDIVTFNTLLLSSYLKGQFELGEEIWAKMVEKNVAI 256

Query: 249 DVGAYNVRLMH-EHGGKPEHVEALIEEMANSGMKPDTISYNYLMTCYCKNGMIDEAKKVY 308
           D+  YN RL+   +  K + +  L  E+  SG+KPD  S+N ++      G +DEA+  Y
Sbjct: 257 DIRTYNARLLGLANEAKSKELVNLFGELKASGLKPDVFSFNAMIRGSINEGKMDEAEAWY 316

Query: 309 DDMEINGCNKNAATFRTFMYYLCRNGDYEKGYMVFKESVKVHKIPDVNTVKYLVEGLMEK 368
            ++  +G   + ATF   +  +C+ GD+E    +FKE+     +    T++ LV+ L++ 
Sbjct: 317 KEIVKHGYRPDKATFALLLPAMCKAGDFESAIELFKETFSKRYLVGQTTLQQLVDELVKG 376

Query: 369 KKMKEAKGLIR 377
            K +EA+ +++
Sbjct: 377 SKREEAEEIVK 379

BLAST of Cp4.1LG14g01660 vs. Swiss-Prot
Match: PP226_ARATH (Pentatricopeptide repeat-containing protein At3g13160, mitochondrial OS=Arabidopsis thaliana GN=At3g13160 PE=2 SV=1)

HSP 1 Score: 195.3 bits (495), Expect = 1.3e-48
Identity = 98/301 (32.56%), Postives = 173/301 (57.48%), Query Frame = 1

Query: 76  ELTIRRLAKSRRFDDIESLIESHKNDPKITQEPFLSTLIRSYGRAGMFEHAMRTYNQMED 135
           E T+RRLA +++F+ +E ++E     P +++E F++ +I  YGR GMFE+A + +++M +
Sbjct: 75  ERTVRRLAAAKKFEWVEEILEEQNKYPNMSKEGFVARIINLYGRVGMFENAQKVFDEMPE 134

Query: 136 FGTPRSVISFNALLCAFNHSKQFDKVPQLFDEIPKKYSFSPNKISYGILVKSYCESGSPE 195
               R+ +SFNALL A  +SK+FD V  +F E+P K S  P+  SY  L+K  C  GS  
Sbjct: 135 RNCKRTALSFNALLNACVNSKKFDLVEGIFKELPGKLSIEPDVASYNTLIKGLCGKGSFT 194

Query: 196 KAMQIVREMEENDVEVTAVTFTTILDALYKKGESEEAEKIWNKMISKGCELDVGAYNVRL 255
           +A+ ++ E+E   ++   +TF  +L   Y KG+ EE E+IW +M+ K  + D+ +YN RL
Sbjct: 195 EAVALIDEIENKGLKPDHITFNILLHESYTKGKFEEGEQIWARMVEKNVKRDIRSYNARL 254

Query: 256 MH-EHGGKPEHVEALIEEMANSGMKPDTISYNYLMTCYCKNGMIDEAKKVYDDMEINGCN 315
           +      K E + +L +++  + +KPD  ++  ++  +   G +DEA   Y ++E NGC 
Sbjct: 255 LGLAMENKSEEMVSLFDKLKGNELKPDVFTFTAMIKGFVSEGKLDEAITWYKEIEKNGCR 314

Query: 316 KNAATFRTFMYYLCRNGDYEKGYMVFKESVKVHKIPDVNTVKYLVEGLMEKKKMKEAKGL 375
                F + +  +C+ GD E  Y + KE      + D   ++ +V+ L++  K  EA+ +
Sbjct: 315 PLKFVFNSLLPAICKAGDLESAYELCKEIFAKRLLVDEAVLQEVVDALVKGSKQDEAEEI 374

BLAST of Cp4.1LG14g01660 vs. TrEMBL
Match: A0A0A0KXX6_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G038790 PE=4 SV=1)

HSP 1 Score: 642.9 bits (1657), Expect = 2.7e-181
Identity = 322/386 (83.42%), Postives = 354/386 (91.71%), Query Frame = 1

Query: 31  TTSSSISISKAKSKLRTEYDPDKALNIYSSVSSHYTSPVSSRYAQELTIRRLAKSRRFDD 90
           T S+S+S   +KSKLRTEYDPDKA+ IYSSVSSHYTSPV+SRYAQE+TIRRLAK+RRF D
Sbjct: 63  TPSTSVS---SKSKLRTEYDPDKAVEIYSSVSSHYTSPVTSRYAQEITIRRLAKARRFKD 122

Query: 91  IESLIESHKNDPKITQEPFLSTLIRSYGRAGMFEHAMRTYNQMEDFGTPRSVISFNALLC 150
           IESLIESHKNDPKITQEPFLSTLIRSYGR GMFEHAMRTYNQM D GTPRS +SFNALL 
Sbjct: 123 IESLIESHKNDPKITQEPFLSTLIRSYGRVGMFEHAMRTYNQMGDLGTPRSALSFNALLT 182

Query: 151 AFNHSKQFDKVPQLFDEIPKKYSFSPNKISYGILVKSYCESGSPEKAMQIVREMEENDVE 210
           A N+SKQFDKVPQLFDE+PK+Y+FSPNK SYGILVKSYC++GSPEKAM+IVREMEEN VE
Sbjct: 183 ACNNSKQFDKVPQLFDEMPKRYNFSPNKFSYGILVKSYCDAGSPEKAMEIVREMEENGVE 242

Query: 211 VTAVTFTTILDALYKKGESEEAEKIWNKMISKGCELDVGAYNVRLMHEHGGKPEHVEALI 270
           V AVTFTTIL+ALYKKG+S EAEKIW  MISKGCELDVGAYNVRLMHEHGGKPEHV+ALI
Sbjct: 243 VNAVTFTTILNALYKKGDSAEAEKIWETMISKGCELDVGAYNVRLMHEHGGKPEHVQALI 302

Query: 271 EEMANSGMKPDTISYNYLMTCYCKNGMIDEAKKVYDDMEINGCNKNAATFRTFMYYLCRN 330
           EEMANSG+KPD ISYNYLMTCYCKNGM DEAKKVY+DMEINGCNKNAATFRT +Y+LCRN
Sbjct: 303 EEMANSGLKPDAISYNYLMTCYCKNGMFDEAKKVYNDMEINGCNKNAATFRTLIYHLCRN 362

Query: 331 GDYEKGYMVFKESVKVHKIPDVNTVKYLVEGLMEKKKMKEAKGLIRTIRKKFPPDSLKAW 390
           G+YEKGY VFKESVK++KIPD NT+KYLVEGL+EKK M+EAKGLIRTIRKKFPPD+LKAW
Sbjct: 363 GEYEKGYKVFKESVKMNKIPDFNTLKYLVEGLVEKKMMREAKGLIRTIRKKFPPDTLKAW 422

Query: 391 RKVEEALGLASSSSSSSSSSKADDET 417
           R+VEE +GLA  S+    SSK DDET
Sbjct: 423 REVEEGVGLA--SAGDDVSSKDDDET 443

BLAST of Cp4.1LG14g01660 vs. TrEMBL
Match: A0A061DFV5_THECC (Tetratricopeptide repeat (TPR)-like superfamily protein OS=Theobroma cacao GN=TCM_000426 PE=4 SV=1)

HSP 1 Score: 571.2 bits (1471), Expect = 1.0e-159
Identity = 287/416 (68.99%), Postives = 350/416 (84.13%), Query Frame = 1

Query: 2   SSIATRHIRRLSTAATATATATAAAKATATTSSSISISKAKSKLRTEYDPDKALNIYSSV 61
           SSI  RH+R LST  T          ATA++S SISIS+AK+KLRTEYDPDKAL IYSSV
Sbjct: 3   SSIPLRHLRHLSTTTTTA--------ATASSSISISISQAKNKLRTEYDPDKALEIYSSV 62

Query: 62  SSHYTSPVSSRYAQELTIRRLAKSRRFDDIESLIESHKNDPKITQEPFLSTLIRSYGRAG 121
           S HY+SP SSRYAQ+LT+RRLAKSRRF DIESLIESHK DPKITQEPFLSTLIRSYG AG
Sbjct: 63  SKHYSSPSSSRYAQDLTVRRLAKSRRFSDIESLIESHKTDPKITQEPFLSTLIRSYGIAG 122

Query: 122 MFEHAMRTYNQMEDFGTPRSVISFNALLCAFNHSKQFDKVPQLFDEIPKKY-SFSPNKIS 181
           M +HA++T++QM+ FGTPRS ISFN+LL A NHS+QFDKVPQLF+EIPKKY S SP+K+S
Sbjct: 123 MLDHAIKTFDQMDQFGTPRSTISFNSLLSAGNHSRQFDKVPQLFEEIPKKYGSVSPDKVS 182

Query: 182 YGILVKSYCESGSPEKAMQIVREMEENDVEVTAVTFTTILDALYKKGESEEAEKIWNKMI 241
           YGIL+KSYCE+G P+K ++++REME   VEVTAVTFTTIL+ALYKKG++EEAEK+W+ M+
Sbjct: 183 YGILIKSYCEAGHPDKGIEVLREMERKSVEVTAVTFTTILNALYKKGKTEEAEKLWSDMM 242

Query: 242 SKGCELDVGAYNVRLMHEHGGKPEHVEALIEEMANSGMKPDTISYNYLMTCYCKNGMIDE 301
             GCELDV +YNVR+M+  GG PE V+ LI+EM+  G+KPDTISYNYLMTCYCKNGM+DE
Sbjct: 243 KNGCELDVASYNVRIMNLQGGDPEKVKELIDEMSTMGLKPDTISYNYLMTCYCKNGMLDE 302

Query: 302 AKKVYDDMEINGCNKNAATFRTFMYYLCRNGDYEKGYMVFKESVKVHKIPDVNTVKYLVE 361
           AKKVY+ +E NGCN NAATFRT ++YLC NG +E+GY VFKESV++HKIPD NT+K+LVE
Sbjct: 303 AKKVYEGLEGNGCNPNAATFRTLVFYLCLNGLHEQGYKVFKESVRLHKIPDFNTLKHLVE 362

Query: 362 GLMEKKKMKEAKGLIRTIRKKFPPDSLKAWRKVEEALGLASSSSSSSSSSKADDET 417
           GL++ KK+KEAKGLIRT++KKFPP+ L AW+K+EE LGL S ++    + +A + T
Sbjct: 363 GLVKNKKIKEAKGLIRTVKKKFPPNFLNAWKKLEEELGLVSGNAGGGEAQEAKEAT 410

BLAST of Cp4.1LG14g01660 vs. TrEMBL
Match: A0A0D2PQ35_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_008G068700 PE=4 SV=1)

HSP 1 Score: 558.1 bits (1437), Expect = 8.8e-156
Identity = 282/412 (68.45%), Postives = 344/412 (83.50%), Query Frame = 1

Query: 2   SSIATRHIRRLSTAATATATATAAAKATATTSSSISISKAKSKLRTEYDPDKALNIYSSV 61
           SSI  RH+R  S        AT+ A A A  SSSIS+S+AKSKLRTEYDPDKAL IYSSV
Sbjct: 3   SSIRLRHLRHFS--------ATSNAAAAAAYSSSISVSQAKSKLRTEYDPDKALEIYSSV 62

Query: 62  SSHYTSPVSSRYAQELTIRRLAKSRRFDDIESLIESHKNDPKITQEPFLSTLIRSYGRAG 121
           S HY+SP SSRYAQ+LT+RRLAKSRRF DIESLIESHK DPKI+QEPFLSTLIRSYG AG
Sbjct: 63  SKHYSSPSSSRYAQDLTVRRLAKSRRFSDIESLIESHKTDPKISQEPFLSTLIRSYGIAG 122

Query: 122 MFEHAMRTYNQMEDFGTPRSVISFNALLCAFNHSKQFDKVPQLFDEIPKKY-SFSPNKIS 181
           M +HA++T++QM+ FGTPRS ISFNALL A N S+QFD+VPQLFDEIPKKY   SP+K+S
Sbjct: 123 MLDHAIKTFHQMDQFGTPRSTISFNALLSACNQSRQFDRVPQLFDEIPKKYIGLSPDKVS 182

Query: 182 YGILVKSYCESGSPEKAMQIVREMEENDVEVTAVTFTTILDALYKKGESEEAEKIWNKMI 241
           YGILVKSYCE+G PEK ++++REME   VEVTAVT TTIL+ALYKKG++EEAEK+W +M+
Sbjct: 183 YGILVKSYCEAGHPEKGLEVLREMERKSVEVTAVTSTTILNALYKKGKTEEAEKLWFEMM 242

Query: 242 SKGCELDVGAYNVRLMHEHGGKPEHVEALIEEMANSGMKPDTISYNYLMTCYCKNGMIDE 301
             GCELDV +YNVR+ +  GG+PE V+ LI++M+  G+KPDTISYNYLMTCYCK GM+DE
Sbjct: 243 KTGCELDVASYNVRISNFQGGEPEKVKELIDDMSTLGLKPDTISYNYLMTCYCKRGMLDE 302

Query: 302 AKKVYDDMEINGCNKNAATFRTFMYYLCRNGDYEKGYMVFKESVKVHKIPDVNTVKYLVE 361
           AKKVY+ +E NGCN NAATFRT ++YLC NG YE+GY VFKESV++HKIPD NT+K+LVE
Sbjct: 303 AKKVYEGLEGNGCNPNAATFRTLVFYLCLNGLYEQGYKVFKESVRLHKIPDFNTLKHLVE 362

Query: 362 GLMEKKKMKEAKGLIRTIRKKFPPDSLKAWRKVEEALGLASSSSSSSSSSKA 413
           GL+ KKK+K+AKGLIRT++K FPP+ LKAW+K+EE LGL S ++ +  + ++
Sbjct: 363 GLVMKKKIKDAKGLIRTVKKTFPPNFLKAWKKLEEELGLVSGNAEAREAKES 406

BLAST of Cp4.1LG14g01660 vs. TrEMBL
Match: A5BUJ7_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_032192 PE=4 SV=1)

HSP 1 Score: 547.4 bits (1409), Expect = 1.6e-152
Identity = 279/411 (67.88%), Postives = 344/411 (83.70%), Query Frame = 1

Query: 7   RHIRRLSTAATATATATAAAKATATTSSSISISKAKSKLRTEYDPDKALNIYSSVSSHYT 66
           RH+R LSTAA A A A+ AA   + +SSSIS+S+AKS LR+E+DPD+AL IYSSVS HYT
Sbjct: 10  RHVRHLSTAAAAAAAASTAA--ASASSSSISVSRAKSILRSEFDPDRALEIYSSVSKHYT 69

Query: 67  SPVSSRYAQELTIRRLAKSRRFDDIESLIESHKNDPKITQEPFLSTLIRSYGRAGMFEHA 126
           SP++SRYAQ+LT++RLAKSRRF DIE+LIESHKNDPKITQEP+LSTLIRSYG AGMF+HA
Sbjct: 70  SPLASRYAQDLTVKRLAKSRRFADIETLIESHKNDPKITQEPYLSTLIRSYGIAGMFQHA 129

Query: 127 MRTYNQMEDFGTPRSVISFNALLCAFNHSKQFDKVPQLFDEIPKKYSFSPNKISYGILVK 186
           +RT+NQME+ GTPRS ISFNALL A N SK FD+VP+ F+EIP++Y   P+KISYGILVK
Sbjct: 130 LRTFNQMEELGTPRSSISFNALLSACNQSKLFDQVPKFFEEIPRRYGIXPDKISYGILVK 189

Query: 187 SYCESGSPEKAMQIVREMEENDVEVTAVTFTTILDALYKKGESEEAEKIWNKMISKGCEL 246
           SYCESG  +KA+ +++EMEE  VE+TAVTFTTILDALYK+G+S+ AEK+W++M  KGC L
Sbjct: 190 SYCESGLSDKAISMLKEMEEKGVEITAVTFTTILDALYKQGQSDRAEKVWHEMAKKGC-L 249

Query: 247 DVGAYNVRLMHEHGGKPEHVEALIEEMANSGMKPDTISYNYLMTCYCKNGMIDEAKKVYD 306
           DVGAYNV++M  HGG PE+V+ALI+EM+N+G+KPDTISYNYLMT YCK+GM+DEAKKVY 
Sbjct: 250 DVGAYNVKIMFAHGGDPENVKALIDEMSNAGLKPDTISYNYLMTSYCKSGMMDEAKKVYA 309

Query: 307 DMEINGCNKNAATFRTFMYYLCRNGDYEKGYMVFKESVKVHKIPDVNTVKYLVEGLMEKK 366
           ++E  GC+ NAATFRT +YYLCR+GD+E GY VFK+S    KIPD  T+++LVEGL++KK
Sbjct: 310 ELEETGCHPNAATFRTLIYYLCRSGDFETGYKVFKQSAFRRKIPDFGTLRHLVEGLVQKK 369

Query: 367 KMKEAKGLIRTIRKKFPPDSLKAWRKVEEALGLASSSSSSSSSSKADDETE 418
           K KEAKGLIRT++K FP + L  WRK+EE LGLA   SS +    ADD  E
Sbjct: 370 KTKEAKGLIRTVKKNFPANFLNVWRKLEEDLGLAGVDSSPA----ADDVQE 413

BLAST of Cp4.1LG14g01660 vs. TrEMBL
Match: A0A067GIE9_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g015673mg PE=4 SV=1)

HSP 1 Score: 547.0 bits (1408), Expect = 2.0e-152
Identity = 267/393 (67.94%), Postives = 337/393 (85.75%), Query Frame = 1

Query: 7   RHIRRLSTAATATATATAAAKATATTSSSISISKAKSKLRTEYDPDKALNIYSSVSSHYT 66
           RHIRRL TA TA  ++        TT+SSIS+SKAKSKLR+E+DPDKAL+IYSSVS HY 
Sbjct: 5   RHIRRLCTATTAAGSS--------TTASSISVSKAKSKLRSEFDPDKALDIYSSVSKHYA 64

Query: 67  SPVSSRYAQELTIRRLAKSRRFDDIESLIESHKNDPKITQEPFLSTLIRSYGRAGMFEHA 126
           SPVSSRYAQ+LT+RRLAKS+RF DIE+LIESHKNDPKITQEP+L  LIRSYG+AGMF+HA
Sbjct: 65  SPVSSRYAQDLTVRRLAKSKRFSDIETLIESHKNDPKITQEPYLCNLIRSYGQAGMFDHA 124

Query: 127 MRTYNQMEDFGTPRSVISFNALLCAFNHSKQFDKVPQLFDEIPKKYSFSPNKISYGILVK 186
           MRT++QM++ GTPRSVISFNALL A   S+ +DKVP LFDEIPKKY+ SP+KISYG+L+K
Sbjct: 125 MRTFDQMDELGTPRSVISFNALLFACTRSRLYDKVPILFDEIPKKYNLSPDKISYGLLLK 184

Query: 187 SYCESGSPEKAMQIVREMEENDVEVTAVTFTTILDALYKKGESEEAEKIWNKMISKGCEL 246
           S+C+SGS +KA++++ EME   VEVT VT+TT+L+ LYK+G +EEAE++W++M  KG +L
Sbjct: 185 SHCDSGSSDKALELLNEMENKGVEVTTVTYTTVLNCLYKQGNAEEAERLWSEMEKKGVDL 244

Query: 247 DVGAYNVRLMHEHGGKPEHVEALIEEMANSGMKPDTISYNYLMTCYCKNGMIDEAKKVYD 306
           DV AYNVR+ + +GG PE ++ LI+EM ++G+KPDTISYN+LMTCYCKN M+DEAKKVY+
Sbjct: 245 DVAAYNVRITNTYGGDPERLKELIDEMRDAGLKPDTISYNFLMTCYCKNEMMDEAKKVYE 304

Query: 307 DMEINGCNKNAATFRTFMYYLCRNGDYEKGYMVFKESVKVHKIPDVNTVKYLVEGLMEKK 366
            +E NGC+ NA TFRT++Y+LC +G+++K Y VFKESV VHKIPD NTVK LVEGL++KK
Sbjct: 305 GLEENGCSPNATTFRTWIYHLCGSGNFDKAYKVFKESVMVHKIPDFNTVKLLVEGLVKKK 364

Query: 367 KMKEAKGLIRTIRKKFPPDSLKAWRKVEEALGL 400
           K+KEAKG+IRTI+KKFPP+ L+AW+KVEE LGL
Sbjct: 365 KIKEAKGVIRTIKKKFPPNVLRAWKKVEEELGL 389

BLAST of Cp4.1LG14g01660 vs. TAIR10
Match: AT4G36680.1 (AT4G36680.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 505.4 bits (1300), Expect = 3.4e-143
Identity = 259/411 (63.02%), Postives = 323/411 (78.59%), Query Frame = 1

Query: 2   SSIATRHIRRLSTAATATATATAAAKATATTSSSISISKAKSKLRTEYDPDKALNIYSSV 61
           S I+ R +RR ++AA    T       TA +S  IS+SKAKS LR E+DPDKAL IY++V
Sbjct: 4   SRISLRLVRRFASAAADGTT-------TAPSSGKISVSKAKSTLRKEHDPDKALKIYANV 63

Query: 62  SSHYTSPVSSRYAQELTIRRLAKSRRFDDIESLIESHKNDPKITQEPFLSTLIRSYGRAG 121
           S H  SPVSSRYAQELT+RRLAK RRF DIE+LIESHKNDPKI +EPF STLIRSYG+A 
Sbjct: 64  SDHSASPVSSRYAQELTVRRLAKCRRFSDIETLIESHKNDPKIKEEPFYSTLIRSYGQAS 123

Query: 122 MFEHAMRTYNQMEDFGTPRSVISFNALLCAFNHSKQFDKVPQLFDEIPKKYS-FSPNKIS 181
           MF HAMRT+ QM+ +GTPRS +SFNALL A  HSK FDKVPQLFDEIP++Y+   P+KIS
Sbjct: 124 MFNHAMRTFEQMDQYGTPRSAVSFNALLNACLHSKNFDKVPQLFDEIPQRYNKIIPDKIS 183

Query: 182 YGILVKSYCESGSPEKAMQIVREMEENDVEVTAVTFTTILDALYKKGESEEAEKIWNKMI 241
           YGIL+KSYC+SG+PEKA++I+R+M+   +EVT + FTTIL +LYKKGE E A+ +WN+M+
Sbjct: 184 YGILIKSYCDSGTPEKAIEIMRQMQGKGMEVTTIAFTTILSSLYKKGELEVADNLWNEMV 243

Query: 242 SKGCELDVGAYNVRLMHEHGGKPEHVEALIEEMANSGMKPDTISYNYLMTCYCKNGMIDE 301
            KGCELD  AYNVR+M      PE V+ LIEEM++ G+KPDTISYNYLMT YC+ GM+DE
Sbjct: 244 KKGCELDNAAYNVRIMSAQKESPERVKELIEEMSSMGLKPDTISYNYLMTAYCERGMLDE 303

Query: 302 AKKVYDDMEINGCNKNAATFRTFMYYLCRNGDYEKGYMVFKESVKVHKIPDVNTVKYLVE 361
           AKKVY+ +E N C  NAATFRT +++LC +  YE+GY +FK+SV +HKIPD NT+K+LV 
Sbjct: 304 AKKVYEGLEGNNCAPNAATFRTLIFHLCYSRLYEQGYAIFKKSVYMHKIPDFNTLKHLVV 363

Query: 362 GLMEKKKMKEAKGLIRTIRKKFPPDSLKAWRKVEEALGLASSSSSSSSSSK 412
           GL+E KK  +AKGLIRT++KKFPP  L AW+K+EE LGL S + +  SS+K
Sbjct: 364 GLVENKKRDDAKGLIRTVKKKFPPSFLNAWKKLEEELGLYSKTDAFPSSAK 407

BLAST of Cp4.1LG14g01660 vs. TAIR10
Match: AT2G18520.1 (AT2G18520.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 442.6 bits (1137), Expect = 2.7e-124
Identity = 238/424 (56.13%), Postives = 311/424 (73.35%), Query Frame = 1

Query: 2   SSIATRHIRRLSTAATATATATAAAKATATTSSSISISKAKSKLRTEYDPDKALNIYSSV 61
           S +  R +RR STA    +  TA   A       I++SKAKSKLR   DPDKAL IY SV
Sbjct: 4   SRLYLRFLRRFSTATGIDSQTTAYPGA-------ITMSKAKSKLRKVQDPDKALAIYKSV 63

Query: 62  SSHYTSPVSSRYAQELTIRRLAKSRRFDDIESLIESHKNDPKITQEPFLSTLIRSYGRAG 121
           S++ TSP+SSRYA ELT++RLAKS+RF DIE+LIESHKN+PKI  E FLSTLIRSYGRA 
Sbjct: 64  SNNSTSPLSSRYAMELTVQRLAKSQRFSDIEALIESHKNNPKIKTETFLSTLIRSYGRAS 123

Query: 122 MFEHAMRTYNQMEDFGTPRSVISFNALLCAFNHSKQFDKVPQLFDEIPKKY-SFSPNKIS 181
           MF+HAM+ + +M+  GTPR+V+SFNALL A  HS  F++VPQLFDE P++Y + +P+KIS
Sbjct: 124 MFDHAMKMFEEMDKLGTPRTVVSFNALLAACLHSDLFERVPQLFDEFPQRYNNITPDKIS 183

Query: 182 YGILVKSYCESGSPEKAMQIVREMEENDVEVTAVTFTTILDALYKKGESEEAEKIWNKMI 241
           YG+L+KSYC+SG PEKAM+I+R+ME   VEVT + FTTIL +LYK G  +EAE +W +M+
Sbjct: 184 YGMLIKSYCDSGKPEKAMEIMRDMEVKGVEVTIIAFTTILGSLYKNGLVDEAESLWIEMV 243

Query: 242 SKGCELDVGAYNVRLMHEHGGKPEHVEALIEEMANSGMKPDTISYNYLMTCYCKNGMIDE 301
           +KGC+LD   YNVRLM+     PE V+ L+EEM++ G+KPDT+SYNYLMT YC  GM+ E
Sbjct: 244 NKGCDLDNTVYNVRLMNAAKESPERVKELMEEMSSVGLKPDTVSYNYLMTAYCVKGMMSE 303

Query: 302 AKKVYDDMEINGCNKNAATFRTFMYYLCRNGDYEKGYMVFKESVKVHKIPDVNTVKYLVE 361
           AKKVY+ +E      NAATFRT +++LC NG Y++G  VFK+S  VHKIPD  T K+L E
Sbjct: 304 AKKVYEGLE----QPNAATFRTLIFHLCINGLYDQGLTVFKKSAIVHKIPDFKTCKHLTE 363

Query: 362 GLMEKKKMKEAKGLIRTIRKKFPPDSLKAWRKVEEALGL------ASSSSSSSSSSKADD 419
           GL++  +M++A+G+ R ++KKFPP  +  W+K+EE LGL      A+ SSSS +    D 
Sbjct: 364 GLVKNNRMEDARGVARIVKKKFPPRLVTEWKKLEEKLGLYSKGNAAAVSSSSQTREVLDQ 416

BLAST of Cp4.1LG14g01660 vs. TAIR10
Match: AT1G61870.1 (AT1G61870.1 pentatricopeptide repeat 336)

HSP 1 Score: 198.7 bits (504), Expect = 6.9e-51
Identity = 125/395 (31.65%), Postives = 218/395 (55.19%), Query Frame = 1

Query: 5   ATRHIRRLSTAATATATATAAAKATATTSSSISISKAKSKLRTEYDPDKALNIYSSVSSH 64
           A+  IR LS+A+T  +  +     T  TS   S   A S L++E DPD+ L I  + S  
Sbjct: 19  ASPQIRSLSSASTILSPDSK----TPLTSKEKS-KAALSLLKSEKDPDRILEICRAASLT 78

Query: 65  YTSPVSSRYAQELTIRRLAKSRRFDDIESLIESH-KNDPKITQEPFLSTLIRSYGRAGMF 124
               +  R A    +  LA+ + F  + +L++   +N P +  E F +  I  Y +A M 
Sbjct: 79  PDCRID-RIAFSAAVENLAEKKHFSAVSNLLDGFIENRPDLKSERFAAHAIVLYAQANML 138

Query: 125 EHAMRTYNQMEDFGTPRSVISFNALLCAFNHSKQFDKVPQLFDEIPKKYSFSPNKISYGI 184
           +H++R +  +E F   R+V S NALL A   +K + +  +++ E+PK Y   P+  +Y  
Sbjct: 139 DHSLRVFRDLEKFEISRTVKSLNALLFACLVAKDYKEAKRVYIEMPKMYGIEPDLETYNR 198

Query: 185 LVKSYCESGSPEKAMQIVREMEENDVEVTAVTFTTILDALYKKGESEEAEKIWNKMISKG 244
           ++K +CESGS   +  IV EME   ++  + +F  ++   Y + +S+E  K+   M  +G
Sbjct: 199 MIKVFCESGSASSSYSIVAEMERKGIKPNSSSFGLMISGFYAEDKSDEVGKVLAMMKDRG 258

Query: 245 CELDVGAYNVRLMHE-HGGKPEHVEALIEEMANSGMKPDTISYNYLMTCYCKNGMIDEAK 304
             + V  YN+R+       K +  +AL++ M ++GMKP+T++Y++L+  +C     +EAK
Sbjct: 259 VNIGVSTYNIRIQSLCKRKKSKEAKALLDGMLSAGMKPNTVTYSHLIHGFCNEDDFEEAK 318

Query: 305 KVYDDMEINGCNKNAATFRTFMYYLCRNGDYEKGYMVFKESVKVHKIPDVNTVKYLVEGL 364
           K++  M   GC  ++  + T +YYLC+ GD+E    + KES++ + +P  + +K LV GL
Sbjct: 319 KLFKIMVNRGCKPDSECYFTLIYYLCKGGDFETALSLCKESMEKNWVPSFSIMKSLVNGL 378

Query: 365 MEKKKMKEAKGLIRTIRKKFPPDSLKAWRKVEEAL 398
            +  K++EAK LI  +++KF   +++ W +VE AL
Sbjct: 379 AKDSKVEEAKELIGQVKEKF-TRNVELWNEVEAAL 406

BLAST of Cp4.1LG14g01660 vs. TAIR10
Match: AT1G55890.1 (AT1G55890.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 198.7 bits (504), Expect = 6.9e-51
Identity = 120/371 (32.35%), Postives = 208/371 (56.06%), Query Frame = 1

Query: 9   IRRLSTAATATATATAAAKATATTSSSISISKAKSKLRTEYDPDKALNIYSSV--SSHYT 68
           IRR S+AAT  +  TA   A +    S++     S +  E +P + +  +     S  + 
Sbjct: 17  IRRFSSAATVVSEPTAVTAAISPPQKSLT-----SLVNGERNPKRIVEKFKKACESERFR 76

Query: 69  SPVSSRYAQELTIRRLAKSRRFDDIESLIESHKNDPKITQEPFLSTLIRSYGRAGMFEHA 128
           + ++     + T+RRL  ++R   +E ++E  K    +++E F + +I  YG+AGMFE+A
Sbjct: 77  TNIA---VYDRTVRRLVAAKRLHYVEEILEEQKKYRDMSKEGFAARIISLYGKAGMFENA 136

Query: 129 MRTYNQMEDFGTPRSVISFNALLCAFNHSKQFDKVPQLFDEIPKKYSFSPNKISYGILVK 188
            + + +M +    RSV+SFNALL A+  SK+FD V +LF+E+P K S  P+ +SY  L+K
Sbjct: 137 QKVFEEMPNRDCKRSVLSFNALLSAYRLSKKFDVVEELFNELPGKLSIKPDIVSYNTLIK 196

Query: 189 SYCESGSPEKAMQIVREMEENDVEVTAVTFTTILDALYKKGESEEAEKIWNKMISKGCEL 248
           + CE  S  +A+ ++ E+E   ++   VTF T+L + Y KG+ E  E+IW KM+ K   +
Sbjct: 197 ALCEKDSLPEAVALLDEIENKGLKPDIVTFNTLLLSSYLKGQFELGEEIWAKMVEKNVAI 256

Query: 249 DVGAYNVRLMH-EHGGKPEHVEALIEEMANSGMKPDTISYNYLMTCYCKNGMIDEAKKVY 308
           D+  YN RL+   +  K + +  L  E+  SG+KPD  S+N ++      G +DEA+  Y
Sbjct: 257 DIRTYNARLLGLANEAKSKELVNLFGELKASGLKPDVFSFNAMIRGSINEGKMDEAEAWY 316

Query: 309 DDMEINGCNKNAATFRTFMYYLCRNGDYEKGYMVFKESVKVHKIPDVNTVKYLVEGLMEK 368
            ++  +G   + ATF   +  +C+ GD+E    +FKE+     +    T++ LV+ L++ 
Sbjct: 317 KEIVKHGYRPDKATFALLLPAMCKAGDFESAIELFKETFSKRYLVGQTTLQQLVDELVKG 376

Query: 369 KKMKEAKGLIR 377
            K +EA+ +++
Sbjct: 377 SKREEAEEIVK 379

BLAST of Cp4.1LG14g01660 vs. TAIR10
Match: AT3G13160.1 (AT3G13160.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 195.3 bits (495), Expect = 7.6e-50
Identity = 98/301 (32.56%), Postives = 173/301 (57.48%), Query Frame = 1

Query: 76  ELTIRRLAKSRRFDDIESLIESHKNDPKITQEPFLSTLIRSYGRAGMFEHAMRTYNQMED 135
           E T+RRLA +++F+ +E ++E     P +++E F++ +I  YGR GMFE+A + +++M +
Sbjct: 75  ERTVRRLAAAKKFEWVEEILEEQNKYPNMSKEGFVARIINLYGRVGMFENAQKVFDEMPE 134

Query: 136 FGTPRSVISFNALLCAFNHSKQFDKVPQLFDEIPKKYSFSPNKISYGILVKSYCESGSPE 195
               R+ +SFNALL A  +SK+FD V  +F E+P K S  P+  SY  L+K  C  GS  
Sbjct: 135 RNCKRTALSFNALLNACVNSKKFDLVEGIFKELPGKLSIEPDVASYNTLIKGLCGKGSFT 194

Query: 196 KAMQIVREMEENDVEVTAVTFTTILDALYKKGESEEAEKIWNKMISKGCELDVGAYNVRL 255
           +A+ ++ E+E   ++   +TF  +L   Y KG+ EE E+IW +M+ K  + D+ +YN RL
Sbjct: 195 EAVALIDEIENKGLKPDHITFNILLHESYTKGKFEEGEQIWARMVEKNVKRDIRSYNARL 254

Query: 256 MH-EHGGKPEHVEALIEEMANSGMKPDTISYNYLMTCYCKNGMIDEAKKVYDDMEINGCN 315
           +      K E + +L +++  + +KPD  ++  ++  +   G +DEA   Y ++E NGC 
Sbjct: 255 LGLAMENKSEEMVSLFDKLKGNELKPDVFTFTAMIKGFVSEGKLDEAITWYKEIEKNGCR 314

Query: 316 KNAATFRTFMYYLCRNGDYEKGYMVFKESVKVHKIPDVNTVKYLVEGLMEKKKMKEAKGL 375
                F + +  +C+ GD E  Y + KE      + D   ++ +V+ L++  K  EA+ +
Sbjct: 315 PLKFVFNSLLPAICKAGDLESAYELCKEIFAKRLLVDEAVLQEVVDALVKGSKQDEAEEI 374

BLAST of Cp4.1LG14g01660 vs. NCBI nr
Match: gi|659107543|ref|XP_008453729.1| (PREDICTED: pentatricopeptide repeat-containing protein At4g36680, mitochondrial [Cucumis melo])

HSP 1 Score: 684.1 bits (1764), Expect = 1.5e-193
Identity = 353/421 (83.85%), Postives = 387/421 (91.92%), Query Frame = 1

Query: 3   SIATRH-IRRLSTAATATATATAAAKATATTSSSISISKAKSKLRTEYDPDKALNIYSSV 62
           SI+TRH IRRLSTA  ATA A AA  AT TTSSS+SISKAKSKLR EYDPDKAL IYSSV
Sbjct: 4   SISTRHHIRRLSTA--ATAAAAAATNATETTSSSLSISKAKSKLRNEYDPDKALEIYSSV 63

Query: 63  SSHYTSPVSSRYAQELTIRRLAKSRRFDDIESLIESHKNDPKITQEPFLSTLIRSYGRAG 122
           SSHYTSPV+SRYAQE+TIRRLAKSRRF DIESLIESHKNDPKITQEPFLSTLIRSYGR G
Sbjct: 64  SSHYTSPVTSRYAQEITIRRLAKSRRFKDIESLIESHKNDPKITQEPFLSTLIRSYGRVG 123

Query: 123 MFEHAMRTYNQMEDFGTPRSVISFNALLCAFNHSKQFDKVPQLFDEIPKKYSFSPNKISY 182
           MFEHAMRTYNQM D GTPRS +SFNALL A NHSKQFDKVPQLFDE+PK+Y+FSPNKISY
Sbjct: 124 MFEHAMRTYNQMGDLGTPRSALSFNALLSACNHSKQFDKVPQLFDEMPKRYNFSPNKISY 183

Query: 183 GILVKSYCESGSPEKAMQIVREMEENDVEVTAVTFTTILDALYKKGESEEAEKIWNKMIS 242
           GILVKSYC++GSPEKA+QI+REMEENDVEVTAVTFTTI++ALYKKGES EAEKIW+KM+S
Sbjct: 184 GILVKSYCDAGSPEKALQILREMEENDVEVTAVTFTTIINALYKKGESAEAEKIWDKMMS 243

Query: 243 KGCELDVGAYNVRLMHEHGGKPEHVEALIEEMANSGMKPDTISYNYLMTCYCKNGMIDEA 302
           KGCELDVGAYNVRLMHEHGGKPE V+A+IEEMANSG+KPD ISYNYLMTCYCKNGMIDEA
Sbjct: 244 KGCELDVGAYNVRLMHEHGGKPERVQAIIEEMANSGLKPDAISYNYLMTCYCKNGMIDEA 303

Query: 303 KKVYDDMEINGCNKNAATFRTFMYYLCRNGDYEKGYMVFKESVKVHKIPDVNTVKYLVEG 362
           KKVY+DMEINGCNKNAATFRTF+Y+LCRNG+YEKGY VFKESVK++KIPD NT+KYLVEG
Sbjct: 304 KKVYNDMEINGCNKNAATFRTFIYHLCRNGEYEKGYKVFKESVKMNKIPDFNTLKYLVEG 363

Query: 363 LMEKKKMKEAKGLIRTIRKKFPPDSLKAWRKVEEALGLAS-----SSSSSSSSSKADDET 418
           L+EKK M+EAKGLIRT+RKKFPPD+LKAWR+VEE +GLAS     SS   + SSK DDET
Sbjct: 364 LVEKKMMREAKGLIRTVRKKFPPDTLKAWREVEEGVGLASAGDDVSSKDDNVSSKDDDET 422

BLAST of Cp4.1LG14g01660 vs. NCBI nr
Match: gi|778697504|ref|XP_011654338.1| (PREDICTED: pentatricopeptide repeat-containing protein At4g36680, mitochondrial-like [Cucumis sativus])

HSP 1 Score: 653.3 bits (1684), Expect = 2.9e-184
Identity = 329/390 (84.36%), Postives = 361/390 (92.56%), Query Frame = 1

Query: 28  ATA-TTSSSISISKAKSKLRTEYDPDKALNIYSSVSSHYTSPVSSRYAQELTIRRLAKSR 87
           ATA T+SSS+SIS+AKSKLRTEYDPDKA+ IYSSVSSHYTSPV+SRYAQE+TIRRLAK+R
Sbjct: 29  ATATTSSSSLSISRAKSKLRTEYDPDKAVEIYSSVSSHYTSPVTSRYAQEITIRRLAKAR 88

Query: 88  RFDDIESLIESHKNDPKITQEPFLSTLIRSYGRAGMFEHAMRTYNQMEDFGTPRSVISFN 147
           RF DIESLIESHKNDPKITQEPFLSTLIRSYGR GMFEHAMRTYNQM D GTPRS +SFN
Sbjct: 89  RFKDIESLIESHKNDPKITQEPFLSTLIRSYGRVGMFEHAMRTYNQMGDLGTPRSALSFN 148

Query: 148 ALLCAFNHSKQFDKVPQLFDEIPKKYSFSPNKISYGILVKSYCESGSPEKAMQIVREMEE 207
           ALL A N+SKQFDKVPQLFDE+PK+Y+FSPNK SYGILVKSYC++GSPEKAM+IVREMEE
Sbjct: 149 ALLTACNNSKQFDKVPQLFDEMPKRYNFSPNKFSYGILVKSYCDAGSPEKAMEIVREMEE 208

Query: 208 NDVEVTAVTFTTILDALYKKGESEEAEKIWNKMISKGCELDVGAYNVRLMHEHGGKPEHV 267
           N VEV AVTFTTIL+ALYKKG+S EAEKIW  MISKGCELDVGAYNVRLMHEHGGKPEHV
Sbjct: 209 NGVEVNAVTFTTILNALYKKGDSAEAEKIWETMISKGCELDVGAYNVRLMHEHGGKPEHV 268

Query: 268 EALIEEMANSGMKPDTISYNYLMTCYCKNGMIDEAKKVYDDMEINGCNKNAATFRTFMYY 327
           +ALIEEMANSG+KPD ISYNYLMTCYCKNGM DEAKKVY+DMEINGCNKNAATFRT +Y+
Sbjct: 269 QALIEEMANSGLKPDAISYNYLMTCYCKNGMFDEAKKVYNDMEINGCNKNAATFRTLIYH 328

Query: 328 LCRNGDYEKGYMVFKESVKVHKIPDVNTVKYLVEGLMEKKKMKEAKGLIRTIRKKFPPDS 387
           LCRNG+YEKGY VFKESVK++KIPD NT+KYLVEGL+EKK M+EAKGLIRTIRKKFPPD+
Sbjct: 329 LCRNGEYEKGYKVFKESVKMNKIPDFNTLKYLVEGLVEKKMMREAKGLIRTIRKKFPPDT 388

Query: 388 LKAWRKVEEALGLASSSSSSSSSSKADDET 417
           LKAWR+VEE +GLA  S+    SSK DDET
Sbjct: 389 LKAWREVEEGVGLA--SAGDDVSSKDDDET 416

BLAST of Cp4.1LG14g01660 vs. NCBI nr
Match: gi|700198104|gb|KGN53262.1| (hypothetical protein Csa_4G038790 [Cucumis sativus])

HSP 1 Score: 642.9 bits (1657), Expect = 3.9e-181
Identity = 322/386 (83.42%), Postives = 354/386 (91.71%), Query Frame = 1

Query: 31  TTSSSISISKAKSKLRTEYDPDKALNIYSSVSSHYTSPVSSRYAQELTIRRLAKSRRFDD 90
           T S+S+S   +KSKLRTEYDPDKA+ IYSSVSSHYTSPV+SRYAQE+TIRRLAK+RRF D
Sbjct: 63  TPSTSVS---SKSKLRTEYDPDKAVEIYSSVSSHYTSPVTSRYAQEITIRRLAKARRFKD 122

Query: 91  IESLIESHKNDPKITQEPFLSTLIRSYGRAGMFEHAMRTYNQMEDFGTPRSVISFNALLC 150
           IESLIESHKNDPKITQEPFLSTLIRSYGR GMFEHAMRTYNQM D GTPRS +SFNALL 
Sbjct: 123 IESLIESHKNDPKITQEPFLSTLIRSYGRVGMFEHAMRTYNQMGDLGTPRSALSFNALLT 182

Query: 151 AFNHSKQFDKVPQLFDEIPKKYSFSPNKISYGILVKSYCESGSPEKAMQIVREMEENDVE 210
           A N+SKQFDKVPQLFDE+PK+Y+FSPNK SYGILVKSYC++GSPEKAM+IVREMEEN VE
Sbjct: 183 ACNNSKQFDKVPQLFDEMPKRYNFSPNKFSYGILVKSYCDAGSPEKAMEIVREMEENGVE 242

Query: 211 VTAVTFTTILDALYKKGESEEAEKIWNKMISKGCELDVGAYNVRLMHEHGGKPEHVEALI 270
           V AVTFTTIL+ALYKKG+S EAEKIW  MISKGCELDVGAYNVRLMHEHGGKPEHV+ALI
Sbjct: 243 VNAVTFTTILNALYKKGDSAEAEKIWETMISKGCELDVGAYNVRLMHEHGGKPEHVQALI 302

Query: 271 EEMANSGMKPDTISYNYLMTCYCKNGMIDEAKKVYDDMEINGCNKNAATFRTFMYYLCRN 330
           EEMANSG+KPD ISYNYLMTCYCKNGM DEAKKVY+DMEINGCNKNAATFRT +Y+LCRN
Sbjct: 303 EEMANSGLKPDAISYNYLMTCYCKNGMFDEAKKVYNDMEINGCNKNAATFRTLIYHLCRN 362

Query: 331 GDYEKGYMVFKESVKVHKIPDVNTVKYLVEGLMEKKKMKEAKGLIRTIRKKFPPDSLKAW 390
           G+YEKGY VFKESVK++KIPD NT+KYLVEGL+EKK M+EAKGLIRTIRKKFPPD+LKAW
Sbjct: 363 GEYEKGYKVFKESVKMNKIPDFNTLKYLVEGLVEKKMMREAKGLIRTIRKKFPPDTLKAW 422

Query: 391 RKVEEALGLASSSSSSSSSSKADDET 417
           R+VEE +GLA  S+    SSK DDET
Sbjct: 423 REVEEGVGLA--SAGDDVSSKDDDET 443

BLAST of Cp4.1LG14g01660 vs. NCBI nr
Match: gi|1009145187|ref|XP_015890196.1| (PREDICTED: pentatricopeptide repeat-containing protein At4g36680, mitochondrial [Ziziphus jujuba])

HSP 1 Score: 578.2 bits (1489), Expect = 1.2e-161
Identity = 288/410 (70.24%), Postives = 352/410 (85.85%), Query Frame = 1

Query: 2   SSIATRHIRRLSTAATATATATAAAKATATTSSSISISKAKSKLRTEYDPDKALNIYSSV 61
           SS   RH+R L   ++  AT T+AAK       SISIS+AKSKLR+E+DPDKAL IYSS+
Sbjct: 3   SSTPIRHLRHLRHLSSTAATTTSAAKP------SISISRAKSKLRSEHDPDKALEIYSSL 62

Query: 62  SSHYTSPVSSRYAQELTIRRLAKSRRFDDIESLIESHKNDPKITQEPFLSTLIRSYGRAG 121
           S +Y SP SSRYAQ+LT+RRLAK+RRF DIE+LIESHK DPKIT+EP+LSTLIRSYG AG
Sbjct: 63  SKNYCSPTSSRYAQDLTVRRLAKARRFSDIETLIESHKTDPKITEEPYLSTLIRSYGLAG 122

Query: 122 MFEHAMRTYNQMEDFGTPRSVISFNALLCAFNHSKQFDKVPQLFDEIPKKYSFSPNKISY 181
           MF+HA+RT+ QM++ GT RSVISFN LL A NHSK FDKVP LF++IP KY F+PNKISY
Sbjct: 123 MFDHALRTFEQMDELGTSRSVISFNTLLSACNHSKLFDKVPVLFNDIPAKYGFTPNKISY 182

Query: 182 GILVKSYCESGSPEKAMQIVREMEENDVEVTAVTFTTILDALYKKGESEEAEKIWNKMIS 241
           GIL+K+YCE+GSPEKA++ +REME+  +EVTAVTFTTILD LYKKGE+EEAEK+W+ M++
Sbjct: 183 GILIKAYCEAGSPEKAIETLREMEKKGIEVTAVTFTTILDTLYKKGETEEAEKLWSTMVN 242

Query: 242 KGCELDVGAYNVRLMHEHGGKPEHVEALIEEMANSGMKPDTISYNYLMTCYCKNGMIDEA 301
           K CE+DV AYNVR+MH  GGKPE V+ALI+EM+N+G+KPDTISYNYLMTCYCKNGM++EA
Sbjct: 243 KDCEIDVAAYNVRIMHCQGGKPEKVKALIDEMSNAGLKPDTISYNYLMTCYCKNGMMEEA 302

Query: 302 KKVYDDMEINGCNKNAATFRTFMYYLCRNGDYEKGYMVFKESVKVHKIPDVNTVKYLVEG 361
           K VY+ +E NGCN NAATFRT +YYLCR+GD+E+GY VFK SV VHKIPD NT+K+LVEG
Sbjct: 303 KNVYEGLEDNGCNPNAATFRTLIYYLCRSGDFERGYKVFKTSVGVHKIPDFNTLKHLVEG 362

Query: 362 LMEKKKMKEAKGLIRTIRKKFPPDSLKAWRKVEEALGLASSSSSSSSSSK 412
           L++KKK+KEAKGLIRTI+KKFPP+ L +W+KVEE+LGLAS+S +SS S +
Sbjct: 363 LVKKKKIKEAKGLIRTIKKKFPPNVLNSWKKVEESLGLASASDASSISDE 406

BLAST of Cp4.1LG14g01660 vs. NCBI nr
Match: gi|590703842|ref|XP_007046988.1| (Tetratricopeptide repeat (TPR)-like superfamily protein [Theobroma cacao])

HSP 1 Score: 571.2 bits (1471), Expect = 1.4e-159
Identity = 287/416 (68.99%), Postives = 350/416 (84.13%), Query Frame = 1

Query: 2   SSIATRHIRRLSTAATATATATAAAKATATTSSSISISKAKSKLRTEYDPDKALNIYSSV 61
           SSI  RH+R LST  T          ATA++S SISIS+AK+KLRTEYDPDKAL IYSSV
Sbjct: 3   SSIPLRHLRHLSTTTTTA--------ATASSSISISISQAKNKLRTEYDPDKALEIYSSV 62

Query: 62  SSHYTSPVSSRYAQELTIRRLAKSRRFDDIESLIESHKNDPKITQEPFLSTLIRSYGRAG 121
           S HY+SP SSRYAQ+LT+RRLAKSRRF DIESLIESHK DPKITQEPFLSTLIRSYG AG
Sbjct: 63  SKHYSSPSSSRYAQDLTVRRLAKSRRFSDIESLIESHKTDPKITQEPFLSTLIRSYGIAG 122

Query: 122 MFEHAMRTYNQMEDFGTPRSVISFNALLCAFNHSKQFDKVPQLFDEIPKKY-SFSPNKIS 181
           M +HA++T++QM+ FGTPRS ISFN+LL A NHS+QFDKVPQLF+EIPKKY S SP+K+S
Sbjct: 123 MLDHAIKTFDQMDQFGTPRSTISFNSLLSAGNHSRQFDKVPQLFEEIPKKYGSVSPDKVS 182

Query: 182 YGILVKSYCESGSPEKAMQIVREMEENDVEVTAVTFTTILDALYKKGESEEAEKIWNKMI 241
           YGIL+KSYCE+G P+K ++++REME   VEVTAVTFTTIL+ALYKKG++EEAEK+W+ M+
Sbjct: 183 YGILIKSYCEAGHPDKGIEVLREMERKSVEVTAVTFTTILNALYKKGKTEEAEKLWSDMM 242

Query: 242 SKGCELDVGAYNVRLMHEHGGKPEHVEALIEEMANSGMKPDTISYNYLMTCYCKNGMIDE 301
             GCELDV +YNVR+M+  GG PE V+ LI+EM+  G+KPDTISYNYLMTCYCKNGM+DE
Sbjct: 243 KNGCELDVASYNVRIMNLQGGDPEKVKELIDEMSTMGLKPDTISYNYLMTCYCKNGMLDE 302

Query: 302 AKKVYDDMEINGCNKNAATFRTFMYYLCRNGDYEKGYMVFKESVKVHKIPDVNTVKYLVE 361
           AKKVY+ +E NGCN NAATFRT ++YLC NG +E+GY VFKESV++HKIPD NT+K+LVE
Sbjct: 303 AKKVYEGLEGNGCNPNAATFRTLVFYLCLNGLHEQGYKVFKESVRLHKIPDFNTLKHLVE 362

Query: 362 GLMEKKKMKEAKGLIRTIRKKFPPDSLKAWRKVEEALGLASSSSSSSSSSKADDET 417
           GL++ KK+KEAKGLIRT++KKFPP+ L AW+K+EE LGL S ++    + +A + T
Sbjct: 363 GLVKNKKIKEAKGLIRTVKKKFPPNFLNAWKKLEEELGLVSGNAGGGEAQEAKEAT 410

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP352_ARATH6.1e-14263.02Pentatricopeptide repeat-containing protein At4g36680, mitochondrial OS=Arabidop... [more]
PP162_ARATH4.9e-12356.13Pentatricopeptide repeat-containing protein At2g18520, mitochondrial OS=Arabidop... [more]
PPR87_ARATH1.2e-4931.65Pentatricopeptide repeat-containing protein At1g61870, mitochondrial OS=Arabidop... [more]
PPR82_ARATH1.2e-4932.35Pentatricopeptide repeat-containing protein At1g55890, mitochondrial OS=Arabidop... [more]
PP226_ARATH1.3e-4832.56Pentatricopeptide repeat-containing protein At3g13160, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A0A0KXX6_CUCSA2.7e-18183.42Uncharacterized protein OS=Cucumis sativus GN=Csa_4G038790 PE=4 SV=1[more]
A0A061DFV5_THECC1.0e-15968.99Tetratricopeptide repeat (TPR)-like superfamily protein OS=Theobroma cacao GN=TC... [more]
A0A0D2PQ35_GOSRA8.8e-15668.45Uncharacterized protein OS=Gossypium raimondii GN=B456_008G068700 PE=4 SV=1[more]
A5BUJ7_VITVI1.6e-15267.88Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_032192 PE=4 SV=1[more]
A0A067GIE9_CITSI2.0e-15267.94Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g015673mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G36680.13.4e-14363.02 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT2G18520.12.7e-12456.13 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G61870.16.9e-5131.65 pentatricopeptide repeat 336[more]
AT1G55890.16.9e-5132.35 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G13160.17.6e-5032.56 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659107543|ref|XP_008453729.1|1.5e-19383.85PREDICTED: pentatricopeptide repeat-containing protein At4g36680, mitochondrial ... [more]
gi|778697504|ref|XP_011654338.1|2.9e-18484.36PREDICTED: pentatricopeptide repeat-containing protein At4g36680, mitochondrial-... [more]
gi|700198104|gb|KGN53262.1|3.9e-18183.42hypothetical protein Csa_4G038790 [Cucumis sativus][more]
gi|1009145187|ref|XP_015890196.1|1.2e-16170.24PREDICTED: pentatricopeptide repeat-containing protein At4g36680, mitochondrial ... [more]
gi|590703842|ref|XP_007046988.1|1.4e-15968.99Tetratricopeptide repeat (TPR)-like superfamily protein [Theobroma cacao][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006626 protein targeting to mitochondrion
biological_process GO:0008150 biological_process
cellular_component GO:0005622 intracellular
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG14g01660.1Cp4.1LG14g01660.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 111..137
score: 4.
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 280..329
score: 9.8E-13coord: 141..189
score: 4.8E-8coord: 213..253
score: 6.8
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 143..177
score: 3.3E-4coord: 179..212
score: 1.2E-7coord: 111..138
score: 1.3E-6coord: 319..345
score: 5.4E-4coord: 283..315
score: 3.6E-10coord: 214..248
score: 1.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 351..381
score: 6.095coord: 316..350
score: 8.396coord: 212..246
score: 11.312coord: 177..211
score: 11.422coord: 281..315
score: 12.693coord: 106..140
score: 8.955coord: 141..171
score: 7
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 107..242
score: 1.8E-9coord: 279..378
score: 1.
IPR011990Tetratricopeptide-like helical domainunknownSSF48452TPR-likecoord: 152..344
score: 1.07E-5coord: 281..385
score: 1.8
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 10..397
score: 1.2E
NoneNo IPR availablePANTHERPTHR24015:SF506SUBFAMILY NOT NAMEDcoord: 10..397
score: 1.2E