Tan0004674 (gene) Snake gourd v1

Overview
NameTan0004674
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat-containing protein
LocationLG07: 2547556 .. 2549662 (+)
RNA-Seq ExpressionTan0004674
SyntenyTan0004674
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CGGTTTGGCACATCAACAGCAACTGGAAGTCGCATCTCTTTCAATTCCCCATGGCGCTTTCAGGTGGCAAATTCATCTCTTCAATGGTTCATTCATGGTGAACTCCTTCCTTCTCTCTTGATTCAGAATCTCAAATCTCCATCATTTGGGGCTCGAATTCCCGTAGGATTCTGATTTTCTAAGAAAATCCGCTTCAGAAATGTCCAAATGCCGCAATTTCAGAGCCTCTCATGTCTCTCTAGCTTGAGGTTTGGGAACATGTGATTTCTTGTCGAACTTTCTTGATTTCTTCTTTCTATCCTGAGGAGTACTAGGATTTTGTTCTTCTTCCTCCTTGCAGAGTGTGTGCTGAATCTGGATTCTTGAACTAGTTGTATAATGGTTCCATGTCTCTCGTATACGCATGACGTTTTTCCGAGGAAGAATTTACTCCTGAGCTCAAGAGGAAACATCGGAGCCAAGAAGGCTCTCTTCTTACTTCAAAAGTGCAAGAACTTTCAACAGCTTAAGCAAATTCATGCCAAGATAATTCGTAGTGGCCTTTCTAACGATCAATTACTTACTAGGAAACTGATTCATCTCTACTCTTCTCATGGAAGGATAGTGTATGCGATATTTCTGTTTCATCAAATCCAGAATCCTTGCACGTTTACTTGGAATCTGATAATTAGGGCTAACACTATCAATGGTCTCTCTGAACAAGCCCTCTTGTTGTATAAGAATATGGTCTGTCAAGGAATTGTAGCCGATAAGTTTACATTTCCATTTGTGATCAAAGCTTGTACAGCTTCCTTTGCAATTGAGCTTGGGAAAGTGGTTCATGGGTCGTTAATCAAATACGGGTTTTCAGGGGATACATTTGTGCAGAACAATCTGATAGATTTTTACTTGAAGTGTGGACATAAACATTGTGGTTTTAAGGTGTTTGAGAAAATGCGTGTTTGTAATGTGGTGTCATGGACAACCATGATATCTGGGCTAGTTTCTTGTGGTGATGTACAGGCAGCAAGAAGGATATTTGATGAGATGCCATTTAAAAATGTTGTTTCATGGACAGCAATGATAAATGGGCATATTAGAAGTCAACAGCCTGAAGAAGCTCTTGAACTATTTAGGAGAATGCAGGCAGGGAATGTTTTCCCGAACGAGTATACGATGGTGAGCTTGATCAAAGCATGTACTGAAATGGGAATCCTAAGTCTTGGTCGTGGGATTCATGATTATGCCATCAAGAACGATTTCGAAATTGGCGTTTATCTTGGCACAGCTCTGATTGACATGTACAGTAAATGTGGTAGTATCAAGGATGCGATAGAAGTGTTCGAGATGATGCCGAGAAGAAGTTTGCCCACATGGAACTCGATGATCGCTAGCTTAGGGGTGCATGGGTTGGGGCAGGAAGCTCTTAATCTTTTCAGTGAGATGGAAAGAGTAAATGTGAAGCCAGATGCAATCACTTTCGTGGGCATTTTATGTGCTTGTGTACATATGAAGAATGTGAAGGCAGGCTGTGATTACTTCAAACAAATGACACAACATTATGGTATTGCACCAATTCCTGAGCATTACAAGTGCATGGCTGAGCTATACGCTCGTTCTAATTCCTTGGATGAAGCCTCAAGATCAACAAAAGCGATTTCGATGGAAGCAGATGAGGATGTCTTGGCTTTACTTTGGATGAAATGGCACGACTTGGGTACTGATGATGAGTAAAGAAATATGCAAATGCAGGAGATGGCAAGTTTGGCCAGCAGCCCTTCTCTTTGTTTTGATGCCAGAGCTGATCAGGTGCGGACTTTCTGTTGAATATTTGGGCTTACTACTTGCTTTTAACTGTCATTTTGTACACTTGAAAAAGGGTTTTTCTTGCTCTTGCATAGTATACGACAAACATAATTGTAATTGATTCTTATTTCAATAGGCATATACCCTGTTGATTTCAATGAATTTCATTGTCGATAGCAAAATAGATGTAATATACTAGATATTGGACTTGCATTCATTGTTCTTATGCTTGTGCAAAAGTGGAAATTGCAATGCTTTGCTTGGAATTCCCACATGGACAATCTTCTCTTCAATTTGAAGTTCTTTTCAAATTTAAGTAGAC

mRNA sequence

CGGTTTGGCACATCAACAGCAACTGGAAGTCGCATCTCTTTCAATTCCCCATGGCGCTTTCAGGTGGCAAATTCATCTCTTCAATGGTTCATTCATGGTGAACTCCTTCCTTCTCTCTTGATTCAGAATCTCAAATCTCCATCATTTGGGGCTCGAATTCCCGTAGGATTCTGATTTTCTAAGAAAATCCGCTTCAGAAATGTCCAAATGCCGCAATTTCAGAGCCTCTCATGTCTCTCTAGCTTGAGAGTGTGTGCTGAATCTGGATTCTTGAACTAGTTGTATAATGGTTCCATGTCTCTCGTATACGCATGACGTTTTTCCGAGGAAGAATTTACTCCTGAGCTCAAGAGGAAACATCGGAGCCAAGAAGGCTCTCTTCTTACTTCAAAAGTGCAAGAACTTTCAACAGCTTAAGCAAATTCATGCCAAGATAATTCGTAGTGGCCTTTCTAACGATCAATTACTTACTAGGAAACTGATTCATCTCTACTCTTCTCATGGAAGGATAGTGTATGCGATATTTCTGTTTCATCAAATCCAGAATCCTTGCACGTTTACTTGGAATCTGATAATTAGGGCTAACACTATCAATGGTCTCTCTGAACAAGCCCTCTTGTTGTATAAGAATATGGTCTGTCAAGGAATTGTAGCCGATAAGTTTACATTTCCATTTGTGATCAAAGCTTGTACAGCTTCCTTTGCAATTGAGCTTGGGAAAGTGGTTCATGGGTCGTTAATCAAATACGGGTTTTCAGGGGATACATTTGTGCAGAACAATCTGATAGATTTTTACTTGAAGTGTGGACATAAACATTGTGGTTTTAAGGTGTTTGAGAAAATGCGTGTTTGTAATGTGGTGTCATGGACAACCATGATATCTGGGCTAGTTTCTTGTGGTGATGTACAGGCAGCAAGAAGGATATTTGATGAGATGCCATTTAAAAATGTTGTTTCATGGACAGCAATGATAAATGGGCATATTAGAAGTCAACAGCCTGAAGAAGCTCTTGAACTATTTAGGAGAATGCAGGCAGGGAATGTTTTCCCGAACGAGTATACGATGGTGAGCTTGATCAAAGCATGTACTGAAATGGGAATCCTAAGTCTTGGTCGTGGGATTCATGATTATGCCATCAAGAACGATTTCGAAATTGGCGTTTATCTTGGCACAGCTCTGATTGACATGTACAGTAAATGTGGTAGTATCAAGGATGCGATAGAAGTGTTCGAGATGATGCCGAGAAGAAGTTTGCCCACATGGAACTCGATGATCGCTAGCTTAGGGGTGCATGGGTTGGGGCAGGAAGCTCTTAATCTTTTCAGTGAGATGGAAAGAGTAAATGTGAAGCCAGATGCAATCACTTTCGTGGGCATTTTATGTGCTTGTGTACATATGAAGAATGTGAAGGCAGGCTGTGATTACTTCAAACAAATGACACAACATTATGGTATTGCACCAATTCCTGAGCATTACAAGTGCATGGCTGAGCTATACGCTCGTTCTAATTCCTTGGATGAAGCCTCAAGATCAACAAAAGCGATTTCGATGGAAGCAGATGAGGATGTCTTGGCTTTACTTTGGATGAAATGGCACGACTTGGGTACTGATGATGAGTAAAGAAATATGCAAATGCAGGAGATGGCAAGTTTGGCCAGCAGCCCTTCTCTTTGTTTTGATGCCAGAGCTGATCAGGTGCGGACTTTCTGTTGAATATTTGGGCTTACTACTTGCTTTTAACTGTCATTTTGTACACTTGAAAAAGGGTTTTTCTTGCTCTTGCATAGTATACGACAAACATAATTGTAATTGATTCTTATTTCAATAGGCATATACCCTGTTGATTTCAATGAATTTCATTGTCGATAGCAAAATAGATGTAATATACTAGATATTGGACTTGCATTCATTGTTCTTATGCTTGTGCAAAAGTGGAAATTGCAATGCTTTGCTTGGAATTCCCACATGGACAATCTTCTCTTCAATTTGAAGTTCTTTTCAAATTTAAGTAGAC

Coding sequence (CDS)

ATGGTTCCATGTCTCTCGTATACGCATGACGTTTTTCCGAGGAAGAATTTACTCCTGAGCTCAAGAGGAAACATCGGAGCCAAGAAGGCTCTCTTCTTACTTCAAAAGTGCAAGAACTTTCAACAGCTTAAGCAAATTCATGCCAAGATAATTCGTAGTGGCCTTTCTAACGATCAATTACTTACTAGGAAACTGATTCATCTCTACTCTTCTCATGGAAGGATAGTGTATGCGATATTTCTGTTTCATCAAATCCAGAATCCTTGCACGTTTACTTGGAATCTGATAATTAGGGCTAACACTATCAATGGTCTCTCTGAACAAGCCCTCTTGTTGTATAAGAATATGGTCTGTCAAGGAATTGTAGCCGATAAGTTTACATTTCCATTTGTGATCAAAGCTTGTACAGCTTCCTTTGCAATTGAGCTTGGGAAAGTGGTTCATGGGTCGTTAATCAAATACGGGTTTTCAGGGGATACATTTGTGCAGAACAATCTGATAGATTTTTACTTGAAGTGTGGACATAAACATTGTGGTTTTAAGGTGTTTGAGAAAATGCGTGTTTGTAATGTGGTGTCATGGACAACCATGATATCTGGGCTAGTTTCTTGTGGTGATGTACAGGCAGCAAGAAGGATATTTGATGAGATGCCATTTAAAAATGTTGTTTCATGGACAGCAATGATAAATGGGCATATTAGAAGTCAACAGCCTGAAGAAGCTCTTGAACTATTTAGGAGAATGCAGGCAGGGAATGTTTTCCCGAACGAGTATACGATGGTGAGCTTGATCAAAGCATGTACTGAAATGGGAATCCTAAGTCTTGGTCGTGGGATTCATGATTATGCCATCAAGAACGATTTCGAAATTGGCGTTTATCTTGGCACAGCTCTGATTGACATGTACAGTAAATGTGGTAGTATCAAGGATGCGATAGAAGTGTTCGAGATGATGCCGAGAAGAAGTTTGCCCACATGGAACTCGATGATCGCTAGCTTAGGGGTGCATGGGTTGGGGCAGGAAGCTCTTAATCTTTTCAGTGAGATGGAAAGAGTAAATGTGAAGCCAGATGCAATCACTTTCGTGGGCATTTTATGTGCTTGTGTACATATGAAGAATGTGAAGGCAGGCTGTGATTACTTCAAACAAATGACACAACATTATGGTATTGCACCAATTCCTGAGCATTACAAGTGCATGGCTGAGCTATACGCTCGTTCTAATTCCTTGGATGAAGCCTCAAGATCAACAAAAGCGATTTCGATGGAAGCAGATGAGGATGTCTTGGCTTTACTTTGGATGAAATGGCACGACTTGGGTACTGATGATGAGTAA

Protein sequence

MVPCLSYTHDVFPRKNLLLSSRGNIGAKKALFLLQKCKNFQQLKQIHAKIIRSGLSNDQLLTRKLIHLYSSHGRIVYAIFLFHQIQNPCTFTWNLIIRANTINGLSEQALLLYKNMVCQGIVADKFTFPFVIKACTASFAIELGKVVHGSLIKYGFSGDTFVQNNLIDFYLKCGHKHCGFKVFEKMRVCNVVSWTTMISGLVSCGDVQAARRIFDEMPFKNVVSWTAMINGHIRSQQPEEALELFRRMQAGNVFPNEYTMVSLIKACTEMGILSLGRGIHDYAIKNDFEIGVYLGTALIDMYSKCGSIKDAIEVFEMMPRRSLPTWNSMIASLGVHGLGQEALNLFSEMERVNVKPDAITFVGILCACVHMKNVKAGCDYFKQMTQHYGIAPIPEHYKCMAELYARSNSLDEASRSTKAISMEADEDVLALLWMKWHDLGTDDE
Homology
BLAST of Tan0004674 vs. ExPASy Swiss-Prot
Match: Q38959 (Pentatricopeptide repeat-containing protein At3g26630, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-A6 PE=2 SV=1)

HSP 1 Score: 433.3 bits (1113), Expect = 3.2e-120
Identity = 217/401 (54.11%), Postives = 290/401 (72.32%), Query Frame = 0

Query: 29  KALFLLQKCKNFQQLKQIHAKIIRSGLSNDQLLTRKLIHLYSSHGRIVYAIFLFHQIQNP 88
           +A + L+ C NF QLKQIH KII+  L+NDQLL R+LI + SS G   YA  +F+Q+Q+P
Sbjct: 22  EASYFLRTCSNFSQLKQIHTKIIKHNLTNDQLLVRQLISVSSSFGETQYASLVFNQLQSP 81

Query: 89  CTFTWNLIIRANTINGLSEQALLLY-KNMVCQGIVADKFTFPFVIKACTASFAIELGKVV 148
            TFTWNL+IR+ ++N    +ALLL+   M+      DKFTFPFVIKAC AS +I LG  V
Sbjct: 82  STFTWNLMIRSLSVNHKPREALLLFILMMISHQSQFDKFTFPFVIKACLASSSIRLGTQV 141

Query: 149 HGSLIKYGFSGDTFVQNNLIDFYLKCGHKHCGFKVFEKMRVCNVVSWTTMISGLVSCGDV 208
           HG  IK GF  D F QN L+D Y KCG    G KVF+KM   ++VSWTTM+ GLVS   +
Sbjct: 142 HGLAIKAGFFNDVFFQNTLMDLYFKCGKPDSGRKVFDKMPGRSIVSWTTMLYGLVSNSQL 201

Query: 209 QAARRIFDEMPFKNVVSWTAMINGHIRSQQPEEALELFRRMQAGNVFPNEYTMVSLIKAC 268
            +A  +F++MP +NVVSWTAMI  ++++++P+EA +LFRRMQ  +V PNE+T+V+L++A 
Sbjct: 202 DSAEIVFNQMPMRNVVSWTAMITAYVKNRRPDEAFQLFRRMQVDDVKPNEFTIVNLLQAS 261

Query: 269 TEMGILSLGRGIHDYAIKNDFEIGVYLGTALIDMYSKCGSIKDAIEVFEMMPRRSLPTWN 328
           T++G LS+GR +HDYA KN F +  +LGTALIDMYSKCGS++DA +VF++M  +SL TWN
Sbjct: 262 TQLGSLSMGRWVHDYAHKNGFVLDCFLGTALIDMYSKCGSLQDARKVFDVMQGKSLATWN 321

Query: 329 SMIASLGVHGLGQEALNLFSEM-ERVNVKPDAITFVGILCACVHMKNVKAGCDYFKQMTQ 388
           SMI SLGVHG G+EAL+LF EM E  +V+PDAITFVG+L AC +  NVK G  YF +M Q
Sbjct: 322 SMITSLGVHGCGEEALSLFEEMEEEASVEPDAITFVGVLSACANTGNVKDGLRYFTRMIQ 381

Query: 389 HYGIAPIPEHYKCMAELYARSNSLDEASRSTKAISMEADED 428
            YGI+PI EH  CM +L  ++  +++AS   +  SM++D D
Sbjct: 382 VYGISPIREHNACMIQLLEQALEVEKASNLVE--SMDSDPD 420

BLAST of Tan0004674 vs. ExPASy Swiss-Prot
Match: Q9SJG6 (Pentatricopeptide repeat-containing protein At2g42920, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-E75 PE=2 SV=1)

HSP 1 Score: 298.1 bits (762), Expect = 1.6e-79
Identity = 152/426 (35.68%), Postives = 248/426 (58.22%), Query Frame = 0

Query: 19  LSSRGNIGAKKALFLLQ-KCKNFQQLKQIHAKIIRSGLSNDQL-LTRKLIHLYSSHGRIV 78
           + S G++     L L+  +C   ++LKQIHA +I++GL +D +  +R L    +S   + 
Sbjct: 16  MPSSGSLSGNTYLRLIDTQCSTMRELKQIHASLIKTGLISDTVTASRVLAFCCASPSDMN 75

Query: 79  YAIFLFHQIQNPCTFTWNLIIRANTINGLSEQALLLYKNMVCQ--GIVADKFTFPFVIKA 138
           YA  +F +I +   F WN IIR  + +   E A+ ++ +M+C    +   + T+P V KA
Sbjct: 76  YAYLVFTRINHKNPFVWNTIIRGFSRSSFPEMAISIFIDMLCSSPSVKPQRLTYPSVFKA 135

Query: 139 CTASFAIELGKVVHGSLIKYGFSGDTFVQNNLIDFYLKCGHKHCGFKVFEKMRVCNVVSW 198
                    G+ +HG +IK G   D+F++N ++  Y+ CG     +++F  M   +VV+W
Sbjct: 136 YGRLGQARDGRQLHGMVIKEGLEDDSFIRNTMLHMYVTCGCLIEAWRIFLGMIGFDVVAW 195

Query: 199 TTMISGLVSCGDVQAARRIFDEMPFKNVVSWTAMINGHIRSQQPEEALELFRRMQAGNVF 258
            +MI G   CG +  A+ +FDEMP +N VSW +MI+G +R+ + ++AL++FR MQ  +V 
Sbjct: 196 NSMIMGFAKCGLIDQAQNLFDEMPQRNGVSWNSMISGFVRNGRFKDALDMFREMQEKDVK 255

Query: 259 PNEYTMVSLIKACTEMGILSLGRGIHDYAIKNDFEIGVYLGTALIDMYSKCGSIKDAIEV 318
           P+ +TMVSL+ AC  +G    GR IH+Y ++N FE+   + TALIDMY KCG I++ + V
Sbjct: 256 PDGFTMVSLLNACAYLGASEQGRWIHEYIVRNRFELNSIVVTALIDMYCKCGCIEEGLNV 315

Query: 319 FEMMPRRSLPTWNSMIASLGVHGLGQEALNLFSEMERVNVKPDAITFVGILCACVHMKNV 378
           FE  P++ L  WNSMI  L  +G  + A++LFSE+ER  ++PD+++F+G+L AC H   V
Sbjct: 316 FECAPKKQLSCWNSMILGLANNGFEERAMDLFSELERSGLEPDSVSFIGVLTACAHSGEV 375

Query: 379 KAGCDYFKQMTQHYGIAPIPEHYKCMAELYARSNSLDEASRSTKAISMEADEDVLALLWM 438
               ++F+ M + Y I P  +HY  M  +   +  L+EA    K + +E D  + + L  
Sbjct: 376 HRADEFFRLMKEKYMIEPSIKHYTLMVNVLGGAGLLEEAEALIKNMPVEEDTVIWSSLLS 435

Query: 439 KWHDLG 441
               +G
Sbjct: 436 ACRKIG 441

BLAST of Tan0004674 vs. ExPASy Swiss-Prot
Match: Q9LS72 (Pentatricopeptide repeat-containing protein At3g29230 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E27 PE=2 SV=1)

HSP 1 Score: 293.9 bits (751), Expect = 3.1e-78
Identity = 156/456 (34.21%), Postives = 237/456 (51.97%), Query Frame = 0

Query: 34  LQKCKNFQQLKQIHAKIIRSGLSNDQLLTRKLIHLYSSHGRIVYAIFLFHQIQNPCTFTW 93
           L KC N  Q+KQ+HA+IIR  L  D  +  KLI   S   +   A+ +F+Q+Q P     
Sbjct: 26  LPKCANLNQVKQLHAQIIRRNLHEDLHIAPKLISALSLCRQTNLAVRVFNQVQEPNVHLC 85

Query: 94  NLIIRANTINGLSEQALLLYKNMVCQGIVADKFTFPFVIKACTASFAIELGKVVHGSLIK 153
           N +IRA+  N    QA  ++  M   G+ AD FT+PF++KAC+    + + K++H  + K
Sbjct: 86  NSLIRAHAQNSQPYQAFFVFSEMQRFGLFADNFTYPFLLKACSGQSWLPVVKMMHNHIEK 145

Query: 154 YGFSGDTFVQNNLIDFYLKCG--HKHCGFKVFEKMRVCNVVSWTTMISGLVSCGDVQAAR 213
            G S D +V N LID Y +CG        K+FEKM   + VSW +M+ GLV  G+++ AR
Sbjct: 146 LGLSSDIYVPNALIDCYSRCGGLGVRDAMKLFEKMSERDTVSWNSMLGGLVKAGELRDAR 205

Query: 214 RIFDEMPFKNVVSWTAMINGHIRSQQPEEALELFRRMQAGNVFP---------------- 273
           R+FDEMP ++++SW  M++G+ R ++  +A ELF +M   N                   
Sbjct: 206 RLFDEMPQRDLISWNTMLDGYARCREMSKAFELFEKMPERNTVSWSTMVMGYSKAGDMEM 265

Query: 274 ------------------------------------------------NEYTMVSLIKAC 333
                                                           +   ++S++ AC
Sbjct: 266 ARVMFDKMPLPAKNVVTWTIIIAGYAEKGLLKEADRLVDQMVASGLKFDAAAVISILAAC 325

Query: 334 TEMGILSLGRGIHDYAIKNDFEIGVYLGTALIDMYSKCGSIKDAIEVFEMMPRRSLPTWN 393
           TE G+LSLG  IH    +++     Y+  AL+DMY+KCG++K A +VF  +P++ L +WN
Sbjct: 326 TESGLLSLGMRIHSILKRSNLGSNAYVLNALLDMYAKCGNLKKAFDVFNDIPKKDLVSWN 385

Query: 394 SMIASLGVHGLGQEALNLFSEMERVNVKPDAITFVGILCACVHMKNVKAGCDYFKQMTQH 424
           +M+  LGVHG G+EA+ LFS M R  ++PD +TF+ +LC+C H   +  G DYF  M + 
Sbjct: 386 TMLHGLGVHGHGKEAIELFSRMRREGIRPDKVTFIAVLCSCNHAGLIDEGIDYFYSMEKV 445

BLAST of Tan0004674 vs. ExPASy Swiss-Prot
Match: Q9SZT8 (Pentatricopeptide repeat-containing protein ELI1, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=ELI1 PE=3 SV=1)

HSP 1 Score: 290.4 bits (742), Expect = 3.4e-77
Identity = 157/411 (38.20%), Postives = 243/411 (59.12%), Query Frame = 0

Query: 28  KKALFLLQKCKNFQQLKQIHAKIIRSGL---SNDQLLTRKLIHLYSSHGRIVYAIFLFHQ 87
           +K   L+ K ++  ++ QIHA I+R  L       +L  KL   Y+SHG+I +++ LFHQ
Sbjct: 30  EKLAVLIDKSQSVDEVLQIHAAILRHNLLLHPRYPVLNLKLHRAYASHGKIRHSLALFHQ 89

Query: 88  IQNPCTFTWNLIIRANTINGLSEQALLLYKNMVCQGIVADKFTFPFVIKACTASFAIELG 147
             +P  F +   I   +INGL +QA LLY  ++   I  ++FTF  ++K+C+     + G
Sbjct: 90  TIDPDLFLFTAAINTASINGLKDQAFLLYVQLLSSEINPNEFTFSSLLKSCST----KSG 149

Query: 148 KVVHGSLIKYGFSGDTFVQNNLIDFYLKCGHKHCGFKVFEKMRVCNVVSWTTMISGLVSC 207
           K++H  ++K+G   D +V   L+D Y K G      KVF++M   ++VS T MI+     
Sbjct: 150 KLIHTHVLKFGLGIDPYVATGLVDVYAKGGDVVSAQKVFDRMPERSLVSSTAMITCYAKQ 209

Query: 208 GDVQAARRIFDEMPFKNVVSWTAMINGHIRSQQPEEALELFRRMQA-GNVFPNEYTMVSL 267
           G+V+AAR +FD M  +++VSW  MI+G+ +   P +AL LF+++ A G   P+E T+V+ 
Sbjct: 210 GNVEAARALFDSMCERDIVSWNVMIDGYAQHGFPNDALMLFQKLLAEGKPKPDEITVVAA 269

Query: 268 IKACTEMGILSLGRGIHDYAIKNDFEIGVYLGTALIDMYSKCGSIKDAIEVFEMMPRRSL 327
           + AC+++G L  GR IH +   +   + V + T LIDMYSKCGS+++A+ VF   PR+ +
Sbjct: 270 LSACSQIGALETGRWIHVFVKSSRIRLNVKVCTGLIDMYSKCGSLEEAVLVFNDTPRKDI 329

Query: 328 PTWNSMIASLGVHGLGQEALNLFSEMERV-NVKPDAITFVGILCACVHMKNVKAGCDYFK 387
             WN+MIA   +HG  Q+AL LF+EM+ +  ++P  ITF+G L AC H   V  G   F+
Sbjct: 330 VAWNAMIAGYAMHGYSQDALRLFNEMQGITGLQPTDITFIGTLQACAHAGLVNEGIRIFE 389

Query: 388 QMTQHYGIAPIPEHYKCMAELYARSNSLDEASRSTKAISMEADEDVLALLW 434
            M Q YGI P  EHY C+  L  R+  L  A  + K ++M+AD    ++LW
Sbjct: 390 SMGQEYGIKPKIEHYGCLVSLLGRAGQLKRAYETIKNMNMDAD----SVLW 432

BLAST of Tan0004674 vs. ExPASy Swiss-Prot
Match: Q9FG16 (Pentatricopeptide repeat-containing protein At5g06540 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H88 PE=3 SV=1)

HSP 1 Score: 288.5 bits (737), Expect = 1.3e-76
Identity = 144/409 (35.21%), Postives = 231/409 (56.48%), Query Frame = 0

Query: 31  LFLLQKCKNFQQLKQIHAKIIRSGLSNDQLLTRKLIHLYSSHGR-------IVYAIFLFH 90
           L LLQ C +F  LK IH  ++R+ L +D  +  +L+ L             + YA  +F 
Sbjct: 16  LALLQSCSSFSDLKIIHGFLLRTHLISDVFVASRLLALCVDDSTFNKPTNLLGYAYGIFS 75

Query: 91  QIQNPCTFTWNLIIRANTINGLSEQALLLYKNMVCQGIVADKFTFPFVIKACTASFAIEL 150
           QIQNP  F +NL+IR  +      +A   Y  M+   I  D  TFPF+IKA +    + +
Sbjct: 76  QIQNPNLFVFNLLIRCFSTGAEPSKAFGFYTQMLKSRIWPDNITFPFLIKASSEMECVLV 135

Query: 151 GKVVHGSLIKYGFSGDTFVQNNLIDFYLKCGHKHCGFKVFEKMRVCNVVSWTTMISGLVS 210
           G+  H  ++++GF  D +V+N+L+  Y  CG      ++F +M   +VVSWT+M++G   
Sbjct: 136 GEQTHSQIVRFGFQNDVYVENSLVHMYANCGFIAAAGRIFGQMGFRDVVSWTSMVAGYCK 195

Query: 211 CGDVQAARRIFDEMPFKNVVSWTAMINGHIRSQQPEEALELFRRMQAGNVFPNEYTMVSL 270
           CG V+ AR +FDEMP +N+ +W+ MING+ ++   E+A++LF  M+   V  NE  MVS+
Sbjct: 196 CGMVENAREMFDEMPHRNLFTWSIMINGYAKNNCFEKAIDLFEFMKREGVVANETVMVSV 255

Query: 271 IKACTEMGILSLGRGIHDYAIKNDFEIGVYLGTALIDMYSKCGSIKDAIEVFEMMPRRSL 330
           I +C  +G L  G   ++Y +K+   + + LGTAL+DM+ +CG I+ AI VFE +P    
Sbjct: 256 ISSCAHLGALEFGERAYEYVVKSHMTVNLILGTALVDMFWRCGDIEKAIHVFEGLPETDS 315

Query: 331 PTWNSMIASLGVHGLGQEALNLFSEMERVNVKPDAITFVGILCACVHMKNVKAGCDYFKQ 390
            +W+S+I  L VHG   +A++ FS+M  +   P  +TF  +L AC H   V+ G + ++ 
Sbjct: 316 LSWSSIIKGLAVHGHAHKAMHYFSQMISLGFIPRDVTFTAVLSACSHGGLVEKGLEIYEN 375

Query: 391 MTQHYGIAPIPEHYKCMAELYARSNSLDEASRSTKAISMEADEDVLALL 433
           M + +GI P  EHY C+ ++  R+  L EA      + ++ +  +L  L
Sbjct: 376 MKKDHGIEPRLEHYGCIVDMLGRAGKLAEAENFILKMHVKPNAPILGAL 424

BLAST of Tan0004674 vs. NCBI nr
Match: KAG7017868.1 (Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 784.3 bits (2024), Expect = 5.7e-223
Identity = 384/444 (86.49%), Postives = 404/444 (90.99%), Query Frame = 0

Query: 1   MVPCLSYTHDVFPRKNLLLSSRGNIGAKKALFLLQKCKNFQQLKQIHAKIIRSGLSNDQL 60
           MVPCLS THDVF   NLLLSSRGNIGAKKALFLLQ CKNF+QLKQIHAKIIRSG+SNDQL
Sbjct: 1   MVPCLSSTHDVFATTNLLLSSRGNIGAKKALFLLQNCKNFKQLKQIHAKIIRSGISNDQL 60

Query: 61  LTRKLIHLYSSHGRIVYAIFLFHQIQNPCTFTWNLIIRANTINGLSEQALLLYKNMVCQG 120
           LTRKLIHLYSSHG+I YAIFLFHQIQNPCTFTWNLIIRANTINGLSEQAL+LYKNMVCQG
Sbjct: 61  LTRKLIHLYSSHGKIAYAIFLFHQIQNPCTFTWNLIIRANTINGLSEQALMLYKNMVCQG 120

Query: 121 IVADKFTFPFVIKACTASFAIELGKVVHGSLIKYGFSGDTFVQNNLIDFYLKCGHKHCGF 180
           I ADKFTFPFVIKACT SFAI++G+V+H SLIKYGFSGDTFVQNNLIDFY KCGHK C  
Sbjct: 121 IAADKFTFPFVIKACTTSFAIDIGRVIHASLIKYGFSGDTFVQNNLIDFYFKCGHKRCAL 180

Query: 181 KVFEKMRVCNVVSWTTMISGLVSCGDVQAARRIFDEMPFKNVVSWTAMINGHIRSQQPEE 240
           KVFEKMRVCNVVSWTTMISGLVSCGDVQAARRIFDEMP KNVVSWTAMING+IR+Q PEE
Sbjct: 181 KVFEKMRVCNVVSWTTMISGLVSCGDVQAARRIFDEMPCKNVVSWTAMINGYIRNQNPEE 240

Query: 241 ALELFRRMQAGNVFPNEYTMVSLIKACTEMGILSLGRGIHDYAIKNDFEIGVYLGTALID 300
           ALELF+RMQA N+ PNEYTMVSLIKACTEMGILSLGRGIHDYAI N FEIGVYLGTALID
Sbjct: 241 ALELFKRMQAENICPNEYTMVSLIKACTEMGILSLGRGIHDYAINNGFEIGVYLGTALID 300

Query: 301 MYSKCGSIKDAIEVFEMMPRRSLPTWNSMIASLGVHGLGQEALNLFSEMERVNVKPDAIT 360
           MYSKCGSIKDAIEVF+ MP +SLPTWNSMI SLGVHGLGQEALNLFSEMERVNVKPDAIT
Sbjct: 301 MYSKCGSIKDAIEVFKTMPGKSLPTWNSMITSLGVHGLGQEALNLFSEMERVNVKPDAIT 360

Query: 361 FVGILCACVHMKNVKAGCDYFKQMTQHYGIAPIPEHYKCMAELYARSNSLDEASRSTKAI 420
           FVG+LCACVHMKNV+AGC YFK+M QHYGIAPIPEHYKCMAELYAR    DEA  STKA+
Sbjct: 361 FVGVLCACVHMKNVEAGCAYFKRMAQHYGIAPIPEHYKCMAELYAR----DEAFESTKAM 420

Query: 421 SMEADEDVLALLWMKWHDLGTDDE 445
           SME D   LALL +  +  GTD E
Sbjct: 421 SMEPDSGSLALLGVIENADGTDKE 440

BLAST of Tan0004674 vs. NCBI nr
Match: KAG6581136.1 (Sialyltransferase-like protein 1, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 784.3 bits (2024), Expect = 5.7e-223
Identity = 384/444 (86.49%), Postives = 404/444 (90.99%), Query Frame = 0

Query: 1   MVPCLSYTHDVFPRKNLLLSSRGNIGAKKALFLLQKCKNFQQLKQIHAKIIRSGLSNDQL 60
           MVPCLS THDVF   NLLLSSRGNIGAKKALFLLQ CKNF+QLKQIHAKIIRSG+SNDQL
Sbjct: 541 MVPCLSSTHDVFATTNLLLSSRGNIGAKKALFLLQNCKNFKQLKQIHAKIIRSGISNDQL 600

Query: 61  LTRKLIHLYSSHGRIVYAIFLFHQIQNPCTFTWNLIIRANTINGLSEQALLLYKNMVCQG 120
           LTRKLIHLYSSHG+I YAIFLFHQIQNPCTFTWNLIIRANTINGLSEQAL+LYKNMVCQG
Sbjct: 601 LTRKLIHLYSSHGKIAYAIFLFHQIQNPCTFTWNLIIRANTINGLSEQALMLYKNMVCQG 660

Query: 121 IVADKFTFPFVIKACTASFAIELGKVVHGSLIKYGFSGDTFVQNNLIDFYLKCGHKHCGF 180
           I ADKFTFPFVIKACT SFAI++G+V+H SLIKYGFSGDTFVQNNLIDFY KCGHK C  
Sbjct: 661 IAADKFTFPFVIKACTTSFAIDIGRVIHASLIKYGFSGDTFVQNNLIDFYFKCGHKRCAL 720

Query: 181 KVFEKMRVCNVVSWTTMISGLVSCGDVQAARRIFDEMPFKNVVSWTAMINGHIRSQQPEE 240
           KVFEKMRVCNVVSWTTMISGLVSCGDVQAARRIFDEMP KNVVSWTAMING+IR+Q PEE
Sbjct: 721 KVFEKMRVCNVVSWTTMISGLVSCGDVQAARRIFDEMPCKNVVSWTAMINGYIRNQNPEE 780

Query: 241 ALELFRRMQAGNVFPNEYTMVSLIKACTEMGILSLGRGIHDYAIKNDFEIGVYLGTALID 300
           ALELF+RMQA N+ PNEYTMVSLIKACTEMGILSLGRGIHDYAI N FEIGVYLGTALID
Sbjct: 781 ALELFKRMQAENICPNEYTMVSLIKACTEMGILSLGRGIHDYAINNGFEIGVYLGTALID 840

Query: 301 MYSKCGSIKDAIEVFEMMPRRSLPTWNSMIASLGVHGLGQEALNLFSEMERVNVKPDAIT 360
           MYSKCGSIKDAIEVF+ MP +SLPTWNSMI SLGVHGLGQEALNLFSEMERVNVKPDAIT
Sbjct: 841 MYSKCGSIKDAIEVFKTMPGKSLPTWNSMITSLGVHGLGQEALNLFSEMERVNVKPDAIT 900

Query: 361 FVGILCACVHMKNVKAGCDYFKQMTQHYGIAPIPEHYKCMAELYARSNSLDEASRSTKAI 420
           FVG+LCACVHMKNV+AGC YFK+M QHYGIAPIPEHYKCMAELYAR    DEA  STKA+
Sbjct: 901 FVGVLCACVHMKNVEAGCAYFKRMAQHYGIAPIPEHYKCMAELYAR----DEAFESTKAM 960

Query: 421 SMEADEDVLALLWMKWHDLGTDDE 445
           SME D   LALL +  +  GTD E
Sbjct: 961 SMEPDSGSLALLGVIENADGTDKE 980

BLAST of Tan0004674 vs. NCBI nr
Match: XP_022934150.1 (pentatricopeptide repeat-containing protein At3g26630, chloroplastic-like isoform X1 [Cucurbita moschata])

HSP 1 Score: 780.4 bits (2014), Expect = 8.2e-222
Identity = 382/444 (86.04%), Postives = 403/444 (90.77%), Query Frame = 0

Query: 1   MVPCLSYTHDVFPRKNLLLSSRGNIGAKKALFLLQKCKNFQQLKQIHAKIIRSGLSNDQL 60
           MVPCLS THDVF   NLLLSSRGNIGAKKALFLLQ CKNF+QL+QIHAKIIRS +SNDQL
Sbjct: 17  MVPCLSSTHDVFATTNLLLSSRGNIGAKKALFLLQNCKNFKQLRQIHAKIIRSAISNDQL 76

Query: 61  LTRKLIHLYSSHGRIVYAIFLFHQIQNPCTFTWNLIIRANTINGLSEQALLLYKNMVCQG 120
           LTRKLIHLYSSHGRI YAIFLFHQIQNPCTFTWNLIIRANTINGLSEQAL+LYKNMVCQG
Sbjct: 77  LTRKLIHLYSSHGRIAYAIFLFHQIQNPCTFTWNLIIRANTINGLSEQALMLYKNMVCQG 136

Query: 121 IVADKFTFPFVIKACTASFAIELGKVVHGSLIKYGFSGDTFVQNNLIDFYLKCGHKHCGF 180
           I ADKFTFPFVIKACT SFAI++G+V+H SLIKYGFSGDTFVQNNLIDFY KCGHK C  
Sbjct: 137 IAADKFTFPFVIKACTTSFAIDIGRVIHASLIKYGFSGDTFVQNNLIDFYFKCGHKRCAL 196

Query: 181 KVFEKMRVCNVVSWTTMISGLVSCGDVQAARRIFDEMPFKNVVSWTAMINGHIRSQQPEE 240
           KVFEKMRVCNVVSWTTMISGLVSCGDVQAARRIFDEMP KNVVSWTAMING+IR+Q PEE
Sbjct: 197 KVFEKMRVCNVVSWTTMISGLVSCGDVQAARRIFDEMPCKNVVSWTAMINGYIRNQNPEE 256

Query: 241 ALELFRRMQAGNVFPNEYTMVSLIKACTEMGILSLGRGIHDYAIKNDFEIGVYLGTALID 300
           ALELF+RMQA N+ PNEYTMVSLIKACTEMGILSLGRGIHDYAI N FEIGVYLGTALID
Sbjct: 257 ALELFKRMQAENICPNEYTMVSLIKACTEMGILSLGRGIHDYAINNGFEIGVYLGTALID 316

Query: 301 MYSKCGSIKDAIEVFEMMPRRSLPTWNSMIASLGVHGLGQEALNLFSEMERVNVKPDAIT 360
           MYSKCGSIKDAIEVF+ MP +SLPTWNSMI SLGVHGLGQEALNLFSEMERVNVKPDAIT
Sbjct: 317 MYSKCGSIKDAIEVFKTMPGKSLPTWNSMITSLGVHGLGQEALNLFSEMERVNVKPDAIT 376

Query: 361 FVGILCACVHMKNVKAGCDYFKQMTQHYGIAPIPEHYKCMAELYARSNSLDEASRSTKAI 420
           FVG+LCACVHMKNV+AGC YFK+M QHYGIAPIPEHYKCMAELYA+    DEA  STKA+
Sbjct: 377 FVGVLCACVHMKNVEAGCAYFKRMAQHYGIAPIPEHYKCMAELYAQ----DEAFESTKAM 436

Query: 421 SMEADEDVLALLWMKWHDLGTDDE 445
           SME D   LALL +  +  GTD E
Sbjct: 437 SMEPDSGSLALLGVIENADGTDKE 456

BLAST of Tan0004674 vs. NCBI nr
Match: XP_022934151.1 (pentatricopeptide repeat-containing protein At3g26630, chloroplastic-like isoform X2 [Cucurbita moschata])

HSP 1 Score: 780.4 bits (2014), Expect = 8.2e-222
Identity = 382/444 (86.04%), Postives = 403/444 (90.77%), Query Frame = 0

Query: 1   MVPCLSYTHDVFPRKNLLLSSRGNIGAKKALFLLQKCKNFQQLKQIHAKIIRSGLSNDQL 60
           MVPCLS THDVF   NLLLSSRGNIGAKKALFLLQ CKNF+QL+QIHAKIIRS +SNDQL
Sbjct: 1   MVPCLSSTHDVFATTNLLLSSRGNIGAKKALFLLQNCKNFKQLRQIHAKIIRSAISNDQL 60

Query: 61  LTRKLIHLYSSHGRIVYAIFLFHQIQNPCTFTWNLIIRANTINGLSEQALLLYKNMVCQG 120
           LTRKLIHLYSSHGRI YAIFLFHQIQNPCTFTWNLIIRANTINGLSEQAL+LYKNMVCQG
Sbjct: 61  LTRKLIHLYSSHGRIAYAIFLFHQIQNPCTFTWNLIIRANTINGLSEQALMLYKNMVCQG 120

Query: 121 IVADKFTFPFVIKACTASFAIELGKVVHGSLIKYGFSGDTFVQNNLIDFYLKCGHKHCGF 180
           I ADKFTFPFVIKACT SFAI++G+V+H SLIKYGFSGDTFVQNNLIDFY KCGHK C  
Sbjct: 121 IAADKFTFPFVIKACTTSFAIDIGRVIHASLIKYGFSGDTFVQNNLIDFYFKCGHKRCAL 180

Query: 181 KVFEKMRVCNVVSWTTMISGLVSCGDVQAARRIFDEMPFKNVVSWTAMINGHIRSQQPEE 240
           KVFEKMRVCNVVSWTTMISGLVSCGDVQAARRIFDEMP KNVVSWTAMING+IR+Q PEE
Sbjct: 181 KVFEKMRVCNVVSWTTMISGLVSCGDVQAARRIFDEMPCKNVVSWTAMINGYIRNQNPEE 240

Query: 241 ALELFRRMQAGNVFPNEYTMVSLIKACTEMGILSLGRGIHDYAIKNDFEIGVYLGTALID 300
           ALELF+RMQA N+ PNEYTMVSLIKACTEMGILSLGRGIHDYAI N FEIGVYLGTALID
Sbjct: 241 ALELFKRMQAENICPNEYTMVSLIKACTEMGILSLGRGIHDYAINNGFEIGVYLGTALID 300

Query: 301 MYSKCGSIKDAIEVFEMMPRRSLPTWNSMIASLGVHGLGQEALNLFSEMERVNVKPDAIT 360
           MYSKCGSIKDAIEVF+ MP +SLPTWNSMI SLGVHGLGQEALNLFSEMERVNVKPDAIT
Sbjct: 301 MYSKCGSIKDAIEVFKTMPGKSLPTWNSMITSLGVHGLGQEALNLFSEMERVNVKPDAIT 360

Query: 361 FVGILCACVHMKNVKAGCDYFKQMTQHYGIAPIPEHYKCMAELYARSNSLDEASRSTKAI 420
           FVG+LCACVHMKNV+AGC YFK+M QHYGIAPIPEHYKCMAELYA+    DEA  STKA+
Sbjct: 361 FVGVLCACVHMKNVEAGCAYFKRMAQHYGIAPIPEHYKCMAELYAQ----DEAFESTKAM 420

Query: 421 SMEADEDVLALLWMKWHDLGTDDE 445
           SME D   LALL +  +  GTD E
Sbjct: 421 SMEPDSGSLALLGVIENADGTDKE 440

BLAST of Tan0004674 vs. NCBI nr
Match: XP_023527079.1 (pentatricopeptide repeat-containing protein At3g26630, chloroplastic-like isoform X2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 780.0 bits (2013), Expect = 1.1e-221
Identity = 382/444 (86.04%), Postives = 402/444 (90.54%), Query Frame = 0

Query: 1   MVPCLSYTHDVFPRKNLLLSSRGNIGAKKALFLLQKCKNFQQLKQIHAKIIRSGLSNDQL 60
           MVPCLS THDVF   NLLLSSRGNIGAKKALFLLQ CKNF+ L+QIHAKIIRS +SNDQL
Sbjct: 1   MVPCLSSTHDVFATTNLLLSSRGNIGAKKALFLLQNCKNFKHLRQIHAKIIRSAISNDQL 60

Query: 61  LTRKLIHLYSSHGRIVYAIFLFHQIQNPCTFTWNLIIRANTINGLSEQALLLYKNMVCQG 120
           LTRKLIHLYSSHGRI YAIFLFHQIQNPCTFTWNLIIRANTINGLSEQAL+LYKNMVCQG
Sbjct: 61  LTRKLIHLYSSHGRIAYAIFLFHQIQNPCTFTWNLIIRANTINGLSEQALMLYKNMVCQG 120

Query: 121 IVADKFTFPFVIKACTASFAIELGKVVHGSLIKYGFSGDTFVQNNLIDFYLKCGHKHCGF 180
           I ADKFTFPFVIKACT SFAI++G+V+H SLIKYGFSGDTFVQNNLIDFY KCGHK C  
Sbjct: 121 IAADKFTFPFVIKACTTSFAIDIGRVIHASLIKYGFSGDTFVQNNLIDFYFKCGHKRCAL 180

Query: 181 KVFEKMRVCNVVSWTTMISGLVSCGDVQAARRIFDEMPFKNVVSWTAMINGHIRSQQPEE 240
           KVFEKMRVCNVVSWTTMISGLVSCGDVQAARRIFDEMP KNVVSWTAMING+IR+Q PEE
Sbjct: 181 KVFEKMRVCNVVSWTTMISGLVSCGDVQAARRIFDEMPCKNVVSWTAMINGYIRNQNPEE 240

Query: 241 ALELFRRMQAGNVFPNEYTMVSLIKACTEMGILSLGRGIHDYAIKNDFEIGVYLGTALID 300
           ALELF+RMQA N+ PNEYTMVSLIKACTEMGILSLGRGIHDYAI N FEIGVYLGTALID
Sbjct: 241 ALELFKRMQAENICPNEYTMVSLIKACTEMGILSLGRGIHDYAINNGFEIGVYLGTALID 300

Query: 301 MYSKCGSIKDAIEVFEMMPRRSLPTWNSMIASLGVHGLGQEALNLFSEMERVNVKPDAIT 360
           MYSKCGSIKDAIEVF+ MP +SLPTWNSMI SLGVHGLGQEALNLFSEMERVNVKPDAIT
Sbjct: 301 MYSKCGSIKDAIEVFKTMPGKSLPTWNSMITSLGVHGLGQEALNLFSEMERVNVKPDAIT 360

Query: 361 FVGILCACVHMKNVKAGCDYFKQMTQHYGIAPIPEHYKCMAELYARSNSLDEASRSTKAI 420
           FVG+LCACVHMKNV+AGC YFK+M QHYGIAPIPEHYKCMAELYAR    DEA  STKA+
Sbjct: 361 FVGVLCACVHMKNVEAGCAYFKRMAQHYGIAPIPEHYKCMAELYAR----DEAFESTKAM 420

Query: 421 SMEADEDVLALLWMKWHDLGTDDE 445
           SME D   LALL +  +  GTD E
Sbjct: 421 SMEPDSGSLALLGVIENADGTDKE 440

BLAST of Tan0004674 vs. ExPASy TrEMBL
Match: A0A6J1F6V3 (pentatricopeptide repeat-containing protein At3g26630, chloroplastic-like isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111441407 PE=4 SV=1)

HSP 1 Score: 780.4 bits (2014), Expect = 4.0e-222
Identity = 382/444 (86.04%), Postives = 403/444 (90.77%), Query Frame = 0

Query: 1   MVPCLSYTHDVFPRKNLLLSSRGNIGAKKALFLLQKCKNFQQLKQIHAKIIRSGLSNDQL 60
           MVPCLS THDVF   NLLLSSRGNIGAKKALFLLQ CKNF+QL+QIHAKIIRS +SNDQL
Sbjct: 17  MVPCLSSTHDVFATTNLLLSSRGNIGAKKALFLLQNCKNFKQLRQIHAKIIRSAISNDQL 76

Query: 61  LTRKLIHLYSSHGRIVYAIFLFHQIQNPCTFTWNLIIRANTINGLSEQALLLYKNMVCQG 120
           LTRKLIHLYSSHGRI YAIFLFHQIQNPCTFTWNLIIRANTINGLSEQAL+LYKNMVCQG
Sbjct: 77  LTRKLIHLYSSHGRIAYAIFLFHQIQNPCTFTWNLIIRANTINGLSEQALMLYKNMVCQG 136

Query: 121 IVADKFTFPFVIKACTASFAIELGKVVHGSLIKYGFSGDTFVQNNLIDFYLKCGHKHCGF 180
           I ADKFTFPFVIKACT SFAI++G+V+H SLIKYGFSGDTFVQNNLIDFY KCGHK C  
Sbjct: 137 IAADKFTFPFVIKACTTSFAIDIGRVIHASLIKYGFSGDTFVQNNLIDFYFKCGHKRCAL 196

Query: 181 KVFEKMRVCNVVSWTTMISGLVSCGDVQAARRIFDEMPFKNVVSWTAMINGHIRSQQPEE 240
           KVFEKMRVCNVVSWTTMISGLVSCGDVQAARRIFDEMP KNVVSWTAMING+IR+Q PEE
Sbjct: 197 KVFEKMRVCNVVSWTTMISGLVSCGDVQAARRIFDEMPCKNVVSWTAMINGYIRNQNPEE 256

Query: 241 ALELFRRMQAGNVFPNEYTMVSLIKACTEMGILSLGRGIHDYAIKNDFEIGVYLGTALID 300
           ALELF+RMQA N+ PNEYTMVSLIKACTEMGILSLGRGIHDYAI N FEIGVYLGTALID
Sbjct: 257 ALELFKRMQAENICPNEYTMVSLIKACTEMGILSLGRGIHDYAINNGFEIGVYLGTALID 316

Query: 301 MYSKCGSIKDAIEVFEMMPRRSLPTWNSMIASLGVHGLGQEALNLFSEMERVNVKPDAIT 360
           MYSKCGSIKDAIEVF+ MP +SLPTWNSMI SLGVHGLGQEALNLFSEMERVNVKPDAIT
Sbjct: 317 MYSKCGSIKDAIEVFKTMPGKSLPTWNSMITSLGVHGLGQEALNLFSEMERVNVKPDAIT 376

Query: 361 FVGILCACVHMKNVKAGCDYFKQMTQHYGIAPIPEHYKCMAELYARSNSLDEASRSTKAI 420
           FVG+LCACVHMKNV+AGC YFK+M QHYGIAPIPEHYKCMAELYA+    DEA  STKA+
Sbjct: 377 FVGVLCACVHMKNVEAGCAYFKRMAQHYGIAPIPEHYKCMAELYAQ----DEAFESTKAM 436

Query: 421 SMEADEDVLALLWMKWHDLGTDDE 445
           SME D   LALL +  +  GTD E
Sbjct: 437 SMEPDSGSLALLGVIENADGTDKE 456

BLAST of Tan0004674 vs. ExPASy TrEMBL
Match: A0A6J1F116 (pentatricopeptide repeat-containing protein At3g26630, chloroplastic-like isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111441407 PE=4 SV=1)

HSP 1 Score: 780.4 bits (2014), Expect = 4.0e-222
Identity = 382/444 (86.04%), Postives = 403/444 (90.77%), Query Frame = 0

Query: 1   MVPCLSYTHDVFPRKNLLLSSRGNIGAKKALFLLQKCKNFQQLKQIHAKIIRSGLSNDQL 60
           MVPCLS THDVF   NLLLSSRGNIGAKKALFLLQ CKNF+QL+QIHAKIIRS +SNDQL
Sbjct: 1   MVPCLSSTHDVFATTNLLLSSRGNIGAKKALFLLQNCKNFKQLRQIHAKIIRSAISNDQL 60

Query: 61  LTRKLIHLYSSHGRIVYAIFLFHQIQNPCTFTWNLIIRANTINGLSEQALLLYKNMVCQG 120
           LTRKLIHLYSSHGRI YAIFLFHQIQNPCTFTWNLIIRANTINGLSEQAL+LYKNMVCQG
Sbjct: 61  LTRKLIHLYSSHGRIAYAIFLFHQIQNPCTFTWNLIIRANTINGLSEQALMLYKNMVCQG 120

Query: 121 IVADKFTFPFVIKACTASFAIELGKVVHGSLIKYGFSGDTFVQNNLIDFYLKCGHKHCGF 180
           I ADKFTFPFVIKACT SFAI++G+V+H SLIKYGFSGDTFVQNNLIDFY KCGHK C  
Sbjct: 121 IAADKFTFPFVIKACTTSFAIDIGRVIHASLIKYGFSGDTFVQNNLIDFYFKCGHKRCAL 180

Query: 181 KVFEKMRVCNVVSWTTMISGLVSCGDVQAARRIFDEMPFKNVVSWTAMINGHIRSQQPEE 240
           KVFEKMRVCNVVSWTTMISGLVSCGDVQAARRIFDEMP KNVVSWTAMING+IR+Q PEE
Sbjct: 181 KVFEKMRVCNVVSWTTMISGLVSCGDVQAARRIFDEMPCKNVVSWTAMINGYIRNQNPEE 240

Query: 241 ALELFRRMQAGNVFPNEYTMVSLIKACTEMGILSLGRGIHDYAIKNDFEIGVYLGTALID 300
           ALELF+RMQA N+ PNEYTMVSLIKACTEMGILSLGRGIHDYAI N FEIGVYLGTALID
Sbjct: 241 ALELFKRMQAENICPNEYTMVSLIKACTEMGILSLGRGIHDYAINNGFEIGVYLGTALID 300

Query: 301 MYSKCGSIKDAIEVFEMMPRRSLPTWNSMIASLGVHGLGQEALNLFSEMERVNVKPDAIT 360
           MYSKCGSIKDAIEVF+ MP +SLPTWNSMI SLGVHGLGQEALNLFSEMERVNVKPDAIT
Sbjct: 301 MYSKCGSIKDAIEVFKTMPGKSLPTWNSMITSLGVHGLGQEALNLFSEMERVNVKPDAIT 360

Query: 361 FVGILCACVHMKNVKAGCDYFKQMTQHYGIAPIPEHYKCMAELYARSNSLDEASRSTKAI 420
           FVG+LCACVHMKNV+AGC YFK+M QHYGIAPIPEHYKCMAELYA+    DEA  STKA+
Sbjct: 361 FVGVLCACVHMKNVEAGCAYFKRMAQHYGIAPIPEHYKCMAELYAQ----DEAFESTKAM 420

Query: 421 SMEADEDVLALLWMKWHDLGTDDE 445
           SME D   LALL +  +  GTD E
Sbjct: 421 SMEPDSGSLALLGVIENADGTDKE 440

BLAST of Tan0004674 vs. ExPASy TrEMBL
Match: A0A1S3BNB7 (pentatricopeptide repeat-containing protein At3g26630, chloroplastic isoform X1 OS=Cucumis melo OX=3656 GN=LOC103491736 PE=4 SV=1)

HSP 1 Score: 777.7 bits (2007), Expect = 2.6e-221
Identity = 372/425 (87.53%), Postives = 396/425 (93.18%), Query Frame = 0

Query: 1   MVPCLSYTHDVFPRKNLLLSSRGNIGAKKALFLLQKCKNFQQLKQIHAKIIRSGLSNDQL 60
           MVPCLSYTHDVFP KN  L+ RGNI AKKALFLLQ CKNF+ L+QIHAKIIRSGLSNDQL
Sbjct: 1   MVPCLSYTHDVFPSKNFSLTPRGNIRAKKALFLLQNCKNFKHLRQIHAKIIRSGLSNDQL 60

Query: 61  LTRKLIHLYSSHGRIVYAIFLFHQIQNPCTFTWNLIIRANTINGLSEQALLLYKNMVCQG 120
           LTRKLIHLYS+HGRIVYAIFLF+QIQNPCTFTWNLIIRANTINGLSEQAL+LYKNMVCQG
Sbjct: 61  LTRKLIHLYSTHGRIVYAIFLFYQIQNPCTFTWNLIIRANTINGLSEQALMLYKNMVCQG 120

Query: 121 IVADKFTFPFVIKACTASFAIELGKVVHGSLIKYGFSGDTFVQNNLIDFYLKCGHKHCGF 180
           I ADKFTFPFVIKACT   +I+LGKVVHGS IKYGFSGD FVQNNLIDFY KCGHKHC  
Sbjct: 121 IAADKFTFPFVIKACTNFLSIDLGKVVHGSSIKYGFSGDAFVQNNLIDFYFKCGHKHCAL 180

Query: 181 KVFEKMRVCNVVSWTTMISGLVSCGDVQAARRIFDEMPFKNVVSWTAMINGHIRSQQPEE 240
           KVFEKMRVCNVVSWTT+ISGL+SCGD+  ARRIFDEMP KNVVSWTAMING+IR+QQPEE
Sbjct: 181 KVFEKMRVCNVVSWTTVISGLISCGDLLEARRIFDEMPSKNVVSWTAMINGYIRNQQPEE 240

Query: 241 ALELFRRMQAGNVFPNEYTMVSLIKACTEMGILSLGRGIHDYAIKNDFEIGVYLGTALID 300
           ALELF+RMQA N+FPNEYTMVSLIKACTEMGILSLGRGIHDY IKN FEIGVYLGTALID
Sbjct: 241 ALELFKRMQAENIFPNEYTMVSLIKACTEMGILSLGRGIHDYTIKNCFEIGVYLGTALID 300

Query: 301 MYSKCGSIKDAIEVFEMMPRRSLPTWNSMIASLGVHGLGQEALNLFSEMERVNVKPDAIT 360
           MYSKCGSIKDAIEVFE MPRRSLPTWNSMI SLGVHGLGQ+ALN+FSEMERVNV+PDAIT
Sbjct: 301 MYSKCGSIKDAIEVFETMPRRSLPTWNSMITSLGVHGLGQQALNIFSEMERVNVEPDAIT 360

Query: 361 FVGILCACVHMKNVKAGCDYFKQMTQHYGIAPIPEHYKCMAELYARSNSLDEASRSTKAI 420
           FVG+LCACVHMKNVK GC YFK+MTQHYGIAPIPEHY+CM ELYARSN+LDEA +STKAI
Sbjct: 361 FVGVLCACVHMKNVKEGCAYFKRMTQHYGIAPIPEHYECMTELYARSNNLDEAFKSTKAI 420

Query: 421 SMEAD 426
           S+E D
Sbjct: 421 SVEPD 425

BLAST of Tan0004674 vs. ExPASy TrEMBL
Match: A0A6J1IZS9 (pentatricopeptide repeat-containing protein At3g26630, chloroplastic isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111482147 PE=4 SV=1)

HSP 1 Score: 757.7 bits (1955), Expect = 2.8e-215
Identity = 362/409 (88.51%), Postives = 381/409 (93.15%), Query Frame = 0

Query: 1   MVPCLSYTHDVFPRKNLLLSSRGNIGAKKALFLLQKCKNFQQLKQIHAKIIRSGLSNDQL 60
           MVPCLS THDVF   NLLLSSRGNIGAKKALFLLQ CKNF+ L+QIHAKIIR G+SNDQL
Sbjct: 1   MVPCLSSTHDVFATTNLLLSSRGNIGAKKALFLLQNCKNFKHLRQIHAKIIRCGISNDQL 60

Query: 61  LTRKLIHLYSSHGRIVYAIFLFHQIQNPCTFTWNLIIRANTINGLSEQALLLYKNMVCQG 120
           LTRKLIHLYSSHGRI YAIFLFHQIQNPCTFTWNLIIRANTINGLSEQAL+LYKNMVCQG
Sbjct: 61  LTRKLIHLYSSHGRIAYAIFLFHQIQNPCTFTWNLIIRANTINGLSEQALMLYKNMVCQG 120

Query: 121 IVADKFTFPFVIKACTASFAIELGKVVHGSLIKYGFSGDTFVQNNLIDFYLKCGHKHCGF 180
           I ADKFTFPFVIKACT SFAI++G+V+H SLIKYGFSGDTFVQNNLIDFY KCGHK C  
Sbjct: 121 IAADKFTFPFVIKACTTSFAIDIGRVIHASLIKYGFSGDTFVQNNLIDFYFKCGHKRCAL 180

Query: 181 KVFEKMRVCNVVSWTTMISGLVSCGDVQAARRIFDEMPFKNVVSWTAMINGHIRSQQPEE 240
           KVFEKMRVCNVVSWTTMISGLVSCGDVQAARRIFDEMP KNVVSWTAMING+IR+Q PEE
Sbjct: 181 KVFEKMRVCNVVSWTTMISGLVSCGDVQAARRIFDEMPCKNVVSWTAMINGYIRNQNPEE 240

Query: 241 ALELFRRMQAGNVFPNEYTMVSLIKACTEMGILSLGRGIHDYAIKNDFEIGVYLGTALID 300
           ALELF+RMQA N+ PNEYTMVSLIKACTEMGILSLGRGIHDYAI N FEIGVYLGTALID
Sbjct: 241 ALELFKRMQAENICPNEYTMVSLIKACTEMGILSLGRGIHDYAINNGFEIGVYLGTALID 300

Query: 301 MYSKCGSIKDAIEVFEMMPRRSLPTWNSMIASLGVHGLGQEALNLFSEMERVNVKPDAIT 360
           MYSKCGSIKDAIEVF+ MP +SLPTWNSMI SLGVHGLGQEALNLFSEMERVNVKPDAIT
Sbjct: 301 MYSKCGSIKDAIEVFKTMPGKSLPTWNSMITSLGVHGLGQEALNLFSEMERVNVKPDAIT 360

Query: 361 FVGILCACVHMKNVKAGCDYFKQMTQHYGIAPIPEHYKCMAELYARSNS 410
           FVG+LCACVHMKNV+AGC YFK+M QHYGIAPIPEHYKCMAELYA+  +
Sbjct: 361 FVGVLCACVHMKNVEAGCAYFKRMAQHYGIAPIPEHYKCMAELYAQDEA 409

BLAST of Tan0004674 vs. ExPASy TrEMBL
Match: A0A6J1J2R2 (pentatricopeptide repeat-containing protein At3g26630, chloroplastic isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111482147 PE=4 SV=1)

HSP 1 Score: 757.7 bits (1955), Expect = 2.8e-215
Identity = 362/409 (88.51%), Postives = 381/409 (93.15%), Query Frame = 0

Query: 1   MVPCLSYTHDVFPRKNLLLSSRGNIGAKKALFLLQKCKNFQQLKQIHAKIIRSGLSNDQL 60
           MVPCLS THDVF   NLLLSSRGNIGAKKALFLLQ CKNF+ L+QIHAKIIR G+SNDQL
Sbjct: 17  MVPCLSSTHDVFATTNLLLSSRGNIGAKKALFLLQNCKNFKHLRQIHAKIIRCGISNDQL 76

Query: 61  LTRKLIHLYSSHGRIVYAIFLFHQIQNPCTFTWNLIIRANTINGLSEQALLLYKNMVCQG 120
           LTRKLIHLYSSHGRI YAIFLFHQIQNPCTFTWNLIIRANTINGLSEQAL+LYKNMVCQG
Sbjct: 77  LTRKLIHLYSSHGRIAYAIFLFHQIQNPCTFTWNLIIRANTINGLSEQALMLYKNMVCQG 136

Query: 121 IVADKFTFPFVIKACTASFAIELGKVVHGSLIKYGFSGDTFVQNNLIDFYLKCGHKHCGF 180
           I ADKFTFPFVIKACT SFAI++G+V+H SLIKYGFSGDTFVQNNLIDFY KCGHK C  
Sbjct: 137 IAADKFTFPFVIKACTTSFAIDIGRVIHASLIKYGFSGDTFVQNNLIDFYFKCGHKRCAL 196

Query: 181 KVFEKMRVCNVVSWTTMISGLVSCGDVQAARRIFDEMPFKNVVSWTAMINGHIRSQQPEE 240
           KVFEKMRVCNVVSWTTMISGLVSCGDVQAARRIFDEMP KNVVSWTAMING+IR+Q PEE
Sbjct: 197 KVFEKMRVCNVVSWTTMISGLVSCGDVQAARRIFDEMPCKNVVSWTAMINGYIRNQNPEE 256

Query: 241 ALELFRRMQAGNVFPNEYTMVSLIKACTEMGILSLGRGIHDYAIKNDFEIGVYLGTALID 300
           ALELF+RMQA N+ PNEYTMVSLIKACTEMGILSLGRGIHDYAI N FEIGVYLGTALID
Sbjct: 257 ALELFKRMQAENICPNEYTMVSLIKACTEMGILSLGRGIHDYAINNGFEIGVYLGTALID 316

Query: 301 MYSKCGSIKDAIEVFEMMPRRSLPTWNSMIASLGVHGLGQEALNLFSEMERVNVKPDAIT 360
           MYSKCGSIKDAIEVF+ MP +SLPTWNSMI SLGVHGLGQEALNLFSEMERVNVKPDAIT
Sbjct: 317 MYSKCGSIKDAIEVFKTMPGKSLPTWNSMITSLGVHGLGQEALNLFSEMERVNVKPDAIT 376

Query: 361 FVGILCACVHMKNVKAGCDYFKQMTQHYGIAPIPEHYKCMAELYARSNS 410
           FVG+LCACVHMKNV+AGC YFK+M QHYGIAPIPEHYKCMAELYA+  +
Sbjct: 377 FVGVLCACVHMKNVEAGCAYFKRMAQHYGIAPIPEHYKCMAELYAQDEA 425

BLAST of Tan0004674 vs. TAIR 10
Match: AT3G26630.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 433.3 bits (1113), Expect = 2.3e-121
Identity = 217/401 (54.11%), Postives = 290/401 (72.32%), Query Frame = 0

Query: 29  KALFLLQKCKNFQQLKQIHAKIIRSGLSNDQLLTRKLIHLYSSHGRIVYAIFLFHQIQNP 88
           +A + L+ C NF QLKQIH KII+  L+NDQLL R+LI + SS G   YA  +F+Q+Q+P
Sbjct: 22  EASYFLRTCSNFSQLKQIHTKIIKHNLTNDQLLVRQLISVSSSFGETQYASLVFNQLQSP 81

Query: 89  CTFTWNLIIRANTINGLSEQALLLY-KNMVCQGIVADKFTFPFVIKACTASFAIELGKVV 148
            TFTWNL+IR+ ++N    +ALLL+   M+      DKFTFPFVIKAC AS +I LG  V
Sbjct: 82  STFTWNLMIRSLSVNHKPREALLLFILMMISHQSQFDKFTFPFVIKACLASSSIRLGTQV 141

Query: 149 HGSLIKYGFSGDTFVQNNLIDFYLKCGHKHCGFKVFEKMRVCNVVSWTTMISGLVSCGDV 208
           HG  IK GF  D F QN L+D Y KCG    G KVF+KM   ++VSWTTM+ GLVS   +
Sbjct: 142 HGLAIKAGFFNDVFFQNTLMDLYFKCGKPDSGRKVFDKMPGRSIVSWTTMLYGLVSNSQL 201

Query: 209 QAARRIFDEMPFKNVVSWTAMINGHIRSQQPEEALELFRRMQAGNVFPNEYTMVSLIKAC 268
            +A  +F++MP +NVVSWTAMI  ++++++P+EA +LFRRMQ  +V PNE+T+V+L++A 
Sbjct: 202 DSAEIVFNQMPMRNVVSWTAMITAYVKNRRPDEAFQLFRRMQVDDVKPNEFTIVNLLQAS 261

Query: 269 TEMGILSLGRGIHDYAIKNDFEIGVYLGTALIDMYSKCGSIKDAIEVFEMMPRRSLPTWN 328
           T++G LS+GR +HDYA KN F +  +LGTALIDMYSKCGS++DA +VF++M  +SL TWN
Sbjct: 262 TQLGSLSMGRWVHDYAHKNGFVLDCFLGTALIDMYSKCGSLQDARKVFDVMQGKSLATWN 321

Query: 329 SMIASLGVHGLGQEALNLFSEM-ERVNVKPDAITFVGILCACVHMKNVKAGCDYFKQMTQ 388
           SMI SLGVHG G+EAL+LF EM E  +V+PDAITFVG+L AC +  NVK G  YF +M Q
Sbjct: 322 SMITSLGVHGCGEEALSLFEEMEEEASVEPDAITFVGVLSACANTGNVKDGLRYFTRMIQ 381

Query: 389 HYGIAPIPEHYKCMAELYARSNSLDEASRSTKAISMEADED 428
            YGI+PI EH  CM +L  ++  +++AS   +  SM++D D
Sbjct: 382 VYGISPIREHNACMIQLLEQALEVEKASNLVE--SMDSDPD 420

BLAST of Tan0004674 vs. TAIR 10
Match: AT2G42920.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 298.1 bits (762), Expect = 1.2e-80
Identity = 152/426 (35.68%), Postives = 248/426 (58.22%), Query Frame = 0

Query: 19  LSSRGNIGAKKALFLLQ-KCKNFQQLKQIHAKIIRSGLSNDQL-LTRKLIHLYSSHGRIV 78
           + S G++     L L+  +C   ++LKQIHA +I++GL +D +  +R L    +S   + 
Sbjct: 16  MPSSGSLSGNTYLRLIDTQCSTMRELKQIHASLIKTGLISDTVTASRVLAFCCASPSDMN 75

Query: 79  YAIFLFHQIQNPCTFTWNLIIRANTINGLSEQALLLYKNMVCQ--GIVADKFTFPFVIKA 138
           YA  +F +I +   F WN IIR  + +   E A+ ++ +M+C    +   + T+P V KA
Sbjct: 76  YAYLVFTRINHKNPFVWNTIIRGFSRSSFPEMAISIFIDMLCSSPSVKPQRLTYPSVFKA 135

Query: 139 CTASFAIELGKVVHGSLIKYGFSGDTFVQNNLIDFYLKCGHKHCGFKVFEKMRVCNVVSW 198
                    G+ +HG +IK G   D+F++N ++  Y+ CG     +++F  M   +VV+W
Sbjct: 136 YGRLGQARDGRQLHGMVIKEGLEDDSFIRNTMLHMYVTCGCLIEAWRIFLGMIGFDVVAW 195

Query: 199 TTMISGLVSCGDVQAARRIFDEMPFKNVVSWTAMINGHIRSQQPEEALELFRRMQAGNVF 258
            +MI G   CG +  A+ +FDEMP +N VSW +MI+G +R+ + ++AL++FR MQ  +V 
Sbjct: 196 NSMIMGFAKCGLIDQAQNLFDEMPQRNGVSWNSMISGFVRNGRFKDALDMFREMQEKDVK 255

Query: 259 PNEYTMVSLIKACTEMGILSLGRGIHDYAIKNDFEIGVYLGTALIDMYSKCGSIKDAIEV 318
           P+ +TMVSL+ AC  +G    GR IH+Y ++N FE+   + TALIDMY KCG I++ + V
Sbjct: 256 PDGFTMVSLLNACAYLGASEQGRWIHEYIVRNRFELNSIVVTALIDMYCKCGCIEEGLNV 315

Query: 319 FEMMPRRSLPTWNSMIASLGVHGLGQEALNLFSEMERVNVKPDAITFVGILCACVHMKNV 378
           FE  P++ L  WNSMI  L  +G  + A++LFSE+ER  ++PD+++F+G+L AC H   V
Sbjct: 316 FECAPKKQLSCWNSMILGLANNGFEERAMDLFSELERSGLEPDSVSFIGVLTACAHSGEV 375

Query: 379 KAGCDYFKQMTQHYGIAPIPEHYKCMAELYARSNSLDEASRSTKAISMEADEDVLALLWM 438
               ++F+ M + Y I P  +HY  M  +   +  L+EA    K + +E D  + + L  
Sbjct: 376 HRADEFFRLMKEKYMIEPSIKHYTLMVNVLGGAGLLEEAEALIKNMPVEEDTVIWSSLLS 435

Query: 439 KWHDLG 441
               +G
Sbjct: 436 ACRKIG 441

BLAST of Tan0004674 vs. TAIR 10
Match: AT3G29230.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 293.9 bits (751), Expect = 2.2e-79
Identity = 156/456 (34.21%), Postives = 237/456 (51.97%), Query Frame = 0

Query: 34  LQKCKNFQQLKQIHAKIIRSGLSNDQLLTRKLIHLYSSHGRIVYAIFLFHQIQNPCTFTW 93
           L KC N  Q+KQ+HA+IIR  L  D  +  KLI   S   +   A+ +F+Q+Q P     
Sbjct: 26  LPKCANLNQVKQLHAQIIRRNLHEDLHIAPKLISALSLCRQTNLAVRVFNQVQEPNVHLC 85

Query: 94  NLIIRANTINGLSEQALLLYKNMVCQGIVADKFTFPFVIKACTASFAIELGKVVHGSLIK 153
           N +IRA+  N    QA  ++  M   G+ AD FT+PF++KAC+    + + K++H  + K
Sbjct: 86  NSLIRAHAQNSQPYQAFFVFSEMQRFGLFADNFTYPFLLKACSGQSWLPVVKMMHNHIEK 145

Query: 154 YGFSGDTFVQNNLIDFYLKCG--HKHCGFKVFEKMRVCNVVSWTTMISGLVSCGDVQAAR 213
            G S D +V N LID Y +CG        K+FEKM   + VSW +M+ GLV  G+++ AR
Sbjct: 146 LGLSSDIYVPNALIDCYSRCGGLGVRDAMKLFEKMSERDTVSWNSMLGGLVKAGELRDAR 205

Query: 214 RIFDEMPFKNVVSWTAMINGHIRSQQPEEALELFRRMQAGNVFP---------------- 273
           R+FDEMP ++++SW  M++G+ R ++  +A ELF +M   N                   
Sbjct: 206 RLFDEMPQRDLISWNTMLDGYARCREMSKAFELFEKMPERNTVSWSTMVMGYSKAGDMEM 265

Query: 274 ------------------------------------------------NEYTMVSLIKAC 333
                                                           +   ++S++ AC
Sbjct: 266 ARVMFDKMPLPAKNVVTWTIIIAGYAEKGLLKEADRLVDQMVASGLKFDAAAVISILAAC 325

Query: 334 TEMGILSLGRGIHDYAIKNDFEIGVYLGTALIDMYSKCGSIKDAIEVFEMMPRRSLPTWN 393
           TE G+LSLG  IH    +++     Y+  AL+DMY+KCG++K A +VF  +P++ L +WN
Sbjct: 326 TESGLLSLGMRIHSILKRSNLGSNAYVLNALLDMYAKCGNLKKAFDVFNDIPKKDLVSWN 385

Query: 394 SMIASLGVHGLGQEALNLFSEMERVNVKPDAITFVGILCACVHMKNVKAGCDYFKQMTQH 424
           +M+  LGVHG G+EA+ LFS M R  ++PD +TF+ +LC+C H   +  G DYF  M + 
Sbjct: 386 TMLHGLGVHGHGKEAIELFSRMRREGIRPDKVTFIAVLCSCNHAGLIDEGIDYFYSMEKV 445

BLAST of Tan0004674 vs. TAIR 10
Match: AT4G37380.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 290.4 bits (742), Expect = 2.4e-78
Identity = 157/411 (38.20%), Postives = 243/411 (59.12%), Query Frame = 0

Query: 28  KKALFLLQKCKNFQQLKQIHAKIIRSGL---SNDQLLTRKLIHLYSSHGRIVYAIFLFHQ 87
           +K   L+ K ++  ++ QIHA I+R  L       +L  KL   Y+SHG+I +++ LFHQ
Sbjct: 30  EKLAVLIDKSQSVDEVLQIHAAILRHNLLLHPRYPVLNLKLHRAYASHGKIRHSLALFHQ 89

Query: 88  IQNPCTFTWNLIIRANTINGLSEQALLLYKNMVCQGIVADKFTFPFVIKACTASFAIELG 147
             +P  F +   I   +INGL +QA LLY  ++   I  ++FTF  ++K+C+     + G
Sbjct: 90  TIDPDLFLFTAAINTASINGLKDQAFLLYVQLLSSEINPNEFTFSSLLKSCST----KSG 149

Query: 148 KVVHGSLIKYGFSGDTFVQNNLIDFYLKCGHKHCGFKVFEKMRVCNVVSWTTMISGLVSC 207
           K++H  ++K+G   D +V   L+D Y K G      KVF++M   ++VS T MI+     
Sbjct: 150 KLIHTHVLKFGLGIDPYVATGLVDVYAKGGDVVSAQKVFDRMPERSLVSSTAMITCYAKQ 209

Query: 208 GDVQAARRIFDEMPFKNVVSWTAMINGHIRSQQPEEALELFRRMQA-GNVFPNEYTMVSL 267
           G+V+AAR +FD M  +++VSW  MI+G+ +   P +AL LF+++ A G   P+E T+V+ 
Sbjct: 210 GNVEAARALFDSMCERDIVSWNVMIDGYAQHGFPNDALMLFQKLLAEGKPKPDEITVVAA 269

Query: 268 IKACTEMGILSLGRGIHDYAIKNDFEIGVYLGTALIDMYSKCGSIKDAIEVFEMMPRRSL 327
           + AC+++G L  GR IH +   +   + V + T LIDMYSKCGS+++A+ VF   PR+ +
Sbjct: 270 LSACSQIGALETGRWIHVFVKSSRIRLNVKVCTGLIDMYSKCGSLEEAVLVFNDTPRKDI 329

Query: 328 PTWNSMIASLGVHGLGQEALNLFSEMERV-NVKPDAITFVGILCACVHMKNVKAGCDYFK 387
             WN+MIA   +HG  Q+AL LF+EM+ +  ++P  ITF+G L AC H   V  G   F+
Sbjct: 330 VAWNAMIAGYAMHGYSQDALRLFNEMQGITGLQPTDITFIGTLQACAHAGLVNEGIRIFE 389

Query: 388 QMTQHYGIAPIPEHYKCMAELYARSNSLDEASRSTKAISMEADEDVLALLW 434
            M Q YGI P  EHY C+  L  R+  L  A  + K ++M+AD    ++LW
Sbjct: 390 SMGQEYGIKPKIEHYGCLVSLLGRAGQLKRAYETIKNMNMDAD----SVLW 432

BLAST of Tan0004674 vs. TAIR 10
Match: AT5G06540.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 288.5 bits (737), Expect = 9.1e-78
Identity = 144/409 (35.21%), Postives = 231/409 (56.48%), Query Frame = 0

Query: 31  LFLLQKCKNFQQLKQIHAKIIRSGLSNDQLLTRKLIHLYSSHGR-------IVYAIFLFH 90
           L LLQ C +F  LK IH  ++R+ L +D  +  +L+ L             + YA  +F 
Sbjct: 16  LALLQSCSSFSDLKIIHGFLLRTHLISDVFVASRLLALCVDDSTFNKPTNLLGYAYGIFS 75

Query: 91  QIQNPCTFTWNLIIRANTINGLSEQALLLYKNMVCQGIVADKFTFPFVIKACTASFAIEL 150
           QIQNP  F +NL+IR  +      +A   Y  M+   I  D  TFPF+IKA +    + +
Sbjct: 76  QIQNPNLFVFNLLIRCFSTGAEPSKAFGFYTQMLKSRIWPDNITFPFLIKASSEMECVLV 135

Query: 151 GKVVHGSLIKYGFSGDTFVQNNLIDFYLKCGHKHCGFKVFEKMRVCNVVSWTTMISGLVS 210
           G+  H  ++++GF  D +V+N+L+  Y  CG      ++F +M   +VVSWT+M++G   
Sbjct: 136 GEQTHSQIVRFGFQNDVYVENSLVHMYANCGFIAAAGRIFGQMGFRDVVSWTSMVAGYCK 195

Query: 211 CGDVQAARRIFDEMPFKNVVSWTAMINGHIRSQQPEEALELFRRMQAGNVFPNEYTMVSL 270
           CG V+ AR +FDEMP +N+ +W+ MING+ ++   E+A++LF  M+   V  NE  MVS+
Sbjct: 196 CGMVENAREMFDEMPHRNLFTWSIMINGYAKNNCFEKAIDLFEFMKREGVVANETVMVSV 255

Query: 271 IKACTEMGILSLGRGIHDYAIKNDFEIGVYLGTALIDMYSKCGSIKDAIEVFEMMPRRSL 330
           I +C  +G L  G   ++Y +K+   + + LGTAL+DM+ +CG I+ AI VFE +P    
Sbjct: 256 ISSCAHLGALEFGERAYEYVVKSHMTVNLILGTALVDMFWRCGDIEKAIHVFEGLPETDS 315

Query: 331 PTWNSMIASLGVHGLGQEALNLFSEMERVNVKPDAITFVGILCACVHMKNVKAGCDYFKQ 390
            +W+S+I  L VHG   +A++ FS+M  +   P  +TF  +L AC H   V+ G + ++ 
Sbjct: 316 LSWSSIIKGLAVHGHAHKAMHYFSQMISLGFIPRDVTFTAVLSACSHGGLVEKGLEIYEN 375

Query: 391 MTQHYGIAPIPEHYKCMAELYARSNSLDEASRSTKAISMEADEDVLALL 433
           M + +GI P  EHY C+ ++  R+  L EA      + ++ +  +L  L
Sbjct: 376 MKKDHGIEPRLEHYGCIVDMLGRAGKLAEAENFILKMHVKPNAPILGAL 424

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q389593.2e-12054.11Pentatricopeptide repeat-containing protein At3g26630, chloroplastic OS=Arabidop... [more]
Q9SJG61.6e-7935.68Pentatricopeptide repeat-containing protein At2g42920, chloroplastic OS=Arabidop... [more]
Q9LS723.1e-7834.21Pentatricopeptide repeat-containing protein At3g29230 OS=Arabidopsis thaliana OX... [more]
Q9SZT83.4e-7738.20Pentatricopeptide repeat-containing protein ELI1, chloroplastic OS=Arabidopsis t... [more]
Q9FG161.3e-7635.21Pentatricopeptide repeat-containing protein At5g06540 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
KAG7017868.15.7e-22386.49Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita a... [more]
KAG6581136.15.7e-22386.49Sialyltransferase-like protein 1, partial [Cucurbita argyrosperma subsp. sororia... [more]
XP_022934150.18.2e-22286.04pentatricopeptide repeat-containing protein At3g26630, chloroplastic-like isofor... [more]
XP_022934151.18.2e-22286.04pentatricopeptide repeat-containing protein At3g26630, chloroplastic-like isofor... [more]
XP_023527079.11.1e-22186.04pentatricopeptide repeat-containing protein At3g26630, chloroplastic-like isofor... [more]
Match NameE-valueIdentityDescription
A0A6J1F6V34.0e-22286.04pentatricopeptide repeat-containing protein At3g26630, chloroplastic-like isofor... [more]
A0A6J1F1164.0e-22286.04pentatricopeptide repeat-containing protein At3g26630, chloroplastic-like isofor... [more]
A0A1S3BNB72.6e-22187.53pentatricopeptide repeat-containing protein At3g26630, chloroplastic isoform X1 ... [more]
A0A6J1IZS92.8e-21588.51pentatricopeptide repeat-containing protein At3g26630, chloroplastic isoform X2 ... [more]
A0A6J1J2R22.8e-21588.51pentatricopeptide repeat-containing protein At3g26630, chloroplastic isoform X1 ... [more]
Match NameE-valueIdentityDescription
AT3G26630.12.3e-12154.11Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT2G42920.11.2e-8035.68Pentatricopeptide repeat (PPR-like) superfamily protein [more]
AT3G29230.12.2e-7934.21Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G37380.12.4e-7838.20Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT5G06540.19.1e-7835.21Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 88..135
e-value: 3.1E-7
score: 30.5
coord: 220..267
e-value: 1.1E-13
score: 51.1
coord: 325..369
e-value: 3.4E-9
score: 36.8
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 296..321
e-value: 1.1E-4
score: 22.2
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 190..217
e-value: 2.4E-5
score: 24.0
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 192..221
e-value: 1.7E-6
score: 25.8
coord: 325..358
e-value: 2.1E-6
score: 25.5
coord: 223..257
e-value: 7.4E-8
score: 30.1
coord: 296..322
e-value: 3.4E-4
score: 18.6
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 322..356
score: 10.917512
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 190..220
score: 9.88715
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 221..255
score: 12.013642
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 150..276
e-value: 5.5E-32
score: 112.6
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 16..149
e-value: 7.0E-13
score: 50.7
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 295..435
e-value: 1.3E-26
score: 95.7
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 170..349
NoneNo IPR availablePANTHERPTHR47926PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 20..421
NoneNo IPR availablePANTHERPTHR47926:SF109BNAA06G32750D PROTEINcoord: 20..421

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0004674.1Tan0004674.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding