Tan0009518 (gene) Snake gourd v1

Overview
NameTan0009518
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat-containing protein
LocationLG06: 9782304 .. 9784367 (+)
RNA-Seq ExpressionTan0009518
SyntenyTan0009518
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GAACAGTCTAAAAAAAAATGAAAAAAAAAAATCGATCTCTCTCAAAAATGGTTTTGAAGACCTTGGCTTCCGCTTCTGCGGTTTCTTCCAAAGGTAAGCTCTCTTCTCCCAGAATTCGATCATCAAATCCTCAAAATTCATCTGAAAATCATAGTAAATGGCCGCCATTCCTCTCACGACCAGCTAGACCGGTTGTATCGCCGGAAATTTCGCCGGAAAAGTTCAATACAGGGATATCGTTTCAAGAACGGCTTAGGACGTTTCTTCAGAATTGCAAGACAGGTAATTTTACCGCATCAGAAGCATTACAATTCTTTGACCTAATGATGCGTGCAAATCCTACCCCTGATATGTCTTCATTCAATGTTTTACTTGGTGGACTTGCTAAGATTAAGCATTATTCTGAAGTATTTTCTTTGTATAATAGATTAAGCTTAGCCGGATTTTTGCCTAATCACTGCACACTCAATATTTTGCTTAATTGCCTTTGTAACGTGAATCGGATTAGCGAAGGTCTTGCGGCCATGTCAGGGATTATAAGGAGAGGTTATGTTCCTAATATAGTGGCATATACGACCTTGATTAAAGGGTTGTGTAGGGTGCATAGGATTAGTGAAGCCACACTGTTATTTATGAGAATGCAAAAGTTGGGTTGTAGGCCAAATGTGATTACTTATGGGGTTTTAATGAAGGGGCTCTGTCAAACAGGCAATACTAACATTGCACTTAAGTTGTTTGAAAATATGCTCAAAGAGACTGGTGGATGTGGGATTAGTTGTAAGCCTGGTCTTATTTGCTATAGTATCATTATAGATGGGCTTTGTAAGGATGGACGGGAAGTTGAGGCAAGAGAACTTTTGGAGGAAATGAAAGCTCAGGGAATGATTCCCAATGTTATTTCTTATAGCTCCTTAATTCATGGATTTTGCTGTGGTGGAAAGTGGGAGGAGGCTAAACGTTTGTTCAATGAGATGATGGATCAAGGTGTTCAACCAAATGTGGTCACATTTAACGTGTTGATGGATATGCTTTGCAAGACAGGAAAGGTTATCGAGGCTAAGGAGTTGCTGGAGGTCATGGTTCAGAGAGGTAATGTTCCTGATTTGGTTACTTATAATATATTGATGCACGGATTCTGTTTGGTTGGTGATCTGAATAGTGCGAGGGAATTGTTCGTTAGTATGGTAAGTAAAGGGTGTGAACCTAATGTGATTAGCTACGGTGTGCTAATCACTGGATATTGTAAAAATCGTAAGGTGGAAGAAGCAATGAAGCTTTACAATGAAATGCTTGGAGTTGGAATGAGGCCATCTGTGCTAACACATAATTTCTTGTTAATGGGGCTTTTTCAGGCAGGCAAGGTTGGTGATGCAAAGAAACTATTTGGTGCCATTCGAGCTCATGGTCTTGTAACAGATTCATACACTTATAATATTTTCTTAGATGGGCTATGTAAAGCAGGAAAACTTGAAACTGCTTGGGAGCTTTTCGACAAATTGTCTCATGAAGGGCTTCTTCCAAATGTTGTGACTTATTCCATTATGATCCATGGGTGTTGTAAAGAAGGACAAGTGGAAAAAGCAAAGGATTTGTTTCAAAAGATGGAAGAAAATGGTTGTATTCCCAATGTAATTACTTATAATACACTCCTTCGTGGTTTCTGCGAAAGTAATAAATCAGAGGAAGTGGTTCAACTTCTTCACAGGATGGTTCAGAAGAATGTGCCGCCGGATGCTAGCACTTGCGCCGTAGTCGTAGACATGCTTTCCACAGAAGAAAAGTATCGAGAATATCTGGACTTGCTTCCAAGGTTTCCTGTCCAACGGTGTTGAGGTTGATCACAGAATACCTACTGTGATAGACTCATTCATTGAACAGAAACAAATGAAAGGCTTTCATAAAAGTTCTCGTAGAATCAAGAATGAAGTTTCCATTCTGGATATGTCAAATTATGATCTAGTAAATGCATATATCCCATCTCCAAAAAAAAATTCAATTGTTTCGAACTTTCTCCTTTTGAAATGGAACATTTACAGGATTTTTGTATAGATAACCTATTTTCAC

mRNA sequence

GAACAGTCTAAAAAAAAATGAAAAAAAAAAATCGATCTCTCTCAAAAATGGTTTTGAAGACCTTGGCTTCCGCTTCTGCGGTTTCTTCCAAAGCTAGACCGGTTGTATCGCCGGAAATTTCGCCGGAAAAGTTCAATACAGGGATATCGTTTCAAGAACGGCTTAGGACGTTTCTTCAGAATTGCAAGACAGATTAAGCTTAGCCGGATTTTTGCCTAATCACTGCACACTCAATATTTTGCTTAATTGCCTTTGTAACGTGAATCGGATTAGCGAAGGTCTTGCGGCCATGTCAGGGATTATAAGGAGAGGTTATGTTCCTAATATAGTGGCATATACGACCTTGATTAAAGGGTTGTGTAGGGTGCATAGGATTAGTGAAGCCACACTGTTATTTATGAGAATGCAAAAGTTGGGTTGTAGGCCAAATGTGATTACTTATGGGGTTTTAATGAAGGGGCTCTGTCAAACAGGCAATACTAACATTGCACTTAAGTTGTTTGAAAATATGCTCAAAGAGACTGGTGGATGTGGGATTAGTTGTAAGCCTGGTCTTATTTGCTATAGTATCATTATAGATGGGCTTTGTAAGGATGGACGGGAAGTTGAGGCAAGAGAACTTTTGGAGGAAATGAAAGCTCAGGGAATGATTCCCAATGTTATTTCTTATAGCTCCTTAATTCATGGATTTTGCTGTGGTGGAAAGTGGGAGGAGGCTAAACGTTTGTTCAATGAGATGATGGATCAAGGTGTTCAACCAAATGTGGTCACATTTAACGTGTTGATGGATATGCTTTGCAAGACAGGAAAGGTTATCGAGGCTAAGGAGTTGCTGGAGGTCATGGTTCAGAGAGGTAATGTTCCTGATTTGGTTACTTATAATATATTGATGCACGGATTCTGTTTGGTTGGTGATCTGAATAGTGCGAGGGAATTGTTCGTTAGTATGGTAAGTAAAGGGTGTGAACCTAATGTGATTAGCTACGGTGTGCTAATCACTGGATATTGTAAAAATCGTAAGGTGGAAGAAGCAATGAAGCTTTACAATGAAATGCTTGGAGTTGGAATGAGGCCATCTGTGCTAACACATAATTTCTTGTTAATGGGGCTTTTTCAGGCAGGCAAGGTTGGTGATGCAAAGAAACTATTTGGTGCCATTCGAGCTCATGGTCTTGTAACAGATTCATACACTTATAATATTTTCTTAGATGGGCTATGTAAAGCAGGAAAACTTGAAACTGCTTGGGAGCTTTTCGACAAATTGTCTCATGAAGGGCTTCTTCCAAATGTTGTGACTTATTCCATTATGATCCATGGGTGTTGTAAAGAAGGACAAGTGGAAAAAGCAAAGGATTTGTTTCAAAAGATGGAAGAAAATGGTTGTATTCCCAATGTAATTACTTATAATACACTCCTTCGTGGTTTCTGCGAAAGTAATAAATCAGAGGAAGTGGTTCAACTTCTTCACAGGATGGTTCAGAAGAATGTGCCGCCGGATGCTAGCACTTGCGCCGTAGTCGTAGACATGCTTTCCACAGAAGAAAAGTATCGAGAATATCTGGACTTGCTTCCAAGGTTTCCTGTCCAACGGTGTTGAGGTTGATCACAGAATACCTACTGTGATAGACTCATTCATTGAACAGAAACAAATGAAAGGCTTTCATAAAAGTTCTCGTAGAATCAAGAATGAAGTTTCCATTCTGGATATGTCAAATTATGATCTAGTAAATGCATATATCCCATCTCCAAAAAAAAATTCAATTGTTTCGAACTTTCTCCTTTTGAAATGGAACATTTACAGGATTTTTGTATAGATAACCTATTTTCAC

Coding sequence (CDS)

ATGTCAGGGATTATAAGGAGAGGTTATGTTCCTAATATAGTGGCATATACGACCTTGATTAAAGGGTTGTGTAGGGTGCATAGGATTAGTGAAGCCACACTGTTATTTATGAGAATGCAAAAGTTGGGTTGTAGGCCAAATGTGATTACTTATGGGGTTTTAATGAAGGGGCTCTGTCAAACAGGCAATACTAACATTGCACTTAAGTTGTTTGAAAATATGCTCAAAGAGACTGGTGGATGTGGGATTAGTTGTAAGCCTGGTCTTATTTGCTATAGTATCATTATAGATGGGCTTTGTAAGGATGGACGGGAAGTTGAGGCAAGAGAACTTTTGGAGGAAATGAAAGCTCAGGGAATGATTCCCAATGTTATTTCTTATAGCTCCTTAATTCATGGATTTTGCTGTGGTGGAAAGTGGGAGGAGGCTAAACGTTTGTTCAATGAGATGATGGATCAAGGTGTTCAACCAAATGTGGTCACATTTAACGTGTTGATGGATATGCTTTGCAAGACAGGAAAGGTTATCGAGGCTAAGGAGTTGCTGGAGGTCATGGTTCAGAGAGGTAATGTTCCTGATTTGGTTACTTATAATATATTGATGCACGGATTCTGTTTGGTTGGTGATCTGAATAGTGCGAGGGAATTGTTCGTTAGTATGGTAAGTAAAGGGTGTGAACCTAATGTGATTAGCTACGGTGTGCTAATCACTGGATATTGTAAAAATCGTAAGGTGGAAGAAGCAATGAAGCTTTACAATGAAATGCTTGGAGTTGGAATGAGGCCATCTGTGCTAACACATAATTTCTTGTTAATGGGGCTTTTTCAGGCAGGCAAGGTTGGTGATGCAAAGAAACTATTTGGTGCCATTCGAGCTCATGGTCTTGTAACAGATTCATACACTTATAATATTTTCTTAGATGGGCTATGTAAAGCAGGAAAACTTGAAACTGCTTGGGAGCTTTTCGACAAATTGTCTCATGAAGGGCTTCTTCCAAATGTTGTGACTTATTCCATTATGATCCATGGGTGTTGTAAAGAAGGACAAGTGGAAAAAGCAAAGGATTTGTTTCAAAAGATGGAAGAAAATGGTTGTATTCCCAATGTAATTACTTATAATACACTCCTTCGTGGTTTCTGCGAAAGTAATAAATCAGAGGAAGTGGTTCAACTTCTTCACAGGATGGTTCAGAAGAATGTGCCGCCGGATGCTAGCACTTGCGCCGTAGTCGTAGACATGCTTTCCACAGAAGAAAAGTATCGAGAATATCTGGACTTGCTTCCAAGGTTTCCTGTCCAACGGTGTTGA

Protein sequence

MSGIIRRGYVPNIVAYTTLIKGLCRVHRISEATLLFMRMQKLGCRPNVITYGVLMKGLCQTGNTNIALKLFENMLKETGGCGISCKPGLICYSIIIDGLCKDGREVEARELLEEMKAQGMIPNVISYSSLIHGFCCGGKWEEAKRLFNEMMDQGVQPNVVTFNVLMDMLCKTGKVIEAKELLEVMVQRGNVPDLVTYNILMHGFCLVGDLNSARELFVSMVSKGCEPNVISYGVLITGYCKNRKVEEAMKLYNEMLGVGMRPSVLTHNFLLMGLFQAGKVGDAKKLFGAIRAHGLVTDSYTYNIFLDGLCKAGKLETAWELFDKLSHEGLLPNVVTYSIMIHGCCKEGQVEKAKDLFQKMEENGCIPNVITYNTLLRGFCESNKSEEVVQLLHRMVQKNVPPDASTCAVVVDMLSTEEKYREYLDLLPRFPVQRC
Homology
BLAST of Tan0009518 vs. ExPASy Swiss-Prot
Match: Q6NQ83 (Pentatricopeptide repeat-containing protein At3g22470, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At3g22470 PE=1 SV=1)

HSP 1 Score: 350.1 bits (897), Expect = 3.5e-95
Identity = 176/452 (38.94%), Postives = 270/452 (59.73%), Query Frame = 0

Query: 11  PNIVAYTTLIKGLCRVHRISEATLLFMRMQKLGCRPNVITYGVLMKGLCQTGNTNIALKL 70
           P++V  +TLI GLC   R+SEA +L  RM + G +P+ +TYG ++  LC++GN+ +AL L
Sbjct: 173 PDLVTVSTLINGLCLKGRVSEALVLIDRMVEYGFQPDEVTYGPVLNRLCKSGNSALALDL 232

Query: 71  FENMLKETGGCGISCKPGLICYSIIIDGLCKDGREVEARELLEEMKAQGMIPNVISYSSL 130
           F  M +       + K  ++ YSI+ID LCKDG   +A  L  EM+ +G+  +V++YSSL
Sbjct: 233 FRKMEER------NIKASVVQYSIVIDSLCKDGSFDDALSLFNEMEMKGIKADVVTYSSL 292

Query: 131 IHGFCCGGKWEEAKRLFNEMMDQGVQPNVVTFNVLMDMLCKTGKVIEAKELLEVMVQRGN 190
           I G C  GKW++  ++  EM+ + + P+VVTF+ L+D+  K GK++EAKEL   M+ RG 
Sbjct: 293 IGGLCNDGKWDDGAKMLREMIGRNIIPDVVTFSALIDVFVKEGKLLEAKELYNEMITRGI 352

Query: 191 VPDLVTYNILMHGFCLVGDLNSARELFVSMVSKGCEPNVISYGVLITGYCKNRKVEEAMK 250
            PD +TYN L+ GFC    L+ A ++F  MVSKGCEP++++Y +LI  YCK ++V++ M+
Sbjct: 353 APDTITYNSLIDGFCKENCLHEANQMFDLMVSKGCEPDIVTYSILINSYCKAKRVDDGMR 412

Query: 251 LYNEMLGVGMRPSVLTHNFLLMGLFQAGKVGDAKKLFGAIRAHGLVTDSYTYNIFLDGL- 310
           L+ E+   G+ P+ +T+N L++G  Q+GK+  AK+LF  + + G+     TY I LDGL 
Sbjct: 413 LFREISSKGLIPNTITYNTLVLGFCQSGKLNAAKELFQEMVSRGVPPSVVTYGILLDGLC 472

Query: 311 ----------------------------------CKAGKLETAWELFDKLSHEGLLPNVV 370
                                             C A K++ AW LF  LS +G+ P+VV
Sbjct: 473 DNGELNKALEIFEKMQKSRMTLGIGIYNIIIHGMCNASKVDDAWSLFCSLSDKGVKPDVV 532

Query: 371 TYSIMIHGCCKEGQVEKAKDLFQKMEENGCIPNVITYNTLLRGFCESNKSEEVVQLLHRM 428
           TY++MI G CK+G + +A  LF+KM+E+GC P+  TYN L+R     +     V+L+  M
Sbjct: 533 TYNVMIGGLCKKGSLSEADMLFRKMKEDGCTPDDFTYNILIRAHLGGSGLISSVELIEEM 592

BLAST of Tan0009518 vs. ExPASy Swiss-Prot
Match: Q9LQ16 (Pentatricopeptide repeat-containing protein At1g62910 OS=Arabidopsis thaliana OX=3702 GN=At1g62910 PE=2 SV=1)

HSP 1 Score: 339.7 bits (870), Expect = 4.8e-92
Identity = 174/459 (37.91%), Postives = 269/459 (58.61%), Query Frame = 0

Query: 4   IIRRGYVPNIVAYTTLIKGLCRVHRISEATLLFMRMQKLGCRPNVITYGVLMKGLCQTGN 63
           ++  GY P+   +TTLI GL   ++ SEA  L  +M + GC+P+++TYG ++ GLC+ G+
Sbjct: 179 MVEMGYKPDTFTFTTLIHGLFLHNKASEAVALVDQMVQRGCQPDLVTYGTVVNGLCKRGD 238

Query: 64  TNIALKLFENMLKETGGCGISCKPGLICYSIIIDGLCKDGREVEARELLEEMKAQGMIPN 123
            ++AL L + M K         +  ++ Y+ IIDGLCK     +A  L  EM  +G+ P+
Sbjct: 239 IDLALSLLKKMEKG------KIEADVVIYNTIIDGLCKYKHMDDALNLFTEMDNKGIRPD 298

Query: 124 VISYSSLIHGFCCGGKWEEAKRLFNEMMDQGVQPNVVTFNVLMDMLCKTGKVIEAKELLE 183
           V +YSSLI   C  G+W +A RL ++M+++ + PNVVTF+ L+D   K GK++EA++L +
Sbjct: 299 VFTYSSLISCLCNYGRWSDASRLLSDMIERKINPNVVTFSALIDAFVKEGKLVEAEKLYD 358

Query: 184 VMVQRGNVPDLVTYNILMHGFCLVGDLNSARELFVSMVSKGCEPNVISYGVLITGYCKNR 243
            M++R   PD+ TY+ L++GFC+   L+ A+ +F  M+SK C PNV++Y  LI G+CK +
Sbjct: 359 EMIKRSIDPDIFTYSSLINGFCMHDRLDEAKHMFELMISKDCFPNVVTYSTLIKGFCKAK 418

Query: 244 KVEEAMKLYNE-----------------------------------MLGVGMRPSVLTHN 303
           +VEE M+L+ E                                   M+ VG+ P++LT+N
Sbjct: 419 RVEEGMELFREMSQRGLVGNTVTYTTLIHGFFQARDCDNAQMVFKQMVSVGVHPNILTYN 478

Query: 304 FLLMGLFQAGKVGDAKKLFGAIRAHGLVTDSYTYNIFLDGLCKAGKLETAWELFDKLSHE 363
            LL GL + GK+  A  +F  ++   +  D YTYNI ++G+CKAGK+E  WELF  LS +
Sbjct: 479 ILLDGLCKNGKLAKAMVVFEYLQRSTMEPDIYTYNIMIEGMCKAGKVEDGWELFCNLSLK 538

Query: 364 GLLPNVVTYSIMIHGCCKEGQVEKAKDLFQKMEENGCIPNVITYNTLLRGFCESNKSEEV 423
           G+ PNV+ Y+ MI G C++G  E+A  L +KM+E+G +PN  TYNTL+R        E  
Sbjct: 539 GVSPNVIAYNTMISGFCRKGSKEEADSLLKKMKEDGPLPNSGTYNTLIRARLRDGDREAS 598

Query: 424 VQLLHRMVQKNVPPDASTCAVVVDMLSTEEKYREYLDLL 428
            +L+  M       DAST  +V +ML      + +LD+L
Sbjct: 599 AELIKEMRSCGFAGDASTIGLVTNMLHDGRLDKSFLDML 631

BLAST of Tan0009518 vs. ExPASy Swiss-Prot
Match: Q9SXD1 (Pentatricopeptide repeat-containing protein At1g62670, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At1g62670 PE=3 SV=2)

HSP 1 Score: 338.6 bits (867), Expect = 1.1e-91
Identity = 166/397 (41.81%), Postives = 251/397 (63.22%), Query Frame = 0

Query: 8   GYVPNIVAYTTLIKGLCRVHRISEATLLFMRMQKLGCRPNVITYGVLMKGLCQTGNTNIA 67
           GY PN V + TLI GL   ++ SEA  L  RM   GC+P+++TYGV++ GLC+ G+T++A
Sbjct: 181 GYQPNTVTFNTLIHGLFLHNKASEAMALIDRMVAKGCQPDLVTYGVVVNGLCKRGDTDLA 240

Query: 68  LKLFENMLKETGGCGISCKPGLICYSIIIDGLCKDGREVEARELLEEMKAQGMIPNVISY 127
             L   M  E G      +PG++ Y+ IIDGLCK     +A  L +EM+ +G+ PNV++Y
Sbjct: 241 FNLLNKM--EQG----KLEPGVLIYNTIIDGLCKYKHMDDALNLFKEMETKGIRPNVVTY 300

Query: 128 SSLIHGFCCGGKWEEAKRLFNEMMDQGVQPNVVTFNVLMDMLCKTGKVIEAKELLEVMVQ 187
           SSLI   C  G+W +A RL ++M+++ + P+V TF+ L+D   K GK++EA++L + MV+
Sbjct: 301 SSLISCLCNYGRWSDASRLLSDMIERKINPDVFTFSALIDAFVKEGKLVEAEKLYDEMVK 360

Query: 188 RGNVPDLVTYNILMHGFCLVGDLNSARELFVSMVSKGCEPNVISYGVLITGYCKNRKVEE 247
           R   P +VTY+ L++GFC+   L+ A+++F  MVSK C P+V++Y  LI G+CK ++VEE
Sbjct: 361 RSIDPSIVTYSSLINGFCMHDRLDEAKQMFEFMVSKHCFPDVVTYNTLIKGFCKYKRVEE 420

Query: 248 AMKLYNEMLGVGMRPSVLTHNFLLMGLFQAGKVGDAKKLFGAIRAHGLVTDSYTYNIFLD 307
            M+++ EM   G+  + +T+N L+ GLFQAG    A+++F  + + G+  +  TYN  LD
Sbjct: 421 GMEVFREMSQRGLVGNTVTYNILIQGLFQAGDCDMAQEIFKEMVSDGVPPNIMTYNTLLD 480

Query: 308 GLCKAGKLETAWELFDKLSHEGLLPNVVTYSIMIHGCCKEGQVEKAKDLFQKMEENGCIP 367
           GLCK GKLE A  +F+ L    + P + TY+IMI G CK G+VE   DLF  +   G  P
Sbjct: 481 GLCKNGKLEKAMVVFEYLQRSKMEPTIYTYNIMIEGMCKAGKVEDGWDLFCNLSLKGVKP 540

Query: 368 NVITYNTLLRGFCESNKSEEVVQLLHRMVQKNVPPDA 405
           +V+ YNT++ GFC     EE   L   M +    P++
Sbjct: 541 DVVAYNTMISGFCRKGSKEEADALFKEMKEDGTLPNS 571

BLAST of Tan0009518 vs. ExPASy Swiss-Prot
Match: Q9CAN0 (Pentatricopeptide repeat-containing protein At1g63130, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At1g63130 PE=2 SV=1)

HSP 1 Score: 335.5 bits (859), Expect = 9.0e-91
Identity = 163/403 (40.45%), Postives = 253/403 (62.78%), Query Frame = 0

Query: 4   IIRRGYVPNIVAYTTLIKGLCRVHRISEATLLFMRMQKLGCRPNVITYGVLMKGLCQTGN 63
           ++  GY P+   + TLI GL R +R SEA  L  RM   GC+P+++TYG+++ GLC+ G+
Sbjct: 177 MVEMGYQPDSFTFNTLIHGLFRHNRASEAVALVDRMVVKGCQPDLVTYGIVVNGLCKRGD 236

Query: 64  TNIALKLFENMLKETGGCGISCKPGLICYSIIIDGLCKDGREVEARELLEEMKAQGMIPN 123
            ++AL L + M  E G      +PG++ Y+ IID LC      +A  L  EM  +G+ PN
Sbjct: 237 IDLALSLLKKM--EQG----KIEPGVVIYNTIIDALCNYKNVNDALNLFTEMDNKGIRPN 296

Query: 124 VISYSSLIHGFCCGGKWEEAKRLFNEMMDQGVQPNVVTFNVLMDMLCKTGKVIEAKELLE 183
           V++Y+SLI   C  G+W +A RL ++M+++ + PNVVTF+ L+D   K GK++EA++L +
Sbjct: 297 VVTYNSLIRCLCNYGRWSDASRLLSDMIERKINPNVVTFSALIDAFVKEGKLVEAEKLYD 356

Query: 184 VMVQRGNVPDLVTYNILMHGFCLVGDLNSARELFVSMVSKGCEPNVISYGVLITGYCKNR 243
            M++R   PD+ TY+ L++GFC+   L+ A+ +F  M+SK C PNV++Y  LI G+CK +
Sbjct: 357 EMIKRSIDPDIFTYSSLINGFCMHDRLDEAKHMFELMISKDCFPNVVTYNTLIKGFCKAK 416

Query: 244 KVEEAMKLYNEMLGVGMRPSVLTHNFLLMGLFQAGKVGDAKKLFGAIRAHGLVTDSYTYN 303
           +V+E M+L+ EM   G+  + +T+  L+ G FQA +  +A+ +F  + + G++ D  TY+
Sbjct: 417 RVDEGMELFREMSQRGLVGNTVTYTTLIHGFFQARECDNAQIVFKQMVSDGVLPDIMTYS 476

Query: 304 IFLDGLCKAGKLETAWELFDKLSHEGLLPNVVTYSIMIHGCCKEGQVEKAKDLFQKMEEN 363
           I LDGLC  GK+ETA  +F+ L    + P++ TY+IMI G CK G+VE   DLF  +   
Sbjct: 477 ILLDGLCNNGKVETALVVFEYLQRSKMEPDIYTYNIMIEGMCKAGKVEDGWDLFCSLSLK 536

Query: 364 GCIPNVITYNTLLRGFCESNKSEEVVQLLHRMVQKNVPPDAST 407
           G  PNV+TY T++ GFC     EE   L   M ++   PD+ T
Sbjct: 537 GVKPNVVTYTTMMSGFCRKGLKEEADALFREMKEEGPLPDSGT 573

BLAST of Tan0009518 vs. ExPASy Swiss-Prot
Match: Q9LQ14 (Pentatricopeptide repeat-containing protein At1g62930, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=At1g62930 PE=2 SV=2)

HSP 1 Score: 331.6 bits (849), Expect = 1.3e-89
Identity = 171/454 (37.67%), Postives = 264/454 (58.15%), Query Frame = 0

Query: 9   YVPNIVAYTTLIKGLCRVHRISEATLLFMRMQKLGCRPNVITYGVLMKGLCQTGNTNIAL 68
           Y PN V + TLI GL   ++ SEA  L  RM   GC+P++ TYG ++ GLC+ G+ ++AL
Sbjct: 181 YQPNTVTFNTLIHGLFLHNKASEAVALIDRMVARGCQPDLFTYGTVVNGLCKRGDIDLAL 240

Query: 69  KLFENMLKETGGCGISCKPGLICYSIIIDGLCKDGREVEARELLEEMKAQGMIPNVISYS 128
            L + M K         +  ++ Y+ IID LC      +A  L  EM  +G+ PNV++Y+
Sbjct: 241 SLLKKMEKG------KIEADVVIYTTIIDALCNYKNVNDALNLFTEMDNKGIRPNVVTYN 300

Query: 129 SLIHGFCCGGKWEEAKRLFNEMMDQGVQPNVVTFNVLMDMLCKTGKVIEAKELLEVMVQR 188
           SLI   C  G+W +A RL ++M+++ + PNVVTF+ L+D   K GK++EA++L + M++R
Sbjct: 301 SLIRCLCNYGRWSDASRLLSDMIERKINPNVVTFSALIDAFVKEGKLVEAEKLYDEMIKR 360

Query: 189 GNVPDLVTYNILMHGFCLVGDLNSARELFVSMVSKGCEPNVISYGVLITGYCKNRKVEEA 248
              PD+ TY+ L++GFC+   L+ A+ +F  M+SK C PNV++Y  LI G+CK ++VEE 
Sbjct: 361 SIDPDIFTYSSLINGFCMHDRLDEAKHMFELMISKDCFPNVVTYNTLIKGFCKAKRVEEG 420

Query: 249 MKLYNEMLGVGMRPSVLTHNFLLMGLFQAGKVGDAKKLFGAIRAHG-------------- 308
           M+L+ EM   G+  + +T+N L+ GLFQAG    A+K+F  + + G              
Sbjct: 421 MELFREMSQRGLVGNTVTYNTLIQGLFQAGDCDMAQKIFKKMVSDGVPPDIITYSILLDG 480

Query: 309 ---------------------LVTDSYTYNIFLDGLCKAGKLETAWELFDKLSHEGLLPN 368
                                +  D YTYNI ++G+CKAGK+E  W+LF  LS +G+ PN
Sbjct: 481 LCKYGKLEKALVVFEYLQKSKMEPDIYTYNIMIEGMCKAGKVEDGWDLFCSLSLKGVKPN 540

Query: 369 VVTYSIMIHGCCKEGQVEKAKDLFQKMEENGCIPNVITYNTLLRGFCESNKSEEVVQLLH 428
           V+ Y+ MI G C++G  E+A  LF++M+E+G +PN  TYNTL+R            +L+ 
Sbjct: 541 VIIYTTMISGFCRKGLKEEADALFREMKEDGTLPNSGTYNTLIRARLRDGDKAASAELIK 600

BLAST of Tan0009518 vs. NCBI nr
Match: KAG7014482.1 (Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 698.4 bits (1801), Expect = 4.0e-197
Identity = 346/467 (74.09%), Postives = 381/467 (81.58%), Query Frame = 0

Query: 1   MSGIIRRGYVPNIVAYTTLIKGLCRVHRISEATLLFMRMQKLGCRPNVITYGVLMKGLCQ 60
           M+GIIRRG++PNIV YT+LIKGLC  HRISEAT LFMRMQKLGCRPNVITYG L+KGLCQ
Sbjct: 170 MAGIIRRGFIPNIVTYTSLIKGLCMEHRISEATRLFMRMQKLGCRPNVITYGTLIKGLCQ 229

Query: 61  TGNTNIALKLFENMLKETGGCGISCKPGLICYSIIIDGLCKDGREVEARELLEEMKAQGM 120
           TGNTNIALKL E ML  TG  GISCKP +I YS IIDGLCKDGRE +AREL EEMKA+ M
Sbjct: 230 TGNTNIALKLHEEMLNGTGRYGISCKPNVISYSTIIDGLCKDGREDKARELFEEMKARRM 289

Query: 121 IPNVISYSSLIHGFCCGGKWEEAKRLFNEMMDQGVQPNVVTFNVLMDMLCKTGKVIEAKE 180
           +P+VISYSSLIHGFC GGKWEEAK LFNEM+D G+QPN VTFNVLMD+LCK GKVIEA E
Sbjct: 290 LPDVISYSSLIHGFCNGGKWEEAKCLFNEMVDLGIQPNAVTFNVLMDILCKAGKVIEANE 349

Query: 181 LLEVMVQRGNVPDLVTYNILMHGFCLVGDLNSARELFVSMVSKGCEPNVISYGVLITGYC 240
           LLEVM+QRGNVPDL TYN LM GFCLV DL+SARELF+SM SKGCEPNVISY VLI GYC
Sbjct: 350 LLEVMIQRGNVPDLFTYNTLMDGFCLVSDLDSARELFLSMPSKGCEPNVISYNVLINGYC 409

Query: 241 KNRKVEEAMKLYNEMLGVGMRPSVLTHNFLLMGLFQAGKVGDAKKLFGAIRAHGLVTDSY 300
           KN KVEEAMK+YNEML VG++PSV+T+N LL GLFQAGKV DAKK+FG I+AHGLV  S 
Sbjct: 410 KNWKVEEAMKIYNEMLQVGIKPSVITYNALLTGLFQAGKVNDAKKIFGVIQAHGLVPSSS 469

Query: 301 TYNIF----------------------------------LDGLCKAGKLETAWELFDKLS 360
           T +IF                                  +DGLCKAGKLETAWE FDK+S
Sbjct: 470 TLSIFVDGLCKNDCLLEAMEIFNELSYNLKLDIDIFNCLIDGLCKAGKLETAWEFFDKIS 529

Query: 361 HEGLLPNVVTYSIMIHGCCKEGQVEKAKDLFQKMEENGCIPNVITYNTLLRGFCESNKSE 420
            EGLLPNVVTYSI+IHGCCKEGQVEKA DLF+KMEENGC PNVITYNTLLRGF +SNK E
Sbjct: 530 REGLLPNVVTYSILIHGCCKEGQVEKANDLFRKMEENGCTPNVITYNTLLRGFYKSNKRE 589

Query: 421 EVVQLLHRMVQKNVPPDASTCAVVVDMLSTEEKYREYLDLLPRFPVQ 434
           EVV+LLHRMV+KNV PDASTC +V+DMLS +EKYR  L+LLP FPVQ
Sbjct: 590 EVVELLHRMVKKNVVPDASTCTIVLDMLSKDEKYRGCLNLLPMFPVQ 636

BLAST of Tan0009518 vs. NCBI nr
Match: KAG6575949.1 (Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 698.4 bits (1801), Expect = 4.0e-197
Identity = 346/467 (74.09%), Postives = 381/467 (81.58%), Query Frame = 0

Query: 1   MSGIIRRGYVPNIVAYTTLIKGLCRVHRISEATLLFMRMQKLGCRPNVITYGVLMKGLCQ 60
           M+GIIRRG++PNIV YT+LIKGLC  HRISEAT LFMRMQKLGCRPNVITYG L+KGLCQ
Sbjct: 170 MAGIIRRGFIPNIVTYTSLIKGLCMEHRISEATRLFMRMQKLGCRPNVITYGTLIKGLCQ 229

Query: 61  TGNTNIALKLFENMLKETGGCGISCKPGLICYSIIIDGLCKDGREVEARELLEEMKAQGM 120
           TGNTNIALKL E ML  TG  GISCKP +I YS IIDGLCKDGRE +AREL EEMKA+ M
Sbjct: 230 TGNTNIALKLHEEMLNGTGRYGISCKPNVISYSTIIDGLCKDGREDKARELFEEMKARRM 289

Query: 121 IPNVISYSSLIHGFCCGGKWEEAKRLFNEMMDQGVQPNVVTFNVLMDMLCKTGKVIEAKE 180
           +P+VISYSSLIHGFC GGKWEEAK LFNEM+D G+QPN VTFNVLMD+LCK GKVIEA E
Sbjct: 290 LPDVISYSSLIHGFCNGGKWEEAKCLFNEMVDLGIQPNAVTFNVLMDILCKAGKVIEANE 349

Query: 181 LLEVMVQRGNVPDLVTYNILMHGFCLVGDLNSARELFVSMVSKGCEPNVISYGVLITGYC 240
           LLEVM+QRGNVPDL TYN LM GFCLV DL+SARELF+SM SKGCEPNVISY VLI GYC
Sbjct: 350 LLEVMIQRGNVPDLFTYNTLMDGFCLVSDLDSARELFLSMPSKGCEPNVISYNVLINGYC 409

Query: 241 KNRKVEEAMKLYNEMLGVGMRPSVLTHNFLLMGLFQAGKVGDAKKLFGAIRAHGLVTDSY 300
           KN KVEEAMK+YNEML VG++PSV+T+N LL GLFQAGKV DAKK+FG I+AHGLV  S 
Sbjct: 410 KNWKVEEAMKIYNEMLQVGIKPSVITYNALLTGLFQAGKVNDAKKIFGVIQAHGLVPSSS 469

Query: 301 TYNIF----------------------------------LDGLCKAGKLETAWELFDKLS 360
           T +IF                                  +DGLCKAGKLETAWE FDK+S
Sbjct: 470 TLSIFVDGLCKNDCLLEAMEIFNELSYNLKLDIDIFNCLIDGLCKAGKLETAWEFFDKIS 529

Query: 361 HEGLLPNVVTYSIMIHGCCKEGQVEKAKDLFQKMEENGCIPNVITYNTLLRGFCESNKSE 420
            EGLLPNVVTYSI+IHGCCKEGQVEKA DLF+KMEENGC PNVITYNTLLRGF +SNK E
Sbjct: 530 REGLLPNVVTYSILIHGCCKEGQVEKANDLFRKMEENGCTPNVITYNTLLRGFYKSNKRE 589

Query: 421 EVVQLLHRMVQKNVPPDASTCAVVVDMLSTEEKYREYLDLLPRFPVQ 434
           EVV+LLHRMV+KNV PDASTC +V+DMLS +EKYR  L+LLP FPVQ
Sbjct: 590 EVVELLHRMVKKNVVPDASTCTIVLDMLSKDEKYRGCLNLLPMFPVQ 636

BLAST of Tan0009518 vs. NCBI nr
Match: XP_022992036.1 (pentatricopeptide repeat-containing protein At1g63330-like [Cucurbita maxima])

HSP 1 Score: 697.6 bits (1799), Expect = 6.9e-197
Identity = 346/467 (74.09%), Postives = 378/467 (80.94%), Query Frame = 0

Query: 1   MSGIIRRGYVPNIVAYTTLIKGLCRVHRISEATLLFMRMQKLGCRPNVITYGVLMKGLCQ 60
           M+GIIRRGYVPNIV YT+LIKGLC  HRISEAT LFMRMQKLGCRPNVITYG L+KGLCQ
Sbjct: 170 MAGIIRRGYVPNIVTYTSLIKGLCMEHRISEATRLFMRMQKLGCRPNVITYGTLIKGLCQ 229

Query: 61  TGNTNIALKLFENMLKETGGCGISCKPGLICYSIIIDGLCKDGREVEARELLEEMKAQGM 120
           TGNTNIALKL E ML  TG  GISCKP +ICYS IIDGLCKDGRE +AREL EEMKA+ M
Sbjct: 230 TGNTNIALKLHEEMLNGTGRYGISCKPNVICYSTIIDGLCKDGREDKARELFEEMKARRM 289

Query: 121 IPNVISYSSLIHGFCCGGKWEEAKRLFNEMMDQGVQPNVVTFNVLMDMLCKTGKVIEAKE 180
           +P+VISYSSLIHGFC GGKWEEAK LFNEM+D G+QPN VTFNVLMD+LCK GKVIEA +
Sbjct: 290 LPDVISYSSLIHGFCNGGKWEEAKCLFNEMVDLGIQPNAVTFNVLMDILCKAGKVIEANK 349

Query: 181 LLEVMVQRGNVPDLVTYNILMHGFCLVGDLNSARELFVSMVSKGCEPNVISYGVLITGYC 240
           LLEVM+QRGNVPDL  YN LM GFCLV DL SARELF+SM SKGCEPNVISY VLI GYC
Sbjct: 350 LLEVMIQRGNVPDLFIYNTLMDGFCLVSDLESARELFLSMPSKGCEPNVISYNVLINGYC 409

Query: 241 KNRKVEEAMKLYNEMLGVGMRPSVLTHNFLLMGLFQAGKVGDAKKLFGAIRAHGLVTDSY 300
           KN KVEEAMK+YNEML VG++PSV+T+N LL GLFQAGKV DAKK+FG I+AHGLV  S 
Sbjct: 410 KNWKVEEAMKIYNEMLQVGVKPSVITYNALLTGLFQAGKVDDAKKIFGVIQAHGLVPSSS 469

Query: 301 TYNIF----------------------------------LDGLCKAGKLETAWELFDKLS 360
           T +IF                                  +DGLCKAGKLETAWE FDK+S
Sbjct: 470 TLSIFVDGLCKNDCLLEAMEIFNELSYNLKLDIDIFNCLIDGLCKAGKLETAWEFFDKIS 529

Query: 361 HEGLLPNVVTYSIMIHGCCKEGQVEKAKDLFQKMEENGCIPNVITYNTLLRGFCESNKSE 420
            EG LPNVVTYSI+IHGCCKEGQVEKA DLF+KMEENGC PNVITYNTLLRGF +SNK E
Sbjct: 530 REGFLPNVVTYSILIHGCCKEGQVEKANDLFRKMEENGCTPNVITYNTLLRGFYKSNKRE 589

Query: 421 EVVQLLHRMVQKNVPPDASTCAVVVDMLSTEEKYREYLDLLPRFPVQ 434
           EVV+LLHRMV KNV PDASTC +V+DMLS +EKYR  L+LLP FPVQ
Sbjct: 590 EVVELLHRMVNKNVVPDASTCTIVLDMLSEDEKYRGCLNLLPTFPVQ 636

BLAST of Tan0009518 vs. NCBI nr
Match: XP_022953742.1 (putative pentatricopeptide repeat-containing protein At1g12700, mitochondrial [Cucurbita moschata])

HSP 1 Score: 697.2 bits (1798), Expect = 9.0e-197
Identity = 345/467 (73.88%), Postives = 381/467 (81.58%), Query Frame = 0

Query: 1   MSGIIRRGYVPNIVAYTTLIKGLCRVHRISEATLLFMRMQKLGCRPNVITYGVLMKGLCQ 60
           M+GIIRRG++PNIV YT+LIKGLC  HRISEAT LFMRMQKLGCRPNVITYG L+KGLCQ
Sbjct: 170 MAGIIRRGFIPNIVTYTSLIKGLCMEHRISEATRLFMRMQKLGCRPNVITYGTLIKGLCQ 229

Query: 61  TGNTNIALKLFENMLKETGGCGISCKPGLICYSIIIDGLCKDGREVEARELLEEMKAQGM 120
           TGNTNIALKL E ML  TG  GISCKP +I YS IIDGLCKDGRE +AREL EEMKA+ M
Sbjct: 230 TGNTNIALKLHEEMLNGTGRYGISCKPNVISYSTIIDGLCKDGREDKARELFEEMKARRM 289

Query: 121 IPNVISYSSLIHGFCCGGKWEEAKRLFNEMMDQGVQPNVVTFNVLMDMLCKTGKVIEAKE 180
           +P+VISYSSLIHGFC GGKWEEAK LFNEM+D G+QPN VTFNVLMD+LCK GKVIEA E
Sbjct: 290 LPDVISYSSLIHGFCNGGKWEEAKCLFNEMVDLGIQPNAVTFNVLMDILCKAGKVIEANE 349

Query: 181 LLEVMVQRGNVPDLVTYNILMHGFCLVGDLNSARELFVSMVSKGCEPNVISYGVLITGYC 240
           LLEVM+QRGNVPDL TYN LM GFCLV DL+SARELF+SM SKGCEPNVISY VLI GYC
Sbjct: 350 LLEVMIQRGNVPDLFTYNTLMDGFCLVSDLDSARELFLSMPSKGCEPNVISYNVLINGYC 409

Query: 241 KNRKVEEAMKLYNEMLGVGMRPSVLTHNFLLMGLFQAGKVGDAKKLFGAIRAHGLVTDSY 300
           KN KVEEAMK+YNEML VG++PS++T+N LL GLFQAGKV DAKK+FG I+AHGLV  S 
Sbjct: 410 KNWKVEEAMKIYNEMLQVGIKPSMITYNALLTGLFQAGKVNDAKKIFGVIQAHGLVPSSS 469

Query: 301 TYNIF----------------------------------LDGLCKAGKLETAWELFDKLS 360
           T +IF                                  +DGLCKAGKLETAWE FDK+S
Sbjct: 470 TLSIFVDGLCKNDCLLEAMEIFNELSYNLKLDIDIFNCLIDGLCKAGKLETAWEFFDKIS 529

Query: 361 HEGLLPNVVTYSIMIHGCCKEGQVEKAKDLFQKMEENGCIPNVITYNTLLRGFCESNKSE 420
            EGLLPNVVTYSI+IHGCCKEGQVEKA DLF+KMEENGC PNVITYNTLLRGF +SNK E
Sbjct: 530 REGLLPNVVTYSILIHGCCKEGQVEKANDLFRKMEENGCTPNVITYNTLLRGFYKSNKRE 589

Query: 421 EVVQLLHRMVQKNVPPDASTCAVVVDMLSTEEKYREYLDLLPRFPVQ 434
           EVV+LLHRMV+KNV PDASTC +V+DMLS +EKYR  L+LLP FPVQ
Sbjct: 590 EVVELLHRMVKKNVVPDASTCTIVLDMLSKDEKYRGCLNLLPTFPVQ 636

BLAST of Tan0009518 vs. NCBI nr
Match: XP_023548637.1 (pentatricopeptide repeat-containing protein At1g63330-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 696.0 bits (1795), Expect = 2.0e-196
Identity = 345/467 (73.88%), Postives = 380/467 (81.37%), Query Frame = 0

Query: 1   MSGIIRRGYVPNIVAYTTLIKGLCRVHRISEATLLFMRMQKLGCRPNVITYGVLMKGLCQ 60
           M+GIIRRG++PNIV YT+LIKGLC  HRISEAT LFMRMQKLGCRPNVITYG L+KGLCQ
Sbjct: 170 MAGIIRRGFIPNIVTYTSLIKGLCMEHRISEATRLFMRMQKLGCRPNVITYGTLIKGLCQ 229

Query: 61  TGNTNIALKLFENMLKETGGCGISCKPGLICYSIIIDGLCKDGREVEARELLEEMKAQGM 120
           TGNTNIALKL E ML  TG  GISCKP +ICYS IIDGLCKDGRE +AREL EEMKA+ M
Sbjct: 230 TGNTNIALKLHEEMLNGTGRYGISCKPNVICYSTIIDGLCKDGREDKARELFEEMKARRM 289

Query: 121 IPNVISYSSLIHGFCCGGKWEEAKRLFNEMMDQGVQPNVVTFNVLMDMLCKTGKVIEAKE 180
           +P+VISYSSLIHGFC GGKWEEAK LFNEM+D G+QPN VTFNVLMD+LCK GKVIEA E
Sbjct: 290 LPDVISYSSLIHGFCNGGKWEEAKCLFNEMVDLGIQPNAVTFNVLMDILCKAGKVIEANE 349

Query: 181 LLEVMVQRGNVPDLVTYNILMHGFCLVGDLNSARELFVSMVSKGCEPNVISYGVLITGYC 240
           LLEVM+QRGN PDL TYN LM GFCLV DL+SARELF+SM SKGCEPNVISY VLI GYC
Sbjct: 350 LLEVMIQRGNAPDLFTYNTLMDGFCLVSDLDSARELFLSMPSKGCEPNVISYNVLINGYC 409

Query: 241 KNRKVEEAMKLYNEMLGVGMRPSVLTHNFLLMGLFQAGKVGDAKKLFGAIRAHGLVTDSY 300
           KN KVEEAMK+YNEML VG++PSV+T+N LL GLFQAGKV DAKK+FG I+AHGLV  S 
Sbjct: 410 KNWKVEEAMKIYNEMLQVGIKPSVITYNALLTGLFQAGKVDDAKKIFGVIQAHGLVPSSS 469

Query: 301 TYNIF----------------------------------LDGLCKAGKLETAWELFDKLS 360
           T +IF                                  +DGLCKAGKLETAWE FDK+S
Sbjct: 470 TLSIFVDGLCKNDCLLEAMEIFNELSYNLKLDIDIFNCLIDGLCKAGKLETAWEFFDKIS 529

Query: 361 HEGLLPNVVTYSIMIHGCCKEGQVEKAKDLFQKMEENGCIPNVITYNTLLRGFCESNKSE 420
            EGLLPNVVTYSI+IHG CKEGQVEKA DLF+KMEENGC PNVITYNTLLRGF +SNK E
Sbjct: 530 REGLLPNVVTYSILIHGYCKEGQVEKANDLFRKMEENGCTPNVITYNTLLRGFYKSNKRE 589

Query: 421 EVVQLLHRMVQKNVPPDASTCAVVVDMLSTEEKYREYLDLLPRFPVQ 434
           EVV+LLHRMV+KNV PDASTC +V+DMLS +EKYR  L+LLP FPVQ
Sbjct: 590 EVVELLHRMVKKNVVPDASTCTIVLDMLSKDEKYRGCLNLLPTFPVQ 636

BLAST of Tan0009518 vs. ExPASy TrEMBL
Match: A0A6J1JSG4 (pentatricopeptide repeat-containing protein At1g63330-like OS=Cucurbita maxima OX=3661 GN=LOC111488506 PE=4 SV=1)

HSP 1 Score: 697.6 bits (1799), Expect = 3.3e-197
Identity = 346/467 (74.09%), Postives = 378/467 (80.94%), Query Frame = 0

Query: 1   MSGIIRRGYVPNIVAYTTLIKGLCRVHRISEATLLFMRMQKLGCRPNVITYGVLMKGLCQ 60
           M+GIIRRGYVPNIV YT+LIKGLC  HRISEAT LFMRMQKLGCRPNVITYG L+KGLCQ
Sbjct: 170 MAGIIRRGYVPNIVTYTSLIKGLCMEHRISEATRLFMRMQKLGCRPNVITYGTLIKGLCQ 229

Query: 61  TGNTNIALKLFENMLKETGGCGISCKPGLICYSIIIDGLCKDGREVEARELLEEMKAQGM 120
           TGNTNIALKL E ML  TG  GISCKP +ICYS IIDGLCKDGRE +AREL EEMKA+ M
Sbjct: 230 TGNTNIALKLHEEMLNGTGRYGISCKPNVICYSTIIDGLCKDGREDKARELFEEMKARRM 289

Query: 121 IPNVISYSSLIHGFCCGGKWEEAKRLFNEMMDQGVQPNVVTFNVLMDMLCKTGKVIEAKE 180
           +P+VISYSSLIHGFC GGKWEEAK LFNEM+D G+QPN VTFNVLMD+LCK GKVIEA +
Sbjct: 290 LPDVISYSSLIHGFCNGGKWEEAKCLFNEMVDLGIQPNAVTFNVLMDILCKAGKVIEANK 349

Query: 181 LLEVMVQRGNVPDLVTYNILMHGFCLVGDLNSARELFVSMVSKGCEPNVISYGVLITGYC 240
           LLEVM+QRGNVPDL  YN LM GFCLV DL SARELF+SM SKGCEPNVISY VLI GYC
Sbjct: 350 LLEVMIQRGNVPDLFIYNTLMDGFCLVSDLESARELFLSMPSKGCEPNVISYNVLINGYC 409

Query: 241 KNRKVEEAMKLYNEMLGVGMRPSVLTHNFLLMGLFQAGKVGDAKKLFGAIRAHGLVTDSY 300
           KN KVEEAMK+YNEML VG++PSV+T+N LL GLFQAGKV DAKK+FG I+AHGLV  S 
Sbjct: 410 KNWKVEEAMKIYNEMLQVGVKPSVITYNALLTGLFQAGKVDDAKKIFGVIQAHGLVPSSS 469

Query: 301 TYNIF----------------------------------LDGLCKAGKLETAWELFDKLS 360
           T +IF                                  +DGLCKAGKLETAWE FDK+S
Sbjct: 470 TLSIFVDGLCKNDCLLEAMEIFNELSYNLKLDIDIFNCLIDGLCKAGKLETAWEFFDKIS 529

Query: 361 HEGLLPNVVTYSIMIHGCCKEGQVEKAKDLFQKMEENGCIPNVITYNTLLRGFCESNKSE 420
            EG LPNVVTYSI+IHGCCKEGQVEKA DLF+KMEENGC PNVITYNTLLRGF +SNK E
Sbjct: 530 REGFLPNVVTYSILIHGCCKEGQVEKANDLFRKMEENGCTPNVITYNTLLRGFYKSNKRE 589

Query: 421 EVVQLLHRMVQKNVPPDASTCAVVVDMLSTEEKYREYLDLLPRFPVQ 434
           EVV+LLHRMV KNV PDASTC +V+DMLS +EKYR  L+LLP FPVQ
Sbjct: 590 EVVELLHRMVNKNVVPDASTCTIVLDMLSEDEKYRGCLNLLPTFPVQ 636

BLAST of Tan0009518 vs. ExPASy TrEMBL
Match: A0A6J1GP31 (putative pentatricopeptide repeat-containing protein At1g12700, mitochondrial OS=Cucurbita moschata OX=3662 GN=LOC111456184 PE=4 SV=1)

HSP 1 Score: 697.2 bits (1798), Expect = 4.3e-197
Identity = 345/467 (73.88%), Postives = 381/467 (81.58%), Query Frame = 0

Query: 1   MSGIIRRGYVPNIVAYTTLIKGLCRVHRISEATLLFMRMQKLGCRPNVITYGVLMKGLCQ 60
           M+GIIRRG++PNIV YT+LIKGLC  HRISEAT LFMRMQKLGCRPNVITYG L+KGLCQ
Sbjct: 170 MAGIIRRGFIPNIVTYTSLIKGLCMEHRISEATRLFMRMQKLGCRPNVITYGTLIKGLCQ 229

Query: 61  TGNTNIALKLFENMLKETGGCGISCKPGLICYSIIIDGLCKDGREVEARELLEEMKAQGM 120
           TGNTNIALKL E ML  TG  GISCKP +I YS IIDGLCKDGRE +AREL EEMKA+ M
Sbjct: 230 TGNTNIALKLHEEMLNGTGRYGISCKPNVISYSTIIDGLCKDGREDKARELFEEMKARRM 289

Query: 121 IPNVISYSSLIHGFCCGGKWEEAKRLFNEMMDQGVQPNVVTFNVLMDMLCKTGKVIEAKE 180
           +P+VISYSSLIHGFC GGKWEEAK LFNEM+D G+QPN VTFNVLMD+LCK GKVIEA E
Sbjct: 290 LPDVISYSSLIHGFCNGGKWEEAKCLFNEMVDLGIQPNAVTFNVLMDILCKAGKVIEANE 349

Query: 181 LLEVMVQRGNVPDLVTYNILMHGFCLVGDLNSARELFVSMVSKGCEPNVISYGVLITGYC 240
           LLEVM+QRGNVPDL TYN LM GFCLV DL+SARELF+SM SKGCEPNVISY VLI GYC
Sbjct: 350 LLEVMIQRGNVPDLFTYNTLMDGFCLVSDLDSARELFLSMPSKGCEPNVISYNVLINGYC 409

Query: 241 KNRKVEEAMKLYNEMLGVGMRPSVLTHNFLLMGLFQAGKVGDAKKLFGAIRAHGLVTDSY 300
           KN KVEEAMK+YNEML VG++PS++T+N LL GLFQAGKV DAKK+FG I+AHGLV  S 
Sbjct: 410 KNWKVEEAMKIYNEMLQVGIKPSMITYNALLTGLFQAGKVNDAKKIFGVIQAHGLVPSSS 469

Query: 301 TYNIF----------------------------------LDGLCKAGKLETAWELFDKLS 360
           T +IF                                  +DGLCKAGKLETAWE FDK+S
Sbjct: 470 TLSIFVDGLCKNDCLLEAMEIFNELSYNLKLDIDIFNCLIDGLCKAGKLETAWEFFDKIS 529

Query: 361 HEGLLPNVVTYSIMIHGCCKEGQVEKAKDLFQKMEENGCIPNVITYNTLLRGFCESNKSE 420
            EGLLPNVVTYSI+IHGCCKEGQVEKA DLF+KMEENGC PNVITYNTLLRGF +SNK E
Sbjct: 530 REGLLPNVVTYSILIHGCCKEGQVEKANDLFRKMEENGCTPNVITYNTLLRGFYKSNKRE 589

Query: 421 EVVQLLHRMVQKNVPPDASTCAVVVDMLSTEEKYREYLDLLPRFPVQ 434
           EVV+LLHRMV+KNV PDASTC +V+DMLS +EKYR  L+LLP FPVQ
Sbjct: 590 EVVELLHRMVKKNVVPDASTCTIVLDMLSKDEKYRGCLNLLPTFPVQ 636

BLAST of Tan0009518 vs. ExPASy TrEMBL
Match: A0A6J1DSW3 (pentatricopeptide repeat-containing protein At1g63330-like OS=Momordica charantia OX=3673 GN=LOC111022854 PE=4 SV=1)

HSP 1 Score: 696.0 bits (1795), Expect = 9.7e-197
Identity = 347/471 (73.67%), Postives = 381/471 (80.89%), Query Frame = 0

Query: 1   MSGIIRRGYVPNIVAYTTLIKGLCRVHRISEATLLFMRMQKLGCRPNVITYGVLMKGLCQ 60
           M+GIIRRGY+PNIV YT+LIKGLC  HRISEAT LFMRMQKLGC PNVITYG L+KGLCQ
Sbjct: 172 MAGIIRRGYIPNIVTYTSLIKGLCMEHRISEATRLFMRMQKLGCTPNVITYGTLIKGLCQ 231

Query: 61  TGNTNIALKLFENMLKETGGCGISCKPGLICYSIIIDGLCKDGREVEARELLEEMKAQGM 120
           TGNTNIALKL E ML  TG  GI+CKP +ICYS IIDGLCKDG E +AREL EEMKAQGM
Sbjct: 232 TGNTNIALKLHEEMLNGTGRYGITCKPNVICYSTIIDGLCKDGLEDKARELFEEMKAQGM 291

Query: 121 IPNVISYSSLIHGFCCGGKWEEAKRLFNEMMDQGVQPNVVTFNVLMDMLCKTGKVIEAKE 180
           +P+VISYSSLIHGFC GGKWEEAK LFNEM+D GVQPNVVTFNVLMDMLCK GKVIEAKE
Sbjct: 292 LPDVISYSSLIHGFCYGGKWEEAKSLFNEMVDHGVQPNVVTFNVLMDMLCKAGKVIEAKE 351

Query: 181 LLEVMVQRG-NVPDLVTYNILMHGFCLVGDLNSARELFVSMVSKGCEPNVISYGVLITGY 240
           LLE+MVQ G N PDL TYN LM GFCLVGDLNSARELF++M +KGCEPNVISY VLI GY
Sbjct: 352 LLELMVQGGNNAPDLFTYNTLMDGFCLVGDLNSARELFINMPNKGCEPNVISYNVLINGY 411

Query: 241 CKNRKVEEAMKLYNEMLGVGMRPSVLTHNFLLMGLFQAGKVGDAKKLFGAIRAHGLVTDS 300
           CKN K+EEAMKLYNEML VG+RPSV+T+N LL GLFQAG V DAKKLFG I+A+GL   S
Sbjct: 412 CKNWKMEEAMKLYNEMLQVGIRPSVITYNSLLTGLFQAGMVVDAKKLFGVIQANGLAPSS 471

Query: 301 YTYNIFL-----------------------------------DGLCKAGKLETAWELFDK 360
            TY+ FL                                   DGLCKAGKLETAWELFDK
Sbjct: 472 STYSTFLDGLCKNDCLLEAIELFNGLKPYNLKLNIEIFNCLIDGLCKAGKLETAWELFDK 531

Query: 361 LSHEGLLPNVVTYSIMIHGCCKEGQVEKAKDLFQKMEENGCIPNVITYNTLLRGFCESNK 420
            S EGLLPNVVTYSIMIHG CK+GQ+EKA DLF+KMEENGC PN+ITYNTL+RGF E+NK
Sbjct: 532 FSLEGLLPNVVTYSIMIHGLCKDGQLEKAIDLFRKMEENGCTPNIITYNTLMRGFYENNK 591

Query: 421 SEEVVQLLHRMVQKNVPPDASTCAVVVDMLSTEEKYREYLDLLPRFPVQRC 436
            EEVV+LLHRMV+KNV PDASTC +V+DMLS +EKY+E L+LLPRFP Q C
Sbjct: 592 PEEVVELLHRMVKKNVLPDASTCTIVLDMLSEDEKYQECLNLLPRFPAQEC 642

BLAST of Tan0009518 vs. ExPASy TrEMBL
Match: A0A1S4DYL6 (pentatricopeptide repeat-containing protein At3g22470, mitochondrial-like OS=Cucumis melo OX=3656 GN=LOC103492586 PE=4 SV=1)

HSP 1 Score: 649.8 bits (1675), Expect = 8.0e-183
Identity = 321/469 (68.44%), Postives = 369/469 (78.68%), Query Frame = 0

Query: 1   MSGIIRRGYVPNIVAYTTLIKGLCRVHRISEATLLFMRMQKLGCRPNVITYGVLMKGLCQ 60
           M+G++RRGY+PN+V YTTLIKGLC  HRISEAT LF+RMQKLGC PNV+TYG L+KGLCQ
Sbjct: 98  MAGLLRRGYIPNVVTYTTLIKGLCMEHRISEATRLFLRMQKLGCTPNVVTYGTLVKGLCQ 157

Query: 61  TGNTNIALKLFENMLKETGGCGISCKPGLICYSIIIDGLCKDGREVEARELLEEMKAQGM 120
           TGN NIALKL + ML +T   GI+CKP +  Y+IIIDGLCK GRE EA EL EEMKAQGM
Sbjct: 158 TGNVNIALKLHQEMLNDTSQYGINCKPNVFNYNIIIDGLCKVGREDEANELFEEMKAQGM 217

Query: 121 IPNVISYSSLIHGFCCGGKWEEAKRLFNEMMDQGVQPNVVTFNVLMDMLCKTGKVIEAKE 180
           IPNVISYSSLIHGFCC  KWEE+KRLF+EM+DQGVQP+ VTF+VL+D LCK GKVIEAK+
Sbjct: 218 IPNVISYSSLIHGFCCARKWEESKRLFDEMVDQGVQPDKVTFSVLIDTLCKEGKVIEAKK 277

Query: 181 LLEVMVQRGNVPDLVTYNILMHGFCLVGDLNSARELFVSMVSKGCEPNVISYGVLITGYC 240
           L EVM+QRG VPDL  Y+ LM GFC+VGDLNSARELFVSM SKGCEP+VISY VLI GYC
Sbjct: 278 LFEVMIQRGIVPDLFIYSSLMEGFCMVGDLNSARELFVSMPSKGCEPDVISYTVLINGYC 337

Query: 241 KNRKVEEAMKLYNEMLGVGMRPSVLTHNFLLMGLFQAGKVGDAKKLFGAIRAHGLVTDSY 300
           K  KVEEAMKLYNEML VG RP+V+T+  LL GLF AGKVGDAKKLF A++A G+  +S+
Sbjct: 338 KTLKVEEAMKLYNEMLLVGKRPNVITYGALLTGLFLAGKVGDAKKLFSAMKARGISANSH 397

Query: 301 -----------------------------------TYNIFLDGLCKAGKLETAWELFDKL 360
                                              TY+  +DGLCK GKLETAWELF+KL
Sbjct: 398 IYGIILDGLCKNGCLFEAMKLFTELKSYNFKLDIETYSCLIDGLCKEGKLETAWELFEKL 457

Query: 361 SHEGLLPNVVTYSIMIHGCCKEGQVEKAKDLFQKMEENGCIPNVITYNTLLRGFCESNKS 420
           S EGL PNVVTYSIMIHG C+EGQV+KA  L QKME NGC PN+ITYNTL+RGF ESNK 
Sbjct: 458 SQEGLQPNVVTYSIMIHGLCREGQVDKANVLIQKMETNGCNPNIITYNTLMRGFYESNKL 517

Query: 421 EEVVQLLHRMVQKNVPPDASTCAVVVDMLSTEEKYREYLDLLPRFPVQR 435
           +EVVQLLH MV+K+V PDA+TC++VVDML  +EKY+E LDLLPRF VQ+
Sbjct: 518 DEVVQLLHGMVKKDVLPDATTCSIVVDMLCKDEKYQECLDLLPRFSVQK 566

BLAST of Tan0009518 vs. ExPASy TrEMBL
Match: A0A5D3C8J0 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold886G00440 PE=4 SV=1)

HSP 1 Score: 649.8 bits (1675), Expect = 8.0e-183
Identity = 321/469 (68.44%), Postives = 369/469 (78.68%), Query Frame = 0

Query: 1   MSGIIRRGYVPNIVAYTTLIKGLCRVHRISEATLLFMRMQKLGCRPNVITYGVLMKGLCQ 60
           M+G++RRGY+PN+V YTTLIKGLC  HRISEAT LF+RMQKLGC PNV+TYG L+KGLCQ
Sbjct: 98  MAGLLRRGYIPNVVTYTTLIKGLCMEHRISEATRLFLRMQKLGCTPNVVTYGTLVKGLCQ 157

Query: 61  TGNTNIALKLFENMLKETGGCGISCKPGLICYSIIIDGLCKDGREVEARELLEEMKAQGM 120
           TGN NIALKL + ML +T   GI+CKP +  Y+IIIDGLCK GRE EA EL EEMKAQGM
Sbjct: 158 TGNVNIALKLHQEMLNDTSQYGINCKPNVFNYNIIIDGLCKVGREDEANELFEEMKAQGM 217

Query: 121 IPNVISYSSLIHGFCCGGKWEEAKRLFNEMMDQGVQPNVVTFNVLMDMLCKTGKVIEAKE 180
           IPNVISYSSLIHGFCC  KWEE+KRLF+EM+DQGVQP+ VTF+VL+D LCK GKVIEAK+
Sbjct: 218 IPNVISYSSLIHGFCCARKWEESKRLFDEMVDQGVQPDKVTFSVLIDTLCKEGKVIEAKK 277

Query: 181 LLEVMVQRGNVPDLVTYNILMHGFCLVGDLNSARELFVSMVSKGCEPNVISYGVLITGYC 240
           L EVM+QRG VPDL  Y+ LM GFC+VGDLNSARELFVSM SKGCEP+VISY VLI GYC
Sbjct: 278 LFEVMIQRGIVPDLFIYSSLMEGFCMVGDLNSARELFVSMPSKGCEPDVISYTVLINGYC 337

Query: 241 KNRKVEEAMKLYNEMLGVGMRPSVLTHNFLLMGLFQAGKVGDAKKLFGAIRAHGLVTDSY 300
           K  KVEEAMKLYNEML VG RP+V+T+  LL GLF AGKVGDAKKLF A++A G+  +S+
Sbjct: 338 KTLKVEEAMKLYNEMLLVGKRPNVITYGALLTGLFLAGKVGDAKKLFSAMKARGISANSH 397

Query: 301 -----------------------------------TYNIFLDGLCKAGKLETAWELFDKL 360
                                              TY+  +DGLCK GKLETAWELF+KL
Sbjct: 398 IYGIILDGLCKNGCLFEAMKLFTELKSYNFKLDIETYSCLIDGLCKEGKLETAWELFEKL 457

Query: 361 SHEGLLPNVVTYSIMIHGCCKEGQVEKAKDLFQKMEENGCIPNVITYNTLLRGFCESNKS 420
           S EGL PNVVTYSIMIHG C+EGQV+KA  L QKME NGC PN+ITYNTL+RGF ESNK 
Sbjct: 458 SQEGLQPNVVTYSIMIHGLCREGQVDKANVLIQKMETNGCNPNIITYNTLMRGFYESNKL 517

Query: 421 EEVVQLLHRMVQKNVPPDASTCAVVVDMLSTEEKYREYLDLLPRFPVQR 435
           +EVVQLLH MV+K+V PDA+TC++VVDML  +EKY+E LDLLPRF VQ+
Sbjct: 518 DEVVQLLHGMVKKDVLPDATTCSIVVDMLCKDEKYQECLDLLPRFSVQK 566

BLAST of Tan0009518 vs. TAIR 10
Match: AT3G22470.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 350.1 bits (897), Expect = 2.5e-96
Identity = 176/452 (38.94%), Postives = 270/452 (59.73%), Query Frame = 0

Query: 11  PNIVAYTTLIKGLCRVHRISEATLLFMRMQKLGCRPNVITYGVLMKGLCQTGNTNIALKL 70
           P++V  +TLI GLC   R+SEA +L  RM + G +P+ +TYG ++  LC++GN+ +AL L
Sbjct: 173 PDLVTVSTLINGLCLKGRVSEALVLIDRMVEYGFQPDEVTYGPVLNRLCKSGNSALALDL 232

Query: 71  FENMLKETGGCGISCKPGLICYSIIIDGLCKDGREVEARELLEEMKAQGMIPNVISYSSL 130
           F  M +       + K  ++ YSI+ID LCKDG   +A  L  EM+ +G+  +V++YSSL
Sbjct: 233 FRKMEER------NIKASVVQYSIVIDSLCKDGSFDDALSLFNEMEMKGIKADVVTYSSL 292

Query: 131 IHGFCCGGKWEEAKRLFNEMMDQGVQPNVVTFNVLMDMLCKTGKVIEAKELLEVMVQRGN 190
           I G C  GKW++  ++  EM+ + + P+VVTF+ L+D+  K GK++EAKEL   M+ RG 
Sbjct: 293 IGGLCNDGKWDDGAKMLREMIGRNIIPDVVTFSALIDVFVKEGKLLEAKELYNEMITRGI 352

Query: 191 VPDLVTYNILMHGFCLVGDLNSARELFVSMVSKGCEPNVISYGVLITGYCKNRKVEEAMK 250
            PD +TYN L+ GFC    L+ A ++F  MVSKGCEP++++Y +LI  YCK ++V++ M+
Sbjct: 353 APDTITYNSLIDGFCKENCLHEANQMFDLMVSKGCEPDIVTYSILINSYCKAKRVDDGMR 412

Query: 251 LYNEMLGVGMRPSVLTHNFLLMGLFQAGKVGDAKKLFGAIRAHGLVTDSYTYNIFLDGL- 310
           L+ E+   G+ P+ +T+N L++G  Q+GK+  AK+LF  + + G+     TY I LDGL 
Sbjct: 413 LFREISSKGLIPNTITYNTLVLGFCQSGKLNAAKELFQEMVSRGVPPSVVTYGILLDGLC 472

Query: 311 ----------------------------------CKAGKLETAWELFDKLSHEGLLPNVV 370
                                             C A K++ AW LF  LS +G+ P+VV
Sbjct: 473 DNGELNKALEIFEKMQKSRMTLGIGIYNIIIHGMCNASKVDDAWSLFCSLSDKGVKPDVV 532

Query: 371 TYSIMIHGCCKEGQVEKAKDLFQKMEENGCIPNVITYNTLLRGFCESNKSEEVVQLLHRM 428
           TY++MI G CK+G + +A  LF+KM+E+GC P+  TYN L+R     +     V+L+  M
Sbjct: 533 TYNVMIGGLCKKGSLSEADMLFRKMKEDGCTPDDFTYNILIRAHLGGSGLISSVELIEEM 592

BLAST of Tan0009518 vs. TAIR 10
Match: AT1G62910.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 339.7 bits (870), Expect = 3.4e-93
Identity = 174/459 (37.91%), Postives = 269/459 (58.61%), Query Frame = 0

Query: 4   IIRRGYVPNIVAYTTLIKGLCRVHRISEATLLFMRMQKLGCRPNVITYGVLMKGLCQTGN 63
           ++  GY P+   +TTLI GL   ++ SEA  L  +M + GC+P+++TYG ++ GLC+ G+
Sbjct: 179 MVEMGYKPDTFTFTTLIHGLFLHNKASEAVALVDQMVQRGCQPDLVTYGTVVNGLCKRGD 238

Query: 64  TNIALKLFENMLKETGGCGISCKPGLICYSIIIDGLCKDGREVEARELLEEMKAQGMIPN 123
            ++AL L + M K         +  ++ Y+ IIDGLCK     +A  L  EM  +G+ P+
Sbjct: 239 IDLALSLLKKMEKG------KIEADVVIYNTIIDGLCKYKHMDDALNLFTEMDNKGIRPD 298

Query: 124 VISYSSLIHGFCCGGKWEEAKRLFNEMMDQGVQPNVVTFNVLMDMLCKTGKVIEAKELLE 183
           V +YSSLI   C  G+W +A RL ++M+++ + PNVVTF+ L+D   K GK++EA++L +
Sbjct: 299 VFTYSSLISCLCNYGRWSDASRLLSDMIERKINPNVVTFSALIDAFVKEGKLVEAEKLYD 358

Query: 184 VMVQRGNVPDLVTYNILMHGFCLVGDLNSARELFVSMVSKGCEPNVISYGVLITGYCKNR 243
            M++R   PD+ TY+ L++GFC+   L+ A+ +F  M+SK C PNV++Y  LI G+CK +
Sbjct: 359 EMIKRSIDPDIFTYSSLINGFCMHDRLDEAKHMFELMISKDCFPNVVTYSTLIKGFCKAK 418

Query: 244 KVEEAMKLYNE-----------------------------------MLGVGMRPSVLTHN 303
           +VEE M+L+ E                                   M+ VG+ P++LT+N
Sbjct: 419 RVEEGMELFREMSQRGLVGNTVTYTTLIHGFFQARDCDNAQMVFKQMVSVGVHPNILTYN 478

Query: 304 FLLMGLFQAGKVGDAKKLFGAIRAHGLVTDSYTYNIFLDGLCKAGKLETAWELFDKLSHE 363
            LL GL + GK+  A  +F  ++   +  D YTYNI ++G+CKAGK+E  WELF  LS +
Sbjct: 479 ILLDGLCKNGKLAKAMVVFEYLQRSTMEPDIYTYNIMIEGMCKAGKVEDGWELFCNLSLK 538

Query: 364 GLLPNVVTYSIMIHGCCKEGQVEKAKDLFQKMEENGCIPNVITYNTLLRGFCESNKSEEV 423
           G+ PNV+ Y+ MI G C++G  E+A  L +KM+E+G +PN  TYNTL+R        E  
Sbjct: 539 GVSPNVIAYNTMISGFCRKGSKEEADSLLKKMKEDGPLPNSGTYNTLIRARLRDGDREAS 598

Query: 424 VQLLHRMVQKNVPPDASTCAVVVDMLSTEEKYREYLDLL 428
            +L+  M       DAST  +V +ML      + +LD+L
Sbjct: 599 AELIKEMRSCGFAGDASTIGLVTNMLHDGRLDKSFLDML 631

BLAST of Tan0009518 vs. TAIR 10
Match: AT1G62670.1 (rna processing factor 2 )

HSP 1 Score: 338.6 bits (867), Expect = 7.5e-93
Identity = 166/397 (41.81%), Postives = 251/397 (63.22%), Query Frame = 0

Query: 8   GYVPNIVAYTTLIKGLCRVHRISEATLLFMRMQKLGCRPNVITYGVLMKGLCQTGNTNIA 67
           GY PN V + TLI GL   ++ SEA  L  RM   GC+P+++TYGV++ GLC+ G+T++A
Sbjct: 181 GYQPNTVTFNTLIHGLFLHNKASEAMALIDRMVAKGCQPDLVTYGVVVNGLCKRGDTDLA 240

Query: 68  LKLFENMLKETGGCGISCKPGLICYSIIIDGLCKDGREVEARELLEEMKAQGMIPNVISY 127
             L   M  E G      +PG++ Y+ IIDGLCK     +A  L +EM+ +G+ PNV++Y
Sbjct: 241 FNLLNKM--EQG----KLEPGVLIYNTIIDGLCKYKHMDDALNLFKEMETKGIRPNVVTY 300

Query: 128 SSLIHGFCCGGKWEEAKRLFNEMMDQGVQPNVVTFNVLMDMLCKTGKVIEAKELLEVMVQ 187
           SSLI   C  G+W +A RL ++M+++ + P+V TF+ L+D   K GK++EA++L + MV+
Sbjct: 301 SSLISCLCNYGRWSDASRLLSDMIERKINPDVFTFSALIDAFVKEGKLVEAEKLYDEMVK 360

Query: 188 RGNVPDLVTYNILMHGFCLVGDLNSARELFVSMVSKGCEPNVISYGVLITGYCKNRKVEE 247
           R   P +VTY+ L++GFC+   L+ A+++F  MVSK C P+V++Y  LI G+CK ++VEE
Sbjct: 361 RSIDPSIVTYSSLINGFCMHDRLDEAKQMFEFMVSKHCFPDVVTYNTLIKGFCKYKRVEE 420

Query: 248 AMKLYNEMLGVGMRPSVLTHNFLLMGLFQAGKVGDAKKLFGAIRAHGLVTDSYTYNIFLD 307
            M+++ EM   G+  + +T+N L+ GLFQAG    A+++F  + + G+  +  TYN  LD
Sbjct: 421 GMEVFREMSQRGLVGNTVTYNILIQGLFQAGDCDMAQEIFKEMVSDGVPPNIMTYNTLLD 480

Query: 308 GLCKAGKLETAWELFDKLSHEGLLPNVVTYSIMIHGCCKEGQVEKAKDLFQKMEENGCIP 367
           GLCK GKLE A  +F+ L    + P + TY+IMI G CK G+VE   DLF  +   G  P
Sbjct: 481 GLCKNGKLEKAMVVFEYLQRSKMEPTIYTYNIMIEGMCKAGKVEDGWDLFCNLSLKGVKP 540

Query: 368 NVITYNTLLRGFCESNKSEEVVQLLHRMVQKNVPPDA 405
           +V+ YNT++ GFC     EE   L   M +    P++
Sbjct: 541 DVVAYNTMISGFCRKGSKEEADALFKEMKEDGTLPNS 571

BLAST of Tan0009518 vs. TAIR 10
Match: AT1G63130.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 335.5 bits (859), Expect = 6.4e-92
Identity = 163/403 (40.45%), Postives = 253/403 (62.78%), Query Frame = 0

Query: 4   IIRRGYVPNIVAYTTLIKGLCRVHRISEATLLFMRMQKLGCRPNVITYGVLMKGLCQTGN 63
           ++  GY P+   + TLI GL R +R SEA  L  RM   GC+P+++TYG+++ GLC+ G+
Sbjct: 177 MVEMGYQPDSFTFNTLIHGLFRHNRASEAVALVDRMVVKGCQPDLVTYGIVVNGLCKRGD 236

Query: 64  TNIALKLFENMLKETGGCGISCKPGLICYSIIIDGLCKDGREVEARELLEEMKAQGMIPN 123
            ++AL L + M  E G      +PG++ Y+ IID LC      +A  L  EM  +G+ PN
Sbjct: 237 IDLALSLLKKM--EQG----KIEPGVVIYNTIIDALCNYKNVNDALNLFTEMDNKGIRPN 296

Query: 124 VISYSSLIHGFCCGGKWEEAKRLFNEMMDQGVQPNVVTFNVLMDMLCKTGKVIEAKELLE 183
           V++Y+SLI   C  G+W +A RL ++M+++ + PNVVTF+ L+D   K GK++EA++L +
Sbjct: 297 VVTYNSLIRCLCNYGRWSDASRLLSDMIERKINPNVVTFSALIDAFVKEGKLVEAEKLYD 356

Query: 184 VMVQRGNVPDLVTYNILMHGFCLVGDLNSARELFVSMVSKGCEPNVISYGVLITGYCKNR 243
            M++R   PD+ TY+ L++GFC+   L+ A+ +F  M+SK C PNV++Y  LI G+CK +
Sbjct: 357 EMIKRSIDPDIFTYSSLINGFCMHDRLDEAKHMFELMISKDCFPNVVTYNTLIKGFCKAK 416

Query: 244 KVEEAMKLYNEMLGVGMRPSVLTHNFLLMGLFQAGKVGDAKKLFGAIRAHGLVTDSYTYN 303
           +V+E M+L+ EM   G+  + +T+  L+ G FQA +  +A+ +F  + + G++ D  TY+
Sbjct: 417 RVDEGMELFREMSQRGLVGNTVTYTTLIHGFFQARECDNAQIVFKQMVSDGVLPDIMTYS 476

Query: 304 IFLDGLCKAGKLETAWELFDKLSHEGLLPNVVTYSIMIHGCCKEGQVEKAKDLFQKMEEN 363
           I LDGLC  GK+ETA  +F+ L    + P++ TY+IMI G CK G+VE   DLF  +   
Sbjct: 477 ILLDGLCNNGKVETALVVFEYLQRSKMEPDIYTYNIMIEGMCKAGKVEDGWDLFCSLSLK 536

Query: 364 GCIPNVITYNTLLRGFCESNKSEEVVQLLHRMVQKNVPPDAST 407
           G  PNV+TY T++ GFC     EE   L   M ++   PD+ T
Sbjct: 537 GVKPNVVTYTTMMSGFCRKGLKEEADALFREMKEEGPLPDSGT 573

BLAST of Tan0009518 vs. TAIR 10
Match: AT1G62930.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 331.6 bits (849), Expect = 9.2e-91
Identity = 171/454 (37.67%), Postives = 264/454 (58.15%), Query Frame = 0

Query: 9   YVPNIVAYTTLIKGLCRVHRISEATLLFMRMQKLGCRPNVITYGVLMKGLCQTGNTNIAL 68
           Y PN V + TLI GL   ++ SEA  L  RM   GC+P++ TYG ++ GLC+ G+ ++AL
Sbjct: 181 YQPNTVTFNTLIHGLFLHNKASEAVALIDRMVARGCQPDLFTYGTVVNGLCKRGDIDLAL 240

Query: 69  KLFENMLKETGGCGISCKPGLICYSIIIDGLCKDGREVEARELLEEMKAQGMIPNVISYS 128
            L + M K         +  ++ Y+ IID LC      +A  L  EM  +G+ PNV++Y+
Sbjct: 241 SLLKKMEKG------KIEADVVIYTTIIDALCNYKNVNDALNLFTEMDNKGIRPNVVTYN 300

Query: 129 SLIHGFCCGGKWEEAKRLFNEMMDQGVQPNVVTFNVLMDMLCKTGKVIEAKELLEVMVQR 188
           SLI   C  G+W +A RL ++M+++ + PNVVTF+ L+D   K GK++EA++L + M++R
Sbjct: 301 SLIRCLCNYGRWSDASRLLSDMIERKINPNVVTFSALIDAFVKEGKLVEAEKLYDEMIKR 360

Query: 189 GNVPDLVTYNILMHGFCLVGDLNSARELFVSMVSKGCEPNVISYGVLITGYCKNRKVEEA 248
              PD+ TY+ L++GFC+   L+ A+ +F  M+SK C PNV++Y  LI G+CK ++VEE 
Sbjct: 361 SIDPDIFTYSSLINGFCMHDRLDEAKHMFELMISKDCFPNVVTYNTLIKGFCKAKRVEEG 420

Query: 249 MKLYNEMLGVGMRPSVLTHNFLLMGLFQAGKVGDAKKLFGAIRAHG-------------- 308
           M+L+ EM   G+  + +T+N L+ GLFQAG    A+K+F  + + G              
Sbjct: 421 MELFREMSQRGLVGNTVTYNTLIQGLFQAGDCDMAQKIFKKMVSDGVPPDIITYSILLDG 480

Query: 309 ---------------------LVTDSYTYNIFLDGLCKAGKLETAWELFDKLSHEGLLPN 368
                                +  D YTYNI ++G+CKAGK+E  W+LF  LS +G+ PN
Sbjct: 481 LCKYGKLEKALVVFEYLQKSKMEPDIYTYNIMIEGMCKAGKVEDGWDLFCSLSLKGVKPN 540

Query: 369 VVTYSIMIHGCCKEGQVEKAKDLFQKMEENGCIPNVITYNTLLRGFCESNKSEEVVQLLH 428
           V+ Y+ MI G C++G  E+A  LF++M+E+G +PN  TYNTL+R            +L+ 
Sbjct: 541 VIIYTTMISGFCRKGLKEEADALFREMKEDGTLPNSGTYNTLIRARLRDGDKAASAELIK 600

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q6NQ833.5e-9538.94Pentatricopeptide repeat-containing protein At3g22470, mitochondrial OS=Arabidop... [more]
Q9LQ164.8e-9237.91Pentatricopeptide repeat-containing protein At1g62910 OS=Arabidopsis thaliana OX... [more]
Q9SXD11.1e-9141.81Pentatricopeptide repeat-containing protein At1g62670, mitochondrial OS=Arabidop... [more]
Q9CAN09.0e-9140.45Pentatricopeptide repeat-containing protein At1g63130, mitochondrial OS=Arabidop... [more]
Q9LQ141.3e-8937.67Pentatricopeptide repeat-containing protein At1g62930, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
KAG7014482.14.0e-19774.09Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita a... [more]
KAG6575949.14.0e-19774.09Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita a... [more]
XP_022992036.16.9e-19774.09pentatricopeptide repeat-containing protein At1g63330-like [Cucurbita maxima][more]
XP_022953742.19.0e-19773.88putative pentatricopeptide repeat-containing protein At1g12700, mitochondrial [C... [more]
XP_023548637.12.0e-19673.88pentatricopeptide repeat-containing protein At1g63330-like [Cucurbita pepo subsp... [more]
Match NameE-valueIdentityDescription
A0A6J1JSG43.3e-19774.09pentatricopeptide repeat-containing protein At1g63330-like OS=Cucurbita maxima O... [more]
A0A6J1GP314.3e-19773.88putative pentatricopeptide repeat-containing protein At1g12700, mitochondrial OS... [more]
A0A6J1DSW39.7e-19773.67pentatricopeptide repeat-containing protein At1g63330-like OS=Momordica charanti... [more]
A0A1S4DYL68.0e-18368.44pentatricopeptide repeat-containing protein At3g22470, mitochondrial-like OS=Cuc... [more]
A0A5D3C8J08.0e-18368.44Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
Match NameE-valueIdentityDescription
AT3G22470.12.5e-9638.94Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G62910.13.4e-9337.91Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G62670.17.5e-9341.81rna processing factor 2 [more]
AT1G63130.16.4e-9240.45Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G62930.19.2e-9137.67Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 85..116
e-value: 8.2E-10
score: 38.3
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 367..414
e-value: 2.2E-13
score: 50.2
coord: 192..241
e-value: 4.7E-16
score: 58.7
coord: 11..60
e-value: 2.4E-17
score: 62.9
coord: 122..171
e-value: 3.2E-17
score: 62.4
coord: 298..346
e-value: 2.7E-18
score: 65.9
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 125..159
e-value: 9.8E-12
score: 42.3
coord: 230..263
e-value: 2.8E-9
score: 34.6
coord: 335..369
e-value: 3.4E-14
score: 50.0
coord: 90..124
e-value: 2.9E-9
score: 34.5
coord: 160..193
e-value: 1.9E-7
score: 28.8
coord: 49..77
e-value: 4.2E-5
score: 21.4
coord: 370..404
e-value: 1.1E-8
score: 32.7
coord: 300..334
e-value: 7.0E-10
score: 36.5
coord: 14..48
e-value: 2.1E-8
score: 31.8
coord: 195..229
e-value: 1.2E-10
score: 38.9
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 266..294
e-value: 0.14
score: 12.5
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 158..192
score: 12.441133
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 333..367
score: 15.017038
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 123..157
score: 14.09629
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 298..332
score: 13.42765
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 12..46
score: 12.342482
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 88..122
score: 11.662881
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 193..227
score: 13.274192
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 228..262
score: 13.318037
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 368..402
score: 12.934392
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 47..77
score: 10.128299
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 2..76
e-value: 2.6E-22
score: 81.3
coord: 77..150
e-value: 3.1E-21
score: 77.7
coord: 151..222
e-value: 6.0E-23
score: 83.3
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 315..427
e-value: 5.5E-37
score: 128.9
coord: 223..314
e-value: 3.6E-27
score: 96.9
NoneNo IPR availablePANTHERPTHR47941PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEIN 3, MITOCHONDRIALcoord: 4..295
coord: 85..427
NoneNo IPR availableSUPERFAMILY81901HCP-likecoord: 138..363

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0009518.1Tan0009518.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0050794 regulation of cellular process
molecular_function GO:0005515 protein binding