Tan0002243 (gene) Snake gourd v1

Overview
NameTan0002243
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat
LocationLG01: 15502560 .. 15504962 (-)
RNA-Seq ExpressionTan0002243
SyntenyTan0002243
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTATCTTCCTCGGTACGTACAGAAATGTCCCAAACCCCTGATTCTCTCTTTTTCTAGGGTTTAACCATTTCTCTTTCACCTTCCCTTCTCCTTCTTCCGACGGCTTCCGCTTTCTCTCCGCCGCAGAATCATGTCATTTTACCGCCTCCTCCTCTGCTCTCTCCGCCGCTCTTCAACCTTTCCGTCGCATACCCGAGCTCTGAGCATTAGTCCTCTGAACCACCATCTGCAGGCTCCGATTCCACCGTCCTCTCAAAGCTCGTCTCCTATTTCGCTCCTCCATGCCCGCTCATTTGCCTTTTCCTCTGCCGAAGAAGCTGCCGCCGAAAGGCGCCGTAGAAAGCGCCGTCTTCGTATCGAACCCCCTCTCCATGCTCTTCGCCGCGACAACTACCCGCCTTCACAGCGTGATCCCAATGCTCCTCGTCTTCCTGACTCCACATCCGCTCTTGTGGGGCCTCGCCTTAACCTTCACAATCGTGTTCAATCCCTGATTCGTGCAGGTGATCTTGATGCGGCTTCTGCGGTTGCTCGCCACTCTCTGTTCTCGAACACGCGGCCGACGGTTTTTACTTGTAACGCTATTATTGCTGCTATGTATCGGGCAAAGAGGTATAGTGATGCGATTGCGCTGTTTCAGTACTTCTTTAACCAGTCGAATATAGTTCCCAATGTTGTGTCGTATAATAATTTGATTAATGCTCATTGCGATGAGGGTCGCGTTGATGTGGGTCTTGAGATTTATCGCCATATTATTGCAAATGCTCCGTTTAGTCCTTCGGCAGTGACTTATCGGCATTTAACTAAGGGATTGATTGATTCTGCGAGGATTGGGGAGGCTGTGGATCTTCTGCGGGAAATGTTGAACAAAGGGCATGGAGCTGATTCGCTGGTTTATAATAATTTGATTTCTGGGTTTCTGAATTTGGAGAATTTGGAGAAGGCGAATGAACTGTTTGATGAGTTGAAGGAGAGGTGTTTGGTTTATGACGGAGTTGTGAATGCTACGTTCATGGATTGGTTCTTTAATAGGGGGAAAGAAAAGGAGGCTATGGAATCGTACAAGTCATTGCTTGATCGGCAATTCAAGATGATTCCAGCAACTTGCAATGTGCTGTTGGAGGTTTTACTTAAACATGGGAAGAAAACGGAGGCTTGGACCTTATTTGATCAGATGTTGGATAACCACACTCCTCCAAATTTCCAAGCAGTCAATTCAGATACGTTTAACATAATGGTTAATGAGTGCTTTAAGCTTGGCAAGTTCTCAGAGGCAATAGAGACTTTCCGGAAGGTGGGAACTCAACCAAAGTCAAGGCCTTTTGCAATGGACGTTGCAGGGTATAATAATATCATTGCAAGGTTTTGTGAGCATGGAGTGATGACAGATGCAGAGTCTTTCTTTTCTGAACTTTGCTCGAAGTCCTTGTCCCCTGATGTCCCAACTCATAGAACATTGATTGAAGCTTATTTAAAGGTTGAGCAGATTGATGATGTATTGAGAGTTTTTAACAGGATGGTCGATGTTGGTTTGAGAGTCGTTGCTAGCTTCGGAAACAGGGTATTCGGTGAATTGATTAAGAATGGCAAGGCCGTTGACTGTGCTCAGATTTTAACAAAAATGGGCGAGAAGGATCCTAAACCAGATCCCACATGCTATGACGTGGTGATTAGAGGGCTATGTAATGAAGGTGCACTGGATGCAAGTCTGGAGTTGCTTGACCAGGTAATGAGGTACGGTATTGGCCTCACTCCCACACTTCAGGAATTTGTTAAAGAGGTATTTGTAAAGGCTGGTCGAAATGAAGAGATTGAAAGACTACTAAATATGAATAGATGGGGACATGCTCCTTATCGCCCCCCCTCCGGACCCCCAAGAATTTTACAATCGCAGGTCCCACCTCAAATGGGTCCGCCTCGTCAGCCACCTCAAGGAACCCCTCAAATGGCAGAATCACATTGGCGACCTTCCATAAACCCTCAAGCAAGAGGAAGTTATGATGCCCCTTCAGCACCTCAAATGACAGTTCCTAATCATTTTCAATCAGGATCAGCTCAAATGACAAGAACACAACAGCCATCTTCAGATCCACCACCTCCAATGGAAGAACATCATCACTCACAACAACCCTCTCAAATGGCTGGGCAGGCGGCCGCTTGATTCTTAGATTCCTAAGTTATTGGAGGATGCTATTGTCAAATTTTGACTTTGTGTTGACAGGCAATACTTGATGAATTACATACCACCTCTTTGTGCTACTTGTTCTTTCATTCAAGCTTTCTTGTTAGTACGAGAATTTGGTTATATCCGTGGTGTCTGTCTGAGATATTATGTATTTACCTGATATATCAAGGTGGCTTCCCCTGTTACACTTTTTAATTATTGATAATATTTTGATTCAC

mRNA sequence

TTATCTTCCTCGGTACGTACAGAAATGTCCCAAACCCCTGATTCTCTCTTTTTCTAGGGTTTAACCATTTCTCTTTCACCTTCCCTTCTCCTTCTTCCGACGGCTTCCGCTTTCTCTCCGCCGCAGAATCATGTCATTTTACCGCCTCCTCCTCTGCTCTCTCCGCCGCTCTTCAACCTTTCCGTCGCATACCCGAGCTCTGAGCATTAGTCCTCTGAACCACCATCTGCAGGCTCCGATTCCACCGTCCTCTCAAAGCTCGTCTCCTATTTCGCTCCTCCATGCCCGCTCATTTGCCTTTTCCTCTGCCGAAGAAGCTGCCGCCGAAAGGCGCCGTAGAAAGCGCCGTCTTCGTATCGAACCCCCTCTCCATGCTCTTCGCCGCGACAACTACCCGCCTTCACAGCGTGATCCCAATGCTCCTCGTCTTCCTGACTCCACATCCGCTCTTGTGGGGCCTCGCCTTAACCTTCACAATCGTGTTCAATCCCTGATTCGTGCAGGTGATCTTGATGCGGCTTCTGCGGTTGCTCGCCACTCTCTGTTCTCGAACACGCGGCCGACGGTTTTTACTTGTAACGCTATTATTGCTGCTATGTATCGGGCAAAGAGGTATAGTGATGCGATTGCGCTGTTTCAGTACTTCTTTAACCAGTCGAATATAGTTCCCAATGTTGTGTCGTATAATAATTTGATTAATGCTCATTGCGATGAGGGTCGCGTTGATGTGGGTCTTGAGATTTATCGCCATATTATTGCAAATGCTCCGTTTAGTCCTTCGGCAGTGACTTATCGGCATTTAACTAAGGGATTGATTGATTCTGCGAGGATTGGGGAGGCTGTGGATCTTCTGCGGGAAATGTTGAACAAAGGGCATGGAGCTGATTCGCTGGTTTATAATAATTTGATTTCTGGGTTTCTGAATTTGGAGAATTTGGAGAAGGCGAATGAACTGTTTGATGAGTTGAAGGAGAGGTGTTTGGTTTATGACGGAGTTGTGAATGCTACGTTCATGGATTGGTTCTTTAATAGGGGGAAAGAAAAGGAGGCTATGGAATCGTACAAGTCATTGCTTGATCGGCAATTCAAGATGATTCCAGCAACTTGCAATGTGCTGTTGGAGGTTTTACTTAAACATGGGAAGAAAACGGAGGCTTGGACCTTATTTGATCAGATGTTGGATAACCACACTCCTCCAAATTTCCAAGCAGTCAATTCAGATACGTTTAACATAATGGTTAATGAGTGCTTTAAGCTTGGCAAGTTCTCAGAGGCAATAGAGACTTTCCGGAAGGTGGGAACTCAACCAAAGTCAAGGCCTTTTGCAATGGACGTTGCAGGGTATAATAATATCATTGCAAGGTTTTGTGAGCATGGAGTGATGACAGATGCAGAGTCTTTCTTTTCTGAACTTTGCTCGAAGTCCTTGTCCCCTGATGTCCCAACTCATAGAACATTGATTGAAGCTTATTTAAAGGTTGAGCAGATTGATGATGTATTGAGAGTTTTTAACAGGATGGTCGATGTTGGTTTGAGAGTCGTTGCTAGCTTCGGAAACAGGGTATTCGGTGAATTGATTAAGAATGGCAAGGCCGTTGACTGTGCTCAGATTTTAACAAAAATGGGCGAGAAGGATCCTAAACCAGATCCCACATGCTATGACGTGGTGATTAGAGGGCTATGTAATGAAGGTGCACTGGATGCAAGTCTGGAGTTGCTTGACCAGGTAATGAGGTACGGTATTGGCCTCACTCCCACACTTCAGGAATTTGTTAAAGAGGTATTTGTAAAGGCTGGTCGAAATGAAGAGATTGAAAGACTACTAAATATGAATAGATGGGGACATGCTCCTTATCGCCCCCCCTCCGGACCCCCAAGAATTTTACAATCGCAGGTCCCACCTCAAATGGGTCCGCCTCGTCAGCCACCTCAAGGAACCCCTCAAATGGCAGAATCACATTGGCGACCTTCCATAAACCCTCAAGCAAGAGGAAGTTATGATGCCCCTTCAGCACCTCAAATGACAGTTCCTAATCATTTTCAATCAGGATCAGCTCAAATGACAAGAACACAACAGCCATCTTCAGATCCACCACCTCCAATGGAAGAACATCATCACTCACAACAACCCTCTCAAATGGCTGGGCAGGCGGCCGCTTGATTCTTAGATTCCTAAGTTATTGGAGGATGCTATTGTCAAATTTTGACTTTGTGTTGACAGGCAATACTTGATGAATTACATACCACCTCTTTGTGCTACTTGTTCTTTCATTCAAGCTTTCTTGTTAGTACGAGAATTTGGTTATATCCGTGGTGTCTGTCTGAGATATTATGTATTTACCTGATATATCAAGGTGGCTTCCCCTGTTACACTTTTTAATTATTGATAATATTTTGATTCAC

Coding sequence (CDS)

ATGTCATTTTACCGCCTCCTCCTCTGCTCTCTCCGCCGCTCTTCAACCTTTCCGTCGCATACCCGAGCTCTGAGCATTAGTCCTCTGAACCACCATCTGCAGGCTCCGATTCCACCGTCCTCTCAAAGCTCGTCTCCTATTTCGCTCCTCCATGCCCGCTCATTTGCCTTTTCCTCTGCCGAAGAAGCTGCCGCCGAAAGGCGCCGTAGAAAGCGCCGTCTTCGTATCGAACCCCCTCTCCATGCTCTTCGCCGCGACAACTACCCGCCTTCACAGCGTGATCCCAATGCTCCTCGTCTTCCTGACTCCACATCCGCTCTTGTGGGGCCTCGCCTTAACCTTCACAATCGTGTTCAATCCCTGATTCGTGCAGGTGATCTTGATGCGGCTTCTGCGGTTGCTCGCCACTCTCTGTTCTCGAACACGCGGCCGACGGTTTTTACTTGTAACGCTATTATTGCTGCTATGTATCGGGCAAAGAGGTATAGTGATGCGATTGCGCTGTTTCAGTACTTCTTTAACCAGTCGAATATAGTTCCCAATGTTGTGTCGTATAATAATTTGATTAATGCTCATTGCGATGAGGGTCGCGTTGATGTGGGTCTTGAGATTTATCGCCATATTATTGCAAATGCTCCGTTTAGTCCTTCGGCAGTGACTTATCGGCATTTAACTAAGGGATTGATTGATTCTGCGAGGATTGGGGAGGCTGTGGATCTTCTGCGGGAAATGTTGAACAAAGGGCATGGAGCTGATTCGCTGGTTTATAATAATTTGATTTCTGGGTTTCTGAATTTGGAGAATTTGGAGAAGGCGAATGAACTGTTTGATGAGTTGAAGGAGAGGTGTTTGGTTTATGACGGAGTTGTGAATGCTACGTTCATGGATTGGTTCTTTAATAGGGGGAAAGAAAAGGAGGCTATGGAATCGTACAAGTCATTGCTTGATCGGCAATTCAAGATGATTCCAGCAACTTGCAATGTGCTGTTGGAGGTTTTACTTAAACATGGGAAGAAAACGGAGGCTTGGACCTTATTTGATCAGATGTTGGATAACCACACTCCTCCAAATTTCCAAGCAGTCAATTCAGATACGTTTAACATAATGGTTAATGAGTGCTTTAAGCTTGGCAAGTTCTCAGAGGCAATAGAGACTTTCCGGAAGGTGGGAACTCAACCAAAGTCAAGGCCTTTTGCAATGGACGTTGCAGGGTATAATAATATCATTGCAAGGTTTTGTGAGCATGGAGTGATGACAGATGCAGAGTCTTTCTTTTCTGAACTTTGCTCGAAGTCCTTGTCCCCTGATGTCCCAACTCATAGAACATTGATTGAAGCTTATTTAAAGGTTGAGCAGATTGATGATGTATTGAGAGTTTTTAACAGGATGGTCGATGTTGGTTTGAGAGTCGTTGCTAGCTTCGGAAACAGGGTATTCGGTGAATTGATTAAGAATGGCAAGGCCGTTGACTGTGCTCAGATTTTAACAAAAATGGGCGAGAAGGATCCTAAACCAGATCCCACATGCTATGACGTGGTGATTAGAGGGCTATGTAATGAAGGTGCACTGGATGCAAGTCTGGAGTTGCTTGACCAGGTAATGAGGTACGGTATTGGCCTCACTCCCACACTTCAGGAATTTGTTAAAGAGGTATTTGTAAAGGCTGGTCGAAATGAAGAGATTGAAAGACTACTAAATATGAATAGATGGGGACATGCTCCTTATCGCCCCCCCTCCGGACCCCCAAGAATTTTACAATCGCAGGTCCCACCTCAAATGGGTCCGCCTCGTCAGCCACCTCAAGGAACCCCTCAAATGGCAGAATCACATTGGCGACCTTCCATAAACCCTCAAGCAAGAGGAAGTTATGATGCCCCTTCAGCACCTCAAATGACAGTTCCTAATCATTTTCAATCAGGATCAGCTCAAATGACAAGAACACAACAGCCATCTTCAGATCCACCACCTCCAATGGAAGAACATCATCACTCACAACAACCCTCTCAAATGGCTGGGCAGGCGGCCGCTTGA

Protein sequence

MSFYRLLLCSLRRSSTFPSHTRALSISPLNHHLQAPIPPSSQSSSPISLLHARSFAFSSAEEAAAERRRRKRRLRIEPPLHALRRDNYPPSQRDPNAPRLPDSTSALVGPRLNLHNRVQSLIRAGDLDAASAVARHSLFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQYFFNQSNIVPNVVSYNNLINAHCDEGRVDVGLEIYRHIIANAPFSPSAVTYRHLTKGLIDSARIGEAVDLLREMLNKGHGADSLVYNNLISGFLNLENLEKANELFDELKERCLVYDGVVNATFMDWFFNRGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQAVNSDTFNIMVNECFKLGKFSEAIETFRKVGTQPKSRPFAMDVAGYNNIIARFCEHGVMTDAESFFSELCSKSLSPDVPTHRTLIEAYLKVEQIDDVLRVFNRMVDVGLRVVASFGNRVFGELIKNGKAVDCAQILTKMGEKDPKPDPTCYDVVIRGLCNEGALDASLELLDQVMRYGIGLTPTLQEFVKEVFVKAGRNEEIERLLNMNRWGHAPYRPPSGPPRILQSQVPPQMGPPRQPPQGTPQMAESHWRPSINPQARGSYDAPSAPQMTVPNHFQSGSAQMTRTQQPSSDPPPPMEEHHHSQQPSQMAGQAAA
Homology
BLAST of Tan0002243 vs. ExPASy Swiss-Prot
Match: Q9SY69 (Pentatricopeptide repeat-containing protein At1g10270 OS=Arabidopsis thaliana OX=3702 GN=GRP23 PE=1 SV=1)

HSP 1 Score: 738.0 bits (1904), Expect = 9.3e-212
Identity = 392/658 (59.57%), Postives = 484/658 (73.56%), Query Frame = 0

Query: 25  SISPL-NHHLQAPIPPSSQSSSPISLLHARSFAFSSAEEAAAERRRRKRRLRIEPPLHAL 84
           +++P+ N   Q  IP +     P   +  R+ AFSSAEEAAAERRRRKRRLRIEPPLHAL
Sbjct: 57  NLNPIPNDPSQFQIPQNHTPPIPYPPIPHRTMAFSSAEEAAAERRRRKRRLRIEPPLHAL 116

Query: 85  RRD-NYPPSQRDPNAPRLPDSTSALVGPRLNLHNRVQSLIRAGDLDAASAVARHSLFSNT 144
           RRD + PP +RDPNAPRLPDSTSALVG RLNLHNRVQSLIRA DLDAAS +AR S+FSNT
Sbjct: 117 RRDPSAPPPKRDPNAPRLPDSTSALVGQRLNLHNRVQSLIRASDLDAASKLARQSVFSNT 176

Query: 145 RPTVFTCNAIIAAMYRAKRYSDAIALFQYFFNQSNIVPNVVSYNNLINAHCDEGRVDVGL 204
           RPTVFTCNAIIAAMYRAKRYS++I+LFQYFF QSNIVPNVVSYN +INAHCDEG VD  L
Sbjct: 177 RPTVFTCNAIIAAMYRAKRYSESISLFQYFFKQSNIVPNVVSYNQIINAHCDEGNVDEAL 236

Query: 205 EIYRHIIANAPFSPSAVTYRHLTKGLIDSARIGEAVDLLREMLNKGHGADSLVYNNLISG 264
           E+YRHI+ANAPF+PS+VTYRHLTKGL+ + RIG+A  LLREML+KG  ADS VYNNLI G
Sbjct: 237 EVYRHILANAPFAPSSVTYRHLTKGLVQAGRIGDAASLLREMLSKGQAADSTVYNNLIRG 296

Query: 265 FLNLENLEKANELFDELKERCLVYDGVVNATFMDWFFNRGKEKEAMESYKSLLDRQFKMI 324
           +L+L + +KA E FDELK +C VYDG+VNATFM+++F +G +KEAMESY+SLLD++F+M 
Sbjct: 297 YLDLGDFDKAVEFFDELKSKCTVYDGIVNATFMEYWFEKGNDKEAMESYRSLLDKKFRMH 356

Query: 325 PATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQAVNSDTFNIMVNECFKLGKFSEA 384
           P T NVLLEV LK GKK EAW LF++MLDNH PPN  +VNSDT  IMVNECFK+G+FSEA
Sbjct: 357 PPTGNVLLEVFLKFGKKDEAWALFNEMLDNHAPPNILSVNSDTVGIMVNECFKMGEFSEA 416

Query: 385 IETFRKVGTQPKSRPFAMDVAGYNNIIARFCEHGVMTDAESFFSELCSKSLSPDVPTHRT 444
           I TF+KVG++  S+PF MD  GY NI+ RFCE G++T+AE FF+E  S+SL  D P+HR 
Sbjct: 417 INTFKKVGSKVTSKPFVMDYLGYCNIVTRFCEQGMLTEAERFFAEGVSRSLPADAPSHRA 476

Query: 445 LIEAYLKVEQIDDVLRVFNRMVDVGLRVVASFGNRVFGELIKNGKAVDCAQILTKMGEKD 504
           +I+AYLK E+IDD +++ +RMVDV LRVVA FG RVFGELIKNGK  + A++LTKMGE++
Sbjct: 477 MIDAYLKAERIDDAVKMLDRMVDVNLRVVADFGARVFGELIKNGKLTESAEVLTKMGERE 536

Query: 505 PKPDPTCYDVVIRGLCNEGALDASLELLDQVMRYGIGLTPTLQEFVKEVFVKAGRNEEIE 564
           PKPDP+ YDVV+RGLC+  ALD + +++ +++R+ +G+T  L+EF+ EVF KAGR EEIE
Sbjct: 537 PKPDPSIYDVVVRGLCDGDALDQAKDIVGEMIRHNVGVTTVLREFIIEVFEKAGRREEIE 596

Query: 565 RLLN-----MNRWGHAPYRPPSGPPRILQSQVPPQMGPPRQP--PQGTPQMAESHWRPSI 624
           ++LN     +   G +   PP  P     +   PQ    R P   QG           + 
Sbjct: 597 KILNSVARPVRNAGQSGNTPPRVPAVFGTTPAAPQQPRDRAPWTSQGVVHSNSGWANGTA 656

Query: 625 NPQARGSYDAPSAP----QMTVPNHFQSGSAQMTRTQQP---SSDPPPPMEEHHHSQQ 667
              A G+Y A +        T  N  Q   +  T  QQP   S   P   ++   SQQ
Sbjct: 657 GQTAGGAYKANNGQNPSWSNTSDNQQQQSWSNQTAGQQPPSWSRQAPGYQQQQSWSQQ 714

BLAST of Tan0002243 vs. ExPASy Swiss-Prot
Match: Q9M3A8 (Pentatricopeptide repeat-containing protein At3g49240, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=EMB1796 PE=1 SV=1)

HSP 1 Score: 341.7 bits (875), Expect = 1.9e-92
Identity = 206/553 (37.25%), Postives = 314/553 (56.78%), Query Frame = 0

Query: 23  ALSISPLNHHLQAPIPPSSQSSSPISLLHARSFAFSSAEEAAAERRRRKRRLRIEPPLHA 82
           ++S +   +HLQ           P   L  R  +F++ EEAAAERRRRKRRLR+EPP+++
Sbjct: 2   SISKAAFLNHLQTLSRSYRHRVLPQPFLAVRYMSFATQEEAAAERRRRKRRLRMEPPVNS 61

Query: 83  LRRDNYPPSQ-----RDPNAPRLPDSTSALVGPRLNLHNRVQSLIRAGDLDAASAVARHS 142
             R     SQ     ++PN P+LP+S SALVG RL+LHN +  LIR  DL+ A+   RHS
Sbjct: 62  FNRSQQQQSQIPRPIQNPNIPKLPESVSALVGKRLDLHNHILKLIRENDLEEAALYTRHS 121

Query: 143 LFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQYFFNQSNIVPNVVSYNNLINAHCDEGR 202
           ++SN RPT+FT N ++AA  R  +Y  A+     F NQ+ I PN+++YN +  A+ D  +
Sbjct: 122 VYSNCRPTIFTVNTVLAAQLRQAKYG-ALLQLHGFINQAGIAPNIITYNLIFQAYLDVRK 181

Query: 203 VDVGLEIYRHIIANAPFSPSAVTYRHLTKGLIDSARIGEAVDLLREMLNKGHGADSLVYN 262
            ++ LE Y+  I NAP +PS  T+R L KGL+ +  + +A+++  +M  KG   D +VY+
Sbjct: 182 PEIALEHYKLFIDNAPLNPSIATFRILVKGLVSNDNLEKAMEIKEDMAVKGFVVDPVVYS 241

Query: 263 NLISGFLNLENLEKANELFDELKERC--LVYDGVVNATFMDWFFNRGKEKEAMESYKSLL 322
            L+ G +   + +   +L+ ELKE+    V DGVV    M  +F +  EKEAME Y+  +
Sbjct: 242 YLMMGCVKNSDADGVLKLYQELKEKLGGFVDDGVVYGQLMKGYFMKEMEKEAMECYEEAV 301

Query: 323 --DRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQAVNSDTFNIMVNEC 382
             + + +M     N +LE L ++GK  EA  LFD +   H PP   AVN  TFN+MVN  
Sbjct: 302 GENSKVRMSAMAYNYVLEALSENGKFDEALKLFDAVKKEHNPPRHLAVNLGTFNVMVNGY 361

Query: 383 FKLGKFSEAIETFRKVGTQPKSRPFAMDVAGYNNIIARFCEHGVMTDAESFFSELCSKSL 442
              GKF EA+E FR++G   K  P   D   +NN++ + C++ ++ +AE  + E+  K++
Sbjct: 362 CAGGKFEEAMEVFRQMG-DFKCSP---DTLSFNNLMNQLCDNELLAEAEKLYGEMEEKNV 421

Query: 443 SPDVPTHRTLIEAYLKVEQIDDVLRVFNRMVDVGLRVVASFGNRVFGELIKNGKAVDCAQ 502
            PD  T+  L++   K  +ID+    +  MV+  LR   +  NR+  +LIK GK  D   
Sbjct: 422 KPDEYTYGLLMDTCFKEGKIDEGAAYYKTMVESNLRPNLAVYNRLQDQLIKAGKLDDAKS 481

Query: 503 ILTKMGEKDPKPDPTCYDVVIRGLCNEGALDASLELLDQVMRYG-IGLTPTLQEFVKEVF 562
               M  K  K D   Y  ++R L   G LD  L+++D+++    + ++  LQEFVKE  
Sbjct: 482 FFDMMVSK-LKMDDEAYKFIMRALSEAGRLDEMLKIVDEMLDDDTVRVSEELQEFVKEEL 541

Query: 563 VKAGRNEEIERLL 566
            K GR  ++E+L+
Sbjct: 542 RKGGREGDLEKLM 548

BLAST of Tan0002243 vs. ExPASy Swiss-Prot
Match: Q9LEX6 (Pentatricopeptide repeat-containing protein At3g60960, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At3g60960 PE=2 SV=2)

HSP 1 Score: 231.9 bits (590), Expect = 2.2e-59
Identity = 149/402 (37.06%), Postives = 223/402 (55.47%), Query Frame = 0

Query: 90  PSQRDPNA-PRL-PDSTSALVGPRLNLHNRVQSLIRAGDLDAASAVARHSL---FSNTRP 149
           P  RDP++ P+L P S S +    ++L  RV+++I   +LD AS ++R ++   F   R 
Sbjct: 27  PLGRDPSSLPKLDPVSISYIDSRPISLRYRVRAMIEMSNLDEASKLSRLAVLNGFLVDRD 86

Query: 150 TVFTCNAIIAAMYRAKRYSDAIALFQYFFNQSNIVPNVVSYNNLINAHCDEGRVDVGLEI 209
           TVF CN++I AM  AKRY DAI+LF YFFN+S  +PN +S + +I AHCD+G VD  LE+
Sbjct: 87  TVFICNSVIGAMCSAKRYDDAISLFNYFFNESQTLPNTLSCDLIIKAHCDQGHVDDALEL 146

Query: 210 YRHIIANAPFSPSAVTYRHLTKGLIDSARIGEAVDLLREMLNKGHGADSLVYNNLISGFL 269
           YRHI+ +   +P   TY  L K L+D+ R  EA  L R M         +VY+ LI GFL
Sbjct: 147 YRHILLDGRVAPGIETYMILAKALVDAKRFDEACVLARSM----SCCSFMVYDILIRGFL 206

Query: 270 NLENLEKANELFDELKERCLVYDG--------VVNATFMDWFFNRGKEKEAMESYKSLLD 329
           ++ N  KA+++F+ELK       G        + N +FM+++F +GK++EAME   +L D
Sbjct: 207 DIGNFVKASQIFEELKGLDSKLPGREYHKANAIFNVSFMNYWFKQGKDEEAMEILANLED 266

Query: 330 RQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQAVNSDTFNIMVNECFKL 389
            Q  + P   N +L+VL+KHGKKTEAW LF +M+        +  +S+T +IM       
Sbjct: 267 AQV-LNPIVGNRVLQVLVKHGKKTEAWELFGEMI--------EICDSETVDIMSE----- 326

Query: 390 GKFSEAIETFRKVGTQPKSRPFAMDVAGYNNIIARFCEHGVMTDAESFFSELCSK----- 449
             FSE    F +           +    Y  +I   CEHG ++DAE  F+E+ +      
Sbjct: 327 -YFSEKTVPFER-----------LRKTCYRKMIVSLCEHGKVSDAEKLFAEMFTDVDGGD 386

Query: 450 -SLSPDVPTHRTLIEAYLKVEQIDDVLRVFNRMVDVGLRVVA 473
             + PD+   R +I  Y+ V ++DD ++  N+M    LR +A
Sbjct: 387 LLVGPDLLIFRAMINGYVSVGRVDDAIKTLNKMRISNLRKLA 398

BLAST of Tan0002243 vs. ExPASy Swiss-Prot
Match: Q9LEX5 (Pentatricopeptide repeat-containing protein At3g60980, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At3g60980 PE=2 SV=1)

HSP 1 Score: 221.9 bits (564), Expect = 2.3e-56
Identity = 143/380 (37.63%), Postives = 218/380 (57.37%), Query Frame = 0

Query: 117 RVQSLIR-AGDLDAASAVARHSLFSN--TRPTVFTCNAIIAAMYRAKRYSDAIALFQYFF 176
           RV  LIR  GDLD A+  AR ++F++  +  T   C +II  M R KR  DA  L+++FF
Sbjct: 38  RVSYLIRCVGDLDTAAKYARLAVFTSIKSESTTTICQSIIGGMLRDKRLKDAYDLYEFFF 97

Query: 177 NQSNIVPNVVSYNNLINAHCDEGRVDVGLEIYRHIIANAPFS--PSAVTYRHLTKGLIDS 236
           NQ N+ PN   +N +I +   +G V+  L  +   I +      PS  ++R LTKGL+ S
Sbjct: 98  NQHNLRPNSHCWNYIIESGFQQGLVNDALHFHHRCINSGQVHDYPSDDSFRILTKGLVHS 157

Query: 237 ARIGEAVDLLR-EMLNKGHGADSLVYNNLISGFLNLENLEKANELFDELKERCLV----- 296
            R+ +A   LR   +N+    D + YNNLI GFL+L N +KAN +  E K   L+     
Sbjct: 158 GRLDQAEAFLRGRTVNRTTYPDHVAYNNLIRGFLDLGNFKKANLVLGEFKRLFLIALSET 217

Query: 297 --------YDGVV---NATFMDWFFNRGKEKEAMESY-KSLLDRQFKMIPATCNVLLEVL 356
                   Y+  V    ATFM+++F +GK+ EAME Y + +L  +  +   T N LL+VL
Sbjct: 218 KDDLHHSNYENRVAFLMATFMEYWFKQGKQVEAMECYNRCVLSNRLLVCAETGNALLKVL 277

Query: 357 LKHGKKTEAWTLFDQMLDNHTPPNFQAVNSDTFNIMVNECFKLGKFSEAIETFRKVGTQP 416
           LK+G+K  AW L+ ++LD +       ++SDT  IMV+ECF +G FSEA+ET++K   +P
Sbjct: 278 LKYGEKKNAWALYHELLDKNGTGK-GCLDSDTIKIMVDECFDMGWFSEAMETYKK--ARP 337

Query: 417 KSRPFAMDVAGYNNIIARFCEHGVMTDAESFFSELCSKSLS-PDVPTHRTLIEAYLKVEQ 473
           K+     D      II RFCE+ ++++AES F +  +      DV T++T+I+AY+K  +
Sbjct: 338 KN-----DYLSDKYIITRFCENRMLSEAESVFVDSLADDFGYIDVNTYKTMIDAYVKAGR 397

BLAST of Tan0002243 vs. ExPASy Swiss-Prot
Match: Q6NQ83 (Pentatricopeptide repeat-containing protein At3g22470, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At3g22470 PE=1 SV=1)

HSP 1 Score: 147.1 bits (370), Expect = 7.0e-34
Identity = 104/431 (24.13%), Postives = 191/431 (44.32%), Query Frame = 0

Query: 125 GDLDAASAVARHSLFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQYFFNQSNIVPNVVS 184
           G +  A A+    +    RP + T + +I  +    R S+A+ L      +    P+ V+
Sbjct: 154 GRVSEAVALVDRMVEMKQRPDLVTVSTLINGLCLKGRVSEALVLIDRMV-EYGFQPDEVT 213

Query: 185 YNNLINAHCDEGRVDVGLEIYRHIIANAPFSPSAVTYRHLTKGLIDSARIGEAVDLLREM 244
           Y  ++N  C  G   + L+++R  +       S V Y  +   L       +A+ L  EM
Sbjct: 214 YGPVLNRLCKSGNSALALDLFRK-MEERNIKASVVQYSIVIDSLCKDGSFDDALSLFNEM 273

Query: 245 LNKGHGADSLVYNNLISGFLNLENLEKANELFDELKERCLVYDGVVNATFMDWFFNRGKE 304
             KG  AD + Y++LI G  N    +   ++  E+  R ++ D V  +  +D F   GK 
Sbjct: 274 EMKGIKADVVTYSSLIGGLCNDGKWDDGAKMLREMIGRNIIPDVVTFSALIDVFVKEGKL 333

Query: 305 KEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQAVNSD 364
            EA E Y  ++ R       T N L++   K     EA  +FD M+     P+       
Sbjct: 334 LEAKELYNEMITRGIAPDTITYNSLIDGFCKENCLHEANQMFDLMVSKGCEPDIV----- 393

Query: 365 TFNIMVNECFKLGKFSEAIETFRKVGTQPKSRPFAMDVAGYNNIIARFCEHGVMTDAESF 424
           T++I++N   K  +  + +  FR++     S+    +   YN ++  FC+ G +  A+  
Sbjct: 394 TYSILINSYCKAKRVDDGMRLFREI----SSKGLIPNTITYNTLVLGFCQSGKLNAAKEL 453

Query: 425 FSELCSKSLSPDVPTHRTLIEAYLKVEQIDDVLRVFNRMVDVGLRVVASFGNRVFGELIK 484
           F E+ S+ + P V T+  L++      +++  L +F +M    + +     N +   +  
Sbjct: 454 FQEMVSRGVPPSVVTYGILLDGLCDNGELNKALEIFEKMQKSRMTLGIGIYNIIIHGMCN 513

Query: 485 NGKAVDCAQILTKMGEKDPKPDPTCYDVVIRGLCNEGALDASLELLDQVMRYGIGLTPTL 544
             K  D   +   + +K  KPD   Y+V+I GLC +G+L +  ++L + M+   G TP  
Sbjct: 514 ASKVDDAWSLFCSLSDKGVKPDVVTYNVMIGGLCKKGSL-SEADMLFRKMKED-GCTP-- 569

Query: 545 QEFVKEVFVKA 556
            +F   + ++A
Sbjct: 574 DDFTYNILIRA 569

BLAST of Tan0002243 vs. NCBI nr
Match: XP_022984748.1 (pentatricopeptide repeat-containing protein At1g10270-like isoform X2 [Cucurbita maxima])

HSP 1 Score: 1225.3 bits (3169), Expect = 0.0e+00
Identity = 624/708 (88.14%), Postives = 642/708 (90.68%), Query Frame = 0

Query: 1   MSFYRLLLCSLRRSSTFPSHTRALSISPLNHHLQAPIPPSSQSSSPISLLHARSFAFSSA 60
           MS YRLLL S RRSST PSH+++LSI PLNHHL +PIPPSSQSSSPISLLHARSFAFSSA
Sbjct: 1   MSLYRLLLRSFRRSSTSPSHSQSLSIGPLNHHLLSPIPPSSQSSSPISLLHARSFAFSSA 60

Query: 61  EEAAAERRRRKRRLRIEPPLHALRRDNYPPSQRDPNAPRLPDSTSALVGPRLNLHNRVQS 120
           EEAAAERRRRKRRLRIEPPLHALRRDNYPP QRDPNAPRLPDSTSALVGPRLNLHNRVQS
Sbjct: 61  EEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRLPDSTSALVGPRLNLHNRVQS 120

Query: 121 LIRAGDLDAASAVARHSLFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQYFFNQSNIVP 180
           LIRAGDLDAASAVARHS+FSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQ+FFNQSNIVP
Sbjct: 121 LIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVP 180

Query: 181 NVVSYNNLINAHCDEGRVDVGLEIYRHIIANAPFSPSAVTYRHLTKGLIDSARIGEAVDL 240
           NVVSYNNLINAHCDEGRVDV LEIYRHIIANAPFSPSAVTYRHLTKGLIDS RIGEAVDL
Sbjct: 181 NVVSYNNLINAHCDEGRVDVSLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDL 240

Query: 241 LREMLNKGHGADSLVYNNLISGFLNLENLEKANELFDELKERCLVYDGVVNATFMDWFFN 300
           LREMLNKGHGADSLVYNNLISGFLNLENLEKANELFDELKERCLVYDGVVNATFMDWFFN
Sbjct: 241 LREMLNKGHGADSLVYNNLISGFLNLENLEKANELFDELKERCLVYDGVVNATFMDWFFN 300

Query: 301 RGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQA 360
           RGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQA
Sbjct: 301 RGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQA 360

Query: 361 VNSDTFNIMVNECFKLGKFSEAIETFRKVGTQPKSRPFAMDVAGYNNIIARFCEHGVMTD 420
           VNSDTFNIMVNECFKLGKFSEA+ETFRKVGTQPKSRPFAMDVAGYNNII RFCE G+M D
Sbjct: 361 VNSDTFNIMVNECFKLGKFSEAVETFRKVGTQPKSRPFAMDVAGYNNIIGRFCEQGMMAD 420

Query: 421 AESFFSELCSKSLSPDVPTHRTLIEAYLKVEQIDDVLRVFNRMVDVGLRVVASFGNRVFG 480
           AE+FF+ELCSKSLSPDVPTHRTLIEAYLK+EQIDD LRVFNRMVDVGLRVVASFGNRVFG
Sbjct: 421 AETFFAELCSKSLSPDVPTHRTLIEAYLKIEQIDDALRVFNRMVDVGLRVVASFGNRVFG 480

Query: 481 ELIKNGKAVDCAQILTKMGEKDPKPDPTCYDVVIRGLCNEGALDASLELLDQVMRYGIGL 540
           ELIKNGK VDCAQILTKMGE+DPKPDPTCYDVVI+GLCNEGALDAS ELLDQ+MRYGIGL
Sbjct: 481 ELIKNGKVVDCAQILTKMGERDPKPDPTCYDVVIQGLCNEGALDASRELLDQIMRYGIGL 540

Query: 541 TPTLQEFVKEVFVKAGRNEEIERLLNMNRWGHAPYRPPSGPPRILQSQVPPQMGPPRQPP 600
           TP LQEFVKE FVKAGR+EEIERLLNMNRWGHAPYRPPSGPPRI QSQVPPQMGPPR PP
Sbjct: 541 TPALQEFVKEAFVKAGRSEEIERLLNMNRWGHAPYRPPSGPPRISQSQVPPQMGPPRPPP 600

Query: 601 QGTPQMAESHWRPSINPQARGSYD--------------------------------APSA 660
           QG P MAE HWRPSINPQARGSY                                  PS+
Sbjct: 601 QGHPPMAEPHWRPSINPQARGSYAPSSPQMTGPQGHPPMAEPHWRPSINPQAGGSYGPSS 660

Query: 661 PQMTVPNHFQSGSAQMTRTQQPSSDPPPPMEEHHHSQQPSQMAGQAAA 677
           PQMT PN+FQSGSAQMTR QQP  D P PMEE HHSQQP Q+AGQ  A
Sbjct: 661 PQMTGPNYFQSGSAQMTRPQQPPFD-PSPMEEQHHSQQPPQIAGQTVA 707

BLAST of Tan0002243 vs. NCBI nr
Match: XP_023552663.1 (pentatricopeptide repeat-containing protein At1g10270-like isoform X2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1220.7 bits (3157), Expect = 0.0e+00
Identity = 629/741 (84.89%), Postives = 645/741 (87.04%), Query Frame = 0

Query: 1   MSFYRLLLCSLRRSSTFPSHTRALSISPLNHHLQAPIPPSSQSSSPISLLHARSFAFSSA 60
           MS YRLLL S RRSST PSH++ALSI PLNHHL +PIPPSSQSSSPISLLH RSFAFSSA
Sbjct: 1   MSLYRLLLRSFRRSSTSPSHSQALSIGPLNHHLLSPIPPSSQSSSPISLLHVRSFAFSSA 60

Query: 61  EEAAAERRRRKRRLRIEPPLHALRRDNYPPSQRDPNAPRLPDSTSALVGPRLNLHNRVQS 120
           EEAAAERRRRKRRLRIEPPLHALRRDNYPP QRDPNAPRLPDSTSALVGPRLNLHNRVQS
Sbjct: 61  EEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRLPDSTSALVGPRLNLHNRVQS 120

Query: 121 LIRAGDLDAASAVARHSLFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQYFFNQSNIVP 180
           LIRAGDLDAASAVARHS+FSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQ+FFNQSNIVP
Sbjct: 121 LIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVP 180

Query: 181 NVVSYNNLINAHCDEGRVDVGLEIYRHIIANAPFSPSAVTYRHLTKGLIDSARIGEAVDL 240
           NVVSYNNLINAHCDEGRVDVGLEIYRHIIANAPFSPSAVTYRHLTKGLIDS RIGEAVDL
Sbjct: 181 NVVSYNNLINAHCDEGRVDVGLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDL 240

Query: 241 LREMLNKGHGADSLVYNNLISGFLNLENLEKANELFDELKERCLVYDGVVNATFMDWFFN 300
           LREMLNKGHGADSLVYNNLISGFLNLENLEKANELFDELKERCLVYDGVVNATFMDWFFN
Sbjct: 241 LREMLNKGHGADSLVYNNLISGFLNLENLEKANELFDELKERCLVYDGVVNATFMDWFFN 300

Query: 301 RGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQA 360
           RGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQA
Sbjct: 301 RGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQA 360

Query: 361 VNSDTFNIMVNECFKLGKFSEAIETFRKVGTQPKSRPFAMDVAGYNNIIARFCEHGVMTD 420
           VNSDTFNIMVNECFKLGKFSEA+ETFRKVGTQPKSRPFAMDVAGYNNII RFCE G+M D
Sbjct: 361 VNSDTFNIMVNECFKLGKFSEAVETFRKVGTQPKSRPFAMDVAGYNNIIGRFCEQGMMAD 420

Query: 421 AESFFSELCSKSLSPDVPTHRTLIEAYLKVEQIDDVLRVFNRMVDVGLRVVASFGNRVFG 480
           AE+FF+ELCSKSLSPDVPTHRTLIEAYLK+EQIDD LRVFNRMVDVGLRVVASFGNRVFG
Sbjct: 421 AETFFAELCSKSLSPDVPTHRTLIEAYLKIEQIDDALRVFNRMVDVGLRVVASFGNRVFG 480

Query: 481 ELIKNGKAVDCAQILTKMGEKDPKPDPTCYDVVIRGLCNEGALDASLELLDQVMRYGIGL 540
           ELIKNGK VDCAQILTKMGE+DPKPDPTCYDVVIRGLCNEGALDAS ELLDQ+MRYGIGL
Sbjct: 481 ELIKNGKVVDCAQILTKMGERDPKPDPTCYDVVIRGLCNEGALDASRELLDQIMRYGIGL 540

Query: 541 TPTLQEFVKEVFVKAGRNEEIERLLNMNRWGHAPYRPPSGPPRILQSQVPPQMGPPRQPP 600
           TPTLQEFVKE FVKAGR+EEIERLLNMNRWGHAPYRPPSGPPRI QSQVPPQMGPPR PP
Sbjct: 541 TPTLQEFVKEAFVKAGRSEEIERLLNMNRWGHAPYRPPSGPPRISQSQVPPQMGPPRPPP 600

Query: 601 QGTPQMAESHWRPSINPQARGSYD------------------------------------ 660
           QG P MAE HWRPSINPQA GSY                                     
Sbjct: 601 QGHPPMAEPHWRPSINPQAGGSYGPSSPQMTGPQGHPPMAEPHWRPSINPQAGGSYAPSS 660

Query: 661 -----------------------------APSAPQMTVPNHFQSGSAQMTRTQQPSSDPP 677
                                         PS+PQMT PN+FQSGSAQMTR QQPS D P
Sbjct: 661 PQMTGPQGHTPMAEPHWRPSINPQAGGSYGPSSPQMTGPNYFQSGSAQMTRPQQPSFD-P 720

BLAST of Tan0002243 vs. NCBI nr
Match: XP_022984746.1 (pentatricopeptide repeat-containing protein At1g10270-like isoform X1 [Cucurbita maxima] >XP_022984747.1 pentatricopeptide repeat-containing protein At1g10270-like isoform X1 [Cucurbita maxima])

HSP 1 Score: 1208.4 bits (3125), Expect = 0.0e+00
Identity = 624/741 (84.21%), Postives = 642/741 (86.64%), Query Frame = 0

Query: 1   MSFYRLLLCSLRRSSTFPSHTRALSISPLNHHLQAPIPPSSQSSSPISLLHARSFAFSSA 60
           MS YRLLL S RRSST PSH+++LSI PLNHHL +PIPPSSQSSSPISLLHARSFAFSSA
Sbjct: 1   MSLYRLLLRSFRRSSTSPSHSQSLSIGPLNHHLLSPIPPSSQSSSPISLLHARSFAFSSA 60

Query: 61  EEAAAERRRRKRRLRIEPPLHALRRDNYPPSQRDPNAPRLPDSTSALVGPRLNLHNRVQS 120
           EEAAAERRRRKRRLRIEPPLHALRRDNYPP QRDPNAPRLPDSTSALVGPRLNLHNRVQS
Sbjct: 61  EEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRLPDSTSALVGPRLNLHNRVQS 120

Query: 121 LIRAGDLDAASAVARHSLFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQYFFNQSNIVP 180
           LIRAGDLDAASAVARHS+FSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQ+FFNQSNIVP
Sbjct: 121 LIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVP 180

Query: 181 NVVSYNNLINAHCDEGRVDVGLEIYRHIIANAPFSPSAVTYRHLTKGLIDSARIGEAVDL 240
           NVVSYNNLINAHCDEGRVDV LEIYRHIIANAPFSPSAVTYRHLTKGLIDS RIGEAVDL
Sbjct: 181 NVVSYNNLINAHCDEGRVDVSLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDL 240

Query: 241 LREMLNKGHGADSLVYNNLISGFLNLENLEKANELFDELKERCLVYDGVVNATFMDWFFN 300
           LREMLNKGHGADSLVYNNLISGFLNLENLEKANELFDELKERCLVYDGVVNATFMDWFFN
Sbjct: 241 LREMLNKGHGADSLVYNNLISGFLNLENLEKANELFDELKERCLVYDGVVNATFMDWFFN 300

Query: 301 RGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQA 360
           RGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQA
Sbjct: 301 RGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQA 360

Query: 361 VNSDTFNIMVNECFKLGKFSEAIETFRKVGTQPKSRPFAMDVAGYNNIIARFCEHGVMTD 420
           VNSDTFNIMVNECFKLGKFSEA+ETFRKVGTQPKSRPFAMDVAGYNNII RFCE G+M D
Sbjct: 361 VNSDTFNIMVNECFKLGKFSEAVETFRKVGTQPKSRPFAMDVAGYNNIIGRFCEQGMMAD 420

Query: 421 AESFFSELCSKSLSPDVPTHRTLIEAYLKVEQIDDVLRVFNRMVDVGLRVVASFGNRVFG 480
           AE+FF+ELCSKSLSPDVPTHRTLIEAYLK+EQIDD LRVFNRMVDVGLRVVASFGNRVFG
Sbjct: 421 AETFFAELCSKSLSPDVPTHRTLIEAYLKIEQIDDALRVFNRMVDVGLRVVASFGNRVFG 480

Query: 481 ELIKNGKAVDCAQILTKMGEKDPKPDPTCYDVVIRGLCNEGALDASLELLDQVMRYGIGL 540
           ELIKNGK VDCAQILTKMGE+DPKPDPTCYDVVI+GLCNEGALDAS ELLDQ+MRYGIGL
Sbjct: 481 ELIKNGKVVDCAQILTKMGERDPKPDPTCYDVVIQGLCNEGALDASRELLDQIMRYGIGL 540

Query: 541 TPTLQEFVKEVFVKAGRNEEIERLLNMNRWGHAPYRPPSGPPRILQSQVPPQMGPPRQP- 600
           TP LQEFVKE FVKAGR+EEIERLLNMNRWGHAPYRPPSGPPRI QSQVPPQMGPPR P 
Sbjct: 541 TPALQEFVKEAFVKAGRSEEIERLLNMNRWGHAPYRPPSGPPRISQSQVPPQMGPPRPPP 600

Query: 601 --------------------------------PQGTPQMAESHWRPSINPQARGSYD--- 660
                                           PQG P MAE HWRPSINPQARGSY    
Sbjct: 601 QGHPPMAEPHWQPSINPQAGGSCAPSSPQMTGPQGHPPMAEPHWRPSINPQARGSYAPSS 660

Query: 661 -----------------------------APSAPQMTVPNHFQSGSAQMTRTQQPSSDPP 677
                                         PS+PQMT PN+FQSGSAQMTR QQP  D P
Sbjct: 661 PQMTGPQGHPPMAEPHWRPSINPQAGGSYGPSSPQMTGPNYFQSGSAQMTRPQQPPFD-P 720

BLAST of Tan0002243 vs. NCBI nr
Match: XP_023552661.1 (pentatricopeptide repeat-containing protein At1g10270-like isoform X1 [Cucurbita pepo subsp. pepo] >XP_023552662.1 pentatricopeptide repeat-containing protein At1g10270-like isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1207.6 bits (3123), Expect = 0.0e+00
Identity = 629/774 (81.27%), Postives = 645/774 (83.33%), Query Frame = 0

Query: 1   MSFYRLLLCSLRRSSTFPSHTRALSISPLNHHLQAPIPPSSQSSSPISLLHARSFAFSSA 60
           MS YRLLL S RRSST PSH++ALSI PLNHHL +PIPPSSQSSSPISLLH RSFAFSSA
Sbjct: 1   MSLYRLLLRSFRRSSTSPSHSQALSIGPLNHHLLSPIPPSSQSSSPISLLHVRSFAFSSA 60

Query: 61  EEAAAERRRRKRRLRIEPPLHALRRDNYPPSQRDPNAPRLPDSTSALVGPRLNLHNRVQS 120
           EEAAAERRRRKRRLRIEPPLHALRRDNYPP QRDPNAPRLPDSTSALVGPRLNLHNRVQS
Sbjct: 61  EEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRLPDSTSALVGPRLNLHNRVQS 120

Query: 121 LIRAGDLDAASAVARHSLFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQYFFNQSNIVP 180
           LIRAGDLDAASAVARHS+FSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQ+FFNQSNIVP
Sbjct: 121 LIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVP 180

Query: 181 NVVSYNNLINAHCDEGRVDVGLEIYRHIIANAPFSPSAVTYRHLTKGLIDSARIGEAVDL 240
           NVVSYNNLINAHCDEGRVDVGLEIYRHIIANAPFSPSAVTYRHLTKGLIDS RIGEAVDL
Sbjct: 181 NVVSYNNLINAHCDEGRVDVGLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDL 240

Query: 241 LREMLNKGHGADSLVYNNLISGFLNLENLEKANELFDELKERCLVYDGVVNATFMDWFFN 300
           LREMLNKGHGADSLVYNNLISGFLNLENLEKANELFDELKERCLVYDGVVNATFMDWFFN
Sbjct: 241 LREMLNKGHGADSLVYNNLISGFLNLENLEKANELFDELKERCLVYDGVVNATFMDWFFN 300

Query: 301 RGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQA 360
           RGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQA
Sbjct: 301 RGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQA 360

Query: 361 VNSDTFNIMVNECFKLGKFSEAIETFRKVGTQPKSRPFAMDVAGYNNIIARFCEHGVMTD 420
           VNSDTFNIMVNECFKLGKFSEA+ETFRKVGTQPKSRPFAMDVAGYNNII RFCE G+M D
Sbjct: 361 VNSDTFNIMVNECFKLGKFSEAVETFRKVGTQPKSRPFAMDVAGYNNIIGRFCEQGMMAD 420

Query: 421 AESFFSELCSKSLSPDVPTHRTLIEAYLKVEQIDDVLRVFNRMVDVGLRVVASFGNRVFG 480
           AE+FF+ELCSKSLSPDVPTHRTLIEAYLK+EQIDD LRVFNRMVDVGLRVVASFGNRVFG
Sbjct: 421 AETFFAELCSKSLSPDVPTHRTLIEAYLKIEQIDDALRVFNRMVDVGLRVVASFGNRVFG 480

Query: 481 ELIKNGKAVDCAQILTKMGEKDPKPDPTCYDVVIRGLCNEGALDASLELLDQVMRYGIGL 540
           ELIKNGK VDCAQILTKMGE+DPKPDPTCYDVVIRGLCNEGALDAS ELLDQ+MRYGIGL
Sbjct: 481 ELIKNGKVVDCAQILTKMGERDPKPDPTCYDVVIRGLCNEGALDASRELLDQIMRYGIGL 540

Query: 541 TPTLQEFVKEVFVKAGRNEEIERLLNMNRWGHAPYRPPSGPPRILQSQVPPQMGPPRQPP 600
           TPTLQEFVKE FVKAGR+EEIERLLNMNRWGHAPYRPPSGPPRI QSQVPPQMGPPR PP
Sbjct: 541 TPTLQEFVKEAFVKAGRSEEIERLLNMNRWGHAPYRPPSGPPRISQSQVPPQMGPPRPPP 600

Query: 601 QGTPQMAESHWRPSINPQARGSYD------------------------------------ 660
           QG P MAE HWRPSINPQA GSY                                     
Sbjct: 601 QGHPPMAEPHWRPSINPQAGGSYAPSSPQMTGPQGHPPMAEPHWRPSINPQAGGSYGPSS 660

Query: 661 ------------------------------------------------------------ 677
                                                                       
Sbjct: 661 PQMTGPQGHPPMAEPHWRPSINPQAGGSYAPSSPQMTGPQGHTPMAEPHWRPSINPQAGG 720

BLAST of Tan0002243 vs. NCBI nr
Match: XP_022136811.1 (pentatricopeptide repeat-containing protein At1g10270 [Momordica charantia])

HSP 1 Score: 1206.4 bits (3120), Expect = 0.0e+00
Identity = 612/676 (90.53%), Postives = 637/676 (94.23%), Query Frame = 0

Query: 1   MSFYRLLLCSLRRSSTFPSHTRALSISPLNHHLQAPIPPSSQSSSPISLLHARSFAFSSA 60
           MSF RLLL SLRRSS  PSH+RAL+I PLNHHLQAPIPPSSQ+++PISLL ARSFAFSSA
Sbjct: 1   MSFCRLLLRSLRRSSASPSHSRALTIGPLNHHLQAPIPPSSQTAAPISLLPARSFAFSSA 60

Query: 61  EEAAAERRRRKRRLRIEPPLHALRRDNYPPSQRDPNAPRLPDSTSALVGPRLNLHNRVQS 120
           EEAAAERRRRKRRLRIEPPLHALRRDNYPP QRDPNAPRLPDSTS+LVGPRLNLHNRVQS
Sbjct: 61  EEAAAERRRRKRRLRIEPPLHALRRDNYPPQQRDPNAPRLPDSTSSLVGPRLNLHNRVQS 120

Query: 121 LIRAGDLDAASAVARHSLFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQYFFNQSNIVP 180
           LIRAGDLDAASAVARHS+FSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQ+FFNQSNIVP
Sbjct: 121 LIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVP 180

Query: 181 NVVSYNNLINAHCDEGRVDVGLEIYRHIIANAPFSPSAVTYRHLTKGLIDSARIGEAVDL 240
           NVVSYNNLINAHCDEGRVDVGLEIYRHIIANAPFSPSAVTYRHLTKGLIDS RIGEAVDL
Sbjct: 181 NVVSYNNLINAHCDEGRVDVGLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDL 240

Query: 241 LREMLNKGHGADSLVYNNLISGFLNLENLEKANELFDELKERCLVYDGVVNATFMDWFFN 300
           LREMLNKGHGADSLVYNNLI+GFLNLENLEKANELFDELKERCLVYDGVVNATFMDWFFN
Sbjct: 241 LREMLNKGHGADSLVYNNLIAGFLNLENLEKANELFDELKERCLVYDGVVNATFMDWFFN 300

Query: 301 RGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQA 360
           RG+EKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQA
Sbjct: 301 RGQEKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQA 360

Query: 361 VNSDTFNIMVNECFKLGKFSEAIETFRKVGTQPKSRPFAMDVAGYNNIIARFCEHGVMTD 420
           VNSDTFNIMVNECF LGKFSEAIETFRKVGTQPKSRPFAMDVAGYNNIIARFCE G+M D
Sbjct: 361 VNSDTFNIMVNECFNLGKFSEAIETFRKVGTQPKSRPFAMDVAGYNNIIARFCEKGMMAD 420

Query: 421 AESFFSELCSKSLSPDVPTHRTLIEAYLKVEQIDDVLRVFNRMVDVGLRVVASFGNRVFG 480
           AE++F+ELCSKSLSPDV THRTLI+AYLKVEQIDDVLRVFNRMV+VGLRVVASFGNRVFG
Sbjct: 421 AETYFAELCSKSLSPDVTTHRTLIDAYLKVEQIDDVLRVFNRMVEVGLRVVASFGNRVFG 480

Query: 481 ELIKNGKAVDCAQILTKMGEKDPKPDPTCYDVVIRGLCNEGALDASLELLDQVMRYGIGL 540
           ELIKNGKAVDCAQILTKMGE+DPKPDPTCYDVVIRGLCNEGALDAS ELLDQ+MRYGIGL
Sbjct: 481 ELIKNGKAVDCAQILTKMGERDPKPDPTCYDVVIRGLCNEGALDASRELLDQIMRYGIGL 540

Query: 541 TPTLQEFVKEVFVKAGRNEEIERLLNMNRWGHAPYRPPSGPPRILQSQVPPQMGPPRQPP 600
           TPTLQEFVKEVFVKAG +EEIERLLNMNRWGHAPYRPPS PPRI QSQVPPQMGPPRQPP
Sbjct: 541 TPTLQEFVKEVFVKAGLSEEIERLLNMNRWGHAPYRPPSRPPRIPQSQVPPQMGPPRQPP 600

Query: 601 QGTPQMAESHWRPSINPQARGSYDAPSAPQMTVPNHFQSGSAQMTRTQQPSSDPPPPMEE 660
            GTPQMAE HWRP INPQ  G+Y APS+ Q+T PN+FQSG+  MTR QQP S PPP  E+
Sbjct: 601 HGTPQMAEPHWRPPINPQVGGNY-APSSHQITGPNYFQSGT--MTRPQQPYSHPPPMEEQ 660

Query: 661 HHHSQQPSQMAGQAAA 677
           H    QP QMA Q  A
Sbjct: 661 HQSPSQPPQMARQGVA 673

BLAST of Tan0002243 vs. ExPASy TrEMBL
Match: A0A6J1JBF6 (pentatricopeptide repeat-containing protein At1g10270-like isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111482933 PE=4 SV=1)

HSP 1 Score: 1225.3 bits (3169), Expect = 0.0e+00
Identity = 624/708 (88.14%), Postives = 642/708 (90.68%), Query Frame = 0

Query: 1   MSFYRLLLCSLRRSSTFPSHTRALSISPLNHHLQAPIPPSSQSSSPISLLHARSFAFSSA 60
           MS YRLLL S RRSST PSH+++LSI PLNHHL +PIPPSSQSSSPISLLHARSFAFSSA
Sbjct: 1   MSLYRLLLRSFRRSSTSPSHSQSLSIGPLNHHLLSPIPPSSQSSSPISLLHARSFAFSSA 60

Query: 61  EEAAAERRRRKRRLRIEPPLHALRRDNYPPSQRDPNAPRLPDSTSALVGPRLNLHNRVQS 120
           EEAAAERRRRKRRLRIEPPLHALRRDNYPP QRDPNAPRLPDSTSALVGPRLNLHNRVQS
Sbjct: 61  EEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRLPDSTSALVGPRLNLHNRVQS 120

Query: 121 LIRAGDLDAASAVARHSLFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQYFFNQSNIVP 180
           LIRAGDLDAASAVARHS+FSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQ+FFNQSNIVP
Sbjct: 121 LIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVP 180

Query: 181 NVVSYNNLINAHCDEGRVDVGLEIYRHIIANAPFSPSAVTYRHLTKGLIDSARIGEAVDL 240
           NVVSYNNLINAHCDEGRVDV LEIYRHIIANAPFSPSAVTYRHLTKGLIDS RIGEAVDL
Sbjct: 181 NVVSYNNLINAHCDEGRVDVSLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDL 240

Query: 241 LREMLNKGHGADSLVYNNLISGFLNLENLEKANELFDELKERCLVYDGVVNATFMDWFFN 300
           LREMLNKGHGADSLVYNNLISGFLNLENLEKANELFDELKERCLVYDGVVNATFMDWFFN
Sbjct: 241 LREMLNKGHGADSLVYNNLISGFLNLENLEKANELFDELKERCLVYDGVVNATFMDWFFN 300

Query: 301 RGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQA 360
           RGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQA
Sbjct: 301 RGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQA 360

Query: 361 VNSDTFNIMVNECFKLGKFSEAIETFRKVGTQPKSRPFAMDVAGYNNIIARFCEHGVMTD 420
           VNSDTFNIMVNECFKLGKFSEA+ETFRKVGTQPKSRPFAMDVAGYNNII RFCE G+M D
Sbjct: 361 VNSDTFNIMVNECFKLGKFSEAVETFRKVGTQPKSRPFAMDVAGYNNIIGRFCEQGMMAD 420

Query: 421 AESFFSELCSKSLSPDVPTHRTLIEAYLKVEQIDDVLRVFNRMVDVGLRVVASFGNRVFG 480
           AE+FF+ELCSKSLSPDVPTHRTLIEAYLK+EQIDD LRVFNRMVDVGLRVVASFGNRVFG
Sbjct: 421 AETFFAELCSKSLSPDVPTHRTLIEAYLKIEQIDDALRVFNRMVDVGLRVVASFGNRVFG 480

Query: 481 ELIKNGKAVDCAQILTKMGEKDPKPDPTCYDVVIRGLCNEGALDASLELLDQVMRYGIGL 540
           ELIKNGK VDCAQILTKMGE+DPKPDPTCYDVVI+GLCNEGALDAS ELLDQ+MRYGIGL
Sbjct: 481 ELIKNGKVVDCAQILTKMGERDPKPDPTCYDVVIQGLCNEGALDASRELLDQIMRYGIGL 540

Query: 541 TPTLQEFVKEVFVKAGRNEEIERLLNMNRWGHAPYRPPSGPPRILQSQVPPQMGPPRQPP 600
           TP LQEFVKE FVKAGR+EEIERLLNMNRWGHAPYRPPSGPPRI QSQVPPQMGPPR PP
Sbjct: 541 TPALQEFVKEAFVKAGRSEEIERLLNMNRWGHAPYRPPSGPPRISQSQVPPQMGPPRPPP 600

Query: 601 QGTPQMAESHWRPSINPQARGSYD--------------------------------APSA 660
           QG P MAE HWRPSINPQARGSY                                  PS+
Sbjct: 601 QGHPPMAEPHWRPSINPQARGSYAPSSPQMTGPQGHPPMAEPHWRPSINPQAGGSYGPSS 660

Query: 661 PQMTVPNHFQSGSAQMTRTQQPSSDPPPPMEEHHHSQQPSQMAGQAAA 677
           PQMT PN+FQSGSAQMTR QQP  D P PMEE HHSQQP Q+AGQ  A
Sbjct: 661 PQMTGPNYFQSGSAQMTRPQQPPFD-PSPMEEQHHSQQPPQIAGQTVA 707

BLAST of Tan0002243 vs. ExPASy TrEMBL
Match: A0A6J1J312 (pentatricopeptide repeat-containing protein At1g10270-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111482933 PE=4 SV=1)

HSP 1 Score: 1208.4 bits (3125), Expect = 0.0e+00
Identity = 624/741 (84.21%), Postives = 642/741 (86.64%), Query Frame = 0

Query: 1   MSFYRLLLCSLRRSSTFPSHTRALSISPLNHHLQAPIPPSSQSSSPISLLHARSFAFSSA 60
           MS YRLLL S RRSST PSH+++LSI PLNHHL +PIPPSSQSSSPISLLHARSFAFSSA
Sbjct: 1   MSLYRLLLRSFRRSSTSPSHSQSLSIGPLNHHLLSPIPPSSQSSSPISLLHARSFAFSSA 60

Query: 61  EEAAAERRRRKRRLRIEPPLHALRRDNYPPSQRDPNAPRLPDSTSALVGPRLNLHNRVQS 120
           EEAAAERRRRKRRLRIEPPLHALRRDNYPP QRDPNAPRLPDSTSALVGPRLNLHNRVQS
Sbjct: 61  EEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRLPDSTSALVGPRLNLHNRVQS 120

Query: 121 LIRAGDLDAASAVARHSLFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQYFFNQSNIVP 180
           LIRAGDLDAASAVARHS+FSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQ+FFNQSNIVP
Sbjct: 121 LIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVP 180

Query: 181 NVVSYNNLINAHCDEGRVDVGLEIYRHIIANAPFSPSAVTYRHLTKGLIDSARIGEAVDL 240
           NVVSYNNLINAHCDEGRVDV LEIYRHIIANAPFSPSAVTYRHLTKGLIDS RIGEAVDL
Sbjct: 181 NVVSYNNLINAHCDEGRVDVSLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDL 240

Query: 241 LREMLNKGHGADSLVYNNLISGFLNLENLEKANELFDELKERCLVYDGVVNATFMDWFFN 300
           LREMLNKGHGADSLVYNNLISGFLNLENLEKANELFDELKERCLVYDGVVNATFMDWFFN
Sbjct: 241 LREMLNKGHGADSLVYNNLISGFLNLENLEKANELFDELKERCLVYDGVVNATFMDWFFN 300

Query: 301 RGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQA 360
           RGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQA
Sbjct: 301 RGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQA 360

Query: 361 VNSDTFNIMVNECFKLGKFSEAIETFRKVGTQPKSRPFAMDVAGYNNIIARFCEHGVMTD 420
           VNSDTFNIMVNECFKLGKFSEA+ETFRKVGTQPKSRPFAMDVAGYNNII RFCE G+M D
Sbjct: 361 VNSDTFNIMVNECFKLGKFSEAVETFRKVGTQPKSRPFAMDVAGYNNIIGRFCEQGMMAD 420

Query: 421 AESFFSELCSKSLSPDVPTHRTLIEAYLKVEQIDDVLRVFNRMVDVGLRVVASFGNRVFG 480
           AE+FF+ELCSKSLSPDVPTHRTLIEAYLK+EQIDD LRVFNRMVDVGLRVVASFGNRVFG
Sbjct: 421 AETFFAELCSKSLSPDVPTHRTLIEAYLKIEQIDDALRVFNRMVDVGLRVVASFGNRVFG 480

Query: 481 ELIKNGKAVDCAQILTKMGEKDPKPDPTCYDVVIRGLCNEGALDASLELLDQVMRYGIGL 540
           ELIKNGK VDCAQILTKMGE+DPKPDPTCYDVVI+GLCNEGALDAS ELLDQ+MRYGIGL
Sbjct: 481 ELIKNGKVVDCAQILTKMGERDPKPDPTCYDVVIQGLCNEGALDASRELLDQIMRYGIGL 540

Query: 541 TPTLQEFVKEVFVKAGRNEEIERLLNMNRWGHAPYRPPSGPPRILQSQVPPQMGPPRQP- 600
           TP LQEFVKE FVKAGR+EEIERLLNMNRWGHAPYRPPSGPPRI QSQVPPQMGPPR P 
Sbjct: 541 TPALQEFVKEAFVKAGRSEEIERLLNMNRWGHAPYRPPSGPPRISQSQVPPQMGPPRPPP 600

Query: 601 --------------------------------PQGTPQMAESHWRPSINPQARGSYD--- 660
                                           PQG P MAE HWRPSINPQARGSY    
Sbjct: 601 QGHPPMAEPHWQPSINPQAGGSCAPSSPQMTGPQGHPPMAEPHWRPSINPQARGSYAPSS 660

Query: 661 -----------------------------APSAPQMTVPNHFQSGSAQMTRTQQPSSDPP 677
                                         PS+PQMT PN+FQSGSAQMTR QQP  D P
Sbjct: 661 PQMTGPQGHPPMAEPHWRPSINPQAGGSYGPSSPQMTGPNYFQSGSAQMTRPQQPPFD-P 720

BLAST of Tan0002243 vs. ExPASy TrEMBL
Match: A0A6J1C4J8 (pentatricopeptide repeat-containing protein At1g10270 OS=Momordica charantia OX=3673 GN=LOC111008416 PE=4 SV=1)

HSP 1 Score: 1206.4 bits (3120), Expect = 0.0e+00
Identity = 612/676 (90.53%), Postives = 637/676 (94.23%), Query Frame = 0

Query: 1   MSFYRLLLCSLRRSSTFPSHTRALSISPLNHHLQAPIPPSSQSSSPISLLHARSFAFSSA 60
           MSF RLLL SLRRSS  PSH+RAL+I PLNHHLQAPIPPSSQ+++PISLL ARSFAFSSA
Sbjct: 1   MSFCRLLLRSLRRSSASPSHSRALTIGPLNHHLQAPIPPSSQTAAPISLLPARSFAFSSA 60

Query: 61  EEAAAERRRRKRRLRIEPPLHALRRDNYPPSQRDPNAPRLPDSTSALVGPRLNLHNRVQS 120
           EEAAAERRRRKRRLRIEPPLHALRRDNYPP QRDPNAPRLPDSTS+LVGPRLNLHNRVQS
Sbjct: 61  EEAAAERRRRKRRLRIEPPLHALRRDNYPPQQRDPNAPRLPDSTSSLVGPRLNLHNRVQS 120

Query: 121 LIRAGDLDAASAVARHSLFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQYFFNQSNIVP 180
           LIRAGDLDAASAVARHS+FSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQ+FFNQSNIVP
Sbjct: 121 LIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVP 180

Query: 181 NVVSYNNLINAHCDEGRVDVGLEIYRHIIANAPFSPSAVTYRHLTKGLIDSARIGEAVDL 240
           NVVSYNNLINAHCDEGRVDVGLEIYRHIIANAPFSPSAVTYRHLTKGLIDS RIGEAVDL
Sbjct: 181 NVVSYNNLINAHCDEGRVDVGLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDL 240

Query: 241 LREMLNKGHGADSLVYNNLISGFLNLENLEKANELFDELKERCLVYDGVVNATFMDWFFN 300
           LREMLNKGHGADSLVYNNLI+GFLNLENLEKANELFDELKERCLVYDGVVNATFMDWFFN
Sbjct: 241 LREMLNKGHGADSLVYNNLIAGFLNLENLEKANELFDELKERCLVYDGVVNATFMDWFFN 300

Query: 301 RGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQA 360
           RG+EKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQA
Sbjct: 301 RGQEKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQA 360

Query: 361 VNSDTFNIMVNECFKLGKFSEAIETFRKVGTQPKSRPFAMDVAGYNNIIARFCEHGVMTD 420
           VNSDTFNIMVNECF LGKFSEAIETFRKVGTQPKSRPFAMDVAGYNNIIARFCE G+M D
Sbjct: 361 VNSDTFNIMVNECFNLGKFSEAIETFRKVGTQPKSRPFAMDVAGYNNIIARFCEKGMMAD 420

Query: 421 AESFFSELCSKSLSPDVPTHRTLIEAYLKVEQIDDVLRVFNRMVDVGLRVVASFGNRVFG 480
           AE++F+ELCSKSLSPDV THRTLI+AYLKVEQIDDVLRVFNRMV+VGLRVVASFGNRVFG
Sbjct: 421 AETYFAELCSKSLSPDVTTHRTLIDAYLKVEQIDDVLRVFNRMVEVGLRVVASFGNRVFG 480

Query: 481 ELIKNGKAVDCAQILTKMGEKDPKPDPTCYDVVIRGLCNEGALDASLELLDQVMRYGIGL 540
           ELIKNGKAVDCAQILTKMGE+DPKPDPTCYDVVIRGLCNEGALDAS ELLDQ+MRYGIGL
Sbjct: 481 ELIKNGKAVDCAQILTKMGERDPKPDPTCYDVVIRGLCNEGALDASRELLDQIMRYGIGL 540

Query: 541 TPTLQEFVKEVFVKAGRNEEIERLLNMNRWGHAPYRPPSGPPRILQSQVPPQMGPPRQPP 600
           TPTLQEFVKEVFVKAG +EEIERLLNMNRWGHAPYRPPS PPRI QSQVPPQMGPPRQPP
Sbjct: 541 TPTLQEFVKEVFVKAGLSEEIERLLNMNRWGHAPYRPPSRPPRIPQSQVPPQMGPPRQPP 600

Query: 601 QGTPQMAESHWRPSINPQARGSYDAPSAPQMTVPNHFQSGSAQMTRTQQPSSDPPPPMEE 660
            GTPQMAE HWRP INPQ  G+Y APS+ Q+T PN+FQSG+  MTR QQP S PPP  E+
Sbjct: 601 HGTPQMAEPHWRPPINPQVGGNY-APSSHQITGPNYFQSGT--MTRPQQPYSHPPPMEEQ 660

Query: 661 HHHSQQPSQMAGQAAA 677
           H    QP QMA Q  A
Sbjct: 661 HQSPSQPPQMARQGVA 673

BLAST of Tan0002243 vs. ExPASy TrEMBL
Match: A0A6J1EMZ6 (pentatricopeptide repeat-containing protein At1g10270-like OS=Cucurbita moschata OX=3662 GN=LOC111436060 PE=4 SV=1)

HSP 1 Score: 1203.0 bits (3111), Expect = 0.0e+00
Identity = 623/741 (84.08%), Postives = 639/741 (86.23%), Query Frame = 0

Query: 1   MSFYRLLLCSLRRSSTFPSHTRALSISPLNHHLQAPIPPSSQSSSPISLLHARSFAFSSA 60
           MS YRLLL S RRSST PSH++ALSI PLNHHL +P PPSSQ SSPISLLHARSFAFSSA
Sbjct: 1   MSLYRLLLRSFRRSSTSPSHSQALSIGPLNHHLLSPFPPSSQ-SSPISLLHARSFAFSSA 60

Query: 61  EEAAAERRRRKRRLRIEPPLHALRRDNYPPSQRDPNAPRLPDSTSALVGPRLNLHNRVQS 120
           EEAAAERRRRKRRLRIEPPLHALRRDNYPP QRDPNAPRLPDSTSALVGPRLNLHNRVQS
Sbjct: 61  EEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRLPDSTSALVGPRLNLHNRVQS 120

Query: 121 LIRAGDLDAASAVARHSLFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQYFFNQSNIVP 180
           LIRAGDLDAASAVARHS+FSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQ+FFNQSNIVP
Sbjct: 121 LIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVP 180

Query: 181 NVVSYNNLINAHCDEGRVDVGLEIYRHIIANAPFSPSAVTYRHLTKGLIDSARIGEAVDL 240
           NVVSYNNLINAHCDEGRVDVGLEIYRHIIANAPFSPSAVTYRHLTKGLIDS RIGEAVDL
Sbjct: 181 NVVSYNNLINAHCDEGRVDVGLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDL 240

Query: 241 LREMLNKGHGADSLVYNNLISGFLNLENLEKANELFDELKERCLVYDGVVNATFMDWFFN 300
           LREMLNKGHGADSLVYNNLISGFLNLENLEKANELFDELKERCLVYDGVVNATFMDWFFN
Sbjct: 241 LREMLNKGHGADSLVYNNLISGFLNLENLEKANELFDELKERCLVYDGVVNATFMDWFFN 300

Query: 301 RGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQA 360
           RGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQA
Sbjct: 301 RGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQA 360

Query: 361 VNSDTFNIMVNECFKLGKFSEAIETFRKVGTQPKSRPFAMDVAGYNNIIARFCEHGVMTD 420
           VNSDTFNIMVNECFKLGKFSEA+ETFRKVGTQPKSRPFAMDVAGYNNII RFCE G+M D
Sbjct: 361 VNSDTFNIMVNECFKLGKFSEAVETFRKVGTQPKSRPFAMDVAGYNNIIGRFCEQGMMGD 420

Query: 421 AESFFSELCSKSLSPDVPTHRTLIEAYLKVEQIDDVLRVFNRMVDVGLRVVASFGNRVFG 480
           AE+FF+ELCSKSLSPDVPTHRTLIEAYLK+EQIDD LRVFNRMVDVGLRVVASFGNRVFG
Sbjct: 421 AETFFAELCSKSLSPDVPTHRTLIEAYLKIEQIDDALRVFNRMVDVGLRVVASFGNRVFG 480

Query: 481 ELIKNGKAVDCAQILTKMGEKDPKPDPTCYDVVIRGLCNEGALDASLELLDQVMRYGIGL 540
           ELIKNGK VDCAQILTKMGE+DPKPDPTCYDVVIRGLCNEGALDAS ELLDQ+MRYGIGL
Sbjct: 481 ELIKNGKVVDCAQILTKMGERDPKPDPTCYDVVIRGLCNEGALDASRELLDQIMRYGIGL 540

Query: 541 TPTLQEFVKEVFVKAGRNEEIERLLNMNRWGHAPYRPPSGPPRILQSQVPPQMGPPRQPP 600
           TPTLQEFVKE FVKAGR+EEIERLLNMNRWGHAPYR PSGPPRI QSQVPPQMGPP  PP
Sbjct: 541 TPTLQEFVKEAFVKAGRSEEIERLLNMNRWGHAPYRSPSGPPRISQSQVPPQMGPPHPPP 600

Query: 601 QGTPQMAESHWRPSINPQARGSYD------------------------------------ 660
           QG P MAE HWRPSINPQA GSY                                     
Sbjct: 601 QGHPPMAEPHWRPSINPQAGGSYGPSSPQMTGPQGHPPMAEPHWRPSINPQAGGSYAPSS 660

Query: 661 -----------------------------APSAPQMTVPNHFQSGSAQMTRTQQPSSDPP 677
                                         PS+PQMT P +FQSGSAQMTR  QP  D P
Sbjct: 661 PQMTGPQGHTPMAEPHWRPSINPQARGSYGPSSPQMTGPKYFQSGSAQMTRPHQPPFD-P 720

BLAST of Tan0002243 vs. ExPASy TrEMBL
Match: A0A6J1FVY7 (pentatricopeptide repeat-containing protein At1g10270-like OS=Cucurbita moschata OX=3662 GN=LOC111447803 PE=4 SV=1)

HSP 1 Score: 1157.9 bits (2994), Expect = 0.0e+00
Identity = 597/676 (88.31%), Postives = 613/676 (90.68%), Query Frame = 0

Query: 1   MSFYRLLLCSLRRSSTFPSHTRALSISPLNHHLQAPIPPSSQSSSPISLLHARSFAFSSA 60
           MS YRLLL SLRR+ST PSH+R LSI PL+ HLQ PI PSSQSSS ISLLHARSFAFSSA
Sbjct: 1   MSLYRLLLRSLRRTSTSPSHSRPLSIGPLSQHLQTPILPSSQSSSLISLLHARSFAFSSA 60

Query: 61  EEAAAERRRRKRRLRIEPPLHALRRDNYPPSQRDPNAPRLPDSTSALVGPRLNLHNRVQS 120
           EEAAAERRRRKRRLRIEPPLHALRRDNYPP QRDPNAPRLPDSTSALVGPRL+LHNRVQS
Sbjct: 61  EEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRLPDSTSALVGPRLSLHNRVQS 120

Query: 121 LIRAGDLDAASAVARHSLFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQYFFNQSNIVP 180
           LIRAGDLDAASAVARHS+FSNTRPTVFTCNAIIAAMYRAKRY DAIALFQ+FFNQSNIVP
Sbjct: 121 LIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYGDAIALFQFFFNQSNIVP 180

Query: 181 NVVSYNNLINAHCDEGRVDVGLEIYRHIIANAPFSPSAVTYRHLTKGLIDSARIGEAVDL 240
           NVVSYNNLINAHCDEGRVDVGLEIYRHIIANAPFSPSAVTYRHLTKGLIDS RIGEAVDL
Sbjct: 181 NVVSYNNLINAHCDEGRVDVGLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDL 240

Query: 241 LREMLNKGHGADSLVYNNLISGFLNLENLEKANELFDELKERCLVYDGVVNATFMDWFFN 300
           LREMLNKGHGADSLVYNNLISGFLNLENLEKANELFDELKERCLVYDGVVNATFMDWFFN
Sbjct: 241 LREMLNKGHGADSLVYNNLISGFLNLENLEKANELFDELKERCLVYDGVVNATFMDWFFN 300

Query: 301 RGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQA 360
           RGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQA
Sbjct: 301 RGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQA 360

Query: 361 VNSDTFNIMVNECFKLGKFSEAIETFRKVGTQPKSRPFAMDVAGYNNIIARFCEHGVMTD 420
           VNSDTFNIMVNECFKLGKFSEA+ETFRKVGTQPKSRPFAMDVAGYNNII RFCEHG+M D
Sbjct: 361 VNSDTFNIMVNECFKLGKFSEAVETFRKVGTQPKSRPFAMDVAGYNNIIVRFCEHGMMED 420

Query: 421 AESFFSELCSKSLSPDVPTHRTLIEAYLKVEQIDDVLRVFNRMVDVGLRVVASFGNRVFG 480
           AE+FF+ELCSKSLSPDVPTHRTLIEAYLK+EQIDDVL+VFNRMVDVGLRVVASFGNRVFG
Sbjct: 421 AETFFAELCSKSLSPDVPTHRTLIEAYLKLEQIDDVLKVFNRMVDVGLRVVASFGNRVFG 480

Query: 481 ELIKNGKAVDCAQILTKMGEKDPKPDPTCYDVVIRGLCNEGALDASLELLDQVMRYGIGL 540
           ELIKNGKAV+CAQILTKMGE+DPKPDPTCYDVVIRGLCNEGALDAS  LLDQVMRYGIGL
Sbjct: 481 ELIKNGKAVECAQILTKMGERDPKPDPTCYDVVIRGLCNEGALDASRMLLDQVMRYGIGL 540

Query: 541 TPTLQEFVKEVFVKAGRNEEIERLLNMNRWGHAPYRPPSGPPRILQSQVPPQMGPPRQPP 600
           TP+LQEFVKEVF KAGRNEEIERLL MNR GHAPYRP SGPPRI QSQV           
Sbjct: 541 TPSLQEFVKEVFEKAGRNEEIERLLMMNRGGHAPYRPTSGPPRISQSQV----------- 600

Query: 601 QGTPQMAESHWRPSINPQARGSYDAPSAPQMTVPNHFQSGSAQMTRTQQPSSDPPPPMEE 660
                           PQ RGSY  PSAPQMT PN+ QSGS QMTR QQPSSDPPP MEE
Sbjct: 601 ----------------PQFRGSY-GPSAPQMTGPNYIQSGSVQMTRPQQPSSDPPPSMEE 648

Query: 661 HHHSQQPSQMAGQAAA 677
                QP QMAGQA A
Sbjct: 661 QQQHSQPPQMAGQAVA 648

BLAST of Tan0002243 vs. TAIR 10
Match: AT1G10270.1 (glutamine-rich protein 23 )

HSP 1 Score: 738.0 bits (1904), Expect = 6.6e-213
Identity = 392/658 (59.57%), Postives = 484/658 (73.56%), Query Frame = 0

Query: 25  SISPL-NHHLQAPIPPSSQSSSPISLLHARSFAFSSAEEAAAERRRRKRRLRIEPPLHAL 84
           +++P+ N   Q  IP +     P   +  R+ AFSSAEEAAAERRRRKRRLRIEPPLHAL
Sbjct: 57  NLNPIPNDPSQFQIPQNHTPPIPYPPIPHRTMAFSSAEEAAAERRRRKRRLRIEPPLHAL 116

Query: 85  RRD-NYPPSQRDPNAPRLPDSTSALVGPRLNLHNRVQSLIRAGDLDAASAVARHSLFSNT 144
           RRD + PP +RDPNAPRLPDSTSALVG RLNLHNRVQSLIRA DLDAAS +AR S+FSNT
Sbjct: 117 RRDPSAPPPKRDPNAPRLPDSTSALVGQRLNLHNRVQSLIRASDLDAASKLARQSVFSNT 176

Query: 145 RPTVFTCNAIIAAMYRAKRYSDAIALFQYFFNQSNIVPNVVSYNNLINAHCDEGRVDVGL 204
           RPTVFTCNAIIAAMYRAKRYS++I+LFQYFF QSNIVPNVVSYN +INAHCDEG VD  L
Sbjct: 177 RPTVFTCNAIIAAMYRAKRYSESISLFQYFFKQSNIVPNVVSYNQIINAHCDEGNVDEAL 236

Query: 205 EIYRHIIANAPFSPSAVTYRHLTKGLIDSARIGEAVDLLREMLNKGHGADSLVYNNLISG 264
           E+YRHI+ANAPF+PS+VTYRHLTKGL+ + RIG+A  LLREML+KG  ADS VYNNLI G
Sbjct: 237 EVYRHILANAPFAPSSVTYRHLTKGLVQAGRIGDAASLLREMLSKGQAADSTVYNNLIRG 296

Query: 265 FLNLENLEKANELFDELKERCLVYDGVVNATFMDWFFNRGKEKEAMESYKSLLDRQFKMI 324
           +L+L + +KA E FDELK +C VYDG+VNATFM+++F +G +KEAMESY+SLLD++F+M 
Sbjct: 297 YLDLGDFDKAVEFFDELKSKCTVYDGIVNATFMEYWFEKGNDKEAMESYRSLLDKKFRMH 356

Query: 325 PATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQAVNSDTFNIMVNECFKLGKFSEA 384
           P T NVLLEV LK GKK EAW LF++MLDNH PPN  +VNSDT  IMVNECFK+G+FSEA
Sbjct: 357 PPTGNVLLEVFLKFGKKDEAWALFNEMLDNHAPPNILSVNSDTVGIMVNECFKMGEFSEA 416

Query: 385 IETFRKVGTQPKSRPFAMDVAGYNNIIARFCEHGVMTDAESFFSELCSKSLSPDVPTHRT 444
           I TF+KVG++  S+PF MD  GY NI+ RFCE G++T+AE FF+E  S+SL  D P+HR 
Sbjct: 417 INTFKKVGSKVTSKPFVMDYLGYCNIVTRFCEQGMLTEAERFFAEGVSRSLPADAPSHRA 476

Query: 445 LIEAYLKVEQIDDVLRVFNRMVDVGLRVVASFGNRVFGELIKNGKAVDCAQILTKMGEKD 504
           +I+AYLK E+IDD +++ +RMVDV LRVVA FG RVFGELIKNGK  + A++LTKMGE++
Sbjct: 477 MIDAYLKAERIDDAVKMLDRMVDVNLRVVADFGARVFGELIKNGKLTESAEVLTKMGERE 536

Query: 505 PKPDPTCYDVVIRGLCNEGALDASLELLDQVMRYGIGLTPTLQEFVKEVFVKAGRNEEIE 564
           PKPDP+ YDVV+RGLC+  ALD + +++ +++R+ +G+T  L+EF+ EVF KAGR EEIE
Sbjct: 537 PKPDPSIYDVVVRGLCDGDALDQAKDIVGEMIRHNVGVTTVLREFIIEVFEKAGRREEIE 596

Query: 565 RLLN-----MNRWGHAPYRPPSGPPRILQSQVPPQMGPPRQP--PQGTPQMAESHWRPSI 624
           ++LN     +   G +   PP  P     +   PQ    R P   QG           + 
Sbjct: 597 KILNSVARPVRNAGQSGNTPPRVPAVFGTTPAAPQQPRDRAPWTSQGVVHSNSGWANGTA 656

Query: 625 NPQARGSYDAPSAP----QMTVPNHFQSGSAQMTRTQQP---SSDPPPPMEEHHHSQQ 667
              A G+Y A +        T  N  Q   +  T  QQP   S   P   ++   SQQ
Sbjct: 657 GQTAGGAYKANNGQNPSWSNTSDNQQQQSWSNQTAGQQPPSWSRQAPGYQQQQSWSQQ 714

BLAST of Tan0002243 vs. TAIR 10
Match: AT3G49240.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 341.7 bits (875), Expect = 1.4e-93
Identity = 206/553 (37.25%), Postives = 314/553 (56.78%), Query Frame = 0

Query: 23  ALSISPLNHHLQAPIPPSSQSSSPISLLHARSFAFSSAEEAAAERRRRKRRLRIEPPLHA 82
           ++S +   +HLQ           P   L  R  +F++ EEAAAERRRRKRRLR+EPP+++
Sbjct: 2   SISKAAFLNHLQTLSRSYRHRVLPQPFLAVRYMSFATQEEAAAERRRRKRRLRMEPPVNS 61

Query: 83  LRRDNYPPSQ-----RDPNAPRLPDSTSALVGPRLNLHNRVQSLIRAGDLDAASAVARHS 142
             R     SQ     ++PN P+LP+S SALVG RL+LHN +  LIR  DL+ A+   RHS
Sbjct: 62  FNRSQQQQSQIPRPIQNPNIPKLPESVSALVGKRLDLHNHILKLIRENDLEEAALYTRHS 121

Query: 143 LFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQYFFNQSNIVPNVVSYNNLINAHCDEGR 202
           ++SN RPT+FT N ++AA  R  +Y  A+     F NQ+ I PN+++YN +  A+ D  +
Sbjct: 122 VYSNCRPTIFTVNTVLAAQLRQAKYG-ALLQLHGFINQAGIAPNIITYNLIFQAYLDVRK 181

Query: 203 VDVGLEIYRHIIANAPFSPSAVTYRHLTKGLIDSARIGEAVDLLREMLNKGHGADSLVYN 262
            ++ LE Y+  I NAP +PS  T+R L KGL+ +  + +A+++  +M  KG   D +VY+
Sbjct: 182 PEIALEHYKLFIDNAPLNPSIATFRILVKGLVSNDNLEKAMEIKEDMAVKGFVVDPVVYS 241

Query: 263 NLISGFLNLENLEKANELFDELKERC--LVYDGVVNATFMDWFFNRGKEKEAMESYKSLL 322
            L+ G +   + +   +L+ ELKE+    V DGVV    M  +F +  EKEAME Y+  +
Sbjct: 242 YLMMGCVKNSDADGVLKLYQELKEKLGGFVDDGVVYGQLMKGYFMKEMEKEAMECYEEAV 301

Query: 323 --DRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQAVNSDTFNIMVNEC 382
             + + +M     N +LE L ++GK  EA  LFD +   H PP   AVN  TFN+MVN  
Sbjct: 302 GENSKVRMSAMAYNYVLEALSENGKFDEALKLFDAVKKEHNPPRHLAVNLGTFNVMVNGY 361

Query: 383 FKLGKFSEAIETFRKVGTQPKSRPFAMDVAGYNNIIARFCEHGVMTDAESFFSELCSKSL 442
              GKF EA+E FR++G   K  P   D   +NN++ + C++ ++ +AE  + E+  K++
Sbjct: 362 CAGGKFEEAMEVFRQMG-DFKCSP---DTLSFNNLMNQLCDNELLAEAEKLYGEMEEKNV 421

Query: 443 SPDVPTHRTLIEAYLKVEQIDDVLRVFNRMVDVGLRVVASFGNRVFGELIKNGKAVDCAQ 502
            PD  T+  L++   K  +ID+    +  MV+  LR   +  NR+  +LIK GK  D   
Sbjct: 422 KPDEYTYGLLMDTCFKEGKIDEGAAYYKTMVESNLRPNLAVYNRLQDQLIKAGKLDDAKS 481

Query: 503 ILTKMGEKDPKPDPTCYDVVIRGLCNEGALDASLELLDQVMRYG-IGLTPTLQEFVKEVF 562
               M  K  K D   Y  ++R L   G LD  L+++D+++    + ++  LQEFVKE  
Sbjct: 482 FFDMMVSK-LKMDDEAYKFIMRALSEAGRLDEMLKIVDEMLDDDTVRVSEELQEFVKEEL 541

Query: 563 VKAGRNEEIERLL 566
            K GR  ++E+L+
Sbjct: 542 RKGGREGDLEKLM 548

BLAST of Tan0002243 vs. TAIR 10
Match: AT3G60960.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 231.9 bits (590), Expect = 1.5e-60
Identity = 149/402 (37.06%), Postives = 223/402 (55.47%), Query Frame = 0

Query: 90  PSQRDPNA-PRL-PDSTSALVGPRLNLHNRVQSLIRAGDLDAASAVARHSL---FSNTRP 149
           P  RDP++ P+L P S S +    ++L  RV+++I   +LD AS ++R ++   F   R 
Sbjct: 27  PLGRDPSSLPKLDPVSISYIDSRPISLRYRVRAMIEMSNLDEASKLSRLAVLNGFLVDRD 86

Query: 150 TVFTCNAIIAAMYRAKRYSDAIALFQYFFNQSNIVPNVVSYNNLINAHCDEGRVDVGLEI 209
           TVF CN++I AM  AKRY DAI+LF YFFN+S  +PN +S + +I AHCD+G VD  LE+
Sbjct: 87  TVFICNSVIGAMCSAKRYDDAISLFNYFFNESQTLPNTLSCDLIIKAHCDQGHVDDALEL 146

Query: 210 YRHIIANAPFSPSAVTYRHLTKGLIDSARIGEAVDLLREMLNKGHGADSLVYNNLISGFL 269
           YRHI+ +   +P   TY  L K L+D+ R  EA  L R M         +VY+ LI GFL
Sbjct: 147 YRHILLDGRVAPGIETYMILAKALVDAKRFDEACVLARSM----SCCSFMVYDILIRGFL 206

Query: 270 NLENLEKANELFDELKERCLVYDG--------VVNATFMDWFFNRGKEKEAMESYKSLLD 329
           ++ N  KA+++F+ELK       G        + N +FM+++F +GK++EAME   +L D
Sbjct: 207 DIGNFVKASQIFEELKGLDSKLPGREYHKANAIFNVSFMNYWFKQGKDEEAMEILANLED 266

Query: 330 RQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQAVNSDTFNIMVNECFKL 389
            Q  + P   N +L+VL+KHGKKTEAW LF +M+        +  +S+T +IM       
Sbjct: 267 AQV-LNPIVGNRVLQVLVKHGKKTEAWELFGEMI--------EICDSETVDIMSE----- 326

Query: 390 GKFSEAIETFRKVGTQPKSRPFAMDVAGYNNIIARFCEHGVMTDAESFFSELCSK----- 449
             FSE    F +           +    Y  +I   CEHG ++DAE  F+E+ +      
Sbjct: 327 -YFSEKTVPFER-----------LRKTCYRKMIVSLCEHGKVSDAEKLFAEMFTDVDGGD 386

Query: 450 -SLSPDVPTHRTLIEAYLKVEQIDDVLRVFNRMVDVGLRVVA 473
             + PD+   R +I  Y+ V ++DD ++  N+M    LR +A
Sbjct: 387 LLVGPDLLIFRAMINGYVSVGRVDDAIKTLNKMRISNLRKLA 398

BLAST of Tan0002243 vs. TAIR 10
Match: AT3G60980.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 221.9 bits (564), Expect = 1.6e-57
Identity = 143/380 (37.63%), Postives = 218/380 (57.37%), Query Frame = 0

Query: 117 RVQSLIR-AGDLDAASAVARHSLFSN--TRPTVFTCNAIIAAMYRAKRYSDAIALFQYFF 176
           RV  LIR  GDLD A+  AR ++F++  +  T   C +II  M R KR  DA  L+++FF
Sbjct: 38  RVSYLIRCVGDLDTAAKYARLAVFTSIKSESTTTICQSIIGGMLRDKRLKDAYDLYEFFF 97

Query: 177 NQSNIVPNVVSYNNLINAHCDEGRVDVGLEIYRHIIANAPFS--PSAVTYRHLTKGLIDS 236
           NQ N+ PN   +N +I +   +G V+  L  +   I +      PS  ++R LTKGL+ S
Sbjct: 98  NQHNLRPNSHCWNYIIESGFQQGLVNDALHFHHRCINSGQVHDYPSDDSFRILTKGLVHS 157

Query: 237 ARIGEAVDLLR-EMLNKGHGADSLVYNNLISGFLNLENLEKANELFDELKERCLV----- 296
            R+ +A   LR   +N+    D + YNNLI GFL+L N +KAN +  E K   L+     
Sbjct: 158 GRLDQAEAFLRGRTVNRTTYPDHVAYNNLIRGFLDLGNFKKANLVLGEFKRLFLIALSET 217

Query: 297 --------YDGVV---NATFMDWFFNRGKEKEAMESY-KSLLDRQFKMIPATCNVLLEVL 356
                   Y+  V    ATFM+++F +GK+ EAME Y + +L  +  +   T N LL+VL
Sbjct: 218 KDDLHHSNYENRVAFLMATFMEYWFKQGKQVEAMECYNRCVLSNRLLVCAETGNALLKVL 277

Query: 357 LKHGKKTEAWTLFDQMLDNHTPPNFQAVNSDTFNIMVNECFKLGKFSEAIETFRKVGTQP 416
           LK+G+K  AW L+ ++LD +       ++SDT  IMV+ECF +G FSEA+ET++K   +P
Sbjct: 278 LKYGEKKNAWALYHELLDKNGTGK-GCLDSDTIKIMVDECFDMGWFSEAMETYKK--ARP 337

Query: 417 KSRPFAMDVAGYNNIIARFCEHGVMTDAESFFSELCSKSLS-PDVPTHRTLIEAYLKVEQ 473
           K+     D      II RFCE+ ++++AES F +  +      DV T++T+I+AY+K  +
Sbjct: 338 KN-----DYLSDKYIITRFCENRMLSEAESVFVDSLADDFGYIDVNTYKTMIDAYVKAGR 397

BLAST of Tan0002243 vs. TAIR 10
Match: AT5G28380.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 186.8 bits (473), Expect = 5.7e-47
Identity = 113/322 (35.09%), Postives = 176/322 (54.66%), Query Frame = 0

Query: 162 YSDAIALFQYFFNQSNIVPNVVSYNNLINAHCDEGRVDVGLEIYRHIIANAPFSPSAVTY 221
           Y +AI+LF YFFN+S  +PN++S N +I AHCD+G VD  LE+YRHI+ +   +P   TY
Sbjct: 88  YDEAISLFDYFFNESQTLPNMLSCNLIIKAHCDQGSVDHALELYRHILLDGSLAPGIETY 147

Query: 222 RHLTKGLIDSARIGEAVDLLREMLNKGHGADSLVYNNLISGFLNLENLEKANELFDELK- 281
           R LTK L+ + R+ EA D++R M       D  VY+ LI GFL+     +A+++F+ELK 
Sbjct: 148 RILTKALVGAKRLDEACDVVRSMSR----CDFAVYDILIRGFLDKGKFVRASQIFEELKG 207

Query: 282 -------ERCLVYDGVVNATFMDWFFNRGKEKEAMESYKSLLDRQFKMIPATCNVLLEVL 341
                          + N +FMD++F +GK++EAME + +L   +  +   + N +L+ L
Sbjct: 208 PNSKLPWRNYHKAIAIFNVSFMDYWFKQGKDEEAMEIFATLEHAEL-LNTISGNGVLKCL 267

Query: 342 LKHGKKTEAWTLFDQMLDNHTPPNFQAVNSDTFNIMVNECFKLGKFSEAIETFRKVGTQP 401
           ++HG+KTEAW LF  M+        +  +S+T  I+++   K G F E    F +V    
Sbjct: 268 VEHGRKTEAWELFLDMI--------EICDSETVGIIMS---KEGFFGEKTIPFERVRR-- 327

Query: 402 KSRPFAMDVAGYNNIIARFCEHGVMTDAESFFSELCSK------SLSPDVPTHRTLIEAY 461
                      Y  +IA  C+ G M +AE  F+++ +          PDV T R +I  Y
Sbjct: 328 ---------TCYTRMIASLCQQGNMLEAEKLFADMFADVDGDDLLAGPDVSTFRAMINGY 382

Query: 462 LKVEQIDDVLRVFNRMVDVGLR 470
           +KV ++DD ++  N+M    LR
Sbjct: 388 VKVGRVDDAIKTLNKMKISNLR 382

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9SY699.3e-21259.57Pentatricopeptide repeat-containing protein At1g10270 OS=Arabidopsis thaliana OX... [more]
Q9M3A81.9e-9237.25Pentatricopeptide repeat-containing protein At3g49240, mitochondrial OS=Arabidop... [more]
Q9LEX62.2e-5937.06Pentatricopeptide repeat-containing protein At3g60960, mitochondrial OS=Arabidop... [more]
Q9LEX52.3e-5637.63Pentatricopeptide repeat-containing protein At3g60980, mitochondrial OS=Arabidop... [more]
Q6NQ837.0e-3424.13Pentatricopeptide repeat-containing protein At3g22470, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
XP_022984748.10.0e+0088.14pentatricopeptide repeat-containing protein At1g10270-like isoform X2 [Cucurbita... [more]
XP_023552663.10.0e+0084.89pentatricopeptide repeat-containing protein At1g10270-like isoform X2 [Cucurbita... [more]
XP_022984746.10.0e+0084.21pentatricopeptide repeat-containing protein At1g10270-like isoform X1 [Cucurbita... [more]
XP_023552661.10.0e+0081.27pentatricopeptide repeat-containing protein At1g10270-like isoform X1 [Cucurbita... [more]
XP_022136811.10.0e+0090.53pentatricopeptide repeat-containing protein At1g10270 [Momordica charantia][more]
Match NameE-valueIdentityDescription
A0A6J1JBF60.0e+0088.14pentatricopeptide repeat-containing protein At1g10270-like isoform X2 OS=Cucurbi... [more]
A0A6J1J3120.0e+0084.21pentatricopeptide repeat-containing protein At1g10270-like isoform X1 OS=Cucurbi... [more]
A0A6J1C4J80.0e+0090.53pentatricopeptide repeat-containing protein At1g10270 OS=Momordica charantia OX=... [more]
A0A6J1EMZ60.0e+0084.08pentatricopeptide repeat-containing protein At1g10270-like OS=Cucurbita moschata... [more]
A0A6J1FVY70.0e+0088.31pentatricopeptide repeat-containing protein At1g10270-like OS=Cucurbita moschata... [more]
Match NameE-valueIdentityDescription
AT1G10270.16.6e-21359.57glutamine-rich protein 23 [more]
AT3G49240.11.4e-9337.25Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G60960.11.5e-6037.06Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G60980.11.6e-5737.63Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT5G28380.15.7e-4735.09Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 110..305
e-value: 3.3E-33
score: 117.4
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 306..475
e-value: 3.4E-25
score: 91.1
coord: 476..575
e-value: 3.1E-8
score: 35.5
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 401..446
e-value: 2.0E-8
score: 34.3
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 325..352
e-value: 0.014
score: 15.6
coord: 365..388
e-value: 0.012
score: 15.8
coord: 509..538
e-value: 0.16
score: 12.3
coord: 219..248
e-value: 1.2
score: 9.6
coord: 255..282
e-value: 1.4E-4
score: 21.8
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 177..200
e-value: 9.7E-7
score: 28.4
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 183..210
e-value: 4.2E-4
score: 18.3
coord: 255..287
e-value: 2.0E-4
score: 19.3
coord: 325..357
e-value: 5.2E-5
score: 21.2
coord: 439..468
e-value: 4.6E-4
score: 18.2
coord: 405..437
e-value: 6.2E-4
score: 17.8
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 401..435
score: 9.591195
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 436..470
score: 9.941957
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 322..356
score: 9.448698
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 181..216
score: 9.350046
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 217..251
score: 8.867749
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 506..540
score: 9.810421
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 252..282
score: 9.371969
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 662..676
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 572..676
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 603..649
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 575..602
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 68..106
NoneNo IPR availablePANTHERPTHR47937:SF2PENTATRICOPEPTIDE (PPR) REPEAT-CONTAINING PROTEIN, PF01535'-RELATEDcoord: 1..670
NoneNo IPR availablePANTHERPTHR47937PLASTID TRANSCRIPTIONALLY ACTIVE CHROMOSOME 2-LIKE PROTEINcoord: 1..670
NoneNo IPR availableSUPERFAMILY81901HCP-likecoord: 155..388

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0002243.1Tan0002243.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0009507 chloroplast
molecular_function GO:0005515 protein binding