Tan0019509 (gene) Snake gourd v1

Overview
NameTan0019509
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat-containing protein
LocationLG09: 71500503 .. 71502874 (-)
RNA-Seq ExpressionTan0019509
SyntenyTan0019509
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GTGGTTTTGAAGAAATACCAACCACGGCCAAATGCGCGCTCGAGAACATAGCAAAAGAGATTCAGAAGGGCCGCCGCTCTGCCATAGATGACTGTAATCTTCCGTCATTGCTCTATCCTTTCTCTGAATCATCATCGTCGTTTTCTTCTTCATCGCTTCGCCTCCACCGCTTCACTTCCTCACTCTCCCATCGACAGAGAGCCACTGATTTCTCTCATTAAATCATGCACCAACAAATCCCAATTGCTCCAAATTCACGCCCACATCATCCGCACTTCTTCCATTTCAGATCCCATTATTTCCCTTCGGTTCTTGACTCGCGCCGCCTCCGCGCCTTTTCGCGAATTGGGCTATTCTCGACGATTCTTCTCTCAGCTGACGAACCCATTTGTTTCTCATTACAATACTATGTTAAGAGCCTACTCTTTGAGCCGCTCACCTCTGGAGGGATTGTACATGTACAGAGATATGGAGAGGCAAGGAGTTCGTGCTGATCCTTTGTCTTCTTCTTTTGCCGTGAAGTCATGTATAAGAATGCTTTCGTTGTTAAGTGGGGTTCAGATTCACGCGAGAATTTTTAGAAATGGACATCAATCGGATAGTCTTCTGCTCACCACTATGATGGACCTGTATTCTCACTGTGGCAAACTTGAGGATGCTTGCAAATTGTTCGATGAAATTCCTCATACAGACGTTGTTGCTTGGAACGTTTTGATTTCGTGTCTAACTCGAAATAAACGAACTAGGGATGCTTTGGGGTTATTTGAGATCATGCAGAGTCCAACGTATCTCTGTGAACCTGATAAAGTTACTTGTTTGCTTCTCCTCCAAGCATGTGCAGACTTGAATGCATTGGAATTCGGCGAAAGAATTCATGGTTATGTTCAAGAGCACGGTTATAATACAGAGAGCAATTTGTGTAATTCGCTGATATCGATGTATTCACGGTGTGGACGTGTGGATAAGGCGTATGAAGTGTTTGAAAAAATGCCAGAGAAAAATGTTGTTTCATGGAGTGCGATGATTTCCGGGTTATCAATGAATGGGCACGGGAGAGAAGCTATTGAAGCGTTTTGGGAGATGCAAAAGAAGGGTATTGAGCCTGATGATCATACTTTCACTGCAGTTCTTTCTGCTTGTAGCCACTGTGGCCTGGTTGATGAAGGAATGGCATTTTTTGATCGTATGAGACCGGAGTTCAAGATAGTTCCCAACGTCCATCACTATGGATGTATGGTTGATCTATTGGGTCGTGCTGGAATGCTCGACCAAGCCTATCATCTCATAATGTCAATGGAGGTGAACCCTGATGCGACATTGTGGAGGACCCTTCTTGGAGCTTGCAGAATTCATGGCTACGCAAACCTTGGGGAGCGCATAATTGAACATTTGATTGAACTTAAATCTCAAGAAGCAGGAGATTATGTGCTGTTGCTGAACATTTATTCCTCGGCTGGCAACTGGGACAAAGTAACTGAATTGAGGAAATTTATGAAGGAGAAGGGTATTTATACTACACCTGGCTGTACCACAATAGAACTGAATGGAGTGGTGCATGAGTTTGCTGTGGATGATGTTTCGCATCCGATGAAGGACGAGATCTACGAGCAGCTGGATGAAATCAACAAGCAGCTAAAGATTGCAGGTTATGAAGCTGAAATATCATCTGAATTACACAACTTGAAGGCAGAAGAAAAGGGGTATGCACTTTCTCATCATAGTGAGAAACTGGCCATAGCTTTTGGCGTTCTTGCAACTCCACCAGGAAGAACCATCAGAGTGGCGAATAACCTTCGTACTTGTGTGGATTGTCACAACTTTGCAAAGTATATCTCTAGTGTTTATAACAGAAAAGTAGTTGTTAGGGACCGAAGTCGGTTCCATCATTTCCGAGAGGGTCGGTGTTCCTGCAACGATTATTGGTAGCGACAGATATATTTTGAACACCATTGATTCGCAGAATATACACTGTCCCACTAGGAAAATTTCCCCCCTAGTGTTCTGTTCAGCTTATACTCATGGCCATGGGCTGTTTCTGAAGATAGGAGGGTTGGTCTACTGTCACAGGGGAGAGTATATTATTACGACAGAAGCTGGACAACAGTTGGATACCAAGATCGTCTGCATTTTGCAAGTTGAAGGGAGTTGGAAAAAATAGCAGCTGCTTCCACAAATGGGCTGTGTTGGAATTTGAGGATGTGAATCAATCGATCAAAGGATTACTTATAAACGGAAAGCAAATAACAAATTTCTTTCTTTTGATTCATCATTGATCCCTGTCCATCCATTGCCATCTTGAATTCCACGAAAATGATACGGTAAAAGCTTCGATTTTCAAAAATAGTTTTCACCACATTTTATAGGCTTCCGAG

mRNA sequence

GTGGTTTTGAAGAAATACCAACCACGGCCAAATGCGCGCTCGAGAACATAGCAAAAGAGATTCAGAAGGGCCGCCGCTCTGCCATAGATGACTGTAATCTTCCGTCATTGCTCTATCCTTTCTCTGAATCATCATCGTCGTTTTCTTCTTCATCGCTTCGCCTCCACCGCTTCACTTCCTCACTCTCCCATCGACAGAGAGCCACTGATTTCTCTCATTAAATCATGCACCAACAAATCCCAATTGCTCCAAATTCACGCCCACATCATCCGCACTTCTTCCATTTCAGATCCCATTATTTCCCTTCGGTTCTTGACTCGCGCCGCCTCCGCGCCTTTTCGCGAATTGGGCTATTCTCGACGATTCTTCTCTCAGCTGACGAACCCATTTGTTTCTCATTACAATACTATGTTAAGAGCCTACTCTTTGAGCCGCTCACCTCTGGAGGGATTGTACATGTACAGAGATATGGAGAGGCAAGGAGTTCGTGCTGATCCTTTGTCTTCTTCTTTTGCCGTGAAGTCATGTATAAGAATGCTTTCGTTGTTAAGTGGGGTTCAGATTCACGCGAGAATTTTTAGAAATGGACATCAATCGGATAGTCTTCTGCTCACCACTATGATGGACCTGTATTCTCACTGTGGCAAACTTGAGGATGCTTGCAAATTGTTCGATGAAATTCCTCATACAGACGTTGTTGCTTGGAACGTTTTGATTTCGTGTCTAACTCGAAATAAACGAACTAGGGATGCTTTGGGGTTATTTGAGATCATGCAGAGTCCAACGTATCTCTGTGAACCTGATAAAGTTACTTGTTTGCTTCTCCTCCAAGCATGTGCAGACTTGAATGCATTGGAATTCGGCGAAAGAATTCATGGTTATGTTCAAGAGCACGGTTATAATACAGAGAGCAATTTGTGTAATTCGCTGATATCGATGTATTCACGGTGTGGACGTGTGGATAAGGCGTATGAAGTGTTTGAAAAAATGCCAGAGAAAAATGTTGTTTCATGGAGTGCGATGATTTCCGGGTTATCAATGAATGGGCACGGGAGAGAAGCTATTGAAGCGTTTTGGGAGATGCAAAAGAAGGGTATTGAGCCTGATGATCATACTTTCACTGCAGTTCTTTCTGCTTGTAGCCACTGTGGCCTGGTTGATGAAGGAATGGCATTTTTTGATCGTATGAGACCGGAGTTCAAGATAGTTCCCAACGTCCATCACTATGGATGTATGGTTGATCTATTGGGTCGTGCTGGAATGCTCGACCAAGCCTATCATCTCATAATGTCAATGGAGGTGAACCCTGATGCGACATTGTGGAGGACCCTTCTTGGAGCTTGCAGAATTCATGGCTACGCAAACCTTGGGGAGCGCATAATTGAACATTTGATTGAACTTAAATCTCAAGAAGCAGGAGATTATGTGCTGTTGCTGAACATTTATTCCTCGGCTGGCAACTGGGACAAAGTAACTGAATTGAGGAAATTTATGAAGGAGAAGGGTATTTATACTACACCTGGCTGTACCACAATAGAACTGAATGGAGTGGTGCATGAGTTTGCTGTGGATGATGTTTCGCATCCGATGAAGGACGAGATCTACGAGCAGCTGGATGAAATCAACAAGCAGCTAAAGATTGCAGGTTATGAAGCTGAAATATCATCTGAATTACACAACTTGAAGGCAGAAGAAAAGGGGTATGCACTTTCTCATCATAGTGAGAAACTGGCCATAGCTTTTGGCGTTCTTGCAACTCCACCAGGAAGAACCATCAGAGTGGCGAATAACCTTCGTACTTGTGTGGATTGTCACAACTTTGCAAAGTATATCTCTAGTGTTTATAACAGAAAAGTAGTTGTTAGGGACCGAAGTCGGTTCCATCATTTCCGAGAGGGTCGGTGTTCCTGCAACGATTATTGGTAGCGACAGATATATTTTGAACACCATTGATTCGCAGAATATACACTGTCCCACTAGGAAAATTTCCCCCCTAGTGTTCTGTTCAGCTTATACTCATGGCCATGGGCTGTTTCTGAAGATAGGAGGGTTGGTCTACTGTCACAGGGGAGAGTATATTATTACGACAGAAGCTGGACAACAGTTGGATACCAAGATCGTCTGCATTTTGCAAGTTGAAGGGAGTTGGAAAAAATAGCAGCTGCTTCCACAAATGGGCTGTGTTGGAATTTGAGGATGTGAATCAATCGATCAAAGGATTACTTATAAACGGAAAGCAAATAACAAATTTCTTTCTTTTGATTCATCATTGATCCCTGTCCATCCATTGCCATCTTGAATTCCACGAAAATGATACGGTAAAAGCTTCGATTTTCAAAAATAGTTTTCACCACATTTTATAGGCTTCCGAG

Coding sequence (CDS)

ATGACTGTAATCTTCCGTCATTGCTCTATCCTTTCTCTGAATCATCATCGTCGTTTTCTTCTTCATCGCTTCGCCTCCACCGCTTCACTTCCTCACTCTCCCATCGACAGAGAGCCACTGATTTCTCTCATTAAATCATGCACCAACAAATCCCAATTGCTCCAAATTCACGCCCACATCATCCGCACTTCTTCCATTTCAGATCCCATTATTTCCCTTCGGTTCTTGACTCGCGCCGCCTCCGCGCCTTTTCGCGAATTGGGCTATTCTCGACGATTCTTCTCTCAGCTGACGAACCCATTTGTTTCTCATTACAATACTATGTTAAGAGCCTACTCTTTGAGCCGCTCACCTCTGGAGGGATTGTACATGTACAGAGATATGGAGAGGCAAGGAGTTCGTGCTGATCCTTTGTCTTCTTCTTTTGCCGTGAAGTCATGTATAAGAATGCTTTCGTTGTTAAGTGGGGTTCAGATTCACGCGAGAATTTTTAGAAATGGACATCAATCGGATAGTCTTCTGCTCACCACTATGATGGACCTGTATTCTCACTGTGGCAAACTTGAGGATGCTTGCAAATTGTTCGATGAAATTCCTCATACAGACGTTGTTGCTTGGAACGTTTTGATTTCGTGTCTAACTCGAAATAAACGAACTAGGGATGCTTTGGGGTTATTTGAGATCATGCAGAGTCCAACGTATCTCTGTGAACCTGATAAAGTTACTTGTTTGCTTCTCCTCCAAGCATGTGCAGACTTGAATGCATTGGAATTCGGCGAAAGAATTCATGGTTATGTTCAAGAGCACGGTTATAATACAGAGAGCAATTTGTGTAATTCGCTGATATCGATGTATTCACGGTGTGGACGTGTGGATAAGGCGTATGAAGTGTTTGAAAAAATGCCAGAGAAAAATGTTGTTTCATGGAGTGCGATGATTTCCGGGTTATCAATGAATGGGCACGGGAGAGAAGCTATTGAAGCGTTTTGGGAGATGCAAAAGAAGGGTATTGAGCCTGATGATCATACTTTCACTGCAGTTCTTTCTGCTTGTAGCCACTGTGGCCTGGTTGATGAAGGAATGGCATTTTTTGATCGTATGAGACCGGAGTTCAAGATAGTTCCCAACGTCCATCACTATGGATGTATGGTTGATCTATTGGGTCGTGCTGGAATGCTCGACCAAGCCTATCATCTCATAATGTCAATGGAGGTGAACCCTGATGCGACATTGTGGAGGACCCTTCTTGGAGCTTGCAGAATTCATGGCTACGCAAACCTTGGGGAGCGCATAATTGAACATTTGATTGAACTTAAATCTCAAGAAGCAGGAGATTATGTGCTGTTGCTGAACATTTATTCCTCGGCTGGCAACTGGGACAAAGTAACTGAATTGAGGAAATTTATGAAGGAGAAGGGTATTTATACTACACCTGGCTGTACCACAATAGAACTGAATGGAGTGGTGCATGAGTTTGCTGTGGATGATGTTTCGCATCCGATGAAGGACGAGATCTACGAGCAGCTGGATGAAATCAACAAGCAGCTAAAGATTGCAGGTTATGAAGCTGAAATATCATCTGAATTACACAACTTGAAGGCAGAAGAAAAGGGGTATGCACTTTCTCATCATAGTGAGAAACTGGCCATAGCTTTTGGCGTTCTTGCAACTCCACCAGGAAGAACCATCAGAGTGGCGAATAACCTTCGTACTTGTGTGGATTGTCACAACTTTGCAAAGTATATCTCTAGTGTTTATAACAGAAAAGTAGTTGTTAGGGACCGAAGTCGGTTCCATCATTTCCGAGAGGGTCGGTGTTCCTGCAACGATTATTGGTAG

Protein sequence

MTVIFRHCSILSLNHHRRFLLHRFASTASLPHSPIDREPLISLIKSCTNKSQLLQIHAHIIRTSSISDPIISLRFLTRAASAPFRELGYSRRFFSQLTNPFVSHYNTMLRAYSLSRSPLEGLYMYRDMERQGVRADPLSSSFAVKSCIRMLSLLSGVQIHARIFRNGHQSDSLLLTTMMDLYSHCGKLEDACKLFDEIPHTDVVAWNVLISCLTRNKRTRDALGLFEIMQSPTYLCEPDKVTCLLLLQACADLNALEFGERIHGYVQEHGYNTESNLCNSLISMYSRCGRVDKAYEVFEKMPEKNVVSWSAMISGLSMNGHGREAIEAFWEMQKKGIEPDDHTFTAVLSACSHCGLVDEGMAFFDRMRPEFKIVPNVHHYGCMVDLLGRAGMLDQAYHLIMSMEVNPDATLWRTLLGACRIHGYANLGERIIEHLIELKSQEAGDYVLLLNIYSSAGNWDKVTELRKFMKEKGIYTTPGCTTIELNGVVHEFAVDDVSHPMKDEIYEQLDEINKQLKIAGYEAEISSELHNLKAEEKGYALSHHSEKLAIAFGVLATPPGRTIRVANNLRTCVDCHNFAKYISSVYNRKVVVRDRSRFHHFREGRCSCNDYW
Homology
BLAST of Tan0019509 vs. ExPASy Swiss-Prot
Match: Q9SN85 (Pentatricopeptide repeat-containing protein At3g47530 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H76 PE=2 SV=1)

HSP 1 Score: 753.8 bits (1945), Expect = 1.5e-216
Identity = 373/578 (64.53%), Postives = 454/578 (78.55%), Query Frame = 0

Query: 40  LISLIKSCTNKSQLLQIHAHIIRTSSISDPIISLRFLTR-AASAPFRELGYSRRFFSQLT 99
           L+SLI S T K  L QIHA ++RTS I +  +   FL+R A S   R++ YS R FSQ  
Sbjct: 14  LLSLIVSSTGKLHLRQIHALLLRTSLIRNSDVFHHFLSRLALSLIPRDINYSCRVFSQRL 73

Query: 100 NPFVSHYNTMLRAYSLSRSPLEGLYMYRDMER-QGVRADPLSSSFAVKSCIRMLSLLSGV 159
           NP +SH NTM+RA+SLS++P EG  ++R + R   + A+PLSSSFA+K CI+   LL G+
Sbjct: 74  NPTLSHCNTMIRAFSLSQTPCEGFRLFRSLRRNSSLPANPLSSSFALKCCIKSGDLLGGL 133

Query: 160 QIHARIFRNGHQSDSLLLTTMMDLYSHCGKLEDACKLFDEIPHTDVVAWNVLISCLTRNK 219
           QIH +IF +G  SDSLL+TT+MDLYS C    DACK+FDEIP  D V+WNVL SC  RNK
Sbjct: 134 QIHGKIFSDGFLSDSLLMTTLMDLYSTCENSTDACKVFDEIPKRDTVSWNVLFSCYLRNK 193

Query: 220 RTRDALGLFEIMQSPTYLC-EPDKVTCLLLLQACADLNALEFGERIHGYVQEHGYNTESN 279
           RTRD L LF+ M++    C +PD VTCLL LQACA+L AL+FG+++H ++ E+G +   N
Sbjct: 194 RTRDVLVLFDKMKNDVDGCVKPDGVTCLLALQACANLGALDFGKQVHDFIDENGLSGALN 253

Query: 280 LCNSLISMYSRCGRVDKAYEVFEKMPEKNVVSWSAMISGLSMNGHGREAIEAFWEMQKKG 339
           L N+L+SMYSRCG +DKAY+VF  M E+NVVSW+A+ISGL+MNG G+EAIEAF EM K G
Sbjct: 254 LSNTLVSMYSRCGSMDKAYQVFYGMRERNVVSWTALISGLAMNGFGKEAIEAFNEMLKFG 313

Query: 340 IEPDDHTFTAVLSACSHCGLVDEGMAFFDRMRP-EFKIVPNVHHYGCMVDLLGRAGMLDQ 399
           I P++ T T +LSACSH GLV EGM FFDRMR  EFKI PN+HHYGC+VDLLGRA +LD+
Sbjct: 314 ISPEEQTLTGLLSACSHSGLVAEGMMFFDRMRSGEFKIKPNLHHYGCVVDLLGRARLLDK 373

Query: 400 AYHLIMSMEVNPDATLWRTLLGACRIHGYANLGERIIEHLIELKSQEAGDYVLLLNIYSS 459
           AY LI SME+ PD+T+WRTLLGACR+HG   LGER+I HLIELK++EAGDYVLLLN YS+
Sbjct: 374 AYSLIKSMEMKPDSTIWRTLLGACRVHGDVELGERVISHLIELKAEEAGDYVLLLNTYST 433

Query: 460 AGNWDKVTELRKFMKEKGIYTTPGCTTIELNGVVHEFAVDDVSHPMKDEIYEQLDEINKQ 519
            G W+KVTELR  MKEK I+T PGC+ IEL G VHEF VDDVSHP K+EIY+ L EIN+Q
Sbjct: 434 VGKWEKVTELRSLMKEKRIHTKPGCSAIELQGTVHEFIVDDVSHPRKEEIYKMLAEINQQ 493

Query: 520 LKIAGYEAEISSELHNLKA-EEKGYALSHHSEKLAIAFGVLATPPGRTIRVANNLRTCVD 579
           LKIAGY AEI+SELHNL++ EEKGYAL +HSEKLAIAFG+L TPPG TIRV  NLRTCVD
Sbjct: 494 LKIAGYVAEITSELHNLESEEEKGYALRYHSEKLAIAFGILVTPPGTTIRVTKNLRTCVD 553

Query: 580 CHNFAKYISSVYNRKVVVRDRSRFHHFREGRCSCNDYW 613
           CHNFAK++S VY+R V+VRDRSRFHHF+ G CSCND+W
Sbjct: 554 CHNFAKFVSDVYDRIVIVRDRSRFHHFKGGSCSCNDFW 591

BLAST of Tan0019509 vs. ExPASy Swiss-Prot
Match: A8MQA3 (Pentatricopeptide repeat-containing protein At4g21065 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H28 PE=2 SV=2)

HSP 1 Score: 463.8 bits (1192), Expect = 3.1e-129
Identity = 240/587 (40.89%), Postives = 356/587 (60.65%), Query Frame = 0

Query: 40  LISLIKSCTNKSQ---------LLQIHAHIIRTS-SISDPIISLRFLTRAASAPF-RELG 99
           L+ +++ C N  Q         L QIHA  IR   SISD  +    +    S P    + 
Sbjct: 11  LLPMVEKCINLLQTYGVSSITKLRQIHAFSIRHGVSISDAELGKHLIFYLVSLPSPPPMS 70

Query: 100 YSRRFFSQLTNPF-VSHYNTMLRAYSLSRSPLEGLYMYRDMERQG-VRADPLSSSFAVKS 159
           Y+ + FS++  P  V  +NT++R Y+   + +    +YR+M   G V  D  +  F +K+
Sbjct: 71  YAHKVFSKIEKPINVFIWNTLIRGYAEIGNSISAFSLYREMRVSGLVEPDTHTYPFLIKA 130

Query: 160 CIRMLSLLSGVQIHARIFRNGHQSDSLLLTTMMDLYSHCGKLEDACKLFDEIPHTDVVAW 219
              M  +  G  IH+ + R+G  S   +  +++ LY++CG +  A K+FD++P  D+VAW
Sbjct: 131 VTTMADVRLGETIHSVVIRSGFGSLIYVQNSLLHLYANCGDVASAYKVFDKMPEKDLVAW 190

Query: 220 NVLISCLTRNKRTRDALGLFEIMQSPTYLCEPDKVTCLLLLQACADLNALEFGERIHGYV 279
           N +I+    N +  +AL L+  M S     +PD  T + LL ACA + AL  G+R+H Y+
Sbjct: 191 NSVINGFAENGKPEEALALYTEMNSKG--IKPDGFTIVSLLSACAKIGALTLGKRVHVYM 250

Query: 280 QEHGYNTESNLCNSLISMYSRCGRVDKAYEVFEKMPEKNVVSWSAMISGLSMNGHGREAI 339
            + G     +  N L+ +Y+RCGRV++A  +F++M +KN VSW+++I GL++NG G+EAI
Sbjct: 251 IKVGLTRNLHSSNVLLDLYARCGRVEEAKTLFDEMVDKNSVSWTSLIVGLAVNGFGKEAI 310

Query: 340 EAFWEMQK-KGIEPDDHTFTAVLSACSHCGLVDEGMAFFDRMRPEFKIVPNVHHYGCMVD 399
           E F  M+  +G+ P + TF  +L ACSHCG+V EG  +F RMR E+KI P + H+GCMVD
Sbjct: 311 ELFKYMESTEGLLPCEITFVGILYACSHCGMVKEGFEYFRRMREEYKIEPRIEHFGCMVD 370

Query: 400 LLGRAGMLDQAYHLIMSMEVNPDATLWRTLLGACRIHGYANLGERIIEHLIELKSQEAGD 459
           LL RAG + +AY  I SM + P+  +WRTLLGAC +HG ++L E     +++L+   +GD
Sbjct: 371 LLARAGQVKKAYEYIKSMPMQPNVVIWRTLLGACTVHGDSDLAEFARIQILQLEPNHSGD 430

Query: 460 YVLLLNIYSSAGNWDKVTELRKFMKEKGIYTTPGCTTIELNGVVHEFAVDDVSHPMKDEI 519
           YVLL N+Y+S   W  V ++RK M   G+   PG + +E+   VHEF + D SHP  D I
Sbjct: 431 YVLLSNMYASEQRWSDVQKIRKQMLRDGVKKVPGHSLVEVGNRVHEFLMGDKSHPQSDAI 490

Query: 520 YEQLDEINKQLKIAGYEAEISSELHNLKAEEKGYALSHHSEKLAIAFGVLATPPGRTIRV 579
           Y +L E+  +L+  GY  +IS+   +++ EEK  A+ +HSEK+AIAF +++TP    I V
Sbjct: 491 YAKLKEMTGRLRSEGYVPQISNVYVDVEEEEKENAVVYHSEKIAIAFMLISTPERSPITV 550

Query: 580 ANNLRTCVDCHNFAKYISSVYNRKVVVRDRSRFHHFREGRCSCNDYW 613
             NLR C DCH   K +S VYNR++VVRDRSRFHHF+ G CSC DYW
Sbjct: 551 VKNLRVCADCHLAIKLVSKVYNREIVVRDRSRFHHFKNGSCSCQDYW 595

BLAST of Tan0019509 vs. ExPASy Swiss-Prot
Match: Q9LN01 (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 459.9 bits (1182), Expect = 4.4e-128
Identity = 220/526 (41.83%), Postives = 332/526 (63.12%), Query Frame = 0

Query: 90  SRRFFSQLTNPFVSHYNTMLRAYSLSRSPLEGLYMYRDMERQGVRADPLSSSFAVKSCIR 149
           +++ F ++    V  +N M+  Y+ + +  E L +++DM +  VR D  +    V +C +
Sbjct: 219 AQKLFDEIPVKDVVSWNAMISGYAETGNYKEALELFKDMMKTNVRPDESTMVTVVSACAQ 278

Query: 150 MLSLLSGVQIHARIFRNGHQSDSLLLTTMMDLYSHCGKLEDACKLFDEIPHTDVVAWNVL 209
             S+  G Q+H  I  +G  S+  ++  ++DLYS CG+LE AC LF+ +P+ DV++WN L
Sbjct: 279 SGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELETACGLFERLPYKDVISWNTL 338

Query: 210 ISCLTRNKRTRDALGLF-EIMQSPTYLCEPDKVTCLLLLQACADLNALEFGERIHGYVQE 269
           I   T     ++AL LF E+++S      P+ VT L +L ACA L A++ G  IH Y+ +
Sbjct: 339 IGGYTHMNLYKEALLLFQEMLRSGE---TPNDVTMLSILPACAHLGAIDIGRWIHVYIDK 398

Query: 270 H--GYNTESNLCNSLISMYSRCGRVDKAYEVFEKMPEKNVVSWSAMISGLSMNGHGREAI 329
              G    S+L  SLI MY++CG ++ A++VF  +  K++ SW+AMI G +M+G    + 
Sbjct: 399 RLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSSWNAMIFGFAMHGRADASF 458

Query: 330 EAFWEMQKKGIEPDDHTFTAVLSACSHCGLVDEGMAFFDRMRPEFKIVPNVHHYGCMVDL 389
           + F  M+K GI+PDD TF  +LSACSH G++D G   F  M  ++K+ P + HYGCM+DL
Sbjct: 459 DLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQDYKMTPKLEHYGCMIDL 518

Query: 390 LGRAGMLDQAYHLIMSMEVNPDATLWRTLLGACRIHGYANLGERIIEHLIELKSQEAGDY 449
           LG +G+  +A  +I  ME+ PD  +W +LL AC++HG   LGE   E+LI+++ +  G Y
Sbjct: 519 LGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELGESFAENLIKIEPENPGSY 578

Query: 450 VLLLNIYSSAGNWDKVTELRKFMKEKGIYTTPGCTTIELNGVVHEFAVDDVSHPMKDEIY 509
           VLL NIY+SAG W++V + R  + +KG+   PGC++IE++ VVHEF + D  HP   EIY
Sbjct: 579 VLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSVVHEFIIGDKFHPRNREIY 638

Query: 510 EQLDEINKQLKIAGYEAEISSELHNLKAEEKGYALSHHSEKLAIAFGVLATPPGRTIRVA 569
             L+E+   L+ AG+  + S  L  ++ E K  AL HHSEKLAIAFG+++T PG  + + 
Sbjct: 639 GMLEEMEVLLEKAGFVPDTSEVLQEMEEEWKEGALRHHSEKLAIAFGLISTKPGTKLTIV 698

Query: 570 NNLRTCVDCHNFAKYISSVYNRKVVVRDRSRFHHFREGRCSCNDYW 613
            NLR C +CH   K IS +Y R+++ RDR+RFHHFR+G CSCNDYW
Sbjct: 699 KNLRVCRNCHEATKLISKIYKREIIARDRTRFHHFRDGVCSCNDYW 741

BLAST of Tan0019509 vs. ExPASy Swiss-Prot
Match: O82380 (Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H33 PE=2 SV=1)

HSP 1 Score: 452.2 bits (1162), Expect = 9.3e-126
Identity = 256/743 (34.45%), Postives = 386/743 (51.95%), Query Frame = 0

Query: 4   IFRHCSILSLNHHRRFLLHRFASTASLPHSPIDREPLISLIKSCTNKSQLLQIHAHIIRT 63
           IF     LSL  H  F      S  + P +  +R   ISLI+ C +  QL Q H H+IRT
Sbjct: 3   IFSTAQPLSLPRHPNF------SNPNQPTTNNERSRHISLIERCVSLRQLKQTHGHMIRT 62

Query: 64  SSISDPIISLRFLTRAASAPFRELGYSRRFFSQLTNPFVSHYNTMLRAY----------- 123
            + SDP  + +    AA + F  L Y+R+ F ++  P    +NT++RAY           
Sbjct: 63  GTFSDPYSASKLFAMAALSSFASLEYARKVFDEIPKPNSFAWNTLIRAYASGPDPVLSIW 122

Query: 124 -----------------------------SLS---------------------------- 183
                                        SLS                            
Sbjct: 123 AFLDMVSESQCYPNKYTFPFLIKAAAEVSSLSLGQSLHGMAVKSAVGSDVFVANSLIHCY 182

Query: 184 ----------------------------------RSPLEGLYMYRDMERQGVRADPLSSS 243
                                              SP + L +++ ME + V+A  ++  
Sbjct: 183 FSCGDLDSACKVFTTIKEKDVVSWNSMINGFVQKGSPDKALELFKKMESEDVKASHVTMV 242

Query: 244 FAVKSCIRMLSLLSGVQIHARIFRNGHQSDSLLLTTMMDLYSHCGKLEDACKLFD----- 303
             + +C ++ +L  G Q+ + I  N    +  L   M+D+Y+ CG +EDA +LFD     
Sbjct: 243 GVLSACAKIRNLEFGRQVCSYIEENRVNVNLTLANAMLDMYTKCGSIEDAKRLFDAMEEK 302

Query: 304 --------------------------EIPHTDVVAWNVLISCLTRNKRTRDALGLFEIMQ 363
                                      +P  D+VAWN LIS   +N +  +AL +F  +Q
Sbjct: 303 DNVTWTTMLDGYAISEDYEAAREVLNSMPQKDIVAWNALISAYEQNGKPNEALIVFHELQ 362

Query: 364 SPTYLCEPDKVTCLLLLQACADLNALEFGERIHGYVQEHGYNTESNLCNSLISMYSRCGR 423
               + + +++T +  L ACA + ALE G  IH Y+++HG     ++ ++LI MYS+CG 
Sbjct: 363 LQKNM-KLNQITLVSTLSACAQVGALELGRWIHSYIKKHGIRMNFHVTSALIHMYSKCGD 422

Query: 424 VDKAYEVFEKMPEKNVVSWSAMISGLSMNGHGREAIEAFWEMQKKGIEPDDHTFTAVLSA 483
           ++K+ EVF  + +++V  WSAMI GL+M+G G EA++ F++MQ+  ++P+  TFT V  A
Sbjct: 423 LEKSREVFNSVEKRDVFVWSAMIGGLAMHGCGNEAVDMFYKMQEANVKPNGVTFTNVFCA 482

Query: 484 CSHCGLVDEGMAFFDRMRPEFKIVPNVHHYGCMVDLLGRAGMLDQAYHLIMSMEVNPDAT 543
           CSH GLVDE  + F +M   + IVP   HY C+VD+LGR+G L++A   I +M + P  +
Sbjct: 483 CSHTGLVDEAESLFHQMESNYGIVPEEKHYACIVDVLGRSGYLEKAVKFIEAMPIPPSTS 542

Query: 544 LWRTLLGACRIHGYANLGERIIEHLIELKSQEAGDYVLLLNIYSSAGNWDKVTELRKFMK 603
           +W  LLGAC+IH   NL E     L+EL+ +  G +VLL NIY+  G W+ V+ELRK M+
Sbjct: 543 VWGALLGACKIHANLNLAEMACTRLLELEPRNDGAHVLLSNIYAKLGKWENVSELRKHMR 602

Query: 604 EKGIYTTPGCTTIELNGVVHEFAVDDVSHPMKDEIYEQLDEINKQLKIAGYEAEISSELH 613
             G+   PGC++IE++G++HEF   D +HPM +++Y +L E+ ++LK  GYE EIS  L 
Sbjct: 603 VTGLKKEPGCSSIEIDGMIHEFLSGDNAHPMSEKVYGKLHEVMEKLKSNGYEPEISQVLQ 662

BLAST of Tan0019509 vs. ExPASy Swiss-Prot
Match: Q9FG16 (Pentatricopeptide repeat-containing protein At5g06540 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H88 PE=3 SV=1)

HSP 1 Score: 449.5 bits (1155), Expect = 6.0e-125
Identity = 229/613 (37.36%), Postives = 358/613 (58.40%), Query Frame = 0

Query: 37  REPLISLIKSCTNKSQLLQIHAHIIRTSSISDPIISLRFLTRAA-----SAPFRELGYSR 96
           + P ++L++SC++ S L  IH  ++RT  ISD  ++ R L         + P   LGY+ 
Sbjct: 12  KHPKLALLQSCSSFSDLKIIHGFLLRTHLISDVFVASRLLALCVDDSTFNKPTNLLGYAY 71

Query: 97  RFFSQLTNPFVSHYNTMLRAYSLSRSPLEGLYMYRDMERQGVRADPLSSSFAVKSCIRML 156
             FSQ+ NP +  +N ++R +S    P +    Y  M +  +  D ++  F +K+   M 
Sbjct: 72  GIFSQIQNPNLFVFNLLIRCFSTGAEPSKAFGFYTQMLKSRIWPDNITFPFLIKASSEME 131

Query: 157 SLLSGVQIHARIFRNGHQSDSLLLTTMMDLYSH--------------------------- 216
            +L G Q H++I R G Q+D  +  +++ +Y++                           
Sbjct: 132 CVLVGEQTHSQIVRFGFQNDVYVENSLVHMYANCGFIAAAGRIFGQMGFRDVVSWTSMVA 191

Query: 217 ----CGKLEDACKLFDEIPHTDVVAWNVLISCLTRNKRTRDALGLFEIMQSPTYLCEPDK 276
               CG +E+A ++FDE+PH ++  W+++I+   +N     A+ LFE M+    +   ++
Sbjct: 192 GYCKCGMVENAREMFDEMPHRNLFTWSIMINGYAKNNCFEKAIDLFEFMKREGVVA--NE 251

Query: 277 VTCLLLLQACADLNALEFGERIHGYVQEHGYNTESNLCNSLISMYSRCGRVDKAYEVFEK 336
              + ++ +CA L ALEFGER + YV +        L  +L+ M+ RCG ++KA  VFE 
Sbjct: 252 TVMVSVISSCAHLGALEFGERAYEYVVKSHMTVNLILGTALVDMFWRCGDIEKAIHVFEG 311

Query: 337 MPEKNVVSWSAMISGLSMNGHGREAIEAFWEMQKKGIEPDDHTFTAVLSACSHCGLVDEG 396
           +PE + +SWS++I GL+++GH  +A+  F +M   G  P D TFTAVLSACSH GLV++G
Sbjct: 312 LPETDSLSWSSIIKGLAVHGHAHKAMHYFSQMISLGFIPRDVTFTAVLSACSHGGLVEKG 371

Query: 397 MAFFDRMRPEFKIVPNVHHYGCMVDLLGRAGMLDQAYHLIMSMEVNPDATLWRTLLGACR 456
           +  ++ M+ +  I P + HYGC+VD+LGRAG L +A + I+ M V P+A +   LLGAC+
Sbjct: 372 LEIYENMKKDHGIEPRLEHYGCIVDMLGRAGKLAEAENFILKMHVKPNAPILGALLGACK 431

Query: 457 IHGYANLGERIIEHLIELKSQEAGDYVLLLNIYSSAGNWDKVTELRKFMKEKGIYTTPGC 516
           I+    + ER+   LI++K + +G YVLL NIY+ AG WDK+  LR  MKEK +   PG 
Sbjct: 432 IYKNTEVAERVGNMLIKVKPEHSGYYVLLSNIYACAGQWDKIESLRDMMKEKLVKKPPGW 491

Query: 517 TTIELNGVVHEFAV-DDVSHPMKDEIYEQLDEINKQLKIAGYEAEISSELHNLKAEEKGY 576
           + IE++G +++F + DD  HP   +I  + +EI  ++++ GY+        ++  EEK  
Sbjct: 492 SLIEIDGKINKFTMGDDQKHPEMGKIRRKWEEILGKIRLIGYKGNTGDAFFDVDEEEKES 551

Query: 577 ALSHHSEKLAIAFGVLATPPGRTIRVANNLRTCVDCHNFAKYISSVYNRKVVVRDRSRFH 613
           ++  HSEKLAIA+G++ T PG TIR+  NLR C DCH   K IS VY R+++VRDR+RFH
Sbjct: 552 SIHMHSEKLAIAYGMMKTKPGTTIRIVKNLRVCEDCHTVTKLISEVYGRELIVRDRNRFH 611

BLAST of Tan0019509 vs. NCBI nr
Match: KAG6589508.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia] >KAG7023195.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1151.7 bits (2978), Expect = 0.0e+00
Identity = 561/628 (89.33%), Postives = 585/628 (93.15%), Query Frame = 0

Query: 1   MTVIFRHCSILSLNHHRRFLLHRFASTASLPHSPI----------------DREPLISLI 60
           MTVIFR C   +  H     L RFASTASL HSPI                DREPLISLI
Sbjct: 1   MTVIFRRCRCSADRHLHSLRLPRFASTASLLHSPISLLSSKFREQNSTLRFDREPLISLI 60

Query: 61  KSCTNKSQLLQIHAHIIRTSSISDPIISLRFLTRAASAPFRELGYSRRFFSQLTNPFVSH 120
           KSCT+KSQLLQIHAH+IRTS I DPI+SLRFLTR  SAPFRELGYSRRFFSQLTNPFVSH
Sbjct: 61  KSCTHKSQLLQIHAHMIRTSFIQDPIVSLRFLTRIVSAPFRELGYSRRFFSQLTNPFVSH 120

Query: 121 YNTMLRAYSLSRSPLEGLYMYRDMERQGVRADPLSSSFAVKSCIRMLSLLSGVQIHARIF 180
           YNT+LRAYSLSRSPLEGLYMYRDMER+GV ADPLSSSFAVKSCIRMLSL SGVQIHARIF
Sbjct: 121 YNTLLRAYSLSRSPLEGLYMYRDMERRGVHADPLSSSFAVKSCIRMLSLFSGVQIHARIF 180

Query: 181 RNGHQSDSLLLTTMMDLYSHCGKLEDACKLFDEIPHTDVVAWNVLISCLTRNKRTRDALG 240
           RNGHQSDSLLLT+MMDLYSHCGKLEDACKLFDEIP  DVVAWNVLISCLTRNKRTRDALG
Sbjct: 181 RNGHQSDSLLLTSMMDLYSHCGKLEDACKLFDEIPQRDVVAWNVLISCLTRNKRTRDALG 240

Query: 241 LFEIMQSPTYLCEPDKVTCLLLLQACADLNALEFGERIHGYVQEHGYNTESNLCNSLISM 300
           LFEIMQSPTYLC+PDKVTCLLLLQACADLNALEFGERIH ++Q+HGYNTESNLCNSLISM
Sbjct: 241 LFEIMQSPTYLCKPDKVTCLLLLQACADLNALEFGERIHSHIQQHGYNTESNLCNSLISM 300

Query: 301 YSRCGRVDKAYEVFEKMPEKNVVSWSAMISGLSMNGHGREAIEAFWEMQKKGIEPDDHTF 360
           YSRCGRVDKAYEVF+KMPEKNVVSWSAMISGLSMNGHGREAIEAFW MQKKG+EPDDHTF
Sbjct: 301 YSRCGRVDKAYEVFDKMPEKNVVSWSAMISGLSMNGHGREAIEAFWAMQKKGVEPDDHTF 360

Query: 361 TAVLSACSHCGLVDEGMAFFDRMRPEFKIVPNVHHYGCMVDLLGRAGMLDQAYHLIMSME 420
           TAVLSACSHCGLVDEGMAFFDRMR EF IVP VHHYGCMVDLLGRAGMLDQAY L+MSME
Sbjct: 361 TAVLSACSHCGLVDEGMAFFDRMRQEFMIVPTVHHYGCMVDLLGRAGMLDQAYQLVMSME 420

Query: 421 VNPDATLWRTLLGACRIHGYANLGERIIEHLIELKSQEAGDYVLLLNIYSSAGNWDKVTE 480
           VNPDAT+WRTLLGACRIHG+ANLGERIIEHLIELKSQEAGDYVLLLNIYSSAGNWDKVTE
Sbjct: 421 VNPDATMWRTLLGACRIHGHANLGERIIEHLIELKSQEAGDYVLLLNIYSSAGNWDKVTE 480

Query: 481 LRKFMKEKGIYTTPGCTTIELNGVVHEFAVDDVSHPMKDEIYEQLDEINKQLKIAGYEAE 540
           LRKFMKE+GIYTTPGCTTIELNGVVHEFAVDD+SHPMKD+IYEQLDEINKQLKIAGYEAE
Sbjct: 481 LRKFMKERGIYTTPGCTTIELNGVVHEFAVDDISHPMKDKIYEQLDEINKQLKIAGYEAE 540

Query: 541 ISSELHNLKAEEKGYALSHHSEKLAIAFGVLATPPGRTIRVANNLRTCVDCHNFAKYISS 600
           ISSELHNLKAE+KGYALS+HSEKLAIAFG+LATPPGRTIRVANNLRTC+DCHNFAKY+SS
Sbjct: 541 ISSELHNLKAEDKGYALSYHSEKLAIAFGILATPPGRTIRVANNLRTCMDCHNFAKYVSS 600

Query: 601 VYNRKVVVRDRSRFHHFREGRCSCNDYW 613
           VYNRKVVVRDRSRFHHFREGRCSCNDYW
Sbjct: 601 VYNRKVVVRDRSRFHHFREGRCSCNDYW 628

BLAST of Tan0019509 vs. NCBI nr
Match: XP_023515406.1 (pentatricopeptide repeat-containing protein At3g47530 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1146.7 bits (2965), Expect = 0.0e+00
Identity = 559/628 (89.01%), Postives = 584/628 (92.99%), Query Frame = 0

Query: 1   MTVIFRHCSILSLNHHRRFLLHRFASTASLPHSPI----------------DREPLISLI 60
           MTVIFR C   +  H     L RFASTASL HSPI                DREPLISLI
Sbjct: 1   MTVIFRRCRCSADRHLHSLRLPRFASTASLLHSPISLLSSKFREQNSTLRFDREPLISLI 60

Query: 61  KSCTNKSQLLQIHAHIIRTSSISDPIISLRFLTRAASAPFRELGYSRRFFSQLTNPFVSH 120
           KSCT+KSQLLQIHAH+IRTS I DPI+SLRFLTR  SAPFRELGYSRRFFSQLTNPFVSH
Sbjct: 61  KSCTHKSQLLQIHAHMIRTSFIQDPIVSLRFLTRIVSAPFRELGYSRRFFSQLTNPFVSH 120

Query: 121 YNTMLRAYSLSRSPLEGLYMYRDMERQGVRADPLSSSFAVKSCIRMLSLLSGVQIHARIF 180
           YNT+LRAYSLSRSPLEGLYMYRDMERQGV ADPLSSSFAVKSCIRMLSL SG+QIHARIF
Sbjct: 121 YNTLLRAYSLSRSPLEGLYMYRDMERQGVHADPLSSSFAVKSCIRMLSLFSGIQIHARIF 180

Query: 181 RNGHQSDSLLLTTMMDLYSHCGKLEDACKLFDEIPHTDVVAWNVLISCLTRNKRTRDALG 240
           RNGHQSDSLLLT+MMDLYSHCGKLEDACKLFDEIP  DVVAWNVLISCLTRNKRTRDALG
Sbjct: 181 RNGHQSDSLLLTSMMDLYSHCGKLEDACKLFDEIPQRDVVAWNVLISCLTRNKRTRDALG 240

Query: 241 LFEIMQSPTYLCEPDKVTCLLLLQACADLNALEFGERIHGYVQEHGYNTESNLCNSLISM 300
           LFEIMQSPTYLC+PDKVTCLLLLQACADLNALEFGERIH Y+Q++ YNTESNLCNSLISM
Sbjct: 241 LFEIMQSPTYLCKPDKVTCLLLLQACADLNALEFGERIHSYIQQNDYNTESNLCNSLISM 300

Query: 301 YSRCGRVDKAYEVFEKMPEKNVVSWSAMISGLSMNGHGREAIEAFWEMQKKGIEPDDHTF 360
           YSRCGRVDKAYEVF+KMPEKNVVSWSAMISGLSMNGHGREAIEAFW MQKKG+EPDDHTF
Sbjct: 301 YSRCGRVDKAYEVFDKMPEKNVVSWSAMISGLSMNGHGREAIEAFWAMQKKGVEPDDHTF 360

Query: 361 TAVLSACSHCGLVDEGMAFFDRMRPEFKIVPNVHHYGCMVDLLGRAGMLDQAYHLIMSME 420
           TAVLSACSHCGLVDEGMAFFDRMR EF IVP VHHYGCMVDLLGRAGMLDQAY L+MSME
Sbjct: 361 TAVLSACSHCGLVDEGMAFFDRMRQEFMIVPTVHHYGCMVDLLGRAGMLDQAYQLVMSME 420

Query: 421 VNPDATLWRTLLGACRIHGYANLGERIIEHLIELKSQEAGDYVLLLNIYSSAGNWDKVTE 480
           VNPDAT+WRTLLGACRIHG+ANLGERIIEHLIELKSQEAGDYVLLLNIYSSAGNWDKVTE
Sbjct: 421 VNPDATMWRTLLGACRIHGHANLGERIIEHLIELKSQEAGDYVLLLNIYSSAGNWDKVTE 480

Query: 481 LRKFMKEKGIYTTPGCTTIELNGVVHEFAVDDVSHPMKDEIYEQLDEINKQLKIAGYEAE 540
           LRKFMKE+GIYTTPGCTTIELNGVVHEFAVDD+SHPMKD+IY+QLDEIN+QLKIAGYEAE
Sbjct: 481 LRKFMKERGIYTTPGCTTIELNGVVHEFAVDDISHPMKDKIYKQLDEINQQLKIAGYEAE 540

Query: 541 ISSELHNLKAEEKGYALSHHSEKLAIAFGVLATPPGRTIRVANNLRTCVDCHNFAKYISS 600
           ISSELHNLKAE+KGYALS+HSEKLAIAFGVLATPPGRTIRVANNLRTC+DCHNFAKY+SS
Sbjct: 541 ISSELHNLKAEDKGYALSYHSEKLAIAFGVLATPPGRTIRVANNLRTCMDCHNFAKYVSS 600

Query: 601 VYNRKVVVRDRSRFHHFREGRCSCNDYW 613
           VYNRKVVVRDRSRFHHFREGRCSCNDYW
Sbjct: 601 VYNRKVVVRDRSRFHHFREGRCSCNDYW 628

BLAST of Tan0019509 vs. NCBI nr
Match: XP_022921651.1 (pentatricopeptide repeat-containing protein At3g47530 [Cucurbita moschata])

HSP 1 Score: 1145.6 bits (2962), Expect = 0.0e+00
Identity = 560/628 (89.17%), Postives = 582/628 (92.68%), Query Frame = 0

Query: 1   MTVIFRHCSILSLNHHRRFLLHRFASTASLPHSPI----------------DREPLISLI 60
           MTVIFR C   +  H     L  FASTASL HSPI                DREPLISLI
Sbjct: 1   MTVIFRRCRCSADRHLHSLRLPHFASTASLLHSPISLLSSKFREQNSTLRFDREPLISLI 60

Query: 61  KSCTNKSQLLQIHAHIIRTSSISDPIISLRFLTRAASAPFRELGYSRRFFSQLTNPFVSH 120
           KSCT+KSQLLQIHAH+IRTS I DPI+SLRFLTR  SAPFRELGYSRRFFSQLTNPFVSH
Sbjct: 61  KSCTHKSQLLQIHAHMIRTSFIQDPIVSLRFLTRIVSAPFRELGYSRRFFSQLTNPFVSH 120

Query: 121 YNTMLRAYSLSRSPLEGLYMYRDMERQGVRADPLSSSFAVKSCIRMLSLLSGVQIHARIF 180
           YNT+LRAYSLSRSPLEGLYMYRDMER+GV ADPLSSSFAVKSCIRMLSL SGVQIHARIF
Sbjct: 121 YNTLLRAYSLSRSPLEGLYMYRDMERRGVHADPLSSSFAVKSCIRMLSLFSGVQIHARIF 180

Query: 181 RNGHQSDSLLLTTMMDLYSHCGKLEDACKLFDEIPHTDVVAWNVLISCLTRNKRTRDALG 240
           RNGHQSDSLLLT+MMDLYSHCGKLEDACKLFDEIP  DVVAWNVLISCLTRNKRTRDALG
Sbjct: 181 RNGHQSDSLLLTSMMDLYSHCGKLEDACKLFDEIPQRDVVAWNVLISCLTRNKRTRDALG 240

Query: 241 LFEIMQSPTYLCEPDKVTCLLLLQACADLNALEFGERIHGYVQEHGYNTESNLCNSLISM 300
           LFEIMQSPTYLC+PDKVTCLLLLQACADLNALEFGERIH ++Q+HGYNTESNLCNSLISM
Sbjct: 241 LFEIMQSPTYLCKPDKVTCLLLLQACADLNALEFGERIHSHIQQHGYNTESNLCNSLISM 300

Query: 301 YSRCGRVDKAYEVFEKMPEKNVVSWSAMISGLSMNGHGREAIEAFWEMQKKGIEPDDHTF 360
           YSRCGRVDKAYEVF+KMPEKNVVSWSAMISGLSMNGHGREAIEAFW MQKKG+EPDDHTF
Sbjct: 301 YSRCGRVDKAYEVFDKMPEKNVVSWSAMISGLSMNGHGREAIEAFWAMQKKGVEPDDHTF 360

Query: 361 TAVLSACSHCGLVDEGMAFFDRMRPEFKIVPNVHHYGCMVDLLGRAGMLDQAYHLIMSME 420
           TAVLSACSHCGLVDEGMAFFDRMR EF IVP VHHYGCMVDLLGRAGMLDQAY L+MSME
Sbjct: 361 TAVLSACSHCGLVDEGMAFFDRMRQEFMIVPTVHHYGCMVDLLGRAGMLDQAYQLVMSME 420

Query: 421 VNPDATLWRTLLGACRIHGYANLGERIIEHLIELKSQEAGDYVLLLNIYSSAGNWDKVTE 480
           VNPDAT+WRTLLGACRIHG+ANLGERIIEHLIELKSQEAGDYVLLLNIYSSAGNW KVTE
Sbjct: 421 VNPDATMWRTLLGACRIHGHANLGERIIEHLIELKSQEAGDYVLLLNIYSSAGNWVKVTE 480

Query: 481 LRKFMKEKGIYTTPGCTTIELNGVVHEFAVDDVSHPMKDEIYEQLDEINKQLKIAGYEAE 540
           LRKFMKE+GIYTTPGCTTIELNGVVHEFAVDD+SHPMKD+IYEQLDEINKQLKIAGYEAE
Sbjct: 481 LRKFMKERGIYTTPGCTTIELNGVVHEFAVDDISHPMKDKIYEQLDEINKQLKIAGYEAE 540

Query: 541 ISSELHNLKAEEKGYALSHHSEKLAIAFGVLATPPGRTIRVANNLRTCVDCHNFAKYISS 600
           ISSELHNLKAE+KGYALS HSEKLAIAFGVLATPPGRTIRVANNLRTC+DCHNFAKY+SS
Sbjct: 541 ISSELHNLKAEDKGYALSFHSEKLAIAFGVLATPPGRTIRVANNLRTCMDCHNFAKYVSS 600

Query: 601 VYNRKVVVRDRSRFHHFREGRCSCNDYW 613
           VYNRKVVVRDRSRFHHFREGRCSCNDYW
Sbjct: 601 VYNRKVVVRDRSRFHHFREGRCSCNDYW 628

BLAST of Tan0019509 vs. NCBI nr
Match: XP_022987181.1 (pentatricopeptide repeat-containing protein At3g47530 [Cucurbita maxima])

HSP 1 Score: 1142.9 bits (2955), Expect = 0.0e+00
Identity = 554/628 (88.22%), Postives = 583/628 (92.83%), Query Frame = 0

Query: 1   MTVIFRHCSILSLNHHRRFLLHRFASTASLPHSPI----------------DREPLISLI 60
           MTVIFR C   +  H     L RFASTASL HSPI                DREPLISLI
Sbjct: 1   MTVIFRRCRCSAYRHPHSLRLPRFASTASLLHSPISLLSSKFRQQNSTLHFDREPLISLI 60

Query: 61  KSCTNKSQLLQIHAHIIRTSSISDPIISLRFLTRAASAPFRELGYSRRFFSQLTNPFVSH 120
           KSCT+KSQLLQIHAH+IRTS I DPI+SLRFLTR  SAPFRELGYSRRFFSQLTNPFVSH
Sbjct: 61  KSCTHKSQLLQIHAHMIRTSFIQDPIVSLRFLTRIVSAPFRELGYSRRFFSQLTNPFVSH 120

Query: 121 YNTMLRAYSLSRSPLEGLYMYRDMERQGVRADPLSSSFAVKSCIRMLSLLSGVQIHARIF 180
           YNT+LRAYSLSRSPLEGLYMYRDMERQGV ADPLSSSFA+KSCIRMLSL SG+QIHARIF
Sbjct: 121 YNTLLRAYSLSRSPLEGLYMYRDMERQGVHADPLSSSFALKSCIRMLSLFSGIQIHARIF 180

Query: 181 RNGHQSDSLLLTTMMDLYSHCGKLEDACKLFDEIPHTDVVAWNVLISCLTRNKRTRDALG 240
           RNGHQSDSLLLT+MMDLYSHCGKL+DACKLFDEIP  DVVAWNVLISCLTRNKRTRDALG
Sbjct: 181 RNGHQSDSLLLTSMMDLYSHCGKLKDACKLFDEIPQRDVVAWNVLISCLTRNKRTRDALG 240

Query: 241 LFEIMQSPTYLCEPDKVTCLLLLQACADLNALEFGERIHGYVQEHGYNTESNLCNSLISM 300
           LFEIMQSPTYLC+PDKVTCLLLLQACADLNALEFGERIH ++Q+HGYNTESNLCNSLISM
Sbjct: 241 LFEIMQSPTYLCKPDKVTCLLLLQACADLNALEFGERIHSHIQQHGYNTESNLCNSLISM 300

Query: 301 YSRCGRVDKAYEVFEKMPEKNVVSWSAMISGLSMNGHGREAIEAFWEMQKKGIEPDDHTF 360
           YSRCGRVDKAYEVF+KMPEKNVVSWSAMISGLSMNGHGREAIEAFW MQKKG+EPDDHTF
Sbjct: 301 YSRCGRVDKAYEVFDKMPEKNVVSWSAMISGLSMNGHGREAIEAFWAMQKKGVEPDDHTF 360

Query: 361 TAVLSACSHCGLVDEGMAFFDRMRPEFKIVPNVHHYGCMVDLLGRAGMLDQAYHLIMSME 420
           TAVLSACSHCGLVDEGMAFFDRMR EF IVP VHHYGCMVDLLGRAGMLDQAY L+MSME
Sbjct: 361 TAVLSACSHCGLVDEGMAFFDRMRQEFMIVPTVHHYGCMVDLLGRAGMLDQAYQLVMSME 420

Query: 421 VNPDATLWRTLLGACRIHGYANLGERIIEHLIELKSQEAGDYVLLLNIYSSAGNWDKVTE 480
           VNPDAT+WRTLLGACRIHG+ANLGER+IEHL+ELKSQEAGDYVLLLNIYSSAGNWDKVTE
Sbjct: 421 VNPDATMWRTLLGACRIHGHANLGERVIEHLVELKSQEAGDYVLLLNIYSSAGNWDKVTE 480

Query: 481 LRKFMKEKGIYTTPGCTTIELNGVVHEFAVDDVSHPMKDEIYEQLDEINKQLKIAGYEAE 540
           LRKFMKE+GIYTTPGCTTIELNGVVHEFAVDD+SHPMKD+IY+QLDEIN+QLKIAGYEAE
Sbjct: 481 LRKFMKERGIYTTPGCTTIELNGVVHEFAVDDISHPMKDKIYKQLDEINQQLKIAGYEAE 540

Query: 541 ISSELHNLKAEEKGYALSHHSEKLAIAFGVLATPPGRTIRVANNLRTCVDCHNFAKYISS 600
           ISSELHNLKAE+KGYALS HSEKLAIAFGVLATPPG TIRVANNLRTC+DCHNFAKY+SS
Sbjct: 541 ISSELHNLKAEDKGYALSFHSEKLAIAFGVLATPPGGTIRVANNLRTCLDCHNFAKYVSS 600

Query: 601 VYNRKVVVRDRSRFHHFREGRCSCNDYW 613
           VYNRKVVVRDRS+FHHFREGRCSCNDYW
Sbjct: 601 VYNRKVVVRDRSQFHHFREGRCSCNDYW 628

BLAST of Tan0019509 vs. NCBI nr
Match: XP_038879600.1 (pentatricopeptide repeat-containing protein At3g47530-like [Benincasa hispida] >XP_038879601.1 pentatricopeptide repeat-containing protein At3g47530-like [Benincasa hispida] >XP_038879603.1 pentatricopeptide repeat-containing protein At3g47530-like [Benincasa hispida] >XP_038879604.1 pentatricopeptide repeat-containing protein At3g47530-like [Benincasa hispida] >XP_038879605.1 pentatricopeptide repeat-containing protein At3g47530-like [Benincasa hispida] >XP_038879606.1 pentatricopeptide repeat-containing protein At3g47530-like [Benincasa hispida] >XP_038879607.1 pentatricopeptide repeat-containing protein At3g47530-like [Benincasa hispida])

HSP 1 Score: 1122.5 bits (2902), Expect = 0.0e+00
Identity = 553/622 (88.91%), Postives = 578/622 (92.93%), Query Frame = 0

Query: 1   MTVIFRHCSILSLNHHRRFLLHRFASTA----------SLPHSPIDREPLISLIKSCTNK 60
           MT IFR  SILS  HH     HRFAST+          S PH  ++RE LISLIKSCT+K
Sbjct: 1   MTGIFRRHSILSPKHHH----HRFASTSLLSSKFRQQNSTPH--LERELLISLIKSCTHK 60

Query: 61  SQLLQIHAHIIRTSSISDPIISLRFLTRAASAPFRELGYSRRFFSQLTNPFVSHYNTMLR 120
           SQLLQIHAHIIRTSSI DPI+SLRFLTR ASAPFR+LGYSRRFFS LTNPFVSHYN MLR
Sbjct: 61  SQLLQIHAHIIRTSSIQDPIVSLRFLTRTASAPFRDLGYSRRFFSLLTNPFVSHYNAMLR 120

Query: 121 AYSLSRSPLEGLYMYRDMERQGVRADPLSSSFAVKSCIRMLSLLSGVQIHARIFRNGHQS 180
           AYSLS SPL+GLYMYRDMERQGVR DPLSSSFAVKSCIRMLSL SGVQIHA+IFRNGHQS
Sbjct: 121 AYSLSPSPLKGLYMYRDMERQGVRVDPLSSSFAVKSCIRMLSLFSGVQIHAKIFRNGHQS 180

Query: 181 DSLLLTTMMDLYSHCGKLEDACKLFDEIPHTDVVAWNVLISCLTRNKRTRDALGLFEIMQ 240
           DSLLLT+MMDLYSHCGKLEDACKLFDEIP  DV+AWNVLISCLTRNKRTRDALGLF+IMQ
Sbjct: 181 DSLLLTSMMDLYSHCGKLEDACKLFDEIPQRDVIAWNVLISCLTRNKRTRDALGLFDIMQ 240

Query: 241 SPTYLCEPDKVTCLLLLQACADLNALEFGERIHGYVQEHGYNTESNLCNSLISMYSRCGR 300
           SPTYLCEPDKVTCLLLLQACADLNALEFGERIH YVQ+HGYNTESNLCNSLISMYS CGR
Sbjct: 241 SPTYLCEPDKVTCLLLLQACADLNALEFGERIHNYVQQHGYNTESNLCNSLISMYSLCGR 300

Query: 301 VDKAYEVFEKMPEKNVVSWSAMISGLSMNGHGREAIEAFWEMQKKGIEPDDHTFTAVLSA 360
           VDKAYEVF+KMPEK+VVSWSAMISGLSMNG GREAIEAFWEMQKKGIEPDDHTFTAVLSA
Sbjct: 301 VDKAYEVFDKMPEKDVVSWSAMISGLSMNGQGREAIEAFWEMQKKGIEPDDHTFTAVLSA 360

Query: 361 CSHCGLVDEGMAFFDRMRPEFKIVPNVHHYGCMVDLLGRAGMLDQAYHLIMSMEVNPDAT 420
           CSHCGLVDEGMAFFDRMR EF+IVPNV+HYGCMVDLLGR GMLDQAY LIMSMEVNPDAT
Sbjct: 361 CSHCGLVDEGMAFFDRMRQEFRIVPNVYHYGCMVDLLGRTGMLDQAYKLIMSMEVNPDAT 420

Query: 421 LWRTLLGACRIHGYANLGERIIEHLIELKSQEAGDYVLLLNIYSSAGNWDKVTELRKFMK 480
           LWRTLLGACRIHG+ANLGERIIEHLIELKSQEAGDYVLLLNIYSSAGNWDKV+ELRKFMK
Sbjct: 421 LWRTLLGACRIHGHANLGERIIEHLIELKSQEAGDYVLLLNIYSSAGNWDKVSELRKFMK 480

Query: 481 EKGIYTTPGCTTIELNGVVHEFAVDDVSHPMKDEIYEQLDEINKQLKIAGYEAEISSELH 540
           EKGIYTTPGCTTIELNGVVHEFAVDD+SHPMKD+IYE+LDEINKQLKIAGYEAEISSELH
Sbjct: 481 EKGIYTTPGCTTIELNGVVHEFAVDDISHPMKDKIYEKLDEINKQLKIAGYEAEISSELH 540

Query: 541 NLKAEEKGYALSHHSEKLAIAFGVLATPPGRTIRVANNLRTCVDCHNFAKYISSVYNRKV 600
            LKAE+KGYALS+HSEKLAIAFGVLATPPGRTIRVAN +  C+DCHNFAKYISSVYNRKV
Sbjct: 541 RLKAEDKGYALSNHSEKLAIAFGVLATPPGRTIRVANTVHICMDCHNFAKYISSVYNRKV 600

Query: 601 VVRDRSRFHHFREGRCSCNDYW 613
           VVRDRSR HHFREGRCSCNDYW
Sbjct: 601 VVRDRSRSHHFREGRCSCNDYW 616

BLAST of Tan0019509 vs. ExPASy TrEMBL
Match: A0A6J1E1Z3 (pentatricopeptide repeat-containing protein At3g47530 OS=Cucurbita moschata OX=3662 GN=LOC111429841 PE=3 SV=1)

HSP 1 Score: 1145.6 bits (2962), Expect = 0.0e+00
Identity = 560/628 (89.17%), Postives = 582/628 (92.68%), Query Frame = 0

Query: 1   MTVIFRHCSILSLNHHRRFLLHRFASTASLPHSPI----------------DREPLISLI 60
           MTVIFR C   +  H     L  FASTASL HSPI                DREPLISLI
Sbjct: 1   MTVIFRRCRCSADRHLHSLRLPHFASTASLLHSPISLLSSKFREQNSTLRFDREPLISLI 60

Query: 61  KSCTNKSQLLQIHAHIIRTSSISDPIISLRFLTRAASAPFRELGYSRRFFSQLTNPFVSH 120
           KSCT+KSQLLQIHAH+IRTS I DPI+SLRFLTR  SAPFRELGYSRRFFSQLTNPFVSH
Sbjct: 61  KSCTHKSQLLQIHAHMIRTSFIQDPIVSLRFLTRIVSAPFRELGYSRRFFSQLTNPFVSH 120

Query: 121 YNTMLRAYSLSRSPLEGLYMYRDMERQGVRADPLSSSFAVKSCIRMLSLLSGVQIHARIF 180
           YNT+LRAYSLSRSPLEGLYMYRDMER+GV ADPLSSSFAVKSCIRMLSL SGVQIHARIF
Sbjct: 121 YNTLLRAYSLSRSPLEGLYMYRDMERRGVHADPLSSSFAVKSCIRMLSLFSGVQIHARIF 180

Query: 181 RNGHQSDSLLLTTMMDLYSHCGKLEDACKLFDEIPHTDVVAWNVLISCLTRNKRTRDALG 240
           RNGHQSDSLLLT+MMDLYSHCGKLEDACKLFDEIP  DVVAWNVLISCLTRNKRTRDALG
Sbjct: 181 RNGHQSDSLLLTSMMDLYSHCGKLEDACKLFDEIPQRDVVAWNVLISCLTRNKRTRDALG 240

Query: 241 LFEIMQSPTYLCEPDKVTCLLLLQACADLNALEFGERIHGYVQEHGYNTESNLCNSLISM 300
           LFEIMQSPTYLC+PDKVTCLLLLQACADLNALEFGERIH ++Q+HGYNTESNLCNSLISM
Sbjct: 241 LFEIMQSPTYLCKPDKVTCLLLLQACADLNALEFGERIHSHIQQHGYNTESNLCNSLISM 300

Query: 301 YSRCGRVDKAYEVFEKMPEKNVVSWSAMISGLSMNGHGREAIEAFWEMQKKGIEPDDHTF 360
           YSRCGRVDKAYEVF+KMPEKNVVSWSAMISGLSMNGHGREAIEAFW MQKKG+EPDDHTF
Sbjct: 301 YSRCGRVDKAYEVFDKMPEKNVVSWSAMISGLSMNGHGREAIEAFWAMQKKGVEPDDHTF 360

Query: 361 TAVLSACSHCGLVDEGMAFFDRMRPEFKIVPNVHHYGCMVDLLGRAGMLDQAYHLIMSME 420
           TAVLSACSHCGLVDEGMAFFDRMR EF IVP VHHYGCMVDLLGRAGMLDQAY L+MSME
Sbjct: 361 TAVLSACSHCGLVDEGMAFFDRMRQEFMIVPTVHHYGCMVDLLGRAGMLDQAYQLVMSME 420

Query: 421 VNPDATLWRTLLGACRIHGYANLGERIIEHLIELKSQEAGDYVLLLNIYSSAGNWDKVTE 480
           VNPDAT+WRTLLGACRIHG+ANLGERIIEHLIELKSQEAGDYVLLLNIYSSAGNW KVTE
Sbjct: 421 VNPDATMWRTLLGACRIHGHANLGERIIEHLIELKSQEAGDYVLLLNIYSSAGNWVKVTE 480

Query: 481 LRKFMKEKGIYTTPGCTTIELNGVVHEFAVDDVSHPMKDEIYEQLDEINKQLKIAGYEAE 540
           LRKFMKE+GIYTTPGCTTIELNGVVHEFAVDD+SHPMKD+IYEQLDEINKQLKIAGYEAE
Sbjct: 481 LRKFMKERGIYTTPGCTTIELNGVVHEFAVDDISHPMKDKIYEQLDEINKQLKIAGYEAE 540

Query: 541 ISSELHNLKAEEKGYALSHHSEKLAIAFGVLATPPGRTIRVANNLRTCVDCHNFAKYISS 600
           ISSELHNLKAE+KGYALS HSEKLAIAFGVLATPPGRTIRVANNLRTC+DCHNFAKY+SS
Sbjct: 541 ISSELHNLKAEDKGYALSFHSEKLAIAFGVLATPPGRTIRVANNLRTCMDCHNFAKYVSS 600

Query: 601 VYNRKVVVRDRSRFHHFREGRCSCNDYW 613
           VYNRKVVVRDRSRFHHFREGRCSCNDYW
Sbjct: 601 VYNRKVVVRDRSRFHHFREGRCSCNDYW 628

BLAST of Tan0019509 vs. ExPASy TrEMBL
Match: A0A6J1JDE8 (pentatricopeptide repeat-containing protein At3g47530 OS=Cucurbita maxima OX=3661 GN=LOC111484807 PE=3 SV=1)

HSP 1 Score: 1142.9 bits (2955), Expect = 0.0e+00
Identity = 554/628 (88.22%), Postives = 583/628 (92.83%), Query Frame = 0

Query: 1   MTVIFRHCSILSLNHHRRFLLHRFASTASLPHSPI----------------DREPLISLI 60
           MTVIFR C   +  H     L RFASTASL HSPI                DREPLISLI
Sbjct: 1   MTVIFRRCRCSAYRHPHSLRLPRFASTASLLHSPISLLSSKFRQQNSTLHFDREPLISLI 60

Query: 61  KSCTNKSQLLQIHAHIIRTSSISDPIISLRFLTRAASAPFRELGYSRRFFSQLTNPFVSH 120
           KSCT+KSQLLQIHAH+IRTS I DPI+SLRFLTR  SAPFRELGYSRRFFSQLTNPFVSH
Sbjct: 61  KSCTHKSQLLQIHAHMIRTSFIQDPIVSLRFLTRIVSAPFRELGYSRRFFSQLTNPFVSH 120

Query: 121 YNTMLRAYSLSRSPLEGLYMYRDMERQGVRADPLSSSFAVKSCIRMLSLLSGVQIHARIF 180
           YNT+LRAYSLSRSPLEGLYMYRDMERQGV ADPLSSSFA+KSCIRMLSL SG+QIHARIF
Sbjct: 121 YNTLLRAYSLSRSPLEGLYMYRDMERQGVHADPLSSSFALKSCIRMLSLFSGIQIHARIF 180

Query: 181 RNGHQSDSLLLTTMMDLYSHCGKLEDACKLFDEIPHTDVVAWNVLISCLTRNKRTRDALG 240
           RNGHQSDSLLLT+MMDLYSHCGKL+DACKLFDEIP  DVVAWNVLISCLTRNKRTRDALG
Sbjct: 181 RNGHQSDSLLLTSMMDLYSHCGKLKDACKLFDEIPQRDVVAWNVLISCLTRNKRTRDALG 240

Query: 241 LFEIMQSPTYLCEPDKVTCLLLLQACADLNALEFGERIHGYVQEHGYNTESNLCNSLISM 300
           LFEIMQSPTYLC+PDKVTCLLLLQACADLNALEFGERIH ++Q+HGYNTESNLCNSLISM
Sbjct: 241 LFEIMQSPTYLCKPDKVTCLLLLQACADLNALEFGERIHSHIQQHGYNTESNLCNSLISM 300

Query: 301 YSRCGRVDKAYEVFEKMPEKNVVSWSAMISGLSMNGHGREAIEAFWEMQKKGIEPDDHTF 360
           YSRCGRVDKAYEVF+KMPEKNVVSWSAMISGLSMNGHGREAIEAFW MQKKG+EPDDHTF
Sbjct: 301 YSRCGRVDKAYEVFDKMPEKNVVSWSAMISGLSMNGHGREAIEAFWAMQKKGVEPDDHTF 360

Query: 361 TAVLSACSHCGLVDEGMAFFDRMRPEFKIVPNVHHYGCMVDLLGRAGMLDQAYHLIMSME 420
           TAVLSACSHCGLVDEGMAFFDRMR EF IVP VHHYGCMVDLLGRAGMLDQAY L+MSME
Sbjct: 361 TAVLSACSHCGLVDEGMAFFDRMRQEFMIVPTVHHYGCMVDLLGRAGMLDQAYQLVMSME 420

Query: 421 VNPDATLWRTLLGACRIHGYANLGERIIEHLIELKSQEAGDYVLLLNIYSSAGNWDKVTE 480
           VNPDAT+WRTLLGACRIHG+ANLGER+IEHL+ELKSQEAGDYVLLLNIYSSAGNWDKVTE
Sbjct: 421 VNPDATMWRTLLGACRIHGHANLGERVIEHLVELKSQEAGDYVLLLNIYSSAGNWDKVTE 480

Query: 481 LRKFMKEKGIYTTPGCTTIELNGVVHEFAVDDVSHPMKDEIYEQLDEINKQLKIAGYEAE 540
           LRKFMKE+GIYTTPGCTTIELNGVVHEFAVDD+SHPMKD+IY+QLDEIN+QLKIAGYEAE
Sbjct: 481 LRKFMKERGIYTTPGCTTIELNGVVHEFAVDDISHPMKDKIYKQLDEINQQLKIAGYEAE 540

Query: 541 ISSELHNLKAEEKGYALSHHSEKLAIAFGVLATPPGRTIRVANNLRTCVDCHNFAKYISS 600
           ISSELHNLKAE+KGYALS HSEKLAIAFGVLATPPG TIRVANNLRTC+DCHNFAKY+SS
Sbjct: 541 ISSELHNLKAEDKGYALSFHSEKLAIAFGVLATPPGGTIRVANNLRTCLDCHNFAKYVSS 600

Query: 601 VYNRKVVVRDRSRFHHFREGRCSCNDYW 613
           VYNRKVVVRDRS+FHHFREGRCSCNDYW
Sbjct: 601 VYNRKVVVRDRSQFHHFREGRCSCNDYW 628

BLAST of Tan0019509 vs. ExPASy TrEMBL
Match: A0A6J1C487 (pentatricopeptide repeat-containing protein At3g47530 OS=Momordica charantia OX=3673 GN=LOC111007242 PE=3 SV=1)

HSP 1 Score: 1112.1 bits (2875), Expect = 0.0e+00
Identity = 551/634 (86.91%), Postives = 574/634 (90.54%), Query Frame = 0

Query: 1   MTVIFRHCSI-LSLNHHRRFLLHRFASTASLPHS---------------------PIDRE 60
           M V+FR   +  SLN      L RFASTASL HS                     PI+RE
Sbjct: 1   MKVVFRQFLLRRSLNPQHHISLPRFASTASLLHSPKSLISSKFRQHNSTTRFSNAPIERE 60

Query: 61  PLISLIKSCTNKSQLLQIHAHIIRTSSISDPIISLRFLTRAASAPFRELGYSRRFFSQLT 120
           PLISLIKSCT+K QLLQIHAHIIRTSSI DPII+LRFLTRAA+APFREL YSRRFFSQLT
Sbjct: 61  PLISLIKSCTHKPQLLQIHAHIIRTSSIKDPIIALRFLTRAATAPFRELDYSRRFFSQLT 120

Query: 121 NPFVSHYNTMLRAYSLSRSPLEGLYMYRDMERQGVRADPLSSSFAVKSCIRMLSLLSGVQ 180
           NP VSHYN MLRAYSLSRSP +GLY+YRDMERQG+RADPLSSSFA+KSCIR+ SLLSGVQ
Sbjct: 121 NPSVSHYNAMLRAYSLSRSPQDGLYVYRDMERQGIRADPLSSSFAIKSCIRIFSLLSGVQ 180

Query: 181 IHARIFRNGHQSDSLLLTTMMDLYSHCGKLEDACKLFDEIPHTDVVAWNVLISCLTRNKR 240
           IHARIFRNGHQSDSLLLTTMMDLYSHCGKLE+ACKLFDEIP  DVVAWNVLISCLTRNKR
Sbjct: 181 IHARIFRNGHQSDSLLLTTMMDLYSHCGKLEEACKLFDEIPQRDVVAWNVLISCLTRNKR 240

Query: 241 TRDALGLFEIMQSPTYLCEPDKVTCLLLLQACADLNALEFGERIHGYVQEHGYNTESNLC 300
           TRDALGLFEIMQSPTYLC+PDKVTCLLLLQACADLNALEFGERIH Y+QE GY+TESNLC
Sbjct: 241 TRDALGLFEIMQSPTYLCQPDKVTCLLLLQACADLNALEFGERIHSYIQERGYDTESNLC 300

Query: 301 NSLISMYSRCGRVDKAYEVFEKMPEKNVVSWSAMISGLSMNGHGREAIEAFWEMQKKGIE 360
           NSLISMYSRCGRVDKAYEVF+KMPEKNVVSWSA+ISGLSMNGHGREAIEAFWEMQK GIE
Sbjct: 301 NSLISMYSRCGRVDKAYEVFDKMPEKNVVSWSAIISGLSMNGHGREAIEAFWEMQKMGIE 360

Query: 361 PDDHTFTAVLSACSHCGLVDEGMAFFDRMRPEFKIVPNVHHYGCMVDLLGRAGMLDQAYH 420
           PDD TFT VLSACSHCGLVDEGMAFFDRMR EFKI PNVHHYGCMVDLLGRAGMLDQAY 
Sbjct: 361 PDDRTFTGVLSACSHCGLVDEGMAFFDRMR-EFKIAPNVHHYGCMVDLLGRAGMLDQAYQ 420

Query: 421 LIMSMEVNPDATLWRTLLGACRIHGYANLGERIIEHLIELKSQEAGDYVLLLNIYSSAGN 480
           L MSME+NPDATLWRTLLGAC+IHG+ NLGE II HLIE KSQEAGDYVLLLNIYSSAGN
Sbjct: 421 LAMSMEMNPDATLWRTLLGACKIHGHVNLGEHIIGHLIEAKSQEAGDYVLLLNIYSSAGN 480

Query: 481 WDKVTELRKFMKEKGIYTTPGCTTIELNGVVHEFAVDDVSHPMKDEIYEQLDEINKQLKI 540
           WDKVTELRKFMKE GIYTTP CTTIELNGVVHEFAVDDVSHPMKDEIYEQLDEINKQLKI
Sbjct: 481 WDKVTELRKFMKENGIYTTPSCTTIELNGVVHEFAVDDVSHPMKDEIYEQLDEINKQLKI 540

Query: 541 AGYEAEISSELHNLKAEEKGYALSHHSEKLAIAFGVLATPPGRTIRVANNLRTCVDCHNF 600
           AGYE+EISSELHNLKAEEKGYALS HSEKLAIAFGVLATPPGRTIRVANN+RTCVDCHNF
Sbjct: 541 AGYESEISSELHNLKAEEKGYALSCHSEKLAIAFGVLATPPGRTIRVANNIRTCVDCHNF 600

Query: 601 AKYISSVYNRKVVVRDRSRFHHFREGRCSCNDYW 613
           AKY+SSVYNRKVVVRDRSRFHHFREGRCSCNDYW
Sbjct: 601 AKYVSSVYNRKVVVRDRSRFHHFREGRCSCNDYW 633

BLAST of Tan0019509 vs. ExPASy TrEMBL
Match: A0A0A0LUH9 (DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G009750 PE=3 SV=1)

HSP 1 Score: 1098.6 bits (2840), Expect = 0.0e+00
Identity = 530/612 (86.60%), Postives = 567/612 (92.65%), Query Frame = 0

Query: 1   MTVIFRHCSILSLNHHRRFLLHRFASTASLPHSPIDREPLISLIKSCTNKSQLLQIHAHI 60
           M VIFR  SILSL +H            S+  S  +REPLISLIKSCT+KSQLLQIHAHI
Sbjct: 1   MCVIFRSPSILSLKYHHH----------SISFSHFEREPLISLIKSCTHKSQLLQIHAHI 60

Query: 61  IRTSSISDPIISLRFLTRAASAPFRELGYSRRFFSQLTNPFVSHYNTMLRAYSLSRSPLE 120
           I TSSI DPI+SLRFLTR ASAPFR+LGYSRR F  LTNPFVSHYN MLRAYSLSRSPLE
Sbjct: 61  ITTSSIQDPIVSLRFLTRTASAPFRDLGYSRRLFDLLTNPFVSHYNAMLRAYSLSRSPLE 120

Query: 121 GLYMYRDMERQGVRADPLSSSFAVKSCIRMLSLLSGVQIHARIFRNGHQSDSLLLTTMMD 180
           GLYMYRDMERQGVRADPLSSSFAVKSCI++LSLL G+QIHARIF NGHQ+DSLLLT+MMD
Sbjct: 121 GLYMYRDMERQGVRADPLSSSFAVKSCIKLLSLLFGIQIHARIFINGHQADSLLLTSMMD 180

Query: 181 LYSHCGKLEDACKLFDEIPHTDVVAWNVLISCLTRNKRTRDALGLFEIMQSPTYLCEPDK 240
           LYSHCGK E+ACKLFDE+P  DVVAWNVLISCLTRNKRTRDALGLFEIMQSPTYLC+PDK
Sbjct: 181 LYSHCGKPEEACKLFDEVPQKDVVAWNVLISCLTRNKRTRDALGLFEIMQSPTYLCQPDK 240

Query: 241 VTCLLLLQACADLNALEFGERIHGYVQEHGYNTESNLCNSLISMYSRCGRVDKAYEVFEK 300
           VTCLLLLQACADLNALEFGERIHGY+Q+HGYNTESNLCNSLISMYSRCGR+DKAYEVF+K
Sbjct: 241 VTCLLLLQACADLNALEFGERIHGYIQQHGYNTESNLCNSLISMYSRCGRMDKAYEVFDK 300

Query: 301 MPEKNVVSWSAMISGLSMNGHGREAIEAFWEMQKKGIEPDDHTFTAVLSACSHCGLVDEG 360
           M EKNVVSWSAMISGLSMNGHGREAIEAFWEMQK G+EP DHTFTAVLSACSHCGLVDEG
Sbjct: 301 MTEKNVVSWSAMISGLSMNGHGREAIEAFWEMQKNGVEPGDHTFTAVLSACSHCGLVDEG 360

Query: 361 MAFFDRMRPEFKIVPNVHHYGCMVDLLGRAGMLDQAYHLIMSMEVNPDATLWRTLLGACR 420
           MAFFDRMR EF I PNVHHYGC+VDLLGRAGMLDQAY LIMSMEV PDAT+WRTLLGACR
Sbjct: 361 MAFFDRMRQEFMIAPNVHHYGCIVDLLGRAGMLDQAYELIMSMEVRPDATMWRTLLGACR 420

Query: 421 IHGYANLGERIIEHLIELKSQEAGDYVLLLNIYSSAGNWDKVTELRKFMKEKGIYTTPGC 480
           IHG+ NLGERI+EHLIELKSQEAGDYVLLLNIYSSAGNWDKVTELRK MKEKGIYTTP C
Sbjct: 421 IHGHGNLGERIVEHLIELKSQEAGDYVLLLNIYSSAGNWDKVTELRKLMKEKGIYTTPCC 480

Query: 481 TTIELNGVVHEFAVDDVSHPMKDEIYEQLDEINKQLKIAGYEAEISSELHNLKAEEKGYA 540
           TTIELNGVVH+FAVDD+SHPMKD+IY+QLDEINKQLKIAGYEAE+SSELH L+ ++KGYA
Sbjct: 481 TTIELNGVVHQFAVDDISHPMKDKIYKQLDEINKQLKIAGYEAEMSSELHRLEPKDKGYA 540

Query: 541 LSHHSEKLAIAFGVLATPPGRTIRVANNLRTCVDCHNFAKYISSVYNRKVVVRDRSRFHH 600
           LS+HSEKLAIAFGVLATPPGRTIR+ANN+RTC+DCHNFAKYISSVYNRKVVVRDRSRFHH
Sbjct: 541 LSNHSEKLAIAFGVLATPPGRTIRIANNIRTCMDCHNFAKYISSVYNRKVVVRDRSRFHH 600

Query: 601 FREGRCSCNDYW 613
           F+EGRCSCND+W
Sbjct: 601 FQEGRCSCNDFW 602

BLAST of Tan0019509 vs. ExPASy TrEMBL
Match: A0A1S3BV40 (pentatricopeptide repeat-containing protein At3g47530 OS=Cucumis melo OX=3656 GN=LOC103493993 PE=3 SV=1)

HSP 1 Score: 1077.8 bits (2786), Expect = 0.0e+00
Identity = 515/573 (89.88%), Postives = 547/573 (95.46%), Query Frame = 0

Query: 40  LISLIKSCTNKSQLLQIHAHIIRTSSISDPIISLRFLTRAASAPFRELGYSRRFFSQLTN 99
           LISLIKSCT+KSQLLQIHAHIIRTSSI DPI+SLRFLTR ASAPFR+LGYSRRF   LTN
Sbjct: 13  LISLIKSCTHKSQLLQIHAHIIRTSSIQDPIVSLRFLTRTASAPFRDLGYSRRFLDLLTN 72

Query: 100 PFVSHYNTMLRAYSLSRSPLEGLYMYRDMERQGVRADPLSSSFAVKSCIRMLSLLSGVQI 159
           P VSHYN MLRAYS+SRSPLEGLY+YRDMERQGVRADPLSSSFAVKSCI++LSLL G+QI
Sbjct: 73  PLVSHYNAMLRAYSVSRSPLEGLYVYRDMERQGVRADPLSSSFAVKSCIKLLSLLFGIQI 132

Query: 160 HARIFRNGHQSDSLLLTTMMDLYSHCGKLEDACKLFDEIPHTDVVAWNVLISCLTRNKRT 219
           HARIF  GHQ+DSLLLT+MMDLYSHCGK E+ACKLFDE+P  DVVAWNVLISCLTRNKRT
Sbjct: 133 HARIFIYGHQADSLLLTSMMDLYSHCGKPEEACKLFDEVPQKDVVAWNVLISCLTRNKRT 192

Query: 220 RDALGLFEIMQSPTYLCEPDKVTCLLLLQACADLNALEFGERIHGYVQEHGYNTESNLCN 279
           RDALGLFEIMQSPTYLC+PDKVTCLLLLQACADLNALEFGERIHGY+Q+H YNTESNLCN
Sbjct: 193 RDALGLFEIMQSPTYLCQPDKVTCLLLLQACADLNALEFGERIHGYIQQHCYNTESNLCN 252

Query: 280 SLISMYSRCGRVDKAYEVFEKMPEKNVVSWSAMISGLSMNGHGREAIEAFWEMQKKGIEP 339
           SLISMYSRCGRVDKAYEVF+KMPEKNVVSWSAMISGLSMNGHGREAIEAFWEMQK G+EP
Sbjct: 253 SLISMYSRCGRVDKAYEVFDKMPEKNVVSWSAMISGLSMNGHGREAIEAFWEMQKNGVEP 312

Query: 340 DDHTFTAVLSACSHCGLVDEGMAFFDRMRPEFKIVPNVHHYGCMVDLLGRAGMLDQAYHL 399
           DDHTFTAVLSACSHCGLVDEGMAFFDRMR E  I PNVHHYGC+VDLLGRAGMLDQAY L
Sbjct: 313 DDHTFTAVLSACSHCGLVDEGMAFFDRMRQELMIAPNVHHYGCIVDLLGRAGMLDQAYEL 372

Query: 400 IMSMEVNPDATLWRTLLGACRIHGYANLGERIIEHLIELKSQEAGDYVLLLNIYSSAGNW 459
           IMSMEV PDAT+WRTLLGACRIHG+ANLGERI+EHLIELKSQEAGDYVLLLNIYSSAG W
Sbjct: 373 IMSMEVRPDATMWRTLLGACRIHGHANLGERIVEHLIELKSQEAGDYVLLLNIYSSAGKW 432

Query: 460 DKVTELRKFMKEKGIYTTPGCTTIELNGVVHEFAVDDVSHPMKDEIYEQLDEINKQLKIA 519
           DKVTELRK MKEKGIYTTP CTTIELNGVVHEFAVDD+SHPMKD+IY+QLDEINKQLKIA
Sbjct: 433 DKVTELRKLMKEKGIYTTPCCTTIELNGVVHEFAVDDISHPMKDKIYKQLDEINKQLKIA 492

Query: 520 GYEAEISSELHNLKAEEKGYALSHHSEKLAIAFGVLATPPGRTIRVANNLRTCVDCHNFA 579
           GYEAE+SSELH LK E+KGYALS+HSEKLAIAFGVLATPPGRTIRVANN+RTC+DCHNFA
Sbjct: 493 GYEAEMSSELHRLKPEDKGYALSNHSEKLAIAFGVLATPPGRTIRVANNIRTCMDCHNFA 552

Query: 580 KYISSVYNRKVVVRDRSRFHHFREGRCSCNDYW 613
           KYISSVYNRKVV+RDRSRFHHF+EGRCSCND+W
Sbjct: 553 KYISSVYNRKVVLRDRSRFHHFQEGRCSCNDFW 585

BLAST of Tan0019509 vs. TAIR 10
Match: AT3G47530.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 753.8 bits (1945), Expect = 1.1e-217
Identity = 373/578 (64.53%), Postives = 454/578 (78.55%), Query Frame = 0

Query: 40  LISLIKSCTNKSQLLQIHAHIIRTSSISDPIISLRFLTR-AASAPFRELGYSRRFFSQLT 99
           L+SLI S T K  L QIHA ++RTS I +  +   FL+R A S   R++ YS R FSQ  
Sbjct: 14  LLSLIVSSTGKLHLRQIHALLLRTSLIRNSDVFHHFLSRLALSLIPRDINYSCRVFSQRL 73

Query: 100 NPFVSHYNTMLRAYSLSRSPLEGLYMYRDMER-QGVRADPLSSSFAVKSCIRMLSLLSGV 159
           NP +SH NTM+RA+SLS++P EG  ++R + R   + A+PLSSSFA+K CI+   LL G+
Sbjct: 74  NPTLSHCNTMIRAFSLSQTPCEGFRLFRSLRRNSSLPANPLSSSFALKCCIKSGDLLGGL 133

Query: 160 QIHARIFRNGHQSDSLLLTTMMDLYSHCGKLEDACKLFDEIPHTDVVAWNVLISCLTRNK 219
           QIH +IF +G  SDSLL+TT+MDLYS C    DACK+FDEIP  D V+WNVL SC  RNK
Sbjct: 134 QIHGKIFSDGFLSDSLLMTTLMDLYSTCENSTDACKVFDEIPKRDTVSWNVLFSCYLRNK 193

Query: 220 RTRDALGLFEIMQSPTYLC-EPDKVTCLLLLQACADLNALEFGERIHGYVQEHGYNTESN 279
           RTRD L LF+ M++    C +PD VTCLL LQACA+L AL+FG+++H ++ E+G +   N
Sbjct: 194 RTRDVLVLFDKMKNDVDGCVKPDGVTCLLALQACANLGALDFGKQVHDFIDENGLSGALN 253

Query: 280 LCNSLISMYSRCGRVDKAYEVFEKMPEKNVVSWSAMISGLSMNGHGREAIEAFWEMQKKG 339
           L N+L+SMYSRCG +DKAY+VF  M E+NVVSW+A+ISGL+MNG G+EAIEAF EM K G
Sbjct: 254 LSNTLVSMYSRCGSMDKAYQVFYGMRERNVVSWTALISGLAMNGFGKEAIEAFNEMLKFG 313

Query: 340 IEPDDHTFTAVLSACSHCGLVDEGMAFFDRMRP-EFKIVPNVHHYGCMVDLLGRAGMLDQ 399
           I P++ T T +LSACSH GLV EGM FFDRMR  EFKI PN+HHYGC+VDLLGRA +LD+
Sbjct: 314 ISPEEQTLTGLLSACSHSGLVAEGMMFFDRMRSGEFKIKPNLHHYGCVVDLLGRARLLDK 373

Query: 400 AYHLIMSMEVNPDATLWRTLLGACRIHGYANLGERIIEHLIELKSQEAGDYVLLLNIYSS 459
           AY LI SME+ PD+T+WRTLLGACR+HG   LGER+I HLIELK++EAGDYVLLLN YS+
Sbjct: 374 AYSLIKSMEMKPDSTIWRTLLGACRVHGDVELGERVISHLIELKAEEAGDYVLLLNTYST 433

Query: 460 AGNWDKVTELRKFMKEKGIYTTPGCTTIELNGVVHEFAVDDVSHPMKDEIYEQLDEINKQ 519
            G W+KVTELR  MKEK I+T PGC+ IEL G VHEF VDDVSHP K+EIY+ L EIN+Q
Sbjct: 434 VGKWEKVTELRSLMKEKRIHTKPGCSAIELQGTVHEFIVDDVSHPRKEEIYKMLAEINQQ 493

Query: 520 LKIAGYEAEISSELHNLKA-EEKGYALSHHSEKLAIAFGVLATPPGRTIRVANNLRTCVD 579
           LKIAGY AEI+SELHNL++ EEKGYAL +HSEKLAIAFG+L TPPG TIRV  NLRTCVD
Sbjct: 494 LKIAGYVAEITSELHNLESEEEKGYALRYHSEKLAIAFGILVTPPGTTIRVTKNLRTCVD 553

Query: 580 CHNFAKYISSVYNRKVVVRDRSRFHHFREGRCSCNDYW 613
           CHNFAK++S VY+R V+VRDRSRFHHF+ G CSCND+W
Sbjct: 554 CHNFAKFVSDVYDRIVIVRDRSRFHHFKGGSCSCNDFW 591

BLAST of Tan0019509 vs. TAIR 10
Match: AT4G21065.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 463.8 bits (1192), Expect = 2.2e-130
Identity = 240/587 (40.89%), Postives = 356/587 (60.65%), Query Frame = 0

Query: 40  LISLIKSCTNKSQ---------LLQIHAHIIRTS-SISDPIISLRFLTRAASAPF-RELG 99
           L+ +++ C N  Q         L QIHA  IR   SISD  +    +    S P    + 
Sbjct: 11  LLPMVEKCINLLQTYGVSSITKLRQIHAFSIRHGVSISDAELGKHLIFYLVSLPSPPPMS 70

Query: 100 YSRRFFSQLTNPF-VSHYNTMLRAYSLSRSPLEGLYMYRDMERQG-VRADPLSSSFAVKS 159
           Y+ + FS++  P  V  +NT++R Y+   + +    +YR+M   G V  D  +  F +K+
Sbjct: 71  YAHKVFSKIEKPINVFIWNTLIRGYAEIGNSISAFSLYREMRVSGLVEPDTHTYPFLIKA 130

Query: 160 CIRMLSLLSGVQIHARIFRNGHQSDSLLLTTMMDLYSHCGKLEDACKLFDEIPHTDVVAW 219
              M  +  G  IH+ + R+G  S   +  +++ LY++CG +  A K+FD++P  D+VAW
Sbjct: 131 VTTMADVRLGETIHSVVIRSGFGSLIYVQNSLLHLYANCGDVASAYKVFDKMPEKDLVAW 190

Query: 220 NVLISCLTRNKRTRDALGLFEIMQSPTYLCEPDKVTCLLLLQACADLNALEFGERIHGYV 279
           N +I+    N +  +AL L+  M S     +PD  T + LL ACA + AL  G+R+H Y+
Sbjct: 191 NSVINGFAENGKPEEALALYTEMNSKG--IKPDGFTIVSLLSACAKIGALTLGKRVHVYM 250

Query: 280 QEHGYNTESNLCNSLISMYSRCGRVDKAYEVFEKMPEKNVVSWSAMISGLSMNGHGREAI 339
            + G     +  N L+ +Y+RCGRV++A  +F++M +KN VSW+++I GL++NG G+EAI
Sbjct: 251 IKVGLTRNLHSSNVLLDLYARCGRVEEAKTLFDEMVDKNSVSWTSLIVGLAVNGFGKEAI 310

Query: 340 EAFWEMQK-KGIEPDDHTFTAVLSACSHCGLVDEGMAFFDRMRPEFKIVPNVHHYGCMVD 399
           E F  M+  +G+ P + TF  +L ACSHCG+V EG  +F RMR E+KI P + H+GCMVD
Sbjct: 311 ELFKYMESTEGLLPCEITFVGILYACSHCGMVKEGFEYFRRMREEYKIEPRIEHFGCMVD 370

Query: 400 LLGRAGMLDQAYHLIMSMEVNPDATLWRTLLGACRIHGYANLGERIIEHLIELKSQEAGD 459
           LL RAG + +AY  I SM + P+  +WRTLLGAC +HG ++L E     +++L+   +GD
Sbjct: 371 LLARAGQVKKAYEYIKSMPMQPNVVIWRTLLGACTVHGDSDLAEFARIQILQLEPNHSGD 430

Query: 460 YVLLLNIYSSAGNWDKVTELRKFMKEKGIYTTPGCTTIELNGVVHEFAVDDVSHPMKDEI 519
           YVLL N+Y+S   W  V ++RK M   G+   PG + +E+   VHEF + D SHP  D I
Sbjct: 431 YVLLSNMYASEQRWSDVQKIRKQMLRDGVKKVPGHSLVEVGNRVHEFLMGDKSHPQSDAI 490

Query: 520 YEQLDEINKQLKIAGYEAEISSELHNLKAEEKGYALSHHSEKLAIAFGVLATPPGRTIRV 579
           Y +L E+  +L+  GY  +IS+   +++ EEK  A+ +HSEK+AIAF +++TP    I V
Sbjct: 491 YAKLKEMTGRLRSEGYVPQISNVYVDVEEEEKENAVVYHSEKIAIAFMLISTPERSPITV 550

Query: 580 ANNLRTCVDCHNFAKYISSVYNRKVVVRDRSRFHHFREGRCSCNDYW 613
             NLR C DCH   K +S VYNR++VVRDRSRFHHF+ G CSC DYW
Sbjct: 551 VKNLRVCADCHLAIKLVSKVYNREIVVRDRSRFHHFKNGSCSCQDYW 595

BLAST of Tan0019509 vs. TAIR 10
Match: AT1G08070.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 459.9 bits (1182), Expect = 3.2e-129
Identity = 220/526 (41.83%), Postives = 332/526 (63.12%), Query Frame = 0

Query: 90  SRRFFSQLTNPFVSHYNTMLRAYSLSRSPLEGLYMYRDMERQGVRADPLSSSFAVKSCIR 149
           +++ F ++    V  +N M+  Y+ + +  E L +++DM +  VR D  +    V +C +
Sbjct: 219 AQKLFDEIPVKDVVSWNAMISGYAETGNYKEALELFKDMMKTNVRPDESTMVTVVSACAQ 278

Query: 150 MLSLLSGVQIHARIFRNGHQSDSLLLTTMMDLYSHCGKLEDACKLFDEIPHTDVVAWNVL 209
             S+  G Q+H  I  +G  S+  ++  ++DLYS CG+LE AC LF+ +P+ DV++WN L
Sbjct: 279 SGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELETACGLFERLPYKDVISWNTL 338

Query: 210 ISCLTRNKRTRDALGLF-EIMQSPTYLCEPDKVTCLLLLQACADLNALEFGERIHGYVQE 269
           I   T     ++AL LF E+++S      P+ VT L +L ACA L A++ G  IH Y+ +
Sbjct: 339 IGGYTHMNLYKEALLLFQEMLRSGE---TPNDVTMLSILPACAHLGAIDIGRWIHVYIDK 398

Query: 270 H--GYNTESNLCNSLISMYSRCGRVDKAYEVFEKMPEKNVVSWSAMISGLSMNGHGREAI 329
              G    S+L  SLI MY++CG ++ A++VF  +  K++ SW+AMI G +M+G    + 
Sbjct: 399 RLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSSWNAMIFGFAMHGRADASF 458

Query: 330 EAFWEMQKKGIEPDDHTFTAVLSACSHCGLVDEGMAFFDRMRPEFKIVPNVHHYGCMVDL 389
           + F  M+K GI+PDD TF  +LSACSH G++D G   F  M  ++K+ P + HYGCM+DL
Sbjct: 459 DLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQDYKMTPKLEHYGCMIDL 518

Query: 390 LGRAGMLDQAYHLIMSMEVNPDATLWRTLLGACRIHGYANLGERIIEHLIELKSQEAGDY 449
           LG +G+  +A  +I  ME+ PD  +W +LL AC++HG   LGE   E+LI+++ +  G Y
Sbjct: 519 LGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELGESFAENLIKIEPENPGSY 578

Query: 450 VLLLNIYSSAGNWDKVTELRKFMKEKGIYTTPGCTTIELNGVVHEFAVDDVSHPMKDEIY 509
           VLL NIY+SAG W++V + R  + +KG+   PGC++IE++ VVHEF + D  HP   EIY
Sbjct: 579 VLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSVVHEFIIGDKFHPRNREIY 638

Query: 510 EQLDEINKQLKIAGYEAEISSELHNLKAEEKGYALSHHSEKLAIAFGVLATPPGRTIRVA 569
             L+E+   L+ AG+  + S  L  ++ E K  AL HHSEKLAIAFG+++T PG  + + 
Sbjct: 639 GMLEEMEVLLEKAGFVPDTSEVLQEMEEEWKEGALRHHSEKLAIAFGLISTKPGTKLTIV 698

Query: 570 NNLRTCVDCHNFAKYISSVYNRKVVVRDRSRFHHFREGRCSCNDYW 613
            NLR C +CH   K IS +Y R+++ RDR+RFHHFR+G CSCNDYW
Sbjct: 699 KNLRVCRNCHEATKLISKIYKREIIARDRTRFHHFRDGVCSCNDYW 741

BLAST of Tan0019509 vs. TAIR 10
Match: AT2G29760.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 452.2 bits (1162), Expect = 6.6e-127
Identity = 256/743 (34.45%), Postives = 386/743 (51.95%), Query Frame = 0

Query: 4   IFRHCSILSLNHHRRFLLHRFASTASLPHSPIDREPLISLIKSCTNKSQLLQIHAHIIRT 63
           IF     LSL  H  F      S  + P +  +R   ISLI+ C +  QL Q H H+IRT
Sbjct: 3   IFSTAQPLSLPRHPNF------SNPNQPTTNNERSRHISLIERCVSLRQLKQTHGHMIRT 62

Query: 64  SSISDPIISLRFLTRAASAPFRELGYSRRFFSQLTNPFVSHYNTMLRAY----------- 123
            + SDP  + +    AA + F  L Y+R+ F ++  P    +NT++RAY           
Sbjct: 63  GTFSDPYSASKLFAMAALSSFASLEYARKVFDEIPKPNSFAWNTLIRAYASGPDPVLSIW 122

Query: 124 -----------------------------SLS---------------------------- 183
                                        SLS                            
Sbjct: 123 AFLDMVSESQCYPNKYTFPFLIKAAAEVSSLSLGQSLHGMAVKSAVGSDVFVANSLIHCY 182

Query: 184 ----------------------------------RSPLEGLYMYRDMERQGVRADPLSSS 243
                                              SP + L +++ ME + V+A  ++  
Sbjct: 183 FSCGDLDSACKVFTTIKEKDVVSWNSMINGFVQKGSPDKALELFKKMESEDVKASHVTMV 242

Query: 244 FAVKSCIRMLSLLSGVQIHARIFRNGHQSDSLLLTTMMDLYSHCGKLEDACKLFD----- 303
             + +C ++ +L  G Q+ + I  N    +  L   M+D+Y+ CG +EDA +LFD     
Sbjct: 243 GVLSACAKIRNLEFGRQVCSYIEENRVNVNLTLANAMLDMYTKCGSIEDAKRLFDAMEEK 302

Query: 304 --------------------------EIPHTDVVAWNVLISCLTRNKRTRDALGLFEIMQ 363
                                      +P  D+VAWN LIS   +N +  +AL +F  +Q
Sbjct: 303 DNVTWTTMLDGYAISEDYEAAREVLNSMPQKDIVAWNALISAYEQNGKPNEALIVFHELQ 362

Query: 364 SPTYLCEPDKVTCLLLLQACADLNALEFGERIHGYVQEHGYNTESNLCNSLISMYSRCGR 423
               + + +++T +  L ACA + ALE G  IH Y+++HG     ++ ++LI MYS+CG 
Sbjct: 363 LQKNM-KLNQITLVSTLSACAQVGALELGRWIHSYIKKHGIRMNFHVTSALIHMYSKCGD 422

Query: 424 VDKAYEVFEKMPEKNVVSWSAMISGLSMNGHGREAIEAFWEMQKKGIEPDDHTFTAVLSA 483
           ++K+ EVF  + +++V  WSAMI GL+M+G G EA++ F++MQ+  ++P+  TFT V  A
Sbjct: 423 LEKSREVFNSVEKRDVFVWSAMIGGLAMHGCGNEAVDMFYKMQEANVKPNGVTFTNVFCA 482

Query: 484 CSHCGLVDEGMAFFDRMRPEFKIVPNVHHYGCMVDLLGRAGMLDQAYHLIMSMEVNPDAT 543
           CSH GLVDE  + F +M   + IVP   HY C+VD+LGR+G L++A   I +M + P  +
Sbjct: 483 CSHTGLVDEAESLFHQMESNYGIVPEEKHYACIVDVLGRSGYLEKAVKFIEAMPIPPSTS 542

Query: 544 LWRTLLGACRIHGYANLGERIIEHLIELKSQEAGDYVLLLNIYSSAGNWDKVTELRKFMK 603
           +W  LLGAC+IH   NL E     L+EL+ +  G +VLL NIY+  G W+ V+ELRK M+
Sbjct: 543 VWGALLGACKIHANLNLAEMACTRLLELEPRNDGAHVLLSNIYAKLGKWENVSELRKHMR 602

Query: 604 EKGIYTTPGCTTIELNGVVHEFAVDDVSHPMKDEIYEQLDEINKQLKIAGYEAEISSELH 613
             G+   PGC++IE++G++HEF   D +HPM +++Y +L E+ ++LK  GYE EIS  L 
Sbjct: 603 VTGLKKEPGCSSIEIDGMIHEFLSGDNAHPMSEKVYGKLHEVMEKLKSNGYEPEISQVLQ 662

BLAST of Tan0019509 vs. TAIR 10
Match: AT5G06540.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 449.5 bits (1155), Expect = 4.3e-126
Identity = 229/613 (37.36%), Postives = 358/613 (58.40%), Query Frame = 0

Query: 37  REPLISLIKSCTNKSQLLQIHAHIIRTSSISDPIISLRFLTRAA-----SAPFRELGYSR 96
           + P ++L++SC++ S L  IH  ++RT  ISD  ++ R L         + P   LGY+ 
Sbjct: 12  KHPKLALLQSCSSFSDLKIIHGFLLRTHLISDVFVASRLLALCVDDSTFNKPTNLLGYAY 71

Query: 97  RFFSQLTNPFVSHYNTMLRAYSLSRSPLEGLYMYRDMERQGVRADPLSSSFAVKSCIRML 156
             FSQ+ NP +  +N ++R +S    P +    Y  M +  +  D ++  F +K+   M 
Sbjct: 72  GIFSQIQNPNLFVFNLLIRCFSTGAEPSKAFGFYTQMLKSRIWPDNITFPFLIKASSEME 131

Query: 157 SLLSGVQIHARIFRNGHQSDSLLLTTMMDLYSH--------------------------- 216
            +L G Q H++I R G Q+D  +  +++ +Y++                           
Sbjct: 132 CVLVGEQTHSQIVRFGFQNDVYVENSLVHMYANCGFIAAAGRIFGQMGFRDVVSWTSMVA 191

Query: 217 ----CGKLEDACKLFDEIPHTDVVAWNVLISCLTRNKRTRDALGLFEIMQSPTYLCEPDK 276
               CG +E+A ++FDE+PH ++  W+++I+   +N     A+ LFE M+    +   ++
Sbjct: 192 GYCKCGMVENAREMFDEMPHRNLFTWSIMINGYAKNNCFEKAIDLFEFMKREGVVA--NE 251

Query: 277 VTCLLLLQACADLNALEFGERIHGYVQEHGYNTESNLCNSLISMYSRCGRVDKAYEVFEK 336
              + ++ +CA L ALEFGER + YV +        L  +L+ M+ RCG ++KA  VFE 
Sbjct: 252 TVMVSVISSCAHLGALEFGERAYEYVVKSHMTVNLILGTALVDMFWRCGDIEKAIHVFEG 311

Query: 337 MPEKNVVSWSAMISGLSMNGHGREAIEAFWEMQKKGIEPDDHTFTAVLSACSHCGLVDEG 396
           +PE + +SWS++I GL+++GH  +A+  F +M   G  P D TFTAVLSACSH GLV++G
Sbjct: 312 LPETDSLSWSSIIKGLAVHGHAHKAMHYFSQMISLGFIPRDVTFTAVLSACSHGGLVEKG 371

Query: 397 MAFFDRMRPEFKIVPNVHHYGCMVDLLGRAGMLDQAYHLIMSMEVNPDATLWRTLLGACR 456
           +  ++ M+ +  I P + HYGC+VD+LGRAG L +A + I+ M V P+A +   LLGAC+
Sbjct: 372 LEIYENMKKDHGIEPRLEHYGCIVDMLGRAGKLAEAENFILKMHVKPNAPILGALLGACK 431

Query: 457 IHGYANLGERIIEHLIELKSQEAGDYVLLLNIYSSAGNWDKVTELRKFMKEKGIYTTPGC 516
           I+    + ER+   LI++K + +G YVLL NIY+ AG WDK+  LR  MKEK +   PG 
Sbjct: 432 IYKNTEVAERVGNMLIKVKPEHSGYYVLLSNIYACAGQWDKIESLRDMMKEKLVKKPPGW 491

Query: 517 TTIELNGVVHEFAV-DDVSHPMKDEIYEQLDEINKQLKIAGYEAEISSELHNLKAEEKGY 576
           + IE++G +++F + DD  HP   +I  + +EI  ++++ GY+        ++  EEK  
Sbjct: 492 SLIEIDGKINKFTMGDDQKHPEMGKIRRKWEEILGKIRLIGYKGNTGDAFFDVDEEEKES 551

Query: 577 ALSHHSEKLAIAFGVLATPPGRTIRVANNLRTCVDCHNFAKYISSVYNRKVVVRDRSRFH 613
           ++  HSEKLAIA+G++ T PG TIR+  NLR C DCH   K IS VY R+++VRDR+RFH
Sbjct: 552 SIHMHSEKLAIAYGMMKTKPGTTIRIVKNLRVCEDCHTVTKLISEVYGRELIVRDRNRFH 611

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9SN851.5e-21664.53Pentatricopeptide repeat-containing protein At3g47530 OS=Arabidopsis thaliana OX... [more]
A8MQA33.1e-12940.89Pentatricopeptide repeat-containing protein At4g21065 OS=Arabidopsis thaliana OX... [more]
Q9LN014.4e-12841.83Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
O823809.3e-12634.45Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidop... [more]
Q9FG166.0e-12537.36Pentatricopeptide repeat-containing protein At5g06540 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
KAG6589508.10.0e+0089.33Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
XP_023515406.10.0e+0089.01pentatricopeptide repeat-containing protein At3g47530 [Cucurbita pepo subsp. pep... [more]
XP_022921651.10.0e+0089.17pentatricopeptide repeat-containing protein At3g47530 [Cucurbita moschata][more]
XP_022987181.10.0e+0088.22pentatricopeptide repeat-containing protein At3g47530 [Cucurbita maxima][more]
XP_038879600.10.0e+0088.91pentatricopeptide repeat-containing protein At3g47530-like [Benincasa hispida] >... [more]
Match NameE-valueIdentityDescription
A0A6J1E1Z30.0e+0089.17pentatricopeptide repeat-containing protein At3g47530 OS=Cucurbita moschata OX=3... [more]
A0A6J1JDE80.0e+0088.22pentatricopeptide repeat-containing protein At3g47530 OS=Cucurbita maxima OX=366... [more]
A0A6J1C4870.0e+0086.91pentatricopeptide repeat-containing protein At3g47530 OS=Momordica charantia OX=... [more]
A0A0A0LUH90.0e+0086.60DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G0097... [more]
A0A1S3BV400.0e+0089.88pentatricopeptide repeat-containing protein At3g47530 OS=Cucumis melo OX=3656 GN... [more]
Match NameE-valueIdentityDescription
AT3G47530.11.1e-21764.53Pentatricopeptide repeat (PPR) superfamily protein [more]
AT4G21065.12.2e-13040.89Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G08070.13.2e-12941.83Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT2G29760.16.6e-12734.45Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT5G06540.14.3e-12637.36Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 446..474
e-value: 0.0063
score: 16.7
coord: 104..133
e-value: 0.0016
score: 18.5
coord: 176..198
e-value: 8.0E-4
score: 19.5
coord: 379..404
e-value: 0.21
score: 11.9
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 304..352
e-value: 2.8E-14
score: 53.0
coord: 202..251
e-value: 9.1E-8
score: 32.2
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 277..306
e-value: 4.5E-7
score: 27.6
coord: 105..136
e-value: 7.9E-6
score: 23.7
coord: 204..231
e-value: 9.4E-5
score: 20.3
coord: 176..200
e-value: 0.0021
score: 16.1
coord: 307..340
e-value: 9.5E-7
score: 26.6
coord: 342..377
e-value: 3.8E-5
score: 21.6
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 279..301
e-value: 6.4E-6
score: 25.8
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 274..304
score: 10.413293
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 305..339
score: 12.199985
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 340..370
score: 9.065053
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 202..236
score: 9.262356
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 101..135
score: 8.867749
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 350..511
e-value: 6.0E-20
score: 73.3
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 254..349
e-value: 3.5E-28
score: 100.2
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 102..253
e-value: 5.0E-25
score: 90.5
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 478..602
e-value: 2.8E-38
score: 130.6
NoneNo IPR availablePANTHERPTHR47928:SF63OS08G0434000 PROTEINcoord: 33..604
NoneNo IPR availablePANTHERPTHR47928REPEAT-CONTAINING PROTEIN, PUTATIVE-RELATEDcoord: 33..604

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0019509.1Tan0019509.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:1900865 chloroplast RNA modification
cellular_component GO:0009507 chloroplast
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding