Tan0003620 (gene) Snake gourd v1

Overview
NameTan0003620
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat-containing protein
LocationLG11: 39016875 .. 39018954 (-)
RNA-Seq ExpressionTan0003620
SyntenyTan0003620
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTCTCTTTCGAATCTTTTGCTTCCGATCATCATCATTTCGTCTCAAGATCTCCACCTTATCTACCTTGCACCTAAGTACAGTCTCTTCTGCCGATTTATTCTACGACCATCTGCAGAAAAACAATGGTAATGTGGAGAAAACCCTTGCCACTGTAAAGACCAAGTTGGATTCTAGATGTGTCAACGAAGTATTGCATAAATGTTCCTTCGAACTATCGCAAATGGGTCTTAGATTTTTTATATGGGCTGGTCGACAGCCTAATTATAGGCATAGTTCTTTTATGTACAGTAGAGCTTGTGAATTGATTGGAATTAATCGAAACCCATCTTTGCTTTGTAATGTTATTGAAGATTATAGGAGGGAGGGTTGCCTTGTTGATATTAGGATGTTCAAGGTTATTTTAAATTTGTGTAAAGAAGCTAAACTTGCCAAAGAGGCTTTCTCCATATTAGGAAAATGCCTGAATTTCATTTGCGTGCTGACACTACAATGTATAATCTGGTAATAAGGTTATTTACTGAGAAGGGTGAGATGGATAAAGCGATGGAATTGATGAAGGAGATGGATTCAGTTGATATTCATCCTAATATGATCACTTATATTGCCATGCTCAAGGGATTTTGTGATGTTGGTCGTTTGGAGGATGCTTATGAGTTATTTAAGGCTATGAAGGAAAATGGATGTTCACCCAACACAGTGGCTTACTCAGTGCTACTCAATGGGGCCAGTCGGCATGGGATTATGGAAAAGCTAATGGAATTGTTGGAGGAGATGGAAAAAGAAGGGGGGAATTGTAGTCCAAATGCTGTCACATACACTTCTATAATCCAGGGTCTCTGTGAATTTGGCCAGCCTCTGGAGGCATTGAAGATGTTGGACAGAATGGAAGACTCTGGCTGTGCTCCAAATCGTGTTACAGTTAGCACTTTACTAAAGGAATTTTGTAAAAATGGTCATGTGGAGGAGGCTTATAAGTTGATTGATAGAGTTGTTGCAAGAGGTGGGGCTTCATATGGTGATTGCTATAGCTCCCTTGTGGTATCTTTGGTTAAGATGAAAAAGATTGCAGAGGCAGAGGAGCTATTTAGAAACATGTTAGCCAATGGGATGAAGCCAGATGGTGTGGCTTGTAGTGTCATGATCAAGGAATTGTGCTTAGAGGAGCGGGTGCTAGATGGTTTTAACTTATGCAACGAAGTTGATAGGAATGGATATTTATCTTCCATTGACTCTGATATTTATTCTCTTCTTTTAGTTGGACTTTGTGAGCATGACCACTCTGTGGATGCTGCAAAACTTGCAAGGTTGATGCTTAAAAAGGGGGTTCGTTTAAAACCTCATTATGCTGATAGTATCATCAAACATCTGAAGAAATTTGGAGACCAAGAGTTAGTTATGCATTTGGGTGGAATAAGGAAATGACAAGGAAAACCAACCAAGAACTTAATATGGGAATAGAAGTGTTAATTTGCAACATGCTTTTAAACTAAAGCTTTTAGCGAGCTTCAGTTGATTAGAAACAACAAACGTGTTAGATTCAGTCATTCTAGTTATGGATGTCAAAACAGTAGCAACTGCAGGTTGCAGTTTGATGACATGCGACTCAAGAGATTGAATCTGTGACCTTGCTTATGGGAGAAAGAAGATTTTCATGACCTTGTGACCTTAAACGTTTCACTTTATCGATGGTTGCAAAGTTGTGTTAAAAGTCCTTCTGAACCACAAATCCTATCCTACGAGCCACCCTGAAGTACATCATGCTCGTTTCTCAAATTTCAATGTTGTTGAATCAAGTGAAGATACTTTTAAAATTTAAATATAGTGAACTGAAATACATCTTACCAGGGCCAAGCAACATTGACCCACTATTCTGAAGCTTGCTGCTACCCAAGGTCCTGCTGCATATCGTGAATTGTTGGGTAAGAAGTTCTGGATGTCTTAATTTGTAAGCTGGTTTACGTTTTAATTGTTGGGAAGATGCTTTTGAAATTAGTGTGTACACAAGCCTTTGACACACTCATTTGTGGTTCTGGGGTTGGCTGAACTGAGAGTTGTCTAATTGTGAATTCTAGCGG

mRNA sequence

ATGGCTCTCTTTCGAATCTTTTGCTTCCGATCATCATCATTTCGTCTCAAGATCTCCACCTTATCTACCTTGCACCTAAGTACAGTCTCTTCTGCCGATTTATTCTACGACCATCTGCAGAAAAACAATGGTAATGTGGAGAAAACCCTTGCCACTGTAAAGACCAAGTTGGATTCTAGATGTGTCAACGAAGTATTGCATAAATGTTCCTTCGAACTATCGCAAATGGGTCTTAGATTTTTTATATGGGCTGGTCGACAGCCTAATTATAGGCATAGTTCTTTTATGTACAGTAGAGCTTGTGAATTGATTGGAATTAATCGAAACCCATCTTTGCTTTGTAATGTTATTGAAGATTATAGGAGGGAGGGTTGCCTTGTTGATATTAGGATGTTCAAGGTTATTTTAAATTTGAAAATGCCTGAATTTCATTTGCGTGCTGACACTACAATGTATAATCTGGTAATAAGGTTATTTACTGAGAAGGGTGAGATGGATAAAGCGATGGAATTGATGAAGGAGATGGATTCAGTTGATATTCATCCTAATATGATCACTTATATTGCCATGCTCAAGGGATTTTGTGATGTTGGTCGTTTGGAGGATGCTTATGAGTTATTTAAGGCTATGAAGGAAAATGGATGTTCACCCAACACAGTGGCTTACTCAGTGCTACTCAATGGGGCCAGTCGGCATGGGATTATGGAAAAGCTAATGGAATTGTTGGAGGAGATGGAAAAAGAAGGGGGGAATTGTAGTCCAAATGCTGTCACATACACTTCTATAATCCAGGGTCTCTGTGAATTTGGCCAGCCTCTGGAGGCATTGAAGATGTTGGACAGAATGGAAGACTCTGGCTGTGCTCCAAATCGTGTTACAGTTAGCACTTTACTAAAGGAATTTTGTAAAAATGGTCATGTGGAGGAGGCTTATAAGTTGATTGATAGAGTTGTTGCAAGAGGTGGGGCTTCATATGGTGATTGCTATAGCTCCCTTGTGGTATCTTTGGTTAAGATGAAAAAGATTGCAGAGGCAGAGGAGCTATTTAGAAACATGTTAGCCAATGGGATGAAGCCAGATGGTGTGGCTTGTAGTGTCATGATCAAGGAATTGTGCTTAGAGGAGCGGGTGCTAGATGGTTTTAACTTATGCAACGAAGTTGATAGGAATGGATATTTATCTTCCATTGACTCTGATATTTATTCTCTTCTTTTAGTTGGACTTTGTGAGCATGACCACTCTGTGGATGCTGCAAAACTTGCAAGGTTGATGCTTAAAAAGGGGGTTCGTTTAAAACCTCATTATGCTGATAGTATCATCAAACATCTGAAGAAATTTGGAGACCAAGAGTTAGTTATGCATTTGGGTGGAATAAGGAAATGACAAGGAAAACCAACCAAGAACTTAATATGGGAATAGAAGTGTTAATTTGCAACATGCTTTTAAACTAAAGCTTTTAGCGAGCTTCAGTTGATTAGAAACAACAAACGTGTTAGATTCAGTCATTCTAGTTATGGATGTCAAAACAGTAGCAACTGCAGGTTGCAGTTTGATGACATGCGACTCAAGAGATTGAATCTGTGACCTTGCTTATGGGAGAAAGAAGATTTTCATGACCTTGTGACCTTAAACGTTTCACTTTATCGATGGTTGCAAAGTTGTGTTAAAAGTCCTTCTGAACCACAAATCCTATCCTACGAGCCACCCTGAAGTACATCATGCTCGTTTCTCAAATTTCAATGTTGTTGAATCAAGTGAAGATACTTTTAAAATTTAAATATAGTGAACTGAAATACATCTTACCAGGGCCAAGCAACATTGACCCACTATTCTGAAGCTTGCTGCTACCCAAGGTCCTGCTGCATATCGTGAATTGTTGGGTAAGAAGTTCTGGATGTCTTAATTTGTAAGCTGGTTTACGTTTTAATTGTTGGGAAGATGCTTTTGAAATTAGTGTGTACACAAGCCTTTGACACACTCATTTGTGGTTCTGGGGTTGGCTGAACTGAGAGTTGTCTAATTGTGAATTCTAGCGG

Coding sequence (CDS)

ATGGCTCTCTTTCGAATCTTTTGCTTCCGATCATCATCATTTCGTCTCAAGATCTCCACCTTATCTACCTTGCACCTAAGTACAGTCTCTTCTGCCGATTTATTCTACGACCATCTGCAGAAAAACAATGGTAATGTGGAGAAAACCCTTGCCACTGTAAAGACCAAGTTGGATTCTAGATGTGTCAACGAAGTATTGCATAAATGTTCCTTCGAACTATCGCAAATGGGTCTTAGATTTTTTATATGGGCTGGTCGACAGCCTAATTATAGGCATAGTTCTTTTATGTACAGTAGAGCTTGTGAATTGATTGGAATTAATCGAAACCCATCTTTGCTTTGTAATGTTATTGAAGATTATAGGAGGGAGGGTTGCCTTGTTGATATTAGGATGTTCAAGGTTATTTTAAATTTGAAAATGCCTGAATTTCATTTGCGTGCTGACACTACAATGTATAATCTGGTAATAAGGTTATTTACTGAGAAGGGTGAGATGGATAAAGCGATGGAATTGATGAAGGAGATGGATTCAGTTGATATTCATCCTAATATGATCACTTATATTGCCATGCTCAAGGGATTTTGTGATGTTGGTCGTTTGGAGGATGCTTATGAGTTATTTAAGGCTATGAAGGAAAATGGATGTTCACCCAACACAGTGGCTTACTCAGTGCTACTCAATGGGGCCAGTCGGCATGGGATTATGGAAAAGCTAATGGAATTGTTGGAGGAGATGGAAAAAGAAGGGGGGAATTGTAGTCCAAATGCTGTCACATACACTTCTATAATCCAGGGTCTCTGTGAATTTGGCCAGCCTCTGGAGGCATTGAAGATGTTGGACAGAATGGAAGACTCTGGCTGTGCTCCAAATCGTGTTACAGTTAGCACTTTACTAAAGGAATTTTGTAAAAATGGTCATGTGGAGGAGGCTTATAAGTTGATTGATAGAGTTGTTGCAAGAGGTGGGGCTTCATATGGTGATTGCTATAGCTCCCTTGTGGTATCTTTGGTTAAGATGAAAAAGATTGCAGAGGCAGAGGAGCTATTTAGAAACATGTTAGCCAATGGGATGAAGCCAGATGGTGTGGCTTGTAGTGTCATGATCAAGGAATTGTGCTTAGAGGAGCGGGTGCTAGATGGTTTTAACTTATGCAACGAAGTTGATAGGAATGGATATTTATCTTCCATTGACTCTGATATTTATTCTCTTCTTTTAGTTGGACTTTGTGAGCATGACCACTCTGTGGATGCTGCAAAACTTGCAAGGTTGATGCTTAAAAAGGGGGTTCGTTTAAAACCTCATTATGCTGATAGTATCATCAAACATCTGAAGAAATTTGGAGACCAAGAGTTAGTTATGCATTTGGGTGGAATAAGGAAATGA

Protein sequence

MALFRIFCFRSSSFRLKISTLSTLHLSTVSSADLFYDHLQKNNGNVEKTLATVKTKLDSRCVNEVLHKCSFELSQMGLRFFIWAGRQPNYRHSSFMYSRACELIGINRNPSLLCNVIEDYRREGCLVDIRMFKVILNLKMPEFHLRADTTMYNLVIRLFTEKGEMDKAMELMKEMDSVDIHPNMITYIAMLKGFCDVGRLEDAYELFKAMKENGCSPNTVAYSVLLNGASRHGIMEKLMELLEEMEKEGGNCSPNAVTYTSIIQGLCEFGQPLEALKMLDRMEDSGCAPNRVTVSTLLKEFCKNGHVEEAYKLIDRVVARGGASYGDCYSSLVVSLVKMKKIAEAEELFRNMLANGMKPDGVACSVMIKELCLEERVLDGFNLCNEVDRNGYLSSIDSDIYSLLLVGLCEHDHSVDAAKLARLMLKKGVRLKPHYADSIIKHLKKFGDQELVMHLGGIRK
Homology
BLAST of Tan0003620 vs. ExPASy Swiss-Prot
Match: Q9LVS3 (Pentatricopeptide repeat-containing protein At5g47360 OS=Arabidopsis thaliana OX=3702 GN=At5g47360 PE=2 SV=1)

HSP 1 Score: 415.2 bits (1066), Expect = 9.4e-115
Identity = 216/461 (46.85%), Postives = 310/461 (67.25%), Query Frame = 0

Query: 11  SSSFRLKISTLSTLH-LSTVSSADLFYDHLQKNNGNVEKTLATVKTKLDSRCVNEVLHKC 70
           S S R + S +S L  L+TVS+A+  Y  LQ    N+EK LA+   +LDS C+NEVL +C
Sbjct: 11  SPSLRSQPSKISALRFLTTVSAAERLYGQLQGCTSNLEKELASANVQLDSSCINEVLRRC 70

Query: 71  SFELSQMGLRFFIWAGRQPNYRHSSFMYSRACELIGINRNPSLLCNVIEDYRREGCLVDI 130
                Q GLRFFIWAG   ++RHS++MY++AC+++ I   P L+  VIE YR+E C V++
Sbjct: 71  DPNQFQSGLRFFIWAGTLSSHRHSAYMYTKACDILKIRAKPDLIKYVIESYRKEECFVNV 130

Query: 131 RMFKVILNL---------------KMPEFHLRADTTMYNLVIRLFTEKGEMDKAMELMKE 190
           +  +++L L               K PEF++ ADT  YNLVIRLF +KG+++ A  L+KE
Sbjct: 131 KTMRIVLTLCNQANLADEALWVLRKFPEFNVCADTVAYNLVIRLFADKGDLNIADMLIKE 190

Query: 191 MDSVDIHPNMITYIAMLKGFCDVGRLEDAYELFKAMKENGCSPNTVAYSVLLNGASRHGI 250
           MD V ++P++ITY +M+ G+C+ G+++DA+ L K M ++ C  N+V YS +L G  + G 
Sbjct: 191 MDCVGLYPDVITYTSMINGYCNAGKIDDAWRLAKEMSKHDCVLNSVTYSRILEGVCKSGD 250

Query: 251 MEKLMELLEEMEKE--GGNCSPNAVTYTSIIQGLCEFGQPLEALKMLDRMEDSGCAPNRV 310
           ME+ +ELL EMEKE  GG  SPNAVTYT +IQ  CE  +  EAL +LDRM + GC PNRV
Sbjct: 251 MERALELLAEMEKEDGGGLISPNAVTYTLVIQAFCEKRRVEEALLVLDRMGNRGCMPNRV 310

Query: 311 TVSTLLKEFCKNGH-VEEAYKLIDRVVARGGASYGDCYSSLVVSLVKMKKIAEAEELFRN 370
           T   L++   +N   V+   KLID++V  GG S  +C+SS  VSL++MK+  EAE++FR 
Sbjct: 311 TACVLIQGVLENDEDVKALSKLIDKLVKLGGVSLSECFSSATVSLIRMKRWEEAEKIFRL 370

Query: 371 MLANGMKPDGVACSVMIKELCLEERVLDGFNLCNEVDRNGYLSSIDSDIYSLLLVGLCEH 430
           ML  G++PDG+ACS + +ELCL ER LD F L  E+++    S+IDSDI+++LL+GLC+ 
Sbjct: 371 MLVRGVRPDGLACSHVFRELCLLERYLDCFLLYQEIEKKDVKSTIDSDIHAVLLLGLCQQ 430

Query: 431 DHSVDAAKLARLMLKKGVRLKPHYADSIIKHLKKFGDQELV 453
            +S +AAKLA+ ML K +RLK  + + II+ LKK GD++L+
Sbjct: 431 GNSWEAAKLAKSMLDKKMRLKVSHVEKIIEALKKTGDEDLM 471

BLAST of Tan0003620 vs. ExPASy Swiss-Prot
Match: P0C8A0 (Pentatricopeptide repeat-containing protein At3g49730 OS=Arabidopsis thaliana OX=3702 GN=At3g49730 PE=2 SV=1)

HSP 1 Score: 163.7 bits (413), Expect = 5.0e-39
Identity = 110/420 (26.19%), Postives = 204/420 (48.57%), Query Frame = 0

Query: 36  YDHLQKNNGNVEK-TLATVKTKLDSR--CVNEVLHKCSFELSQMGLRFFIWAGRQPNYRH 95
           Y  L+ ++  V K  LA  ++ +D R   +  VL +C  +   +G RFF+WA +QP Y H
Sbjct: 71  YRILRNHHSRVPKLELALNESGIDLRPGLIIRVLSRCG-DAGNLGYRFFLWATKQPGYFH 130

Query: 96  SSFMYSRACELIGINRNPSLLCNVIEDYRREGC-LVDIRMFKVILNL------------- 155
           S  +      ++   R    +  +IE+ R+    L++  +F V++               
Sbjct: 131 SYEVCKSMVMILSKMRQFGAVWGLIEEMRKTNPELIEPELFVVLMRRFASANMVKKAVEV 190

Query: 156 --KMPEFHLRADTTMYNLVIRLFTEKGEMDKAMELMKEMDSVDIHPNMITYIAMLKGFCD 215
             +MP++ L  D  ++  ++    + G + +A ++ ++M      PN+  + ++L G+C 
Sbjct: 191 LDEMPKYGLEPDEYVFGCLLDALCKNGSVKEASKVFEDMRE-KFPPNLRYFTSLLYGWCR 250

Query: 216 VGRLEDAYELFKAMKENGCSPNTVAYSVLLNGASRHGIMEKLMELLEEMEKEGGNCSPNA 275
            G+L +A E+   MKE G  P+ V ++ LL+G +  G M    +L+ +M K G    PN 
Sbjct: 251 EGKLMEAKEVLVQMKEAGLEPDIVVFTNLLSGYAHAGKMADAYDLMNDMRKRG--FEPNV 310

Query: 276 VTYTSIIQGLCEFGQPL-EALKMLDRMEDSGCAPNRVTVSTLLKEFCKNGHVEEAYKLID 335
             YT +IQ LC   + + EA+++   ME  GC  + VT + L+  FCK G +++ Y ++D
Sbjct: 311 NCYTVLIQALCRTEKRMDEAMRVFVEMERYGCEADIVTYTALISGFCKWGMIDKGYSVLD 370

Query: 336 RVVARGGASYGDCYSSLVVSLVKMKKIAEAEELFRNMLANGMKPDGVACSVMIKELCLEE 395
            +  +G       Y  ++V+  K ++  E  EL   M   G  PD +  +V+I+  C   
Sbjct: 371 DMRKKGVMPSQVTYMQIMVAHEKKEQFEECLELIEKMKRRGCHPDLLIYNVVIRLACKLG 430

Query: 396 RVLDGFNLCNEVDRNGYLSSIDSDIYSLLLVGLCEHDHSVDAAKLARLMLKKGVRLKPHY 436
            V +   L NE++ NG    +D+  + +++ G       ++A    + M+ +G+   P Y
Sbjct: 431 EVKEAVRLWNEMEANGLSPGVDT--FVIMINGFTSQGFLIEACNHFKEMVSRGIFSAPQY 484

BLAST of Tan0003620 vs. ExPASy Swiss-Prot
Match: Q9ZVX5 (Pentatricopeptide repeat-containing protein At2g16880 OS=Arabidopsis thaliana OX=3702 GN=At2g16880 PE=2 SV=1)

HSP 1 Score: 160.2 bits (404), Expect = 5.5e-38
Identity = 102/368 (27.72%), Postives = 182/368 (49.46%), Query Frame = 0

Query: 93  SSFMYSRACEL------IGINRNPSLLCNVIEDYRREGCLVDIRMFKVILNLKMPEFHLR 152
           SSF  S A E+      IG++ N      ++  Y  EG L D      +L   + EF + 
Sbjct: 181 SSFSISSAREVFDDMVKIGVSLNVQTFNVLVNGYCLEGKLEDA---LGMLERMVSEFKVN 240

Query: 153 ADTTMYNLVIRLFTEKGEMDKAMELMKEMDSVDIHPNMITYIAMLKGFCDVGRLEDAYEL 212
            D   YN +++  ++KG +    EL+ +M    + PN +TY  ++ G+C +G L++A+++
Sbjct: 241 PDNVTYNTILKAMSKKGRLSDLKELLLDMKKNGLVPNRVTYNNLVYGYCKLGSLKEAFQI 300

Query: 213 FKAMKENGCSPNTVAYSVLLNGASRHGIMEKLMELLEEMEKEGGNCSPNAVTYTSIIQGL 272
            + MK+    P+   Y++L+NG    G M + +EL++ M+       P+ VTY ++I G 
Sbjct: 301 VELMKQTNVLPDLCTYNILINGLCNAGSMREGLELMDAMKSL--KLQPDVVTYNTLIDGC 360

Query: 273 CEFGQPLEALKMLDRMEDSGCAPNRVTVSTLLKEFCKNGHVEEAYKLIDRVVARGGASYG 332
            E G  LEA K++++ME+ G   N+VT +  LK  CK    E   + +  +V   G S  
Sbjct: 361 FELGLSLEARKLMEQMENDGVKANQVTHNISLKWLCKEEKREAVTRKVKELVDMHGFSPD 420

Query: 333 -DCYSSLVVSLVKMKKIAEAEELFRNMLANGMKPDGVACSVMIKELCLEERVLDGFNLCN 392
              Y +L+ + +K+  ++ A E+ R M   G+K + +  + ++  LC E ++ +  NL N
Sbjct: 421 IVTYHTLIKAYLKVGDLSGALEMMREMGQKGIKMNTITLNTILDALCKERKLDEAHNLLN 480

Query: 393 EVDRNGYLSSIDSDIYSLLLVGLCEHDHSVDAAKLARLMLKKGVRLKPHYADSIIKHLKK 452
              + G++  +D   Y  L++G    +    A ++   M K  +       +S+I  L  
Sbjct: 481 SAHKRGFI--VDEVTYGTLIMGFFREEKVEKALEMWDEMKKVKITPTVSTFNSLIGGLCH 540

Query: 453 FGDQELVM 454
            G  EL M
Sbjct: 541 HGKTELAM 541

BLAST of Tan0003620 vs. ExPASy Swiss-Prot
Match: Q9FMF6 (Pentatricopeptide repeat-containing protein At5g64320, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At5g64320 PE=2 SV=1)

HSP 1 Score: 159.5 bits (402), Expect = 9.3e-38
Identity = 84/297 (28.28%), Postives = 157/297 (52.86%), Query Frame = 0

Query: 133 KVILNLKMPEFHLRADTTMYNLVIRLFTEKGEMDKAMELMKEMDSVDIHPNMITYIAMLK 192
           K +L+  +  + +  D   YN +I  + ++G +  A+E++ +M +    PN+ +Y  ++ 
Sbjct: 373 KAVLSDMVTSYGIVPDVCTYNSLIYGYWKEGLVGLALEVLHDMRNKGCKPNVYSYTILVD 432

Query: 193 GFCDVGRLEDAYELFKAMKENGCSPNTVAYSVLLNGASRHGIMEKLMELLEEMEKEGGNC 252
           GFC +G++++AY +   M  +G  PNTV ++ L++   +   + + +E+  EM ++G  C
Sbjct: 433 GFCKLGKIDEAYNVLNEMSADGLKPNTVGFNCLISAFCKEHRIPEAVEIFREMPRKG--C 492

Query: 253 SPNAVTYTSIIQGLCEFGQPLEALKMLDRMEDSGCAPNRVTVSTLLKEFCKNGHVEEAYK 312
            P+  T+ S+I GLCE  +   AL +L  M   G   N VT +TL+  F + G ++EA K
Sbjct: 493 KPDVYTFNSLISGLCEVDEIKHALWLLRDMISEGVVANTVTYNTLINAFLRRGEIKEARK 552

Query: 313 LIDRVVARGGASYGDCYSSLVVSLVKMKKIAEAEELFRNMLANGMKPDGVACSVMIKELC 372
           L++ +V +G       Y+SL+  L +  ++ +A  LF  ML +G  P  ++C+++I  LC
Sbjct: 553 LVNEMVFQGSPLDEITYNSLIKGLCRAGEVDKARSLFEKMLRDGHAPSNISCNILINGLC 612

Query: 373 LEERVLDGFNLCNEVDRNGYLSSIDSDIYSLLLVGLCEHDHSVDAAKLARLMLKKGV 430
               V +      E+   G  S+ D   ++ L+ GLC      D   + R +  +G+
Sbjct: 613 RSGMVEEAVEFQKEMVLRG--STPDIVTFNSLINGLCRAGRIEDGLTMFRKLQAEGI 665

BLAST of Tan0003620 vs. ExPASy Swiss-Prot
Match: Q9FH87 (Putative pentatricopeptide repeat-containing protein At5g65820 OS=Arabidopsis thaliana OX=3702 GN=At5g65820 PE=3 SV=1)

HSP 1 Score: 159.5 bits (402), Expect = 9.3e-38
Identity = 111/419 (26.49%), Postives = 205/419 (48.93%), Query Frame = 0

Query: 30  SSADLFYDHLQKNNGNVEK-TLATVKTKLDSR--CVNEVLHKCSFELSQMGLRFFIWAGR 89
           S  +  Y  L+K +  V K  LA  ++ ++ R   +  VL++C  +   +G RFF+WA +
Sbjct: 81  SDVEKSYRILRKFHSRVPKLELALNESGVELRPGLIERVLNRCG-DAGNLGYRFFVWAAK 140

Query: 90  QPNYRHSSFMYSRACELIGINRNPSLLCNVIEDYRREG-CLVDIRMFKVILNL------- 149
           QP Y HS  +Y    +++   R    +  +IE+ R+E   L++  +F V++         
Sbjct: 141 QPRYCHSIEVYKSMVKILSKMRQFGAVWGLIEEMRKENPQLIEPELFVVLVQRFASADMV 200

Query: 150 --------KMPEFHLRADTTMYNLVIRLFTEKGEMDKAMELMKEMDSVDIHPNMITYIAM 209
                   +MP+F    D  ++  ++    + G +  A +L ++M  +    N+  + ++
Sbjct: 201 KKAIEVLDEMPKFGFEPDEYVFGCLLDALCKHGSVKDAAKLFEDM-RMRFPVNLRYFTSL 260

Query: 210 LKGFCDVGRLEDAYELFKAMKENGCSPNTVAYSVLLNGASRHGIMEKLMELLEEMEKEGG 269
           L G+C VG++ +A  +   M E G  P+ V Y+ LL+G +  G M    +LL +M + G 
Sbjct: 261 LYGWCRVGKMMEAKYVLVQMNEAGFEPDIVDYTNLLSGYANAGKMADAYDLLRDMRRRG- 320

Query: 270 NCSPNAVTYTSIIQGLCEFGQPLEALKMLDRMEDSGCAPNRVTVSTLLKEFCKNGHVEEA 329
              PNA  YT +IQ LC+  +  EA+K+   ME   C  + VT + L+  FCK G +++ 
Sbjct: 321 -FEPNANCYTVLIQALCKVDRMEEAMKVFVEMERYECEADVVTYTALVSGFCKWGKIDKC 380

Query: 330 YKLIDRVVARGGASYGDCYSSLVVSLVKMKKIAEAEELFRNMLANGMKPDGVACSVMIKE 389
           Y ++D ++ +G       Y  ++V+  K +   E  EL   M      PD    +V+I+ 
Sbjct: 381 YIVLDDMIKKGLMPSELTYMHIMVAHEKKESFEECLELMEKMRQIEYHPDIGIYNVVIRL 440

Query: 390 LCLEERVLDGFNLCNEVDRNGYLSSIDSDIYSLLLVGLCEHDHSVDAAKLARLMLKKGV 430
            C    V +   L NE++ NG    +D+  + +++ GL      ++A+   + M+ +G+
Sbjct: 441 ACKLGEVKEAVRLWNEMEENGLSPGVDT--FVIMINGLASQGCLLEASDHFKEMVTRGL 493

BLAST of Tan0003620 vs. NCBI nr
Match: KAG7017159.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 831.2 bits (2146), Expect = 4.2e-237
Identity = 412/475 (86.74%), Postives = 438/475 (92.21%), Query Frame = 0

Query: 1   MALFRIFCFRSSSFRLKISTLSTLHLSTVSSADLFYDHLQKNNGNVEKTLATVKTKLDSR 60
           MALFRIF  R SSFR KISTLSTL LSTVSSADLFYDHLQKNNGNVEKTLATVKTKLDSR
Sbjct: 1   MALFRIFYPRPSSFRFKISTLSTLQLSTVSSADLFYDHLQKNNGNVEKTLATVKTKLDSR 60

Query: 61  CVNEVLHKCSFELSQMGLRFFIWAGRQPNYRHSSFMYSRACELIGINRNPSLLCNVIEDY 120
           CVN+VLHKCSFELSQMGLRFFIWAGRQPNYRHSSFMY+RACELIG+NR+P LL NVIEDY
Sbjct: 61  CVNQVLHKCSFELSQMGLRFFIWAGRQPNYRHSSFMYTRACELIGLNRSPCLLFNVIEDY 120

Query: 121 RREGCLVDIRMFKVILNL---------------KMPEFHLRADTTMYNLVIRLFTEKGEM 180
           RREGCLVDI MFKVILNL               KM EFHLRADTTMYNLVIRLFTEKG+M
Sbjct: 121 RREGCLVDIGMFKVILNLCKEGKLAKEALSILGKMAEFHLRADTTMYNLVIRLFTEKGDM 180

Query: 181 DKAMELMKEMDSVDIHPNMITYIAMLKGFCDVGRLEDAYELFKAMKENGCSPNTVAYSVL 240
           DKAMEL+KEMDSVDI PNMITYIAMLKGFCDVGRLEDAY LFK MK+NGC+PNTVAYSVL
Sbjct: 181 DKAMELLKEMDSVDIDPNMITYIAMLKGFCDVGRLEDAYGLFKVMKDNGCAPNTVAYSVL 240

Query: 241 LNGASRHGIMEKLMELLEEMEKEGGNCSPNAVTYTSIIQGLCEFGQPLEALKMLDRMEDS 300
           LNGASRHG +EKLMELLEEMEK+GG C PN VTYTSIIQ LCE GQPLEALK+LDRMED 
Sbjct: 241 LNGASRHGDLEKLMELLEEMEKQGGTCGPNTVTYTSIIQSLCEVGQPLEALKILDRMEDF 300

Query: 301 GCAPNRVTVSTLLKEFCKNGHVEEAYKLIDRVVARGGASYGDCYSSLVVSLVKMKKIAEA 360
           GCAPNRVTVS L+KEFCK+GH+EEAYKLIDRV ARGGASYGDCYSSLV+SL+KMK+IAEA
Sbjct: 301 GCAPNRVTVSVLVKEFCKDGHMEEAYKLIDRVAARGGASYGDCYSSLVISLIKMKRIAEA 360

Query: 361 EELFRNMLANGMKPDGVACSVMIKELCLEERVLDGFNLCNEVDRNGYLSSIDSDIYSLLL 420
           EELFRNMLANG+KPDGVAC++MIKELCLEERV+DGFNLCNEVDRNGYLSSIDSDIYSLLL
Sbjct: 361 EELFRNMLANGVKPDGVACTLMIKELCLEERVVDGFNLCNEVDRNGYLSSIDSDIYSLLL 420

Query: 421 VGLCEHDHSVDAAKLARLMLKKGVRLKPHYADSIIKHLKKFGDQELVMHLGGIRK 461
           VGLCEHDHSVDAAKLARLML+KG+RLKPHYA+SIIKH+KKFGDQ LVMHLGGIR+
Sbjct: 421 VGLCEHDHSVDAAKLARLMLQKGIRLKPHYAESIIKHVKKFGDQNLVMHLGGIRE 475

BLAST of Tan0003620 vs. NCBI nr
Match: KAG6579716.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 830.1 bits (2143), Expect = 9.4e-237
Identity = 411/475 (86.53%), Postives = 438/475 (92.21%), Query Frame = 0

Query: 1   MALFRIFCFRSSSFRLKISTLSTLHLSTVSSADLFYDHLQKNNGNVEKTLATVKTKLDSR 60
           MALFRIF  R SSFR KISTLSTL LSTVSSADLFYDHLQKNNGNVEKTLATVKTKLDSR
Sbjct: 1   MALFRIFYPRPSSFRFKISTLSTLQLSTVSSADLFYDHLQKNNGNVEKTLATVKTKLDSR 60

Query: 61  CVNEVLHKCSFELSQMGLRFFIWAGRQPNYRHSSFMYSRACELIGINRNPSLLCNVIEDY 120
           CVN+VLHKCSFELSQMGLRFFIWAGRQPNYRHSSFMY+RACELIG+NR+P LL NVIEDY
Sbjct: 61  CVNQVLHKCSFELSQMGLRFFIWAGRQPNYRHSSFMYTRACELIGLNRSPCLLFNVIEDY 120

Query: 121 RREGCLVDIRMFKVILNL---------------KMPEFHLRADTTMYNLVIRLFTEKGEM 180
           RREGCLVDI MFKVILNL               KM EFHLRADTTMYNLVIRLFTEKG+M
Sbjct: 121 RREGCLVDIGMFKVILNLCKEGKLAKEALSILGKMAEFHLRADTTMYNLVIRLFTEKGDM 180

Query: 181 DKAMELMKEMDSVDIHPNMITYIAMLKGFCDVGRLEDAYELFKAMKENGCSPNTVAYSVL 240
           DKAMEL+KEMDSVDI PNMITYIAMLKGFCDVGRLEDAY LFK MK+NGC+PNTVAYSVL
Sbjct: 181 DKAMELLKEMDSVDIDPNMITYIAMLKGFCDVGRLEDAYGLFKVMKDNGCAPNTVAYSVL 240

Query: 241 LNGASRHGIMEKLMELLEEMEKEGGNCSPNAVTYTSIIQGLCEFGQPLEALKMLDRMEDS 300
           LNGASRHG +EKLMELLEEMEK+GG C PN VTYTSIIQ LCE GQPLEALK+LDRMED 
Sbjct: 241 LNGASRHGDLEKLMELLEEMEKQGGTCGPNTVTYTSIIQSLCEVGQPLEALKILDRMEDF 300

Query: 301 GCAPNRVTVSTLLKEFCKNGHVEEAYKLIDRVVARGGASYGDCYSSLVVSLVKMKKIAEA 360
           GCAPNRVTVS L+KEFCK+GH+EEAYKLIDRV ARGGASYGDCYSSLV+SL+KMK+IAEA
Sbjct: 301 GCAPNRVTVSVLVKEFCKDGHMEEAYKLIDRVAARGGASYGDCYSSLVISLIKMKRIAEA 360

Query: 361 EELFRNMLANGMKPDGVACSVMIKELCLEERVLDGFNLCNEVDRNGYLSSIDSDIYSLLL 420
           EELFRNMLANG+KPDGVAC++MIKELCLE+RV+DGFNLCNEVDRNGYLSSIDSDIYSLLL
Sbjct: 361 EELFRNMLANGVKPDGVACTLMIKELCLEDRVVDGFNLCNEVDRNGYLSSIDSDIYSLLL 420

Query: 421 VGLCEHDHSVDAAKLARLMLKKGVRLKPHYADSIIKHLKKFGDQELVMHLGGIRK 461
           VGLCEHDHSVDAAKLARLML+KG+RLKPHYA+SIIKH+KKFGDQ LVMHLGGIR+
Sbjct: 421 VGLCEHDHSVDAAKLARLMLQKGIRLKPHYAESIIKHVKKFGDQNLVMHLGGIRE 475

BLAST of Tan0003620 vs. NCBI nr
Match: XP_023551479.1 (pentatricopeptide repeat-containing protein At5g47360 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 829.3 bits (2141), Expect = 1.6e-236
Identity = 411/475 (86.53%), Postives = 437/475 (92.00%), Query Frame = 0

Query: 1   MALFRIFCFRSSSFRLKISTLSTLHLSTVSSADLFYDHLQKNNGNVEKTLATVKTKLDSR 60
           MALFRIF  R SSFR KISTLSTL LSTVSSADLFYDHLQKNNGNVEKTL TVKTKLDSR
Sbjct: 1   MALFRIFYPRPSSFRFKISTLSTLQLSTVSSADLFYDHLQKNNGNVEKTLTTVKTKLDSR 60

Query: 61  CVNEVLHKCSFELSQMGLRFFIWAGRQPNYRHSSFMYSRACELIGINRNPSLLCNVIEDY 120
           CVNEVLHKCSFELSQMGLRFFIWAGRQPNYRHSSFMY+RACELIG+NR+P L+ NVIEDY
Sbjct: 61  CVNEVLHKCSFELSQMGLRFFIWAGRQPNYRHSSFMYTRACELIGLNRSPCLVFNVIEDY 120

Query: 121 RREGCLVDIRMFKVILNL---------------KMPEFHLRADTTMYNLVIRLFTEKGEM 180
           RREGCLVDI MFKVILNL               +M EFHLRADTTMYNLVIRLFTEKGEM
Sbjct: 121 RREGCLVDIGMFKVILNLCKEGKLAKEALSILGEMAEFHLRADTTMYNLVIRLFTEKGEM 180

Query: 181 DKAMELMKEMDSVDIHPNMITYIAMLKGFCDVGRLEDAYELFKAMKENGCSPNTVAYSVL 240
           DKAMEL+KEMDSVDI PNMITYIAMLKGFCDVGRLEDAY LFK MK+NGC+PNTVAYSVL
Sbjct: 181 DKAMELLKEMDSVDIDPNMITYIAMLKGFCDVGRLEDAYGLFKVMKDNGCAPNTVAYSVL 240

Query: 241 LNGASRHGIMEKLMELLEEMEKEGGNCSPNAVTYTSIIQGLCEFGQPLEALKMLDRMEDS 300
           LNGASRHG +EKLMELLEEMEK+GG C PN VTYTSIIQ LCE GQPLEALK+LDRMED 
Sbjct: 241 LNGASRHGDLEKLMELLEEMEKQGGTCGPNTVTYTSIIQSLCEVGQPLEALKILDRMEDF 300

Query: 301 GCAPNRVTVSTLLKEFCKNGHVEEAYKLIDRVVARGGASYGDCYSSLVVSLVKMKKIAEA 360
           GCAPNRVTVS L+KEFCK+GH+EEAYKLIDRV ARGGASYGDCYSSLV+SL+KMK+IAEA
Sbjct: 301 GCAPNRVTVSVLVKEFCKDGHMEEAYKLIDRVAARGGASYGDCYSSLVISLIKMKRIAEA 360

Query: 361 EELFRNMLANGMKPDGVACSVMIKELCLEERVLDGFNLCNEVDRNGYLSSIDSDIYSLLL 420
           EELFRNMLANG+KPDGVAC++MIKELCLEERV+DGFNLCNEVDRNGYLSSIDSDIYSLLL
Sbjct: 361 EELFRNMLANGVKPDGVACTLMIKELCLEERVVDGFNLCNEVDRNGYLSSIDSDIYSLLL 420

Query: 421 VGLCEHDHSVDAAKLARLMLKKGVRLKPHYADSIIKHLKKFGDQELVMHLGGIRK 461
           VGLCEHDHSVDAAKLARLML+KG+RLKPHYA+SIIKH+KKFGDQ LVMHLGGIR+
Sbjct: 421 VGLCEHDHSVDAAKLARLMLQKGIRLKPHYAESIIKHVKKFGDQNLVMHLGGIRE 475

BLAST of Tan0003620 vs. NCBI nr
Match: XP_022969835.1 (pentatricopeptide repeat-containing protein At5g47360 [Cucurbita maxima])

HSP 1 Score: 828.6 bits (2139), Expect = 2.7e-236
Identity = 410/475 (86.32%), Postives = 439/475 (92.42%), Query Frame = 0

Query: 1   MALFRIFCFRSSSFRLKISTLSTLHLSTVSSADLFYDHLQKNNGNVEKTLATVKTKLDSR 60
           MALFRIF  R SSFR KISTLSTL LSTVSSADLFYDHLQKNNGNVEKTLATV+TKLDSR
Sbjct: 1   MALFRIFYPRPSSFRFKISTLSTLQLSTVSSADLFYDHLQKNNGNVEKTLATVRTKLDSR 60

Query: 61  CVNEVLHKCSFELSQMGLRFFIWAGRQPNYRHSSFMYSRACELIGINRNPSLLCNVIEDY 120
           CVN+VLHKCSFELS MGLRFFIWAGRQPNYRHSSFMY+RACELIG+NR+P LL NVIEDY
Sbjct: 61  CVNQVLHKCSFELSPMGLRFFIWAGRQPNYRHSSFMYARACELIGLNRSPCLLFNVIEDY 120

Query: 121 RREGCLVDIRMFKVILNL---------------KMPEFHLRADTTMYNLVIRLFTEKGEM 180
           RREGCL+DI MFKVILNL               KM EFHLRADTTMYNLVIRLFTEKGEM
Sbjct: 121 RREGCLLDIGMFKVILNLCKEGKLAKEALSILGKMAEFHLRADTTMYNLVIRLFTEKGEM 180

Query: 181 DKAMELMKEMDSVDIHPNMITYIAMLKGFCDVGRLEDAYELFKAMKENGCSPNTVAYSVL 240
           DKAMEL+KEMDSVDI PNMITYIAMLKGFCDVGRLEDAY LFK MKENGC+PNTVAYSVL
Sbjct: 181 DKAMELLKEMDSVDIDPNMITYIAMLKGFCDVGRLEDAYGLFKVMKENGCAPNTVAYSVL 240

Query: 241 LNGASRHGIMEKLMELLEEMEKEGGNCSPNAVTYTSIIQGLCEFGQPLEALKMLDRMEDS 300
           LNGASRHG +EKLMELLEEMEK+GGNC PN VTYTSIIQ LCE GQPLEALK+LDRMEDS
Sbjct: 241 LNGASRHGDLEKLMELLEEMEKQGGNCGPNTVTYTSIIQSLCEVGQPLEALKILDRMEDS 300

Query: 301 GCAPNRVTVSTLLKEFCKNGHVEEAYKLIDRVVARGGASYGDCYSSLVVSLVKMKKIAEA 360
           GC+PNRVTVS L+KEFCK+GH+EEAYKLIDRV ARGGASYGDCYSSLV+SL+KMK+IAEA
Sbjct: 301 GCSPNRVTVSALVKEFCKDGHMEEAYKLIDRVAARGGASYGDCYSSLVISLIKMKRIAEA 360

Query: 361 EELFRNMLANGMKPDGVACSVMIKELCLEERVLDGFNLCNEVDRNGYLSSIDSDIYSLLL 420
           EELFRNMLANG+KPDGVAC++MIKELCLEERV+DGFNLCNEV+RNGYLSSIDSDIYSLLL
Sbjct: 361 EELFRNMLANGVKPDGVACTLMIKELCLEERVVDGFNLCNEVNRNGYLSSIDSDIYSLLL 420

Query: 421 VGLCEHDHSVDAAKLARLMLKKGVRLKPHYADSIIKHLKKFGDQELVMHLGGIRK 461
           VGLCEHDHSVDAAKLARLML+KG+RLKPHYA+SIIKH+KKFG Q+LVMHLGGIR+
Sbjct: 421 VGLCEHDHSVDAAKLARLMLQKGIRLKPHYAESIIKHVKKFGGQDLVMHLGGIRE 475

BLAST of Tan0003620 vs. NCBI nr
Match: XP_022928928.1 (pentatricopeptide repeat-containing protein At5g47360 [Cucurbita moschata])

HSP 1 Score: 827.8 bits (2137), Expect = 4.7e-236
Identity = 411/475 (86.53%), Postives = 436/475 (91.79%), Query Frame = 0

Query: 1   MALFRIFCFRSSSFRLKISTLSTLHLSTVSSADLFYDHLQKNNGNVEKTLATVKTKLDSR 60
           MALFRIF  R SSFR KISTLSTL LSTVSSADLFYDHLQK NGNVEKTLATVKTKLDSR
Sbjct: 1   MALFRIFYPRPSSFRFKISTLSTLQLSTVSSADLFYDHLQKKNGNVEKTLATVKTKLDSR 60

Query: 61  CVNEVLHKCSFELSQMGLRFFIWAGRQPNYRHSSFMYSRACELIGINRNPSLLCNVIEDY 120
           CVN+VLHKCS ELSQMGLRFFIWAGRQPNYRHSSFMY+RACELIG+NR+P LL NVIEDY
Sbjct: 61  CVNQVLHKCSLELSQMGLRFFIWAGRQPNYRHSSFMYTRACELIGLNRSPCLLFNVIEDY 120

Query: 121 RREGCLVDIRMFKVILNL---------------KMPEFHLRADTTMYNLVIRLFTEKGEM 180
           RREGCLVDI MFKVILNL               KM EFHLRADTTMYNLVIRLFTEKGEM
Sbjct: 121 RREGCLVDIGMFKVILNLCKEGKLAKEALSILGKMAEFHLRADTTMYNLVIRLFTEKGEM 180

Query: 181 DKAMELMKEMDSVDIHPNMITYIAMLKGFCDVGRLEDAYELFKAMKENGCSPNTVAYSVL 240
           DKAMEL+KEMDSVDI PNMITYIAMLKGFCDVGRLEDAY LFK MK+NGC+PNTVAYSVL
Sbjct: 181 DKAMELLKEMDSVDIDPNMITYIAMLKGFCDVGRLEDAYGLFKVMKDNGCAPNTVAYSVL 240

Query: 241 LNGASRHGIMEKLMELLEEMEKEGGNCSPNAVTYTSIIQGLCEFGQPLEALKMLDRMEDS 300
           LNGASRHG +EKLMELLEEMEK+GG C PN VTYTSIIQ LCE GQPLEALK+LDRMED 
Sbjct: 241 LNGASRHGDLEKLMELLEEMEKQGGTCGPNTVTYTSIIQSLCEVGQPLEALKILDRMEDF 300

Query: 301 GCAPNRVTVSTLLKEFCKNGHVEEAYKLIDRVVARGGASYGDCYSSLVVSLVKMKKIAEA 360
           GCAPNRVTVS L+KEFCK+GH+EEAYKLIDRV ARGGASYGDCYSSLV+SL+KMK+IAEA
Sbjct: 301 GCAPNRVTVSVLVKEFCKDGHMEEAYKLIDRVAARGGASYGDCYSSLVISLIKMKRIAEA 360

Query: 361 EELFRNMLANGMKPDGVACSVMIKELCLEERVLDGFNLCNEVDRNGYLSSIDSDIYSLLL 420
           EELFRNMLANG+KPDGVAC++MIKELCLEERV+DGFNLCNEVDRNGYLSSIDSDIYSLLL
Sbjct: 361 EELFRNMLANGVKPDGVACTLMIKELCLEERVVDGFNLCNEVDRNGYLSSIDSDIYSLLL 420

Query: 421 VGLCEHDHSVDAAKLARLMLKKGVRLKPHYADSIIKHLKKFGDQELVMHLGGIRK 461
           VGLCEHDHSVDAAKLARLML+KG+RLKPHYA+SIIKH+KKFGDQ LVMHLGGIR+
Sbjct: 421 VGLCEHDHSVDAAKLARLMLQKGIRLKPHYAESIIKHVKKFGDQNLVMHLGGIRE 475

BLAST of Tan0003620 vs. ExPASy TrEMBL
Match: A0A6J1I125 (pentatricopeptide repeat-containing protein At5g47360 OS=Cucurbita maxima OX=3661 GN=LOC111468919 PE=3 SV=1)

HSP 1 Score: 828.6 bits (2139), Expect = 1.3e-236
Identity = 410/475 (86.32%), Postives = 439/475 (92.42%), Query Frame = 0

Query: 1   MALFRIFCFRSSSFRLKISTLSTLHLSTVSSADLFYDHLQKNNGNVEKTLATVKTKLDSR 60
           MALFRIF  R SSFR KISTLSTL LSTVSSADLFYDHLQKNNGNVEKTLATV+TKLDSR
Sbjct: 1   MALFRIFYPRPSSFRFKISTLSTLQLSTVSSADLFYDHLQKNNGNVEKTLATVRTKLDSR 60

Query: 61  CVNEVLHKCSFELSQMGLRFFIWAGRQPNYRHSSFMYSRACELIGINRNPSLLCNVIEDY 120
           CVN+VLHKCSFELS MGLRFFIWAGRQPNYRHSSFMY+RACELIG+NR+P LL NVIEDY
Sbjct: 61  CVNQVLHKCSFELSPMGLRFFIWAGRQPNYRHSSFMYARACELIGLNRSPCLLFNVIEDY 120

Query: 121 RREGCLVDIRMFKVILNL---------------KMPEFHLRADTTMYNLVIRLFTEKGEM 180
           RREGCL+DI MFKVILNL               KM EFHLRADTTMYNLVIRLFTEKGEM
Sbjct: 121 RREGCLLDIGMFKVILNLCKEGKLAKEALSILGKMAEFHLRADTTMYNLVIRLFTEKGEM 180

Query: 181 DKAMELMKEMDSVDIHPNMITYIAMLKGFCDVGRLEDAYELFKAMKENGCSPNTVAYSVL 240
           DKAMEL+KEMDSVDI PNMITYIAMLKGFCDVGRLEDAY LFK MKENGC+PNTVAYSVL
Sbjct: 181 DKAMELLKEMDSVDIDPNMITYIAMLKGFCDVGRLEDAYGLFKVMKENGCAPNTVAYSVL 240

Query: 241 LNGASRHGIMEKLMELLEEMEKEGGNCSPNAVTYTSIIQGLCEFGQPLEALKMLDRMEDS 300
           LNGASRHG +EKLMELLEEMEK+GGNC PN VTYTSIIQ LCE GQPLEALK+LDRMEDS
Sbjct: 241 LNGASRHGDLEKLMELLEEMEKQGGNCGPNTVTYTSIIQSLCEVGQPLEALKILDRMEDS 300

Query: 301 GCAPNRVTVSTLLKEFCKNGHVEEAYKLIDRVVARGGASYGDCYSSLVVSLVKMKKIAEA 360
           GC+PNRVTVS L+KEFCK+GH+EEAYKLIDRV ARGGASYGDCYSSLV+SL+KMK+IAEA
Sbjct: 301 GCSPNRVTVSALVKEFCKDGHMEEAYKLIDRVAARGGASYGDCYSSLVISLIKMKRIAEA 360

Query: 361 EELFRNMLANGMKPDGVACSVMIKELCLEERVLDGFNLCNEVDRNGYLSSIDSDIYSLLL 420
           EELFRNMLANG+KPDGVAC++MIKELCLEERV+DGFNLCNEV+RNGYLSSIDSDIYSLLL
Sbjct: 361 EELFRNMLANGVKPDGVACTLMIKELCLEERVVDGFNLCNEVNRNGYLSSIDSDIYSLLL 420

Query: 421 VGLCEHDHSVDAAKLARLMLKKGVRLKPHYADSIIKHLKKFGDQELVMHLGGIRK 461
           VGLCEHDHSVDAAKLARLML+KG+RLKPHYA+SIIKH+KKFG Q+LVMHLGGIR+
Sbjct: 421 VGLCEHDHSVDAAKLARLMLQKGIRLKPHYAESIIKHVKKFGGQDLVMHLGGIRE 475

BLAST of Tan0003620 vs. ExPASy TrEMBL
Match: A0A6J1EQI2 (pentatricopeptide repeat-containing protein At5g47360 OS=Cucurbita moschata OX=3662 GN=LOC111435690 PE=3 SV=1)

HSP 1 Score: 827.8 bits (2137), Expect = 2.3e-236
Identity = 411/475 (86.53%), Postives = 436/475 (91.79%), Query Frame = 0

Query: 1   MALFRIFCFRSSSFRLKISTLSTLHLSTVSSADLFYDHLQKNNGNVEKTLATVKTKLDSR 60
           MALFRIF  R SSFR KISTLSTL LSTVSSADLFYDHLQK NGNVEKTLATVKTKLDSR
Sbjct: 1   MALFRIFYPRPSSFRFKISTLSTLQLSTVSSADLFYDHLQKKNGNVEKTLATVKTKLDSR 60

Query: 61  CVNEVLHKCSFELSQMGLRFFIWAGRQPNYRHSSFMYSRACELIGINRNPSLLCNVIEDY 120
           CVN+VLHKCS ELSQMGLRFFIWAGRQPNYRHSSFMY+RACELIG+NR+P LL NVIEDY
Sbjct: 61  CVNQVLHKCSLELSQMGLRFFIWAGRQPNYRHSSFMYTRACELIGLNRSPCLLFNVIEDY 120

Query: 121 RREGCLVDIRMFKVILNL---------------KMPEFHLRADTTMYNLVIRLFTEKGEM 180
           RREGCLVDI MFKVILNL               KM EFHLRADTTMYNLVIRLFTEKGEM
Sbjct: 121 RREGCLVDIGMFKVILNLCKEGKLAKEALSILGKMAEFHLRADTTMYNLVIRLFTEKGEM 180

Query: 181 DKAMELMKEMDSVDIHPNMITYIAMLKGFCDVGRLEDAYELFKAMKENGCSPNTVAYSVL 240
           DKAMEL+KEMDSVDI PNMITYIAMLKGFCDVGRLEDAY LFK MK+NGC+PNTVAYSVL
Sbjct: 181 DKAMELLKEMDSVDIDPNMITYIAMLKGFCDVGRLEDAYGLFKVMKDNGCAPNTVAYSVL 240

Query: 241 LNGASRHGIMEKLMELLEEMEKEGGNCSPNAVTYTSIIQGLCEFGQPLEALKMLDRMEDS 300
           LNGASRHG +EKLMELLEEMEK+GG C PN VTYTSIIQ LCE GQPLEALK+LDRMED 
Sbjct: 241 LNGASRHGDLEKLMELLEEMEKQGGTCGPNTVTYTSIIQSLCEVGQPLEALKILDRMEDF 300

Query: 301 GCAPNRVTVSTLLKEFCKNGHVEEAYKLIDRVVARGGASYGDCYSSLVVSLVKMKKIAEA 360
           GCAPNRVTVS L+KEFCK+GH+EEAYKLIDRV ARGGASYGDCYSSLV+SL+KMK+IAEA
Sbjct: 301 GCAPNRVTVSVLVKEFCKDGHMEEAYKLIDRVAARGGASYGDCYSSLVISLIKMKRIAEA 360

Query: 361 EELFRNMLANGMKPDGVACSVMIKELCLEERVLDGFNLCNEVDRNGYLSSIDSDIYSLLL 420
           EELFRNMLANG+KPDGVAC++MIKELCLEERV+DGFNLCNEVDRNGYLSSIDSDIYSLLL
Sbjct: 361 EELFRNMLANGVKPDGVACTLMIKELCLEERVVDGFNLCNEVDRNGYLSSIDSDIYSLLL 420

Query: 421 VGLCEHDHSVDAAKLARLMLKKGVRLKPHYADSIIKHLKKFGDQELVMHLGGIRK 461
           VGLCEHDHSVDAAKLARLML+KG+RLKPHYA+SIIKH+KKFGDQ LVMHLGGIR+
Sbjct: 421 VGLCEHDHSVDAAKLARLMLQKGIRLKPHYAESIIKHVKKFGDQNLVMHLGGIRE 475

BLAST of Tan0003620 vs. ExPASy TrEMBL
Match: A0A6J1DNK5 (pentatricopeptide repeat-containing protein At5g47360 OS=Momordica charantia OX=3673 GN=LOC111022875 PE=3 SV=1)

HSP 1 Score: 818.9 bits (2114), Expect = 1.0e-233
Identity = 406/475 (85.47%), Postives = 431/475 (90.74%), Query Frame = 0

Query: 1   MALFRIFCFRSSSFRLKISTLSTLHLSTVSSADLFYDHLQKNNGNVEKTLATVKTKLDSR 60
           MALF IF FRS SF LKIS LS LHLSTVSSADLFYDHLQKNNGNVEK LATVKT LDSR
Sbjct: 1   MALFGIFSFRSFSFGLKISKLSALHLSTVSSADLFYDHLQKNNGNVEKILATVKTTLDSR 60

Query: 61  CVNEVLHKCSFELSQMGLRFFIWAGRQPNYRHSSFMYSRACELIGINRNPSLLCNVIEDY 120
           CVN+VLHKCSFELS MGLRFFIWAGRQPNYRHSSFMYSRACELIGI+R+P LL NVIEDY
Sbjct: 61  CVNQVLHKCSFELSLMGLRFFIWAGRQPNYRHSSFMYSRACELIGIDRSPCLLLNVIEDY 120

Query: 121 RREGCLVDIRMFKVILNL---------------KMPEFHLRADTTMYNLVIRLFTEKGEM 180
           RREGC+VDIRMFKV+LNL               KMPEFHLRADTT+YNLV+RLF EKGEM
Sbjct: 121 RREGCVVDIRMFKVMLNLCKEAKLANEALLILGKMPEFHLRADTTIYNLVVRLFIEKGEM 180

Query: 181 DKAMELMKEMDSVDIHPNMITYIAMLKGFCDVGRLEDAYELFKAMKENGCSPNTVAYSVL 240
           DKAM+LM+EMDS+DIHPNMITYIAMLKGFCDVGRLEDAY LFKAMKENGCSPNT+AYS+L
Sbjct: 181 DKAMKLMEEMDSIDIHPNMITYIAMLKGFCDVGRLEDAYGLFKAMKENGCSPNTLAYSIL 240

Query: 241 LNGASRHGIMEKLMELLEEMEKEGGNCSPNAVTYTSIIQGLCEFGQPLEALKMLDRMEDS 300
           LNGASR GI EK+MELLEEMEKEGGNCSPN VTYTSIIQ LCE GQPLEALK+LDRME+S
Sbjct: 241 LNGASRQGITEKIMELLEEMEKEGGNCSPNTVTYTSIIQSLCELGQPLEALKILDRMENS 300

Query: 301 GCAPNRVTVSTLLKEFCKNGHVEEAYKLIDRVVARGGASYGDCYSSLVVSLVKMKKIAEA 360
           GCAPNRVTV TL+KEFCK+GH+EE Y+LI RVVARGG SYGDCYSSLVVSL KMKKIA A
Sbjct: 301 GCAPNRVTVRTLIKEFCKDGHMEEVYELIHRVVARGGTSYGDCYSSLVVSLAKMKKIAAA 360

Query: 361 EELFRNMLANGMKPDGVACSVMIKELCLEERVLDGFNLCNEVDRNGYLSSIDSDIYSLLL 420
           EELFRNMLA+G+KPDGVACSVMIKELCLEERVLDG+NLCNEVDRNGYLSSIDSDIYSLLL
Sbjct: 361 EELFRNMLASGVKPDGVACSVMIKELCLEERVLDGYNLCNEVDRNGYLSSIDSDIYSLLL 420

Query: 421 VGLCEHDHSVDAAKLARLMLKKGVRLKPHYADSIIKHLKKFGDQELVMHLGGIRK 461
           VGLCEHDH +DA KLARLMLKKG+RLKPHYAD +IKHL KFGDQELVM LGGIRK
Sbjct: 421 VGLCEHDHPMDAEKLARLMLKKGIRLKPHYADHVIKHLNKFGDQELVMQLGGIRK 475

BLAST of Tan0003620 vs. ExPASy TrEMBL
Match: A0A0A0LI44 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G123590 PE=4 SV=1)

HSP 1 Score: 799.7 bits (2064), Expect = 6.6e-228
Identity = 399/475 (84.00%), Postives = 431/475 (90.74%), Query Frame = 0

Query: 1   MALFRIFCFRSSSFRLKISTLSTLHLSTVSSADLFYDHLQKNNGNVEKTLATVKTKLDSR 60
           MALFRI C RSSSF L ISTLST HL+T+SS+DLFYDHL+K+NGN++KTLAT+KTKLDSR
Sbjct: 1   MALFRISCPRSSSFLLNISTLSTFHLNTLSSSDLFYDHLEKSNGNLDKTLATLKTKLDSR 60

Query: 61  CVNEVLHKCSFELSQMGLRFFIWAGRQPNYRHSSFMYSRACELIGINRNPSLLCNVIEDY 120
           CVNEVL+KCSFELSQMGLRFFIWAGRQPNYRHSSFMYSRACELIGIN +P LL NVIEDY
Sbjct: 61  CVNEVLYKCSFELSQMGLRFFIWAGRQPNYRHSSFMYSRACELIGINVSPCLLFNVIEDY 120

Query: 121 RREGCLVDIRMFKVILNL---------------KMPEFHLRADTTMYNLVIRLFTEKGEM 180
           RREGCLVDIRMFK+ILNL               KM EFHLRADTTMYNLVIRLFTEKGEM
Sbjct: 121 RREGCLVDIRMFKIILNLCKEAKLAKEALSILRKMSEFHLRADTTMYNLVIRLFTEKGEM 180

Query: 181 DKAMELMKEMDSVDIHPNMITYIAMLKGFCDVGRLEDAYELFKAMKENGCSPNTVAYSVL 240
           DKAMELMKEMDSVDIHPNMITYI+MLKGFCDVGR EDAY LFK MKENGC+PNTV YSVL
Sbjct: 181 DKAMELMKEMDSVDIHPNMITYISMLKGFCDVGRWEDAYGLFKDMKENGCAPNTVVYSVL 240

Query: 241 LNGASRHGIMEKLMELLEEMEKEGGNCSPNAVTYTSIIQGLCEFGQPLEALKMLDRMEDS 300
           +NGA R  IM++LME+L+EMEK+GG CSPN VTYTSIIQ LCE G PLEALK+LDRME+ 
Sbjct: 241 VNGAIRLRIMDRLMEMLKEMEKQGGTCSPNTVTYTSIIQSLCEEGHPLEALKVLDRMEEY 300

Query: 301 GCAPNRVTVSTLLKEFCKNGHVEEAYKLIDRVVARGGASYGDCYSSLVVSLVKMKKIAEA 360
           G APNRV VS L+KEFCK+GHVEEAYKLIDRVVARGG SYGDCYSSLVV+LVKMKKIAEA
Sbjct: 301 GYAPNRVAVSFLVKEFCKDGHVEEAYKLIDRVVARGGVSYGDCYSSLVVTLVKMKKIAEA 360

Query: 361 EELFRNMLANGMKPDGVACSVMIKELCLEERVLDGFNLCNEVDRNGYLSSIDSDIYSLLL 420
           E+LFRNMLANG+KPDGVACS+MI+ELCLEERVLDGFNLC EVDRNGYL SID+DIYSLLL
Sbjct: 361 EKLFRNMLANGVKPDGVACSLMIRELCLEERVLDGFNLCYEVDRNGYLCSIDADIYSLLL 420

Query: 421 VGLCEHDHSVDAAKLARLMLKKGVRLKPHYADSIIKHLKKFGDQELVMHLGGIRK 461
           VGLCEHDHSVDAAKLARLMLKKG+RLKPHYA+SIIKHLKKF D+ELVMHLGGIRK
Sbjct: 421 VGLCEHDHSVDAAKLARLMLKKGIRLKPHYAESIIKHLKKFEDRELVMHLGGIRK 475

BLAST of Tan0003620 vs. ExPASy TrEMBL
Match: A0A1S3B4L9 (pentatricopeptide repeat-containing protein At5g47360 OS=Cucumis melo OX=3656 GN=LOC103485755 PE=4 SV=1)

HSP 1 Score: 763.8 bits (1971), Expect = 4.0e-217
Identity = 387/475 (81.47%), Postives = 420/475 (88.42%), Query Frame = 0

Query: 1   MALFRIFCFRSSSFRLKISTLSTLHLSTVSSADLFYDHLQKNNGNVEKTLATVKTKLDSR 60
           MALFRI   RSSS  L ISTLST HLST+SS+DLFYDHL+KNNGNVEKTLATVKTKLDSR
Sbjct: 1   MALFRISYPRSSSILLNISTLSTFHLSTLSSSDLFYDHLEKNNGNVEKTLATVKTKLDSR 60

Query: 61  CVNEVLHKCSFELSQMGLRFFIWAGRQPNYRHSSFMYSRACELIGINRNPSLLCNVIEDY 120
           CVNEVL+KCS ELSQMGLRFFIWAGRQPNYRH+SFMYSRACELIGIN +P LL NVIEDY
Sbjct: 61  CVNEVLYKCSSELSQMGLRFFIWAGRQPNYRHTSFMYSRACELIGINVSPCLLFNVIEDY 120

Query: 121 RREGCLVDIRMFKVILNL---------------KMPEFHLRADTTMYNLVIRLFTEKGEM 180
           RREGCLVDIR+F++ILNL               KM EFHLRADTT+YNLVIRL TEKGEM
Sbjct: 121 RREGCLVDIRIFQIILNLCKEAKLTKEALSILRKMSEFHLRADTTIYNLVIRLCTEKGEM 180

Query: 181 DKAMELMKEMDSVDIHPNMITYIAMLKGFCDVGRLEDAYELFKAMKENGCSPNTVAYSVL 240
           DKAMELMKEMDSVDIHPNMITYI+M+KGFCDVGR EDAY LFKAMKENG +PNTV YSVL
Sbjct: 181 DKAMELMKEMDSVDIHPNMITYISMIKGFCDVGRWEDAYGLFKAMKENGYAPNTVVYSVL 240

Query: 241 LNGASRHGIMEKLMELLEEMEKEGGNCSPNAVTYTSIIQGLCEFGQPLEALKMLDRMEDS 300
           +NGA R  IM+KLME+LEEMEK+GG C PN VTYTSIIQ LCE G  LEALK+LDRME+ 
Sbjct: 241 VNGAVRLRIMDKLMEMLEEMEKQGGTCRPNTVTYTSIIQSLCEQGFLLEALKVLDRMEEY 300

Query: 301 GCAPNRVTVSTLLKEFCKNGHVEEAYKLIDRVVARGGASYGDCYSSLVVSLVKMKKIAEA 360
           G APNRV V  L+KEFCK+GHVEEAYKLIDRVVARGGASYGDC SSLV+SLVKMKKI EA
Sbjct: 301 GHAPNRVAVGYLVKEFCKDGHVEEAYKLIDRVVARGGASYGDCCSSLVISLVKMKKIPEA 360

Query: 361 EELFRNMLANGMKPDGVACSVMIKELCLEERVLDGFNLCNEVDRNGYLSSIDSDIYSLLL 420
           E+LFRNMLANG+KPDGVACS+MI+ELCLEERVLDGF+LC EVDRNGYL  ID+D+YSLLL
Sbjct: 361 EKLFRNMLANGVKPDGVACSLMIRELCLEERVLDGFSLCYEVDRNGYLCYIDADVYSLLL 420

Query: 421 VGLCEHDHSVDAAKLARLMLKKGVRLKPHYADSIIKHLKKFGDQELVMHLGGIRK 461
           VGL +HDHSVDAA LARLMLKKG+RLKPHYA+SIIKHLKKF DQEL+MHLGGIRK
Sbjct: 421 VGLYQHDHSVDAAILARLMLKKGIRLKPHYAESIIKHLKKFEDQELIMHLGGIRK 475

BLAST of Tan0003620 vs. TAIR 10
Match: AT5G47360.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 415.2 bits (1066), Expect = 6.7e-116
Identity = 216/461 (46.85%), Postives = 310/461 (67.25%), Query Frame = 0

Query: 11  SSSFRLKISTLSTLH-LSTVSSADLFYDHLQKNNGNVEKTLATVKTKLDSRCVNEVLHKC 70
           S S R + S +S L  L+TVS+A+  Y  LQ    N+EK LA+   +LDS C+NEVL +C
Sbjct: 11  SPSLRSQPSKISALRFLTTVSAAERLYGQLQGCTSNLEKELASANVQLDSSCINEVLRRC 70

Query: 71  SFELSQMGLRFFIWAGRQPNYRHSSFMYSRACELIGINRNPSLLCNVIEDYRREGCLVDI 130
                Q GLRFFIWAG   ++RHS++MY++AC+++ I   P L+  VIE YR+E C V++
Sbjct: 71  DPNQFQSGLRFFIWAGTLSSHRHSAYMYTKACDILKIRAKPDLIKYVIESYRKEECFVNV 130

Query: 131 RMFKVILNL---------------KMPEFHLRADTTMYNLVIRLFTEKGEMDKAMELMKE 190
           +  +++L L               K PEF++ ADT  YNLVIRLF +KG+++ A  L+KE
Sbjct: 131 KTMRIVLTLCNQANLADEALWVLRKFPEFNVCADTVAYNLVIRLFADKGDLNIADMLIKE 190

Query: 191 MDSVDIHPNMITYIAMLKGFCDVGRLEDAYELFKAMKENGCSPNTVAYSVLLNGASRHGI 250
           MD V ++P++ITY +M+ G+C+ G+++DA+ L K M ++ C  N+V YS +L G  + G 
Sbjct: 191 MDCVGLYPDVITYTSMINGYCNAGKIDDAWRLAKEMSKHDCVLNSVTYSRILEGVCKSGD 250

Query: 251 MEKLMELLEEMEKE--GGNCSPNAVTYTSIIQGLCEFGQPLEALKMLDRMEDSGCAPNRV 310
           ME+ +ELL EMEKE  GG  SPNAVTYT +IQ  CE  +  EAL +LDRM + GC PNRV
Sbjct: 251 MERALELLAEMEKEDGGGLISPNAVTYTLVIQAFCEKRRVEEALLVLDRMGNRGCMPNRV 310

Query: 311 TVSTLLKEFCKNGH-VEEAYKLIDRVVARGGASYGDCYSSLVVSLVKMKKIAEAEELFRN 370
           T   L++   +N   V+   KLID++V  GG S  +C+SS  VSL++MK+  EAE++FR 
Sbjct: 311 TACVLIQGVLENDEDVKALSKLIDKLVKLGGVSLSECFSSATVSLIRMKRWEEAEKIFRL 370

Query: 371 MLANGMKPDGVACSVMIKELCLEERVLDGFNLCNEVDRNGYLSSIDSDIYSLLLVGLCEH 430
           ML  G++PDG+ACS + +ELCL ER LD F L  E+++    S+IDSDI+++LL+GLC+ 
Sbjct: 371 MLVRGVRPDGLACSHVFRELCLLERYLDCFLLYQEIEKKDVKSTIDSDIHAVLLLGLCQQ 430

Query: 431 DHSVDAAKLARLMLKKGVRLKPHYADSIIKHLKKFGDQELV 453
            +S +AAKLA+ ML K +RLK  + + II+ LKK GD++L+
Sbjct: 431 GNSWEAAKLAKSMLDKKMRLKVSHVEKIIEALKKTGDEDLM 471

BLAST of Tan0003620 vs. TAIR 10
Match: AT3G49730.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 163.7 bits (413), Expect = 3.5e-40
Identity = 110/420 (26.19%), Postives = 204/420 (48.57%), Query Frame = 0

Query: 36  YDHLQKNNGNVEK-TLATVKTKLDSR--CVNEVLHKCSFELSQMGLRFFIWAGRQPNYRH 95
           Y  L+ ++  V K  LA  ++ +D R   +  VL +C  +   +G RFF+WA +QP Y H
Sbjct: 71  YRILRNHHSRVPKLELALNESGIDLRPGLIIRVLSRCG-DAGNLGYRFFLWATKQPGYFH 130

Query: 96  SSFMYSRACELIGINRNPSLLCNVIEDYRREGC-LVDIRMFKVILNL------------- 155
           S  +      ++   R    +  +IE+ R+    L++  +F V++               
Sbjct: 131 SYEVCKSMVMILSKMRQFGAVWGLIEEMRKTNPELIEPELFVVLMRRFASANMVKKAVEV 190

Query: 156 --KMPEFHLRADTTMYNLVIRLFTEKGEMDKAMELMKEMDSVDIHPNMITYIAMLKGFCD 215
             +MP++ L  D  ++  ++    + G + +A ++ ++M      PN+  + ++L G+C 
Sbjct: 191 LDEMPKYGLEPDEYVFGCLLDALCKNGSVKEASKVFEDMRE-KFPPNLRYFTSLLYGWCR 250

Query: 216 VGRLEDAYELFKAMKENGCSPNTVAYSVLLNGASRHGIMEKLMELLEEMEKEGGNCSPNA 275
            G+L +A E+   MKE G  P+ V ++ LL+G +  G M    +L+ +M K G    PN 
Sbjct: 251 EGKLMEAKEVLVQMKEAGLEPDIVVFTNLLSGYAHAGKMADAYDLMNDMRKRG--FEPNV 310

Query: 276 VTYTSIIQGLCEFGQPL-EALKMLDRMEDSGCAPNRVTVSTLLKEFCKNGHVEEAYKLID 335
             YT +IQ LC   + + EA+++   ME  GC  + VT + L+  FCK G +++ Y ++D
Sbjct: 311 NCYTVLIQALCRTEKRMDEAMRVFVEMERYGCEADIVTYTALISGFCKWGMIDKGYSVLD 370

Query: 336 RVVARGGASYGDCYSSLVVSLVKMKKIAEAEELFRNMLANGMKPDGVACSVMIKELCLEE 395
            +  +G       Y  ++V+  K ++  E  EL   M   G  PD +  +V+I+  C   
Sbjct: 371 DMRKKGVMPSQVTYMQIMVAHEKKEQFEECLELIEKMKRRGCHPDLLIYNVVIRLACKLG 430

Query: 396 RVLDGFNLCNEVDRNGYLSSIDSDIYSLLLVGLCEHDHSVDAAKLARLMLKKGVRLKPHY 436
            V +   L NE++ NG    +D+  + +++ G       ++A    + M+ +G+   P Y
Sbjct: 431 EVKEAVRLWNEMEANGLSPGVDT--FVIMINGFTSQGFLIEACNHFKEMVSRGIFSAPQY 484

BLAST of Tan0003620 vs. TAIR 10
Match: AT2G16880.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 160.2 bits (404), Expect = 3.9e-39
Identity = 102/368 (27.72%), Postives = 182/368 (49.46%), Query Frame = 0

Query: 93  SSFMYSRACEL------IGINRNPSLLCNVIEDYRREGCLVDIRMFKVILNLKMPEFHLR 152
           SSF  S A E+      IG++ N      ++  Y  EG L D      +L   + EF + 
Sbjct: 181 SSFSISSAREVFDDMVKIGVSLNVQTFNVLVNGYCLEGKLEDA---LGMLERMVSEFKVN 240

Query: 153 ADTTMYNLVIRLFTEKGEMDKAMELMKEMDSVDIHPNMITYIAMLKGFCDVGRLEDAYEL 212
            D   YN +++  ++KG +    EL+ +M    + PN +TY  ++ G+C +G L++A+++
Sbjct: 241 PDNVTYNTILKAMSKKGRLSDLKELLLDMKKNGLVPNRVTYNNLVYGYCKLGSLKEAFQI 300

Query: 213 FKAMKENGCSPNTVAYSVLLNGASRHGIMEKLMELLEEMEKEGGNCSPNAVTYTSIIQGL 272
            + MK+    P+   Y++L+NG    G M + +EL++ M+       P+ VTY ++I G 
Sbjct: 301 VELMKQTNVLPDLCTYNILINGLCNAGSMREGLELMDAMKSL--KLQPDVVTYNTLIDGC 360

Query: 273 CEFGQPLEALKMLDRMEDSGCAPNRVTVSTLLKEFCKNGHVEEAYKLIDRVVARGGASYG 332
            E G  LEA K++++ME+ G   N+VT +  LK  CK    E   + +  +V   G S  
Sbjct: 361 FELGLSLEARKLMEQMENDGVKANQVTHNISLKWLCKEEKREAVTRKVKELVDMHGFSPD 420

Query: 333 -DCYSSLVVSLVKMKKIAEAEELFRNMLANGMKPDGVACSVMIKELCLEERVLDGFNLCN 392
              Y +L+ + +K+  ++ A E+ R M   G+K + +  + ++  LC E ++ +  NL N
Sbjct: 421 IVTYHTLIKAYLKVGDLSGALEMMREMGQKGIKMNTITLNTILDALCKERKLDEAHNLLN 480

Query: 393 EVDRNGYLSSIDSDIYSLLLVGLCEHDHSVDAAKLARLMLKKGVRLKPHYADSIIKHLKK 452
              + G++  +D   Y  L++G    +    A ++   M K  +       +S+I  L  
Sbjct: 481 SAHKRGFI--VDEVTYGTLIMGFFREEKVEKALEMWDEMKKVKITPTVSTFNSLIGGLCH 540

Query: 453 FGDQELVM 454
            G  EL M
Sbjct: 541 HGKTELAM 541

BLAST of Tan0003620 vs. TAIR 10
Match: AT1G12775.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 159.5 bits (402), Expect = 6.6e-39
Identity = 86/305 (28.20%), Postives = 163/305 (53.44%), Query Frame = 0

Query: 139 KMPEFHLRADTTMYNLVIRLFTEKGEMDKAMELMKEMDSVDIHPNMITYIAMLKGFCDVG 198
           KM E +++ D   Y+++I    + G +D A  L  EM+      ++ITY  ++ GFC+ G
Sbjct: 253 KMEERNIKLDAVKYSIIIDGLCKDGSLDNAFNLFNEMEIKGFKADIITYNTLIGGFCNAG 312

Query: 199 RLEDAYELFKAMKENGCSPNTVAYSVLLNGASRHGIMEKLMELLEEMEKEGGNCSPNAVT 258
           R +D  +L + M +   SPN V +SVL++   + G + +  +LL+EM + G   +PN +T
Sbjct: 313 RWDDGAKLLRDMIKRKISPNVVTFSVLIDSFVKEGKLREADQLLKEMMQRG--IAPNTIT 372

Query: 259 YTSIIQGLCEFGQPLEALKMLDRMEDSGCAPNRVTVSTLLKEFCKNGHVEEAYKLIDRVV 318
           Y S+I G C+  +  EA++M+D M   GC P+ +T + L+  +CK   +++  +L   + 
Sbjct: 373 YNSLIDGFCKENRLEEAIQMVDLMISKGCDPDIMTFNILINGYCKANRIDDGLELFREMS 432

Query: 319 ARGGASYGDCYSSLVVSLVKMKKIAEAEELFRNMLANGMKPDGVACSVMIKELCLEERVL 378
            RG  +    Y++LV    +  K+  A++LF+ M++  ++PD V+  +++  LC    + 
Sbjct: 433 LRGVIANTVTYNTLVQGFCQSGKLEVAKKLFQEMVSRRVRPDIVSYKILLDGLCDNGELE 492

Query: 379 DGFNLCNEVDRNGYLSSIDSDIYSLLLVGLCEHDHSVDAAKLARLMLKKGVRLKPHYADS 438
               +  +++++     +D  IY +++ G+C      DA  L   +  KGV+L     + 
Sbjct: 493 KALEIFGKIEKS--KMELDIGIYMIIIHGMCNASKVDDAWDLFCSLPLKGVKLDARAYNI 552

Query: 439 IIKHL 444
           +I  L
Sbjct: 553 MISEL 553

BLAST of Tan0003620 vs. TAIR 10
Match: AT5G65820.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 159.5 bits (402), Expect = 6.6e-39
Identity = 111/419 (26.49%), Postives = 205/419 (48.93%), Query Frame = 0

Query: 30  SSADLFYDHLQKNNGNVEK-TLATVKTKLDSR--CVNEVLHKCSFELSQMGLRFFIWAGR 89
           S  +  Y  L+K +  V K  LA  ++ ++ R   +  VL++C  +   +G RFF+WA +
Sbjct: 81  SDVEKSYRILRKFHSRVPKLELALNESGVELRPGLIERVLNRCG-DAGNLGYRFFVWAAK 140

Query: 90  QPNYRHSSFMYSRACELIGINRNPSLLCNVIEDYRREG-CLVDIRMFKVILNL------- 149
           QP Y HS  +Y    +++   R    +  +IE+ R+E   L++  +F V++         
Sbjct: 141 QPRYCHSIEVYKSMVKILSKMRQFGAVWGLIEEMRKENPQLIEPELFVVLVQRFASADMV 200

Query: 150 --------KMPEFHLRADTTMYNLVIRLFTEKGEMDKAMELMKEMDSVDIHPNMITYIAM 209
                   +MP+F    D  ++  ++    + G +  A +L ++M  +    N+  + ++
Sbjct: 201 KKAIEVLDEMPKFGFEPDEYVFGCLLDALCKHGSVKDAAKLFEDM-RMRFPVNLRYFTSL 260

Query: 210 LKGFCDVGRLEDAYELFKAMKENGCSPNTVAYSVLLNGASRHGIMEKLMELLEEMEKEGG 269
           L G+C VG++ +A  +   M E G  P+ V Y+ LL+G +  G M    +LL +M + G 
Sbjct: 261 LYGWCRVGKMMEAKYVLVQMNEAGFEPDIVDYTNLLSGYANAGKMADAYDLLRDMRRRG- 320

Query: 270 NCSPNAVTYTSIIQGLCEFGQPLEALKMLDRMEDSGCAPNRVTVSTLLKEFCKNGHVEEA 329
              PNA  YT +IQ LC+  +  EA+K+   ME   C  + VT + L+  FCK G +++ 
Sbjct: 321 -FEPNANCYTVLIQALCKVDRMEEAMKVFVEMERYECEADVVTYTALVSGFCKWGKIDKC 380

Query: 330 YKLIDRVVARGGASYGDCYSSLVVSLVKMKKIAEAEELFRNMLANGMKPDGVACSVMIKE 389
           Y ++D ++ +G       Y  ++V+  K +   E  EL   M      PD    +V+I+ 
Sbjct: 381 YIVLDDMIKKGLMPSELTYMHIMVAHEKKESFEECLELMEKMRQIEYHPDIGIYNVVIRL 440

Query: 390 LCLEERVLDGFNLCNEVDRNGYLSSIDSDIYSLLLVGLCEHDHSVDAAKLARLMLKKGV 430
            C    V +   L NE++ NG    +D+  + +++ GL      ++A+   + M+ +G+
Sbjct: 441 ACKLGEVKEAVRLWNEMEENGLSPGVDT--FVIMINGLASQGCLLEASDHFKEMVTRGL 493

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9LVS39.4e-11546.85Pentatricopeptide repeat-containing protein At5g47360 OS=Arabidopsis thaliana OX... [more]
P0C8A05.0e-3926.19Pentatricopeptide repeat-containing protein At3g49730 OS=Arabidopsis thaliana OX... [more]
Q9ZVX55.5e-3827.72Pentatricopeptide repeat-containing protein At2g16880 OS=Arabidopsis thaliana OX... [more]
Q9FMF69.3e-3828.28Pentatricopeptide repeat-containing protein At5g64320, mitochondrial OS=Arabidop... [more]
Q9FH879.3e-3826.49Putative pentatricopeptide repeat-containing protein At5g65820 OS=Arabidopsis th... [more]
Match NameE-valueIdentityDescription
KAG7017159.14.2e-23786.74Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
KAG6579716.19.4e-23786.53Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
XP_023551479.11.6e-23686.53pentatricopeptide repeat-containing protein At5g47360 [Cucurbita pepo subsp. pep... [more]
XP_022969835.12.7e-23686.32pentatricopeptide repeat-containing protein At5g47360 [Cucurbita maxima][more]
XP_022928928.14.7e-23686.53pentatricopeptide repeat-containing protein At5g47360 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
A0A6J1I1251.3e-23686.32pentatricopeptide repeat-containing protein At5g47360 OS=Cucurbita maxima OX=366... [more]
A0A6J1EQI22.3e-23686.53pentatricopeptide repeat-containing protein At5g47360 OS=Cucurbita moschata OX=3... [more]
A0A6J1DNK51.0e-23385.47pentatricopeptide repeat-containing protein At5g47360 OS=Momordica charantia OX=... [more]
A0A0A0LI446.6e-22884.00Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G123590 PE=4 SV=1[more]
A0A1S3B4L94.0e-21781.47pentatricopeptide repeat-containing protein At5g47360 OS=Cucumis melo OX=3656 GN... [more]
Match NameE-valueIdentityDescription
AT5G47360.16.7e-11646.85Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G49730.13.5e-4026.19Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT2G16880.13.9e-3927.72Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G12775.16.6e-3928.20Pentatricopeptide repeat (PPR) superfamily protein [more]
AT5G65820.16.6e-3926.49Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 328..360
e-value: 2.2E-6
score: 25.5
coord: 151..183
e-value: 6.8E-6
score: 23.9
coord: 220..249
e-value: 5.6E-4
score: 17.9
coord: 185..219
e-value: 4.2E-10
score: 37.2
coord: 292..321
e-value: 0.0014
score: 16.7
coord: 257..290
e-value: 6.5E-9
score: 33.4
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 182..229
e-value: 4.6E-15
score: 55.5
coord: 254..303
e-value: 8.3E-15
score: 54.7
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 328..356
e-value: 0.0051
score: 17.0
coord: 151..177
e-value: 1.0E-4
score: 22.3
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 290..324
score: 9.689847
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 183..217
score: 13.690722
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 218..252
score: 9.656963
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 397..431
score: 8.692369
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 148..182
score: 11.016164
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 325..359
score: 9.766576
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 255..289
score: 13.131695
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 35..248
e-value: 2.2E-31
score: 111.5
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 327..457
e-value: 1.1E-15
score: 59.9
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 249..318
e-value: 7.9E-22
score: 79.7
NoneNo IPR availablePANTHERPTHR45613:SF82PPR CONTAINING PLANT-LIKE PROTEINcoord: 19..456
NoneNo IPR availablePANTHERPTHR45613PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 19..456

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0003620.1Tan0003620.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding