Tan0006665 (gene) Snake gourd v1

Overview
NameTan0006665
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat-containing protein
LocationLG08: 6563325 .. 6566313 (+)
RNA-Seq ExpressionTan0006665
SyntenyTan0006665
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAAAAACTCCCCGTTTTCTCTCTGATATCGGGGGAACTTCCCGTCAATGTCTCCAAGTCCAAAGTGAAAAACATGAACGCCATCATTCGCCTCCCGTCGAGAATTTCCCCGGCGAAAATTGATCGGAAGTACAAGTTTTCAACTACCCAACTTCCGTTCTGCACTTACAATTCAACTTCTACGGCTTCCACATCGAAAGATTTCGACCCAGAAATTATCTCACAGCTAATTTCCAGGCAACGATGGTCGAACCTCAAATCCCATTTCAAATTCAAAACTCCCGTCGATTTTCTTCACCAATTGCTCGGTTCGGAAGCCGTCGACCCGTTGCTTGTGCTCAGGTACTTCAACTGGTCCCAGAAAGAGCTTAAAGTTAATTACAACATCGAGCTCTTTTGCAGACTCTTAAATTTGTTAGCCAATGCCAAATGTTACCCCAAAATTCGATCGTTTCTTGACTCTTTTGTAAAGGGTGAAACAAATTCCACGATTTCTTTGATTTTTCATACGCTTTCGGTGTGTGGTGACCAATTTTGTGCTAACTCGATAATTGCTGATATGTTGGTGTTGGCTTATGTGAAAAATTCGAAAACGGCTTTGGGATTGGAGGCGTTTAAGCGTGCTGGTGATTATGGGTATAAGTTATCTGTGTTGTCTTGTAACCCATTGTTGAGTGCTCTGGCGAAGGAGAACGAATTTGGGGATGTGGAATTTATCTATAAAGAGATGATTAGGAGGAAAGTTAGTCCTAATTTGATTACATTTAATATTGTGATTAATGGGTTGTGTAAGGCTGGGAAATTGAATAAAGCTGGTGATGTTATTGATGATATGAAAGTGTGGGGATTCTGGCCCAATGTGGTTACTTATAACACTCTCATTGATGGATACTGTAAAATAGGCAGGGTTGGGAAGATGTACAAAGCTGATGCTATTTTGAAGGAAATGATGGCAAATGAAGTTAGTCCAAATGATGTAACTTTTAATATATTGATTGATGGGTTCTGTAAAGATGAAAACATATCGGCTGCGTTGAAGGTGTTCGAGGAAATGCGGAGTCAGGGTGTGAAGCCAACTGTTATAACATATAATTCCTTGATTAATGGTTTTTGCAACGAAGGAAAACTGAATGAAGCAAAAGTATTGCTTGATGAAATGTTGAGTTCAAACTTGAAGCCTAATGTCATTACTTATAATGCTCTGATTAATGGATATTGTAAGAAGAAGCTGTTGGAGGAAGCTAGGGAGTTGTTTGATAAAATTGGAAAACAAGGGCTGGCTCCCAATGTTATAACATTCAATACATTACTTGATGGATATTGCAAGAGTGGAAAGATGGAAGAAGCATTTTTGCTACAAAGCTTAATGTTAGGGAAGGGATTTCTCCCAGATGTCTCAACCTACAATTGCCTTATCGCCGGTTTTTGCAGGGAGGGGAAAATGGAGGAAGCCAAGAATCTTTTAAACGAAATGGAAGATAGAGGTCTGAAAGCTGATTTGGTGACTTACAATATTTTGGTAAGTGCATGGTGTGAGAAAAGGGAGCCAAAAAAGGCAGCGGGACTCGTCAATGAGATGCTTCATAGGGGATTACGACCGAGTCATTTGACATATAACATTTTGTTGAATGGCTATTGTATAGAAGGAAACTTAAGGGCTGCTTTGAATGTAAGGAAACAAATGGAGAAAGAAGGAAGGTGGGCCAACGTGGTGACATATAATGTGTTGATTCAAGGTTATTGTAGAAAGGGGAAGTTGGAAGATGCAAATGGACTTCTGAATGAGATGCTGGAGAAGGGATTGATACCTAACCGAACTACTTATGAAATAATTAGAGAGGAAATGATGGAGAAAGGATTCCTCCCTGATATAGAAGGCCATCTTTATCAAGTCTCCCGGTAGAAGTTAATCAATGACTTCACAACTTTACTGGTGGAATGAGCTGTACAAACTCTTAAATCGCTGATCCGGCCAGCATGATTGCAGCTAGAGAGGGCATTTCATGGTAACTAATATATCCAGGAGGGCAACAGACACCATATTCAGCAATCCTTAATCTTCTACATTTGAGAGCTTTTGATGTTGTGTATTTTTAAACTTACGGATGGTATAAAATTTAATCAATCAGGTAATTTAGTAATCAATCATAGCATGCTTCTTCTTTCCCTTGTCATCCTACCTTACCTTCCTCGCCATTTTATGTCCTTGATGTGCTTATTGTATGATTATTTTTTATTGTCATTGCAGGATTATCCGCTGTAATCCCAACTTGGGATGAAGATCTGCAGTCATCAGTTCGTTTTCTTGGCCTTTTGATGCACTCTGGTTTTACTTTAAGACCAGCCCTTGCAATGGCGCCCAAAGACATCTGACTCGTCTTACATCGAGCAACCTTCGCTTTGCTTCCAATGATATTGATTTGACTATTTAAGTTTTAAGCTAGTCGTTCGACTCGTTGTAGAAGGAAGCAGCAGAAGGGTGGTACTTTGAGAAACAGGATATGAATATGCTGCAATCTCAGCTTGATCACAAACACTCGTTTCCTTATTTTTGTCGCGGTTGGCACATTTGGACCTGCATTTGGTGATTCCATCTCCTGCTTCATACTAGTCGTGGGCATGGGCAGATCGTGGACATTCAGGTCCATAGGTAATTACTCCTGCCGATAAGTTGGAGGTTTGTTCGAGTCCATAAGTAATGACAACTGCTGATGTTTTACTATACTCATCAGTATCCAGTGTCCTTGGATCTTCAAATCAGAAGTAATTTGTATATCATCCACAGGGTGTTGTTGGTCGACCTTGAATTATTATGCGAATGGCGCTATTAAAGGGTCGGTCAATAGCAAGTCTCTTTCTTTCTCGAATCTCTTTGTCGTCCCGGCCATTTCGGGGTTGTTTGGGACACTGGGTGAGTTATAATAATATGTGGGTATTATAACCTGTGGAATCATATAATATTATTTAAAATACAGAATAGTATATTATGG

mRNA sequence

AAAAAACTCCCCGTTTTCTCTCTGATATCGGGGGAACTTCCCGTCAATGTCTCCAAGTCCAAAGTGAAAAACATGAACGCCATCATTCGCCTCCCGTCGAGAATTTCCCCGGCGAAAATTGATCGGAAGTACAAGTTTTCAACTACCCAACTTCCGTTCTGCACTTACAATTCAACTTCTACGGCTTCCACATCGAAAGATTTCGACCCAGAAATTATCTCACAGCTAATTTCCAGGCAACGATGGTCGAACCTCAAATCCCATTTCAAATTCAAAACTCCCGTCGATTTTCTTCACCAATTGCTCGGTTCGGAAGCCGTCGACCCGTTGCTTGTGCTCAGGTACTTCAACTGGTCCCAGAAAGAGCTTAAAGTTAATTACAACATCGAGCTCTTTTGCAGACTCTTAAATTTGTTAGCCAATGCCAAATGTTACCCCAAAATTCGATCGTTTCTTGACTCTTTTGTAAAGGGTGAAACAAATTCCACGATTTCTTTGATTTTTCATACGCTTTCGGTGTGTGGTGACCAATTTTGTGCTAACTCGATAATTGCTGATATGTTGGTGTTGGCTTATGTGAAAAATTCGAAAACGGCTTTGGGATTGGAGGCGTTTAAGCGTGCTGGTGATTATGGGTATAAGTTATCTGTGTTGTCTTGTAACCCATTGTTGAGTGCTCTGGCGAAGGAGAACGAATTTGGGGATGTGGAATTTATCTATAAAGAGATGATTAGGAGGAAAGTTAGTCCTAATTTGATTACATTTAATATTGTGATTAATGGGTTGTGTAAGGCTGGGAAATTGAATAAAGCTGGTGATGTTATTGATGATATGAAAGTGTGGGGATTCTGGCCCAATGTGGTTACTTATAACACTCTCATTGATGGATACTGTAAAATAGGCAGGGTTGGGAAGATGTACAAAGCTGATGCTATTTTGAAGGAAATGATGGCAAATGAAGTTAGTCCAAATGATGTAACTTTTAATATATTGATTGATGGGTTCTGTAAAGATGAAAACATATCGGCTGCGTTGAAGGTGTTCGAGGAAATGCGGAGTCAGGGTGTGAAGCCAACTGTTATAACATATAATTCCTTGATTAATGGTTTTTGCAACGAAGGAAAACTGAATGAAGCAAAAGTATTGCTTGATGAAATGTTGAGTTCAAACTTGAAGCCTAATGTCATTACTTATAATGCTCTGATTAATGGATATTGTAAGAAGAAGCTGTTGGAGGAAGCTAGGGAGTTGTTTGATAAAATTGGAAAACAAGGGCTGGCTCCCAATGTTATAACATTCAATACATTACTTGATGGATATTGCAAGAGTGGAAAGATGGAAGAAGCATTTTTGCTACAAAGCTTAATGTTAGGGAAGGGATTTCTCCCAGATGTCTCAACCTACAATTGCCTTATCGCCGGTTTTTGCAGGGAGGGGAAAATGGAGGAAGCCAAGAATCTTTTAAACGAAATGGAAGATAGAGGTCTGAAAGCTGATTTGGTGACTTACAATATTTTGGTAAGTGCATGGTGTGAGAAAAGGGAGCCAAAAAAGGCAGCGGGACTCGTCAATGAGATGCTTCATAGGGGATTACGACCGAGTCATTTGACATATAACATTTTGTTGAATGGCTATTGTATAGAAGGAAACTTAAGGGCTGCTTTGAATGTAAGGAAACAAATGGAGAAAGAAGGAAGGTGGGCCAACGTGGTGACATATAATGTGTTGATTCAAGGTTATTGTAGAAAGGGGAAGTTGGAAGATGCAAATGGACTTCTGAATGAGATGCTGGAGAAGGGATTGATACCTAACCGAACTACTTATGAAATAATTAGAGAGGAAATGATGGAGAAAGGATTCCTCCCTGATATAGAAGGCCATCTTTATCAAGTCTCCCGGTAGAAGTTAATCAATGACTTCACAACTTTACTGGTGGAATGAGCTGTACAAACTCTTAAATCGCTGATCCGGCCAGCATGATTGCAGCTAGAGAGGGCATTTCATGGTAACTAATATATCCAGGAGGGCAACAGACACCATATTCAGCAATCCTTAATCTTCTACATTTGAGAGCTTTTGATGTTGTGTATTTTTAAACTTACGGATGGTATAAAATTTAATCAATCAGGATTATCCGCTGTAATCCCAACTTGGGATGAAGATCTGCAGTCATCAGTTCGTTTTCTTGGCCTTTTGATGCACTCTGGTTTTACTTTAAGACCAGCCCTTGCAATGGCGCCCAAAGACATCTGACTCGTCTTACATCGAGCAACCTTCGCTTTGCTTCCAATGATATTGATTTGACTATTTAAGTTTTAAGCTAGTCGTTCGACTCGTTGTAGAAGGAAGCAGCAGAAGGGTGGTACTTTGAGAAACAGGATATGAATATGCTGCAATCTCAGCTTGATCACAAACACTCGTTTCCTTATTTTTGTCGCGGTTGGCACATTTGGACCTGCATTTGGTGATTCCATCTCCTGCTTCATACTAGTCGTGGGCATGGGCAGATCGTGGACATTCAGGTCCATAGGTAATTACTCCTGCCGATAAGTTGGAGGTTTGTTCGAGTCCATAAGTAATGACAACTGCTGATGTTTTACTATACTCATCAGTATCCAGTGTCCTTGGATCTTCAAATCAGAAGTAATTTGTATATCATCCACAGGGTGTTGTTGGTCGACCTTGAATTATTATGCGAATGGCGCTATTAAAGGGTCGGTCAATAGCAAGTCTCTTTCTTTCTCGAATCTCTTTGTCGTCCCGGCCATTTCGGGGTTGTTTGGGACACTGGGTGAGTTATAATAATATGTGGGTATTATAACCTGTGGAATCATATAATATTATTTAAAATACAGAATAGTATATTATGG

Coding sequence (CDS)

ATGAACGCCATCATTCGCCTCCCGTCGAGAATTTCCCCGGCGAAAATTGATCGGAAGTACAAGTTTTCAACTACCCAACTTCCGTTCTGCACTTACAATTCAACTTCTACGGCTTCCACATCGAAAGATTTCGACCCAGAAATTATCTCACAGCTAATTTCCAGGCAACGATGGTCGAACCTCAAATCCCATTTCAAATTCAAAACTCCCGTCGATTTTCTTCACCAATTGCTCGGTTCGGAAGCCGTCGACCCGTTGCTTGTGCTCAGGTACTTCAACTGGTCCCAGAAAGAGCTTAAAGTTAATTACAACATCGAGCTCTTTTGCAGACTCTTAAATTTGTTAGCCAATGCCAAATGTTACCCCAAAATTCGATCGTTTCTTGACTCTTTTGTAAAGGGTGAAACAAATTCCACGATTTCTTTGATTTTTCATACGCTTTCGGTGTGTGGTGACCAATTTTGTGCTAACTCGATAATTGCTGATATGTTGGTGTTGGCTTATGTGAAAAATTCGAAAACGGCTTTGGGATTGGAGGCGTTTAAGCGTGCTGGTGATTATGGGTATAAGTTATCTGTGTTGTCTTGTAACCCATTGTTGAGTGCTCTGGCGAAGGAGAACGAATTTGGGGATGTGGAATTTATCTATAAAGAGATGATTAGGAGGAAAGTTAGTCCTAATTTGATTACATTTAATATTGTGATTAATGGGTTGTGTAAGGCTGGGAAATTGAATAAAGCTGGTGATGTTATTGATGATATGAAAGTGTGGGGATTCTGGCCCAATGTGGTTACTTATAACACTCTCATTGATGGATACTGTAAAATAGGCAGGGTTGGGAAGATGTACAAAGCTGATGCTATTTTGAAGGAAATGATGGCAAATGAAGTTAGTCCAAATGATGTAACTTTTAATATATTGATTGATGGGTTCTGTAAAGATGAAAACATATCGGCTGCGTTGAAGGTGTTCGAGGAAATGCGGAGTCAGGGTGTGAAGCCAACTGTTATAACATATAATTCCTTGATTAATGGTTTTTGCAACGAAGGAAAACTGAATGAAGCAAAAGTATTGCTTGATGAAATGTTGAGTTCAAACTTGAAGCCTAATGTCATTACTTATAATGCTCTGATTAATGGATATTGTAAGAAGAAGCTGTTGGAGGAAGCTAGGGAGTTGTTTGATAAAATTGGAAAACAAGGGCTGGCTCCCAATGTTATAACATTCAATACATTACTTGATGGATATTGCAAGAGTGGAAAGATGGAAGAAGCATTTTTGCTACAAAGCTTAATGTTAGGGAAGGGATTTCTCCCAGATGTCTCAACCTACAATTGCCTTATCGCCGGTTTTTGCAGGGAGGGGAAAATGGAGGAAGCCAAGAATCTTTTAAACGAAATGGAAGATAGAGGTCTGAAAGCTGATTTGGTGACTTACAATATTTTGGTAAGTGCATGGTGTGAGAAAAGGGAGCCAAAAAAGGCAGCGGGACTCGTCAATGAGATGCTTCATAGGGGATTACGACCGAGTCATTTGACATATAACATTTTGTTGAATGGCTATTGTATAGAAGGAAACTTAAGGGCTGCTTTGAATGTAAGGAAACAAATGGAGAAAGAAGGAAGGTGGGCCAACGTGGTGACATATAATGTGTTGATTCAAGGTTATTGTAGAAAGGGGAAGTTGGAAGATGCAAATGGACTTCTGAATGAGATGCTGGAGAAGGGATTGATACCTAACCGAACTACTTATGAAATAATTAGAGAGGAAATGATGGAGAAAGGATTCCTCCCTGATATAGAAGGCCATCTTTATCAAGTCTCCCGGTAG

Protein sequence

MNAIIRLPSRISPAKIDRKYKFSTTQLPFCTYNSTSTASTSKDFDPEIISQLISRQRWSNLKSHFKFKTPVDFLHQLLGSEAVDPLLVLRYFNWSQKELKVNYNIELFCRLLNLLANAKCYPKIRSFLDSFVKGETNSTISLIFHTLSVCGDQFCANSIIADMLVLAYVKNSKTALGLEAFKRAGDYGYKLSVLSCNPLLSALAKENEFGDVEFIYKEMIRRKVSPNLITFNIVINGLCKAGKLNKAGDVIDDMKVWGFWPNVVTYNTLIDGYCKIGRVGKMYKADAILKEMMANEVSPNDVTFNILIDGFCKDENISAALKVFEEMRSQGVKPTVITYNSLINGFCNEGKLNEAKVLLDEMLSSNLKPNVITYNALINGYCKKKLLEEARELFDKIGKQGLAPNVITFNTLLDGYCKSGKMEEAFLLQSLMLGKGFLPDVSTYNCLIAGFCREGKMEEAKNLLNEMEDRGLKADLVTYNILVSAWCEKREPKKAAGLVNEMLHRGLRPSHLTYNILLNGYCIEGNLRAALNVRKQMEKEGRWANVVTYNVLIQGYCRKGKLEDANGLLNEMLEKGLIPNRTTYEIIREEMMEKGFLPDIEGHLYQVSR
Homology
BLAST of Tan0006665 vs. ExPASy Swiss-Prot
Match: O04504 (Pentatricopeptide repeat-containing protein At1g09820 OS=Arabidopsis thaliana OX=3702 GN=At1g09820 PE=2 SV=1)

HSP 1 Score: 689.5 bits (1778), Expect = 3.4e-197
Identity = 337/581 (58.00%), Postives = 433/581 (74.53%), Query Frame = 0

Query: 30  CTYNSTSTASTSKD-FDPEIISQLISRQRWSNLKSHFKFKTPVDFLHQLLGSEAVDPLLV 89
           C+ +ST T S     +D  +I+ LI +Q WS L  H     P +   QL+ SE +DP L 
Sbjct: 26  CSSSSTITGSPCPPRYDVAVIADLIEKQHWSKLGVHVTDINPNELFRQLISSE-LDPDLC 85

Query: 90  LRYFNWSQKELKVNYNIELFCRLLNLLANAKCYPKIRSFLDSFVKGETNSTISLIFHTLS 149
           LRY++W  K   ++ ++EL  +LL+ LANAK Y KIRSFLD FV+  ++  +  IFH +S
Sbjct: 86  LRYYSWLVKNSDISVSLELTFKLLHSLANAKRYSKIRSFLDGFVRNGSDHQVHSIFHAIS 145

Query: 150 VCGDQFCANSIIADMLVLAYVKNSKTALGLEAFKRAGDYGYKLSVLSCNPLLSALAKENE 209
           +C D  C NSIIADMLVLAY  NS+  LG EAFKR+G YGYKLS LSC PL+ AL KEN 
Sbjct: 146 MC-DNVCVNSIIADMLVLAYANNSRFELGFEAFKRSGYYGYKLSALSCKPLMIALLKENR 205

Query: 210 FGDVEFIYKEMIRRKVSPNLITFNIVINGLCKAGKLNKAGDVIDDMKVWGFWPNVVTYNT 269
             DVE++YKEMIRRK+ PN+ TFN+VIN LCK GK+NKA DV++DMKV+G  PNVV+YNT
Sbjct: 206 SADVEYVYKEMIRRKIQPNVFTFNVVINALCKTGKMNKARDVMEDMKVYGCSPNVVSYNT 265

Query: 270 LIDGYCKIGRVGKMYKADAILKEMMANEVSPNDVTFNILIDGFCKDENISAALKVFEEMR 329
           LIDGYCK+G  GKMYKADA+LKEM+ N+VSPN  TFNILIDGF KD+N+  ++KVF+EM 
Sbjct: 266 LIDGYCKLGGNGKMYKADAVLKEMVENDVSPNLTTFNILIDGFWKDDNLPGSMKVFKEML 325

Query: 330 SQGVKPTVITYNSLINGFCNEGKLNEAKVLLDEMLSSNLKPNVITYNALINGYCKKKLLE 389
            Q VKP VI+YNSLING CN GK++EA  + D+M+S+ ++PN+ITYNALING+CK  +L+
Sbjct: 326 DQDVKPNVISYNSLINGLCNGGKISEAISMRDKMVSAGVQPNLITYNALINGFCKNDMLK 385

Query: 390 EARELFDKIGKQGLAPNVITFNTLLDGYCKSGKMEEAFLLQSLMLGKGFLPDVSTYNCLI 449
           EA ++F  +  QG  P    +N L+D YCK GK+++ F L+  M  +G +PDV TYNCLI
Sbjct: 386 EALDMFGSVKGQGAVPTTRMYNMLIDAYCKLGKIDDGFALKEEMEREGIVPDVGTYNCLI 445

Query: 450 AGFCREGKMEEAKNLLNEMEDRGLKADLVTYNILVSAWCEKREPKKAAGLVNEMLHRGLR 509
           AG CR G +E AK L +++  +GL  DLVT++IL+  +C K E +KAA L+ EM   GL+
Sbjct: 446 AGLCRNGNIEAAKKLFDQLTSKGL-PDLVTFHILMEGYCRKGESRKAAMLLKEMSKMGLK 505

Query: 510 PSHLTYNILLNGYCIEGNLRAALNVRKQMEKEGRW-ANVVTYNVLIQGYCRKGKLEDANG 569
           P HLTYNI++ GYC EGNL+AA N+R QMEKE R   NV +YNVL+QGY +KGKLEDAN 
Sbjct: 506 PRHLTYNIVMKGYCKEGNLKAATNMRTQMEKERRLRMNVASYNVLLQGYSQKGKLEDANM 565

Query: 570 LLNEMLEKGLIPNRTTYEIIREEMMEKGFLPDIEGHLYQVS 609
           LLNEMLEKGL+PNR TYEI++EEM+++GF+PDIEGHL+ VS
Sbjct: 566 LLNEMLEKGLVPNRITYEIVKEEMVDQGFVPDIEGHLFNVS 603

BLAST of Tan0006665 vs. ExPASy Swiss-Prot
Match: Q9FIX3 (Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana OX=3702 GN=EMB2745 PE=2 SV=1)

HSP 1 Score: 342.0 bits (876), Expect = 1.3e-92
Identity = 192/556 (34.53%), Postives = 299/556 (53.78%), Query Frame = 0

Query: 69  TPVDFLHQLLGSEAVDPLLVLRYFNWSQKELKVNYNIELFCRLLNLLANAKCYPKIRSFL 128
           TP    + LL S+  D  L+L++ NW+       + +   C  L++L   K Y K    L
Sbjct: 47  TPEAASNLLLKSQN-DQALILKFLNWANPH--QFFTLRCKCITLHILTKFKLY-KTAQIL 106

Query: 129 DSFVKGET--NSTISLIFHTLSVCGDQFCANSIIADMLVLAYVKNSKTALGLEAFKRAGD 188
              V  +T  +   SL+F +L    D   + S + D++V +Y + S     L     A  
Sbjct: 107 AEDVAAKTLDDEYASLVFKSLQETYDLCYSTSSVFDLVVKSYSRLSLIDKALSIVHLAQA 166

Query: 189 YGYKLSVLSCNPLLSA-LAKENEFGDVEFIYKEMIRRKVSPNLITFNIVINGLCKAGKLN 248
           +G+   VLS N +L A +  +      E ++KEM+  +VSPN+ T+NI+I G C AG ++
Sbjct: 167 HGFMPGVLSYNAVLDATIRSKRNISFAENVFKEMLESQVSPNVFTYNILIRGFCFAGNID 226

Query: 249 KAGDVIDDMKVWGFWPNVVTYNTLIDGYCKIGRV-------------------------- 308
            A  + D M+  G  PNVVTYNTLIDGYCK+ ++                          
Sbjct: 227 VALTLFDKMETKGCLPNVVTYNTLIDGYCKLRKIDDGFKLLRSMALKGLEPNLISYNVVI 286

Query: 309 ------GKMYKADAILKEMMANEVSPNDVTFNILIDGFCKDENISAALKVFEEMRSQGVK 368
                 G+M +   +L EM     S ++VT+N LI G+CK+ N   AL +  EM   G+ 
Sbjct: 287 NGLCREGRMKEVSFVLTEMNRRGYSLDEVTYNTLIKGYCKEGNFHQALVMHAEMLRHGLT 346

Query: 369 PTVITYNSLINGFCNEGKLNEAKVLLDEMLSSNLKPNVITYNALINGYCKKKLLEEAREL 428
           P+VITY SLI+  C  G +N A   LD+M    L PN  TY  L++G+ +K  + EA  +
Sbjct: 347 PSVITYTSLIHSMCKAGNMNRAMEFLDQMRVRGLCPNERTYTTLVDGFSQKGYMNEAYRV 406

Query: 429 FDKIGKQGLAPNVITFNTLLDGYCKSGKMEEAFLLQSLMLGKGFLPDVSTYNCLIAGFCR 488
             ++   G +P+V+T+N L++G+C +GKME+A  +   M  KG  PDV +Y+ +++GFCR
Sbjct: 407 LREMNDNGFSPSVVTYNALINGHCVTGKMEDAIAVLEDMKEKGLSPDVVSYSTVLSGFCR 466

Query: 489 EGKMEEAKNLLNEMEDRGLKADLVTYNILVSAWCEKREPKKAAGLVNEMLHRGLRPSHLT 548
              ++EA  +  EM ++G+K D +TY+ L+  +CE+R  K+A  L  EML  GL P   T
Sbjct: 467 SYDVDEALRVKREMVEKGIKPDTITYSSLIQGFCEQRRTKEACDLYEEMLRVGLPPDEFT 526

Query: 549 YNILLNGYCIEGNLRAALNVRKQMEKEGRWANVVTYNVLIQGYCRKGKLEDANGLLNEML 590
           Y  L+N YC+EG+L  AL +  +M ++G   +VVTY+VLI G  ++ +  +A  LL ++ 
Sbjct: 527 YTALINAYCMEGDLEKALQLHNEMVEKGVLPDVVTYSVLINGLNKQSRTREAKRLLLKLF 586

BLAST of Tan0006665 vs. ExPASy Swiss-Prot
Match: Q0WVK7 (Pentatricopeptide repeat-containing protein At1g05670, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At1g05670 PE=2 SV=1)

HSP 1 Score: 315.8 bits (808), Expect = 1.0e-84
Identity = 188/602 (31.23%), Postives = 316/602 (52.49%), Query Frame = 0

Query: 24  TTQLPFCTYNSTSTASTSKDFDPEIISQLISRQ----RWSNLKSHFKFKTPVDFLHQLLG 83
           T   PF  Y+    +    +F  +I + +  R+    R S      KFKT  D L  +L 
Sbjct: 38  TDTRPFPDYSPKKASVRDTEFVHQITNVIKLRRAEPLRRSLKPYECKFKT--DHLIWVLM 97

Query: 84  SEAVDPLLVLRYFNWSQKELKVNYNIELFCRLLNLLANAKCYPKIRSFLDSF-VKGETNS 143
               D  LVL +F+W++   + + N+E  C +++L   +K     +S + SF  + + N 
Sbjct: 98  KIKCDYRLVLDFFDWARS--RRDSNLESLCIVIHLAVASKDLKVAQSLISSFWERPKLNV 157

Query: 144 TISLI--FHTLSVCGDQFCANSIIADMLVLAYVKNSKTALGLEAFKRAGDYGYKLSVLSC 203
           T S +  F  L      + ++  + D+     V           F++  +YG  LSV SC
Sbjct: 158 TDSFVQFFDLLVYTYKDWGSDPRVFDVFFQVLVDFGLLREARRVFEKMLNYGLVLSVDSC 217

Query: 204 NPLLSALAKE-NEFGDVEFIYKEMIRRKVSPNLITFNIVINGLCKAGKLNKAGDVIDDMK 263
           N  L+ L+K+  +      +++E     V  N+ ++NIVI+ +C+ G++ +A  ++  M+
Sbjct: 218 NVYLTRLSKDCYKTATAIIVFREFPEVGVCWNVASYNIVIHFVCQLGRIKEAHHLLLLME 277

Query: 264 VWGFWPNVVTYNTLIDGYCKIGRVGKMYKADAILK------------------------- 323
           + G+ P+V++Y+T+++GYC+ G + K++K   ++K                         
Sbjct: 278 LKGYTPDVISYSTVVNGYCRFGELDKVWKLIEVMKRKGLKPNSYIYGSIIGLLCRICKLA 337

Query: 324 -------EMMANEVSPNDVTFNILIDGFCKDENISAALKVFEEMRSQGVKPTVITYNSLI 383
                  EM+   + P+ V +  LIDGFCK  +I AA K F EM S+ + P V+TY ++I
Sbjct: 338 EAEEAFSEMIRQGILPDTVVYTTLIDGFCKRGDIRAASKFFYEMHSRDITPDVLTYTAII 397

Query: 384 NGFCNEGKLNEAKVLLDEMLSSNLKPNVITYNALINGYCKKKLLEEARELFDKIGKQGLA 443
           +GFC  G + EA  L  EM    L+P+ +T+  LINGYCK   +++A  + + + + G +
Sbjct: 398 SGFCQIGDMVEAGKLFHEMFCKGLEPDSVTFTELINGYCKAGHMKDAFRVHNHMIQAGCS 457

Query: 444 PNVITFNTLLDGYCKSGKMEEAFLLQSLMLGKGFLPDVSTYNCLIAGFCREGKMEEAKNL 503
           PNV+T+ TL+DG CK G ++ A  L   M   G  P++ TYN ++ G C+ G +EEA  L
Sbjct: 458 PNVVTYTTLIDGLCKEGDLDSANELLHEMWKIGLQPNIFTYNSIVNGLCKSGNIEEAVKL 517

Query: 504 LNEMEDRGLKADLVTYNILVSAWCEKREPKKAAGLVNEMLHRGLRPSHLTYNILLNGYCI 563
           + E E  GL AD VTY  L+ A+C+  E  KA  ++ EML +GL+P+ +T+N+L+NG+C+
Sbjct: 518 VGEFEAAGLNADTVTYTTLMDAYCKSGEMDKAQEILKEMLGKGLQPTIVTFNVLMNGFCL 577

Query: 564 EGNLRAALNVRKQMEKEGRWANVVTYNVLIQGYCRKGKLEDANGLLNEMLEKGLIPNRTT 586
            G L     +   M  +G   N  T+N L++ YC +  L+ A  +  +M  +G+ P+  T
Sbjct: 578 HGMLEDGEKLLNWMLAKGIAPNATTFNSLVKQYCIRNNLKAATAIYKDMCSRGVGPDGKT 635

BLAST of Tan0006665 vs. ExPASy Swiss-Prot
Match: Q9LVQ5 (Pentatricopeptide repeat-containing protein At5g55840 OS=Arabidopsis thaliana OX=3702 GN=At5g55840 PE=3 SV=2)

HSP 1 Score: 312.0 bits (798), Expect = 1.5e-83
Identity = 176/549 (32.06%), Postives = 291/549 (53.01%), Query Frame = 0

Query: 44  FDPE-IISQLISRQRWSNLKSHFKFKTPVDFLHQLLGSEAVDPLLVLRYFNWSQKE--LK 103
           FD E  I  +++  RW +L         +D+    L    V   L L++  W  K+  L+
Sbjct: 17  FDMEKSIYNILTIDRWGSLNH-------MDYRQARL--RLVHGKLALKFLKWVVKQPGLE 76

Query: 104 VNYNIELFCRLLNLLANAKCYPKIRSFLD--SFVKGETNSTISLIFHTLSVCGDQFCANS 163
            ++ ++L C   ++L  A+ Y   R  L   S + G+++     +  T  +C     +N 
Sbjct: 77  TDHIVQLVCITTHILVRARMYDPARHILKELSLMSGKSSFVFGALMTTYRLCN----SNP 136

Query: 164 IIADMLVLAYVKNSKTALGLEAFKRAGDYGYKLSVLSCNPLLSALAKENEFGDVEFIYKE 223
            + D+L+  Y++       LE F+  G YG+  SV +CN +L ++ K  E   V    KE
Sbjct: 137 SVYDILIRVYLREGMIQDSLEIFRLMGLYGFNPSVYTCNAILGSVVKSGEDVSVWSFLKE 196

Query: 224 MIRRKVSPNLITFNIVINGLCKAGKLNKAGDVIDDMKVWGFWPNVVTYNTLIDGYCKIGR 283
           M++RK+ P++ TFNI+IN LC  G   K+  ++  M+  G+ P +VTYNT++  YCK GR
Sbjct: 197 MLKRKICPDVATFNILINVLCAEGSFEKSSYLMQKMEKSGYAPTIVTYNTVLHWYCKKGR 256

Query: 284 VGKMYKADAILKEMMANEVSPNDVTFNILIDGFCKDENISAALKVFEEMRSQGVKPTVIT 343
                 A  +L  M +  V  +  T+N+LI   C+   I+    +  +MR + + P  +T
Sbjct: 257 ---FKAAIELLDHMKSKGVDADVCTYNMLIHDLCRSNRIAKGYLLLRDMRKRMIHPNEVT 316

Query: 344 YNSLINGFCNEGKLNEAKVLLDEMLSSNLKPNVITYNALINGYCKKKLLEEARELFDKIG 403
           YN+LINGF NEGK+  A  LL+EMLS  L PN +T+NALI+G+  +   +EA ++F  + 
Sbjct: 317 YNTLINGFSNEGKVLIASQLLNEMLSFGLSPNHVTFNALIDGHISEGNFKEALKMFYMME 376

Query: 404 KQGLAPNVITFNTLLDGYCKSGKMEEAFLLQSLMLGKGFLPDVSTYNCLIAGFCREGKME 463
            +GL P+ +++  LLDG CK+ + + A      M   G      TY  +I G C+ G ++
Sbjct: 377 AKGLTPSEVSYGVLLDGLCKNAEFDLARGFYMRMKRNGVCVGRITYTGMIDGLCKNGFLD 436

Query: 464 EAKNLLNEMEDRGLKADLVTYNILVSAWCEKREPKKAAGLVNEMLHRGLRPSHLTYNILL 523
           EA  LLNEM   G+  D+VTY+ L++ +C+    K A  +V  +   GL P+ + Y+ L+
Sbjct: 437 EAVVLLNEMSKDGIDPDIVTYSALINGFCKVGRFKTAKEIVCRIYRVGLSPNGIIYSTLI 496

Query: 524 NGYCIEGNLRAALNVRKQMEKEGRWANVVTYNVLIQGYCRKGKLEDANGLLNEMLEKGLI 583
              C  G L+ A+ + + M  EG   +  T+NVL+   C+ GK+ +A   +  M   G++
Sbjct: 497 YNCCRMGCLKEAIRIYEAMILEGHTRDHFTFNVLVTSLCKAGKVAEAEEFMRCMTSDGIL 549

Query: 584 PNRTTYEII 588
           PN  +++ +
Sbjct: 557 PNTVSFDCL 549

BLAST of Tan0006665 vs. ExPASy Swiss-Prot
Match: Q6NQ83 (Pentatricopeptide repeat-containing protein At3g22470, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At3g22470 PE=1 SV=1)

HSP 1 Score: 304.7 bits (779), Expect = 2.4e-81
Identity = 162/515 (31.46%), Postives = 280/515 (54.37%), Query Frame = 0

Query: 98  ELKVNYNIELFCRLL-----------NLLANAKCYPKIRSFLDSFVKGETNSTISLIFHT 157
           ++KVN  I+LF  ++           N L +A    K    +  F KG   + I    +T
Sbjct: 48  DIKVNDAIDLFESMIQSRPLPTPIDFNRLCSAVARTKQYDLVLGFCKGMELNGIEHDMYT 107

Query: 158 LSVCGDQFCANSIIADMLVLAYVKNSKTALGLEAFKRAGDYGYKLSVLSCNPLLSALAKE 217
           +++              ++  Y +  K         RA   GY+   ++ + L++    E
Sbjct: 108 MTI--------------MINCYCRKKKLLFAFSVLGRAWKLGYEPDTITFSTLVNGFCLE 167

Query: 218 NEFGDVEFIYKEMIRRKVSPNLITFNIVINGLCKAGKLNKAGDVIDDMKVWGFWPNVVTY 277
               +   +   M+  K  P+L+T + +INGLC  G++++A  +ID M  +GF P+ VTY
Sbjct: 168 GRVSEAVALVDRMVEMKQRPDLVTVSTLINGLCLKGRVSEALVLIDRMVEYGFQPDEVTY 227

Query: 278 NTLIDGYCKIGRVGKMYKADAILKEMMANEVSPNDVTFNILIDGFCKDENISAALKVFEE 337
             +++  CK G       A  + ++M    +  + V ++I+ID  CKD +   AL +F E
Sbjct: 228 GPVLNRLCKSGNSA---LALDLFRKMEERNIKASVVQYSIVIDSLCKDGSFDDALSLFNE 287

Query: 338 MRSQGVKPTVITYNSLINGFCNEGKLNEAKVLLDEMLSSNLKPNVITYNALINGYCKKKL 397
           M  +G+K  V+TY+SLI G CN+GK ++   +L EM+  N+ P+V+T++ALI+ + K+  
Sbjct: 288 MEMKGIKADVVTYSSLIGGLCNDGKWDDGAKMLREMIGRNIIPDVVTFSALIDVFVKEGK 347

Query: 398 LEEARELFDKIGKQGLAPNVITFNTLLDGYCKSGKMEEAFLLQSLMLGKGFLPDVSTYNC 457
           L EA+EL++++  +G+AP+ IT+N+L+DG+CK   + EA  +  LM+ KG  PD+ TY+ 
Sbjct: 348 LLEAKELYNEMITRGIAPDTITYNSLIDGFCKENCLHEANQMFDLMVSKGCEPDIVTYSI 407

Query: 458 LIAGFCREGKMEEAKNLLNEMEDRGLKADLVTYNILVSAWCEKREPKKAAGLVNEMLHRG 517
           LI  +C+  ++++   L  E+  +GL  + +TYN LV  +C+  +   A  L  EM+ RG
Sbjct: 408 LINSYCKAKRVDDGMRLFREISSKGLIPNTITYNTLVLGFCQSGKLNAAKELFQEMVSRG 467

Query: 518 LRPSHLTYNILLNGYCIEGNLRAALNVRKQMEKEGRWANVVTYNVLIQGYCRKGKLEDAN 577
           + PS +TY ILL+G C  G L  AL + ++M+K      +  YN++I G C   K++DA 
Sbjct: 468 VPPSVVTYGILLDGLCDNGELNKALEIFEKMQKSRMTLGIGIYNIIIHGMCNASKVDDAW 527

Query: 578 GLLNEMLEKGLIPNRTTYEIIREEMMEKGFLPDIE 602
            L   + +KG+ P+  TY ++   + +KG L + +
Sbjct: 528 SLFCSLSDKGVKPDVVTYNVMIGGLCKKGSLSEAD 545

BLAST of Tan0006665 vs. NCBI nr
Match: XP_022156382.1 (pentatricopeptide repeat-containing protein At1g09820 [Momordica charantia] >XP_022156383.1 pentatricopeptide repeat-containing protein At1g09820 [Momordica charantia])

HSP 1 Score: 1144.4 bits (2959), Expect = 0.0e+00
Identity = 553/609 (90.80%), Postives = 583/609 (95.73%), Query Frame = 0

Query: 1   MNAIIRLPSRISPAKIDRKYKFSTTQLPFCTYNSTSTASTSKDFDPEIISQLISRQRWSN 60
           MNA+ RLP RI PAKIDRKY FSTT  P CTYNSTSTAS+S DFDP+IIS+LISRQ+WSN
Sbjct: 1   MNAVNRLPWRIFPAKIDRKYIFSTTHPPLCTYNSTSTASSSNDFDPQIISELISRQQWSN 60

Query: 61  LKSHFKFKTPVDFLHQLLGSEAVDPLLVLRYFNWSQKELKVNYNIELFCRLLNLLANAKC 120
           LKSHFKFK+P+DFLH LLGS AVDPLLVLRYFNWSQKELKVNY+IELFCRLLNLLANAKC
Sbjct: 61  LKSHFKFKSPIDFLHLLLGSGAVDPLLVLRYFNWSQKELKVNYSIELFCRLLNLLANAKC 120

Query: 121 YPKIRSFLDSFVKGETNSTISLIFHTLSVCGDQFCANSIIADMLVLAYVKNSKTALGLEA 180
           YPKIRSFLDSFVKGETN TISLIF+TLSVCGDQFCANSIIADMLVLAYVKNSKT+LGLEA
Sbjct: 121 YPKIRSFLDSFVKGETNCTISLIFYTLSVCGDQFCANSIIADMLVLAYVKNSKTSLGLEA 180

Query: 181 FKRAGDYGYKLSVLSCNPLLSALAKENEFGDVEFIYKEMIRRKVSPNLITFNIVINGLCK 240
           FKR+GDYGYKLSVLSCNPLLSAL KE+EFGDVEF+YKEMIRRKVSPNLITFNIVINGLCK
Sbjct: 181 FKRSGDYGYKLSVLSCNPLLSALVKESEFGDVEFVYKEMIRRKVSPNLITFNIVINGLCK 240

Query: 241 AGKLNKAGDVIDDMKVWGFWPNVVTYNTLIDGYCKIGRVGKMYKADAILKEMMANEVSPN 300
            GKLNKAGDVIDDMKVWGFWPNVVTYNTLIDGYCK+GRVGKMYKADAILKEM+AN+VSPN
Sbjct: 241 VGKLNKAGDVIDDMKVWGFWPNVVTYNTLIDGYCKMGRVGKMYKADAILKEMVANKVSPN 300

Query: 301 DVTFNILIDGFCKDENISAALKVFEEMRSQGVKPTVITYNSLINGFCNEGKLNEAKVLLD 360
           DVTFNILIDGFCKDENISAALKVFEEMR QG+KP+V+TYNSLING CNEGKLNEA  LLD
Sbjct: 301 DVTFNILIDGFCKDENISAALKVFEEMRIQGLKPSVVTYNSLINGLCNEGKLNEANTLLD 360

Query: 361 EMLSSNLKPNVITYNALINGYCKKKLLEEARELFDKIGKQGLAPNVITFNTLLDGYCKSG 420
           EM +SNLKPNV+TYNALINGYCKKKLLEEARELFD IGKQGLAPNVITFNTLLDGYCK G
Sbjct: 361 EMSNSNLKPNVVTYNALINGYCKKKLLEEARELFDNIGKQGLAPNVITFNTLLDGYCKCG 420

Query: 421 KMEEAFLLQSLMLGKGFLPDVSTYNCLIAGFCREGKMEEAKNLLNEMEDRGLKADLVTYN 480
           KM+EAFLL+SLML KG  PDVSTYNCLI GFC+EGKMEE +NLLNEME+RGLKADLVTYN
Sbjct: 421 KMDEAFLLRSLMLEKGLPPDVSTYNCLIVGFCKEGKMEEVENLLNEMEERGLKADLVTYN 480

Query: 481 ILVSAWCEKREPKKAAGLVNEMLHRGLRPSHLTYNILLNGYCIEGNLRAALNVRKQMEKE 540
           IL+SAWCEKREPKKAAGL++EM HRGLRPSHLTYNILLNGYCIEGNLRAALNVRKQMEK+
Sbjct: 481 ILISAWCEKREPKKAAGLIDEMFHRGLRPSHLTYNILLNGYCIEGNLRAALNVRKQMEKD 540

Query: 541 GRWANVVTYNVLIQGYCRKGKLEDANGLLNEMLEKGLIPNRTTYEIIREEMMEKGFLPDI 600
           GRWANVVTYNVLIQGYCRKGKLEDANGLLNEMLEKGLIPNRTTYEIIREEMMEKGFLPDI
Sbjct: 541 GRWANVVTYNVLIQGYCRKGKLEDANGLLNEMLEKGLIPNRTTYEIIREEMMEKGFLPDI 600

Query: 601 EGHLYQVSR 610
           EGHLYQVSR
Sbjct: 601 EGHLYQVSR 609

BLAST of Tan0006665 vs. NCBI nr
Match: XP_038891207.1 (pentatricopeptide repeat-containing protein At1g09820 [Benincasa hispida])

HSP 1 Score: 1114.8 bits (2882), Expect = 0.0e+00
Identity = 544/611 (89.03%), Postives = 573/611 (93.78%), Query Frame = 0

Query: 1   MNAIIRL--PSRISPAKIDRKYKFSTTQLPFCTYNSTSTASTSKDFDPEIISQLISRQRW 60
           MN IIR+   S I P KI RK+ FSTT LPFCTYNST TA TS D DP IIS LISRQ+W
Sbjct: 1   MNVIIRISSTSTIFPVKIYRKFIFSTTYLPFCTYNSTCTAPTSNDSDPLIISDLISRQQW 60

Query: 61  SNLKSHFKFKTPVDFLHQLLGSEAVDPLLVLRYFNWSQKELKVNYNIELFCRLLNLLANA 120
           S LKSHFKFK+PVDFLHQLLGS AVDPLLVLRYFNWSQKELKVNYNIELFCRL++LLANA
Sbjct: 61  STLKSHFKFKSPVDFLHQLLGSGAVDPLLVLRYFNWSQKELKVNYNIELFCRLIHLLANA 120

Query: 121 KCYPKIRSFLDSFVKGETNSTISLIFHTLSVCGDQFCANSIIADMLVLAYVKNSKTALGL 180
           KCYPKIRSFLDSFVKGETN +ISLIFHTLSVC DQFCANSIIAD+LVLAYVKNSKT LGL
Sbjct: 121 KCYPKIRSFLDSFVKGETNCSISLIFHTLSVCSDQFCANSIIADILVLAYVKNSKTVLGL 180

Query: 181 EAFKRAGDYGYKLSVLSCNPLLSALAKENEFGDVEFIYKEMIRRKVSPNLITFNIVINGL 240
           EAFKRAGDYGYKLSVLSCNPLLSAL KENEFGD+EF+YKEMIRRKV+PNLITFNIVINGL
Sbjct: 181 EAFKRAGDYGYKLSVLSCNPLLSALVKENEFGDMEFVYKEMIRRKVNPNLITFNIVINGL 240

Query: 241 CKAGKLNKAGDVIDDMKVWGFWPNVVTYNTLIDGYCKIGRVGKMYKADAILKEMMANEVS 300
           CK GKLNKAGDVIDDMKVWGFWPNVVTYNTLIDGYCK+G+VGKMYKADAILKEM+AN+VS
Sbjct: 241 CKVGKLNKAGDVIDDMKVWGFWPNVVTYNTLIDGYCKMGKVGKMYKADAILKEMVANKVS 300

Query: 301 PNDVTFNILIDGFCKDENISAALKVFEEMRSQGVKPTVITYNSLINGFCNEGKLNEAKVL 360
           PNDVTFN LIDGFCKDEN+ AALKVFEEM+SQG+KPTV+TYNSLING CNEGKLNEAKVL
Sbjct: 301 PNDVTFNTLIDGFCKDENVLAALKVFEEMQSQGLKPTVVTYNSLINGLCNEGKLNEAKVL 360

Query: 361 LDEMLSSNLKPNVITYNALINGYCKKKLLEEARELFDKIGKQGLAPNVITFNTLLDGYCK 420
           LD MLSSNLKPNVITYNALINGYCKKKLLEEARELFD I KQGLAPNVITFNTLLDGYCK
Sbjct: 361 LDGMLSSNLKPNVITYNALINGYCKKKLLEEARELFDNIRKQGLAPNVITFNTLLDGYCK 420

Query: 421 SGKMEEAFLLQSLMLGKGFLPDVSTYNCLIAGFCREGKMEEAKNLLNEMEDRGLKADLVT 480
            GKMEEAFLLQ+LML KGFLPD+STYNCLI GFCREGKMEE K LLNEME RG+KAD VT
Sbjct: 421 CGKMEEAFLLQNLMLEKGFLPDISTYNCLIVGFCREGKMEEVKKLLNEMERRGVKADTVT 480

Query: 481 YNILVSAWCEKREPKKAAGLVNEMLHRGLRPSHLTYNILLNGYCIEGNLRAALNVRKQME 540
           YNIL+SAWCEKREPKKAA LV+EML RGL+PSHLT+NILLNGYC+EGNLRAALNVRKQME
Sbjct: 481 YNILISAWCEKREPKKAARLVDEMLERGLQPSHLTFNILLNGYCMEGNLRAALNVRKQME 540

Query: 541 KEGRWANVVTYNVLIQGYCRKGKLEDANGLLNEMLEKGLIPNRTTYEIIREEMMEKGFLP 600
           KEGRWANVVTYNVLIQGYCRKGKLEDANGLLNEMLEKGLIPN+TTYEII+EEMMEKGFLP
Sbjct: 541 KEGRWANVVTYNVLIQGYCRKGKLEDANGLLNEMLEKGLIPNQTTYEIIKEEMMEKGFLP 600

Query: 601 DIEGHLYQVSR 610
           DIEGHLYQVSR
Sbjct: 601 DIEGHLYQVSR 611

BLAST of Tan0006665 vs. NCBI nr
Match: XP_023536750.1 (pentatricopeptide repeat-containing protein At1g09820 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1109.0 bits (2867), Expect = 0.0e+00
Identity = 535/609 (87.85%), Postives = 570/609 (93.60%), Query Frame = 0

Query: 1   MNAIIRLPSRISPAKIDRKYKFSTTQLPFCTYNSTSTASTSKDFDPEIISQLISRQRWSN 60
           MNA+IR P RI  AKI R+YKFSTT LPFCT NST TA TS DFDP+IIS LISRQ+WSN
Sbjct: 1   MNAMIRFPPRIFTAKIGRRYKFSTTHLPFCTDNSTYTAPTSNDFDPQIISDLISRQQWSN 60

Query: 61  LKSHFKFKTPVDFLHQLLGSEAVDPLLVLRYFNWSQKELKVNYNIELFCRLLNLLANAKC 120
           LKSHFKFKTP+DFLH+LLGS AVDPLLVLRYF WSQKELKVNYNIELFCRLLNLLANAK 
Sbjct: 61  LKSHFKFKTPIDFLHELLGSGAVDPLLVLRYFKWSQKELKVNYNIELFCRLLNLLANAKY 120

Query: 121 YPKIRSFLDSFVKGETNSTISLIFHTLSVCGDQFCANSIIADMLVLAYVKNSKTALGLEA 180
           YPK+RSFLDSFVK ETN TISLIFHTLSVCGDQFCANSIIADMLVLAYVKNSKTALGLEA
Sbjct: 121 YPKMRSFLDSFVKRETNCTISLIFHTLSVCGDQFCANSIIADMLVLAYVKNSKTALGLEA 180

Query: 181 FKRAGDYGYKLSVLSCNPLLSALAKENEFGDVEFIYKEMIRRKVSPNLITFNIVINGLCK 240
           FKRAGDYGYKLSVLSCNPLL+AL KENEFGDVEF+YKEMIRRK SPNL TFNIVI+GLCK
Sbjct: 181 FKRAGDYGYKLSVLSCNPLLNALVKENEFGDVEFVYKEMIRRKFSPNLFTFNIVIHGLCK 240

Query: 241 AGKLNKAGDVIDDMKVWGFWPNVVTYNTLIDGYCKIGRVGKMYKADAILKEMMANEVSPN 300
            G+LNKAGDV+DDMK+WGF PNVVTYNTLIDGYCK+GRVGKM+KADA+LKEM+AN VSPN
Sbjct: 241 IGRLNKAGDVLDDMKIWGFLPNVVTYNTLIDGYCKVGRVGKMFKADAMLKEMVANNVSPN 300

Query: 301 DVTFNILIDGFCKDENISAALKVFEEMRSQGVKPTVITYNSLINGFCNEGKLNEAKVLLD 360
           DVTFNILIDGFCKDEN+SAA+KVF+EM+SQGVKPTV+TYNSLING CNEGKLNEA+VLLD
Sbjct: 301 DVTFNILIDGFCKDENVSAAMKVFDEMQSQGVKPTVVTYNSLINGLCNEGKLNEARVLLD 360

Query: 361 EMLSSNLKPNVITYNALINGYCKKKLLEEARELFDKIGKQGLAPNVITFNTLLDGYCKSG 420
           EM S NLKPNV+TYNALINGYCKKKLLEEARELFD IGKQGL PNVITFNTL+DGYCK G
Sbjct: 361 EMSSLNLKPNVVTYNALINGYCKKKLLEEARELFDDIGKQGLDPNVITFNTLVDGYCKCG 420

Query: 421 KMEEAFLLQSLMLGKGFLPDVSTYNCLIAGFCREGKMEEAKNLLNEMEDRGLKADLVTYN 480
           KM EAFL++SLM  KGFLP+VSTYNCLI GFCR GKMEEAKNLLNEME RGLKAD +TYN
Sbjct: 421 KMNEAFLIRSLMFEKGFLPNVSTYNCLITGFCRAGKMEEAKNLLNEMEGRGLKADTITYN 480

Query: 481 ILVSAWCEKREPKKAAGLVNEMLHRGLRPSHLTYNILLNGYCIEGNLRAALNVRKQMEKE 540
           IL+SAWCEKRE KKAA L++EML RGLRPSHLT+NILLNGYC EGNLRAALNVRKQMEKE
Sbjct: 481 ILISAWCEKRESKKAARLIDEMLDRGLRPSHLTFNILLNGYCTEGNLRAALNVRKQMEKE 540

Query: 541 GRWANVVTYNVLIQGYCRKGKLEDANGLLNEMLEKGLIPNRTTYEIIREEMMEKGFLPDI 600
           GRWANVVTYNVLIQGYCRKGK+EDANGLLNEMLEKGLIPNRTTYEI+REEMMEKGFLPDI
Sbjct: 541 GRWANVVTYNVLIQGYCRKGKMEDANGLLNEMLEKGLIPNRTTYEIVREEMMEKGFLPDI 600

Query: 601 EGHLYQVSR 610
           EGHLYQVSR
Sbjct: 601 EGHLYQVSR 609

BLAST of Tan0006665 vs. NCBI nr
Match: XP_022965643.1 (pentatricopeptide repeat-containing protein At1g09820 [Cucurbita maxima] >XP_022965644.1 pentatricopeptide repeat-containing protein At1g09820 [Cucurbita maxima] >XP_022965645.1 pentatricopeptide repeat-containing protein At1g09820 [Cucurbita maxima])

HSP 1 Score: 1102.4 bits (2850), Expect = 0.0e+00
Identity = 533/609 (87.52%), Postives = 569/609 (93.43%), Query Frame = 0

Query: 1   MNAIIRLPSRISPAKIDRKYKFSTTQLPFCTYNSTSTASTSKDFDPEIISQLISRQRWSN 60
           MNA+IR P RI  AKI R+YKFST  LPFCTY+ST TA TS DFDP+IIS LISRQ+WSN
Sbjct: 1   MNAMIRFPPRIFTAKIGRRYKFSTMHLPFCTYHSTCTAPTSNDFDPQIISDLISRQQWSN 60

Query: 61  LKSHFKFKTPVDFLHQLLGSEAVDPLLVLRYFNWSQKELKVNYNIELFCRLLNLLANAKC 120
           LKSHFKFKTP+DFLH+LLGS AVDPLLVLRYF WSQKEL+VNYNIELFCRLLNLLANAK 
Sbjct: 61  LKSHFKFKTPIDFLHELLGSGAVDPLLVLRYFKWSQKELRVNYNIELFCRLLNLLANAKY 120

Query: 121 YPKIRSFLDSFVKGETNSTISLIFHTLSVCGDQFCANSIIADMLVLAYVKNSKTALGLEA 180
           YPKIRSFLDSFVK ETN TISLIFHTLSVCGDQFCANSIIADMLVLAYV+NSKTALGLEA
Sbjct: 121 YPKIRSFLDSFVKRETNCTISLIFHTLSVCGDQFCANSIIADMLVLAYVRNSKTALGLEA 180

Query: 181 FKRAGDYGYKLSVLSCNPLLSALAKENEFGDVEFIYKEMIRRKVSPNLITFNIVINGLCK 240
           FKRAGDYGYKLSVLSCNPLL+AL KENEFG VEF+YKEMIRRK SPNL TFNIVI+GLCK
Sbjct: 181 FKRAGDYGYKLSVLSCNPLLNALVKENEFGGVEFVYKEMIRRKFSPNLFTFNIVIHGLCK 240

Query: 241 AGKLNKAGDVIDDMKVWGFWPNVVTYNTLIDGYCKIGRVGKMYKADAILKEMMANEVSPN 300
            G+LNKAGDVIDDMKVWGF PN VTYNTLIDGYCK+GRVGKM+KADAILKEM+AN VSPN
Sbjct: 241 IGRLNKAGDVIDDMKVWGFLPNEVTYNTLIDGYCKVGRVGKMFKADAILKEMVANNVSPN 300

Query: 301 DVTFNILIDGFCKDENISAALKVFEEMRSQGVKPTVITYNSLINGFCNEGKLNEAKVLLD 360
           DVTFNILIDGFCKDENISAA+KVF+EM+SQGVKPTV+TYNSLING CNEGKLNEA+VLLD
Sbjct: 301 DVTFNILIDGFCKDENISAAMKVFDEMQSQGVKPTVVTYNSLINGLCNEGKLNEARVLLD 360

Query: 361 EMLSSNLKPNVITYNALINGYCKKKLLEEARELFDKIGKQGLAPNVITFNTLLDGYCKSG 420
           EM S NLKPNV+TYNALINGYCKKKLLEEARELFD IGKQGL PNVITFNTL+DGYCK G
Sbjct: 361 EMSSLNLKPNVVTYNALINGYCKKKLLEEARELFDDIGKQGLDPNVITFNTLVDGYCKCG 420

Query: 421 KMEEAFLLQSLMLGKGFLPDVSTYNCLIAGFCREGKMEEAKNLLNEMEDRGLKADLVTYN 480
           KM+EAFL++SLM  +GFLP+VSTYNCLI GFCR GKMEEAKNLLNEME RGLKAD++TYN
Sbjct: 421 KMDEAFLIRSLMFERGFLPNVSTYNCLITGFCRVGKMEEAKNLLNEMEGRGLKADMITYN 480

Query: 481 ILVSAWCEKREPKKAAGLVNEMLHRGLRPSHLTYNILLNGYCIEGNLRAALNVRKQMEKE 540
           IL+SAWCEKRE KKAA L++EML RGLRPSHLT+NILLNGYC EGNLRAALNVRKQMEKE
Sbjct: 481 ILISAWCEKRESKKAARLIDEMLDRGLRPSHLTFNILLNGYCTEGNLRAALNVRKQMEKE 540

Query: 541 GRWANVVTYNVLIQGYCRKGKLEDANGLLNEMLEKGLIPNRTTYEIIREEMMEKGFLPDI 600
           GRWANVVTYNVLIQGYCRKGK+E ANGLLNEMLEKGLIPNRTTYEI+REEMMEKGFLPDI
Sbjct: 541 GRWANVVTYNVLIQGYCRKGKMEVANGLLNEMLEKGLIPNRTTYEIVREEMMEKGFLPDI 600

Query: 601 EGHLYQVSR 610
           EGHLYQVSR
Sbjct: 601 EGHLYQVSR 609

BLAST of Tan0006665 vs. NCBI nr
Match: KAG7021052.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1101.7 bits (2848), Expect = 0.0e+00
Identity = 532/609 (87.36%), Postives = 568/609 (93.27%), Query Frame = 0

Query: 1   MNAIIRLPSRISPAKIDRKYKFSTTQLPFCTYNSTSTASTSKDFDPEIISQLISRQRWSN 60
           MNA+IR P RI  AKI R+YKFSTT L FCTYNST TA T  DFDP+IIS LISRQ+WSN
Sbjct: 1   MNAMIRFPPRIFTAKIGRRYKFSTTHLLFCTYNSTYTAPTLNDFDPQIISDLISRQQWSN 60

Query: 61  LKSHFKFKTPVDFLHQLLGSEAVDPLLVLRYFNWSQKELKVNYNIELFCRLLNLLANAKC 120
           LKSHFKFKTP+DFLH+LLGS AVDPLLVLRYF WSQKELKVNYNIELFCRLLNLLANAK 
Sbjct: 61  LKSHFKFKTPIDFLHELLGSGAVDPLLVLRYFKWSQKELKVNYNIELFCRLLNLLANAKY 120

Query: 121 YPKIRSFLDSFVKGETNSTISLIFHTLSVCGDQFCANSIIADMLVLAYVKNSKTALGLEA 180
           YPK+RSFLDSFVK ETN TISLIFHTLSVCGDQFCANSIIADMLVLAYVKNSKT+LGLEA
Sbjct: 121 YPKMRSFLDSFVKRETNCTISLIFHTLSVCGDQFCANSIIADMLVLAYVKNSKTSLGLEA 180

Query: 181 FKRAGDYGYKLSVLSCNPLLSALAKENEFGDVEFIYKEMIRRKVSPNLITFNIVINGLCK 240
           FKRAGDYGYKLSVLSCNPLL+AL KENEFGDVEF+YKEMIRRK SPNL TFNIVI+GLCK
Sbjct: 181 FKRAGDYGYKLSVLSCNPLLNALVKENEFGDVEFVYKEMIRRKFSPNLFTFNIVIHGLCK 240

Query: 241 AGKLNKAGDVIDDMKVWGFWPNVVTYNTLIDGYCKIGRVGKMYKADAILKEMMANEVSPN 300
            G+LNKAGDV+DDMK+WGF PNVVTYNTLIDGYCK+GRVGKM+KADAILKEM+AN VSPN
Sbjct: 241 IGRLNKAGDVLDDMKIWGFLPNVVTYNTLIDGYCKVGRVGKMFKADAILKEMVANNVSPN 300

Query: 301 DVTFNILIDGFCKDENISAALKVFEEMRSQGVKPTVITYNSLINGFCNEGKLNEAKVLLD 360
           DVTFNILIDGFCKDEN SAA+KVF+EM+SQGVKPTV+TYNSLING CNEGKLNEA+VLLD
Sbjct: 301 DVTFNILIDGFCKDENASAAMKVFDEMQSQGVKPTVVTYNSLINGLCNEGKLNEARVLLD 360

Query: 361 EMLSSNLKPNVITYNALINGYCKKKLLEEARELFDKIGKQGLAPNVITFNTLLDGYCKSG 420
           EM S NLKPNV+TYNALINGYCKKKLLEEAR+LFD IGKQGL PNVITFNTL+DGYCK G
Sbjct: 361 EMSSLNLKPNVVTYNALINGYCKKKLLEEARDLFDDIGKQGLDPNVITFNTLVDGYCKCG 420

Query: 421 KMEEAFLLQSLMLGKGFLPDVSTYNCLIAGFCREGKMEEAKNLLNEMEDRGLKADLVTYN 480
           KM EAFL++SLM  KGFLP+VSTYNCLI GFCREGKMEEAKNLLNEME RGLKAD +TYN
Sbjct: 421 KMNEAFLVRSLMFEKGFLPNVSTYNCLITGFCREGKMEEAKNLLNEMEGRGLKADTITYN 480

Query: 481 ILVSAWCEKREPKKAAGLVNEMLHRGLRPSHLTYNILLNGYCIEGNLRAALNVRKQMEKE 540
           IL+SAWCEK E KKAA L++EML RGLRPSHLT+NILLNGYC EGNLRAALNVRKQMEKE
Sbjct: 481 ILISAWCEKGESKKAARLIDEMLDRGLRPSHLTFNILLNGYCTEGNLRAALNVRKQMEKE 540

Query: 541 GRWANVVTYNVLIQGYCRKGKLEDANGLLNEMLEKGLIPNRTTYEIIREEMMEKGFLPDI 600
           GRWANVVTYNVLI+GYCRKGK+EDANGLLNEMLEKGLIPNRTTYEI+REEMMEKGFLPDI
Sbjct: 541 GRWANVVTYNVLIRGYCRKGKMEDANGLLNEMLEKGLIPNRTTYEIVREEMMEKGFLPDI 600

Query: 601 EGHLYQVSR 610
           EGHLYQVSR
Sbjct: 601 EGHLYQVSR 609

BLAST of Tan0006665 vs. ExPASy TrEMBL
Match: A0A6J1DRY0 (pentatricopeptide repeat-containing protein At1g09820 OS=Momordica charantia OX=3673 GN=LOC111023289 PE=4 SV=1)

HSP 1 Score: 1144.4 bits (2959), Expect = 0.0e+00
Identity = 553/609 (90.80%), Postives = 583/609 (95.73%), Query Frame = 0

Query: 1   MNAIIRLPSRISPAKIDRKYKFSTTQLPFCTYNSTSTASTSKDFDPEIISQLISRQRWSN 60
           MNA+ RLP RI PAKIDRKY FSTT  P CTYNSTSTAS+S DFDP+IIS+LISRQ+WSN
Sbjct: 1   MNAVNRLPWRIFPAKIDRKYIFSTTHPPLCTYNSTSTASSSNDFDPQIISELISRQQWSN 60

Query: 61  LKSHFKFKTPVDFLHQLLGSEAVDPLLVLRYFNWSQKELKVNYNIELFCRLLNLLANAKC 120
           LKSHFKFK+P+DFLH LLGS AVDPLLVLRYFNWSQKELKVNY+IELFCRLLNLLANAKC
Sbjct: 61  LKSHFKFKSPIDFLHLLLGSGAVDPLLVLRYFNWSQKELKVNYSIELFCRLLNLLANAKC 120

Query: 121 YPKIRSFLDSFVKGETNSTISLIFHTLSVCGDQFCANSIIADMLVLAYVKNSKTALGLEA 180
           YPKIRSFLDSFVKGETN TISLIF+TLSVCGDQFCANSIIADMLVLAYVKNSKT+LGLEA
Sbjct: 121 YPKIRSFLDSFVKGETNCTISLIFYTLSVCGDQFCANSIIADMLVLAYVKNSKTSLGLEA 180

Query: 181 FKRAGDYGYKLSVLSCNPLLSALAKENEFGDVEFIYKEMIRRKVSPNLITFNIVINGLCK 240
           FKR+GDYGYKLSVLSCNPLLSAL KE+EFGDVEF+YKEMIRRKVSPNLITFNIVINGLCK
Sbjct: 181 FKRSGDYGYKLSVLSCNPLLSALVKESEFGDVEFVYKEMIRRKVSPNLITFNIVINGLCK 240

Query: 241 AGKLNKAGDVIDDMKVWGFWPNVVTYNTLIDGYCKIGRVGKMYKADAILKEMMANEVSPN 300
            GKLNKAGDVIDDMKVWGFWPNVVTYNTLIDGYCK+GRVGKMYKADAILKEM+AN+VSPN
Sbjct: 241 VGKLNKAGDVIDDMKVWGFWPNVVTYNTLIDGYCKMGRVGKMYKADAILKEMVANKVSPN 300

Query: 301 DVTFNILIDGFCKDENISAALKVFEEMRSQGVKPTVITYNSLINGFCNEGKLNEAKVLLD 360
           DVTFNILIDGFCKDENISAALKVFEEMR QG+KP+V+TYNSLING CNEGKLNEA  LLD
Sbjct: 301 DVTFNILIDGFCKDENISAALKVFEEMRIQGLKPSVVTYNSLINGLCNEGKLNEANTLLD 360

Query: 361 EMLSSNLKPNVITYNALINGYCKKKLLEEARELFDKIGKQGLAPNVITFNTLLDGYCKSG 420
           EM +SNLKPNV+TYNALINGYCKKKLLEEARELFD IGKQGLAPNVITFNTLLDGYCK G
Sbjct: 361 EMSNSNLKPNVVTYNALINGYCKKKLLEEARELFDNIGKQGLAPNVITFNTLLDGYCKCG 420

Query: 421 KMEEAFLLQSLMLGKGFLPDVSTYNCLIAGFCREGKMEEAKNLLNEMEDRGLKADLVTYN 480
           KM+EAFLL+SLML KG  PDVSTYNCLI GFC+EGKMEE +NLLNEME+RGLKADLVTYN
Sbjct: 421 KMDEAFLLRSLMLEKGLPPDVSTYNCLIVGFCKEGKMEEVENLLNEMEERGLKADLVTYN 480

Query: 481 ILVSAWCEKREPKKAAGLVNEMLHRGLRPSHLTYNILLNGYCIEGNLRAALNVRKQMEKE 540
           IL+SAWCEKREPKKAAGL++EM HRGLRPSHLTYNILLNGYCIEGNLRAALNVRKQMEK+
Sbjct: 481 ILISAWCEKREPKKAAGLIDEMFHRGLRPSHLTYNILLNGYCIEGNLRAALNVRKQMEKD 540

Query: 541 GRWANVVTYNVLIQGYCRKGKLEDANGLLNEMLEKGLIPNRTTYEIIREEMMEKGFLPDI 600
           GRWANVVTYNVLIQGYCRKGKLEDANGLLNEMLEKGLIPNRTTYEIIREEMMEKGFLPDI
Sbjct: 541 GRWANVVTYNVLIQGYCRKGKLEDANGLLNEMLEKGLIPNRTTYEIIREEMMEKGFLPDI 600

Query: 601 EGHLYQVSR 610
           EGHLYQVSR
Sbjct: 601 EGHLYQVSR 609

BLAST of Tan0006665 vs. ExPASy TrEMBL
Match: A0A6J1HPK5 (pentatricopeptide repeat-containing protein At1g09820 OS=Cucurbita maxima OX=3661 GN=LOC111465480 PE=4 SV=1)

HSP 1 Score: 1102.4 bits (2850), Expect = 0.0e+00
Identity = 533/609 (87.52%), Postives = 569/609 (93.43%), Query Frame = 0

Query: 1   MNAIIRLPSRISPAKIDRKYKFSTTQLPFCTYNSTSTASTSKDFDPEIISQLISRQRWSN 60
           MNA+IR P RI  AKI R+YKFST  LPFCTY+ST TA TS DFDP+IIS LISRQ+WSN
Sbjct: 1   MNAMIRFPPRIFTAKIGRRYKFSTMHLPFCTYHSTCTAPTSNDFDPQIISDLISRQQWSN 60

Query: 61  LKSHFKFKTPVDFLHQLLGSEAVDPLLVLRYFNWSQKELKVNYNIELFCRLLNLLANAKC 120
           LKSHFKFKTP+DFLH+LLGS AVDPLLVLRYF WSQKEL+VNYNIELFCRLLNLLANAK 
Sbjct: 61  LKSHFKFKTPIDFLHELLGSGAVDPLLVLRYFKWSQKELRVNYNIELFCRLLNLLANAKY 120

Query: 121 YPKIRSFLDSFVKGETNSTISLIFHTLSVCGDQFCANSIIADMLVLAYVKNSKTALGLEA 180
           YPKIRSFLDSFVK ETN TISLIFHTLSVCGDQFCANSIIADMLVLAYV+NSKTALGLEA
Sbjct: 121 YPKIRSFLDSFVKRETNCTISLIFHTLSVCGDQFCANSIIADMLVLAYVRNSKTALGLEA 180

Query: 181 FKRAGDYGYKLSVLSCNPLLSALAKENEFGDVEFIYKEMIRRKVSPNLITFNIVINGLCK 240
           FKRAGDYGYKLSVLSCNPLL+AL KENEFG VEF+YKEMIRRK SPNL TFNIVI+GLCK
Sbjct: 181 FKRAGDYGYKLSVLSCNPLLNALVKENEFGGVEFVYKEMIRRKFSPNLFTFNIVIHGLCK 240

Query: 241 AGKLNKAGDVIDDMKVWGFWPNVVTYNTLIDGYCKIGRVGKMYKADAILKEMMANEVSPN 300
            G+LNKAGDVIDDMKVWGF PN VTYNTLIDGYCK+GRVGKM+KADAILKEM+AN VSPN
Sbjct: 241 IGRLNKAGDVIDDMKVWGFLPNEVTYNTLIDGYCKVGRVGKMFKADAILKEMVANNVSPN 300

Query: 301 DVTFNILIDGFCKDENISAALKVFEEMRSQGVKPTVITYNSLINGFCNEGKLNEAKVLLD 360
           DVTFNILIDGFCKDENISAA+KVF+EM+SQGVKPTV+TYNSLING CNEGKLNEA+VLLD
Sbjct: 301 DVTFNILIDGFCKDENISAAMKVFDEMQSQGVKPTVVTYNSLINGLCNEGKLNEARVLLD 360

Query: 361 EMLSSNLKPNVITYNALINGYCKKKLLEEARELFDKIGKQGLAPNVITFNTLLDGYCKSG 420
           EM S NLKPNV+TYNALINGYCKKKLLEEARELFD IGKQGL PNVITFNTL+DGYCK G
Sbjct: 361 EMSSLNLKPNVVTYNALINGYCKKKLLEEARELFDDIGKQGLDPNVITFNTLVDGYCKCG 420

Query: 421 KMEEAFLLQSLMLGKGFLPDVSTYNCLIAGFCREGKMEEAKNLLNEMEDRGLKADLVTYN 480
           KM+EAFL++SLM  +GFLP+VSTYNCLI GFCR GKMEEAKNLLNEME RGLKAD++TYN
Sbjct: 421 KMDEAFLIRSLMFERGFLPNVSTYNCLITGFCRVGKMEEAKNLLNEMEGRGLKADMITYN 480

Query: 481 ILVSAWCEKREPKKAAGLVNEMLHRGLRPSHLTYNILLNGYCIEGNLRAALNVRKQMEKE 540
           IL+SAWCEKRE KKAA L++EML RGLRPSHLT+NILLNGYC EGNLRAALNVRKQMEKE
Sbjct: 481 ILISAWCEKRESKKAARLIDEMLDRGLRPSHLTFNILLNGYCTEGNLRAALNVRKQMEKE 540

Query: 541 GRWANVVTYNVLIQGYCRKGKLEDANGLLNEMLEKGLIPNRTTYEIIREEMMEKGFLPDI 600
           GRWANVVTYNVLIQGYCRKGK+E ANGLLNEMLEKGLIPNRTTYEI+REEMMEKGFLPDI
Sbjct: 541 GRWANVVTYNVLIQGYCRKGKMEVANGLLNEMLEKGLIPNRTTYEIVREEMMEKGFLPDI 600

Query: 601 EGHLYQVSR 610
           EGHLYQVSR
Sbjct: 601 EGHLYQVSR 609

BLAST of Tan0006665 vs. ExPASy TrEMBL
Match: A0A6J1FI86 (pentatricopeptide repeat-containing protein At1g09820 OS=Cucurbita moschata OX=3662 GN=LOC111444357 PE=4 SV=1)

HSP 1 Score: 1095.5 bits (2832), Expect = 0.0e+00
Identity = 529/609 (86.86%), Postives = 565/609 (92.78%), Query Frame = 0

Query: 1   MNAIIRLPSRISPAKIDRKYKFSTTQLPFCTYNSTSTASTSKDFDPEIISQLISRQRWSN 60
           MN +I     I  AK  R+YKFSTT LPFCTYNST TA TS DFDP+IIS LISRQ+WSN
Sbjct: 1   MNGMISFGPIIFTAKFGRRYKFSTTHLPFCTYNSTYTAPTSNDFDPQIISDLISRQQWSN 60

Query: 61  LKSHFKFKTPVDFLHQLLGSEAVDPLLVLRYFNWSQKELKVNYNIELFCRLLNLLANAKC 120
           LKSHFKFKTP+DFLH+LLGS AVDPLLVLRYF WSQKELKVNYNIELFCRLLNLLANAK 
Sbjct: 61  LKSHFKFKTPIDFLHELLGSGAVDPLLVLRYFKWSQKELKVNYNIELFCRLLNLLANAKY 120

Query: 121 YPKIRSFLDSFVKGETNSTISLIFHTLSVCGDQFCANSIIADMLVLAYVKNSKTALGLEA 180
           YPK+RSFLDSFVK ETN TISLIFHTLSVCGDQFCANSIIADMLVLAYVKNSKTALGLEA
Sbjct: 121 YPKMRSFLDSFVKRETNCTISLIFHTLSVCGDQFCANSIIADMLVLAYVKNSKTALGLEA 180

Query: 181 FKRAGDYGYKLSVLSCNPLLSALAKENEFGDVEFIYKEMIRRKVSPNLITFNIVINGLCK 240
           FKRAGDYGYKLSVLSCNPLL+AL KENEFGDVEF+YKEMIRRK SPNL TFNIVI+GLCK
Sbjct: 181 FKRAGDYGYKLSVLSCNPLLNALVKENEFGDVEFVYKEMIRRKFSPNLFTFNIVIHGLCK 240

Query: 241 AGKLNKAGDVIDDMKVWGFWPNVVTYNTLIDGYCKIGRVGKMYKADAILKEMMANEVSPN 300
            G+LNKAGDV+DDMK+WGF PNVVTYNTLIDGYCK+GRVGKM+KADAILKEM+AN VSPN
Sbjct: 241 IGRLNKAGDVLDDMKIWGFLPNVVTYNTLIDGYCKVGRVGKMFKADAILKEMVANNVSPN 300

Query: 301 DVTFNILIDGFCKDENISAALKVFEEMRSQGVKPTVITYNSLINGFCNEGKLNEAKVLLD 360
           DVTFNILIDGFCKDEN SAA+KVF+EM+SQGVKPTV+TYNSLING CNEGKLNEA+VLLD
Sbjct: 301 DVTFNILIDGFCKDENASAAMKVFDEMQSQGVKPTVVTYNSLINGLCNEGKLNEARVLLD 360

Query: 361 EMLSSNLKPNVITYNALINGYCKKKLLEEARELFDKIGKQGLAPNVITFNTLLDGYCKSG 420
           EM S NLKPNV+TYNALINGYCKKKLLEEAR+LFD IGKQGL PNVITFNTL+DGYCK G
Sbjct: 361 EMSSLNLKPNVVTYNALINGYCKKKLLEEARDLFDDIGKQGLDPNVITFNTLVDGYCKCG 420

Query: 421 KMEEAFLLQSLMLGKGFLPDVSTYNCLIAGFCREGKMEEAKNLLNEMEDRGLKADLVTYN 480
           KM EAFL++SLM  KGFLP+VSTYNCLI GFCREGKMEEAKNLLNEME RGLKAD +TYN
Sbjct: 421 KMNEAFLVRSLMFEKGFLPNVSTYNCLITGFCREGKMEEAKNLLNEMEGRGLKADTITYN 480

Query: 481 ILVSAWCEKREPKKAAGLVNEMLHRGLRPSHLTYNILLNGYCIEGNLRAALNVRKQMEKE 540
           IL+SAWCEK E KKAA L++EML RGL+PSHLT+NILLNGYC EGNLRAALNVRKQMEKE
Sbjct: 481 ILISAWCEKGESKKAARLIDEMLDRGLQPSHLTFNILLNGYCTEGNLRAALNVRKQMEKE 540

Query: 541 GRWANVVTYNVLIQGYCRKGKLEDANGLLNEMLEKGLIPNRTTYEIIREEMMEKGFLPDI 600
           GRWANVVTYNVLI+GYCRKGK+EDANGLLNEMLEKGLIPNRTTYEI+REEMMEKGFLPDI
Sbjct: 541 GRWANVVTYNVLIRGYCRKGKMEDANGLLNEMLEKGLIPNRTTYEIVREEMMEKGFLPDI 600

Query: 601 EGHLYQVSR 610
           EGHLYQVSR
Sbjct: 601 EGHLYQVSR 609

BLAST of Tan0006665 vs. ExPASy TrEMBL
Match: A0A5D3BCX2 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold546G001150 PE=4 SV=1)

HSP 1 Score: 1078.5 bits (2788), Expect = 0.0e+00
Identity = 527/610 (86.39%), Postives = 558/610 (91.48%), Query Frame = 0

Query: 1   MNAIIRL--PSRISPAKIDRKYKFSTTQLPFCTYNSTSTASTSKDFDPEIISQLISRQRW 60
           MN IIRL   SRI P KIDR Y FSTT LPFCTYNST TA TS DFDP IIS LISRQRW
Sbjct: 1   MNVIIRLSSASRIFPVKIDRNYIFSTTHLPFCTYNSTCTAPTSNDFDPLIISDLISRQRW 60

Query: 61  SNLKSHFKFKTPVDFLHQLLGSEAVDPLLVLRYFNWSQKELKVNYNIELFCRLLNLLANA 120
           S LKSH KFK+P+DFLHQL+ S AVDPLLVLRYFNWS++ELKVNY+IEL CRLL+LLAN 
Sbjct: 61  SILKSHVKFKSPIDFLHQLMCSGAVDPLLVLRYFNWSRRELKVNYSIELICRLLHLLANV 120

Query: 121 KCYPKIRSFLDSFVKGETNSTISLIFHTLSVCGDQFCANSIIADMLVLAYVKNSKTALGL 180
           K YPKIRS LDSFVKGETN +ISLIFH+LSVC DQFCANSIIADMLVLAYV+NSKT LGL
Sbjct: 121 KYYPKIRSVLDSFVKGETNCSISLIFHSLSVCSDQFCANSIIADMLVLAYVQNSKTVLGL 180

Query: 181 EAFKRAGDYGYKLSVLSCNPLLSALAKENEFGDVEFIYKEMIRRKVSPNLITFNIVINGL 240
           EAFKRAGDY YKLSVLSCNPLLSAL KE+E GDVEF+YKEMIRRK+SPNLITFNIVINGL
Sbjct: 181 EAFKRAGDYRYKLSVLSCNPLLSALVKESEVGDVEFVYKEMIRRKISPNLITFNIVINGL 240

Query: 241 CKAGKLNKAGDVIDDMKVWGFWPNVVTYNTLIDGYCKIGRVGKMYKADAILKEMMANEVS 300
           CK GKLNKAGDVIDDMKVWGFWPN VTYNTLIDGYCK+GRVGKMYKADAILKEM+ N+VS
Sbjct: 241 CKVGKLNKAGDVIDDMKVWGFWPNAVTYNTLIDGYCKMGRVGKMYKADAILKEMVGNKVS 300

Query: 301 PNDVTFNILIDGFCKDENISAALKVFEEMRSQGVKPTVITYNSLINGFCNEGKLNEAKVL 360
           PN VTFN+LIDGFCKDEN+S ALKVFEEM+SQG+KPTV+TYNSLING CNEGKLNEAKVL
Sbjct: 301 PNIVTFNVLIDGFCKDENVSGALKVFEEMQSQGLKPTVVTYNSLINGMCNEGKLNEAKVL 360

Query: 361 LDEMLSSNLKPNVITYNALINGYCKKKLLEEARELFDKIGKQGLAPNVITFNTLLDGYCK 420
           LDEMLSSNLKPNVITYNALINGYCKKK LEEARELFD IGKQGL PNVITFNTLLDGYCK
Sbjct: 361 LDEMLSSNLKPNVITYNALINGYCKKKKLEEARELFDNIGKQGLTPNVITFNTLLDGYCK 420

Query: 421 SGKMEEAFLLQSLMLGKGFLPDVSTYNCLIAGFCREGKMEEAKNLLNEMEDRGLKADLVT 480
            GKMEEAFLLQ +ML KGFLPDVSTYNCLI GFCREGKMEE KNLLNEME RG+KAD VT
Sbjct: 421 CGKMEEAFLLQKVMLEKGFLPDVSTYNCLIVGFCREGKMEEVKNLLNEMECRGVKADTVT 480

Query: 481 YNILVSAWCEKREPKKAAGLVNEMLHRGLRPSHLTYNILLNGYCIEGNLRAALNVRKQME 540
           YNIL+SAWCEK+EPKKAA L++EML RGL+PSHLTYNILLNGYC+EGNLRAALN+RKQME
Sbjct: 481 YNILISAWCEKKEPKKAARLIDEMLDRGLKPSHLTYNILLNGYCMEGNLRAALNLRKQME 540

Query: 541 KEGRWANVVTYNVLIQGYCRKGKLEDANGLLNEMLEKGLIPNRTTYEIIREEMMEKGFLP 600
           KE  WANVVTYNVLI GYCRKGKLEDANGLLNEMLEKGLIPNRTTYEII+ EMMEKGFLP
Sbjct: 541 KERIWANVVTYNVLILGYCRKGKLEDANGLLNEMLEKGLIPNRTTYEIIKVEMMEKGFLP 600

Query: 601 DIEGHLYQVS 609
           DIEGHLY  S
Sbjct: 601 DIEGHLYHAS 610

BLAST of Tan0006665 vs. ExPASy TrEMBL
Match: A0A5A7U323 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold385G001350 PE=4 SV=1)

HSP 1 Score: 1078.2 bits (2787), Expect = 0.0e+00
Identity = 527/610 (86.39%), Postives = 558/610 (91.48%), Query Frame = 0

Query: 1   MNAIIRL--PSRISPAKIDRKYKFSTTQLPFCTYNSTSTASTSKDFDPEIISQLISRQRW 60
           MN IIRL   SRI P KIDR Y FSTT L FCTYNST TA TS DFDP IIS LISRQRW
Sbjct: 1   MNVIIRLSSASRIFPVKIDRNYIFSTTHLSFCTYNSTCTAPTSNDFDPLIISDLISRQRW 60

Query: 61  SNLKSHFKFKTPVDFLHQLLGSEAVDPLLVLRYFNWSQKELKVNYNIELFCRLLNLLANA 120
           S LKSH KFK+P+DFLHQL+ S AVDPLLVLRYFNWS++ELKVNY+IEL CRLL+LLAN 
Sbjct: 61  SILKSHVKFKSPIDFLHQLMCSGAVDPLLVLRYFNWSRRELKVNYSIELICRLLHLLANV 120

Query: 121 KCYPKIRSFLDSFVKGETNSTISLIFHTLSVCGDQFCANSIIADMLVLAYVKNSKTALGL 180
           K YPKIRS LDSFVKGETN +ISLIFH+LSVC DQFCANSIIADMLVLAYV+NSKT LGL
Sbjct: 121 KYYPKIRSVLDSFVKGETNCSISLIFHSLSVCSDQFCANSIIADMLVLAYVQNSKTVLGL 180

Query: 181 EAFKRAGDYGYKLSVLSCNPLLSALAKENEFGDVEFIYKEMIRRKVSPNLITFNIVINGL 240
           EAFKRAGDY YKLSVLSCNPLLSAL KE+EFGDVEF+YKEMIRRK+SPNLITFNIVINGL
Sbjct: 181 EAFKRAGDYRYKLSVLSCNPLLSALVKESEFGDVEFVYKEMIRRKISPNLITFNIVINGL 240

Query: 241 CKAGKLNKAGDVIDDMKVWGFWPNVVTYNTLIDGYCKIGRVGKMYKADAILKEMMANEVS 300
           CK GKLNKAGDVIDDMKVWGFWPN VTYNTLIDGYCK+GRVGKMYKADAILKEM+ N+VS
Sbjct: 241 CKVGKLNKAGDVIDDMKVWGFWPNAVTYNTLIDGYCKMGRVGKMYKADAILKEMVGNKVS 300

Query: 301 PNDVTFNILIDGFCKDENISAALKVFEEMRSQGVKPTVITYNSLINGFCNEGKLNEAKVL 360
           PN VTFN+LIDGFCKDEN+S ALKVFEEM+SQG+KPTV+TYNSLING CNEGKLNEAKVL
Sbjct: 301 PNIVTFNVLIDGFCKDENVSGALKVFEEMQSQGLKPTVVTYNSLINGMCNEGKLNEAKVL 360

Query: 361 LDEMLSSNLKPNVITYNALINGYCKKKLLEEARELFDKIGKQGLAPNVITFNTLLDGYCK 420
           LDEMLSSNLKPNVITYNALINGYCKKK LEEARELFD IGKQGL PNVITFNTLLDGYCK
Sbjct: 361 LDEMLSSNLKPNVITYNALINGYCKKKKLEEARELFDNIGKQGLTPNVITFNTLLDGYCK 420

Query: 421 SGKMEEAFLLQSLMLGKGFLPDVSTYNCLIAGFCREGKMEEAKNLLNEMEDRGLKADLVT 480
            GKMEEAFLLQ +ML KGFLPDVSTYNCLI GFCREGKMEE KNLLNEME RG+KAD VT
Sbjct: 421 CGKMEEAFLLQKVMLEKGFLPDVSTYNCLIVGFCREGKMEEVKNLLNEMECRGVKADTVT 480

Query: 481 YNILVSAWCEKREPKKAAGLVNEMLHRGLRPSHLTYNILLNGYCIEGNLRAALNVRKQME 540
           YNIL+SAWCEK+EPKKAA L++EML RGL+PSHLTYNILLNGYC+EGNLRAALN+RKQME
Sbjct: 481 YNILISAWCEKKEPKKAARLIDEMLDRGLKPSHLTYNILLNGYCMEGNLRAALNLRKQME 540

Query: 541 KEGRWANVVTYNVLIQGYCRKGKLEDANGLLNEMLEKGLIPNRTTYEIIREEMMEKGFLP 600
           KE  WANVVTYNVLI GYCRKGKLEDANGLLNEMLEKGLIPNRTTYEII+ EMMEKGFLP
Sbjct: 541 KERIWANVVTYNVLILGYCRKGKLEDANGLLNEMLEKGLIPNRTTYEIIKVEMMEKGFLP 600

Query: 601 DIEGHLYQVS 609
           DIEGHLY  S
Sbjct: 601 DIEGHLYHAS 610

BLAST of Tan0006665 vs. TAIR 10
Match: AT1G09820.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 689.5 bits (1778), Expect = 2.4e-198
Identity = 337/581 (58.00%), Postives = 433/581 (74.53%), Query Frame = 0

Query: 30  CTYNSTSTASTSKD-FDPEIISQLISRQRWSNLKSHFKFKTPVDFLHQLLGSEAVDPLLV 89
           C+ +ST T S     +D  +I+ LI +Q WS L  H     P +   QL+ SE +DP L 
Sbjct: 26  CSSSSTITGSPCPPRYDVAVIADLIEKQHWSKLGVHVTDINPNELFRQLISSE-LDPDLC 85

Query: 90  LRYFNWSQKELKVNYNIELFCRLLNLLANAKCYPKIRSFLDSFVKGETNSTISLIFHTLS 149
           LRY++W  K   ++ ++EL  +LL+ LANAK Y KIRSFLD FV+  ++  +  IFH +S
Sbjct: 86  LRYYSWLVKNSDISVSLELTFKLLHSLANAKRYSKIRSFLDGFVRNGSDHQVHSIFHAIS 145

Query: 150 VCGDQFCANSIIADMLVLAYVKNSKTALGLEAFKRAGDYGYKLSVLSCNPLLSALAKENE 209
           +C D  C NSIIADMLVLAY  NS+  LG EAFKR+G YGYKLS LSC PL+ AL KEN 
Sbjct: 146 MC-DNVCVNSIIADMLVLAYANNSRFELGFEAFKRSGYYGYKLSALSCKPLMIALLKENR 205

Query: 210 FGDVEFIYKEMIRRKVSPNLITFNIVINGLCKAGKLNKAGDVIDDMKVWGFWPNVVTYNT 269
             DVE++YKEMIRRK+ PN+ TFN+VIN LCK GK+NKA DV++DMKV+G  PNVV+YNT
Sbjct: 206 SADVEYVYKEMIRRKIQPNVFTFNVVINALCKTGKMNKARDVMEDMKVYGCSPNVVSYNT 265

Query: 270 LIDGYCKIGRVGKMYKADAILKEMMANEVSPNDVTFNILIDGFCKDENISAALKVFEEMR 329
           LIDGYCK+G  GKMYKADA+LKEM+ N+VSPN  TFNILIDGF KD+N+  ++KVF+EM 
Sbjct: 266 LIDGYCKLGGNGKMYKADAVLKEMVENDVSPNLTTFNILIDGFWKDDNLPGSMKVFKEML 325

Query: 330 SQGVKPTVITYNSLINGFCNEGKLNEAKVLLDEMLSSNLKPNVITYNALINGYCKKKLLE 389
            Q VKP VI+YNSLING CN GK++EA  + D+M+S+ ++PN+ITYNALING+CK  +L+
Sbjct: 326 DQDVKPNVISYNSLINGLCNGGKISEAISMRDKMVSAGVQPNLITYNALINGFCKNDMLK 385

Query: 390 EARELFDKIGKQGLAPNVITFNTLLDGYCKSGKMEEAFLLQSLMLGKGFLPDVSTYNCLI 449
           EA ++F  +  QG  P    +N L+D YCK GK+++ F L+  M  +G +PDV TYNCLI
Sbjct: 386 EALDMFGSVKGQGAVPTTRMYNMLIDAYCKLGKIDDGFALKEEMEREGIVPDVGTYNCLI 445

Query: 450 AGFCREGKMEEAKNLLNEMEDRGLKADLVTYNILVSAWCEKREPKKAAGLVNEMLHRGLR 509
           AG CR G +E AK L +++  +GL  DLVT++IL+  +C K E +KAA L+ EM   GL+
Sbjct: 446 AGLCRNGNIEAAKKLFDQLTSKGL-PDLVTFHILMEGYCRKGESRKAAMLLKEMSKMGLK 505

Query: 510 PSHLTYNILLNGYCIEGNLRAALNVRKQMEKEGRW-ANVVTYNVLIQGYCRKGKLEDANG 569
           P HLTYNI++ GYC EGNL+AA N+R QMEKE R   NV +YNVL+QGY +KGKLEDAN 
Sbjct: 506 PRHLTYNIVMKGYCKEGNLKAATNMRTQMEKERRLRMNVASYNVLLQGYSQKGKLEDANM 565

Query: 570 LLNEMLEKGLIPNRTTYEIIREEMMEKGFLPDIEGHLYQVS 609
           LLNEMLEKGL+PNR TYEI++EEM+++GF+PDIEGHL+ VS
Sbjct: 566 LLNEMLEKGLVPNRITYEIVKEEMVDQGFVPDIEGHLFNVS 603

BLAST of Tan0006665 vs. TAIR 10
Match: AT5G39710.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 342.0 bits (876), Expect = 9.6e-94
Identity = 192/556 (34.53%), Postives = 299/556 (53.78%), Query Frame = 0

Query: 69  TPVDFLHQLLGSEAVDPLLVLRYFNWSQKELKVNYNIELFCRLLNLLANAKCYPKIRSFL 128
           TP    + LL S+  D  L+L++ NW+       + +   C  L++L   K Y K    L
Sbjct: 47  TPEAASNLLLKSQN-DQALILKFLNWANPH--QFFTLRCKCITLHILTKFKLY-KTAQIL 106

Query: 129 DSFVKGET--NSTISLIFHTLSVCGDQFCANSIIADMLVLAYVKNSKTALGLEAFKRAGD 188
              V  +T  +   SL+F +L    D   + S + D++V +Y + S     L     A  
Sbjct: 107 AEDVAAKTLDDEYASLVFKSLQETYDLCYSTSSVFDLVVKSYSRLSLIDKALSIVHLAQA 166

Query: 189 YGYKLSVLSCNPLLSA-LAKENEFGDVEFIYKEMIRRKVSPNLITFNIVINGLCKAGKLN 248
           +G+   VLS N +L A +  +      E ++KEM+  +VSPN+ T+NI+I G C AG ++
Sbjct: 167 HGFMPGVLSYNAVLDATIRSKRNISFAENVFKEMLESQVSPNVFTYNILIRGFCFAGNID 226

Query: 249 KAGDVIDDMKVWGFWPNVVTYNTLIDGYCKIGRV-------------------------- 308
            A  + D M+  G  PNVVTYNTLIDGYCK+ ++                          
Sbjct: 227 VALTLFDKMETKGCLPNVVTYNTLIDGYCKLRKIDDGFKLLRSMALKGLEPNLISYNVVI 286

Query: 309 ------GKMYKADAILKEMMANEVSPNDVTFNILIDGFCKDENISAALKVFEEMRSQGVK 368
                 G+M +   +L EM     S ++VT+N LI G+CK+ N   AL +  EM   G+ 
Sbjct: 287 NGLCREGRMKEVSFVLTEMNRRGYSLDEVTYNTLIKGYCKEGNFHQALVMHAEMLRHGLT 346

Query: 369 PTVITYNSLINGFCNEGKLNEAKVLLDEMLSSNLKPNVITYNALINGYCKKKLLEEAREL 428
           P+VITY SLI+  C  G +N A   LD+M    L PN  TY  L++G+ +K  + EA  +
Sbjct: 347 PSVITYTSLIHSMCKAGNMNRAMEFLDQMRVRGLCPNERTYTTLVDGFSQKGYMNEAYRV 406

Query: 429 FDKIGKQGLAPNVITFNTLLDGYCKSGKMEEAFLLQSLMLGKGFLPDVSTYNCLIAGFCR 488
             ++   G +P+V+T+N L++G+C +GKME+A  +   M  KG  PDV +Y+ +++GFCR
Sbjct: 407 LREMNDNGFSPSVVTYNALINGHCVTGKMEDAIAVLEDMKEKGLSPDVVSYSTVLSGFCR 466

Query: 489 EGKMEEAKNLLNEMEDRGLKADLVTYNILVSAWCEKREPKKAAGLVNEMLHRGLRPSHLT 548
              ++EA  +  EM ++G+K D +TY+ L+  +CE+R  K+A  L  EML  GL P   T
Sbjct: 467 SYDVDEALRVKREMVEKGIKPDTITYSSLIQGFCEQRRTKEACDLYEEMLRVGLPPDEFT 526

Query: 549 YNILLNGYCIEGNLRAALNVRKQMEKEGRWANVVTYNVLIQGYCRKGKLEDANGLLNEML 590
           Y  L+N YC+EG+L  AL +  +M ++G   +VVTY+VLI G  ++ +  +A  LL ++ 
Sbjct: 527 YTALINAYCMEGDLEKALQLHNEMVEKGVLPDVVTYSVLINGLNKQSRTREAKRLLLKLF 586

BLAST of Tan0006665 vs. TAIR 10
Match: AT1G05670.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 315.8 bits (808), Expect = 7.3e-86
Identity = 188/602 (31.23%), Postives = 316/602 (52.49%), Query Frame = 0

Query: 24  TTQLPFCTYNSTSTASTSKDFDPEIISQLISRQ----RWSNLKSHFKFKTPVDFLHQLLG 83
           T   PF  Y+    +    +F  +I + +  R+    R S      KFKT  D L  +L 
Sbjct: 38  TDTRPFPDYSPKKASVRDTEFVHQITNVIKLRRAEPLRRSLKPYECKFKT--DHLIWVLM 97

Query: 84  SEAVDPLLVLRYFNWSQKELKVNYNIELFCRLLNLLANAKCYPKIRSFLDSF-VKGETNS 143
               D  LVL +F+W++   + + N+E  C +++L   +K     +S + SF  + + N 
Sbjct: 98  KIKCDYRLVLDFFDWARS--RRDSNLESLCIVIHLAVASKDLKVAQSLISSFWERPKLNV 157

Query: 144 TISLI--FHTLSVCGDQFCANSIIADMLVLAYVKNSKTALGLEAFKRAGDYGYKLSVLSC 203
           T S +  F  L      + ++  + D+     V           F++  +YG  LSV SC
Sbjct: 158 TDSFVQFFDLLVYTYKDWGSDPRVFDVFFQVLVDFGLLREARRVFEKMLNYGLVLSVDSC 217

Query: 204 NPLLSALAKE-NEFGDVEFIYKEMIRRKVSPNLITFNIVINGLCKAGKLNKAGDVIDDMK 263
           N  L+ L+K+  +      +++E     V  N+ ++NIVI+ +C+ G++ +A  ++  M+
Sbjct: 218 NVYLTRLSKDCYKTATAIIVFREFPEVGVCWNVASYNIVIHFVCQLGRIKEAHHLLLLME 277

Query: 264 VWGFWPNVVTYNTLIDGYCKIGRVGKMYKADAILK------------------------- 323
           + G+ P+V++Y+T+++GYC+ G + K++K   ++K                         
Sbjct: 278 LKGYTPDVISYSTVVNGYCRFGELDKVWKLIEVMKRKGLKPNSYIYGSIIGLLCRICKLA 337

Query: 324 -------EMMANEVSPNDVTFNILIDGFCKDENISAALKVFEEMRSQGVKPTVITYNSLI 383
                  EM+   + P+ V +  LIDGFCK  +I AA K F EM S+ + P V+TY ++I
Sbjct: 338 EAEEAFSEMIRQGILPDTVVYTTLIDGFCKRGDIRAASKFFYEMHSRDITPDVLTYTAII 397

Query: 384 NGFCNEGKLNEAKVLLDEMLSSNLKPNVITYNALINGYCKKKLLEEARELFDKIGKQGLA 443
           +GFC  G + EA  L  EM    L+P+ +T+  LINGYCK   +++A  + + + + G +
Sbjct: 398 SGFCQIGDMVEAGKLFHEMFCKGLEPDSVTFTELINGYCKAGHMKDAFRVHNHMIQAGCS 457

Query: 444 PNVITFNTLLDGYCKSGKMEEAFLLQSLMLGKGFLPDVSTYNCLIAGFCREGKMEEAKNL 503
           PNV+T+ TL+DG CK G ++ A  L   M   G  P++ TYN ++ G C+ G +EEA  L
Sbjct: 458 PNVVTYTTLIDGLCKEGDLDSANELLHEMWKIGLQPNIFTYNSIVNGLCKSGNIEEAVKL 517

Query: 504 LNEMEDRGLKADLVTYNILVSAWCEKREPKKAAGLVNEMLHRGLRPSHLTYNILLNGYCI 563
           + E E  GL AD VTY  L+ A+C+  E  KA  ++ EML +GL+P+ +T+N+L+NG+C+
Sbjct: 518 VGEFEAAGLNADTVTYTTLMDAYCKSGEMDKAQEILKEMLGKGLQPTIVTFNVLMNGFCL 577

Query: 564 EGNLRAALNVRKQMEKEGRWANVVTYNVLIQGYCRKGKLEDANGLLNEMLEKGLIPNRTT 586
            G L     +   M  +G   N  T+N L++ YC +  L+ A  +  +M  +G+ P+  T
Sbjct: 578 HGMLEDGEKLLNWMLAKGIAPNATTFNSLVKQYCIRNNLKAATAIYKDMCSRGVGPDGKT 635

BLAST of Tan0006665 vs. TAIR 10
Match: AT1G05670.2 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 315.8 bits (808), Expect = 7.3e-86
Identity = 188/602 (31.23%), Postives = 316/602 (52.49%), Query Frame = 0

Query: 24  TTQLPFCTYNSTSTASTSKDFDPEIISQLISRQ----RWSNLKSHFKFKTPVDFLHQLLG 83
           T   PF  Y+    +    +F  +I + +  R+    R S      KFKT  D L  +L 
Sbjct: 38  TDTRPFPDYSPKKASVRDTEFVHQITNVIKLRRAEPLRRSLKPYECKFKT--DHLIWVLM 97

Query: 84  SEAVDPLLVLRYFNWSQKELKVNYNIELFCRLLNLLANAKCYPKIRSFLDSF-VKGETNS 143
               D  LVL +F+W++   + + N+E  C +++L   +K     +S + SF  + + N 
Sbjct: 98  KIKCDYRLVLDFFDWARS--RRDSNLESLCIVIHLAVASKDLKVAQSLISSFWERPKLNV 157

Query: 144 TISLI--FHTLSVCGDQFCANSIIADMLVLAYVKNSKTALGLEAFKRAGDYGYKLSVLSC 203
           T S +  F  L      + ++  + D+     V           F++  +YG  LSV SC
Sbjct: 158 TDSFVQFFDLLVYTYKDWGSDPRVFDVFFQVLVDFGLLREARRVFEKMLNYGLVLSVDSC 217

Query: 204 NPLLSALAKE-NEFGDVEFIYKEMIRRKVSPNLITFNIVINGLCKAGKLNKAGDVIDDMK 263
           N  L+ L+K+  +      +++E     V  N+ ++NIVI+ +C+ G++ +A  ++  M+
Sbjct: 218 NVYLTRLSKDCYKTATAIIVFREFPEVGVCWNVASYNIVIHFVCQLGRIKEAHHLLLLME 277

Query: 264 VWGFWPNVVTYNTLIDGYCKIGRVGKMYKADAILK------------------------- 323
           + G+ P+V++Y+T+++GYC+ G + K++K   ++K                         
Sbjct: 278 LKGYTPDVISYSTVVNGYCRFGELDKVWKLIEVMKRKGLKPNSYIYGSIIGLLCRICKLA 337

Query: 324 -------EMMANEVSPNDVTFNILIDGFCKDENISAALKVFEEMRSQGVKPTVITYNSLI 383
                  EM+   + P+ V +  LIDGFCK  +I AA K F EM S+ + P V+TY ++I
Sbjct: 338 EAEEAFSEMIRQGILPDTVVYTTLIDGFCKRGDIRAASKFFYEMHSRDITPDVLTYTAII 397

Query: 384 NGFCNEGKLNEAKVLLDEMLSSNLKPNVITYNALINGYCKKKLLEEARELFDKIGKQGLA 443
           +GFC  G + EA  L  EM    L+P+ +T+  LINGYCK   +++A  + + + + G +
Sbjct: 398 SGFCQIGDMVEAGKLFHEMFCKGLEPDSVTFTELINGYCKAGHMKDAFRVHNHMIQAGCS 457

Query: 444 PNVITFNTLLDGYCKSGKMEEAFLLQSLMLGKGFLPDVSTYNCLIAGFCREGKMEEAKNL 503
           PNV+T+ TL+DG CK G ++ A  L   M   G  P++ TYN ++ G C+ G +EEA  L
Sbjct: 458 PNVVTYTTLIDGLCKEGDLDSANELLHEMWKIGLQPNIFTYNSIVNGLCKSGNIEEAVKL 517

Query: 504 LNEMEDRGLKADLVTYNILVSAWCEKREPKKAAGLVNEMLHRGLRPSHLTYNILLNGYCI 563
           + E E  GL AD VTY  L+ A+C+  E  KA  ++ EML +GL+P+ +T+N+L+NG+C+
Sbjct: 518 VGEFEAAGLNADTVTYTTLMDAYCKSGEMDKAQEILKEMLGKGLQPTIVTFNVLMNGFCL 577

Query: 564 EGNLRAALNVRKQMEKEGRWANVVTYNVLIQGYCRKGKLEDANGLLNEMLEKGLIPNRTT 586
            G L     +   M  +G   N  T+N L++ YC +  L+ A  +  +M  +G+ P+  T
Sbjct: 578 HGMLEDGEKLLNWMLAKGIAPNATTFNSLVKQYCIRNNLKAATAIYKDMCSRGVGPDGKT 635

BLAST of Tan0006665 vs. TAIR 10
Match: AT5G55840.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 312.0 bits (798), Expect = 1.1e-84
Identity = 176/549 (32.06%), Postives = 291/549 (53.01%), Query Frame = 0

Query: 44  FDPE-IISQLISRQRWSNLKSHFKFKTPVDFLHQLLGSEAVDPLLVLRYFNWSQKE--LK 103
           FD E  I  +++  RW +L         +D+    L    V   L L++  W  K+  L+
Sbjct: 57  FDMEKSIYNILTIDRWGSLNH-------MDYRQARL--RLVHGKLALKFLKWVVKQPGLE 116

Query: 104 VNYNIELFCRLLNLLANAKCYPKIRSFLD--SFVKGETNSTISLIFHTLSVCGDQFCANS 163
            ++ ++L C   ++L  A+ Y   R  L   S + G+++     +  T  +C     +N 
Sbjct: 117 TDHIVQLVCITTHILVRARMYDPARHILKELSLMSGKSSFVFGALMTTYRLCN----SNP 176

Query: 164 IIADMLVLAYVKNSKTALGLEAFKRAGDYGYKLSVLSCNPLLSALAKENEFGDVEFIYKE 223
            + D+L+  Y++       LE F+  G YG+  SV +CN +L ++ K  E   V    KE
Sbjct: 177 SVYDILIRVYLREGMIQDSLEIFRLMGLYGFNPSVYTCNAILGSVVKSGEDVSVWSFLKE 236

Query: 224 MIRRKVSPNLITFNIVINGLCKAGKLNKAGDVIDDMKVWGFWPNVVTYNTLIDGYCKIGR 283
           M++RK+ P++ TFNI+IN LC  G   K+  ++  M+  G+ P +VTYNT++  YCK GR
Sbjct: 237 MLKRKICPDVATFNILINVLCAEGSFEKSSYLMQKMEKSGYAPTIVTYNTVLHWYCKKGR 296

Query: 284 VGKMYKADAILKEMMANEVSPNDVTFNILIDGFCKDENISAALKVFEEMRSQGVKPTVIT 343
                 A  +L  M +  V  +  T+N+LI   C+   I+    +  +MR + + P  +T
Sbjct: 297 ---FKAAIELLDHMKSKGVDADVCTYNMLIHDLCRSNRIAKGYLLLRDMRKRMIHPNEVT 356

Query: 344 YNSLINGFCNEGKLNEAKVLLDEMLSSNLKPNVITYNALINGYCKKKLLEEARELFDKIG 403
           YN+LINGF NEGK+  A  LL+EMLS  L PN +T+NALI+G+  +   +EA ++F  + 
Sbjct: 357 YNTLINGFSNEGKVLIASQLLNEMLSFGLSPNHVTFNALIDGHISEGNFKEALKMFYMME 416

Query: 404 KQGLAPNVITFNTLLDGYCKSGKMEEAFLLQSLMLGKGFLPDVSTYNCLIAGFCREGKME 463
            +GL P+ +++  LLDG CK+ + + A      M   G      TY  +I G C+ G ++
Sbjct: 417 AKGLTPSEVSYGVLLDGLCKNAEFDLARGFYMRMKRNGVCVGRITYTGMIDGLCKNGFLD 476

Query: 464 EAKNLLNEMEDRGLKADLVTYNILVSAWCEKREPKKAAGLVNEMLHRGLRPSHLTYNILL 523
           EA  LLNEM   G+  D+VTY+ L++ +C+    K A  +V  +   GL P+ + Y+ L+
Sbjct: 477 EAVVLLNEMSKDGIDPDIVTYSALINGFCKVGRFKTAKEIVCRIYRVGLSPNGIIYSTLI 536

Query: 524 NGYCIEGNLRAALNVRKQMEKEGRWANVVTYNVLIQGYCRKGKLEDANGLLNEMLEKGLI 583
              C  G L+ A+ + + M  EG   +  T+NVL+   C+ GK+ +A   +  M   G++
Sbjct: 537 YNCCRMGCLKEAIRIYEAMILEGHTRDHFTFNVLVTSLCKAGKVAEAEEFMRCMTSDGIL 589

Query: 584 PNRTTYEII 588
           PN  +++ +
Sbjct: 597 PNTVSFDCL 589

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
O045043.4e-19758.00Pentatricopeptide repeat-containing protein At1g09820 OS=Arabidopsis thaliana OX... [more]
Q9FIX31.3e-9234.53Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana OX... [more]
Q0WVK71.0e-8431.23Pentatricopeptide repeat-containing protein At1g05670, mitochondrial OS=Arabidop... [more]
Q9LVQ51.5e-8332.06Pentatricopeptide repeat-containing protein At5g55840 OS=Arabidopsis thaliana OX... [more]
Q6NQ832.4e-8131.46Pentatricopeptide repeat-containing protein At3g22470, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
XP_022156382.10.0e+0090.80pentatricopeptide repeat-containing protein At1g09820 [Momordica charantia] >XP_... [more]
XP_038891207.10.0e+0089.03pentatricopeptide repeat-containing protein At1g09820 [Benincasa hispida][more]
XP_023536750.10.0e+0087.85pentatricopeptide repeat-containing protein At1g09820 [Cucurbita pepo subsp. pep... [more]
XP_022965643.10.0e+0087.52pentatricopeptide repeat-containing protein At1g09820 [Cucurbita maxima] >XP_022... [more]
KAG7021052.10.0e+0087.36Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
Match NameE-valueIdentityDescription
A0A6J1DRY00.0e+0090.80pentatricopeptide repeat-containing protein At1g09820 OS=Momordica charantia OX=... [more]
A0A6J1HPK50.0e+0087.52pentatricopeptide repeat-containing protein At1g09820 OS=Cucurbita maxima OX=366... [more]
A0A6J1FI860.0e+0086.86pentatricopeptide repeat-containing protein At1g09820 OS=Cucurbita moschata OX=3... [more]
A0A5D3BCX20.0e+0086.39Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A5A7U3230.0e+0086.39Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
Match NameE-valueIdentityDescription
AT1G09820.12.4e-19858.00Pentatricopeptide repeat (PPR-like) superfamily protein [more]
AT5G39710.19.6e-9434.53Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G05670.17.3e-8631.23Pentatricopeptide repeat (PPR-like) superfamily protein [more]
AT1G05670.27.3e-8631.23Pentatricopeptide repeat (PPR-like) superfamily protein [more]
AT5G55840.11.1e-8432.06Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 351..454
e-value: 1.4E-36
score: 127.6
coord: 152..278
e-value: 3.5E-29
score: 103.4
coord: 279..350
e-value: 8.2E-24
score: 85.9
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 468..538
e-value: 7.3E-18
score: 66.8
coord: 539..604
e-value: 7.1E-13
score: 50.5
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 195..223
e-value: 0.1
score: 12.9
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 505..538
e-value: 1.4E-8
score: 34.3
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 299..348
e-value: 1.3E-19
score: 70.1
coord: 369..418
e-value: 8.2E-19
score: 67.6
coord: 545..587
e-value: 6.2E-15
score: 55.1
coord: 439..487
e-value: 4.6E-18
score: 65.2
coord: 226..275
e-value: 9.1E-18
score: 64.2
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 477..510
e-value: 1.0E-6
score: 26.5
coord: 443..475
e-value: 2.0E-10
score: 38.2
coord: 513..541
e-value: 3.8E-6
score: 24.7
coord: 229..261
e-value: 1.2E-7
score: 29.5
coord: 195..227
e-value: 6.7E-6
score: 24.0
coord: 337..371
e-value: 2.7E-10
score: 37.8
coord: 407..441
e-value: 8.4E-9
score: 33.1
coord: 264..300
e-value: 1.5E-6
score: 26.0
coord: 372..406
e-value: 1.8E-9
score: 35.2
coord: 547..580
e-value: 3.5E-10
score: 37.4
coord: 302..336
e-value: 5.5E-11
score: 39.9
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 475..509
score: 12.101333
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 192..226
score: 9.722731
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 370..404
score: 13.778412
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 262..299
score: 11.553267
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 227..261
score: 11.8273
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 335..369
score: 13.383805
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 545..579
score: 14.600509
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 510..544
score: 10.095415
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 300..334
score: 13.59207
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 405..439
score: 12.780933
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 440..474
score: 14.01956
NoneNo IPR availablePANTHERPTHR47932ATPASE EXPRESSION PROTEIN 3coord: 38..604
NoneNo IPR availablePANTHERPTHR47932:SF2OS10G0484300 PROTEINcoord: 38..604
NoneNo IPR availableSUPERFAMILY81901HCP-likecoord: 310..540

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0006665.1Tan0006665.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding