Tan0021339 (gene) Snake gourd v1

Overview
NameTan0021339
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat-containing protein
LocationLG08: 15720343 .. 15722808 (+)
RNA-Seq ExpressionTan0021339
SyntenyTan0021339
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATTCTGCCGGCTTAGGGGACTTAACAGAATCCATTGGTTGAAATGGCCAGCGATGAGTTTGAAATTGGTTTTTTGGTTTCGTCATTACTTTAATTCTCTGTTTCGAATTCTATTTTCTCAGGCGCCAAACACGAGCTGGGAAGAATCATCACTTTCTTTCACCATCTACATTATTGAATAAGGTGGTATAAGATGAGATGGATGAATCTGGGCAGCGGATGCTTTGCTTCTACTGCTTTTCTGAAACTTTCCCATTCTGTTTCTCAAGTTACAATGGCCCAAAGAATCATGTCATTCAACTTGTTTGAGCATCAGCTGTTCAAGCCATGTTGCTACCACTCTTCAAATGATTCTTTGGCCAATACCCTTCATGCCAAGATGGTAAAAAATGGTTCTATTTTGGATTCGGGAAAGTTCGTTTTGAGTTCTTATGTGAAATCTGAGAAATTAGACGAAGCACAGAAAATGTTCGACGAAATGCCTAACAGAGATGTACTTACATGGACGATACTTATATCGGGTTTTGCTAGATTAAATTATTCTGAAGTGGCATTGAAACTCTTTAGAGAAATGCTGGATGAAGGGGTTTGTCCAAATCATTTTACTTTGTCGTGTGTTTTTAAACTTTGCTCTAGAGTAGGTGATGTGCAAATGGGTAAGGGGATTCATGGATGGATACTCAGAAGTGGGGTTAATTTAGATGTTGTCTTGGAAAATTCTATACTTGATTTGTATGCAAAGTTTGATGCATTTGATTATGCCAAAAAGTTGTTTGATTCAATGAGAGAAAAGAGTACTGCTACTTACAATATAATGCTTGGTGTGTATGTTCGTAGTTGTGATGTTAACAAATCTCTTGATTTATTCAGAAACTTGCCTTGCAGAGATACGGCGAGTTGGAATACAATTATATGTGGGCTAATGCAAGGTGGGTATCTGAATAAAGCATTGGAACTACTCTATGAGATGGTGGCAAATGAACCTGAGTTTAACAAAGTTACTTCTTCCATAGCTTTAAGTGTGGTTTCTTCTTTATTGATTATTGAGCTAGGTAGACAAGTACATGGCCGAATTTTCAGGTTCGGTTTTCATAATGATGGATTTGTGAAGAGTTCATTGATAAATATGTACATTAAGTGTGGCAATTTGGAAAAAGCTTCGGTGATATATAGTCAAATGCCTTCAGATTTTGCGAGGAAACAAGATTCCAACATTGTATGTAGTGACATGATGACAGAAATTGTTTCGCGGAGCTCAATGGTGTCTGGATATGTTCGAAATGGCAAGTATGAAGATGCCTTCAAAACTTTTGTTTCTATGGTCCGTGAACGGGTTGTGATGGACAAATTTACCATTGCAAGCATTGTATCCGCTTGTTCTAATGCTGGTGTCCTGGAGCTTGGCCGTCAAATCCATGCATATATTCAGAAAACTGGGGAACAGCTTGATGCTCACTTGGCTTCCTCCTTGATTGACATGTATGCTAAAGGTGGGAGTTTGGATTGTGCCCATCGAATTTTTGAGCAAACGACTTACTTAAATGTTGTGATATGGACTTCCATGATTGCTGGATGTGCTTTGCACGGTCAAGGTAAAGAAGCCATTCGACTGTTTGAACAGATGAGATATGAGGGAATCATACCAAATGAGGTTACTTTTATAGGAGTTTTAACAGCTTGCAGTCATGCAGGGCTGCTTGAAGAAGGCCGTCTATATTTTAACATGATGAAAGATGTTTACGCAATCAAGCCTAGGGTTGAGCATTTCACTTGTATGGTAGATCTTTATGGTCGAGCTGGATGCTTGAATGAAGTCAAAGAGTTCATCTATGAGAATGATTTATCACACCTTAGTGCAGTTTGGAAGGCATTTCTATCATCTTGTCGGATTTACAAGGACATTGAAATGGGAAATTGGGTTTCTGAAAAATTGTTTAGACTCGAACCACAAGATGAAGGGCCTTATGTTTTACTATCAAACATGTGCTCCAGCAATCAAAAGTGGGAAGAAGCTTCCAGAACAAGAAGATCTATGCAACGCAGAGGGATTAGCAAAACACCTGGTCAATCTTGGATTCATGTGAAGAATCAAGTCCACTCTTTTGTTGCGGGAGATCGATCACACCCTCAACACACTCAGATATATACATATCTGGACACGCTCATTGGAAGATTGAAGGAAATAGGGTACTTGTATGATGTAAAAATGGTGATGCAGGATGTAGAAGAAGAACAGGGTGAAGTGCTTCTTCATTGGCATAGTGAAAAGCTTGCAGTTGCTTATGGCATTATCTCTTTGGCTTCTGGCATTCCAATCCGAATCATGAAGAACCTTCGAGTATGTGCCGACTGTCATAACTTTATGAAGTTAACATCTCAACTTTTAGGCAGGGAGATCATTGTTCGAGATATTCACCGTTTCCATCATTTTAACTCTGGTCGTTGCTCTTGTGATGATTATTGGTGA

mRNA sequence

ATTCTGCCGGCTTAGGGGACTTAACAGAATCCATTGGTTGAAATGGCCAGCGATGAGTTTGAAATTGGTTTTTTGGTTTCGTCATTACTTTAATTCTCTGTTTCGAATTCTATTTTCTCAGGCGCCAAACACGAGCTGGGAAGAATCATCACTTTCTTTCACCATCTACATTATTGAATAAGGTGGTATAAGATGAGATGGATGAATCTGGGCAGCGGATGCTTTGCTTCTACTGCTTTTCTGAAACTTTCCCATTCTGTTTCTCAAGTTACAATGGCCCAAAGAATCATGTCATTCAACTTGTTTGAGCATCAGCTGTTCAAGCCATGTTGCTACCACTCTTCAAATGATTCTTTGGCCAATACCCTTCATGCCAAGATGGTAAAAAATGGTTCTATTTTGGATTCGGGAAAGTTCGTTTTGAGTTCTTATGTGAAATCTGAGAAATTAGACGAAGCACAGAAAATGTTCGACGAAATGCCTAACAGAGATGTACTTACATGGACGATACTTATATCGGGTTTTGCTAGATTAAATTATTCTGAAGTGGCATTGAAACTCTTTAGAGAAATGCTGGATGAAGGGGTTTGTCCAAATCATTTTACTTTGTCGTGTGTTTTTAAACTTTGCTCTAGAGTAGGTGATGTGCAAATGGGTAAGGGGATTCATGGATGGATACTCAGAAGTGGGGTTAATTTAGATGTTGTCTTGGAAAATTCTATACTTGATTTGTATGCAAAGTTTGATGCATTTGATTATGCCAAAAAGTTGTTTGATTCAATGAGAGAAAAGAGTACTGCTACTTACAATATAATGCTTGGTGTGTATGTTCGTAGTTGTGATGTTAACAAATCTCTTGATTTATTCAGAAACTTGCCTTGCAGAGATACGGCGAGTTGGAATACAATTATATGTGGGCTAATGCAAGGTGGGTATCTGAATAAAGCATTGGAACTACTCTATGAGATGGTGGCAAATGAACCTGAGTTTAACAAAGTTACTTCTTCCATAGCTTTAAGTGTGGTTTCTTCTTTATTGATTATTGAGCTAGGTAGACAAGTACATGGCCGAATTTTCAGGTTCGGTTTTCATAATGATGGATTTGTGAAGAGTTCATTGATAAATATGTACATTAAGTGTGGCAATTTGGAAAAAGCTTCGGTGATATATAGTCAAATGCCTTCAGATTTTGCGAGGAAACAAGATTCCAACATTGTATGTAGTGACATGATGACAGAAATTGTTTCGCGGAGCTCAATGGTGTCTGGATATGTTCGAAATGGCAAGTATGAAGATGCCTTCAAAACTTTTGTTTCTATGGTCCGTGAACGGGTTGTGATGGACAAATTTACCATTGCAAGCATTGTATCCGCTTGTTCTAATGCTGGTGTCCTGGAGCTTGGCCGTCAAATCCATGCATATATTCAGAAAACTGGGGAACAGCTTGATGCTCACTTGGCTTCCTCCTTGATTGACATGTATGCTAAAGGTGGGAGTTTGGATTGTGCCCATCGAATTTTTGAGCAAACGACTTACTTAAATGTTGTGATATGGACTTCCATGATTGCTGGATGTGCTTTGCACGGTCAAGGTAAAGAAGCCATTCGACTGTTTGAACAGATGAGATATGAGGGAATCATACCAAATGAGGTTACTTTTATAGGAGTTTTAACAGCTTGCAGTCATGCAGGGCTGCTTGAAGAAGGCCGTCTATATTTTAACATGATGAAAGATGTTTACGCAATCAAGCCTAGGGTTGAGCATTTCACTTGTATGGTAGATCTTTATGGTCGAGCTGGATGCTTGAATGAAGTCAAAGAGTTCATCTATGAGAATGATTTATCACACCTTAGTGCAGTTTGGAAGGCATTTCTATCATCTTGTCGGATTTACAAGGACATTGAAATGGGAAATTGGGTTTCTGAAAAATTGTTTAGACTCGAACCACAAGATGAAGGGCCTTATGTTTTACTATCAAACATGTGCTCCAGCAATCAAAAGTGGGAAGAAGCTTCCAGAACAAGAAGATCTATGCAACGCAGAGGGATTAGCAAAACACCTGGTCAATCTTGGATTCATGTGAAGAATCAAGTCCACTCTTTTGTTGCGGGAGATCGATCACACCCTCAACACACTCAGATATATACATATCTGGACACGCTCATTGGAAGATTGAAGGAAATAGGGTACTTGTATGATGTAAAAATGGTGATGCAGGATGTAGAAGAAGAACAGGGTGAAGTGCTTCTTCATTGGCATAGTGAAAAGCTTGCAGTTGCTTATGGCATTATCTCTTTGGCTTCTGGCATTCCAATCCGAATCATGAAGAACCTTCGAGTATGTGCCGACTGTCATAACTTTATGAAGTTAACATCTCAACTTTTAGGCAGGGAGATCATTGTTCGAGATATTCACCGTTTCCATCATTTTAACTCTGGTCGTTGCTCTTGTGATGATTATTGGTGA

Coding sequence (CDS)

ATGAGATGGATGAATCTGGGCAGCGGATGCTTTGCTTCTACTGCTTTTCTGAAACTTTCCCATTCTGTTTCTCAAGTTACAATGGCCCAAAGAATCATGTCATTCAACTTGTTTGAGCATCAGCTGTTCAAGCCATGTTGCTACCACTCTTCAAATGATTCTTTGGCCAATACCCTTCATGCCAAGATGGTAAAAAATGGTTCTATTTTGGATTCGGGAAAGTTCGTTTTGAGTTCTTATGTGAAATCTGAGAAATTAGACGAAGCACAGAAAATGTTCGACGAAATGCCTAACAGAGATGTACTTACATGGACGATACTTATATCGGGTTTTGCTAGATTAAATTATTCTGAAGTGGCATTGAAACTCTTTAGAGAAATGCTGGATGAAGGGGTTTGTCCAAATCATTTTACTTTGTCGTGTGTTTTTAAACTTTGCTCTAGAGTAGGTGATGTGCAAATGGGTAAGGGGATTCATGGATGGATACTCAGAAGTGGGGTTAATTTAGATGTTGTCTTGGAAAATTCTATACTTGATTTGTATGCAAAGTTTGATGCATTTGATTATGCCAAAAAGTTGTTTGATTCAATGAGAGAAAAGAGTACTGCTACTTACAATATAATGCTTGGTGTGTATGTTCGTAGTTGTGATGTTAACAAATCTCTTGATTTATTCAGAAACTTGCCTTGCAGAGATACGGCGAGTTGGAATACAATTATATGTGGGCTAATGCAAGGTGGGTATCTGAATAAAGCATTGGAACTACTCTATGAGATGGTGGCAAATGAACCTGAGTTTAACAAAGTTACTTCTTCCATAGCTTTAAGTGTGGTTTCTTCTTTATTGATTATTGAGCTAGGTAGACAAGTACATGGCCGAATTTTCAGGTTCGGTTTTCATAATGATGGATTTGTGAAGAGTTCATTGATAAATATGTACATTAAGTGTGGCAATTTGGAAAAAGCTTCGGTGATATATAGTCAAATGCCTTCAGATTTTGCGAGGAAACAAGATTCCAACATTGTATGTAGTGACATGATGACAGAAATTGTTTCGCGGAGCTCAATGGTGTCTGGATATGTTCGAAATGGCAAGTATGAAGATGCCTTCAAAACTTTTGTTTCTATGGTCCGTGAACGGGTTGTGATGGACAAATTTACCATTGCAAGCATTGTATCCGCTTGTTCTAATGCTGGTGTCCTGGAGCTTGGCCGTCAAATCCATGCATATATTCAGAAAACTGGGGAACAGCTTGATGCTCACTTGGCTTCCTCCTTGATTGACATGTATGCTAAAGGTGGGAGTTTGGATTGTGCCCATCGAATTTTTGAGCAAACGACTTACTTAAATGTTGTGATATGGACTTCCATGATTGCTGGATGTGCTTTGCACGGTCAAGGTAAAGAAGCCATTCGACTGTTTGAACAGATGAGATATGAGGGAATCATACCAAATGAGGTTACTTTTATAGGAGTTTTAACAGCTTGCAGTCATGCAGGGCTGCTTGAAGAAGGCCGTCTATATTTTAACATGATGAAAGATGTTTACGCAATCAAGCCTAGGGTTGAGCATTTCACTTGTATGGTAGATCTTTATGGTCGAGCTGGATGCTTGAATGAAGTCAAAGAGTTCATCTATGAGAATGATTTATCACACCTTAGTGCAGTTTGGAAGGCATTTCTATCATCTTGTCGGATTTACAAGGACATTGAAATGGGAAATTGGGTTTCTGAAAAATTGTTTAGACTCGAACCACAAGATGAAGGGCCTTATGTTTTACTATCAAACATGTGCTCCAGCAATCAAAAGTGGGAAGAAGCTTCCAGAACAAGAAGATCTATGCAACGCAGAGGGATTAGCAAAACACCTGGTCAATCTTGGATTCATGTGAAGAATCAAGTCCACTCTTTTGTTGCGGGAGATCGATCACACCCTCAACACACTCAGATATATACATATCTGGACACGCTCATTGGAAGATTGAAGGAAATAGGGTACTTGTATGATGTAAAAATGGTGATGCAGGATGTAGAAGAAGAACAGGGTGAAGTGCTTCTTCATTGGCATAGTGAAAAGCTTGCAGTTGCTTATGGCATTATCTCTTTGGCTTCTGGCATTCCAATCCGAATCATGAAGAACCTTCGAGTATGTGCCGACTGTCATAACTTTATGAAGTTAACATCTCAACTTTTAGGCAGGGAGATCATTGTTCGAGATATTCACCGTTTCCATCATTTTAACTCTGGTCGTTGCTCTTGTGATGATTATTGGTGA

Protein sequence

MRWMNLGSGCFASTAFLKLSHSVSQVTMAQRIMSFNLFEHQLFKPCCYHSSNDSLANTLHAKMVKNGSILDSGKFVLSSYVKSEKLDEAQKMFDEMPNRDVLTWTILISGFARLNYSEVALKLFREMLDEGVCPNHFTLSCVFKLCSRVGDVQMGKGIHGWILRSGVNLDVVLENSILDLYAKFDAFDYAKKLFDSMREKSTATYNIMLGVYVRSCDVNKSLDLFRNLPCRDTASWNTIICGLMQGGYLNKALELLYEMVANEPEFNKVTSSIALSVVSSLLIIELGRQVHGRIFRFGFHNDGFVKSSLINMYIKCGNLEKASVIYSQMPSDFARKQDSNIVCSDMMTEIVSRSSMVSGYVRNGKYEDAFKTFVSMVRERVVMDKFTIASIVSACSNAGVLELGRQIHAYIQKTGEQLDAHLASSLIDMYAKGGSLDCAHRIFEQTTYLNVVIWTSMIAGCALHGQGKEAIRLFEQMRYEGIIPNEVTFIGVLTACSHAGLLEEGRLYFNMMKDVYAIKPRVEHFTCMVDLYGRAGCLNEVKEFIYENDLSHLSAVWKAFLSSCRIYKDIEMGNWVSEKLFRLEPQDEGPYVLLSNMCSSNQKWEEASRTRRSMQRRGISKTPGQSWIHVKNQVHSFVAGDRSHPQHTQIYTYLDTLIGRLKEIGYLYDVKMVMQDVEEEQGEVLLHWHSEKLAVAYGIISLASGIPIRIMKNLRVCADCHNFMKLTSQLLGREIIVRDIHRFHHFNSGRCSCDDYW
Homology
BLAST of Tan0021339 vs. ExPASy Swiss-Prot
Match: Q9LW63 (Putative pentatricopeptide repeat-containing protein At3g23330 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H32 PE=3 SV=1)

HSP 1 Score: 520.0 bits (1338), Expect = 4.5e-146
Identity = 262/710 (36.90%), Postives = 416/710 (58.59%), Query Frame = 0

Query: 54  SLANTLHAKMVKNGSIL-DSGKFVLSSYVKSEKLDEAQKMFDEMPNRDVLTWTILISGFA 113
           S A  LHA+ ++  S+   S   V+S Y   + L EA  +F  + +  VL W  +I  F 
Sbjct: 22  SQAKQLHAQFIRTQSLSHTSASIVISIYTNLKLLHEALLLFKTLKSPPVLAWKSVIRCFT 81

Query: 114 RLNYSEVALKLFREMLDEGVCPNHFTLSCVFKLCSRVGDVQMGKGIHGWILRSGVNLDVV 173
             +    AL  F EM   G CP+H     V K C+ + D++ G+ +HG+I+R G++ D+ 
Sbjct: 82  DQSLFSKALASFVEMRASGRCPDHNVFPSVLKSCTMMMDLRFGESVHGFIVRLGMDCDLY 141

Query: 174 LENSILDLYAKFDAFD---YAKKLFDSM--REKSTATYNIMLGVYVRSCDVNKSLDLFRN 233
             N+++++YAK            +FD M  R  ++   ++     +    ++    +F  
Sbjct: 142 TGNALMNMYAKLLGMGSKISVGNVFDEMPQRTSNSGDEDVKAETCIMPFGIDSVRRVFEV 201

Query: 234 LPCRDTASWNTIICGLMQGGYLNKALELLYEMVANEPEFNKVTSSIALSVVSSLLIIELG 293
           +P +D  S+NTII G  Q G    AL ++ EM   + + +  T S  L + S  + +  G
Sbjct: 202 MPRKDVVSYNTIIAGYAQSGMYEDALRMVREMGTTDLKPDSFTLSSVLPIFSEYVDVIKG 261

Query: 294 RQVHGRIFRFGFHNDGFVKSSLINMYIKCGNLEKASVIYSQMPSDFARKQDSNIVCSDMM 353
           +++HG + R G  +D ++ SSL++MY K   +E +  ++S+            + C D  
Sbjct: 262 KEIHGYVIRKGIDSDVYIGSSLVDMYAKSARIEDSERVFSR------------LYCRDG- 321

Query: 354 TEIVSRSSMVSGYVRNGKYEDAFKTFVSMVRERVVMDKFTIASIVSACSNAGVLELGRQI 413
              +S +S+V+GYV+NG+Y +A + F  MV  +V       +S++ AC++   L LG+Q+
Sbjct: 322 ---ISWNSLVAGYVQNGRYNEALRLFRQMVTAKVKPGAVAFSSVIPACAHLATLHLGKQL 381

Query: 414 HAYIQKTGEQLDAHLASSLIDMYAKGGSLDCAHRIFEQTTYLNVVIWTSMIAGCALHGQG 473
           H Y+ + G   +  +AS+L+DMY+K G++  A +IF++   L+ V WT++I G ALHG G
Sbjct: 382 HGYVLRGGFGSNIFIASALVDMYSKCGNIKAARKIFDRMNVLDEVSWTAIIMGHALHGHG 441

Query: 474 KEAIRLFEQMRYEGIIPNEVTFIGVLTACSHAGLLEEGRLYFNMMKDVYAIKPRVEHFTC 533
            EA+ LFE+M+ +G+ PN+V F+ VLTACSH GL++E   YFN M  VY +   +EH+  
Sbjct: 442 HEAVSLFEEMKRQGVKPNQVAFVAVLTACSHVGLVDEAWGYFNSMTKVYGLNQELEHYAA 501

Query: 534 MVDLYGRAGCLNEVKEFIYENDLSHLSAVWKAFLSSCRIYKDIEMGNWVSEKLFRLEPQD 593
           + DL GRAG L E   FI +  +    +VW   LSSC ++K++E+   V+EK+F ++ ++
Sbjct: 502 VADLLGRAGKLEEAYNFISKMCVEPTGSVWSTLLSSCSVHKNLELAEKVAEKIFTVDSEN 561

Query: 594 EGPYVLLSNMCSSNQKWEEASRTRRSMQRRGISKTPGQSWIHVKNQVHSFVAGDRSHPQH 653
            G YVL+ NM +SN +W+E ++ R  M+++G+ K P  SWI +KN+ H FV+GDRSHP  
Sbjct: 562 MGAYVLMCNMYASNGRWKEMAKLRLRMRKKGLRKKPACSWIEMKNKTHGFVSGDRSHPSM 621

Query: 654 TQIYTYLDTLIGRLKEIGYLYDVKMVMQDVEEEQGEVLLHWHSEKLAVAYGIISLASGIP 713
            +I  +L  ++ ++++ GY+ D   V+ DV+EE    LL  HSE+LAVA+GII+   G  
Sbjct: 622 DKINEFLKAVMEQMEKEGYVADTSGVLHDVDEEHKRELLFGHSERLAVAFGIINTEPGTT 681

Query: 714 IRIMKNLRVCADCHNFMKLTSQLLGREIIVRDIHRFHHFNSGRCSCDDYW 758
           IR+ KN+R+C DCH  +K  S++  REIIVRD  RFHHFN G CSC DYW
Sbjct: 682 IRVTKNIRICTDCHVAIKFISKITEREIIVRDNSRFHHFNRGNCSCGDYW 715

BLAST of Tan0021339 vs. ExPASy Swiss-Prot
Match: Q9SHZ8 (Pentatricopeptide repeat-containing protein At2g22070 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H41 PE=3 SV=1)

HSP 1 Score: 511.9 bits (1317), Expect = 1.2e-143
Identity = 264/701 (37.66%), Postives = 419/701 (59.77%), Query Frame = 0

Query: 76  VLSSYVKSEKLDEAQKMFDEMPNRDVLTWTILISGFARLNYSEVALKLFREMLDEGVCPN 135
           VLS+Y K   +D   + FD++P RD ++WT +I G+  +     A+++  +M+ EG+ P 
Sbjct: 86  VLSAYSKRGDMDSTCEFFDQLPQRDSVSWTTMIVGYKNIGQYHKAIRVMGDMVKEGIEPT 145

Query: 136 HFTLSCVFKLCSRVGDVQMGKGIHGWILRSGVNLDVVLENSILDLYAKFDAFDYAKKLFD 195
            FTL+ V    +    ++ GK +H +I++ G+  +V + NS+L++YAK      AK +FD
Sbjct: 146 QFTLTNVLASVAATRCMETGKKVHSFIVKLGLRGNVSVSNSLLNMYAKCGDPMMAKFVFD 205

Query: 196 SMREKSTATYNIMLGVYVRSCDVNKSLDLFRNLPCRDTASWNTIICGLMQGGYLNKALEL 255
            M  +  +++N M+ ++++   ++ ++  F  +  RD  +WN++I G  Q GY  +AL++
Sbjct: 206 RMVVRDISSWNAMIALHMQVGQMDLAMAQFEQMAERDIVTWNSMISGFNQRGYDLRALDI 265

Query: 256 LYEMVANE-PEFNKVTSSIALSVVSSLLIIELGRQVHGRIFRFGFHNDGFVKSSLINMYI 315
             +M+ +     ++ T +  LS  ++L  + +G+Q+H  I   GF   G V ++LI+MY 
Sbjct: 266 FSKMLRDSLLSPDRFTLASVLSACANLEKLCIGKQIHSHIVTTGFDISGIVLNALISMYS 325

Query: 316 KCGNLEKASVIYSQMPSDFAR-----------------KQDSNIVCSDMMTEIVSRSSMV 375
           +CG +E A  +  Q  +   +                  Q  NI  S    ++V+ ++M+
Sbjct: 326 RCGGVETARRLIEQRGTKDLKIEGFTALLDGYIKLGDMNQAKNIFVSLKDRDVVAWTAMI 385

Query: 376 SGYVRNGKYEDAFKTFVSMVRERVVMDKFTIASIVSACSNAGVLELGRQIHAYIQKTGEQ 435
            GY ++G Y +A   F SMV      + +T+A+++S  S+   L  G+QIH    K+GE 
Sbjct: 386 VGYEQHGSYGEAINLFRSMVGGGQRPNSYTLAAMLSVASSLASLSHGKQIHGSAVKSGEI 445

Query: 436 LDAHLASSLIDMYAKGGSLDCAHRIFEQ-TTYLNVVIWTSMIAGCALHGQGKEAIRLFEQ 495
               ++++LI MYAK G++  A R F+      + V WTSMI   A HG  +EA+ LFE 
Sbjct: 446 YSVSVSNALITMYAKAGNITSASRAFDLIRCERDTVSWTSMIIALAQHGHAEEALELFET 505

Query: 496 MRYEGIIPNEVTFIGVLTACSHAGLLEEGRLYFNMMKDVYAIKPRVEHFTCMVDLYGRAG 555
           M  EG+ P+ +T++GV +AC+HAGL+ +GR YF+MMKDV  I P + H+ CMVDL+GRAG
Sbjct: 506 MLMEGLRPDHITYVGVFSACTHAGLVNQGRQYFDMMKDVDKIIPTLSHYACMVDLFGRAG 565

Query: 556 CLNEVKEFIYENDLSHLSAVWKAFLSSCRIYKDIEMGNWVSEKLFRLEPQDEGPYVLLSN 615
            L E +EFI +  +      W + LS+CR++K+I++G   +E+L  LEP++ G Y  L+N
Sbjct: 566 LLQEAQEFIEKMPIEPDVVTWGSLLSACRVHKNIDLGKVAAERLLLLEPENSGAYSALAN 625

Query: 616 MCSSNQKWEEASRTRRSMQRRGISKTPGQSWIHVKNQVHSFVAGDRSHPQHTQIYTYLDT 675
           + S+  KWEEA++ R+SM+   + K  G SWI VK++VH F   D +HP+  +IY  +  
Sbjct: 626 LYSACGKWEEAAKIRKSMKDGRVKKEQGFSWIEVKHKVHVFGVEDGTHPEKNEIYMTMKK 685

Query: 676 LIGRLKEIGYLYDVKMVMQDVEEEQGEVLLHWHSEKLAVAYGIISLASGIPIRIMKNLRV 735
           +   +K++GY+ D   V+ D+EEE  E +L  HSEKLA+A+G+IS      +RIMKNLRV
Sbjct: 686 IWDEIKKMGYVPDTASVLHDLEEEVKEQILRHHSEKLAIAFGLISTPDKTTLRIMKNLRV 745

Query: 736 CADCHNFMKLTSQLLGREIIVRDIHRFHHFNSGRCSCDDYW 758
           C DCH  +K  S+L+GREIIVRD  RFHHF  G CSC DYW
Sbjct: 746 CNDCHTAIKFISKLVGREIIVRDTTRFHHFKDGFCSCRDYW 786

BLAST of Tan0021339 vs. ExPASy Swiss-Prot
Match: Q9SVP7 (Pentatricopeptide repeat-containing protein At4g13650 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H42 PE=2 SV=2)

HSP 1 Score: 480.3 bits (1235), Expect = 3.9e-134
Identity = 235/682 (34.46%), Postives = 388/682 (56.89%), Query Frame = 0

Query: 76   VLSSYVKSEKLDEAQKMFDEMPNRDVLTWTILISGFARLNYSEVALKLFREMLDEGVCPN 135
            +L+ Y K   ++ A   F E    +V+ W +++  +  L+    + ++FR+M  E + PN
Sbjct: 430  LLNLYAKCADIETALDYFLETEVENVVLWNVMLVAYGLLDDLRNSFRIFRQMQIEEIVPN 489

Query: 136  HFTLSCVFKLCSRVGDVQMGKGIHGWILRSGVNLDVVLENSILDLYAKFDAFDYAKKLFD 195
             +T   + K C R+GD+++G+ IH  I+++   L+  + + ++D+YAK    D A     
Sbjct: 490  QYTYPSILKTCIRLGDLELGEQIHSQIIKTNFQLNAYVCSVLIDMYAKLGKLDTA----- 549

Query: 196  SMREKSTATYNIMLGVYVRSCDVNKSLDLFRNLPCRDTASWNTIICGLMQGGYLNKALEL 255
                                       D+      +D  SW T+I G  Q  + +KAL  
Sbjct: 550  --------------------------WDILIRFAGKDVVSWTTMIAGYTQYNFDDKALTT 609

Query: 256  LYEMVANEPEFNKVTSSIALSVVSSLLIIELGRQVHGRIFRFGFHNDGFVKSSLINMYIK 315
              +M+      ++V  + A+S  + L  ++ G+Q+H +    GF +D   +++L+ +Y +
Sbjct: 610  FRQMLDRGIRSDEVGLTNAVSACAGLQALKEGQQIHAQACVSGFSSDLPFQNALVTLYSR 669

Query: 316  CGNLEKASVIYSQMPSDFARKQDSNIVCSDMMTEIVSRSSMVSGYVRNGKYEDAFKTFVS 375
            CG +E++ + + Q  +                 + ++ +++VSG+ ++G  E+A + FV 
Sbjct: 670  CGKIEESYLAFEQTEAG----------------DNIAWNALVSGFQQSGNNEEALRVFVR 729

Query: 376  MVRERVVMDKFTIASIVSACSNAGVLELGRQIHAYIQKTGEQLDAHLASSLIDMYAKGGS 435
            M RE +  + FT  S V A S    ++ G+Q+HA I KTG   +  + ++LI MYAK GS
Sbjct: 730  MNREGIDNNNFTFGSAVKAASETANMKQGKQVHAVITKTGYDSETEVCNALISMYAKCGS 789

Query: 436  LDCAHRIFEQTTYLNVVIWTSMIAGCALHGQGKEAIRLFEQMRYEGIIPNEVTFIGVLTA 495
            +  A + F + +  N V W ++I   + HG G EA+  F+QM +  + PN VT +GVL+A
Sbjct: 790  ISDAEKQFLEVSTKNEVSWNAIINAYSKHGFGSEALDSFDQMIHSNVRPNHVTLVGVLSA 849

Query: 496  CSHAGLLEEGRLYFNMMKDVYAIKPRVEHFTCMVDLYGRAGCLNEVKEFIYENDLSHLSA 555
            CSH GL+++G  YF  M   Y + P+ EH+ C+VD+  RAG L+  KEFI E  +   + 
Sbjct: 850  CSHIGLVDKGIAYFESMNSEYGLSPKPEHYVCVVDMLTRAGLLSRAKEFIQEMPIKPDAL 909

Query: 556  VWKAFLSSCRIYKDIEMGNWVSEKLFRLEPQDEGPYVLLSNMCSSNQKWEEASRTRRSMQ 615
            VW+  LS+C ++K++E+G + +  L  LEP+D   YVLLSN+ + ++KW+    TR+ M+
Sbjct: 910  VWRTLLSACVVHKNMEIGEFAAHHLLELEPEDSATYVLLSNLYAVSKKWDARDLTRQKMK 969

Query: 616  RRGISKTPGQSWIHVKNQVHSFVAGDRSHPQHTQIYTYLDTLIGRLKEIGYLYDVKMVMQ 675
             +G+ K PGQSWI VKN +HSF  GD++HP   +I+ Y   L  R  EIGY+ D   ++ 
Sbjct: 970  EKGVKKEPGQSWIEVKNSIHSFYVGDQNHPLADEIHEYFQDLTKRASEIGYVQDCFSLLN 1029

Query: 676  DVEEEQGEVLLHWHSEKLAVAYGIISLASGIPIRIMKNLRVCADCHNFMKLTSQLLGREI 735
            +++ EQ + ++  HSEKLA+++G++SL + +PI +MKNLRVC DCH ++K  S++  REI
Sbjct: 1030 ELQHEQKDPIIFIHSEKLAISFGLLSLPATVPINVMKNLRVCNDCHAWIKFVSKVSNREI 1064

Query: 736  IVRDIHRFHHFNSGRCSCDDYW 758
            IVRD +RFHHF  G CSC DYW
Sbjct: 1090 IVRDAYRFHHFEGGACSCKDYW 1064

BLAST of Tan0021339 vs. ExPASy Swiss-Prot
Match: Q9LN01 (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 479.6 bits (1233), Expect = 6.7e-134
Identity = 246/708 (34.75%), Postives = 403/708 (56.92%), Query Frame = 0

Query: 59  LHAKMVKNGSILDSGKFVLSSYVK-------SEKLDEAQKMFDEMPNRDVLTWTILISGF 118
           +HA+M+K G  L +  + LS  ++        E L  A  +F  +   ++L W  +  G 
Sbjct: 52  IHAQMIKIG--LHNTNYALSKLIEFCILSPHFEGLPYAISVFKTIQEPNLLIWNTMFRGH 111

Query: 119 ARLNYSEVALKLFREMLDEGVCPNHFTLSCVFKLCSRVGDVQMGKGIHGWILRSGVNLDV 178
           A  +    ALKL+  M+  G+ PN +T   V K C++    + G+ IHG +L+ G +LD+
Sbjct: 112 ALSSDPVSALKLYVCMISLGLLPNSYTFPFVLKSCAKSKAFKEGQQIHGHVLKLGCDLDL 171

Query: 179 VLENSILDLYAKFDAFDYAKKLFDSMREKSTATYNIMLGVYVRSCDVNKSLDLFRNLPCR 238
            +  S++ +Y +    + A K+FD    +   +Y  ++  Y     +  +  LF  +P +
Sbjct: 172 YVHTSLISMYVQNGRLEDAHKVFDKSPHRDVVSYTALIKGYASRGYIENAQKLFDEIPVK 231

Query: 239 DTASWNTIICGLMQGGYLNKALELLYEMVANEPEFNKVTSSIALSVVSSLLIIELGRQVH 298
           D  SWN +I G  + G   +ALEL  +M+      ++ T    +S  +    IELGRQVH
Sbjct: 232 DVVSWNAMISGYAETGNYKEALELFKDMMKTNVRPDESTMVTVVSACAQSGSIELGRQVH 291

Query: 299 GRIFRFGFHNDGFVKSSLINMYIKCGNLEKASVIYSQMPSDFARKQDSNIVCSDMMTEIV 358
             I   GF ++  + ++LI++Y KCG LE A  ++ ++P                  +++
Sbjct: 292 LWIDDHGFGSNLKIVNALIDLYSKCGELETACGLFERLP----------------YKDVI 351

Query: 359 SRSSMVSGYVRNGKYEDAFKTFVSMVRERVVMDKFTIASIVSACSNAGVLELGRQIHAYI 418
           S ++++ GY     Y++A   F  M+R     +  T+ SI+ AC++ G +++GR IH YI
Sbjct: 352 SWNTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSILPACAHLGAIDIGRWIHVYI 411

Query: 419 QK--TGEQLDAHLASSLIDMYAKGGSLDCAHRIFEQTTYLNVVIWTSMIAGCALHGQGKE 478
            K   G    + L +SLIDMYAK G ++ AH++F    + ++  W +MI G A+HG+   
Sbjct: 412 DKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSSWNAMIFGFAMHGRADA 471

Query: 479 AIRLFEQMRYEGIIPNEVTFIGVLTACSHAGLLEEGRLYFNMMKDVYAIKPRVEHFTCMV 538
           +  LF +MR  GI P+++TF+G+L+ACSH+G+L+ GR  F  M   Y + P++EH+ CM+
Sbjct: 472 SFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQDYKMTPKLEHYGCMI 531

Query: 539 DLYGRAGCLNEVKEFIYENDLSHLSAVWKAFLSSCRIYKDIEMGNWVSEKLFRLEPQDEG 598
           DL G +G   E +E I   ++     +W + L +C+++ ++E+G   +E L ++EP++ G
Sbjct: 532 DLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELGESFAENLIKIEPENPG 591

Query: 599 PYVLLSNMCSSNQKWEEASRTRRSMQRRGISKTPGQSWIHVKNQVHSFVAGDRSHPQHTQ 658
            YVLLSN+ +S  +W E ++TR  +  +G+ K PG S I + + VH F+ GD+ HP++ +
Sbjct: 592 SYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSVVHEFIIGDKFHPRNRE 651

Query: 659 IYTYLDTLIGRLKEIGYLYDVKMVMQDVEEEQGEVLLHWHSEKLAVAYGIISLASGIPIR 718
           IY  L+ +   L++ G++ D   V+Q++EEE  E  L  HSEKLA+A+G+IS   G  + 
Sbjct: 652 IYGMLEEMEVLLEKAGFVPDTSEVLQEMEEEWKEGALRHHSEKLAIAFGLISTKPGTKLT 711

Query: 719 IMKNLRVCADCHNFMKLTSQLLGREIIVRDIHRFHHFNSGRCSCDDYW 758
           I+KNLRVC +CH   KL S++  REII RD  RFHHF  G CSC+DYW
Sbjct: 712 IVKNLRVCRNCHEATKLISKIYKREIIARDRTRFHHFRDGVCSCNDYW 741

BLAST of Tan0021339 vs. ExPASy Swiss-Prot
Match: Q5G1T1 (Pentatricopeptide repeat-containing protein At3g49170, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=EMB2261 PE=2 SV=1)

HSP 1 Score: 479.2 bits (1232), Expect = 8.8e-134
Identity = 251/694 (36.17%), Postives = 387/694 (55.76%), Query Frame = 0

Query: 73  GKFVLSSYVKSE-KLDEAQKMFDEMPNRDVLTWTILISGFARLNYSEVALKLFREMLDEG 132
           G  ++  +VK E   + A K+FD+M   +V+TWT++I+   ++ +   A++ F +M+  G
Sbjct: 205 GCSLIDMFVKGENSFENAYKVFDKMSELNVVTWTLMITRCMQMGFPREAIRFFLDMVLSG 264

Query: 133 VCPNHFTLSCVFKLCSRVGDVQMGKGIHGWILRSGVNLDVVLENSILDLYAKFDA---FD 192
              + FTLS VF  C+ + ++ +GK +H W +RSG+  DV  E S++D+YAK  A    D
Sbjct: 265 FESDKFTLSSVFSACAELENLSLGKQLHSWAIRSGLVDDV--ECSLVDMYAKCSADGSVD 324

Query: 193 YAKKLFDSMREKSTATYNIMLGVYVRSCDVNKSLDLFRNLPCRDTASWNTIICGLMQGGY 252
             +K+FD M + S  ++  ++  Y+++C++                              
Sbjct: 325 DCRKVFDRMEDHSVMSWTALITGYMKNCNL------------------------------ 384

Query: 253 LNKALELLYEMVA-NEPEFNKVTSSIALSVVSSLLIIELGRQVHGRIFRFGFHNDGFVKS 312
             +A+ L  EM+     E N  T S A     +L    +G+QV G+ F+ G  ++  V +
Sbjct: 385 ATEAINLFSEMITQGHVEPNHFTFSSAFKACGNLSDPRVGKQVLGQAFKRGLASNSSVAN 444

Query: 313 SLINMYIKCGNLEKASVIYSQMPSDFARKQDSNIVCSDMMTEIVSRSSMVSGYVRNGKYE 372
           S+I+M++K   +E A   +  +                    +VS ++ + G  RN  +E
Sbjct: 445 SVISMFVKSDRMEDAQRAFESLSE----------------KNLVSYNTFLDGTCRNLNFE 504

Query: 373 DAFKTFVSMVRERVVMDKFTIASIVSACSNAGVLELGRQIHAYIQKTGEQLDAHLASSLI 432
            AFK    +    + +  FT AS++S  +N G +  G QIH+ + K G   +  + ++LI
Sbjct: 505 QAFKLLSEITERELGVSAFTFASLLSGVANVGSIRKGEQIHSQVVKLGLSCNQPVCNALI 564

Query: 433 DMYAKGGSLDCAHRIFEQTTYLNVVIWTSMIAGCALHGQGKEAIRLFEQMRYEGIIPNEV 492
            MY+K GS+D A R+F      NV+ WTSMI G A HG     +  F QM  EG+ PNEV
Sbjct: 565 SMYSKCGSIDTASRVFNFMENRNVISWTSMITGFAKHGFAIRVLETFNQMIEEGVKPNEV 624

Query: 493 TFIGVLTACSHAGLLEEGRLYFNMMKDVYAIKPRVEHFTCMVDLYGRAGCLNEVKEFIYE 552
           T++ +L+ACSH GL+ EG  +FN M + + IKP++EH+ CMVDL  RAG L +  EFI  
Sbjct: 625 TYVAILSACSHVGLVSEGWRHFNSMYEDHKIKPKMEHYACMVDLLCRAGLLTDAFEFINT 684

Query: 553 NDLSHLSAVWKAFLSSCRIYKDIEMGNWVSEKLFRLEPQDEGPYVLLSNMCSSNQKWEEA 612
                   VW+ FL +CR++ + E+G   + K+  L+P +   Y+ LSN+ +   KWEE+
Sbjct: 685 MPFQADVLVWRTFLGACRVHSNTELGKLAARKILELDPNEPAAYIQLSNIYACAGKWEES 744

Query: 613 SRTRRSMQRRGISKTPGQSWIHVKNQVHSFVAGDRSHPQHTQIYTYLDTLIGRLKEIGYL 672
           +  RR M+ R + K  G SWI V +++H F  GD +HP   QIY  LD LI  +K  GY+
Sbjct: 745 TEMRRKMKERNLVKEGGCSWIEVGDKIHKFYVGDTAHPNAHQIYDELDRLITEIKRCGYV 804

Query: 673 YDVKMVMQDVEEEQGEV----LLHWHSEKLAVAYGIISLASGIPIRIMKNLRVCADCHNF 732
            D  +V+  +EEE  E     LL+ HSEK+AVA+G+IS +   P+R+ KNLRVC DCHN 
Sbjct: 805 PDTDLVLHKLEEENDEAEKERLLYQHSEKIAVAFGLISTSKSRPVRVFKNLRVCGDCHNA 850

Query: 733 MKLTSQLLGREIIVRDIHRFHHFNSGRCSCDDYW 758
           MK  S + GREI++RD++RFHHF  G+CSC+DYW
Sbjct: 865 MKYISTVSGREIVLRDLNRFHHFKDGKCSCNDYW 850

BLAST of Tan0021339 vs. NCBI nr
Match: KAG6586149.1 (putative pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1385.9 bits (3586), Expect = 0.0e+00
Identity = 673/757 (88.90%), Postives = 716/757 (94.58%), Query Frame = 0

Query: 1   MRWMNLGSGCFASTAFLKLSHSVSQVTMAQRIMSFNLFEHQLFKPCCYHSSNDSLANTLH 60
           MRWMN  SG FASTAFLKL+HSVSQV MAQ+I+ FNL EHQLFK C YHSSND  +NTLH
Sbjct: 1   MRWMNPCSGGFASTAFLKLTHSVSQVFMAQKIIPFNLSEHQLFKSCRYHSSNDDSSNTLH 60

Query: 61  AKMVKNGSILDSGKFVLSSYVKSEKLDEAQKMFDEMPNRDVLTWTILISGFARLNYSEVA 120
           AKMVKNGSIL  GK V+SSYVKSEKLD+AQK+FDEMP+RDVL+WT+LISGFAR+N SE A
Sbjct: 61  AKMVKNGSILYLGKLVMSSYVKSEKLDDAQKVFDEMPHRDVLSWTVLISGFARVNCSERA 120

Query: 121 LKLFREMLDEGVCPNHFTLSCVFKLCSRVGDVQMGKGIHGWILRSGVNLDVVLENSILDL 180
           L+LFREML EGVCPNHFTLSCV KLCSRVGD+QMGKGIHGWILRSGVNLDVVLENS+LDL
Sbjct: 121 LQLFREMLVEGVCPNHFTLSCVLKLCSRVGDLQMGKGIHGWILRSGVNLDVVLENSMLDL 180

Query: 181 YAKFDAFDYAKKLFDSMREKSTATYNIMLGVYVRSCDVNKSLDLFRNLPCRDTASWNTII 240
           Y KFDAFDYA KLFDSMREKSTA+YNIMLGVYVRSCDVNKSLDLFRNLPCRDTASWNTII
Sbjct: 181 YTKFDAFDYATKLFDSMREKSTASYNIMLGVYVRSCDVNKSLDLFRNLPCRDTASWNTII 240

Query: 241 CGLMQGGYLNKALELLYEMVANEPEFNKVTSSIALSVVSSLLIIELGRQVHGRIFRFGFH 300
           CGLMQGGYLN A+ELLYEMV NEPEFN+VTSSIALSVVSSLLIIELGRQVHGRIFRFG H
Sbjct: 241 CGLMQGGYLNIAMELLYEMVKNEPEFNEVTSSIALSVVSSLLIIELGRQVHGRIFRFGLH 300

Query: 301 NDGFVKSSLINMYIKCGNLEKASVIYSQMPSDFARKQDSNIVCSDMMTEIVSRSSMVSGY 360
           NDGFV SSLINMYIKCGNLEKASVIYSQMPS+F +++DSNIVCS+ MTEIVSRSS+VSGY
Sbjct: 301 NDGFVNSSLINMYIKCGNLEKASVIYSQMPSNFGKRRDSNIVCSNTMTEIVSRSSIVSGY 360

Query: 361 VRNGKYEDAFKTFVSMVRERVVMDKFTIASIVSACSNAGVLELGRQIHAYIQKTGEQLDA 420
           V+NGKYED+F+TFVSMVRER VMD+FTIASI+SACSNAGVLELGRQIHAYIQKTGEQLDA
Sbjct: 361 VQNGKYEDSFQTFVSMVRERAVMDRFTIASIISACSNAGVLELGRQIHAYIQKTGEQLDA 420

Query: 421 HLASSLIDMYAKGGSLDCAHRIFEQTTYLNVVIWTSMIAGCALHGQGKEAIRLFEQMRYE 480
           HLASS+IDMYAKGGSLDCAH++FEQTTYLNVV WTSMI GCALHGQGKEAIRLFEQMRYE
Sbjct: 421 HLASSMIDMYAKGGSLDCAHQVFEQTTYLNVVTWTSMITGCALHGQGKEAIRLFEQMRYE 480

Query: 481 GIIPNEVTFIGVLTACSHAGLLEEGRLYFNMMKDVYAIKPRVEHFTCMVDLYGRAGCLNE 540
           GIIPNEVTFIGVLTACSHAGLL+EGRLYFNMMKDVYAI+P+VEHFTCMVD+YGRAGCLNE
Sbjct: 481 GIIPNEVTFIGVLTACSHAGLLDEGRLYFNMMKDVYAIEPKVEHFTCMVDVYGRAGCLNE 540

Query: 541 VKEFIYENDLSHLSAVWKAFLSSCRIYKDIEMGNWVSEKLFRLEPQDEGPYVLLSNMCSS 600
           VKEFIY+NDLSH SAVWKAFLSSCR+YKDIEMGNWVSEKLF+LEP+DEGPYVLLSNMCSS
Sbjct: 541 VKEFIYQNDLSHHSAVWKAFLSSCRLYKDIEMGNWVSEKLFKLEPRDEGPYVLLSNMCSS 600

Query: 601 NQKWEEASRTRRSMQRRGISKTPGQSWIHVKNQVHSFVAGDRSHPQHTQIYTYLDTLIGR 660
           NQKWEEAS+TRRSMQ RGISKTPGQSWIHVKNQVHSF+AGDRSH QH QIY YLD LIGR
Sbjct: 601 NQKWEEASKTRRSMQHRGISKTPGQSWIHVKNQVHSFIAGDRSHLQHAQIYAYLDKLIGR 660

Query: 661 LKEIGYLYDVKMVMQDVEEEQGEVLLHWHSEKLAVAYGIISLASGIPIRIMKNLRVCADC 720
           LKEIGY  DVK+VMQDVEEEQGEVLL WHSEKLAVAYGII+LASGIPIRIMKNLRVC DC
Sbjct: 661 LKEIGYSCDVKLVMQDVEEEQGEVLLGWHSEKLAVAYGIINLASGIPIRIMKNLRVCTDC 720

Query: 721 HNFMKLTSQLLGREIIVRDIHRFHHFNSGRCSCDDYW 758
           HNFMKLTSQLL REIIVRDIHRFHHFNSG CSC DYW
Sbjct: 721 HNFMKLTSQLLDREIIVRDIHRFHHFNSGHCSCGDYW 757

BLAST of Tan0021339 vs. NCBI nr
Match: KAG7020981.1 (putative pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1382.5 bits (3577), Expect = 0.0e+00
Identity = 672/757 (88.77%), Postives = 716/757 (94.58%), Query Frame = 0

Query: 1   MRWMNLGSGCFASTAFLKLSHSVSQVTMAQRIMSFNLFEHQLFKPCCYHSSNDSLANTLH 60
           MRWMN  SG FASTAFLKL+HSVSQV+MAQ+I+ FNL EHQLFK C YHSSND  +NTLH
Sbjct: 1   MRWMNPCSGGFASTAFLKLTHSVSQVSMAQKIIPFNLSEHQLFKSCRYHSSNDDSSNTLH 60

Query: 61  AKMVKNGSILDSGKFVLSSYVKSEKLDEAQKMFDEMPNRDVLTWTILISGFARLNYSEVA 120
           AKMVKNGSIL  GK V+SSYVKSEKLD+AQK+FDEMP+RDVL+WT+LISGFAR+N SE A
Sbjct: 61  AKMVKNGSILYLGKLVMSSYVKSEKLDDAQKVFDEMPHRDVLSWTVLISGFARVNCSERA 120

Query: 121 LKLFREMLDEGVCPNHFTLSCVFKLCSRVGDVQMGKGIHGWILRSGVNLDVVLENSILDL 180
           L+LFREML EGVCPNHFTLSCV KLCSRVGD+QMGKGIHGWILRSGVNLDVVLENS+LDL
Sbjct: 121 LQLFREMLVEGVCPNHFTLSCVLKLCSRVGDLQMGKGIHGWILRSGVNLDVVLENSMLDL 180

Query: 181 YAKFDAFDYAKKLFDSMREKSTATYNIMLGVYVRSCDVNKSLDLFRNLPCRDTASWNTII 240
           Y KFDAFDYA KLFDSMREKSTA+YNIMLGVYVRSCDVNKSLDLFRNLPCRDTASWNTII
Sbjct: 181 YTKFDAFDYATKLFDSMREKSTASYNIMLGVYVRSCDVNKSLDLFRNLPCRDTASWNTII 240

Query: 241 CGLMQGGYLNKALELLYEMVANEPEFNKVTSSIALSVVSSLLIIELGRQVHGRIFRFGFH 300
           CGLMQGGYLN A+ELLYEMV NEPEFN+VTSSIALSVVSSLLIIELGRQVHGRIFRFG H
Sbjct: 241 CGLMQGGYLNIAMELLYEMVKNEPEFNEVTSSIALSVVSSLLIIELGRQVHGRIFRFGLH 300

Query: 301 NDGFVKSSLINMYIKCGNLEKASVIYSQMPSDFARKQDSNIVCSDMMTEIVSRSSMVSGY 360
           NDGFV SSLINMYIKCGNLEKASVIYSQMPS+F +++DSNIVCS+ MTEIVSRSS+VSGY
Sbjct: 301 NDGFVNSSLINMYIKCGNLEKASVIYSQMPSNFGKRRDSNIVCSNTMTEIVSRSSIVSGY 360

Query: 361 VRNGKYEDAFKTFVSMVRERVVMDKFTIASIVSACSNAGVLELGRQIHAYIQKTGEQLDA 420
           V+NGKYED+F+TFVSMVRER VMD+FTIASI+SACSNAGVLELGRQIHAYIQKTGEQLDA
Sbjct: 361 VQNGKYEDSFQTFVSMVRERAVMDRFTIASIISACSNAGVLELGRQIHAYIQKTGEQLDA 420

Query: 421 HLASSLIDMYAKGGSLDCAHRIFEQTTYLNVVIWTSMIAGCALHGQGKEAIRLFEQMRYE 480
           HLASS+IDMYAKGGSLDCAH++FEQTTYLNVV WTSMI GCALHGQGKEAIRLFEQMRYE
Sbjct: 421 HLASSMIDMYAKGGSLDCAHQVFEQTTYLNVVTWTSMITGCALHGQGKEAIRLFEQMRYE 480

Query: 481 GIIPNEVTFIGVLTACSHAGLLEEGRLYFNMMKDVYAIKPRVEHFTCMVDLYGRAGCLNE 540
           GIIPNEVTFIGVLTACSHAGLL+EGRLYFNMMKDVYAI+P+VEHFTCMVD+YGRAG LNE
Sbjct: 481 GIIPNEVTFIGVLTACSHAGLLDEGRLYFNMMKDVYAIEPKVEHFTCMVDVYGRAGRLNE 540

Query: 541 VKEFIYENDLSHLSAVWKAFLSSCRIYKDIEMGNWVSEKLFRLEPQDEGPYVLLSNMCSS 600
           VKEFIY+NDLSH SAVWKAFLSSCR+YKDIEMGNWVSEKLF+LEP+DEGPYVLLSNMCSS
Sbjct: 541 VKEFIYQNDLSHHSAVWKAFLSSCRLYKDIEMGNWVSEKLFKLEPRDEGPYVLLSNMCSS 600

Query: 601 NQKWEEASRTRRSMQRRGISKTPGQSWIHVKNQVHSFVAGDRSHPQHTQIYTYLDTLIGR 660
           NQKWEEAS+TRRSMQ RGISKTPGQSWIHVKNQVHSF+AGDRSH QH QIY YLD LIGR
Sbjct: 601 NQKWEEASKTRRSMQHRGISKTPGQSWIHVKNQVHSFIAGDRSHLQHAQIYAYLDKLIGR 660

Query: 661 LKEIGYLYDVKMVMQDVEEEQGEVLLHWHSEKLAVAYGIISLASGIPIRIMKNLRVCADC 720
           LKEIGY  DVK+VMQDVEEEQGEVLL WHSEKLAVAYGII+LASGIPIRIMKNLRVC DC
Sbjct: 661 LKEIGYSCDVKLVMQDVEEEQGEVLLGWHSEKLAVAYGIINLASGIPIRIMKNLRVCTDC 720

Query: 721 HNFMKLTSQLLGREIIVRDIHRFHHFNSGRCSCDDYW 758
           HNFMKLTSQLL REIIVRDIHRFHHFNSG CSC DYW
Sbjct: 721 HNFMKLTSQLLDREIIVRDIHRFHHFNSGHCSCGDYW 757

BLAST of Tan0021339 vs. NCBI nr
Match: XP_038889548.1 (putative pentatricopeptide repeat-containing protein At3g23330 [Benincasa hispida] >XP_038889549.1 putative pentatricopeptide repeat-containing protein At3g23330 [Benincasa hispida])

HSP 1 Score: 1372.8 bits (3552), Expect = 0.0e+00
Identity = 674/758 (88.92%), Postives = 712/758 (93.93%), Query Frame = 0

Query: 1   MRWMNLGSGCFASTAFLKLSHSVSQVTMAQRIMSFNLFEHQLFKPCCYHSSNDSLANTLH 60
           MR MNL S CFA TAFLKL H + QVTMAQ+I+SFNL EHQLFK CCYH+SNDSL NTLH
Sbjct: 1   MRLMNLSSCCFA-TAFLKLPHPICQVTMAQKIISFNLSEHQLFKSCCYHTSNDSLVNTLH 60

Query: 61  AKMVKNGSILDSGKFVLSSYVKSEKLDEAQKMFDEMPNRDVLTWTILISGFARLNYSEVA 120
           AKMVKNGSIL+SGKFVLSSYVKSEKL++AQK+FDEMP+RDVLTWT+LISGF+R+N SE+A
Sbjct: 61  AKMVKNGSILESGKFVLSSYVKSEKLNDAQKLFDEMPSRDVLTWTVLISGFSRINCSEMA 120

Query: 121 LKLFREMLDEGVCPNHFTLSCVFKLCSRVGDVQMGKGIHGWILRSGVNLDVVLENSILDL 180
           L+LFR+ML EGVCPNHFTLS V KLCSRVGD+QMGKGIHGWILR+GVNLDVVLENS+LDL
Sbjct: 121 LQLFRKMLVEGVCPNHFTLSTVLKLCSRVGDMQMGKGIHGWILRNGVNLDVVLENSMLDL 180

Query: 181 YAKFDAFDYAKKLFDSMREKSTATYNIMLGVYVRSCDVNKSLDLFRNLPCRDTASWNTII 240
           YAKFD F  AKKLFDSMREKSTATYNIMLGVYVRSCDVNKSLDLFRN+PCR+TASWNTII
Sbjct: 181 YAKFDDFYCAKKLFDSMREKSTATYNIMLGVYVRSCDVNKSLDLFRNMPCRNTASWNTII 240

Query: 241 CGLMQGGYLNKALELLYEMVANEPEFNKVTSSIALSVVSSLLIIELGRQVHGRIFRFGFH 300
           CGLMQGG+LN ALELLYEMV NEPEFNKVTSSIALSVV+SLLIIELGRQVHGRI R G H
Sbjct: 241 CGLMQGGHLNAALELLYEMVENEPEFNKVTSSIALSVVASLLIIELGRQVHGRIIRCGLH 300

Query: 301 NDGFVKSSLINMYIKCGNLEKASVIYSQMPSDFARKQDSNIVCSDMMTEIVSRSSMVSGY 360
           NDGFVKSSLINMYIKCGNLEKASVIYSQMPS F  KQDSNIVCSDMMTEIVSRSSMVSGY
Sbjct: 301 NDGFVKSSLINMYIKCGNLEKASVIYSQMPSGFVTKQDSNIVCSDMMTEIVSRSSMVSGY 360

Query: 361 VRNGKYEDAFKTFVSMVRERVVMDKFTIASIVSACSNAGVLELGRQIHAYIQKTGEQLDA 420
           + NGKYE+AFKT VSMVRERV+MDKFTIAS+VSACSNAGVLELGRQIH YIQKTGEQLDA
Sbjct: 361 IWNGKYENAFKTVVSMVRERVLMDKFTIASVVSACSNAGVLELGRQIHGYIQKTGEQLDA 420

Query: 421 HLASSLIDMYAKGGSLDCAHRIFEQTT-YLNVVIWTSMIAGCALHGQGKEAIRLFEQMRY 480
           HLASSLIDMYAKGGSLDCAHRIFEQTT YLNVV+WTSMIAG ALHGQGKEAIRLFE+MRY
Sbjct: 421 HLASSLIDMYAKGGSLDCAHRIFEQTTNYLNVVLWTSMIAGYALHGQGKEAIRLFERMRY 480

Query: 481 EGIIPNEVTFIGVLTACSHAGLLEEGRLYFNMMKDVYAIKPRVEHFTCMVDLYGRAGCLN 540
           EGIIPNEVTF+GVLTACSHAGLLE GRLYFNMMKDVYAIKP+VEHFTCMVDLYGRAGCLN
Sbjct: 481 EGIIPNEVTFVGVLTACSHAGLLEHGRLYFNMMKDVYAIKPKVEHFTCMVDLYGRAGCLN 540

Query: 541 EVKEFIYENDLSHLSAVWKAFLSSCRIYKDIEMGNWVSEKLFRLEPQDEGPYVLLSNMCS 600
           EVKEFIYENDLSHLSAVWKAFLSSCR+YK++EMGNWVSEKLF LE QDEG YVLLSNMCS
Sbjct: 541 EVKEFIYENDLSHLSAVWKAFLSSCRLYKNLEMGNWVSEKLFSLEQQDEGSYVLLSNMCS 600

Query: 601 SNQKWEEASRTRRSMQRRGISKTPGQSWIHVKNQVHSFVAGDRSHPQHTQIYTYLDTLIG 660
            +QKWEEASRTRRSMQ RGI+KTPGQSWIHVKNQVHSFVAGD+SHPQH QIY YLD LIG
Sbjct: 601 GSQKWEEASRTRRSMQHRGINKTPGQSWIHVKNQVHSFVAGDQSHPQHVQIYEYLDKLIG 660

Query: 661 RLKEIGYLYDVKMVMQDVEEEQGEVLLHWHSEKLAVAYGIISLASGIPIRIMKNLRVCAD 720
           RLKEIGYLYDVK+VMQDVEEEQGEVLL WHSEKLA+AYGIISL S IPIRIMKNLRVC D
Sbjct: 661 RLKEIGYLYDVKLVMQDVEEEQGEVLLGWHSEKLALAYGIISLGSAIPIRIMKNLRVCTD 720

Query: 721 CHNFMKLTSQLLGREIIVRDIHRFHHFNSGRCSCDDYW 758
           CHNFMKLTSQLLGREIIVRDIHRFH FNSG CSC DYW
Sbjct: 721 CHNFMKLTSQLLGREIIVRDIHRFHRFNSGHCSCGDYW 757

BLAST of Tan0021339 vs. NCBI nr
Match: KAG7029890.1 (putative pentatricopeptide repeat-containing protein [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1334.3 bits (3452), Expect = 0.0e+00
Identity = 650/755 (86.09%), Postives = 700/755 (92.72%), Query Frame = 0

Query: 3    WMNLGSGCFASTAFLKLSHSVSQVTMAQRIMSFNLFEHQLFKPCCYHSSNDSLANTLHAK 62
            ++    G  ASTAFLKL  SVSQVTMAQ+I+ FN   H LF+ C +HSSNDSL NTLHAK
Sbjct: 285  FLGFSFGYSASTAFLKLFRSVSQVTMAQKIIPFNFSAHHLFESCSFHSSNDSLPNTLHAK 344

Query: 63   MVKNGSILDSGKFVLSSYVKSEKLDEAQKMFDEMPNRDVLTWTILISGFARLNYSEVALK 122
            MVKNGSI +S KF+LSSYVKSEKL++A+K+FDEMP+RDVLTWT+LISGFAR+N SE+AL+
Sbjct: 345  MVKNGSIFESRKFILSSYVKSEKLNDARKVFDEMPSRDVLTWTVLISGFARVNCSEMALQ 404

Query: 123  LFREMLDEGVCPNHFTLSCVFKLCSRVGDVQMGKGIHGWILRSGVNLDVVLENSILDLYA 182
            LFREML EGVCPN FTLS V KLCSRVGDV+MGKGIHGWILRSG++LDVVLENS+LDLYA
Sbjct: 405  LFREMLVEGVCPNPFTLSTVLKLCSRVGDVKMGKGIHGWILRSGISLDVVLENSMLDLYA 464

Query: 183  KFDAFDYAKKLFDSMREKSTATYNIMLGVYVRSCDVNKSLDLFRNLPCRDTASWNTIICG 242
            KFD FDY  KLFDSMREKSTATYNI+LGV+VRS DVNKSLDLFRNLPCRDTA+WNT+ICG
Sbjct: 465  KFDEFDYVTKLFDSMREKSTATYNILLGVHVRS-DVNKSLDLFRNLPCRDTATWNTVICG 524

Query: 243  LMQGGYLNKALELLYEMVANEPEFNKVTSSIALSVVSSLLIIELGRQVHGRIFRFGFHND 302
            LMQGGYLN+ALELLYEMV NEPEFNKVTSSIALSVVSSLL+ ELGRQVHGRI R GFHND
Sbjct: 525  LMQGGYLNEALELLYEMVENEPEFNKVTSSIALSVVSSLLVSELGRQVHGRIVRCGFHND 584

Query: 303  GFVKSSLINMYIKCGNLEKASVIYSQMPSDFARKQDSNIVCSDMMTEIVSRSSMVSGYVR 362
            GFVKSSLINMYIKCGNLEKAS IYSQMPS FA++QD +IVCSD MTEIVSRSSMVSGYVR
Sbjct: 585  GFVKSSLINMYIKCGNLEKASAIYSQMPSGFAKRQDFDIVCSDAMTEIVSRSSMVSGYVR 644

Query: 363  NGKYEDAFKTFVSMVRERVVMDKFTIASIVSACSNAGVLELGRQIHAYIQKTGEQLDAHL 422
            NG YEDAFKTFVSMVRERV+MDKFTIAS+VSACSNAGV ELGRQIHAYIQKTGEQLDAHL
Sbjct: 645  NGNYEDAFKTFVSMVRERVLMDKFTIASVVSACSNAGVFELGRQIHAYIQKTGEQLDAHL 704

Query: 423  ASSLIDMYAKGGSLDCAHRIFEQTTYLNVVIWTSMIAGCALHGQGKEAIRLFEQMRYEGI 482
             SSLIDMYAKGGSLDCA +IFEQTTYLNVVIWTSMI GCALHGQGKEAIRLFE+MRYEG+
Sbjct: 705  TSSLIDMYAKGGSLDCARQIFEQTTYLNVVIWTSMITGCALHGQGKEAIRLFEKMRYEGM 764

Query: 483  IPNEVTFIGVLTACSHAGLLEEGRLYFNMMKDVYAIKPRVEHFTCMVDLYGRAGCLNEVK 542
            IPNEVTFIGVL ACSHAGLLE+GRLYFNMMKDVYAIKP+VEHFTCMVDLYGRAG LNEVK
Sbjct: 765  IPNEVTFIGVLAACSHAGLLEDGRLYFNMMKDVYAIKPKVEHFTCMVDLYGRAGRLNEVK 824

Query: 543  EFIYENDLSHLSAVWKAFLSSCRIYKDIEMGNWVSEKLFRLEPQDEGPYVLLSNMCSSNQ 602
            +FIYEND+SHL+AVWKAFLSSC++YKDIEMGNWVSE+LFRLEP DEGPYVLLSNMCSSN+
Sbjct: 825  KFIYENDISHLNAVWKAFLSSCQLYKDIEMGNWVSERLFRLEPLDEGPYVLLSNMCSSNK 884

Query: 603  KWEEASRTRRSMQRRGISKTPGQSWIHVKNQVHSFVAGDRSHPQHTQIYTYLDTLIGRLK 662
            KWEEA RTRRSMQ RGISKTPGQSWIHVKN+VHSFVAGDRSHPQH QIY YLD LIGRLK
Sbjct: 885  KWEEAFRTRRSMQHRGISKTPGQSWIHVKNRVHSFVAGDRSHPQHAQIYEYLDKLIGRLK 944

Query: 663  EIGYLYDVKMVMQDVEEEQGEVLLHWHSEKLAVAYGIISLASGIPIRIMKNLRVCADCHN 722
            EIGYL+DVK+VMQDVEEEQGEVLL WHSEKLA+AYG+ISL S IPIRIMKNLR+C DCHN
Sbjct: 945  EIGYLFDVKLVMQDVEEEQGEVLLGWHSEKLAIAYGLISLGSSIPIRIMKNLRICTDCHN 1004

Query: 723  FMKLTSQLLGREIIVRDIHRFHHFNSGRCSCDDYW 758
            FMKLTSQLL REIIVRDIHRFHHFNSG CSC DYW
Sbjct: 1005 FMKLTSQLLCREIIVRDIHRFHHFNSGHCSCGDYW 1038

BLAST of Tan0021339 vs. NCBI nr
Match: XP_022965499.1 (LOW QUALITY PROTEIN: putative pentatricopeptide repeat-containing protein At3g23330 [Cucurbita maxima])

HSP 1 Score: 1317.0 bits (3407), Expect = 0.0e+00
Identity = 637/716 (88.97%), Postives = 681/716 (95.11%), Query Frame = 0

Query: 42   LFKPCCYHSSNDSLANTLHAKMVKNGSILDSGKFVLSSYVKSEKLDEAQKMFDEMPNRDV 101
            LFK CCYH+SN + A+TLHAKMVKNGSIL  GKF++SS+VKSE+LD+AQK+FDEMP+RDV
Sbjct: 299  LFKSCCYHASNGASADTLHAKMVKNGSILYLGKFIMSSHVKSERLDDAQKVFDEMPHRDV 358

Query: 102  LTWTILISGFARLNYSEVALKLFREMLDEGVCPNHFTLSCVFKLCSRVGDVQMGKGIHGW 161
            L+WT+LISGFAR+N SE+AL+LFREML EGVCPNHFTLSCV KLCSRVGD+QMGKGIHGW
Sbjct: 359  LSWTVLISGFARVNCSEMALQLFREMLVEGVCPNHFTLSCVLKLCSRVGDLQMGKGIHGW 418

Query: 162  ILRSGVNLDVVLENSILDLYAKFDAFDYAKKLFDSMREKSTATYNIMLGVYVRSCDVNKS 221
            ILRSGVNLDVVL NS+LDLYAKFDAFDYAK+LFDSM+EKSTATYNIMLGVYVRSCDVNKS
Sbjct: 419  ILRSGVNLDVVLGNSMLDLYAKFDAFDYAKQLFDSMKEKSTATYNIMLGVYVRSCDVNKS 478

Query: 222  LDLFRNLPCRDTASWNTIICGLMQGGYLNKALELLYEMVANEPEFNKVTSSIALSVVSSL 281
            LDLFRNLPCRD ASWNTIICGLMQGGYLN A+ELLYEMV NEPEFNKVTSSIALSVVSSL
Sbjct: 479  LDLFRNLPCRDAASWNTIICGLMQGGYLNTAMELLYEMVKNEPEFNKVTSSIALSVVSSL 538

Query: 282  LIIELGRQVHGRIFRFGFHNDGFVKSSLINMYIKCGNLEKASVIYSQMPSDFARKQDSNI 341
            LII+LGRQVHGRIFRFGFHNDGFV SSLINMYIKCGNLEKASVIYSQMPS+F +K+DSNI
Sbjct: 539  LIIDLGRQVHGRIFRFGFHNDGFVNSSLINMYIKCGNLEKASVIYSQMPSNFGKKRDSNI 598

Query: 342  VCSDMMTEIVSRSSMVSGYVRNGKYEDAFKTFVSMVRERVVMDKFTIASIVSACSNAGVL 401
            VCS+ MTEIVSRSS+VSGYV+NGKYED+FKTFVSM+RER VMD+FTIASI+SACSNAGVL
Sbjct: 599  VCSNTMTEIVSRSSIVSGYVQNGKYEDSFKTFVSMIRERAVMDRFTIASIISACSNAGVL 658

Query: 402  ELGRQIHAYIQKTGEQLDAHLASSLIDMYAKGGSLDCAHRIFEQTTYLNVVIWTSMIAGC 461
            ELGRQIHAYIQKTGEQLDAHLASSLIDMYAKGGSLDCA++IF QTTYLNVV WTSMI GC
Sbjct: 659  ELGRQIHAYIQKTGEQLDAHLASSLIDMYAKGGSLDCAYQIFVQTTYLNVVTWTSMITGC 718

Query: 462  ALHGQGKEAIRLFEQMRYEGIIPNEVTFIGVLTACSHAGLLEEGRLYFNMMKDVYAIKPR 521
            ALHGQGKEAIRLFEQMRYEGIIPNEVTFIGVL ACSHAGLL+EGRLYFNMMKDVYAI+P+
Sbjct: 719  ALHGQGKEAIRLFEQMRYEGIIPNEVTFIGVLIACSHAGLLDEGRLYFNMMKDVYAIEPK 778

Query: 522  VEHFTCMVDLYGRAGCLNEVKEFIYENDLSHLSAVWKAFLSSCRIYKDIEMGNWVSEKLF 581
            VEHFTCMVDLYGRAG LNEVKEFIY+N+LSH SAVWKAFLSSCR+YKDI+MGNWVSEKLF
Sbjct: 779  VEHFTCMVDLYGRAGRLNEVKEFIYQNNLSHHSAVWKAFLSSCRLYKDIKMGNWVSEKLF 838

Query: 582  RLEPQDEGPYVLLSNMCSSNQKWEEASRTRRSMQRRGISKTPGQSWIHVKNQVHSFVAGD 641
            +LEP+DEGPYVLLSNMCSSNQKWEEAS+TRRSMQ RGISKTPGQSWIHVKNQVHSFVAGD
Sbjct: 839  KLEPRDEGPYVLLSNMCSSNQKWEEASKTRRSMQHRGISKTPGQSWIHVKNQVHSFVAGD 898

Query: 642  RSHPQHTQIYTYLDTLIGRLKEIGYLYDVKMVMQDVEEEQGEVLLHWHSEKLAVAYGIIS 701
            RSH QH QIY YLD LIGRLKEIGY  DVK+VMQDVEEEQGEVLL WHSEKLAV YGIIS
Sbjct: 899  RSHLQHAQIYAYLDKLIGRLKEIGYSCDVKLVMQDVEEEQGEVLLGWHSEKLAVTYGIIS 958

Query: 702  LASGIPIRIMKNLRVCADCHNFMKLTSQLLGREIIVRDIHRFHHFNSGRCSCDDYW 758
            LASGIPIRIMKNLRVC DCHNFMKLTSQLL REIIVRDIHRFHHF SGRCSC DYW
Sbjct: 959  LASGIPIRIMKNLRVCTDCHNFMKLTSQLLDREIIVRDIHRFHHFISGRCSCGDYW 1014

BLAST of Tan0021339 vs. ExPASy TrEMBL
Match: A0A6J1HR62 (LOW QUALITY PROTEIN: putative pentatricopeptide repeat-containing protein At3g23330 OS=Cucurbita maxima OX=3661 GN=LOC111465385 PE=3 SV=1)

HSP 1 Score: 1317.0 bits (3407), Expect = 0.0e+00
Identity = 637/716 (88.97%), Postives = 681/716 (95.11%), Query Frame = 0

Query: 42   LFKPCCYHSSNDSLANTLHAKMVKNGSILDSGKFVLSSYVKSEKLDEAQKMFDEMPNRDV 101
            LFK CCYH+SN + A+TLHAKMVKNGSIL  GKF++SS+VKSE+LD+AQK+FDEMP+RDV
Sbjct: 299  LFKSCCYHASNGASADTLHAKMVKNGSILYLGKFIMSSHVKSERLDDAQKVFDEMPHRDV 358

Query: 102  LTWTILISGFARLNYSEVALKLFREMLDEGVCPNHFTLSCVFKLCSRVGDVQMGKGIHGW 161
            L+WT+LISGFAR+N SE+AL+LFREML EGVCPNHFTLSCV KLCSRVGD+QMGKGIHGW
Sbjct: 359  LSWTVLISGFARVNCSEMALQLFREMLVEGVCPNHFTLSCVLKLCSRVGDLQMGKGIHGW 418

Query: 162  ILRSGVNLDVVLENSILDLYAKFDAFDYAKKLFDSMREKSTATYNIMLGVYVRSCDVNKS 221
            ILRSGVNLDVVL NS+LDLYAKFDAFDYAK+LFDSM+EKSTATYNIMLGVYVRSCDVNKS
Sbjct: 419  ILRSGVNLDVVLGNSMLDLYAKFDAFDYAKQLFDSMKEKSTATYNIMLGVYVRSCDVNKS 478

Query: 222  LDLFRNLPCRDTASWNTIICGLMQGGYLNKALELLYEMVANEPEFNKVTSSIALSVVSSL 281
            LDLFRNLPCRD ASWNTIICGLMQGGYLN A+ELLYEMV NEPEFNKVTSSIALSVVSSL
Sbjct: 479  LDLFRNLPCRDAASWNTIICGLMQGGYLNTAMELLYEMVKNEPEFNKVTSSIALSVVSSL 538

Query: 282  LIIELGRQVHGRIFRFGFHNDGFVKSSLINMYIKCGNLEKASVIYSQMPSDFARKQDSNI 341
            LII+LGRQVHGRIFRFGFHNDGFV SSLINMYIKCGNLEKASVIYSQMPS+F +K+DSNI
Sbjct: 539  LIIDLGRQVHGRIFRFGFHNDGFVNSSLINMYIKCGNLEKASVIYSQMPSNFGKKRDSNI 598

Query: 342  VCSDMMTEIVSRSSMVSGYVRNGKYEDAFKTFVSMVRERVVMDKFTIASIVSACSNAGVL 401
            VCS+ MTEIVSRSS+VSGYV+NGKYED+FKTFVSM+RER VMD+FTIASI+SACSNAGVL
Sbjct: 599  VCSNTMTEIVSRSSIVSGYVQNGKYEDSFKTFVSMIRERAVMDRFTIASIISACSNAGVL 658

Query: 402  ELGRQIHAYIQKTGEQLDAHLASSLIDMYAKGGSLDCAHRIFEQTTYLNVVIWTSMIAGC 461
            ELGRQIHAYIQKTGEQLDAHLASSLIDMYAKGGSLDCA++IF QTTYLNVV WTSMI GC
Sbjct: 659  ELGRQIHAYIQKTGEQLDAHLASSLIDMYAKGGSLDCAYQIFVQTTYLNVVTWTSMITGC 718

Query: 462  ALHGQGKEAIRLFEQMRYEGIIPNEVTFIGVLTACSHAGLLEEGRLYFNMMKDVYAIKPR 521
            ALHGQGKEAIRLFEQMRYEGIIPNEVTFIGVL ACSHAGLL+EGRLYFNMMKDVYAI+P+
Sbjct: 719  ALHGQGKEAIRLFEQMRYEGIIPNEVTFIGVLIACSHAGLLDEGRLYFNMMKDVYAIEPK 778

Query: 522  VEHFTCMVDLYGRAGCLNEVKEFIYENDLSHLSAVWKAFLSSCRIYKDIEMGNWVSEKLF 581
            VEHFTCMVDLYGRAG LNEVKEFIY+N+LSH SAVWKAFLSSCR+YKDI+MGNWVSEKLF
Sbjct: 779  VEHFTCMVDLYGRAGRLNEVKEFIYQNNLSHHSAVWKAFLSSCRLYKDIKMGNWVSEKLF 838

Query: 582  RLEPQDEGPYVLLSNMCSSNQKWEEASRTRRSMQRRGISKTPGQSWIHVKNQVHSFVAGD 641
            +LEP+DEGPYVLLSNMCSSNQKWEEAS+TRRSMQ RGISKTPGQSWIHVKNQVHSFVAGD
Sbjct: 839  KLEPRDEGPYVLLSNMCSSNQKWEEASKTRRSMQHRGISKTPGQSWIHVKNQVHSFVAGD 898

Query: 642  RSHPQHTQIYTYLDTLIGRLKEIGYLYDVKMVMQDVEEEQGEVLLHWHSEKLAVAYGIIS 701
            RSH QH QIY YLD LIGRLKEIGY  DVK+VMQDVEEEQGEVLL WHSEKLAV YGIIS
Sbjct: 899  RSHLQHAQIYAYLDKLIGRLKEIGYSCDVKLVMQDVEEEQGEVLLGWHSEKLAVTYGIIS 958

Query: 702  LASGIPIRIMKNLRVCADCHNFMKLTSQLLGREIIVRDIHRFHHFNSGRCSCDDYW 758
            LASGIPIRIMKNLRVC DCHNFMKLTSQLL REIIVRDIHRFHHF SGRCSC DYW
Sbjct: 959  LASGIPIRIMKNLRVCTDCHNFMKLTSQLLDREIIVRDIHRFHHFISGRCSCGDYW 1014

BLAST of Tan0021339 vs. ExPASy TrEMBL
Match: A0A0A0LKI4 (DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_2G074230 PE=3 SV=1)

HSP 1 Score: 1312.7 bits (3396), Expect = 0.0e+00
Identity = 643/758 (84.83%), Postives = 689/758 (90.90%), Query Frame = 0

Query: 1   MRWMNLGSGCFASTAFLKLSHSVSQVTMAQRIMSFNLFEHQLFKPCCYHSSNDSLANTLH 60
           MRWMNL S CF S AFLKLSHS+SQ TM  +I+SFNL EH LFK   YH+SN   +NTLH
Sbjct: 1   MRWMNLSSSCFPSPAFLKLSHSISQGTMTHKIISFNLSEHHLFKSFSYHTSNHFSSNTLH 60

Query: 61  AKMVKNGSILDSGKFVLSSYVKSEKLDEAQKMFDEMPNRDVLTWTILISGFARLNYSEVA 120
           AKMVK GSI  SGKFVL+SYVKSEKL++AQK+FDEMPNRDVLTWT LISGF+R+N S +A
Sbjct: 61  AKMVKIGSIFVSGKFVLTSYVKSEKLNDAQKLFDEMPNRDVLTWTALISGFSRVNSSGMA 120

Query: 121 LKLFREMLDEGVCPNHFTLSCVFKLCSRVGDVQMGKGIHGWILRSGVNLDVVLENSILDL 180
           L+LFREML EGV PNHFTLS V KLCS+VGDV+MGKGIHGWILR+GV LDVVLENS+LDL
Sbjct: 121 LQLFREMLVEGVSPNHFTLSTVLKLCSKVGDVRMGKGIHGWILRNGVKLDVVLENSMLDL 180

Query: 181 YAKFDAFDYAKKLFDSMREKSTATYNIMLGVYVRSCDVNKSLDLFRNLPCRDTASWNTII 240
           YAKFD F YA+KL+DSMREKST T NI+LGVYVRSCDVNKSL LFRNLPCR+ ASWNTII
Sbjct: 181 YAKFDEFVYARKLYDSMREKSTDTDNIILGVYVRSCDVNKSLHLFRNLPCRNAASWNTII 240

Query: 241 CGLMQGGYLNKALELLYEMVANEPEFNKVTSSIALSVVSSLLIIELGRQVHGRIFRFGFH 300
           CGLMQGGYLN ALELLYEMV NE EFN  TSSIALSVVSSLLI+ELGRQVHGRI R G H
Sbjct: 241 CGLMQGGYLNAALELLYEMVENESEFNNFTSSIALSVVSSLLILELGRQVHGRIVRCGLH 300

Query: 301 NDGFVKSSLINMYIKCGNLEKASVIYSQMPSDFARKQDSNIVCSDMMTEIVSRSSMVSGY 360
           NDGFVKS+LINMYIKCGNLEKASVIYS++PS FA KQ SNIVCSD MTEIVSRSSMV GY
Sbjct: 301 NDGFVKSALINMYIKCGNLEKASVIYSRLPSGFATKQSSNIVCSDTMTEIVSRSSMVYGY 360

Query: 361 VRNGKYEDAFKTFVSMVRERVVMDKFTIASIVSACSNAGVLELGRQIHAYIQKTGEQLDA 420
           VRNGKYEDAFKTFVSMVRERV+MDKFTIA++VSACSNAGVLELGRQ+H +I KT EQLDA
Sbjct: 361 VRNGKYEDAFKTFVSMVRERVLMDKFTIANVVSACSNAGVLELGRQVHGFIHKTVEQLDA 420

Query: 421 HLASSLIDMYAKGGSLDCAHRIFEQ-TTYLNVVIWTSMIAGCALHGQGKEAIRLFEQMRY 480
           HLASSLIDMYAKGGSLDCAHRIF+Q T YLNVVIWTSMI GCALHG GKEAIRLFEQMRY
Sbjct: 421 HLASSLIDMYAKGGSLDCAHRIFDQMTNYLNVVIWTSMIVGCALHGHGKEAIRLFEQMRY 480

Query: 481 EGIIPNEVTFIGVLTACSHAGLLEEGRLYFNMMKDVYAIKPRVEHFTCMVDLYGRAGCLN 540
           EGIIPNEVTFIGVLTACSHAGLLE+G LYFNMMKDVYAIKP+VEH+TCMVDLYGRAG LN
Sbjct: 481 EGIIPNEVTFIGVLTACSHAGLLEDGHLYFNMMKDVYAIKPKVEHYTCMVDLYGRAGLLN 540

Query: 541 EVKEFIYENDLSHLSAVWKAFLSSCRIYKDIEMGNWVSEKLFRLEPQDEGPYVLLSNMCS 600
           EVKEFIYENDLSHLSAVWKAFLSSCR+Y+D+EMG WVSEKLFRL+PQDEG YVLLSNMCS
Sbjct: 541 EVKEFIYENDLSHLSAVWKAFLSSCRLYRDLEMGKWVSEKLFRLKPQDEGSYVLLSNMCS 600

Query: 601 SNQKWEEASRTRRSMQRRGISKTPGQSWIHVKNQVHSFVAGDRSHPQHTQIYTYLDTLIG 660
            +QKWEEASR RRSMQ  GI+KTPGQSWIH+KNQVHSFVAGD+SHPQH QIY YLD LIG
Sbjct: 601 GSQKWEEASRARRSMQHSGINKTPGQSWIHLKNQVHSFVAGDQSHPQHAQIYEYLDKLIG 660

Query: 661 RLKEIGYLYDVKMVMQDVEEEQGEVLLHWHSEKLAVAYGIISLASGIPIRIMKNLRVCAD 720
           RLKEIGYL+DVK+VMQDVEEEQGEVLL WHSEKLAVAYGIISL S IPIRIMKNLR+C D
Sbjct: 661 RLKEIGYLHDVKLVMQDVEEEQGEVLLGWHSEKLAVAYGIISLGSAIPIRIMKNLRICTD 720

Query: 721 CHNFMKLTSQLLGREIIVRDIHRFHHFNSGRCSCDDYW 758
           CHNFMKLTSQLLGREIIVRDI+RFHHFNSG CSC DYW
Sbjct: 721 CHNFMKLTSQLLGREIIVRDIYRFHHFNSGHCSCGDYW 758

BLAST of Tan0021339 vs. ExPASy TrEMBL
Match: A0A6J1EPP7 (putative pentatricopeptide repeat-containing protein At3g23330 OS=Cucurbita moschata OX=3662 GN=LOC111436248 PE=3 SV=1)

HSP 1 Score: 1299.6 bits (3362), Expect = 0.0e+00
Identity = 634/710 (89.30%), Postives = 671/710 (94.51%), Query Frame = 0

Query: 48   YHSSNDSLANTLHAKMVKNGSILDSGKFVLSSYVKSEKLDEAQKMFDEMPNRDVLTWTIL 107
            +HSSNDSL NTLHAKMVKNGSI +S KF+LSSYVKSEKL++A+K+FDEMP+RDVLTWT+L
Sbjct: 306  FHSSNDSLPNTLHAKMVKNGSIFESRKFILSSYVKSEKLNDARKVFDEMPSRDVLTWTVL 365

Query: 108  ISGFARLNYSEVALKLFREMLDEGVCPNHFTLSCVFKLCSRVGDVQMGKGIHGWILRSGV 167
            ISGFAR+N SE+AL+LFREML EGVCPN FTLS V KLCSRVGDV+MGKGIHGWILRSGV
Sbjct: 366  ISGFARVNCSEMALQLFREMLVEGVCPNPFTLSTVLKLCSRVGDVKMGKGIHGWILRSGV 425

Query: 168  NLDVVLENSILDLYAKFDAFDYAKKLFDSMREKSTATYNIMLGVYVRSCDVNKSLDLFRN 227
            +LDVVLENS+LDLYAKFD FDY  KLFDSMREKSTATYNI+LGV+VRS DVNKSLDLFRN
Sbjct: 426  SLDVVLENSMLDLYAKFDEFDYVTKLFDSMREKSTATYNILLGVHVRS-DVNKSLDLFRN 485

Query: 228  LPCRDTASWNTIICGLMQGGYLNKALELLYEMVANEPEFNKVTSSIALSVVSSLLIIELG 287
            LPCRDTASWNT+ICGLMQGGYLN+ALELLYEMV NEPEFNKVTSSIALSVVSSLLIIELG
Sbjct: 486  LPCRDTASWNTVICGLMQGGYLNEALELLYEMVENEPEFNKVTSSIALSVVSSLLIIELG 545

Query: 288  RQVHGRIFRFGFHNDGFVKSSLINMYIKCGNLEKASVIYSQMPSDFARKQDSNIVCSDMM 347
            RQVHGRI R G HNDGFVKSSLINMYIKCGNLEKASVIYSQMPS FA KQD NIVCSD M
Sbjct: 546  RQVHGRIVRCGLHNDGFVKSSLINMYIKCGNLEKASVIYSQMPSGFATKQDFNIVCSDTM 605

Query: 348  TEIVSRSSMVSGYVRNGKYEDAFKTFVSMVRERVVMDKFTIASIVSACSNAGVLELGRQI 407
            TEIVSRSSMVSGYVRNGKYEDAFKTFVSMVRERV+MDKFTIAS+VSACSNAGV ELGRQI
Sbjct: 606  TEIVSRSSMVSGYVRNGKYEDAFKTFVSMVRERVLMDKFTIASVVSACSNAGVFELGRQI 665

Query: 408  HAYIQKTGEQLDAHLASSLIDMYAKGGSLDCAHRIFEQTTYLNVVIWTSMIAGCALHGQG 467
            HAYIQKTGEQLDAHL SSLIDMYAKGGSLDCA +IFEQTTYLNVVIWTSMI GCALHGQG
Sbjct: 666  HAYIQKTGEQLDAHLTSSLIDMYAKGGSLDCARQIFEQTTYLNVVIWTSMITGCALHGQG 725

Query: 468  KEAIRLFEQMRYEGIIPNEVTFIGVLTACSHAGLLEEGRLYFNMMKDVYAIKPRVEHFTC 527
            KEAIRLFE+MRYEG+IPNEVTFIGVL ACSHAGLLE+GRLYFNMMKDVYAIKP+VEHFTC
Sbjct: 726  KEAIRLFEKMRYEGMIPNEVTFIGVLAACSHAGLLEDGRLYFNMMKDVYAIKPKVEHFTC 785

Query: 528  MVDLYGRAGCLNEVKEFIYENDLSHLSAVWKAFLSSCRIYKDIEMGNWVSEKLFRLEPQD 587
            MVDLYGRAG LNEVK+FIYENDLSHL+AVWKAFLSSC++YKDIEMGNWVSE+LFRLEP D
Sbjct: 786  MVDLYGRAGHLNEVKKFIYENDLSHLNAVWKAFLSSCQLYKDIEMGNWVSERLFRLEPLD 845

Query: 588  EGPYVLLSNMCSSNQKWEEASRTRRSMQRRGISKTPGQSWIHVKNQVHSFVAGDRSHPQH 647
            EGPYVLLSNMCSSNQKWEEA RTRRSMQ RGISKTPGQSWIHVKN+VHSFVAGDRSHPQH
Sbjct: 846  EGPYVLLSNMCSSNQKWEEAFRTRRSMQHRGISKTPGQSWIHVKNRVHSFVAGDRSHPQH 905

Query: 648  TQIYTYLDTLIGRLKEIGYLYDVKMVMQDVEEEQGEVLLHWHSEKLAVAYGIISLASGIP 707
             QIY YLD LIGRLKEIGYL+DVK+VMQDVEEEQGEVLL WHSEKLA+AYG+ISL S IP
Sbjct: 906  AQIYEYLDKLIGRLKEIGYLFDVKLVMQDVEEEQGEVLLGWHSEKLAIAYGLISLGSSIP 965

Query: 708  IRIMKNLRVCADCHNFMKLTSQLLGREIIVRDIHRFHHFNSGRCSCDDYW 758
            IRIMKNLR+C DCHNFMKLTSQLL REIIVRDIHRFHHFNSG CSC DYW
Sbjct: 966  IRIMKNLRICTDCHNFMKLTSQLLCREIIVRDIHRFHHFNSGHCSCGDYW 1014

BLAST of Tan0021339 vs. ExPASy TrEMBL
Match: A0A6J1KA70 (putative pentatricopeptide repeat-containing protein At3g23330 OS=Cucurbita maxima OX=3661 GN=LOC111492492 PE=3 SV=1)

HSP 1 Score: 1288.9 bits (3334), Expect = 0.0e+00
Identity = 629/710 (88.59%), Postives = 669/710 (94.23%), Query Frame = 0

Query: 48   YHSSNDSLANTLHAKMVKNGSILDSGKFVLSSYVKSEKLDEAQKMFDEMPNRDVLTWTIL 107
            YHSSNDSL NTLHAKMVKNGSI +S KF+LSSYVKSEKL++A+K+FDEMP+RDVLTWT+L
Sbjct: 306  YHSSNDSLPNTLHAKMVKNGSIFESRKFILSSYVKSEKLNDARKVFDEMPSRDVLTWTVL 365

Query: 108  ISGFARLNYSEVALKLFREMLDEGVCPNHFTLSCVFKLCSRVGDVQMGKGIHGWILRSGV 167
            ISGFAR+N SE+AL+LFREML EGV PN FTLS V KLCSRVGDV+MGKGIHGWILRSGV
Sbjct: 366  ISGFARVNCSEMALQLFREMLVEGVYPNPFTLSTVLKLCSRVGDVKMGKGIHGWILRSGV 425

Query: 168  NLDVVLENSILDLYAKFDAFDYAKKLFDSMREKSTATYNIMLGVYVRSCDVNKSLDLFRN 227
            +LDVVLENS+LDLYAKFD FDY KKLFDSMREKSTATYNI+LGV+VRS DVNKSLDLFRN
Sbjct: 426  SLDVVLENSMLDLYAKFDEFDYVKKLFDSMREKSTATYNILLGVHVRS-DVNKSLDLFRN 485

Query: 228  LPCRDTASWNTIICGLMQGGYLNKALELLYEMVANEPEFNKVTSSIALSVVSSLLIIELG 287
            LPCRDTASWNT+ICGLMQGGYLN+ALELLYEMV N+PEFNKVTSSIALSVVSSLLIIELG
Sbjct: 486  LPCRDTASWNTVICGLMQGGYLNEALELLYEMVENQPEFNKVTSSIALSVVSSLLIIELG 545

Query: 288  RQVHGRIFRFGFHNDGFVKSSLINMYIKCGNLEKASVIYSQMPSDFARKQDSNIVCSDMM 347
            RQVHGRI R GFHNDGFVKSSLINMYIKCGNLEKASVIYSQMPS F +KQD +IV SD M
Sbjct: 546  RQVHGRILRCGFHNDGFVKSSLINMYIKCGNLEKASVIYSQMPSGFGKKQDFDIVYSDTM 605

Query: 348  TEIVSRSSMVSGYVRNGKYEDAFKTFVSMVRERVVMDKFTIASIVSACSNAGVLELGRQI 407
            TEIVSRSSMVSGYVRNGKYEDAFKTFVSMVRERV+MDKFTIAS+VSACSNAGV ELGRQI
Sbjct: 606  TEIVSRSSMVSGYVRNGKYEDAFKTFVSMVRERVLMDKFTIASVVSACSNAGVFELGRQI 665

Query: 408  HAYIQKTGEQLDAHLASSLIDMYAKGGSLDCAHRIFEQTTYLNVVIWTSMIAGCALHGQG 467
            HAYIQKTGEQLDAHL SSLIDMYAKGGSLDCA +IFEQ TYLNVVIWTSMI GCALHGQG
Sbjct: 666  HAYIQKTGEQLDAHLTSSLIDMYAKGGSLDCARQIFEQMTYLNVVIWTSMITGCALHGQG 725

Query: 468  KEAIRLFEQMRYEGIIPNEVTFIGVLTACSHAGLLEEGRLYFNMMKDVYAIKPRVEHFTC 527
            KEAIRLFE+MRYEG+IPNEVTFIGVL ACSHAGL+E+GRLYFNMMKDVYAIKP+VEHFTC
Sbjct: 726  KEAIRLFEKMRYEGMIPNEVTFIGVLAACSHAGLIEDGRLYFNMMKDVYAIKPKVEHFTC 785

Query: 528  MVDLYGRAGCLNEVKEFIYENDLSHLSAVWKAFLSSCRIYKDIEMGNWVSEKLFRLEPQD 587
            MVDLYGRAG LNEVK+FIYENDLSHL+AVWKAFLSSC++YKDIEMGNWVSE+LFRLEP D
Sbjct: 786  MVDLYGRAGRLNEVKKFIYENDLSHLNAVWKAFLSSCQLYKDIEMGNWVSERLFRLEPLD 845

Query: 588  EGPYVLLSNMCSSNQKWEEASRTRRSMQRRGISKTPGQSWIHVKNQVHSFVAGDRSHPQH 647
            EGPY+LLSNMCSSNQKWEEA RTRR MQ RGISKTPGQSWIHVKNQVHSFVAGDRSHPQH
Sbjct: 846  EGPYILLSNMCSSNQKWEEAFRTRRFMQHRGISKTPGQSWIHVKNQVHSFVAGDRSHPQH 905

Query: 648  TQIYTYLDTLIGRLKEIGYLYDVKMVMQDVEEEQGEVLLHWHSEKLAVAYGIISLASGIP 707
             QIY YLD LIGRLKEIGYL+DVK+VMQDVEEEQGEVLL WHSEKLA+AYG+ISL S IP
Sbjct: 906  AQIYEYLDNLIGRLKEIGYLFDVKLVMQDVEEEQGEVLLGWHSEKLAIAYGLISLDSAIP 965

Query: 708  IRIMKNLRVCADCHNFMKLTSQLLGREIIVRDIHRFHHFNSGRCSCDDYW 758
            IRIMKNLR+C DCHNFMKLTSQLL REIIVRDIHRFHHFNSG CSC DYW
Sbjct: 966  IRIMKNLRMCTDCHNFMKLTSQLLCREIIVRDIHRFHHFNSGHCSCGDYW 1014

BLAST of Tan0021339 vs. ExPASy TrEMBL
Match: A0A1S3B4E3 (LOW QUALITY PROTEIN: putative pentatricopeptide repeat-containing protein At3g23330 OS=Cucumis melo OX=3656 GN=LOC103485889 PE=3 SV=1)

HSP 1 Score: 1248.0 bits (3228), Expect = 0.0e+00
Identity = 616/747 (82.46%), Postives = 668/747 (89.42%), Query Frame = 0

Query: 19   LSHSVSQVTMAQ----RIMSFNLFEHQLFKPC---CYHSSNDSLANTLHAKMVKNGSILD 78
            L+  +SQ T+A       + F+L  +  F P    CYH+SN   +NTLHAKMVK GSI++
Sbjct: 267  LASKISQGTVATVGGLLFLGFSLSSY-FFPPLXKFCYHTSNSFSSNTLHAKMVKIGSIIE 326

Query: 79   SGKFVLSSYVKSEKLDEAQKMFDEMPNRDVLTWTILISGFARLNYSEVALKLFREMLDEG 138
            SGKFVL+SYVKS+KL++AQK+FDEMPNRDVLTWT +ISGF+R+N S +AL+LFREML EG
Sbjct: 327  SGKFVLTSYVKSKKLNDAQKLFDEMPNRDVLTWTAIISGFSRVNCSGMALQLFREMLVEG 386

Query: 139  VCPNHFTLSCVFKLCSRVGDVQMGKGIHGWILRSGVNLDVVLENSILDLYAKFDAFDYAK 198
            VCPNHFTLS V KLCS+VGDV+MGKGIHGWILR+GV LDVVLENS+LDLYAKFD F YA+
Sbjct: 387  VCPNHFTLSTVLKLCSKVGDVRMGKGIHGWILRNGVKLDVVLENSLLDLYAKFDEFVYAR 446

Query: 199  KLFDSMREKSTATYNIMLGVYVRSCDVNKSLDLFRNLPCRDTASWNTIICGLMQGGYLNK 258
            KL+DSM EKST T NI+LGVYVRSCDVNKSL LFRNLPCR+ ASWNTIICGLMQGGYLN 
Sbjct: 447  KLYDSMGEKSTDTDNIILGVYVRSCDVNKSLHLFRNLPCRNAASWNTIICGLMQGGYLNA 506

Query: 259  ALELLYEMVANEPEFNKVTSSIALSVVSSLLIIELGRQVHGRIFRFGFHNDGFVKSSLIN 318
            ALELLYEMV NE EFN  TSSIALSV SSLLI+ELGRQVHGRI R G HNDGFVKS+LIN
Sbjct: 507  ALELLYEMVENESEFNNFTSSIALSVASSLLILELGRQVHGRIVRCGLHNDGFVKSALIN 566

Query: 319  MYIKCGNLEKASVIYSQMPSDFARKQDSNIVCSDMMTEIVSRSSMVSGYVRNGKYEDAFK 378
            MYIKCGNLEKASVIYSQ+PS FA KQ SNIVCSD MTEIVSRSSMV GYVRNGKYEDAFK
Sbjct: 567  MYIKCGNLEKASVIYSQLPSGFATKQGSNIVCSDTMTEIVSRSSMVYGYVRNGKYEDAFK 626

Query: 379  TFVSMVRERVVMDKFTIASIVSACSNAGVLELGRQIHAYIQKTGEQLDAHLASSLIDMYA 438
            TFVSMVRERV+MDKFTIAS+VSAC+NAGVLELGRQ+H +IQK+ EQLDAHLASSLIDMYA
Sbjct: 627  TFVSMVRERVLMDKFTIASVVSACANAGVLELGRQVHGFIQKSVEQLDAHLASSLIDMYA 686

Query: 439  KGGSLDCAHRIFEQTT-YLNVVIWTSMIAGCALHGQGKEAIRLFEQMRYEGIIPNEVTFI 498
            KGGSLDCAHRIF+Q T YLNVVIWTSMI GC+LHG GKEAIRLFEQMRYEGIIPNEVTFI
Sbjct: 687  KGGSLDCAHRIFDQMTYYLNVVIWTSMIVGCSLHGHGKEAIRLFEQMRYEGIIPNEVTFI 746

Query: 499  GVLTACSHAGLLEEGRLYFNMMKDVYAIKPRVEHFTCMVDLYGRAGCLNEVKEFIYENDL 558
            GVLTACSHAGLLE+G LYFNMMKDVYAIKP+VEH+TCMVDLYGRAG LNEVKEFIYENDL
Sbjct: 747  GVLTACSHAGLLEDGLLYFNMMKDVYAIKPKVEHYTCMVDLYGRAGLLNEVKEFIYENDL 806

Query: 559  SHLSAVWKAFLSSCRIYKDIEMGNWVSEKLFRLEPQDEGPYVLLSNMCSSNQKWEEASRT 618
            SHLS VWKAFLSSC +Y+D+EMG WVSEKLFRLEPQDEG YVLLSNMCS +QKW+EASR 
Sbjct: 807  SHLSVVWKAFLSSCLLYRDLEMGKWVSEKLFRLEPQDEGSYVLLSNMCSGSQKWQEASRA 866

Query: 619  RRSMQRRGISKTPGQSWIHVKNQVHSFVAGDRSHPQHTQIYTYLDTLIGRLKEIGYLYDV 678
            R SMQ  GI+KTPGQSWIH+KNQVHSFVAGDRSHPQH QIY YLD LIGRLKEIGYL+DV
Sbjct: 867  RSSMQHSGINKTPGQSWIHLKNQVHSFVAGDRSHPQHAQIYEYLDKLIGRLKEIGYLHDV 926

Query: 679  KMVMQDVEEEQGEVLLHWHSEKLAVAYGIISLASGIPIRIMKNLRVCADCHNFMKLTSQL 738
            K+VMQDVEEEQGEVLL WHSEKLAVAYGIISL S IPIRIMKNLR+C DCHNFMKLTSQL
Sbjct: 927  KLVMQDVEEEQGEVLLGWHSEKLAVAYGIISLGSAIPIRIMKNLRICTDCHNFMKLTSQL 986

Query: 739  LGREIIVRDIHRFHHFNSGRCSCDDYW 758
            LGREIIVRDI RFHHFNSG CSC DYW
Sbjct: 987  LGREIIVRDICRFHHFNSGHCSCGDYW 1012

BLAST of Tan0021339 vs. TAIR 10
Match: AT3G23330.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 520.0 bits (1338), Expect = 3.2e-147
Identity = 262/710 (36.90%), Postives = 416/710 (58.59%), Query Frame = 0

Query: 54  SLANTLHAKMVKNGSIL-DSGKFVLSSYVKSEKLDEAQKMFDEMPNRDVLTWTILISGFA 113
           S A  LHA+ ++  S+   S   V+S Y   + L EA  +F  + +  VL W  +I  F 
Sbjct: 22  SQAKQLHAQFIRTQSLSHTSASIVISIYTNLKLLHEALLLFKTLKSPPVLAWKSVIRCFT 81

Query: 114 RLNYSEVALKLFREMLDEGVCPNHFTLSCVFKLCSRVGDVQMGKGIHGWILRSGVNLDVV 173
             +    AL  F EM   G CP+H     V K C+ + D++ G+ +HG+I+R G++ D+ 
Sbjct: 82  DQSLFSKALASFVEMRASGRCPDHNVFPSVLKSCTMMMDLRFGESVHGFIVRLGMDCDLY 141

Query: 174 LENSILDLYAKFDAFD---YAKKLFDSM--REKSTATYNIMLGVYVRSCDVNKSLDLFRN 233
             N+++++YAK            +FD M  R  ++   ++     +    ++    +F  
Sbjct: 142 TGNALMNMYAKLLGMGSKISVGNVFDEMPQRTSNSGDEDVKAETCIMPFGIDSVRRVFEV 201

Query: 234 LPCRDTASWNTIICGLMQGGYLNKALELLYEMVANEPEFNKVTSSIALSVVSSLLIIELG 293
           +P +D  S+NTII G  Q G    AL ++ EM   + + +  T S  L + S  + +  G
Sbjct: 202 MPRKDVVSYNTIIAGYAQSGMYEDALRMVREMGTTDLKPDSFTLSSVLPIFSEYVDVIKG 261

Query: 294 RQVHGRIFRFGFHNDGFVKSSLINMYIKCGNLEKASVIYSQMPSDFARKQDSNIVCSDMM 353
           +++HG + R G  +D ++ SSL++MY K   +E +  ++S+            + C D  
Sbjct: 262 KEIHGYVIRKGIDSDVYIGSSLVDMYAKSARIEDSERVFSR------------LYCRDG- 321

Query: 354 TEIVSRSSMVSGYVRNGKYEDAFKTFVSMVRERVVMDKFTIASIVSACSNAGVLELGRQI 413
              +S +S+V+GYV+NG+Y +A + F  MV  +V       +S++ AC++   L LG+Q+
Sbjct: 322 ---ISWNSLVAGYVQNGRYNEALRLFRQMVTAKVKPGAVAFSSVIPACAHLATLHLGKQL 381

Query: 414 HAYIQKTGEQLDAHLASSLIDMYAKGGSLDCAHRIFEQTTYLNVVIWTSMIAGCALHGQG 473
           H Y+ + G   +  +AS+L+DMY+K G++  A +IF++   L+ V WT++I G ALHG G
Sbjct: 382 HGYVLRGGFGSNIFIASALVDMYSKCGNIKAARKIFDRMNVLDEVSWTAIIMGHALHGHG 441

Query: 474 KEAIRLFEQMRYEGIIPNEVTFIGVLTACSHAGLLEEGRLYFNMMKDVYAIKPRVEHFTC 533
            EA+ LFE+M+ +G+ PN+V F+ VLTACSH GL++E   YFN M  VY +   +EH+  
Sbjct: 442 HEAVSLFEEMKRQGVKPNQVAFVAVLTACSHVGLVDEAWGYFNSMTKVYGLNQELEHYAA 501

Query: 534 MVDLYGRAGCLNEVKEFIYENDLSHLSAVWKAFLSSCRIYKDIEMGNWVSEKLFRLEPQD 593
           + DL GRAG L E   FI +  +    +VW   LSSC ++K++E+   V+EK+F ++ ++
Sbjct: 502 VADLLGRAGKLEEAYNFISKMCVEPTGSVWSTLLSSCSVHKNLELAEKVAEKIFTVDSEN 561

Query: 594 EGPYVLLSNMCSSNQKWEEASRTRRSMQRRGISKTPGQSWIHVKNQVHSFVAGDRSHPQH 653
            G YVL+ NM +SN +W+E ++ R  M+++G+ K P  SWI +KN+ H FV+GDRSHP  
Sbjct: 562 MGAYVLMCNMYASNGRWKEMAKLRLRMRKKGLRKKPACSWIEMKNKTHGFVSGDRSHPSM 621

Query: 654 TQIYTYLDTLIGRLKEIGYLYDVKMVMQDVEEEQGEVLLHWHSEKLAVAYGIISLASGIP 713
            +I  +L  ++ ++++ GY+ D   V+ DV+EE    LL  HSE+LAVA+GII+   G  
Sbjct: 622 DKINEFLKAVMEQMEKEGYVADTSGVLHDVDEEHKRELLFGHSERLAVAFGIINTEPGTT 681

Query: 714 IRIMKNLRVCADCHNFMKLTSQLLGREIIVRDIHRFHHFNSGRCSCDDYW 758
           IR+ KN+R+C DCH  +K  S++  REIIVRD  RFHHFN G CSC DYW
Sbjct: 682 IRVTKNIRICTDCHVAIKFISKITEREIIVRDNSRFHHFNRGNCSCGDYW 715

BLAST of Tan0021339 vs. TAIR 10
Match: AT2G22070.1 (pentatricopeptide (PPR) repeat-containing protein )

HSP 1 Score: 511.9 bits (1317), Expect = 8.7e-145
Identity = 264/701 (37.66%), Postives = 419/701 (59.77%), Query Frame = 0

Query: 76  VLSSYVKSEKLDEAQKMFDEMPNRDVLTWTILISGFARLNYSEVALKLFREMLDEGVCPN 135
           VLS+Y K   +D   + FD++P RD ++WT +I G+  +     A+++  +M+ EG+ P 
Sbjct: 86  VLSAYSKRGDMDSTCEFFDQLPQRDSVSWTTMIVGYKNIGQYHKAIRVMGDMVKEGIEPT 145

Query: 136 HFTLSCVFKLCSRVGDVQMGKGIHGWILRSGVNLDVVLENSILDLYAKFDAFDYAKKLFD 195
            FTL+ V    +    ++ GK +H +I++ G+  +V + NS+L++YAK      AK +FD
Sbjct: 146 QFTLTNVLASVAATRCMETGKKVHSFIVKLGLRGNVSVSNSLLNMYAKCGDPMMAKFVFD 205

Query: 196 SMREKSTATYNIMLGVYVRSCDVNKSLDLFRNLPCRDTASWNTIICGLMQGGYLNKALEL 255
            M  +  +++N M+ ++++   ++ ++  F  +  RD  +WN++I G  Q GY  +AL++
Sbjct: 206 RMVVRDISSWNAMIALHMQVGQMDLAMAQFEQMAERDIVTWNSMISGFNQRGYDLRALDI 265

Query: 256 LYEMVANE-PEFNKVTSSIALSVVSSLLIIELGRQVHGRIFRFGFHNDGFVKSSLINMYI 315
             +M+ +     ++ T +  LS  ++L  + +G+Q+H  I   GF   G V ++LI+MY 
Sbjct: 266 FSKMLRDSLLSPDRFTLASVLSACANLEKLCIGKQIHSHIVTTGFDISGIVLNALISMYS 325

Query: 316 KCGNLEKASVIYSQMPSDFAR-----------------KQDSNIVCSDMMTEIVSRSSMV 375
           +CG +E A  +  Q  +   +                  Q  NI  S    ++V+ ++M+
Sbjct: 326 RCGGVETARRLIEQRGTKDLKIEGFTALLDGYIKLGDMNQAKNIFVSLKDRDVVAWTAMI 385

Query: 376 SGYVRNGKYEDAFKTFVSMVRERVVMDKFTIASIVSACSNAGVLELGRQIHAYIQKTGEQ 435
            GY ++G Y +A   F SMV      + +T+A+++S  S+   L  G+QIH    K+GE 
Sbjct: 386 VGYEQHGSYGEAINLFRSMVGGGQRPNSYTLAAMLSVASSLASLSHGKQIHGSAVKSGEI 445

Query: 436 LDAHLASSLIDMYAKGGSLDCAHRIFEQ-TTYLNVVIWTSMIAGCALHGQGKEAIRLFEQ 495
               ++++LI MYAK G++  A R F+      + V WTSMI   A HG  +EA+ LFE 
Sbjct: 446 YSVSVSNALITMYAKAGNITSASRAFDLIRCERDTVSWTSMIIALAQHGHAEEALELFET 505

Query: 496 MRYEGIIPNEVTFIGVLTACSHAGLLEEGRLYFNMMKDVYAIKPRVEHFTCMVDLYGRAG 555
           M  EG+ P+ +T++GV +AC+HAGL+ +GR YF+MMKDV  I P + H+ CMVDL+GRAG
Sbjct: 506 MLMEGLRPDHITYVGVFSACTHAGLVNQGRQYFDMMKDVDKIIPTLSHYACMVDLFGRAG 565

Query: 556 CLNEVKEFIYENDLSHLSAVWKAFLSSCRIYKDIEMGNWVSEKLFRLEPQDEGPYVLLSN 615
            L E +EFI +  +      W + LS+CR++K+I++G   +E+L  LEP++ G Y  L+N
Sbjct: 566 LLQEAQEFIEKMPIEPDVVTWGSLLSACRVHKNIDLGKVAAERLLLLEPENSGAYSALAN 625

Query: 616 MCSSNQKWEEASRTRRSMQRRGISKTPGQSWIHVKNQVHSFVAGDRSHPQHTQIYTYLDT 675
           + S+  KWEEA++ R+SM+   + K  G SWI VK++VH F   D +HP+  +IY  +  
Sbjct: 626 LYSACGKWEEAAKIRKSMKDGRVKKEQGFSWIEVKHKVHVFGVEDGTHPEKNEIYMTMKK 685

Query: 676 LIGRLKEIGYLYDVKMVMQDVEEEQGEVLLHWHSEKLAVAYGIISLASGIPIRIMKNLRV 735
           +   +K++GY+ D   V+ D+EEE  E +L  HSEKLA+A+G+IS      +RIMKNLRV
Sbjct: 686 IWDEIKKMGYVPDTASVLHDLEEEVKEQILRHHSEKLAIAFGLISTPDKTTLRIMKNLRV 745

Query: 736 CADCHNFMKLTSQLLGREIIVRDIHRFHHFNSGRCSCDDYW 758
           C DCH  +K  S+L+GREIIVRD  RFHHF  G CSC DYW
Sbjct: 746 CNDCHTAIKFISKLVGREIIVRDTTRFHHFKDGFCSCRDYW 786

BLAST of Tan0021339 vs. TAIR 10
Match: AT4G13650.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 480.3 bits (1235), Expect = 2.8e-135
Identity = 235/682 (34.46%), Postives = 388/682 (56.89%), Query Frame = 0

Query: 76   VLSSYVKSEKLDEAQKMFDEMPNRDVLTWTILISGFARLNYSEVALKLFREMLDEGVCPN 135
            +L+ Y K   ++ A   F E    +V+ W +++  +  L+    + ++FR+M  E + PN
Sbjct: 430  LLNLYAKCADIETALDYFLETEVENVVLWNVMLVAYGLLDDLRNSFRIFRQMQIEEIVPN 489

Query: 136  HFTLSCVFKLCSRVGDVQMGKGIHGWILRSGVNLDVVLENSILDLYAKFDAFDYAKKLFD 195
             +T   + K C R+GD+++G+ IH  I+++   L+  + + ++D+YAK    D A     
Sbjct: 490  QYTYPSILKTCIRLGDLELGEQIHSQIIKTNFQLNAYVCSVLIDMYAKLGKLDTA----- 549

Query: 196  SMREKSTATYNIMLGVYVRSCDVNKSLDLFRNLPCRDTASWNTIICGLMQGGYLNKALEL 255
                                       D+      +D  SW T+I G  Q  + +KAL  
Sbjct: 550  --------------------------WDILIRFAGKDVVSWTTMIAGYTQYNFDDKALTT 609

Query: 256  LYEMVANEPEFNKVTSSIALSVVSSLLIIELGRQVHGRIFRFGFHNDGFVKSSLINMYIK 315
              +M+      ++V  + A+S  + L  ++ G+Q+H +    GF +D   +++L+ +Y +
Sbjct: 610  FRQMLDRGIRSDEVGLTNAVSACAGLQALKEGQQIHAQACVSGFSSDLPFQNALVTLYSR 669

Query: 316  CGNLEKASVIYSQMPSDFARKQDSNIVCSDMMTEIVSRSSMVSGYVRNGKYEDAFKTFVS 375
            CG +E++ + + Q  +                 + ++ +++VSG+ ++G  E+A + FV 
Sbjct: 670  CGKIEESYLAFEQTEAG----------------DNIAWNALVSGFQQSGNNEEALRVFVR 729

Query: 376  MVRERVVMDKFTIASIVSACSNAGVLELGRQIHAYIQKTGEQLDAHLASSLIDMYAKGGS 435
            M RE +  + FT  S V A S    ++ G+Q+HA I KTG   +  + ++LI MYAK GS
Sbjct: 730  MNREGIDNNNFTFGSAVKAASETANMKQGKQVHAVITKTGYDSETEVCNALISMYAKCGS 789

Query: 436  LDCAHRIFEQTTYLNVVIWTSMIAGCALHGQGKEAIRLFEQMRYEGIIPNEVTFIGVLTA 495
            +  A + F + +  N V W ++I   + HG G EA+  F+QM +  + PN VT +GVL+A
Sbjct: 790  ISDAEKQFLEVSTKNEVSWNAIINAYSKHGFGSEALDSFDQMIHSNVRPNHVTLVGVLSA 849

Query: 496  CSHAGLLEEGRLYFNMMKDVYAIKPRVEHFTCMVDLYGRAGCLNEVKEFIYENDLSHLSA 555
            CSH GL+++G  YF  M   Y + P+ EH+ C+VD+  RAG L+  KEFI E  +   + 
Sbjct: 850  CSHIGLVDKGIAYFESMNSEYGLSPKPEHYVCVVDMLTRAGLLSRAKEFIQEMPIKPDAL 909

Query: 556  VWKAFLSSCRIYKDIEMGNWVSEKLFRLEPQDEGPYVLLSNMCSSNQKWEEASRTRRSMQ 615
            VW+  LS+C ++K++E+G + +  L  LEP+D   YVLLSN+ + ++KW+    TR+ M+
Sbjct: 910  VWRTLLSACVVHKNMEIGEFAAHHLLELEPEDSATYVLLSNLYAVSKKWDARDLTRQKMK 969

Query: 616  RRGISKTPGQSWIHVKNQVHSFVAGDRSHPQHTQIYTYLDTLIGRLKEIGYLYDVKMVMQ 675
             +G+ K PGQSWI VKN +HSF  GD++HP   +I+ Y   L  R  EIGY+ D   ++ 
Sbjct: 970  EKGVKKEPGQSWIEVKNSIHSFYVGDQNHPLADEIHEYFQDLTKRASEIGYVQDCFSLLN 1029

Query: 676  DVEEEQGEVLLHWHSEKLAVAYGIISLASGIPIRIMKNLRVCADCHNFMKLTSQLLGREI 735
            +++ EQ + ++  HSEKLA+++G++SL + +PI +MKNLRVC DCH ++K  S++  REI
Sbjct: 1030 ELQHEQKDPIIFIHSEKLAISFGLLSLPATVPINVMKNLRVCNDCHAWIKFVSKVSNREI 1064

Query: 736  IVRDIHRFHHFNSGRCSCDDYW 758
            IVRD +RFHHF  G CSC DYW
Sbjct: 1090 IVRDAYRFHHFEGGACSCKDYW 1064

BLAST of Tan0021339 vs. TAIR 10
Match: AT1G08070.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 479.6 bits (1233), Expect = 4.8e-135
Identity = 246/708 (34.75%), Postives = 403/708 (56.92%), Query Frame = 0

Query: 59  LHAKMVKNGSILDSGKFVLSSYVK-------SEKLDEAQKMFDEMPNRDVLTWTILISGF 118
           +HA+M+K G  L +  + LS  ++        E L  A  +F  +   ++L W  +  G 
Sbjct: 52  IHAQMIKIG--LHNTNYALSKLIEFCILSPHFEGLPYAISVFKTIQEPNLLIWNTMFRGH 111

Query: 119 ARLNYSEVALKLFREMLDEGVCPNHFTLSCVFKLCSRVGDVQMGKGIHGWILRSGVNLDV 178
           A  +    ALKL+  M+  G+ PN +T   V K C++    + G+ IHG +L+ G +LD+
Sbjct: 112 ALSSDPVSALKLYVCMISLGLLPNSYTFPFVLKSCAKSKAFKEGQQIHGHVLKLGCDLDL 171

Query: 179 VLENSILDLYAKFDAFDYAKKLFDSMREKSTATYNIMLGVYVRSCDVNKSLDLFRNLPCR 238
            +  S++ +Y +    + A K+FD    +   +Y  ++  Y     +  +  LF  +P +
Sbjct: 172 YVHTSLISMYVQNGRLEDAHKVFDKSPHRDVVSYTALIKGYASRGYIENAQKLFDEIPVK 231

Query: 239 DTASWNTIICGLMQGGYLNKALELLYEMVANEPEFNKVTSSIALSVVSSLLIIELGRQVH 298
           D  SWN +I G  + G   +ALEL  +M+      ++ T    +S  +    IELGRQVH
Sbjct: 232 DVVSWNAMISGYAETGNYKEALELFKDMMKTNVRPDESTMVTVVSACAQSGSIELGRQVH 291

Query: 299 GRIFRFGFHNDGFVKSSLINMYIKCGNLEKASVIYSQMPSDFARKQDSNIVCSDMMTEIV 358
             I   GF ++  + ++LI++Y KCG LE A  ++ ++P                  +++
Sbjct: 292 LWIDDHGFGSNLKIVNALIDLYSKCGELETACGLFERLP----------------YKDVI 351

Query: 359 SRSSMVSGYVRNGKYEDAFKTFVSMVRERVVMDKFTIASIVSACSNAGVLELGRQIHAYI 418
           S ++++ GY     Y++A   F  M+R     +  T+ SI+ AC++ G +++GR IH YI
Sbjct: 352 SWNTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSILPACAHLGAIDIGRWIHVYI 411

Query: 419 QK--TGEQLDAHLASSLIDMYAKGGSLDCAHRIFEQTTYLNVVIWTSMIAGCALHGQGKE 478
            K   G    + L +SLIDMYAK G ++ AH++F    + ++  W +MI G A+HG+   
Sbjct: 412 DKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSSWNAMIFGFAMHGRADA 471

Query: 479 AIRLFEQMRYEGIIPNEVTFIGVLTACSHAGLLEEGRLYFNMMKDVYAIKPRVEHFTCMV 538
           +  LF +MR  GI P+++TF+G+L+ACSH+G+L+ GR  F  M   Y + P++EH+ CM+
Sbjct: 472 SFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQDYKMTPKLEHYGCMI 531

Query: 539 DLYGRAGCLNEVKEFIYENDLSHLSAVWKAFLSSCRIYKDIEMGNWVSEKLFRLEPQDEG 598
           DL G +G   E +E I   ++     +W + L +C+++ ++E+G   +E L ++EP++ G
Sbjct: 532 DLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELGESFAENLIKIEPENPG 591

Query: 599 PYVLLSNMCSSNQKWEEASRTRRSMQRRGISKTPGQSWIHVKNQVHSFVAGDRSHPQHTQ 658
            YVLLSN+ +S  +W E ++TR  +  +G+ K PG S I + + VH F+ GD+ HP++ +
Sbjct: 592 SYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSVVHEFIIGDKFHPRNRE 651

Query: 659 IYTYLDTLIGRLKEIGYLYDVKMVMQDVEEEQGEVLLHWHSEKLAVAYGIISLASGIPIR 718
           IY  L+ +   L++ G++ D   V+Q++EEE  E  L  HSEKLA+A+G+IS   G  + 
Sbjct: 652 IYGMLEEMEVLLEKAGFVPDTSEVLQEMEEEWKEGALRHHSEKLAIAFGLISTKPGTKLT 711

Query: 719 IMKNLRVCADCHNFMKLTSQLLGREIIVRDIHRFHHFNSGRCSCDDYW 758
           I+KNLRVC +CH   KL S++  REII RD  RFHHF  G CSC+DYW
Sbjct: 712 IVKNLRVCRNCHEATKLISKIYKREIIARDRTRFHHFRDGVCSCNDYW 741

BLAST of Tan0021339 vs. TAIR 10
Match: AT3G49170.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 479.2 bits (1232), Expect = 6.2e-135
Identity = 251/694 (36.17%), Postives = 387/694 (55.76%), Query Frame = 0

Query: 73  GKFVLSSYVKSE-KLDEAQKMFDEMPNRDVLTWTILISGFARLNYSEVALKLFREMLDEG 132
           G  ++  +VK E   + A K+FD+M   +V+TWT++I+   ++ +   A++ F +M+  G
Sbjct: 205 GCSLIDMFVKGENSFENAYKVFDKMSELNVVTWTLMITRCMQMGFPREAIRFFLDMVLSG 264

Query: 133 VCPNHFTLSCVFKLCSRVGDVQMGKGIHGWILRSGVNLDVVLENSILDLYAKFDA---FD 192
              + FTLS VF  C+ + ++ +GK +H W +RSG+  DV  E S++D+YAK  A    D
Sbjct: 265 FESDKFTLSSVFSACAELENLSLGKQLHSWAIRSGLVDDV--ECSLVDMYAKCSADGSVD 324

Query: 193 YAKKLFDSMREKSTATYNIMLGVYVRSCDVNKSLDLFRNLPCRDTASWNTIICGLMQGGY 252
             +K+FD M + S  ++  ++  Y+++C++                              
Sbjct: 325 DCRKVFDRMEDHSVMSWTALITGYMKNCNL------------------------------ 384

Query: 253 LNKALELLYEMVA-NEPEFNKVTSSIALSVVSSLLIIELGRQVHGRIFRFGFHNDGFVKS 312
             +A+ L  EM+     E N  T S A     +L    +G+QV G+ F+ G  ++  V +
Sbjct: 385 ATEAINLFSEMITQGHVEPNHFTFSSAFKACGNLSDPRVGKQVLGQAFKRGLASNSSVAN 444

Query: 313 SLINMYIKCGNLEKASVIYSQMPSDFARKQDSNIVCSDMMTEIVSRSSMVSGYVRNGKYE 372
           S+I+M++K   +E A   +  +                    +VS ++ + G  RN  +E
Sbjct: 445 SVISMFVKSDRMEDAQRAFESLSE----------------KNLVSYNTFLDGTCRNLNFE 504

Query: 373 DAFKTFVSMVRERVVMDKFTIASIVSACSNAGVLELGRQIHAYIQKTGEQLDAHLASSLI 432
            AFK    +    + +  FT AS++S  +N G +  G QIH+ + K G   +  + ++LI
Sbjct: 505 QAFKLLSEITERELGVSAFTFASLLSGVANVGSIRKGEQIHSQVVKLGLSCNQPVCNALI 564

Query: 433 DMYAKGGSLDCAHRIFEQTTYLNVVIWTSMIAGCALHGQGKEAIRLFEQMRYEGIIPNEV 492
            MY+K GS+D A R+F      NV+ WTSMI G A HG     +  F QM  EG+ PNEV
Sbjct: 565 SMYSKCGSIDTASRVFNFMENRNVISWTSMITGFAKHGFAIRVLETFNQMIEEGVKPNEV 624

Query: 493 TFIGVLTACSHAGLLEEGRLYFNMMKDVYAIKPRVEHFTCMVDLYGRAGCLNEVKEFIYE 552
           T++ +L+ACSH GL+ EG  +FN M + + IKP++EH+ CMVDL  RAG L +  EFI  
Sbjct: 625 TYVAILSACSHVGLVSEGWRHFNSMYEDHKIKPKMEHYACMVDLLCRAGLLTDAFEFINT 684

Query: 553 NDLSHLSAVWKAFLSSCRIYKDIEMGNWVSEKLFRLEPQDEGPYVLLSNMCSSNQKWEEA 612
                   VW+ FL +CR++ + E+G   + K+  L+P +   Y+ LSN+ +   KWEE+
Sbjct: 685 MPFQADVLVWRTFLGACRVHSNTELGKLAARKILELDPNEPAAYIQLSNIYACAGKWEES 744

Query: 613 SRTRRSMQRRGISKTPGQSWIHVKNQVHSFVAGDRSHPQHTQIYTYLDTLIGRLKEIGYL 672
           +  RR M+ R + K  G SWI V +++H F  GD +HP   QIY  LD LI  +K  GY+
Sbjct: 745 TEMRRKMKERNLVKEGGCSWIEVGDKIHKFYVGDTAHPNAHQIYDELDRLITEIKRCGYV 804

Query: 673 YDVKMVMQDVEEEQGEV----LLHWHSEKLAVAYGIISLASGIPIRIMKNLRVCADCHNF 732
            D  +V+  +EEE  E     LL+ HSEK+AVA+G+IS +   P+R+ KNLRVC DCHN 
Sbjct: 805 PDTDLVLHKLEEENDEAEKERLLYQHSEKIAVAFGLISTSKSRPVRVFKNLRVCGDCHNA 850

Query: 733 MKLTSQLLGREIIVRDIHRFHHFNSGRCSCDDYW 758
           MK  S + GREI++RD++RFHHF  G+CSC+DYW
Sbjct: 865 MKYISTVSGREIVLRDLNRFHHFKDGKCSCNDYW 850

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9LW634.5e-14636.90Putative pentatricopeptide repeat-containing protein At3g23330 OS=Arabidopsis th... [more]
Q9SHZ81.2e-14337.66Pentatricopeptide repeat-containing protein At2g22070 OS=Arabidopsis thaliana OX... [more]
Q9SVP73.9e-13434.46Pentatricopeptide repeat-containing protein At4g13650 OS=Arabidopsis thaliana OX... [more]
Q9LN016.7e-13434.75Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
Q5G1T18.8e-13436.17Pentatricopeptide repeat-containing protein At3g49170, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
KAG6586149.10.0e+0088.90putative pentatricopeptide repeat-containing protein, partial [Cucurbita argyros... [more]
KAG7020981.10.0e+0088.77putative pentatricopeptide repeat-containing protein, partial [Cucurbita argyros... [more]
XP_038889548.10.0e+0088.92putative pentatricopeptide repeat-containing protein At3g23330 [Benincasa hispid... [more]
KAG7029890.10.0e+0086.09putative pentatricopeptide repeat-containing protein [Cucurbita argyrosperma sub... [more]
XP_022965499.10.0e+0088.97LOW QUALITY PROTEIN: putative pentatricopeptide repeat-containing protein At3g23... [more]
Match NameE-valueIdentityDescription
A0A6J1HR620.0e+0088.97LOW QUALITY PROTEIN: putative pentatricopeptide repeat-containing protein At3g23... [more]
A0A0A0LKI40.0e+0084.83DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_2G0742... [more]
A0A6J1EPP70.0e+0089.30putative pentatricopeptide repeat-containing protein At3g23330 OS=Cucurbita mosc... [more]
A0A6J1KA700.0e+0088.59putative pentatricopeptide repeat-containing protein At3g23330 OS=Cucurbita maxi... [more]
A0A1S3B4E30.0e+0082.46LOW QUALITY PROTEIN: putative pentatricopeptide repeat-containing protein At3g23... [more]
Match NameE-valueIdentityDescription
AT3G23330.13.2e-14736.90Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT2G22070.18.7e-14537.66pentatricopeptide (PPR) repeat-containing protein [more]
AT4G13650.12.8e-13534.46Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G08070.14.8e-13534.75Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G49170.16.2e-13536.17Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 100..146
e-value: 3.3E-11
score: 43.2
coord: 450..497
e-value: 6.2E-11
score: 42.3
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 204..227
e-value: 0.041
score: 14.1
coord: 235..261
e-value: 1.0E-4
score: 22.3
coord: 524..546
e-value: 0.73
score: 10.2
coord: 307..331
e-value: 0.015
score: 15.5
coord: 175..200
e-value: 0.097
score: 13.0
coord: 352..380
e-value: 2.7E-5
score: 24.1
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 103..135
e-value: 3.9E-8
score: 31.0
coord: 452..486
e-value: 3.1E-6
score: 25.0
coord: 235..262
e-value: 6.3E-5
score: 20.9
coord: 204..227
e-value: 0.0014
score: 16.6
coord: 354..384
e-value: 7.5E-6
score: 23.8
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 232..266
score: 9.13082
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 349..383
score: 9.448698
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 100..134
score: 12.167101
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 450..484
score: 12.002681
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 425..641
e-value: 1.5E-35
score: 125.0
coord: 279..424
e-value: 3.3E-18
score: 68.1
coord: 35..159
e-value: 7.9E-21
score: 76.7
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 169..278
e-value: 1.7E-19
score: 71.8
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 623..746
e-value: 3.6E-40
score: 136.8
NoneNo IPR availablePANTHERPTHR24015:SF1922OS07G0239600 PROTEINcoord: 42..739
NoneNo IPR availablePANTHERPTHR24015OS07G0578800 PROTEIN-RELATEDcoord: 42..739

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0021339.1Tan0021339.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding