Tan0003857 (gene) Snake gourd v1

Overview
NameTan0003857
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat-containing protein
LocationLG09: 60031217 .. 60033828 (-)
RNA-Seq ExpressionTan0003857
SyntenyTan0003857
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAGTATTGAACTCAACATGGCGAAATGCAGGGCCGTCATTGCTGCCCTCAGGAGGCTGTTCTTCAAGCTTCTTCAGTAAGTCCCTGTCTTCAGAAAAATGATACGATGCGACTTCACCTCTAAGCTCTTCTTTTTACTCGATTCCTGCAAATCAATCCACCAAATCAAACAAGCCCATGCCCAATTGATCACCACTGGCCTTATTCTACACCCAATCGCCACTAATAAACTCCTCAAACTTCTTTCCTTCTCAAGATTCGCTTCCATTTCTTATGCCCAAATGGTGTTCGACCATTTTCCCCATCCAGACCTCTTCCTTTACAACACCATCATCAAGGCCCACGCACTTTCAGCCACTTCCTCTGCAGATTTCTTCACCAGGTTTCGTTCTCTGATCCGCGACGAAAGGTTAGTGCCCAATCAGTATTCCTTCGCATTTGCCTTCAAGGGGTGTGGCAATGGTGTTGGGATTGTGGAAGGGGAGCAAGTTCGCGTTCATGCTGTTAAACTTGGTCAGGAGAACAATTTGTTTGTGACGAATGCGTTGATTGGGATGTATGTGAATTTGGGTCTTGTTGTGGATGCTAGAAAGGTGTTTGATTGGAGCCCAAATAGAGATATGTACTCGTGGAATATTATGCTTAGTGGGTATGTGAGATTGGGGAAAATGGATGAAGCTCGGGAACTGTTTGATGAAATGCCTGAAAGAGATGTTGTGTCGTGGACAACAATGATTGCTGGATGTGTCCAGGTAATTAGATAAAAACACAACTGCATTTACTTGTTTTATAAAAGCTTATATAAATTTGGTCTTGTGTCATTTGTAACTGCCAACTTCATTTTTTTTTCTTTTTTTTTTTTTTTTCAACATACATGTGGGTGAAGATTTGAATCTCTAGCCTTGTAGTCATATATGCTTTAATCAGTTGAAGTATGTTCGGGTTAACATTTGTACTTGTGAATTAGTAGTGTTTATTTAGTCTTACTTTATTCTAATAAACCCTTCAACTTTTAATTACGTGTCAATGATGGAAAGTTATAAAATACAAAATTGAAAGTTCTAAAAACTAGATAAATACAACTTTGGAAAAGTTCAACGACCATAGTATAACACGTTTGGGAATTTAACTTTCTATGATTTTAATTACTTCTTGGCACAGTAATGCTATGCTACCTACGGTATACGCTATGCCATAGTGAGAAGCCATTTTTTACTGTGCGAGACAGAAGCATCTTTCTGTCTCGTTTCCTCAAGCATGCCTCTGCATCCGTGGGGAAATCAAATGGTAAGCAATTACTCAGTGTAATGTGAGTTCGGTTTAGTTCAAGCAGGGATTATTATGCTTCTTAAAGGCTGAAATATGATCTTCATCCATAGGTTGGTCATTTCATGGAGGCGTTGGATATCTTCCACAAGATGTTGGAAAGAGGGGTGAGCCCGAATGAGTACACTTTGGCCAGTGCCCTTGCAGCCTGTGCTAATCTTGTGGCATTGGATCAAGGAAGATGGATGCATGTTTATATTCGAAAGAATGATATACAGTTGAATGATCGGTTGCTGGCCGGACTCATTGACATGTATGCAAAATGTGGAGAGTTAGAGTTTGCATCAAAGCTTTTCAACAGTGAACAACAGTTGAAGCGAAAGGTTTGGCCTTGGAATGCCATGATTGGTGGGTTTGCAACGCATGGGAAGTCGAAGGAAGCAATTGAGGTTTTTGAAAAAATGAAGGTAGAAAAGGTTTCTCCCAACAAAGTTACATTTGTTGCTTTGTTAAATGCTTGTAGTCATGGAAATAGAGTTGAGGAAGGAAGAGGCTACTTTGAATCAATGACAGGTTGCTATGGAGTCGAACCTGAGTTAGAGCATTACGGATGTATGGTGGATCTACTGGGACGTGCCGGGCGCTTGAAGGAAGCTGAAGAGATCATATCAAGTATGGCTTTGACACCAGATATTGCTATTTGGGGTGCATTGCTTAGTGCTTGCAAAATCCATAAGGACATTGAAATGGCAGAGAGAATTGGGAAAATTGTTAGAGAGTTAGATTCTGACCATCTGGGTTGCCATGTTCTATTAGCAAATATATATTCTTTGACTGGGAATTGGAATGAAGCCAGGACATTGAGGGAGAAGATTGCAGTAAGTGGGAAAAAGAAAACTCCAGGTTGCAGCTCCATCGAGTTGAATGGGACATTCCATCAATTTCTCGTCGGTGATCGGTCTCATCCTCAAACAAAACAGCTCTATTTGTTCTTGGATGAGATGACCACCAAGTTGAAGATTGCTGGTTACGTTCCTGAATCTGGAGAAGTTTTGCTCGATATTGATGACAATGAGGACAGAGAAACAGCTCTGTTAAAGCACAGTGAGAAGTTAGCAATTGCCTTTGGGCTGATGAATACAGCACCTGGAACTCCAATCCGCATTGTGAAGAACTTGAGAGTATGTGGAGACTGTCATCAAGCGATAAAGTTCATTTCGAAGGTATACGATAGGGAGATCATTGTAAGGGACCGAATTAGATATCACCATTTTAAAGACGGAACTTGTTCGTGTAACGATTACTGGTAGCATTTGTTGTGTATATAAAAAATTGCTTCATTTTCAA

mRNA sequence

AAGTATTGAACTCAACATGGCGAAATGCAGGGCCGTCATTGCTGCCCTCAGGAGGCTGTTCTTCAAGCTTCTTCAGTAAGTCCCTGTCTTCAGAAAAATGATACGATGCGACTTCACCTCTAAGCTCTTCTTTTTACTCGATTCCTGCAAATCAATCCACCAAATCAAACAAGCCCATGCCCAATTGATCACCACTGGCCTTATTCTACACCCAATCGCCACTAATAAACTCCTCAAACTTCTTTCCTTCTCAAGATTCGCTTCCATTTCTTATGCCCAAATGGTGTTCGACCATTTTCCCCATCCAGACCTCTTCCTTTACAACACCATCATCAAGGCCCACGCACTTTCAGCCACTTCCTCTGCAGATTTCTTCACCAGGTTTCGTTCTCTGATCCGCGACGAAAGGTTAGTGCCCAATCAGTATTCCTTCGCATTTGCCTTCAAGGGGTGTGGCAATGGTGTTGGGATTGTGGAAGGGGAGCAAGTTCGCGTTCATGCTGTTAAACTTGGTCAGGAGAACAATTTGTTTGTGACGAATGCGTTGATTGGGATGTATGTGAATTTGGGTCTTGTTGTGGATGCTAGAAAGGTGTTTGATTGGAGCCCAAATAGAGATATGTACTCGTGGAATATTATGCTTAGTGGGTATGTGAGATTGGGGAAAATGGATGAAGCTCGGGAACTGTTTGATGAAATGCCTGAAAGAGATGTTGTGTCGTGGACAACAATGATTGCTGGATGTGTCCAGGTTGGTCATTTCATGGAGGCGTTGGATATCTTCCACAAGATGTTGGAAAGAGGGGTGAGCCCGAATGAGTACACTTTGGCCAGTGCCCTTGCAGCCTGTGCTAATCTTGTGGCATTGGATCAAGGAAGATGGATGCATGTTTATATTCGAAAGAATGATATACAGTTGAATGATCGGTTGCTGGCCGGACTCATTGACATGTATGCAAAATGTGGAGAGTTAGAGTTTGCATCAAAGCTTTTCAACAGTGAACAACAGTTGAAGCGAAAGGTTTGGCCTTGGAATGCCATGATTGGTGGGTTTGCAACGCATGGGAAGTCGAAGGAAGCAATTGAGGTTTTTGAAAAAATGAAGGTAGAAAAGGTTTCTCCCAACAAAGTTACATTTGTTGCTTTGTTAAATGCTTGTAGTCATGGAAATAGAGTTGAGGAAGGAAGAGGCTACTTTGAATCAATGACAGGTTGCTATGGAGTCGAACCTGAGTTAGAGCATTACGGATGTATGGTGGATCTACTGGGACGTGCCGGGCGCTTGAAGGAAGCTGAAGAGATCATATCAAGTATGGCTTTGACACCAGATATTGCTATTTGGGGTGCATTGCTTAGTGCTTGCAAAATCCATAAGGACATTGAAATGGCAGAGAGAATTGGGAAAATTGTTAGAGAGTTAGATTCTGACCATCTGGGTTGCCATGTTCTATTAGCAAATATATATTCTTTGACTGGGAATTGGAATGAAGCCAGGACATTGAGGGAGAAGATTGCAGTAAGTGGGAAAAAGAAAACTCCAGGTTGCAGCTCCATCGAGTTGAATGGGACATTCCATCAATTTCTCGTCGGTGATCGGTCTCATCCTCAAACAAAACAGCTCTATTTGTTCTTGGATGAGATGACCACCAAGTTGAAGATTGCTGGTTACGTTCCTGAATCTGGAGAAGTTTTGCTCGATATTGATGACAATGAGGACAGAGAAACAGCTCTGTTAAAGCACAGTGAGAAGTTAGCAATTGCCTTTGGGCTGATGAATACAGCACCTGGAACTCCAATCCGCATTGTGAAGAACTTGAGAGTATGTGGAGACTGTCATCAAGCGATAAAGTTCATTTCGAAGGTATACGATAGGGAGATCATTGTAAGGGACCGAATTAGATATCACCATTTTAAAGACGGAACTTGTTCGTGTAACGATTACTGGTAGCATTTGTTGTGTATATAAAAAATTGCTTCATTTTCAA

Coding sequence (CDS)

ATGATACGATGCGACTTCACCTCTAAGCTCTTCTTTTTACTCGATTCCTGCAAATCAATCCACCAAATCAAACAAGCCCATGCCCAATTGATCACCACTGGCCTTATTCTACACCCAATCGCCACTAATAAACTCCTCAAACTTCTTTCCTTCTCAAGATTCGCTTCCATTTCTTATGCCCAAATGGTGTTCGACCATTTTCCCCATCCAGACCTCTTCCTTTACAACACCATCATCAAGGCCCACGCACTTTCAGCCACTTCCTCTGCAGATTTCTTCACCAGGTTTCGTTCTCTGATCCGCGACGAAAGGTTAGTGCCCAATCAGTATTCCTTCGCATTTGCCTTCAAGGGGTGTGGCAATGGTGTTGGGATTGTGGAAGGGGAGCAAGTTCGCGTTCATGCTGTTAAACTTGGTCAGGAGAACAATTTGTTTGTGACGAATGCGTTGATTGGGATGTATGTGAATTTGGGTCTTGTTGTGGATGCTAGAAAGGTGTTTGATTGGAGCCCAAATAGAGATATGTACTCGTGGAATATTATGCTTAGTGGGTATGTGAGATTGGGGAAAATGGATGAAGCTCGGGAACTGTTTGATGAAATGCCTGAAAGAGATGTTGTGTCGTGGACAACAATGATTGCTGGATGTGTCCAGGTTGGTCATTTCATGGAGGCGTTGGATATCTTCCACAAGATGTTGGAAAGAGGGGTGAGCCCGAATGAGTACACTTTGGCCAGTGCCCTTGCAGCCTGTGCTAATCTTGTGGCATTGGATCAAGGAAGATGGATGCATGTTTATATTCGAAAGAATGATATACAGTTGAATGATCGGTTGCTGGCCGGACTCATTGACATGTATGCAAAATGTGGAGAGTTAGAGTTTGCATCAAAGCTTTTCAACAGTGAACAACAGTTGAAGCGAAAGGTTTGGCCTTGGAATGCCATGATTGGTGGGTTTGCAACGCATGGGAAGTCGAAGGAAGCAATTGAGGTTTTTGAAAAAATGAAGGTAGAAAAGGTTTCTCCCAACAAAGTTACATTTGTTGCTTTGTTAAATGCTTGTAGTCATGGAAATAGAGTTGAGGAAGGAAGAGGCTACTTTGAATCAATGACAGGTTGCTATGGAGTCGAACCTGAGTTAGAGCATTACGGATGTATGGTGGATCTACTGGGACGTGCCGGGCGCTTGAAGGAAGCTGAAGAGATCATATCAAGTATGGCTTTGACACCAGATATTGCTATTTGGGGTGCATTGCTTAGTGCTTGCAAAATCCATAAGGACATTGAAATGGCAGAGAGAATTGGGAAAATTGTTAGAGAGTTAGATTCTGACCATCTGGGTTGCCATGTTCTATTAGCAAATATATATTCTTTGACTGGGAATTGGAATGAAGCCAGGACATTGAGGGAGAAGATTGCAGTAAGTGGGAAAAAGAAAACTCCAGGTTGCAGCTCCATCGAGTTGAATGGGACATTCCATCAATTTCTCGTCGGTGATCGGTCTCATCCTCAAACAAAACAGCTCTATTTGTTCTTGGATGAGATGACCACCAAGTTGAAGATTGCTGGTTACGTTCCTGAATCTGGAGAAGTTTTGCTCGATATTGATGACAATGAGGACAGAGAAACAGCTCTGTTAAAGCACAGTGAGAAGTTAGCAATTGCCTTTGGGCTGATGAATACAGCACCTGGAACTCCAATCCGCATTGTGAAGAACTTGAGAGTATGTGGAGACTGTCATCAAGCGATAAAGTTCATTTCGAAGGTATACGATAGGGAGATCATTGTAAGGGACCGAATTAGATATCACCATTTTAAAGACGGAACTTGTTCGTGTAACGATTACTGGTAG

Protein sequence

MIRCDFTSKLFFLLDSCKSIHQIKQAHAQLITTGLILHPIATNKLLKLLSFSRFASISYAQMVFDHFPHPDLFLYNTIIKAHALSATSSADFFTRFRSLIRDERLVPNQYSFAFAFKGCGNGVGIVEGEQVRVHAVKLGQENNLFVTNALIGMYVNLGLVVDARKVFDWSPNRDMYSWNIMLSGYVRLGKMDEARELFDEMPERDVVSWTTMIAGCVQVGHFMEALDIFHKMLERGVSPNEYTLASALAACANLVALDQGRWMHVYIRKNDIQLNDRLLAGLIDMYAKCGELEFASKLFNSEQQLKRKVWPWNAMIGGFATHGKSKEAIEVFEKMKVEKVSPNKVTFVALLNACSHGNRVEEGRGYFESMTGCYGVEPELEHYGCMVDLLGRAGRLKEAEEIISSMALTPDIAIWGALLSACKIHKDIEMAERIGKIVRELDSDHLGCHVLLANIYSLTGNWNEARTLREKIAVSGKKKTPGCSSIELNGTFHQFLVGDRSHPQTKQLYLFLDEMTTKLKIAGYVPESGEVLLDIDDNEDRETALLKHSEKLAIAFGLMNTAPGTPIRIVKNLRVCGDCHQAIKFISKVYDREIIVRDRIRYHHFKDGTCSCNDYW
Homology
BLAST of Tan0003857 vs. ExPASy Swiss-Prot
Match: Q9FJY7 (Pentatricopeptide repeat-containing protein At5g66520 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H61 PE=2 SV=1)

HSP 1 Score: 548.5 bits (1412), Expect = 9.6e-155
Identity = 277/609 (45.48%), Postives = 382/609 (62.73%), Query Frame = 0

Query: 14  LDSCKSIHQIKQAHAQLITTGLILHPIATNKLLKL-LSFSRFASISYAQMVFDHFPHPDL 73
           L  C    ++KQ HA+++ TGL+    A  K L   +S +    + YAQ+VFD F  PD 
Sbjct: 21  LQRCSKQEELKQIHARMLKTGLMQDSYAITKFLSFCISSTSSDFLPYAQIVFDGFDRPDT 80

Query: 74  FLYNTIIKAHALSATSSADFFTRFRSLIRDERLV-----PNQYSFAFAFKGCGNGVGIVE 133
           FL+N +I+  + S           RSL+  +R++      N Y+F    K C N     E
Sbjct: 81  FLWNLMIRGFSCSDEPE-------RSLLLYQRMLCSSAPHNAYTFPSLLKACSNLSAFEE 140

Query: 134 GEQVRVHAVKLGQENNLFVTNALIGMYVNLGLVVDARKVFDWSPNRDMYSWNIMLSGYVR 193
             Q+     KLG EN+++  N+LI  Y   G    A  +FD  P  D  SWN ++ GYV+
Sbjct: 141 TTQIHAQITKLGYENDVYAVNSLINSYAVTGNFKLAHLLFDRIPEPDDVSWNSVIKGYVK 200

Query: 194 LGKMDEARELFDEMPERDVVSWTTMIAGCVQVGHFMEALDIFHKMLERGVSPNEYTLASA 253
            GKMD A  LF +M E++ +SWTTMI+G VQ     EAL +FH+M    V P+  +LA+A
Sbjct: 201 AGKMDIALTLFRKMAEKNAISWTTMISGYVQADMNKEALQLFHEMQNSDVEPDNVSLANA 260

Query: 254 LAACANLVALDQGRWMHVYIRKNDIQLNDRLLAGLIDMYAKCGELEFASKLFNSEQQLKR 313
           L+ACA L AL+QG+W+H Y+ K  I+++  L   LIDMYAKCGE+E A ++F + +  K+
Sbjct: 261 LSACAQLGALEQGKWIHSYLNKTRIRMDSVLGCVLIDMYAKCGEMEEALEVFKNIK--KK 320

Query: 314 KVWPWNAMIGGFATHGKSKEAIEVFEKMKVEKVSPNKVTFVALLNACSHGNRVEEGRGYF 373
            V  W A+I G+A HG  +EAI  F +M+   + PN +TF A+L ACS+   VEEG+  F
Sbjct: 321 SVQAWTALISGYAYHGHGREAISKFMEMQKMGIKPNVITFTAVLTACSYTGLVEEGKLIF 380

Query: 374 ESMTGCYGVEPELEHYGCMVDLLGRAGRLKEAEEIISSMALTPDIAIWGALLSACKIHKD 433
            SM   Y ++P +EHYGC+VDLLGRAG L EA+  I  M L P+  IWGALL AC+IHK+
Sbjct: 381 YSMERDYNLKPTIEHYGCIVDLLGRAGLLDEAKRFIQEMPLKPNAVIWGALLKACRIHKN 440

Query: 434 IEMAERIGKIVRELDSDHLGCHVLLANIYSLTGNWNEARTLREKIAVSGKKKTPGCSSIE 493
           IE+ E IG+I+  +D  H G +V  ANI+++   W++A   R  +   G  K PGCS+I 
Sbjct: 441 IELGEEIGEILIAIDPYHGGRYVHKANIHAMDKKWDKAAETRRLMKEQGVAKVPGCSTIS 500

Query: 494 LNGTFHQFLVGDRSHPQTKQLYLFLDEMTTKLKIAGYVPESGEVLLDIDDNEDRETALLK 553
           L GT H+FL GDRSHP+ +++      M  KL+  GYVPE  E+LLD+ D+++RE  + +
Sbjct: 501 LEGTTHEFLAGDRSHPEIEKIQSKWRIMRRKLEENGYVPELEEMLLDLVDDDEREAIVHQ 560

Query: 554 HSEKLAIAFGLMNTAPGTPIRIVKNLRVCGDCHQAIKFISKVYDREIIVRDRIRYHHFKD 613
           HSEKLAI +GL+ T PGT IRI+KNLRVC DCH+  K ISK+Y R+I++RDR R+HHF+D
Sbjct: 561 HSEKLAITYGLIKTKPGTIIRIMKNLRVCKDCHKVTKLISKIYKRDIVMRDRTRFHHFRD 620

Query: 614 GTCSCNDYW 617
           G CSC DYW
Sbjct: 621 GKCSCGDYW 620

BLAST of Tan0003857 vs. ExPASy Swiss-Prot
Match: Q9LN01 (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 531.2 bits (1367), Expect = 1.6e-149
Identity = 279/710 (39.30%), Postives = 403/710 (56.76%), Query Frame = 0

Query: 13  LLDSCKSIHQIKQAHAQLITTGLILHPIATNKLLKLLSFS-RFASISYAQMVFDHFPHPD 72
           LL +CK++  ++  HAQ+I  GL     A +KL++    S  F  + YA  VF     P+
Sbjct: 39  LLHNCKTLQSLRIIHAQMIKIGLHNTNYALSKLIEFCILSPHFEGLPYAISVFKTIQEPN 98

Query: 73  LFLYNTIIKAHALSA--TSSADFFTRFRSLIRDERLVPNQYSFAFAFKGCGNGVGIVEGE 132
           L ++NT+ + HALS+   S+   +    SL     L+PN Y+F F  K C       EG+
Sbjct: 99  LLIWNTMFRGHALSSDPVSALKLYVCMISL----GLLPNSYTFPFVLKSCAKSKAFKEGQ 158

Query: 133 QVRVHAVKLGQENNLFVTNALIGMYVNLGLVVDARKVFDWSPNRDMYSWNIMLSGYVRLG 192
           Q+  H +KLG + +L+V  +LI MYV  G + DA KVFD SP+RD+ S+  ++ GY   G
Sbjct: 159 QIHGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVFDKSPHRDVVSYTALIKGYASRG 218

Query: 193 KMDEARELFDEMPERDVVSWTTMIAGCVQVGHFMEALD---------------------- 252
            ++ A++LFDE+P +DVVSW  MI+G  + G++ EAL+                      
Sbjct: 219 YIENAQKLFDEIPVKDVVSWNAMISGYAETGNYKEALELFKDMMKTNVRPDESTMVTVVS 278

Query: 253 ------------------------------------------------------------ 312
                                                                       
Sbjct: 279 ACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELETACGLFERLPYKDVIS 338

Query: 313 -------------------IFHKMLERGVSPNEYTLASALAACANLVALDQGRWMHVYI- 372
                              +F +ML  G +PN+ T+ S L ACA+L A+D GRW+HVYI 
Sbjct: 339 WNTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSILPACAHLGAIDIGRWIHVYID 398

Query: 373 -RKNDIQLNDRLLAGLIDMYAKCGELEFASKLFNSEQQLKRKVWPWNAMIGGFATHGKSK 432
            R   +     L   LIDMYAKCG++E A ++FNS   L + +  WNAMI GFA HG++ 
Sbjct: 399 KRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNS--ILHKSLSSWNAMIFGFAMHGRAD 458

Query: 433 EAIEVFEKMKVEKVSPNKVTFVALLNACSHGNRVEEGRGYFESMTGCYGVEPELEHYGCM 492
            + ++F +M+   + P+ +TFV LL+ACSH   ++ GR  F +MT  Y + P+LEHYGCM
Sbjct: 459 ASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQDYKMTPKLEHYGCM 518

Query: 493 VDLLGRAGRLKEAEEIISSMALTPDIAIWGALLSACKIHKDIEMAERIGKIVRELDSDHL 552
           +DLLG +G  KEAEE+I+ M + PD  IW +LL ACK+H ++E+ E   + + +++ ++ 
Sbjct: 519 IDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELGESFAENLIKIEPENP 578

Query: 553 GCHVLLANIYSLTGNWNEARTLREKIAVSGKKKTPGCSSIELNGTFHQFLVGDRSHPQTK 612
           G +VLL+NIY+  G WNE    R  +   G KK PGCSSIE++   H+F++GD+ HP+ +
Sbjct: 579 GSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSVVHEFIIGDKFHPRNR 638

Query: 613 QLYLFLDEMTTKLKIAGYVPESGEVLLDIDDNEDRETALLKHSEKLAIAFGLMNTAPGTP 617
           ++Y  L+EM   L+ AG+VP++ EVL ++++ E +E AL  HSEKLAIAFGL++T PGT 
Sbjct: 639 EIYGMLEEMEVLLEKAGFVPDTSEVLQEMEE-EWKEGALRHHSEKLAIAFGLISTKPGTK 698

BLAST of Tan0003857 vs. ExPASy Swiss-Prot
Match: Q9FI80 (Pentatricopeptide repeat-containing protein At5g48910 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H38 PE=2 SV=1)

HSP 1 Score: 528.1 bits (1359), Expect = 1.3e-148
Identity = 266/626 (42.49%), Postives = 392/626 (62.62%), Query Frame = 0

Query: 8   SKLFFLLDSCKSIHQIKQAHAQLITTGLILHPIATNKLLKLLSFS--RFASISYAQMVFD 67
           S LF  +++C++I  + Q HA  I +G +   +A  ++L+  + S      + YA  +F+
Sbjct: 24  SSLFPQINNCRTIRDLSQIHAVFIKSGQMRDTLAAAEILRFCATSDLHHRDLDYAHKIFN 83

Query: 68  HFPHPDLFLYNTIIKAHALSATSSADF-FTRFRSLIRDERLVPNQYSFAFAFKGCGNGVG 127
             P  + F +NTII+  + S    A    T F  ++ DE + PN+++F    K C     
Sbjct: 84  QMPQRNCFSWNTIIRGFSESDEDKALIAITLFYEMMSDEFVEPNRFTFPSVLKACAKTGK 143

Query: 128 IVEGEQVRVHAVKLGQENNLFVTNALIGMYVNLGLVVDARKVF--------------DWS 187
           I EG+Q+   A+K G   + FV + L+ MYV  G + DAR +F                 
Sbjct: 144 IQEGKQIHGLALKYGFGGDEFVMSNLVRMYVMCGFMKDARVLFYKNIIEKDMVVMTDRRK 203

Query: 188 PNRDMYSWNIMLSGYVRLGKMDEARELFDEMPERDVVSWTTMIAGCVQVGHFMEALDIFH 247
            + ++  WN+M+ GY+RLG    AR LFD+M +R VVSW TMI+G    G F +A+++F 
Sbjct: 204 RDGEIVLWNVMIDGYMRLGDCKAARMLFDKMRQRSVVSWNTMISGYSLNGFFKDAVEVFR 263

Query: 248 KMLERGVSPNEYTLASALAACANLVALDQGRWMHVYIRKNDIQLNDRLLAGLIDMYAKCG 307
           +M +  + PN  TL S L A + L +L+ G W+H+Y   + I+++D L + LIDMY+KCG
Sbjct: 264 EMKKGDIRPNYVTLVSVLPAISRLGSLELGEWLHLYAEDSGIRIDDVLGSALIDMYSKCG 323

Query: 308 ELEFASKLFNSEQQLKRKVWPWNAMIGGFATHGKSKEAIEVFEKMKVEKVSPNKVTFVAL 367
            +E A  +F  E+  +  V  W+AMI GFA HG++ +AI+ F KM+   V P+ V ++ L
Sbjct: 324 IIEKAIHVF--ERLPRENVITWSAMINGFAIHGQAGDAIDCFCKMRQAGVRPSDVAYINL 383

Query: 368 LNACSHGNRVEEGRGYFESMTGCYGVEPELEHYGCMVDLLGRAGRLKEAEEIISSMALTP 427
           L ACSHG  VEEGR YF  M    G+EP +EHYGCMVDLLGR+G L EAEE I +M + P
Sbjct: 384 LTACSHGGLVEEGRRYFSQMVSVDGLEPRIEHYGCMVDLLGRSGLLDEAEEFILNMPIKP 443

Query: 428 DIAIWGALLSACKIHKDIEMAERIGKIVRELDSDHLGCHVLLANIYSLTGNWNEARTLRE 487
           D  IW ALL AC++  ++EM +R+  I+ ++     G +V L+N+Y+  GNW+E   +R 
Sbjct: 444 DDVIWKALLGACRMQGNVEMGKRVANILMDMVPHDSGAYVALSNMYASQGNWSEVSEMRL 503

Query: 488 KIAVSGKKKTPGCSSIELNGTFHQFLVGDRSHPQTKQLYLFLDEMTTKLKIAGYVPESGE 547
           ++     +K PGCS I+++G  H+F+V D SHP+ K++   L E++ KL++AGY P + +
Sbjct: 504 RMKEKDIRKDPGCSLIDIDGVLHEFVVEDDSHPKAKEINSMLVEISDKLRLAGYRPITTQ 563

Query: 548 VLLDIDDNEDRETALLKHSEKLAIAFGLMNTAPGTPIRIVKNLRVCGDCHQAIKFISKVY 607
           VLL++++ ED+E  L  HSEK+A AFGL++T+PG PIRIVKNLR+C DCH +IK ISKVY
Sbjct: 564 VLLNLEE-EDKENVLHYHSEKIATAFGLISTSPGKPIRIVKNLRICEDCHSSIKLISKVY 623

Query: 608 DREIIVRDRIRYHHFKDGTCSCNDYW 617
            R+I VRDR R+HHF+DG+CSC DYW
Sbjct: 624 KRKITVRDRKRFHHFQDGSCSCMDYW 646

BLAST of Tan0003857 vs. ExPASy Swiss-Prot
Match: O82380 (Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H33 PE=2 SV=1)

HSP 1 Score: 511.5 bits (1316), Expect = 1.3e-143
Identity = 268/711 (37.69%), Postives = 393/711 (55.27%), Query Frame = 0

Query: 8   SKLFFLLDSCKSIHQIKQAHAQLITTGLILHPIATNKLLKLLSFSRFASISYAQMVFDHF 67
           S+   L++ C S+ Q+KQ H  +I TG    P + +KL  + + S FAS+ YA+ VFD  
Sbjct: 31  SRHISLIERCVSLRQLKQTHGHMIRTGTFSDPYSASKLFAMAALSSFASLEYARKVFDEI 90

Query: 68  PHPDLFLYNTIIKAHALSATSSADFFTRFRSLIRDERLVPNQYSFAFAFKGCGNGVGIVE 127
           P P+ F +NT+I+A+A S          F  ++ + +  PN+Y+F F  K       +  
Sbjct: 91  PKPNSFAWNTLIRAYA-SGPDPVLSIWAFLDMVSESQCYPNKYTFPFLIKAAAEVSSLSL 150

Query: 128 GEQVRVHAVKLGQENNLFVTNALIGMYVNLGLVVDARKVFDWSPNRDMYSWNIMLSGYVR 187
           G+ +   AVK    +++FV N+LI  Y + G +  A KVF     +D+ SWN M++G+V+
Sbjct: 151 GQSLHGMAVKSAVGSDVFVANSLIHCYFSCGDLDSACKVFTTIKEKDVVSWNSMINGFVQ 210

Query: 188 LGKMDEARELF------------------------------------------------- 247
            G  D+A ELF                                                 
Sbjct: 211 KGSPDKALELFKKMESEDVKASHVTMVGVLSACAKIRNLEFGRQVCSYIEENRVNVNLTL 270

Query: 248 ----------------------------------------------------DEMPERDV 307
                                                               + MP++D+
Sbjct: 271 ANAMLDMYTKCGSIEDAKRLFDAMEEKDNVTWTTMLDGYAISEDYEAAREVLNSMPQKDI 330

Query: 308 VSWTTMIAGCVQVGHFMEALDIFHKM-LERGVSPNEYTLASALAACANLVALDQGRWMHV 367
           V+W  +I+   Q G   EAL +FH++ L++ +  N+ TL S L+ACA + AL+ GRW+H 
Sbjct: 331 VAWNALISAYEQNGKPNEALIVFHELQLQKNMKLNQITLVSTLSACAQVGALELGRWIHS 390

Query: 368 YIRKNDIQLNDRLLAGLIDMYAKCGELEFASKLFNSEQQLKRKVWPWNAMIGGFATHGKS 427
           YI+K+ I++N  + + LI MY+KCG+LE + ++FNS +  KR V+ W+AMIGG A HG  
Sbjct: 391 YIKKHGIRMNFHVTSALIHMYSKCGDLEKSREVFNSVE--KRDVFVWSAMIGGLAMHGCG 450

Query: 428 KEAIEVFEKMKVEKVSPNKVTFVALLNACSHGNRVEEGRGYFESMTGCYGVEPELEHYGC 487
            EA+++F KM+   V PN VTF  +  ACSH   V+E    F  M   YG+ PE +HY C
Sbjct: 451 NEAVDMFYKMQEANVKPNGVTFTNVFCACSHTGLVDEAESLFHQMESNYGIVPEEKHYAC 510

Query: 488 MVDLLGRAGRLKEAEEIISSMALTPDIAIWGALLSACKIHKDIEMAERIGKIVRELDSDH 547
           +VD+LGR+G L++A + I +M + P  ++WGALL ACKIH ++ +AE     + EL+  +
Sbjct: 511 IVDVLGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACKIHANLNLAEMACTRLLELEPRN 570

Query: 548 LGCHVLLANIYSLTGNWNEARTLREKIAVSGKKKTPGCSSIELNGTFHQFLVGDRSHPQT 607
            G HVLL+NIY+  G W     LR+ + V+G KK PGCSSIE++G  H+FL GD +HP +
Sbjct: 571 DGAHVLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCSSIEIDGMIHEFLSGDNAHPMS 630

Query: 608 KQLYLFLDEMTTKLKIAGYVPESGEVLLDIDDNEDRETALLKHSEKLAIAFGLMNTAPGT 617
           +++Y  L E+  KLK  GY PE  +VL  I++ E +E +L  HSEKLAI +GL++T    
Sbjct: 631 EKVYGKLHEVMEKLKSNGYEPEISQVLQIIEEEEMKEQSLNLHSEKLAICYGLISTEAPK 690

BLAST of Tan0003857 vs. ExPASy Swiss-Prot
Match: Q683I9 (Pentatricopeptide repeat-containing protein At3g62890 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H82 PE=2 SV=1)

HSP 1 Score: 507.7 bits (1306), Expect = 1.9e-142
Identity = 250/572 (43.71%), Postives = 378/572 (66.08%), Query Frame = 0

Query: 55  ASISYAQMVFDHFPHPDL--FLYNTIIKA--HALSATSSADFFTRFRSLIRDERLVPNQY 114
           A I+YA  +F H  H  L  FL+N II+A  H +S+       + +  + R+ R+ P+ +
Sbjct: 6   AIIAYANPIF-HIRHLKLESFLWNIIIRAIVHNVSSPQRHSPISVYLRM-RNHRVSPDFH 65

Query: 115 SFAFAFKGCGNGVGIVEGEQVRVHAVKLGQENNLFVTNALIGMYVNLGLVVDARKVFDWS 174
           +F F      N + +  G++     +  G + + FV  +L+ MY + G +  A++VFD S
Sbjct: 66  TFPFLLPSFHNPLHLPLGQRTHAQILLFGLDKDPFVRTSLLNMYSSCGDLRSAQRVFDDS 125

Query: 175 PNRDMYSWNIMLSGYVRLGKMDEARELFDEMPERDVVSWTTMIAGCVQVGHFMEALDIFH 234
            ++D+ +WN +++ Y + G +D+AR+LFDEMPER+V+SW+ +I G V  G + EALD+F 
Sbjct: 126 GSKDLPAWNSVVNAYAKAGLIDDARKLFDEMPERNVISWSCLINGYVMCGKYKEALDLFR 185

Query: 235 KML-----ERGVSPNEYTLASALAACANLVALDQGRWMHVYIRKNDIQLNDRLLAGLIDM 294
           +M      E  V PNE+T+++ L+AC  L AL+QG+W+H YI K  ++++  L   LIDM
Sbjct: 186 EMQLPKPNEAFVRPNEFTMSTVLSACGRLGALEQGKWVHAYIDKYHVEIDIVLGTALIDM 245

Query: 295 YAKCGELEFASKLFNSEQQLKRKVWPWNAMIGGFATHGKSKEAIEVFEKMKV-EKVSPNK 354
           YAKCG LE A ++FN+    K+ V  ++AMI   A +G + E  ++F +M   + ++PN 
Sbjct: 246 YAKCGSLERAKRVFNALGS-KKDVKAYSAMICCLAMYGLTDECFQLFSEMTTSDNINPNS 305

Query: 355 VTFVALLNACSHGNRVEEGRGYFESMTGCYGVEPELEHYGCMVDLLGRAGRLKEAEEIIS 414
           VTFV +L AC H   + EG+ YF+ M   +G+ P ++HYGCMVDL GR+G +KEAE  I+
Sbjct: 306 VTFVGILGACVHRGLINEGKSYFKMMIEEFGITPSIQHYGCMVDLYGRSGLIKEAESFIA 365

Query: 415 SMALTPDIAIWGALLSACKIHKDIEMAERIGKIVRELDSDHLGCHVLLANIYSLTGNWNE 474
           SM + PD+ IWG+LLS  ++  DI+  E   K + ELD  + G +VLL+N+Y+ TG W E
Sbjct: 366 SMPMEPDVLIWGSLLSGSRMLGDIKTCEGALKRLIELDPMNSGAYVLLSNVYAKTGRWME 425

Query: 475 ARTLREKIAVSGKKKTPGCSSIELNGTFHQFLVGDRSHPQTKQLYLFLDEMTTKLKIAGY 534
            + +R ++ V G  K PGCS +E+ G  H+F+VGD S  +++++Y  LDE+  +L+ AGY
Sbjct: 426 VKCIRHEMEVKGINKVPGCSYVEVEGVVHEFVVGDESQQESERIYAMLDEIMQRLREAGY 485

Query: 535 VPESGEVLLDIDDNEDRETALLKHSEKLAIAFGLMNTAPGTPIRIVKNLRVCGDCHQAIK 594
           V ++ EVLLD+++ +D+E AL  HSEKLAIAF LM T PGTP+RI+KNLR+CGDCH  +K
Sbjct: 486 VTDTKEVLLDLNE-KDKEIALSYHSEKLAIAFCLMKTRPGTPVRIIKNLRICGDCHLVMK 545

Query: 595 FISKVYDREIIVRDRIRYHHFKDGTCSCNDYW 617
            ISK++ REI+VRD  R+HHF+DG+CSC D+W
Sbjct: 546 MISKLFSREIVVRDCNRFHHFRDGSCSCRDFW 573

BLAST of Tan0003857 vs. NCBI nr
Match: XP_022927711.1 (pentatricopeptide repeat-containing protein At3g62890-like [Cucurbita moschata] >XP_022927721.1 pentatricopeptide repeat-containing protein At3g62890-like [Cucurbita moschata])

HSP 1 Score: 1172.1 bits (3031), Expect = 0.0e+00
Identity = 565/616 (91.72%), Postives = 586/616 (95.13%), Query Frame = 0

Query: 1   MIRCDFTSKLFFLLDSCKSIHQIKQAHAQLITTGLILHPIATNKLLKLLSFSRFASISYA 60
           MIR DFTSKLFFLLD CKSIHQIKQ HAQLITTGL+LHPIATNKLLKLLSFSRFASISYA
Sbjct: 1   MIRYDFTSKLFFLLDFCKSIHQIKQTHAQLITTGLVLHPIATNKLLKLLSFSRFASISYA 60

Query: 61  QMVFDHFPHPDLFLYNTIIKAHALSATSSADFFTRFRSLIRDERLVPNQYSFAFAFKGCG 120
           QMVFDHFP PDLFLYNTIIKAHALSATSSAD FTRFRSLIRD RLVPNQYSFAFAFKGCG
Sbjct: 61  QMVFDHFPQPDLFLYNTIIKAHALSATSSADSFTRFRSLIRDGRLVPNQYSFAFAFKGCG 120

Query: 121 NGVGIVEGEQVRVHAVKLGQENNLFVTNALIGMYVNLGLVVDARKVFDWSPNRDMYSWNI 180
           N VG++EGEQVR HAVKLG ENNLFV NALIGMYVNLG+V DARKVFDWS  RDMYSWNI
Sbjct: 121 NAVGVLEGEQVRAHAVKLGLENNLFVMNALIGMYVNLGVVGDARKVFDWSTIRDMYSWNI 180

Query: 181 MLSGYVRLGKMDEARELFDEMPERDVVSWTTMIAGCVQVGHFMEALDIFHKMLERGVSPN 240
           MLSGY +LGKMD+ARELFDEMPERDVVSWTTMIAGCVQVGHFMEALDIFHKML++GV PN
Sbjct: 181 MLSGYAKLGKMDDARELFDEMPERDVVSWTTMIAGCVQVGHFMEALDIFHKMLQKGVGPN 240

Query: 241 EYTLASALAACANLVALDQGRWMHVYIRKNDIQLNDRLLAGLIDMYAKCGELEFASKLFN 300
           EYTLASALAACANLVALDQGRWMHVYIRKN+I LNDRLLAGLIDMY KCGELEFASKLFN
Sbjct: 241 EYTLASALAACANLVALDQGRWMHVYIRKNEIPLNDRLLAGLIDMYVKCGELEFASKLFN 300

Query: 301 SEQQLKRKVWPWNAMIGGFATHGKSKEAIEVFEKMKVEKVSPNKVTFVALLNACSHGNRV 360
           SE+   RKVWPWNAMIGGFA HGKSKEAIE+FE+MKVEKVSPNKVTFVALLNACSHGNRV
Sbjct: 301 SERMSIRKVWPWNAMIGGFAMHGKSKEAIELFEQMKVEKVSPNKVTFVALLNACSHGNRV 360

Query: 361 EEGRGYFESMTGCYGVEPELEHYGCMVDLLGRAGRLKEAEEIISSMALTPDIAIWGALLS 420
           EEGRGYFESM G +GVEPELEHYGCMVDLLGR+GRLKEAEEIISSM L PD+AIWGALLS
Sbjct: 361 EEGRGYFESMAGRFGVEPELEHYGCMVDLLGRSGRLKEAEEIISSMPLAPDVAIWGALLS 420

Query: 421 ACKIHKDIEMAERIGKIVRELDSDHLGCHVLLANIYSLTGNWNEARTLREKIAVSGKKKT 480
           ACK HKDIEM ERIGKIVRELDSDHLGCHVLLAN+YSLTGNWNEARTLREKIAVSGKKKT
Sbjct: 421 ACKTHKDIEMGERIGKIVRELDSDHLGCHVLLANMYSLTGNWNEARTLREKIAVSGKKKT 480

Query: 481 PGCSSIELNGTFHQFLVGDRSHPQTKQLYLFLDEMTTKLKIAGYVPESGEVLLDIDDNED 540
           PGCSSIELNGTFHQFLVGDRSHPQTK+LY+FLDEMTTKLK+AGY+PESGEVLLDIDDNED
Sbjct: 481 PGCSSIELNGTFHQFLVGDRSHPQTKELYMFLDEMTTKLKMAGYIPESGEVLLDIDDNED 540

Query: 541 RETALLKHSEKLAIAFGLMNTAPGTPIRIVKNLRVCGDCHQAIKFISKVYDREIIVRDRI 600
           RETALLKHSEKLAIAFGLMNTAP TPIRIVKNLRVCGDCH AIKFISKVYDREI+VRDRI
Sbjct: 541 RETALLKHSEKLAIAFGLMNTAPKTPIRIVKNLRVCGDCHLAIKFISKVYDREIVVRDRI 600

Query: 601 RYHHFKDGTCSCNDYW 617
           RYHHFKDG CSCNDYW
Sbjct: 601 RYHHFKDGACSCNDYW 616

BLAST of Tan0003857 vs. NCBI nr
Match: XP_023529533.1 (pentatricopeptide repeat-containing protein At3g62890-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1168.7 bits (3022), Expect = 0.0e+00
Identity = 564/616 (91.56%), Postives = 586/616 (95.13%), Query Frame = 0

Query: 1   MIRCDFTSKLFFLLDSCKSIHQIKQAHAQLITTGLILHPIATNKLLKLLSFSRFASISYA 60
           MIR DFTSKLFFLLD CKSIHQIKQ HAQLITTGL+LHPIATNKLLKLLSFSRFASISYA
Sbjct: 1   MIRYDFTSKLFFLLDFCKSIHQIKQTHAQLITTGLVLHPIATNKLLKLLSFSRFASISYA 60

Query: 61  QMVFDHFPHPDLFLYNTIIKAHALSATSSADFFTRFRSLIRDERLVPNQYSFAFAFKGCG 120
           QMVFDHFP PDLFLYNTIIKAHA+SATSSAD FTRFRSLIRD RLVPNQYSFAFAFKGCG
Sbjct: 61  QMVFDHFPQPDLFLYNTIIKAHAISATSSADSFTRFRSLIRDGRLVPNQYSFAFAFKGCG 120

Query: 121 NGVGIVEGEQVRVHAVKLGQENNLFVTNALIGMYVNLGLVVDARKVFDWSPNRDMYSWNI 180
           N VG++EGEQVRVHAVKLG ENNLFV NALIGMYVNLG V  ARKVFDWS  RDMYSWNI
Sbjct: 121 NAVGVLEGEQVRVHAVKLGLENNLFVMNALIGMYVNLGFVGYARKVFDWSTIRDMYSWNI 180

Query: 181 MLSGYVRLGKMDEARELFDEMPERDVVSWTTMIAGCVQVGHFMEALDIFHKMLERGVSPN 240
           MLSGY +LGKMD+ARELFDEMPERDVVSWTTMIAGCVQVGHFMEALDIFHKML++GV+PN
Sbjct: 181 MLSGYAKLGKMDDARELFDEMPERDVVSWTTMIAGCVQVGHFMEALDIFHKMLQKGVNPN 240

Query: 241 EYTLASALAACANLVALDQGRWMHVYIRKNDIQLNDRLLAGLIDMYAKCGELEFASKLFN 300
           EYTLASALAACANLVALDQGRWMHVYIRKN+I LNDRLLAGLIDMY KCGELEFASKLFN
Sbjct: 241 EYTLASALAACANLVALDQGRWMHVYIRKNEIPLNDRLLAGLIDMYVKCGELEFASKLFN 300

Query: 301 SEQQLKRKVWPWNAMIGGFATHGKSKEAIEVFEKMKVEKVSPNKVTFVALLNACSHGNRV 360
           SE+   RKVWPWNAMIGGFA HGKSKEAIEVFE+MKVEKVSPNKVTFVALLNACSHGNRV
Sbjct: 301 SERMSIRKVWPWNAMIGGFAMHGKSKEAIEVFEQMKVEKVSPNKVTFVALLNACSHGNRV 360

Query: 361 EEGRGYFESMTGCYGVEPELEHYGCMVDLLGRAGRLKEAEEIISSMALTPDIAIWGALLS 420
           EEGR YF+SM G YGVEPELEHYGCMVDLLGR+GRLKEAEEIISSM +TPD+AIWGALLS
Sbjct: 361 EEGRRYFKSMAGRYGVEPELEHYGCMVDLLGRSGRLKEAEEIISSMPMTPDVAIWGALLS 420

Query: 421 ACKIHKDIEMAERIGKIVRELDSDHLGCHVLLANIYSLTGNWNEARTLREKIAVSGKKKT 480
           ACK HKDIEM ERIGKIVRELDSDHLGCHVLLAN+YSLTGNWNEARTLREKIAVSGKKKT
Sbjct: 421 ACKTHKDIEMGERIGKIVRELDSDHLGCHVLLANMYSLTGNWNEARTLREKIAVSGKKKT 480

Query: 481 PGCSSIELNGTFHQFLVGDRSHPQTKQLYLFLDEMTTKLKIAGYVPESGEVLLDIDDNED 540
           PGCSSIELNGTFHQFLVGDRSHPQTK+LY+FLDEMTTKLK+AGY+PESGEVLLDIDDNED
Sbjct: 481 PGCSSIELNGTFHQFLVGDRSHPQTKELYMFLDEMTTKLKMAGYIPESGEVLLDIDDNED 540

Query: 541 RETALLKHSEKLAIAFGLMNTAPGTPIRIVKNLRVCGDCHQAIKFISKVYDREIIVRDRI 600
           RETALLKHSEKLAIAFGLMNTAP TPIRIVKNLRVCGDCH AIKFISKVYDREI+VRDRI
Sbjct: 541 RETALLKHSEKLAIAFGLMNTAPKTPIRIVKNLRVCGDCHLAIKFISKVYDREIVVRDRI 600

Query: 601 RYHHFKDGTCSCNDYW 617
           RYHHFKDG CSCNDYW
Sbjct: 601 RYHHFKDGACSCNDYW 616

BLAST of Tan0003857 vs. NCBI nr
Match: XP_038878435.1 (pentatricopeptide repeat-containing protein At5g66520-like [Benincasa hispida])

HSP 1 Score: 1164.1 bits (3010), Expect = 0.0e+00
Identity = 555/616 (90.10%), Postives = 583/616 (94.64%), Query Frame = 0

Query: 1   MIRCDFTSKLFFLLDSCKSIHQIKQAHAQLITTGLILHPIATNKLLKLLSFSRFASISYA 60
           MIRCDFTSKLFFLLDSCKSIHQIKQ HAQLITTGLI+HPI TNKLLKL+S SRFA ISYA
Sbjct: 1   MIRCDFTSKLFFLLDSCKSIHQIKQVHAQLITTGLIVHPIPTNKLLKLISSSRFAPISYA 60

Query: 61  QMVFDHFPHPDLFLYNTIIKAHALSATSSADFFTRFRSLIRDERLVPNQYSFAFAFKGCG 120
            MVFDH P PDLFLYNTIIKA ALS TSSAD FTRFRSLIR+ERLVPNQYSFAF FKGCG
Sbjct: 61  HMVFDHCPQPDLFLYNTIIKALALSTTSSADSFTRFRSLIREERLVPNQYSFAFVFKGCG 120

Query: 121 NGVGIVEGEQVRVHAVKLGQENNLFVTNALIGMYVNLGLVVDARKVFDWSPNRDMYSWNI 180
           NGVG++EGEQVRVHAVKLG ENNLFV NALIGMYVNLG VVDARKVFDWSPNRDMYSWNI
Sbjct: 121 NGVGVLEGEQVRVHAVKLGLENNLFVMNALIGMYVNLGFVVDARKVFDWSPNRDMYSWNI 180

Query: 181 MLSGYVRLGKMDEARELFDEMPERDVVSWTTMIAGCVQVGHFMEALDIFHKMLERGVSPN 240
           MLSGY RLGKMDEAR+LFDEMPERDVVSWTTMI+GC+QVGHFMEALDIFH MLE G SPN
Sbjct: 181 MLSGYARLGKMDEARQLFDEMPERDVVSWTTMISGCLQVGHFMEALDIFHNMLENGASPN 240

Query: 241 EYTLASALAACANLVALDQGRWMHVYIRKNDIQLNDRLLAGLIDMYAKCGELEFASKLFN 300
           EYTLASALAACANLVALDQGRWMHVYI+KNDIQ+N+RLLAGLIDMYAKCGELEFASKLF 
Sbjct: 241 EYTLASALAACANLVALDQGRWMHVYIKKNDIQMNERLLAGLIDMYAKCGELEFASKLFK 300

Query: 301 SEQQLKRKVWPWNAMIGGFATHGKSKEAIEVFEKMKVEKVSPNKVTFVALLNACSHGNRV 360
           SE+QLKRKVWPWNAM+GGFA HGKSKEAIEVFE+MK E+VSPNKVTFVALLNACSHGNRV
Sbjct: 301 SERQLKRKVWPWNAMVGGFAMHGKSKEAIEVFEQMKRERVSPNKVTFVALLNACSHGNRV 360

Query: 361 EEGRGYFESMTGCYGVEPELEHYGCMVDLLGRAGRLKEAEEIISSMALTPDIAIWGALLS 420
           EEG+ YFESM   YG+EPELEHYGC+VDLLGRAGRLKEAEEIIS+M LTPD+ IWGALLS
Sbjct: 361 EEGKCYFESMASHYGLEPELEHYGCLVDLLGRAGRLKEAEEIISNMPLTPDVVIWGALLS 420

Query: 421 ACKIHKDIEMAERIGKIVRELDSDHLGCHVLLANIYSLTGNWNEARTLREKIAVSGKKKT 480
            CKIHKD+EM ERIGKIV+ELD +HLGCHVLLANIYSLTGNWNEARTLREKIAVSGKKKT
Sbjct: 421 GCKIHKDVEMGERIGKIVKELDPNHLGCHVLLANIYSLTGNWNEARTLREKIAVSGKKKT 480

Query: 481 PGCSSIELNGTFHQFLVGDRSHPQTKQLYLFLDEMTTKLKIAGYVPESGEVLLDIDDNED 540
           PGCSS+ELNGTFHQFL+GDRSHPQTKQLYLFLDEM  KLKI+GY+PESGEVLLDIDDNED
Sbjct: 481 PGCSSVELNGTFHQFLIGDRSHPQTKQLYLFLDEMAAKLKISGYIPESGEVLLDIDDNED 540

Query: 541 RETALLKHSEKLAIAFGLMNTAPGTPIRIVKNLRVCGDCHQAIKFISKVYDREIIVRDRI 600
           RETALLKHSEKLAIAFGLMNTAPGTPIRIVKNLRVCGDCH AIKFISKVYDREI+VRDRI
Sbjct: 541 RETALLKHSEKLAIAFGLMNTAPGTPIRIVKNLRVCGDCHLAIKFISKVYDREIVVRDRI 600

Query: 601 RYHHFKDGTCSCNDYW 617
           RYHHFKDGTCSCNDYW
Sbjct: 601 RYHHFKDGTCSCNDYW 616

BLAST of Tan0003857 vs. NCBI nr
Match: XP_022967088.1 (pentatricopeptide repeat-containing protein At5g66520-like [Cucurbita maxima])

HSP 1 Score: 1156.7 bits (2991), Expect = 0.0e+00
Identity = 559/616 (90.75%), Postives = 580/616 (94.16%), Query Frame = 0

Query: 1   MIRCDFTSKLFFLLDSCKSIHQIKQAHAQLITTGLILHPIATNKLLKLLSFSRFASISYA 60
           MIR DFTSKLFFLLD CKSIHQIKQ HAQLITTGL+LHPIATNKLLKLLSFSRF SISYA
Sbjct: 1   MIRYDFTSKLFFLLDFCKSIHQIKQTHAQLITTGLVLHPIATNKLLKLLSFSRFGSISYA 60

Query: 61  QMVFDHFPHPDLFLYNTIIKAHALSATSSADFFTRFRSLIRDERLVPNQYSFAFAFKGCG 120
           QMVFDHFP PDLFLYNTIIKAHA+SATSSAD FTRFRSLIRD  LVPNQYSFAFAFKGCG
Sbjct: 61  QMVFDHFPQPDLFLYNTIIKAHAISATSSADSFTRFRSLIRDGSLVPNQYSFAFAFKGCG 120

Query: 121 NGVGIVEGEQVRVHAVKLGQENNLFVTNALIGMYVNLGLVVDARKVFDWSPNRDMYSWNI 180
           N VG++EGEQVRVHAVKLG ENNLFV NALIGMYVNLG V DARKVFDWS  RDMYSWNI
Sbjct: 121 NAVGVLEGEQVRVHAVKLGLENNLFVMNALIGMYVNLGFVGDARKVFDWSTIRDMYSWNI 180

Query: 181 MLSGYVRLGKMDEARELFDEMPERDVVSWTTMIAGCVQVGHFMEALDIFHKMLERGVSPN 240
           MLSGY +LGKMD+ARELFDEMPERDVVSWTTMIAGCVQVGHFM ALDIFHKML++GV  N
Sbjct: 181 MLSGYAKLGKMDDARELFDEMPERDVVSWTTMIAGCVQVGHFMGALDIFHKMLQKGVGLN 240

Query: 241 EYTLASALAACANLVALDQGRWMHVYIRKNDIQLNDRLLAGLIDMYAKCGELEFASKLFN 300
           EYTLASALAACANLVALDQGRWMHVYIRKN+IQLNDRLLAGLIDMY KCGELEFA KLFN
Sbjct: 241 EYTLASALAACANLVALDQGRWMHVYIRKNEIQLNDRLLAGLIDMYVKCGELEFALKLFN 300

Query: 301 SEQQLKRKVWPWNAMIGGFATHGKSKEAIEVFEKMKVEKVSPNKVTFVALLNACSHGNRV 360
           SE+   RKVWPWNAMIGGFA HGKSKEAIEVFE+MKVEKVSPNKVTFVALLNACSHGNRV
Sbjct: 301 SERMSIRKVWPWNAMIGGFAMHGKSKEAIEVFEQMKVEKVSPNKVTFVALLNACSHGNRV 360

Query: 361 EEGRGYFESMTGCYGVEPELEHYGCMVDLLGRAGRLKEAEEIISSMALTPDIAIWGALLS 420
           EEGR YFESM G YGVEPELEHYGCMVDLLGR+GRLKEAEEIISSM +T D+AIWGALLS
Sbjct: 361 EEGRRYFESMAGLYGVEPELEHYGCMVDLLGRSGRLKEAEEIISSMPMTADVAIWGALLS 420

Query: 421 ACKIHKDIEMAERIGKIVRELDSDHLGCHVLLANIYSLTGNWNEARTLREKIAVSGKKKT 480
           ACK HKDIEM ERIGKIV ELDSDHLGCHVLLAN+YSLTGNWNEARTLREKIAVSGKKKT
Sbjct: 421 ACKTHKDIEMGERIGKIVTELDSDHLGCHVLLANMYSLTGNWNEARTLREKIAVSGKKKT 480

Query: 481 PGCSSIELNGTFHQFLVGDRSHPQTKQLYLFLDEMTTKLKIAGYVPESGEVLLDIDDNED 540
           PGCSSIELNGTFHQFLVGDRSHPQTK+LY+ LDEMTTKLK+AGY+PESGEVLLDIDD+ED
Sbjct: 481 PGCSSIELNGTFHQFLVGDRSHPQTKELYMLLDEMTTKLKMAGYIPESGEVLLDIDDDED 540

Query: 541 RETALLKHSEKLAIAFGLMNTAPGTPIRIVKNLRVCGDCHQAIKFISKVYDREIIVRDRI 600
           RETALLKHSEKLAIAFGLMNTAP TPIRIVKNLRVCGDCHQAIKFISKVYDREI+VRDRI
Sbjct: 541 RETALLKHSEKLAIAFGLMNTAPRTPIRIVKNLRVCGDCHQAIKFISKVYDREIVVRDRI 600

Query: 601 RYHHFKDGTCSCNDYW 617
           RYHHFKDG CSCNDYW
Sbjct: 601 RYHHFKDGACSCNDYW 616

BLAST of Tan0003857 vs. NCBI nr
Match: XP_004139110.1 (pentatricopeptide repeat-containing protein At5g66520 [Cucumis sativus] >KGN66465.1 hypothetical protein Csa_007004 [Cucumis sativus])

HSP 1 Score: 1114.4 bits (2881), Expect = 0.0e+00
Identity = 539/616 (87.50%), Postives = 569/616 (92.37%), Query Frame = 0

Query: 1   MIRCDFTSKLFFLLDSCKSIHQIKQAHAQLITTGLILHPIATNKLLKLLSFSRFASISYA 60
           M+RCD      FLL SCKS  QIKQ HA+LITTGLILHPI TNKLLK LS S FA ISYA
Sbjct: 1   MVRCD------FLLSSCKSFRQIKQVHARLITTGLILHPIPTNKLLKQLS-SIFAPISYA 60

Query: 61  QMVFDHFPHPDLFLYNTIIKAHALSATSSADFFTRFRSLIRDERLVPNQYSFAFAFKGCG 120
            MVFDHFP PDLFLYNTIIK  A S TSSAD FT+FRSLIR+ERLVPNQYSFAFAFKGCG
Sbjct: 61  HMVFDHFPQPDLFLYNTIIKVLAFSTTSSADSFTKFRSLIREERLVPNQYSFAFAFKGCG 120

Query: 121 NGVGIVEGEQVRVHAVKLGQENNLFVTNALIGMYVNLGLVVDARKVFDWSPNRDMYSWNI 180
           +GVG++EGEQVRVHA+KLG ENNLFVTNALIGMYVNL  VVDARKVFDWSPNRDMYSWNI
Sbjct: 121 SGVGVLEGEQVRVHAIKLGLENNLFVTNALIGMYVNLDFVVDARKVFDWSPNRDMYSWNI 180

Query: 181 MLSGYVRLGKMDEARELFDEMPERDVVSWTTMIAGCVQVGHFMEALDIFHKMLERGVSPN 240
           MLSGY RLGKMDEAR+LFDEMPE+DVVSWTTMI+GC+QVG+FMEALDIFH ML +G+SPN
Sbjct: 181 MLSGYARLGKMDEARQLFDEMPEKDVVSWTTMISGCLQVGYFMEALDIFHNMLAKGMSPN 240

Query: 241 EYTLASALAACANLVALDQGRWMHVYIRKNDIQLNDRLLAGLIDMYAKCGELEFASKLFN 300
           EYTLAS+LAACANLVALDQGRWMHVYI+KN+IQ+N+RLLAGLIDMYAKCGELEFASKLFN
Sbjct: 241 EYTLASSLAACANLVALDQGRWMHVYIKKNNIQMNERLLAGLIDMYAKCGELEFASKLFN 300

Query: 301 SEQQLKRKVWPWNAMIGGFATHGKSKEAIEVFEKMKVEKVSPNKVTFVALLNACSHGNRV 360
           S  +LKRKVWPWNAMIGGFA HGKSKEAIEVFE+MK+EKVSPNKVTFVALLNACSHGNRV
Sbjct: 301 SNPRLKRKVWPWNAMIGGFAVHGKSKEAIEVFEQMKIEKVSPNKVTFVALLNACSHGNRV 360

Query: 361 EEGRGYFESMTGCYGVEPELEHYGCMVDLLGRAGRLKEAEEIISSMALTPDIAIWGALLS 420
           EEGR YFESM   Y V+PELEHYGC+VDLLGRAGRLKEAEEIISSM LTPD+AIWGALLS
Sbjct: 361 EEGRYYFESMASHYRVKPELEHYGCLVDLLGRAGRLKEAEEIISSMHLTPDVAIWGALLS 420

Query: 421 ACKIHKDIEMAERIGKIVRELDSDHLGCHVLLANIYSLTGNWNEARTLREKIAVSGKKKT 480
           ACKIHKD EM ER+GKIV+ELD +HLGCHVLLANIYSLTGNWNEARTLREKIA SGKKKT
Sbjct: 421 ACKIHKDAEMGERVGKIVKELDPNHLGCHVLLANIYSLTGNWNEARTLREKIAESGKKKT 480

Query: 481 PGCSSIELNGTFHQFLVGDRSHPQTKQLYLFLDEMTTKLKIAGYVPESGEVLLDIDDNED 540
           PGCSSIELNG FHQFLVGDRSHPQTKQLYLFLDEM TKLKIAGY+PESGEVLLDIDDNED
Sbjct: 481 PGCSSIELNGMFHQFLVGDRSHPQTKQLYLFLDEMITKLKIAGYIPESGEVLLDIDDNED 540

Query: 541 RETALLKHSEKLAIAFGLMNTAPGTPIRIVKNLRVCGDCHQAIKFISKVYDREIIVRDRI 600
           RETALLKHSEKLAIAFGLMNT P TPIRIVKNLRVC DCH AIKFISKVYDREIIVRDRI
Sbjct: 541 RETALLKHSEKLAIAFGLMNTTPKTPIRIVKNLRVCSDCHLAIKFISKVYDREIIVRDRI 600

Query: 601 RYHHFKDGTCSCNDYW 617
           RYHHFKDGTCSCNDYW
Sbjct: 601 RYHHFKDGTCSCNDYW 609

BLAST of Tan0003857 vs. ExPASy TrEMBL
Match: A0A6J1EIF4 (pentatricopeptide repeat-containing protein At3g62890-like OS=Cucurbita moschata OX=3662 GN=LOC111434526 PE=3 SV=1)

HSP 1 Score: 1172.1 bits (3031), Expect = 0.0e+00
Identity = 565/616 (91.72%), Postives = 586/616 (95.13%), Query Frame = 0

Query: 1   MIRCDFTSKLFFLLDSCKSIHQIKQAHAQLITTGLILHPIATNKLLKLLSFSRFASISYA 60
           MIR DFTSKLFFLLD CKSIHQIKQ HAQLITTGL+LHPIATNKLLKLLSFSRFASISYA
Sbjct: 1   MIRYDFTSKLFFLLDFCKSIHQIKQTHAQLITTGLVLHPIATNKLLKLLSFSRFASISYA 60

Query: 61  QMVFDHFPHPDLFLYNTIIKAHALSATSSADFFTRFRSLIRDERLVPNQYSFAFAFKGCG 120
           QMVFDHFP PDLFLYNTIIKAHALSATSSAD FTRFRSLIRD RLVPNQYSFAFAFKGCG
Sbjct: 61  QMVFDHFPQPDLFLYNTIIKAHALSATSSADSFTRFRSLIRDGRLVPNQYSFAFAFKGCG 120

Query: 121 NGVGIVEGEQVRVHAVKLGQENNLFVTNALIGMYVNLGLVVDARKVFDWSPNRDMYSWNI 180
           N VG++EGEQVR HAVKLG ENNLFV NALIGMYVNLG+V DARKVFDWS  RDMYSWNI
Sbjct: 121 NAVGVLEGEQVRAHAVKLGLENNLFVMNALIGMYVNLGVVGDARKVFDWSTIRDMYSWNI 180

Query: 181 MLSGYVRLGKMDEARELFDEMPERDVVSWTTMIAGCVQVGHFMEALDIFHKMLERGVSPN 240
           MLSGY +LGKMD+ARELFDEMPERDVVSWTTMIAGCVQVGHFMEALDIFHKML++GV PN
Sbjct: 181 MLSGYAKLGKMDDARELFDEMPERDVVSWTTMIAGCVQVGHFMEALDIFHKMLQKGVGPN 240

Query: 241 EYTLASALAACANLVALDQGRWMHVYIRKNDIQLNDRLLAGLIDMYAKCGELEFASKLFN 300
           EYTLASALAACANLVALDQGRWMHVYIRKN+I LNDRLLAGLIDMY KCGELEFASKLFN
Sbjct: 241 EYTLASALAACANLVALDQGRWMHVYIRKNEIPLNDRLLAGLIDMYVKCGELEFASKLFN 300

Query: 301 SEQQLKRKVWPWNAMIGGFATHGKSKEAIEVFEKMKVEKVSPNKVTFVALLNACSHGNRV 360
           SE+   RKVWPWNAMIGGFA HGKSKEAIE+FE+MKVEKVSPNKVTFVALLNACSHGNRV
Sbjct: 301 SERMSIRKVWPWNAMIGGFAMHGKSKEAIELFEQMKVEKVSPNKVTFVALLNACSHGNRV 360

Query: 361 EEGRGYFESMTGCYGVEPELEHYGCMVDLLGRAGRLKEAEEIISSMALTPDIAIWGALLS 420
           EEGRGYFESM G +GVEPELEHYGCMVDLLGR+GRLKEAEEIISSM L PD+AIWGALLS
Sbjct: 361 EEGRGYFESMAGRFGVEPELEHYGCMVDLLGRSGRLKEAEEIISSMPLAPDVAIWGALLS 420

Query: 421 ACKIHKDIEMAERIGKIVRELDSDHLGCHVLLANIYSLTGNWNEARTLREKIAVSGKKKT 480
           ACK HKDIEM ERIGKIVRELDSDHLGCHVLLAN+YSLTGNWNEARTLREKIAVSGKKKT
Sbjct: 421 ACKTHKDIEMGERIGKIVRELDSDHLGCHVLLANMYSLTGNWNEARTLREKIAVSGKKKT 480

Query: 481 PGCSSIELNGTFHQFLVGDRSHPQTKQLYLFLDEMTTKLKIAGYVPESGEVLLDIDDNED 540
           PGCSSIELNGTFHQFLVGDRSHPQTK+LY+FLDEMTTKLK+AGY+PESGEVLLDIDDNED
Sbjct: 481 PGCSSIELNGTFHQFLVGDRSHPQTKELYMFLDEMTTKLKMAGYIPESGEVLLDIDDNED 540

Query: 541 RETALLKHSEKLAIAFGLMNTAPGTPIRIVKNLRVCGDCHQAIKFISKVYDREIIVRDRI 600
           RETALLKHSEKLAIAFGLMNTAP TPIRIVKNLRVCGDCH AIKFISKVYDREI+VRDRI
Sbjct: 541 RETALLKHSEKLAIAFGLMNTAPKTPIRIVKNLRVCGDCHLAIKFISKVYDREIVVRDRI 600

Query: 601 RYHHFKDGTCSCNDYW 617
           RYHHFKDG CSCNDYW
Sbjct: 601 RYHHFKDGACSCNDYW 616

BLAST of Tan0003857 vs. ExPASy TrEMBL
Match: A0A6J1HU41 (pentatricopeptide repeat-containing protein At5g66520-like OS=Cucurbita maxima OX=3661 GN=LOC111466592 PE=3 SV=1)

HSP 1 Score: 1156.7 bits (2991), Expect = 0.0e+00
Identity = 559/616 (90.75%), Postives = 580/616 (94.16%), Query Frame = 0

Query: 1   MIRCDFTSKLFFLLDSCKSIHQIKQAHAQLITTGLILHPIATNKLLKLLSFSRFASISYA 60
           MIR DFTSKLFFLLD CKSIHQIKQ HAQLITTGL+LHPIATNKLLKLLSFSRF SISYA
Sbjct: 1   MIRYDFTSKLFFLLDFCKSIHQIKQTHAQLITTGLVLHPIATNKLLKLLSFSRFGSISYA 60

Query: 61  QMVFDHFPHPDLFLYNTIIKAHALSATSSADFFTRFRSLIRDERLVPNQYSFAFAFKGCG 120
           QMVFDHFP PDLFLYNTIIKAHA+SATSSAD FTRFRSLIRD  LVPNQYSFAFAFKGCG
Sbjct: 61  QMVFDHFPQPDLFLYNTIIKAHAISATSSADSFTRFRSLIRDGSLVPNQYSFAFAFKGCG 120

Query: 121 NGVGIVEGEQVRVHAVKLGQENNLFVTNALIGMYVNLGLVVDARKVFDWSPNRDMYSWNI 180
           N VG++EGEQVRVHAVKLG ENNLFV NALIGMYVNLG V DARKVFDWS  RDMYSWNI
Sbjct: 121 NAVGVLEGEQVRVHAVKLGLENNLFVMNALIGMYVNLGFVGDARKVFDWSTIRDMYSWNI 180

Query: 181 MLSGYVRLGKMDEARELFDEMPERDVVSWTTMIAGCVQVGHFMEALDIFHKMLERGVSPN 240
           MLSGY +LGKMD+ARELFDEMPERDVVSWTTMIAGCVQVGHFM ALDIFHKML++GV  N
Sbjct: 181 MLSGYAKLGKMDDARELFDEMPERDVVSWTTMIAGCVQVGHFMGALDIFHKMLQKGVGLN 240

Query: 241 EYTLASALAACANLVALDQGRWMHVYIRKNDIQLNDRLLAGLIDMYAKCGELEFASKLFN 300
           EYTLASALAACANLVALDQGRWMHVYIRKN+IQLNDRLLAGLIDMY KCGELEFA KLFN
Sbjct: 241 EYTLASALAACANLVALDQGRWMHVYIRKNEIQLNDRLLAGLIDMYVKCGELEFALKLFN 300

Query: 301 SEQQLKRKVWPWNAMIGGFATHGKSKEAIEVFEKMKVEKVSPNKVTFVALLNACSHGNRV 360
           SE+   RKVWPWNAMIGGFA HGKSKEAIEVFE+MKVEKVSPNKVTFVALLNACSHGNRV
Sbjct: 301 SERMSIRKVWPWNAMIGGFAMHGKSKEAIEVFEQMKVEKVSPNKVTFVALLNACSHGNRV 360

Query: 361 EEGRGYFESMTGCYGVEPELEHYGCMVDLLGRAGRLKEAEEIISSMALTPDIAIWGALLS 420
           EEGR YFESM G YGVEPELEHYGCMVDLLGR+GRLKEAEEIISSM +T D+AIWGALLS
Sbjct: 361 EEGRRYFESMAGLYGVEPELEHYGCMVDLLGRSGRLKEAEEIISSMPMTADVAIWGALLS 420

Query: 421 ACKIHKDIEMAERIGKIVRELDSDHLGCHVLLANIYSLTGNWNEARTLREKIAVSGKKKT 480
           ACK HKDIEM ERIGKIV ELDSDHLGCHVLLAN+YSLTGNWNEARTLREKIAVSGKKKT
Sbjct: 421 ACKTHKDIEMGERIGKIVTELDSDHLGCHVLLANMYSLTGNWNEARTLREKIAVSGKKKT 480

Query: 481 PGCSSIELNGTFHQFLVGDRSHPQTKQLYLFLDEMTTKLKIAGYVPESGEVLLDIDDNED 540
           PGCSSIELNGTFHQFLVGDRSHPQTK+LY+ LDEMTTKLK+AGY+PESGEVLLDIDD+ED
Sbjct: 481 PGCSSIELNGTFHQFLVGDRSHPQTKELYMLLDEMTTKLKMAGYIPESGEVLLDIDDDED 540

Query: 541 RETALLKHSEKLAIAFGLMNTAPGTPIRIVKNLRVCGDCHQAIKFISKVYDREIIVRDRI 600
           RETALLKHSEKLAIAFGLMNTAP TPIRIVKNLRVCGDCHQAIKFISKVYDREI+VRDRI
Sbjct: 541 RETALLKHSEKLAIAFGLMNTAPRTPIRIVKNLRVCGDCHQAIKFISKVYDREIVVRDRI 600

Query: 601 RYHHFKDGTCSCNDYW 617
           RYHHFKDG CSCNDYW
Sbjct: 601 RYHHFKDGACSCNDYW 616

BLAST of Tan0003857 vs. ExPASy TrEMBL
Match: A0A0A0LX83 (DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G612890 PE=3 SV=1)

HSP 1 Score: 1114.4 bits (2881), Expect = 0.0e+00
Identity = 539/616 (87.50%), Postives = 569/616 (92.37%), Query Frame = 0

Query: 1   MIRCDFTSKLFFLLDSCKSIHQIKQAHAQLITTGLILHPIATNKLLKLLSFSRFASISYA 60
           M+RCD      FLL SCKS  QIKQ HA+LITTGLILHPI TNKLLK LS S FA ISYA
Sbjct: 1   MVRCD------FLLSSCKSFRQIKQVHARLITTGLILHPIPTNKLLKQLS-SIFAPISYA 60

Query: 61  QMVFDHFPHPDLFLYNTIIKAHALSATSSADFFTRFRSLIRDERLVPNQYSFAFAFKGCG 120
            MVFDHFP PDLFLYNTIIK  A S TSSAD FT+FRSLIR+ERLVPNQYSFAFAFKGCG
Sbjct: 61  HMVFDHFPQPDLFLYNTIIKVLAFSTTSSADSFTKFRSLIREERLVPNQYSFAFAFKGCG 120

Query: 121 NGVGIVEGEQVRVHAVKLGQENNLFVTNALIGMYVNLGLVVDARKVFDWSPNRDMYSWNI 180
           +GVG++EGEQVRVHA+KLG ENNLFVTNALIGMYVNL  VVDARKVFDWSPNRDMYSWNI
Sbjct: 121 SGVGVLEGEQVRVHAIKLGLENNLFVTNALIGMYVNLDFVVDARKVFDWSPNRDMYSWNI 180

Query: 181 MLSGYVRLGKMDEARELFDEMPERDVVSWTTMIAGCVQVGHFMEALDIFHKMLERGVSPN 240
           MLSGY RLGKMDEAR+LFDEMPE+DVVSWTTMI+GC+QVG+FMEALDIFH ML +G+SPN
Sbjct: 181 MLSGYARLGKMDEARQLFDEMPEKDVVSWTTMISGCLQVGYFMEALDIFHNMLAKGMSPN 240

Query: 241 EYTLASALAACANLVALDQGRWMHVYIRKNDIQLNDRLLAGLIDMYAKCGELEFASKLFN 300
           EYTLAS+LAACANLVALDQGRWMHVYI+KN+IQ+N+RLLAGLIDMYAKCGELEFASKLFN
Sbjct: 241 EYTLASSLAACANLVALDQGRWMHVYIKKNNIQMNERLLAGLIDMYAKCGELEFASKLFN 300

Query: 301 SEQQLKRKVWPWNAMIGGFATHGKSKEAIEVFEKMKVEKVSPNKVTFVALLNACSHGNRV 360
           S  +LKRKVWPWNAMIGGFA HGKSKEAIEVFE+MK+EKVSPNKVTFVALLNACSHGNRV
Sbjct: 301 SNPRLKRKVWPWNAMIGGFAVHGKSKEAIEVFEQMKIEKVSPNKVTFVALLNACSHGNRV 360

Query: 361 EEGRGYFESMTGCYGVEPELEHYGCMVDLLGRAGRLKEAEEIISSMALTPDIAIWGALLS 420
           EEGR YFESM   Y V+PELEHYGC+VDLLGRAGRLKEAEEIISSM LTPD+AIWGALLS
Sbjct: 361 EEGRYYFESMASHYRVKPELEHYGCLVDLLGRAGRLKEAEEIISSMHLTPDVAIWGALLS 420

Query: 421 ACKIHKDIEMAERIGKIVRELDSDHLGCHVLLANIYSLTGNWNEARTLREKIAVSGKKKT 480
           ACKIHKD EM ER+GKIV+ELD +HLGCHVLLANIYSLTGNWNEARTLREKIA SGKKKT
Sbjct: 421 ACKIHKDAEMGERVGKIVKELDPNHLGCHVLLANIYSLTGNWNEARTLREKIAESGKKKT 480

Query: 481 PGCSSIELNGTFHQFLVGDRSHPQTKQLYLFLDEMTTKLKIAGYVPESGEVLLDIDDNED 540
           PGCSSIELNG FHQFLVGDRSHPQTKQLYLFLDEM TKLKIAGY+PESGEVLLDIDDNED
Sbjct: 481 PGCSSIELNGMFHQFLVGDRSHPQTKQLYLFLDEMITKLKIAGYIPESGEVLLDIDDNED 540

Query: 541 RETALLKHSEKLAIAFGLMNTAPGTPIRIVKNLRVCGDCHQAIKFISKVYDREIIVRDRI 600
           RETALLKHSEKLAIAFGLMNT P TPIRIVKNLRVC DCH AIKFISKVYDREIIVRDRI
Sbjct: 541 RETALLKHSEKLAIAFGLMNTTPKTPIRIVKNLRVCSDCHLAIKFISKVYDREIIVRDRI 600

Query: 601 RYHHFKDGTCSCNDYW 617
           RYHHFKDGTCSCNDYW
Sbjct: 601 RYHHFKDGTCSCNDYW 609

BLAST of Tan0003857 vs. ExPASy TrEMBL
Match: A0A1S3BNM1 (pentatricopeptide repeat-containing protein At5g66520-like OS=Cucumis melo OX=3656 GN=LOC103492046 PE=3 SV=1)

HSP 1 Score: 1109.4 bits (2868), Expect = 0.0e+00
Identity = 538/616 (87.34%), Postives = 568/616 (92.21%), Query Frame = 0

Query: 1   MIRCDFTSKLFFLLDSCKSIHQIKQAHAQLITTGLILHPIATNKLLKLLSFSRFASISYA 60
           M+RCD      FLL SCKS  QIKQ HAQLIT+GLILHPI TNKLLK LS S FA ISYA
Sbjct: 1   MVRCD------FLLGSCKSFRQIKQVHAQLITSGLILHPIPTNKLLKQLS-SIFAPISYA 60

Query: 61  QMVFDHFPHPDLFLYNTIIKAHALSATSSADFFTRFRSLIRDERLVPNQYSFAFAFKGCG 120
            MVFDHFP PDLFLYNTIIK  A S TSSAD FTRFRSLIR+ERLVPNQYSFAFAFK CG
Sbjct: 61  HMVFDHFPQPDLFLYNTIIKVLAFSTTSSADSFTRFRSLIREERLVPNQYSFAFAFKACG 120

Query: 121 NGVGIVEGEQVRVHAVKLGQENNLFVTNALIGMYVNLGLVVDARKVFDWSPNRDMYSWNI 180
           +GVG++EGEQVRVHA+KLG ENNLFVTNALIGMYVNL  VVDARKVF+WSP RDMYSWNI
Sbjct: 121 SGVGVLEGEQVRVHALKLGLENNLFVTNALIGMYVNLDFVVDARKVFEWSPYRDMYSWNI 180

Query: 181 MLSGYVRLGKMDEARELFDEMPERDVVSWTTMIAGCVQVGHFMEALDIFHKMLERGVSPN 240
           MLSGY RLGKMDEAR+LFDEMPERDVVSWTTMI+GC+QVGHFMEA+DIFH ML +G+SPN
Sbjct: 181 MLSGYARLGKMDEARQLFDEMPERDVVSWTTMISGCLQVGHFMEAVDIFHNMLAKGMSPN 240

Query: 241 EYTLASALAACANLVALDQGRWMHVYIRKNDIQLNDRLLAGLIDMYAKCGELEFASKLFN 300
           E+TLASAL+ACANLVALDQGRWMHVYI+KN+IQ+N+RLLAGLIDMYAKCGELEFASKLFN
Sbjct: 241 EHTLASALSACANLVALDQGRWMHVYIKKNNIQMNERLLAGLIDMYAKCGELEFASKLFN 300

Query: 301 SEQQLKRKVWPWNAMIGGFATHGKSKEAIEVFEKMKVEKVSPNKVTFVALLNACSHGNRV 360
           S  QL RKVWPWNAMIGGFA HGKSKEAIEVFE+MK+EKVSPNKVTFV+LLNACSHGNRV
Sbjct: 301 SNPQLMRKVWPWNAMIGGFAVHGKSKEAIEVFEQMKIEKVSPNKVTFVSLLNACSHGNRV 360

Query: 361 EEGRGYFESMTGCYGVEPELEHYGCMVDLLGRAGRLKEAEEIISSMALTPDIAIWGALLS 420
           +EGR YFESM   YGV+P LEHYGC+VDLLGRAGRLKEAEEIISSM LTPD+AIWGALLS
Sbjct: 361 KEGRYYFESMASHYGVKPVLEHYGCLVDLLGRAGRLKEAEEIISSMHLTPDVAIWGALLS 420

Query: 421 ACKIHKDIEMAERIGKIVRELDSDHLGCHVLLANIYSLTGNWNEARTLREKIAVSGKKKT 480
           ACKIHKD+EM ERIGKIV+ELD +HLGCHVLLANIYSLTGNWNEARTLREKIAVSGKKKT
Sbjct: 421 ACKIHKDVEMGERIGKIVKELDPNHLGCHVLLANIYSLTGNWNEARTLREKIAVSGKKKT 480

Query: 481 PGCSSIELNGTFHQFLVGDRSHPQTKQLYLFLDEMTTKLKIAGYVPESGEVLLDIDDNED 540
           PGCSSIELNG FHQFLVGDRSHPQTKQLYLFLDEM TKLKIAGYVPESGEVLLDIDDNED
Sbjct: 481 PGCSSIELNGMFHQFLVGDRSHPQTKQLYLFLDEMITKLKIAGYVPESGEVLLDIDDNED 540

Query: 541 RETALLKHSEKLAIAFGLMNTAPGTPIRIVKNLRVCGDCHQAIKFISKVYDREIIVRDRI 600
           RETALLKHSEKLAIAFGLMNT P TPIRIVKNLRVC DCH AIKFISKVYDREIIVRDRI
Sbjct: 541 RETALLKHSEKLAIAFGLMNTTPKTPIRIVKNLRVCNDCHLAIKFISKVYDREIIVRDRI 600

Query: 601 RYHHFKDGTCSCNDYW 617
           RYHHFKDGTCSCNDYW
Sbjct: 601 RYHHFKDGTCSCNDYW 609

BLAST of Tan0003857 vs. ExPASy TrEMBL
Match: A0A5A7UA76 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold773G00170 PE=3 SV=1)

HSP 1 Score: 1041.2 bits (2691), Expect = 1.7e-300
Identity = 497/555 (89.55%), Postives = 523/555 (94.23%), Query Frame = 0

Query: 62  MVFDHFPHPDLFLYNTIIKAHALSATSSADFFTRFRSLIRDERLVPNQYSFAFAFKGCGN 121
           MVFDHFP PDLFLYNTIIK  A S TSSAD FTRFRSLIR+ERLVPNQYSFAFAFK CG+
Sbjct: 1   MVFDHFPQPDLFLYNTIIKVLAFSTTSSADSFTRFRSLIREERLVPNQYSFAFAFKACGS 60

Query: 122 GVGIVEGEQVRVHAVKLGQENNLFVTNALIGMYVNLGLVVDARKVFDWSPNRDMYSWNIM 181
           GVG++EGEQVRVHA+KLG ENNLFVTNALIGMYVNL  VVDARKVF+WSP RDMYSWNIM
Sbjct: 61  GVGVLEGEQVRVHALKLGLENNLFVTNALIGMYVNLDFVVDARKVFEWSPYRDMYSWNIM 120

Query: 182 LSGYVRLGKMDEARELFDEMPERDVVSWTTMIAGCVQVGHFMEALDIFHKMLERGVSPNE 241
           LSGY RLGKMDEAR+LFDEMPERDVVSWTTMI+GC+QVGHFMEALDIFH ML +G+SPNE
Sbjct: 121 LSGYARLGKMDEARQLFDEMPERDVVSWTTMISGCLQVGHFMEALDIFHNMLAKGMSPNE 180

Query: 242 YTLASALAACANLVALDQGRWMHVYIRKNDIQLNDRLLAGLIDMYAKCGELEFASKLFNS 301
           +TLASALAACANLVALDQGRWMHVYI+KN+IQ+N+RLLAGLIDMYAKCGELEFASKLFNS
Sbjct: 181 HTLASALAACANLVALDQGRWMHVYIKKNNIQMNERLLAGLIDMYAKCGELEFASKLFNS 240

Query: 302 EQQLKRKVWPWNAMIGGFATHGKSKEAIEVFEKMKVEKVSPNKVTFVALLNACSHGNRVE 361
             QL RKVWPWNAMIGGFA HGKSKEAIEVFE+MK+EKVSPNKVTFV+LLNACSHGNRV+
Sbjct: 241 NPQLMRKVWPWNAMIGGFAVHGKSKEAIEVFEQMKIEKVSPNKVTFVSLLNACSHGNRVK 300

Query: 362 EGRGYFESMTGCYGVEPELEHYGCMVDLLGRAGRLKEAEEIISSMALTPDIAIWGALLSA 421
           EGR YFESM   YGV+P LEHYGC+VDLLGRAGRLKEAEEIISSM LTPD+AIWGALLSA
Sbjct: 301 EGRYYFESMVSHYGVKPVLEHYGCLVDLLGRAGRLKEAEEIISSMHLTPDVAIWGALLSA 360

Query: 422 CKIHKDIEMAERIGKIVRELDSDHLGCHVLLANIYSLTGNWNEARTLREKIAVSGKKKTP 481
           CKIHKD+EM ERIGKIV+ELD +HLGCHVLLANIYSLTGNWNEARTLREKIAVSGKKKTP
Sbjct: 361 CKIHKDVEMGERIGKIVKELDPNHLGCHVLLANIYSLTGNWNEARTLREKIAVSGKKKTP 420

Query: 482 GCSSIELNGTFHQFLVGDRSHPQTKQLYLFLDEMTTKLKIAGYVPESGEVLLDIDDNEDR 541
           GCSSIELNG FHQFLVGDRSHPQTKQLYLFLDEM TKLKIAGYVPESGEVLLDIDDNEDR
Sbjct: 421 GCSSIELNGMFHQFLVGDRSHPQTKQLYLFLDEMITKLKIAGYVPESGEVLLDIDDNEDR 480

Query: 542 ETALLKHSEKLAIAFGLMNTAPGTPIRIVKNLRVCGDCHQAIKFISKVYDREIIVRDRIR 601
           ETALLKHSEKLAIAFGLMNT P TPIRIVKNLRVC DCH AIKFISKVYDREIIVRDRIR
Sbjct: 481 ETALLKHSEKLAIAFGLMNTTPKTPIRIVKNLRVCNDCHLAIKFISKVYDREIIVRDRIR 540

Query: 602 YHHFKDGTCSCNDYW 617
           YHHFKDGTCSCNDYW
Sbjct: 541 YHHFKDGTCSCNDYW 555

BLAST of Tan0003857 vs. TAIR 10
Match: AT5G66520.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 548.5 bits (1412), Expect = 6.8e-156
Identity = 277/609 (45.48%), Postives = 382/609 (62.73%), Query Frame = 0

Query: 14  LDSCKSIHQIKQAHAQLITTGLILHPIATNKLLKL-LSFSRFASISYAQMVFDHFPHPDL 73
           L  C    ++KQ HA+++ TGL+    A  K L   +S +    + YAQ+VFD F  PD 
Sbjct: 21  LQRCSKQEELKQIHARMLKTGLMQDSYAITKFLSFCISSTSSDFLPYAQIVFDGFDRPDT 80

Query: 74  FLYNTIIKAHALSATSSADFFTRFRSLIRDERLV-----PNQYSFAFAFKGCGNGVGIVE 133
           FL+N +I+  + S           RSL+  +R++      N Y+F    K C N     E
Sbjct: 81  FLWNLMIRGFSCSDEPE-------RSLLLYQRMLCSSAPHNAYTFPSLLKACSNLSAFEE 140

Query: 134 GEQVRVHAVKLGQENNLFVTNALIGMYVNLGLVVDARKVFDWSPNRDMYSWNIMLSGYVR 193
             Q+     KLG EN+++  N+LI  Y   G    A  +FD  P  D  SWN ++ GYV+
Sbjct: 141 TTQIHAQITKLGYENDVYAVNSLINSYAVTGNFKLAHLLFDRIPEPDDVSWNSVIKGYVK 200

Query: 194 LGKMDEARELFDEMPERDVVSWTTMIAGCVQVGHFMEALDIFHKMLERGVSPNEYTLASA 253
            GKMD A  LF +M E++ +SWTTMI+G VQ     EAL +FH+M    V P+  +LA+A
Sbjct: 201 AGKMDIALTLFRKMAEKNAISWTTMISGYVQADMNKEALQLFHEMQNSDVEPDNVSLANA 260

Query: 254 LAACANLVALDQGRWMHVYIRKNDIQLNDRLLAGLIDMYAKCGELEFASKLFNSEQQLKR 313
           L+ACA L AL+QG+W+H Y+ K  I+++  L   LIDMYAKCGE+E A ++F + +  K+
Sbjct: 261 LSACAQLGALEQGKWIHSYLNKTRIRMDSVLGCVLIDMYAKCGEMEEALEVFKNIK--KK 320

Query: 314 KVWPWNAMIGGFATHGKSKEAIEVFEKMKVEKVSPNKVTFVALLNACSHGNRVEEGRGYF 373
            V  W A+I G+A HG  +EAI  F +M+   + PN +TF A+L ACS+   VEEG+  F
Sbjct: 321 SVQAWTALISGYAYHGHGREAISKFMEMQKMGIKPNVITFTAVLTACSYTGLVEEGKLIF 380

Query: 374 ESMTGCYGVEPELEHYGCMVDLLGRAGRLKEAEEIISSMALTPDIAIWGALLSACKIHKD 433
            SM   Y ++P +EHYGC+VDLLGRAG L EA+  I  M L P+  IWGALL AC+IHK+
Sbjct: 381 YSMERDYNLKPTIEHYGCIVDLLGRAGLLDEAKRFIQEMPLKPNAVIWGALLKACRIHKN 440

Query: 434 IEMAERIGKIVRELDSDHLGCHVLLANIYSLTGNWNEARTLREKIAVSGKKKTPGCSSIE 493
           IE+ E IG+I+  +D  H G +V  ANI+++   W++A   R  +   G  K PGCS+I 
Sbjct: 441 IELGEEIGEILIAIDPYHGGRYVHKANIHAMDKKWDKAAETRRLMKEQGVAKVPGCSTIS 500

Query: 494 LNGTFHQFLVGDRSHPQTKQLYLFLDEMTTKLKIAGYVPESGEVLLDIDDNEDRETALLK 553
           L GT H+FL GDRSHP+ +++      M  KL+  GYVPE  E+LLD+ D+++RE  + +
Sbjct: 501 LEGTTHEFLAGDRSHPEIEKIQSKWRIMRRKLEENGYVPELEEMLLDLVDDDEREAIVHQ 560

Query: 554 HSEKLAIAFGLMNTAPGTPIRIVKNLRVCGDCHQAIKFISKVYDREIIVRDRIRYHHFKD 613
           HSEKLAI +GL+ T PGT IRI+KNLRVC DCH+  K ISK+Y R+I++RDR R+HHF+D
Sbjct: 561 HSEKLAITYGLIKTKPGTIIRIMKNLRVCKDCHKVTKLISKIYKRDIVMRDRTRFHHFRD 620

Query: 614 GTCSCNDYW 617
           G CSC DYW
Sbjct: 621 GKCSCGDYW 620

BLAST of Tan0003857 vs. TAIR 10
Match: AT1G08070.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 531.2 bits (1367), Expect = 1.1e-150
Identity = 279/710 (39.30%), Postives = 403/710 (56.76%), Query Frame = 0

Query: 13  LLDSCKSIHQIKQAHAQLITTGLILHPIATNKLLKLLSFS-RFASISYAQMVFDHFPHPD 72
           LL +CK++  ++  HAQ+I  GL     A +KL++    S  F  + YA  VF     P+
Sbjct: 39  LLHNCKTLQSLRIIHAQMIKIGLHNTNYALSKLIEFCILSPHFEGLPYAISVFKTIQEPN 98

Query: 73  LFLYNTIIKAHALSA--TSSADFFTRFRSLIRDERLVPNQYSFAFAFKGCGNGVGIVEGE 132
           L ++NT+ + HALS+   S+   +    SL     L+PN Y+F F  K C       EG+
Sbjct: 99  LLIWNTMFRGHALSSDPVSALKLYVCMISL----GLLPNSYTFPFVLKSCAKSKAFKEGQ 158

Query: 133 QVRVHAVKLGQENNLFVTNALIGMYVNLGLVVDARKVFDWSPNRDMYSWNIMLSGYVRLG 192
           Q+  H +KLG + +L+V  +LI MYV  G + DA KVFD SP+RD+ S+  ++ GY   G
Sbjct: 159 QIHGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVFDKSPHRDVVSYTALIKGYASRG 218

Query: 193 KMDEARELFDEMPERDVVSWTTMIAGCVQVGHFMEALD---------------------- 252
            ++ A++LFDE+P +DVVSW  MI+G  + G++ EAL+                      
Sbjct: 219 YIENAQKLFDEIPVKDVVSWNAMISGYAETGNYKEALELFKDMMKTNVRPDESTMVTVVS 278

Query: 253 ------------------------------------------------------------ 312
                                                                       
Sbjct: 279 ACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELETACGLFERLPYKDVIS 338

Query: 313 -------------------IFHKMLERGVSPNEYTLASALAACANLVALDQGRWMHVYI- 372
                              +F +ML  G +PN+ T+ S L ACA+L A+D GRW+HVYI 
Sbjct: 339 WNTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSILPACAHLGAIDIGRWIHVYID 398

Query: 373 -RKNDIQLNDRLLAGLIDMYAKCGELEFASKLFNSEQQLKRKVWPWNAMIGGFATHGKSK 432
            R   +     L   LIDMYAKCG++E A ++FNS   L + +  WNAMI GFA HG++ 
Sbjct: 399 KRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNS--ILHKSLSSWNAMIFGFAMHGRAD 458

Query: 433 EAIEVFEKMKVEKVSPNKVTFVALLNACSHGNRVEEGRGYFESMTGCYGVEPELEHYGCM 492
            + ++F +M+   + P+ +TFV LL+ACSH   ++ GR  F +MT  Y + P+LEHYGCM
Sbjct: 459 ASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQDYKMTPKLEHYGCM 518

Query: 493 VDLLGRAGRLKEAEEIISSMALTPDIAIWGALLSACKIHKDIEMAERIGKIVRELDSDHL 552
           +DLLG +G  KEAEE+I+ M + PD  IW +LL ACK+H ++E+ E   + + +++ ++ 
Sbjct: 519 IDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELGESFAENLIKIEPENP 578

Query: 553 GCHVLLANIYSLTGNWNEARTLREKIAVSGKKKTPGCSSIELNGTFHQFLVGDRSHPQTK 612
           G +VLL+NIY+  G WNE    R  +   G KK PGCSSIE++   H+F++GD+ HP+ +
Sbjct: 579 GSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSVVHEFIIGDKFHPRNR 638

Query: 613 QLYLFLDEMTTKLKIAGYVPESGEVLLDIDDNEDRETALLKHSEKLAIAFGLMNTAPGTP 617
           ++Y  L+EM   L+ AG+VP++ EVL ++++ E +E AL  HSEKLAIAFGL++T PGT 
Sbjct: 639 EIYGMLEEMEVLLEKAGFVPDTSEVLQEMEE-EWKEGALRHHSEKLAIAFGLISTKPGTK 698

BLAST of Tan0003857 vs. TAIR 10
Match: AT5G48910.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 528.1 bits (1359), Expect = 9.5e-150
Identity = 266/626 (42.49%), Postives = 392/626 (62.62%), Query Frame = 0

Query: 8   SKLFFLLDSCKSIHQIKQAHAQLITTGLILHPIATNKLLKLLSFS--RFASISYAQMVFD 67
           S LF  +++C++I  + Q HA  I +G +   +A  ++L+  + S      + YA  +F+
Sbjct: 24  SSLFPQINNCRTIRDLSQIHAVFIKSGQMRDTLAAAEILRFCATSDLHHRDLDYAHKIFN 83

Query: 68  HFPHPDLFLYNTIIKAHALSATSSADF-FTRFRSLIRDERLVPNQYSFAFAFKGCGNGVG 127
             P  + F +NTII+  + S    A    T F  ++ DE + PN+++F    K C     
Sbjct: 84  QMPQRNCFSWNTIIRGFSESDEDKALIAITLFYEMMSDEFVEPNRFTFPSVLKACAKTGK 143

Query: 128 IVEGEQVRVHAVKLGQENNLFVTNALIGMYVNLGLVVDARKVF--------------DWS 187
           I EG+Q+   A+K G   + FV + L+ MYV  G + DAR +F                 
Sbjct: 144 IQEGKQIHGLALKYGFGGDEFVMSNLVRMYVMCGFMKDARVLFYKNIIEKDMVVMTDRRK 203

Query: 188 PNRDMYSWNIMLSGYVRLGKMDEARELFDEMPERDVVSWTTMIAGCVQVGHFMEALDIFH 247
            + ++  WN+M+ GY+RLG    AR LFD+M +R VVSW TMI+G    G F +A+++F 
Sbjct: 204 RDGEIVLWNVMIDGYMRLGDCKAARMLFDKMRQRSVVSWNTMISGYSLNGFFKDAVEVFR 263

Query: 248 KMLERGVSPNEYTLASALAACANLVALDQGRWMHVYIRKNDIQLNDRLLAGLIDMYAKCG 307
           +M +  + PN  TL S L A + L +L+ G W+H+Y   + I+++D L + LIDMY+KCG
Sbjct: 264 EMKKGDIRPNYVTLVSVLPAISRLGSLELGEWLHLYAEDSGIRIDDVLGSALIDMYSKCG 323

Query: 308 ELEFASKLFNSEQQLKRKVWPWNAMIGGFATHGKSKEAIEVFEKMKVEKVSPNKVTFVAL 367
            +E A  +F  E+  +  V  W+AMI GFA HG++ +AI+ F KM+   V P+ V ++ L
Sbjct: 324 IIEKAIHVF--ERLPRENVITWSAMINGFAIHGQAGDAIDCFCKMRQAGVRPSDVAYINL 383

Query: 368 LNACSHGNRVEEGRGYFESMTGCYGVEPELEHYGCMVDLLGRAGRLKEAEEIISSMALTP 427
           L ACSHG  VEEGR YF  M    G+EP +EHYGCMVDLLGR+G L EAEE I +M + P
Sbjct: 384 LTACSHGGLVEEGRRYFSQMVSVDGLEPRIEHYGCMVDLLGRSGLLDEAEEFILNMPIKP 443

Query: 428 DIAIWGALLSACKIHKDIEMAERIGKIVRELDSDHLGCHVLLANIYSLTGNWNEARTLRE 487
           D  IW ALL AC++  ++EM +R+  I+ ++     G +V L+N+Y+  GNW+E   +R 
Sbjct: 444 DDVIWKALLGACRMQGNVEMGKRVANILMDMVPHDSGAYVALSNMYASQGNWSEVSEMRL 503

Query: 488 KIAVSGKKKTPGCSSIELNGTFHQFLVGDRSHPQTKQLYLFLDEMTTKLKIAGYVPESGE 547
           ++     +K PGCS I+++G  H+F+V D SHP+ K++   L E++ KL++AGY P + +
Sbjct: 504 RMKEKDIRKDPGCSLIDIDGVLHEFVVEDDSHPKAKEINSMLVEISDKLRLAGYRPITTQ 563

Query: 548 VLLDIDDNEDRETALLKHSEKLAIAFGLMNTAPGTPIRIVKNLRVCGDCHQAIKFISKVY 607
           VLL++++ ED+E  L  HSEK+A AFGL++T+PG PIRIVKNLR+C DCH +IK ISKVY
Sbjct: 564 VLLNLEE-EDKENVLHYHSEKIATAFGLISTSPGKPIRIVKNLRICEDCHSSIKLISKVY 623

Query: 608 DREIIVRDRIRYHHFKDGTCSCNDYW 617
            R+I VRDR R+HHF+DG+CSC DYW
Sbjct: 624 KRKITVRDRKRFHHFQDGSCSCMDYW 646

BLAST of Tan0003857 vs. TAIR 10
Match: AT2G29760.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 511.5 bits (1316), Expect = 9.2e-145
Identity = 268/711 (37.69%), Postives = 393/711 (55.27%), Query Frame = 0

Query: 8   SKLFFLLDSCKSIHQIKQAHAQLITTGLILHPIATNKLLKLLSFSRFASISYAQMVFDHF 67
           S+   L++ C S+ Q+KQ H  +I TG    P + +KL  + + S FAS+ YA+ VFD  
Sbjct: 31  SRHISLIERCVSLRQLKQTHGHMIRTGTFSDPYSASKLFAMAALSSFASLEYARKVFDEI 90

Query: 68  PHPDLFLYNTIIKAHALSATSSADFFTRFRSLIRDERLVPNQYSFAFAFKGCGNGVGIVE 127
           P P+ F +NT+I+A+A S          F  ++ + +  PN+Y+F F  K       +  
Sbjct: 91  PKPNSFAWNTLIRAYA-SGPDPVLSIWAFLDMVSESQCYPNKYTFPFLIKAAAEVSSLSL 150

Query: 128 GEQVRVHAVKLGQENNLFVTNALIGMYVNLGLVVDARKVFDWSPNRDMYSWNIMLSGYVR 187
           G+ +   AVK    +++FV N+LI  Y + G +  A KVF     +D+ SWN M++G+V+
Sbjct: 151 GQSLHGMAVKSAVGSDVFVANSLIHCYFSCGDLDSACKVFTTIKEKDVVSWNSMINGFVQ 210

Query: 188 LGKMDEARELF------------------------------------------------- 247
            G  D+A ELF                                                 
Sbjct: 211 KGSPDKALELFKKMESEDVKASHVTMVGVLSACAKIRNLEFGRQVCSYIEENRVNVNLTL 270

Query: 248 ----------------------------------------------------DEMPERDV 307
                                                               + MP++D+
Sbjct: 271 ANAMLDMYTKCGSIEDAKRLFDAMEEKDNVTWTTMLDGYAISEDYEAAREVLNSMPQKDI 330

Query: 308 VSWTTMIAGCVQVGHFMEALDIFHKM-LERGVSPNEYTLASALAACANLVALDQGRWMHV 367
           V+W  +I+   Q G   EAL +FH++ L++ +  N+ TL S L+ACA + AL+ GRW+H 
Sbjct: 331 VAWNALISAYEQNGKPNEALIVFHELQLQKNMKLNQITLVSTLSACAQVGALELGRWIHS 390

Query: 368 YIRKNDIQLNDRLLAGLIDMYAKCGELEFASKLFNSEQQLKRKVWPWNAMIGGFATHGKS 427
           YI+K+ I++N  + + LI MY+KCG+LE + ++FNS +  KR V+ W+AMIGG A HG  
Sbjct: 391 YIKKHGIRMNFHVTSALIHMYSKCGDLEKSREVFNSVE--KRDVFVWSAMIGGLAMHGCG 450

Query: 428 KEAIEVFEKMKVEKVSPNKVTFVALLNACSHGNRVEEGRGYFESMTGCYGVEPELEHYGC 487
            EA+++F KM+   V PN VTF  +  ACSH   V+E    F  M   YG+ PE +HY C
Sbjct: 451 NEAVDMFYKMQEANVKPNGVTFTNVFCACSHTGLVDEAESLFHQMESNYGIVPEEKHYAC 510

Query: 488 MVDLLGRAGRLKEAEEIISSMALTPDIAIWGALLSACKIHKDIEMAERIGKIVRELDSDH 547
           +VD+LGR+G L++A + I +M + P  ++WGALL ACKIH ++ +AE     + EL+  +
Sbjct: 511 IVDVLGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACKIHANLNLAEMACTRLLELEPRN 570

Query: 548 LGCHVLLANIYSLTGNWNEARTLREKIAVSGKKKTPGCSSIELNGTFHQFLVGDRSHPQT 607
            G HVLL+NIY+  G W     LR+ + V+G KK PGCSSIE++G  H+FL GD +HP +
Sbjct: 571 DGAHVLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCSSIEIDGMIHEFLSGDNAHPMS 630

Query: 608 KQLYLFLDEMTTKLKIAGYVPESGEVLLDIDDNEDRETALLKHSEKLAIAFGLMNTAPGT 617
           +++Y  L E+  KLK  GY PE  +VL  I++ E +E +L  HSEKLAI +GL++T    
Sbjct: 631 EKVYGKLHEVMEKLKSNGYEPEISQVLQIIEEEEMKEQSLNLHSEKLAICYGLISTEAPK 690

BLAST of Tan0003857 vs. TAIR 10
Match: AT3G62890.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 507.7 bits (1306), Expect = 1.3e-143
Identity = 250/572 (43.71%), Postives = 378/572 (66.08%), Query Frame = 0

Query: 55  ASISYAQMVFDHFPHPDL--FLYNTIIKA--HALSATSSADFFTRFRSLIRDERLVPNQY 114
           A I+YA  +F H  H  L  FL+N II+A  H +S+       + +  + R+ R+ P+ +
Sbjct: 6   AIIAYANPIF-HIRHLKLESFLWNIIIRAIVHNVSSPQRHSPISVYLRM-RNHRVSPDFH 65

Query: 115 SFAFAFKGCGNGVGIVEGEQVRVHAVKLGQENNLFVTNALIGMYVNLGLVVDARKVFDWS 174
           +F F      N + +  G++     +  G + + FV  +L+ MY + G +  A++VFD S
Sbjct: 66  TFPFLLPSFHNPLHLPLGQRTHAQILLFGLDKDPFVRTSLLNMYSSCGDLRSAQRVFDDS 125

Query: 175 PNRDMYSWNIMLSGYVRLGKMDEARELFDEMPERDVVSWTTMIAGCVQVGHFMEALDIFH 234
            ++D+ +WN +++ Y + G +D+AR+LFDEMPER+V+SW+ +I G V  G + EALD+F 
Sbjct: 126 GSKDLPAWNSVVNAYAKAGLIDDARKLFDEMPERNVISWSCLINGYVMCGKYKEALDLFR 185

Query: 235 KML-----ERGVSPNEYTLASALAACANLVALDQGRWMHVYIRKNDIQLNDRLLAGLIDM 294
           +M      E  V PNE+T+++ L+AC  L AL+QG+W+H YI K  ++++  L   LIDM
Sbjct: 186 EMQLPKPNEAFVRPNEFTMSTVLSACGRLGALEQGKWVHAYIDKYHVEIDIVLGTALIDM 245

Query: 295 YAKCGELEFASKLFNSEQQLKRKVWPWNAMIGGFATHGKSKEAIEVFEKMKV-EKVSPNK 354
           YAKCG LE A ++FN+    K+ V  ++AMI   A +G + E  ++F +M   + ++PN 
Sbjct: 246 YAKCGSLERAKRVFNALGS-KKDVKAYSAMICCLAMYGLTDECFQLFSEMTTSDNINPNS 305

Query: 355 VTFVALLNACSHGNRVEEGRGYFESMTGCYGVEPELEHYGCMVDLLGRAGRLKEAEEIIS 414
           VTFV +L AC H   + EG+ YF+ M   +G+ P ++HYGCMVDL GR+G +KEAE  I+
Sbjct: 306 VTFVGILGACVHRGLINEGKSYFKMMIEEFGITPSIQHYGCMVDLYGRSGLIKEAESFIA 365

Query: 415 SMALTPDIAIWGALLSACKIHKDIEMAERIGKIVRELDSDHLGCHVLLANIYSLTGNWNE 474
           SM + PD+ IWG+LLS  ++  DI+  E   K + ELD  + G +VLL+N+Y+ TG W E
Sbjct: 366 SMPMEPDVLIWGSLLSGSRMLGDIKTCEGALKRLIELDPMNSGAYVLLSNVYAKTGRWME 425

Query: 475 ARTLREKIAVSGKKKTPGCSSIELNGTFHQFLVGDRSHPQTKQLYLFLDEMTTKLKIAGY 534
            + +R ++ V G  K PGCS +E+ G  H+F+VGD S  +++++Y  LDE+  +L+ AGY
Sbjct: 426 VKCIRHEMEVKGINKVPGCSYVEVEGVVHEFVVGDESQQESERIYAMLDEIMQRLREAGY 485

Query: 535 VPESGEVLLDIDDNEDRETALLKHSEKLAIAFGLMNTAPGTPIRIVKNLRVCGDCHQAIK 594
           V ++ EVLLD+++ +D+E AL  HSEKLAIAF LM T PGTP+RI+KNLR+CGDCH  +K
Sbjct: 486 VTDTKEVLLDLNE-KDKEIALSYHSEKLAIAFCLMKTRPGTPVRIIKNLRICGDCHLVMK 545

Query: 595 FISKVYDREIIVRDRIRYHHFKDGTCSCNDYW 617
            ISK++ REI+VRD  R+HHF+DG+CSC D+W
Sbjct: 546 MISKLFSREIVVRDCNRFHHFRDGSCSCRDFW 573

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9FJY79.6e-15545.48Pentatricopeptide repeat-containing protein At5g66520 OS=Arabidopsis thaliana OX... [more]
Q9LN011.6e-14939.30Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
Q9FI801.3e-14842.49Pentatricopeptide repeat-containing protein At5g48910 OS=Arabidopsis thaliana OX... [more]
O823801.3e-14337.69Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidop... [more]
Q683I91.9e-14243.71Pentatricopeptide repeat-containing protein At3g62890 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
XP_022927711.10.0e+0091.72pentatricopeptide repeat-containing protein At3g62890-like [Cucurbita moschata] ... [more]
XP_023529533.10.0e+0091.56pentatricopeptide repeat-containing protein At3g62890-like [Cucurbita pepo subsp... [more]
XP_038878435.10.0e+0090.10pentatricopeptide repeat-containing protein At5g66520-like [Benincasa hispida][more]
XP_022967088.10.0e+0090.75pentatricopeptide repeat-containing protein At5g66520-like [Cucurbita maxima][more]
XP_004139110.10.0e+0087.50pentatricopeptide repeat-containing protein At5g66520 [Cucumis sativus] >KGN6646... [more]
Match NameE-valueIdentityDescription
A0A6J1EIF40.0e+0091.72pentatricopeptide repeat-containing protein At3g62890-like OS=Cucurbita moschata... [more]
A0A6J1HU410.0e+0090.75pentatricopeptide repeat-containing protein At5g66520-like OS=Cucurbita maxima O... [more]
A0A0A0LX830.0e+0087.50DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G6128... [more]
A0A1S3BNM10.0e+0087.34pentatricopeptide repeat-containing protein At5g66520-like OS=Cucumis melo OX=36... [more]
A0A5A7UA761.7e-30089.55Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
Match NameE-valueIdentityDescription
AT5G66520.16.8e-15645.48Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G08070.11.1e-15039.30Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT5G48910.19.5e-15042.49Pentatricopeptide repeat (PPR) superfamily protein [more]
AT2G29760.19.2e-14537.69Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G62890.11.3e-14343.71Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 257..370
e-value: 2.7E-24
score: 88.2
coord: 371..495
e-value: 5.2E-11
score: 44.5
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 132..256
e-value: 2.3E-34
score: 120.3
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 185..470
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 148..168
e-value: 1.3
score: 9.5
coord: 282..300
e-value: 0.47
score: 10.8
coord: 382..406
e-value: 0.015
score: 15.5
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 207..241
e-value: 2.6E-10
score: 37.8
coord: 176..207
e-value: 1.0E-9
score: 36.0
coord: 312..343
e-value: 7.8E-7
score: 26.9
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 173..201
e-value: 1.9E-6
score: 27.5
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 204..252
e-value: 1.1E-12
score: 47.9
coord: 312..355
e-value: 1.1E-12
score: 47.9
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 308..342
score: 10.961357
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 209..239
score: 9.580234
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 174..208
score: 13.318037
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 481..606
e-value: 1.5E-39
score: 134.8
NoneNo IPR availablePANTHERPTHR47928REPEAT-CONTAINING PROTEIN, PUTATIVE-RELATEDcoord: 17..590
NoneNo IPR availablePANTHERPTHR47928:SF65TRANSCRIPT PROCESSING PROTEIN, PUTATIVE-RELATEDcoord: 17..590

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0003857.1Tan0003857.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:1900865 chloroplast RNA modification
cellular_component GO:0009507 chloroplast
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding