Tan0021031 (gene) Snake gourd v1

Overview
NameTan0021031
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat-containing protein
LocationLG11: 4196051 .. 4199677 (+)
RNA-Seq ExpressionTan0021031
SyntenyTan0021031
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GCGACGAGTATATGAAAGCCAAAATCTAAATCAACTATTTTTGCTCTCGTCTCAGCGCACTCTCTTTTGATGATCCGAAAGAGAATAAATGGCTCCAACCGCTTTCGACGGTCTACCTTATGGCAGAAATGCACCAGCTTTCGTGCTTTGAAGCAAGTTCATGCTTTTCTTGTCGTCAATGGCTTTAATTCAAGCCCATCTGCCCTCAGACAACTCATTTTCGTCAGTGCTATAGCTGTTTCTGGGACAATGGACTATGCCCACCAAGTGTTCGCTCAAATTACTGAACCGGATATCTTCATGTGGAACACCATGATCAGGGGTTCGGCTCAGAGCTTGAAGCCTGCAAGTGCTGTTTCTCTTTATGCCCAGATGGAAAATCGTGGGGTTAGACCTGATAAATTTACCTTCTCGTTTGTACTCAAGGCCTGTACTAAGCTTTCTTGGGTTCAATTGGGATTTGGGATTCATGGGAAGGTTGTGAAGTTTGGGTTTCAATCCAATACATTTGTAAGGAATACTCTTATTTATTTCCATGCTAATTGTGGCGATTTGGGCACTGCCAGAGCACTTTTCGATGCTTCTGCCAAAAGGGATGTTGTGCCTTGGTCAGCTTTGACGGCAGGCTATGCAAGAAGAGGGGAATTGGATGTTGCACGACAGCTGTTTGATGAAATGCCAATCAAAGACTTGGTCTCGTGGAATGTGATAATAACAGCGTATGCAAAGCTGGGAGAGATGGAGAAGGCAAGGACACTGTTTGATGACGCTCCAAAGAAAGATGTCGTGACTTGGAATGCGATGATTGCAGGATATGTGCTTTCTGGGTTGAACAAGGAAGCTCTGGAGATGTTTGATGCAATGAGAGATGCGGGACAAAGGCCGGATGATGTGACAATGTTGAGTATCTTGTCTGCTTCTGCTGATTTGGGAGAGTTGGAAGTTGGAAAAAAGATACACCATTCCATTTTCGACATGTGCTGCGGACATTTAAGTGTGCTTCTTGGTAATGCACTTATAGACATGTATGCCAAGTGTGGAAGTATTGAGAATGCTATGGACGTTTTTCGAGGGATGAGAGACAAAGATACCTCCTCATGGAATTCAATAATAGGAGGATTGGCTTTTCATGGACATGCCAAGGAATCGATAAATCTGTTTCAAGAAATGCTCAGGTTGAAAATGAGGCCAAATGAAATCACTTTTGTTGGTGTGTTGGTTGCTTGTAGTCATGCTGGGAAAGTACAAGAAGGGCGTATGTATTTTAATCTTATGAGAAACTTGTACAAAATTGAGCCCAATATCAAGCATTACGGATGTATGGTAGACATCTTGGGGCGTGCTGGGCTATTGAGTGAAGCATTTGATTTTATAGACACAATGGAGATTGAACCTAATGCCATCATTTGGAGAACACTACTTGGGGCCTGTAGAGTTCACGGAGATGTCGAGTTGGGAAGGCGTGCAAATGAGCAATTACTCAAAATGAGGAAGGATGAGAGTGGGGATTATGTACTCCTATCTAACATATATGCATCACAAGGTGAGTGGGATGGCGTCGAGAAAGTACGGAGGTTGATGGATGATGGTGGGGTGAAGAAGGAGGCTGGTCGTAGCCTGATTGACGCAGATAATACCTTTCTAATGCATTTTTTGTTTGACTCAAAGCCAAAGTTTGTAGAAGAAGGCAGTTAATCTGCGTGTTACTATGTTCTCATCTTTCATCTATTTTTTCTCTTGGTGATGAGGACCCAACGCACAATGTGCATTCTGTTCATTTTACTGCAACAACCACAGGGCTGCCTCGAAGTTTTAGAATGTTTTAGGATTTTTCCTCTGCCTATGGCCACAGAAACAAGGTCTCAGACCCAGCTCTAGTCAAAGGCGTAGCTATGGACCATGGCTATAGGCTCTCTGAGTTGCATGTCCGAATCCAGTCATCATGACTGCATCAGTATATCTTCAACTTCTGGTTGTCCAACATAAAGCACAGTTTTCATGTTAGAGATTTTTTGTGCTTTTTGTATTACTGTGTTTACAGATCGAATCCCCAAAGTAGAGGGTTATAATGGTTGTGCTCACTCGGGAAGTCCCAACTAAATGATTGTGTAAAATTACGGCAATTGACCTGTAAGAATTGGTCCTAGCTTCTTTTGTTATACCTTAATAAACCACACGAATTGTTTCATTAGCCTTATTTTCTTGGCTGCCAGATTCTTTTGGACTTGTTGGATAAATCTGAATAGAAATAGGGCCATTAAATACCCATGGAGCATACTTTCCATTGTTCTCTACATGCTTTTTTCTATTTTTGCAGTATTTTTTACACATAAATAGGAGTATTTTTTTACCATGTTTTCGTGTTGAAGTCTATCTGCATTCTTGGGCTAGCTTAGAGATATAATAGTAGTCATACAGTTTTCATTGCAAACTTAAGCTTAATGCTTACATCGACTTGGAAGAAATAACAATACATCACACACACACTGACAAGTTGGACCTCTAGTCCCTTTTTTAACACACTCAAATTCATATATGCATTATGATTTGCAGATAAAAAGACCCTTATGTTGTTCTCACAAACTGGCAAGGCTGTGATAATGATAAACTACTTTCAACGACAACGAGACAGTGCAGCACCTGAGAGGAAGATCAAAATCATAAACAGAACAATTGCTGCAAAAGATCCAATCAAAGAGCCCGAGATTCGCTTGCAGAAGGAGTTAAATTGTTGGCAGACGGCTAACCAATTTGTCTTGGCATTTCCGTTGTGTGCTAAGTACTCAATGGCTGCTGCTGCAGAAGCCCCCGATGTCAAAAGAGCCATCATTCCCTGTTCATAATTGTATAAAACACACAAGTAAGCTAATTCCTCGAGTTTCTTTCTTTCTTTCTTTTAATTACTGCCAAAGATTACTTGTATATGTATTGGTGGTTTTTTTAAGCTGCTGGCTTGATGGAAAATGACTGAACTTCATACAGATAAAGGAAGTACCGCGTCTAAGAAGACCAACAGAACCCTTGAGCCATGTGCTCTGCTCCTCAGGATGTGGAATATAGAAAGAGGCAGAGAAAGAACCAAGTAGCCACTTACAATGGCATTGGCTACAACAAAGAAGCTGCTCAAACACTCAAAAGAGGTTAGTAAATTCATGAGGAGTTAGTTTTTGAAATATAAATCAAATGCTTGGTTATGTTAATTTATGACATACGTAAACATAGGGAGATCATTATACTCGGCTCTAAACCGAATAAATTGTGTGACAAATGGAAGAGTCTGGTTGGTGGTTCCCATAGCTATTGCACTCCCTAATGTTCCAATAATGGCAGCCAACCTGAGGATGAAGTCGAGTATCGATAAACCTCGGTTCATCCCTGGTTTGGATTTGGAACTCTTTAGCTCACTCGACTCACCTGTAGCTGCTTTCATCTTTCTTTGAAGGTTTATGTTTGGGTTATGAAGCTTTCAGAAGTGATTTTCTTTAGGATGCTCAATGTGCATGGCTGAGCAGTGTGCACCAAGGAGCTATTAATATGCAATACATAAGCTAGGCAATGTTGTTTTGGTGAAGTTTAGTGGCCAAAGAGCCCACAGGAGAGCATAACAAATCCACTCCAAAAACAGGG

mRNA sequence

GCGACGAGTATATGAAAGCCAAAATCTAAATCAACTATTTTTGCTCTCGTCTCAGCGCACTCTCTTTTGATGATCCGAAAGAGAATAAATGGCTCCAACCGCTTTCGACGGTCTACCTTATGGCAGAAATGCACCAGCTTTCGTGCTTTGAAGCAAGTTCATGCTTTTCTTGTCGTCAATGGCTTTAATTCAAGCCCATCTGCCCTCAGACAACTCATTTTCGTCAGTGCTATAGCTGTTTCTGGGACAATGGACTATGCCCACCAAGTGTTCGCTCAAATTACTGAACCGGATATCTTCATGTGGAACACCATGATCAGGGGTTCGGCTCAGAGCTTGAAGCCTGCAAGTGCTGTTTCTCTTTATGCCCAGATGGAAAATCGTGGGGTTAGACCTGATAAATTTACCTTCTCGTTTGTACTCAAGGCCTGTACTAAGCTTTCTTGGGTTCAATTGGGATTTGGGATTCATGGGAAGGTTGTGAAGTTTGGGTTTCAATCCAATACATTTGTAAGGAATACTCTTATTTATTTCCATGCTAATTGTGGCGATTTGGGCACTGCCAGAGCACTTTTCGATGCTTCTGCCAAAAGGGATGTTGTGCCTTGGTCAGCTTTGACGGCAGGCTATGCAAGAAGAGGGGAATTGGATGTTGCACGACAGCTGTTTGATGAAATGCCAATCAAAGACTTGGTCTCGTGGAATGTGATAATAACAGCGTATGCAAAGCTGGGAGAGATGGAGAAGGCAAGGACACTGTTTGATGACGCTCCAAAGAAAGATGTCGTGACTTGGAATGCGATGATTGCAGGATATGTGCTTTCTGGGTTGAACAAGGAAGCTCTGGAGATGTTTGATGCAATGAGAGATGCGGGACAAAGGCCGGATGATGTGACAATGTTGAGTATCTTGTCTGCTTCTGCTGATTTGGGAGAGTTGGAAGTTGGAAAAAAGATACACCATTCCATTTTCGACATGTGCTGCGGACATTTAAGTGTGCTTCTTGGTAATGCACTTATAGACATGTATGCCAAGTGTGGAAGTATTGAGAATGCTATGGACGTTTTTCGAGGGATGAGAGACAAAGATACCTCCTCATGGAATTCAATAATAGGAGGATTGGCTTTTCATGGACATGCCAAGGAATCGATAAATCTGTTTCAAGAAATGCTCAGGTTGAAAATGAGGCCAAATGAAATCACTTTTGTTGGTGTGTTGGTTGCTTGTAGTCATGCTGGGAAAGTACAAGAAGGGCGTATGTATTTTAATCTTATGAGAAACTTGTACAAAATTGAGCCCAATATCAAGCATTACGGATGTATGGTAGACATCTTGGGGCGTGCTGGGCTATTGAGTGAAGCATTTGATTTTATAGACACAATGGAGATTGAACCTAATGCCATCATTTGGAGAACACTACTTGGGGCCTGTAGAGTTCACGGAGATGTCGAGTTGGGAAGGCGTGCAAATGAGCAATTACTCAAAATGAGGAAGGATGAGAGTGGGGATTATGTACTCCTATCTAACATATATGCATCACAAGGTGAGTGGGATGGCGTCGAGAAAGTACGGAGGTTGATGGATGATGGTGGGGTGAAGAAGGAGGCTGGTCGTAGCCTGATTGACGCAGATAATACCTTTCTAATGCATTTTTTGTTTGACTCAAAGCCAAAGTTTGTAGAAGAAGGCAGTTAATCTGCGTGTTACTATGTTCTCATCTTTCATCTATTTTTTCTCTTGGTGATGAGGACCCAACGCACAATGTGCATTCTGTTCATTTTACTGCAACAACCACAGGGCTGCCTCGAAGTTTTAGAATGTTTTAGGATTTTTCCTCTGCCTATGGCCACAGAAACAAGGTCTCAGACCCAGCTCTAGTCAAAGGCGTAGCTATGGACCATGGCTATAGGCTCTCTGAGTTGCATGTCCGAATCCAGTCATCATGACTGCATCAGTATATCTTCAACTTCTGGTTGTCCAACATAAAGCACAGTTTTCATGTTAGAGATTTTTTGTGCTTTTTGTATTACTGTGTTTACAGATCGAATCCCCAAAGTAGAGGGTTATAATGGTTGTGCTCACTCGGGAAGTCCCAACTAAATGATTGTGTAAAATTACGGCAATTGACCTATAAAAAGACCCTTATGTTGTTCTCACAAACTGGCAAGGCTGTGATAATGATAAACTACTTTCAACGACAACGAGACAGTGCAGCACCTGAGAGGAAGATCAAAATCATAAACAGAACAATTGCTGCAAAAGATCCAATCAAAGAGCCCGAGATTCGCTTGCAGAAGGAGTTAAATTGTTGGCAGACGGCTAACCAATTTGTCTTGGCATTTCCGTTGTGTGCTAAGTACTCAATGGCTGCTGCTGCAGAAGCCCCCGATGTCAAAAGAGCCATCATTCCCTGTTCATAATTGTATAAAACACACAAATAAAGGAAGTACCGCGTCTAAGAAGACCAACAGAACCCTTGAGCCATGTGCTCTGCTCCTCAGGATGTGGAATATAGAAAGAGGCAGAGAAAGAACCAAGTAGCCACTTACAATGGCATTGGCTACAACAAAGAAGCTGCTCAAACACTCAAAAGAGGTTAGTAAATTCATGAGGAGTTAGTTTTTGAAATATAAATCAAATGCTTGGTTATGTTAATTTATGACATACGTAAACATAGGGAGATCATTATACTCGGCTCTAAACCGAATAAATTGTGTGACAAATGGAAGAGTCTGGTTGGTGGTTCCCATAGCTATTGCACTCCCTAATGTTCCAATAATGGCAGCCAACCTGAGGATGAAGTCGAGTATCGATAAACCTCGGTTCATCCCTGGTTTGGATTTGGAACTCTTTAGCTCACTCGACTCACCTGTAGCTGCTTTCATCTTTCTTTGAAGGTTTATGTTTGGGTTATGAAGCTTTCAGAAGTGATTTTCTTTAGGATGCTCAATGTGCATGGCTGAGCAGTGTGCACCAAGGAGCTATTAATATGCAATACATAAGCTAGGCAATGTTGTTTTGGTGAAGTTTAGTGGCCAAAGAGCCCACAGGAGAGCATAACAAATCCACTCCAAAAACAGGG

Coding sequence (CDS)

ATGATCCGAAAGAGAATAAATGGCTCCAACCGCTTTCGACGGTCTACCTTATGGCAGAAATGCACCAGCTTTCGTGCTTTGAAGCAAGTTCATGCTTTTCTTGTCGTCAATGGCTTTAATTCAAGCCCATCTGCCCTCAGACAACTCATTTTCGTCAGTGCTATAGCTGTTTCTGGGACAATGGACTATGCCCACCAAGTGTTCGCTCAAATTACTGAACCGGATATCTTCATGTGGAACACCATGATCAGGGGTTCGGCTCAGAGCTTGAAGCCTGCAAGTGCTGTTTCTCTTTATGCCCAGATGGAAAATCGTGGGGTTAGACCTGATAAATTTACCTTCTCGTTTGTACTCAAGGCCTGTACTAAGCTTTCTTGGGTTCAATTGGGATTTGGGATTCATGGGAAGGTTGTGAAGTTTGGGTTTCAATCCAATACATTTGTAAGGAATACTCTTATTTATTTCCATGCTAATTGTGGCGATTTGGGCACTGCCAGAGCACTTTTCGATGCTTCTGCCAAAAGGGATGTTGTGCCTTGGTCAGCTTTGACGGCAGGCTATGCAAGAAGAGGGGAATTGGATGTTGCACGACAGCTGTTTGATGAAATGCCAATCAAAGACTTGGTCTCGTGGAATGTGATAATAACAGCGTATGCAAAGCTGGGAGAGATGGAGAAGGCAAGGACACTGTTTGATGACGCTCCAAAGAAAGATGTCGTGACTTGGAATGCGATGATTGCAGGATATGTGCTTTCTGGGTTGAACAAGGAAGCTCTGGAGATGTTTGATGCAATGAGAGATGCGGGACAAAGGCCGGATGATGTGACAATGTTGAGTATCTTGTCTGCTTCTGCTGATTTGGGAGAGTTGGAAGTTGGAAAAAAGATACACCATTCCATTTTCGACATGTGCTGCGGACATTTAAGTGTGCTTCTTGGTAATGCACTTATAGACATGTATGCCAAGTGTGGAAGTATTGAGAATGCTATGGACGTTTTTCGAGGGATGAGAGACAAAGATACCTCCTCATGGAATTCAATAATAGGAGGATTGGCTTTTCATGGACATGCCAAGGAATCGATAAATCTGTTTCAAGAAATGCTCAGGTTGAAAATGAGGCCAAATGAAATCACTTTTGTTGGTGTGTTGGTTGCTTGTAGTCATGCTGGGAAAGTACAAGAAGGGCGTATGTATTTTAATCTTATGAGAAACTTGTACAAAATTGAGCCCAATATCAAGCATTACGGATGTATGGTAGACATCTTGGGGCGTGCTGGGCTATTGAGTGAAGCATTTGATTTTATAGACACAATGGAGATTGAACCTAATGCCATCATTTGGAGAACACTACTTGGGGCCTGTAGAGTTCACGGAGATGTCGAGTTGGGAAGGCGTGCAAATGAGCAATTACTCAAAATGAGGAAGGATGAGAGTGGGGATTATGTACTCCTATCTAACATATATGCATCACAAGGTGAGTGGGATGGCGTCGAGAAAGTACGGAGGTTGATGGATGATGGTGGGGTGAAGAAGGAGGCTGGTCGTAGCCTGATTGACGCAGATAATACCTTTCTAATGCATTTTTTGTTTGACTCAAAGCCAAAGTTTGTAGAAGAAGGCAGTTAA

Protein sequence

MIRKRINGSNRFRRSTLWQKCTSFRALKQVHAFLVVNGFNSSPSALRQLIFVSAIAVSGTMDYAHQVFAQITEPDIFMWNTMIRGSAQSLKPASAVSLYAQMENRGVRPDKFTFSFVLKACTKLSWVQLGFGIHGKVVKFGFQSNTFVRNTLIYFHANCGDLGTARALFDASAKRDVVPWSALTAGYARRGELDVARQLFDEMPIKDLVSWNVIITAYAKLGEMEKARTLFDDAPKKDVVTWNAMIAGYVLSGLNKEALEMFDAMRDAGQRPDDVTMLSILSASADLGELEVGKKIHHSIFDMCCGHLSVLLGNALIDMYAKCGSIENAMDVFRGMRDKDTSSWNSIIGGLAFHGHAKESINLFQEMLRLKMRPNEITFVGVLVACSHAGKVQEGRMYFNLMRNLYKIEPNIKHYGCMVDILGRAGLLSEAFDFIDTMEIEPNAIIWRTLLGACRVHGDVELGRRANEQLLKMRKDESGDYVLLSNIYASQGEWDGVEKVRRLMDDGGVKKEAGRSLIDADNTFLMHFLFDSKPKFVEEGS
Homology
BLAST of Tan0021031 vs. ExPASy Swiss-Prot
Match: Q9LXF2 (Pentatricopeptide repeat-containing protein At5g15300 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E40 PE=2 SV=2)

HSP 1 Score: 665.6 bits (1716), Expect = 4.7e-190
Identity = 322/547 (58.87%), Postives = 420/547 (76.78%), Query Frame = 0

Query: 1   MIRKRING--SNRFRRSTLWQKCTSFRALKQVHAFLVVNGFNSSPSALRQLIFVSAIAVS 60
           MIR++ N   +NR RR  LWQ C + R LKQ+HA +VVNG  S+ S + +LI+ ++++V 
Sbjct: 1   MIRRQTNDRTTNR-RRPKLWQNCKNIRTLKQIHASMVVNGLMSNLSVVGELIYSASLSVP 60

Query: 61  GTMDYAHQVFAQITEPDIFMWNTMIRGSAQSLKPASAVSLYAQMENRGVRPDKFTFSFVL 120
           G + YAH++F +I +PD+ + N ++RGSAQS+KP   VSLY +ME RGV PD++TF+FVL
Sbjct: 61  GALKYAHKLFDEIPKPDVSICNHVLRGSAQSMKPEKTVSLYTEMEKRGVSPDRYTFTFVL 120

Query: 121 KACTKLSWVQLGFGIHGKVVKFGFQSNTFVRNTLIYFHANCGDLGTARALFDASAKRDVV 180
           KAC+KL W   GF  HGKVV+ GF  N +V+N LI FHANCGDLG A  LFD SAK   V
Sbjct: 121 KACSKLEWRSNGFAFHGKVVRHGFVLNEYVKNALILFHANCGDLGIASELFDDSAKAHKV 180

Query: 181 PWSALTAGYARRGELDVARQLFDEMPIKDLVSWNVIITAYAKLGEMEKARTLFDDAPKKD 240
            WS++T+GYA+RG++D A +LFDEMP KD V+WNV+IT   K  EM+ AR LFD   +KD
Sbjct: 181 AWSSMTSGYAKRGKIDEAMRLFDEMPYKDQVAWNVMITGCLKCKEMDSARELFDRFTEKD 240

Query: 241 VVTWNAMIAGYVLSGLNKEALEMFDAMRDAGQRPDDVTMLSILSASADLGELEVGKKIHH 300
           VVTWNAMI+GYV  G  KEAL +F  MRDAG+ PD VT+LS+LSA A LG+LE GK++H 
Sbjct: 241 VVTWNAMISGYVNCGYPKEALGIFKEMRDAGEHPDVVTILSLLSACAVLGDLETGKRLHI 300

Query: 301 SIFDMCCGHLSVLLG----NALIDMYAKCGSIENAMDVFRGMRDKDTSSWNSIIGGLAFH 360
            I +      S+ +G    NALIDMYAKCGSI+ A++VFRG++D+D S+WN++I GLA H
Sbjct: 301 YILETASVSSSIYVGTPIWNALIDMYAKCGSIDRAIEVFRGVKDRDLSTWNTLIVGLALH 360

Query: 361 GHAKESINLFQEMLRLKMRPNEITFVGVLVACSHAGKVQEGRMYFNLMRNLYKIEPNIKH 420
            HA+ SI +F+EM RLK+ PNE+TF+GV++ACSH+G+V EGR YF+LMR++Y IEPNIKH
Sbjct: 361 -HAEGSIEMFEEMQRLKVWPNEVTFIGVILACSHSGRVDEGRKYFSLMRDMYNIEPNIKH 420

Query: 421 YGCMVDILGRAGLLSEAFDFIDTMEIEPNAIIWRTLLGACRVHGDVELGRRANEQLLKMR 480
           YGCMVD+LGRAG L EAF F+++M+IEPNAI+WRTLLGAC+++G+VELG+ ANE+LL MR
Sbjct: 421 YGCMVDMLGRAGQLEEAFMFVESMKIEPNAIVWRTLLGACKIYGNVELGKYANEKLLSMR 480

Query: 481 KDESGDYVLLSNIYASQGEWDGVEKVRRLMDDGGVKKEAGRSLIDA-DNTFLMHFLFDSK 540
           KDESGDYVLLSNIYAS G+WDGV+KVR++ DD  VKK  G SLI+  D+  +M +L  S+
Sbjct: 481 KDESGDYVLLSNIYASTGQWDGVQKVRKMFDDTRVKKPTGVSLIEEDDDKLMMRYLLSSE 540

BLAST of Tan0021031 vs. ExPASy Swiss-Prot
Match: Q9LN01 (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 422.5 bits (1085), Expect = 7.0e-117
Identity = 232/579 (40.07%), Postives = 324/579 (55.96%), Query Frame = 0

Query: 16  TLWQKCTSFRALKQVHAFLVVNGFNSSPSALRQLI-FVSAIAVSGTMDYAHQVFAQITEP 75
           +L   C + ++L+ +HA ++  G +++  AL +LI F         + YA  VF  I EP
Sbjct: 38  SLLHNCKTLQSLRIIHAQMIKIGLHNTNYALSKLIEFCILSPHFEGLPYAISVFKTIQEP 97

Query: 76  DIFMWNTMIRGSAQSLKPASAVSLYAQMENRGVRPDKFTFSFVLKACTKLSWVQLGFGIH 135
           ++ +WNTM RG A S  P SA+ LY  M + G+ P+ +TF FVLK+C K    + G  IH
Sbjct: 98  NLLIWNTMFRGHALSSDPVSALKLYVCMISLGLLPNSYTFPFVLKSCAKSKAFKEGQQIH 157

Query: 136 GKVVKFGFQSNTFVRNTLIYFHANCGDLGTARALFDASAKRDVVPWSALTAGYARRGELD 195
           G V+K G   + +V  +LI  +   G L  A  +FD S  RDVV ++AL  GYA RG ++
Sbjct: 158 GHVLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVFDKSPHRDVVSYTALIKGYASRGYIE 217

Query: 196 VARQLFDEMPIKDLVSWNVIITAYA----------------------------------- 255
            A++LFDE+P+KD+VSWN +I+ YA                                   
Sbjct: 218 NAQKLFDEIPVKDVVSWNAMISGYAETGNYKEALELFKDMMKTNVRPDESTMVTVVSACA 277

Query: 256 -----------------------------------KLGEMEKARTLFDDAPKKDVVTWNA 315
                                              K GE+E A  LF+  P KDV++WN 
Sbjct: 278 QSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELETACGLFERLPYKDVISWNT 337

Query: 316 MIAGYVLSGLNKEALEMFDAMRDAGQRPDDVTMLSILSASADLGELEVGKKIHHSIFDMC 375
           +I GY    L KEAL +F  M  +G+ P+DVTMLSIL A A LG +++G+ IH  I    
Sbjct: 338 LIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSILPACAHLGAIDIGRWIHVYIDKRL 397

Query: 376 CGHLSV-LLGNALIDMYAKCGSIENAMDVFRGMRDKDTSSWNSIIGGLAFHGHAKESINL 435
            G  +   L  +LIDMYAKCG IE A  VF  +  K  SSWN++I G A HG A  S +L
Sbjct: 398 KGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSSWNAMIFGFAMHGRADASFDL 457

Query: 436 FQEMLRLKMRPNEITFVGVLVACSHAGKVQEGRMYFNLMRNLYKIEPNIKHYGCMVDILG 495
           F  M ++ ++P++ITFVG+L ACSH+G +  GR  F  M   YK+ P ++HYGCM+D+LG
Sbjct: 458 FSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQDYKMTPKLEHYGCMIDLLG 517

Query: 496 RAGLLSEAFDFIDTMEIEPNAIIWRTLLGACRVHGDVELGRRANEQLLKMRKDESGDYVL 523
            +GL  EA + I+ ME+EP+ +IW +LL AC++HG+VELG    E L+K+  +  G YVL
Sbjct: 518 HSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELGESFAENLIKIEPENPGSYVL 577

BLAST of Tan0021031 vs. ExPASy Swiss-Prot
Match: Q9LS72 (Pentatricopeptide repeat-containing protein At3g29230 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E27 PE=2 SV=1)

HSP 1 Score: 414.1 bits (1063), Expect = 2.5e-114
Identity = 213/552 (38.59%), Postives = 325/552 (58.88%), Query Frame = 0

Query: 20  KCTSFRALKQVHAFLVVNGFNSSPSALRQLIFVSAIAVSGTMDYAHQVFAQITEPDIFMW 79
           KC +   +KQ+HA ++    +       +LI  SA+++    + A +VF Q+ EP++ + 
Sbjct: 28  KCANLNQVKQLHAQIIRRNLHEDLHIAPKLI--SALSLCRQTNLAVRVFNQVQEPNVHLC 87

Query: 80  NTMIRGSAQSLKPASAVSLYAQMENRGVRPDKFTFSFVLKACTKLSWVQLGFGIHGKVVK 139
           N++IR  AQ+ +P  A  ++++M+  G+  D FT+ F+LKAC+  SW+ +   +H  + K
Sbjct: 88  NSLIRAHAQNSQPYQAFFVFSEMQRFGLFADNFTYPFLLKACSGQSWLPVVKMMHNHIEK 147

Query: 140 FGFQSNTFVRNTLIYFHANCGDLGT--ARALFDASAKRDVVPWSALTAGYARRGELDVAR 199
            G  S+ +V N LI  ++ CG LG   A  LF+  ++RD V W+++  G  + GEL  AR
Sbjct: 148 LGLSSDIYVPNALIDCYSRCGGLGVRDAMKLFEKMSERDTVSWNSMLGGLVKAGELRDAR 207

Query: 200 QLFDEMPIKDLVSWNVIITAYA-------------------------------KLGEMEK 259
           +LFDEMP +DL+SWN ++  YA                               K G+ME 
Sbjct: 208 RLFDEMPQRDLISWNTMLDGYARCREMSKAFELFEKMPERNTVSWSTMVMGYSKAGDMEM 267

Query: 260 ARTLFD--DAPKKDVVTWNAMIAGYVLSGLNKEALEMFDAMRDAGQRPDDVTMLSILSAS 319
           AR +FD    P K+VVTW  +IAGY   GL KEA  + D M  +G + D   ++SIL+A 
Sbjct: 268 ARVMFDKMPLPAKNVVTWTIIIAGYAEKGLLKEADRLVDQMVASGLKFDAAAVISILAAC 327

Query: 320 ADLGELEVGKKIHHSIFDMCCGHLSVLLGNALIDMYAKCGSIENAMDVFRGMRDKDTSSW 379
            + G L +G +IH  +     G  + +L NAL+DMYAKCG+++ A DVF  +  KD  SW
Sbjct: 328 TESGLLSLGMRIHSILKRSNLGSNAYVL-NALLDMYAKCGNLKKAFDVFNDIPKKDLVSW 387

Query: 380 NSIIGGLAFHGHAKESINLFQEMLRLKMRPNEITFVGVLVACSHAGKVQEGRMYFNLMRN 439
           N+++ GL  HGH KE+I LF  M R  +RP+++TF+ VL +C+HAG + EG  YF  M  
Sbjct: 388 NTMLHGLGVHGHGKEAIELFSRMRREGIRPDKVTFIAVLCSCNHAGLIDEGIDYFYSMEK 447

Query: 440 LYKIEPNIKHYGCMVDILGRAGLLSEAFDFIDTMEIEPNAIIWRTLLGACRVHGDVELGR 499
           +Y + P ++HYGC+VD+LGR G L EA   + TM +EPN +IW  LLGACR+H +V++ +
Sbjct: 448 VYDLVPQVEHYGCLVDLLGRVGRLKEAIKVVQTMPMEPNVVIWGALLGACRMHNEVDIAK 507

Query: 500 RANEQLLKMRKDESGDYVLLSNIYASQGEWDGVEKVRRLMDDGGVKKEAGRSLIDADNTF 536
              + L+K+   + G+Y LLSNIYA+  +W+GV  +R  M   GV+K +G S ++ ++  
Sbjct: 508 EVLDNLVKLDPCDPGNYSLLSNIYAAAEDWEGVADIRSKMKSMGVEKPSGASSVELEDGI 567

BLAST of Tan0021031 vs. ExPASy Swiss-Prot
Match: Q9CA54 (Pentatricopeptide repeat-containing protein At1g74630 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H71 PE=2 SV=1)

HSP 1 Score: 411.4 bits (1056), Expect = 1.6e-113
Identity = 204/525 (38.86%), Postives = 320/525 (60.95%), Query Frame = 0

Query: 16  TLWQKCTSFRALKQVHAFLVVNGFNSSPSALRQLIFVSAIAVSGTMDYAHQVFAQITEPD 75
           +L   C + RAL Q+H   +  G ++      +LI   AI++S  + YA ++     EPD
Sbjct: 10  SLLNSCKNLRALTQIHGLFIKYGVDTDSYFTGKLILHCAISISDALPYARRLLLCFPEPD 69

Query: 76  IFMWNTMIRGSAQSLKPASAVSLYAQMENRG-VRPDKFTFSFVLKACTKLSWVQLGFGIH 135
            FM+NT++RG ++S +P ++V+++ +M  +G V PD F+F+FV+KA      ++ GF +H
Sbjct: 70  AFMFNTLVRGYSESDEPHNSVAVFVEMMRKGFVFPDSFSFAFVIKAVENFRSLRTGFQMH 129

Query: 136 GKVVKFGFQSNTFVRNTLIYFHANCGDLGTARALFDASAKRDVVPWSALTAGYARRGELD 195
            + +K G +S+ FV  TLI  +  CG +  AR +FD   + ++V W+A+     R  ++ 
Sbjct: 130 CQALKHGLESHLFVGTTLIGMYGGCGCVEFARKVFDEMHQPNLVAWNAVITACFRGNDVA 189

Query: 196 VARQLFDEMPIKDLVSWNVIITAYAKLGEMEKARTLFDDAPKKDVVTWNAMIAGYVLSGL 255
            AR++FD+M +++  SWNV++  Y K GE+E A+ +F + P +D V+W+ MI G   +G 
Sbjct: 190 GAREIFDKMLVRNHTSWNVMLAGYIKAGELESAKRIFSEMPHRDDVSWSTMIVGIAHNGS 249

Query: 256 NKEALEMFDAMRDAGQRPDDVTMLSILSASADLGELEVGKKIHHSIFDMCCGHLSVLLGN 315
             E+   F  ++ AG  P++V++  +LSA +  G  E G KI H   +       V + N
Sbjct: 250 FNESFLYFRELQRAGMSPNEVSLTGVLSACSQSGSFEFG-KILHGFVEKAGYSWIVSVNN 309

Query: 316 ALIDMYAKCGSIENAMDVFRGMRDKD-TSSWNSIIGGLAFHGHAKESINLFQEMLRLKMR 375
           ALIDMY++CG++  A  VF GM++K    SW S+I GLA HG  +E++ LF EM    + 
Sbjct: 310 ALIDMYSRCGNVPMARLVFEGMQEKRCIVSWTSMIAGLAMHGQGEEAVRLFNEMTAYGVT 369

Query: 376 PNEITFVGVLVACSHAGKVQEGRMYFNLMRNLYKIEPNIKHYGCMVDILGRAGLLSEAFD 435
           P+ I+F+ +L ACSHAG ++EG  YF+ M+ +Y IEP I+HYGCMVD+ GR+G L +A+D
Sbjct: 370 PDGISFISLLHACSHAGLIEEGEDYFSEMKRVYHIEPEIEHYGCMVDLYGRSGKLQKAYD 429

Query: 436 FIDTMEIEPNAIIWRTLLGACRVHGDVELGRRANEQLLKMRKDESGDYVLLSNIYASQGE 495
           FI  M I P AI+WRTLLGAC  HG++EL  +  ++L ++  + SGD VLLSN YA+ G+
Sbjct: 430 FICQMPIPPTAIVWRTLLGACSSHGNIELAEQVKQRLNELDPNNSGDLVLLSNAYATAGK 489

Query: 496 WDGVEKVRRLMDDGGVKKEAGRSLIDADNTFLMHFLFDSKPKFVE 539
           W  V  +R+ M    +KK    SL++   T +  F    K K ++
Sbjct: 490 WKDVASIRKSMIVQRIKKTTAWSLVEVGKT-MYKFTAGEKKKGID 532

BLAST of Tan0021031 vs. ExPASy Swiss-Prot
Match: Q9SJZ3 (Pentatricopeptide repeat-containing protein At2g22410, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-E28 PE=2 SV=1)

HSP 1 Score: 398.7 bits (1023), Expect = 1.1e-109
Identity = 214/593 (36.09%), Postives = 323/593 (54.47%), Query Frame = 0

Query: 16  TLWQKCTSFRALKQVHAFLVVNGFNSSPSALRQLIFVSAIAVSGTMDYAHQVFAQITEPD 75
           +L +KC     LKQ+ A +++NG    P A  +LI   A++ S  +DY+ ++   I  P+
Sbjct: 58  SLLEKCKLLLHLKQIQAQMIINGLILDPFASSRLIAFCALSESRYLDYSVKILKGIENPN 117

Query: 76  IFMWNTMIRGSAQSLKPASAVSLYAQMENRGV---RPDKFTFSFVLKACTKLSWVQLGFG 135
           IF WN  IRG ++S  P  +  LY QM   G    RPD FT+  + K C  L    LG  
Sbjct: 118 IFSWNVTIRGFSESENPKESFLLYKQMLRHGCCESRPDHFTYPVLFKVCADLRLSSLGHM 177

Query: 136 IHGKVVKFGFQSNTFVRNTLIYFHANCGDLGTARALFDASAKRDVVPWSALTAGYARRGE 195
           I G V+K   +  + V N  I+  A+CGD+  AR +FD S  RD+V W+ L  GY + GE
Sbjct: 178 ILGHVLKLRLELVSHVHNASIHMFASCGDMENARKVFDESPVRDLVSWNCLINGYKKIGE 237

Query: 196 ------------------------------------------------------------ 255
                                                                       
Sbjct: 238 AEKAIYVYKLMESEGVKPDDVTMIGLVSSCSMLGDLNRGKEFYEYVKENGLRMTIPLVNA 297

Query: 256 ----------LDVARQLFDEMPIKDLVSWNVIITAYAKLGEMEKARTLFDDAPKKDVVTW 315
                     +  AR++FD +  + +VSW  +I+ YA+ G ++ +R LFDD  +KDVV W
Sbjct: 298 LMDMFSKCGDIHEARRIFDNLEKRTIVSWTTMISGYARCGLLDVSRKLFDDMEEKDVVLW 357

Query: 316 NAMIAGYVLSGLNKEALEMFDAMRDAGQRPDDVTMLSILSASADLGELEVGKKIHHSIFD 375
           NAMI G V +   ++AL +F  M+ +  +PD++TM+  LSA + LG L+VG  IH  I +
Sbjct: 358 NAMIGGSVQAKRGQDALALFQEMQTSNTKPDEITMIHCLSACSQLGALDVGIWIHRYI-E 417

Query: 376 MCCGHLSVLLGNALIDMYAKCGSIENAMDVFRGMRDKDTSSWNSIIGGLAFHGHAKESIN 435
                L+V LG +L+DMYAKCG+I  A+ VF G++ +++ ++ +IIGGLA HG A  +I+
Sbjct: 418 KYSLSLNVALGTSLVDMYAKCGNISEALSVFHGIQTRNSLTYTAIIGGLALHGDASTAIS 477

Query: 436 LFQEMLRLKMRPNEITFVGVLVACSHAGKVQEGRMYFNLMRNLYKIEPNIKHYGCMVDIL 495
            F EM+   + P+EITF+G+L AC H G +Q GR YF+ M++ + + P +KHY  MVD+L
Sbjct: 478 YFNEMIDAGIAPDEITFIGLLSACCHGGMIQTGRDYFSQMKSRFNLNPQLKHYSIMVDLL 537

Query: 496 GRAGLLSEAFDFIDTMEIEPNAIIWRTLLGACRVHGDVELGRRANEQLLKMRKDESGDYV 536
           GRAGLL EA   +++M +E +A +W  LL  CR+HG+VELG +A ++LL++   +SG YV
Sbjct: 538 GRAGLLEEADRLMESMPMEADAAVWGALLFGCRMHGNVELGEKAAKKLLELDPSDSGIYV 597

BLAST of Tan0021031 vs. NCBI nr
Match: XP_023544416.1 (pentatricopeptide repeat-containing protein At5g15300 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1021.9 bits (2641), Expect = 2.0e-294
Identity = 500/544 (91.91%), Postives = 525/544 (96.51%), Query Frame = 0

Query: 1   MIRKRIN---GSNRFRRSTLWQKCTSFRALKQVHAFLVVNGFNSSPSALRQLIFVSAIAV 60
           MIRKR N    SNRF+RS+LWQKCT+FRALKQVHAFLV+NGFNSSPSALR+LIF+SAIAV
Sbjct: 1   MIRKRTNDNVSSNRFQRSSLWQKCTNFRALKQVHAFLVINGFNSSPSALRELIFLSAIAV 60

Query: 61  SGTMDYAHQVFAQITEPDIFMWNTMIRGSAQSLKPASAVSLYAQMENRGVRPDKFTFSFV 120
           SGTM YAHQVFAQITEPDIFMWNTMIRGSAQSL PASAVSLYAQMENRGV+PDKFTFSFV
Sbjct: 61  SGTMHYAHQVFAQITEPDIFMWNTMIRGSAQSLAPASAVSLYAQMENRGVKPDKFTFSFV 120

Query: 121 LKACTKLSWVQLGFGIHGKVVKFGFQSNTFVRNTLIYFHANCGDLGTARALFDASAKRDV 180
           LKACTKLSWV+LGFGIHGKVVKFGFQSNTFVRNTLIYFHANCGDL TARALFDASAKRDV
Sbjct: 121 LKACTKLSWVKLGFGIHGKVVKFGFQSNTFVRNTLIYFHANCGDLSTARALFDASAKRDV 180

Query: 181 VPWSALTAGYARRGELDVARQLFDEMPIKDLVSWNVIITAYAKLGEMEKARTLFDDAPKK 240
           VPWSALTAGYARRGELDVARQLFDEMPI+DLVSWNV+ITAYAKLG MEKAR LFD+AP K
Sbjct: 181 VPWSALTAGYARRGELDVARQLFDEMPIRDLVSWNVMITAYAKLGAMEKARKLFDEAPNK 240

Query: 241 DVVTWNAMIAGYVLSGLNKEALEMFDAMRDAGQRPDDVTMLSILSASADLGELEVGKKIH 300
           DVVTWNAMIAGYVLSGLN+EALEMFDAMRD GQRPDDVTMLSILSA+ADLG+LEVGKKI+
Sbjct: 241 DVVTWNAMIAGYVLSGLNREALEMFDAMRDVGQRPDDVTMLSILSATADLGDLEVGKKIY 300

Query: 301 HSIFDMCCGHLSVLLGNALIDMYAKCGSIENAMDVFRGMRDKDTSSWNSIIGGLAFHGHA 360
            SIFDM CG +SVLLGNALIDMYAKCGSIENA+DVFR MRDKDTSSWNSIIGGLAFHGHA
Sbjct: 301 RSIFDMYCGDISVLLGNALIDMYAKCGSIENALDVFRAMRDKDTSSWNSIIGGLAFHGHA 360

Query: 361 KESINLFQEMLRLKMRPNEITFVGVLVACSHAGKVQEGRMYFNLMRNLYKIEPNIKHYGC 420
           KESINLFQEM+RLK+RPNEITFVGVLVACSHAGKVQEGRMYFNLMR+ YKIEPNIKHYGC
Sbjct: 361 KESINLFQEMMRLKIRPNEITFVGVLVACSHAGKVQEGRMYFNLMRDSYKIEPNIKHYGC 420

Query: 421 MVDILGRAGLLSEAFDFIDTMEIEPNAIIWRTLLGACRVHGDVELGRRANEQLLKMRKDE 480
           MVDILGRAGLL EAFDFIDTMEIEPNAIIWRTLLGACRVHGDVELGRRANEQLLKMRKDE
Sbjct: 421 MVDILGRAGLLIEAFDFIDTMEIEPNAIIWRTLLGACRVHGDVELGRRANEQLLKMRKDE 480

Query: 481 SGDYVLLSNIYASQGEWDGVEKVRRLMDDGGVKKEAGRSLIDADNTFLMHFLFDSKPKFV 540
           SGDYVLLSNIYAS+GEWDGVEKVR+LMDDGGVKKEAGRS+IDADN+FLM+FLFDSKPKFV
Sbjct: 481 SGDYVLLSNIYASKGEWDGVEKVRKLMDDGGVKKEAGRSMIDADNSFLMNFLFDSKPKFV 540

Query: 541 EEGS 542
           EE S
Sbjct: 541 EESS 544

BLAST of Tan0021031 vs. NCBI nr
Match: KAG6602506.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1019.2 bits (2634), Expect = 1.3e-293
Identity = 500/544 (91.91%), Postives = 523/544 (96.14%), Query Frame = 0

Query: 1   MIRKRIN---GSNRFRRSTLWQKCTSFRALKQVHAFLVVNGFNSSPSALRQLIFVSAIAV 60
           MIRKR N    SNRF+RS+LWQKCT+FRALKQVHAFLVVNGFNSSPSALR+LIF+SAIAV
Sbjct: 1   MIRKRTNDNVSSNRFQRSSLWQKCTNFRALKQVHAFLVVNGFNSSPSALRELIFLSAIAV 60

Query: 61  SGTMDYAHQVFAQITEPDIFMWNTMIRGSAQSLKPASAVSLYAQMENRGVRPDKFTFSFV 120
           SGTM YAHQVFAQITEPDIFMWNTMIRGSAQSL PASAVSLYAQMENRGV+PDKFTFSFV
Sbjct: 61  SGTMHYAHQVFAQITEPDIFMWNTMIRGSAQSLAPASAVSLYAQMENRGVKPDKFTFSFV 120

Query: 121 LKACTKLSWVQLGFGIHGKVVKFGFQSNTFVRNTLIYFHANCGDLGTARALFDASAKRDV 180
           LKACTKLSWV+LGFGIHGKVVKFGFQSNTFVRNTLIYFHANCGDL TARALFDASAKRDV
Sbjct: 121 LKACTKLSWVKLGFGIHGKVVKFGFQSNTFVRNTLIYFHANCGDLSTARALFDASAKRDV 180

Query: 181 VPWSALTAGYARRGELDVARQLFDEMPIKDLVSWNVIITAYAKLGEMEKARTLFDDAPKK 240
           VPWSALTAGYARRGELDVARQLFDEMPI+DLVSWNV+ITAYAKLG MEKAR LFD+AP K
Sbjct: 181 VPWSALTAGYARRGELDVARQLFDEMPIRDLVSWNVMITAYAKLGAMEKARKLFDEAPNK 240

Query: 241 DVVTWNAMIAGYVLSGLNKEALEMFDAMRDAGQRPDDVTMLSILSASADLGELEVGKKIH 300
           DVVTWNAMIAGYVLSGLN+EALEMFDAMRD GQRPDDVTMLSILSA+ADLG+LEVGKKI+
Sbjct: 241 DVVTWNAMIAGYVLSGLNREALEMFDAMRDVGQRPDDVTMLSILSATADLGDLEVGKKIY 300

Query: 301 HSIFDMCCGHLSVLLGNALIDMYAKCGSIENAMDVFRGMRDKDTSSWNSIIGGLAFHGHA 360
            SIFDM CG +SVLLGNALIDMYAKCGSIENA+DVFR MRDKDTSSWNSIIGGLAFHGHA
Sbjct: 301 RSIFDMYCGDISVLLGNALIDMYAKCGSIENALDVFRAMRDKDTSSWNSIIGGLAFHGHA 360

Query: 361 KESINLFQEMLRLKMRPNEITFVGVLVACSHAGKVQEGRMYFNLMRNLYKIEPNIKHYGC 420
            ESINLFQEMLRLK+RPNEITFVGVLVACSHAGKVQEGRMYFNLM + YKIEPNIKHYGC
Sbjct: 361 NESINLFQEMLRLKIRPNEITFVGVLVACSHAGKVQEGRMYFNLMIDSYKIEPNIKHYGC 420

Query: 421 MVDILGRAGLLSEAFDFIDTMEIEPNAIIWRTLLGACRVHGDVELGRRANEQLLKMRKDE 480
           MVDILGRAGLL EAFDFIDTMEIEPNAIIWRTLLGACRVHGDVELGRRANEQLLKMRKDE
Sbjct: 421 MVDILGRAGLLIEAFDFIDTMEIEPNAIIWRTLLGACRVHGDVELGRRANEQLLKMRKDE 480

Query: 481 SGDYVLLSNIYASQGEWDGVEKVRRLMDDGGVKKEAGRSLIDADNTFLMHFLFDSKPKFV 540
           SGDYVLLSNIYAS+GEWDGVEKVR+LMDDGGVKKEAGRS+IDADN+FLMHFLFDSKPKFV
Sbjct: 481 SGDYVLLSNIYASKGEWDGVEKVRKLMDDGGVKKEAGRSMIDADNSFLMHFLFDSKPKFV 540

Query: 541 EEGS 542
           +E S
Sbjct: 541 KESS 544

BLAST of Tan0021031 vs. NCBI nr
Match: KAG7033178.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1018.1 bits (2631), Expect = 2.9e-293
Identity = 499/544 (91.73%), Postives = 523/544 (96.14%), Query Frame = 0

Query: 1   MIRKRIN---GSNRFRRSTLWQKCTSFRALKQVHAFLVVNGFNSSPSALRQLIFVSAIAV 60
           MIRKR N    SNRF+RS+LWQKCT+FRALKQVHAFLVVNGFNSSPSALR+LIF+SAIAV
Sbjct: 1   MIRKRTNDNVSSNRFQRSSLWQKCTNFRALKQVHAFLVVNGFNSSPSALRELIFLSAIAV 60

Query: 61  SGTMDYAHQVFAQITEPDIFMWNTMIRGSAQSLKPASAVSLYAQMENRGVRPDKFTFSFV 120
           SGTM YAHQVFAQITEPDIFMWNTMIRGSAQSL PASAVSLYAQMENRGV+PDKFTFSFV
Sbjct: 61  SGTMHYAHQVFAQITEPDIFMWNTMIRGSAQSLAPASAVSLYAQMENRGVKPDKFTFSFV 120

Query: 121 LKACTKLSWVQLGFGIHGKVVKFGFQSNTFVRNTLIYFHANCGDLGTARALFDASAKRDV 180
           LKACTKLSWV+LGFGIHGKV+KFGFQSNTFVRNTLIYFHANCGDL TARALFDASAKRDV
Sbjct: 121 LKACTKLSWVKLGFGIHGKVLKFGFQSNTFVRNTLIYFHANCGDLSTARALFDASAKRDV 180

Query: 181 VPWSALTAGYARRGELDVARQLFDEMPIKDLVSWNVIITAYAKLGEMEKARTLFDDAPKK 240
           VPWSALTAGYARRGELDVARQLFDEMPI+DLVSWNV+ITAYAKLG MEKAR LFD+AP K
Sbjct: 181 VPWSALTAGYARRGELDVARQLFDEMPIRDLVSWNVMITAYAKLGAMEKARKLFDEAPNK 240

Query: 241 DVVTWNAMIAGYVLSGLNKEALEMFDAMRDAGQRPDDVTMLSILSASADLGELEVGKKIH 300
           DVVTWNAMIAGYVLSGLN+EALEMFDAMRD GQRPDDVTMLSILSA+ADLG+LEVGKKI+
Sbjct: 241 DVVTWNAMIAGYVLSGLNREALEMFDAMRDVGQRPDDVTMLSILSATADLGDLEVGKKIY 300

Query: 301 HSIFDMCCGHLSVLLGNALIDMYAKCGSIENAMDVFRGMRDKDTSSWNSIIGGLAFHGHA 360
            SIFDM CG +SVLLGNALIDMYAKCGSIENA+DVFR MRDKDTSSWNSIIGGLAFHGHA
Sbjct: 301 RSIFDMYCGDISVLLGNALIDMYAKCGSIENALDVFRAMRDKDTSSWNSIIGGLAFHGHA 360

Query: 361 KESINLFQEMLRLKMRPNEITFVGVLVACSHAGKVQEGRMYFNLMRNLYKIEPNIKHYGC 420
            ESINLFQEMLRLK+RPNEITFVGVLVACSHAGKVQEGRMYFNLM + YKIEPNIKHYGC
Sbjct: 361 NESINLFQEMLRLKIRPNEITFVGVLVACSHAGKVQEGRMYFNLMIDSYKIEPNIKHYGC 420

Query: 421 MVDILGRAGLLSEAFDFIDTMEIEPNAIIWRTLLGACRVHGDVELGRRANEQLLKMRKDE 480
           MVDILGRAGLL EAFDFIDTMEIEPNAIIWRTLLGACRVHGDVELGRRANEQLLKMRKDE
Sbjct: 421 MVDILGRAGLLIEAFDFIDTMEIEPNAIIWRTLLGACRVHGDVELGRRANEQLLKMRKDE 480

Query: 481 SGDYVLLSNIYASQGEWDGVEKVRRLMDDGGVKKEAGRSLIDADNTFLMHFLFDSKPKFV 540
           SGDYVLLSNIYAS+GEWDGVEKVR+LMDDGGVKKEAGRS+IDADN+FLMHFLFDSKPKFV
Sbjct: 481 SGDYVLLSNIYASKGEWDGVEKVRKLMDDGGVKKEAGRSMIDADNSFLMHFLFDSKPKFV 540

Query: 541 EEGS 542
           +E S
Sbjct: 541 KESS 544

BLAST of Tan0021031 vs. NCBI nr
Match: XP_022954064.1 (pentatricopeptide repeat-containing protein At5g15300 [Cucurbita moschata])

HSP 1 Score: 1015.8 bits (2625), Expect = 1.4e-292
Identity = 497/544 (91.36%), Postives = 523/544 (96.14%), Query Frame = 0

Query: 1   MIRKRIN---GSNRFRRSTLWQKCTSFRALKQVHAFLVVNGFNSSPSALRQLIFVSAIAV 60
           MIRKR N    SNRF+RS+LWQKCT+FRALKQVHAFLVVNGFNSSPSALR+LIF+S+IAV
Sbjct: 1   MIRKRTNDNVSSNRFQRSSLWQKCTNFRALKQVHAFLVVNGFNSSPSALRELIFLSSIAV 60

Query: 61  SGTMDYAHQVFAQITEPDIFMWNTMIRGSAQSLKPASAVSLYAQMENRGVRPDKFTFSFV 120
           SGTM YAHQVFAQITEPDIF+WNTMIRGSAQSL PASAVSLYAQMENRGV+PDKFTFSFV
Sbjct: 61  SGTMHYAHQVFAQITEPDIFIWNTMIRGSAQSLAPASAVSLYAQMENRGVKPDKFTFSFV 120

Query: 121 LKACTKLSWVQLGFGIHGKVVKFGFQSNTFVRNTLIYFHANCGDLGTARALFDASAKRDV 180
           LKACTKLSWV+LGFGIHGKVVKFGFQSNTFVRNTLIYFHANCGDL TARALFDASAK DV
Sbjct: 121 LKACTKLSWVKLGFGIHGKVVKFGFQSNTFVRNTLIYFHANCGDLSTARALFDASAKTDV 180

Query: 181 VPWSALTAGYARRGELDVARQLFDEMPIKDLVSWNVIITAYAKLGEMEKARTLFDDAPKK 240
           VPWSALTAGYARRGELDVARQLFDEMPI+DLVSWNV+ITAYAKLG MEKAR LFD+AP K
Sbjct: 181 VPWSALTAGYARRGELDVARQLFDEMPIRDLVSWNVMITAYAKLGAMEKARKLFDEAPNK 240

Query: 241 DVVTWNAMIAGYVLSGLNKEALEMFDAMRDAGQRPDDVTMLSILSASADLGELEVGKKIH 300
           DVVTWNAMIAGYVLSGLN+EALEMFDAMRD GQRPDDVTMLSILSA+ADLG+LEVGKKI+
Sbjct: 241 DVVTWNAMIAGYVLSGLNREALEMFDAMRDVGQRPDDVTMLSILSATADLGDLEVGKKIY 300

Query: 301 HSIFDMCCGHLSVLLGNALIDMYAKCGSIENAMDVFRGMRDKDTSSWNSIIGGLAFHGHA 360
            SIFDM CG +SVLLGNALIDMYAKCGSIENA+DVFR MRDKDTSSWNSIIGGLAFHGHA
Sbjct: 301 RSIFDMYCGDISVLLGNALIDMYAKCGSIENAVDVFRAMRDKDTSSWNSIIGGLAFHGHA 360

Query: 361 KESINLFQEMLRLKMRPNEITFVGVLVACSHAGKVQEGRMYFNLMRNLYKIEPNIKHYGC 420
            ESINLFQEMLRLK+RPNEITFVGVLVACSHAGKVQEGRMYFNLMR+ YKIEPNIKHYGC
Sbjct: 361 NESINLFQEMLRLKIRPNEITFVGVLVACSHAGKVQEGRMYFNLMRDSYKIEPNIKHYGC 420

Query: 421 MVDILGRAGLLSEAFDFIDTMEIEPNAIIWRTLLGACRVHGDVELGRRANEQLLKMRKDE 480
           MVDILGRAGLL EAFDFIDTMEIEPNAIIWRTLLGACRVHGDV+LGRRANEQLLKMRKDE
Sbjct: 421 MVDILGRAGLLIEAFDFIDTMEIEPNAIIWRTLLGACRVHGDVDLGRRANEQLLKMRKDE 480

Query: 481 SGDYVLLSNIYASQGEWDGVEKVRRLMDDGGVKKEAGRSLIDADNTFLMHFLFDSKPKFV 540
           SGDYVLLSNIYAS+GEWDGVEKVR+LMDDGGVKKEAGRS+IDADN+FLMHFLFDSKPKFV
Sbjct: 481 SGDYVLLSNIYASKGEWDGVEKVRKLMDDGGVKKEAGRSMIDADNSFLMHFLFDSKPKFV 540

Query: 541 EEGS 542
           +E S
Sbjct: 541 KESS 544

BLAST of Tan0021031 vs. NCBI nr
Match: XP_022133508.1 (pentatricopeptide repeat-containing protein At5g15300 [Momordica charantia])

HSP 1 Score: 1015.4 bits (2624), Expect = 1.9e-292
Identity = 496/543 (91.34%), Postives = 522/543 (96.13%), Query Frame = 0

Query: 1   MIRKRIN--GSNRFRRSTLWQKCTSFRALKQVHAFLVVNGFNSSPSALRQLIFVSAIAVS 60
           MIRKR N  G+NRF+RS+LWQKCTSFRALKQVHAFLVVNGFNSS SALR+LIFV AIA+S
Sbjct: 1   MIRKRTNDSGANRFQRSSLWQKCTSFRALKQVHAFLVVNGFNSSTSALRELIFVGAIAIS 60

Query: 61  GTMDYAHQVFAQITEPDIFMWNTMIRGSAQSLKPASAVSLYAQMENRGVRPDKFTFSFVL 120
           GTMDYAHQVFAQITEPDIFMWNTMIRGSAQSLKP +AVS+YAQMENRGVRPDKFTFSFVL
Sbjct: 61  GTMDYAHQVFAQITEPDIFMWNTMIRGSAQSLKPVNAVSVYAQMENRGVRPDKFTFSFVL 120

Query: 121 KACTKLSWVQLGFGIHGKVVKFGFQSNTFVRNTLIYFHANCGDLGTARALFDASAKRDVV 180
           KACTKLSWV LGFGIHGKVVKFGFQSN+FVRNTLIYFHANCGDL TARALFDASAKRDVV
Sbjct: 121 KACTKLSWVNLGFGIHGKVVKFGFQSNSFVRNTLIYFHANCGDLATARALFDASAKRDVV 180

Query: 181 PWSALTAGYARRGELDVARQLFDEMPIKDLVSWNVIITAYAKLGEMEKARTLFDDAPKKD 240
           PWSALTAGYARRGELDVAR LFDEMP+KDLVSWNV+ITAYAKLGEMEKAR LFDDAP+KD
Sbjct: 181 PWSALTAGYARRGELDVARMLFDEMPVKDLVSWNVMITAYAKLGEMEKARKLFDDAPQKD 240

Query: 241 VVTWNAMIAGYVLSGLNKEALEMFDAMRDAGQRPDDVTMLSILSASADLGELEVGKKIHH 300
           VVTWNAMIAGYVL+GLNK+ALEMFDAM D GQRPDDVTMLSILSASADLG+LEVGK IH 
Sbjct: 241 VVTWNAMIAGYVLAGLNKQALEMFDAMIDQGQRPDDVTMLSILSASADLGDLEVGKNIHR 300

Query: 301 SIFDMCCGHLSVLLGNALIDMYAKCGSIENAMDVFRGMRDKDTSSWNSIIGGLAFHGHAK 360
           SIF+MCCG L VLLGNALIDMYAKCGSIENA++VF+GMRDKDTSSWNSIIGGLAFHGHA+
Sbjct: 301 SIFNMCCGDLGVLLGNALIDMYAKCGSIENALEVFKGMRDKDTSSWNSIIGGLAFHGHAE 360

Query: 361 ESINLFQEMLRLKMRPNEITFVGVLVACSHAGKVQEGRMYFNLMRNLYKIEPNIKHYGCM 420
           +SINLFQEMLRLKMRPNEITFVGVLVACSHAGKVQEGRMYFNLMRNLYKIEPNIKHYGCM
Sbjct: 361 KSINLFQEMLRLKMRPNEITFVGVLVACSHAGKVQEGRMYFNLMRNLYKIEPNIKHYGCM 420

Query: 421 VDILGRAGLLSEAFDFIDTMEIEPNAIIWRTLLGACRVHGDVELGRRANEQLLKMRKDES 480
           VDILGRAG L EAFDFIDTMEI+PNAIIWRTLLGACRVHGDVELGRRANEQLL+MRKDES
Sbjct: 421 VDILGRAGQLIEAFDFIDTMEIKPNAIIWRTLLGACRVHGDVELGRRANEQLLEMRKDES 480

Query: 481 GDYVLLSNIYASQGEWDGVEKVRRLMDDGGVKKEAGRSLIDADNTFLMHFLFDSKPKFVE 540
           GDYVLLSNIYASQGEW GVEKVR+LMD+GGVKKEAG SLIDADN+FLM+FLFDSKPKFVE
Sbjct: 481 GDYVLLSNIYASQGEWGGVEKVRKLMDEGGVKKEAGCSLIDADNSFLMNFLFDSKPKFVE 540

Query: 541 EGS 542
            GS
Sbjct: 541 GGS 543

BLAST of Tan0021031 vs. ExPASy TrEMBL
Match: A0A6J1GRS9 (pentatricopeptide repeat-containing protein At5g15300 OS=Cucurbita moschata OX=3662 GN=LOC111456437 PE=4 SV=1)

HSP 1 Score: 1015.8 bits (2625), Expect = 6.9e-293
Identity = 497/544 (91.36%), Postives = 523/544 (96.14%), Query Frame = 0

Query: 1   MIRKRIN---GSNRFRRSTLWQKCTSFRALKQVHAFLVVNGFNSSPSALRQLIFVSAIAV 60
           MIRKR N    SNRF+RS+LWQKCT+FRALKQVHAFLVVNGFNSSPSALR+LIF+S+IAV
Sbjct: 1   MIRKRTNDNVSSNRFQRSSLWQKCTNFRALKQVHAFLVVNGFNSSPSALRELIFLSSIAV 60

Query: 61  SGTMDYAHQVFAQITEPDIFMWNTMIRGSAQSLKPASAVSLYAQMENRGVRPDKFTFSFV 120
           SGTM YAHQVFAQITEPDIF+WNTMIRGSAQSL PASAVSLYAQMENRGV+PDKFTFSFV
Sbjct: 61  SGTMHYAHQVFAQITEPDIFIWNTMIRGSAQSLAPASAVSLYAQMENRGVKPDKFTFSFV 120

Query: 121 LKACTKLSWVQLGFGIHGKVVKFGFQSNTFVRNTLIYFHANCGDLGTARALFDASAKRDV 180
           LKACTKLSWV+LGFGIHGKVVKFGFQSNTFVRNTLIYFHANCGDL TARALFDASAK DV
Sbjct: 121 LKACTKLSWVKLGFGIHGKVVKFGFQSNTFVRNTLIYFHANCGDLSTARALFDASAKTDV 180

Query: 181 VPWSALTAGYARRGELDVARQLFDEMPIKDLVSWNVIITAYAKLGEMEKARTLFDDAPKK 240
           VPWSALTAGYARRGELDVARQLFDEMPI+DLVSWNV+ITAYAKLG MEKAR LFD+AP K
Sbjct: 181 VPWSALTAGYARRGELDVARQLFDEMPIRDLVSWNVMITAYAKLGAMEKARKLFDEAPNK 240

Query: 241 DVVTWNAMIAGYVLSGLNKEALEMFDAMRDAGQRPDDVTMLSILSASADLGELEVGKKIH 300
           DVVTWNAMIAGYVLSGLN+EALEMFDAMRD GQRPDDVTMLSILSA+ADLG+LEVGKKI+
Sbjct: 241 DVVTWNAMIAGYVLSGLNREALEMFDAMRDVGQRPDDVTMLSILSATADLGDLEVGKKIY 300

Query: 301 HSIFDMCCGHLSVLLGNALIDMYAKCGSIENAMDVFRGMRDKDTSSWNSIIGGLAFHGHA 360
            SIFDM CG +SVLLGNALIDMYAKCGSIENA+DVFR MRDKDTSSWNSIIGGLAFHGHA
Sbjct: 301 RSIFDMYCGDISVLLGNALIDMYAKCGSIENAVDVFRAMRDKDTSSWNSIIGGLAFHGHA 360

Query: 361 KESINLFQEMLRLKMRPNEITFVGVLVACSHAGKVQEGRMYFNLMRNLYKIEPNIKHYGC 420
            ESINLFQEMLRLK+RPNEITFVGVLVACSHAGKVQEGRMYFNLMR+ YKIEPNIKHYGC
Sbjct: 361 NESINLFQEMLRLKIRPNEITFVGVLVACSHAGKVQEGRMYFNLMRDSYKIEPNIKHYGC 420

Query: 421 MVDILGRAGLLSEAFDFIDTMEIEPNAIIWRTLLGACRVHGDVELGRRANEQLLKMRKDE 480
           MVDILGRAGLL EAFDFIDTMEIEPNAIIWRTLLGACRVHGDV+LGRRANEQLLKMRKDE
Sbjct: 421 MVDILGRAGLLIEAFDFIDTMEIEPNAIIWRTLLGACRVHGDVDLGRRANEQLLKMRKDE 480

Query: 481 SGDYVLLSNIYASQGEWDGVEKVRRLMDDGGVKKEAGRSLIDADNTFLMHFLFDSKPKFV 540
           SGDYVLLSNIYAS+GEWDGVEKVR+LMDDGGVKKEAGRS+IDADN+FLMHFLFDSKPKFV
Sbjct: 481 SGDYVLLSNIYASKGEWDGVEKVRKLMDDGGVKKEAGRSMIDADNSFLMHFLFDSKPKFV 540

Query: 541 EEGS 542
           +E S
Sbjct: 541 KESS 544

BLAST of Tan0021031 vs. ExPASy TrEMBL
Match: A0A6J1BZB3 (pentatricopeptide repeat-containing protein At5g15300 OS=Momordica charantia OX=3673 GN=LOC111006070 PE=4 SV=1)

HSP 1 Score: 1015.4 bits (2624), Expect = 9.0e-293
Identity = 496/543 (91.34%), Postives = 522/543 (96.13%), Query Frame = 0

Query: 1   MIRKRIN--GSNRFRRSTLWQKCTSFRALKQVHAFLVVNGFNSSPSALRQLIFVSAIAVS 60
           MIRKR N  G+NRF+RS+LWQKCTSFRALKQVHAFLVVNGFNSS SALR+LIFV AIA+S
Sbjct: 1   MIRKRTNDSGANRFQRSSLWQKCTSFRALKQVHAFLVVNGFNSSTSALRELIFVGAIAIS 60

Query: 61  GTMDYAHQVFAQITEPDIFMWNTMIRGSAQSLKPASAVSLYAQMENRGVRPDKFTFSFVL 120
           GTMDYAHQVFAQITEPDIFMWNTMIRGSAQSLKP +AVS+YAQMENRGVRPDKFTFSFVL
Sbjct: 61  GTMDYAHQVFAQITEPDIFMWNTMIRGSAQSLKPVNAVSVYAQMENRGVRPDKFTFSFVL 120

Query: 121 KACTKLSWVQLGFGIHGKVVKFGFQSNTFVRNTLIYFHANCGDLGTARALFDASAKRDVV 180
           KACTKLSWV LGFGIHGKVVKFGFQSN+FVRNTLIYFHANCGDL TARALFDASAKRDVV
Sbjct: 121 KACTKLSWVNLGFGIHGKVVKFGFQSNSFVRNTLIYFHANCGDLATARALFDASAKRDVV 180

Query: 181 PWSALTAGYARRGELDVARQLFDEMPIKDLVSWNVIITAYAKLGEMEKARTLFDDAPKKD 240
           PWSALTAGYARRGELDVAR LFDEMP+KDLVSWNV+ITAYAKLGEMEKAR LFDDAP+KD
Sbjct: 181 PWSALTAGYARRGELDVARMLFDEMPVKDLVSWNVMITAYAKLGEMEKARKLFDDAPQKD 240

Query: 241 VVTWNAMIAGYVLSGLNKEALEMFDAMRDAGQRPDDVTMLSILSASADLGELEVGKKIHH 300
           VVTWNAMIAGYVL+GLNK+ALEMFDAM D GQRPDDVTMLSILSASADLG+LEVGK IH 
Sbjct: 241 VVTWNAMIAGYVLAGLNKQALEMFDAMIDQGQRPDDVTMLSILSASADLGDLEVGKNIHR 300

Query: 301 SIFDMCCGHLSVLLGNALIDMYAKCGSIENAMDVFRGMRDKDTSSWNSIIGGLAFHGHAK 360
           SIF+MCCG L VLLGNALIDMYAKCGSIENA++VF+GMRDKDTSSWNSIIGGLAFHGHA+
Sbjct: 301 SIFNMCCGDLGVLLGNALIDMYAKCGSIENALEVFKGMRDKDTSSWNSIIGGLAFHGHAE 360

Query: 361 ESINLFQEMLRLKMRPNEITFVGVLVACSHAGKVQEGRMYFNLMRNLYKIEPNIKHYGCM 420
           +SINLFQEMLRLKMRPNEITFVGVLVACSHAGKVQEGRMYFNLMRNLYKIEPNIKHYGCM
Sbjct: 361 KSINLFQEMLRLKMRPNEITFVGVLVACSHAGKVQEGRMYFNLMRNLYKIEPNIKHYGCM 420

Query: 421 VDILGRAGLLSEAFDFIDTMEIEPNAIIWRTLLGACRVHGDVELGRRANEQLLKMRKDES 480
           VDILGRAG L EAFDFIDTMEI+PNAIIWRTLLGACRVHGDVELGRRANEQLL+MRKDES
Sbjct: 421 VDILGRAGQLIEAFDFIDTMEIKPNAIIWRTLLGACRVHGDVELGRRANEQLLEMRKDES 480

Query: 481 GDYVLLSNIYASQGEWDGVEKVRRLMDDGGVKKEAGRSLIDADNTFLMHFLFDSKPKFVE 540
           GDYVLLSNIYASQGEW GVEKVR+LMD+GGVKKEAG SLIDADN+FLM+FLFDSKPKFVE
Sbjct: 481 GDYVLLSNIYASQGEWGGVEKVRKLMDEGGVKKEAGCSLIDADNSFLMNFLFDSKPKFVE 540

Query: 541 EGS 542
            GS
Sbjct: 541 GGS 543

BLAST of Tan0021031 vs. ExPASy TrEMBL
Match: A0A6J1JPF2 (pentatricopeptide repeat-containing protein At5g15300 OS=Cucurbita maxima OX=3661 GN=LOC111487155 PE=4 SV=1)

HSP 1 Score: 1015.0 bits (2623), Expect = 1.2e-292
Identity = 498/544 (91.54%), Postives = 523/544 (96.14%), Query Frame = 0

Query: 1   MIRKRIN---GSNRFRRSTLWQKCTSFRALKQVHAFLVVNGFNSSPSALRQLIFVSAIAV 60
           MIRKR N    SNRF+RS+LWQKCT+FR LKQ+HAFLVVNGFNSSPSALR+LIF+SAIAV
Sbjct: 1   MIRKRTNDNVSSNRFQRSSLWQKCTNFRTLKQLHAFLVVNGFNSSPSALRELIFLSAIAV 60

Query: 61  SGTMDYAHQVFAQITEPDIFMWNTMIRGSAQSLKPASAVSLYAQMENRGVRPDKFTFSFV 120
           SGTM YAHQVFAQITEPDIFMWNTMIRGSAQSL PASAVSLYAQMENRGV+PDKFTFSFV
Sbjct: 61  SGTMHYAHQVFAQITEPDIFMWNTMIRGSAQSLAPASAVSLYAQMENRGVKPDKFTFSFV 120

Query: 121 LKACTKLSWVQLGFGIHGKVVKFGFQSNTFVRNTLIYFHANCGDLGTARALFDASAKRDV 180
           LKACTKLSWV+LGFGIHGKVVKFGFQSNTFVRNTLIYFHANCGDL TARALFDASAKRDV
Sbjct: 121 LKACTKLSWVKLGFGIHGKVVKFGFQSNTFVRNTLIYFHANCGDLSTARALFDASAKRDV 180

Query: 181 VPWSALTAGYARRGELDVARQLFDEMPIKDLVSWNVIITAYAKLGEMEKARTLFDDAPKK 240
           VPWSALTAGYARRGELDVARQLFDEMPI+DLVSWNV+ITAYAKLG MEKAR LFD+AP K
Sbjct: 181 VPWSALTAGYARRGELDVARQLFDEMPIRDLVSWNVMITAYAKLGAMEKARKLFDEAPNK 240

Query: 241 DVVTWNAMIAGYVLSGLNKEALEMFDAMRDAGQRPDDVTMLSILSASADLGELEVGKKIH 300
           DVVTWNAMIAGYVLSGLN+EALEMFDAMRD GQRPDDVTMLSILSA+ADLG+LEVGKKI+
Sbjct: 241 DVVTWNAMIAGYVLSGLNREALEMFDAMRDVGQRPDDVTMLSILSATADLGDLEVGKKIY 300

Query: 301 HSIFDMCCGHLSVLLGNALIDMYAKCGSIENAMDVFRGMRDKDTSSWNSIIGGLAFHGHA 360
            SIFDM CG +SVLLGNALIDMYAKCGSIENA+DVFR MRDKDTSSWNSIIGGLAFHGHA
Sbjct: 301 RSIFDMYCGDISVLLGNALIDMYAKCGSIENALDVFRAMRDKDTSSWNSIIGGLAFHGHA 360

Query: 361 KESINLFQEMLRLKMRPNEITFVGVLVACSHAGKVQEGRMYFNLMRNLYKIEPNIKHYGC 420
           KESINLFQEMLRLK+RPNEITFVGVLVACSHAGKVQEGRMYFNLMR+ YKIEPNIKHYGC
Sbjct: 361 KESINLFQEMLRLKIRPNEITFVGVLVACSHAGKVQEGRMYFNLMRDSYKIEPNIKHYGC 420

Query: 421 MVDILGRAGLLSEAFDFIDTMEIEPNAIIWRTLLGACRVHGDVELGRRANEQLLKMRKDE 480
           MVDILGRAGLL EAFDFIDTMEIEPNAIIWRTLLGACRVHGDVELGRRANEQLLKMRKDE
Sbjct: 421 MVDILGRAGLLIEAFDFIDTMEIEPNAIIWRTLLGACRVHGDVELGRRANEQLLKMRKDE 480

Query: 481 SGDYVLLSNIYASQGEWDGVEKVRRLMDDGGVKKEAGRSLIDADNTFLMHFLFDSKPKFV 540
           SGDYVLLSNIYAS+GEWDGVEKVR+LMDDGGVKKEAGRS+IDADN+FLM+FLFDSK KFV
Sbjct: 481 SGDYVLLSNIYASKGEWDGVEKVRKLMDDGGVKKEAGRSMIDADNSFLMNFLFDSKLKFV 540

Query: 541 EEGS 542
           +E S
Sbjct: 541 KESS 544

BLAST of Tan0021031 vs. ExPASy TrEMBL
Match: A0A5D3BN20 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold169G002760 PE=4 SV=1)

HSP 1 Score: 971.1 bits (2509), Expect = 1.9e-279
Identity = 472/540 (87.41%), Postives = 506/540 (93.70%), Query Frame = 0

Query: 1   MIRKRINGS--NRFRRSTLWQKCTSFRALKQVHAFLVVNGFNSSPSALRQLIFVSAIAVS 60
           MIRKR N +  NRF  S+LWQKCT+FRALKQ+HAFL+VNG NS+ S LR+LIFVSA+ VS
Sbjct: 1   MIRKRTNDNRFNRFHHSSLWQKCTNFRALKQLHAFLIVNGLNSTNSVLRELIFVSAMVVS 60

Query: 61  GTMDYAHQVFAQITEPDIFMWNTMIRGSAQSLKPASAVSLYAQMENRGVRPDKFTFSFVL 120
           GTMDYAHQ+FAQIT+PDIFMWNTMIRGS QSLKPA+AVSLY QM+NRGVRPDKFTFSFVL
Sbjct: 61  GTMDYAHQLFAQITQPDIFMWNTMIRGSTQSLKPATAVSLYTQMDNRGVRPDKFTFSFVL 120

Query: 121 KACTKLSWVQLGFGIHGKVVKFGFQSNTFVRNTLIYFHANCGDLGTARALFDASAKRDVV 180
           KACTKLSW +LG  IHGK++K GFQSNTFVRNTLIYFHANCGDL  ARALFD SAKRDVV
Sbjct: 121 KACTKLSWDKLGIVIHGKILKSGFQSNTFVRNTLIYFHANCGDLAIARALFDDSAKRDVV 180

Query: 181 PWSALTAGYARRGELDVARQLFDEMPIKDLVSWNVIITAYAKLGEMEKARTLFDDAPKKD 240
           PWSA+TAGYARRG+LDVARQLFDEMP+KDLVSWNV+ITAYAKLGEMEKAR LFD+APKKD
Sbjct: 181 PWSAMTAGYARRGKLDVARQLFDEMPVKDLVSWNVMITAYAKLGEMEKARKLFDEAPKKD 240

Query: 241 VVTWNAMIAGYVLSGLNKEALEMFDAMRDAGQRPDDVTMLSILSASADLGELEVGKKIHH 300
           VVTWNAMIAGYVLS LNKEALEMFDAMR  GQRPDDVTMLSILSASADLG+LE+GKKIH 
Sbjct: 241 VVTWNAMIAGYVLSRLNKEALEMFDAMRAMGQRPDDVTMLSILSASADLGDLEIGKKIHR 300

Query: 301 SIFDMCCGHLSVLLGNALIDMYAKCGSIENAMDVFRGMRDKDTSSWNSIIGGLAFHGHAK 360
           SIFDM CG LSVLLGNALIDMYAKCGSI NAM+VF+GMR KDT+SWNSIIGGLA HGHA+
Sbjct: 301 SIFDMRCGDLSVLLGNALIDMYAKCGSIGNAMEVFQGMRRKDTASWNSIIGGLALHGHAE 360

Query: 361 ESINLFQEMLRLKMRPNEITFVGVLVACSHAGKVQEGRMYFNLMRNLYKIEPNIKHYGCM 420
           ESIN FQEMLRLKM+PN+ITFVGVLVACSHAGKVQEGR YFNLMRN+YKIEPNIKHYGCM
Sbjct: 361 ESINQFQEMLRLKMKPNDITFVGVLVACSHAGKVQEGRTYFNLMRNMYKIEPNIKHYGCM 420

Query: 421 VDILGRAGLLSEAFDFIDTMEIEPNAIIWRTLLGACRVHGDVELGRRANEQLLKMRKDES 480
           VDILGRAGLL EAF FIDTME+EPNAIIWRTLLGACRVHGDVELGRRANEQLLKMRKDES
Sbjct: 421 VDILGRAGLLIEAFKFIDTMEVEPNAIIWRTLLGACRVHGDVELGRRANEQLLKMRKDES 480

Query: 481 GDYVLLSNIYASQGEWDGVEKVRRLMDDGGVKKEAGRSLIDADNTFLMHFLFDSKPKFVE 539
           GDYVLLSNIYASQGEWDGV+KVR+LMDDGGVKK+ GRSLIDADN+FLMHFLFDSKPKFVE
Sbjct: 481 GDYVLLSNIYASQGEWDGVQKVRKLMDDGGVKKKVGRSLIDADNSFLMHFLFDSKPKFVE 540

BLAST of Tan0021031 vs. ExPASy TrEMBL
Match: A0A1S3CAF6 (pentatricopeptide repeat-containing protein At5g15300 OS=Cucumis melo OX=3656 GN=LOC103498496 PE=4 SV=1)

HSP 1 Score: 969.1 bits (2504), Expect = 7.4e-279
Identity = 471/540 (87.22%), Postives = 506/540 (93.70%), Query Frame = 0

Query: 1   MIRKRINGS--NRFRRSTLWQKCTSFRALKQVHAFLVVNGFNSSPSALRQLIFVSAIAVS 60
           MIRKR N +  NRF  S+LWQKCT+FRALKQ+HAFL+VNG NS+ S LR+LIFVSA+ VS
Sbjct: 1   MIRKRTNDNRFNRFHHSSLWQKCTNFRALKQLHAFLIVNGLNSTNSVLRELIFVSAMVVS 60

Query: 61  GTMDYAHQVFAQITEPDIFMWNTMIRGSAQSLKPASAVSLYAQMENRGVRPDKFTFSFVL 120
           GTMDYAHQ+FAQIT+PDIFMWNTMIRGS QSLKPA+AVSLY QM+NRGVRPDKFTFSFVL
Sbjct: 61  GTMDYAHQLFAQITQPDIFMWNTMIRGSTQSLKPATAVSLYTQMDNRGVRPDKFTFSFVL 120

Query: 121 KACTKLSWVQLGFGIHGKVVKFGFQSNTFVRNTLIYFHANCGDLGTARALFDASAKRDVV 180
           KACTKLSW +LG  IHGK++K GFQSNTFVRNTLIYFHANCGDL  ARALFD SAKRDVV
Sbjct: 121 KACTKLSWDKLGIVIHGKILKSGFQSNTFVRNTLIYFHANCGDLAIARALFDDSAKRDVV 180

Query: 181 PWSALTAGYARRGELDVARQLFDEMPIKDLVSWNVIITAYAKLGEMEKARTLFDDAPKKD 240
           PWSA+TAGYARRG+LDVARQLFDEMP+KDLVSWNV+ITAYAKLGEMEKAR LFD+APKKD
Sbjct: 181 PWSAMTAGYARRGKLDVARQLFDEMPVKDLVSWNVMITAYAKLGEMEKARKLFDEAPKKD 240

Query: 241 VVTWNAMIAGYVLSGLNKEALEMFDAMRDAGQRPDDVTMLSILSASADLGELEVGKKIHH 300
           VVTWNAMIAGYVLS LNKEALEMFDAMR  GQRPDDVTMLSILSASADLG+LE+GKKIH 
Sbjct: 241 VVTWNAMIAGYVLSRLNKEALEMFDAMRAMGQRPDDVTMLSILSASADLGDLEIGKKIHR 300

Query: 301 SIFDMCCGHLSVLLGNALIDMYAKCGSIENAMDVFRGMRDKDTSSWNSIIGGLAFHGHAK 360
           SIFDM CG LSVLLGNALIDMYAKCGSI NAM+VF+GMR KDT+SWNSIIGGLA HGHA+
Sbjct: 301 SIFDMRCGDLSVLLGNALIDMYAKCGSIGNAMEVFQGMRRKDTASWNSIIGGLALHGHAE 360

Query: 361 ESINLFQEMLRLKMRPNEITFVGVLVACSHAGKVQEGRMYFNLMRNLYKIEPNIKHYGCM 420
           ESIN FQEMLRLKM+PN+ITFVGVLVACSHAGKVQEGR YF+LMRN+YKIEPNIKHYGCM
Sbjct: 361 ESINQFQEMLRLKMKPNDITFVGVLVACSHAGKVQEGRTYFSLMRNMYKIEPNIKHYGCM 420

Query: 421 VDILGRAGLLSEAFDFIDTMEIEPNAIIWRTLLGACRVHGDVELGRRANEQLLKMRKDES 480
           VDILGRAGLL EAF FIDTME+EPNAIIWRTLLGACRVHGDVELGRRANEQLLKMRKDES
Sbjct: 421 VDILGRAGLLIEAFKFIDTMEVEPNAIIWRTLLGACRVHGDVELGRRANEQLLKMRKDES 480

Query: 481 GDYVLLSNIYASQGEWDGVEKVRRLMDDGGVKKEAGRSLIDADNTFLMHFLFDSKPKFVE 539
           GDYVLLSNIYASQGEWDGV+KVR+LMDDGGVKK+ GRSLIDADN+FLMHFLFDSKPKFVE
Sbjct: 481 GDYVLLSNIYASQGEWDGVQKVRKLMDDGGVKKKVGRSLIDADNSFLMHFLFDSKPKFVE 540

BLAST of Tan0021031 vs. TAIR 10
Match: AT5G15300.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 665.6 bits (1716), Expect = 3.4e-191
Identity = 322/547 (58.87%), Postives = 420/547 (76.78%), Query Frame = 0

Query: 1   MIRKRING--SNRFRRSTLWQKCTSFRALKQVHAFLVVNGFNSSPSALRQLIFVSAIAVS 60
           MIR++ N   +NR RR  LWQ C + R LKQ+HA +VVNG  S+ S + +LI+ ++++V 
Sbjct: 1   MIRRQTNDRTTNR-RRPKLWQNCKNIRTLKQIHASMVVNGLMSNLSVVGELIYSASLSVP 60

Query: 61  GTMDYAHQVFAQITEPDIFMWNTMIRGSAQSLKPASAVSLYAQMENRGVRPDKFTFSFVL 120
           G + YAH++F +I +PD+ + N ++RGSAQS+KP   VSLY +ME RGV PD++TF+FVL
Sbjct: 61  GALKYAHKLFDEIPKPDVSICNHVLRGSAQSMKPEKTVSLYTEMEKRGVSPDRYTFTFVL 120

Query: 121 KACTKLSWVQLGFGIHGKVVKFGFQSNTFVRNTLIYFHANCGDLGTARALFDASAKRDVV 180
           KAC+KL W   GF  HGKVV+ GF  N +V+N LI FHANCGDLG A  LFD SAK   V
Sbjct: 121 KACSKLEWRSNGFAFHGKVVRHGFVLNEYVKNALILFHANCGDLGIASELFDDSAKAHKV 180

Query: 181 PWSALTAGYARRGELDVARQLFDEMPIKDLVSWNVIITAYAKLGEMEKARTLFDDAPKKD 240
            WS++T+GYA+RG++D A +LFDEMP KD V+WNV+IT   K  EM+ AR LFD   +KD
Sbjct: 181 AWSSMTSGYAKRGKIDEAMRLFDEMPYKDQVAWNVMITGCLKCKEMDSARELFDRFTEKD 240

Query: 241 VVTWNAMIAGYVLSGLNKEALEMFDAMRDAGQRPDDVTMLSILSASADLGELEVGKKIHH 300
           VVTWNAMI+GYV  G  KEAL +F  MRDAG+ PD VT+LS+LSA A LG+LE GK++H 
Sbjct: 241 VVTWNAMISGYVNCGYPKEALGIFKEMRDAGEHPDVVTILSLLSACAVLGDLETGKRLHI 300

Query: 301 SIFDMCCGHLSVLLG----NALIDMYAKCGSIENAMDVFRGMRDKDTSSWNSIIGGLAFH 360
            I +      S+ +G    NALIDMYAKCGSI+ A++VFRG++D+D S+WN++I GLA H
Sbjct: 301 YILETASVSSSIYVGTPIWNALIDMYAKCGSIDRAIEVFRGVKDRDLSTWNTLIVGLALH 360

Query: 361 GHAKESINLFQEMLRLKMRPNEITFVGVLVACSHAGKVQEGRMYFNLMRNLYKIEPNIKH 420
            HA+ SI +F+EM RLK+ PNE+TF+GV++ACSH+G+V EGR YF+LMR++Y IEPNIKH
Sbjct: 361 -HAEGSIEMFEEMQRLKVWPNEVTFIGVILACSHSGRVDEGRKYFSLMRDMYNIEPNIKH 420

Query: 421 YGCMVDILGRAGLLSEAFDFIDTMEIEPNAIIWRTLLGACRVHGDVELGRRANEQLLKMR 480
           YGCMVD+LGRAG L EAF F+++M+IEPNAI+WRTLLGAC+++G+VELG+ ANE+LL MR
Sbjct: 421 YGCMVDMLGRAGQLEEAFMFVESMKIEPNAIVWRTLLGACKIYGNVELGKYANEKLLSMR 480

Query: 481 KDESGDYVLLSNIYASQGEWDGVEKVRRLMDDGGVKKEAGRSLIDA-DNTFLMHFLFDSK 540
           KDESGDYVLLSNIYAS G+WDGV+KVR++ DD  VKK  G SLI+  D+  +M +L  S+
Sbjct: 481 KDESGDYVLLSNIYASTGQWDGVQKVRKMFDDTRVKKPTGVSLIEEDDDKLMMRYLLSSE 540

BLAST of Tan0021031 vs. TAIR 10
Match: AT1G08070.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 422.5 bits (1085), Expect = 4.9e-118
Identity = 232/579 (40.07%), Postives = 324/579 (55.96%), Query Frame = 0

Query: 16  TLWQKCTSFRALKQVHAFLVVNGFNSSPSALRQLI-FVSAIAVSGTMDYAHQVFAQITEP 75
           +L   C + ++L+ +HA ++  G +++  AL +LI F         + YA  VF  I EP
Sbjct: 38  SLLHNCKTLQSLRIIHAQMIKIGLHNTNYALSKLIEFCILSPHFEGLPYAISVFKTIQEP 97

Query: 76  DIFMWNTMIRGSAQSLKPASAVSLYAQMENRGVRPDKFTFSFVLKACTKLSWVQLGFGIH 135
           ++ +WNTM RG A S  P SA+ LY  M + G+ P+ +TF FVLK+C K    + G  IH
Sbjct: 98  NLLIWNTMFRGHALSSDPVSALKLYVCMISLGLLPNSYTFPFVLKSCAKSKAFKEGQQIH 157

Query: 136 GKVVKFGFQSNTFVRNTLIYFHANCGDLGTARALFDASAKRDVVPWSALTAGYARRGELD 195
           G V+K G   + +V  +LI  +   G L  A  +FD S  RDVV ++AL  GYA RG ++
Sbjct: 158 GHVLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVFDKSPHRDVVSYTALIKGYASRGYIE 217

Query: 196 VARQLFDEMPIKDLVSWNVIITAYA----------------------------------- 255
            A++LFDE+P+KD+VSWN +I+ YA                                   
Sbjct: 218 NAQKLFDEIPVKDVVSWNAMISGYAETGNYKEALELFKDMMKTNVRPDESTMVTVVSACA 277

Query: 256 -----------------------------------KLGEMEKARTLFDDAPKKDVVTWNA 315
                                              K GE+E A  LF+  P KDV++WN 
Sbjct: 278 QSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELETACGLFERLPYKDVISWNT 337

Query: 316 MIAGYVLSGLNKEALEMFDAMRDAGQRPDDVTMLSILSASADLGELEVGKKIHHSIFDMC 375
           +I GY    L KEAL +F  M  +G+ P+DVTMLSIL A A LG +++G+ IH  I    
Sbjct: 338 LIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSILPACAHLGAIDIGRWIHVYIDKRL 397

Query: 376 CGHLSV-LLGNALIDMYAKCGSIENAMDVFRGMRDKDTSSWNSIIGGLAFHGHAKESINL 435
            G  +   L  +LIDMYAKCG IE A  VF  +  K  SSWN++I G A HG A  S +L
Sbjct: 398 KGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSSWNAMIFGFAMHGRADASFDL 457

Query: 436 FQEMLRLKMRPNEITFVGVLVACSHAGKVQEGRMYFNLMRNLYKIEPNIKHYGCMVDILG 495
           F  M ++ ++P++ITFVG+L ACSH+G +  GR  F  M   YK+ P ++HYGCM+D+LG
Sbjct: 458 FSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQDYKMTPKLEHYGCMIDLLG 517

Query: 496 RAGLLSEAFDFIDTMEIEPNAIIWRTLLGACRVHGDVELGRRANEQLLKMRKDESGDYVL 523
            +GL  EA + I+ ME+EP+ +IW +LL AC++HG+VELG    E L+K+  +  G YVL
Sbjct: 518 HSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELGESFAENLIKIEPENPGSYVL 577

BLAST of Tan0021031 vs. TAIR 10
Match: AT3G29230.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 414.1 bits (1063), Expect = 1.8e-115
Identity = 213/552 (38.59%), Postives = 325/552 (58.88%), Query Frame = 0

Query: 20  KCTSFRALKQVHAFLVVNGFNSSPSALRQLIFVSAIAVSGTMDYAHQVFAQITEPDIFMW 79
           KC +   +KQ+HA ++    +       +LI  SA+++    + A +VF Q+ EP++ + 
Sbjct: 28  KCANLNQVKQLHAQIIRRNLHEDLHIAPKLI--SALSLCRQTNLAVRVFNQVQEPNVHLC 87

Query: 80  NTMIRGSAQSLKPASAVSLYAQMENRGVRPDKFTFSFVLKACTKLSWVQLGFGIHGKVVK 139
           N++IR  AQ+ +P  A  ++++M+  G+  D FT+ F+LKAC+  SW+ +   +H  + K
Sbjct: 88  NSLIRAHAQNSQPYQAFFVFSEMQRFGLFADNFTYPFLLKACSGQSWLPVVKMMHNHIEK 147

Query: 140 FGFQSNTFVRNTLIYFHANCGDLGT--ARALFDASAKRDVVPWSALTAGYARRGELDVAR 199
            G  S+ +V N LI  ++ CG LG   A  LF+  ++RD V W+++  G  + GEL  AR
Sbjct: 148 LGLSSDIYVPNALIDCYSRCGGLGVRDAMKLFEKMSERDTVSWNSMLGGLVKAGELRDAR 207

Query: 200 QLFDEMPIKDLVSWNVIITAYA-------------------------------KLGEMEK 259
           +LFDEMP +DL+SWN ++  YA                               K G+ME 
Sbjct: 208 RLFDEMPQRDLISWNTMLDGYARCREMSKAFELFEKMPERNTVSWSTMVMGYSKAGDMEM 267

Query: 260 ARTLFD--DAPKKDVVTWNAMIAGYVLSGLNKEALEMFDAMRDAGQRPDDVTMLSILSAS 319
           AR +FD    P K+VVTW  +IAGY   GL KEA  + D M  +G + D   ++SIL+A 
Sbjct: 268 ARVMFDKMPLPAKNVVTWTIIIAGYAEKGLLKEADRLVDQMVASGLKFDAAAVISILAAC 327

Query: 320 ADLGELEVGKKIHHSIFDMCCGHLSVLLGNALIDMYAKCGSIENAMDVFRGMRDKDTSSW 379
            + G L +G +IH  +     G  + +L NAL+DMYAKCG+++ A DVF  +  KD  SW
Sbjct: 328 TESGLLSLGMRIHSILKRSNLGSNAYVL-NALLDMYAKCGNLKKAFDVFNDIPKKDLVSW 387

Query: 380 NSIIGGLAFHGHAKESINLFQEMLRLKMRPNEITFVGVLVACSHAGKVQEGRMYFNLMRN 439
           N+++ GL  HGH KE+I LF  M R  +RP+++TF+ VL +C+HAG + EG  YF  M  
Sbjct: 388 NTMLHGLGVHGHGKEAIELFSRMRREGIRPDKVTFIAVLCSCNHAGLIDEGIDYFYSMEK 447

Query: 440 LYKIEPNIKHYGCMVDILGRAGLLSEAFDFIDTMEIEPNAIIWRTLLGACRVHGDVELGR 499
           +Y + P ++HYGC+VD+LGR G L EA   + TM +EPN +IW  LLGACR+H +V++ +
Sbjct: 448 VYDLVPQVEHYGCLVDLLGRVGRLKEAIKVVQTMPMEPNVVIWGALLGACRMHNEVDIAK 507

Query: 500 RANEQLLKMRKDESGDYVLLSNIYASQGEWDGVEKVRRLMDDGGVKKEAGRSLIDADNTF 536
              + L+K+   + G+Y LLSNIYA+  +W+GV  +R  M   GV+K +G S ++ ++  
Sbjct: 508 EVLDNLVKLDPCDPGNYSLLSNIYAAAEDWEGVADIRSKMKSMGVEKPSGASSVELEDGI 567

BLAST of Tan0021031 vs. TAIR 10
Match: AT1G74630.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 411.4 bits (1056), Expect = 1.1e-114
Identity = 204/525 (38.86%), Postives = 320/525 (60.95%), Query Frame = 0

Query: 16  TLWQKCTSFRALKQVHAFLVVNGFNSSPSALRQLIFVSAIAVSGTMDYAHQVFAQITEPD 75
           +L   C + RAL Q+H   +  G ++      +LI   AI++S  + YA ++     EPD
Sbjct: 10  SLLNSCKNLRALTQIHGLFIKYGVDTDSYFTGKLILHCAISISDALPYARRLLLCFPEPD 69

Query: 76  IFMWNTMIRGSAQSLKPASAVSLYAQMENRG-VRPDKFTFSFVLKACTKLSWVQLGFGIH 135
            FM+NT++RG ++S +P ++V+++ +M  +G V PD F+F+FV+KA      ++ GF +H
Sbjct: 70  AFMFNTLVRGYSESDEPHNSVAVFVEMMRKGFVFPDSFSFAFVIKAVENFRSLRTGFQMH 129

Query: 136 GKVVKFGFQSNTFVRNTLIYFHANCGDLGTARALFDASAKRDVVPWSALTAGYARRGELD 195
            + +K G +S+ FV  TLI  +  CG +  AR +FD   + ++V W+A+     R  ++ 
Sbjct: 130 CQALKHGLESHLFVGTTLIGMYGGCGCVEFARKVFDEMHQPNLVAWNAVITACFRGNDVA 189

Query: 196 VARQLFDEMPIKDLVSWNVIITAYAKLGEMEKARTLFDDAPKKDVVTWNAMIAGYVLSGL 255
            AR++FD+M +++  SWNV++  Y K GE+E A+ +F + P +D V+W+ MI G   +G 
Sbjct: 190 GAREIFDKMLVRNHTSWNVMLAGYIKAGELESAKRIFSEMPHRDDVSWSTMIVGIAHNGS 249

Query: 256 NKEALEMFDAMRDAGQRPDDVTMLSILSASADLGELEVGKKIHHSIFDMCCGHLSVLLGN 315
             E+   F  ++ AG  P++V++  +LSA +  G  E G KI H   +       V + N
Sbjct: 250 FNESFLYFRELQRAGMSPNEVSLTGVLSACSQSGSFEFG-KILHGFVEKAGYSWIVSVNN 309

Query: 316 ALIDMYAKCGSIENAMDVFRGMRDKD-TSSWNSIIGGLAFHGHAKESINLFQEMLRLKMR 375
           ALIDMY++CG++  A  VF GM++K    SW S+I GLA HG  +E++ LF EM    + 
Sbjct: 310 ALIDMYSRCGNVPMARLVFEGMQEKRCIVSWTSMIAGLAMHGQGEEAVRLFNEMTAYGVT 369

Query: 376 PNEITFVGVLVACSHAGKVQEGRMYFNLMRNLYKIEPNIKHYGCMVDILGRAGLLSEAFD 435
           P+ I+F+ +L ACSHAG ++EG  YF+ M+ +Y IEP I+HYGCMVD+ GR+G L +A+D
Sbjct: 370 PDGISFISLLHACSHAGLIEEGEDYFSEMKRVYHIEPEIEHYGCMVDLYGRSGKLQKAYD 429

Query: 436 FIDTMEIEPNAIIWRTLLGACRVHGDVELGRRANEQLLKMRKDESGDYVLLSNIYASQGE 495
           FI  M I P AI+WRTLLGAC  HG++EL  +  ++L ++  + SGD VLLSN YA+ G+
Sbjct: 430 FICQMPIPPTAIVWRTLLGACSSHGNIELAEQVKQRLNELDPNNSGDLVLLSNAYATAGK 489

Query: 496 WDGVEKVRRLMDDGGVKKEAGRSLIDADNTFLMHFLFDSKPKFVE 539
           W  V  +R+ M    +KK    SL++   T +  F    K K ++
Sbjct: 490 WKDVASIRKSMIVQRIKKTTAWSLVEVGKT-MYKFTAGEKKKGID 532

BLAST of Tan0021031 vs. TAIR 10
Match: AT2G22410.1 (SLOW GROWTH 1 )

HSP 1 Score: 398.7 bits (1023), Expect = 7.6e-111
Identity = 214/593 (36.09%), Postives = 323/593 (54.47%), Query Frame = 0

Query: 16  TLWQKCTSFRALKQVHAFLVVNGFNSSPSALRQLIFVSAIAVSGTMDYAHQVFAQITEPD 75
           +L +KC     LKQ+ A +++NG    P A  +LI   A++ S  +DY+ ++   I  P+
Sbjct: 58  SLLEKCKLLLHLKQIQAQMIINGLILDPFASSRLIAFCALSESRYLDYSVKILKGIENPN 117

Query: 76  IFMWNTMIRGSAQSLKPASAVSLYAQMENRGV---RPDKFTFSFVLKACTKLSWVQLGFG 135
           IF WN  IRG ++S  P  +  LY QM   G    RPD FT+  + K C  L    LG  
Sbjct: 118 IFSWNVTIRGFSESENPKESFLLYKQMLRHGCCESRPDHFTYPVLFKVCADLRLSSLGHM 177

Query: 136 IHGKVVKFGFQSNTFVRNTLIYFHANCGDLGTARALFDASAKRDVVPWSALTAGYARRGE 195
           I G V+K   +  + V N  I+  A+CGD+  AR +FD S  RD+V W+ L  GY + GE
Sbjct: 178 ILGHVLKLRLELVSHVHNASIHMFASCGDMENARKVFDESPVRDLVSWNCLINGYKKIGE 237

Query: 196 ------------------------------------------------------------ 255
                                                                       
Sbjct: 238 AEKAIYVYKLMESEGVKPDDVTMIGLVSSCSMLGDLNRGKEFYEYVKENGLRMTIPLVNA 297

Query: 256 ----------LDVARQLFDEMPIKDLVSWNVIITAYAKLGEMEKARTLFDDAPKKDVVTW 315
                     +  AR++FD +  + +VSW  +I+ YA+ G ++ +R LFDD  +KDVV W
Sbjct: 298 LMDMFSKCGDIHEARRIFDNLEKRTIVSWTTMISGYARCGLLDVSRKLFDDMEEKDVVLW 357

Query: 316 NAMIAGYVLSGLNKEALEMFDAMRDAGQRPDDVTMLSILSASADLGELEVGKKIHHSIFD 375
           NAMI G V +   ++AL +F  M+ +  +PD++TM+  LSA + LG L+VG  IH  I +
Sbjct: 358 NAMIGGSVQAKRGQDALALFQEMQTSNTKPDEITMIHCLSACSQLGALDVGIWIHRYI-E 417

Query: 376 MCCGHLSVLLGNALIDMYAKCGSIENAMDVFRGMRDKDTSSWNSIIGGLAFHGHAKESIN 435
                L+V LG +L+DMYAKCG+I  A+ VF G++ +++ ++ +IIGGLA HG A  +I+
Sbjct: 418 KYSLSLNVALGTSLVDMYAKCGNISEALSVFHGIQTRNSLTYTAIIGGLALHGDASTAIS 477

Query: 436 LFQEMLRLKMRPNEITFVGVLVACSHAGKVQEGRMYFNLMRNLYKIEPNIKHYGCMVDIL 495
            F EM+   + P+EITF+G+L AC H G +Q GR YF+ M++ + + P +KHY  MVD+L
Sbjct: 478 YFNEMIDAGIAPDEITFIGLLSACCHGGMIQTGRDYFSQMKSRFNLNPQLKHYSIMVDLL 537

Query: 496 GRAGLLSEAFDFIDTMEIEPNAIIWRTLLGACRVHGDVELGRRANEQLLKMRKDESGDYV 536
           GRAGLL EA   +++M +E +A +W  LL  CR+HG+VELG +A ++LL++   +SG YV
Sbjct: 538 GRAGLLEEADRLMESMPMEADAAVWGALLFGCRMHGNVELGEKAAKKLLELDPSDSGIYV 597

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9LXF24.7e-19058.87Pentatricopeptide repeat-containing protein At5g15300 OS=Arabidopsis thaliana OX... [more]
Q9LN017.0e-11740.07Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
Q9LS722.5e-11438.59Pentatricopeptide repeat-containing protein At3g29230 OS=Arabidopsis thaliana OX... [more]
Q9CA541.6e-11338.86Pentatricopeptide repeat-containing protein At1g74630 OS=Arabidopsis thaliana OX... [more]
Q9SJZ31.1e-10936.09Pentatricopeptide repeat-containing protein At2g22410, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
XP_023544416.12.0e-29491.91pentatricopeptide repeat-containing protein At5g15300 [Cucurbita pepo subsp. pep... [more]
KAG6602506.11.3e-29391.91Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
KAG7033178.12.9e-29391.73Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
XP_022954064.11.4e-29291.36pentatricopeptide repeat-containing protein At5g15300 [Cucurbita moschata][more]
XP_022133508.11.9e-29291.34pentatricopeptide repeat-containing protein At5g15300 [Momordica charantia][more]
Match NameE-valueIdentityDescription
A0A6J1GRS96.9e-29391.36pentatricopeptide repeat-containing protein At5g15300 OS=Cucurbita moschata OX=3... [more]
A0A6J1BZB39.0e-29391.34pentatricopeptide repeat-containing protein At5g15300 OS=Momordica charantia OX=... [more]
A0A6J1JPF21.2e-29291.54pentatricopeptide repeat-containing protein At5g15300 OS=Cucurbita maxima OX=366... [more]
A0A5D3BN201.9e-27987.41Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S3CAF67.4e-27987.22pentatricopeptide repeat-containing protein At5g15300 OS=Cucumis melo OX=3656 GN... [more]
Match NameE-valueIdentityDescription
AT5G15300.13.4e-19158.87Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G08070.14.9e-11840.07Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G29230.11.8e-11538.59Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G74630.11.1e-11438.86Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT2G22410.17.6e-11136.09SLOW GROWTH 1 [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 237..283
e-value: 1.3E-10
score: 41.3
coord: 339..386
e-value: 4.7E-8
score: 33.1
coord: 74..122
e-value: 4.2E-12
score: 46.1
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 343..376
e-value: 0.0022
score: 16.1
coord: 78..110
e-value: 1.3E-4
score: 19.9
coord: 240..273
e-value: 9.4E-6
score: 23.5
coord: 180..203
e-value: 0.0024
score: 15.9
coord: 209..239
e-value: 1.0E-5
score: 23.4
coord: 314..340
e-value: 2.7E-4
score: 18.9
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 179..204
e-value: 0.002
score: 18.3
coord: 415..439
e-value: 0.053
score: 13.8
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 75..109
score: 11.016164
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 238..272
score: 12.463056
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 176..210
score: 10.13926
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 340..374
score: 10.950397
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 14..123
e-value: 4.7E-12
score: 47.9
coord: 310..528
e-value: 3.6E-42
score: 146.8
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 207..303
e-value: 4.4E-23
score: 83.6
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 124..206
e-value: 9.6E-11
score: 43.7
NoneNo IPR availablePANTHERPTHR47928REPEAT-CONTAINING PROTEIN, PUTATIVE-RELATEDcoord: 221..520
NoneNo IPR availablePANTHERPTHR47928:SF151OS03G0168550 PROTEINcoord: 221..520
NoneNo IPR availablePANTHERPTHR47928:SF151OS03G0168550 PROTEINcoord: 9..188
NoneNo IPR availablePANTHERPTHR47928REPEAT-CONTAINING PROTEIN, PUTATIVE-RELATEDcoord: 9..188
NoneNo IPR availablePANTHERPTHR47928:SF151OS03G0168550 PROTEINcoord: 189..219
NoneNo IPR availablePANTHERPTHR47928REPEAT-CONTAINING PROTEIN, PUTATIVE-RELATEDcoord: 189..219

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0021031.1Tan0021031.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding