Tan0021631 (gene) Snake gourd v1

Overview
NameTan0021631
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat-containing protein
LocationLG08: 15061909 .. 15065564 (-)
RNA-Seq ExpressionTan0021631
SyntenyTan0021631
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CCATGACAAGAAAAATTTGGAAATGTCAAAGGAGAAAGCCATTGTCACTCTCTTGCAAGGCTGCAACAGCCTCAACAAGCTTCGAAAAATCCACGCCCATGTTATTGTAAGCGGCCTCCGCCATCACATCGCCATTGGTAACAAGCTTTTGAACTTCTGTGCCATCTCAGTTTCAGGTTCCCTTGCTTATGCACAGCTTCTCTTCCATCAAATGGAATGCCCACAAACCGAAGCCTGGAACTCCATCATCCGAGGCTTCGCTCAGAGCTCATCTCCCATTGACGCCGTTGTTTATTACAATCAAATGGTTTGGCGCTCTTTCTCTTCCCCTGACACATTCACTTTCTCATTTGTGCTCAAAGCCTGTGAAAGACTCAAGGCTGAGCGCAAGTGTAAAGAAGTTCATGGCTCTGTAATCCGCTGTGGTTATGATGGGGATGTTATTATTTGCACCAATCTTGTCAAATGCTATGGGGCAATGGGGTCTGTTTGTATTGCCCAACGGGTGTTTGACAAAATGCCTGCGAGAGACTTGGTGGCTTGGAATGCTATGATTTCGTGCTTTTCTCAACAGGGTTTGCACCAAGAGGCATTGCAGATATACAATCAGATGAGAAGTGAAAATGTGGATGTAGATGGGTTTACACTTGTTGGGTTGATTTCTTCATGTGCTCATCTTGGAGCGTTGAATATTGGGGTTCAGATGCATAGACTTGCTCGTGAAAAGGGTCTTGTGGAGAGTCTTTATGTTGGAAATGCATTGATAGATATGTATGCTAAATGTGGTAGTTTGGATCAAGCCATTCGCATATTTGATAGAATGCAGAGGAAGGATATTTTCACTTGGAACTCAATGATTGTTGGGTATGGAGTGCACGGTCGTGGCAGTGAAGCTATATGTTGCTTCCAACAGATGTTAGAAGCGAGAATGCAACCAAACTCTGTCACATTTTTGGGTTTACTTTGTGGATGTAGTCATCAAGGCTTGGTTCAAGAAGGTGTTAAATACTTCAATTTGATGAGCTTCATGTTTAGGCTAAGACCTGAAGTTAAACACTATGGATGCCTTGTGGATTTATATGGTCGAGCTGGGAAGCTTGAAAAGGCACTTGAAACTATATCAAATTCATCAACAAATGATCCGGTTTTGTGGCGAATCTTACTTGGCTCTTGCAAAATTCATAAAAATGTGGGTATAGGAAAAATTGCCATGAACAATCTCTCTGAGCTTGGAGCTACAAATGCAGGAGATTGTATATTGCTGGCTACCATCTATGCTGGAGTGAAGGATACAGCTGGTGTTGCAAGTATGAGAAAAATGATCAAGAGCCAAGGGATAAAGACCACCCCAGGTTGGAGTTGGATTGAAATTGGGGAACAAGTTCATAAATTTGTGGTTGATGACAAGTCCCATCGTTATTCGATTGAAGTTCATGAAAAGTTGAGGGAAGTTATTCATCAAGCTTCCTTGTTTGGATATATAGGAGATGAGTCTGTTTCGTCACTGGATGTGTTTTCCACCACACAGACTTTAGAGACTTCCTGCACATATCATAGTGAGAAACTTGCAATTGCATTTGGATTGGCAAGAACTGCAGATGGGACACAGATACGAATTGTTAAAAACCTTCGAGTTTGCAGGGATTGTCATTCATTCATAAAAGCTGTCTCGGTGGCATTCAACCGAGAAATAATTGTTAGAGATCGGGTTCGGTTCCACCATTTCAAGGGTGGCCGATGTTCCTGCAATGACTACTGGTGAAAAGTGAAAAATGTATTTTCACTTCATTCAAGGCTGCAGTAGCTCCCTCTCGTTGTAATGGAGTTTTGGCTCTTGTGCTTTCTTTTTATTAGCATAGTAGAGTCGTCCCAAACAACATGTAAGGAGAAAAAACTGAGCTTCAAGTTATTATGCTCTTCTTTTATATGTAACTTCTAACCTACTGCAGCATGATGGGTTTATGTAATTACTTAGGCATTCTTGGAAAAAGATACGGGCTTCTCAATAACCATGAAGACCACATAGCAAGCACAAACTTACCACAATCAGAACTTCTTTCTGGGATACTTCACTCATAATCTTACTTTCCATACTAGGGGGTCAGTCAAAGAAATGTACTTGAGACAACTTAAATTATCCTGATGAAGTCCAAAGATGCAGTATATATATAGAGTGAACTAAAGGTATGTTCTTTAAGCTTTCATTTATAGGTTCTTCTTAGTGAATTCATATGTCTATTTTGTTGAAAACATTTGTGAGCTCATTTCTAAACTGGTTTCATTCATCCCACTTGAAAACTATCCTTCCTGCTCGTGCAATTTGAAAGCATAATCAAAACGAAGAGCTTTGAGCCTTTTTAATAAGTTTCAAAGAAATGCAAAGATTTTGTGAGAAGATACGATTTAGTTGCAGTTTACGGAGGTTTTTAATAAGAAGATACAATTTAGTTGCAGTTTACAGAGATTTTTAATAGTTTTTTGACTTAGTTTTATCACTTCTAGTTGGTGTGCTCTAAATTATTCTACTTGTCCTTTCACTATTGCCAGCTGGAGTTCTGTGTTTTGAACTCCCTTGGTTGGAGGGGCTTTTCTCCTCCTTGGTGCCTGCAACTTTTCGCTATAGTTTGCAGAAATTATGTGACTGTAAAAATTTTTAAAACAAAAAGATTTCTATGATCTTTAAATAACTGGAGTTGGGAGTTGGGCCAAACCTTGCCACGAACTACATTACCTAATGCCTTTTTATTGTGGTGTTGGTAATGAAAGTTGAAGGAAAGTGGAGAATGTGCATAACTCAATCAAAATGCATATACTTGACCAAGAGAAGACTTCATCACTAGCCTAGCCTTGTATTGTTATATAGTAATATCTTCAGGTTGAAGAATGCAGGTGCAACATATTAGAGAATGGTCAGTAAGATGTTAGAATTGATCCAGAAGGATATGACAGTCTGCCTCAATGACATACTAATGAAACATAAGAGCTATCCATCATGAAGTCAACCTAGCCGAAGCTTTTAAGACTCTGAGAAAGTACCAGGGAAATTGTAAAAATTAGCCCCTTTGAATTTTAAATTTGGATCTTAGACTCCTATTTTAAAAATTGATTTTTTTTGGACCCCTAGACAGATGAAATTACCATCATGCCATTTATATTAGAAAAAACATTTCAAAAGTCTCTTCCCTTTTCCCGTTTTCCCGTTCGTTTTCTCTTCCCGTTCTTATCCCTTCTTCCTCTGAAAAAGTGGTAATTTGCTGTGGATTTCAGAAATGAGCCCTCTAAGAAAGAATATCTCAAGTATGTATTTTTATTCATTTTGAAACTGTTTTCTAGGGTTTAGGATTTTTTTTAAACTGTGATGTTATGTGTTGAATCTATTAGATTACTTGTGATTCAAGTTGTTTTTTGGCCATAGAAATTCTTTAAACGTGTTTTCGACGTGTTCTGTGGTGTTAGATTATGTTAGAATCATTTTTTTATGCATTTTTTATCGTTTAGTCAATGGTTTTGTCGAAAATGAAGATGAATCCTTGTTTAGATGGAAAAACTTCATGATCAACGAACTCTGGTGATCTTTTCCACTACGACACGTGATCTAATGGATGCACAAGACTCGCATATGTATCCAACCACACTTCAACATGGTATGTTTTGGC

mRNA sequence

CCATGACAAGAAAAATTTGGAAATGTCAAAGGAGAAAGCCATTGTCACTCTCTTGCAAGGCTGCAACAGCCTCAACAAGCTTCGAAAAATCCACGCCCATGTTATTGTAAGCGGCCTCCGCCATCACATCGCCATTGGTAACAAGCTTTTGAACTTCTGTGCCATCTCAGTTTCAGGTTCCCTTGCTTATGCACAGCTTCTCTTCCATCAAATGGAATGCCCACAAACCGAAGCCTGGAACTCCATCATCCGAGGCTTCGCTCAGAGCTCATCTCCCATTGACGCCGTTGTTTATTACAATCAAATGGTTTGGCGCTCTTTCTCTTCCCCTGACACATTCACTTTCTCATTTGTGCTCAAAGCCTGTGAAAGACTCAAGGCTGAGCGCAAGTGTAAAGAAGTTCATGGCTCTGTAATCCGCTGTGGTTATGATGGGGATGTTATTATTTGCACCAATCTTGTCAAATGCTATGGGGCAATGGGGTCTGTTTGTATTGCCCAACGGGTGTTTGACAAAATGCCTGCGAGAGACTTGGTGGCTTGGAATGCTATGATTTCGTGCTTTTCTCAACAGGGTTTGCACCAAGAGGCATTGCAGATATACAATCAGATGAGAAGTGAAAATGTGGATGTAGATGGGTTTACACTTGTTGGGTTGATTTCTTCATGTGCTCATCTTGGAGCGTTGAATATTGGGGTTCAGATGCATAGACTTGCTCGTGAAAAGGGTCTTGTGGAGAGTCTTTATGTTGGAAATGCATTGATAGATATGTATGCTAAATGTGGTAGTTTGGATCAAGCCATTCGCATATTTGATAGAATGCAGAGGAAGGATATTTTCACTTGGAACTCAATGATTGTTGGGTATGGAGTGCACGGTCGTGGCAGTGAAGCTATATGTTGCTTCCAACAGATGTTAGAAGCGAGAATGCAACCAAACTCTGTCACATTTTTGGGTTTACTTTGTGGATGTAGTCATCAAGGCTTGGTTCAAGAAGGTGTTAAATACTTCAATTTGATGAGCTTCATGTTTAGGCTAAGACCTGAAGTTAAACACTATGGATGCCTTGTGGATTTATATGGTCGAGCTGGGAAGCTTGAAAAGGCACTTGAAACTATATCAAATTCATCAACAAATGATCCGGTTTTGTGGCGAATCTTACTTGGCTCTTGCAAAATTCATAAAAATGTGGGTATAGGAAAAATTGCCATGAACAATCTCTCTGAGCTTGGAGCTACAAATGCAGGAGATTGTATATTGCTGGCTACCATCTATGCTGGAGTGAAGGATACAGCTGGTGTTGCAAGTATGAGAAAAATGATCAAGAGCCAAGGGATAAAGACCACCCCAGGTTGGAGTTGGATTGAAATTGGGGAACAAGTTCATAAATTTGTGGTTGATGACAAGTCCCATCGTTATTCGATTGAAGTTCATGAAAAGTTGAGGGAAGTTATTCATCAAGCTTCCTTGTTTGGATATATAGGAGATGAGTCTGTTTCGTCACTGGATGTGTTTTCCACCACACAGACTTTAGAGACTTCCTGCACATATCATAGTGAGAAACTTGCAATTGCATTTGGATTGGCAAGAACTGCAGATGGGACACAGATACGAATTGTTAAAAACCTTCGAGTTTGCAGGGATTGTCATTCATTCATAAAAGCTGTCTCGGTGGCATTCAACCGAGAAATAATTGTTAGAGATCGGGTTCGGTTCCACCATTTCAAGGGTGGCCGATGTTCCTGCAATGACTACTGGTGAAAAGTGAAAAATGTATTTTCACTTCATTCAAGGCTGCAGTAGCTCCCTCTCGTTGTAATGGAGTTTTGGCTCTTGTGCTTTCTTTTTATTAGCATAGTAGAGTCGTCCCAAACAACATGCATTCTTGGAAAAAGATACGGGCTTCTCAATAACCATGAAGACCACATAGCAAGCACAAACTTACCACAATCAGAACTTCTTTCTGGGATACTTCACTCATAATCTTACTTTCCATACTAGGGGGTCAGTCAAAGAAATGTACTTGAGACAACTTAAATTATCCTGATGAAGTCCAAAGATGCAGTATATATATAGAGTGAACTAAAGATGGAAAAACTTCATGATCAACGAACTCTGGTGATCTTTTCCACTACGACACGTGATCTAATGGATGCACAAGACTCGCATATGTATCCAACCACACTTCAACATGGTATGTTTTGGC

Coding sequence (CDS)

ATGTCAAAGGAGAAAGCCATTGTCACTCTCTTGCAAGGCTGCAACAGCCTCAACAAGCTTCGAAAAATCCACGCCCATGTTATTGTAAGCGGCCTCCGCCATCACATCGCCATTGGTAACAAGCTTTTGAACTTCTGTGCCATCTCAGTTTCAGGTTCCCTTGCTTATGCACAGCTTCTCTTCCATCAAATGGAATGCCCACAAACCGAAGCCTGGAACTCCATCATCCGAGGCTTCGCTCAGAGCTCATCTCCCATTGACGCCGTTGTTTATTACAATCAAATGGTTTGGCGCTCTTTCTCTTCCCCTGACACATTCACTTTCTCATTTGTGCTCAAAGCCTGTGAAAGACTCAAGGCTGAGCGCAAGTGTAAAGAAGTTCATGGCTCTGTAATCCGCTGTGGTTATGATGGGGATGTTATTATTTGCACCAATCTTGTCAAATGCTATGGGGCAATGGGGTCTGTTTGTATTGCCCAACGGGTGTTTGACAAAATGCCTGCGAGAGACTTGGTGGCTTGGAATGCTATGATTTCGTGCTTTTCTCAACAGGGTTTGCACCAAGAGGCATTGCAGATATACAATCAGATGAGAAGTGAAAATGTGGATGTAGATGGGTTTACACTTGTTGGGTTGATTTCTTCATGTGCTCATCTTGGAGCGTTGAATATTGGGGTTCAGATGCATAGACTTGCTCGTGAAAAGGGTCTTGTGGAGAGTCTTTATGTTGGAAATGCATTGATAGATATGTATGCTAAATGTGGTAGTTTGGATCAAGCCATTCGCATATTTGATAGAATGCAGAGGAAGGATATTTTCACTTGGAACTCAATGATTGTTGGGTATGGAGTGCACGGTCGTGGCAGTGAAGCTATATGTTGCTTCCAACAGATGTTAGAAGCGAGAATGCAACCAAACTCTGTCACATTTTTGGGTTTACTTTGTGGATGTAGTCATCAAGGCTTGGTTCAAGAAGGTGTTAAATACTTCAATTTGATGAGCTTCATGTTTAGGCTAAGACCTGAAGTTAAACACTATGGATGCCTTGTGGATTTATATGGTCGAGCTGGGAAGCTTGAAAAGGCACTTGAAACTATATCAAATTCATCAACAAATGATCCGGTTTTGTGGCGAATCTTACTTGGCTCTTGCAAAATTCATAAAAATGTGGGTATAGGAAAAATTGCCATGAACAATCTCTCTGAGCTTGGAGCTACAAATGCAGGAGATTGTATATTGCTGGCTACCATCTATGCTGGAGTGAAGGATACAGCTGGTGTTGCAAGTATGAGAAAAATGATCAAGAGCCAAGGGATAAAGACCACCCCAGGTTGGAGTTGGATTGAAATTGGGGAACAAGTTCATAAATTTGTGGTTGATGACAAGTCCCATCGTTATTCGATTGAAGTTCATGAAAAGTTGAGGGAAGTTATTCATCAAGCTTCCTTGTTTGGATATATAGGAGATGAGTCTGTTTCGTCACTGGATGTGTTTTCCACCACACAGACTTTAGAGACTTCCTGCACATATCATAGTGAGAAACTTGCAATTGCATTTGGATTGGCAAGAACTGCAGATGGGACACAGATACGAATTGTTAAAAACCTTCGAGTTTGCAGGGATTGTCATTCATTCATAAAAGCTGTCTCGGTGGCATTCAACCGAGAAATAATTGTTAGAGATCGGGTTCGGTTCCACCATTTCAAGGGTGGCCGATGTTCCTGCAATGACTACTGGTGA

Protein sequence

MSKEKAIVTLLQGCNSLNKLRKIHAHVIVSGLRHHIAIGNKLLNFCAISVSGSLAYAQLLFHQMECPQTEAWNSIIRGFAQSSSPIDAVVYYNQMVWRSFSSPDTFTFSFVLKACERLKAERKCKEVHGSVIRCGYDGDVIICTNLVKCYGAMGSVCIAQRVFDKMPARDLVAWNAMISCFSQQGLHQEALQIYNQMRSENVDVDGFTLVGLISSCAHLGALNIGVQMHRLAREKGLVESLYVGNALIDMYAKCGSLDQAIRIFDRMQRKDIFTWNSMIVGYGVHGRGSEAICCFQQMLEARMQPNSVTFLGLLCGCSHQGLVQEGVKYFNLMSFMFRLRPEVKHYGCLVDLYGRAGKLEKALETISNSSTNDPVLWRILLGSCKIHKNVGIGKIAMNNLSELGATNAGDCILLATIYAGVKDTAGVASMRKMIKSQGIKTTPGWSWIEIGEQVHKFVVDDKSHRYSIEVHEKLREVIHQASLFGYIGDESVSSLDVFSTTQTLETSCTYHSEKLAIAFGLARTADGTQIRIVKNLRVCRDCHSFIKAVSVAFNREIIVRDRVRFHHFKGGRCSCNDYW
Homology
BLAST of Tan0021631 vs. ExPASy Swiss-Prot
Match: Q9LXY5 (Pentatricopeptide repeat-containing protein At3g56550 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H80 PE=2 SV=1)

HSP 1 Score: 716.5 bits (1848), Expect = 2.5e-205
Identity = 343/579 (59.24%), Postives = 443/579 (76.51%), Query Frame = 0

Query: 3   KEKAIVTLLQGCNSLNKLRKIHAHVIVSGLRHHIAIGNKLLNFCAISVSGSLAYAQLLFH 62
           K + IV +LQGCNS+ KLRKIH+HVI++GL+HH +I N LL FCA+SV+GSL++AQLLF 
Sbjct: 4   KARVIVRMLQGCNSMKKLRKIHSHVIINGLQHHPSIFNHLLRFCAVSVTGSLSHAQLLFD 63

Query: 63  QMEC-PQTEAWNSIIRGFAQSSSPIDAVVYYNQMVWRSFSSPDTFTFSFVLKACERLKAE 122
             +  P T  WN +IRGF+ SSSP++++++YN+M+  S S PD FTF+F LK+CER+K+ 
Sbjct: 64  HFDSDPSTSDWNYLIRGFSNSSSPLNSILFYNRMLLSSVSRPDLFTFNFALKSCERIKSI 123

Query: 123 RKCKEVHGSVIRCGYDGDVIICTNLVKCYGAMGSVCIAQRVFDKMPARDLVAWNAMISCF 182
            KC E+HGSVIR G+  D I+ T+LV+CY A GSV IA +VFD+MP RDLV+WN MI CF
Sbjct: 124 PKCLEIHGSVIRSGFLDDAIVATSLVRCYSANGSVEIASKVFDEMPVRDLVSWNVMICCF 183

Query: 183 SQQGLHQEALQIYNQMRSENVDVDGFTLVGLISSCAHLGALNIGVQMHRLAREKGLVESL 242
           S  GLH +AL +Y +M +E V  D +TLV L+SSCAH+ ALN+GV +HR+A +      +
Sbjct: 184 SHVGLHNQALSMYKRMGNEGVCGDSYTLVALLSSCAHVSALNMGVMLHRIACDIRCESCV 243

Query: 243 YVGNALIDMYAKCGSLDQAIRIFDRMQRKDIFTWNSMIVGYGVHGRGSEAICCFQQMLEA 302
           +V NALIDMYAKCGSL+ AI +F+ M+++D+ TWNSMI+GYGVHG G EAI  F++M+ +
Sbjct: 244 FVSNALIDMYAKCGSLENAIGVFNGMRKRDVLTWNSMIIGYGVHGHGVEAISFFRKMVAS 303

Query: 303 RMQPNSVTFLGLLCGCSHQGLVQEGVKYFNLMSFMFRLRPEVKHYGCLVDLYGRAGKLEK 362
            ++PN++TFLGLL GCSHQGLV+EGV++F +MS  F L P VKHYGC+VDLYGRAG+LE 
Sbjct: 304 GVRPNAITFLGLLLGCSHQGLVKEGVEHFEIMSSQFHLTPNVKHYGCMVDLYGRAGQLEN 363

Query: 363 ALETISNSSTN-DPVLWRILLGSCKIHKNVGIGKIAMNNLSELGATNAGDCILLATIYAG 422
           +LE I  SS + DPVLWR LLGSCKIH+N+ +G++AM  L +L A NAGD +L+ +IY+ 
Sbjct: 364 SLEMIYASSCHEDPVLWRTLLGSCKIHRNLELGEVAMKKLVQLEAFNAGDYVLMTSIYSA 423

Query: 423 VKDTAGVASMRKMIKSQGIKTTPGWSWIEIGEQVHKFVVDDKSHRYSIEVHEKLREVIHQ 482
             D    ASMRK+I+S  ++T PGWSWIEIG+QVHKFVVDDK H  S  ++ +L EVI++
Sbjct: 424 ANDAQAFASMRKLIRSHDLQTVPGWSWIEIGDQVHKFVVDDKMHPESAVIYSELGEVINR 483

Query: 483 ASLFGYIGDESVSSLDVFSTTQTLETSCTYHSEKLAIAFGLARTADGTQIRIVKNLRVCR 542
           A L GY  ++S  +    S  + L ++ T HSEKLAIA+GL RT  GT +RI KNLRVCR
Sbjct: 484 AILAGYKPEDSNRTAPTLS-DRCLGSADTSHSEKLAIAYGLMRTTAGTTLRITKNLRVCR 543

Query: 543 DCHSFIKAVSVAFNREIIVRDRVRFHHFKGGRCSCNDYW 580
           DCHSF K VS AFNREIIVRDRVRFHHF  G CSCNDYW
Sbjct: 544 DCHSFTKYVSKAFNREIIVRDRVRFHHFADGICSCNDYW 581

BLAST of Tan0021631 vs. ExPASy Swiss-Prot
Match: A8MQA3 (Pentatricopeptide repeat-containing protein At4g21065 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H28 PE=2 SV=2)

HSP 1 Score: 449.5 bits (1155), Expect = 5.7e-125
Identity = 232/579 (40.07%), Postives = 355/579 (61.31%), Query Frame = 0

Query: 8   VTLLQ--GCNSLNKLRKIHAHVIVSGLRHHIAIGNKLLNFCAISVSG--SLAYAQLLFHQ 67
           + LLQ  G +S+ KLR+IHA  I  G+    A   K L F  +S+     ++YA  +F +
Sbjct: 19  INLLQTYGVSSITKLRQIHAFSIRHGVSISDAELGKHLIFYLVSLPSPPPMSYAHKVFSK 78

Query: 68  MECP-QTEAWNSIIRGFAQSSSPIDAVVYYNQMVWRSFSSPDTFTFSFVLKACERLKAER 127
           +E P     WN++IRG+A+  + I A   Y +M       PDT T+ F++KA   +   R
Sbjct: 79  IEKPINVFIWNTLIRGYAEIGNSISAFSLYREMRVSGLVEPDTHTYPFLIKAVTTMADVR 138

Query: 128 KCKEVHGSVIRCGYDGDVIICTNLVKCYGAMGSVCIAQRVFDKMPARDLVAWNAMISCFS 187
             + +H  VIR G+   + +  +L+  Y   G V  A +VFDKMP +DLVAWN++I+ F+
Sbjct: 139 LGETIHSVVIRSGFGSLIYVQNSLLHLYANCGDVASAYKVFDKMPEKDLVAWNSVINGFA 198

Query: 188 QQGLHQEALQIYNQMRSENVDVDGFTLVGLISSCAHLGALNIGVQMHRLAREKGLVESLY 247
           + G  +EAL +Y +M S+ +  DGFT+V L+S+CA +GAL +G ++H    + GL  +L+
Sbjct: 199 ENGKPEEALALYTEMNSKGIKPDGFTIVSLLSACAKIGALTLGKRVHVYMIKVGLTRNLH 258

Query: 248 VGNALIDMYAKCGSLDQAIRIFDRMQRKDIFTWNSMIVGYGVHGRGSEAICCFQQMLEAR 307
             N L+D+YA+CG +++A  +FD M  K+  +W S+IVG  V+G G EAI  F+ M    
Sbjct: 259 SSNVLLDLYARCGRVEEAKTLFDEMVDKNSVSWTSLIVGLAVNGFGKEAIELFKYMESTE 318

Query: 308 -MQPNSVTFLGLLCGCSHQGLVQEGVKYFNLMSFMFRLRPEVKHYGCLVDLYGRAGKLEK 367
            + P  +TF+G+L  CSH G+V+EG +YF  M   +++ P ++H+GC+VDL  RAG+++K
Sbjct: 319 GLLPCEITFVGILYACSHCGMVKEGFEYFRRMREEYKIEPRIEHFGCMVDLLARAGQVKK 378

Query: 368 ALETISNSSTN-DPVLWRILLGSCKIHKNVGIGKIAMNNLSELGATNAGDCILLATIYAG 427
           A E I +     + V+WR LLG+C +H +  + + A   + +L   ++GD +LL+ +YA 
Sbjct: 379 AYEYIKSMPMQPNVVIWRTLLGACTVHGDSDLAEFARIQILQLEPNHSGDYVLLSNMYAS 438

Query: 428 VKDTAGVASMRKMIKSQGIKTTPGWSWIEIGEQVHKFVVDDKSHRYSIEVHEKLREVIHQ 487
            +  + V  +RK +   G+K  PG S +E+G +VH+F++ DKSH  S  ++ KL+E+  +
Sbjct: 439 EQRWSDVQKIRKQMLRDGVKKVPGHSLVEVGNRVHEFLMGDKSHPQSDAIYAKLKEMTGR 498

Query: 488 ASLFGYIGDESVSSLDVFSTTQTLETSCTYHSEKLAIAFGLARTADGTQIRIVKNLRVCR 547
               GY+    +S++ V    +  E +  YHSEK+AIAF L  T + + I +VKNLRVC 
Sbjct: 499 LRSEGYV--PQISNVYVDVEEEEKENAVVYHSEKIAIAFMLISTPERSPITVVKNLRVCA 558

Query: 548 DCHSFIKAVSVAFNREIIVRDRVRFHHFKGGRCSCNDYW 580
           DCH  IK VS  +NREI+VRDR RFHHFK G CSC DYW
Sbjct: 559 DCHLAIKLVSKVYNREIVVRDRSRFHHFKNGSCSCQDYW 595

BLAST of Tan0021631 vs. ExPASy Swiss-Prot
Match: Q8LK93 (Pentatricopeptide repeat-containing protein At2g02980, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H26 PE=2 SV=2)

HSP 1 Score: 429.9 bits (1104), Expect = 4.7e-119
Identity = 221/574 (38.50%), Postives = 347/574 (60.45%), Query Frame = 0

Query: 8   VTLLQGCNSLNKLRKIHAHVIVSGLRHHIAIGNKLLNFCAIS-VSGSLAYAQLLFHQMEC 67
           + L+  CNSL +L +I A+ I S +   ++   KL+NFC  S    S++YA+ LF  M  
Sbjct: 33  ILLISKCNSLRELMQIQAYAIKSHI-EDVSFVAKLINFCTESPTESSMSYARHLFEAMSE 92

Query: 68  PQTEAWNSIIRGFAQSSSPIDAVVYYNQMVWRSFSSPDTFTFSFVLKACERLKAERKCKE 127
           P    +NS+ RG+++ ++P++    + +++      PD +TF  +LKAC   KA  + ++
Sbjct: 93  PDIVIFNSMARGYSRFTNPLEVFSLFVEIL-EDGILPDNYTFPSLLKACAVAKALEEGRQ 152

Query: 128 VHGSVIRCGYDGDVIICTNLVKCYGAMGSVCIAQRVFDKMPARDLVAWNAMISCFSQQGL 187
           +H   ++ G D +V +C  L+  Y     V  A+ VFD++    +V +NAMI+ ++++  
Sbjct: 153 LHCLSMKLGLDDNVYVCPTLINMYTECEDVDSARCVFDRIVEPCVVCYNAMITGYARRNR 212

Query: 188 HQEALQIYNQMRSENVDVDGFTLVGLISSCAHLGALNIGVQMHRLAREKGLVESLYVGNA 247
             EAL ++ +M+ + +  +  TL+ ++SSCA LG+L++G  +H+ A++    + + V  A
Sbjct: 213 PNEALSLFREMQGKYLKPNEITLLSVLSSCALLGSLDLGKWIHKYAKKHSFCKYVKVNTA 272

Query: 248 LIDMYAKCGSLDQAIRIFDRMQRKDIFTWNSMIVGYGVHGRGSEAICCFQQMLEARMQPN 307
           LIDM+AKCGSLD A+ IF++M+ KD   W++MIV Y  HG+  +++  F++M    +QP+
Sbjct: 273 LIDMFAKCGSLDDAVSIFEKMRYKDTQAWSAMIVAYANHGKAEKSMLMFERMRSENVQPD 332

Query: 308 SVTFLGLLCGCSHQGLVQEGVKYFNLMSFMFRLRPEVKHYGCLVDLYGRAGKLEKALETI 367
            +TFLGLL  CSH G V+EG KYF+ M   F + P +KHYG +VDL  RAG LE A E I
Sbjct: 333 EITFLGLLNACSHTGRVEEGRKYFSQMVSKFGIVPSIKHYGSMVDLLSRAGNLEDAYEFI 392

Query: 368 SNSSTN-DPVLWRILLGSCKIHKNVGIGKIAMNNLSELGATNAGDCILLATIYAGVKDTA 427
                +  P+LWRILL +C  H N+ + +     + EL  ++ GD ++L+ +YA  K   
Sbjct: 393 DKLPISPTPMLWRILLAACSSHNNLDLAEKVSERIFELDDSHGGDYVILSNLYARNKKWE 452

Query: 428 GVASMRKMIKSQGIKTTPGWSWIEIGEQVHKFVVDDKSHRYSIEVHEKLREVIHQASLFG 487
            V S+RK++K +     PG S IE+   VH+F   D     + ++H  L E++ +  L G
Sbjct: 453 YVDSLRKVMKDRKAVKVPGCSSIEVNNVVHEFFSGDGVKSATTKLHRALDEMVKELKLSG 512

Query: 488 YIGDESVSSLDVFSTTQTLETSCTYHSEKLAIAFGLARTADGTQIRIVKNLRVCRDCHSF 547
           Y+ D S+  +      Q  E +  YHSEKLAI FGL  T  GT IR+VKNLRVCRDCH+ 
Sbjct: 513 YVPDTSM-VVHANMNDQEKEITLRYHSEKLAITFGLLNTPPGTTIRVVKNLRVCRDCHNA 572

Query: 548 IKAVSVAFNREIIVRDRVRFHHFKGGRCSCNDYW 580
            K +S+ F R++++RD  RFHHF+ G+CSC D+W
Sbjct: 573 AKLISLIFGRKVVLRDVQRFHHFEDGKCSCGDFW 603

BLAST of Tan0021631 vs. ExPASy Swiss-Prot
Match: Q9FI80 (Pentatricopeptide repeat-containing protein At5g48910 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H38 PE=2 SV=1)

HSP 1 Score: 421.0 bits (1081), Expect = 2.2e-116
Identity = 226/630 (35.87%), Postives = 358/630 (56.83%), Query Frame = 0

Query: 2   SKEKAIVTLLQGCNSLNKLRKIHAHVIVSGLRHHIAIGNKLLNFCAIS--VSGSLAYAQL 61
           S   ++   +  C ++  L +IHA  I SG         ++L FCA S      L YA  
Sbjct: 21  SHPSSLFPQINNCRTIRDLSQIHAVFIKSGQMRDTLAAAEILRFCATSDLHHRDLDYAHK 80

Query: 62  LFHQMECPQTEAWNSIIRGFAQS--SSPIDAVVYYNQMVWRSFSSPDTFTFSFVLKACER 121
           +F+QM      +WN+IIRGF++S     + A+  + +M+   F  P+ FTF  VLKAC +
Sbjct: 81  IFNQMPQRNCFSWNTIIRGFSESDEDKALIAITLFYEMMSDEFVEPNRFTFPSVLKACAK 140

Query: 122 LKAERKCKEVHGSVIRCGYDGDVIICTNLVKC---------------------------- 181
               ++ K++HG  ++ G+ GD  + +NLV+                             
Sbjct: 141 TGKIQEGKQIHGLALKYGFGGDEFVMSNLVRMYVMCGFMKDARVLFYKNIIEKDMVVMTD 200

Query: 182 -----------------YGAMGSVCIAQRVFDKMPARDLVAWNAMISCFSQQGLHQEALQ 241
                            Y  +G    A+ +FDKM  R +V+WN MIS +S  G  ++A++
Sbjct: 201 RRKRDGEIVLWNVMIDGYMRLGDCKAARMLFDKMRQRSVVSWNTMISGYSLNGFFKDAVE 260

Query: 242 IYNQMRSENVDVDGFTLVGLISSCAHLGALNIGVQMHRLAREKGLVESLYVGNALIDMYA 301
           ++ +M+  ++  +  TLV ++ + + LG+L +G  +H  A + G+     +G+ALIDMY+
Sbjct: 261 VFREMKKGDIRPNYVTLVSVLPAISRLGSLELGEWLHLYAEDSGIRIDDVLGSALIDMYS 320

Query: 302 KCGSLDQAIRIFDRMQRKDIFTWNSMIVGYGVHGRGSEAICCFQQMLEARMQPNSVTFLG 361
           KCG +++AI +F+R+ R+++ TW++MI G+ +HG+  +AI CF +M +A ++P+ V ++ 
Sbjct: 321 KCGIIEKAIHVFERLPRENVITWSAMINGFAIHGQAGDAIDCFCKMRQAGVRPSDVAYIN 380

Query: 362 LLCGCSHQGLVQEGVKYFNLMSFMFRLRPEVKHYGCLVDLYGRAGKLEKALETISNSSTN 421
           LL  CSH GLV+EG +YF+ M  +  L P ++HYGC+VDL GR+G L++A E I N    
Sbjct: 381 LLTACSHGGLVEEGRRYFSQMVSVDGLEPRIEHYGCMVDLLGRSGLLDEAEEFILNMPIK 440

Query: 422 -DPVLWRILLGSCKIHKNVGIGKIAMNNLSELGATNAGDCILLATIYAGVKDTAGVASMR 481
            D V+W+ LLG+C++  NV +GK   N L ++   ++G  + L+ +YA   + + V+ MR
Sbjct: 441 PDDVIWKALLGACRMQGNVEMGKRVANILMDMVPHDSGAYVALSNMYASQGNWSEVSEMR 500

Query: 482 KMIKSQGIKTTPGWSWIEIGEQVHKFVVDDKSHRYSIEVHEKLREVIHQASLFGY--IGD 541
             +K + I+  PG S I+I   +H+FVV+D SH  + E++  L E+  +  L GY  I  
Sbjct: 501 LRMKEKDIRKDPGCSLIDIDGVLHEFVVEDDSHPKAKEINSMLVEISDKLRLAGYRPITT 560

Query: 542 ESVSSLDVFSTTQTLETSCTYHSEKLAIAFGLARTADGTQIRIVKNLRVCRDCHSFIKAV 580
           + + +L+     +  E    YHSEK+A AFGL  T+ G  IRIVKNLR+C DCHS IK +
Sbjct: 561 QVLLNLE----EEDKENVLHYHSEKIATAFGLISTSPGKPIRIVKNLRICEDCHSSIKLI 620

BLAST of Tan0021631 vs. ExPASy Swiss-Prot
Match: Q9C6T2 (Pentatricopeptide repeat-containing protein At1g31920 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H11 PE=2 SV=1)

HSP 1 Score: 416.8 bits (1070), Expect = 4.1e-115
Identity = 215/581 (37.01%), Postives = 355/581 (61.10%), Query Frame = 0

Query: 3   KEKAIVTLLQGCNSLNKLRKIHAHVIVSGLRHHIAI-GNKLLNFCAIS-VSGSLAYAQLL 62
           KE+  + LL+ C+++++ +++HA  I   L +  +   + +L  CA S    S+ YA  +
Sbjct: 29  KEQECLYLLKRCHNIDEFKQVHARFIKLSLFYSSSFSASSVLAKCAHSGWENSMNYAASI 88

Query: 63  FHQMECPQTEAWNSIIRGFAQSSSPIDAVVYYNQMVWRSFSSPDTFTFSFVLKACERLKA 122
           F  ++ P T  +N++IRG+    S  +A+ +YN+M+ R  + PD FT+  +LKAC RLK+
Sbjct: 89  FRGIDDPCTFDFNTMIRGYVNVMSFEEALCFYNEMMQRG-NEPDNFTYPCLLKACTRLKS 148

Query: 123 ERKCKEVHGSVIRCGYDGDVIICTNLVKCYGAMGSVCIAQRVFDKMPARDLVAWNAMISC 182
            R+ K++HG V + G + DV +  +L+  YG  G + ++  VF+K+ ++   +W++M+S 
Sbjct: 149 IREGKQIHGQVFKLGLEADVFVQNSLINMYGRCGEMELSSAVFEKLESKTAASWSSMVSA 208

Query: 183 FSQQGLHQEALQIYNQMRSE-NVDVDGFTLVGLISSCAHLGALNIGVQMHRLAREKGLVE 242
            +  G+  E L ++  M SE N+  +   +V  + +CA+ GALN+G+ +H          
Sbjct: 209 RAGMGMWSECLLLFRGMCSETNLKAEESGMVSALLACANTGALNLGMSIHGFLLRNISEL 268

Query: 243 SLYVGNALIDMYAKCGSLDQAIRIFDRMQRKDIFTWNSMIVGYGVHGRGSEAICCFQQML 302
           ++ V  +L+DMY KCG LD+A+ IF +M++++  T+++MI G  +HG G  A+  F +M+
Sbjct: 269 NIIVQTSLVDMYVKCGCLDKALHIFQKMEKRNNLTYSAMISGLALHGEGESALRMFSKMI 328

Query: 303 EARMQPNSVTFLGLLCGCSHQGLVQEGVKYFNLMSFMFRLRPEVKHYGCLVDLYGRAGKL 362
           +  ++P+ V ++ +L  CSH GLV+EG + F  M    ++ P  +HYGCLVDL GRAG L
Sbjct: 329 KEGLEPDHVVYVSVLNACSHSGLVKEGRRVFAEMLKEGKVEPTAEHYGCLVDLLGRAGLL 388

Query: 363 EKALETI-SNSSTNDPVLWRILLGSCKIHKNVGIGKIAMNNLSELGATNAGDCILLATIY 422
           E+ALETI S     + V+WR  L  C++ +N+ +G+IA   L +L + N GD +L++ +Y
Sbjct: 389 EEALETIQSIPIEKNDVIWRTFLSQCRVRQNIELGQIAAQELLKLSSHNPGDYLLISNLY 448

Query: 423 AGVKDTAGVASMRKMIKSQGIKTTPGWSWIEIGEQVHKFVVDDKSHRYSIEVHEKLREVI 482
           +  +    VA  R  I  +G+K TPG+S +E+  + H+FV  D+SH    E+++ L ++ 
Sbjct: 449 SQGQMWDDVARTRTEIAIKGLKQTPGFSIVELKGKTHRFVSQDRSHPKCKEIYKMLHQME 508

Query: 483 HQASLFGYIGDESVSSLDVFSTTQTLETSCTYHSEKLAIAFGLARTADGTQIRIVKNLRV 542
            Q    GY  D +   L+V    +  +     HS+K+AIAFGL  T  G+ I+I +NLR+
Sbjct: 509 WQLKFEGYSPDLTQILLNV--DEEEKKERLKGHSQKVAIAFGLLYTPPGSIIKIARNLRM 568

Query: 543 CRDCHSFIKAVSVAFNREIIVRDRVRFHHFKGGRCSCNDYW 580
           C DCH++ K +S+ + REI+VRDR RFH FKGG CSC DYW
Sbjct: 569 CSDCHTYTKKISMIYEREIVVRDRNRFHLFKGGTCSCKDYW 606

BLAST of Tan0021631 vs. NCBI nr
Match: XP_038890323.1 (pentatricopeptide repeat-containing protein At3g56550 [Benincasa hispida] >XP_038890324.1 pentatricopeptide repeat-containing protein At3g56550 [Benincasa hispida] >XP_038890325.1 pentatricopeptide repeat-containing protein At3g56550 [Benincasa hispida])

HSP 1 Score: 1110.5 bits (2871), Expect = 0.0e+00
Identity = 536/579 (92.57%), Postives = 558/579 (96.37%), Query Frame = 0

Query: 1   MSKEKAIVTLLQGCNSLNKLRKIHAHVIVSGLRHHIAIGNKLLNFCAISVSGSLAYAQLL 60
           MSKEKAI+TLLQGCNSLN+LRKIHAHVIVSGLRHH+AIGNKLLNFCAISVSGSLAYAQLL
Sbjct: 1   MSKEKAILTLLQGCNSLNRLRKIHAHVIVSGLRHHVAIGNKLLNFCAISVSGSLAYAQLL 60

Query: 61  FHQMECPQTEAWNSIIRGFAQSSSPIDAVVYYNQMVWRSFSSPDTFTFSFVLKACERLKA 120
           FHQMECPQTEAWNSIIRGFAQSSSPIDA+++YNQMVW SFSSPDTFTFSFVLKACER+KA
Sbjct: 61  FHQMECPQTEAWNSIIRGFAQSSSPIDAIIFYNQMVWASFSSPDTFTFSFVLKACERIKA 120

Query: 121 ERKCKEVHGSVIRCGYDGDVIICTNLVKCYGAMGSVCIAQRVFDKMPARDLVAWNAMISC 180
           ERKC EVHGSVIRCGYDGDVI+CTNLVKCY AMGS+CIAQ+VFDKMPARDLVAWNAMISC
Sbjct: 121 ERKCNEVHGSVIRCGYDGDVIVCTNLVKCYSAMGSICIAQQVFDKMPARDLVAWNAMISC 180

Query: 181 FSQQGLHQEALQIYNQMRSENVDVDGFTLVGLISSCAHLGALNIGVQMHRLAREKGLVES 240
           FSQQGLHQEALQ YNQMRSENVDVDGFTLVGLISSCAHLGALNIGVQMHR AREKGLV+S
Sbjct: 181 FSQQGLHQEALQTYNQMRSENVDVDGFTLVGLISSCAHLGALNIGVQMHRFAREKGLVQS 240

Query: 241 LYVGNALIDMYAKCGSLDQAIRIFDRMQRKDIFTWNSMIVGYGVHGRGSEAICCFQQMLE 300
           LYVGNALIDMYAKCGSLD+AI IFDRMQRKDIFTWNSMIVGYGVHGRGSEAI CFQ+MLE
Sbjct: 241 LYVGNALIDMYAKCGSLDEAIFIFDRMQRKDIFTWNSMIVGYGVHGRGSEAIYCFQKMLE 300

Query: 301 ARMQPNSVTFLGLLCGCSHQGLVQEGVKYFNLMSFMFRLRPEVKHYGCLVDLYGRAGKLE 360
           ARMQPNSVTFLGLLCGCSHQGLVQEGVKYFNLMS  FRLRPEVKHYGCLVDLYGRAGKLE
Sbjct: 301 ARMQPNSVTFLGLLCGCSHQGLVQEGVKYFNLMSSKFRLRPEVKHYGCLVDLYGRAGKLE 360

Query: 361 KALETISNSSTNDPVLWRILLGSCKIHKNVGIGKIAMNNLSELGATNAGDCILLATIYAG 420
           KALE + NSS NDPVLWRILLGSCKIHKN+ IG+IAM +LSELGATNAGDCILLATIYAG
Sbjct: 361 KALEIVLNSSQNDPVLWRILLGSCKIHKNMKIGEIAMKSLSELGATNAGDCILLATIYAG 420

Query: 421 VKDTAGVASMRKMIKSQGIKTTPGWSWIEIGEQVHKFVVDDKSHRYSIEVHEKLREVIHQ 480
            KDT GVA MRKMIKSQGIKTTPGWSWIEIGEQVHKFVVDDKSHRYSIEV+EKLREVIHQ
Sbjct: 421 EKDTVGVARMRKMIKSQGIKTTPGWSWIEIGEQVHKFVVDDKSHRYSIEVYEKLREVIHQ 480

Query: 481 ASLFGYIGDESVSSLDVFSTTQTLETSCTYHSEKLAIAFGLARTADGTQIRIVKNLRVCR 540
           ASLFGY+GD SVSSLDV STT+TL+TSCTYHSEKLAIAFGLART DGTQIRIVKNLRVCR
Sbjct: 481 ASLFGYVGDASVSSLDVLSTTETLKTSCTYHSEKLAIAFGLARTTDGTQIRIVKNLRVCR 540

Query: 541 DCHSFIKAVSVAFNREIIVRDRVRFHHFKGGRCSCNDYW 580
           DCHSFIKAVS AFNREIIVRDRVRFHHFKGG+CSCNDYW
Sbjct: 541 DCHSFIKAVSEAFNREIIVRDRVRFHHFKGGQCSCNDYW 579

BLAST of Tan0021631 vs. NCBI nr
Match: XP_004152881.1 (pentatricopeptide repeat-containing protein At3g56550 isoform X1 [Cucumis sativus] >XP_011648994.1 pentatricopeptide repeat-containing protein At3g56550 isoform X1 [Cucumis sativus] >XP_031737318.1 pentatricopeptide repeat-containing protein At3g56550 isoform X1 [Cucumis sativus])

HSP 1 Score: 1078.2 bits (2787), Expect = 0.0e+00
Identity = 522/579 (90.16%), Postives = 546/579 (94.30%), Query Frame = 0

Query: 1   MSKEKAIVTLLQGCNSLNKLRKIHAHVIVSGLRHHIAIGNKLLNFCAISVSGSLAYAQLL 60
           MS EKAI+ LLQGCNSL +LRKIHAHVIVSGL HH+ I NKLLNFCAISVSGSLAYAQLL
Sbjct: 1   MSNEKAILALLQGCNSLKRLRKIHAHVIVSGLHHHVPIANKLLNFCAISVSGSLAYAQLL 60

Query: 61  FHQMECPQTEAWNSIIRGFAQSSSPIDAVVYYNQMVWRSFSSPDTFTFSFVLKACERLKA 120
           FHQMECPQTEAWNSIIRGFAQSSSPIDA+V+YNQMV  SFS PDTFTFSFVLKACER+KA
Sbjct: 61  FHQMECPQTEAWNSIIRGFAQSSSPIDAIVFYNQMVCDSFSIPDTFTFSFVLKACERIKA 120

Query: 121 ERKCKEVHGSVIRCGYDGDVIICTNLVKCYGAMGSVCIAQRVFDKMPARDLVAWNAMISC 180
           ERKCKEVHGSVIRCGYD DVI+CTNLVKCY AMGSVCIA++VFDKMPARDLVAWNAMISC
Sbjct: 121 ERKCKEVHGSVIRCGYDADVIVCTNLVKCYSAMGSVCIARQVFDKMPARDLVAWNAMISC 180

Query: 181 FSQQGLHQEALQIYNQMRSENVDVDGFTLVGLISSCAHLGALNIGVQMHRLAREKGLVES 240
           FSQQGLHQEALQ YNQMRSENVD+DGFTLVGLISSCAHLGALNIGVQMHR ARE GL +S
Sbjct: 181 FSQQGLHQEALQTYNQMRSENVDIDGFTLVGLISSCAHLGALNIGVQMHRFARENGLDQS 240

Query: 241 LYVGNALIDMYAKCGSLDQAIRIFDRMQRKDIFTWNSMIVGYGVHGRGSEAICCFQQMLE 300
           LYVGNALIDMYAKCGSLDQAI IFDRMQRKDIFTWNSMIVGYGVHGRGSEAI CFQQMLE
Sbjct: 241 LYVGNALIDMYAKCGSLDQAILIFDRMQRKDIFTWNSMIVGYGVHGRGSEAIYCFQQMLE 300

Query: 301 ARMQPNSVTFLGLLCGCSHQGLVQEGVKYFNLMSFMFRLRPEVKHYGCLVDLYGRAGKLE 360
           AR+QPN VTFLGLLCGCSHQGLVQEGVKYFNLMS  FRL+PEVKHYGCLVDLYGRAGKL+
Sbjct: 301 ARIQPNPVTFLGLLCGCSHQGLVQEGVKYFNLMSSKFRLKPEVKHYGCLVDLYGRAGKLD 360

Query: 361 KALETISNSSTNDPVLWRILLGSCKIHKNVGIGKIAMNNLSELGATNAGDCILLATIYAG 420
           KALE +SNSS ND VLWRILLGSCKIHKNV IG+IAMN LSELGAT+AGDCILLATIYAG
Sbjct: 361 KALEIVSNSSHNDSVLWRILLGSCKIHKNVTIGEIAMNRLSELGATSAGDCILLATIYAG 420

Query: 421 VKDTAGVASMRKMIKSQGIKTTPGWSWIEIGEQVHKFVVDDKSHRYSIEVHEKLREVIHQ 480
            KD AGVA MRKMIKSQG KTTPGWSWIEIGEQVHKFVVDDKSHRYS+EV+EKLREVIHQ
Sbjct: 421 EKDKAGVARMRKMIKSQGKKTTPGWSWIEIGEQVHKFVVDDKSHRYSVEVYEKLREVIHQ 480

Query: 481 ASLFGYIGDESVSSLDVFSTTQTLETSCTYHSEKLAIAFGLARTADGTQIRIVKNLRVCR 540
           AS FGY+GDES+SSLD+ ST +TL+TSCTYHSEKLAIAFGLARTADGTQIRIVKNLRVCR
Sbjct: 481 ASFFGYVGDESISSLDMLSTMETLKTSCTYHSEKLAIAFGLARTADGTQIRIVKNLRVCR 540

Query: 541 DCHSFIKAVSVAFNREIIVRDRVRFHHFKGGRCSCNDYW 580
           DCHSFIKAVSVAFNREIIVRDRVRFHHFKGG CSCNDYW
Sbjct: 541 DCHSFIKAVSVAFNREIIVRDRVRFHHFKGGECSCNDYW 579

BLAST of Tan0021631 vs. NCBI nr
Match: XP_022149932.1 (pentatricopeptide repeat-containing protein At3g56550 [Momordica charantia] >XP_022149933.1 pentatricopeptide repeat-containing protein At3g56550 [Momordica charantia] >XP_022149934.1 pentatricopeptide repeat-containing protein At3g56550 [Momordica charantia] >XP_022149935.1 pentatricopeptide repeat-containing protein At3g56550 [Momordica charantia])

HSP 1 Score: 1072.8 bits (2773), Expect = 8.9e-310
Identity = 516/579 (89.12%), Postives = 549/579 (94.82%), Query Frame = 0

Query: 1   MSKEKAIVTLLQGCNSLNKLRKIHAHVIVSGLRHHIAIGNKLLNFCAISVSGSLAYAQLL 60
           MS EKAI+TLLQGCNSLNKLRKIHAHVI+SGLRHH AIGNKLLNFCAISVSGSL YA+LL
Sbjct: 1   MSNEKAILTLLQGCNSLNKLRKIHAHVILSGLRHHAAIGNKLLNFCAISVSGSLPYARLL 60

Query: 61  FHQMECPQTEAWNSIIRGFAQSSSPIDAVVYYNQMVWRSFSSPDTFTFSFVLKACERLKA 120
           F  M+CPQTEAWNSIIRGFAQS+SPI+AVVYYNQMVW S S PDTFTFSFVLKACERLKA
Sbjct: 61  FRHMDCPQTEAWNSIIRGFAQSASPIEAVVYYNQMVWASLSPPDTFTFSFVLKACERLKA 120

Query: 121 ERKCKEVHGSVIRCGYDGDVIICTNLVKCYGAMGSVCIAQRVFDKMPARDLVAWNAMISC 180
           ERKC+EVHGSVIR GYDGDVIICTNL+KCY AMG +C+AQ+VFDKMP RDLVAWNAMISC
Sbjct: 121 ERKCREVHGSVIRWGYDGDVIICTNLMKCYAAMGFICVAQQVFDKMPTRDLVAWNAMISC 180

Query: 181 FSQQGLHQEALQIYNQMRSENVDVDGFTLVGLISSCAHLGALNIGVQMHRLAREKGLVES 240
           +SQQGLHQEAL+ YNQMRS NVDVDGFTLVGL+SSCAHLGALNIGVQMHR AREKGLVES
Sbjct: 181 YSQQGLHQEALETYNQMRSGNVDVDGFTLVGLLSSCAHLGALNIGVQMHRFAREKGLVES 240

Query: 241 LYVGNALIDMYAKCGSLDQAIRIFDRMQRKDIFTWNSMIVGYGVHGRGSEAICCFQQMLE 300
           LYVGNALIDMYAKCGSLDQAI IFDRMQRKD+FTWNSMIVGYGVHGRG+EAI CFQQMLE
Sbjct: 241 LYVGNALIDMYAKCGSLDQAILIFDRMQRKDVFTWNSMIVGYGVHGRGTEAIFCFQQMLE 300

Query: 301 ARMQPNSVTFLGLLCGCSHQGLVQEGVKYFNLMSFMFRLRPEVKHYGCLVDLYGRAGKLE 360
           AR+QP+SVTFLGLLCGCSHQGLVQEGVK+FNLMS  FRLRPEVKHYGCLVDLYGRAGKLE
Sbjct: 301 ARIQPSSVTFLGLLCGCSHQGLVQEGVKFFNLMSSKFRLRPEVKHYGCLVDLYGRAGKLE 360

Query: 361 KALETISNSSTNDPVLWRILLGSCKIHKNVGIGKIAMNNLSELGATNAGDCILLATIYAG 420
           KALE I NSS NDPVLWRILLGSCKIHKNVGIG+IAMNNLS+LGATNAGDCILLATIYAG
Sbjct: 361 KALEIILNSSQNDPVLWRILLGSCKIHKNVGIGEIAMNNLSQLGATNAGDCILLATIYAG 420

Query: 421 VKDTAGVASMRKMIKSQGIKTTPGWSWIEIGEQVHKFVVDDKSHRYSIEVHEKLREVIHQ 480
           VK+T+GV  MRKMI+SQGIKTTPGWSWIEIGEQVHKFVVDDKSHRY IEV+EKL+EVIHQ
Sbjct: 421 VKNTSGVVRMRKMIRSQGIKTTPGWSWIEIGEQVHKFVVDDKSHRYYIEVYEKLKEVIHQ 480

Query: 481 ASLFGYIGDESVSSLDVFSTTQTLETSCTYHSEKLAIAFGLARTADGTQIRIVKNLRVCR 540
           ASLFGYIGD   S+ DVFST++ LETSC+YHSEKLAIAFGLARTADGTQIRIVKNLRVCR
Sbjct: 481 ASLFGYIGDGYFSTTDVFSTSEILETSCSYHSEKLAIAFGLARTADGTQIRIVKNLRVCR 540

Query: 541 DCHSFIKAVSVAFNREIIVRDRVRFHHFKGGRCSCNDYW 580
           DCHSF+KAVS+AFNREIIVRDRVRFHHFKGG+CSCNDYW
Sbjct: 541 DCHSFVKAVSLAFNREIIVRDRVRFHHFKGGQCSCNDYW 579

BLAST of Tan0021631 vs. NCBI nr
Match: XP_023537237.1 (pentatricopeptide repeat-containing protein At3g56550 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1071.6 bits (2770), Expect = 2.2e-309
Identity = 522/579 (90.16%), Postives = 546/579 (94.30%), Query Frame = 0

Query: 1   MSKEKAIVTLLQGCNSLNKLRKIHAHVIVSGLRHHIAIGNKLLNFCAISVSGSLAYAQLL 60
           MSKEKAI+TLLQGCNSLNKLRKIHAHV+VSGLRHH+AI NKLLNFCAISVSGSLAYAQLL
Sbjct: 1   MSKEKAIITLLQGCNSLNKLRKIHAHVLVSGLRHHVAINNKLLNFCAISVSGSLAYAQLL 60

Query: 61  FHQMECPQTEAWNSIIRGFAQSSSPIDAVVYYNQMVWRSFSSPDTFTFSFVLKACERLKA 120
           FHQMEC QTEAWNSIIRGFAQSSSPIDAVVYYNQMV  SFSSPDTFTFSFVLKACERLKA
Sbjct: 61  FHQMECLQTEAWNSIIRGFAQSSSPIDAVVYYNQMVCASFSSPDTFTFSFVLKACERLKA 120

Query: 121 ERKCKEVHGSVIRCGYDGDVIICTNLVKCYGAMGSVCIAQRVFDKMPARDLVAWNAMISC 180
           ERKCKE+HG++IRCGYDGDVIICTNLVKCY AMGSVCIAQ+VFD+MP RDLVAWNAMISC
Sbjct: 121 ERKCKEIHGTIIRCGYDGDVIICTNLVKCYAAMGSVCIAQQVFDEMPVRDLVAWNAMISC 180

Query: 181 FSQQGLHQEALQIYNQMRSENVDVDGFTLVGLISSCAHLGALNIGVQMHRLAREKGLVES 240
           FSQQGLH EALQ+YNQMRSENVDVDGFTLVGLISSCAHLGALNIGVQMHR AREKGLVES
Sbjct: 181 FSQQGLHGEALQVYNQMRSENVDVDGFTLVGLISSCAHLGALNIGVQMHRFAREKGLVES 240

Query: 241 LYVGNALIDMYAKCGSLDQAIRIFDRMQRKDIFTWNSMIVGYGVHGRGSEAICCFQQMLE 300
           LYVGNALIDMYAKCGSLDQAI IFDRM RKDIFTWNSMIVGYGVHGRG+EAI CF++MLE
Sbjct: 241 LYVGNALIDMYAKCGSLDQAIFIFDRMHRKDIFTWNSMIVGYGVHGRGTEAIFCFERMLE 300

Query: 301 ARMQPNSVTFLGLLCGCSHQGLVQEGVKYFNLMSFMFRLRPEVKHYGCLVDLYGRAGKLE 360
           ARMQPNS+TFLGLLCGCSHQGLVQEGVKYFNLMS  FRLRPEVKHYGCLVDLYGRAGKLE
Sbjct: 301 ARMQPNSITFLGLLCGCSHQGLVQEGVKYFNLMSSKFRLRPEVKHYGCLVDLYGRAGKLE 360

Query: 361 KALETISNSSTNDPVLWRILLGSCKIHKNVGIGKIAMNNLSELGATNAGDCILLATIYAG 420
           KALETI NSS NDPVLWRILLGSCKIHKNVG+G+IAMNNLSELGATNAGDCILLATIYAG
Sbjct: 361 KALETIQNSSPNDPVLWRILLGSCKIHKNVGVGEIAMNNLSELGATNAGDCILLATIYAG 420

Query: 421 VKDTAGVASMRKMIKSQGIKTTPGWSWIEIGEQVHKFVVDDKSHRYSIEVHEKLREVIHQ 480
           V DTAGVASMRK IKSQGIKT+PGWSWIEIGEQVHKFVVDDKSHR SIEV+EKLREV+HQ
Sbjct: 421 VNDTAGVASMRKTIKSQGIKTSPGWSWIEIGEQVHKFVVDDKSHRDSIEVYEKLREVLHQ 480

Query: 481 ASLFGYIGDESVSSLDVFSTTQTLETSCTYHSEKLAIAFGLARTADGTQIRIVKNLRVCR 540
           ASLFGY+ D            +TL+TS TYHSEKLAIAFGLARTADGT IRIVKNLRVCR
Sbjct: 481 ASLFGYVRD-----------AETLKTSSTYHSEKLAIAFGLARTADGTPIRIVKNLRVCR 540

Query: 541 DCHSFIKAVSVAFNREIIVRDRVRFHHFKGGRCSCNDYW 580
           DCHSF+KAVSVAF+REIIVRDRVRFHHFKGG+CSCNDYW
Sbjct: 541 DCHSFMKAVSVAFDREIIVRDRVRFHHFKGGQCSCNDYW 568

BLAST of Tan0021631 vs. NCBI nr
Match: XP_016899519.1 (PREDICTED: pentatricopeptide repeat-containing protein At3g56550 [Cucumis melo] >XP_016899520.1 PREDICTED: pentatricopeptide repeat-containing protein At3g56550 [Cucumis melo] >XP_016899521.1 PREDICTED: pentatricopeptide repeat-containing protein At3g56550 [Cucumis melo] >XP_016899522.1 PREDICTED: pentatricopeptide repeat-containing protein At3g56550 [Cucumis melo] >XP_016899523.1 PREDICTED: pentatricopeptide repeat-containing protein At3g56550 [Cucumis melo] >KAA0047714.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK08368.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1070.8 bits (2768), Expect = 4.0e-309
Identity = 521/579 (89.98%), Postives = 546/579 (94.30%), Query Frame = 0

Query: 1   MSKEKAIVTLLQGCNSLNKLRKIHAHVIVSGLRHHIAIGNKLLNFCAISVSGSLAYAQLL 60
           MSKEKAI+TLLQGCNSL +LRKIHAHVIVSGL HH+AI NKLLNFCAISVSGSLAYAQLL
Sbjct: 1   MSKEKAILTLLQGCNSLKRLRKIHAHVIVSGLHHHVAIANKLLNFCAISVSGSLAYAQLL 60

Query: 61  FHQMECPQTEAWNSIIRGFAQSSSPIDAVVYYNQMVWRSFSSPDTFTFSFVLKACERLKA 120
           FHQ E PQTEAWNSIIRGFAQSSSPIDA+V+YNQMVW SFS  DTFTFSFVLKACER+KA
Sbjct: 61  FHQTEFPQTEAWNSIIRGFAQSSSPIDAIVFYNQMVWDSFSMRDTFTFSFVLKACERIKA 120

Query: 121 ERKCKEVHGSVIRCGYDGDVIICTNLVKCYGAMGSVCIAQRVFDKMPARDLVAWNAMISC 180
           ERKCKEVHG+VIRCGYD DVI+CTNLVKCY AMGSV IA++VFDKMPARDLVAWNAMISC
Sbjct: 121 ERKCKEVHGTVIRCGYDADVIVCTNLVKCYSAMGSVYIARQVFDKMPARDLVAWNAMISC 180

Query: 181 FSQQGLHQEALQIYNQMRSENVDVDGFTLVGLISSCAHLGALNIGVQMHRLAREKGLVES 240
           FSQQGLHQEALQ YNQMRSENVD+DGFTLVGLISSCAHLGALNIGVQMHR ARE GL +S
Sbjct: 181 FSQQGLHQEALQTYNQMRSENVDIDGFTLVGLISSCAHLGALNIGVQMHRFARENGLDQS 240

Query: 241 LYVGNALIDMYAKCGSLDQAIRIFDRMQRKDIFTWNSMIVGYGVHGRGSEAICCFQQMLE 300
           LYVGNALIDMYAKCGSLDQAI IFDRMQRKDIFTWNSMIVGYGVHGRGSEAI CFQQMLE
Sbjct: 241 LYVGNALIDMYAKCGSLDQAILIFDRMQRKDIFTWNSMIVGYGVHGRGSEAIYCFQQMLE 300

Query: 301 ARMQPNSVTFLGLLCGCSHQGLVQEGVKYFNLMSFMFRLRPEVKHYGCLVDLYGRAGKLE 360
           AR+QPNS+TFLGLLCGCSHQGLVQEGVKYFNLMS  FRLRPEVKHYGCLVDLYGRAGKLE
Sbjct: 301 ARIQPNSITFLGLLCGCSHQGLVQEGVKYFNLMSSKFRLRPEVKHYGCLVDLYGRAGKLE 360

Query: 361 KALETISNSSTNDPVLWRILLGSCKIHKNVGIGKIAMNNLSELGATNAGDCILLATIYAG 420
           KALE +SNSS ND VLWR LLGSCKIHKNV IG+IAMN L ELGAT+AGDCILLATIYAG
Sbjct: 361 KALEIVSNSSHNDSVLWRTLLGSCKIHKNVTIGEIAMNRLFELGATSAGDCILLATIYAG 420

Query: 421 VKDTAGVASMRKMIKSQGIKTTPGWSWIEIGEQVHKFVVDDKSHRYSIEVHEKLREVIHQ 480
             D AGV+ MRKMIKSQGIKTTPGWSWIEIGEQVHKFVVDDKS+RYSIEV+EKLREVI+Q
Sbjct: 421 ENDKAGVSRMRKMIKSQGIKTTPGWSWIEIGEQVHKFVVDDKSNRYSIEVYEKLREVIYQ 480

Query: 481 ASLFGYIGDESVSSLDVFSTTQTLETSCTYHSEKLAIAFGLARTADGTQIRIVKNLRVCR 540
           AS FGY+GDESVSSLDV ST +TL+TSCTYHSEKLAIAFGLARTADGTQIRIVKNLRVCR
Sbjct: 481 ASFFGYVGDESVSSLDVLSTIETLKTSCTYHSEKLAIAFGLARTADGTQIRIVKNLRVCR 540

Query: 541 DCHSFIKAVSVAFNREIIVRDRVRFHHFKGGRCSCNDYW 580
           DCHSFIKAVSVAFNREIIVRDRVRFHHFKGG+CSCNDYW
Sbjct: 541 DCHSFIKAVSVAFNREIIVRDRVRFHHFKGGKCSCNDYW 579

BLAST of Tan0021631 vs. ExPASy TrEMBL
Match: A0A0A0LH20 (DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_2G074120 PE=3 SV=1)

HSP 1 Score: 1078.2 bits (2787), Expect = 0.0e+00
Identity = 522/579 (90.16%), Postives = 546/579 (94.30%), Query Frame = 0

Query: 1   MSKEKAIVTLLQGCNSLNKLRKIHAHVIVSGLRHHIAIGNKLLNFCAISVSGSLAYAQLL 60
           MS EKAI+ LLQGCNSL +LRKIHAHVIVSGL HH+ I NKLLNFCAISVSGSLAYAQLL
Sbjct: 1   MSNEKAILALLQGCNSLKRLRKIHAHVIVSGLHHHVPIANKLLNFCAISVSGSLAYAQLL 60

Query: 61  FHQMECPQTEAWNSIIRGFAQSSSPIDAVVYYNQMVWRSFSSPDTFTFSFVLKACERLKA 120
           FHQMECPQTEAWNSIIRGFAQSSSPIDA+V+YNQMV  SFS PDTFTFSFVLKACER+KA
Sbjct: 61  FHQMECPQTEAWNSIIRGFAQSSSPIDAIVFYNQMVCDSFSIPDTFTFSFVLKACERIKA 120

Query: 121 ERKCKEVHGSVIRCGYDGDVIICTNLVKCYGAMGSVCIAQRVFDKMPARDLVAWNAMISC 180
           ERKCKEVHGSVIRCGYD DVI+CTNLVKCY AMGSVCIA++VFDKMPARDLVAWNAMISC
Sbjct: 121 ERKCKEVHGSVIRCGYDADVIVCTNLVKCYSAMGSVCIARQVFDKMPARDLVAWNAMISC 180

Query: 181 FSQQGLHQEALQIYNQMRSENVDVDGFTLVGLISSCAHLGALNIGVQMHRLAREKGLVES 240
           FSQQGLHQEALQ YNQMRSENVD+DGFTLVGLISSCAHLGALNIGVQMHR ARE GL +S
Sbjct: 181 FSQQGLHQEALQTYNQMRSENVDIDGFTLVGLISSCAHLGALNIGVQMHRFARENGLDQS 240

Query: 241 LYVGNALIDMYAKCGSLDQAIRIFDRMQRKDIFTWNSMIVGYGVHGRGSEAICCFQQMLE 300
           LYVGNALIDMYAKCGSLDQAI IFDRMQRKDIFTWNSMIVGYGVHGRGSEAI CFQQMLE
Sbjct: 241 LYVGNALIDMYAKCGSLDQAILIFDRMQRKDIFTWNSMIVGYGVHGRGSEAIYCFQQMLE 300

Query: 301 ARMQPNSVTFLGLLCGCSHQGLVQEGVKYFNLMSFMFRLRPEVKHYGCLVDLYGRAGKLE 360
           AR+QPN VTFLGLLCGCSHQGLVQEGVKYFNLMS  FRL+PEVKHYGCLVDLYGRAGKL+
Sbjct: 301 ARIQPNPVTFLGLLCGCSHQGLVQEGVKYFNLMSSKFRLKPEVKHYGCLVDLYGRAGKLD 360

Query: 361 KALETISNSSTNDPVLWRILLGSCKIHKNVGIGKIAMNNLSELGATNAGDCILLATIYAG 420
           KALE +SNSS ND VLWRILLGSCKIHKNV IG+IAMN LSELGAT+AGDCILLATIYAG
Sbjct: 361 KALEIVSNSSHNDSVLWRILLGSCKIHKNVTIGEIAMNRLSELGATSAGDCILLATIYAG 420

Query: 421 VKDTAGVASMRKMIKSQGIKTTPGWSWIEIGEQVHKFVVDDKSHRYSIEVHEKLREVIHQ 480
            KD AGVA MRKMIKSQG KTTPGWSWIEIGEQVHKFVVDDKSHRYS+EV+EKLREVIHQ
Sbjct: 421 EKDKAGVARMRKMIKSQGKKTTPGWSWIEIGEQVHKFVVDDKSHRYSVEVYEKLREVIHQ 480

Query: 481 ASLFGYIGDESVSSLDVFSTTQTLETSCTYHSEKLAIAFGLARTADGTQIRIVKNLRVCR 540
           AS FGY+GDES+SSLD+ ST +TL+TSCTYHSEKLAIAFGLARTADGTQIRIVKNLRVCR
Sbjct: 481 ASFFGYVGDESISSLDMLSTMETLKTSCTYHSEKLAIAFGLARTADGTQIRIVKNLRVCR 540

Query: 541 DCHSFIKAVSVAFNREIIVRDRVRFHHFKGGRCSCNDYW 580
           DCHSFIKAVSVAFNREIIVRDRVRFHHFKGG CSCNDYW
Sbjct: 541 DCHSFIKAVSVAFNREIIVRDRVRFHHFKGGECSCNDYW 579

BLAST of Tan0021631 vs. ExPASy TrEMBL
Match: A0A6J1D832 (pentatricopeptide repeat-containing protein At3g56550 OS=Momordica charantia OX=3673 GN=LOC111018226 PE=3 SV=1)

HSP 1 Score: 1072.8 bits (2773), Expect = 4.3e-310
Identity = 516/579 (89.12%), Postives = 549/579 (94.82%), Query Frame = 0

Query: 1   MSKEKAIVTLLQGCNSLNKLRKIHAHVIVSGLRHHIAIGNKLLNFCAISVSGSLAYAQLL 60
           MS EKAI+TLLQGCNSLNKLRKIHAHVI+SGLRHH AIGNKLLNFCAISVSGSL YA+LL
Sbjct: 1   MSNEKAILTLLQGCNSLNKLRKIHAHVILSGLRHHAAIGNKLLNFCAISVSGSLPYARLL 60

Query: 61  FHQMECPQTEAWNSIIRGFAQSSSPIDAVVYYNQMVWRSFSSPDTFTFSFVLKACERLKA 120
           F  M+CPQTEAWNSIIRGFAQS+SPI+AVVYYNQMVW S S PDTFTFSFVLKACERLKA
Sbjct: 61  FRHMDCPQTEAWNSIIRGFAQSASPIEAVVYYNQMVWASLSPPDTFTFSFVLKACERLKA 120

Query: 121 ERKCKEVHGSVIRCGYDGDVIICTNLVKCYGAMGSVCIAQRVFDKMPARDLVAWNAMISC 180
           ERKC+EVHGSVIR GYDGDVIICTNL+KCY AMG +C+AQ+VFDKMP RDLVAWNAMISC
Sbjct: 121 ERKCREVHGSVIRWGYDGDVIICTNLMKCYAAMGFICVAQQVFDKMPTRDLVAWNAMISC 180

Query: 181 FSQQGLHQEALQIYNQMRSENVDVDGFTLVGLISSCAHLGALNIGVQMHRLAREKGLVES 240
           +SQQGLHQEAL+ YNQMRS NVDVDGFTLVGL+SSCAHLGALNIGVQMHR AREKGLVES
Sbjct: 181 YSQQGLHQEALETYNQMRSGNVDVDGFTLVGLLSSCAHLGALNIGVQMHRFAREKGLVES 240

Query: 241 LYVGNALIDMYAKCGSLDQAIRIFDRMQRKDIFTWNSMIVGYGVHGRGSEAICCFQQMLE 300
           LYVGNALIDMYAKCGSLDQAI IFDRMQRKD+FTWNSMIVGYGVHGRG+EAI CFQQMLE
Sbjct: 241 LYVGNALIDMYAKCGSLDQAILIFDRMQRKDVFTWNSMIVGYGVHGRGTEAIFCFQQMLE 300

Query: 301 ARMQPNSVTFLGLLCGCSHQGLVQEGVKYFNLMSFMFRLRPEVKHYGCLVDLYGRAGKLE 360
           AR+QP+SVTFLGLLCGCSHQGLVQEGVK+FNLMS  FRLRPEVKHYGCLVDLYGRAGKLE
Sbjct: 301 ARIQPSSVTFLGLLCGCSHQGLVQEGVKFFNLMSSKFRLRPEVKHYGCLVDLYGRAGKLE 360

Query: 361 KALETISNSSTNDPVLWRILLGSCKIHKNVGIGKIAMNNLSELGATNAGDCILLATIYAG 420
           KALE I NSS NDPVLWRILLGSCKIHKNVGIG+IAMNNLS+LGATNAGDCILLATIYAG
Sbjct: 361 KALEIILNSSQNDPVLWRILLGSCKIHKNVGIGEIAMNNLSQLGATNAGDCILLATIYAG 420

Query: 421 VKDTAGVASMRKMIKSQGIKTTPGWSWIEIGEQVHKFVVDDKSHRYSIEVHEKLREVIHQ 480
           VK+T+GV  MRKMI+SQGIKTTPGWSWIEIGEQVHKFVVDDKSHRY IEV+EKL+EVIHQ
Sbjct: 421 VKNTSGVVRMRKMIRSQGIKTTPGWSWIEIGEQVHKFVVDDKSHRYYIEVYEKLKEVIHQ 480

Query: 481 ASLFGYIGDESVSSLDVFSTTQTLETSCTYHSEKLAIAFGLARTADGTQIRIVKNLRVCR 540
           ASLFGYIGD   S+ DVFST++ LETSC+YHSEKLAIAFGLARTADGTQIRIVKNLRVCR
Sbjct: 481 ASLFGYIGDGYFSTTDVFSTSEILETSCSYHSEKLAIAFGLARTADGTQIRIVKNLRVCR 540

Query: 541 DCHSFIKAVSVAFNREIIVRDRVRFHHFKGGRCSCNDYW 580
           DCHSF+KAVS+AFNREIIVRDRVRFHHFKGG+CSCNDYW
Sbjct: 541 DCHSFVKAVSLAFNREIIVRDRVRFHHFKGGQCSCNDYW 579

BLAST of Tan0021631 vs. ExPASy TrEMBL
Match: A0A1S4DU66 (pentatricopeptide repeat-containing protein At3g56550 OS=Cucumis melo OX=3656 GN=LOC103485901 PE=3 SV=1)

HSP 1 Score: 1070.8 bits (2768), Expect = 1.9e-309
Identity = 521/579 (89.98%), Postives = 546/579 (94.30%), Query Frame = 0

Query: 1   MSKEKAIVTLLQGCNSLNKLRKIHAHVIVSGLRHHIAIGNKLLNFCAISVSGSLAYAQLL 60
           MSKEKAI+TLLQGCNSL +LRKIHAHVIVSGL HH+AI NKLLNFCAISVSGSLAYAQLL
Sbjct: 1   MSKEKAILTLLQGCNSLKRLRKIHAHVIVSGLHHHVAIANKLLNFCAISVSGSLAYAQLL 60

Query: 61  FHQMECPQTEAWNSIIRGFAQSSSPIDAVVYYNQMVWRSFSSPDTFTFSFVLKACERLKA 120
           FHQ E PQTEAWNSIIRGFAQSSSPIDA+V+YNQMVW SFS  DTFTFSFVLKACER+KA
Sbjct: 61  FHQTEFPQTEAWNSIIRGFAQSSSPIDAIVFYNQMVWDSFSMRDTFTFSFVLKACERIKA 120

Query: 121 ERKCKEVHGSVIRCGYDGDVIICTNLVKCYGAMGSVCIAQRVFDKMPARDLVAWNAMISC 180
           ERKCKEVHG+VIRCGYD DVI+CTNLVKCY AMGSV IA++VFDKMPARDLVAWNAMISC
Sbjct: 121 ERKCKEVHGTVIRCGYDADVIVCTNLVKCYSAMGSVYIARQVFDKMPARDLVAWNAMISC 180

Query: 181 FSQQGLHQEALQIYNQMRSENVDVDGFTLVGLISSCAHLGALNIGVQMHRLAREKGLVES 240
           FSQQGLHQEALQ YNQMRSENVD+DGFTLVGLISSCAHLGALNIGVQMHR ARE GL +S
Sbjct: 181 FSQQGLHQEALQTYNQMRSENVDIDGFTLVGLISSCAHLGALNIGVQMHRFARENGLDQS 240

Query: 241 LYVGNALIDMYAKCGSLDQAIRIFDRMQRKDIFTWNSMIVGYGVHGRGSEAICCFQQMLE 300
           LYVGNALIDMYAKCGSLDQAI IFDRMQRKDIFTWNSMIVGYGVHGRGSEAI CFQQMLE
Sbjct: 241 LYVGNALIDMYAKCGSLDQAILIFDRMQRKDIFTWNSMIVGYGVHGRGSEAIYCFQQMLE 300

Query: 301 ARMQPNSVTFLGLLCGCSHQGLVQEGVKYFNLMSFMFRLRPEVKHYGCLVDLYGRAGKLE 360
           AR+QPNS+TFLGLLCGCSHQGLVQEGVKYFNLMS  FRLRPEVKHYGCLVDLYGRAGKLE
Sbjct: 301 ARIQPNSITFLGLLCGCSHQGLVQEGVKYFNLMSSKFRLRPEVKHYGCLVDLYGRAGKLE 360

Query: 361 KALETISNSSTNDPVLWRILLGSCKIHKNVGIGKIAMNNLSELGATNAGDCILLATIYAG 420
           KALE +SNSS ND VLWR LLGSCKIHKNV IG+IAMN L ELGAT+AGDCILLATIYAG
Sbjct: 361 KALEIVSNSSHNDSVLWRTLLGSCKIHKNVTIGEIAMNRLFELGATSAGDCILLATIYAG 420

Query: 421 VKDTAGVASMRKMIKSQGIKTTPGWSWIEIGEQVHKFVVDDKSHRYSIEVHEKLREVIHQ 480
             D AGV+ MRKMIKSQGIKTTPGWSWIEIGEQVHKFVVDDKS+RYSIEV+EKLREVI+Q
Sbjct: 421 ENDKAGVSRMRKMIKSQGIKTTPGWSWIEIGEQVHKFVVDDKSNRYSIEVYEKLREVIYQ 480

Query: 481 ASLFGYIGDESVSSLDVFSTTQTLETSCTYHSEKLAIAFGLARTADGTQIRIVKNLRVCR 540
           AS FGY+GDESVSSLDV ST +TL+TSCTYHSEKLAIAFGLARTADGTQIRIVKNLRVCR
Sbjct: 481 ASFFGYVGDESVSSLDVLSTIETLKTSCTYHSEKLAIAFGLARTADGTQIRIVKNLRVCR 540

Query: 541 DCHSFIKAVSVAFNREIIVRDRVRFHHFKGGRCSCNDYW 580
           DCHSFIKAVSVAFNREIIVRDRVRFHHFKGG+CSCNDYW
Sbjct: 541 DCHSFIKAVSVAFNREIIVRDRVRFHHFKGGKCSCNDYW 579

BLAST of Tan0021631 vs. ExPASy TrEMBL
Match: A0A5A7TXJ9 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold648G001800 PE=3 SV=1)

HSP 1 Score: 1070.8 bits (2768), Expect = 1.9e-309
Identity = 521/579 (89.98%), Postives = 546/579 (94.30%), Query Frame = 0

Query: 1   MSKEKAIVTLLQGCNSLNKLRKIHAHVIVSGLRHHIAIGNKLLNFCAISVSGSLAYAQLL 60
           MSKEKAI+TLLQGCNSL +LRKIHAHVIVSGL HH+AI NKLLNFCAISVSGSLAYAQLL
Sbjct: 1   MSKEKAILTLLQGCNSLKRLRKIHAHVIVSGLHHHVAIANKLLNFCAISVSGSLAYAQLL 60

Query: 61  FHQMECPQTEAWNSIIRGFAQSSSPIDAVVYYNQMVWRSFSSPDTFTFSFVLKACERLKA 120
           FHQ E PQTEAWNSIIRGFAQSSSPIDA+V+YNQMVW SFS  DTFTFSFVLKACER+KA
Sbjct: 61  FHQTEFPQTEAWNSIIRGFAQSSSPIDAIVFYNQMVWDSFSMRDTFTFSFVLKACERIKA 120

Query: 121 ERKCKEVHGSVIRCGYDGDVIICTNLVKCYGAMGSVCIAQRVFDKMPARDLVAWNAMISC 180
           ERKCKEVHG+VIRCGYD DVI+CTNLVKCY AMGSV IA++VFDKMPARDLVAWNAMISC
Sbjct: 121 ERKCKEVHGTVIRCGYDADVIVCTNLVKCYSAMGSVYIARQVFDKMPARDLVAWNAMISC 180

Query: 181 FSQQGLHQEALQIYNQMRSENVDVDGFTLVGLISSCAHLGALNIGVQMHRLAREKGLVES 240
           FSQQGLHQEALQ YNQMRSENVD+DGFTLVGLISSCAHLGALNIGVQMHR ARE GL +S
Sbjct: 181 FSQQGLHQEALQTYNQMRSENVDIDGFTLVGLISSCAHLGALNIGVQMHRFARENGLDQS 240

Query: 241 LYVGNALIDMYAKCGSLDQAIRIFDRMQRKDIFTWNSMIVGYGVHGRGSEAICCFQQMLE 300
           LYVGNALIDMYAKCGSLDQAI IFDRMQRKDIFTWNSMIVGYGVHGRGSEAI CFQQMLE
Sbjct: 241 LYVGNALIDMYAKCGSLDQAILIFDRMQRKDIFTWNSMIVGYGVHGRGSEAIYCFQQMLE 300

Query: 301 ARMQPNSVTFLGLLCGCSHQGLVQEGVKYFNLMSFMFRLRPEVKHYGCLVDLYGRAGKLE 360
           AR+QPNS+TFLGLLCGCSHQGLVQEGVKYFNLMS  FRLRPEVKHYGCLVDLYGRAGKLE
Sbjct: 301 ARIQPNSITFLGLLCGCSHQGLVQEGVKYFNLMSSKFRLRPEVKHYGCLVDLYGRAGKLE 360

Query: 361 KALETISNSSTNDPVLWRILLGSCKIHKNVGIGKIAMNNLSELGATNAGDCILLATIYAG 420
           KALE +SNSS ND VLWR LLGSCKIHKNV IG+IAMN L ELGAT+AGDCILLATIYAG
Sbjct: 361 KALEIVSNSSHNDSVLWRTLLGSCKIHKNVTIGEIAMNRLFELGATSAGDCILLATIYAG 420

Query: 421 VKDTAGVASMRKMIKSQGIKTTPGWSWIEIGEQVHKFVVDDKSHRYSIEVHEKLREVIHQ 480
             D AGV+ MRKMIKSQGIKTTPGWSWIEIGEQVHKFVVDDKS+RYSIEV+EKLREVI+Q
Sbjct: 421 ENDKAGVSRMRKMIKSQGIKTTPGWSWIEIGEQVHKFVVDDKSNRYSIEVYEKLREVIYQ 480

Query: 481 ASLFGYIGDESVSSLDVFSTTQTLETSCTYHSEKLAIAFGLARTADGTQIRIVKNLRVCR 540
           AS FGY+GDESVSSLDV ST +TL+TSCTYHSEKLAIAFGLARTADGTQIRIVKNLRVCR
Sbjct: 481 ASFFGYVGDESVSSLDVLSTIETLKTSCTYHSEKLAIAFGLARTADGTQIRIVKNLRVCR 540

Query: 541 DCHSFIKAVSVAFNREIIVRDRVRFHHFKGGRCSCNDYW 580
           DCHSFIKAVSVAFNREIIVRDRVRFHHFKGG+CSCNDYW
Sbjct: 541 DCHSFIKAVSVAFNREIIVRDRVRFHHFKGGKCSCNDYW 579

BLAST of Tan0021631 vs. ExPASy TrEMBL
Match: A0A6J1FBZ0 (pentatricopeptide repeat-containing protein At3g56550 OS=Cucurbita moschata OX=3662 GN=LOC111444025 PE=3 SV=1)

HSP 1 Score: 1068.1 bits (2761), Expect = 1.2e-308
Identity = 522/579 (90.16%), Postives = 545/579 (94.13%), Query Frame = 0

Query: 1   MSKEKAIVTLLQGCNSLNKLRKIHAHVIVSGLRHHIAIGNKLLNFCAISVSGSLAYAQLL 60
           MSKEKAIVTLLQGCNSLNKLRKIHAHV+VSGLRHH+AI NKLLNFCAISVSGSLAYAQLL
Sbjct: 1   MSKEKAIVTLLQGCNSLNKLRKIHAHVLVSGLRHHVAINNKLLNFCAISVSGSLAYAQLL 60

Query: 61  FHQMECPQTEAWNSIIRGFAQSSSPIDAVVYYNQMVWRSFSSPDTFTFSFVLKACERLKA 120
           FHQMEC QTEAWNSIIRGFAQSSSPIDAVVYYNQMV  SFSSPDTFTFSFVLKACERLKA
Sbjct: 61  FHQMECLQTEAWNSIIRGFAQSSSPIDAVVYYNQMVCASFSSPDTFTFSFVLKACERLKA 120

Query: 121 ERKCKEVHGSVIRCGYDGDVIICTNLVKCYGAMGSVCIAQRVFDKMPARDLVAWNAMISC 180
           ERKCKE+HG++IRCGYDGDVIICTNLVKCY AMGSVCIA +VFD+MP RDLVAWNAMISC
Sbjct: 121 ERKCKEIHGTIIRCGYDGDVIICTNLVKCYAAMGSVCIAHQVFDEMPVRDLVAWNAMISC 180

Query: 181 FSQQGLHQEALQIYNQMRSENVDVDGFTLVGLISSCAHLGALNIGVQMHRLAREKGLVES 240
           FSQQGLH EALQ+YNQMRSENVDVDGFTLVGLISSCAHLGALNIGVQMHR AREKGLVES
Sbjct: 181 FSQQGLHGEALQVYNQMRSENVDVDGFTLVGLISSCAHLGALNIGVQMHRFAREKGLVES 240

Query: 241 LYVGNALIDMYAKCGSLDQAIRIFDRMQRKDIFTWNSMIVGYGVHGRGSEAICCFQQMLE 300
           LYVGNALIDMYAKCGSLDQAI IFDRM RKDIFTWNSMIVGYGVHGRG+EAI CF++MLE
Sbjct: 241 LYVGNALIDMYAKCGSLDQAIFIFDRMHRKDIFTWNSMIVGYGVHGRGTEAIFCFERMLE 300

Query: 301 ARMQPNSVTFLGLLCGCSHQGLVQEGVKYFNLMSFMFRLRPEVKHYGCLVDLYGRAGKLE 360
           ARMQPNSVTFLGLLCGCSHQGLVQEGVKYFNLMS  FRLRPEVKHYGCLVDLYGRAGKLE
Sbjct: 301 ARMQPNSVTFLGLLCGCSHQGLVQEGVKYFNLMSSKFRLRPEVKHYGCLVDLYGRAGKLE 360

Query: 361 KALETISNSSTNDPVLWRILLGSCKIHKNVGIGKIAMNNLSELGATNAGDCILLATIYAG 420
           KALETI NSS NDPVLWRILLGSCKIHKNVG+G+IAMNNL+ELGATNAGDCILLATIYAG
Sbjct: 361 KALETIRNSSPNDPVLWRILLGSCKIHKNVGVGEIAMNNLNELGATNAGDCILLATIYAG 420

Query: 421 VKDTAGVASMRKMIKSQGIKTTPGWSWIEIGEQVHKFVVDDKSHRYSIEVHEKLREVIHQ 480
           V DTAGVASMRK IKSQGIKT+PGWSWIEIGEQVHKFVVDDKSHR SIEV+EKLREV+HQ
Sbjct: 421 VNDTAGVASMRKTIKSQGIKTSPGWSWIEIGEQVHKFVVDDKSHRDSIEVYEKLREVLHQ 480

Query: 481 ASLFGYIGDESVSSLDVFSTTQTLETSCTYHSEKLAIAFGLARTADGTQIRIVKNLRVCR 540
           ASLFGY+ D            +TL+TS TYHSEKLAIAFGLARTADGT IRIVKNLRVCR
Sbjct: 481 ASLFGYVID-----------AETLKTSSTYHSEKLAIAFGLARTADGTPIRIVKNLRVCR 540

Query: 541 DCHSFIKAVSVAFNREIIVRDRVRFHHFKGGRCSCNDYW 580
           DCHSF+KAVSVAF+REIIVRDRVRFHHFKGG+CSCNDYW
Sbjct: 541 DCHSFMKAVSVAFDREIIVRDRVRFHHFKGGQCSCNDYW 568

BLAST of Tan0021631 vs. TAIR 10
Match: AT3G56550.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 716.5 bits (1848), Expect = 1.8e-206
Identity = 343/579 (59.24%), Postives = 443/579 (76.51%), Query Frame = 0

Query: 3   KEKAIVTLLQGCNSLNKLRKIHAHVIVSGLRHHIAIGNKLLNFCAISVSGSLAYAQLLFH 62
           K + IV +LQGCNS+ KLRKIH+HVI++GL+HH +I N LL FCA+SV+GSL++AQLLF 
Sbjct: 4   KARVIVRMLQGCNSMKKLRKIHSHVIINGLQHHPSIFNHLLRFCAVSVTGSLSHAQLLFD 63

Query: 63  QMEC-PQTEAWNSIIRGFAQSSSPIDAVVYYNQMVWRSFSSPDTFTFSFVLKACERLKAE 122
             +  P T  WN +IRGF+ SSSP++++++YN+M+  S S PD FTF+F LK+CER+K+ 
Sbjct: 64  HFDSDPSTSDWNYLIRGFSNSSSPLNSILFYNRMLLSSVSRPDLFTFNFALKSCERIKSI 123

Query: 123 RKCKEVHGSVIRCGYDGDVIICTNLVKCYGAMGSVCIAQRVFDKMPARDLVAWNAMISCF 182
            KC E+HGSVIR G+  D I+ T+LV+CY A GSV IA +VFD+MP RDLV+WN MI CF
Sbjct: 124 PKCLEIHGSVIRSGFLDDAIVATSLVRCYSANGSVEIASKVFDEMPVRDLVSWNVMICCF 183

Query: 183 SQQGLHQEALQIYNQMRSENVDVDGFTLVGLISSCAHLGALNIGVQMHRLAREKGLVESL 242
           S  GLH +AL +Y +M +E V  D +TLV L+SSCAH+ ALN+GV +HR+A +      +
Sbjct: 184 SHVGLHNQALSMYKRMGNEGVCGDSYTLVALLSSCAHVSALNMGVMLHRIACDIRCESCV 243

Query: 243 YVGNALIDMYAKCGSLDQAIRIFDRMQRKDIFTWNSMIVGYGVHGRGSEAICCFQQMLEA 302
           +V NALIDMYAKCGSL+ AI +F+ M+++D+ TWNSMI+GYGVHG G EAI  F++M+ +
Sbjct: 244 FVSNALIDMYAKCGSLENAIGVFNGMRKRDVLTWNSMIIGYGVHGHGVEAISFFRKMVAS 303

Query: 303 RMQPNSVTFLGLLCGCSHQGLVQEGVKYFNLMSFMFRLRPEVKHYGCLVDLYGRAGKLEK 362
            ++PN++TFLGLL GCSHQGLV+EGV++F +MS  F L P VKHYGC+VDLYGRAG+LE 
Sbjct: 304 GVRPNAITFLGLLLGCSHQGLVKEGVEHFEIMSSQFHLTPNVKHYGCMVDLYGRAGQLEN 363

Query: 363 ALETISNSSTN-DPVLWRILLGSCKIHKNVGIGKIAMNNLSELGATNAGDCILLATIYAG 422
           +LE I  SS + DPVLWR LLGSCKIH+N+ +G++AM  L +L A NAGD +L+ +IY+ 
Sbjct: 364 SLEMIYASSCHEDPVLWRTLLGSCKIHRNLELGEVAMKKLVQLEAFNAGDYVLMTSIYSA 423

Query: 423 VKDTAGVASMRKMIKSQGIKTTPGWSWIEIGEQVHKFVVDDKSHRYSIEVHEKLREVIHQ 482
             D    ASMRK+I+S  ++T PGWSWIEIG+QVHKFVVDDK H  S  ++ +L EVI++
Sbjct: 424 ANDAQAFASMRKLIRSHDLQTVPGWSWIEIGDQVHKFVVDDKMHPESAVIYSELGEVINR 483

Query: 483 ASLFGYIGDESVSSLDVFSTTQTLETSCTYHSEKLAIAFGLARTADGTQIRIVKNLRVCR 542
           A L GY  ++S  +    S  + L ++ T HSEKLAIA+GL RT  GT +RI KNLRVCR
Sbjct: 484 AILAGYKPEDSNRTAPTLS-DRCLGSADTSHSEKLAIAYGLMRTTAGTTLRITKNLRVCR 543

Query: 543 DCHSFIKAVSVAFNREIIVRDRVRFHHFKGGRCSCNDYW 580
           DCHSF K VS AFNREIIVRDRVRFHHF  G CSCNDYW
Sbjct: 544 DCHSFTKYVSKAFNREIIVRDRVRFHHFADGICSCNDYW 581

BLAST of Tan0021631 vs. TAIR 10
Match: AT4G21065.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 449.5 bits (1155), Expect = 4.0e-126
Identity = 232/579 (40.07%), Postives = 355/579 (61.31%), Query Frame = 0

Query: 8   VTLLQ--GCNSLNKLRKIHAHVIVSGLRHHIAIGNKLLNFCAISVSG--SLAYAQLLFHQ 67
           + LLQ  G +S+ KLR+IHA  I  G+    A   K L F  +S+     ++YA  +F +
Sbjct: 19  INLLQTYGVSSITKLRQIHAFSIRHGVSISDAELGKHLIFYLVSLPSPPPMSYAHKVFSK 78

Query: 68  MECP-QTEAWNSIIRGFAQSSSPIDAVVYYNQMVWRSFSSPDTFTFSFVLKACERLKAER 127
           +E P     WN++IRG+A+  + I A   Y +M       PDT T+ F++KA   +   R
Sbjct: 79  IEKPINVFIWNTLIRGYAEIGNSISAFSLYREMRVSGLVEPDTHTYPFLIKAVTTMADVR 138

Query: 128 KCKEVHGSVIRCGYDGDVIICTNLVKCYGAMGSVCIAQRVFDKMPARDLVAWNAMISCFS 187
             + +H  VIR G+   + +  +L+  Y   G V  A +VFDKMP +DLVAWN++I+ F+
Sbjct: 139 LGETIHSVVIRSGFGSLIYVQNSLLHLYANCGDVASAYKVFDKMPEKDLVAWNSVINGFA 198

Query: 188 QQGLHQEALQIYNQMRSENVDVDGFTLVGLISSCAHLGALNIGVQMHRLAREKGLVESLY 247
           + G  +EAL +Y +M S+ +  DGFT+V L+S+CA +GAL +G ++H    + GL  +L+
Sbjct: 199 ENGKPEEALALYTEMNSKGIKPDGFTIVSLLSACAKIGALTLGKRVHVYMIKVGLTRNLH 258

Query: 248 VGNALIDMYAKCGSLDQAIRIFDRMQRKDIFTWNSMIVGYGVHGRGSEAICCFQQMLEAR 307
             N L+D+YA+CG +++A  +FD M  K+  +W S+IVG  V+G G EAI  F+ M    
Sbjct: 259 SSNVLLDLYARCGRVEEAKTLFDEMVDKNSVSWTSLIVGLAVNGFGKEAIELFKYMESTE 318

Query: 308 -MQPNSVTFLGLLCGCSHQGLVQEGVKYFNLMSFMFRLRPEVKHYGCLVDLYGRAGKLEK 367
            + P  +TF+G+L  CSH G+V+EG +YF  M   +++ P ++H+GC+VDL  RAG+++K
Sbjct: 319 GLLPCEITFVGILYACSHCGMVKEGFEYFRRMREEYKIEPRIEHFGCMVDLLARAGQVKK 378

Query: 368 ALETISNSSTN-DPVLWRILLGSCKIHKNVGIGKIAMNNLSELGATNAGDCILLATIYAG 427
           A E I +     + V+WR LLG+C +H +  + + A   + +L   ++GD +LL+ +YA 
Sbjct: 379 AYEYIKSMPMQPNVVIWRTLLGACTVHGDSDLAEFARIQILQLEPNHSGDYVLLSNMYAS 438

Query: 428 VKDTAGVASMRKMIKSQGIKTTPGWSWIEIGEQVHKFVVDDKSHRYSIEVHEKLREVIHQ 487
            +  + V  +RK +   G+K  PG S +E+G +VH+F++ DKSH  S  ++ KL+E+  +
Sbjct: 439 EQRWSDVQKIRKQMLRDGVKKVPGHSLVEVGNRVHEFLMGDKSHPQSDAIYAKLKEMTGR 498

Query: 488 ASLFGYIGDESVSSLDVFSTTQTLETSCTYHSEKLAIAFGLARTADGTQIRIVKNLRVCR 547
               GY+    +S++ V    +  E +  YHSEK+AIAF L  T + + I +VKNLRVC 
Sbjct: 499 LRSEGYV--PQISNVYVDVEEEEKENAVVYHSEKIAIAFMLISTPERSPITVVKNLRVCA 558

Query: 548 DCHSFIKAVSVAFNREIIVRDRVRFHHFKGGRCSCNDYW 580
           DCH  IK VS  +NREI+VRDR RFHHFK G CSC DYW
Sbjct: 559 DCHLAIKLVSKVYNREIVVRDRSRFHHFKNGSCSCQDYW 595

BLAST of Tan0021631 vs. TAIR 10
Match: AT2G02980.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 429.9 bits (1104), Expect = 3.3e-120
Identity = 221/574 (38.50%), Postives = 347/574 (60.45%), Query Frame = 0

Query: 8   VTLLQGCNSLNKLRKIHAHVIVSGLRHHIAIGNKLLNFCAIS-VSGSLAYAQLLFHQMEC 67
           + L+  CNSL +L +I A+ I S +   ++   KL+NFC  S    S++YA+ LF  M  
Sbjct: 33  ILLISKCNSLRELMQIQAYAIKSHI-EDVSFVAKLINFCTESPTESSMSYARHLFEAMSE 92

Query: 68  PQTEAWNSIIRGFAQSSSPIDAVVYYNQMVWRSFSSPDTFTFSFVLKACERLKAERKCKE 127
           P    +NS+ RG+++ ++P++    + +++      PD +TF  +LKAC   KA  + ++
Sbjct: 93  PDIVIFNSMARGYSRFTNPLEVFSLFVEIL-EDGILPDNYTFPSLLKACAVAKALEEGRQ 152

Query: 128 VHGSVIRCGYDGDVIICTNLVKCYGAMGSVCIAQRVFDKMPARDLVAWNAMISCFSQQGL 187
           +H   ++ G D +V +C  L+  Y     V  A+ VFD++    +V +NAMI+ ++++  
Sbjct: 153 LHCLSMKLGLDDNVYVCPTLINMYTECEDVDSARCVFDRIVEPCVVCYNAMITGYARRNR 212

Query: 188 HQEALQIYNQMRSENVDVDGFTLVGLISSCAHLGALNIGVQMHRLAREKGLVESLYVGNA 247
             EAL ++ +M+ + +  +  TL+ ++SSCA LG+L++G  +H+ A++    + + V  A
Sbjct: 213 PNEALSLFREMQGKYLKPNEITLLSVLSSCALLGSLDLGKWIHKYAKKHSFCKYVKVNTA 272

Query: 248 LIDMYAKCGSLDQAIRIFDRMQRKDIFTWNSMIVGYGVHGRGSEAICCFQQMLEARMQPN 307
           LIDM+AKCGSLD A+ IF++M+ KD   W++MIV Y  HG+  +++  F++M    +QP+
Sbjct: 273 LIDMFAKCGSLDDAVSIFEKMRYKDTQAWSAMIVAYANHGKAEKSMLMFERMRSENVQPD 332

Query: 308 SVTFLGLLCGCSHQGLVQEGVKYFNLMSFMFRLRPEVKHYGCLVDLYGRAGKLEKALETI 367
            +TFLGLL  CSH G V+EG KYF+ M   F + P +KHYG +VDL  RAG LE A E I
Sbjct: 333 EITFLGLLNACSHTGRVEEGRKYFSQMVSKFGIVPSIKHYGSMVDLLSRAGNLEDAYEFI 392

Query: 368 SNSSTN-DPVLWRILLGSCKIHKNVGIGKIAMNNLSELGATNAGDCILLATIYAGVKDTA 427
                +  P+LWRILL +C  H N+ + +     + EL  ++ GD ++L+ +YA  K   
Sbjct: 393 DKLPISPTPMLWRILLAACSSHNNLDLAEKVSERIFELDDSHGGDYVILSNLYARNKKWE 452

Query: 428 GVASMRKMIKSQGIKTTPGWSWIEIGEQVHKFVVDDKSHRYSIEVHEKLREVIHQASLFG 487
            V S+RK++K +     PG S IE+   VH+F   D     + ++H  L E++ +  L G
Sbjct: 453 YVDSLRKVMKDRKAVKVPGCSSIEVNNVVHEFFSGDGVKSATTKLHRALDEMVKELKLSG 512

Query: 488 YIGDESVSSLDVFSTTQTLETSCTYHSEKLAIAFGLARTADGTQIRIVKNLRVCRDCHSF 547
           Y+ D S+  +      Q  E +  YHSEKLAI FGL  T  GT IR+VKNLRVCRDCH+ 
Sbjct: 513 YVPDTSM-VVHANMNDQEKEITLRYHSEKLAITFGLLNTPPGTTIRVVKNLRVCRDCHNA 572

Query: 548 IKAVSVAFNREIIVRDRVRFHHFKGGRCSCNDYW 580
            K +S+ F R++++RD  RFHHF+ G+CSC D+W
Sbjct: 573 AKLISLIFGRKVVLRDVQRFHHFEDGKCSCGDFW 603

BLAST of Tan0021631 vs. TAIR 10
Match: AT5G48910.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 421.0 bits (1081), Expect = 1.5e-117
Identity = 226/630 (35.87%), Postives = 358/630 (56.83%), Query Frame = 0

Query: 2   SKEKAIVTLLQGCNSLNKLRKIHAHVIVSGLRHHIAIGNKLLNFCAIS--VSGSLAYAQL 61
           S   ++   +  C ++  L +IHA  I SG         ++L FCA S      L YA  
Sbjct: 21  SHPSSLFPQINNCRTIRDLSQIHAVFIKSGQMRDTLAAAEILRFCATSDLHHRDLDYAHK 80

Query: 62  LFHQMECPQTEAWNSIIRGFAQS--SSPIDAVVYYNQMVWRSFSSPDTFTFSFVLKACER 121
           +F+QM      +WN+IIRGF++S     + A+  + +M+   F  P+ FTF  VLKAC +
Sbjct: 81  IFNQMPQRNCFSWNTIIRGFSESDEDKALIAITLFYEMMSDEFVEPNRFTFPSVLKACAK 140

Query: 122 LKAERKCKEVHGSVIRCGYDGDVIICTNLVKC---------------------------- 181
               ++ K++HG  ++ G+ GD  + +NLV+                             
Sbjct: 141 TGKIQEGKQIHGLALKYGFGGDEFVMSNLVRMYVMCGFMKDARVLFYKNIIEKDMVVMTD 200

Query: 182 -----------------YGAMGSVCIAQRVFDKMPARDLVAWNAMISCFSQQGLHQEALQ 241
                            Y  +G    A+ +FDKM  R +V+WN MIS +S  G  ++A++
Sbjct: 201 RRKRDGEIVLWNVMIDGYMRLGDCKAARMLFDKMRQRSVVSWNTMISGYSLNGFFKDAVE 260

Query: 242 IYNQMRSENVDVDGFTLVGLISSCAHLGALNIGVQMHRLAREKGLVESLYVGNALIDMYA 301
           ++ +M+  ++  +  TLV ++ + + LG+L +G  +H  A + G+     +G+ALIDMY+
Sbjct: 261 VFREMKKGDIRPNYVTLVSVLPAISRLGSLELGEWLHLYAEDSGIRIDDVLGSALIDMYS 320

Query: 302 KCGSLDQAIRIFDRMQRKDIFTWNSMIVGYGVHGRGSEAICCFQQMLEARMQPNSVTFLG 361
           KCG +++AI +F+R+ R+++ TW++MI G+ +HG+  +AI CF +M +A ++P+ V ++ 
Sbjct: 321 KCGIIEKAIHVFERLPRENVITWSAMINGFAIHGQAGDAIDCFCKMRQAGVRPSDVAYIN 380

Query: 362 LLCGCSHQGLVQEGVKYFNLMSFMFRLRPEVKHYGCLVDLYGRAGKLEKALETISNSSTN 421
           LL  CSH GLV+EG +YF+ M  +  L P ++HYGC+VDL GR+G L++A E I N    
Sbjct: 381 LLTACSHGGLVEEGRRYFSQMVSVDGLEPRIEHYGCMVDLLGRSGLLDEAEEFILNMPIK 440

Query: 422 -DPVLWRILLGSCKIHKNVGIGKIAMNNLSELGATNAGDCILLATIYAGVKDTAGVASMR 481
            D V+W+ LLG+C++  NV +GK   N L ++   ++G  + L+ +YA   + + V+ MR
Sbjct: 441 PDDVIWKALLGACRMQGNVEMGKRVANILMDMVPHDSGAYVALSNMYASQGNWSEVSEMR 500

Query: 482 KMIKSQGIKTTPGWSWIEIGEQVHKFVVDDKSHRYSIEVHEKLREVIHQASLFGY--IGD 541
             +K + I+  PG S I+I   +H+FVV+D SH  + E++  L E+  +  L GY  I  
Sbjct: 501 LRMKEKDIRKDPGCSLIDIDGVLHEFVVEDDSHPKAKEINSMLVEISDKLRLAGYRPITT 560

Query: 542 ESVSSLDVFSTTQTLETSCTYHSEKLAIAFGLARTADGTQIRIVKNLRVCRDCHSFIKAV 580
           + + +L+     +  E    YHSEK+A AFGL  T+ G  IRIVKNLR+C DCHS IK +
Sbjct: 561 QVLLNLE----EEDKENVLHYHSEKIATAFGLISTSPGKPIRIVKNLRICEDCHSSIKLI 620

BLAST of Tan0021631 vs. TAIR 10
Match: AT1G31920.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 416.8 bits (1070), Expect = 2.9e-116
Identity = 215/581 (37.01%), Postives = 355/581 (61.10%), Query Frame = 0

Query: 3   KEKAIVTLLQGCNSLNKLRKIHAHVIVSGLRHHIAI-GNKLLNFCAIS-VSGSLAYAQLL 62
           KE+  + LL+ C+++++ +++HA  I   L +  +   + +L  CA S    S+ YA  +
Sbjct: 29  KEQECLYLLKRCHNIDEFKQVHARFIKLSLFYSSSFSASSVLAKCAHSGWENSMNYAASI 88

Query: 63  FHQMECPQTEAWNSIIRGFAQSSSPIDAVVYYNQMVWRSFSSPDTFTFSFVLKACERLKA 122
           F  ++ P T  +N++IRG+    S  +A+ +YN+M+ R  + PD FT+  +LKAC RLK+
Sbjct: 89  FRGIDDPCTFDFNTMIRGYVNVMSFEEALCFYNEMMQRG-NEPDNFTYPCLLKACTRLKS 148

Query: 123 ERKCKEVHGSVIRCGYDGDVIICTNLVKCYGAMGSVCIAQRVFDKMPARDLVAWNAMISC 182
            R+ K++HG V + G + DV +  +L+  YG  G + ++  VF+K+ ++   +W++M+S 
Sbjct: 149 IREGKQIHGQVFKLGLEADVFVQNSLINMYGRCGEMELSSAVFEKLESKTAASWSSMVSA 208

Query: 183 FSQQGLHQEALQIYNQMRSE-NVDVDGFTLVGLISSCAHLGALNIGVQMHRLAREKGLVE 242
            +  G+  E L ++  M SE N+  +   +V  + +CA+ GALN+G+ +H          
Sbjct: 209 RAGMGMWSECLLLFRGMCSETNLKAEESGMVSALLACANTGALNLGMSIHGFLLRNISEL 268

Query: 243 SLYVGNALIDMYAKCGSLDQAIRIFDRMQRKDIFTWNSMIVGYGVHGRGSEAICCFQQML 302
           ++ V  +L+DMY KCG LD+A+ IF +M++++  T+++MI G  +HG G  A+  F +M+
Sbjct: 269 NIIVQTSLVDMYVKCGCLDKALHIFQKMEKRNNLTYSAMISGLALHGEGESALRMFSKMI 328

Query: 303 EARMQPNSVTFLGLLCGCSHQGLVQEGVKYFNLMSFMFRLRPEVKHYGCLVDLYGRAGKL 362
           +  ++P+ V ++ +L  CSH GLV+EG + F  M    ++ P  +HYGCLVDL GRAG L
Sbjct: 329 KEGLEPDHVVYVSVLNACSHSGLVKEGRRVFAEMLKEGKVEPTAEHYGCLVDLLGRAGLL 388

Query: 363 EKALETI-SNSSTNDPVLWRILLGSCKIHKNVGIGKIAMNNLSELGATNAGDCILLATIY 422
           E+ALETI S     + V+WR  L  C++ +N+ +G+IA   L +L + N GD +L++ +Y
Sbjct: 389 EEALETIQSIPIEKNDVIWRTFLSQCRVRQNIELGQIAAQELLKLSSHNPGDYLLISNLY 448

Query: 423 AGVKDTAGVASMRKMIKSQGIKTTPGWSWIEIGEQVHKFVVDDKSHRYSIEVHEKLREVI 482
           +  +    VA  R  I  +G+K TPG+S +E+  + H+FV  D+SH    E+++ L ++ 
Sbjct: 449 SQGQMWDDVARTRTEIAIKGLKQTPGFSIVELKGKTHRFVSQDRSHPKCKEIYKMLHQME 508

Query: 483 HQASLFGYIGDESVSSLDVFSTTQTLETSCTYHSEKLAIAFGLARTADGTQIRIVKNLRV 542
            Q    GY  D +   L+V    +  +     HS+K+AIAFGL  T  G+ I+I +NLR+
Sbjct: 509 WQLKFEGYSPDLTQILLNV--DEEEKKERLKGHSQKVAIAFGLLYTPPGSIIKIARNLRM 568

Query: 543 CRDCHSFIKAVSVAFNREIIVRDRVRFHHFKGGRCSCNDYW 580
           C DCH++ K +S+ + REI+VRDR RFH FKGG CSC DYW
Sbjct: 569 CSDCHTYTKKISMIYEREIVVRDRNRFHLFKGGTCSCKDYW 606

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9LXY52.5e-20559.24Pentatricopeptide repeat-containing protein At3g56550 OS=Arabidopsis thaliana OX... [more]
A8MQA35.7e-12540.07Pentatricopeptide repeat-containing protein At4g21065 OS=Arabidopsis thaliana OX... [more]
Q8LK934.7e-11938.50Pentatricopeptide repeat-containing protein At2g02980, chloroplastic OS=Arabidop... [more]
Q9FI802.2e-11635.87Pentatricopeptide repeat-containing protein At5g48910 OS=Arabidopsis thaliana OX... [more]
Q9C6T24.1e-11537.01Pentatricopeptide repeat-containing protein At1g31920 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
XP_038890323.10.0e+0092.57pentatricopeptide repeat-containing protein At3g56550 [Benincasa hispida] >XP_03... [more]
XP_004152881.10.0e+0090.16pentatricopeptide repeat-containing protein At3g56550 isoform X1 [Cucumis sativu... [more]
XP_022149932.18.9e-31089.12pentatricopeptide repeat-containing protein At3g56550 [Momordica charantia] >XP_... [more]
XP_023537237.12.2e-30990.16pentatricopeptide repeat-containing protein At3g56550 [Cucurbita pepo subsp. pep... [more]
XP_016899519.14.0e-30989.98PREDICTED: pentatricopeptide repeat-containing protein At3g56550 [Cucumis melo] ... [more]
Match NameE-valueIdentityDescription
A0A0A0LH200.0e+0090.16DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_2G0741... [more]
A0A6J1D8324.3e-31089.12pentatricopeptide repeat-containing protein At3g56550 OS=Momordica charantia OX=... [more]
A0A1S4DU661.9e-30989.98pentatricopeptide repeat-containing protein At3g56550 OS=Cucumis melo OX=3656 GN... [more]
A0A5A7TXJ91.9e-30989.98Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A6J1FBZ01.2e-30890.16pentatricopeptide repeat-containing protein At3g56550 OS=Cucurbita moschata OX=3... [more]
Match NameE-valueIdentityDescription
AT3G56550.11.8e-20659.24Pentatricopeptide repeat (PPR) superfamily protein [more]
AT4G21065.14.0e-12640.07Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT2G02980.13.3e-12038.50Pentatricopeptide repeat (PPR) superfamily protein [more]
AT5G48910.11.5e-11735.87Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G31920.12.9e-11637.01Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 224..338
e-value: 2.5E-25
score: 91.6
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 3..125
e-value: 1.1E-12
score: 49.6
coord: 126..223
e-value: 1.1E-18
score: 69.3
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 273..306
e-value: 3.1E-7
score: 28.2
coord: 245..272
e-value: 1.5E-5
score: 22.9
coord: 172..205
e-value: 7.0E-8
score: 30.2
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 346..364
e-value: 0.085
score: 13.1
coord: 71..98
e-value: 0.0098
score: 16.1
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 270..318
e-value: 5.5E-10
score: 39.3
coord: 170..217
e-value: 8.1E-10
score: 38.8
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 271..305
score: 11.246351
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 170..204
score: 11.586152
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 240..270
score: 9.602157
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 443..569
e-value: 1.1E-32
score: 112.6
NoneNo IPR availablePANTHERPTHR47928REPEAT-CONTAINING PROTEIN, PUTATIVE-RELATEDcoord: 10..552
NoneNo IPR availablePANTHERPTHR47928:SF8SUBFAMILY NOT NAMEDcoord: 10..552
NoneNo IPR availableSUPERFAMILY81901HCP-likecoord: 173..374

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0021631.1Tan0021631.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding