Tan0017390 (gene) Snake gourd v1

Overview
NameTan0017390
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat-containing protein
LocationLG02: 95446069 .. 95448751 (+)
RNA-Seq ExpressionTan0017390
SyntenyTan0017390
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CTCGAAACTGCTGAATGAGGATTTGCAGAAGAATTCGTTTTTACGTTCTTGGTTAAATGCAATCTCAACTTACAAAATCCAAGGTATTGCGTATAATCAACAATGCATCAGCTTCCAGACCCAACCCTCGTGCGGCGGAGCAGAACTGCTTAGCGTTGCTTCAGGCCTGCAACGCGCTACCGAAGCTCACCCAAATCCACGCCCACATCCTCAAGTTGGGCCTTCACAACAACCCGCTCGTTCTCACCAAATTCGCCTCCATTTCCTCTGTTATCAATGCCACCGATTACGCTGCCTCTTTCTTGTTCTCTGCCGATGCCGATACTCGGCTTTACGATGCGTTTCTTTTCAATACCCTCATCAGAGCCTATGCCCAAACTGGTCACTCGAAGGCCAAAGCATTGTCTTTGTATTGTATAATGCTTCACGATGGGATTTTGCCTAATAAGTTCACGTACCCATTTGTCCTGAAGGCTTGTGCTGGCCTTGAGGTTTTGAATTTGGGTCAATCGGTTCATGGGTCGGTGGTGAAGTTTGGGTTTGATCGTGATATTCATGTTCAGAACACTATGGTTCATATGTACTCTTGCTGCGCTGGTGGGATAAATTTTGCCCGGAAGGTGTTTGATGAAATGCCCAAGTCAGATTCTGTGACTTGGAGTGCGATGATTGGAGGGTATGCTCGTGCAGGGCGCTCCACTGAAGCAGTGGCCTTATTTAGGGAGATGCAAATGGCGGAGGTTTGTCCAGATGAGATAACCATGGTCTCAATTCTTTCTGCTTGTACTGATTTGGGTGCCCTTGAACTTGGGAAGTGGATCGAAGCTTATATAGCTTTACATGGAATTCAGAAACCAGTAGAGGTTAGCAATGCACTTATTGACATGTTTGCAAAGTGTGGCGATATTAGTAAAGCTTTGAAATTGTTCAGAACTATGAGTGAGAAAACAATAGTTTCTTGGACTTCTGTTATCGTTGGCATGGCAATGCATGGCCGTGGTCAAGAGGCTATTTGTTTATTTGAAGAGATGATAGGTTCTGGTGTTGCTCCGGACGATGTTGCCTTTATCGGTTTGCTTTCTGCTTGTAGCCATTCGGGACTAGTTGAAAGAGGTAGGGAATATTTCAGTTCTATGATGAAGAAATATAAGCATGTTCCTAAGATAGAACACTATGGATGCATGGTGGACATGTATTGCAGGACTGGACTTGTGAAGGAGGCTCTCGAGTTCGTCCATAACATGCCGATCGAGCCAAATCCGGTAATTTTACGAACACTAGTCAGTGCTTGCCGTGGCCATGGTGAATTCAAGCTTGGAGAAAAGATCACCAAACTGTTAATGAGACATGAGCCTATGCATGGGTCAAACTATGTCTTGCTTTCCAACATTTATGCGAAAATGTTCAGTTGGGAGAAGAAGACCAAAATTAGAGAGGTGATGGAAGTAAAAGGAATGAAAAAGGTTCCAGGGAGCACTATGATTGAGATTGATAATGAAATCTATGAATTTGTTGCTGGAGATAAATCTCATAAACAGTTCAAGGAAATCTACGAGATGGTGGACGAGATGGGGAGAGAGATGAAGAAATCTGGATATCGTCCTTCGACATCGGAAGTTTTGCTCGACATCAACGAAGAAGACAAAGAAGATACTTTGAATAGGCATAGTGAAAAACTGGCTATTGCATTTGGTCTTCTTAATACTCCACCTGGAACTCCGATTCGAATCGTAAAGAATTTGCGAGTTTGCAGCGATTGCCACTCCGCTTCCAAGTTCATCTCTAAGATTTATGGCCGTGAAATCATAATGAGGGACCGTAATCGGTTTCACCACTTCAAGGCTGGGCTGTGCTCGTGTGGAGATTTCTGGTGAAGTATATAATAACACCACCAACATATCTTCCAGAATCCCTGGAGAAACTATGGATCTTCGACGAAATGAGTACTCAGCAGGCAATCGATAGCGATTCGGTTCATCAATGCCTTGCATCCATTCTGCAGCTGAGGCATTACATGTTAAAACTTGGAAGAATATTCCAAAGATGCAATTGATAATGTTATATATCTCTTTCATTTGAGACAAAGCTCGACCACATTTGGCGGCCATATACTTGAGATTTGAGAGCTTCAACTTCAACTGAAAAAAAATCAGTTGTTTTTGCTTGGAAGTTTTCTTTTATCAGGCTATGCTGCAAGTAGAATGAGACAGCTAGCATCTTCGGCGAACAGCTTGATCTCGACGGCGATCAAACTAACAACTCGGTAACGTTCAACTTATCTTCAACTCTATCGCAAAAAATCTTTTAAGTCATCATAGTTTATGCTTAAAGATTCTTTAATTGGATTCTTCCAAGTCATGAACTTTTTAATTCAATCTGAATGGCAGGCATGTCAAATTTTAGAAAATCACTCGGCGCCCCAAACAGCCCTTAAGAAATTCCGTACACAGCATAGGATGCAGAGATCGTACTTGAGATTATAGAAAATCATTAAGGTAGGCTTAGCCAGGTAATATCATATGATTGTTACTTATGGATAGAATAATTGATTTACTATAAAGTCTGGATTAAAGATATGAACCTTCCTTTTTTATACTTCTATGTTTATAGGAAGTCTTTAAAGGTAGCTTTAACTTAACTAGCCAATCATCCCCTAAATTATTAAACAAATTTCTGAAATAAT

mRNA sequence

CTCGAAACTGCTGAATGAGGATTTGCAGAAGAATTCGTTTTTACGTTCTTGGTTAAATGCAATCTCAACTTACAAAATCCAAGGTATTGCGTATAATCAACAATGCATCAGCTTCCAGACCCAACCCTCGTGCGGCGGAGCAGAACTGCTTAGCGTTGCTTCAGGCCTGCAACGCGCTACCGAAGCTCACCCAAATCCACGCCCACATCCTCAAGTTGGGCCTTCACAACAACCCGCTCGTTCTCACCAAATTCGCCTCCATTTCCTCTGTTATCAATGCCACCGATTACGCTGCCTCTTTCTTGTTCTCTGCCGATGCCGATACTCGGCTTTACGATGCGTTTCTTTTCAATACCCTCATCAGAGCCTATGCCCAAACTGGTCACTCGAAGGCCAAAGCATTGTCTTTGTATTGTATAATGCTTCACGATGGGATTTTGCCTAATAAGTTCACGTACCCATTTGTCCTGAAGGCTTGTGCTGGCCTTGAGGTTTTGAATTTGGGTCAATCGGTTCATGGGTCGGTGGTGAAGTTTGGGTTTGATCGTGATATTCATGTTCAGAACACTATGGTTCATATGTACTCTTGCTGCGCTGGTGGGATAAATTTTGCCCGGAAGGTGTTTGATGAAATGCCCAAGTCAGATTCTGTGACTTGGAGTGCGATGATTGGAGGGTATGCTCGTGCAGGGCGCTCCACTGAAGCAGTGGCCTTATTTAGGGAGATGCAAATGGCGGAGGTTTGTCCAGATGAGATAACCATGGTCTCAATTCTTTCTGCTTGTACTGATTTGGGTGCCCTTGAACTTGGGAAGTGGATCGAAGCTTATATAGCTTTACATGGAATTCAGAAACCAGTAGAGGTTAGCAATGCACTTATTGACATGTTTGCAAAGTGTGGCGATATTAGTAAAGCTTTGAAATTGTTCAGAACTATGAGTGAGAAAACAATAGTTTCTTGGACTTCTGTTATCGTTGGCATGGCAATGCATGGCCGTGGTCAAGAGGCTATTTGTTTATTTGAAGAGATGATAGGTTCTGGTGTTGCTCCGGACGATGTTGCCTTTATCGGTTTGCTTTCTGCTTGTAGCCATTCGGGACTAGTTGAAAGAGGTAGGGAATATTTCAGTTCTATGATGAAGAAATATAAGCATGTTCCTAAGATAGAACACTATGGATGCATGGTGGACATGTATTGCAGGACTGGACTTGTGAAGGAGGCTCTCGAGTTCGTCCATAACATGCCGATCGAGCCAAATCCGGTAATTTTACGAACACTAGTCAGTGCTTGCCGTGGCCATGGTGAATTCAAGCTTGGAGAAAAGATCACCAAACTGTTAATGAGACATGAGCCTATGCATGGGTCAAACTATGTCTTGCTTTCCAACATTTATGCGAAAATGTTCAGTTGGGAGAAGAAGACCAAAATTAGAGAGGTGATGGAAGTAAAAGGAATGAAAAAGGTTCCAGGGAGCACTATGATTGAGATTGATAATGAAATCTATGAATTTGTTGCTGGAGATAAATCTCATAAACAGTTCAAGGAAATCTACGAGATGGTGGACGAGATGGGGAGAGAGATGAAGAAATCTGGATATCGTCCTTCGACATCGGAAGTTTTGCTCGACATCAACGAAGAAGACAAAGAAGATACTTTGAATAGGCATAGTGAAAAACTGGCTATTGCATTTGGTCTTCTTAATACTCCACCTGGAACTCCGATTCGAATCGTAAAGAATTTGCGAGTTTGCAGCGATTGCCACTCCGCTTCCAAGTTCATCTCTAAGATTTATGGCCGTGAAATCATAATGAGGGACCGTAATCGGTTTCACCACTTCAAGGCTGGGCTGTGCTCGTGTGGAGATTTCTGGTGAAGTATATAATAACACCACCAACATATCTTCCAGAATCCCTGGAGAAACTATGGATCTTCGACGAAATGAGTACTCAGCAGGCAATCGATAGCGATTCGGTTCATCAATGCCTTGCATCCATTCTGCAGCTGAGGCATTACATGTTAAAACTTGGAAGAATATTCCAAAGATGCAATTGATAATGTTATATATCTCTTTCATTTGAGACAAAGCTCGACCACATTTGGCGGCCATATACTTGAGATTTGAGAGCTTCAACTTCAACTGAAAAAAAATCAGTTGTTTTTGCTTGGAAGTTTTCTTTTATCAGGCTATGCTGCAAGTAGAATGAGACAGCTAGCATCTTCGGCGAACAGCTTGATCTCGACGGCGATCAAACTAACAACTCGGCATGTCAAATTTTAGAAAATCACTCGGCGCCCCAAACAGCCCTTAAGAAATTCCGTACACAGCATAGGATGCAGAGATCGTACTTGAGATTATAGAAAATCATTAAGGTAGGCTTAGCCAGGTAATATCATATGATTGTTACTTATGGATAGAATAATTGATTTACTATAAAGTCTGGATTAAAGATATGAACCTTCCTTTTTTATACTTCTATGTTTATAGGAAGTCTTTAAAGGTAGCTTTAACTTAACTAGCCAATCATCCCCTAAATTATTAAACAAATTTCTGAAATAAT

Coding sequence (CDS)

ATGCAATCTCAACTTACAAAATCCAAGGTATTGCGTATAATCAACAATGCATCAGCTTCCAGACCCAACCCTCGTGCGGCGGAGCAGAACTGCTTAGCGTTGCTTCAGGCCTGCAACGCGCTACCGAAGCTCACCCAAATCCACGCCCACATCCTCAAGTTGGGCCTTCACAACAACCCGCTCGTTCTCACCAAATTCGCCTCCATTTCCTCTGTTATCAATGCCACCGATTACGCTGCCTCTTTCTTGTTCTCTGCCGATGCCGATACTCGGCTTTACGATGCGTTTCTTTTCAATACCCTCATCAGAGCCTATGCCCAAACTGGTCACTCGAAGGCCAAAGCATTGTCTTTGTATTGTATAATGCTTCACGATGGGATTTTGCCTAATAAGTTCACGTACCCATTTGTCCTGAAGGCTTGTGCTGGCCTTGAGGTTTTGAATTTGGGTCAATCGGTTCATGGGTCGGTGGTGAAGTTTGGGTTTGATCGTGATATTCATGTTCAGAACACTATGGTTCATATGTACTCTTGCTGCGCTGGTGGGATAAATTTTGCCCGGAAGGTGTTTGATGAAATGCCCAAGTCAGATTCTGTGACTTGGAGTGCGATGATTGGAGGGTATGCTCGTGCAGGGCGCTCCACTGAAGCAGTGGCCTTATTTAGGGAGATGCAAATGGCGGAGGTTTGTCCAGATGAGATAACCATGGTCTCAATTCTTTCTGCTTGTACTGATTTGGGTGCCCTTGAACTTGGGAAGTGGATCGAAGCTTATATAGCTTTACATGGAATTCAGAAACCAGTAGAGGTTAGCAATGCACTTATTGACATGTTTGCAAAGTGTGGCGATATTAGTAAAGCTTTGAAATTGTTCAGAACTATGAGTGAGAAAACAATAGTTTCTTGGACTTCTGTTATCGTTGGCATGGCAATGCATGGCCGTGGTCAAGAGGCTATTTGTTTATTTGAAGAGATGATAGGTTCTGGTGTTGCTCCGGACGATGTTGCCTTTATCGGTTTGCTTTCTGCTTGTAGCCATTCGGGACTAGTTGAAAGAGGTAGGGAATATTTCAGTTCTATGATGAAGAAATATAAGCATGTTCCTAAGATAGAACACTATGGATGCATGGTGGACATGTATTGCAGGACTGGACTTGTGAAGGAGGCTCTCGAGTTCGTCCATAACATGCCGATCGAGCCAAATCCGGTAATTTTACGAACACTAGTCAGTGCTTGCCGTGGCCATGGTGAATTCAAGCTTGGAGAAAAGATCACCAAACTGTTAATGAGACATGAGCCTATGCATGGGTCAAACTATGTCTTGCTTTCCAACATTTATGCGAAAATGTTCAGTTGGGAGAAGAAGACCAAAATTAGAGAGGTGATGGAAGTAAAAGGAATGAAAAAGGTTCCAGGGAGCACTATGATTGAGATTGATAATGAAATCTATGAATTTGTTGCTGGAGATAAATCTCATAAACAGTTCAAGGAAATCTACGAGATGGTGGACGAGATGGGGAGAGAGATGAAGAAATCTGGATATCGTCCTTCGACATCGGAAGTTTTGCTCGACATCAACGAAGAAGACAAAGAAGATACTTTGAATAGGCATAGTGAAAAACTGGCTATTGCATTTGGTCTTCTTAATACTCCACCTGGAACTCCGATTCGAATCGTAAAGAATTTGCGAGTTTGCAGCGATTGCCACTCCGCTTCCAAGTTCATCTCTAAGATTTATGGCCGTGAAATCATAATGAGGGACCGTAATCGGTTTCACCACTTCAAGGCTGGGCTGTGCTCGTGTGGAGATTTCTGGTGA

Protein sequence

MQSQLTKSKVLRIINNASASRPNPRAAEQNCLALLQACNALPKLTQIHAHILKLGLHNNPLVLTKFASISSVINATDYAASFLFSADADTRLYDAFLFNTLIRAYAQTGHSKAKALSLYCIMLHDGILPNKFTYPFVLKACAGLEVLNLGQSVHGSVVKFGFDRDIHVQNTMVHMYSCCAGGINFARKVFDEMPKSDSVTWSAMIGGYARAGRSTEAVALFREMQMAEVCPDEITMVSILSACTDLGALELGKWIEAYIALHGIQKPVEVSNALIDMFAKCGDISKALKLFRTMSEKTIVSWTSVIVGMAMHGRGQEAICLFEEMIGSGVAPDDVAFIGLLSACSHSGLVERGREYFSSMMKKYKHVPKIEHYGCMVDMYCRTGLVKEALEFVHNMPIEPNPVILRTLVSACRGHGEFKLGEKITKLLMRHEPMHGSNYVLLSNIYAKMFSWEKKTKIREVMEVKGMKKVPGSTMIEIDNEIYEFVAGDKSHKQFKEIYEMVDEMGREMKKSGYRPSTSEVLLDINEEDKEDTLNRHSEKLAIAFGLLNTPPGTPIRIVKNLRVCSDCHSASKFISKIYGREIIMRDRNRFHHFKAGLCSCGDFW
Homology
BLAST of Tan0017390 vs. ExPASy Swiss-Prot
Match: Q8LK93 (Pentatricopeptide repeat-containing protein At2g02980, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H26 PE=2 SV=2)

HSP 1 Score: 509.2 bits (1310), Expect = 6.3e-143
Identity = 257/580 (44.31%), Postives = 384/580 (66.21%), Query Frame = 0

Query: 29  QNCLALLQACNALPKLTQIHAHILKLGLHNNPLV--LTKFASISSVINATDYAASFLFSA 88
           QN + L+  CN+L +L QI A+ +K  + +   V  L  F + S   ++  Y A  LF A
Sbjct: 30  QNPILLISKCNSLRELMQIQAYAIKSHIEDVSFVAKLINFCTESPTESSMSY-ARHLFEA 89

Query: 89  DADTRLYDAFLFNTLIRAYAQTGHSKAKALSLYCIMLHDGILPNKFTYPFVLKACAGLEV 148
            ++    D  +FN++ R Y++   +  +  SL+  +L DGILP+ +T+P +LKACA  + 
Sbjct: 90  MSEP---DIVIFNSMARGYSRF-TNPLEVFSLFVEILEDGILPDNYTFPSLLKACAVAKA 149

Query: 149 LNLGQSVHGSVVKFGFDRDIHVQNTMVHMYSCCAGGINFARKVFDEMPKSDSVTWSAMIG 208
           L  G+ +H   +K G D +++V  T+++MY+ C   ++ AR VFD + +   V ++AMI 
Sbjct: 150 LEEGRQLHCLSMKLGLDDNVYVCPTLINMYTECE-DVDSARCVFDRIVEPCVVCYNAMIT 209

Query: 209 GYARAGRSTEAVALFREMQMAEVCPDEITMVSILSACTDLGALELGKWIEAYIALHGIQK 268
           GYAR  R  EA++LFREMQ   + P+EIT++S+LS+C  LG+L+LGKWI  Y   H   K
Sbjct: 210 GYARRNRPNEALSLFREMQGKYLKPNEITLLSVLSSCALLGSLDLGKWIHKYAKKHSFCK 269

Query: 269 PVEVSNALIDMFAKCGDISKALKLFRTMSEKTIVSWTSVIVGMAMHGRGQEAICLFEEMI 328
            V+V+ ALIDMFAKCG +  A+ +F  M  K   +W+++IV  A HG+ ++++ +FE M 
Sbjct: 270 YVKVNTALIDMFAKCGSLDDAVSIFEKMRYKDTQAWSAMIVAYANHGKAEKSMLMFERMR 329

Query: 329 GSGVAPDDVAFIGLLSACSHSGLVERGREYFSSMMKKYKHVPKIEHYGCMVDMYCRTGLV 388
              V PD++ F+GLL+ACSH+G VE GR+YFS M+ K+  VP I+HYG MVD+  R G +
Sbjct: 330 SENVQPDEITFLGLLNACSHTGRVEEGRKYFSQMVSKFGIVPSIKHYGSMVDLLSRAGNL 389

Query: 389 KEALEFVHNMPIEPNPVILRTLVSACRGHGEFKLGEKITKLLMRHEPMHGSNYVLLSNIY 448
           ++A EF+  +PI P P++ R L++AC  H    L EK+++ +   +  HG +YV+LSN+Y
Sbjct: 390 EDAYEFIDKLPISPTPMLWRILLAACSSHNNLDLAEKVSERIFELDDSHGGDYVILSNLY 449

Query: 449 AKMFSWEKKTKIREVMEVKGMKKVPGSTMIEIDNEIYEFVAGDKSHKQFKEIYEMVDEMG 508
           A+   WE    +R+VM+ +   KVPG + IE++N ++EF +GD       +++  +DEM 
Sbjct: 450 ARNKKWEYVDSLRKVMKDRKAVKVPGCSSIEVNNVVHEFFSGDGVKSATTKLHRALDEMV 509

Query: 509 REMKKSGYRPSTSEVL-LDINEEDKEDTLNRHSEKLAIAFGLLNTPPGTPIRIVKNLRVC 568
           +E+K SGY P TS V+  ++N+++KE TL  HSEKLAI FGLLNTPPGT IR+VKNLRVC
Sbjct: 510 KELKLSGYVPDTSMVVHANMNDQEKEITLRYHSEKLAITFGLLNTPPGTTIRVVKNLRVC 569

Query: 569 SDCHSASKFISKIYGREIIMRDRNRFHHFKAGLCSCGDFW 606
            DCH+A+K IS I+GR++++RD  RFHHF+ G CSCGDFW
Sbjct: 570 RDCHNAAKLISLIFGRKVVLRDVQRFHHFEDGKCSCGDFW 603

BLAST of Tan0017390 vs. ExPASy Swiss-Prot
Match: A8MQA3 (Pentatricopeptide repeat-containing protein At4g21065 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H28 PE=2 SV=2)

HSP 1 Score: 504.2 bits (1297), Expect = 2.0e-141
Identity = 258/582 (44.33%), Postives = 381/582 (65.46%), Query Frame = 0

Query: 29  QNCLALLQ--ACNALPKLTQIHAHILKLGLHNNPLVLTKFASISSVINATDYAASFLFSA 88
           + C+ LLQ    +++ KL QIHA  ++ G+  +   L K      V   +    S+    
Sbjct: 16  EKCINLLQTYGVSSITKLRQIHAFSIRHGVSISDAELGKHLIFYLVSLPSPPPMSYAHKV 75

Query: 89  DAD-TRLYDAFLFNTLIRAYAQTGHSKAKALSLYCIMLHDGIL-PNKFTYPFVLKACAGL 148
            +   +  + F++NTLIR YA+ G+S   A SLY  M   G++ P+  TYPF++KA   +
Sbjct: 76  FSKIEKPINVFIWNTLIRGYAEIGNS-ISAFSLYREMRVSGLVEPDTHTYPFLIKAVTTM 135

Query: 149 EVLNLGQSVHGSVVKFGFDRDIHVQNTMVHMYSCCAGGINFARKVFDEMPKSDSVTWSAM 208
             + LG+++H  V++ GF   I+VQN+++H+Y+ C G +  A KVFD+MP+ D V W+++
Sbjct: 136 ADVRLGETIHSVVIRSGFGSLIYVQNSLLHLYANC-GDVASAYKVFDKMPEKDLVAWNSV 195

Query: 209 IGGYARAGRSTEAVALFREMQMAEVCPDEITMVSILSACTDLGALELGKWIEAYIALHGI 268
           I G+A  G+  EA+AL+ EM    + PD  T+VS+LSAC  +GAL LGK +  Y+   G+
Sbjct: 196 INGFAENGKPEEALALYTEMNSKGIKPDGFTIVSLLSACAKIGALTLGKRVHVYMIKVGL 255

Query: 269 QKPVEVSNALIDMFAKCGDISKALKLFRTMSEKTIVSWTSVIVGMAMHGRGQEAICLFEE 328
            + +  SN L+D++A+CG + +A  LF  M +K  VSWTS+IVG+A++G G+EAI LF+ 
Sbjct: 256 TRNLHSSNVLLDLYARCGRVEEAKTLFDEMVDKNSVSWTSLIVGLAVNGFGKEAIELFKY 315

Query: 329 MIGS-GVAPDDVAFIGLLSACSHSGLVERGREYFSSMMKKYKHVPKIEHYGCMVDMYCRT 388
           M  + G+ P ++ F+G+L ACSH G+V+ G EYF  M ++YK  P+IEH+GCMVD+  R 
Sbjct: 316 MESTEGLLPCEITFVGILYACSHCGMVKEGFEYFRRMREEYKIEPRIEHFGCMVDLLARA 375

Query: 389 GLVKEALEFVHNMPIEPNPVILRTLVSACRGHGEFKLGEKITKLLMRHEPMHGSNYVLLS 448
           G VK+A E++ +MP++PN VI RTL+ AC  HG+  L E     +++ EP H  +YVLLS
Sbjct: 376 GQVKKAYEYIKSMPMQPNVVIWRTLLGACTVHGDSDLAEFARIQILQLEPNHSGDYVLLS 435

Query: 449 NIYAKMFSWEKKTKIREVMEVKGMKKVPGSTMIEIDNEIYEFVAGDKSHKQFKEIYEMVD 508
           N+YA    W    KIR+ M   G+KKVPG +++E+ N ++EF+ GDKSH Q   IY  + 
Sbjct: 436 NMYASEQRWSDVQKIRKQMLRDGVKKVPGHSLVEVGNRVHEFLMGDKSHPQSDAIYAKLK 495

Query: 509 EMGREMKKSGYRPSTSEVLLDINEEDKEDTLNRHSEKLAIAFGLLNTPPGTPIRIVKNLR 568
           EM   ++  GY P  S V +D+ EE+KE+ +  HSEK+AIAF L++TP  +PI +VKNLR
Sbjct: 496 EMTGRLRSEGYVPQISNVYVDVEEEEKENAVVYHSEKIAIAFMLISTPERSPITVVKNLR 555

Query: 569 VCSDCHSASKFISKIYGREIIMRDRNRFHHFKAGLCSCGDFW 606
           VC+DCH A K +SK+Y REI++RDR+RFHHFK G CSC D+W
Sbjct: 556 VCADCHLAIKLVSKVYNREIVVRDRSRFHHFKNGSCSCQDYW 595

BLAST of Tan0017390 vs. ExPASy Swiss-Prot
Match: Q9LN01 (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 492.7 bits (1267), Expect = 6.1e-138
Identity = 278/725 (38.34%), Postives = 384/725 (52.97%), Query Frame = 0

Query: 17  ASASRPNPRAAEQNCLALLQACNALPKLTQIHAHILKLGLHNNPLVLTK---FASISSVI 76
           +S+  P         L+LL  C  L  L  IHA ++K+GLHN    L+K   F  +S   
Sbjct: 22  SSSDPPYDSIRNHPSLSLLHNCKTLQSLRIIHAQMIKIGLHNTNYALSKLIEFCILSPHF 81

Query: 77  NATDYAASFLFSADADTRLYDAFLFNTLIRAYAQTGHSKAKALSLYCIMLHDGILPNKFT 136
               YA S +F    +  L    ++NT+ R +A +      AL LY  M+  G+LPN +T
Sbjct: 82  EGLPYAIS-VFKTIQEPNL---LIWNTMFRGHALSS-DPVSALKLYVCMISLGLLPNSYT 141

Query: 137 YPFVLKACAGLEVLNLGQSVHGSVVKFGFDRDIHVQNTMVHMY----------------- 196
           +PFVLK+CA  +    GQ +HG V+K G D D++V  +++ MY                 
Sbjct: 142 FPFVLKSCAKSKAFKEGQQIHGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVFDKSP 201

Query: 197 -------------SCCAGGINFARKVFDEMPKSDSVTWSAMIGGYARAGRSTEAVALFRE 256
                            G I  A+K+FDE+P  D V+W+AMI GYA  G   EA+ LF++
Sbjct: 202 HRDVVSYTALIKGYASRGYIENAQKLFDEIPVKDVVSWNAMISGYAETGNYKEALELFKD 261

Query: 257 MQMAEVCPDE-------------------------------------------------- 316
           M    V PDE                                                  
Sbjct: 262 MMKTNVRPDESTMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGE 321

Query: 317 ---------------------------------------------------ITMVSILSA 376
                                                              +TM+SIL A
Sbjct: 322 LETACGLFERLPYKDVISWNTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSILPA 381

Query: 377 CTDLGALELGKWIEAYI--ALHGIQKPVEVSNALIDMFAKCGDISKALKLFRTMSEKTIV 436
           C  LGA+++G+WI  YI   L G+     +  +LIDM+AKCGDI  A ++F ++  K++ 
Sbjct: 382 CAHLGAIDIGRWIHVYIDKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLS 441

Query: 437 SWTSVIVGMAMHGRGQEAICLFEEMIGSGVAPDDVAFIGLLSACSHSGLVERGREYFSSM 496
           SW ++I G AMHGR   +  LF  M   G+ PDD+ F+GLLSACSHSG+++ GR  F +M
Sbjct: 442 SWNAMIFGFAMHGRADASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTM 501

Query: 497 MKKYKHVPKIEHYGCMVDMYCRTGLVKEALEFVHNMPIEPNPVILRTLVSACRGHGEFKL 556
            + YK  PK+EHYGCM+D+   +GL KEA E ++ M +EP+ VI  +L+ AC+ HG  +L
Sbjct: 502 TQDYKMTPKLEHYGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVEL 561

Query: 557 GEKITKLLMRHEPMHGSNYVLLSNIYAKMFSWEKKTKIREVMEVKGMKKVPGSTMIEIDN 606
           GE   + L++ EP +  +YVLLSNIYA    W +  K R ++  KGMKKVPG + IEID+
Sbjct: 562 GESFAENLIKIEPENPGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDS 621

BLAST of Tan0017390 vs. ExPASy Swiss-Prot
Match: Q9FJY7 (Pentatricopeptide repeat-containing protein At5g66520 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H61 PE=2 SV=1)

HSP 1 Score: 476.1 bits (1224), Expect = 5.9e-133
Identity = 240/606 (39.60%), Postives = 371/606 (61.22%), Query Frame = 0

Query: 32  LALLQACNALPKLTQIHAHILKLGLHNNPLVLTKFASISSVINATDYAASFLFSADADTR 91
           ++ LQ C+   +L QIHA +LK GL  +   +TKF S      ++D+        D   R
Sbjct: 18  MSCLQRCSKQEELKQIHARMLKTGLMQDSYAITKFLSFCISSTSSDFLPYAQIVFDGFDR 77

Query: 92  LYDAFLFNTLIRAYAQTGHSKAKALSLYCIMLHDGILPNKFTYPFVLKACAGLEVLNLGQ 151
             D FL+N +IR ++ +   + ++L LY  ML      N +T+P +LKAC+ L       
Sbjct: 78  -PDTFLWNLMIRGFSCSDEPE-RSLLLYQRMLCSSAPHNAYTFPSLLKACSNLSAFEETT 137

Query: 152 SVHGSVVKFGFDRDIHVQNTMVHMYSCCAGGINFARKVFDEMPKSDSVTWSAMIGGYARA 211
            +H  + K G++ D++  N++++ Y+   G    A  +FD +P+ D V+W+++I GY +A
Sbjct: 138 QIHAQITKLGYENDVYAVNSLINSYA-VTGNFKLAHLLFDRIPEPDDVSWNSVIKGYVKA 197

Query: 212 GR-------------------------------STEAVALFREMQMAEVCPDEITMVSIL 271
           G+                               + EA+ LF EMQ ++V PD +++ + L
Sbjct: 198 GKMDIALTLFRKMAEKNAISWTTMISGYVQADMNKEALQLFHEMQNSDVEPDNVSLANAL 257

Query: 272 SACTDLGALELGKWIEAYIALHGIQKPVEVSNALIDMFAKCGDISKALKLFRTMSEKTIV 331
           SAC  LGALE GKWI +Y+    I+    +   LIDM+AKCG++ +AL++F+ + +K++ 
Sbjct: 258 SACAQLGALEQGKWIHSYLNKTRIRMDSVLGCVLIDMYAKCGEMEEALEVFKNIKKKSVQ 317

Query: 332 SWTSVIVGMAMHGRGQEAICLFEEMIGSGVAPDDVAFIGLLSACSHSGLVERGREYFSSM 391
           +WT++I G A HG G+EAI  F EM   G+ P+ + F  +L+ACS++GLVE G+  F SM
Sbjct: 318 AWTALISGYAYHGHGREAISKFMEMQKMGIKPNVITFTAVLTACSYTGLVEEGKLIFYSM 377

Query: 392 MKKYKHVPKIEHYGCMVDMYCRTGLVKEALEFVHNMPIEPNPVILRTLVSACRGHGEFKL 451
            + Y   P IEHYGC+VD+  R GL+ EA  F+  MP++PN VI   L+ ACR H   +L
Sbjct: 378 ERDYNLKPTIEHYGCIVDLLGRAGLLDEAKRFIQEMPLKPNAVIWGALLKACRIHKNIEL 437

Query: 452 GEKITKLLMRHEPMHGSNYVLLSNIYAKMFSWEKKTKIREVMEVKGMKKVPGSTMIEIDN 511
           GE+I ++L+  +P HG  YV  +NI+A    W+K  + R +M+ +G+ KVPG + I ++ 
Sbjct: 438 GEEIGEILIAIDPYHGGRYVHKANIHAMDKKWDKAAETRRLMKEQGVAKVPGCSTISLEG 497

Query: 512 EIYEFVAGDKSHKQFKEIYEMVDEMGREMKKSGYRPSTSEVLLD-INEEDKEDTLNRHSE 571
             +EF+AGD+SH + ++I      M R+++++GY P   E+LLD ++++++E  +++HSE
Sbjct: 498 TTHEFLAGDRSHPEIEKIQSKWRIMRRKLEENGYVPELEEMLLDLVDDDEREAIVHQHSE 557

Query: 572 KLAIAFGLLNTPPGTPIRIVKNLRVCSDCHSASKFISKIYGREIIMRDRNRFHHFKAGLC 606
           KLAI +GL+ T PGT IRI+KNLRVC DCH  +K ISKIY R+I+MRDR RFHHF+ G C
Sbjct: 558 KLAITYGLIKTKPGTIIRIMKNLRVCKDCHKVTKLISKIYKRDIVMRDRTRFHHFRDGKC 617

BLAST of Tan0017390 vs. ExPASy Swiss-Prot
Match: Q9FI80 (Pentatricopeptide repeat-containing protein At5g48910 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H38 PE=2 SV=1)

HSP 1 Score: 471.5 bits (1212), Expect = 1.5e-131
Identity = 249/636 (39.15%), Postives = 377/636 (59.28%), Query Frame = 0

Query: 16  NASASRPNPRAAEQNCLALLQACNALPKLTQIHAHILKLGLHNNPLVLTKFASISSVINA 75
           N+ AS  +P +   +    +  C  +  L+QIHA  +K G   + L   +     +  + 
Sbjct: 13  NSPAS--SPASHPSSLFPQINNCRTIRDLSQIHAVFIKSGQMRDTLAAAEILRFCATSDL 72

Query: 76  TDYAASFLFSADADTRLYDAFLFNTLIRAYAQTGHSKAK-ALSLYCIMLHDGIL-PNKFT 135
                 +           + F +NT+IR ++++   KA  A++L+  M+ D  + PN+FT
Sbjct: 73  HHRDLDYAHKIFNQMPQRNCFSWNTIIRGFSESDEDKALIAITLFYEMMSDEFVEPNRFT 132

Query: 136 YPFVLKACAGLEVLNLGQSVHGSVVKFGFDRDIHVQNTMVHMYSCCA------------- 195
           +P VLKACA    +  G+ +HG  +K+GF  D  V + +V MY  C              
Sbjct: 133 FPSVLKACAKTGKIQEGKQIHGLALKYGFGGDEFVMSNLVRMYVMCGFMKDARVLFYKNI 192

Query: 196 -------------------------------GGINFARKVFDEMPKSDSVTWSAMIGGYA 255
                                          G    AR +FD+M +   V+W+ MI GY+
Sbjct: 193 IEKDMVVMTDRRKRDGEIVLWNVMIDGYMRLGDCKAARMLFDKMRQRSVVSWNTMISGYS 252

Query: 256 RAGRSTEAVALFREMQMAEVCPDEITMVSILSACTDLGALELGKWIEAYIALHGIQKPVE 315
             G   +AV +FREM+  ++ P+ +T+VS+L A + LG+LELG+W+  Y    GI+    
Sbjct: 253 LNGFFKDAVEVFREMKKGDIRPNYVTLVSVLPAISRLGSLELGEWLHLYAEDSGIRIDDV 312

Query: 316 VSNALIDMFAKCGDISKALKLFRTMSEKTIVSWTSVIVGMAMHGRGQEAICLFEEMIGSG 375
           + +ALIDM++KCG I KA+ +F  +  + +++W+++I G A+HG+  +AI  F +M  +G
Sbjct: 313 LGSALIDMYSKCGIIEKAIHVFERLPRENVITWSAMINGFAIHGQAGDAIDCFCKMRQAG 372

Query: 376 VAPDDVAFIGLLSACSHSGLVERGREYFSSMMKKYKHVPKIEHYGCMVDMYCRTGLVKEA 435
           V P DVA+I LL+ACSH GLVE GR YFS M+      P+IEHYGCMVD+  R+GL+ EA
Sbjct: 373 VRPSDVAYINLLTACSHGGLVEEGRRYFSQMVSVDGLEPRIEHYGCMVDLLGRSGLLDEA 432

Query: 436 LEFVHNMPIEPNPVILRTLVSACRGHGEFKLGEKITKLLMRHEPMHGSNYVLLSNIYAKM 495
            EF+ NMPI+P+ VI + L+ ACR  G  ++G+++  +LM   P     YV LSN+YA  
Sbjct: 433 EEFILNMPIKPDDVIWKALLGACRMQGNVEMGKRVANILMDMVPHDSGAYVALSNMYASQ 492

Query: 496 FSWEKKTKIREVMEVKGMKKVPGSTMIEIDNEIYEFVAGDKSHKQFKEIYEMVDEMGREM 555
            +W + +++R  M+ K ++K PG ++I+ID  ++EFV  D SH + KEI  M+ E+  ++
Sbjct: 493 GNWSEVSEMRLRMKEKDIRKDPGCSLIDIDGVLHEFVVEDDSHPKAKEINSMLVEISDKL 552

Query: 556 KKSGYRPSTSEVLLDINEEDKEDTLNRHSEKLAIAFGLLNTPPGTPIRIVKNLRVCSDCH 606
           + +GYRP T++VLL++ EEDKE+ L+ HSEK+A AFGL++T PG PIRIVKNLR+C DCH
Sbjct: 553 RLAGYRPITTQVLLNLEEEDKENVLHYHSEKIATAFGLISTSPGKPIRIVKNLRICEDCH 612

BLAST of Tan0017390 vs. NCBI nr
Match: XP_038884201.1 (pentatricopeptide repeat-containing protein At4g21065-like [Benincasa hispida])

HSP 1 Score: 1144.8 bits (2960), Expect = 0.0e+00
Identity = 564/605 (93.22%), Postives = 582/605 (96.20%), Query Frame = 0

Query: 1   MQSQLTKSKVLRIINNASASRPNPRAAEQNCLALLQACNALPKLTQIHAHILKLGLHNNP 60
           MQSQ TK+K+LR INN  AS  NPRAAEQNCLALLQACNALPKLTQIH HILKLGLHNNP
Sbjct: 1   MQSQFTKTKLLRAINNVVASTTNPRAAEQNCLALLQACNALPKLTQIHTHILKLGLHNNP 60

Query: 61  LVLTKFASISSVINATDYAASFLFSADADTRLYDAFLFNTLIRAYAQTGHSKAKALSLYC 120
           LVLTKFASISS+I+ATDYAASFLFSA+ADTRLYDAFLFNTLIRAYAQTGHSK KALSLY 
Sbjct: 61  LVLTKFASISSLIHATDYAASFLFSAEADTRLYDAFLFNTLIRAYAQTGHSKDKALSLYS 120

Query: 121 IMLHDGILPNKFTYPFVLKACAGLEVLNLGQSVHGSVVKFGFDRDIHVQNTMVHMYSCCA 180
           IMLHDGILPNKFTYPFVLKACAGLEVLNLGQSVHGSVVKFGFDRDIHVQNTM+HMYSCCA
Sbjct: 121 IMLHDGILPNKFTYPFVLKACAGLEVLNLGQSVHGSVVKFGFDRDIHVQNTMIHMYSCCA 180

Query: 181 GGINFARKVFDEMPKSDSVTWSAMIGGYARAGRSTEAVALFREMQMAEVCPDEITMVSIL 240
           GGIN ARKVFDEMPKSDSVTWSAMIGGYAR GRSTEAVALFREMQMAEVCPDEITMVSIL
Sbjct: 181 GGINSARKVFDEMPKSDSVTWSAMIGGYARVGRSTEAVALFREMQMAEVCPDEITMVSIL 240

Query: 241 SACTDLGALELGKWIEAYIALHGIQKPVEVSNALIDMFAKCGDISKALKLFRTMSEKTIV 300
           SACTDLGALELGKWIEAYI   GI KPVEVSNALIDMFAKCGDI+KALKLFR ++EKTIV
Sbjct: 241 SACTDLGALELGKWIEAYIERQGIHKPVEVSNALIDMFAKCGDINKALKLFRALNEKTIV 300

Query: 301 SWTSVIVGMAMHGRGQEAICLFEEMIGSGVAPDDVAFIGLLSACSHSGLVERGREYFSSM 360
           SWTSVIVGMAMHGRGQEAICLFEEMI SGVAPDDV+FIGLLSACSHSGLVERGREYFSSM
Sbjct: 301 SWTSVIVGMAMHGRGQEAICLFEEMIVSGVAPDDVSFIGLLSACSHSGLVERGREYFSSM 360

Query: 361 MKKYKHVPKIEHYGCMVDMYCRTGLVKEALEFVHNMPIEPNPVILRTLVSACRGHGEFKL 420
           MKKYK  PKIEHYGCMVDMYCRTGLVKEAL+FVHNMP+EPNPVILRTLVSACRGHGEFKL
Sbjct: 361 MKKYKLAPKIEHYGCMVDMYCRTGLVKEALQFVHNMPVEPNPVILRTLVSACRGHGEFKL 420

Query: 421 GEKITKLLMRHEPMHGSNYVLLSNIYAKMFSWEKKTKIREVMEVKGMKKVPGSTMIEIDN 480
           GEKITKLLMRHEP+H SNYVLLSNIYAKM SWEKKTKIREVMEVKGMKK+PGSTMIEIDN
Sbjct: 421 GEKITKLLMRHEPLHESNYVLLSNIYAKMLSWEKKTKIREVMEVKGMKKIPGSTMIEIDN 480

Query: 481 EIYEFVAGDKSHKQFKEIYEMVDEMGREMKKSGYRPSTSEVLLDINEEDKEDTLNRHSEK 540
           EIYEFVAGDKSHKQ+KEIYEMVDEMGREMKKSGYRPSTSEVLLDINEEDKEDTLNRHSEK
Sbjct: 481 EIYEFVAGDKSHKQYKEIYEMVDEMGREMKKSGYRPSTSEVLLDINEEDKEDTLNRHSEK 540

Query: 541 LAIAFGLLNTPPGTPIRIVKNLRVCSDCHSASKFISKIYGREIIMRDRNRFHHFKAGLCS 600
           LAIAFGLL+TPPGTPIRIVKNLRVCSDCHSASK+IS IY REIIMRDRNRFHHFK+GLCS
Sbjct: 541 LAIAFGLLSTPPGTPIRIVKNLRVCSDCHSASKYISNIYNREIIMRDRNRFHHFKSGLCS 600

Query: 601 CGDFW 606
           CGDFW
Sbjct: 601 CGDFW 605

BLAST of Tan0017390 vs. NCBI nr
Match: XP_023537014.1 (pentatricopeptide repeat-containing protein At4g21065-like [Cucurbita pepo subsp. pepo] >XP_023537016.1 pentatricopeptide repeat-containing protein At4g21065-like [Cucurbita pepo subsp. pepo] >XP_023537017.1 pentatricopeptide repeat-containing protein At4g21065-like [Cucurbita pepo subsp. pepo] >XP_023537018.1 pentatricopeptide repeat-containing protein At4g21065-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1136.7 bits (2939), Expect = 0.0e+00
Identity = 567/606 (93.56%), Postives = 581/606 (95.87%), Query Frame = 0

Query: 1   MQSQLTKSKVLRIINNASASRPNPRAAEQNCLALLQACNALPKLTQIHAHILKLGLHNNP 60
           MQSQ     + R+INNA+ASR NPRAAEQNCLALLQACN+LPKLTQIHAHI KLGLHNNP
Sbjct: 1   MQSQF----LSRVINNAAASRSNPRAAEQNCLALLQACNSLPKLTQIHAHIFKLGLHNNP 60

Query: 61  LVLTKFASISSVINATDYAASFLFSADADTRLYDAFLFNTLIRAYAQTGHSKAKALSLYC 120
           LVLTKFASISSVINATDYAASFLFSA+ADTRLYDAFLFNTLIRA+AQTGHSKA+ALSLY 
Sbjct: 61  LVLTKFASISSVINATDYAASFLFSAEADTRLYDAFLFNTLIRAFAQTGHSKARALSLYG 120

Query: 121 IMLHDGILPNKFTYPFVLKACAGLEVLNLGQSVHGSVVKFGFDRDIHVQNTMVHMYSCCA 180
           IMLHDGILPNKFTYPFVLKACAGLEVL+LGQSVHGSVVKFGFD D+HVQNTMVHMYSCC+
Sbjct: 121 IMLHDGILPNKFTYPFVLKACAGLEVLSLGQSVHGSVVKFGFDHDVHVQNTMVHMYSCCS 180

Query: 181 GGINFARKVFDEMPKSDSVTWSAMIGGYARAGRSTEAVALFREMQMAEVCPDEITMVSIL 240
           GGI FARKVFDEMPKSDSVTWSAMIGGYAR GRSTEAVALFREMQMAEVCPDEITMVS+L
Sbjct: 181 GGIIFARKVFDEMPKSDSVTWSAMIGGYARVGRSTEAVALFREMQMAEVCPDEITMVSVL 240

Query: 241 SACTDLGALELGKWIEAYIALHGIQKPVEVSNALIDMFAKCGDISKALKLFRTMSEKTIV 300
           SACTDLGALELGKWIEAYI   GIQKPVEVSNALIDMFAKCGDI KALKLFR MS+KTIV
Sbjct: 241 SACTDLGALELGKWIEAYIERQGIQKPVEVSNALIDMFAKCGDIGKALKLFRAMSDKTIV 300

Query: 301 SWTSVIVGMAMHGRGQEAICLFEEMIG-SGVAPDDVAFIGLLSACSHSGLVERGREYFSS 360
           SWTSVIVGMAMHGRGQEAI LFEEMIG SGVAPDDVAFIGLLSACSHSGLVERGREYF+S
Sbjct: 301 SWTSVIVGMAMHGRGQEAISLFEEMIGSSGVAPDDVAFIGLLSACSHSGLVERGREYFNS 360

Query: 361 MMKKYKHVPKIEHYGCMVDMYCRTGLVKEALEFVHNMPIEPNPVILRTLVSACRGHGEFK 420
           MMKKYK VPKIEHYGCMVDMYCRTGLVKEALEFVHNMPIEPNPVILRTLVSACRGHGEFK
Sbjct: 361 MMKKYKLVPKIEHYGCMVDMYCRTGLVKEALEFVHNMPIEPNPVILRTLVSACRGHGEFK 420

Query: 421 LGEKITKLLMRHEPMHGSNYVLLSNIYAKMFSWEKKTKIREVMEVKGMKKVPGSTMIEID 480
           LGEKITKLLMRHEPMH SNYVLLSNIYAKMF+WEKKTKIREVMEVKGMKKVPGSTMIEID
Sbjct: 421 LGEKITKLLMRHEPMHESNYVLLSNIYAKMFNWEKKTKIREVMEVKGMKKVPGSTMIEID 480

Query: 481 NEIYEFVAGDKSHKQFKEIYEMVDEMGREMKKSGYRPSTSEVLLDINEEDKEDTLNRHSE 540
           NEIYEFVAGDKSHKQFKEIY MVDEMGREM KSGYRPSTSEVLLDINEEDKEDTLNRHSE
Sbjct: 481 NEIYEFVAGDKSHKQFKEIYAMVDEMGREMTKSGYRPSTSEVLLDINEEDKEDTLNRHSE 540

Query: 541 KLAIAFGLLNTPPGTPIRIVKNLRVCSDCHSASKFISKIYGREIIMRDRNRFHHFKAGLC 600
           KLAIAFGLLNTPPGTPIRIVKNLRVCSDCHSASKFISKIY REIIMRDRNRFHHFK GLC
Sbjct: 541 KLAIAFGLLNTPPGTPIRIVKNLRVCSDCHSASKFISKIYDREIIMRDRNRFHHFKGGLC 600

Query: 601 SCGDFW 606
           SCGDFW
Sbjct: 601 SCGDFW 602

BLAST of Tan0017390 vs. NCBI nr
Match: XP_023002279.1 (pentatricopeptide repeat-containing protein At4g21065-like [Cucurbita maxima] >XP_023002280.1 pentatricopeptide repeat-containing protein At4g21065-like [Cucurbita maxima] >XP_023002281.1 pentatricopeptide repeat-containing protein At4g21065-like [Cucurbita maxima])

HSP 1 Score: 1133.6 bits (2931), Expect = 0.0e+00
Identity = 568/606 (93.73%), Postives = 578/606 (95.38%), Query Frame = 0

Query: 1   MQSQLTKSKVLRIINNASASRPNPRAAEQNCLALLQACNALPKLTQIHAHILKLGLHNNP 60
           MQSQ     VLR+INNA+ASR NPRAAEQNCLALLQACN LPKLTQIHAHI KLGLHNNP
Sbjct: 1   MQSQF----VLRVINNATASRSNPRAAEQNCLALLQACNLLPKLTQIHAHIFKLGLHNNP 60

Query: 61  LVLTKFASISSVINATDYAASFLFSADADTRLYDAFLFNTLIRAYAQTGHSKAKALSLYC 120
           LVLTKF SISSVINATDYAASFLFSA+ADTRLYDAFLFNTLIRA+AQTGHSKA+ALSLY 
Sbjct: 61  LVLTKFVSISSVINATDYAASFLFSAEADTRLYDAFLFNTLIRAFAQTGHSKARALSLYG 120

Query: 121 IMLHDGILPNKFTYPFVLKACAGLEVLNLGQSVHGSVVKFGFDRDIHVQNTMVHMYSCCA 180
           IMLHDGILPNKFTYPFVLKACAGLEVL+LGQSVHGSVVKFGFD D+HVQNTMVHMYSCCA
Sbjct: 121 IMLHDGILPNKFTYPFVLKACAGLEVLSLGQSVHGSVVKFGFDHDVHVQNTMVHMYSCCA 180

Query: 181 GGINFARKVFDEMPKSDSVTWSAMIGGYARAGRSTEAVALFREMQMAEVCPDEITMVSIL 240
            GI FARKVFDEMPKSDSVTWSAMIGGYAR GRSTEAVALFREMQMAEV PDEITMVS+L
Sbjct: 181 DGIIFARKVFDEMPKSDSVTWSAMIGGYARVGRSTEAVALFREMQMAEVFPDEITMVSVL 240

Query: 241 SACTDLGALELGKWIEAYIALHGIQKPVEVSNALIDMFAKCGDISKALKLFRTMSEKTIV 300
           SACTDLGALELGKWIEAYI   GIQKPVEVSNALIDMFAKCGDI KALKLFR MSEKTIV
Sbjct: 241 SACTDLGALELGKWIEAYIERQGIQKPVEVSNALIDMFAKCGDIGKALKLFRVMSEKTIV 300

Query: 301 SWTSVIVGMAMHGRGQEAICLFEEMIG-SGVAPDDVAFIGLLSACSHSGLVERGREYFSS 360
           SWTSVIVGMAMHGRGQEAICLFEEMIG SGVAPDDVAFIGLLSACSHSGLVERGREYFSS
Sbjct: 301 SWTSVIVGMAMHGRGQEAICLFEEMIGSSGVAPDDVAFIGLLSACSHSGLVERGREYFSS 360

Query: 361 MMKKYKHVPKIEHYGCMVDMYCRTGLVKEALEFVHNMPIEPNPVILRTLVSACRGHGEFK 420
           MMKKYK VPKIEHYGCMVDMYCRTGLVKEALEFVHNMPIEPN VILRTLVSACRGHGEFK
Sbjct: 361 MMKKYKLVPKIEHYGCMVDMYCRTGLVKEALEFVHNMPIEPNTVILRTLVSACRGHGEFK 420

Query: 421 LGEKITKLLMRHEPMHGSNYVLLSNIYAKMFSWEKKTKIREVMEVKGMKKVPGSTMIEID 480
           LGEKITKLLMRHEPMH SNYVLLSNIYAKMF+WEKK KIREVMEVKGMKKVPGSTMIEID
Sbjct: 421 LGEKITKLLMRHEPMHESNYVLLSNIYAKMFNWEKKAKIREVMEVKGMKKVPGSTMIEID 480

Query: 481 NEIYEFVAGDKSHKQFKEIYEMVDEMGREMKKSGYRPSTSEVLLDINEEDKEDTLNRHSE 540
           NEIYEFVAGDKSHKQFKEIY MVDEMGREM KSGYRPSTSEVLLDINEEDKEDTLNRHSE
Sbjct: 481 NEIYEFVAGDKSHKQFKEIYAMVDEMGREMTKSGYRPSTSEVLLDINEEDKEDTLNRHSE 540

Query: 541 KLAIAFGLLNTPPGTPIRIVKNLRVCSDCHSASKFISKIYGREIIMRDRNRFHHFKAGLC 600
           KLAIAFGLLNTPPGTPIRIVKNLRVC+DCHSASKFISKIY REIIMRDRNRFHHFKAGLC
Sbjct: 541 KLAIAFGLLNTPPGTPIRIVKNLRVCTDCHSASKFISKIYDREIIMRDRNRFHHFKAGLC 600

Query: 601 SCGDFW 606
           SCGDFW
Sbjct: 601 SCGDFW 602

BLAST of Tan0017390 vs. NCBI nr
Match: KAG6585464.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1132.1 bits (2927), Expect = 0.0e+00
Identity = 563/606 (92.90%), Postives = 580/606 (95.71%), Query Frame = 0

Query: 1   MQSQLTKSKVLRIINNASASRPNPRAAEQNCLALLQACNALPKLTQIHAHILKLGLHNNP 60
           MQSQ     +LR+INNA+ASR NPRAAEQNCLALLQACN+LPKLTQIHAHI KLGL NNP
Sbjct: 1   MQSQF----LLRVINNAAASRSNPRAAEQNCLALLQACNSLPKLTQIHAHIFKLGLRNNP 60

Query: 61  LVLTKFASISSVINATDYAASFLFSADADTRLYDAFLFNTLIRAYAQTGHSKAKALSLYC 120
           LVLTKFASISSVINATDYAASFLFSA+ADTRLYDAFLFNTLIRA+AQTGHSKA+ALSLY 
Sbjct: 61  LVLTKFASISSVINATDYAASFLFSAEADTRLYDAFLFNTLIRAFAQTGHSKARALSLYG 120

Query: 121 IMLHDGILPNKFTYPFVLKACAGLEVLNLGQSVHGSVVKFGFDRDIHVQNTMVHMYSCCA 180
           IMLHDGILPNKFTYPFVLKACAGLEVL+LGQSVHGSVVKFGFD D+HVQNTMVHMYSCC+
Sbjct: 121 IMLHDGILPNKFTYPFVLKACAGLEVLSLGQSVHGSVVKFGFDHDVHVQNTMVHMYSCCS 180

Query: 181 GGINFARKVFDEMPKSDSVTWSAMIGGYARAGRSTEAVALFREMQMAEVCPDEITMVSIL 240
           GGI FARKVFDEMPKSDSVTWSAMIGGYAR GRSTEAVALFREMQMAEVCPDEITMVS+L
Sbjct: 181 GGIIFARKVFDEMPKSDSVTWSAMIGGYARVGRSTEAVALFREMQMAEVCPDEITMVSVL 240

Query: 241 SACTDLGALELGKWIEAYIALHGIQKPVEVSNALIDMFAKCGDISKALKLFRTMSEKTIV 300
           SACTDLGALELGKWIEAYI   GIQKPVEVSNALIDMFAKCGDI KALKLFR MS+KTIV
Sbjct: 241 SACTDLGALELGKWIEAYIERQGIQKPVEVSNALIDMFAKCGDIGKALKLFRAMSDKTIV 300

Query: 301 SWTSVIVGMAMHGRGQEAICLFEEMIG-SGVAPDDVAFIGLLSACSHSGLVERGREYFSS 360
           SWTSVIVGMAMHGRGQEAICLFEEMIG S VAPDDVAFIGLLSACSHSGLVERGREYF+S
Sbjct: 301 SWTSVIVGMAMHGRGQEAICLFEEMIGSSSVAPDDVAFIGLLSACSHSGLVERGREYFNS 360

Query: 361 MMKKYKHVPKIEHYGCMVDMYCRTGLVKEALEFVHNMPIEPNPVILRTLVSACRGHGEFK 420
           MMKKYK VPKIEHYGCMVDMYCRTGLVKEALEFVHNMP +PNPVILRTLVSACRGHGEFK
Sbjct: 361 MMKKYKLVPKIEHYGCMVDMYCRTGLVKEALEFVHNMPFKPNPVILRTLVSACRGHGEFK 420

Query: 421 LGEKITKLLMRHEPMHGSNYVLLSNIYAKMFSWEKKTKIREVMEVKGMKKVPGSTMIEID 480
           LGEKITKLLMRHEPMH SNYVLLSNIYAKMF+WEKKTKIREVMEVKG+KKVPGSTMIEID
Sbjct: 421 LGEKITKLLMRHEPMHESNYVLLSNIYAKMFNWEKKTKIREVMEVKGLKKVPGSTMIEID 480

Query: 481 NEIYEFVAGDKSHKQFKEIYEMVDEMGREMKKSGYRPSTSEVLLDINEEDKEDTLNRHSE 540
           NEIYEFVAGDKSHKQFKEIY MVDEMGREM KSGYRPSTSEVLLDINEEDKEDTLNRHSE
Sbjct: 481 NEIYEFVAGDKSHKQFKEIYAMVDEMGREMTKSGYRPSTSEVLLDINEEDKEDTLNRHSE 540

Query: 541 KLAIAFGLLNTPPGTPIRIVKNLRVCSDCHSASKFISKIYGREIIMRDRNRFHHFKAGLC 600
           KLAIAFGLLNTPPGTPIRIVKNLRVC+DCHSASKFISKIY REIIMRDRNRFHHFK GLC
Sbjct: 541 KLAIAFGLLNTPPGTPIRIVKNLRVCTDCHSASKFISKIYDREIIMRDRNRFHHFKGGLC 600

Query: 601 SCGDFW 606
           SCGDFW
Sbjct: 601 SCGDFW 602

BLAST of Tan0017390 vs. NCBI nr
Match: XP_022131416.1 (pentatricopeptide repeat-containing protein At4g21065-like [Momordica charantia] >XP_022131419.1 pentatricopeptide repeat-containing protein At4g21065-like [Momordica charantia] >XP_022131420.1 pentatricopeptide repeat-containing protein At4g21065-like [Momordica charantia])

HSP 1 Score: 1130.5 bits (2923), Expect = 0.0e+00
Identity = 561/606 (92.57%), Postives = 580/606 (95.71%), Query Frame = 0

Query: 1   MQSQLTKSKVLRIINNASA-SRPNPRAAEQNCLALLQACNALPKLTQIHAHILKLGLHNN 60
           MQSQ +K+K+L  INNA   SR NPRAAEQ+CLALLQACNALPKL QIHAHILKLGLHNN
Sbjct: 1   MQSQFSKTKLLLAINNAPVFSRANPRAAEQDCLALLQACNALPKLAQIHAHILKLGLHNN 60

Query: 61  PLVLTKFASISSVINATDYAASFLFSADADTRLYDAFLFNTLIRAYAQTGHSKAKALSLY 120
           PLVLTKFASISSVI+ATDYAASFLFSA ADTRLYDAFLFNTLIRAYAQTGHSK KAL+LY
Sbjct: 61  PLVLTKFASISSVISATDYAASFLFSAGADTRLYDAFLFNTLIRAYAQTGHSKPKALALY 120

Query: 121 CIMLHDGILPNKFTYPFVLKACAGLEVLNLGQSVHGSVVKFGFDRDIHVQNTMVHMYSCC 180
            +ML DGILPNKFTYPFVLKACAGLEVLNLGQSVHGSVVKFGFDRD+HV+NTMVHMYSCC
Sbjct: 121 GLMLRDGILPNKFTYPFVLKACAGLEVLNLGQSVHGSVVKFGFDRDVHVRNTMVHMYSCC 180

Query: 181 AGGINFARKVFDEMPKSDSVTWSAMIGGYARAGRSTEAVALFREMQMAEVCPDEITMVSI 240
           AGGINFARKVFDEMPKSDSVTWSAMIGGYAR GR TEAV+LFREMQ+AEVCPDEITMVSI
Sbjct: 181 AGGINFARKVFDEMPKSDSVTWSAMIGGYARVGRPTEAVSLFREMQLAEVCPDEITMVSI 240

Query: 241 LSACTDLGALELGKWIEAYIALHGIQKPVEVSNALIDMFAKCGDISKALKLFRTMSEKTI 300
           LSACTDLGALELGKW+EAYI   GIQKP EVSNALIDMFAKCGDISKALKLF+TMSEKTI
Sbjct: 241 LSACTDLGALELGKWLEAYIERQGIQKPEEVSNALIDMFAKCGDISKALKLFKTMSEKTI 300

Query: 301 VSWTSVIVGMAMHGRGQEAICLFEEMIGSGVAPDDVAFIGLLSACSHSGLVERGREYFSS 360
           VSWTSVIVGMAMHGRGQ+AICLFEEMIGSGVAPDDVAFIGLLSACSHSG+VERGREYFSS
Sbjct: 301 VSWTSVIVGMAMHGRGQDAICLFEEMIGSGVAPDDVAFIGLLSACSHSGMVERGREYFSS 360

Query: 361 MMKKYKHVPKIEHYGCMVDMYCRTGLVKEALEFVHNMPIEPNPVILRTLVSACRGHGEFK 420
           M KKYK VPKIEHYGCMVDM+CRTGLVKEALEFVH+MPIEPN VILRTLVSACRGHGEF+
Sbjct: 361 MTKKYKLVPKIEHYGCMVDMFCRTGLVKEALEFVHSMPIEPNAVILRTLVSACRGHGEFQ 420

Query: 421 LGEKITKLLMRHEPMHGSNYVLLSNIYAKMFSWEKKTKIREVMEVKGMKKVPGSTMIEID 480
           LGEKITK LMRHEPMH SNYVLLSNIYAKM SWEKKTKIREVMEVKGMKKVPGSTMIEID
Sbjct: 421 LGEKITKQLMRHEPMHESNYVLLSNIYAKMLSWEKKTKIREVMEVKGMKKVPGSTMIEID 480

Query: 481 NEIYEFVAGDKSHKQFKEIYEMVDEMGREMKKSGYRPSTSEVLLDINEEDKEDTLNRHSE 540
           NEIYEFVAGDKSHKQFKEIYEMVDEMGREMKKSGYRPSTSEVLLDINEEDKEDTLNRH E
Sbjct: 481 NEIYEFVAGDKSHKQFKEIYEMVDEMGREMKKSGYRPSTSEVLLDINEEDKEDTLNRHGE 540

Query: 541 KLAIAFGLLNTPPGTPIRIVKNLRVCSDCHSASKFISKIYGREIIMRDRNRFHHFKAGLC 600
           KLAIAFGLLNTPPGTPIRIVKNLRVCSDCHSASKFISKIY REIIMRDRNRFHHFKAG+C
Sbjct: 541 KLAIAFGLLNTPPGTPIRIVKNLRVCSDCHSASKFISKIYDREIIMRDRNRFHHFKAGIC 600

Query: 601 SCGDFW 606
           SCGDFW
Sbjct: 601 SCGDFW 606

BLAST of Tan0017390 vs. ExPASy TrEMBL
Match: A0A6J1KQ01 (pentatricopeptide repeat-containing protein At4g21065-like OS=Cucurbita maxima OX=3661 GN=LOC111496170 PE=3 SV=1)

HSP 1 Score: 1133.6 bits (2931), Expect = 0.0e+00
Identity = 568/606 (93.73%), Postives = 578/606 (95.38%), Query Frame = 0

Query: 1   MQSQLTKSKVLRIINNASASRPNPRAAEQNCLALLQACNALPKLTQIHAHILKLGLHNNP 60
           MQSQ     VLR+INNA+ASR NPRAAEQNCLALLQACN LPKLTQIHAHI KLGLHNNP
Sbjct: 1   MQSQF----VLRVINNATASRSNPRAAEQNCLALLQACNLLPKLTQIHAHIFKLGLHNNP 60

Query: 61  LVLTKFASISSVINATDYAASFLFSADADTRLYDAFLFNTLIRAYAQTGHSKAKALSLYC 120
           LVLTKF SISSVINATDYAASFLFSA+ADTRLYDAFLFNTLIRA+AQTGHSKA+ALSLY 
Sbjct: 61  LVLTKFVSISSVINATDYAASFLFSAEADTRLYDAFLFNTLIRAFAQTGHSKARALSLYG 120

Query: 121 IMLHDGILPNKFTYPFVLKACAGLEVLNLGQSVHGSVVKFGFDRDIHVQNTMVHMYSCCA 180
           IMLHDGILPNKFTYPFVLKACAGLEVL+LGQSVHGSVVKFGFD D+HVQNTMVHMYSCCA
Sbjct: 121 IMLHDGILPNKFTYPFVLKACAGLEVLSLGQSVHGSVVKFGFDHDVHVQNTMVHMYSCCA 180

Query: 181 GGINFARKVFDEMPKSDSVTWSAMIGGYARAGRSTEAVALFREMQMAEVCPDEITMVSIL 240
            GI FARKVFDEMPKSDSVTWSAMIGGYAR GRSTEAVALFREMQMAEV PDEITMVS+L
Sbjct: 181 DGIIFARKVFDEMPKSDSVTWSAMIGGYARVGRSTEAVALFREMQMAEVFPDEITMVSVL 240

Query: 241 SACTDLGALELGKWIEAYIALHGIQKPVEVSNALIDMFAKCGDISKALKLFRTMSEKTIV 300
           SACTDLGALELGKWIEAYI   GIQKPVEVSNALIDMFAKCGDI KALKLFR MSEKTIV
Sbjct: 241 SACTDLGALELGKWIEAYIERQGIQKPVEVSNALIDMFAKCGDIGKALKLFRVMSEKTIV 300

Query: 301 SWTSVIVGMAMHGRGQEAICLFEEMIG-SGVAPDDVAFIGLLSACSHSGLVERGREYFSS 360
           SWTSVIVGMAMHGRGQEAICLFEEMIG SGVAPDDVAFIGLLSACSHSGLVERGREYFSS
Sbjct: 301 SWTSVIVGMAMHGRGQEAICLFEEMIGSSGVAPDDVAFIGLLSACSHSGLVERGREYFSS 360

Query: 361 MMKKYKHVPKIEHYGCMVDMYCRTGLVKEALEFVHNMPIEPNPVILRTLVSACRGHGEFK 420
           MMKKYK VPKIEHYGCMVDMYCRTGLVKEALEFVHNMPIEPN VILRTLVSACRGHGEFK
Sbjct: 361 MMKKYKLVPKIEHYGCMVDMYCRTGLVKEALEFVHNMPIEPNTVILRTLVSACRGHGEFK 420

Query: 421 LGEKITKLLMRHEPMHGSNYVLLSNIYAKMFSWEKKTKIREVMEVKGMKKVPGSTMIEID 480
           LGEKITKLLMRHEPMH SNYVLLSNIYAKMF+WEKK KIREVMEVKGMKKVPGSTMIEID
Sbjct: 421 LGEKITKLLMRHEPMHESNYVLLSNIYAKMFNWEKKAKIREVMEVKGMKKVPGSTMIEID 480

Query: 481 NEIYEFVAGDKSHKQFKEIYEMVDEMGREMKKSGYRPSTSEVLLDINEEDKEDTLNRHSE 540
           NEIYEFVAGDKSHKQFKEIY MVDEMGREM KSGYRPSTSEVLLDINEEDKEDTLNRHSE
Sbjct: 481 NEIYEFVAGDKSHKQFKEIYAMVDEMGREMTKSGYRPSTSEVLLDINEEDKEDTLNRHSE 540

Query: 541 KLAIAFGLLNTPPGTPIRIVKNLRVCSDCHSASKFISKIYGREIIMRDRNRFHHFKAGLC 600
           KLAIAFGLLNTPPGTPIRIVKNLRVC+DCHSASKFISKIY REIIMRDRNRFHHFKAGLC
Sbjct: 541 KLAIAFGLLNTPPGTPIRIVKNLRVCTDCHSASKFISKIYDREIIMRDRNRFHHFKAGLC 600

Query: 601 SCGDFW 606
           SCGDFW
Sbjct: 601 SCGDFW 602

BLAST of Tan0017390 vs. ExPASy TrEMBL
Match: A0A6J1BQ70 (pentatricopeptide repeat-containing protein At4g21065-like OS=Momordica charantia OX=3673 GN=LOC111004636 PE=3 SV=1)

HSP 1 Score: 1130.5 bits (2923), Expect = 0.0e+00
Identity = 561/606 (92.57%), Postives = 580/606 (95.71%), Query Frame = 0

Query: 1   MQSQLTKSKVLRIINNASA-SRPNPRAAEQNCLALLQACNALPKLTQIHAHILKLGLHNN 60
           MQSQ +K+K+L  INNA   SR NPRAAEQ+CLALLQACNALPKL QIHAHILKLGLHNN
Sbjct: 1   MQSQFSKTKLLLAINNAPVFSRANPRAAEQDCLALLQACNALPKLAQIHAHILKLGLHNN 60

Query: 61  PLVLTKFASISSVINATDYAASFLFSADADTRLYDAFLFNTLIRAYAQTGHSKAKALSLY 120
           PLVLTKFASISSVI+ATDYAASFLFSA ADTRLYDAFLFNTLIRAYAQTGHSK KAL+LY
Sbjct: 61  PLVLTKFASISSVISATDYAASFLFSAGADTRLYDAFLFNTLIRAYAQTGHSKPKALALY 120

Query: 121 CIMLHDGILPNKFTYPFVLKACAGLEVLNLGQSVHGSVVKFGFDRDIHVQNTMVHMYSCC 180
            +ML DGILPNKFTYPFVLKACAGLEVLNLGQSVHGSVVKFGFDRD+HV+NTMVHMYSCC
Sbjct: 121 GLMLRDGILPNKFTYPFVLKACAGLEVLNLGQSVHGSVVKFGFDRDVHVRNTMVHMYSCC 180

Query: 181 AGGINFARKVFDEMPKSDSVTWSAMIGGYARAGRSTEAVALFREMQMAEVCPDEITMVSI 240
           AGGINFARKVFDEMPKSDSVTWSAMIGGYAR GR TEAV+LFREMQ+AEVCPDEITMVSI
Sbjct: 181 AGGINFARKVFDEMPKSDSVTWSAMIGGYARVGRPTEAVSLFREMQLAEVCPDEITMVSI 240

Query: 241 LSACTDLGALELGKWIEAYIALHGIQKPVEVSNALIDMFAKCGDISKALKLFRTMSEKTI 300
           LSACTDLGALELGKW+EAYI   GIQKP EVSNALIDMFAKCGDISKALKLF+TMSEKTI
Sbjct: 241 LSACTDLGALELGKWLEAYIERQGIQKPEEVSNALIDMFAKCGDISKALKLFKTMSEKTI 300

Query: 301 VSWTSVIVGMAMHGRGQEAICLFEEMIGSGVAPDDVAFIGLLSACSHSGLVERGREYFSS 360
           VSWTSVIVGMAMHGRGQ+AICLFEEMIGSGVAPDDVAFIGLLSACSHSG+VERGREYFSS
Sbjct: 301 VSWTSVIVGMAMHGRGQDAICLFEEMIGSGVAPDDVAFIGLLSACSHSGMVERGREYFSS 360

Query: 361 MMKKYKHVPKIEHYGCMVDMYCRTGLVKEALEFVHNMPIEPNPVILRTLVSACRGHGEFK 420
           M KKYK VPKIEHYGCMVDM+CRTGLVKEALEFVH+MPIEPN VILRTLVSACRGHGEF+
Sbjct: 361 MTKKYKLVPKIEHYGCMVDMFCRTGLVKEALEFVHSMPIEPNAVILRTLVSACRGHGEFQ 420

Query: 421 LGEKITKLLMRHEPMHGSNYVLLSNIYAKMFSWEKKTKIREVMEVKGMKKVPGSTMIEID 480
           LGEKITK LMRHEPMH SNYVLLSNIYAKM SWEKKTKIREVMEVKGMKKVPGSTMIEID
Sbjct: 421 LGEKITKQLMRHEPMHESNYVLLSNIYAKMLSWEKKTKIREVMEVKGMKKVPGSTMIEID 480

Query: 481 NEIYEFVAGDKSHKQFKEIYEMVDEMGREMKKSGYRPSTSEVLLDINEEDKEDTLNRHSE 540
           NEIYEFVAGDKSHKQFKEIYEMVDEMGREMKKSGYRPSTSEVLLDINEEDKEDTLNRH E
Sbjct: 481 NEIYEFVAGDKSHKQFKEIYEMVDEMGREMKKSGYRPSTSEVLLDINEEDKEDTLNRHGE 540

Query: 541 KLAIAFGLLNTPPGTPIRIVKNLRVCSDCHSASKFISKIYGREIIMRDRNRFHHFKAGLC 600
           KLAIAFGLLNTPPGTPIRIVKNLRVCSDCHSASKFISKIY REIIMRDRNRFHHFKAG+C
Sbjct: 541 KLAIAFGLLNTPPGTPIRIVKNLRVCSDCHSASKFISKIYDREIIMRDRNRFHHFKAGIC 600

Query: 601 SCGDFW 606
           SCGDFW
Sbjct: 601 SCGDFW 606

BLAST of Tan0017390 vs. ExPASy TrEMBL
Match: A0A6J1GHH7 (pentatricopeptide repeat-containing protein At4g21065-like OS=Cucurbita moschata OX=3662 GN=LOC111454235 PE=3 SV=1)

HSP 1 Score: 1130.2 bits (2922), Expect = 0.0e+00
Identity = 563/606 (92.90%), Postives = 579/606 (95.54%), Query Frame = 0

Query: 1   MQSQLTKSKVLRIINNASASRPNPRAAEQNCLALLQACNALPKLTQIHAHILKLGLHNNP 60
           MQSQ     +LR+I+NA+ASR NPRAAEQNCLALLQACN+LPKLTQIHAHI KLGL NNP
Sbjct: 1   MQSQF----LLRVISNAAASRSNPRAAEQNCLALLQACNSLPKLTQIHAHIFKLGLRNNP 60

Query: 61  LVLTKFASISSVINATDYAASFLFSADADTRLYDAFLFNTLIRAYAQTGHSKAKALSLYC 120
           LVLTKFASISSVINATDYAASFLFSA+ADTRLYDAFLFNTLIRA+AQTGHSKA+ALSLY 
Sbjct: 61  LVLTKFASISSVINATDYAASFLFSAEADTRLYDAFLFNTLIRAFAQTGHSKARALSLYG 120

Query: 121 IMLHDGILPNKFTYPFVLKACAGLEVLNLGQSVHGSVVKFGFDRDIHVQNTMVHMYSCCA 180
           IMLHDGILPNKFTYPFVLKACAGLEVL+LGQSVHGSVVKFGFD D+HVQNTMVHMYSCC+
Sbjct: 121 IMLHDGILPNKFTYPFVLKACAGLEVLSLGQSVHGSVVKFGFDHDVHVQNTMVHMYSCCS 180

Query: 181 GGINFARKVFDEMPKSDSVTWSAMIGGYARAGRSTEAVALFREMQMAEVCPDEITMVSIL 240
           GGI FARKVFDEMPKSDSVTWSAMIGGYAR GRSTEAVALFREMQMAEVCPDEITMVS+L
Sbjct: 181 GGIIFARKVFDEMPKSDSVTWSAMIGGYARVGRSTEAVALFREMQMAEVCPDEITMVSVL 240

Query: 241 SACTDLGALELGKWIEAYIALHGIQKPVEVSNALIDMFAKCGDISKALKLFRTMSEKTIV 300
           SACTDLGALELGKWIEAYI   GIQKPVEVSNALIDMFAKCGDI KALKLFR MS+KTIV
Sbjct: 241 SACTDLGALELGKWIEAYIERQGIQKPVEVSNALIDMFAKCGDIGKALKLFRAMSDKTIV 300

Query: 301 SWTSVIVGMAMHGRGQEAICLFEEMIG-SGVAPDDVAFIGLLSACSHSGLVERGREYFSS 360
           SWTSVIVGMAMHGRG EAICLFEEMIG S VAPDDVAFIGLLSACSHSGLVERGREYF+S
Sbjct: 301 SWTSVIVGMAMHGRGLEAICLFEEMIGSSSVAPDDVAFIGLLSACSHSGLVERGREYFNS 360

Query: 361 MMKKYKHVPKIEHYGCMVDMYCRTGLVKEALEFVHNMPIEPNPVILRTLVSACRGHGEFK 420
           MMKKYK VPKIEHYGCMVDMYCRTGLVKEALEFVHNMP EPNPVILRTLVSACRGHGEFK
Sbjct: 361 MMKKYKLVPKIEHYGCMVDMYCRTGLVKEALEFVHNMPFEPNPVILRTLVSACRGHGEFK 420

Query: 421 LGEKITKLLMRHEPMHGSNYVLLSNIYAKMFSWEKKTKIREVMEVKGMKKVPGSTMIEID 480
           LGEKITKLLMRHEPMH SNYVLLSNIYAKMF+WEKKTKIREVMEVKGMKKVPGSTMIEID
Sbjct: 421 LGEKITKLLMRHEPMHESNYVLLSNIYAKMFNWEKKTKIREVMEVKGMKKVPGSTMIEID 480

Query: 481 NEIYEFVAGDKSHKQFKEIYEMVDEMGREMKKSGYRPSTSEVLLDINEEDKEDTLNRHSE 540
           NEIYEFVAGDKSHKQFKEIY MVDEMGREM KSGYRPSTSEVLLDINEEDKEDTLNRHSE
Sbjct: 481 NEIYEFVAGDKSHKQFKEIYAMVDEMGREMTKSGYRPSTSEVLLDINEEDKEDTLNRHSE 540

Query: 541 KLAIAFGLLNTPPGTPIRIVKNLRVCSDCHSASKFISKIYGREIIMRDRNRFHHFKAGLC 600
           KLAIAFGLLNTPPGTPIRIVKNLRVC+DCHSASKFISKIY REIIMRDRNRFHHFK GLC
Sbjct: 541 KLAIAFGLLNTPPGTPIRIVKNLRVCTDCHSASKFISKIYDREIIMRDRNRFHHFKGGLC 600

Query: 601 SCGDFW 606
           SCGDFW
Sbjct: 601 SCGDFW 602

BLAST of Tan0017390 vs. ExPASy TrEMBL
Match: A0A5A7V9A4 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold82G002500 PE=3 SV=1)

HSP 1 Score: 1128.2 bits (2917), Expect = 0.0e+00
Identity = 562/606 (92.74%), Postives = 577/606 (95.21%), Query Frame = 0

Query: 1   MQSQLTKSKVLRIINNA-SASRPNPRAAEQNCLALLQACNALPKLTQIHAHILKLGLHNN 60
           MQSQ TK K+LR INN  ++S  NPRAAEQNCLALLQACNALPKLTQIH HILKLGLHNN
Sbjct: 1   MQSQFTKPKLLRTINNVLASSTTNPRAAEQNCLALLQACNALPKLTQIHTHILKLGLHNN 60

Query: 61  PLVLTKFASISSVINATDYAASFLFSADADTRLYDAFLFNTLIRAYAQTGHSKAKALSLY 120
           PLVLTKFASISS+I+ATDYAASFLFSA+ADTRLYDAFLFNTLIRAYAQTGHSK KAL+LY
Sbjct: 61  PLVLTKFASISSLIHATDYAASFLFSAEADTRLYDAFLFNTLIRAYAQTGHSKDKALALY 120

Query: 121 CIMLHDGILPNKFTYPFVLKACAGLEVLNLGQSVHGSVVKFGFDRDIHVQNTMVHMYSCC 180
            IMLHDGILPNKFTYPFVLKACAGLEVLNLGQSVHGSVVKFGFD DIHVQNTMVHMYSCC
Sbjct: 121 GIMLHDGILPNKFTYPFVLKACAGLEVLNLGQSVHGSVVKFGFDCDIHVQNTMVHMYSCC 180

Query: 181 AGGINFARKVFDEMPKSDSVTWSAMIGGYARAGRSTEAVALFREMQMAEVCPDEITMVSI 240
           AGGIN ARKVFDEMPKSDSVTWSAMIGGYAR GRSTEAVALFREMQMAEVCPDEITMVS+
Sbjct: 181 AGGINSARKVFDEMPKSDSVTWSAMIGGYARVGRSTEAVALFREMQMAEVCPDEITMVSM 240

Query: 241 LSACTDLGALELGKWIEAYIALHGIQKPVEVSNALIDMFAKCGDISKALKLFRTMSEKTI 300
           LSACTDLGALELGKWIEAYI  HGI KPVEVSNALIDMFAKCGDISKALKLFR M+EKTI
Sbjct: 241 LSACTDLGALELGKWIEAYIERHGIHKPVEVSNALIDMFAKCGDISKALKLFRAMNEKTI 300

Query: 301 VSWTSVIVGMAMHGRGQEAICLFEEMIGSGVAPDDVAFIGLLSACSHSGLVERGREYFSS 360
           VSWTSVIVGMAMHGRG+EA CLFEEMI SGVAPDDVAFIGLLSACSHSGLVERGREYF S
Sbjct: 301 VSWTSVIVGMAMHGRGREATCLFEEMITSGVAPDDVAFIGLLSACSHSGLVERGREYFGS 360

Query: 361 MMKKYKHVPKIEHYGCMVDMYCRTGLVKEALEFVHNMPIEPNPVILRTLVSACRGHGEFK 420
           MMKKYK VPKIEHYGCMVDMYCRTGLVKEALEFV NMPIEPNPVILRTLV+ACRGHGEFK
Sbjct: 361 MMKKYKLVPKIEHYGCMVDMYCRTGLVKEALEFVRNMPIEPNPVILRTLVTACRGHGEFK 420

Query: 421 LGEKITKLLMRHEPMHGSNYVLLSNIYAKMFSWEKKTKIREVMEVKGMKKVPGSTMIEID 480
           LGEKITKLLM+HEP+H SNYVLLSNIYAK  SWEKKTKIREVMEVKGMKKVPGSTMIEID
Sbjct: 421 LGEKITKLLMKHEPLHESNYVLLSNIYAKTLSWEKKTKIREVMEVKGMKKVPGSTMIEID 480

Query: 481 NEIYEFVAGDKSHKQFKEIYEMVDEMGREMKKSGYRPSTSEVLLDINEEDKEDTLNRHSE 540
           NEIYEFVAGDKSHKQ KEIYEMVDEMGREMKKSGYRPSTSEVLLDINEEDKED+LNRHSE
Sbjct: 481 NEIYEFVAGDKSHKQHKEIYEMVDEMGREMKKSGYRPSTSEVLLDINEEDKEDSLNRHSE 540

Query: 541 KLAIAFGLLNTPPGTPIRIVKNLRVCSDCHSASKFISKIYGREIIMRDRNRFHHFKAGLC 600
           KLAIAFGLL TPPGTPIRIVKNLRVCSDCHSASKFISKIY REIIMRDRNRFHHFK+G C
Sbjct: 541 KLAIAFGLLRTPPGTPIRIVKNLRVCSDCHSASKFISKIYDREIIMRDRNRFHHFKSGQC 600

Query: 601 SCGDFW 606
           SCGDFW
Sbjct: 601 SCGDFW 606

BLAST of Tan0017390 vs. ExPASy TrEMBL
Match: A0A1S3BC37 (pentatricopeptide repeat-containing protein At4g21065-like OS=Cucumis melo OX=3656 GN=LOC103488302 PE=3 SV=1)

HSP 1 Score: 1127.1 bits (2914), Expect = 0.0e+00
Identity = 561/606 (92.57%), Postives = 577/606 (95.21%), Query Frame = 0

Query: 1   MQSQLTKSKVLRIINNA-SASRPNPRAAEQNCLALLQACNALPKLTQIHAHILKLGLHNN 60
           MQSQ TK K+LR INN  ++S  NPRAAEQNCLALLQACNALPKLTQIH HI+KLGLHNN
Sbjct: 1   MQSQFTKPKLLRTINNVLASSTTNPRAAEQNCLALLQACNALPKLTQIHTHIVKLGLHNN 60

Query: 61  PLVLTKFASISSVINATDYAASFLFSADADTRLYDAFLFNTLIRAYAQTGHSKAKALSLY 120
           PLVLTKFASISS+I+ATDYAASFLFSA+ADTRLYDAFLFNTLIRAYAQTGHSK KAL+LY
Sbjct: 61  PLVLTKFASISSLIHATDYAASFLFSAEADTRLYDAFLFNTLIRAYAQTGHSKDKALALY 120

Query: 121 CIMLHDGILPNKFTYPFVLKACAGLEVLNLGQSVHGSVVKFGFDRDIHVQNTMVHMYSCC 180
            IMLHDGILPNKFTYPFVLKACAGLEVLNLGQSVHGSVVKFGFD DIHVQNTMVHMYSCC
Sbjct: 121 GIMLHDGILPNKFTYPFVLKACAGLEVLNLGQSVHGSVVKFGFDCDIHVQNTMVHMYSCC 180

Query: 181 AGGINFARKVFDEMPKSDSVTWSAMIGGYARAGRSTEAVALFREMQMAEVCPDEITMVSI 240
           AGGIN ARKVFDEMPKSDSVTWSAMIGGYAR GRSTEAVALFREMQMAEVCPDEITMVS+
Sbjct: 181 AGGINSARKVFDEMPKSDSVTWSAMIGGYARVGRSTEAVALFREMQMAEVCPDEITMVSM 240

Query: 241 LSACTDLGALELGKWIEAYIALHGIQKPVEVSNALIDMFAKCGDISKALKLFRTMSEKTI 300
           LSACTDLGALELGKWIEAYI  HGI KPVEVSNALIDMFAKCGDISKALKLFR M+EKTI
Sbjct: 241 LSACTDLGALELGKWIEAYIERHGIHKPVEVSNALIDMFAKCGDISKALKLFRAMNEKTI 300

Query: 301 VSWTSVIVGMAMHGRGQEAICLFEEMIGSGVAPDDVAFIGLLSACSHSGLVERGREYFSS 360
           VSWTSVIVGMAMHGRG+EA CLFEEMI SGVAPDDVAFIGLLSACSHSGLVERGREYF S
Sbjct: 301 VSWTSVIVGMAMHGRGREATCLFEEMITSGVAPDDVAFIGLLSACSHSGLVERGREYFGS 360

Query: 361 MMKKYKHVPKIEHYGCMVDMYCRTGLVKEALEFVHNMPIEPNPVILRTLVSACRGHGEFK 420
           MMKKYK VPKIEHYGCMVDMYCRTGLVKEALEFV NMPIEPNPVILRTLV+ACRGHGEFK
Sbjct: 361 MMKKYKLVPKIEHYGCMVDMYCRTGLVKEALEFVRNMPIEPNPVILRTLVTACRGHGEFK 420

Query: 421 LGEKITKLLMRHEPMHGSNYVLLSNIYAKMFSWEKKTKIREVMEVKGMKKVPGSTMIEID 480
           LGEKITKLLM+HEP+H SNYVLLSNIYAK  SWEKKTKIREVMEVKGMKKVPGSTMIEID
Sbjct: 421 LGEKITKLLMKHEPLHESNYVLLSNIYAKTLSWEKKTKIREVMEVKGMKKVPGSTMIEID 480

Query: 481 NEIYEFVAGDKSHKQFKEIYEMVDEMGREMKKSGYRPSTSEVLLDINEEDKEDTLNRHSE 540
           NEIYEFVAGDKSHKQ KEIYEMVDEMGREMKKSGYRPSTSEVLLDINEEDKED+LNRHSE
Sbjct: 481 NEIYEFVAGDKSHKQHKEIYEMVDEMGREMKKSGYRPSTSEVLLDINEEDKEDSLNRHSE 540

Query: 541 KLAIAFGLLNTPPGTPIRIVKNLRVCSDCHSASKFISKIYGREIIMRDRNRFHHFKAGLC 600
           KLAIAFGLL TPPGTPIRIVKNLRVCSDCHSASKFISKIY REIIMRDRNRFHHFK+G C
Sbjct: 541 KLAIAFGLLRTPPGTPIRIVKNLRVCSDCHSASKFISKIYDREIIMRDRNRFHHFKSGQC 600

Query: 601 SCGDFW 606
           SCGDFW
Sbjct: 601 SCGDFW 606

BLAST of Tan0017390 vs. TAIR 10
Match: AT2G02980.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 509.2 bits (1310), Expect = 4.5e-144
Identity = 257/580 (44.31%), Postives = 384/580 (66.21%), Query Frame = 0

Query: 29  QNCLALLQACNALPKLTQIHAHILKLGLHNNPLV--LTKFASISSVINATDYAASFLFSA 88
           QN + L+  CN+L +L QI A+ +K  + +   V  L  F + S   ++  Y A  LF A
Sbjct: 30  QNPILLISKCNSLRELMQIQAYAIKSHIEDVSFVAKLINFCTESPTESSMSY-ARHLFEA 89

Query: 89  DADTRLYDAFLFNTLIRAYAQTGHSKAKALSLYCIMLHDGILPNKFTYPFVLKACAGLEV 148
            ++    D  +FN++ R Y++   +  +  SL+  +L DGILP+ +T+P +LKACA  + 
Sbjct: 90  MSEP---DIVIFNSMARGYSRF-TNPLEVFSLFVEILEDGILPDNYTFPSLLKACAVAKA 149

Query: 149 LNLGQSVHGSVVKFGFDRDIHVQNTMVHMYSCCAGGINFARKVFDEMPKSDSVTWSAMIG 208
           L  G+ +H   +K G D +++V  T+++MY+ C   ++ AR VFD + +   V ++AMI 
Sbjct: 150 LEEGRQLHCLSMKLGLDDNVYVCPTLINMYTECE-DVDSARCVFDRIVEPCVVCYNAMIT 209

Query: 209 GYARAGRSTEAVALFREMQMAEVCPDEITMVSILSACTDLGALELGKWIEAYIALHGIQK 268
           GYAR  R  EA++LFREMQ   + P+EIT++S+LS+C  LG+L+LGKWI  Y   H   K
Sbjct: 210 GYARRNRPNEALSLFREMQGKYLKPNEITLLSVLSSCALLGSLDLGKWIHKYAKKHSFCK 269

Query: 269 PVEVSNALIDMFAKCGDISKALKLFRTMSEKTIVSWTSVIVGMAMHGRGQEAICLFEEMI 328
            V+V+ ALIDMFAKCG +  A+ +F  M  K   +W+++IV  A HG+ ++++ +FE M 
Sbjct: 270 YVKVNTALIDMFAKCGSLDDAVSIFEKMRYKDTQAWSAMIVAYANHGKAEKSMLMFERMR 329

Query: 329 GSGVAPDDVAFIGLLSACSHSGLVERGREYFSSMMKKYKHVPKIEHYGCMVDMYCRTGLV 388
              V PD++ F+GLL+ACSH+G VE GR+YFS M+ K+  VP I+HYG MVD+  R G +
Sbjct: 330 SENVQPDEITFLGLLNACSHTGRVEEGRKYFSQMVSKFGIVPSIKHYGSMVDLLSRAGNL 389

Query: 389 KEALEFVHNMPIEPNPVILRTLVSACRGHGEFKLGEKITKLLMRHEPMHGSNYVLLSNIY 448
           ++A EF+  +PI P P++ R L++AC  H    L EK+++ +   +  HG +YV+LSN+Y
Sbjct: 390 EDAYEFIDKLPISPTPMLWRILLAACSSHNNLDLAEKVSERIFELDDSHGGDYVILSNLY 449

Query: 449 AKMFSWEKKTKIREVMEVKGMKKVPGSTMIEIDNEIYEFVAGDKSHKQFKEIYEMVDEMG 508
           A+   WE    +R+VM+ +   KVPG + IE++N ++EF +GD       +++  +DEM 
Sbjct: 450 ARNKKWEYVDSLRKVMKDRKAVKVPGCSSIEVNNVVHEFFSGDGVKSATTKLHRALDEMV 509

Query: 509 REMKKSGYRPSTSEVL-LDINEEDKEDTLNRHSEKLAIAFGLLNTPPGTPIRIVKNLRVC 568
           +E+K SGY P TS V+  ++N+++KE TL  HSEKLAI FGLLNTPPGT IR+VKNLRVC
Sbjct: 510 KELKLSGYVPDTSMVVHANMNDQEKEITLRYHSEKLAITFGLLNTPPGTTIRVVKNLRVC 569

Query: 569 SDCHSASKFISKIYGREIIMRDRNRFHHFKAGLCSCGDFW 606
            DCH+A+K IS I+GR++++RD  RFHHF+ G CSCGDFW
Sbjct: 570 RDCHNAAKLISLIFGRKVVLRDVQRFHHFEDGKCSCGDFW 603

BLAST of Tan0017390 vs. TAIR 10
Match: AT4G21065.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 504.2 bits (1297), Expect = 1.4e-142
Identity = 258/582 (44.33%), Postives = 381/582 (65.46%), Query Frame = 0

Query: 29  QNCLALLQ--ACNALPKLTQIHAHILKLGLHNNPLVLTKFASISSVINATDYAASFLFSA 88
           + C+ LLQ    +++ KL QIHA  ++ G+  +   L K      V   +    S+    
Sbjct: 16  EKCINLLQTYGVSSITKLRQIHAFSIRHGVSISDAELGKHLIFYLVSLPSPPPMSYAHKV 75

Query: 89  DAD-TRLYDAFLFNTLIRAYAQTGHSKAKALSLYCIMLHDGIL-PNKFTYPFVLKACAGL 148
            +   +  + F++NTLIR YA+ G+S   A SLY  M   G++ P+  TYPF++KA   +
Sbjct: 76  FSKIEKPINVFIWNTLIRGYAEIGNS-ISAFSLYREMRVSGLVEPDTHTYPFLIKAVTTM 135

Query: 149 EVLNLGQSVHGSVVKFGFDRDIHVQNTMVHMYSCCAGGINFARKVFDEMPKSDSVTWSAM 208
             + LG+++H  V++ GF   I+VQN+++H+Y+ C G +  A KVFD+MP+ D V W+++
Sbjct: 136 ADVRLGETIHSVVIRSGFGSLIYVQNSLLHLYANC-GDVASAYKVFDKMPEKDLVAWNSV 195

Query: 209 IGGYARAGRSTEAVALFREMQMAEVCPDEITMVSILSACTDLGALELGKWIEAYIALHGI 268
           I G+A  G+  EA+AL+ EM    + PD  T+VS+LSAC  +GAL LGK +  Y+   G+
Sbjct: 196 INGFAENGKPEEALALYTEMNSKGIKPDGFTIVSLLSACAKIGALTLGKRVHVYMIKVGL 255

Query: 269 QKPVEVSNALIDMFAKCGDISKALKLFRTMSEKTIVSWTSVIVGMAMHGRGQEAICLFEE 328
            + +  SN L+D++A+CG + +A  LF  M +K  VSWTS+IVG+A++G G+EAI LF+ 
Sbjct: 256 TRNLHSSNVLLDLYARCGRVEEAKTLFDEMVDKNSVSWTSLIVGLAVNGFGKEAIELFKY 315

Query: 329 MIGS-GVAPDDVAFIGLLSACSHSGLVERGREYFSSMMKKYKHVPKIEHYGCMVDMYCRT 388
           M  + G+ P ++ F+G+L ACSH G+V+ G EYF  M ++YK  P+IEH+GCMVD+  R 
Sbjct: 316 MESTEGLLPCEITFVGILYACSHCGMVKEGFEYFRRMREEYKIEPRIEHFGCMVDLLARA 375

Query: 389 GLVKEALEFVHNMPIEPNPVILRTLVSACRGHGEFKLGEKITKLLMRHEPMHGSNYVLLS 448
           G VK+A E++ +MP++PN VI RTL+ AC  HG+  L E     +++ EP H  +YVLLS
Sbjct: 376 GQVKKAYEYIKSMPMQPNVVIWRTLLGACTVHGDSDLAEFARIQILQLEPNHSGDYVLLS 435

Query: 449 NIYAKMFSWEKKTKIREVMEVKGMKKVPGSTMIEIDNEIYEFVAGDKSHKQFKEIYEMVD 508
           N+YA    W    KIR+ M   G+KKVPG +++E+ N ++EF+ GDKSH Q   IY  + 
Sbjct: 436 NMYASEQRWSDVQKIRKQMLRDGVKKVPGHSLVEVGNRVHEFLMGDKSHPQSDAIYAKLK 495

Query: 509 EMGREMKKSGYRPSTSEVLLDINEEDKEDTLNRHSEKLAIAFGLLNTPPGTPIRIVKNLR 568
           EM   ++  GY P  S V +D+ EE+KE+ +  HSEK+AIAF L++TP  +PI +VKNLR
Sbjct: 496 EMTGRLRSEGYVPQISNVYVDVEEEEKENAVVYHSEKIAIAFMLISTPERSPITVVKNLR 555

Query: 569 VCSDCHSASKFISKIYGREIIMRDRNRFHHFKAGLCSCGDFW 606
           VC+DCH A K +SK+Y REI++RDR+RFHHFK G CSC D+W
Sbjct: 556 VCADCHLAIKLVSKVYNREIVVRDRSRFHHFKNGSCSCQDYW 595

BLAST of Tan0017390 vs. TAIR 10
Match: AT1G08070.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 492.7 bits (1267), Expect = 4.3e-139
Identity = 278/725 (38.34%), Postives = 384/725 (52.97%), Query Frame = 0

Query: 17  ASASRPNPRAAEQNCLALLQACNALPKLTQIHAHILKLGLHNNPLVLTK---FASISSVI 76
           +S+  P         L+LL  C  L  L  IHA ++K+GLHN    L+K   F  +S   
Sbjct: 22  SSSDPPYDSIRNHPSLSLLHNCKTLQSLRIIHAQMIKIGLHNTNYALSKLIEFCILSPHF 81

Query: 77  NATDYAASFLFSADADTRLYDAFLFNTLIRAYAQTGHSKAKALSLYCIMLHDGILPNKFT 136
               YA S +F    +  L    ++NT+ R +A +      AL LY  M+  G+LPN +T
Sbjct: 82  EGLPYAIS-VFKTIQEPNL---LIWNTMFRGHALSS-DPVSALKLYVCMISLGLLPNSYT 141

Query: 137 YPFVLKACAGLEVLNLGQSVHGSVVKFGFDRDIHVQNTMVHMY----------------- 196
           +PFVLK+CA  +    GQ +HG V+K G D D++V  +++ MY                 
Sbjct: 142 FPFVLKSCAKSKAFKEGQQIHGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVFDKSP 201

Query: 197 -------------SCCAGGINFARKVFDEMPKSDSVTWSAMIGGYARAGRSTEAVALFRE 256
                            G I  A+K+FDE+P  D V+W+AMI GYA  G   EA+ LF++
Sbjct: 202 HRDVVSYTALIKGYASRGYIENAQKLFDEIPVKDVVSWNAMISGYAETGNYKEALELFKD 261

Query: 257 MQMAEVCPDE-------------------------------------------------- 316
           M    V PDE                                                  
Sbjct: 262 MMKTNVRPDESTMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGE 321

Query: 317 ---------------------------------------------------ITMVSILSA 376
                                                              +TM+SIL A
Sbjct: 322 LETACGLFERLPYKDVISWNTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSILPA 381

Query: 377 CTDLGALELGKWIEAYI--ALHGIQKPVEVSNALIDMFAKCGDISKALKLFRTMSEKTIV 436
           C  LGA+++G+WI  YI   L G+     +  +LIDM+AKCGDI  A ++F ++  K++ 
Sbjct: 382 CAHLGAIDIGRWIHVYIDKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLS 441

Query: 437 SWTSVIVGMAMHGRGQEAICLFEEMIGSGVAPDDVAFIGLLSACSHSGLVERGREYFSSM 496
           SW ++I G AMHGR   +  LF  M   G+ PDD+ F+GLLSACSHSG+++ GR  F +M
Sbjct: 442 SWNAMIFGFAMHGRADASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTM 501

Query: 497 MKKYKHVPKIEHYGCMVDMYCRTGLVKEALEFVHNMPIEPNPVILRTLVSACRGHGEFKL 556
            + YK  PK+EHYGCM+D+   +GL KEA E ++ M +EP+ VI  +L+ AC+ HG  +L
Sbjct: 502 TQDYKMTPKLEHYGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVEL 561

Query: 557 GEKITKLLMRHEPMHGSNYVLLSNIYAKMFSWEKKTKIREVMEVKGMKKVPGSTMIEIDN 606
           GE   + L++ EP +  +YVLLSNIYA    W +  K R ++  KGMKKVPG + IEID+
Sbjct: 562 GESFAENLIKIEPENPGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDS 621

BLAST of Tan0017390 vs. TAIR 10
Match: AT5G66520.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 476.1 bits (1224), Expect = 4.2e-134
Identity = 240/606 (39.60%), Postives = 371/606 (61.22%), Query Frame = 0

Query: 32  LALLQACNALPKLTQIHAHILKLGLHNNPLVLTKFASISSVINATDYAASFLFSADADTR 91
           ++ LQ C+   +L QIHA +LK GL  +   +TKF S      ++D+        D   R
Sbjct: 18  MSCLQRCSKQEELKQIHARMLKTGLMQDSYAITKFLSFCISSTSSDFLPYAQIVFDGFDR 77

Query: 92  LYDAFLFNTLIRAYAQTGHSKAKALSLYCIMLHDGILPNKFTYPFVLKACAGLEVLNLGQ 151
             D FL+N +IR ++ +   + ++L LY  ML      N +T+P +LKAC+ L       
Sbjct: 78  -PDTFLWNLMIRGFSCSDEPE-RSLLLYQRMLCSSAPHNAYTFPSLLKACSNLSAFEETT 137

Query: 152 SVHGSVVKFGFDRDIHVQNTMVHMYSCCAGGINFARKVFDEMPKSDSVTWSAMIGGYARA 211
            +H  + K G++ D++  N++++ Y+   G    A  +FD +P+ D V+W+++I GY +A
Sbjct: 138 QIHAQITKLGYENDVYAVNSLINSYA-VTGNFKLAHLLFDRIPEPDDVSWNSVIKGYVKA 197

Query: 212 GR-------------------------------STEAVALFREMQMAEVCPDEITMVSIL 271
           G+                               + EA+ LF EMQ ++V PD +++ + L
Sbjct: 198 GKMDIALTLFRKMAEKNAISWTTMISGYVQADMNKEALQLFHEMQNSDVEPDNVSLANAL 257

Query: 272 SACTDLGALELGKWIEAYIALHGIQKPVEVSNALIDMFAKCGDISKALKLFRTMSEKTIV 331
           SAC  LGALE GKWI +Y+    I+    +   LIDM+AKCG++ +AL++F+ + +K++ 
Sbjct: 258 SACAQLGALEQGKWIHSYLNKTRIRMDSVLGCVLIDMYAKCGEMEEALEVFKNIKKKSVQ 317

Query: 332 SWTSVIVGMAMHGRGQEAICLFEEMIGSGVAPDDVAFIGLLSACSHSGLVERGREYFSSM 391
           +WT++I G A HG G+EAI  F EM   G+ P+ + F  +L+ACS++GLVE G+  F SM
Sbjct: 318 AWTALISGYAYHGHGREAISKFMEMQKMGIKPNVITFTAVLTACSYTGLVEEGKLIFYSM 377

Query: 392 MKKYKHVPKIEHYGCMVDMYCRTGLVKEALEFVHNMPIEPNPVILRTLVSACRGHGEFKL 451
            + Y   P IEHYGC+VD+  R GL+ EA  F+  MP++PN VI   L+ ACR H   +L
Sbjct: 378 ERDYNLKPTIEHYGCIVDLLGRAGLLDEAKRFIQEMPLKPNAVIWGALLKACRIHKNIEL 437

Query: 452 GEKITKLLMRHEPMHGSNYVLLSNIYAKMFSWEKKTKIREVMEVKGMKKVPGSTMIEIDN 511
           GE+I ++L+  +P HG  YV  +NI+A    W+K  + R +M+ +G+ KVPG + I ++ 
Sbjct: 438 GEEIGEILIAIDPYHGGRYVHKANIHAMDKKWDKAAETRRLMKEQGVAKVPGCSTISLEG 497

Query: 512 EIYEFVAGDKSHKQFKEIYEMVDEMGREMKKSGYRPSTSEVLLD-INEEDKEDTLNRHSE 571
             +EF+AGD+SH + ++I      M R+++++GY P   E+LLD ++++++E  +++HSE
Sbjct: 498 TTHEFLAGDRSHPEIEKIQSKWRIMRRKLEENGYVPELEEMLLDLVDDDEREAIVHQHSE 557

Query: 572 KLAIAFGLLNTPPGTPIRIVKNLRVCSDCHSASKFISKIYGREIIMRDRNRFHHFKAGLC 606
           KLAI +GL+ T PGT IRI+KNLRVC DCH  +K ISKIY R+I+MRDR RFHHF+ G C
Sbjct: 558 KLAITYGLIKTKPGTIIRIMKNLRVCKDCHKVTKLISKIYKRDIVMRDRTRFHHFRDGKC 617

BLAST of Tan0017390 vs. TAIR 10
Match: AT5G48910.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 471.5 bits (1212), Expect = 1.0e-132
Identity = 249/636 (39.15%), Postives = 377/636 (59.28%), Query Frame = 0

Query: 16  NASASRPNPRAAEQNCLALLQACNALPKLTQIHAHILKLGLHNNPLVLTKFASISSVINA 75
           N+ AS  +P +   +    +  C  +  L+QIHA  +K G   + L   +     +  + 
Sbjct: 13  NSPAS--SPASHPSSLFPQINNCRTIRDLSQIHAVFIKSGQMRDTLAAAEILRFCATSDL 72

Query: 76  TDYAASFLFSADADTRLYDAFLFNTLIRAYAQTGHSKAK-ALSLYCIMLHDGIL-PNKFT 135
                 +           + F +NT+IR ++++   KA  A++L+  M+ D  + PN+FT
Sbjct: 73  HHRDLDYAHKIFNQMPQRNCFSWNTIIRGFSESDEDKALIAITLFYEMMSDEFVEPNRFT 132

Query: 136 YPFVLKACAGLEVLNLGQSVHGSVVKFGFDRDIHVQNTMVHMYSCCA------------- 195
           +P VLKACA    +  G+ +HG  +K+GF  D  V + +V MY  C              
Sbjct: 133 FPSVLKACAKTGKIQEGKQIHGLALKYGFGGDEFVMSNLVRMYVMCGFMKDARVLFYKNI 192

Query: 196 -------------------------------GGINFARKVFDEMPKSDSVTWSAMIGGYA 255
                                          G    AR +FD+M +   V+W+ MI GY+
Sbjct: 193 IEKDMVVMTDRRKRDGEIVLWNVMIDGYMRLGDCKAARMLFDKMRQRSVVSWNTMISGYS 252

Query: 256 RAGRSTEAVALFREMQMAEVCPDEITMVSILSACTDLGALELGKWIEAYIALHGIQKPVE 315
             G   +AV +FREM+  ++ P+ +T+VS+L A + LG+LELG+W+  Y    GI+    
Sbjct: 253 LNGFFKDAVEVFREMKKGDIRPNYVTLVSVLPAISRLGSLELGEWLHLYAEDSGIRIDDV 312

Query: 316 VSNALIDMFAKCGDISKALKLFRTMSEKTIVSWTSVIVGMAMHGRGQEAICLFEEMIGSG 375
           + +ALIDM++KCG I KA+ +F  +  + +++W+++I G A+HG+  +AI  F +M  +G
Sbjct: 313 LGSALIDMYSKCGIIEKAIHVFERLPRENVITWSAMINGFAIHGQAGDAIDCFCKMRQAG 372

Query: 376 VAPDDVAFIGLLSACSHSGLVERGREYFSSMMKKYKHVPKIEHYGCMVDMYCRTGLVKEA 435
           V P DVA+I LL+ACSH GLVE GR YFS M+      P+IEHYGCMVD+  R+GL+ EA
Sbjct: 373 VRPSDVAYINLLTACSHGGLVEEGRRYFSQMVSVDGLEPRIEHYGCMVDLLGRSGLLDEA 432

Query: 436 LEFVHNMPIEPNPVILRTLVSACRGHGEFKLGEKITKLLMRHEPMHGSNYVLLSNIYAKM 495
            EF+ NMPI+P+ VI + L+ ACR  G  ++G+++  +LM   P     YV LSN+YA  
Sbjct: 433 EEFILNMPIKPDDVIWKALLGACRMQGNVEMGKRVANILMDMVPHDSGAYVALSNMYASQ 492

Query: 496 FSWEKKTKIREVMEVKGMKKVPGSTMIEIDNEIYEFVAGDKSHKQFKEIYEMVDEMGREM 555
            +W + +++R  M+ K ++K PG ++I+ID  ++EFV  D SH + KEI  M+ E+  ++
Sbjct: 493 GNWSEVSEMRLRMKEKDIRKDPGCSLIDIDGVLHEFVVEDDSHPKAKEINSMLVEISDKL 552

Query: 556 KKSGYRPSTSEVLLDINEEDKEDTLNRHSEKLAIAFGLLNTPPGTPIRIVKNLRVCSDCH 606
           + +GYRP T++VLL++ EEDKE+ L+ HSEK+A AFGL++T PG PIRIVKNLR+C DCH
Sbjct: 553 RLAGYRPITTQVLLNLEEEDKENVLHYHSEKIATAFGLISTSPGKPIRIVKNLRICEDCH 612

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q8LK936.3e-14344.31Pentatricopeptide repeat-containing protein At2g02980, chloroplastic OS=Arabidop... [more]
A8MQA32.0e-14144.33Pentatricopeptide repeat-containing protein At4g21065 OS=Arabidopsis thaliana OX... [more]
Q9LN016.1e-13838.34Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
Q9FJY75.9e-13339.60Pentatricopeptide repeat-containing protein At5g66520 OS=Arabidopsis thaliana OX... [more]
Q9FI801.5e-13139.15Pentatricopeptide repeat-containing protein At5g48910 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
XP_038884201.10.0e+0093.22pentatricopeptide repeat-containing protein At4g21065-like [Benincasa hispida][more]
XP_023537014.10.0e+0093.56pentatricopeptide repeat-containing protein At4g21065-like [Cucurbita pepo subsp... [more]
XP_023002279.10.0e+0093.73pentatricopeptide repeat-containing protein At4g21065-like [Cucurbita maxima] >X... [more]
KAG6585464.10.0e+0092.90Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
XP_022131416.10.0e+0092.57pentatricopeptide repeat-containing protein At4g21065-like [Momordica charantia]... [more]
Match NameE-valueIdentityDescription
A0A6J1KQ010.0e+0093.73pentatricopeptide repeat-containing protein At4g21065-like OS=Cucurbita maxima O... [more]
A0A6J1BQ700.0e+0092.57pentatricopeptide repeat-containing protein At4g21065-like OS=Momordica charanti... [more]
A0A6J1GHH70.0e+0092.90pentatricopeptide repeat-containing protein At4g21065-like OS=Cucurbita moschata... [more]
A0A5A7V9A40.0e+0092.74Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S3BC370.0e+0092.57pentatricopeptide repeat-containing protein At4g21065-like OS=Cucumis melo OX=36... [more]
Match NameE-valueIdentityDescription
AT2G02980.14.5e-14444.31Pentatricopeptide repeat (PPR) superfamily protein [more]
AT4G21065.11.4e-14244.33Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G08070.14.3e-13938.34Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT5G66520.14.2e-13439.60Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT5G48910.11.0e-13239.15Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 471..595
e-value: 2.3E-41
score: 140.6
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 272..297
e-value: 7.9E-5
score: 22.6
coord: 372..396
e-value: 0.0014
score: 18.8
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 272..297
e-value: 1.8E-4
score: 19.4
coord: 373..396
e-value: 0.0019
score: 16.3
coord: 199..233
e-value: 7.3E-9
score: 33.3
coord: 300..333
e-value: 2.8E-6
score: 25.1
coord: 97..130
e-value: 1.2E-4
score: 20.0
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 94..142
e-value: 1.0E-7
score: 32.0
coord: 197..243
e-value: 5.1E-10
score: 39.4
coord: 298..345
e-value: 1.1E-7
score: 32.0
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 94..129
score: 10.720209
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 197..231
score: 12.539784
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 267..297
score: 8.725252
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 298..332
score: 10.731171
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 148..254
e-value: 1.3E-22
score: 82.0
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 6..147
e-value: 5.9E-8
score: 34.5
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 267..482
e-value: 6.1E-42
score: 146.0
NoneNo IPR availablePANTHERPTHR47926:SF239SUBFAMILY NOT NAMEDcoord: 29..591
NoneNo IPR availablePANTHERPTHR47926PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 29..591

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0017390.1Tan0017390.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding