Tan0020698 (gene) Snake gourd v1

Overview
NameTan0020698
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat-containing protein
LocationLG01: 2598776 .. 2600545 (-)
RNA-Seq ExpressionTan0020698
SyntenyTan0020698
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTCAATCTAAAATGGGTTTTATTAGATTGTATCAGAGATTGTAAGAACTTAAGGATCTTTAGGCAAATTCATGCTCAGTTGGTAACATCTGGACTAGTTTACGATGGCTTTGTCACAAACAAAGTGGTGGAATTCTTTGCCAATTTTGTTGAGTTTAGTGACTATGCCTGTGATTACTTGAAACAAAACAACTCTCGCCTAGGTTCATTTCCTTGTAACTCGCTGATTAATGGGTATGTTGCCGGTGACTTGCCACTAATGGCGGTTTCAGTTTATAGAAGGATGGTGGGAGAGGGGTTTGTGCCTGATTTGTTTACTTTTCCAGTGGTTTTGAAAGCATGCTCTAACTTTTCAGGGAGCAGAGAAGGCAGACAGGTTCATGGCGTGGTGGTTAAGTGGGGGTTTTTGAGTGATCTTTATGTGCAAAACTCGCTGGTTCGTTGTTATGGAGCTTGTGGGGATTTTTCTAGTGCGGGTAAGCTGTTTGATGAAATGCTTGTTAGAGATGTTGTTTCGTGGAACAGTTTGATATCCGGGTTCATGAAGGCGGGGCATTTTGATGAGGCCATTTCTTTGTTTTTCAGGATGGATGTGGAGCCAAGCATTGCAACTTTAGTCAGTGTGCTTGCTGCTTGTGCAAGAAAGGGAGACTTGTGTATGGGGAAGGGATTTCATGGTATGATCGAGAGAAGGTTTAAGTTGGATTTATTGCTAGGCAATGCAATGCTTGATATGTATGTAAAGAATGGATGTTTGTATGAAGCTAAGAAAATATTTGATGAGCTCCCAACAAGAGATATTGTGTCATGGACTATCATGATCACTGGATTGGTGCAGAGTAACCATCCAAAAGAGTCCTTGGAACTGTTTTCAACGATGCGAACCATGGGCATTAATCCTGATGGGATTATCTTAACTAGTGTTCTCTCTGCTTGTGCTAGTCTAGGAACTCTCGACTTTGGCAGATGGGTCCATGAGTACATAGATCAAAGAGGAATCAAATGGGATATCCATATTGGAACTGCAATAGTTGACATGTATGCCAAATGTGGCTGTATCGAAATGGCACTGCAAATTTTTAATAATATGGCTCAAAGAAATACCTTCACTTGGAACGTCTTGCTATGCGGTCTGGCAATGCATGGACTTGTGCGTGAAGCATTAAATCTTTTTGAAGTAATGATAATATCTGGTGTCAAGCCTAATGAGGTAACATTTCTAGCAATTTTGACAGCTTGCTGCCATTCTGGTCTGGTCAATGAAGGGCGCAAGTATTTTGATAACATGAGTAGTCAACAGTACAATTTATCGCCGAAGTTGGAGCACTACGGATGCATGGTTGATTTGTTCTGTCGAGCTGGACTCTTGGAGGAAGCTGTGGAGTTGGCAAGGACCATGCCAATGAAGCCTGATGTGCTTATCTGGGGAGTGCTACTAAATGCTTGCAAAACTGTTGGAAATGTTGAGCTCTCTCAACACATTCAAGATTACATCTTGGAACTTGATCCAGAGGACAGTGGAATCTTCGTGCTGCTGTCCAATATATCTGCTACTAATGAAAGATGGTCCGATGTGACTCGATTAAGGAGGTTGATGAAAGATAGAGGTGTGAAAAAATCACCTGGATCAAGTGTCATTGAGGTGGATGGTAAGGCTCACGAGTTCGTGGTTGGAGATATTAGCCACTTCCAAACTGAAGAAATCTACAAGCTGTTAAACCTCATTAACTCTGTCTTCCATGAAAGTCATTTGATGCATCCATTGTAG

mRNA sequence

ATGTTCAATCTAAAATGGGTTTTATTAGATTGTATCAGAGATTGTAAGAACTTAAGGATCTTTAGGCAAATTCATGCTCAGTTGGTAACATCTGGACTAGTTTACGATGGCTTTGTCACAAACAAAGTGGTGGAATTCTTTGCCAATTTTGTTGAGTTTAGTGACTATGCCTGTGATTACTTGAAACAAAACAACTCTCGCCTAGGTTCATTTCCTTGTAACTCGCTGATTAATGGGTATGTTGCCGGTGACTTGCCACTAATGGCGGTTTCAGTTTATAGAAGGATGGTGGGAGAGGGGTTTGTGCCTGATTTGTTTACTTTTCCAGTGGTTTTGAAAGCATGCTCTAACTTTTCAGGGAGCAGAGAAGGCAGACAGGTTCATGGCGTGGTGGTTAAGTGGGGGTTTTTGAGTGATCTTTATGTGCAAAACTCGCTGGTTCGTTGTTATGGAGCTTGTGGGGATTTTTCTAGTGCGGGTAAGCTGTTTGATGAAATGCTTGTTAGAGATGTTGTTTCGTGGAACAGTTTGATATCCGGGTTCATGAAGGCGGGGCATTTTGATGAGGCCATTTCTTTGTTTTTCAGGATGGATGTGGAGCCAAGCATTGCAACTTTAGTCAGTGTGCTTGCTGCTTGTGCAAGAAAGGGAGACTTGTGTATGGGGAAGGGATTTCATGGTATGATCGAGAGAAGGTTTAAGTTGGATTTATTGCTAGGCAATGCAATGCTTGATATGTATGTAAAGAATGGATGTTTGTATGAAGCTAAGAAAATATTTGATGAGCTCCCAACAAGAGATATTGTGTCATGGACTATCATGATCACTGGATTGGTGCAGAGTAACCATCCAAAAGAGTCCTTGGAACTGTTTTCAACGATGCGAACCATGGGCATTAATCCTGATGGGATTATCTTAACTAGTGTTCTCTCTGCTTGTGCTAGTCTAGGAACTCTCGACTTTGGCAGATGGGTCCATGAGTACATAGATCAAAGAGGAATCAAATGGGATATCCATATTGGAACTGCAATAGTTGACATGTATGCCAAATGTGGCTGTATCGAAATGGCACTGCAAATTTTTAATAATATGGCTCAAAGAAATACCTTCACTTGGAACGTCTTGCTATGCGGTCTGGCAATGCATGGACTTGTGCGTGAAGCATTAAATCTTTTTGAAGTAATGATAATATCTGGTGTCAAGCCTAATGAGGTAACATTTCTAGCAATTTTGACAGCTTGCTGCCATTCTGGTCTGGTCAATGAAGGGCGCAAGTATTTTGATAACATGAGTAGTCAACAGTACAATTTATCGCCGAAGTTGGAGCACTACGGATGCATGGTTGATTTGTTCTGTCGAGCTGGACTCTTGGAGGAAGCTGTGGAGTTGGCAAGGACCATGCCAATGAAGCCTGATGTGCTTATCTGGGGAGTGCTACTAAATGCTTGCAAAACTGTTGGAAATGTTGAGCTCTCTCAACACATTCAAGATTACATCTTGGAACTTGATCCAGAGGACAGTGGAATCTTCGTGCTGCTGTCCAATATATCTGCTACTAATGAAAGATGGTCCGATGTGACTCGATTAAGGAGGTTGATGAAAGATAGAGGTGTGAAAAAATCACCTGGATCAAGTGTCATTGAGGTGGATGGTAAGGCTCACGAGTTCGTGGTTGGAGATATTAGCCACTTCCAAACTGAAGAAATCTACAAGCTGTTAAACCTCATTAACTCTGTCTTCCATGAAAGTCATTTGATGCATCCATTGTAG

Coding sequence (CDS)

ATGTTCAATCTAAAATGGGTTTTATTAGATTGTATCAGAGATTGTAAGAACTTAAGGATCTTTAGGCAAATTCATGCTCAGTTGGTAACATCTGGACTAGTTTACGATGGCTTTGTCACAAACAAAGTGGTGGAATTCTTTGCCAATTTTGTTGAGTTTAGTGACTATGCCTGTGATTACTTGAAACAAAACAACTCTCGCCTAGGTTCATTTCCTTGTAACTCGCTGATTAATGGGTATGTTGCCGGTGACTTGCCACTAATGGCGGTTTCAGTTTATAGAAGGATGGTGGGAGAGGGGTTTGTGCCTGATTTGTTTACTTTTCCAGTGGTTTTGAAAGCATGCTCTAACTTTTCAGGGAGCAGAGAAGGCAGACAGGTTCATGGCGTGGTGGTTAAGTGGGGGTTTTTGAGTGATCTTTATGTGCAAAACTCGCTGGTTCGTTGTTATGGAGCTTGTGGGGATTTTTCTAGTGCGGGTAAGCTGTTTGATGAAATGCTTGTTAGAGATGTTGTTTCGTGGAACAGTTTGATATCCGGGTTCATGAAGGCGGGGCATTTTGATGAGGCCATTTCTTTGTTTTTCAGGATGGATGTGGAGCCAAGCATTGCAACTTTAGTCAGTGTGCTTGCTGCTTGTGCAAGAAAGGGAGACTTGTGTATGGGGAAGGGATTTCATGGTATGATCGAGAGAAGGTTTAAGTTGGATTTATTGCTAGGCAATGCAATGCTTGATATGTATGTAAAGAATGGATGTTTGTATGAAGCTAAGAAAATATTTGATGAGCTCCCAACAAGAGATATTGTGTCATGGACTATCATGATCACTGGATTGGTGCAGAGTAACCATCCAAAAGAGTCCTTGGAACTGTTTTCAACGATGCGAACCATGGGCATTAATCCTGATGGGATTATCTTAACTAGTGTTCTCTCTGCTTGTGCTAGTCTAGGAACTCTCGACTTTGGCAGATGGGTCCATGAGTACATAGATCAAAGAGGAATCAAATGGGATATCCATATTGGAACTGCAATAGTTGACATGTATGCCAAATGTGGCTGTATCGAAATGGCACTGCAAATTTTTAATAATATGGCTCAAAGAAATACCTTCACTTGGAACGTCTTGCTATGCGGTCTGGCAATGCATGGACTTGTGCGTGAAGCATTAAATCTTTTTGAAGTAATGATAATATCTGGTGTCAAGCCTAATGAGGTAACATTTCTAGCAATTTTGACAGCTTGCTGCCATTCTGGTCTGGTCAATGAAGGGCGCAAGTATTTTGATAACATGAGTAGTCAACAGTACAATTTATCGCCGAAGTTGGAGCACTACGGATGCATGGTTGATTTGTTCTGTCGAGCTGGACTCTTGGAGGAAGCTGTGGAGTTGGCAAGGACCATGCCAATGAAGCCTGATGTGCTTATCTGGGGAGTGCTACTAAATGCTTGCAAAACTGTTGGAAATGTTGAGCTCTCTCAACACATTCAAGATTACATCTTGGAACTTGATCCAGAGGACAGTGGAATCTTCGTGCTGCTGTCCAATATATCTGCTACTAATGAAAGATGGTCCGATGTGACTCGATTAAGGAGGTTGATGAAAGATAGAGGTGTGAAAAAATCACCTGGATCAAGTGTCATTGAGGTGGATGGTAAGGCTCACGAGTTCGTGGTTGGAGATATTAGCCACTTCCAAACTGAAGAAATCTACAAGCTGTTAAACCTCATTAACTCTGTCTTCCATGAAAGTCATTTGATGCATCCATTGTAG

Protein sequence

MFNLKWVLLDCIRDCKNLRIFRQIHAQLVTSGLVYDGFVTNKVVEFFANFVEFSDYACDYLKQNNSRLGSFPCNSLINGYVAGDLPLMAVSVYRRMVGEGFVPDLFTFPVVLKACSNFSGSREGRQVHGVVVKWGFLSDLYVQNSLVRCYGACGDFSSAGKLFDEMLVRDVVSWNSLISGFMKAGHFDEAISLFFRMDVEPSIATLVSVLAACARKGDLCMGKGFHGMIERRFKLDLLLGNAMLDMYVKNGCLYEAKKIFDELPTRDIVSWTIMITGLVQSNHPKESLELFSTMRTMGINPDGIILTSVLSACASLGTLDFGRWVHEYIDQRGIKWDIHIGTAIVDMYAKCGCIEMALQIFNNMAQRNTFTWNVLLCGLAMHGLVREALNLFEVMIISGVKPNEVTFLAILTACCHSGLVNEGRKYFDNMSSQQYNLSPKLEHYGCMVDLFCRAGLLEEAVELARTMPMKPDVLIWGVLLNACKTVGNVELSQHIQDYILELDPEDSGIFVLLSNISATNERWSDVTRLRRLMKDRGVKKSPGSSVIEVDGKAHEFVVGDISHFQTEEIYKLLNLINSVFHESHLMHPL
Homology
BLAST of Tan0020698 vs. ExPASy Swiss-Prot
Match: Q9SZK1 (Pentatricopeptide repeat-containing protein At4g38010 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E45 PE=3 SV=1)

HSP 1 Score: 622.1 bits (1603), Expect = 6.5e-177
Identity = 308/547 (56.31%), Postives = 384/547 (70.20%), Query Frame = 0

Query: 5   KWVLLDCIRDCKNLRIFRQIHAQLVTSGLVYDGFVTNKVVEFFANFVEFSDYACDYLKQN 64
           K VLL+ I  C +LR+F+QI  QL+T  L+ D  + NKVV F     +F+ Y+   L   
Sbjct: 6   KSVLLELISRCSSLRVFKQIQTQLITRDLLRDDLIINKVVTFLGKSADFASYSSVILHSI 65

Query: 65  NSRLGSFPCNSLINGYVAGDLPLMAVSVYRRMVGEGFVPDLFTFPVVLKACSNFSGSREG 124
            S L SF  N+L++ Y   D P + +  Y+  V  GF PD+FTFP V KAC  FSG REG
Sbjct: 66  RSVLSSFSYNTLLSSYAVCDKPRVTIFAYKTFVSNGFSPDMFTFPPVFKACGKFSGIREG 125

Query: 125 RQVHGVVVKWGFLSDLYVQNSLVRCYGACGDFSSAGKLFDEMLVRDVVSWNSLISGFMKA 184
           +Q+HG+V K GF  D+YVQNSLV  YG CG+  +A K+F EM VRDVVSW  +I+GF + 
Sbjct: 126 KQIHGIVTKMGFYDDIYVQNSLVHFYGVCGESRNACKVFGEMPVRDVVSWTGIITGFTRT 185

Query: 185 GHFDEAISLFFRMDVEPSIATLVSVLAACARKGDLCMGKGFHGMIERRFKL-DLLLGNAM 244
           G + EA+  F +MDVEP++AT V VL +  R G L +GKG HG+I +R  L  L  GNA+
Sbjct: 186 GLYKEALDTFSKMDVEPNLATYVCVLVSSGRVGCLSLGKGIHGLILKRASLISLETGNAL 245

Query: 245 LDMYVKNGCLYEAKKIFDELPTRDIVSWTIMITGLVQSNHPKESLELFSTMRT-MGINPD 304
           +DMYVK   L +A ++F EL  +D VSW  MI+GLV     KE+++LFS M+T  GI PD
Sbjct: 246 IDMYVKCEQLSDAMRVFGELEKKDKVSWNSMISGLVHCERSKEAIDLFSLMQTSSGIKPD 305

Query: 305 GIILTSVLSACASLGTLDFGRWVHEYIDQRGIKWDIHIGTAIVDMYAKCGCIEMALQIFN 364
           G ILTSVLSACASLG +D GRWVHEYI   GIKWD HIGTAIVDMYAKCG IE AL+IFN
Sbjct: 306 GHILTSVLSACASLGAVDHGRWVHEYILTAGIKWDTHIGTAIVDMYAKCGYIETALEIFN 365

Query: 365 NMAQRNTFTWNVLLCGLAMHGLVREALNLFEVMIISGVKPNEVTFLAILTACCHSGLVNE 424
            +  +N FTWN LL GLA+HG   E+L  FE M+  G KPN VTFLA L ACCH+GLV+E
Sbjct: 366 GIRSKNVFTWNALLGGLAIHGHGLESLRYFEEMVKLGFKPNLVTFLAALNACCHTGLVDE 425

Query: 425 GRKYFDNMSSQQYNLSPKLEHYGCMVDLFCRAGLLEEAVELARTMPMKPDVLIWGVLLNA 484
           GR+YF  M S++YNL PKLEHYGCM+DL CRAGLL+EA+EL + MP+KPDV I G +L+A
Sbjct: 426 GRRYFHKMKSREYNLFPKLEHYGCMIDLLCRAGLLDEALELVKAMPVKPDVRICGAILSA 485

Query: 485 CKTVGN-VELSQHIQDYILELDPEDSGIFVLLSNISATNERWSDVTRLRRLMKDRGVKKS 544
           CK  G  +EL + I D  L+++ EDSG++VLLSNI A N RW DV R+RRLMK +G+ K 
Sbjct: 486 CKNRGTLMELPKEILDSFLDIEFEDSGVYVLLSNIFAANRRWDDVARIRRLMKVKGISKV 545

Query: 545 PGSSVIE 549
           PGSS IE
Sbjct: 546 PGSSYIE 552

BLAST of Tan0020698 vs. ExPASy Swiss-Prot
Match: Q9LN01 (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 449.1 bits (1154), Expect = 7.6e-125
Identity = 233/604 (38.58%), Postives = 359/604 (59.44%), Query Frame = 0

Query: 9   LDCIRDCKNLRIFRQIHAQLVTSGLVYDGFVTNKVVEF--FANFVEFSDYACDYLKQNNS 68
           L  + +CK L+  R IHAQ++  GL    +  +K++EF   +   E   YA    K    
Sbjct: 37  LSLLHNCKTLQSLRIIHAQMIKIGLHNTNYALSKLIEFCILSPHFEGLPYAISVFK-TIQ 96

Query: 69  RLGSFPCNSLINGYVAGDLPLMAVSVYRRMVGEGFVPDLFTFPVVLKACSNFSGSREGRQ 128
                  N++  G+     P+ A+ +Y  M+  G +P+ +TFP VLK+C+     +EG+Q
Sbjct: 97  EPNLLIWNTMFRGHALSSDPVSALKLYVCMISLGLLPNSYTFPFVLKSCAKSKAFKEGQQ 156

Query: 129 VHGVVVKWGFLSDLYVQNS-------------------------------LVRCYGACGD 188
           +HG V+K G   DLYV  S                               L++ Y + G 
Sbjct: 157 IHGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVFDKSPHRDVVSYTALIKGYASRGY 216

Query: 189 FSSAGKLFDEMLVRDVVSWNSLISGFMKAGHFDEAISLF---FRMDVEPSIATLVSVLAA 248
             +A KLFDE+ V+DVVSWN++ISG+ + G++ EA+ LF    + +V P  +T+V+V++A
Sbjct: 217 IENAQKLFDEIPVKDVVSWNAMISGYAETGNYKEALELFKDMMKTNVRPDESTMVTVVSA 276

Query: 249 CARKGDLCMGKGFHGMI-ERRFKLDLLLGNAMLDMYVKNGCLYEAKKIFDELPTRDIVSW 308
           CA+ G + +G+  H  I +  F  +L + NA++D+Y K G L  A  +F+ LP +D++SW
Sbjct: 277 CAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELETACGLFERLPYKDVISW 336

Query: 309 TIMITGLVQSNHPKESLELFSTMRTMGINPDGIILTSVLSACASLGTLDFGRWVHEYIDQ 368
             +I G    N  KE+L LF  M   G  P+ + + S+L ACA LG +D GRW+H YID+
Sbjct: 337 NTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSILPACAHLGAIDIGRWIHVYIDK 396

Query: 369 R--GIKWDIHIGTAIVDMYAKCGCIEMALQIFNNMAQRNTFTWNVLLCGLAMHGLVREAL 428
           R  G+     + T+++DMYAKCG IE A Q+FN++  ++  +WN ++ G AMHG    + 
Sbjct: 397 RLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSSWNAMIFGFAMHGRADASF 456

Query: 429 NLFEVMIISGVKPNEVTFLAILTACCHSGLVNEGRKYFDNMSSQQYNLSPKLEHYGCMVD 488
           +LF  M   G++P+++TF+ +L+AC HSG+++ GR  F  M +Q Y ++PKLEHYGCM+D
Sbjct: 457 DLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTM-TQDYKMTPKLEHYGCMID 516

Query: 489 LFCRAGLLEEAVELARTMPMKPDVLIWGVLLNACKTVGNVELSQHIQDYILELDPEDSGI 548
           L   +GL +EA E+   M M+PD +IW  LL ACK  GNVEL +   + +++++PE+ G 
Sbjct: 517 LLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELGESFAENLIKIEPENPGS 576

Query: 549 FVLLSNISATNERWSDVTRLRRLMKDRGVKKSPGSSVIEVDGKAHEFVVGDISHFQTEEI 574
           +VLLSNI A+  RW++V + R L+ D+G+KK PG S IE+D   HEF++GD  H +  EI
Sbjct: 577 YVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSVVHEFIIGDKFHPRNREI 636

BLAST of Tan0020698 vs. ExPASy Swiss-Prot
Match: Q9SJZ3 (Pentatricopeptide repeat-containing protein At2g22410, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-E28 PE=2 SV=1)

HSP 1 Score: 444.1 bits (1141), Expect = 2.4e-123
Identity = 235/608 (38.65%), Postives = 356/608 (58.55%), Query Frame = 0

Query: 8   LLDCIRDCKNLRIFRQIHAQLVTSGLVYDGFVTNKVVEFFA-NFVEFSDYACDYLKQNNS 67
           LL  +  CK L   +QI AQ++ +GL+ D F +++++ F A +   + DY+   LK    
Sbjct: 56  LLSLLEKCKLLLHLKQIQAQMIINGLILDPFASSRLIAFCALSESRYLDYSVKILK-GIE 115

Query: 68  RLGSFPCNSLINGYVAGDLPLMAVSVYRRMVGEGFV---PDLFTFPVVLKACSNFSGSRE 127
               F  N  I G+   + P  +  +Y++M+  G     PD FT+PV+ K C++   S  
Sbjct: 116 NPNIFSWNVTIRGFSESENPKESFLLYKQMLRHGCCESRPDHFTYPVLFKVCADLRLSSL 175

Query: 128 GRQVHGVVVKWGFLSDLYVQNSLVRCYGACGDFSSAGKLFDEMLVRDVVSWNSLISGFMK 187
           G  + G V+K       +V N+ +  + +CGD  +A K+FDE  VRD+VSWN LI+G+ K
Sbjct: 176 GHMILGHVLKLRLELVSHVHNASIHMFASCGDMENARKVFDESPVRDLVSWNCLINGYKK 235

Query: 188 AGHFDEAISLFFRMD---VEPSIATLVSVLAACARKGDLCMGKGFHGMI-ERRFKLDLLL 247
            G  ++AI ++  M+   V+P   T++ ++++C+  GDL  GK F+  + E   ++ + L
Sbjct: 236 IGEAEKAIYVYKLMESEGVKPDDVTMIGLVSSCSMLGDLNRGKEFYEYVKENGLRMTIPL 295

Query: 248 GNAMLDMYVKNGCLYEAKKIFDELPTRDIVSWTIMITGL--------------------- 307
            NA++DM+ K G ++EA++IFD L  R IVSWT MI+G                      
Sbjct: 296 VNALMDMFSKCGDIHEARRIFDNLEKRTIVSWTTMISGYARCGLLDVSRKLFDDMEEKDV 355

Query: 308 ----------VQSNHPKESLELFSTMRTMGINPDGIILTSVLSACASLGTLDFGRWVHEY 367
                     VQ+   +++L LF  M+T    PD I +   LSAC+ LG LD G W+H Y
Sbjct: 356 VLWNAMIGGSVQAKRGQDALALFQEMQTSNTKPDEITMIHCLSACSQLGALDVGIWIHRY 415

Query: 368 IDQRGIKWDIHIGTAIVDMYAKCGCIEMALQIFNNMAQRNTFTWNVLLCGLAMHGLVREA 427
           I++  +  ++ +GT++VDMYAKCG I  AL +F+ +  RN+ T+  ++ GLA+HG    A
Sbjct: 416 IEKYSLSLNVALGTSLVDMYAKCGNISEALSVFHGIQTRNSLTYTAIIGGLALHGDASTA 475

Query: 428 LNLFEVMIISGVKPNEVTFLAILTACCHSGLVNEGRKYFDNMSSQQYNLSPKLEHYGCMV 487
           ++ F  MI +G+ P+E+TF+ +L+ACCH G++  GR YF  M S ++NL+P+L+HY  MV
Sbjct: 476 ISYFNEMIDAGIAPDEITFIGLLSACCHGGMIQTGRDYFSQMKS-RFNLNPQLKHYSIMV 535

Query: 488 DLFCRAGLLEEAVELARTMPMKPDVLIWGVLLNACKTVGNVELSQHIQDYILELDPEDSG 547
           DL  RAGLLEEA  L  +MPM+ D  +WG LL  C+  GNVEL +     +LELDP DSG
Sbjct: 536 DLLGRAGLLEEADRLMESMPMEADAAVWGALLFGCRMHGNVELGEKAAKKLLELDPSDSG 595

Query: 548 IFVLLSNISATNERWSDVTRLRRLMKDRGVKKSPGSSVIEVDGKAHEFVVGDISHFQTEE 577
           I+VLL  +      W D  R RR+M +RGV+K PG S IEV+G   EF+V D S  ++E+
Sbjct: 596 IYVLLDGMYGEANMWEDAKRARRMMNERGVEKIPGCSSIEVNGIVCEFIVRDKSRPESEK 655

BLAST of Tan0020698 vs. ExPASy Swiss-Prot
Match: O82380 (Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H33 PE=2 SV=1)

HSP 1 Score: 427.9 bits (1099), Expect = 1.8e-118
Identity = 228/608 (37.50%), Postives = 355/608 (58.39%), Query Frame = 0

Query: 9   LDCIRDCKNLRIFRQIHAQLVTSGLVYDGFVTNKV-----VEFFANFVEFSDYACDYLKQ 68
           +  I  C +LR  +Q H  ++ +G   D +  +K+     +  FA+ +E++    D + +
Sbjct: 34  ISLIERCVSLRQLKQTHGHMIRTGTFSDPYSASKLFAMAALSSFAS-LEYARKVFDEIPK 93

Query: 69  NNSRLGSFPCNSLINGYVAGDLPLMAVSVYRRMVGEG-FVPDLFTFPVVLKACSNFSGSR 128
            N    SF  N+LI  Y +G  P++++  +  MV E    P+ +TFP ++KA +  S   
Sbjct: 94  PN----SFAWNTLIRAYASGPDPVLSIWAFLDMVSESQCYPNKYTFPFLIKAAAEVSSLS 153

Query: 129 EGRQVHGVVVKWGFLSDLYVQNSLVRCYGACGDFSSAGKLFDEMLVRDVVSWNSLISGFM 188
            G+ +HG+ VK    SD++V NSL+ CY +CGD  SA K+F  +  +DVVSWNS+I+GF+
Sbjct: 154 LGQSLHGMAVKSAVGSDVFVANSLIHCYFSCGDLDSACKVFTTIKEKDVVSWNSMINGFV 213

Query: 189 KAGHFDEAISLFFRM---DVEPSIATLVSVLAACARKGDLCMGKGFHGMI-ERRFKLDLL 248
           + G  D+A+ LF +M   DV+ S  T+V VL+ACA+  +L  G+     I E R  ++L 
Sbjct: 214 QKGSPDKALELFKKMESEDVKASHVTMVGVLSACAKIRNLEFGRQVCSYIEENRVNVNLT 273

Query: 249 LGNAMLDMYVKNGCLYEAKKIFD-------------------------------ELPTRD 308
           L NAMLDMY K G + +AK++FD                                +P +D
Sbjct: 274 LANAMLDMYTKCGSIEDAKRLFDAMEEKDNVTWTTMLDGYAISEDYEAAREVLNSMPQKD 333

Query: 309 IVSWTIMITGLVQSNHPKESLELFSTMRTM-GINPDGIILTSVLSACASLGTLDFGRWVH 368
           IV+W  +I+   Q+  P E+L +F  ++    +  + I L S LSACA +G L+ GRW+H
Sbjct: 334 IVAWNALISAYEQNGKPNEALIVFHELQLQKNMKLNQITLVSTLSACAQVGALELGRWIH 393

Query: 369 EYIDQRGIKWDIHIGTAIVDMYAKCGCIEMALQIFNNMAQRNTFTWNVLLCGLAMHGLVR 428
            YI + GI+ + H+ +A++ MY+KCG +E + ++FN++ +R+ F W+ ++ GLAMHG   
Sbjct: 394 SYIKKHGIRMNFHVTSALIHMYSKCGDLEKSREVFNSVEKRDVFVWSAMIGGLAMHGCGN 453

Query: 429 EALNLFEVMIISGVKPNEVTFLAILTACCHSGLVNEGRKYFDNMSSQQYNLSPKLEHYGC 488
           EA+++F  M  + VKPN VTF  +  AC H+GLV+E    F  M S  Y + P+ +HY C
Sbjct: 454 EAVDMFYKMQEANVKPNGVTFTNVFCACSHTGLVDEAESLFHQMES-NYGIVPEEKHYAC 513

Query: 489 MVDLFCRAGLLEEAVELARTMPMKPDVLIWGVLLNACKTVGNVELSQHIQDYILELDPED 548
           +VD+  R+G LE+AV+    MP+ P   +WG LL ACK   N+ L++     +LEL+P +
Sbjct: 514 IVDVLGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACKIHANLNLAEMACTRLLELEPRN 573

Query: 549 SGIFVLLSNISATNERWSDVTRLRRLMKDRGVKKSPGSSVIEVDGKAHEFVVGDISHFQT 575
            G  VLLSNI A   +W +V+ LR+ M+  G+KK PG S IE+DG  HEF+ GD +H  +
Sbjct: 574 DGAHVLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCSSIEIDGMIHEFLSGDNAHPMS 633

BLAST of Tan0020698 vs. ExPASy Swiss-Prot
Match: Q9C866 (Pentatricopeptide repeat-containing protein At1g31430 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E55 PE=2 SV=1)

HSP 1 Score: 420.6 bits (1080), Expect = 2.9e-116
Identity = 217/541 (40.11%), Postives = 326/541 (60.26%), Query Frame = 0

Query: 74  NSLINGYVAGDLPLMAVSVYRRMVGEGFVPDLFTFPVVLKACSNFSGSREGRQVHGVVVK 133
           N ++     G      ++++  + G+G  PD FT PVVLK+        EG +VHG  VK
Sbjct: 15  NKMLKSLADGKSFTKVLALFGELRGQGLYPDNFTLPVVLKSIGRLRKVIEGEKVHGYAVK 74

Query: 134 WGFLSDLYVQNSLVRCYGACGDFSSAGKLFDEMLVRDVVSWNSLISGFMKAGHFDEAISL 193
            G   D YV NSL+  Y + G      K+FDEM  RDVVSWN LIS ++  G F++AI +
Sbjct: 75  AGLEFDSYVSNSLMGMYASLGKIEITHKVFDEMPQRDVVSWNGLISSYVGNGRFEDAIGV 134

Query: 194 FFRMDVEPSI----ATLVSVLAACARKGDLCMGKGFHGMIERRFKLDLLLGNAMLDMYVK 253
           F RM  E ++     T+VS L+AC+   +L +G+  +  +   F++ + +GNA++DM+ K
Sbjct: 135 FKRMSQESNLKFDEGTIVSTLSACSALKNLEIGERIYRFVVTEFEMSVRIGNALVDMFCK 194

Query: 254 NGCLYEAKKIFDEL-------------------------------PTRDIVSWTIMITGL 313
            GCL +A+ +FD +                               P +D+V WT M+ G 
Sbjct: 195 CGCLDKARAVFDSMRDKNVKCWTSMVFGYVSTGRIDEARVLFERSPVKDVVLWTAMMNGY 254

Query: 314 VQSNHPKESLELFSTMRTMGINPDGIILTSVLSACASLGTLDFGRWVHEYIDQRGIKWDI 373
           VQ N   E+LELF  M+T GI PD  +L S+L+ CA  G L+ G+W+H YI++  +  D 
Sbjct: 255 VQFNRFDEALELFRCMQTAGIRPDNFVLVSLLTGCAQTGALEQGKWIHGYINENRVTVDK 314

Query: 374 HIGTAIVDMYAKCGCIEMALQIFNNMAQRNTFTWNVLLCGLAMHGLVREALNLFEVMIIS 433
            +GTA+VDMYAKCGCIE AL++F  + +R+T +W  L+ GLAM+G+   AL+L+  M   
Sbjct: 315 VVGTALVDMYAKCGCIETALEVFYEIKERDTASWTSLIYGLAMNGMSGRALDLYYEMENV 374

Query: 434 GVKPNEVTFLAILTACCHSGLVNEGRKYFDNMSSQQYNLSPKLEHYGCMVDLFCRAGLLE 493
           GV+ + +TF+A+LTAC H G V EGRK F +M ++++N+ PK EH  C++DL CRAGLL+
Sbjct: 375 GVRLDAITFVAVLTACNHGGFVAEGRKIFHSM-TERHNVQPKSEHCSCLIDLLCRAGLLD 434

Query: 494 EAVELARTMPMKPD---VLIWGVLLNACKTVGNVELSQHIQDYILELDPEDSGIFVLLSN 553
           EA EL   M  + D   V ++  LL+A +  GNV++++ + + + +++  DS    LL++
Sbjct: 435 EAEELIDKMRGESDETLVPVYCSLLSAARNYGNVKIAERVAEKLEKVEVSDSSAHTLLAS 494

Query: 554 ISATNERWSDVTRLRRLMKDRGVKKSPGSSVIEVDGKAHEFVVGD--ISHFQTEEIYKLL 575
           + A+  RW DVT +RR MKD G++K PG S IE+DG  HEF+VGD  +SH + +EI  +L
Sbjct: 495 VYASANRWEDVTNVRRKMKDLGIRKFPGCSSIEIDGVGHEFIVGDDLLSHPKMDEINSML 554

BLAST of Tan0020698 vs. NCBI nr
Match: XP_022968019.1 (pentatricopeptide repeat-containing protein At4g38010 isoform X1 [Cucurbita maxima])

HSP 1 Score: 1082.4 bits (2798), Expect = 0.0e+00
Identity = 523/588 (88.95%), Postives = 551/588 (93.71%), Query Frame = 0

Query: 1   MFNLKWVLLDCIRDCKNLRIFRQIHAQLVTSGLVYDGFVTNKVVEFFANFVEFSDYACDY 60
           MFNLKWVLLD I+DCKNLRIF++IHAQLVTSGLVYD FVTNKVVEFFANFVEF DYACDY
Sbjct: 1   MFNLKWVLLDSIKDCKNLRIFKKIHAQLVTSGLVYDDFVTNKVVEFFANFVEFGDYACDY 60

Query: 61  LKQNNSRLGSFPCNSLINGYVAGDLPLMAVSVYRRMVGEGFVPDLFTFPVVLKACSNFSG 120
           LKQ N+RLGSFP NSLINGY  G+ P MAVSVYRRM  +GFVPDLFTFPV+ KACSNFSG
Sbjct: 61  LKQVNTRLGSFPFNSLINGYAGGEFPQMAVSVYRRMARDGFVPDLFTFPVLFKACSNFSG 120

Query: 121 SREGRQVHGVVVKWGFLSDLYVQNSLVRCYGACGDFSSAGKLFDEMLVRDVVSWNSLISG 180
           SREGRQVHGVV+K G LSDL+VQNSLVRCYGAC DFS AGK+FDEMLVRDVVSWNSLISG
Sbjct: 121 SREGRQVHGVVIKLGILSDLFVQNSLVRCYGACEDFSCAGKVFDEMLVRDVVSWNSLISG 180

Query: 181 FMKAGHFDEAISLFFRMDVEPSIATLVSVLAACARKGDLCMGKGFHGMIERRFKLDLLLG 240
           FMKAG FD+AISLFFRMDVEPS+ATLVSVLAACARKGDL MGKG HGMI+RRFKLDL+LG
Sbjct: 181 FMKAGRFDDAISLFFRMDVEPSVATLVSVLAACARKGDLYMGKGIHGMIQRRFKLDLVLG 240

Query: 241 NAMLDMYVKNGCLYEAKKIFDELPTRDIVSWTIMITGLVQSNHPKESLELFSTMRTMGIN 300
           NAMLDMY KNGCLYEAK IFDELPTRDIVSWTIMITGLVQSNHPKESLELF  MR +GI+
Sbjct: 241 NAMLDMYAKNGCLYEAKNIFDELPTRDIVSWTIMITGLVQSNHPKESLELFWMMRNLGIS 300

Query: 301 PDGIILTSVLSACASLGTLDFGRWVHEYIDQRGIKWDIHIGTAIVDMYAKCGCIEMALQI 360
           PDGIILTSVLSACASLGTL +G WVHEYI+QRGIKWDIHIGTAIVDMYAKCGC+EMA QI
Sbjct: 301 PDGIILTSVLSACASLGTLKYGTWVHEYINQRGIKWDIHIGTAIVDMYAKCGCVEMARQI 360

Query: 361 FNNMAQRNTFTWNVLLCGLAMHGLVREALNLFEVMIISGVKPNEVTFLAILTACCHSGLV 420
           FN+M QRNTFTWN LLCGLAMHGL  EAL LFEVMIISGVK NEVTFLAILTACCHSGLV
Sbjct: 361 FNSMPQRNTFTWNALLCGLAMHGLAHEALYLFEVMIISGVKTNEVTFLAILTACCHSGLV 420

Query: 421 NEGRKYFDNMSSQQYNLSPKLEHYGCMVDLFCRAGLLEEAVELARTMPMKPDVLIWGVLL 480
           +EGRKYFDNMSSQ+YNLSPKLEHYGCM+DLFCRAGLLEEAVEL RTMPMKPDVLIWGVLL
Sbjct: 421 DEGRKYFDNMSSQRYNLSPKLEHYGCMIDLFCRAGLLEEAVELVRTMPMKPDVLIWGVLL 480

Query: 481 NACKTVGNVELSQHIQDYILELDPEDSGIFVLLSNISATNERWSDVTRLRRLMKDRGVKK 540
           NACKTVGNVELSQHIQ+YILELDPEDSG+FVLLSNISATNERWS+VTRLRRLMKDRGVKK
Sbjct: 481 NACKTVGNVELSQHIQEYILELDPEDSGVFVLLSNISATNERWSNVTRLRRLMKDRGVKK 540

Query: 541 SPGSSVIEVDGKAHEFVVGDISHFQTEEIYKLLNLINSVFHESHLMHP 589
           SPGSSVIEVDGKAHEFV GDIS+ QTEEIYK+L LINSVFHESHLMHP
Sbjct: 541 SPGSSVIEVDGKAHEFVAGDISNLQTEEIYKVLTLINSVFHESHLMHP 588

BLAST of Tan0020698 vs. NCBI nr
Match: KAG7013201.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1073.9 bits (2776), Expect = 4.5e-310
Identity = 520/589 (88.29%), Postives = 546/589 (92.70%), Query Frame = 0

Query: 1   MFNLKWVLLDCIRDCKNLRIFRQIHAQLVTSGLVYDGFVTNKVVEFFANFVEFSDYACDY 60
           MFNLKWVLLD I+DCKNLRIF++IHAQLV SGLVYD FVTNKVVEFFANFVEF DYACDY
Sbjct: 1   MFNLKWVLLDSIKDCKNLRIFKKIHAQLVASGLVYDDFVTNKVVEFFANFVEFGDYACDY 60

Query: 61  LKQNNSRLGSFPCNSLINGYVAGDLPLMAVSVYRRMVGEGFVPDLFTFPVVLKACSNFSG 120
           LKQ N RLGSFP NSLINGY  G+ P MAVSVYRRM  +GFVPDLFTFPV+ KACSNFSG
Sbjct: 61  LKQVNMRLGSFPFNSLINGYAGGEFPQMAVSVYRRMARDGFVPDLFTFPVLFKACSNFSG 120

Query: 121 SREGRQVHGVVVKWGFLSDLYVQNSLVRCYGACGDFSSAGKLFDEMLVRDVVSWNSLISG 180
            REGRQVHGVVVK G LSDL+VQNSLVRCYGAC DFS AGK+FDEMLVRDVVSWNSLISG
Sbjct: 121 RREGRQVHGVVVKLGILSDLFVQNSLVRCYGACEDFSCAGKVFDEMLVRDVVSWNSLISG 180

Query: 181 FMKAGHFDEAISLFFRMDVEPSIATLVSVLAACARKGDLCMGKGFHGMIERRFKLDLLLG 240
           FMKAG FD+AISLFFRMDVEPS+ATLVSVLAACARKG+L MGKG HGMI+RRFKLDL+LG
Sbjct: 181 FMKAGRFDDAISLFFRMDVEPSVATLVSVLAACARKGELYMGKGIHGMIQRRFKLDLVLG 240

Query: 241 NAMLDMYVKNGCLYEAKKIFDELPTRDIVSWTIMITGLVQSNHPKESLELFSTMRTMGIN 300
           NAMLDMY KNGCLYEAK IFDELPTRDIVSWTIMITGLVQSNHPKESLELF  MR +GI+
Sbjct: 241 NAMLDMYAKNGCLYEAKNIFDELPTRDIVSWTIMITGLVQSNHPKESLELFWMMRNLGIS 300

Query: 301 PDGIILTSVLSACASLGTLDFGRWVHEYIDQRGIKWDIHIGTAIVDMYAKCGCIEMALQI 360
           PDGIILTSVLSACASLGTL++G WVHEYIDQRGIKWDIHIGTAIVDMYAKCGC+EMA QI
Sbjct: 301 PDGIILTSVLSACASLGTLEYGTWVHEYIDQRGIKWDIHIGTAIVDMYAKCGCVEMARQI 360

Query: 361 FNNMAQRNTFTWNVLLCGLAMHGLVREALNLFEVMIISGVKPNEVTFLAILTACCHSGLV 420
           FNNM QRNTFTWN LLCGLAMHGL  EAL LFEVMIISGVKPNEVTFLAILTACCHSGLV
Sbjct: 361 FNNMPQRNTFTWNALLCGLAMHGLAHEALYLFEVMIISGVKPNEVTFLAILTACCHSGLV 420

Query: 421 NEGRKYFDNMSSQQYNLSPKLEHYGCMVDLFCRAGLLEEAVELARTMPMKPDVLIWGVLL 480
           +EGRKYFDNMSSQ YNLSPKLEHYGCM+DL CRAGLLEEAVEL RTMPMKPDVLIWGVLL
Sbjct: 421 DEGRKYFDNMSSQTYNLSPKLEHYGCMIDLLCRAGLLEEAVELVRTMPMKPDVLIWGVLL 480

Query: 481 NACKTVGNVELSQHIQDYILELDPEDSGIFVLLSNISATNERWSDVTRLRRLMKDRGVKK 540
           NACKTVGNVELSQHIQ+YILELD EDSG+FVLLSNISATNERWS+VTRLRRLMKDRGVKK
Sbjct: 481 NACKTVGNVELSQHIQEYILELDTEDSGVFVLLSNISATNERWSNVTRLRRLMKDRGVKK 540

Query: 541 SPGSSVIEVDGKAHEFVVGDISHFQTEEIYKLLNLINSVFHESHLMHPL 590
           SPGSSVIEVDGKAHEFV GDIS+ + EEIYK+L LINSV HESHLMHPL
Sbjct: 541 SPGSSVIEVDGKAHEFVAGDISNLEIEEIYKVLTLINSVLHESHLMHPL 589

BLAST of Tan0020698 vs. NCBI nr
Match: KAG6574147.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1072.4 bits (2772), Expect = 1.4e-309
Identity = 519/589 (88.12%), Postives = 546/589 (92.70%), Query Frame = 0

Query: 1   MFNLKWVLLDCIRDCKNLRIFRQIHAQLVTSGLVYDGFVTNKVVEFFANFVEFSDYACDY 60
           MFNLKWVLLD I+DCKNLRIF++IHAQLV SGLVYD FVTNKVVEFFANFVEF DYACDY
Sbjct: 1   MFNLKWVLLDSIKDCKNLRIFKKIHAQLVASGLVYDDFVTNKVVEFFANFVEFGDYACDY 60

Query: 61  LKQNNSRLGSFPCNSLINGYVAGDLPLMAVSVYRRMVGEGFVPDLFTFPVVLKACSNFSG 120
           LKQ N RLGSFP NSLINGY  G+ P MAVSVYRRM  +GFVPDLFTFPV+ KACSNFSG
Sbjct: 61  LKQVNMRLGSFPFNSLINGYAGGEFPQMAVSVYRRMARDGFVPDLFTFPVLFKACSNFSG 120

Query: 121 SREGRQVHGVVVKWGFLSDLYVQNSLVRCYGACGDFSSAGKLFDEMLVRDVVSWNSLISG 180
            REGRQVHGVVVK G LSDL+VQNSLVRCYGAC DFS AG++FDEMLVRDVVSWNSLISG
Sbjct: 121 RREGRQVHGVVVKLGILSDLFVQNSLVRCYGACEDFSCAGEVFDEMLVRDVVSWNSLISG 180

Query: 181 FMKAGHFDEAISLFFRMDVEPSIATLVSVLAACARKGDLCMGKGFHGMIERRFKLDLLLG 240
           FMKAG FD+AISLFFRMDVEPS+ATLVSVLAACARKG+L MGKG HGMI+RRFKLDL+LG
Sbjct: 181 FMKAGRFDDAISLFFRMDVEPSVATLVSVLAACARKGELYMGKGIHGMIQRRFKLDLVLG 240

Query: 241 NAMLDMYVKNGCLYEAKKIFDELPTRDIVSWTIMITGLVQSNHPKESLELFSTMRTMGIN 300
           NAMLDMY KNGCLYEAK IFDELPTRDIVSWTIMITGLVQSNHPKESLELF  MR +GI+
Sbjct: 241 NAMLDMYAKNGCLYEAKNIFDELPTRDIVSWTIMITGLVQSNHPKESLELFWMMRNLGIS 300

Query: 301 PDGIILTSVLSACASLGTLDFGRWVHEYIDQRGIKWDIHIGTAIVDMYAKCGCIEMALQI 360
           PDGIILTSVLSACASLGTL++G WVHEYIDQRGIKWDIHIGTAIVDMYAKCGC+EMA QI
Sbjct: 301 PDGIILTSVLSACASLGTLEYGTWVHEYIDQRGIKWDIHIGTAIVDMYAKCGCVEMARQI 360

Query: 361 FNNMAQRNTFTWNVLLCGLAMHGLVREALNLFEVMIISGVKPNEVTFLAILTACCHSGLV 420
           FNNM QRNTFTWN LLCGLAMHGL  EAL LFEVMIISGVKPNEVTFLAILTACCHSGLV
Sbjct: 361 FNNMPQRNTFTWNALLCGLAMHGLAHEALYLFEVMIISGVKPNEVTFLAILTACCHSGLV 420

Query: 421 NEGRKYFDNMSSQQYNLSPKLEHYGCMVDLFCRAGLLEEAVELARTMPMKPDVLIWGVLL 480
           +EGRKYFDNMSSQ YNLSPKLEHYGCM+DL CRAGLLEEAVEL RTMPMKPDVLIWGVLL
Sbjct: 421 DEGRKYFDNMSSQTYNLSPKLEHYGCMIDLLCRAGLLEEAVELVRTMPMKPDVLIWGVLL 480

Query: 481 NACKTVGNVELSQHIQDYILELDPEDSGIFVLLSNISATNERWSDVTRLRRLMKDRGVKK 540
           NACKTVGNVELSQHIQ+YILELD EDSG+FVLLSNISATNERWS+VTRLRRLMKDRGVKK
Sbjct: 481 NACKTVGNVELSQHIQEYILELDTEDSGVFVLLSNISATNERWSNVTRLRRLMKDRGVKK 540

Query: 541 SPGSSVIEVDGKAHEFVVGDISHFQTEEIYKLLNLINSVFHESHLMHPL 590
           SPGSSVIEVDGKAHEFV GDIS+ + EEIYK+L LINSV HESHLMHPL
Sbjct: 541 SPGSSVIEVDGKAHEFVAGDISNLEIEEIYKVLTLINSVLHESHLMHPL 589

BLAST of Tan0020698 vs. NCBI nr
Match: XP_022945046.1 (pentatricopeptide repeat-containing protein At4g38010 isoform X1 [Cucurbita moschata])

HSP 1 Score: 1071.6 bits (2770), Expect = 2.3e-309
Identity = 519/589 (88.12%), Postives = 546/589 (92.70%), Query Frame = 0

Query: 1   MFNLKWVLLDCIRDCKNLRIFRQIHAQLVTSGLVYDGFVTNKVVEFFANFVEFSDYACDY 60
           MFNLKWVLLD I+DCKNLRIF++IHAQLV SGLVYD FVTNKVVEFFANFVEF DYACDY
Sbjct: 1   MFNLKWVLLDSIKDCKNLRIFKKIHAQLVASGLVYDDFVTNKVVEFFANFVEFGDYACDY 60

Query: 61  LKQNNSRLGSFPCNSLINGYVAGDLPLMAVSVYRRMVGEGFVPDLFTFPVVLKACSNFSG 120
           LKQ N RLGSFP NSLINGY  G+ P MAVSVYRRM  +GFVPDLFTFPV+ KACSNFSG
Sbjct: 61  LKQVNIRLGSFPFNSLINGYAGGEFPQMAVSVYRRMARDGFVPDLFTFPVLFKACSNFSG 120

Query: 121 SREGRQVHGVVVKWGFLSDLYVQNSLVRCYGACGDFSSAGKLFDEMLVRDVVSWNSLISG 180
           SREGRQVHGVVVK G LSDL+VQNSLV CYGAC DFS AGK+FDEMLVRDVVSWNSLISG
Sbjct: 121 SREGRQVHGVVVKLGILSDLFVQNSLVCCYGACEDFSCAGKVFDEMLVRDVVSWNSLISG 180

Query: 181 FMKAGHFDEAISLFFRMDVEPSIATLVSVLAACARKGDLCMGKGFHGMIERRFKLDLLLG 240
           FMKAG FD+AISLFFRMDVEPS+ATLVSVLAACARKG+L MGKG HGMI+RRFKLDL+LG
Sbjct: 181 FMKAGRFDDAISLFFRMDVEPSVATLVSVLAACARKGELYMGKGIHGMIQRRFKLDLVLG 240

Query: 241 NAMLDMYVKNGCLYEAKKIFDELPTRDIVSWTIMITGLVQSNHPKESLELFSTMRTMGIN 300
           NAMLDMY KNGCLYEAK IFDELPTRDIVSWTIMITGLVQSNHPKESLELF  MR +GI+
Sbjct: 241 NAMLDMYAKNGCLYEAKNIFDELPTRDIVSWTIMITGLVQSNHPKESLELFWMMRNLGIS 300

Query: 301 PDGIILTSVLSACASLGTLDFGRWVHEYIDQRGIKWDIHIGTAIVDMYAKCGCIEMALQI 360
           PDGIILTSVLSACASLGTL++G WVHEYIDQRGIKWDIHIGTAIVDMYAKCGC+EMA QI
Sbjct: 301 PDGIILTSVLSACASLGTLEYGTWVHEYIDQRGIKWDIHIGTAIVDMYAKCGCVEMARQI 360

Query: 361 FNNMAQRNTFTWNVLLCGLAMHGLVREALNLFEVMIISGVKPNEVTFLAILTACCHSGLV 420
           FNNM QRNTFTWN LLCGLAMHGL  EAL LFEVMIISGVKPNEVTFLAILTACCHSGLV
Sbjct: 361 FNNMPQRNTFTWNALLCGLAMHGLAHEALFLFEVMIISGVKPNEVTFLAILTACCHSGLV 420

Query: 421 NEGRKYFDNMSSQQYNLSPKLEHYGCMVDLFCRAGLLEEAVELARTMPMKPDVLIWGVLL 480
           +EGRKYFDNMSSQ YNLSPKLEHYGCM+DL CRAGLLEEAVEL RTMPMKPDVLIWGVLL
Sbjct: 421 DEGRKYFDNMSSQTYNLSPKLEHYGCMIDLLCRAGLLEEAVELVRTMPMKPDVLIWGVLL 480

Query: 481 NACKTVGNVELSQHIQDYILELDPEDSGIFVLLSNISATNERWSDVTRLRRLMKDRGVKK 540
           NACKTVGN+ELSQHIQ+YILELD EDSG+FVLLSNISATNERWS+VTRLRRLMKDRGVKK
Sbjct: 481 NACKTVGNIELSQHIQEYILELDTEDSGVFVLLSNISATNERWSNVTRLRRLMKDRGVKK 540

Query: 541 SPGSSVIEVDGKAHEFVVGDISHFQTEEIYKLLNLINSVFHESHLMHPL 590
           SPGSSVIEVDGKAHEFV GDIS+ + EEIYK+L LINSV HESHLMHPL
Sbjct: 541 SPGSSVIEVDGKAHEFVAGDISNLEIEEIYKVLTLINSVLHESHLMHPL 589

BLAST of Tan0020698 vs. NCBI nr
Match: XP_038889932.1 (pentatricopeptide repeat-containing protein At4g38010 [Benincasa hispida] >XP_038889934.1 pentatricopeptide repeat-containing protein At4g38010 [Benincasa hispida] >XP_038889935.1 pentatricopeptide repeat-containing protein At4g38010 [Benincasa hispida] >XP_038889936.1 pentatricopeptide repeat-containing protein At4g38010 [Benincasa hispida] >XP_038889937.1 pentatricopeptide repeat-containing protein At4g38010 [Benincasa hispida] >XP_038889938.1 pentatricopeptide repeat-containing protein At4g38010 [Benincasa hispida] >XP_038889939.1 pentatricopeptide repeat-containing protein At4g38010 [Benincasa hispida] >XP_038889940.1 pentatricopeptide repeat-containing protein At4g38010 [Benincasa hispida] >XP_038889941.1 pentatricopeptide repeat-containing protein At4g38010 [Benincasa hispida])

HSP 1 Score: 1067.0 bits (2758), Expect = 5.8e-308
Identity = 515/584 (88.18%), Postives = 551/584 (94.35%), Query Frame = 0

Query: 1   MFNLKWVLLDCIRDCKNLRIFRQIHAQLVTSGLVYDGFVTNKVVEFFANFVEFSDYACDY 60
           MFNLKWVLL  + DCKNLRIFRQIHAQLVTSGLV D FVT+KV+EFFANFVE+ DYACDY
Sbjct: 1   MFNLKWVLLHSVEDCKNLRIFRQIHAQLVTSGLVCDDFVTSKVIEFFANFVEYGDYACDY 60

Query: 61  LKQNNSRLGSFPCNSLINGYVAGDLPLMAVSVYRRMVGEGFVPDLFTFPVVLKACSNFSG 120
           LKQ ++RLGSFP NSLINGYV G+LP +AVSVYRRMV +GFVPD+FTFPV+LKACSNFSG
Sbjct: 61  LKQGSTRLGSFPFNSLINGYV-GELPQLAVSVYRRMVRDGFVPDMFTFPVLLKACSNFSG 120

Query: 121 SREGRQVHGVVVKWGFLSDLYVQNSLVRCYGACGDFSSAGKLFDEMLVRDVVSWNSLISG 180
           SREGRQ+HGVVVK G L+D YVQNSL+ CYGACGDFS AG++FDEMLVRDVVSWNSLISG
Sbjct: 121 SREGRQIHGVVVKLGLLADHYVQNSLICCYGACGDFSCAGRVFDEMLVRDVVSWNSLISG 180

Query: 181 FMKAGHFDEAISLFFRMDVEPSIATLVSVLAACARKGDLCMGKGFHGMIERRFKLDLLLG 240
           FMK GHFDEAISLFFRMDVEPSIATLVSVLAACARKGDLCMGKG HG+IERRFKL+L+LG
Sbjct: 181 FMKVGHFDEAISLFFRMDVEPSIATLVSVLAACARKGDLCMGKGIHGVIERRFKLNLILG 240

Query: 241 NAMLDMYVKNGCLYEAKKIFDELPTRDIVSWTIMITGLVQSNHPKESLELFSTMRTMGIN 300
           NAMLDMYVKNGCLYEAKKIFDELPTRDIVSWTIMITGLVQS+HPKESLELFS MRT+GIN
Sbjct: 241 NAMLDMYVKNGCLYEAKKIFDELPTRDIVSWTIMITGLVQSDHPKESLELFSMMRTLGIN 300

Query: 301 PDGIILTSVLSACASLGTLDFGRWVHEYIDQRGIKWDIHIGTAIVDMYAKCGCIEMALQI 360
           PD IILTSVLSAC+SLGTLDFG WVHEYI+QR IKWD+HIGTAIVDMYAKCGCIEMALQI
Sbjct: 301 PDAIILTSVLSACSSLGTLDFGTWVHEYINQRRIKWDVHIGTAIVDMYAKCGCIEMALQI 360

Query: 361 FNNMAQRNTFTWNVLLCGLAMHGLVREALNLFEVMIISGVKPNEVTFLAILTACCHSGLV 420
           F NM ++NTFTWN LLCGLAMHGLV EALNLFEVMI SGVKPNEVTFLAILTACCHSGLV
Sbjct: 361 FYNMPKKNTFTWNALLCGLAMHGLVYEALNLFEVMIKSGVKPNEVTFLAILTACCHSGLV 420

Query: 421 NEGRKYFDNMSSQQYNLSPKLEHYGCMVDLFCRAGLLEEAVELARTMPMKPDVLIWGVLL 480
           NEGRKYF+NMSSQ YNL PKLEHYGCM+DLFCRAGLLEEAVELARTMPMKPDVLIWGVLL
Sbjct: 421 NEGRKYFNNMSSQLYNLLPKLEHYGCMIDLFCRAGLLEEAVELARTMPMKPDVLIWGVLL 480

Query: 481 NACKTVGNVELSQHIQDYILELDPEDSGIFVLLSNISATNERWSDVTRLRRLMKDRGVKK 540
           NACKTVGNV+LS HI+DYILELDPEDSG+FVLLSNISATNERWSDV RLRRLMKDRGV+K
Sbjct: 481 NACKTVGNVDLSHHIRDYILELDPEDSGVFVLLSNISATNERWSDVIRLRRLMKDRGVEK 540

Query: 541 SPGSSVIEVDGKAHEFVVGDISHFQTEEIYKLLNLINSVFHESH 585
           +PGSSVIEVDGKAHEFV GDISH QT+EIYKLL+LINSV+HESH
Sbjct: 541 APGSSVIEVDGKAHEFVAGDISHLQTKEIYKLLSLINSVYHESH 583

BLAST of Tan0020698 vs. ExPASy TrEMBL
Match: A0A6J1HYD9 (pentatricopeptide repeat-containing protein At4g38010 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111467390 PE=4 SV=1)

HSP 1 Score: 1082.4 bits (2798), Expect = 0.0e+00
Identity = 523/588 (88.95%), Postives = 551/588 (93.71%), Query Frame = 0

Query: 1   MFNLKWVLLDCIRDCKNLRIFRQIHAQLVTSGLVYDGFVTNKVVEFFANFVEFSDYACDY 60
           MFNLKWVLLD I+DCKNLRIF++IHAQLVTSGLVYD FVTNKVVEFFANFVEF DYACDY
Sbjct: 1   MFNLKWVLLDSIKDCKNLRIFKKIHAQLVTSGLVYDDFVTNKVVEFFANFVEFGDYACDY 60

Query: 61  LKQNNSRLGSFPCNSLINGYVAGDLPLMAVSVYRRMVGEGFVPDLFTFPVVLKACSNFSG 120
           LKQ N+RLGSFP NSLINGY  G+ P MAVSVYRRM  +GFVPDLFTFPV+ KACSNFSG
Sbjct: 61  LKQVNTRLGSFPFNSLINGYAGGEFPQMAVSVYRRMARDGFVPDLFTFPVLFKACSNFSG 120

Query: 121 SREGRQVHGVVVKWGFLSDLYVQNSLVRCYGACGDFSSAGKLFDEMLVRDVVSWNSLISG 180
           SREGRQVHGVV+K G LSDL+VQNSLVRCYGAC DFS AGK+FDEMLVRDVVSWNSLISG
Sbjct: 121 SREGRQVHGVVIKLGILSDLFVQNSLVRCYGACEDFSCAGKVFDEMLVRDVVSWNSLISG 180

Query: 181 FMKAGHFDEAISLFFRMDVEPSIATLVSVLAACARKGDLCMGKGFHGMIERRFKLDLLLG 240
           FMKAG FD+AISLFFRMDVEPS+ATLVSVLAACARKGDL MGKG HGMI+RRFKLDL+LG
Sbjct: 181 FMKAGRFDDAISLFFRMDVEPSVATLVSVLAACARKGDLYMGKGIHGMIQRRFKLDLVLG 240

Query: 241 NAMLDMYVKNGCLYEAKKIFDELPTRDIVSWTIMITGLVQSNHPKESLELFSTMRTMGIN 300
           NAMLDMY KNGCLYEAK IFDELPTRDIVSWTIMITGLVQSNHPKESLELF  MR +GI+
Sbjct: 241 NAMLDMYAKNGCLYEAKNIFDELPTRDIVSWTIMITGLVQSNHPKESLELFWMMRNLGIS 300

Query: 301 PDGIILTSVLSACASLGTLDFGRWVHEYIDQRGIKWDIHIGTAIVDMYAKCGCIEMALQI 360
           PDGIILTSVLSACASLGTL +G WVHEYI+QRGIKWDIHIGTAIVDMYAKCGC+EMA QI
Sbjct: 301 PDGIILTSVLSACASLGTLKYGTWVHEYINQRGIKWDIHIGTAIVDMYAKCGCVEMARQI 360

Query: 361 FNNMAQRNTFTWNVLLCGLAMHGLVREALNLFEVMIISGVKPNEVTFLAILTACCHSGLV 420
           FN+M QRNTFTWN LLCGLAMHGL  EAL LFEVMIISGVK NEVTFLAILTACCHSGLV
Sbjct: 361 FNSMPQRNTFTWNALLCGLAMHGLAHEALYLFEVMIISGVKTNEVTFLAILTACCHSGLV 420

Query: 421 NEGRKYFDNMSSQQYNLSPKLEHYGCMVDLFCRAGLLEEAVELARTMPMKPDVLIWGVLL 480
           +EGRKYFDNMSSQ+YNLSPKLEHYGCM+DLFCRAGLLEEAVEL RTMPMKPDVLIWGVLL
Sbjct: 421 DEGRKYFDNMSSQRYNLSPKLEHYGCMIDLFCRAGLLEEAVELVRTMPMKPDVLIWGVLL 480

Query: 481 NACKTVGNVELSQHIQDYILELDPEDSGIFVLLSNISATNERWSDVTRLRRLMKDRGVKK 540
           NACKTVGNVELSQHIQ+YILELDPEDSG+FVLLSNISATNERWS+VTRLRRLMKDRGVKK
Sbjct: 481 NACKTVGNVELSQHIQEYILELDPEDSGVFVLLSNISATNERWSNVTRLRRLMKDRGVKK 540

Query: 541 SPGSSVIEVDGKAHEFVVGDISHFQTEEIYKLLNLINSVFHESHLMHP 589
           SPGSSVIEVDGKAHEFV GDIS+ QTEEIYK+L LINSVFHESHLMHP
Sbjct: 541 SPGSSVIEVDGKAHEFVAGDISNLQTEEIYKVLTLINSVFHESHLMHP 588

BLAST of Tan0020698 vs. ExPASy TrEMBL
Match: A0A6J1FZR9 (pentatricopeptide repeat-containing protein At4g38010 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111449406 PE=4 SV=1)

HSP 1 Score: 1071.6 bits (2770), Expect = 1.1e-309
Identity = 519/589 (88.12%), Postives = 546/589 (92.70%), Query Frame = 0

Query: 1   MFNLKWVLLDCIRDCKNLRIFRQIHAQLVTSGLVYDGFVTNKVVEFFANFVEFSDYACDY 60
           MFNLKWVLLD I+DCKNLRIF++IHAQLV SGLVYD FVTNKVVEFFANFVEF DYACDY
Sbjct: 1   MFNLKWVLLDSIKDCKNLRIFKKIHAQLVASGLVYDDFVTNKVVEFFANFVEFGDYACDY 60

Query: 61  LKQNNSRLGSFPCNSLINGYVAGDLPLMAVSVYRRMVGEGFVPDLFTFPVVLKACSNFSG 120
           LKQ N RLGSFP NSLINGY  G+ P MAVSVYRRM  +GFVPDLFTFPV+ KACSNFSG
Sbjct: 61  LKQVNIRLGSFPFNSLINGYAGGEFPQMAVSVYRRMARDGFVPDLFTFPVLFKACSNFSG 120

Query: 121 SREGRQVHGVVVKWGFLSDLYVQNSLVRCYGACGDFSSAGKLFDEMLVRDVVSWNSLISG 180
           SREGRQVHGVVVK G LSDL+VQNSLV CYGAC DFS AGK+FDEMLVRDVVSWNSLISG
Sbjct: 121 SREGRQVHGVVVKLGILSDLFVQNSLVCCYGACEDFSCAGKVFDEMLVRDVVSWNSLISG 180

Query: 181 FMKAGHFDEAISLFFRMDVEPSIATLVSVLAACARKGDLCMGKGFHGMIERRFKLDLLLG 240
           FMKAG FD+AISLFFRMDVEPS+ATLVSVLAACARKG+L MGKG HGMI+RRFKLDL+LG
Sbjct: 181 FMKAGRFDDAISLFFRMDVEPSVATLVSVLAACARKGELYMGKGIHGMIQRRFKLDLVLG 240

Query: 241 NAMLDMYVKNGCLYEAKKIFDELPTRDIVSWTIMITGLVQSNHPKESLELFSTMRTMGIN 300
           NAMLDMY KNGCLYEAK IFDELPTRDIVSWTIMITGLVQSNHPKESLELF  MR +GI+
Sbjct: 241 NAMLDMYAKNGCLYEAKNIFDELPTRDIVSWTIMITGLVQSNHPKESLELFWMMRNLGIS 300

Query: 301 PDGIILTSVLSACASLGTLDFGRWVHEYIDQRGIKWDIHIGTAIVDMYAKCGCIEMALQI 360
           PDGIILTSVLSACASLGTL++G WVHEYIDQRGIKWDIHIGTAIVDMYAKCGC+EMA QI
Sbjct: 301 PDGIILTSVLSACASLGTLEYGTWVHEYIDQRGIKWDIHIGTAIVDMYAKCGCVEMARQI 360

Query: 361 FNNMAQRNTFTWNVLLCGLAMHGLVREALNLFEVMIISGVKPNEVTFLAILTACCHSGLV 420
           FNNM QRNTFTWN LLCGLAMHGL  EAL LFEVMIISGVKPNEVTFLAILTACCHSGLV
Sbjct: 361 FNNMPQRNTFTWNALLCGLAMHGLAHEALFLFEVMIISGVKPNEVTFLAILTACCHSGLV 420

Query: 421 NEGRKYFDNMSSQQYNLSPKLEHYGCMVDLFCRAGLLEEAVELARTMPMKPDVLIWGVLL 480
           +EGRKYFDNMSSQ YNLSPKLEHYGCM+DL CRAGLLEEAVEL RTMPMKPDVLIWGVLL
Sbjct: 421 DEGRKYFDNMSSQTYNLSPKLEHYGCMIDLLCRAGLLEEAVELVRTMPMKPDVLIWGVLL 480

Query: 481 NACKTVGNVELSQHIQDYILELDPEDSGIFVLLSNISATNERWSDVTRLRRLMKDRGVKK 540
           NACKTVGN+ELSQHIQ+YILELD EDSG+FVLLSNISATNERWS+VTRLRRLMKDRGVKK
Sbjct: 481 NACKTVGNIELSQHIQEYILELDTEDSGVFVLLSNISATNERWSNVTRLRRLMKDRGVKK 540

Query: 541 SPGSSVIEVDGKAHEFVVGDISHFQTEEIYKLLNLINSVFHESHLMHPL 590
           SPGSSVIEVDGKAHEFV GDIS+ + EEIYK+L LINSV HESHLMHPL
Sbjct: 541 SPGSSVIEVDGKAHEFVAGDISNLEIEEIYKVLTLINSVLHESHLMHPL 589

BLAST of Tan0020698 vs. ExPASy TrEMBL
Match: A0A0A0LF19 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G004700 PE=4 SV=1)

HSP 1 Score: 1058.5 bits (2736), Expect = 1.0e-305
Identity = 508/584 (86.99%), Postives = 549/584 (94.01%), Query Frame = 0

Query: 1   MFNLKWVLLDCIRDCKNLRIFRQIHAQLVTSGLVYDGFVTNKVVEFFANFVEFSDYACDY 60
           MFNLKWVLLD I+DCKNLRIFRQIHAQLVTSGLVYD FVT+KV+EFFANFVE+ DYACDY
Sbjct: 1   MFNLKWVLLDSIKDCKNLRIFRQIHAQLVTSGLVYDDFVTSKVMEFFANFVEYGDYACDY 60

Query: 61  LKQNNSRLGSFPCNSLINGYVAGDLPLMAVSVYRRMVGEGFVPDLFTFPVVLKACSNFSG 120
           L+Q N+RLGSFP NSLINGYV G+ P MAVSVYRRMV +GFVPD+FTFPV+LKACSNFSG
Sbjct: 61  LEQGNTRLGSFPFNSLINGYVGGEFPQMAVSVYRRMVRDGFVPDMFTFPVLLKACSNFSG 120

Query: 121 SREGRQVHGVVVKWGFLSDLYVQNSLVRCYGACGDFSSAGKLFDEMLVRDVVSWNSLISG 180
           SREGRQVHGVVVK G L+D YVQNSL+RCYGACGDFS AGK+FDEMLVRDVVSWNSLISG
Sbjct: 121 SREGRQVHGVVVKLGLLADHYVQNSLIRCYGACGDFSCAGKVFDEMLVRDVVSWNSLISG 180

Query: 181 FMKAGHFDEAISLFFRMDVEPSIATLVSVLAACARKGDLCMGKGFHGMIERRFKLDLLLG 240
           FMKAGHFDEAIS+FFRMDVEPS+ TLVSVLAACAR GDLC GKG HG+IERRFK++L+LG
Sbjct: 181 FMKAGHFDEAISVFFRMDVEPSMTTLVSVLAACARNGDLCTGKGIHGVIERRFKVNLVLG 240

Query: 241 NAMLDMYVKNGCLYEAKKIFDELPTRDIVSWTIMITGLVQSNHPKESLELFSTMRTMGIN 300
           NAMLDMYVKNGC YEAK IFDELPTRDIVSWTIMITGLVQS+HPK+SLELFS MRT+GI+
Sbjct: 241 NAMLDMYVKNGCFYEAKNIFDELPTRDIVSWTIMITGLVQSDHPKQSLELFSMMRTLGIS 300

Query: 301 PDGIILTSVLSACASLGTLDFGRWVHEYIDQRGIKWDIHIGTAIVDMYAKCGCIEMALQI 360
           PD IILTSVLSACASLGTLDFG WVHEYI+QRGIKWDIHIGTAIVDMYAKCGCIEMAL+I
Sbjct: 301 PDAIILTSVLSACASLGTLDFGTWVHEYINQRGIKWDIHIGTAIVDMYAKCGCIEMALKI 360

Query: 361 FNNMAQRNTFTWNVLLCGLAMHGLVREALNLFEVMIISGVKPNEVTFLAILTACCHSGLV 420
           F +M+QRNTFTWN LLCGLAMHGLV EALNLFEVMIISGVKPNE+TFLAILTACCH GLV
Sbjct: 361 FYSMSQRNTFTWNALLCGLAMHGLVHEALNLFEVMIISGVKPNEITFLAILTACCHCGLV 420

Query: 421 NEGRKYFDNMSSQQYNLSPKLEHYGCMVDLFCRAGLLEEAVELARTMPMKPDVLIWGVLL 480
           +EGRKYFDNM S+ YNL PKLEHYGCM+DLFCRAGLLEEAVELARTMPMKPDVLIWG+LL
Sbjct: 421 DEGRKYFDNM-SKLYNLLPKLEHYGCMIDLFCRAGLLEEAVELARTMPMKPDVLIWGLLL 480

Query: 481 NACKTVGNVELSQHIQDYILELDPEDSGIFVLLSNISATNERWSDVTRLRRLMKDRGVKK 540
           NAC TVGN+ELS  IQDYILELD +DSG+FVLLSNISA N+RWS+VTRLRRLMKDRGV+K
Sbjct: 481 NACTTVGNIELSHRIQDYILELDHDDSGVFVLLSNISAINQRWSNVTRLRRLMKDRGVRK 540

Query: 541 SPGSSVIEVDGKAHEFVVGDISHFQTEEIYKLLNLINSVFHESH 585
           +PGSSVIEVDGKAHEFVVGDISH QTEEIYK+LNLINSV+HESH
Sbjct: 541 APGSSVIEVDGKAHEFVVGDISHLQTEEIYKVLNLINSVYHESH 583

BLAST of Tan0020698 vs. ExPASy TrEMBL
Match: A0A5D3CGR6 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold332G00240 PE=4 SV=1)

HSP 1 Score: 1055.0 bits (2727), Expect = 1.1e-304
Identity = 508/584 (86.99%), Postives = 548/584 (93.84%), Query Frame = 0

Query: 1   MFNLKWVLLDCIRDCKNLRIFRQIHAQLVTSGLVYDGFVTNKVVEFFANFVEFSDYACDY 60
           MFNLKWVLLD I+DCKNLRIFRQIHAQLVTSGLVYD FVT+KV+EFFANFVE+ DYACDY
Sbjct: 1   MFNLKWVLLDSIKDCKNLRIFRQIHAQLVTSGLVYDDFVTSKVMEFFANFVEYGDYACDY 60

Query: 61  LKQNNSRLGSFPCNSLINGYVAGDLPLMAVSVYRRMVGEGFVPDLFTFPVVLKACSNFSG 120
           L+Q N+RLGSFP NSLINGYV G+ P  AVSVYRRMV +GFVPD+FTFPV+LKACSNFSG
Sbjct: 61  LEQGNTRLGSFPFNSLINGYVGGEFPQTAVSVYRRMVRDGFVPDMFTFPVLLKACSNFSG 120

Query: 121 SREGRQVHGVVVKWGFLSDLYVQNSLVRCYGACGDFSSAGKLFDEMLVRDVVSWNSLISG 180
           SREGRQVHGVVVK G L+DLYVQNSL+RCYGACGD S AGK+FDEM+VRDVVSWNSLISG
Sbjct: 121 SREGRQVHGVVVKLGLLADLYVQNSLIRCYGACGDLSCAGKVFDEMVVRDVVSWNSLISG 180

Query: 181 FMKAGHFDEAISLFFRMDVEPSIATLVSVLAACARKGDLCMGKGFHGMIERRFKLDLLLG 240
           FMKAGHFDEAIS+FFRMDVEPSIATLVSVLAACAR G+LC GKG HG+IERRFK++L+LG
Sbjct: 181 FMKAGHFDEAISVFFRMDVEPSIATLVSVLAACARNGNLCTGKGIHGVIERRFKVNLVLG 240

Query: 241 NAMLDMYVKNGCLYEAKKIFDELPTRDIVSWTIMITGLVQSNHPKESLELFSTMRTMGIN 300
           NAMLDMYVKNGC YEAKK+FDELPTRDIVSWTIMITGLVQS+HPKESLELFS MRT+GI+
Sbjct: 241 NAMLDMYVKNGCFYEAKKMFDELPTRDIVSWTIMITGLVQSDHPKESLELFSMMRTLGIS 300

Query: 301 PDGIILTSVLSACASLGTLDFGRWVHEYIDQRGIKWDIHIGTAIVDMYAKCGCIEMALQI 360
           PD IILTSVLSACASLGTLDFG WVHEYI+QRGIKWDIH GTAIVDMYAKCGCIEMALQI
Sbjct: 301 PDAIILTSVLSACASLGTLDFGTWVHEYINQRGIKWDIHTGTAIVDMYAKCGCIEMALQI 360

Query: 361 FNNMAQRNTFTWNVLLCGLAMHGLVREALNLFEVMIISGVKPNEVTFLAILTACCHSGLV 420
           F +M QRNTFTWN LLCGLAMHGLV EAL+LFEVM ISGV+PNE+TFLAILTACCHSGLV
Sbjct: 361 FYSMPQRNTFTWNALLCGLAMHGLVHEALDLFEVMTISGVEPNEITFLAILTACCHSGLV 420

Query: 421 NEGRKYFDNMSSQQYNLSPKLEHYGCMVDLFCRAGLLEEAVELARTMPMKPDVLIWGVLL 480
           +EGRKYFDNM S+ YNL PKLEHYGCM+DLFCRAGLLEEAVELARTMPMKPD+LIWGVLL
Sbjct: 421 DEGRKYFDNM-SKLYNLLPKLEHYGCMIDLFCRAGLLEEAVELARTMPMKPDMLIWGVLL 480

Query: 481 NACKTVGNVELSQHIQDYILELDPEDSGIFVLLSNISATNERWSDVTRLRRLMKDRGVKK 540
           NAC TVGNVELS  IQDYILELD +DSG+FVLLSNISA N+RWS+VTRLRRLMKDRGVKK
Sbjct: 481 NACTTVGNVELSHRIQDYILELDHDDSGVFVLLSNISAINQRWSNVTRLRRLMKDRGVKK 540

Query: 541 SPGSSVIEVDGKAHEFVVGDISHFQTEEIYKLLNLINSVFHESH 585
           +PGSSVIEVDGKAHEFVVGDISH QTEEIYK+LNLINSV+HESH
Sbjct: 541 APGSSVIEVDGKAHEFVVGDISHLQTEEIYKVLNLINSVYHESH 583

BLAST of Tan0020698 vs. ExPASy TrEMBL
Match: A0A1S3C7P6 (pentatricopeptide repeat-containing protein At4g38010 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103497763 PE=4 SV=1)

HSP 1 Score: 1053.5 bits (2723), Expect = 3.2e-304
Identity = 507/584 (86.82%), Postives = 548/584 (93.84%), Query Frame = 0

Query: 1   MFNLKWVLLDCIRDCKNLRIFRQIHAQLVTSGLVYDGFVTNKVVEFFANFVEFSDYACDY 60
           MFNLKWVLLD I+DCKNLRIFRQIHAQLVTSGLVYD FVT+KV+EFFANFVE+ DYACDY
Sbjct: 1   MFNLKWVLLDSIKDCKNLRIFRQIHAQLVTSGLVYDDFVTSKVMEFFANFVEYGDYACDY 60

Query: 61  LKQNNSRLGSFPCNSLINGYVAGDLPLMAVSVYRRMVGEGFVPDLFTFPVVLKACSNFSG 120
           L+Q N+RLGSFP NSLINGYV G+ P  AVSVYRRMV +GFVPD+FTFPV+LKACSNFSG
Sbjct: 61  LEQGNTRLGSFPFNSLINGYVGGEFPQTAVSVYRRMVRDGFVPDMFTFPVLLKACSNFSG 120

Query: 121 SREGRQVHGVVVKWGFLSDLYVQNSLVRCYGACGDFSSAGKLFDEMLVRDVVSWNSLISG 180
           SREGRQVHGVVVK G L+DLYVQNSL+RCYGACGD S AGK+FDEM+VRDVVSWNSLISG
Sbjct: 121 SREGRQVHGVVVKLGLLADLYVQNSLIRCYGACGDLSCAGKVFDEMVVRDVVSWNSLISG 180

Query: 181 FMKAGHFDEAISLFFRMDVEPSIATLVSVLAACARKGDLCMGKGFHGMIERRFKLDLLLG 240
           FMKAGHFDEAIS+FFRMDVEPSIATLVSVLAACAR G+LC GKG HG+IERRFK++L+LG
Sbjct: 181 FMKAGHFDEAISVFFRMDVEPSIATLVSVLAACARNGNLCTGKGIHGVIERRFKVNLVLG 240

Query: 241 NAMLDMYVKNGCLYEAKKIFDELPTRDIVSWTIMITGLVQSNHPKESLELFSTMRTMGIN 300
           NAMLDMYVKNGC YEAKK+FDELPTRDIVSWTIMITGLVQS+HPKESLELFS MRT+GI+
Sbjct: 241 NAMLDMYVKNGCFYEAKKMFDELPTRDIVSWTIMITGLVQSDHPKESLELFSMMRTLGIS 300

Query: 301 PDGIILTSVLSACASLGTLDFGRWVHEYIDQRGIKWDIHIGTAIVDMYAKCGCIEMALQI 360
           PD IILTSVLSACASLGTLDFG WVHEYI+QRGIKWDIH GTAIVDMYAKCGCIEMALQI
Sbjct: 301 PDAIILTSVLSACASLGTLDFGTWVHEYINQRGIKWDIHTGTAIVDMYAKCGCIEMALQI 360

Query: 361 FNNMAQRNTFTWNVLLCGLAMHGLVREALNLFEVMIISGVKPNEVTFLAILTACCHSGLV 420
           F +M QRNTFTWN LLCGLAMHGLV EAL+LFEVM ISGV+PNE+TFLAILTACCHSGLV
Sbjct: 361 FYSMPQRNTFTWNALLCGLAMHGLVHEALDLFEVMTISGVEPNEITFLAILTACCHSGLV 420

Query: 421 NEGRKYFDNMSSQQYNLSPKLEHYGCMVDLFCRAGLLEEAVELARTMPMKPDVLIWGVLL 480
           +EGRKYF+NM S+ YNL PKLEHYGCM+DLFCRAGLLEEAVELARTMPMKPD+LIWGVLL
Sbjct: 421 DEGRKYFENM-SKLYNLLPKLEHYGCMIDLFCRAGLLEEAVELARTMPMKPDMLIWGVLL 480

Query: 481 NACKTVGNVELSQHIQDYILELDPEDSGIFVLLSNISATNERWSDVTRLRRLMKDRGVKK 540
           NAC TVGNVELS  IQDYILELD +DSG+FVLLSNISA N+RWS+VTRLRRLMKDRGVKK
Sbjct: 481 NACTTVGNVELSHRIQDYILELDHDDSGVFVLLSNISAINQRWSNVTRLRRLMKDRGVKK 540

Query: 541 SPGSSVIEVDGKAHEFVVGDISHFQTEEIYKLLNLINSVFHESH 585
           +PGSSVIEVDGKAHEFVVGDISH QTEEIYK+LNLINSV+HESH
Sbjct: 541 APGSSVIEVDGKAHEFVVGDISHLQTEEIYKVLNLINSVYHESH 583

BLAST of Tan0020698 vs. TAIR 10
Match: AT4G38010.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 622.1 bits (1603), Expect = 4.6e-178
Identity = 308/547 (56.31%), Postives = 384/547 (70.20%), Query Frame = 0

Query: 5   KWVLLDCIRDCKNLRIFRQIHAQLVTSGLVYDGFVTNKVVEFFANFVEFSDYACDYLKQN 64
           K VLL+ I  C +LR+F+QI  QL+T  L+ D  + NKVV F     +F+ Y+   L   
Sbjct: 6   KSVLLELISRCSSLRVFKQIQTQLITRDLLRDDLIINKVVTFLGKSADFASYSSVILHSI 65

Query: 65  NSRLGSFPCNSLINGYVAGDLPLMAVSVYRRMVGEGFVPDLFTFPVVLKACSNFSGSREG 124
            S L SF  N+L++ Y   D P + +  Y+  V  GF PD+FTFP V KAC  FSG REG
Sbjct: 66  RSVLSSFSYNTLLSSYAVCDKPRVTIFAYKTFVSNGFSPDMFTFPPVFKACGKFSGIREG 125

Query: 125 RQVHGVVVKWGFLSDLYVQNSLVRCYGACGDFSSAGKLFDEMLVRDVVSWNSLISGFMKA 184
           +Q+HG+V K GF  D+YVQNSLV  YG CG+  +A K+F EM VRDVVSW  +I+GF + 
Sbjct: 126 KQIHGIVTKMGFYDDIYVQNSLVHFYGVCGESRNACKVFGEMPVRDVVSWTGIITGFTRT 185

Query: 185 GHFDEAISLFFRMDVEPSIATLVSVLAACARKGDLCMGKGFHGMIERRFKL-DLLLGNAM 244
           G + EA+  F +MDVEP++AT V VL +  R G L +GKG HG+I +R  L  L  GNA+
Sbjct: 186 GLYKEALDTFSKMDVEPNLATYVCVLVSSGRVGCLSLGKGIHGLILKRASLISLETGNAL 245

Query: 245 LDMYVKNGCLYEAKKIFDELPTRDIVSWTIMITGLVQSNHPKESLELFSTMRT-MGINPD 304
           +DMYVK   L +A ++F EL  +D VSW  MI+GLV     KE+++LFS M+T  GI PD
Sbjct: 246 IDMYVKCEQLSDAMRVFGELEKKDKVSWNSMISGLVHCERSKEAIDLFSLMQTSSGIKPD 305

Query: 305 GIILTSVLSACASLGTLDFGRWVHEYIDQRGIKWDIHIGTAIVDMYAKCGCIEMALQIFN 364
           G ILTSVLSACASLG +D GRWVHEYI   GIKWD HIGTAIVDMYAKCG IE AL+IFN
Sbjct: 306 GHILTSVLSACASLGAVDHGRWVHEYILTAGIKWDTHIGTAIVDMYAKCGYIETALEIFN 365

Query: 365 NMAQRNTFTWNVLLCGLAMHGLVREALNLFEVMIISGVKPNEVTFLAILTACCHSGLVNE 424
            +  +N FTWN LL GLA+HG   E+L  FE M+  G KPN VTFLA L ACCH+GLV+E
Sbjct: 366 GIRSKNVFTWNALLGGLAIHGHGLESLRYFEEMVKLGFKPNLVTFLAALNACCHTGLVDE 425

Query: 425 GRKYFDNMSSQQYNLSPKLEHYGCMVDLFCRAGLLEEAVELARTMPMKPDVLIWGVLLNA 484
           GR+YF  M S++YNL PKLEHYGCM+DL CRAGLL+EA+EL + MP+KPDV I G +L+A
Sbjct: 426 GRRYFHKMKSREYNLFPKLEHYGCMIDLLCRAGLLDEALELVKAMPVKPDVRICGAILSA 485

Query: 485 CKTVGN-VELSQHIQDYILELDPEDSGIFVLLSNISATNERWSDVTRLRRLMKDRGVKKS 544
           CK  G  +EL + I D  L+++ EDSG++VLLSNI A N RW DV R+RRLMK +G+ K 
Sbjct: 486 CKNRGTLMELPKEILDSFLDIEFEDSGVYVLLSNIFAANRRWDDVARIRRLMKVKGISKV 545

Query: 545 PGSSVIE 549
           PGSS IE
Sbjct: 546 PGSSYIE 552

BLAST of Tan0020698 vs. TAIR 10
Match: AT1G08070.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 449.1 bits (1154), Expect = 5.4e-126
Identity = 233/604 (38.58%), Postives = 359/604 (59.44%), Query Frame = 0

Query: 9   LDCIRDCKNLRIFRQIHAQLVTSGLVYDGFVTNKVVEF--FANFVEFSDYACDYLKQNNS 68
           L  + +CK L+  R IHAQ++  GL    +  +K++EF   +   E   YA    K    
Sbjct: 37  LSLLHNCKTLQSLRIIHAQMIKIGLHNTNYALSKLIEFCILSPHFEGLPYAISVFK-TIQ 96

Query: 69  RLGSFPCNSLINGYVAGDLPLMAVSVYRRMVGEGFVPDLFTFPVVLKACSNFSGSREGRQ 128
                  N++  G+     P+ A+ +Y  M+  G +P+ +TFP VLK+C+     +EG+Q
Sbjct: 97  EPNLLIWNTMFRGHALSSDPVSALKLYVCMISLGLLPNSYTFPFVLKSCAKSKAFKEGQQ 156

Query: 129 VHGVVVKWGFLSDLYVQNS-------------------------------LVRCYGACGD 188
           +HG V+K G   DLYV  S                               L++ Y + G 
Sbjct: 157 IHGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVFDKSPHRDVVSYTALIKGYASRGY 216

Query: 189 FSSAGKLFDEMLVRDVVSWNSLISGFMKAGHFDEAISLF---FRMDVEPSIATLVSVLAA 248
             +A KLFDE+ V+DVVSWN++ISG+ + G++ EA+ LF    + +V P  +T+V+V++A
Sbjct: 217 IENAQKLFDEIPVKDVVSWNAMISGYAETGNYKEALELFKDMMKTNVRPDESTMVTVVSA 276

Query: 249 CARKGDLCMGKGFHGMI-ERRFKLDLLLGNAMLDMYVKNGCLYEAKKIFDELPTRDIVSW 308
           CA+ G + +G+  H  I +  F  +L + NA++D+Y K G L  A  +F+ LP +D++SW
Sbjct: 277 CAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELETACGLFERLPYKDVISW 336

Query: 309 TIMITGLVQSNHPKESLELFSTMRTMGINPDGIILTSVLSACASLGTLDFGRWVHEYIDQ 368
             +I G    N  KE+L LF  M   G  P+ + + S+L ACA LG +D GRW+H YID+
Sbjct: 337 NTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSILPACAHLGAIDIGRWIHVYIDK 396

Query: 369 R--GIKWDIHIGTAIVDMYAKCGCIEMALQIFNNMAQRNTFTWNVLLCGLAMHGLVREAL 428
           R  G+     + T+++DMYAKCG IE A Q+FN++  ++  +WN ++ G AMHG    + 
Sbjct: 397 RLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSSWNAMIFGFAMHGRADASF 456

Query: 429 NLFEVMIISGVKPNEVTFLAILTACCHSGLVNEGRKYFDNMSSQQYNLSPKLEHYGCMVD 488
           +LF  M   G++P+++TF+ +L+AC HSG+++ GR  F  M +Q Y ++PKLEHYGCM+D
Sbjct: 457 DLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTM-TQDYKMTPKLEHYGCMID 516

Query: 489 LFCRAGLLEEAVELARTMPMKPDVLIWGVLLNACKTVGNVELSQHIQDYILELDPEDSGI 548
           L   +GL +EA E+   M M+PD +IW  LL ACK  GNVEL +   + +++++PE+ G 
Sbjct: 517 LLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELGESFAENLIKIEPENPGS 576

Query: 549 FVLLSNISATNERWSDVTRLRRLMKDRGVKKSPGSSVIEVDGKAHEFVVGDISHFQTEEI 574
           +VLLSNI A+  RW++V + R L+ D+G+KK PG S IE+D   HEF++GD  H +  EI
Sbjct: 577 YVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSVVHEFIIGDKFHPRNREI 636

BLAST of Tan0020698 vs. TAIR 10
Match: AT2G22410.1 (SLOW GROWTH 1 )

HSP 1 Score: 444.1 bits (1141), Expect = 1.7e-124
Identity = 235/608 (38.65%), Postives = 356/608 (58.55%), Query Frame = 0

Query: 8   LLDCIRDCKNLRIFRQIHAQLVTSGLVYDGFVTNKVVEFFA-NFVEFSDYACDYLKQNNS 67
           LL  +  CK L   +QI AQ++ +GL+ D F +++++ F A +   + DY+   LK    
Sbjct: 56  LLSLLEKCKLLLHLKQIQAQMIINGLILDPFASSRLIAFCALSESRYLDYSVKILK-GIE 115

Query: 68  RLGSFPCNSLINGYVAGDLPLMAVSVYRRMVGEGFV---PDLFTFPVVLKACSNFSGSRE 127
               F  N  I G+   + P  +  +Y++M+  G     PD FT+PV+ K C++   S  
Sbjct: 116 NPNIFSWNVTIRGFSESENPKESFLLYKQMLRHGCCESRPDHFTYPVLFKVCADLRLSSL 175

Query: 128 GRQVHGVVVKWGFLSDLYVQNSLVRCYGACGDFSSAGKLFDEMLVRDVVSWNSLISGFMK 187
           G  + G V+K       +V N+ +  + +CGD  +A K+FDE  VRD+VSWN LI+G+ K
Sbjct: 176 GHMILGHVLKLRLELVSHVHNASIHMFASCGDMENARKVFDESPVRDLVSWNCLINGYKK 235

Query: 188 AGHFDEAISLFFRMD---VEPSIATLVSVLAACARKGDLCMGKGFHGMI-ERRFKLDLLL 247
            G  ++AI ++  M+   V+P   T++ ++++C+  GDL  GK F+  + E   ++ + L
Sbjct: 236 IGEAEKAIYVYKLMESEGVKPDDVTMIGLVSSCSMLGDLNRGKEFYEYVKENGLRMTIPL 295

Query: 248 GNAMLDMYVKNGCLYEAKKIFDELPTRDIVSWTIMITGL--------------------- 307
            NA++DM+ K G ++EA++IFD L  R IVSWT MI+G                      
Sbjct: 296 VNALMDMFSKCGDIHEARRIFDNLEKRTIVSWTTMISGYARCGLLDVSRKLFDDMEEKDV 355

Query: 308 ----------VQSNHPKESLELFSTMRTMGINPDGIILTSVLSACASLGTLDFGRWVHEY 367
                     VQ+   +++L LF  M+T    PD I +   LSAC+ LG LD G W+H Y
Sbjct: 356 VLWNAMIGGSVQAKRGQDALALFQEMQTSNTKPDEITMIHCLSACSQLGALDVGIWIHRY 415

Query: 368 IDQRGIKWDIHIGTAIVDMYAKCGCIEMALQIFNNMAQRNTFTWNVLLCGLAMHGLVREA 427
           I++  +  ++ +GT++VDMYAKCG I  AL +F+ +  RN+ T+  ++ GLA+HG    A
Sbjct: 416 IEKYSLSLNVALGTSLVDMYAKCGNISEALSVFHGIQTRNSLTYTAIIGGLALHGDASTA 475

Query: 428 LNLFEVMIISGVKPNEVTFLAILTACCHSGLVNEGRKYFDNMSSQQYNLSPKLEHYGCMV 487
           ++ F  MI +G+ P+E+TF+ +L+ACCH G++  GR YF  M S ++NL+P+L+HY  MV
Sbjct: 476 ISYFNEMIDAGIAPDEITFIGLLSACCHGGMIQTGRDYFSQMKS-RFNLNPQLKHYSIMV 535

Query: 488 DLFCRAGLLEEAVELARTMPMKPDVLIWGVLLNACKTVGNVELSQHIQDYILELDPEDSG 547
           DL  RAGLLEEA  L  +MPM+ D  +WG LL  C+  GNVEL +     +LELDP DSG
Sbjct: 536 DLLGRAGLLEEADRLMESMPMEADAAVWGALLFGCRMHGNVELGEKAAKKLLELDPSDSG 595

Query: 548 IFVLLSNISATNERWSDVTRLRRLMKDRGVKKSPGSSVIEVDGKAHEFVVGDISHFQTEE 577
           I+VLL  +      W D  R RR+M +RGV+K PG S IEV+G   EF+V D S  ++E+
Sbjct: 596 IYVLLDGMYGEANMWEDAKRARRMMNERGVEKIPGCSSIEVNGIVCEFIVRDKSRPESEK 655

BLAST of Tan0020698 vs. TAIR 10
Match: AT2G29760.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 427.9 bits (1099), Expect = 1.3e-119
Identity = 228/608 (37.50%), Postives = 355/608 (58.39%), Query Frame = 0

Query: 9   LDCIRDCKNLRIFRQIHAQLVTSGLVYDGFVTNKV-----VEFFANFVEFSDYACDYLKQ 68
           +  I  C +LR  +Q H  ++ +G   D +  +K+     +  FA+ +E++    D + +
Sbjct: 34  ISLIERCVSLRQLKQTHGHMIRTGTFSDPYSASKLFAMAALSSFAS-LEYARKVFDEIPK 93

Query: 69  NNSRLGSFPCNSLINGYVAGDLPLMAVSVYRRMVGEG-FVPDLFTFPVVLKACSNFSGSR 128
            N    SF  N+LI  Y +G  P++++  +  MV E    P+ +TFP ++KA +  S   
Sbjct: 94  PN----SFAWNTLIRAYASGPDPVLSIWAFLDMVSESQCYPNKYTFPFLIKAAAEVSSLS 153

Query: 129 EGRQVHGVVVKWGFLSDLYVQNSLVRCYGACGDFSSAGKLFDEMLVRDVVSWNSLISGFM 188
            G+ +HG+ VK    SD++V NSL+ CY +CGD  SA K+F  +  +DVVSWNS+I+GF+
Sbjct: 154 LGQSLHGMAVKSAVGSDVFVANSLIHCYFSCGDLDSACKVFTTIKEKDVVSWNSMINGFV 213

Query: 189 KAGHFDEAISLFFRM---DVEPSIATLVSVLAACARKGDLCMGKGFHGMI-ERRFKLDLL 248
           + G  D+A+ LF +M   DV+ S  T+V VL+ACA+  +L  G+     I E R  ++L 
Sbjct: 214 QKGSPDKALELFKKMESEDVKASHVTMVGVLSACAKIRNLEFGRQVCSYIEENRVNVNLT 273

Query: 249 LGNAMLDMYVKNGCLYEAKKIFD-------------------------------ELPTRD 308
           L NAMLDMY K G + +AK++FD                                +P +D
Sbjct: 274 LANAMLDMYTKCGSIEDAKRLFDAMEEKDNVTWTTMLDGYAISEDYEAAREVLNSMPQKD 333

Query: 309 IVSWTIMITGLVQSNHPKESLELFSTMRTM-GINPDGIILTSVLSACASLGTLDFGRWVH 368
           IV+W  +I+   Q+  P E+L +F  ++    +  + I L S LSACA +G L+ GRW+H
Sbjct: 334 IVAWNALISAYEQNGKPNEALIVFHELQLQKNMKLNQITLVSTLSACAQVGALELGRWIH 393

Query: 369 EYIDQRGIKWDIHIGTAIVDMYAKCGCIEMALQIFNNMAQRNTFTWNVLLCGLAMHGLVR 428
            YI + GI+ + H+ +A++ MY+KCG +E + ++FN++ +R+ F W+ ++ GLAMHG   
Sbjct: 394 SYIKKHGIRMNFHVTSALIHMYSKCGDLEKSREVFNSVEKRDVFVWSAMIGGLAMHGCGN 453

Query: 429 EALNLFEVMIISGVKPNEVTFLAILTACCHSGLVNEGRKYFDNMSSQQYNLSPKLEHYGC 488
           EA+++F  M  + VKPN VTF  +  AC H+GLV+E    F  M S  Y + P+ +HY C
Sbjct: 454 EAVDMFYKMQEANVKPNGVTFTNVFCACSHTGLVDEAESLFHQMES-NYGIVPEEKHYAC 513

Query: 489 MVDLFCRAGLLEEAVELARTMPMKPDVLIWGVLLNACKTVGNVELSQHIQDYILELDPED 548
           +VD+  R+G LE+AV+    MP+ P   +WG LL ACK   N+ L++     +LEL+P +
Sbjct: 514 IVDVLGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACKIHANLNLAEMACTRLLELEPRN 573

Query: 549 SGIFVLLSNISATNERWSDVTRLRRLMKDRGVKKSPGSSVIEVDGKAHEFVVGDISHFQT 575
            G  VLLSNI A   +W +V+ LR+ M+  G+KK PG S IE+DG  HEF+ GD +H  +
Sbjct: 574 DGAHVLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCSSIEIDGMIHEFLSGDNAHPMS 633

BLAST of Tan0020698 vs. TAIR 10
Match: AT1G31430.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 420.6 bits (1080), Expect = 2.0e-117
Identity = 217/541 (40.11%), Postives = 326/541 (60.26%), Query Frame = 0

Query: 74  NSLINGYVAGDLPLMAVSVYRRMVGEGFVPDLFTFPVVLKACSNFSGSREGRQVHGVVVK 133
           N ++     G      ++++  + G+G  PD FT PVVLK+        EG +VHG  VK
Sbjct: 15  NKMLKSLADGKSFTKVLALFGELRGQGLYPDNFTLPVVLKSIGRLRKVIEGEKVHGYAVK 74

Query: 134 WGFLSDLYVQNSLVRCYGACGDFSSAGKLFDEMLVRDVVSWNSLISGFMKAGHFDEAISL 193
            G   D YV NSL+  Y + G      K+FDEM  RDVVSWN LIS ++  G F++AI +
Sbjct: 75  AGLEFDSYVSNSLMGMYASLGKIEITHKVFDEMPQRDVVSWNGLISSYVGNGRFEDAIGV 134

Query: 194 FFRMDVEPSI----ATLVSVLAACARKGDLCMGKGFHGMIERRFKLDLLLGNAMLDMYVK 253
           F RM  E ++     T+VS L+AC+   +L +G+  +  +   F++ + +GNA++DM+ K
Sbjct: 135 FKRMSQESNLKFDEGTIVSTLSACSALKNLEIGERIYRFVVTEFEMSVRIGNALVDMFCK 194

Query: 254 NGCLYEAKKIFDEL-------------------------------PTRDIVSWTIMITGL 313
            GCL +A+ +FD +                               P +D+V WT M+ G 
Sbjct: 195 CGCLDKARAVFDSMRDKNVKCWTSMVFGYVSTGRIDEARVLFERSPVKDVVLWTAMMNGY 254

Query: 314 VQSNHPKESLELFSTMRTMGINPDGIILTSVLSACASLGTLDFGRWVHEYIDQRGIKWDI 373
           VQ N   E+LELF  M+T GI PD  +L S+L+ CA  G L+ G+W+H YI++  +  D 
Sbjct: 255 VQFNRFDEALELFRCMQTAGIRPDNFVLVSLLTGCAQTGALEQGKWIHGYINENRVTVDK 314

Query: 374 HIGTAIVDMYAKCGCIEMALQIFNNMAQRNTFTWNVLLCGLAMHGLVREALNLFEVMIIS 433
            +GTA+VDMYAKCGCIE AL++F  + +R+T +W  L+ GLAM+G+   AL+L+  M   
Sbjct: 315 VVGTALVDMYAKCGCIETALEVFYEIKERDTASWTSLIYGLAMNGMSGRALDLYYEMENV 374

Query: 434 GVKPNEVTFLAILTACCHSGLVNEGRKYFDNMSSQQYNLSPKLEHYGCMVDLFCRAGLLE 493
           GV+ + +TF+A+LTAC H G V EGRK F +M ++++N+ PK EH  C++DL CRAGLL+
Sbjct: 375 GVRLDAITFVAVLTACNHGGFVAEGRKIFHSM-TERHNVQPKSEHCSCLIDLLCRAGLLD 434

Query: 494 EAVELARTMPMKPD---VLIWGVLLNACKTVGNVELSQHIQDYILELDPEDSGIFVLLSN 553
           EA EL   M  + D   V ++  LL+A +  GNV++++ + + + +++  DS    LL++
Sbjct: 435 EAEELIDKMRGESDETLVPVYCSLLSAARNYGNVKIAERVAEKLEKVEVSDSSAHTLLAS 494

Query: 554 ISATNERWSDVTRLRRLMKDRGVKKSPGSSVIEVDGKAHEFVVGD--ISHFQTEEIYKLL 575
           + A+  RW DVT +RR MKD G++K PG S IE+DG  HEF+VGD  +SH + +EI  +L
Sbjct: 495 VYASANRWEDVTNVRRKMKDLGIRKFPGCSSIEIDGVGHEFIVGDDLLSHPKMDEINSML 554

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9SZK16.5e-17756.31Pentatricopeptide repeat-containing protein At4g38010 OS=Arabidopsis thaliana OX... [more]
Q9LN017.6e-12538.58Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
Q9SJZ32.4e-12338.65Pentatricopeptide repeat-containing protein At2g22410, mitochondrial OS=Arabidop... [more]
O823801.8e-11837.50Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidop... [more]
Q9C8662.9e-11640.11Pentatricopeptide repeat-containing protein At1g31430 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
XP_022968019.10.0e+0088.95pentatricopeptide repeat-containing protein At4g38010 isoform X1 [Cucurbita maxi... [more]
KAG7013201.14.5e-31088.29Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
KAG6574147.11.4e-30988.12Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
XP_022945046.12.3e-30988.12pentatricopeptide repeat-containing protein At4g38010 isoform X1 [Cucurbita mosc... [more]
XP_038889932.15.8e-30888.18pentatricopeptide repeat-containing protein At4g38010 [Benincasa hispida] >XP_03... [more]
Match NameE-valueIdentityDescription
A0A6J1HYD90.0e+0088.95pentatricopeptide repeat-containing protein At4g38010 isoform X1 OS=Cucurbita ma... [more]
A0A6J1FZR91.1e-30988.12pentatricopeptide repeat-containing protein At4g38010 isoform X1 OS=Cucurbita mo... [more]
A0A0A0LF191.0e-30586.99Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G004700 PE=4 SV=1[more]
A0A5D3CGR61.1e-30486.99Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S3C7P63.2e-30486.82pentatricopeptide repeat-containing protein At4g38010 isoform X1 OS=Cucumis melo... [more]
Match NameE-valueIdentityDescription
AT4G38010.14.6e-17856.31Pentatricopeptide repeat (PPR-like) superfamily protein [more]
AT1G08070.15.4e-12638.58Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT2G22410.11.7e-12438.65SLOW GROWTH 1 [more]
AT2G29760.11.3e-11937.50Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G31430.12.0e-11740.11Pentatricopeptide repeat (PPR-like) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 73..101
e-value: 7.3E-4
score: 19.6
coord: 443..467
e-value: 0.0037
score: 17.4
coord: 241..266
e-value: 4.5E-4
score: 20.3
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 269..302
e-value: 5.8E-6
score: 24.1
coord: 142..172
e-value: 1.7E-4
score: 19.6
coord: 370..404
e-value: 4.5E-7
score: 27.6
coord: 73..104
e-value: 1.3E-5
score: 23.1
coord: 172..197
e-value: 9.4E-7
score: 26.6
coord: 405..433
e-value: 0.0031
score: 15.6
coord: 241..269
e-value: 0.0019
score: 16.2
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 169..215
e-value: 4.1E-8
score: 33.3
coord: 367..416
e-value: 3.3E-12
score: 46.4
coord: 267..314
e-value: 3.1E-7
score: 30.5
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 267..301
score: 11.531345
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 368..402
score: 12.035565
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 403..437
score: 8.999285
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 170..200
score: 10.588674
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 69..103
score: 9.163705
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 139..169
score: 8.527949
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 324..418
e-value: 1.4E-26
score: 94.9
coord: 224..323
e-value: 1.2E-22
score: 82.2
coord: 123..223
e-value: 5.8E-21
score: 76.7
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 419..553
e-value: 2.8E-14
score: 54.8
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 7..122
e-value: 1.3E-8
score: 36.3
NoneNo IPR availablePANTHERPTHR47928REPEAT-CONTAINING PROTEIN, PUTATIVE-RELATEDcoord: 188..253
NoneNo IPR availablePANTHERPTHR47928REPEAT-CONTAINING PROTEIN, PUTATIVE-RELATEDcoord: 2..187
NoneNo IPR availablePANTHERPTHR47928REPEAT-CONTAINING PROTEIN, PUTATIVE-RELATEDcoord: 251..578
NoneNo IPR availablePANTHERPTHR47928:SF62OS04G0488200 PROTEINcoord: 251..578
NoneNo IPR availablePANTHERPTHR47928:SF62OS04G0488200 PROTEINcoord: 188..253
coord: 2..187

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0020698.1Tan0020698.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding