Tan0021130 (gene) Snake gourd v1

Overview
NameTan0021130
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat-containing protein
LocationLG03: 69157063 .. 69158796 (+)
RNA-Seq ExpressionTan0021130
SyntenyTan0021130
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTGTCTCTGTCTGATCTTTTCCCATCCTTTGATCACTGTGCTCGTCTCATATCACAATGCATCAAACTCAAACACTTAAAGGTTGGCATGTCCTTGCACTCCCACCTTATCAAAACCGCACTTTCATTTGACCTCTTCCTTGCAAACCGTCTTATTGACATGTATTCCAAATGTAATTCTATTGAGAATGCACAGAAGGCATTTGATGATTTGCCCATTAGAAATATTCACTCTTGGAATACCATTCTTGCTTCCTACTCATGTGTTGGATTTTTGAGTCAAGCTCGTAAGGTCTTTGATGAAATGCCTCATCCAAATATTGTTAGCTACAATACCTTGATTTCTAGCTTTACTCGCCATGGGTTGTATGTAGAATCAATGAATATCTTTCGACAAATGCAACAAGATTTTGATCTTTTAGTCTTGGATGAGTTTACTCTTGTGAGTATAGCGGGTACTTGTGCCTGTTTGGGTGCTCTAGAATCGTTGCGTCAGGTTCATGGAGCAGCTGTTGTCATTGGATTGGAGTTTAATATGATTGCTTGCAATGCTATAATTGATGCTTATGGTAAATGTGGCGAACCGGATGCGTCATATTCTATTTTCAGTCGAATGCAAGAGAGAGATGTTGTTACCTGGACCTCAATGGTTGTAGCCTATGCTCAGACATCCAAGTTAGATGATGCTTTTCGAGTGTTCAGTTGTATGCCGGAAAAAGGTGTCCACACTTGGACTGCTTTGATTAATGCTTTTGTGAAAAACAAGTATAGCAATGAGGCCGTAGATCTGTTTCAACAAATGCTGGAGGAAAAAATTTCTCCTAATGCTTTCACATTTGTAGGAGTTTTAAGTGCTTGTGCAGATCTTGCTTTGATAGCAAAAGGCAAAGAGATTCATGGACTCATAACCAGAAGAAGCAGTGGCCTTAATTTTCCAAACGTATATATGTGTAATTCTTTGATTGATCTGTACAGTAAGAGTGGTGACGTGAAATCAGCTAGGACGTTGTTTAACTTGGTTTTTGAAAAGGATGTAGTGTCGTGGAATTCATTAATAACTGGGTTTGCACAAAATGGGCTTGGAAGGGAAGCACTTCTTGCCTTTAGGAGGATGATAGAAGTAGGGATAAGGCCTAATAAAGTGACGTTTCTTGGTGTGCTGTCTGCCTGTGCCCATACTGGTTTATCATCTGAAGGATTGTATATTCTGGAGTTAATGGAGAAGTGTTATGGCATTAAGCCTAGTTTAGATCATTATGCAGTCTTGATTGATATGTTTGGAAGAAAAAATAGACTTTCCGAAGCATTGGATATAATATCCGGGGCACCCAATGGATCAAAGCATGTCGGAATATGGGGTGCAGTTCTGGGGGCTTGTCGAATACATGAAAATTTGGACCTAGCTGTAAGAGCTGCAGAAACTTTGTTTGAGATGGAGCCAGATAATGCTGGAAGATATGTAATGTTATCTAATGTATTTGCTGCAGCAAATAGATGGATGGATGCCCATAACGTGAGAAAACTCATGGAGGAAAGAGGTTTCAAGAAGGAAGTTGCATATAGCTGCATAGAAATAAGAAATATAAGGCATAAATTTGTGGCAAGAGATAATTCCCATAGTCAGATGGGTGACATATATGAGCTAATTTCTATACTACTAGACCATATGAAAAATTTTGGTTACATGCCTTTTGACAATGGCATTTACTTTTACGATGGATATAATACTTGA

mRNA sequence

ATGGTGTCTCTGTCTGATCTTTTCCCATCCTTTGATCACTGTGCTCGTCTCATATCACAATGCATCAAACTCAAACACTTAAAGGTTGGCATGTCCTTGCACTCCCACCTTATCAAAACCGCACTTTCATTTGACCTCTTCCTTGCAAACCGTCTTATTGACATGTATTCCAAATGTAATTCTATTGAGAATGCACAGAAGGCATTTGATGATTTGCCCATTAGAAATATTCACTCTTGGAATACCATTCTTGCTTCCTACTCATGTGTTGGATTTTTGAGTCAAGCTCGTAAGGTCTTTGATGAAATGCCTCATCCAAATATTGTTAGCTACAATACCTTGATTTCTAGCTTTACTCGCCATGGGTTGTATGTAGAATCAATGAATATCTTTCGACAAATGCAACAAGATTTTGATCTTTTAGTCTTGGATGAGTTTACTCTTGTGAGTATAGCGGGTACTTGTGCCTGTTTGGGTGCTCTAGAATCGTTGCGTCAGGTTCATGGAGCAGCTGTTGTCATTGGATTGGAGTTTAATATGATTGCTTGCAATGCTATAATTGATGCTTATGGTAAATGTGGCGAACCGGATGCGTCATATTCTATTTTCAGTCGAATGCAAGAGAGAGATGTTGTTACCTGGACCTCAATGGTTGTAGCCTATGCTCAGACATCCAAGTTAGATGATGCTTTTCGAGTGTTCAGTTGTATGCCGGAAAAAGGTGTCCACACTTGGACTGCTTTGATTAATGCTTTTGTGAAAAACAAGTATAGCAATGAGGCCGTAGATCTGTTTCAACAAATGCTGGAGGAAAAAATTTCTCCTAATGCTTTCACATTTGTAGGAGTTTTAAGTGCTTGTGCAGATCTTGCTTTGATAGCAAAAGGCAAAGAGATTCATGGACTCATAACCAGAAGAAGCAGTGGCCTTAATTTTCCAAACGTATATATGTGTAATTCTTTGATTGATCTGTACAGTAAGAGTGGTGACGTGAAATCAGCTAGGACGTTGTTTAACTTGGTTTTTGAAAAGGATGTAGTGTCGTGGAATTCATTAATAACTGGGTTTGCACAAAATGGGCTTGGAAGGGAAGCACTTCTTGCCTTTAGGAGGATGATAGAAGTAGGGATAAGGCCTAATAAAGTGACGTTTCTTGGTGTGCTGTCTGCCTGTGCCCATACTGGTTTATCATCTGAAGGATTGTATATTCTGGAGTTAATGGAGAAGTGTTATGGCATTAAGCCTAGTTTAGATCATTATGCAGTCTTGATTGATATGTTTGGAAGAAAAAATAGACTTTCCGAAGCATTGGATATAATATCCGGGGCACCCAATGGATCAAAGCATGTCGGAATATGGGGTGCAGTTCTGGGGGCTTGTCGAATACATGAAAATTTGGACCTAGCTGTAAGAGCTGCAGAAACTTTGTTTGAGATGGAGCCAGATAATGCTGGAAGATATGTAATGTTATCTAATGTATTTGCTGCAGCAAATAGATGGATGGATGCCCATAACGTGAGAAAACTCATGGAGGAAAGAGGTTTCAAGAAGGAAGTTGCATATAGCTGCATAGAAATAAGAAATATAAGGCATAAATTTGTGGCAAGAGATAATTCCCATAGTCAGATGGGTGACATATATGAGCTAATTTCTATACTACTAGACCATATGAAAAATTTTGGTTACATGCCTTTTGACAATGGCATTTACTTTTACGATGGATATAATACTTGA

Coding sequence (CDS)

ATGGTGTCTCTGTCTGATCTTTTCCCATCCTTTGATCACTGTGCTCGTCTCATATCACAATGCATCAAACTCAAACACTTAAAGGTTGGCATGTCCTTGCACTCCCACCTTATCAAAACCGCACTTTCATTTGACCTCTTCCTTGCAAACCGTCTTATTGACATGTATTCCAAATGTAATTCTATTGAGAATGCACAGAAGGCATTTGATGATTTGCCCATTAGAAATATTCACTCTTGGAATACCATTCTTGCTTCCTACTCATGTGTTGGATTTTTGAGTCAAGCTCGTAAGGTCTTTGATGAAATGCCTCATCCAAATATTGTTAGCTACAATACCTTGATTTCTAGCTTTACTCGCCATGGGTTGTATGTAGAATCAATGAATATCTTTCGACAAATGCAACAAGATTTTGATCTTTTAGTCTTGGATGAGTTTACTCTTGTGAGTATAGCGGGTACTTGTGCCTGTTTGGGTGCTCTAGAATCGTTGCGTCAGGTTCATGGAGCAGCTGTTGTCATTGGATTGGAGTTTAATATGATTGCTTGCAATGCTATAATTGATGCTTATGGTAAATGTGGCGAACCGGATGCGTCATATTCTATTTTCAGTCGAATGCAAGAGAGAGATGTTGTTACCTGGACCTCAATGGTTGTAGCCTATGCTCAGACATCCAAGTTAGATGATGCTTTTCGAGTGTTCAGTTGTATGCCGGAAAAAGGTGTCCACACTTGGACTGCTTTGATTAATGCTTTTGTGAAAAACAAGTATAGCAATGAGGCCGTAGATCTGTTTCAACAAATGCTGGAGGAAAAAATTTCTCCTAATGCTTTCACATTTGTAGGAGTTTTAAGTGCTTGTGCAGATCTTGCTTTGATAGCAAAAGGCAAAGAGATTCATGGACTCATAACCAGAAGAAGCAGTGGCCTTAATTTTCCAAACGTATATATGTGTAATTCTTTGATTGATCTGTACAGTAAGAGTGGTGACGTGAAATCAGCTAGGACGTTGTTTAACTTGGTTTTTGAAAAGGATGTAGTGTCGTGGAATTCATTAATAACTGGGTTTGCACAAAATGGGCTTGGAAGGGAAGCACTTCTTGCCTTTAGGAGGATGATAGAAGTAGGGATAAGGCCTAATAAAGTGACGTTTCTTGGTGTGCTGTCTGCCTGTGCCCATACTGGTTTATCATCTGAAGGATTGTATATTCTGGAGTTAATGGAGAAGTGTTATGGCATTAAGCCTAGTTTAGATCATTATGCAGTCTTGATTGATATGTTTGGAAGAAAAAATAGACTTTCCGAAGCATTGGATATAATATCCGGGGCACCCAATGGATCAAAGCATGTCGGAATATGGGGTGCAGTTCTGGGGGCTTGTCGAATACATGAAAATTTGGACCTAGCTGTAAGAGCTGCAGAAACTTTGTTTGAGATGGAGCCAGATAATGCTGGAAGATATGTAATGTTATCTAATGTATTTGCTGCAGCAAATAGATGGATGGATGCCCATAACGTGAGAAAACTCATGGAGGAAAGAGGTTTCAAGAAGGAAGTTGCATATAGCTGCATAGAAATAAGAAATATAAGGCATAAATTTGTGGCAAGAGATAATTCCCATAGTCAGATGGGTGACATATATGAGCTAATTTCTATACTACTAGACCATATGAAAAATTTTGGTTACATGCCTTTTGACAATGGCATTTACTTTTACGATGGATATAATACTTGA

Protein sequence

MVSLSDLFPSFDHCARLISQCIKLKHLKVGMSLHSHLIKTALSFDLFLANRLIDMYSKCNSIENAQKAFDDLPIRNIHSWNTILASYSCVGFLSQARKVFDEMPHPNIVSYNTLISSFTRHGLYVESMNIFRQMQQDFDLLVLDEFTLVSIAGTCACLGALESLRQVHGAAVVIGLEFNMIACNAIIDAYGKCGEPDASYSIFSRMQERDVVTWTSMVVAYAQTSKLDDAFRVFSCMPEKGVHTWTALINAFVKNKYSNEAVDLFQQMLEEKISPNAFTFVGVLSACADLALIAKGKEIHGLITRRSSGLNFPNVYMCNSLIDLYSKSGDVKSARTLFNLVFEKDVVSWNSLITGFAQNGLGREALLAFRRMIEVGIRPNKVTFLGVLSACAHTGLSSEGLYILELMEKCYGIKPSLDHYAVLIDMFGRKNRLSEALDIISGAPNGSKHVGIWGAVLGACRIHENLDLAVRAAETLFEMEPDNAGRYVMLSNVFAAANRWMDAHNVRKLMEERGFKKEVAYSCIEIRNIRHKFVARDNSHSQMGDIYELISILLDHMKNFGYMPFDNGIYFYDGYNT
Homology
BLAST of Tan0021130 vs. ExPASy Swiss-Prot
Match: Q9SIT7 (Pentatricopeptide repeat-containing protein At2g13600 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E76 PE=3 SV=1)

HSP 1 Score: 369.4 bits (947), Expect = 7.5e-101
Identity = 214/652 (32.82%), Postives = 329/652 (50.46%), Query Frame = 0

Query: 15  ARLISQCIKLKHLKVGMS-LHSHLIKTALSFDLFLANRLIDMYSKCNSIENAQKAFDDLP 74
           A+L+  CIK K   + +  +H+ +IK+  S ++F+ NRLID YSKC S+E+ ++ FD +P
Sbjct: 23  AKLLDSCIKSKLSAIYVRYVHASVIKSGFSNEIFIQNRLIDAYSKCGSLEDGRQVFDKMP 82

Query: 75  IRNIHSWNTILASYSCVGFLSQ-------------------------------------- 134
            RNI++WN+++   + +GFL +                                      
Sbjct: 83  QRNIYTWNSVVTGLTKLGFLDEADSLFRSMPERDQCTWNSMVSGFAQHDRCEEALCYFAM 142

Query: 135 ------------------------------------------------------------ 194
                                                                       
Sbjct: 143 MHKEGFVLNEYSFASVLSACSGLNDMNKGVQVHSLIAKSPFLSDVYIGSALVDMYSKCGN 202

Query: 195 ---ARKVFDEMPHPNIVSYNTLISSFTRHGLYVESMNIFRQMQQDFDLLVLDEFTLVSIA 254
              A++VFDEM   N+VS+N+LI+ F ++G  VE++++F+ M +    +  DE TL S+ 
Sbjct: 203 VNDAQRVFDEMGDRNVVSWNSLITCFEQNGPAVEALDVFQMMLE--SRVEPDEVTLASVI 262

Query: 255 GTCACLGALESLRQVHGAAVVIG-LEFNMIACNAIIDAYGKCGEPDASYSIFSRMQERDV 314
             CA L A++  ++VHG  V    L  ++I  NA +D Y KC     +  IF  M  R+V
Sbjct: 263 SACASLSAIKVGQEVHGRVVKNDKLRNDIILSNAFVDMYAKCSRIKEARFIFDSMPIRNV 322

Query: 315 VTWTSMVVAYAQTSKLDDAFRVFSCMPEKGVHTWTALINAFVKNKYSNEAVDLFQQMLEE 374
           +  TSM+  YA  +    A  +F+ M E+ V +W ALI  + +N  + EA+ LF  +  E
Sbjct: 323 IAETSMISGYAMAASTKAARLMFTKMAERNVVSWNALIAGYTQNGENEEALSLFCLLKRE 382

Query: 375 KISPNAFTFVGVLSACADLALIAKGKEIHGLITRRSSGLNF-----PNVYMCNSLIDLYS 434
            + P  ++F  +L ACADLA +  G + H  + +   G  F      ++++ NSLID+Y 
Sbjct: 383 SVCPTHYSFANILKACADLAELHLGMQAHVHVLKH--GFKFQSGEEDDIFVGNSLIDMYV 442

Query: 435 KSGDVKSARTLFNLVFEKDVVSWNSLITGFAQNGLGREALLAFRRMIEVGIRPNKVTFLG 494
           K G V+    +F  + E+D VSWN++I GFAQNG G EAL  FR M+E G +P+ +T +G
Sbjct: 443 KCGCVEEGYLVFRKMMERDCVSWNAMIIGFAQNGYGNEALELFREMLESGEKPDHITMIG 502

Query: 495 VLSACAHTGLSSEGLYILELMEKCYGIKPSLDHYAVLIDMFGRKNRLSEALDIISGAPNG 554
           VLSAC H G   EG +    M + +G+ P  DHY  ++D+ GR   L EA  +I   P  
Sbjct: 503 VLSACGHAGFVEEGRHYFSSMTRDFGVAPLRDHYTCMVDLLGRAGFLEEAKSMIEEMPMQ 562

Query: 555 SKHVGIWGAVLGACRIHENLDLAVRAAETLFEMEPDNAGRYVMLSNVFAAANRWMDAHNV 559
              V IWG++L AC++H N+ L    AE L E+EP N+G YV+LSN++A   +W D  NV
Sbjct: 563 PDSV-IWGSLLAACKVHRNITLGKYVAEKLLEVEPSNSGPYVLLSNMYAELGKWEDVMNV 622

BLAST of Tan0021130 vs. ExPASy Swiss-Prot
Match: Q9SKQ4 (Pentatricopeptide repeat-containing protein At2g21090 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E48 PE=2 SV=1)

HSP 1 Score: 369.4 bits (947), Expect = 7.5e-101
Identity = 197/534 (36.89%), Postives = 308/534 (57.68%), Query Frame = 0

Query: 11  FDHCARLISQCIKLKHLKVGMSLHSHLIKTALSF-DLFLANRLIDMYSKCNSIENAQKAF 70
           FD  A L+ QC   K LK G  +H HL  T     +  L+N LI MY KC    +A K F
Sbjct: 46  FDLLASLLQQCGDTKSLKQGKWIHRHLKITGFKRPNTLLSNHLIGMYMKCGKPIDACKVF 105

Query: 71  DDLPIRNIHSWNTILASYSCVGFLSQARKVFDEMPHPNIVSYNTLISSFTRHGLYVESMN 130
           D + +RN++SWN +++ Y   G L +AR VFD MP  ++VS+NT++  + + G   E++ 
Sbjct: 106 DQMHLRNLYSWNNMVSGYVKSGMLVRARVVFDSMPERDVVSWNTMVIGYAQDGNLHEALW 165

Query: 131 IFRQMQQDFDLLVLDEFTLVSIAGTCACLGALESLRQVHGAAVVIGLEFNMIACNAIIDA 190
            +++ ++    +  +EF+   +   C     L+  RQ HG  +V G   N++   +IIDA
Sbjct: 166 FYKEFRRSG--IKFNEFSFAGLLTACVKSRQLQLNRQAHGQVLVAGFLSNVVLSCSIIDA 225

Query: 191 YGKCGEPDASYSIFSRMQERDVVTWTSMVVAYAQTSKLDDAFRVFSCMPEKGVHTWTALI 250
           Y KCG+ +++   F  M  +D+  WT+++  YA+   ++ A ++F  MPEK   +WTALI
Sbjct: 226 YAKCGQMESAKRCFDEMTVKDIHIWTTLISGYAKLGDMEAAEKLFCEMPEKNPVSWTALI 285

Query: 251 NAFVKNKYSNEAVDLFQQMLEEKISPNAFTFVGVLSACADLALIAKGKEIHGLITRRSSG 310
             +V+    N A+DLF++M+   + P  FTF   L A A +A +  GKEIHG + R +  
Sbjct: 286 AGYVRQGSGNRALDLFRKMIALGVKPEQFTFSSCLCASASIASLRHGKEIHGYMIRTNVR 345

Query: 311 LNFPNVYMCNSLIDLYSKSGDVKSARTLFNLVFEK-DVVSWNSLITGFAQNGLGREALLA 370
              PN  + +SLID+YSKSG ++++  +F +  +K D V WN++I+  AQ+GLG +AL  
Sbjct: 346 ---PNAIVISSLIDMYSKSGSLEASERVFRICDDKHDCVFWNTMISALAQHGLGHKALRM 405

Query: 371 FRRMIEVGIRPNKVTFLGVLSACAHTGLSSEGLYILELMEKCYGIKPSLDHYAVLIDMFG 430
              MI+  ++PN+ T + +L+AC+H+GL  EGL   E M   +GI P  +HYA LID+ G
Sbjct: 406 LDDMIKFRVQPNRTTLVVILNACSHSGLVEEGLRWFESMTVQHGIVPDQEHYACLIDLLG 465

Query: 431 RKNRLSEALDIISGAP-NGSKHVGIWGAVLGACRIHENLDLAVRAAETLFEMEPDNAGRY 490
           R     E +  I   P    KH  IW A+LG CRIH N +L  +AA+ L +++P+++  Y
Sbjct: 466 RAGCFKELMRKIEEMPFEPDKH--IWNAILGVCRIHGNEELGKKAADELIKLDPESSAPY 525

Query: 491 VMLSNVFAAANRWMDAHNVRKLMEERGFKKEVAYSCIEIRNIRHKFVARDNSHS 542
           ++LS+++A   +W     +R +M++R   KE A S IEI      F   D SH+
Sbjct: 526 ILLSSIYADHGKWELVEKLRGVMKKRRVNKEKAVSWIEIEKKVEAFTVSDGSHA 572

BLAST of Tan0021130 vs. ExPASy Swiss-Prot
Match: Q9SHZ8 (Pentatricopeptide repeat-containing protein At2g22070 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H41 PE=3 SV=1)

HSP 1 Score: 360.9 bits (925), Expect = 2.7e-98
Identity = 187/541 (34.57%), Postives = 318/541 (58.78%), Query Frame = 0

Query: 27  LKVGMSLHSHLIKTALSFDLFLANRLIDMYSKCNSIENAQKAFDDLPIRNIHSWNTILAS 86
           ++ G  +HS ++K  L  ++ ++N L++MY+KC     A+  FD + +R+I SWN ++A 
Sbjct: 162 METGKKVHSFIVKLGLRGNVSVSNSLLNMYAKCGDPMMAKFVFDRMVVRDISSWNAMIAL 221

Query: 87  YSCVGFLSQARKVFDEMPHPNIVSYNTLISSFTRHGLYVESMNIFRQMQQDFDLLVLDEF 146
           +  VG +  A   F++M   +IV++N++IS F + G  + +++IF +M +D  LL  D F
Sbjct: 222 HMQVGQMDLAMAQFEQMAERDIVTWNSMISGFNQRGYDLRALDIFSKMLRD-SLLSPDRF 281

Query: 147 TLVSIAGTCACLGALESLRQVHGAAVVIGLEFNMIACNAIIDAYGKCGEPDASYSIFSRM 206
           TL S+   CA L  L   +Q+H   V  G + + I  NA+I  Y +CG  + +  +  + 
Sbjct: 282 TLASVLSACANLEKLCIGKQIHSHIVTTGFDISGIVLNALISMYSRCGGVETARRLIEQR 341

Query: 207 QERD--VVTWTSMVVAYAQTSKLDDAFRVFSCMPEKGVHTWTALINAFVKNKYSNEAVDL 266
             +D  +  +T+++  Y +   ++ A  +F  + ++ V  WTA+I  + ++    EA++L
Sbjct: 342 GTKDLKIEGFTALLDGYIKLGDMNQAKNIFVSLKDRDVVAWTAMIVGYEQHGSYGEAINL 401

Query: 267 FQQMLEEKISPNAFTFVGVLSACADLALIAKGKEIHGLITRRSSGLNFPNVYMCNSLIDL 326
           F+ M+     PN++T   +LS  + LA ++ GK+IHG   +  SG    +V + N+LI +
Sbjct: 402 FRSMVGGGQRPNSYTLAAMLSVASSLASLSHGKQIHGSAVK--SG-EIYSVSVSNALITM 461

Query: 327 YSKSGDVKSARTLFNLV-FEKDVVSWNSLITGFAQNGLGREALLAFRRMIEVGIRPNKVT 386
           Y+K+G++ SA   F+L+  E+D VSW S+I   AQ+G   EAL  F  M+  G+RP+ +T
Sbjct: 462 YAKAGNITSASRAFDLIRCERDTVSWTSMIIALAQHGHAEEALELFETMLMEGLRPDHIT 521

Query: 387 FLGVLSACAHTGLSSEGLYILELMEKCYGIKPSLDHYAVLIDMFGRKNRLSEALDIISGA 446
           ++GV SAC H GL ++G    ++M+    I P+L HYA ++D+FGR   L EA + I   
Sbjct: 522 YVGVFSACTHAGLVNQGRQYFDMMKDVDKIIPTLSHYACMVDLFGRAGLLQEAQEFIEKM 581

Query: 447 PNGSKHVGIWGAVLGACRIHENLDLAVRAAETLFEMEPDNAGRYVMLSNVFAAANRWMDA 506
           P     V  WG++L ACR+H+N+DL   AAE L  +EP+N+G Y  L+N+++A  +W +A
Sbjct: 582 PI-EPDVVTWGSLLSACRVHKNIDLGKVAAERLLLLEPENSGAYSALANLYSACGKWEEA 641

Query: 507 HNVRKLMEERGFKKEVAYSCIEIRNIRHKFVARDNSHSQMGDIYELISILLDHMKNFGYM 565
             +RK M++   KKE  +S IE+++  H F   D +H +  +IY  +  + D +K  GY+
Sbjct: 642 AKIRKSMKDGRVKKEQGFSWIEVKHKVHVFGVEDGTHPEKNEIYMTMKKIWDEIKKMGYV 697

BLAST of Tan0021130 vs. ExPASy Swiss-Prot
Match: Q9LN01 (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 348.6 bits (893), Expect = 1.4e-94
Identity = 189/564 (33.51%), Postives = 311/564 (55.14%), Query Frame = 0

Query: 1   MVSLSDLFPSFDHCARLISQCIKLKHLKVGMSLHSHLIKTALSFDLFLANRLIDMYSKCN 60
           M+SL  L P+      ++  C K K  K G  +H H++K     DL++   LI MY +  
Sbjct: 125 MISLG-LLPNSYTFPFVLKSCAKSKAFKEGQQIHGHVLKLGCDLDLYVHTSLISMYVQNG 184

Query: 61  SIENAQKAFDDLPIRNIHSWNTILASYSCVGFLSQARKVFDEMPHPNIVSYNTLISSFTR 120
            +E+A K FD  P R++ S+  ++  Y+  G++  A+K+FDE+P  ++VS+N +IS +  
Sbjct: 185 RLEDAHKVFDKSPHRDVVSYTALIKGYASRGYIENAQKLFDEIPVKDVVSWNAMISGYAE 244

Query: 121 HGLYVESMNIFRQMQQDFDLLVLDEFTLVSIAGTCACLGALESLRQVHGAAVVIGLEFNM 180
            G Y E++ +F+ M +    +  DE T+V++   CA  G++E  RQVH      G   N+
Sbjct: 245 TGNYKEALELFKDMMK--TNVRPDESTMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNL 304

Query: 181 IACNAIIDAYGKCGEPDASYSIFSRMQERDVVTWTSMVVAYAQTSKLDDAFRVFSCMPEK 240
              NA+ID Y KCGE + +  +F R+  +DV++W +++  Y                   
Sbjct: 305 KIVNALIDLYSKCGELETACGLFERLPYKDVISWNTLIGGYTH----------------- 364

Query: 241 GVHTWTALINAFVKNKYSNEAVDLFQQMLEEKISPNAFTFVGVLSACADLALIAKGKEIH 300
                         N Y  EA+ LFQ+ML    +PN  T + +L ACA L  I  G+ IH
Sbjct: 365 -------------MNLY-KEALLLFQEMLRSGETPNDVTMLSILPACAHLGAIDIGRWIH 424

Query: 301 GLITRRSSGLNFPNVYMCNSLIDLYSKSGDVKSARTLFNLVFEKDVVSWNSLITGFAQNG 360
             I +R  G+   +  +  SLID+Y+K GD+++A  +FN +  K + SWN++I GFA +G
Sbjct: 425 VYIDKRLKGVTNAS-SLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSSWNAMIFGFAMHG 484

Query: 361 LGREALLAFRRMIEVGIRPNKVTFLGVLSACAHTGLSSEGLYILELMEKCYGIKPSLDHY 420
               +   F RM ++GI+P+ +TF+G+LSAC+H+G+   G +I   M + Y + P L+HY
Sbjct: 485 RADASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQDYKMTPKLEHY 544

Query: 421 AVLIDMFGRKNRLSEALDIISGAPNGSKHVGIWGAVLGACRIHENLDLAVRAAETLFEME 480
             +ID+ G      EA ++I+        V IW ++L AC++H N++L    AE L ++E
Sbjct: 545 GCMIDLLGHSGLFKEAEEMINMMEMEPDGV-IWCSLLKACKMHGNVELGESFAENLIKIE 604

Query: 481 PDNAGRYVMLSNVFAAANRWMDAHNVRKLMEERGFKKEVAYSCIEIRNIRHKFVARDNSH 540
           P+N G YV+LSN++A+A RW +    R L+ ++G KK    S IEI ++ H+F+  D  H
Sbjct: 605 PENPGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSVVHEFIIGDKFH 652

Query: 541 SQMGDIYELISILLDHMKNFGYMP 565
            +  +IY ++  +   ++  G++P
Sbjct: 665 PRNREIYGMLEEMEVLLEKAGFVP 652

BLAST of Tan0021130 vs. ExPASy Swiss-Prot
Match: O82380 (Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H33 PE=2 SV=1)

HSP 1 Score: 347.1 bits (889), Expect = 4.0e-94
Identity = 194/565 (34.34%), Postives = 305/565 (53.98%), Query Frame = 0

Query: 1   MVSLSDLFPSFDHCARLISQCIKLKHLKVGMSLHSHLIKTALSFDLFLANRLIDMYSKCN 60
           MVS S  +P+      LI    ++  L +G SLH   +K+A+  D+F+AN LI  Y  C 
Sbjct: 121 MVSESQCYPNKYTFPFLIKAAAEVSSLSLGQSLHGMAVKSAVGSDVFVANSLIHCYFSC- 180

Query: 61  SIENAQKAFDDLPIRNIHSWNTILASYSCVGFLSQARKVFDEMPHPNIVSYNTLISSFTR 120
                                         G L  A KVF  +   ++VS+N++I+ F +
Sbjct: 181 ------------------------------GDLDSACKVFTTIKEKDVVSWNSMINGFVQ 240

Query: 121 HGLYVESMNIFRQMQQDFDLLVLDEFTLVSIAGTCACLGALESLRQVHGAAVVIGLEFNM 180
            G   +++ +F++M+ +   +     T+V +   CA +  LE  RQV        +  N+
Sbjct: 241 KGSPDKALELFKKMESED--VKASHVTMVGVLSACAKIRNLEFGRQVCSYIEENRVNVNL 300

Query: 181 IACNAIIDAYGKCGEPDASYSIFSRMQERDVVTWTSMVVAYAQTSKLDDAFRVFSCMPEK 240
              NA++D Y KCG  + +  +F  M+E+D VTWT+M+  YA +   + A  V + MP+K
Sbjct: 301 TLANAMLDMYTKCGSIEDAKRLFDAMEEKDNVTWTTMLDGYAISEDYEAAREVLNSMPQK 360

Query: 241 GVHTWTALINAFVKNKYSNEAVDLFQQM-LEEKISPNAFTFVGVLSACADLALIAKGKEI 300
            +  W ALI+A+ +N   NEA+ +F ++ L++ +  N  T V  LSACA +  +  G+ I
Sbjct: 361 DIVAWNALISAYEQNGKPNEALIVFHELQLQKNMKLNQITLVSTLSACAQVGALELGRWI 420

Query: 301 HGLITRRSSGLNFPNVYMCNSLIDLYSKSGDVKSARTLFNLVFEKDVVSWNSLITGFAQN 360
           H  I +    +NF   ++ ++LI +YSK GD++ +R +FN V ++DV  W+++I G A +
Sbjct: 421 HSYIKKHGIRMNF---HVTSALIHMYSKCGDLEKSREVFNSVEKRDVFVWSAMIGGLAMH 480

Query: 361 GLGREALLAFRRMIEVGIRPNKVTFLGVLSACAHTGLSSEGLYILELMEKCYGIKPSLDH 420
           G G EA+  F +M E  ++PN VTF  V  AC+HTGL  E   +   ME  YGI P   H
Sbjct: 481 GCGNEAVDMFYKMQEANVKPNGVTFTNVFCACSHTGLVDEAESLFHQMESNYGIVPEEKH 540

Query: 421 YAVLIDMFGRKNRLSEALDIISGAPNGSKHVGIWGAVLGACRIHENLDLAVRAAETLFEM 480
           YA ++D+ GR   L +A+  I   P       +WGA+LGAC+IH NL+LA  A   L E+
Sbjct: 541 YACIVDVLGRSGYLEKAVKFIEAMPI-PPSTSVWGALLGACKIHANLNLAEMACTRLLEL 600

Query: 481 EPDNAGRYVMLSNVFAAANRWMDAHNVRKLMEERGFKKEVAYSCIEIRNIRHKFVARDNS 540
           EP N G +V+LSN++A   +W +   +RK M   G KKE   S IEI  + H+F++ DN+
Sbjct: 601 EPRNDGAHVLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCSSIEIDGMIHEFLSGDNA 648

Query: 541 HSQMGDIYELISILLDHMKNFGYMP 565
           H     +Y  +  +++ +K+ GY P
Sbjct: 661 HPMSEKVYGKLHEVMEKLKSNGYEP 648

BLAST of Tan0021130 vs. NCBI nr
Match: XP_031745241.1 (pentatricopeptide repeat-containing protein At2g21090 isoform X1 [Cucumis sativus] >KAE8645941.1 hypothetical protein Csa_021389 [Cucumis sativus])

HSP 1 Score: 1053.1 bits (2722), Expect = 8.5e-304
Identity = 517/577 (89.60%), Postives = 545/577 (94.45%), Query Frame = 0

Query: 1   MVSLSDLFPSFDHCARLISQCIKLKHLKVGMSLHSHLIKTALSFDLFLANRLIDMYSKCN 60
           MV LSDLFPSFDHCARL S+CI+ KHL+VGMSLHSHLIKTALSFDLFLANRLIDMYSKCN
Sbjct: 1   MVPLSDLFPSFDHCARLFSKCIQHKHLRVGMSLHSHLIKTALSFDLFLANRLIDMYSKCN 60

Query: 61  SIENAQKAFDDLPIRNIHSWNTILASYSCVGFLSQARKVFDEMPHPNIVSYNTLISSFTR 120
           S+ENAQKAFDDLPIRNIHSWNTILASYS  GF SQARKVFDEMPHPNIVSYNTLISSFT 
Sbjct: 61  SMENAQKAFDDLPIRNIHSWNTILASYSRAGFFSQARKVFDEMPHPNIVSYNTLISSFTH 120

Query: 121 HGLYVESMNIFRQMQQDFDLLVLDEFTLVSIAGTCACLGALESLRQVHGAAVVIGLEFNM 180
           HGLYVESMNIFRQMQQDFDLL LDE TLVSIAGTCACLGALE LRQVHGAA+VIGLEFNM
Sbjct: 121 HGLYVESMNIFRQMQQDFDLLALDEITLVSIAGTCACLGALEFLRQVHGAAIVIGLEFNM 180

Query: 181 IACNAIIDAYGKCGEPDASYSIFSRMQERDVVTWTSMVVAYAQTSKLDDAFRVFSCMPEK 240
           I CNAI+DAYGKCG+PDASYSIFSRM+ERDVVTWTSMVVAY QTS+LDDAFRVFSCMP K
Sbjct: 181 IVCNAIVDAYGKCGDPDASYSIFSRMKERDVVTWTSMVVAYNQTSRLDDAFRVFSCMPVK 240

Query: 241 GVHTWTALINAFVKNKYSNEAVDLFQQMLEEKISPNAFTFVGVLSACADLALIAKGKEIH 300
            VHTWTALINA VKNKYSNEA+DLFQQMLEEK SPNAFTFVGVLSACADLALIAKGKEIH
Sbjct: 241 NVHTWTALINALVKNKYSNEALDLFQQMLEEKTSPNAFTFVGVLSACADLALIAKGKEIH 300

Query: 301 GLITRRSSGLNFPNVYMCNSLIDLYSKSGDVKSARTLFNLVFEKDVVSWNSLITGFAQNG 360
           GLI RRSS LNFPNVY+CN+LIDLYSKSGDVKSAR LFNL+ EKDVVSWNSLITGFAQNG
Sbjct: 301 GLIIRRSSELNFPNVYVCNALIDLYSKSGDVKSARMLFNLILEKDVVSWNSLITGFAQNG 360

Query: 361 LGREALLAFRRMIEVGIRPNKVTFLGVLSACAHTGLSSEGLYILELMEKCYGIKPSLDHY 420
           LGREALLAFR+M EVGIRPNKVTFL VLSAC+HTGLSSEGL ILELMEK Y I+PSL+HY
Sbjct: 361 LGREALLAFRKMTEVGIRPNKVTFLAVLSACSHTGLSSEGLCILELMEKFYDIEPSLEHY 420

Query: 421 AVLIDMFGRKNRLSEALDIISGAPNGSKHVGIWGAVLGACRIHENLDLAVRAAETLFEME 480
           AV+IDMFGR+NRL+EALD+IS APNGSKHVGIWGAVLGACRIHENLDLA+RAAETLFEME
Sbjct: 421 AVMIDMFGRENRLAEALDLISRAPNGSKHVGIWGAVLGACRIHENLDLAIRAAETLFEME 480

Query: 481 PDNAGRYVMLSNVFAAANRWMDAHNVRKLMEERGFKKEVAYSCIEIRNIRHKFVARDNSH 540
           PDNAGRYVMLSNVFAAA+RWMDAHNVRKLMEERGFKKEVAYSCIEIRNIRHKFVARDNSH
Sbjct: 481 PDNAGRYVMLSNVFAAASRWMDAHNVRKLMEERGFKKEVAYSCIEIRNIRHKFVARDNSH 540

Query: 541 SQMGDIYELISILLDHMKNFGYMPFDNGIYFYDGYNT 578
           SQMG+IYEL+ ILL+HM   GYM  D+GIYFYDGY+T
Sbjct: 541 SQMGEIYELMFILLEHMNIIGYMALDDGIYFYDGYST 577

BLAST of Tan0021130 vs. NCBI nr
Match: XP_038882958.1 (pentatricopeptide repeat-containing protein At2g21090 isoform X1 [Benincasa hispida])

HSP 1 Score: 1045.0 bits (2701), Expect = 2.3e-301
Identity = 512/577 (88.73%), Postives = 543/577 (94.11%), Query Frame = 0

Query: 1   MVSLSDLFPSFDHCARLISQCIKLKHLKVGMSLHSHLIKTALSFDLFLANRLIDMYSKCN 60
           MV  SDLFPSFDHCARLIS+CIK KHLKVGMSLHSHLIKTALS DLFLANRLIDMYSKCN
Sbjct: 1   MVPFSDLFPSFDHCARLISKCIKHKHLKVGMSLHSHLIKTALSSDLFLANRLIDMYSKCN 60

Query: 61  SIENAQKAFDDLPIRNIHSWNTILASYSCVGFLSQARKVFDEMPHPNIVSYNTLISSFTR 120
           S+ENAQKAFD+LPIRNIHSWN ILASYS  GF SQARKVFDEMPHPNIVSYNTLISSFT 
Sbjct: 61  SMENAQKAFDELPIRNIHSWNIILASYSRAGFFSQARKVFDEMPHPNIVSYNTLISSFTH 120

Query: 121 HGLYVESMNIFRQMQQDFDLLVLDEFTLVSIAGTCACLGALESLRQVHGAAVVIGLEFNM 180
           HGLYVES+NIFRQMQQDFD LVLDEFTLVSI GTCACLGALE LRQVHGAA+VIGLEFNM
Sbjct: 121 HGLYVESINIFRQMQQDFDHLVLDEFTLVSIVGTCACLGALELLRQVHGAAIVIGLEFNM 180

Query: 181 IACNAIIDAYGKCGEPDASYSIFSRMQERDVVTWTSMVVAYAQTSKLDDAFRVFSCMPEK 240
           I CNAI+DAYGKCG+PDASYSIFSRM+ERDVVTWTSMVVAY QTS+LDDAFRVFSCMP K
Sbjct: 181 IVCNAIVDAYGKCGDPDASYSIFSRMKERDVVTWTSMVVAYNQTSRLDDAFRVFSCMPVK 240

Query: 241 GVHTWTALINAFVKNKYSNEAVDLFQQMLEEKISPNAFTFVGVLSACADLALIAKGKEIH 300
            VHTWTALIN   KNKYSNEA+DLFQQMLEEKISPN FTFVGVLSACADLALIAKGKEIH
Sbjct: 241 NVHTWTALINGLAKNKYSNEALDLFQQMLEEKISPNTFTFVGVLSACADLALIAKGKEIH 300

Query: 301 GLITRRSSGLNFPNVYMCNSLIDLYSKSGDVKSARTLFNLVFEKDVVSWNSLITGFAQNG 360
           G I RRS+ LNFPNVY+CN+LIDLYSKSGD+KSARTLF+L+ EKDVVSWNSLITGFAQNG
Sbjct: 301 GFIIRRSNDLNFPNVYICNALIDLYSKSGDMKSARTLFDLILEKDVVSWNSLITGFAQNG 360

Query: 361 LGREALLAFRRMIEVGIRPNKVTFLGVLSACAHTGLSSEGLYILELMEKCYGIKPSLDHY 420
           LGREALLAFRRM EVGIRPNKVTFLG+LSAC+HTGLSSEGL+ILELME  Y IKPSLDHY
Sbjct: 361 LGREALLAFRRMTEVGIRPNKVTFLGLLSACSHTGLSSEGLHILELMETSYDIKPSLDHY 420

Query: 421 AVLIDMFGRKNRLSEALDIISGAPNGSKHVGIWGAVLGACRIHENLDLAVRAAETLFEME 480
           AVLIDMFGRKNRL+EALD+IS APNGSKHVGIWGAVLGACRIHENLDLA+RAAETLFEME
Sbjct: 421 AVLIDMFGRKNRLAEALDLISRAPNGSKHVGIWGAVLGACRIHENLDLAIRAAETLFEME 480

Query: 481 PDNAGRYVMLSNVFAAANRWMDAHNVRKLMEERGFKKEVAYSCIEIRNIRHKFVARDNSH 540
           PDNAGRYVMLSNVFAAA+RWMDAHNVRKLMEERGFKKE+AYSCIEIRNIRHKFVARDNSH
Sbjct: 481 PDNAGRYVMLSNVFAAASRWMDAHNVRKLMEERGFKKELAYSCIEIRNIRHKFVARDNSH 540

Query: 541 SQMGDIYELISILLDHMKNFGYMPFDNGIYFYDGYNT 578
           +QMG+I+EL+ ILL+HMK FG M  D+GIYFYDGY+T
Sbjct: 541 NQMGEIHELMFILLEHMKIFGCMALDDGIYFYDGYST 577

BLAST of Tan0021130 vs. NCBI nr
Match: XP_022132706.1 (pentatricopeptide repeat-containing protein At2g13600-like [Momordica charantia])

HSP 1 Score: 1028.5 bits (2658), Expect = 2.3e-296
Identity = 502/575 (87.30%), Postives = 542/575 (94.26%), Query Frame = 0

Query: 1   MVSLSDLFPSFDHCARLISQCIKLKHLKVGMSLHSHLIKTALSFDLFLANRLIDMYSKCN 60
           MV L+D+FP+FDHCARLIS+CIK KHLKVGMSLHSHLIKTALS+DLFLANRLIDMYSKCN
Sbjct: 1   MVPLADIFPAFDHCARLISKCIKHKHLKVGMSLHSHLIKTALSYDLFLANRLIDMYSKCN 60

Query: 61  SIENAQKAFDDLPIRNIHSWNTILASYSCVGFLSQARKVFDEMPHPNIVSYNTLISSFTR 120
           S+ENAQKAFDDLPIRN+HSWNTILA Y+ +G LSQARK FDEMPHPNI+SYNTLI SFTR
Sbjct: 61  SMENAQKAFDDLPIRNVHSWNTILALYTRIGCLSQARKFFDEMPHPNIISYNTLIYSFTR 120

Query: 121 HGLYVESMNIFRQMQQDFDLLVLDEFTLVSIAGTCACLGALESLRQVHGAAVVIGLEFNM 180
           HGLYVESMNIFR+MQQDFDLLVLDEFTLVSIAGTCACLGAL  LRQ+HGAA+VIGLEFN+
Sbjct: 121 HGLYVESMNIFRKMQQDFDLLVLDEFTLVSIAGTCACLGALALLRQIHGAAIVIGLEFNV 180

Query: 181 IACNAIIDAYGKCGEPDASYSIFSRMQERDVVTWTSMVVAYAQTSKLDDAFRVFSCMPEK 240
           I  NAIIDAYGKCGEPD SYSIFS+MQERDVVTWTSMVVAYAQTS+LDDAFRVFSCMP K
Sbjct: 181 IVSNAIIDAYGKCGEPDTSYSIFSQMQERDVVTWTSMVVAYAQTSRLDDAFRVFSCMPMK 240

Query: 241 GVHTWTALINAFVKNKYSNEAVDLFQQMLEEKISPNAFTFVGVLSACADLALIAKGKEIH 300
            VHTWTALINAF KNKYSNEA+DLF+QMLEEKIS N+FTFVGVLSACADLALIAKGK+IH
Sbjct: 241 NVHTWTALINAFAKNKYSNEALDLFEQMLEEKISLNSFTFVGVLSACADLALIAKGKQIH 300

Query: 301 GLITRRSSGLNFPNVYMCNSLIDLYSKSGDVKSARTLFNLVFEKDVVSWNSLITGFAQNG 360
           GLI R S  LNF NVY+ N+LID+YSKSGD+KSARTLFNL+ EKDVVSWNSLITGFAQNG
Sbjct: 301 GLIIRSSCSLNFLNVYIYNALIDMYSKSGDMKSARTLFNLMPEKDVVSWNSLITGFAQNG 360

Query: 361 LGREALLAFRRMIEVGIRPNKVTFLGVLSACAHTGLSSEGLYILELMEKCYGIKPSLDHY 420
           LG+EAL+AFRRMIEVGIRPNKVTFLGVLSAC+HTGL SEGLY+LELMEK +GIKPSLDHY
Sbjct: 361 LGKEALIAFRRMIEVGIRPNKVTFLGVLSACSHTGLLSEGLYLLELMEKFFGIKPSLDHY 420

Query: 421 AVLIDMFGRKNRLSEALDIISGAPNGSKHVGIWGAVLGACRIHENLDLAVRAAETLFEME 480
           AVLIDMFGRKNRL+EALD+I+ APN S HVGIWGAVLGACR+HENLDLA+ AAETLFEME
Sbjct: 421 AVLIDMFGRKNRLAEALDLIARAPNRSNHVGIWGAVLGACRMHENLDLAMSAAETLFEME 480

Query: 481 PDNAGRYVMLSNVFAAANRWMDAHNVRKLMEERGFKKEVAYSCIEIRNIRHKFVARDNSH 540
           PDNAGRYVML+N+FAAA+RWMDAHNVRKLMEERGFKKEVAYSCIEIRN  HKFVARDNSH
Sbjct: 481 PDNAGRYVMLANIFAAASRWMDAHNVRKLMEERGFKKEVAYSCIEIRNRGHKFVARDNSH 540

Query: 541 SQMGDIYELISILLDHMKNFGYMPFDNGIYFYDGY 576
           SQMG+IYEL+ ILLDHMKNFG MPFDNGIYFYDGY
Sbjct: 541 SQMGEIYELMFILLDHMKNFGCMPFDNGIYFYDGY 575

BLAST of Tan0021130 vs. NCBI nr
Match: XP_022977857.1 (pentatricopeptide repeat-containing protein At2g21090-like isoform X1 [Cucurbita maxima])

HSP 1 Score: 1012.7 bits (2617), Expect = 1.3e-291
Identity = 496/574 (86.41%), Postives = 531/574 (92.51%), Query Frame = 0

Query: 1   MVSLSDLFPSFDHCARLISQCIKLKHLKVGMSLHSHLIKTALSFDLFLANRLIDMYSKCN 60
           M+ LS  FPSFDH A LIS+CIK KHLKVGMSLHSHLIK+ALSFD FLAN LIDMYSKCN
Sbjct: 1   MMPLSGFFPSFDHYAFLISKCIKHKHLKVGMSLHSHLIKSALSFDPFLANHLIDMYSKCN 60

Query: 61  SIENAQKAFDDLPIRNIHSWNTILASYSCVGFLSQARKVFDEMPHPNIVSYNTLISSFTR 120
           S+ENAQKAFDDLP +NIHSWNTILASYS  GFLSQAR +FDEMPHPNIVSYNTLISSFT 
Sbjct: 61  SMENAQKAFDDLPFKNIHSWNTILASYSRAGFLSQARMIFDEMPHPNIVSYNTLISSFTH 120

Query: 121 HGLYVESMNIFRQMQQDFDLLVLDEFTLVSIAGTCACLGALESLRQVHGAAVVIGLEFNM 180
           HGLYVE+MNIF QMQQDFD LVLDEFT VSI GTCACLGALE LRQ+HGAA+ IGLEFNM
Sbjct: 121 HGLYVEAMNIFWQMQQDFDRLVLDEFTFVSIVGTCACLGALEMLRQIHGAAIFIGLEFNM 180

Query: 181 IACNAIIDAYGKCGEPDASYSIFSRMQERDVVTWTSMVVAYAQTSKLDDAFRVFSCMPEK 240
           I CNA+I+AYGKCGEP  SYS+FSRMQ+RDVVTWTSMVVAY QTSKLDDAFRVF  MP K
Sbjct: 181 IVCNAVINAYGKCGEPGTSYSVFSRMQKRDVVTWTSMVVAYTQTSKLDDAFRVFRSMPVK 240

Query: 241 GVHTWTALINAFVKNKYSNEAVDLFQQMLEEKISPNAFTFVGVLSACADLALIAKGKEIH 300
            VHTWTALINAFVKNKYSNEA+DLFQQMLEEK SPNAFTFVGVLSACADLALIAKGKEIH
Sbjct: 241 NVHTWTALINAFVKNKYSNEALDLFQQMLEEKYSPNAFTFVGVLSACADLALIAKGKEIH 300

Query: 301 GLITRRSSGLNFPNVYMCNSLIDLYSKSGDVKSARTLFNLVFEKDVVSWNSLITGFAQNG 360
           G+I RRSS LNFPNVYMCN+L+DLYSKSGD+KSARTLFNLV +KDVVSWNSLITGFAQNG
Sbjct: 301 GIIIRRSSDLNFPNVYMCNALVDLYSKSGDMKSARTLFNLVPKKDVVSWNSLITGFAQNG 360

Query: 361 LGREALLAFRRMIEVGIRPNKVTFLGVLSACAHTGLSSEGLYILELMEKCYGIKPSLDHY 420
           LGREAL+A+RRMIEVGI+PN+VTFLGVLSAC+HTGLSSEGLYI+E MEK   IKPSLDHY
Sbjct: 361 LGREALIAYRRMIEVGIKPNEVTFLGVLSACSHTGLSSEGLYIMESMEKSNDIKPSLDHY 420

Query: 421 AVLIDMFGRKNRLSEALDIISGAPNGSKHVGIWGAVLGACRIHENLDLAVRAAETLFEME 480
           AVLIDMFGRKNRL+EALD+IS APN SKH+GIWGAVLGACRIH+NLDLA+RAAETLFEME
Sbjct: 421 AVLIDMFGRKNRLAEALDLISRAPNASKHIGIWGAVLGACRIHDNLDLAIRAAETLFEME 480

Query: 481 PDNAGRYVMLSNVFAAANRWMDAHNVRKLMEERGFKKEVAYSCIEIRNIRHKFVARDNSH 540
           PDNAGRYVMLSNVFAAA+RWMDAHNVRKLMEERGFKKEVA S IEIRN+RHKFVARDNSH
Sbjct: 481 PDNAGRYVMLSNVFAAASRWMDAHNVRKLMEERGFKKEVAQSFIEIRNVRHKFVARDNSH 540

Query: 541 SQMGDIYELISILLDHMKNFGYMPFDNGIYFYDG 575
           SQMG+IYEL+ ILLDHMK FGYM  D+GIYFYDG
Sbjct: 541 SQMGEIYELMFILLDHMKKFGYMLLDDGIYFYDG 574

BLAST of Tan0021130 vs. NCBI nr
Match: KAG6604304.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1010.7 bits (2612), Expect = 4.9e-291
Identity = 495/574 (86.24%), Postives = 530/574 (92.33%), Query Frame = 0

Query: 1   MVSLSDLFPSFDHCARLISQCIKLKHLKVGMSLHSHLIKTALSFDLFLANRLIDMYSKCN 60
           M+ LS  FPSFDH A LIS+CIK KHLKVGMSLHSHLIK+ALSFD FLANRLIDMYSKCN
Sbjct: 1   MMPLSGFFPSFDHYAFLISKCIKHKHLKVGMSLHSHLIKSALSFDPFLANRLIDMYSKCN 60

Query: 61  SIENAQKAFDDLPIRNIHSWNTILASYSCVGFLSQARKVFDEMPHPNIVSYNTLISSFTR 120
           S+ENAQKAFDDLP +NIHSWNTILASYS  GFLSQAR +FDEMPHPNIVSYNTLISSFT 
Sbjct: 61  SMENAQKAFDDLPFKNIHSWNTILASYSRAGFLSQARMIFDEMPHPNIVSYNTLISSFTH 120

Query: 121 HGLYVESMNIFRQMQQDFDLLVLDEFTLVSIAGTCACLGALESLRQVHGAAVVIGLEFNM 180
           HGLYVE+M+IF QMQQDFD LVLDEFT VSI GTCACLGALE LRQVHGAA+ IGLEFNM
Sbjct: 121 HGLYVEAMDIFWQMQQDFDRLVLDEFTFVSIVGTCACLGALEMLRQVHGAAIFIGLEFNM 180

Query: 181 IACNAIIDAYGKCGEPDASYSIFSRMQERDVVTWTSMVVAYAQTSKLDDAFRVFSCMPEK 240
           I CNA+I+AYGKCGEP  SYS+FS MQ+RDVVTWTSMVVAY QTSKLDDAFRVF  MP K
Sbjct: 181 IVCNAVINAYGKCGEPGTSYSVFSSMQKRDVVTWTSMVVAYTQTSKLDDAFRVFRSMPVK 240

Query: 241 GVHTWTALINAFVKNKYSNEAVDLFQQMLEEKISPNAFTFVGVLSACADLALIAKGKEIH 300
            VHTWTALINAFVKNKYSNEA+DLFQQMLEEK SPNAFTFVGVLSACADLALIAKGKEIH
Sbjct: 241 NVHTWTALINAFVKNKYSNEALDLFQQMLEEKYSPNAFTFVGVLSACADLALIAKGKEIH 300

Query: 301 GLITRRSSGLNFPNVYMCNSLIDLYSKSGDVKSARTLFNLVFEKDVVSWNSLITGFAQNG 360
            +I RRSS LNFPNVYMCN+L+DLYSKSGD+KSARTLFNLV +KDVVSWNSLITGFAQNG
Sbjct: 301 AIIIRRSSDLNFPNVYMCNALVDLYSKSGDMKSARTLFNLVPKKDVVSWNSLITGFAQNG 360

Query: 361 LGREALLAFRRMIEVGIRPNKVTFLGVLSACAHTGLSSEGLYILELMEKCYGIKPSLDHY 420
           LGREAL+AFRRMIEVGI+PN+VTFLGVLSAC+HTGLSSEGLYI+ELM K   IKPSLDHY
Sbjct: 361 LGREALIAFRRMIEVGIKPNEVTFLGVLSACSHTGLSSEGLYIMELMAKSNDIKPSLDHY 420

Query: 421 AVLIDMFGRKNRLSEALDIISGAPNGSKHVGIWGAVLGACRIHENLDLAVRAAETLFEME 480
           AVLIDMFGRKNRL+EALD+IS APN SKH+GIWGAVLGACRIH+NLDLA+RAAETLFEME
Sbjct: 421 AVLIDMFGRKNRLAEALDLISRAPNASKHIGIWGAVLGACRIHDNLDLAIRAAETLFEME 480

Query: 481 PDNAGRYVMLSNVFAAANRWMDAHNVRKLMEERGFKKEVAYSCIEIRNIRHKFVARDNSH 540
           PDNAGRYVMLSNVFAAA+RWMDAHNVRKLMEERGFKKEVA S IEIRN+RHKFVARDNSH
Sbjct: 481 PDNAGRYVMLSNVFAAASRWMDAHNVRKLMEERGFKKEVAQSFIEIRNVRHKFVARDNSH 540

Query: 541 SQMGDIYELISILLDHMKNFGYMPFDNGIYFYDG 575
           SQMG+IYEL+ ILLDHMK  GYMP D+G+YFYDG
Sbjct: 541 SQMGEIYELMFILLDHMKKIGYMPLDDGVYFYDG 574

BLAST of Tan0021130 vs. ExPASy TrEMBL
Match: A0A0A0KFI0 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G476040 PE=4 SV=1)

HSP 1 Score: 1053.1 bits (2722), Expect = 4.1e-304
Identity = 517/577 (89.60%), Postives = 545/577 (94.45%), Query Frame = 0

Query: 1   MVSLSDLFPSFDHCARLISQCIKLKHLKVGMSLHSHLIKTALSFDLFLANRLIDMYSKCN 60
           MV LSDLFPSFDHCARL S+CI+ KHL+VGMSLHSHLIKTALSFDLFLANRLIDMYSKCN
Sbjct: 1   MVPLSDLFPSFDHCARLFSKCIQHKHLRVGMSLHSHLIKTALSFDLFLANRLIDMYSKCN 60

Query: 61  SIENAQKAFDDLPIRNIHSWNTILASYSCVGFLSQARKVFDEMPHPNIVSYNTLISSFTR 120
           S+ENAQKAFDDLPIRNIHSWNTILASYS  GF SQARKVFDEMPHPNIVSYNTLISSFT 
Sbjct: 61  SMENAQKAFDDLPIRNIHSWNTILASYSRAGFFSQARKVFDEMPHPNIVSYNTLISSFTH 120

Query: 121 HGLYVESMNIFRQMQQDFDLLVLDEFTLVSIAGTCACLGALESLRQVHGAAVVIGLEFNM 180
           HGLYVESMNIFRQMQQDFDLL LDE TLVSIAGTCACLGALE LRQVHGAA+VIGLEFNM
Sbjct: 121 HGLYVESMNIFRQMQQDFDLLALDEITLVSIAGTCACLGALEFLRQVHGAAIVIGLEFNM 180

Query: 181 IACNAIIDAYGKCGEPDASYSIFSRMQERDVVTWTSMVVAYAQTSKLDDAFRVFSCMPEK 240
           I CNAI+DAYGKCG+PDASYSIFSRM+ERDVVTWTSMVVAY QTS+LDDAFRVFSCMP K
Sbjct: 181 IVCNAIVDAYGKCGDPDASYSIFSRMKERDVVTWTSMVVAYNQTSRLDDAFRVFSCMPVK 240

Query: 241 GVHTWTALINAFVKNKYSNEAVDLFQQMLEEKISPNAFTFVGVLSACADLALIAKGKEIH 300
            VHTWTALINA VKNKYSNEA+DLFQQMLEEK SPNAFTFVGVLSACADLALIAKGKEIH
Sbjct: 241 NVHTWTALINALVKNKYSNEALDLFQQMLEEKTSPNAFTFVGVLSACADLALIAKGKEIH 300

Query: 301 GLITRRSSGLNFPNVYMCNSLIDLYSKSGDVKSARTLFNLVFEKDVVSWNSLITGFAQNG 360
           GLI RRSS LNFPNVY+CN+LIDLYSKSGDVKSAR LFNL+ EKDVVSWNSLITGFAQNG
Sbjct: 301 GLIIRRSSELNFPNVYVCNALIDLYSKSGDVKSARMLFNLILEKDVVSWNSLITGFAQNG 360

Query: 361 LGREALLAFRRMIEVGIRPNKVTFLGVLSACAHTGLSSEGLYILELMEKCYGIKPSLDHY 420
           LGREALLAFR+M EVGIRPNKVTFL VLSAC+HTGLSSEGL ILELMEK Y I+PSL+HY
Sbjct: 361 LGREALLAFRKMTEVGIRPNKVTFLAVLSACSHTGLSSEGLCILELMEKFYDIEPSLEHY 420

Query: 421 AVLIDMFGRKNRLSEALDIISGAPNGSKHVGIWGAVLGACRIHENLDLAVRAAETLFEME 480
           AV+IDMFGR+NRL+EALD+IS APNGSKHVGIWGAVLGACRIHENLDLA+RAAETLFEME
Sbjct: 421 AVMIDMFGRENRLAEALDLISRAPNGSKHVGIWGAVLGACRIHENLDLAIRAAETLFEME 480

Query: 481 PDNAGRYVMLSNVFAAANRWMDAHNVRKLMEERGFKKEVAYSCIEIRNIRHKFVARDNSH 540
           PDNAGRYVMLSNVFAAA+RWMDAHNVRKLMEERGFKKEVAYSCIEIRNIRHKFVARDNSH
Sbjct: 481 PDNAGRYVMLSNVFAAASRWMDAHNVRKLMEERGFKKEVAYSCIEIRNIRHKFVARDNSH 540

Query: 541 SQMGDIYELISILLDHMKNFGYMPFDNGIYFYDGYNT 578
           SQMG+IYEL+ ILL+HM   GYM  D+GIYFYDGY+T
Sbjct: 541 SQMGEIYELMFILLEHMNIIGYMALDDGIYFYDGYST 577

BLAST of Tan0021130 vs. ExPASy TrEMBL
Match: A0A6J1BT15 (pentatricopeptide repeat-containing protein At2g13600-like OS=Momordica charantia OX=3673 GN=LOC111005503 PE=4 SV=1)

HSP 1 Score: 1028.5 bits (2658), Expect = 1.1e-296
Identity = 502/575 (87.30%), Postives = 542/575 (94.26%), Query Frame = 0

Query: 1   MVSLSDLFPSFDHCARLISQCIKLKHLKVGMSLHSHLIKTALSFDLFLANRLIDMYSKCN 60
           MV L+D+FP+FDHCARLIS+CIK KHLKVGMSLHSHLIKTALS+DLFLANRLIDMYSKCN
Sbjct: 1   MVPLADIFPAFDHCARLISKCIKHKHLKVGMSLHSHLIKTALSYDLFLANRLIDMYSKCN 60

Query: 61  SIENAQKAFDDLPIRNIHSWNTILASYSCVGFLSQARKVFDEMPHPNIVSYNTLISSFTR 120
           S+ENAQKAFDDLPIRN+HSWNTILA Y+ +G LSQARK FDEMPHPNI+SYNTLI SFTR
Sbjct: 61  SMENAQKAFDDLPIRNVHSWNTILALYTRIGCLSQARKFFDEMPHPNIISYNTLIYSFTR 120

Query: 121 HGLYVESMNIFRQMQQDFDLLVLDEFTLVSIAGTCACLGALESLRQVHGAAVVIGLEFNM 180
           HGLYVESMNIFR+MQQDFDLLVLDEFTLVSIAGTCACLGAL  LRQ+HGAA+VIGLEFN+
Sbjct: 121 HGLYVESMNIFRKMQQDFDLLVLDEFTLVSIAGTCACLGALALLRQIHGAAIVIGLEFNV 180

Query: 181 IACNAIIDAYGKCGEPDASYSIFSRMQERDVVTWTSMVVAYAQTSKLDDAFRVFSCMPEK 240
           I  NAIIDAYGKCGEPD SYSIFS+MQERDVVTWTSMVVAYAQTS+LDDAFRVFSCMP K
Sbjct: 181 IVSNAIIDAYGKCGEPDTSYSIFSQMQERDVVTWTSMVVAYAQTSRLDDAFRVFSCMPMK 240

Query: 241 GVHTWTALINAFVKNKYSNEAVDLFQQMLEEKISPNAFTFVGVLSACADLALIAKGKEIH 300
            VHTWTALINAF KNKYSNEA+DLF+QMLEEKIS N+FTFVGVLSACADLALIAKGK+IH
Sbjct: 241 NVHTWTALINAFAKNKYSNEALDLFEQMLEEKISLNSFTFVGVLSACADLALIAKGKQIH 300

Query: 301 GLITRRSSGLNFPNVYMCNSLIDLYSKSGDVKSARTLFNLVFEKDVVSWNSLITGFAQNG 360
           GLI R S  LNF NVY+ N+LID+YSKSGD+KSARTLFNL+ EKDVVSWNSLITGFAQNG
Sbjct: 301 GLIIRSSCSLNFLNVYIYNALIDMYSKSGDMKSARTLFNLMPEKDVVSWNSLITGFAQNG 360

Query: 361 LGREALLAFRRMIEVGIRPNKVTFLGVLSACAHTGLSSEGLYILELMEKCYGIKPSLDHY 420
           LG+EAL+AFRRMIEVGIRPNKVTFLGVLSAC+HTGL SEGLY+LELMEK +GIKPSLDHY
Sbjct: 361 LGKEALIAFRRMIEVGIRPNKVTFLGVLSACSHTGLLSEGLYLLELMEKFFGIKPSLDHY 420

Query: 421 AVLIDMFGRKNRLSEALDIISGAPNGSKHVGIWGAVLGACRIHENLDLAVRAAETLFEME 480
           AVLIDMFGRKNRL+EALD+I+ APN S HVGIWGAVLGACR+HENLDLA+ AAETLFEME
Sbjct: 421 AVLIDMFGRKNRLAEALDLIARAPNRSNHVGIWGAVLGACRMHENLDLAMSAAETLFEME 480

Query: 481 PDNAGRYVMLSNVFAAANRWMDAHNVRKLMEERGFKKEVAYSCIEIRNIRHKFVARDNSH 540
           PDNAGRYVML+N+FAAA+RWMDAHNVRKLMEERGFKKEVAYSCIEIRN  HKFVARDNSH
Sbjct: 481 PDNAGRYVMLANIFAAASRWMDAHNVRKLMEERGFKKEVAYSCIEIRNRGHKFVARDNSH 540

Query: 541 SQMGDIYELISILLDHMKNFGYMPFDNGIYFYDGY 576
           SQMG+IYEL+ ILLDHMKNFG MPFDNGIYFYDGY
Sbjct: 541 SQMGEIYELMFILLDHMKNFGCMPFDNGIYFYDGY 575

BLAST of Tan0021130 vs. ExPASy TrEMBL
Match: A0A6J1IJL9 (pentatricopeptide repeat-containing protein At2g21090-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111478028 PE=4 SV=1)

HSP 1 Score: 1012.7 bits (2617), Expect = 6.2e-292
Identity = 496/574 (86.41%), Postives = 531/574 (92.51%), Query Frame = 0

Query: 1   MVSLSDLFPSFDHCARLISQCIKLKHLKVGMSLHSHLIKTALSFDLFLANRLIDMYSKCN 60
           M+ LS  FPSFDH A LIS+CIK KHLKVGMSLHSHLIK+ALSFD FLAN LIDMYSKCN
Sbjct: 1   MMPLSGFFPSFDHYAFLISKCIKHKHLKVGMSLHSHLIKSALSFDPFLANHLIDMYSKCN 60

Query: 61  SIENAQKAFDDLPIRNIHSWNTILASYSCVGFLSQARKVFDEMPHPNIVSYNTLISSFTR 120
           S+ENAQKAFDDLP +NIHSWNTILASYS  GFLSQAR +FDEMPHPNIVSYNTLISSFT 
Sbjct: 61  SMENAQKAFDDLPFKNIHSWNTILASYSRAGFLSQARMIFDEMPHPNIVSYNTLISSFTH 120

Query: 121 HGLYVESMNIFRQMQQDFDLLVLDEFTLVSIAGTCACLGALESLRQVHGAAVVIGLEFNM 180
           HGLYVE+MNIF QMQQDFD LVLDEFT VSI GTCACLGALE LRQ+HGAA+ IGLEFNM
Sbjct: 121 HGLYVEAMNIFWQMQQDFDRLVLDEFTFVSIVGTCACLGALEMLRQIHGAAIFIGLEFNM 180

Query: 181 IACNAIIDAYGKCGEPDASYSIFSRMQERDVVTWTSMVVAYAQTSKLDDAFRVFSCMPEK 240
           I CNA+I+AYGKCGEP  SYS+FSRMQ+RDVVTWTSMVVAY QTSKLDDAFRVF  MP K
Sbjct: 181 IVCNAVINAYGKCGEPGTSYSVFSRMQKRDVVTWTSMVVAYTQTSKLDDAFRVFRSMPVK 240

Query: 241 GVHTWTALINAFVKNKYSNEAVDLFQQMLEEKISPNAFTFVGVLSACADLALIAKGKEIH 300
            VHTWTALINAFVKNKYSNEA+DLFQQMLEEK SPNAFTFVGVLSACADLALIAKGKEIH
Sbjct: 241 NVHTWTALINAFVKNKYSNEALDLFQQMLEEKYSPNAFTFVGVLSACADLALIAKGKEIH 300

Query: 301 GLITRRSSGLNFPNVYMCNSLIDLYSKSGDVKSARTLFNLVFEKDVVSWNSLITGFAQNG 360
           G+I RRSS LNFPNVYMCN+L+DLYSKSGD+KSARTLFNLV +KDVVSWNSLITGFAQNG
Sbjct: 301 GIIIRRSSDLNFPNVYMCNALVDLYSKSGDMKSARTLFNLVPKKDVVSWNSLITGFAQNG 360

Query: 361 LGREALLAFRRMIEVGIRPNKVTFLGVLSACAHTGLSSEGLYILELMEKCYGIKPSLDHY 420
           LGREAL+A+RRMIEVGI+PN+VTFLGVLSAC+HTGLSSEGLYI+E MEK   IKPSLDHY
Sbjct: 361 LGREALIAYRRMIEVGIKPNEVTFLGVLSACSHTGLSSEGLYIMESMEKSNDIKPSLDHY 420

Query: 421 AVLIDMFGRKNRLSEALDIISGAPNGSKHVGIWGAVLGACRIHENLDLAVRAAETLFEME 480
           AVLIDMFGRKNRL+EALD+IS APN SKH+GIWGAVLGACRIH+NLDLA+RAAETLFEME
Sbjct: 421 AVLIDMFGRKNRLAEALDLISRAPNASKHIGIWGAVLGACRIHDNLDLAIRAAETLFEME 480

Query: 481 PDNAGRYVMLSNVFAAANRWMDAHNVRKLMEERGFKKEVAYSCIEIRNIRHKFVARDNSH 540
           PDNAGRYVMLSNVFAAA+RWMDAHNVRKLMEERGFKKEVA S IEIRN+RHKFVARDNSH
Sbjct: 481 PDNAGRYVMLSNVFAAASRWMDAHNVRKLMEERGFKKEVAQSFIEIRNVRHKFVARDNSH 540

Query: 541 SQMGDIYELISILLDHMKNFGYMPFDNGIYFYDG 575
           SQMG+IYEL+ ILLDHMK FGYM  D+GIYFYDG
Sbjct: 541 SQMGEIYELMFILLDHMKKFGYMLLDDGIYFYDG 574

BLAST of Tan0021130 vs. ExPASy TrEMBL
Match: A0A5D3C8H5 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold202G001990 PE=4 SV=1)

HSP 1 Score: 995.7 bits (2573), Expect = 7.8e-287
Identity = 488/547 (89.21%), Postives = 516/547 (94.33%), Query Frame = 0

Query: 31  MSLHSHLIKTALSFDLFLANRLIDMYSKCNSIENAQKAFDDLPIRNIHSWNTILASYSCV 90
           MSLHSHLIKTALSFDLFLANRLIDMYSKCNS+ENAQKAFDD PIRNIHSWNTILASYS  
Sbjct: 1   MSLHSHLIKTALSFDLFLANRLIDMYSKCNSMENAQKAFDDSPIRNIHSWNTILASYSRA 60

Query: 91  GFLSQARKVFDEMPHPNIVSYNTLISSFTRHGLYVESMNIFRQMQQDFDLLVLDEFTLVS 150
           G  SQARKVFDEMPHPNIVSYNTLISSFT HGLY ESMNIFRQMQ+DFDLL LDE TLVS
Sbjct: 61  GSFSQARKVFDEMPHPNIVSYNTLISSFTHHGLYGESMNIFRQMQRDFDLLALDEITLVS 120

Query: 151 IAGTCACLGALESLRQVHGAAVVIGLEFNMIACNAIIDAYGKCGEPDASYSIFSRMQERD 210
           I G CACLGALE LRQVHGAA+VIGLEFN+I CNAI+DAYGKCG+PDASYSIFSRM+ERD
Sbjct: 121 IVGACACLGALELLRQVHGAAIVIGLEFNLIVCNAIVDAYGKCGDPDASYSIFSRMKERD 180

Query: 211 VVTWTSMVVAYAQTSKLDDAFRVFSCMPEKGVHTWTALINAFVKNKYSNEAVDLFQQMLE 270
           VVTWTSMVVAY QTS+LDDAFRVFSCMP K VHTWTALINA VKNKYSNEA+DLFQQMLE
Sbjct: 181 VVTWTSMVVAYNQTSRLDDAFRVFSCMPVKNVHTWTALINALVKNKYSNEALDLFQQMLE 240

Query: 271 EKISPNAFTFVGVLSACADLALIAKGKEIHGLITRRSSGLNFPNVYMCNSLIDLYSKSGD 330
           EK SPNAFTFVGVLSACADLALIAKGKEIHGLI RRSS LNFPNVY+CN+LIDLYSKSGD
Sbjct: 241 EKNSPNAFTFVGVLSACADLALIAKGKEIHGLIIRRSSDLNFPNVYVCNALIDLYSKSGD 300

Query: 331 VKSARTLFNLVFEKDVVSWNSLITGFAQNGLGREALLAFRRMIEVGIRPNKVTFLGVLSA 390
           +KSAR LFNL+ EKDVVSWNSLITGFAQNGLGREALLAF++M EVGIRPNKVTFLGVLSA
Sbjct: 301 MKSARMLFNLILEKDVVSWNSLITGFAQNGLGREALLAFQKMTEVGIRPNKVTFLGVLSA 360

Query: 391 CAHTGLSSEGLYILELMEKCYGIKPSLDHYAVLIDMFGRKNRLSEALDIISGAPNGSKHV 450
           C+HTGLSSEGLYILELMEK Y IKPSL+HYAV+IDMFGR+N+LSEALD+IS APNGSKHV
Sbjct: 361 CSHTGLSSEGLYILELMEKSYDIKPSLEHYAVMIDMFGRENKLSEALDLISRAPNGSKHV 420

Query: 451 GIWGAVLGACRIHENLDLAVRAAETLFEMEPDNAGRYVMLSNVFAAANRWMDAHNVRKLM 510
           GIWGAVLGACRIHENLDLA+RAAETLFEMEPDNAGRYVMLSNVFAAA+RWMDAHNVRKLM
Sbjct: 421 GIWGAVLGACRIHENLDLAIRAAETLFEMEPDNAGRYVMLSNVFAAASRWMDAHNVRKLM 480

Query: 511 EERGFKKEVAYSCIEIRNIRHKFVARDNSHSQMGDIYELISILLDHMKNFGYMPFDNGIY 570
           EERGFKKEVAYSCIEIRNIRHKFVARDNSHSQMG+IYEL+ ILL+HM  FGYM  D+GIY
Sbjct: 481 EERGFKKEVAYSCIEIRNIRHKFVARDNSHSQMGEIYELMFILLEHMNIFGYMALDDGIY 540

Query: 571 FYDGYNT 578
           FYDGY+T
Sbjct: 541 FYDGYST 547

BLAST of Tan0021130 vs. ExPASy TrEMBL
Match: A0A6J1EJM2 (pentatricopeptide repeat-containing protein At2g21090-like isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111433199 PE=3 SV=1)

HSP 1 Score: 936.0 bits (2418), Expect = 7.4e-269
Identity = 464/541 (85.77%), Postives = 496/541 (91.68%), Query Frame = 0

Query: 1   MVSLSDLFPSFDHCARLISQCIKLKHLKVGMSLHSHLIKTALSFDLFLANRLIDMYSKCN 60
           M+ LS  FPSFDH A LIS+CIK KHLKVGMSLHSHLIK+ALSFD FLANRLIDMYSKCN
Sbjct: 1   MMPLSGFFPSFDHYAFLISKCIKHKHLKVGMSLHSHLIKSALSFDPFLANRLIDMYSKCN 60

Query: 61  SIENAQKAFDDLPIRNIHSWNTILASYSCVGFLSQARKVFDEMPHPNIVSYNTLISSFTR 120
           S+ENAQKAFDDLP +NIHSWNTILASYS  GFLSQAR +FDEMPHPNIVSYNTLISSFT 
Sbjct: 61  SMENAQKAFDDLPFKNIHSWNTILASYSRAGFLSQARMIFDEMPHPNIVSYNTLISSFTH 120

Query: 121 HGLYVESMNIFRQMQQDFDLLVLDEFTLVSIAGTCACLGALESLRQVHGAAVVIGLEFNM 180
           HGLYVE+M+IF QMQQDFD LVLDEFT VSI GTCACLGALE LRQVHGAA+ IGLEFNM
Sbjct: 121 HGLYVEAMDIFWQMQQDFDRLVLDEFTFVSIVGTCACLGALEMLRQVHGAAIFIGLEFNM 180

Query: 181 IACNAIIDAYGKCGEPDASYSIFSRMQERDVVTWTSMVVAYAQTSKLDDAFRVFSCMPEK 240
           I CNA+I+AYGKCGEP  SYS+FSRMQ+RDVVTWTSMVVAY QTSKLDDAFRVF  MP K
Sbjct: 181 IVCNAVINAYGKCGEPGTSYSVFSRMQKRDVVTWTSMVVAYTQTSKLDDAFRVFRSMPVK 240

Query: 241 GVHTWTALINAFVKNKYSNEAVDLFQQMLEEKISPNAFTFVGVLSACADLALIAKGKEIH 300
            VHTWTALINAFVKNKYSNEA+DLFQQMLEEK SPNAFTFVGVLSACADLALIAKGKEIH
Sbjct: 241 NVHTWTALINAFVKNKYSNEALDLFQQMLEEKYSPNAFTFVGVLSACADLALIAKGKEIH 300

Query: 301 GLITRRSSGLNFPNVYMCNSLIDLYSKSGDVKSARTLFNLVFEKDVVSWNSLITGFAQNG 360
            +I RRSS LNFPNVYMCN+L+DLYSKSGD+KSARTLFNLV +KDVVSWNSLITGFAQNG
Sbjct: 301 AIIIRRSSDLNFPNVYMCNALVDLYSKSGDMKSARTLFNLVPKKDVVSWNSLITGFAQNG 360

Query: 361 LGREALLAFRRMIEVGIRPNKVTFLGVLSACAHTGLSSEGLYILELMEKCYGIKPSLDHY 420
           LGREAL+AFRRMIEVGI+PN+VTFLGVLSAC+HTGLSSEGLYI+ELMEK   IKPSLDHY
Sbjct: 361 LGREALIAFRRMIEVGIKPNEVTFLGVLSACSHTGLSSEGLYIMELMEKSNDIKPSLDHY 420

Query: 421 AVLIDMFGRKNRLSEALDIISGAPNGSKHVGIWGAVLGACRIHENLDLAVRAAETLFEME 480
           AVLIDMFGRKNRL+EALD+IS APN SKH+GIWGAVLGACRIH+NLDLA+RAAETLFEME
Sbjct: 421 AVLIDMFGRKNRLAEALDLISRAPNASKHIGIWGAVLGACRIHDNLDLAIRAAETLFEME 480

Query: 481 PDNAGRYVMLSNVFAAANRWMDAHNVRKLMEERGFKKEVAYSCIEIRN-----IRHKFVA 537
           PDNAGRYVMLSNVFAAA+RWMDAHNVRKLMEERGFKKEVA S IEIRN     +R+ F A
Sbjct: 481 PDNAGRYVMLSNVFAAASRWMDAHNVRKLMEERGFKKEVAQSFIEIRNFAEMKLRNNFPA 540

BLAST of Tan0021130 vs. TAIR 10
Match: AT2G21090.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 369.4 bits (947), Expect = 5.3e-102
Identity = 197/534 (36.89%), Postives = 308/534 (57.68%), Query Frame = 0

Query: 11  FDHCARLISQCIKLKHLKVGMSLHSHLIKTALSF-DLFLANRLIDMYSKCNSIENAQKAF 70
           FD  A L+ QC   K LK G  +H HL  T     +  L+N LI MY KC    +A K F
Sbjct: 46  FDLLASLLQQCGDTKSLKQGKWIHRHLKITGFKRPNTLLSNHLIGMYMKCGKPIDACKVF 105

Query: 71  DDLPIRNIHSWNTILASYSCVGFLSQARKVFDEMPHPNIVSYNTLISSFTRHGLYVESMN 130
           D + +RN++SWN +++ Y   G L +AR VFD MP  ++VS+NT++  + + G   E++ 
Sbjct: 106 DQMHLRNLYSWNNMVSGYVKSGMLVRARVVFDSMPERDVVSWNTMVIGYAQDGNLHEALW 165

Query: 131 IFRQMQQDFDLLVLDEFTLVSIAGTCACLGALESLRQVHGAAVVIGLEFNMIACNAIIDA 190
            +++ ++    +  +EF+   +   C     L+  RQ HG  +V G   N++   +IIDA
Sbjct: 166 FYKEFRRSG--IKFNEFSFAGLLTACVKSRQLQLNRQAHGQVLVAGFLSNVVLSCSIIDA 225

Query: 191 YGKCGEPDASYSIFSRMQERDVVTWTSMVVAYAQTSKLDDAFRVFSCMPEKGVHTWTALI 250
           Y KCG+ +++   F  M  +D+  WT+++  YA+   ++ A ++F  MPEK   +WTALI
Sbjct: 226 YAKCGQMESAKRCFDEMTVKDIHIWTTLISGYAKLGDMEAAEKLFCEMPEKNPVSWTALI 285

Query: 251 NAFVKNKYSNEAVDLFQQMLEEKISPNAFTFVGVLSACADLALIAKGKEIHGLITRRSSG 310
             +V+    N A+DLF++M+   + P  FTF   L A A +A +  GKEIHG + R +  
Sbjct: 286 AGYVRQGSGNRALDLFRKMIALGVKPEQFTFSSCLCASASIASLRHGKEIHGYMIRTNVR 345

Query: 311 LNFPNVYMCNSLIDLYSKSGDVKSARTLFNLVFEK-DVVSWNSLITGFAQNGLGREALLA 370
              PN  + +SLID+YSKSG ++++  +F +  +K D V WN++I+  AQ+GLG +AL  
Sbjct: 346 ---PNAIVISSLIDMYSKSGSLEASERVFRICDDKHDCVFWNTMISALAQHGLGHKALRM 405

Query: 371 FRRMIEVGIRPNKVTFLGVLSACAHTGLSSEGLYILELMEKCYGIKPSLDHYAVLIDMFG 430
              MI+  ++PN+ T + +L+AC+H+GL  EGL   E M   +GI P  +HYA LID+ G
Sbjct: 406 LDDMIKFRVQPNRTTLVVILNACSHSGLVEEGLRWFESMTVQHGIVPDQEHYACLIDLLG 465

Query: 431 RKNRLSEALDIISGAP-NGSKHVGIWGAVLGACRIHENLDLAVRAAETLFEMEPDNAGRY 490
           R     E +  I   P    KH  IW A+LG CRIH N +L  +AA+ L +++P+++  Y
Sbjct: 466 RAGCFKELMRKIEEMPFEPDKH--IWNAILGVCRIHGNEELGKKAADELIKLDPESSAPY 525

Query: 491 VMLSNVFAAANRWMDAHNVRKLMEERGFKKEVAYSCIEIRNIRHKFVARDNSHS 542
           ++LS+++A   +W     +R +M++R   KE A S IEI      F   D SH+
Sbjct: 526 ILLSSIYADHGKWELVEKLRGVMKKRRVNKEKAVSWIEIEKKVEAFTVSDGSHA 572

BLAST of Tan0021130 vs. TAIR 10
Match: AT2G13600.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 369.4 bits (947), Expect = 5.3e-102
Identity = 214/652 (32.82%), Postives = 329/652 (50.46%), Query Frame = 0

Query: 15  ARLISQCIKLKHLKVGMS-LHSHLIKTALSFDLFLANRLIDMYSKCNSIENAQKAFDDLP 74
           A+L+  CIK K   + +  +H+ +IK+  S ++F+ NRLID YSKC S+E+ ++ FD +P
Sbjct: 23  AKLLDSCIKSKLSAIYVRYVHASVIKSGFSNEIFIQNRLIDAYSKCGSLEDGRQVFDKMP 82

Query: 75  IRNIHSWNTILASYSCVGFLSQ-------------------------------------- 134
            RNI++WN+++   + +GFL +                                      
Sbjct: 83  QRNIYTWNSVVTGLTKLGFLDEADSLFRSMPERDQCTWNSMVSGFAQHDRCEEALCYFAM 142

Query: 135 ------------------------------------------------------------ 194
                                                                       
Sbjct: 143 MHKEGFVLNEYSFASVLSACSGLNDMNKGVQVHSLIAKSPFLSDVYIGSALVDMYSKCGN 202

Query: 195 ---ARKVFDEMPHPNIVSYNTLISSFTRHGLYVESMNIFRQMQQDFDLLVLDEFTLVSIA 254
              A++VFDEM   N+VS+N+LI+ F ++G  VE++++F+ M +    +  DE TL S+ 
Sbjct: 203 VNDAQRVFDEMGDRNVVSWNSLITCFEQNGPAVEALDVFQMMLE--SRVEPDEVTLASVI 262

Query: 255 GTCACLGALESLRQVHGAAVVIG-LEFNMIACNAIIDAYGKCGEPDASYSIFSRMQERDV 314
             CA L A++  ++VHG  V    L  ++I  NA +D Y KC     +  IF  M  R+V
Sbjct: 263 SACASLSAIKVGQEVHGRVVKNDKLRNDIILSNAFVDMYAKCSRIKEARFIFDSMPIRNV 322

Query: 315 VTWTSMVVAYAQTSKLDDAFRVFSCMPEKGVHTWTALINAFVKNKYSNEAVDLFQQMLEE 374
           +  TSM+  YA  +    A  +F+ M E+ V +W ALI  + +N  + EA+ LF  +  E
Sbjct: 323 IAETSMISGYAMAASTKAARLMFTKMAERNVVSWNALIAGYTQNGENEEALSLFCLLKRE 382

Query: 375 KISPNAFTFVGVLSACADLALIAKGKEIHGLITRRSSGLNF-----PNVYMCNSLIDLYS 434
            + P  ++F  +L ACADLA +  G + H  + +   G  F      ++++ NSLID+Y 
Sbjct: 383 SVCPTHYSFANILKACADLAELHLGMQAHVHVLKH--GFKFQSGEEDDIFVGNSLIDMYV 442

Query: 435 KSGDVKSARTLFNLVFEKDVVSWNSLITGFAQNGLGREALLAFRRMIEVGIRPNKVTFLG 494
           K G V+    +F  + E+D VSWN++I GFAQNG G EAL  FR M+E G +P+ +T +G
Sbjct: 443 KCGCVEEGYLVFRKMMERDCVSWNAMIIGFAQNGYGNEALELFREMLESGEKPDHITMIG 502

Query: 495 VLSACAHTGLSSEGLYILELMEKCYGIKPSLDHYAVLIDMFGRKNRLSEALDIISGAPNG 554
           VLSAC H G   EG +    M + +G+ P  DHY  ++D+ GR   L EA  +I   P  
Sbjct: 503 VLSACGHAGFVEEGRHYFSSMTRDFGVAPLRDHYTCMVDLLGRAGFLEEAKSMIEEMPMQ 562

Query: 555 SKHVGIWGAVLGACRIHENLDLAVRAAETLFEMEPDNAGRYVMLSNVFAAANRWMDAHNV 559
              V IWG++L AC++H N+ L    AE L E+EP N+G YV+LSN++A   +W D  NV
Sbjct: 563 PDSV-IWGSLLAACKVHRNITLGKYVAEKLLEVEPSNSGPYVLLSNMYAELGKWEDVMNV 622

BLAST of Tan0021130 vs. TAIR 10
Match: AT2G22070.1 (pentatricopeptide (PPR) repeat-containing protein )

HSP 1 Score: 360.9 bits (925), Expect = 1.9e-99
Identity = 187/541 (34.57%), Postives = 318/541 (58.78%), Query Frame = 0

Query: 27  LKVGMSLHSHLIKTALSFDLFLANRLIDMYSKCNSIENAQKAFDDLPIRNIHSWNTILAS 86
           ++ G  +HS ++K  L  ++ ++N L++MY+KC     A+  FD + +R+I SWN ++A 
Sbjct: 162 METGKKVHSFIVKLGLRGNVSVSNSLLNMYAKCGDPMMAKFVFDRMVVRDISSWNAMIAL 221

Query: 87  YSCVGFLSQARKVFDEMPHPNIVSYNTLISSFTRHGLYVESMNIFRQMQQDFDLLVLDEF 146
           +  VG +  A   F++M   +IV++N++IS F + G  + +++IF +M +D  LL  D F
Sbjct: 222 HMQVGQMDLAMAQFEQMAERDIVTWNSMISGFNQRGYDLRALDIFSKMLRD-SLLSPDRF 281

Query: 147 TLVSIAGTCACLGALESLRQVHGAAVVIGLEFNMIACNAIIDAYGKCGEPDASYSIFSRM 206
           TL S+   CA L  L   +Q+H   V  G + + I  NA+I  Y +CG  + +  +  + 
Sbjct: 282 TLASVLSACANLEKLCIGKQIHSHIVTTGFDISGIVLNALISMYSRCGGVETARRLIEQR 341

Query: 207 QERD--VVTWTSMVVAYAQTSKLDDAFRVFSCMPEKGVHTWTALINAFVKNKYSNEAVDL 266
             +D  +  +T+++  Y +   ++ A  +F  + ++ V  WTA+I  + ++    EA++L
Sbjct: 342 GTKDLKIEGFTALLDGYIKLGDMNQAKNIFVSLKDRDVVAWTAMIVGYEQHGSYGEAINL 401

Query: 267 FQQMLEEKISPNAFTFVGVLSACADLALIAKGKEIHGLITRRSSGLNFPNVYMCNSLIDL 326
           F+ M+     PN++T   +LS  + LA ++ GK+IHG   +  SG    +V + N+LI +
Sbjct: 402 FRSMVGGGQRPNSYTLAAMLSVASSLASLSHGKQIHGSAVK--SG-EIYSVSVSNALITM 461

Query: 327 YSKSGDVKSARTLFNLV-FEKDVVSWNSLITGFAQNGLGREALLAFRRMIEVGIRPNKVT 386
           Y+K+G++ SA   F+L+  E+D VSW S+I   AQ+G   EAL  F  M+  G+RP+ +T
Sbjct: 462 YAKAGNITSASRAFDLIRCERDTVSWTSMIIALAQHGHAEEALELFETMLMEGLRPDHIT 521

Query: 387 FLGVLSACAHTGLSSEGLYILELMEKCYGIKPSLDHYAVLIDMFGRKNRLSEALDIISGA 446
           ++GV SAC H GL ++G    ++M+    I P+L HYA ++D+FGR   L EA + I   
Sbjct: 522 YVGVFSACTHAGLVNQGRQYFDMMKDVDKIIPTLSHYACMVDLFGRAGLLQEAQEFIEKM 581

Query: 447 PNGSKHVGIWGAVLGACRIHENLDLAVRAAETLFEMEPDNAGRYVMLSNVFAAANRWMDA 506
           P     V  WG++L ACR+H+N+DL   AAE L  +EP+N+G Y  L+N+++A  +W +A
Sbjct: 582 PI-EPDVVTWGSLLSACRVHKNIDLGKVAAERLLLLEPENSGAYSALANLYSACGKWEEA 641

Query: 507 HNVRKLMEERGFKKEVAYSCIEIRNIRHKFVARDNSHSQMGDIYELISILLDHMKNFGYM 565
             +RK M++   KKE  +S IE+++  H F   D +H +  +IY  +  + D +K  GY+
Sbjct: 642 AKIRKSMKDGRVKKEQGFSWIEVKHKVHVFGVEDGTHPEKNEIYMTMKKIWDEIKKMGYV 697

BLAST of Tan0021130 vs. TAIR 10
Match: AT1G08070.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 348.6 bits (893), Expect = 9.7e-96
Identity = 189/564 (33.51%), Postives = 311/564 (55.14%), Query Frame = 0

Query: 1   MVSLSDLFPSFDHCARLISQCIKLKHLKVGMSLHSHLIKTALSFDLFLANRLIDMYSKCN 60
           M+SL  L P+      ++  C K K  K G  +H H++K     DL++   LI MY +  
Sbjct: 125 MISLG-LLPNSYTFPFVLKSCAKSKAFKEGQQIHGHVLKLGCDLDLYVHTSLISMYVQNG 184

Query: 61  SIENAQKAFDDLPIRNIHSWNTILASYSCVGFLSQARKVFDEMPHPNIVSYNTLISSFTR 120
            +E+A K FD  P R++ S+  ++  Y+  G++  A+K+FDE+P  ++VS+N +IS +  
Sbjct: 185 RLEDAHKVFDKSPHRDVVSYTALIKGYASRGYIENAQKLFDEIPVKDVVSWNAMISGYAE 244

Query: 121 HGLYVESMNIFRQMQQDFDLLVLDEFTLVSIAGTCACLGALESLRQVHGAAVVIGLEFNM 180
            G Y E++ +F+ M +    +  DE T+V++   CA  G++E  RQVH      G   N+
Sbjct: 245 TGNYKEALELFKDMMK--TNVRPDESTMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNL 304

Query: 181 IACNAIIDAYGKCGEPDASYSIFSRMQERDVVTWTSMVVAYAQTSKLDDAFRVFSCMPEK 240
              NA+ID Y KCGE + +  +F R+  +DV++W +++  Y                   
Sbjct: 305 KIVNALIDLYSKCGELETACGLFERLPYKDVISWNTLIGGYTH----------------- 364

Query: 241 GVHTWTALINAFVKNKYSNEAVDLFQQMLEEKISPNAFTFVGVLSACADLALIAKGKEIH 300
                         N Y  EA+ LFQ+ML    +PN  T + +L ACA L  I  G+ IH
Sbjct: 365 -------------MNLY-KEALLLFQEMLRSGETPNDVTMLSILPACAHLGAIDIGRWIH 424

Query: 301 GLITRRSSGLNFPNVYMCNSLIDLYSKSGDVKSARTLFNLVFEKDVVSWNSLITGFAQNG 360
             I +R  G+   +  +  SLID+Y+K GD+++A  +FN +  K + SWN++I GFA +G
Sbjct: 425 VYIDKRLKGVTNAS-SLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSSWNAMIFGFAMHG 484

Query: 361 LGREALLAFRRMIEVGIRPNKVTFLGVLSACAHTGLSSEGLYILELMEKCYGIKPSLDHY 420
               +   F RM ++GI+P+ +TF+G+LSAC+H+G+   G +I   M + Y + P L+HY
Sbjct: 485 RADASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQDYKMTPKLEHY 544

Query: 421 AVLIDMFGRKNRLSEALDIISGAPNGSKHVGIWGAVLGACRIHENLDLAVRAAETLFEME 480
             +ID+ G      EA ++I+        V IW ++L AC++H N++L    AE L ++E
Sbjct: 545 GCMIDLLGHSGLFKEAEEMINMMEMEPDGV-IWCSLLKACKMHGNVELGESFAENLIKIE 604

Query: 481 PDNAGRYVMLSNVFAAANRWMDAHNVRKLMEERGFKKEVAYSCIEIRNIRHKFVARDNSH 540
           P+N G YV+LSN++A+A RW +    R L+ ++G KK    S IEI ++ H+F+  D  H
Sbjct: 605 PENPGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSVVHEFIIGDKFH 652

Query: 541 SQMGDIYELISILLDHMKNFGYMP 565
            +  +IY ++  +   ++  G++P
Sbjct: 665 PRNREIYGMLEEMEVLLEKAGFVP 652

BLAST of Tan0021130 vs. TAIR 10
Match: AT2G29760.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 347.1 bits (889), Expect = 2.8e-95
Identity = 194/565 (34.34%), Postives = 305/565 (53.98%), Query Frame = 0

Query: 1   MVSLSDLFPSFDHCARLISQCIKLKHLKVGMSLHSHLIKTALSFDLFLANRLIDMYSKCN 60
           MVS S  +P+      LI    ++  L +G SLH   +K+A+  D+F+AN LI  Y  C 
Sbjct: 121 MVSESQCYPNKYTFPFLIKAAAEVSSLSLGQSLHGMAVKSAVGSDVFVANSLIHCYFSC- 180

Query: 61  SIENAQKAFDDLPIRNIHSWNTILASYSCVGFLSQARKVFDEMPHPNIVSYNTLISSFTR 120
                                         G L  A KVF  +   ++VS+N++I+ F +
Sbjct: 181 ------------------------------GDLDSACKVFTTIKEKDVVSWNSMINGFVQ 240

Query: 121 HGLYVESMNIFRQMQQDFDLLVLDEFTLVSIAGTCACLGALESLRQVHGAAVVIGLEFNM 180
            G   +++ +F++M+ +   +     T+V +   CA +  LE  RQV        +  N+
Sbjct: 241 KGSPDKALELFKKMESED--VKASHVTMVGVLSACAKIRNLEFGRQVCSYIEENRVNVNL 300

Query: 181 IACNAIIDAYGKCGEPDASYSIFSRMQERDVVTWTSMVVAYAQTSKLDDAFRVFSCMPEK 240
              NA++D Y KCG  + +  +F  M+E+D VTWT+M+  YA +   + A  V + MP+K
Sbjct: 301 TLANAMLDMYTKCGSIEDAKRLFDAMEEKDNVTWTTMLDGYAISEDYEAAREVLNSMPQK 360

Query: 241 GVHTWTALINAFVKNKYSNEAVDLFQQM-LEEKISPNAFTFVGVLSACADLALIAKGKEI 300
            +  W ALI+A+ +N   NEA+ +F ++ L++ +  N  T V  LSACA +  +  G+ I
Sbjct: 361 DIVAWNALISAYEQNGKPNEALIVFHELQLQKNMKLNQITLVSTLSACAQVGALELGRWI 420

Query: 301 HGLITRRSSGLNFPNVYMCNSLIDLYSKSGDVKSARTLFNLVFEKDVVSWNSLITGFAQN 360
           H  I +    +NF   ++ ++LI +YSK GD++ +R +FN V ++DV  W+++I G A +
Sbjct: 421 HSYIKKHGIRMNF---HVTSALIHMYSKCGDLEKSREVFNSVEKRDVFVWSAMIGGLAMH 480

Query: 361 GLGREALLAFRRMIEVGIRPNKVTFLGVLSACAHTGLSSEGLYILELMEKCYGIKPSLDH 420
           G G EA+  F +M E  ++PN VTF  V  AC+HTGL  E   +   ME  YGI P   H
Sbjct: 481 GCGNEAVDMFYKMQEANVKPNGVTFTNVFCACSHTGLVDEAESLFHQMESNYGIVPEEKH 540

Query: 421 YAVLIDMFGRKNRLSEALDIISGAPNGSKHVGIWGAVLGACRIHENLDLAVRAAETLFEM 480
           YA ++D+ GR   L +A+  I   P       +WGA+LGAC+IH NL+LA  A   L E+
Sbjct: 541 YACIVDVLGRSGYLEKAVKFIEAMPI-PPSTSVWGALLGACKIHANLNLAEMACTRLLEL 600

Query: 481 EPDNAGRYVMLSNVFAAANRWMDAHNVRKLMEERGFKKEVAYSCIEIRNIRHKFVARDNS 540
           EP N G +V+LSN++A   +W +   +RK M   G KKE   S IEI  + H+F++ DN+
Sbjct: 601 EPRNDGAHVLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCSSIEIDGMIHEFLSGDNA 648

Query: 541 HSQMGDIYELISILLDHMKNFGYMP 565
           H     +Y  +  +++ +K+ GY P
Sbjct: 661 HPMSEKVYGKLHEVMEKLKSNGYEP 648

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9SIT77.5e-10132.82Pentatricopeptide repeat-containing protein At2g13600 OS=Arabidopsis thaliana OX... [more]
Q9SKQ47.5e-10136.89Pentatricopeptide repeat-containing protein At2g21090 OS=Arabidopsis thaliana OX... [more]
Q9SHZ82.7e-9834.57Pentatricopeptide repeat-containing protein At2g22070 OS=Arabidopsis thaliana OX... [more]
Q9LN011.4e-9433.51Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
O823804.0e-9434.34Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
XP_031745241.18.5e-30489.60pentatricopeptide repeat-containing protein At2g21090 isoform X1 [Cucumis sativu... [more]
XP_038882958.12.3e-30188.73pentatricopeptide repeat-containing protein At2g21090 isoform X1 [Benincasa hisp... [more]
XP_022132706.12.3e-29687.30pentatricopeptide repeat-containing protein At2g13600-like [Momordica charantia][more]
XP_022977857.11.3e-29186.41pentatricopeptide repeat-containing protein At2g21090-like isoform X1 [Cucurbita... [more]
KAG6604304.14.9e-29186.24Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
Match NameE-valueIdentityDescription
A0A0A0KFI04.1e-30489.60Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G476040 PE=4 SV=1[more]
A0A6J1BT151.1e-29687.30pentatricopeptide repeat-containing protein At2g13600-like OS=Momordica charanti... [more]
A0A6J1IJL96.2e-29286.41pentatricopeptide repeat-containing protein At2g21090-like isoform X1 OS=Cucurbi... [more]
A0A5D3C8H57.8e-28789.21Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A6J1EJM27.4e-26985.77pentatricopeptide repeat-containing protein At2g21090-like isoform X1 OS=Cucurbi... [more]
Match NameE-valueIdentityDescription
AT2G21090.15.3e-10236.89Pentatricopeptide repeat (PPR-like) superfamily protein [more]
AT2G13600.15.3e-10232.82Pentatricopeptide repeat (PPR) superfamily protein [more]
AT2G22070.11.9e-9934.57pentatricopeptide (PPR) repeat-containing protein [more]
AT1G08070.19.7e-9633.51Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT2G29760.12.8e-9534.34Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 79..103
e-value: 0.0013
score: 18.9
coord: 317..343
e-value: 0.001
score: 19.1
coord: 182..210
e-value: 2.6E-5
score: 24.2
coord: 109..136
e-value: 4.3E-6
score: 26.6
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 344..392
e-value: 7.2E-10
score: 38.9
coord: 241..288
e-value: 1.1E-12
score: 47.9
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 182..212
e-value: 7.0E-6
score: 23.9
coord: 78..103
e-value: 0.0015
score: 16.6
coord: 243..277
e-value: 2.9E-9
score: 34.5
coord: 212..242
e-value: 0.0032
score: 15.5
coord: 347..380
e-value: 5.1E-7
score: 27.5
coord: 316..345
e-value: 5.9E-4
score: 17.8
coord: 109..136
e-value: 3.5E-5
score: 21.7
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 179..213
score: 10.446177
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 241..275
score: 11.279235
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 107..137
score: 9.952918
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 345..379
score: 12.287675
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 143..298
e-value: 2.5E-31
score: 110.5
coord: 299..398
e-value: 1.8E-23
score: 84.8
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 399..550
e-value: 3.2E-9
score: 38.7
coord: 4..136
e-value: 4.2E-19
score: 71.1
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 206..505
NoneNo IPR availablePANTHERPTHR47926:SF232SUBFAMILY NOT NAMEDcoord: 5..567
NoneNo IPR availablePANTHERPTHR47926PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 5..567

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0021130.1Tan0021130.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding