Tan0012021 (gene) Snake gourd v1

Overview
NameTan0012021
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat-containing protein
LocationLG05: 84807119 .. 84809422 (-)
RNA-Seq ExpressionTan0012021
SyntenyTan0012021
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CATAACAGTTCTTCTTCTTCTTGTTCTTCTTCGAGCTCTCTGCAGATCTCATCTATGGATCAGAAGCTCTTCTCCAAAGCCTTAACACGCTATGCGCTAGCCGGCCGATCTTACCACACGAATCGATTGAAAAAGGCGACTCTGTACGCGAAAATTAGTCCACTGGGCGATCCTAGCATCAGCGTGGAGCCAGAGCTCGATGGCTGGGTTCAGGAGGGAAAGAAGGTCCGAGTCGCTGAGCTTCAGAGAATCATTCATGACCTTCGCAAGCGAAAACGATTTACACAAGCTCTTGAGGTTTGAATTCTTCTTTCTCGAAATGTGCTTTGTTTTGGTTTGAGTAGTAAGATGTGGAGGAAGAAGTGAACTGTGCAAAATGATGCATTGAGTGAGAACTAAGTGCCTTTTAGACGAGTTTAATTGAGGAAGTTTGCATAGCTTAATCGCTAACTTTATGTTTCCGGATTACGGAGAATGAGGTTGTATCAAATTCTCAATAGATTTAGCTGGGGGAAATGGAAGTCTCTGGATTACGGAGAATGTGGTTGTATCAAATTCTTAAGGTTTTTGATTCAGTGAATTTGTTTTAGTAACTTCATTAGTAATTTAGTTCACATAATATGGACTTTTCAGCGATGGAATTGTGTCTTTTCTCTCCTATTTACTCAAGAGTATGATTTAACACACTTTGTGGATAATGATGGAATGACTGAAACATCATATGAGCTTGAAATGTGTGGACTTAAACAATTTATTTGTGGGGTTCTTTGTAATTGATGATAGGTATAAATGCAGGTGTCCGAATGGATGAAGAAAAGCGGTGTCTGCATATTTTCACCAAGGGAGCATGCGGTACAATTGGATCTGATTGGCCGAGTACGTGGATATCTCTCTGCTGAAAGCTATTTCAATCAGTTGAAGGAGCAAGACCAGACTGATAAGACATATGGTGCTCTCTTGAATTGCTATGTTCGGCAGCGACAAGTTGAAAAATCCCTCTCCCATTTGCAAAAAATGAAAGAGATGGGTTTTGCAACTTCAGAGCTCACTTACAATGACATAATGTGCTTGTATACAAATGTTGGCCAGCATGACAAGGTCCCTGAGGTGCTAGCAGAGATGAAGGAGAAAAATGTGTCTCCTGACAACTTCAGCTACAGAATCTGCATCAATTCGTATGGTGTGAGACGGGATCTTGAGGGGATGGAGAATGTATTAAAGGAGATGGAATCTCAACCTCATATTATCATGGACTGGAACACTTATGCAGTAGTTGCTAACTTCTTTATAAAAGCGGATCTTACTGATAAGGCGGTTGATGCCTTGAGAAAATCAGAAGAAAAACTGAAGAGTAAGGACAGAATTGGCCATAACCATCTAATCTCACTTTATGCGACCTTAGGAAACAAGGAGAAGGTGTTGAGATTGTGGAATCTGGATAAAAGTGATACTACGAGATTCATCAATAGGGACTACATCACAATGCTTGAATCTCTGGTGAGACTGGGTGAACTTGAAGAAGCTGAAAAAGTGCTGAAAGAGTGGGAATCATCTGGGAATTGCTATGATTTTCGAGTTCCTAACACTGTGATTGTTGGGTATATTGACAAGGGAATGTGTGAGAGAGCCGAAGCGCTGCTTGAAGACTTGATGGAGAAAGGAAAGGCTACCACACCAAACAGTTGGGGTGCTGTGGCTGTTCAATATATGGACAGGGGCGAGACCGAAAAAGCTGTAGAGTGCATGAAGGCAGCCCTTTCTCTAAATATGGATAAAGGGTGGAAGCCTAATCTTCGCGTGATCACAGGCATATTGAATTGGCTTGGTGAAAATGGCAGCATCGAAGAAGTAGAAGCTTTTGTAGGCTCATTGAGGTCTGTCATTCCAGTGAACAGAGAGATGTATCATGCCTTGATGAAAGCTCATATAAGAGGTGGCAAAGAAGTTAATGAGGTGTTAAATCAAATGAAGTCTGATAAAATAGATGAAGATGAAGAAACAAAGAAAATTCTTGGCACTTGGCAAGAAACAACTGAAGGTAAGAACGTTGGCTGATTGTTTTCTTATTATATCTTATTATTGATGTTGTCCATCATCAGCCAAACGCATGTTGCTTGGGTTTCAATAATGCAAAATCTTACTTATCAAATTCTTCACATGAAAGGAGTAAAAGTAGCTAATGTCACTTCCAAATTTGCTGCTCTTGATGTACTAAGTCTTTGGTTTTAGTTTGGTGGGTTAATGAAAGTGTTGGTGCTTAAGAGAACTGTGTTGCATGCAGGGCTGATTTGGTATTTCGTGAAC

mRNA sequence

CATAACAGTTCTTCTTCTTCTTGTTCTTCTTCGAGCTCTCTGCAGATCTCATCTATGGATCAGAAGCTCTTCTCCAAAGCCTTAACACGCTATGCGCTAGCCGGCCGATCTTACCACACGAATCGATTGAAAAAGGCGACTCTGTACGCGAAAATTAGTCCACTGGGCGATCCTAGCATCAGCGTGGAGCCAGAGCTCGATGGCTGGGTTCAGGAGGGAAAGAAGGTCCGAGTCGCTGAGCTTCAGAGAATCATTCATGACCTTCGCAAGCGAAAACGATTTACACAAGCTCTTGAGGTGTCCGAATGGATGAAGAAAAGCGGTGTCTGCATATTTTCACCAAGGGAGCATGCGGTACAATTGGATCTGATTGGCCGAGTACGTGGATATCTCTCTGCTGAAAGCTATTTCAATCAGTTGAAGGAGCAAGACCAGACTGATAAGACATATGGTGCTCTCTTGAATTGCTATGTTCGGCAGCGACAAGTTGAAAAATCCCTCTCCCATTTGCAAAAAATGAAAGAGATGGGTTTTGCAACTTCAGAGCTCACTTACAATGACATAATGTGCTTGTATACAAATGTTGGCCAGCATGACAAGGTCCCTGAGGTGCTAGCAGAGATGAAGGAGAAAAATGTGTCTCCTGACAACTTCAGCTACAGAATCTGCATCAATTCGTATGGTGTGAGACGGGATCTTGAGGGGATGGAGAATGTATTAAAGGAGATGGAATCTCAACCTCATATTATCATGGACTGGAACACTTATGCAGTAGTTGCTAACTTCTTTATAAAAGCGGATCTTACTGATAAGGCGGTTGATGCCTTGAGAAAATCAGAAGAAAAACTGAAGAGTAAGGACAGAATTGGCCATAACCATCTAATCTCACTTTATGCGACCTTAGGAAACAAGGAGAAGGTGTTGAGATTGTGGAATCTGGATAAAAGTGATACTACGAGATTCATCAATAGGGACTACATCACAATGCTTGAATCTCTGGTGAGACTGGGTGAACTTGAAGAAGCTGAAAAAGTGCTGAAAGAGTGGGAATCATCTGGGAATTGCTATGATTTTCGAGTTCCTAACACTGTGATTGTTGGGTATATTGACAAGGGAATGTGTGAGAGAGCCGAAGCGCTGCTTGAAGACTTGATGGAGAAAGGAAAGGCTACCACACCAAACAGTTGGGGTGCTGTGGCTGTTCAATATATGGACAGGGGCGAGACCGAAAAAGCTGTAGAGTGCATGAAGGCAGCCCTTTCTCTAAATATGGATAAAGGGTGGAAGCCTAATCTTCGCGTGATCACAGGCATATTGAATTGGCTTGGTGAAAATGGCAGCATCGAAGAAGTAGAAGCTTTTGTAGGCTCATTGAGGTCTGTCATTCCAGTGAACAGAGAGATGTATCATGCCTTGATGAAAGCTCATATAAGAGGTGGCAAAGAAGTTAATGAGGTGTTAAATCAAATGAAGTCTGATAAAATAGATGAAGATGAAGAAACAAAGAAAATTCTTGGCACTTGGCAAGAAACAACTGAAGGTAAGAACGTTGGCTGATTGTTTTCTTATTATATCTTATTATTGATGTTGTCCATCATCAGCCAAACGCATGTTGCTTGGGTTTCAATAATGCAAAATCTTACTTATCAAATTCTTCACATGAAAGGAGTAAAAGTAGCTAATGTCACTTCCAAATTTGCTGCTCTTGATGTACTAAGTCTTTGGTTTTAGTTTGGTGGGTTAATGAAAGTGTTGGTGCTTAAGAGAACTGTGTTGCATGCAGGGCTGATTTGGTATTTCGTGAAC

Coding sequence (CDS)

ATGGATCAGAAGCTCTTCTCCAAAGCCTTAACACGCTATGCGCTAGCCGGCCGATCTTACCACACGAATCGATTGAAAAAGGCGACTCTGTACGCGAAAATTAGTCCACTGGGCGATCCTAGCATCAGCGTGGAGCCAGAGCTCGATGGCTGGGTTCAGGAGGGAAAGAAGGTCCGAGTCGCTGAGCTTCAGAGAATCATTCATGACCTTCGCAAGCGAAAACGATTTACACAAGCTCTTGAGGTGTCCGAATGGATGAAGAAAAGCGGTGTCTGCATATTTTCACCAAGGGAGCATGCGGTACAATTGGATCTGATTGGCCGAGTACGTGGATATCTCTCTGCTGAAAGCTATTTCAATCAGTTGAAGGAGCAAGACCAGACTGATAAGACATATGGTGCTCTCTTGAATTGCTATGTTCGGCAGCGACAAGTTGAAAAATCCCTCTCCCATTTGCAAAAAATGAAAGAGATGGGTTTTGCAACTTCAGAGCTCACTTACAATGACATAATGTGCTTGTATACAAATGTTGGCCAGCATGACAAGGTCCCTGAGGTGCTAGCAGAGATGAAGGAGAAAAATGTGTCTCCTGACAACTTCAGCTACAGAATCTGCATCAATTCGTATGGTGTGAGACGGGATCTTGAGGGGATGGAGAATGTATTAAAGGAGATGGAATCTCAACCTCATATTATCATGGACTGGAACACTTATGCAGTAGTTGCTAACTTCTTTATAAAAGCGGATCTTACTGATAAGGCGGTTGATGCCTTGAGAAAATCAGAAGAAAAACTGAAGAGTAAGGACAGAATTGGCCATAACCATCTAATCTCACTTTATGCGACCTTAGGAAACAAGGAGAAGGTGTTGAGATTGTGGAATCTGGATAAAAGTGATACTACGAGATTCATCAATAGGGACTACATCACAATGCTTGAATCTCTGGTGAGACTGGGTGAACTTGAAGAAGCTGAAAAAGTGCTGAAAGAGTGGGAATCATCTGGGAATTGCTATGATTTTCGAGTTCCTAACACTGTGATTGTTGGGTATATTGACAAGGGAATGTGTGAGAGAGCCGAAGCGCTGCTTGAAGACTTGATGGAGAAAGGAAAGGCTACCACACCAAACAGTTGGGGTGCTGTGGCTGTTCAATATATGGACAGGGGCGAGACCGAAAAAGCTGTAGAGTGCATGAAGGCAGCCCTTTCTCTAAATATGGATAAAGGGTGGAAGCCTAATCTTCGCGTGATCACAGGCATATTGAATTGGCTTGGTGAAAATGGCAGCATCGAAGAAGTAGAAGCTTTTGTAGGCTCATTGAGGTCTGTCATTCCAGTGAACAGAGAGATGTATCATGCCTTGATGAAAGCTCATATAAGAGGTGGCAAAGAAGTTAATGAGGTGTTAAATCAAATGAAGTCTGATAAAATAGATGAAGATGAAGAAACAAAGAAAATTCTTGGCACTTGGCAAGAAACAACTGAAGGTAAGAACGTTGGCTGA

Protein sequence

MDQKLFSKALTRYALAGRSYHTNRLKKATLYAKISPLGDPSISVEPELDGWVQEGKKVRVAELQRIIHDLRKRKRFTQALEVSEWMKKSGVCIFSPREHAVQLDLIGRVRGYLSAESYFNQLKEQDQTDKTYGALLNCYVRQRQVEKSLSHLQKMKEMGFATSELTYNDIMCLYTNVGQHDKVPEVLAEMKEKNVSPDNFSYRICINSYGVRRDLEGMENVLKEMESQPHIIMDWNTYAVVANFFIKADLTDKAVDALRKSEEKLKSKDRIGHNHLISLYATLGNKEKVLRLWNLDKSDTTRFINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAEALLEDLMEKGKATTPNSWGAVAVQYMDRGETEKAVECMKAALSLNMDKGWKPNLRVITGILNWLGENGSIEEVEAFVGSLRSVIPVNREMYHALMKAHIRGGKEVNEVLNQMKSDKIDEDEETKKILGTWQETTEGKNVG
Homology
BLAST of Tan0012021 vs. ExPASy Swiss-Prot
Match: Q84JR3 (Pentatricopeptide repeat-containing protein At4g21705, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At4g21705 PE=2 SV=1)

HSP 1 Score: 612.5 bits (1578), Expect = 4.4e-174
Identity = 297/477 (62.26%), Postives = 374/477 (78.41%), Query Frame = 0

Query: 15  LAGRSYHTNRLKKATLYAKISPLGDPSISVEPELDGWVQEGKKVRVAELQRIIHDLRKRK 74
           +A R Y+TNR+KK TLY+KISPLGDP  SV PEL  WVQ GKKV VAEL RI+HDLR+RK
Sbjct: 12  IASRYYYTNRVKKTTLYSKISPLGDPKSSVYPELQNWVQCGKKVSVAELIRIVHDLRRRK 71

Query: 75  RFTQALEVSEWMKKSGVCIFSPREHAVQLDLIGRVRGYLSAESYFNQLKEQDQTDKTYGA 134
           RF  ALEVS+WM ++GVC+FSP EHAV LDLIGRV G+++AE YF  LKEQ + DKTYGA
Sbjct: 72  RFLHALEVSKWMNETGVCVFSPTEHAVHLDLIGRVYGFVTAEEYFENLKEQYKNDKTYGA 131

Query: 135 LLNCYVRQRQVEKSLSHLQKMKEMGFATSELTYNDIMCLYTNVGQHDKVPEVLAEMKEKN 194
           LLNCYVRQ+ VEKSL H +KMKEMGF TS LTYN+IMCLYTN+GQH+KVP+VL EMKE+N
Sbjct: 132 LLNCYVRQQNVEKSLLHFEKMKEMGFVTSSLTYNNIMCLYTNIGQHEKVPKVLEEMKEEN 191

Query: 195 VSPDNFSYRICINSYGVRRDLEGMENVLKEMESQPHIIMDWNTYAVVANFFIKADLTDKA 254
           V+PDN+SYRICIN++G   DLE +   L++ME +  I MDWNTYAV A F+I     D+A
Sbjct: 192 VAPDNYSYRICINAFGAMYDLERIGGTLRDMERRQDITMDWNTYAVAAKFYIDGGDCDRA 251

Query: 255 VDALRKSEEKLKSKDRIGHNHLISLYATLGNKEKVLRLWNLDKSDTTRFINRDYITMLES 314
           V+ L+ SE +L+ KD  G+NHLI+LYA LG K +VLRLW+L+K    R IN+DY+T+L+S
Sbjct: 252 VELLKMSENRLEKKDGEGYNHLITLYARLGKKIEVLRLWDLEKDVCKRRINQDYLTVLQS 311

Query: 315 LVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAEALLEDLMEKGKATT 374
           LV++  L EAE+VL EW+SSGNCYDFRVPNTVI GYI K M E+AEA+LEDL  +GKATT
Sbjct: 312 LVKIDALVEAEEVLTEWKSSGNCYDFRVPNTVIRGYIGKSMEEKAEAMLEDLARRGKATT 371

Query: 375 PNSWGAVAVQYMDRGETEKAVECMKAALSLNM-DKGWKPNLRVITGILNWLGENGSIEEV 434
           P SW  VA  Y ++G  E A +CMK AL + +  + W+P L ++T +L+W+G+ GS++EV
Sbjct: 372 PESWELVATAYAEKGTLENAFKCMKTALGVEVGSRKWRPGLTLVTSVLSWVGDEGSLKEV 431

Query: 435 EAFVGSLRSVIPVNREMYHALMKAHIR-GGKEVNEVLNQMKSDKIDEDEETKKILGT 490
           E+FV SLR+ I VN++MYHAL+KA IR GG+ ++ +L +MK DKI+ DEET  IL T
Sbjct: 432 ESFVASLRNCIGVNKQMYHALVKADIREGGRNIDTLLQRMKDDKIEIDEETTVILST 488

BLAST of Tan0012021 vs. ExPASy Swiss-Prot
Match: Q9SKU6 (Pentatricopeptide repeat-containing protein At2g20710, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At2g20710 PE=2 SV=1)

HSP 1 Score: 295.4 bits (755), Expect = 1.2e-78
Identity = 151/397 (38.04%), Postives = 246/397 (61.96%), Query Frame = 0

Query: 29  TLYAKISPLGDPSISVEPELDGWVQEGKKVRVAELQRIIHDLRKRKRFTQALEVSEWMKK 88
           TL  +++  GDPS S+   LDGW+ +G  V+ +EL  II  LRK  RF+ AL++S+WM +
Sbjct: 39  TLQRRVARSGDPSASIIKVLDGWLDQGNLVKTSELHSIIKMLRKFSRFSHALQISDWMSE 98

Query: 89  SGVCIFSPREHAVQLDLIGRVRGYLSAESYFNQLKEQDQTDKTYGALLNCYVRQRQVEKS 148
             V   S  + A++LDLI +V G   AE +F  +  + +    YGALLNCY  ++ + K+
Sbjct: 99  HRVHEISEGDVAIRLDLIAKVGGLGEAEKFFETIPMERRNYHLYGALLNCYASKKVLHKA 158

Query: 149 LSHLQKMKEMGFATSELTYNDIMCLYTNVGQHDKVPEVLAEMKEKNVSPDNFSYRICINS 208
               Q+MKE+GF    L YN ++ LY   G++  V ++L EM+++ V PD F+    +++
Sbjct: 159 EQVFQEMKELGFLKGCLPYNVMLNLYVRTGKYTMVEKLLREMEDETVKPDIFTVNTRLHA 218

Query: 209 YGVRRDLEGMENVLKEMESQPHIIMDWNTYAVVANFFIKADLTDKAVDALRKSEEKLKS- 268
           Y V  D+EGME  L   E+   + +DW TYA  AN +IKA LT+KA++ LRKSE+ + + 
Sbjct: 219 YSVVSDVEGMEKFLMRCEADQGLHLDWRTYADTANGYIKAGLTEKALEMLRKSEQMVNAQ 278

Query: 269 KDRIGHNHLISLYATLGNKEKVLRLWNLDKSDTTRFINRDYITMLESLVRLGELEEAEKV 328
           K +  +  L+S Y   G KE+V RLW+L K +   F N  YI+++ +L+++ ++EE EK+
Sbjct: 279 KRKHAYEVLMSFYGAAGKKEEVYRLWSLYK-ELDGFYNTGYISVISALLKMDDIEEVEKI 338

Query: 329 LKEWESSGNCYDFRVPNTVIVGYIDKGMCERAEALLEDLMEKGKATTPNSWGAVAVQYMD 388
           ++EWE+  + +D R+P+ +I GY  KGM E+AE ++  L++K +    ++W  +A+ Y  
Sbjct: 339 MEEWEAGHSLFDIRIPHLLITGYCKKGMMEKAEEVVNILVQKWRVEDTSTWERLALGYKM 398

Query: 389 RGETEKAVECMKAALSLNMDKGWKPNLRVITGILNWL 425
            G+ EKAVE  K A+ ++   GW+P+  V+   +++L
Sbjct: 399 AGKMEKAVEKWKRAIEVS-KPGWRPHQVVLMSCVDYL 433

BLAST of Tan0012021 vs. ExPASy Swiss-Prot
Match: Q8LPS6 (Pentatricopeptide repeat-containing protein At1g02150 OS=Arabidopsis thaliana OX=3702 GN=At1g02150 PE=2 SV=2)

HSP 1 Score: 290.4 bits (742), Expect = 3.8e-77
Identity = 158/452 (34.96%), Postives = 258/452 (57.08%), Query Frame = 0

Query: 30  LYAKISPLGDPSISVEPELDGWVQEGKKVRVAELQRIIHDLRKRKRFTQALEVSEWMKKS 89
           +Y KIS +  P +     L+ W + G+K+   EL R++ +LRK KR  QALEV +WM   
Sbjct: 69  IYKKISLMEKPELGAASVLNQWEKAGRKLTKWELCRVVKELRKYKRANQALEVYDWMNNR 128

Query: 90  GVCI-FSPREHAVQLDLIGRVRGYLSAESYFNQLKEQDQTDKTYGALLNCYVRQRQVEKS 149
           G     S  + A+QLDLIG+VRG   AE +F QL E  +  + YG+LLN YVR +  EK+
Sbjct: 129 GERFRLSASDAAIQLDLIGKVRGIPDAEEFFLQLPENFKDRRVYGSLLNAYVRAKSREKA 188

Query: 150 LSHLQKMKEMGFATSELTYNDIMCLYTNVGQHDKVPEVLAEMKEKNVSPDNFSYRICINS 209
            + L  M++ G+A   L +N +M LY N+ ++DKV  ++ EMK+K++  D +SY I ++S
Sbjct: 189 EALLNTMRDKGYALHPLPFNVMMTLYMNLREYDKVDAMVFEMKQKDIRLDIYSYNIWLSS 248

Query: 210 YGVRRDLEGMENVLKEMESQPHIIMDWNTYAVVANFFIKADLTDKAVDALRKSEEKLKSK 269
            G    +E ME V ++M+S   I  +W T++ +A  +IK   T+KA DALRK E ++  +
Sbjct: 249 CGSLGSVEKMELVYQQMKSDVSIYPNWTTFSTMATMYIKMGETEKAEDALRKVEARITGR 308

Query: 270 DRIGHNHLISLYATLGNKEKVLRLWNLDKSDTTRFINRDYITMLESLVRLGELEEAEKVL 329
           +RI +++L+SLY +LGNK+++ R+W++ KS      N  Y  ++ SLVR+G++E AEKV 
Sbjct: 309 NRIPYHYLLSLYGSLGNKKELYRVWHVYKSVVPSIPNLGYHALVSSLVRMGDIEGAEKVY 368

Query: 330 KEWESSGNCYDFRVPNTVIVGYIDKGMCERAEALLEDLMEKGKATTPNSWGAVAVQYMDR 389
           +EW    + YD R+PN ++  Y+     E AE L + ++E G   + ++W  +AV +  +
Sbjct: 369 EEWLPVKSSYDPRIPNLLMNAYVKNDQLETAEGLFDHMVEMGGKPSSSTWEILAVGHTRK 428

Query: 390 GETEKAVECMKAALSLNMDKGWKPNLRVITGILNWLGENGSIEEVEAFVGSLRSVIPVNR 449
               +A+ C++ A S      W+P + +++G      E   +   EA +  LR    +  
Sbjct: 429 RCISEALTCLRNAFSAEGSSNWRPKVLMLSGFFKLCEEESDVTSKEAVLELLRQSGDLED 488

Query: 450 EMYHALMKAHIRGGKEVNEVLNQMKSDKIDED 481
           + Y AL+        + N  +N  + D  + D
Sbjct: 489 KSYLALIDV------DENRTVNNSEIDAHETD 514

BLAST of Tan0012021 vs. ExPASy Swiss-Prot
Match: Q93WC5 (Pentatricopeptide repeat-containing protein At4g01990, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At4g01990 PE=2 SV=1)

HSP 1 Score: 276.6 bits (706), Expect = 5.7e-73
Identity = 154/473 (32.56%), Postives = 265/473 (56.03%), Query Frame = 0

Query: 16  AGRSYHTNRLKKATLYAKISPLGD-PSISVEPELDGWVQEGKKVRVAELQRIIHDLRKRK 75
           A  S  T   K  ++Y K+S LG      +E  L+ +V EG  V+  +L R   DLRK +
Sbjct: 27  AAASVPTKAKKHRSIYKKLSSLGTRGGGKMEETLNQFVMEGVPVKKHDLIRYAKDLRKFR 86

Query: 76  RFTQALEVSEWMKKSGVCIFSPREHAVQLDLIGRVRGYLSAESYFNQLKEQDQTDKTYGA 135
           +  +ALE+ EWM++  +  F+  +HA++L+LI + +G  +AE+YFN L +  +   TYG+
Sbjct: 87  QPQRALEIFEWMERKEIA-FTGSDHAIRLNLIAKSKGLEAAETYFNSLDDSIKNQSTYGS 146

Query: 136 LLNCYVRQRQVEKSLSHLQKMKEMGFATSELTYNDIMCLYTNVGQHDKVPEVLAEMKEKN 195
           LLNCY  +++  K+ +H + M ++   ++ L +N++M +Y  +GQ +KVP ++  MKEK+
Sbjct: 147 LLNCYCVEKEEVKAKAHFENMVDLNHVSNSLPFNNLMAMYMGLGQPEKVPALVVAMKEKS 206

Query: 196 VSPDNFSYRICINSYGVRRDLEGMENVLKEMESQPHIIMDWNTYAVVANFFIKADLTDKA 255
           ++P + +Y + I S G  +DL+G+E VL EM+++   I  WNT+A +A  +IK  L  KA
Sbjct: 207 ITPCDITYSMWIQSCGSLKDLDGVEKVLDEMKAEGEGIFSWNTFANLAAIYIKVGLYGKA 266

Query: 256 VDALRKSEEKLKSKDRIGHNHLISLYATLGNKEKVLRLWNLDKSDTTRFINRDYITMLES 315
            +AL+  E  +    R  ++ LI+LY  + N  +V R+W+L K       N  Y+TML +
Sbjct: 267 EEALKSLENNMNPDVRDCYHFLINLYTGIANASEVYRVWDLLKKRYPNVNNSSYLTMLRA 326

Query: 316 LVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAEALLEDLMEKGKATT 375
           L +L +++  +KV  EWES+   YD R+ N  I  Y+ + M E AEA+    M+K K   
Sbjct: 327 LSKLDDIDGVKKVFAEWESTCWTYDMRMANVAISSYLKQNMYEEAEAVFNGAMKKCKGQF 386

Query: 376 PNSWGAVAVQYMDRGETEKAVECMKAALSLNMDKGWKPNLRVITGILNWLGENGSIEEVE 435
             +   + +  +   + + A++  +AA+ L+ DK W  +  +I+       E   ++  E
Sbjct: 387 SKARQLLMMHLLKNDQADLALKHFEAAV-LDQDKNWTWSSELISSFFLHFEEAKDVDGAE 446

Query: 436 AFVGSLRSVIPVNREMYHALMKAHIRGGKEVNEVLNQMKSDKIDEDEETKKIL 488
            F  +L    P++ E Y  LMK ++  GK   ++  +++   I  DEE + +L
Sbjct: 447 EFCKTLTKWSPLSSETYTLLMKTYLAAGKACPDMKKRLEEQGILVDEEQECLL 497

BLAST of Tan0012021 vs. ExPASy Swiss-Prot
Match: Q9SY07 (Pentatricopeptide repeat-containing protein At4g02820, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At4g02820 PE=2 SV=1)

HSP 1 Score: 266.5 bits (680), Expect = 5.9e-70
Identity = 146/438 (33.33%), Postives = 251/438 (57.31%), Query Frame = 0

Query: 51  WVQEGKKVRVAELQRIIHDLRKRKRFTQALEVSEWMKKSGVCIFSPREHAVQLDLIGRVR 110
           W +EG  VR  EL RI+ +LRK KR+  ALE+ EWM           ++AV LDLI ++R
Sbjct: 84  WKEEGHSVRKYELNRIVRELRKIKRYKHALEICEWMVVQEDIKLQAGDYAVHLDLISKIR 143

Query: 111 GYLSAESYFNQLKEQDQTDKTYGALLNCYVRQRQVEKSLSHLQKMKEMGFATSELTYNDI 170
           G  SAE +F  + +Q +      +LL+ YV+ +  +K+ +  +KM E GF  S L YN +
Sbjct: 144 GLNSAEKFFEDMPDQMRGHAACTSLLHSYVQNKLSDKAEALFEKMGECGFLKSCLPYNHM 203

Query: 171 MCLYTNVGQHDKVPEVLAEMKEKNVSPDNFSYRICINSYGVRRDLEGMENV-LKEMESQP 230
           + +Y + GQ +KVP ++ E+K +  SPD  +Y + + ++    D+EG E V LK  E + 
Sbjct: 204 LSMYISRGQFEKVPVLIKELKIR-TSPDIVTYNLWLTAFASGNDVEGAEKVYLKAKEEK- 263

Query: 231 HIIMDWNTYAVVANFFIKADLTDKAVDALRKSEEKLKSKDRIGHNHLISLYATLGNKEKV 290
            +  DW TY+V+ N + K D  +KA  AL++ E+ +  K+R+ +  LISL+A LG+K+ V
Sbjct: 264 -LNPDWVTYSVLTNLYAKTDNVEKARLALKEMEKLVSKKNRVAYASLISLHANLGDKDGV 323

Query: 291 LRLWNLDKSDTTRFINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVG 350
              W   KS   +  + +Y++M+ ++V+LGE E+A+ +  EWES     D R+PN ++  
Sbjct: 324 NLTWKKVKSSFKKMNDAEYLSMISAVVKLGEFEQAKGLYDEWESVSGTGDARIPNLILAE 383

Query: 351 YIDKGMCERAEALLEDLMEKGKATTPNSWGAVAVQYMDRGETEKAVECMKAALSLNMDKG 410
           Y+++      E   E ++EKG   + ++W  +   Y+ R + EK ++C   A  ++  K 
Sbjct: 384 YMNRDEVLLGEKFYERIVEKGINPSYSTWEILTWAYLKRKDMEKVLDCFGKA--IDSVKK 443

Query: 411 WKPNLRVITGILNWLGENGSIEEVEAFVGSLRSVIPVNREMYHALMKAHIRGGKEVNEVL 470
           W  N+R++ G    L E G+++  E  +  L+    VN ++Y++L++ + + G+    V 
Sbjct: 444 WTVNVRLVKGACKELEEQGNVKGAEKLMTLLQKAGYVNTQLYNSLLRTYAKAGEMALIVE 503

Query: 471 NQMKSDKIDEDEETKKIL 488
            +M  D ++ DEETK+++
Sbjct: 504 ERMAKDNVELDEETKELI 516

BLAST of Tan0012021 vs. NCBI nr
Match: XP_023542644.1 (pentatricopeptide repeat-containing protein At4g21705, mitochondrial isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 952.2 bits (2460), Expect = 1.8e-273
Identity = 469/495 (94.75%), Postives = 486/495 (98.18%), Query Frame = 0

Query: 1   MDQKLFSKALTRYALAGRSYHTNRLKKATLYAKISPLGDPSISVEPELDGWVQEGKKVRV 60
           MDQ  FSKALTRYALAGR YHTNRLKKATLYAKISPLGDPS+SVEPELDGWV+EGKKVR+
Sbjct: 1   MDQ-FFSKALTRYALAGRFYHTNRLKKATLYAKISPLGDPSVSVEPELDGWVKEGKKVRI 60

Query: 61  AELQRIIHDLRKRKRFTQALEVSEWMKKSGVCIFSPREHAVQLDLIGRVRGYLSAESYFN 120
           AELQRIIHDLRKRKRFTQALEVSEWMKK+GVCIFSP EHAVQLDLIGRVRGYLSAESYFN
Sbjct: 61  AELQRIIHDLRKRKRFTQALEVSEWMKKTGVCIFSPSEHAVQLDLIGRVRGYLSAESYFN 120

Query: 121 QLKEQDQTDKTYGALLNCYVRQRQVEKSLSHLQKMKEMGFATSELTYNDIMCLYTNVGQH 180
           QLKEQDQTDKTYGALLNCYVRQRQVEKSLSHLQKMKEMGFATSELT+ND+MCLYTNVGQH
Sbjct: 121 QLKEQDQTDKTYGALLNCYVRQRQVEKSLSHLQKMKEMGFATSELTFNDMMCLYTNVGQH 180

Query: 181 DKVPEVLAEMKEKNVSPDNFSYRICINSYGVRRDLEGMENVLKEMESQPHIIMDWNTYAV 240
           DKVPEVLAEMKEKN+SPDNFSYRICINSYG RRDLEGMENVLKEMESQPHI+MDWNTYAV
Sbjct: 181 DKVPEVLAEMKEKNISPDNFSYRICINSYGARRDLEGMENVLKEMESQPHIVMDWNTYAV 240

Query: 241 VANFFIKADLTDKAVDALRKSEEKLKSKDRIGHNHLISLYATLGNKEKVLRLWNLDKSDT 300
           VANFFIKADLT+KAVDALRK+EE+LKSKDRIGHNHLISLYATLGNKEKVLRLWNLDK+D 
Sbjct: 241 VANFFIKADLTEKAVDALRKAEERLKSKDRIGHNHLISLYATLGNKEKVLRLWNLDKTDA 300

Query: 301 TRFINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAE 360
           TRFINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAE
Sbjct: 301 TRFINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAE 360

Query: 361 ALLEDLMEKGKATTPNSWGAVAVQYMDRGETEKAVECMKAALSLNMDKGWKPNLRVITGI 420
           ALLEDLMEKGK TTPNSWGAVAVQYMDRGETEK+VECMKAAL+LNMDKGWKPNLRVITGI
Sbjct: 361 ALLEDLMEKGKTTTPNSWGAVAVQYMDRGETEKSVECMKAALTLNMDKGWKPNLRVITGI 420

Query: 421 LNWLGENGSIEEVEAFVGSLRSVIPVNREMYHALMKAHIRGGKEVNEVLNQMKSDKIDED 480
           LNWLGEN SIEEVEAFVGSLRSVIPVNREMYHALMKAHIRGGKEV+E+LNQMKSDK+DED
Sbjct: 421 LNWLGENASIEEVEAFVGSLRSVIPVNREMYHALMKAHIRGGKEVHELLNQMKSDKLDED 480

Query: 481 EETKKILGTWQETTE 496
           EETKKILGT QETTE
Sbjct: 481 EETKKILGTGQETTE 494

BLAST of Tan0012021 vs. NCBI nr
Match: XP_022994385.1 (pentatricopeptide repeat-containing protein At4g21705, mitochondrial isoform X1 [Cucurbita maxima])

HSP 1 Score: 950.3 bits (2455), Expect = 6.8e-273
Identity = 468/498 (93.98%), Postives = 485/498 (97.39%), Query Frame = 0

Query: 1   MDQKLFSKALTRYALAGRSYHTNRLKKATLYAKISPLGDPSISVEPELDGWVQEGKKVRV 60
           MDQ  FSKALTRYALA R YHTNRLKKATLYAKISPLGDP++SVEPELDGWV+EGKKVR+
Sbjct: 1   MDQ-FFSKALTRYALADRFYHTNRLKKATLYAKISPLGDPNVSVEPELDGWVKEGKKVRI 60

Query: 61  AELQRIIHDLRKRKRFTQALEVSEWMKKSGVCIFSPREHAVQLDLIGRVRGYLSAESYFN 120
           AELQRIIHDLRKRKRFTQALEVSEWMKK+GVCIFSP EHAVQLDLIGRVRGYLSAESYFN
Sbjct: 61  AELQRIIHDLRKRKRFTQALEVSEWMKKTGVCIFSPSEHAVQLDLIGRVRGYLSAESYFN 120

Query: 121 QLKEQDQTDKTYGALLNCYVRQRQVEKSLSHLQKMKEMGFATSELTYNDIMCLYTNVGQH 180
           QLKEQDQTDKTYGALLNCYVRQRQVEKSLSHLQKMKEMGFATSELTYND+MCLYTNVGQH
Sbjct: 121 QLKEQDQTDKTYGALLNCYVRQRQVEKSLSHLQKMKEMGFATSELTYNDMMCLYTNVGQH 180

Query: 181 DKVPEVLAEMKEKNVSPDNFSYRICINSYGVRRDLEGMENVLKEMESQPHIIMDWNTYAV 240
           DKVPEVLAEMKEKNVSPDNFSYRICINSYG RRDLEGMENVLKEMESQPHI+MDWNTYAV
Sbjct: 181 DKVPEVLAEMKEKNVSPDNFSYRICINSYGARRDLEGMENVLKEMESQPHIVMDWNTYAV 240

Query: 241 VANFFIKADLTDKAVDALRKSEEKLKSKDRIGHNHLISLYATLGNKEKVLRLWNLDKSDT 300
           VANFFIKADL DKAVDAL+K+EE+LKSKDRIGHNHLISLY TLGNKEKVLRLWNLDK+DT
Sbjct: 241 VANFFIKADLADKAVDALKKAEERLKSKDRIGHNHLISLYTTLGNKEKVLRLWNLDKTDT 300

Query: 301 TRFINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAE 360
           TRFINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAE
Sbjct: 301 TRFINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAE 360

Query: 361 ALLEDLMEKGKATTPNSWGAVAVQYMDRGETEKAVECMKAALSLNMDKGWKPNLRVITGI 420
           ALLEDLMEKGK TTPNSWGAVAVQYMDRGETEK+VECMKAAL+LNMDKGWKPNLRVITGI
Sbjct: 361 ALLEDLMEKGKTTTPNSWGAVAVQYMDRGETEKSVECMKAALTLNMDKGWKPNLRVITGI 420

Query: 421 LNWLGENGSIEEVEAFVGSLRSVIPVNREMYHALMKAHIRGGKEVNEVLNQMKSDKIDED 480
           LNWLGEN SIEEVEAFVGSLRS IPVNREMYHALMK HIRGGKEV+E+LNQMKSDKIDED
Sbjct: 421 LNWLGENASIEEVEAFVGSLRSAIPVNREMYHALMKVHIRGGKEVHELLNQMKSDKIDED 480

Query: 481 EETKKILGTWQETTEGKN 499
           EETKKILGT QETTEG++
Sbjct: 481 EETKKILGTGQETTEGRS 497

BLAST of Tan0012021 vs. NCBI nr
Match: KAG7012430.1 (Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 949.5 bits (2453), Expect = 1.2e-272
Identity = 469/495 (94.75%), Postives = 484/495 (97.78%), Query Frame = 0

Query: 1   MDQKLFSKALTRYALAGRSYHTNRLKKATLYAKISPLGDPSISVEPELDGWVQEGKKVRV 60
           MDQ  FSKALTRYALAGR YHTNRLKKATLYAKISPLGDPS+SVEP LDGWV+EGKKVR+
Sbjct: 1   MDQ-FFSKALTRYALAGRFYHTNRLKKATLYAKISPLGDPSVSVEPVLDGWVKEGKKVRI 60

Query: 61  AELQRIIHDLRKRKRFTQALEVSEWMKKSGVCIFSPREHAVQLDLIGRVRGYLSAESYFN 120
           AELQRIIHDLRKRKRFTQALEVSEWMKK+GVCIFSP EHAVQLDLIGRVRGYLSAESYFN
Sbjct: 61  AELQRIIHDLRKRKRFTQALEVSEWMKKTGVCIFSPSEHAVQLDLIGRVRGYLSAESYFN 120

Query: 121 QLKEQDQTDKTYGALLNCYVRQRQVEKSLSHLQKMKEMGFATSELTYNDIMCLYTNVGQH 180
           QLKEQDQTDKTYGALLNCYVRQRQVEKSLSHLQKMKEMGFATSELTYND+MCLYTNVGQH
Sbjct: 121 QLKEQDQTDKTYGALLNCYVRQRQVEKSLSHLQKMKEMGFATSELTYNDMMCLYTNVGQH 180

Query: 181 DKVPEVLAEMKEKNVSPDNFSYRICINSYGVRRDLEGMENVLKEMESQPHIIMDWNTYAV 240
           DKVPEVLAEMKEKNVSPDNFSYRICINSYG RRDLEGMENVLKEMESQPHI+MDWNTYAV
Sbjct: 181 DKVPEVLAEMKEKNVSPDNFSYRICINSYGARRDLEGMENVLKEMESQPHIVMDWNTYAV 240

Query: 241 VANFFIKADLTDKAVDALRKSEEKLKSKDRIGHNHLISLYATLGNKEKVLRLWNLDKSDT 300
           VANFFIKADL DKAVDAL+K+EE+LKSKDRIGHNHLISLYATLGNKEKVLRLWNLDK+DT
Sbjct: 241 VANFFIKADLADKAVDALKKAEERLKSKDRIGHNHLISLYATLGNKEKVLRLWNLDKTDT 300

Query: 301 TRFINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAE 360
           TR INRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAE
Sbjct: 301 TRLINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAE 360

Query: 361 ALLEDLMEKGKATTPNSWGAVAVQYMDRGETEKAVECMKAALSLNMDKGWKPNLRVITGI 420
           ALLEDLMEKGK TTPNSWGAVAVQYMDR ETEK+VECMKAAL+LNMDKGWKPNLRVITGI
Sbjct: 361 ALLEDLMEKGKTTTPNSWGAVAVQYMDRSETEKSVECMKAALTLNMDKGWKPNLRVITGI 420

Query: 421 LNWLGENGSIEEVEAFVGSLRSVIPVNREMYHALMKAHIRGGKEVNEVLNQMKSDKIDED 480
           LNWLGENGSIEEVEAFVGSLRSVIPVNREMYHALMKAHIRGGKEV+E+LNQMKSDK+DED
Sbjct: 421 LNWLGENGSIEEVEAFVGSLRSVIPVNREMYHALMKAHIRGGKEVHELLNQMKSDKLDED 480

Query: 481 EETKKILGTWQETTE 496
           EETKKILGT QETTE
Sbjct: 481 EETKKILGTGQETTE 494

BLAST of Tan0012021 vs. NCBI nr
Match: XP_022954890.1 (pentatricopeptide repeat-containing protein At4g21705, mitochondrial isoform X1 [Cucurbita moschata])

HSP 1 Score: 948.3 bits (2450), Expect = 2.6e-272
Identity = 469/496 (94.56%), Postives = 483/496 (97.38%), Query Frame = 0

Query: 1   MDQKLFSKALTRYALAGRSYHTNRLKKATLYAKISPLGDPSISVEPELDGWVQEGKKVRV 60
           MDQ  FSKALTRYALAGR YHTNRLKKATLYAKISPLGDPS+SVEP LDGWV+EGKKVR+
Sbjct: 1   MDQ-FFSKALTRYALAGRFYHTNRLKKATLYAKISPLGDPSVSVEPVLDGWVKEGKKVRI 60

Query: 61  AELQRIIHDLRKRKRFTQALEVSEWMKKSGVCIFSPREHAVQLDLIGRVRGYLSAESYFN 120
           AELQRIIHDLRKRKRFTQALEVSEWMKK+GVCIFSP EHAVQLDLIGRVRGYLSAESYFN
Sbjct: 61  AELQRIIHDLRKRKRFTQALEVSEWMKKTGVCIFSPSEHAVQLDLIGRVRGYLSAESYFN 120

Query: 121 QLKEQDQTDKTYGALLNCYVRQRQVEKSLSHLQKMKEMGFATSELTYNDIMCLYTNVGQH 180
           QLKEQDQTDKTYGALLNCYVRQRQVEKSLSHLQKMKEMGFATSELTYND+MCLYTNVGQH
Sbjct: 121 QLKEQDQTDKTYGALLNCYVRQRQVEKSLSHLQKMKEMGFATSELTYNDMMCLYTNVGQH 180

Query: 181 DKVPEVLAEMKEKNVSPDNFSYRICINSYGVRRDLEGMENVLKEMESQPHIIMDWNTYAV 240
           DKVPEVLAEMKEKNVSPDNFSYRICINSYG RRDLEGMENVLKEMESQPHI+MDWNTYAV
Sbjct: 181 DKVPEVLAEMKEKNVSPDNFSYRICINSYGARRDLEGMENVLKEMESQPHIVMDWNTYAV 240

Query: 241 VANFFIKADLTDKAVDALRKSEEKLKSKDRIGHNHLISLYATLGNKEKVLRLWNLDKSDT 300
           VANFFIKADL DKAVDAL+K+EE+LKSKDRIGHNHLISLYATLGNKEKVLRLWNLDK+DT
Sbjct: 241 VANFFIKADLADKAVDALKKAEERLKSKDRIGHNHLISLYATLGNKEKVLRLWNLDKTDT 300

Query: 301 TRFINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAE 360
           TR INRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAE
Sbjct: 301 TRLINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAE 360

Query: 361 ALLEDLMEKGKATTPNSWGAVAVQYMDRGETEKAVECMKAALSLNMDKGWKPNLRVITGI 420
           ALLEDLMEKGK TTPN WGAVAVQYMDR ETEK+VECMKAAL+LNMDKGWKPNLRVITGI
Sbjct: 361 ALLEDLMEKGKTTTPNCWGAVAVQYMDRSETEKSVECMKAALTLNMDKGWKPNLRVITGI 420

Query: 421 LNWLGENGSIEEVEAFVGSLRSVIPVNREMYHALMKAHIRGGKEVNEVLNQMKSDKIDED 480
           LNWLGEN SIEEVEAFVGSLRSVIPVNREMYHALMKAHIRGGKEV+E+LNQMKSDKIDED
Sbjct: 421 LNWLGENASIEEVEAFVGSLRSVIPVNREMYHALMKAHIRGGKEVHELLNQMKSDKIDED 480

Query: 481 EETKKILGTWQETTEG 497
           EETKKILGT QETTEG
Sbjct: 481 EETKKILGTGQETTEG 495

BLAST of Tan0012021 vs. NCBI nr
Match: KAG6573262.1 (Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 946.4 bits (2445), Expect = 9.8e-272
Identity = 468/497 (94.16%), Postives = 483/497 (97.18%), Query Frame = 0

Query: 1   MDQKLFSKALTRYALAGRSYHTNRLKKATLYAKISPLGDPSISVEPELDGWVQEGKKVRV 60
           MDQ  FSKALTRYALAGR YHTNRLKKATLYAKISPLGDPS+SVEP LDGWV+EGKKVR+
Sbjct: 1   MDQ-FFSKALTRYALAGRFYHTNRLKKATLYAKISPLGDPSVSVEPVLDGWVKEGKKVRI 60

Query: 61  AELQRIIHDLRKRKRFTQALEVSEWMKKSGVCIFSPREHAVQLDLIGRVRGYLSAESYFN 120
           AELQRIIHDLRKRKRFTQALEVSEWMKK+GVCIFSP EHAVQLDLIGRVRGYLSAESYFN
Sbjct: 61  AELQRIIHDLRKRKRFTQALEVSEWMKKTGVCIFSPSEHAVQLDLIGRVRGYLSAESYFN 120

Query: 121 QLKEQDQTDKTYGALLNCYVRQRQVEKSLSHLQKMKEMGFATSELTYNDIMCLYTNVGQH 180
           QLKEQDQTDKTYGALLNCYVRQRQVEKSLSHLQKMKEMGFATSELTYND+MCLYTNVGQH
Sbjct: 121 QLKEQDQTDKTYGALLNCYVRQRQVEKSLSHLQKMKEMGFATSELTYNDMMCLYTNVGQH 180

Query: 181 DKVPEVLAEMKEKNVSPDNFSYRICINSYGVRRDLEGMENVLKEMESQPHIIMDWNTYAV 240
           DKVPEVLAEMKEKNVSPDNFSYRICINSYG RRDLEGMENVLKEMESQPHI+MDWNTYAV
Sbjct: 181 DKVPEVLAEMKEKNVSPDNFSYRICINSYGARRDLEGMENVLKEMESQPHIVMDWNTYAV 240

Query: 241 VANFFIKADLTDKAVDALRKSEEKLKSKDRIGHNHLISLYATLGNKEKVLRLWNLDKSDT 300
           VANFFIKADL DKAVDAL+K+E +LKSKDRIGHNHLISLYATLGNKEKVLRLWNLDK+DT
Sbjct: 241 VANFFIKADLADKAVDALKKAEVRLKSKDRIGHNHLISLYATLGNKEKVLRLWNLDKTDT 300

Query: 301 TRFINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAE 360
           TR INRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAE
Sbjct: 301 TRLINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAE 360

Query: 361 ALLEDLMEKGKATTPNSWGAVAVQYMDRGETEKAVECMKAALSLNMDKGWKPNLRVITGI 420
           ALLEDLMEKGK TTPN WGAVAVQYMDR ETEK+VECMKAAL+LNMDKGWKPNLRVITGI
Sbjct: 361 ALLEDLMEKGKTTTPNCWGAVAVQYMDRSETEKSVECMKAALTLNMDKGWKPNLRVITGI 420

Query: 421 LNWLGENGSIEEVEAFVGSLRSVIPVNREMYHALMKAHIRGGKEVNEVLNQMKSDKIDED 480
           LNWLGEN SIEEVEAFVGSLRSVIPVNREMYHALMKAHIRGGKEV+E+LNQMKSDKIDED
Sbjct: 421 LNWLGENASIEEVEAFVGSLRSVIPVNREMYHALMKAHIRGGKEVHELLNQMKSDKIDED 480

Query: 481 EETKKILGTWQETTEGK 498
           EETKKILGT QETTEG+
Sbjct: 481 EETKKILGTGQETTEGR 496

BLAST of Tan0012021 vs. ExPASy TrEMBL
Match: A0A6J1K124 (pentatricopeptide repeat-containing protein At4g21705, mitochondrial isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111490120 PE=4 SV=1)

HSP 1 Score: 950.3 bits (2455), Expect = 3.3e-273
Identity = 468/498 (93.98%), Postives = 485/498 (97.39%), Query Frame = 0

Query: 1   MDQKLFSKALTRYALAGRSYHTNRLKKATLYAKISPLGDPSISVEPELDGWVQEGKKVRV 60
           MDQ  FSKALTRYALA R YHTNRLKKATLYAKISPLGDP++SVEPELDGWV+EGKKVR+
Sbjct: 1   MDQ-FFSKALTRYALADRFYHTNRLKKATLYAKISPLGDPNVSVEPELDGWVKEGKKVRI 60

Query: 61  AELQRIIHDLRKRKRFTQALEVSEWMKKSGVCIFSPREHAVQLDLIGRVRGYLSAESYFN 120
           AELQRIIHDLRKRKRFTQALEVSEWMKK+GVCIFSP EHAVQLDLIGRVRGYLSAESYFN
Sbjct: 61  AELQRIIHDLRKRKRFTQALEVSEWMKKTGVCIFSPSEHAVQLDLIGRVRGYLSAESYFN 120

Query: 121 QLKEQDQTDKTYGALLNCYVRQRQVEKSLSHLQKMKEMGFATSELTYNDIMCLYTNVGQH 180
           QLKEQDQTDKTYGALLNCYVRQRQVEKSLSHLQKMKEMGFATSELTYND+MCLYTNVGQH
Sbjct: 121 QLKEQDQTDKTYGALLNCYVRQRQVEKSLSHLQKMKEMGFATSELTYNDMMCLYTNVGQH 180

Query: 181 DKVPEVLAEMKEKNVSPDNFSYRICINSYGVRRDLEGMENVLKEMESQPHIIMDWNTYAV 240
           DKVPEVLAEMKEKNVSPDNFSYRICINSYG RRDLEGMENVLKEMESQPHI+MDWNTYAV
Sbjct: 181 DKVPEVLAEMKEKNVSPDNFSYRICINSYGARRDLEGMENVLKEMESQPHIVMDWNTYAV 240

Query: 241 VANFFIKADLTDKAVDALRKSEEKLKSKDRIGHNHLISLYATLGNKEKVLRLWNLDKSDT 300
           VANFFIKADL DKAVDAL+K+EE+LKSKDRIGHNHLISLY TLGNKEKVLRLWNLDK+DT
Sbjct: 241 VANFFIKADLADKAVDALKKAEERLKSKDRIGHNHLISLYTTLGNKEKVLRLWNLDKTDT 300

Query: 301 TRFINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAE 360
           TRFINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAE
Sbjct: 301 TRFINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAE 360

Query: 361 ALLEDLMEKGKATTPNSWGAVAVQYMDRGETEKAVECMKAALSLNMDKGWKPNLRVITGI 420
           ALLEDLMEKGK TTPNSWGAVAVQYMDRGETEK+VECMKAAL+LNMDKGWKPNLRVITGI
Sbjct: 361 ALLEDLMEKGKTTTPNSWGAVAVQYMDRGETEKSVECMKAALTLNMDKGWKPNLRVITGI 420

Query: 421 LNWLGENGSIEEVEAFVGSLRSVIPVNREMYHALMKAHIRGGKEVNEVLNQMKSDKIDED 480
           LNWLGEN SIEEVEAFVGSLRS IPVNREMYHALMK HIRGGKEV+E+LNQMKSDKIDED
Sbjct: 421 LNWLGENASIEEVEAFVGSLRSAIPVNREMYHALMKVHIRGGKEVHELLNQMKSDKIDED 480

Query: 481 EETKKILGTWQETTEGKN 499
           EETKKILGT QETTEG++
Sbjct: 481 EETKKILGTGQETTEGRS 497

BLAST of Tan0012021 vs. ExPASy TrEMBL
Match: A0A6J1GTN5 (pentatricopeptide repeat-containing protein At4g21705, mitochondrial isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111457015 PE=4 SV=1)

HSP 1 Score: 948.3 bits (2450), Expect = 1.2e-272
Identity = 469/496 (94.56%), Postives = 483/496 (97.38%), Query Frame = 0

Query: 1   MDQKLFSKALTRYALAGRSYHTNRLKKATLYAKISPLGDPSISVEPELDGWVQEGKKVRV 60
           MDQ  FSKALTRYALAGR YHTNRLKKATLYAKISPLGDPS+SVEP LDGWV+EGKKVR+
Sbjct: 1   MDQ-FFSKALTRYALAGRFYHTNRLKKATLYAKISPLGDPSVSVEPVLDGWVKEGKKVRI 60

Query: 61  AELQRIIHDLRKRKRFTQALEVSEWMKKSGVCIFSPREHAVQLDLIGRVRGYLSAESYFN 120
           AELQRIIHDLRKRKRFTQALEVSEWMKK+GVCIFSP EHAVQLDLIGRVRGYLSAESYFN
Sbjct: 61  AELQRIIHDLRKRKRFTQALEVSEWMKKTGVCIFSPSEHAVQLDLIGRVRGYLSAESYFN 120

Query: 121 QLKEQDQTDKTYGALLNCYVRQRQVEKSLSHLQKMKEMGFATSELTYNDIMCLYTNVGQH 180
           QLKEQDQTDKTYGALLNCYVRQRQVEKSLSHLQKMKEMGFATSELTYND+MCLYTNVGQH
Sbjct: 121 QLKEQDQTDKTYGALLNCYVRQRQVEKSLSHLQKMKEMGFATSELTYNDMMCLYTNVGQH 180

Query: 181 DKVPEVLAEMKEKNVSPDNFSYRICINSYGVRRDLEGMENVLKEMESQPHIIMDWNTYAV 240
           DKVPEVLAEMKEKNVSPDNFSYRICINSYG RRDLEGMENVLKEMESQPHI+MDWNTYAV
Sbjct: 181 DKVPEVLAEMKEKNVSPDNFSYRICINSYGARRDLEGMENVLKEMESQPHIVMDWNTYAV 240

Query: 241 VANFFIKADLTDKAVDALRKSEEKLKSKDRIGHNHLISLYATLGNKEKVLRLWNLDKSDT 300
           VANFFIKADL DKAVDAL+K+EE+LKSKDRIGHNHLISLYATLGNKEKVLRLWNLDK+DT
Sbjct: 241 VANFFIKADLADKAVDALKKAEERLKSKDRIGHNHLISLYATLGNKEKVLRLWNLDKTDT 300

Query: 301 TRFINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAE 360
           TR INRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAE
Sbjct: 301 TRLINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAE 360

Query: 361 ALLEDLMEKGKATTPNSWGAVAVQYMDRGETEKAVECMKAALSLNMDKGWKPNLRVITGI 420
           ALLEDLMEKGK TTPN WGAVAVQYMDR ETEK+VECMKAAL+LNMDKGWKPNLRVITGI
Sbjct: 361 ALLEDLMEKGKTTTPNCWGAVAVQYMDRSETEKSVECMKAALTLNMDKGWKPNLRVITGI 420

Query: 421 LNWLGENGSIEEVEAFVGSLRSVIPVNREMYHALMKAHIRGGKEVNEVLNQMKSDKIDED 480
           LNWLGEN SIEEVEAFVGSLRSVIPVNREMYHALMKAHIRGGKEV+E+LNQMKSDKIDED
Sbjct: 421 LNWLGENASIEEVEAFVGSLRSVIPVNREMYHALMKAHIRGGKEVHELLNQMKSDKIDED 480

Query: 481 EETKKILGTWQETTEG 497
           EETKKILGT QETTEG
Sbjct: 481 EETKKILGTGQETTEG 495

BLAST of Tan0012021 vs. ExPASy TrEMBL
Match: A0A6J1CGU2 (pentatricopeptide repeat-containing protein At4g21705, mitochondrial OS=Momordica charantia OX=3673 GN=LOC111010721 PE=4 SV=1)

HSP 1 Score: 931.8 bits (2407), Expect = 1.2e-267
Identity = 455/500 (91.00%), Postives = 483/500 (96.60%), Query Frame = 0

Query: 1   MDQKLFSKALTRYALAGRSYHTNRLKKATLYAKISPLGDPSISVEPELDGWVQEGKKVRV 60
           MDQ LFSKALTRYA+AGRSYHTNR+KKATLYAKISPLGDPSISV PELDGWVQEGKK+RV
Sbjct: 10  MDQNLFSKALTRYAMAGRSYHTNRMKKATLYAKISPLGDPSISVGPELDGWVQEGKKIRV 69

Query: 61  AELQRIIHDLRKRKRFTQALEVSEWMKKSGVCIFSPREHAVQLDLIGRVRGYLSAESYFN 120
           AELQRIIHDLRKRKRFTQALEVSEWMK+SGVCIFSP EHAVQLDLIGRVRGYLSAESYF+
Sbjct: 70  AELQRIIHDLRKRKRFTQALEVSEWMKQSGVCIFSPSEHAVQLDLIGRVRGYLSAESYFD 129

Query: 121 QLKEQDQTDKTYGALLNCYVRQRQVEKSLSHLQKMKEMGFATSELTYNDIMCLYTNVGQH 180
           QLK+QD+T KTYGALLNCYVRQRQV+KSLSHLQKMKEMGFATSELTYND+MCLYTNVGQH
Sbjct: 130 QLKDQDKTGKTYGALLNCYVRQRQVDKSLSHLQKMKEMGFATSELTYNDMMCLYTNVGQH 189

Query: 181 DKVPEVLAEMKEKNVSPDNFSYRICINSYGVRRDLEGMENVLKEMESQPHIIMDWNTYAV 240
           DKVP+VLAEMKE  VSPDNFSYRICINSYG R DLEGME+VLKEMESQPHI+MDWNTYAV
Sbjct: 190 DKVPQVLAEMKENKVSPDNFSYRICINSYGTRCDLEGMESVLKEMESQPHIVMDWNTYAV 249

Query: 241 VANFFIKADLTDKAVDALRKSEEKLKSKDRIGHNHLISLYATLGNKEKVLRLWNLDKSDT 300
           VANFFIK  LTDKAVDALRKSEE+L SKDRIGHNHLISLYATLGNKE+VLRLW LDKSD+
Sbjct: 250 VANFFIKGGLTDKAVDALRKSEERLNSKDRIGHNHLISLYATLGNKEEVLRLWKLDKSDS 309

Query: 301 TRFINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAE 360
           TRFINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAE
Sbjct: 310 TRFINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAE 369

Query: 361 ALLEDLMEKGKATTPNSWGAVAVQYMDRGETEKAVECMKAALSLNMDKGWKPNLRVITGI 420
           ALLEDLM++GKATTPNSWGAVAVQY+DRGETEKAVECMK ALSL++DKGWKPNLRVITGI
Sbjct: 370 ALLEDLMKEGKATTPNSWGAVAVQYLDRGETEKAVECMKTALSLHIDKGWKPNLRVITGI 429

Query: 421 LNWLGENGSIEEVEAFVGSLRSVIPVNREMYHALMKAHIRGGKEVNEVLNQMKSDKIDED 480
           LNW+G+N S EEVEAFVGSLRSVIPVNREMYHALMKAHIRGGKEV+ +L+QMKSD+IDED
Sbjct: 430 LNWIGDNSSTEEVEAFVGSLRSVIPVNREMYHALMKAHIRGGKEVHGLLSQMKSDQIDED 489

Query: 481 EETKKILGTWQETTEGKNVG 501
           EETKKILGTWQE TEGK++G
Sbjct: 490 EETKKILGTWQEATEGKSIG 509

BLAST of Tan0012021 vs. ExPASy TrEMBL
Match: A0A5A7UM45 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold861G00740 PE=4 SV=1)

HSP 1 Score: 897.5 bits (2318), Expect = 2.5e-257
Identity = 440/499 (88.18%), Postives = 468/499 (93.79%), Query Frame = 0

Query: 1   MDQKLFSKALTRYALAGRSYHTNRLKKATLYAKISPLGDPSISVEPELDGWVQEGKKVRV 60
           MDQKLFSKALT YALA RSYHT RLKKATLYAKISPLGDPSISVE ELDGWVQEGKKVRV
Sbjct: 1   MDQKLFSKALTHYALASRSYHTTRLKKATLYAKISPLGDPSISVESELDGWVQEGKKVRV 60

Query: 61  AELQRIIHDLRKRKRFTQALEVSEWMKKSGVCIFSPREHAVQLDLIGRVRGYLSAESYFN 120
           AELQRII D RKR RF+QAL+VSEWMKKSG CIFSP EHAVQLDLIGRVRGYLSAE YFN
Sbjct: 61  AELQRIIRDFRKRSRFSQALQVSEWMKKSGACIFSPTEHAVQLDLIGRVRGYLSAEKYFN 120

Query: 121 QLKEQDQTDKTYGALLNCYVRQRQVEKSLSHLQKMKEMGFATSELTYNDIMCLYTNVGQH 180
           QLKEQDQ  KTYGALLNCYVRQ+QV+KSLSHLQKMKE+GFATSELTYNDIMCLYT VGQH
Sbjct: 121 QLKEQDQNIKTYGALLNCYVRQQQVDKSLSHLQKMKELGFATSELTYNDIMCLYTRVGQH 180

Query: 181 DKVPEVLAEMKEKNVSPDNFSYRICINSYGVRRDLEGMENVLKEMESQPHIIMDWNTYAV 240
           +KVPEVLAEMK  NVSPDNFSYRICINSYG R+DLEGMENVLKEMESQPHI+MDWNTYAV
Sbjct: 181 EKVPEVLAEMKGNNVSPDNFSYRICINSYGARKDLEGMENVLKEMESQPHIVMDWNTYAV 240

Query: 241 VANFFIKADLTDKAVDALRKSEEKLKSKDRIGHNHLISLYATLGNKEKVLRLWNLDKSDT 300
           VANFFIKA LTDKAVDALRKSEEKLKSKDRIGHNHLISLYATLGNKEKVLR+WNLDK+ T
Sbjct: 241 VANFFIKAGLTDKAVDALRKSEEKLKSKDRIGHNHLISLYATLGNKEKVLRVWNLDKTAT 300

Query: 301 TRFINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAE 360
           TR INRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAE
Sbjct: 301 TRIINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAE 360

Query: 361 ALLEDLMEKGKATTPNSWGAVAVQYMDRGETEKAVECMKAALSLNMDKGWKPNLRVITGI 420
            LLE+L +  KATTPNSWGAVAV+Y+DRGETEKA+ECMKAALS+N DKGWKPN RVITG+
Sbjct: 361 TLLENLNQNEKATTPNSWGAVAVKYLDRGETEKALECMKAALSVNTDKGWKPNPRVITGV 420

Query: 421 LNWLGENGSIEEVEAFVGSLRSVIPVNREMYHALMKAHIRGGKEVNEVLNQMKSDKIDED 480
           LNWLG+ G +EEVEAFV +LRSVIPVNREMYHAL+K +IR  KEVNEVLN+MK+DKI+ED
Sbjct: 421 LNWLGDKGIVEEVEAFVSALRSVIPVNREMYHALLKVYIRADKEVNEVLNKMKADKINED 480

Query: 481 EETKKILGTWQETTEGKNV 500
           EETKKILGTW+ETTEGK++
Sbjct: 481 EETKKILGTWEETTEGKSI 499

BLAST of Tan0012021 vs. ExPASy TrEMBL
Match: A0A1S3B5N2 (pentatricopeptide repeat-containing protein At4g21705, mitochondrial OS=Cucumis melo OX=3656 GN=LOC103486296 PE=4 SV=1)

HSP 1 Score: 897.5 bits (2318), Expect = 2.5e-257
Identity = 440/499 (88.18%), Postives = 468/499 (93.79%), Query Frame = 0

Query: 1   MDQKLFSKALTRYALAGRSYHTNRLKKATLYAKISPLGDPSISVEPELDGWVQEGKKVRV 60
           MDQKLFSKALT YALA RSYHT RLKKATLYAKISPLGDPSISVE ELDGWVQEGKKVRV
Sbjct: 1   MDQKLFSKALTHYALASRSYHTTRLKKATLYAKISPLGDPSISVESELDGWVQEGKKVRV 60

Query: 61  AELQRIIHDLRKRKRFTQALEVSEWMKKSGVCIFSPREHAVQLDLIGRVRGYLSAESYFN 120
           AELQRII D RKR RF+QAL+VSEWMKKSG CIFSP EHAVQLDLIGRVRGYLSAE YFN
Sbjct: 61  AELQRIIRDFRKRSRFSQALQVSEWMKKSGACIFSPTEHAVQLDLIGRVRGYLSAEKYFN 120

Query: 121 QLKEQDQTDKTYGALLNCYVRQRQVEKSLSHLQKMKEMGFATSELTYNDIMCLYTNVGQH 180
           QLKEQDQ  KTYGALLNCYVRQ+QV+KSLSHLQKMKE+GFATSELTYNDIMCLYT VGQH
Sbjct: 121 QLKEQDQNIKTYGALLNCYVRQQQVDKSLSHLQKMKELGFATSELTYNDIMCLYTRVGQH 180

Query: 181 DKVPEVLAEMKEKNVSPDNFSYRICINSYGVRRDLEGMENVLKEMESQPHIIMDWNTYAV 240
           +KVPEVLAEMK  NVSPDNFSYRICINSYG R+DLEGMENVLKEMESQPHI+MDWNTYAV
Sbjct: 181 EKVPEVLAEMKGNNVSPDNFSYRICINSYGARKDLEGMENVLKEMESQPHIVMDWNTYAV 240

Query: 241 VANFFIKADLTDKAVDALRKSEEKLKSKDRIGHNHLISLYATLGNKEKVLRLWNLDKSDT 300
           VANFFIKA LTDKAVDALRKSEEKLKSKDRIGHNHLISLYATLGNKEKVLR+WNLDK+ T
Sbjct: 241 VANFFIKAGLTDKAVDALRKSEEKLKSKDRIGHNHLISLYATLGNKEKVLRVWNLDKTAT 300

Query: 301 TRFINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAE 360
           TR INRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAE
Sbjct: 301 TRIINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAE 360

Query: 361 ALLEDLMEKGKATTPNSWGAVAVQYMDRGETEKAVECMKAALSLNMDKGWKPNLRVITGI 420
            LLE+L +  KATTPNSWGAVAV+Y+DRGETEKA+ECMKAALS+N DKGWKPN RVITG+
Sbjct: 361 TLLENLNQNEKATTPNSWGAVAVKYLDRGETEKALECMKAALSVNTDKGWKPNPRVITGV 420

Query: 421 LNWLGENGSIEEVEAFVGSLRSVIPVNREMYHALMKAHIRGGKEVNEVLNQMKSDKIDED 480
           LNWLG+ G +EEVEAFV +LRSVIPVNREMYHAL+K +IR  KEVNEVLN+MK+DKI+ED
Sbjct: 421 LNWLGDKGIVEEVEAFVSALRSVIPVNREMYHALLKVYIRADKEVNEVLNKMKADKINED 480

Query: 481 EETKKILGTWQETTEGKNV 500
           EETKKILGTW+ETTEGK++
Sbjct: 481 EETKKILGTWEETTEGKSI 499

BLAST of Tan0012021 vs. TAIR 10
Match: AT4G21705.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 612.5 bits (1578), Expect = 3.1e-175
Identity = 297/477 (62.26%), Postives = 374/477 (78.41%), Query Frame = 0

Query: 15  LAGRSYHTNRLKKATLYAKISPLGDPSISVEPELDGWVQEGKKVRVAELQRIIHDLRKRK 74
           +A R Y+TNR+KK TLY+KISPLGDP  SV PEL  WVQ GKKV VAEL RI+HDLR+RK
Sbjct: 12  IASRYYYTNRVKKTTLYSKISPLGDPKSSVYPELQNWVQCGKKVSVAELIRIVHDLRRRK 71

Query: 75  RFTQALEVSEWMKKSGVCIFSPREHAVQLDLIGRVRGYLSAESYFNQLKEQDQTDKTYGA 134
           RF  ALEVS+WM ++GVC+FSP EHAV LDLIGRV G+++AE YF  LKEQ + DKTYGA
Sbjct: 72  RFLHALEVSKWMNETGVCVFSPTEHAVHLDLIGRVYGFVTAEEYFENLKEQYKNDKTYGA 131

Query: 135 LLNCYVRQRQVEKSLSHLQKMKEMGFATSELTYNDIMCLYTNVGQHDKVPEVLAEMKEKN 194
           LLNCYVRQ+ VEKSL H +KMKEMGF TS LTYN+IMCLYTN+GQH+KVP+VL EMKE+N
Sbjct: 132 LLNCYVRQQNVEKSLLHFEKMKEMGFVTSSLTYNNIMCLYTNIGQHEKVPKVLEEMKEEN 191

Query: 195 VSPDNFSYRICINSYGVRRDLEGMENVLKEMESQPHIIMDWNTYAVVANFFIKADLTDKA 254
           V+PDN+SYRICIN++G   DLE +   L++ME +  I MDWNTYAV A F+I     D+A
Sbjct: 192 VAPDNYSYRICINAFGAMYDLERIGGTLRDMERRQDITMDWNTYAVAAKFYIDGGDCDRA 251

Query: 255 VDALRKSEEKLKSKDRIGHNHLISLYATLGNKEKVLRLWNLDKSDTTRFINRDYITMLES 314
           V+ L+ SE +L+ KD  G+NHLI+LYA LG K +VLRLW+L+K    R IN+DY+T+L+S
Sbjct: 252 VELLKMSENRLEKKDGEGYNHLITLYARLGKKIEVLRLWDLEKDVCKRRINQDYLTVLQS 311

Query: 315 LVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAEALLEDLMEKGKATT 374
           LV++  L EAE+VL EW+SSGNCYDFRVPNTVI GYI K M E+AEA+LEDL  +GKATT
Sbjct: 312 LVKIDALVEAEEVLTEWKSSGNCYDFRVPNTVIRGYIGKSMEEKAEAMLEDLARRGKATT 371

Query: 375 PNSWGAVAVQYMDRGETEKAVECMKAALSLNM-DKGWKPNLRVITGILNWLGENGSIEEV 434
           P SW  VA  Y ++G  E A +CMK AL + +  + W+P L ++T +L+W+G+ GS++EV
Sbjct: 372 PESWELVATAYAEKGTLENAFKCMKTALGVEVGSRKWRPGLTLVTSVLSWVGDEGSLKEV 431

Query: 435 EAFVGSLRSVIPVNREMYHALMKAHIR-GGKEVNEVLNQMKSDKIDEDEETKKILGT 490
           E+FV SLR+ I VN++MYHAL+KA IR GG+ ++ +L +MK DKI+ DEET  IL T
Sbjct: 432 ESFVASLRNCIGVNKQMYHALVKADIREGGRNIDTLLQRMKDDKIEIDEETTVILST 488

BLAST of Tan0012021 vs. TAIR 10
Match: AT2G20710.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 295.4 bits (755), Expect = 8.4e-80
Identity = 151/397 (38.04%), Postives = 246/397 (61.96%), Query Frame = 0

Query: 29  TLYAKISPLGDPSISVEPELDGWVQEGKKVRVAELQRIIHDLRKRKRFTQALEVSEWMKK 88
           TL  +++  GDPS S+   LDGW+ +G  V+ +EL  II  LRK  RF+ AL++S+WM +
Sbjct: 39  TLQRRVARSGDPSASIIKVLDGWLDQGNLVKTSELHSIIKMLRKFSRFSHALQISDWMSE 98

Query: 89  SGVCIFSPREHAVQLDLIGRVRGYLSAESYFNQLKEQDQTDKTYGALLNCYVRQRQVEKS 148
             V   S  + A++LDLI +V G   AE +F  +  + +    YGALLNCY  ++ + K+
Sbjct: 99  HRVHEISEGDVAIRLDLIAKVGGLGEAEKFFETIPMERRNYHLYGALLNCYASKKVLHKA 158

Query: 149 LSHLQKMKEMGFATSELTYNDIMCLYTNVGQHDKVPEVLAEMKEKNVSPDNFSYRICINS 208
               Q+MKE+GF    L YN ++ LY   G++  V ++L EM+++ V PD F+    +++
Sbjct: 159 EQVFQEMKELGFLKGCLPYNVMLNLYVRTGKYTMVEKLLREMEDETVKPDIFTVNTRLHA 218

Query: 209 YGVRRDLEGMENVLKEMESQPHIIMDWNTYAVVANFFIKADLTDKAVDALRKSEEKLKS- 268
           Y V  D+EGME  L   E+   + +DW TYA  AN +IKA LT+KA++ LRKSE+ + + 
Sbjct: 219 YSVVSDVEGMEKFLMRCEADQGLHLDWRTYADTANGYIKAGLTEKALEMLRKSEQMVNAQ 278

Query: 269 KDRIGHNHLISLYATLGNKEKVLRLWNLDKSDTTRFINRDYITMLESLVRLGELEEAEKV 328
           K +  +  L+S Y   G KE+V RLW+L K +   F N  YI+++ +L+++ ++EE EK+
Sbjct: 279 KRKHAYEVLMSFYGAAGKKEEVYRLWSLYK-ELDGFYNTGYISVISALLKMDDIEEVEKI 338

Query: 329 LKEWESSGNCYDFRVPNTVIVGYIDKGMCERAEALLEDLMEKGKATTPNSWGAVAVQYMD 388
           ++EWE+  + +D R+P+ +I GY  KGM E+AE ++  L++K +    ++W  +A+ Y  
Sbjct: 339 MEEWEAGHSLFDIRIPHLLITGYCKKGMMEKAEEVVNILVQKWRVEDTSTWERLALGYKM 398

Query: 389 RGETEKAVECMKAALSLNMDKGWKPNLRVITGILNWL 425
            G+ EKAVE  K A+ ++   GW+P+  V+   +++L
Sbjct: 399 AGKMEKAVEKWKRAIEVS-KPGWRPHQVVLMSCVDYL 433

BLAST of Tan0012021 vs. TAIR 10
Match: AT1G02150.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 290.4 bits (742), Expect = 2.7e-78
Identity = 158/452 (34.96%), Postives = 258/452 (57.08%), Query Frame = 0

Query: 30  LYAKISPLGDPSISVEPELDGWVQEGKKVRVAELQRIIHDLRKRKRFTQALEVSEWMKKS 89
           +Y KIS +  P +     L+ W + G+K+   EL R++ +LRK KR  QALEV +WM   
Sbjct: 69  IYKKISLMEKPELGAASVLNQWEKAGRKLTKWELCRVVKELRKYKRANQALEVYDWMNNR 128

Query: 90  GVCI-FSPREHAVQLDLIGRVRGYLSAESYFNQLKEQDQTDKTYGALLNCYVRQRQVEKS 149
           G     S  + A+QLDLIG+VRG   AE +F QL E  +  + YG+LLN YVR +  EK+
Sbjct: 129 GERFRLSASDAAIQLDLIGKVRGIPDAEEFFLQLPENFKDRRVYGSLLNAYVRAKSREKA 188

Query: 150 LSHLQKMKEMGFATSELTYNDIMCLYTNVGQHDKVPEVLAEMKEKNVSPDNFSYRICINS 209
            + L  M++ G+A   L +N +M LY N+ ++DKV  ++ EMK+K++  D +SY I ++S
Sbjct: 189 EALLNTMRDKGYALHPLPFNVMMTLYMNLREYDKVDAMVFEMKQKDIRLDIYSYNIWLSS 248

Query: 210 YGVRRDLEGMENVLKEMESQPHIIMDWNTYAVVANFFIKADLTDKAVDALRKSEEKLKSK 269
            G    +E ME V ++M+S   I  +W T++ +A  +IK   T+KA DALRK E ++  +
Sbjct: 249 CGSLGSVEKMELVYQQMKSDVSIYPNWTTFSTMATMYIKMGETEKAEDALRKVEARITGR 308

Query: 270 DRIGHNHLISLYATLGNKEKVLRLWNLDKSDTTRFINRDYITMLESLVRLGELEEAEKVL 329
           +RI +++L+SLY +LGNK+++ R+W++ KS      N  Y  ++ SLVR+G++E AEKV 
Sbjct: 309 NRIPYHYLLSLYGSLGNKKELYRVWHVYKSVVPSIPNLGYHALVSSLVRMGDIEGAEKVY 368

Query: 330 KEWESSGNCYDFRVPNTVIVGYIDKGMCERAEALLEDLMEKGKATTPNSWGAVAVQYMDR 389
           +EW    + YD R+PN ++  Y+     E AE L + ++E G   + ++W  +AV +  +
Sbjct: 369 EEWLPVKSSYDPRIPNLLMNAYVKNDQLETAEGLFDHMVEMGGKPSSSTWEILAVGHTRK 428

Query: 390 GETEKAVECMKAALSLNMDKGWKPNLRVITGILNWLGENGSIEEVEAFVGSLRSVIPVNR 449
               +A+ C++ A S      W+P + +++G      E   +   EA +  LR    +  
Sbjct: 429 RCISEALTCLRNAFSAEGSSNWRPKVLMLSGFFKLCEEESDVTSKEAVLELLRQSGDLED 488

Query: 450 EMYHALMKAHIRGGKEVNEVLNQMKSDKIDED 481
           + Y AL+        + N  +N  + D  + D
Sbjct: 489 KSYLALIDV------DENRTVNNSEIDAHETD 514

BLAST of Tan0012021 vs. TAIR 10
Match: AT4G01990.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 276.6 bits (706), Expect = 4.0e-74
Identity = 154/473 (32.56%), Postives = 265/473 (56.03%), Query Frame = 0

Query: 16  AGRSYHTNRLKKATLYAKISPLGD-PSISVEPELDGWVQEGKKVRVAELQRIIHDLRKRK 75
           A  S  T   K  ++Y K+S LG      +E  L+ +V EG  V+  +L R   DLRK +
Sbjct: 27  AAASVPTKAKKHRSIYKKLSSLGTRGGGKMEETLNQFVMEGVPVKKHDLIRYAKDLRKFR 86

Query: 76  RFTQALEVSEWMKKSGVCIFSPREHAVQLDLIGRVRGYLSAESYFNQLKEQDQTDKTYGA 135
           +  +ALE+ EWM++  +  F+  +HA++L+LI + +G  +AE+YFN L +  +   TYG+
Sbjct: 87  QPQRALEIFEWMERKEIA-FTGSDHAIRLNLIAKSKGLEAAETYFNSLDDSIKNQSTYGS 146

Query: 136 LLNCYVRQRQVEKSLSHLQKMKEMGFATSELTYNDIMCLYTNVGQHDKVPEVLAEMKEKN 195
           LLNCY  +++  K+ +H + M ++   ++ L +N++M +Y  +GQ +KVP ++  MKEK+
Sbjct: 147 LLNCYCVEKEEVKAKAHFENMVDLNHVSNSLPFNNLMAMYMGLGQPEKVPALVVAMKEKS 206

Query: 196 VSPDNFSYRICINSYGVRRDLEGMENVLKEMESQPHIIMDWNTYAVVANFFIKADLTDKA 255
           ++P + +Y + I S G  +DL+G+E VL EM+++   I  WNT+A +A  +IK  L  KA
Sbjct: 207 ITPCDITYSMWIQSCGSLKDLDGVEKVLDEMKAEGEGIFSWNTFANLAAIYIKVGLYGKA 266

Query: 256 VDALRKSEEKLKSKDRIGHNHLISLYATLGNKEKVLRLWNLDKSDTTRFINRDYITMLES 315
            +AL+  E  +    R  ++ LI+LY  + N  +V R+W+L K       N  Y+TML +
Sbjct: 267 EEALKSLENNMNPDVRDCYHFLINLYTGIANASEVYRVWDLLKKRYPNVNNSSYLTMLRA 326

Query: 316 LVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAEALLEDLMEKGKATT 375
           L +L +++  +KV  EWES+   YD R+ N  I  Y+ + M E AEA+    M+K K   
Sbjct: 327 LSKLDDIDGVKKVFAEWESTCWTYDMRMANVAISSYLKQNMYEEAEAVFNGAMKKCKGQF 386

Query: 376 PNSWGAVAVQYMDRGETEKAVECMKAALSLNMDKGWKPNLRVITGILNWLGENGSIEEVE 435
             +   + +  +   + + A++  +AA+ L+ DK W  +  +I+       E   ++  E
Sbjct: 387 SKARQLLMMHLLKNDQADLALKHFEAAV-LDQDKNWTWSSELISSFFLHFEEAKDVDGAE 446

Query: 436 AFVGSLRSVIPVNREMYHALMKAHIRGGKEVNEVLNQMKSDKIDEDEETKKIL 488
            F  +L    P++ E Y  LMK ++  GK   ++  +++   I  DEE + +L
Sbjct: 447 EFCKTLTKWSPLSSETYTLLMKTYLAAGKACPDMKKRLEEQGILVDEEQECLL 497

BLAST of Tan0012021 vs. TAIR 10
Match: AT4G02820.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 266.5 bits (680), Expect = 4.2e-71
Identity = 146/438 (33.33%), Postives = 251/438 (57.31%), Query Frame = 0

Query: 51  WVQEGKKVRVAELQRIIHDLRKRKRFTQALEVSEWMKKSGVCIFSPREHAVQLDLIGRVR 110
           W +EG  VR  EL RI+ +LRK KR+  ALE+ EWM           ++AV LDLI ++R
Sbjct: 84  WKEEGHSVRKYELNRIVRELRKIKRYKHALEICEWMVVQEDIKLQAGDYAVHLDLISKIR 143

Query: 111 GYLSAESYFNQLKEQDQTDKTYGALLNCYVRQRQVEKSLSHLQKMKEMGFATSELTYNDI 170
           G  SAE +F  + +Q +      +LL+ YV+ +  +K+ +  +KM E GF  S L YN +
Sbjct: 144 GLNSAEKFFEDMPDQMRGHAACTSLLHSYVQNKLSDKAEALFEKMGECGFLKSCLPYNHM 203

Query: 171 MCLYTNVGQHDKVPEVLAEMKEKNVSPDNFSYRICINSYGVRRDLEGMENV-LKEMESQP 230
           + +Y + GQ +KVP ++ E+K +  SPD  +Y + + ++    D+EG E V LK  E + 
Sbjct: 204 LSMYISRGQFEKVPVLIKELKIR-TSPDIVTYNLWLTAFASGNDVEGAEKVYLKAKEEK- 263

Query: 231 HIIMDWNTYAVVANFFIKADLTDKAVDALRKSEEKLKSKDRIGHNHLISLYATLGNKEKV 290
            +  DW TY+V+ N + K D  +KA  AL++ E+ +  K+R+ +  LISL+A LG+K+ V
Sbjct: 264 -LNPDWVTYSVLTNLYAKTDNVEKARLALKEMEKLVSKKNRVAYASLISLHANLGDKDGV 323

Query: 291 LRLWNLDKSDTTRFINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVG 350
              W   KS   +  + +Y++M+ ++V+LGE E+A+ +  EWES     D R+PN ++  
Sbjct: 324 NLTWKKVKSSFKKMNDAEYLSMISAVVKLGEFEQAKGLYDEWESVSGTGDARIPNLILAE 383

Query: 351 YIDKGMCERAEALLEDLMEKGKATTPNSWGAVAVQYMDRGETEKAVECMKAALSLNMDKG 410
           Y+++      E   E ++EKG   + ++W  +   Y+ R + EK ++C   A  ++  K 
Sbjct: 384 YMNRDEVLLGEKFYERIVEKGINPSYSTWEILTWAYLKRKDMEKVLDCFGKA--IDSVKK 443

Query: 411 WKPNLRVITGILNWLGENGSIEEVEAFVGSLRSVIPVNREMYHALMKAHIRGGKEVNEVL 470
           W  N+R++ G    L E G+++  E  +  L+    VN ++Y++L++ + + G+    V 
Sbjct: 444 WTVNVRLVKGACKELEEQGNVKGAEKLMTLLQKAGYVNTQLYNSLLRTYAKAGEMALIVE 503

Query: 471 NQMKSDKIDEDEETKKIL 488
            +M  D ++ DEETK+++
Sbjct: 504 ERMAKDNVELDEETKELI 516

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q84JR34.4e-17462.26Pentatricopeptide repeat-containing protein At4g21705, mitochondrial OS=Arabidop... [more]
Q9SKU61.2e-7838.04Pentatricopeptide repeat-containing protein At2g20710, mitochondrial OS=Arabidop... [more]
Q8LPS63.8e-7734.96Pentatricopeptide repeat-containing protein At1g02150 OS=Arabidopsis thaliana OX... [more]
Q93WC55.7e-7332.56Pentatricopeptide repeat-containing protein At4g01990, mitochondrial OS=Arabidop... [more]
Q9SY075.9e-7033.33Pentatricopeptide repeat-containing protein At4g02820, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
XP_023542644.11.8e-27394.75pentatricopeptide repeat-containing protein At4g21705, mitochondrial isoform X1 ... [more]
XP_022994385.16.8e-27393.98pentatricopeptide repeat-containing protein At4g21705, mitochondrial isoform X1 ... [more]
KAG7012430.11.2e-27294.75Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita a... [more]
XP_022954890.12.6e-27294.56pentatricopeptide repeat-containing protein At4g21705, mitochondrial isoform X1 ... [more]
KAG6573262.19.8e-27294.16Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita a... [more]
Match NameE-valueIdentityDescription
A0A6J1K1243.3e-27393.98pentatricopeptide repeat-containing protein At4g21705, mitochondrial isoform X1 ... [more]
A0A6J1GTN51.2e-27294.56pentatricopeptide repeat-containing protein At4g21705, mitochondrial isoform X1 ... [more]
A0A6J1CGU21.2e-26791.00pentatricopeptide repeat-containing protein At4g21705, mitochondrial OS=Momordic... [more]
A0A5A7UM452.5e-25788.18Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S3B5N22.5e-25788.18pentatricopeptide repeat-containing protein At4g21705, mitochondrial OS=Cucumis ... [more]
Match NameE-valueIdentityDescription
AT4G21705.13.1e-17562.26Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT2G20710.18.4e-8038.04Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G02150.12.7e-7834.96Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G01990.14.0e-7432.56Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G02820.14.2e-7133.33Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 308..338
e-value: 3.7E-4
score: 18.5
coord: 131..160
e-value: 5.1E-5
score: 21.2
coord: 166..198
e-value: 1.8E-5
score: 22.6
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 308..335
e-value: 0.0024
score: 18.0
coord: 131..160
e-value: 1.3E-4
score: 22.0
coord: 344..370
e-value: 0.06
score: 13.6
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 165..209
e-value: 1.4E-8
score: 34.8
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 128..162
score: 9.613118
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 163..197
score: 10.095415
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 50..187
e-value: 4.0E-14
score: 54.6
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 405..500
e-value: 6.2E-7
score: 31.1
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 299..404
e-value: 6.6E-11
score: 43.9
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 188..298
e-value: 4.6E-13
score: 51.3
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 142..433
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 311..405
NoneNo IPR availablePANTHERPTHR45717:SF20OS07G0598500 PROTEINcoord: 17..490
NoneNo IPR availablePANTHERPTHR45717OS12G0527900 PROTEINcoord: 17..490
IPR019734Tetratricopeptide repeatPROSITEPS50005TPRcoord: 375..408
score: 8.4079

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0012021.1Tan0012021.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0005739 mitochondrion
molecular_function GO:0005515 protein binding