Tan0011222 (gene) Snake gourd v1

Overview
NameTan0011222
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat-containing protein
LocationLG04: 3932078 .. 3935391 (+)
RNA-Seq ExpressionTan0011222
SyntenyTan0011222
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AGCCAGTCCATCACTGTCTCCCTCAGCGCTCGCAAATCAATGCCATCTACTCTGCAGAATATCTGTTGAAGGTCTTCTTCCTTAACTCCTCCAATTCTTTGCTTTCTTTCTTGCTTGCTGGCTAATGTTCAGGCCAAGCTCCCGAGTCCACTTCTTTTCTCTTTCTTCGTATTTCGAGCTCATACCTATATAAATATATTTGTACTTCTTTCTTCAGTGTTTCCATATATATATACGTACACGTATGTGTGTATGTACAGTTTATGTTTACTTTTTCCAATCGTCTGCTGAACCTGAATGCGTGTTCATGTGGTAATCGTAATGAAGGGAAGAACCCTGTTCTTTATTGAGTTTGAATCTTAAGTAGCTTATCGAGATAAGAGAATGTCAGTGAGAAAGGCTAATTTGTAGCTTTATGTTTGTGTTCGTGGAAGTAAAATTTTGGGGATGCTTCTATGGAAGTTTTTATGTGTTTCTTTTCTTGCATTGATGTGTTGGCGCAAATTGAGGTGATGATTTTTGTGTTGAAATTCCTGAATGGTACAATGACGTTCTATCTGTTTATTTAGGTAGATAGGATCATTGATTTCATGAAAGTCACGATTCATTGTCCATTTCGTGCCAAACGGAATTTGGTTTCGTACCCAAGTAAGCATGCTTTTGGTTCCCACCTTAGATACTGGCGTTCGGCTGCAGAAGGCGATATTGTGCCTTTTAGGACAGAAGATATTGATAATGACTATCTATTGGATACACACAGGATCTCCACGCACGGCCAGCTTTGGCAGGCTCTTTCACTGTTCTATTCCTCTAGACAGCCTCATTCCCGCCAGACCTATGCCCATCTCTTCCATGCTTGTGCACGCCTCCGCTGCCTACAAGAAGGCGTTGGACTGCACCGTTACATGATGTCACGGGATCCTATGGACTCATTTGATCTCTTTGTTACCAATCATCTTATTAACATGTACTGTAAATGTGGCCACTTAATCTATGCCTACCAATTATTTAATGACATGCCAAGGAGAAACCTTGTTTCGTGGACTGCACTTATCTCGGGACTTTCTCAGAATGGCCATGTCGATGAGTGCTTCCTTCTATTTTCGAGAATGTTGGTAGATCACCAGCCAAATGAGTTTACAGTTGCAAGTTTGCTTACCTCATTTGGTGAGCATGACGGTGAACGTGGCAAACAGGTACATGGGTTTGCCTTGAAAACGTCTTTAGATGCCTCTGTTTATGTTGCAAATGCTCTTATTACCATGTATAGCAAGAGTTTCTATAAAGGTGGTGCTTTTAATGATAATAAAGATGATGCTTGGACTATGTTCAAGAGCATAGAAAATCCCAGCCTTATAACATGGAATTCAATGATTGCAGGGTTTTGTTTCCAGAAACTCGGAAACCAAGCTATCCATTTATTTATACAAATGAATCGTCAAGGAATTGGGTTTGATCGTGCAACAATTTTAAGTACTTTGTCTTCCGTAAGTCTCTGCAATTGGGATGAATTTGGCCTGGGTCTGAGCTTTTGTCATGAGTTACACTGTCAAGCATTAAAAACTGCTTTCATCTCAGAAGTTGAAATAATTACTGCGTTAGTGAAAACTTATGCAGAACTAGAAGGGGACATCGCGGATAGTTATAGGCTTTTTGTTGAAGCAGGATATAATCGGGATATAGTTTTATGGACTAGCATTATGACAGCTTTTGTAGAACATGACCCTGGGAAAACCCTTTCCCTTTTTCATCAGTTCCGACAAGAAGGCTTAATTCCAGATGGACACAATTTTTCAATTGTATTAAAGGCTTGTGCTGGATTTTTGACCGAGAAGCATGCCTCAACATATCATTCACTGCTAATTAAATATATGTCTGAGAATGACATTGTCCTTAACAATGCCTTGATTCATGCTTATGCGAGGTGTGGTTCAATTACTTCCTCTGAGAAAGTATTTGATCAAATGAAACATCGAGATTTGGTTTCTTGGAACACAATGATGAAGGCCTATGCTGTCCATGGGCAAGCTGAGAAAGCTTTGCAGGTTTTTTCAAAGATGAATGTTCCACCTGATTCTACTACATTTGTCTCTCTCCTTTCAGCCTGTAGCCATGCTGGGCTCGTGGAAGAAGGGACCAGACTTTTCAATTCAATAGCAAATTATGGTATTGCTTGTCAACTAGATCACTATGCTTGCATGGTTGACATTTTGGGGAGAGCTGGTCGGATTCAAGAGGCTGAAGATTTTATAAGTAAAATGCCTGTGGAACCTGATTTTGTTGTTTGGAGTTCATTCCTCGGATCATGTAGAAAGCATGGTGCAACGCAATTGGCCAAATTATCATCTAATAAATTGAAAGAGTTAGATCCTAGCAATTCTCTGGCTTATGTGCAAATGTCAAATCTATATTGCTTCAGTGGTAGCTTTTATGATGCAGACTTAATTAGGACGGAAATGAAAGGGTCTAGAGTGAGAAAGGAACCTGGATTAAGTTGGGTAGAAATAGAAAATCAAGTGCATGAGTTTGCATCTGGAGGTCGCCGTCATCCACAGAGAGAGGTAATATGCAATGAGCTTGAAGAACTCGTTGGGAGGTTAAAGGAGATTGGTTATGTGCCTGAGACAAGCTTAGCATTGCATGACGTGGAGCAAGAGCAAAAGGAGGAGCAACTATATCATCATAGCGAGAAGCTGGCTTTGGTTTTCTCTGTAATGAATGATAGTAACTTGTGTCGCATTGGTACTCCTATAAGGATTATGAAAAACATCCGAATTTGTGTGGATTGTCATAATTTCATGAAGTTAGCTTCAACGCTACTTAAGAAGGAGATTGTCATTAGAGACTCTAATCGTTTTCATCATTTCACAGCAGGTTTATGCTCTTGCAATGATTATTGGTAATTAATTGGCTTCAAACTTTCAAATACCTAAGGTTCACCTGCATTTACCAGTAGCTTGTAACCTCCAATATGAATAGAGACAAGTGATGATCTAGAATATTGACACTATATATCAATAAACTCTCAGTTCCTTTCACAACTGAGGACATTTTAGGAGGTGAAGATGTGGTTAGTCCAAACACATCCCAGACAAGATGGATACAGGAAAGATCAATTCCAGACTCCACGAAGTTGCTTTTACAAATTTGCAAACATGAAGCTTCTCCATTTGAGCGAGTGTAAATTACTACCAATCATTATACAACCTCCAATTCATTTGGATATCCTAATCCTTGTCTTTATATTAACTAACTAGGTTCATCCGAACTTCCTGTTTTGTCTTTCAGCAGATTTCCTCTAGTTTGGTCATAAA

mRNA sequence

AGCCAGTCCATCACTGTCTCCCTCAGCGCTCGCAAATCAATGCCATCTACTCTGCAGAATATCTGTTGAAGGTAGATAGGATCATTGATTTCATGAAAGTCACGATTCATTGTCCATTTCGTGCCAAACGGAATTTGGTTTCGTACCCAAGTAAGCATGCTTTTGGTTCCCACCTTAGATACTGGCGTTCGGCTGCAGAAGGCGATATTGTGCCTTTTAGGACAGAAGATATTGATAATGACTATCTATTGGATACACACAGGATCTCCACGCACGGCCAGCTTTGGCAGGCTCTTTCACTGTTCTATTCCTCTAGACAGCCTCATTCCCGCCAGACCTATGCCCATCTCTTCCATGCTTGTGCACGCCTCCGCTGCCTACAAGAAGGCGTTGGACTGCACCGTTACATGATGTCACGGGATCCTATGGACTCATTTGATCTCTTTGTTACCAATCATCTTATTAACATGTACTGTAAATGTGGCCACTTAATCTATGCCTACCAATTATTTAATGACATGCCAAGGAGAAACCTTGTTTCGTGGACTGCACTTATCTCGGGACTTTCTCAGAATGGCCATGTCGATGAGTGCTTCCTTCTATTTTCGAGAATGTTGGTAGATCACCAGCCAAATGAGTTTACAGTTGCAAGTTTGCTTACCTCATTTGGTGAGCATGACGGTGAACGTGGCAAACAGGTACATGGGTTTGCCTTGAAAACGTCTTTAGATGCCTCTGTTTATGTTGCAAATGCTCTTATTACCATGTATAGCAAGAGTTTCTATAAAGGTGGTGCTTTTAATGATAATAAAGATGATGCTTGGACTATGTTCAAGAGCATAGAAAATCCCAGCCTTATAACATGGAATTCAATGATTGCAGGGTTTTGTTTCCAGAAACTCGGAAACCAAGCTATCCATTTATTTATACAAATGAATCGTCAAGGAATTGGGTTTGATCGTGCAACAATTTTAAGTACTTTGTCTTCCGTAAGTCTCTGCAATTGGGATGAATTTGGCCTGGGTCTGAGCTTTTGTCATGAGTTACACTGTCAAGCATTAAAAACTGCTTTCATCTCAGAAGTTGAAATAATTACTGCGTTAGTGAAAACTTATGCAGAACTAGAAGGGGACATCGCGGATAGTTATAGGCTTTTTGTTGAAGCAGGATATAATCGGGATATAGTTTTATGGACTAGCATTATGACAGCTTTTGTAGAACATGACCCTGGGAAAACCCTTTCCCTTTTTCATCAGTTCCGACAAGAAGGCTTAATTCCAGATGGACACAATTTTTCAATTGTATTAAAGGCTTGTGCTGGATTTTTGACCGAGAAGCATGCCTCAACATATCATTCACTGCTAATTAAATATATGTCTGAGAATGACATTGTCCTTAACAATGCCTTGATTCATGCTTATGCGAGGTGTGGTTCAATTACTTCCTCTGAGAAAGTATTTGATCAAATGAAACATCGAGATTTGGTTTCTTGGAACACAATGATGAAGGCCTATGCTGTCCATGGGCAAGCTGAGAAAGCTTTGCAGGTTTTTTCAAAGATGAATGTTCCACCTGATTCTACTACATTTGTCTCTCTCCTTTCAGCCTGTAGCCATGCTGGGCTCGTGGAAGAAGGGACCAGACTTTTCAATTCAATAGCAAATTATGGTATTGCTTGTCAACTAGATCACTATGCTTGCATGGTTGACATTTTGGGGAGAGCTGGTCGGATTCAAGAGGCTGAAGATTTTATAAGTAAAATGCCTGTGGAACCTGATTTTGTTGTTTGGAGTTCATTCCTCGGATCATGTAGAAAGCATGGTGCAACGCAATTGGCCAAATTATCATCTAATAAATTGAAAGAGTTAGATCCTAGCAATTCTCTGGCTTATGTGCAAATGTCAAATCTATATTGCTTCAGTGGTAGCTTTTATGATGCAGACTTAATTAGGACGGAAATGAAAGGGTCTAGAGTGAGAAAGGAACCTGGATTAAGTTGGGTAGAAATAGAAAATCAAGTGCATGAGTTTGCATCTGGAGGTCGCCGTCATCCACAGAGAGAGGTAATATGCAATGAGCTTGAAGAACTCGTTGGGAGGTTAAAGGAGATTGGTTATGTGCCTGAGACAAGCTTAGCATTGCATGACGTGGAGCAAGAGCAAAAGGAGGAGCAACTATATCATCATAGCGAGAAGCTGGCTTTGGTTTTCTCTGTAATGAATGATAGTAACTTGTGTCGCATTGGTACTCCTATAAGGATTATGAAAAACATCCGAATTTGTGTGGATTGTCATAATTTCATGAAGTTAGCTTCAACGCTACTTAAGAAGGAGATTGTCATTAGAGACTCTAATCGTTTTCATCATTTCACAGCAGGTTTATGCTCTTGCAATGATTATTGGTAATTAATTGGCTTCAAACTTTCAAATACCTAAGGTTCACCTGCATTTACCAGTAGCTTGTAACCTCCAATATGAATAGAGACAAGTGATGATCTAGAATATTGACACTATATATCAATAAACTCTCAGTTCCTTTCACAACTGAGGACATTTTAGGAGGTGAAGATGTGGTTAGTCCAAACACATCCCAGACAAGATGGATACAGGAAAGATCAATTCCAGACTCCACGAAGTTGCTTTTACAAATTTGCAAACATGAAGCTTCTCCATTTGAGCGAGTGTAAATTACTACCAATCATTATACAACCTCCAATTCATTTGGATATCCTAATCCTTGTCTTTATATTAACTAACTAGGTTCATCCGAACTTCCTGTTTTGTCTTTCAGCAGATTTCCTCTAGTTTGGTCATAAA

Coding sequence (CDS)

ATGAAAGTCACGATTCATTGTCCATTTCGTGCCAAACGGAATTTGGTTTCGTACCCAAGTAAGCATGCTTTTGGTTCCCACCTTAGATACTGGCGTTCGGCTGCAGAAGGCGATATTGTGCCTTTTAGGACAGAAGATATTGATAATGACTATCTATTGGATACACACAGGATCTCCACGCACGGCCAGCTTTGGCAGGCTCTTTCACTGTTCTATTCCTCTAGACAGCCTCATTCCCGCCAGACCTATGCCCATCTCTTCCATGCTTGTGCACGCCTCCGCTGCCTACAAGAAGGCGTTGGACTGCACCGTTACATGATGTCACGGGATCCTATGGACTCATTTGATCTCTTTGTTACCAATCATCTTATTAACATGTACTGTAAATGTGGCCACTTAATCTATGCCTACCAATTATTTAATGACATGCCAAGGAGAAACCTTGTTTCGTGGACTGCACTTATCTCGGGACTTTCTCAGAATGGCCATGTCGATGAGTGCTTCCTTCTATTTTCGAGAATGTTGGTAGATCACCAGCCAAATGAGTTTACAGTTGCAAGTTTGCTTACCTCATTTGGTGAGCATGACGGTGAACGTGGCAAACAGGTACATGGGTTTGCCTTGAAAACGTCTTTAGATGCCTCTGTTTATGTTGCAAATGCTCTTATTACCATGTATAGCAAGAGTTTCTATAAAGGTGGTGCTTTTAATGATAATAAAGATGATGCTTGGACTATGTTCAAGAGCATAGAAAATCCCAGCCTTATAACATGGAATTCAATGATTGCAGGGTTTTGTTTCCAGAAACTCGGAAACCAAGCTATCCATTTATTTATACAAATGAATCGTCAAGGAATTGGGTTTGATCGTGCAACAATTTTAAGTACTTTGTCTTCCGTAAGTCTCTGCAATTGGGATGAATTTGGCCTGGGTCTGAGCTTTTGTCATGAGTTACACTGTCAAGCATTAAAAACTGCTTTCATCTCAGAAGTTGAAATAATTACTGCGTTAGTGAAAACTTATGCAGAACTAGAAGGGGACATCGCGGATAGTTATAGGCTTTTTGTTGAAGCAGGATATAATCGGGATATAGTTTTATGGACTAGCATTATGACAGCTTTTGTAGAACATGACCCTGGGAAAACCCTTTCCCTTTTTCATCAGTTCCGACAAGAAGGCTTAATTCCAGATGGACACAATTTTTCAATTGTATTAAAGGCTTGTGCTGGATTTTTGACCGAGAAGCATGCCTCAACATATCATTCACTGCTAATTAAATATATGTCTGAGAATGACATTGTCCTTAACAATGCCTTGATTCATGCTTATGCGAGGTGTGGTTCAATTACTTCCTCTGAGAAAGTATTTGATCAAATGAAACATCGAGATTTGGTTTCTTGGAACACAATGATGAAGGCCTATGCTGTCCATGGGCAAGCTGAGAAAGCTTTGCAGGTTTTTTCAAAGATGAATGTTCCACCTGATTCTACTACATTTGTCTCTCTCCTTTCAGCCTGTAGCCATGCTGGGCTCGTGGAAGAAGGGACCAGACTTTTCAATTCAATAGCAAATTATGGTATTGCTTGTCAACTAGATCACTATGCTTGCATGGTTGACATTTTGGGGAGAGCTGGTCGGATTCAAGAGGCTGAAGATTTTATAAGTAAAATGCCTGTGGAACCTGATTTTGTTGTTTGGAGTTCATTCCTCGGATCATGTAGAAAGCATGGTGCAACGCAATTGGCCAAATTATCATCTAATAAATTGAAAGAGTTAGATCCTAGCAATTCTCTGGCTTATGTGCAAATGTCAAATCTATATTGCTTCAGTGGTAGCTTTTATGATGCAGACTTAATTAGGACGGAAATGAAAGGGTCTAGAGTGAGAAAGGAACCTGGATTAAGTTGGGTAGAAATAGAAAATCAAGTGCATGAGTTTGCATCTGGAGGTCGCCGTCATCCACAGAGAGAGGTAATATGCAATGAGCTTGAAGAACTCGTTGGGAGGTTAAAGGAGATTGGTTATGTGCCTGAGACAAGCTTAGCATTGCATGACGTGGAGCAAGAGCAAAAGGAGGAGCAACTATATCATCATAGCGAGAAGCTGGCTTTGGTTTTCTCTGTAATGAATGATAGTAACTTGTGTCGCATTGGTACTCCTATAAGGATTATGAAAAACATCCGAATTTGTGTGGATTGTCATAATTTCATGAAGTTAGCTTCAACGCTACTTAAGAAGGAGATTGTCATTAGAGACTCTAATCGTTTTCATCATTTCACAGCAGGTTTATGCTCTTGCAATGATTATTGGTAA

Protein sequence

MKVTIHCPFRAKRNLVSYPSKHAFGSHLRYWRSAAEGDIVPFRTEDIDNDYLLDTHRISTHGQLWQALSLFYSSRQPHSRQTYAHLFHACARLRCLQEGVGLHRYMMSRDPMDSFDLFVTNHLINMYCKCGHLIYAYQLFNDMPRRNLVSWTALISGLSQNGHVDECFLLFSRMLVDHQPNEFTVASLLTSFGEHDGERGKQVHGFALKTSLDASVYVANALITMYSKSFYKGGAFNDNKDDAWTMFKSIENPSLITWNSMIAGFCFQKLGNQAIHLFIQMNRQGIGFDRATILSTLSSVSLCNWDEFGLGLSFCHELHCQALKTAFISEVEIITALVKTYAELEGDIADSYRLFVEAGYNRDIVLWTSIMTAFVEHDPGKTLSLFHQFRQEGLIPDGHNFSIVLKACAGFLTEKHASTYHSLLIKYMSENDIVLNNALIHAYARCGSITSSEKVFDQMKHRDLVSWNTMMKAYAVHGQAEKALQVFSKMNVPPDSTTFVSLLSACSHAGLVEEGTRLFNSIANYGIACQLDHYACMVDILGRAGRIQEAEDFISKMPVEPDFVVWSSFLGSCRKHGATQLAKLSSNKLKELDPSNSLAYVQMSNLYCFSGSFYDADLIRTEMKGSRVRKEPGLSWVEIENQVHEFASGGRRHPQREVICNELEELVGRLKEIGYVPETSLALHDVEQEQKEEQLYHHSEKLALVFSVMNDSNLCRIGTPIRIMKNIRICVDCHNFMKLASTLLKKEIVIRDSNRFHHFTAGLCSCNDYW
Homology
BLAST of Tan0011222 vs. ExPASy Swiss-Prot
Match: Q9C9H9 (Pentatricopeptide repeat-containing protein At1g71420 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H70 PE=2 SV=1)

HSP 1 Score: 726.1 bits (1873), Expect = 4.2e-208
Identity = 375/715 (52.45%), Postives = 499/715 (69.79%), Query Frame = 0

Query: 62  GQLWQALSLFYSSR-QPHSRQTYAHLFHACARLRCLQEGVGLHRYMMSRDPMDSFDLFVT 121
           G + +A+SLFYS+  +  S+Q YA LF ACA  R L +G+ LH +M+S     S ++ + 
Sbjct: 40  GDIRRAVSLFYSAPVELQSQQAYAALFQACAEQRNLLDGINLHHHMLSHPYCYSQNVILA 99

Query: 122 NHLINMYCKCGHLIYAYQLFNDMPRRNLVSWTALISGLSQNGHVDECFLLFSRMLVDHQP 181
           N LINMY KCG+++YA Q+F+ MP RN+VSWTALI+G  Q G+  E F LFS ML    P
Sbjct: 100 NFLINMYAKCGNILYARQVFDTMPERNVVSWTALITGYVQAGNEQEGFCLFSSMLSHCFP 159

Query: 182 NEFTVASLLTSFGEHDGERGKQVHGFALKTSLDASVYVANALITMYSKSFYKGGAFNDNK 241
           NEFT++S+LTS      E GKQVHG ALK  L  S+YVANA+I+MY +      A+    
Sbjct: 160 NEFTLSSVLTSCRY---EPGKQVHGLALKLGLHCSIYVANAVISMYGRCHDGAAAY---- 219

Query: 242 DDAWTMFKSIENPSLITWNSMIAGFCFQKLGNQAIHLFIQMNRQGIGFDRATILSTLSSV 301
            +AWT+F++I+  +L+TWNSMIA F    LG +AI +F++M+  G+GFDRAT+L+  SS+
Sbjct: 220 -EAWTVFEAIKFKNLVTWNSMIAAFQCCNLGKKAIGVFMRMHSDGVGFDRATLLNICSSL 279

Query: 302 SLCNWDEFGLGLSFCHELHCQALKTAFISEVEIITALVKTYAELEGDIADSYRLFVEAGY 361
              +          C +LH   +K+  +++ E+ TAL+K Y+E+  D  D Y+LF+E  +
Sbjct: 280 YKSSDLVPNEVSKCCLQLHSLTVKSGLVTQTEVATALIKVYSEMLEDYTDCYKLFMEMSH 339

Query: 362 NRDIVLWTSIMTAFVEHDPGKTLSLFHQFRQEGLIPDGHNFSIVLKACAGFLTEKHASTY 421
            RDIV W  I+TAF  +DP + + LF Q RQE L PD + FS VLKACAG +T +HA + 
Sbjct: 340 CRDIVAWNGIITAFAVYDPERAIHLFGQLRQEKLSPDWYTFSSVLKACAGLVTARHALSI 399

Query: 422 HSLLIKYMSENDIVLNNALIHAYARCGSITSSEKVFDQMKHRDLVSWNTMMKAYAVHGQA 481
           H+ +IK     D VLNN+LIHAYA+CGS+    +VFD M  RD+VSWN+M+KAY++HGQ 
Sbjct: 400 HAQVIKGGFLADTVLNNSLIHAYAKCGSLDLCMRVFDDMDSRDVVSWNSMLKAYSLHGQV 459

Query: 482 EKALQVFSKMNVPPDSTTFVSLLSACSHAGLVEEGTRLFNSIANYGIAC-QLDHYACMVD 541
           +  L VF KM++ PDS TF++LLSACSHAG VEEG R+F S+        QL+HYAC++D
Sbjct: 460 DSILPVFQKMDINPDSATFIALLSACSHAGRVEEGLRIFRSMFEKPETLPQLNHYACVID 519

Query: 542 ILGRAGRIQEAEDFISKMPVEPDFVVWSSFLGSCRKHGATQLAKLSSNKLKEL-DPSNSL 601
           +L RA R  EAE+ I +MP++PD VVW + LGSCRKHG T+L KL+++KLKEL +P+NS+
Sbjct: 520 MLSRAERFAEAEEVIKQMPMDPDAVVWIALLGSCRKHGNTRLGKLAADKLKELVEPTNSM 579

Query: 602 AYVQMSNLYCFSGSFYDADLIRTEMKGSRVRKEPGLSWVEIENQVHEFASGGRRHPQREV 661
           +Y+QMSN+Y   GSF +A+L   EM+  RVRKEP LSW EI N+VHEFASGGR  P +E 
Sbjct: 580 SYIQMSNIYNAEGSFNEANLSIKEMETWRVRKEPDLSWTEIGNKVHEFASGGRHRPDKEA 639

Query: 662 ICNELEELVGRLKEIGYVPETSLALHDVE-QEQKEEQLYHHSEKLALVFSVM--NDSNLC 721
           +  EL+ L+  LKE+GYVPE   A  D+E +EQ+E+ L HHSEKLAL F+VM    S+ C
Sbjct: 640 VYRELKRLISWLKEMGYVPEMRSASQDIEDEEQEEDNLLHHSEKLALAFAVMEGRKSSDC 699

Query: 722 RIGTPIRIMKNIRICVDCHNFMKLASTLLKKEIVIRDSNRFHHFTAGLCSCNDYW 771
            +   I+IMKN RIC+DCHNFMKLAS LL KEI++RDSNRFHHF    CSCNDYW
Sbjct: 700 GVNL-IQIMKNTRICIDCHNFMKLASKLLGKEILMRDSNRFHHFKDSSCSCNDYW 745

BLAST of Tan0011222 vs. ExPASy Swiss-Prot
Match: Q9FIB2 (Putative pentatricopeptide repeat-containing protein At5g09950 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H35 PE=3 SV=1)

HSP 1 Score: 458.0 bits (1177), Expect = 2.1e-127
Identity = 253/688 (36.77%), Postives = 406/688 (59.01%), Query Frame = 0

Query: 96  LQEGVGLHRYMMSRDPMDSFDLFVTNHLINMYCKCGHLIYAYQLFNDMPRRNLVSWTALI 155
           L++G  +H ++++   +D F + + N L+NMY KCG +  A ++F  M  ++ VSW ++I
Sbjct: 329 LKKGREVHGHVITTGLVD-FMVGIGNGLVNMYAKCGSIADARRVFYFMTDKDSVSWNSMI 388

Query: 156 SGLSQNGHVDECFLLFSRM-LVDHQPNEFTVASLLTSFGEHD-GERGKQVHGFALKTSLD 215
           +GL QNG   E    +  M   D  P  FT+ S L+S       + G+Q+HG +LK  +D
Sbjct: 389 TGLDQNGCFIEAVERYKSMRRHDILPGSFTLISSLSSCASLKWAKLGQQIHGESLKLGID 448

Query: 216 ASVYVANALITMYSKSFYKGGAFNDNKDDAWTMFKSIENPSLITWNSMIAGFC-FQKLGN 275
            +V V+NAL+T+Y+++ Y         ++   +F S+     ++WNS+I      ++   
Sbjct: 449 LNVSVSNALMTLYAETGY--------LNECRKIFSSMPEHDQVSWNSIIGALARSERSLP 508

Query: 276 QAIHLFIQMNRQGIGFDRATILSTLSSVSLCNWDEFGLGLSFCHELHCQALKTAFISEVE 335
           +A+  F+   R G   +R T  S LS+VS  ++ E G       ++H  ALK     E  
Sbjct: 509 EAVVCFLNAQRAGQKLNRITFSSVLSAVSSLSFGELG------KQIHGLALKNNIADEAT 568

Query: 336 IITALVKTYAELEGDIADSYRLFVEAGYNRDIVLWTSIMTAFVEHD-PGKTLSLFHQFRQ 395
              AL+  Y +  G++    ++F      RD V W S+++ ++ ++   K L L     Q
Sbjct: 569 TENALIACYGKC-GEMDGCEKIFSRMAERRDNVTWNSMISGYIHNELLAKALDLVWFMLQ 628

Query: 396 EGLIPDGHNFSIVLKACAGFLTEKHASTYHSLLIKYMSENDIVLNNALIHAYARCGSITS 455
            G   D   ++ VL A A   T +     H+  ++   E+D+V+ +AL+  Y++CG +  
Sbjct: 629 TGQRLDSFMYATVLSAFASVATLERGMEVHACSVRACLESDVVVGSALVDMYSKCGRLDY 688

Query: 456 SEKVFDQMKHRDLVSWNTMMKAYAVHGQAEKALQVFSKMNV----PPDSTTFVSLLSACS 515
           + + F+ M  R+  SWN+M+  YA HGQ E+AL++F  M +    PPD  TFV +LSACS
Sbjct: 689 ALRFFNTMPVRNSYSWNSMISGYARHGQGEEALKLFETMKLDGQTPPDHVTFVGVLSACS 748

Query: 516 HAGLVEEGTRLFNSIA-NYGIACQLDHYACMVDILGRAGRIQEAEDFISKMPVEPDFVVW 575
           HAGL+EEG + F S++ +YG+A +++H++CM D+LGRAG + + EDFI KMP++P+ ++W
Sbjct: 749 HAGLLEEGFKHFESMSDSYGLAPRIEHFSCMADVLGRAGELDKLEDFIEKMPMKPNVLIW 808

Query: 576 SSFLGS-CRKHG-ATQLAKLSSNKLKELDPSNSLAYVQMSNLYCFSGSFYDADLIRTEMK 635
            + LG+ CR +G   +L K ++  L +L+P N++ YV + N+Y   G + D    R +MK
Sbjct: 809 RTVLGACCRANGRKAELGKKAAEMLFQLEPENAVNYVLLGNMYAAGGRWEDLVKARKKMK 868

Query: 636 GSRVRKEPGLSWVEIENQVHEFASGGRRHPQREVICNELEELVGRLKEIGYVPETSLALH 695
            + V+KE G SWV +++ VH F +G + HP  +VI  +L+EL  ++++ GYVP+T  AL+
Sbjct: 869 DADVKKEAGYSWVTMKDGVHMFVAGDKSHPDADVIYKKLKELNRKMRDAGYVPQTGFALY 928

Query: 696 DVEQEQKEEQLYHHSEKLALVF--SVMNDSNLCRIGTPIRIMKNIRICVDCHNFMKLAST 755
           D+EQE KEE L +HSEKLA+ F  +    S L     PIRIMKN+R+C DCH+  K  S 
Sbjct: 929 DLEQENKEEILSYHSEKLAVAFVLAAQRSSTL-----PIRIMKNLRVCGDCHSAFKYISK 988

Query: 756 LLKKEIVIRDSNRFHHFTAGLCSCNDYW 771
           +  ++I++RDSNRFHHF  G CSC+D+W
Sbjct: 989 IEGRQIILRDSNRFHHFQDGACSCSDFW 995

BLAST of Tan0011222 vs. ExPASy Swiss-Prot
Match: Q0WSH6 (Pentatricopeptide repeat-containing protein At4g14850 OS=Arabidopsis thaliana OX=3702 GN=LOI1 PE=1 SV=1)

HSP 1 Score: 434.5 bits (1116), Expect = 2.5e-120
Identity = 247/668 (36.98%), Postives = 375/668 (56.14%), Query Frame = 0

Query: 118 FVTNHLINMYCKCGHLIYAYQLFNDMPRRNLVSWTALISGLSQNGHVDECFLLFSRMLVD 177
           F+ N+LINMY K  H   A  +    P RN+VSWT+LISGL+QNGH     + F  M  +
Sbjct: 43  FLANYLINMYSKLDHPESARLVLRLTPARNVVSWTSLISGLAQNGHFSTALVEFFEMRRE 102

Query: 178 H-QPNEFT-------VASLLTSFGEHDGERGKQVHGFALKTSLDASVYVANALITMYSKS 237
              PN+FT       VASL           GKQ+H  A+K      V+V  +   MY K+
Sbjct: 103 GVVPNDFTFPCAFKAVASLRLPV------TGKQIHALAVKCGRILDVFVGCSAFDMYCKT 162

Query: 238 FYKGGAFNDNKDDAWTMFKSIENPSLITWNSMIAGFCFQKLGNQAIHLFIQMNRQGIGFD 297
                     +DDA  +F  I   +L TWN+ I+         +AI  FI+  R     +
Sbjct: 163 RL--------RDDARKLFDEIPERNLETWNAFISNSVTDGRPREAIEAFIEFRRIDGHPN 222

Query: 298 RATILSTLSSVSLCNWDEFGLGLSFCHELHCQALKTAFISEVEIITALVKTYAELEGDIA 357
             T  + L++ S  +W    LG+    +LH   L++ F ++V +   L+  Y + +  I 
Sbjct: 223 SITFCAFLNACS--DWLHLNLGM----QLHGLVLRSGFDTDVSVCNGLIDFYGKCK-QIR 282

Query: 358 DSYRLFVEAGYNRDIVLWTSIMTAFVE-HDPGKTLSLFHQFRQEGLIPDGHNFSIVLKAC 417
            S  +F E G  ++ V W S++ A+V+ H+  K   L+ + R++ +       S VL AC
Sbjct: 283 SSEIIFTEMG-TKNAVSWCSLVAAYVQNHEDEKASVLYLRSRKDIVETSDFMISSVLSAC 342

Query: 418 AGFLTEKHASTYHSLLIKYMSENDIVLNNALIHAYARCGSITSSEKVFDQMKHRDLVSWN 477
           AG    +   + H+  +K   E  I + +AL+  Y +CG I  SE+ FD+M  ++LV+ N
Sbjct: 343 AGMAGLELGRSIHAHAVKACVERTIFVGSALVDMYGKCGCIEDSEQAFDEMPEKNLVTRN 402

Query: 478 TMMKAYAVHGQAEKALQVFSKM-----NVPPDSTTFVSLLSACSHAGLVEEGTRLFNSI- 537
           +++  YA  GQ + AL +F +M        P+  TFVSLLSACS AG VE G ++F+S+ 
Sbjct: 403 SLIGGYAHQGQVDMALALFEEMAPRGCGPTPNYMTFVSLLSACSRAGAVENGMKIFDSMR 462

Query: 538 ANYGIACQLDHYACMVDILGRAGRIQEAEDFISKMPVEPDFVVWSSFLGSCRKHGATQLA 597
           + YGI    +HY+C+VD+LGRAG ++ A +FI KMP++P   VW +   +CR HG  QL 
Sbjct: 463 STYGIEPGAEHYSCIVDMLGRAGMVERAYEFIKKMPIQPTISVWGALQNACRMHGKPQLG 522

Query: 598 KLSSNKLKELDPSNSLAYVQMSNLYCFSGSFYDADLIRTEMKGSRVRKEPGLSWVEIENQ 657
            L++  L +LDP +S  +V +SN +  +G + +A+ +R E+KG  ++K  G SW+ ++NQ
Sbjct: 523 LLAAENLFKLDPKDSGNHVLLSNTFAAAGRWAEANTVREELKGVGIKKGAGYSWITVKNQ 582

Query: 658 VHEFASGGRRHPQREVICNELEELVGRLKEIGYVPETSLALHDVEQEQKEEQLYHHSEKL 717
           VH F +  R H   + I   L +L   ++  GY P+  L+L+D+E+E+K  ++ HHSEKL
Sbjct: 583 VHAFQAKDRSHILNKEIQTTLAKLRNEMEAAGYKPDLKLSLYDLEEEEKAAEVSHHSEKL 642

Query: 718 ALVFSVMNDSNLCRIGTPIRIMKNIRICVDCHNFMKLASTLLKKEIVIRDSNRFHHFTAG 771
           AL F +++      +  PIRI KN+RIC DCH+F K  S  +K+EI++RD+NRFH F  G
Sbjct: 643 ALAFGLLS----LPLSVPIRITKNLRICGDCHSFFKFVSGSVKREIIVRDNNRFHRFKDG 684

BLAST of Tan0011222 vs. ExPASy Swiss-Prot
Match: Q9SHZ8 (Pentatricopeptide repeat-containing protein At2g22070 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H41 PE=3 SV=1)

HSP 1 Score: 430.6 bits (1106), Expect = 3.6e-119
Identity = 254/731 (34.75%), Postives = 399/731 (54.58%), Query Frame = 0

Query: 99  GVGLHRYMMSRDPMDSFDLFVTNHLINMYCKCGHLIYAYQLFNDMPRRNLVSWTALISGL 158
           G  LH   +  D M     F  N +++ Y K G +    + F+ +P+R+ VSWT +I G 
Sbjct: 63  GYALHARKLF-DEMPLRTAFSWNTVLSAYSKRGDMDSTCEFFDQLPQRDSVSWTTMIVGY 122

Query: 159 SQNGHVDECFLLFSRMLVDH-QPNEFTVASLLTSF-GEHDGERGKQVHGFALKTSLDASV 218
              G   +   +   M+ +  +P +FT+ ++L S       E GK+VH F +K  L  +V
Sbjct: 123 KNIGQYHKAIRVMGDMVKEGIEPTQFTLTNVLASVAATRCMETGKKVHSFIVKLGLRGNV 182

Query: 219 YVANALITMYSKS--------FYKGGAFND---------------NKDDAWTMFKSIENP 278
            V+N+L+ MY+K          +      D                 D A   F+ +   
Sbjct: 183 SVSNSLLNMYAKCGDPMMAKFVFDRMVVRDISSWNAMIALHMQVGQMDLAMAQFEQMAER 242

Query: 279 SLITWNSMIAGFCFQKLGNQAIHLFIQMNRQG-IGFDRATILSTLSSVS----LC----- 338
            ++TWNSMI+GF  +    +A+ +F +M R   +  DR T+ S LS+ +    LC     
Sbjct: 243 DIVTWNSMISGFNQRGYDLRALDIFSKMLRDSLLSPDRFTLASVLSACANLEKLCIGKQI 302

Query: 339 -------NWDEFGLGLSFCHELH--CQALKTA--FISE-------VEIITALVKTYAELE 398
                   +D  G+ L+    ++  C  ++TA   I +       +E  TAL+  Y +L 
Sbjct: 303 HSHIVTTGFDISGIVLNALISMYSRCGGVETARRLIEQRGTKDLKIEGFTALLDGYIKL- 362

Query: 399 GDIADSYRLFVEAGYNRDIVLWTSIMTAFVEHDP-GKTLSLFHQFRQEGLIPDGHNFSIV 458
           GD+  +  +FV    +RD+V WT+++  + +H   G+ ++LF      G  P+ +  + +
Sbjct: 363 GDMNQAKNIFVSL-KDRDVVAWTAMIVGYEQHGSYGEAINLFRSMVGGGQRPNSYTLAAM 422

Query: 459 LKACAGFLTEKHASTYHSLLIKYMSENDIVLNNALIHAYARCGSITSSEKVFDQMK-HRD 518
           L   +   +  H    H   +K      + ++NALI  YA+ G+ITS+ + FD ++  RD
Sbjct: 423 LSVASSLASLSHGKQIHGSAVKSGEIYSVSVSNALITMYAKAGNITSASRAFDLIRCERD 482

Query: 519 LVSWNTMMKAYAVHGQAEKALQVFSKM---NVPPDSTTFVSLLSACSHAGLVEEGTRLFN 578
            VSW +M+ A A HG AE+AL++F  M    + PD  T+V + SAC+HAGLV +G + F+
Sbjct: 483 TVSWTSMIIALAQHGHAEEALELFETMLMEGLRPDHITYVGVFSACTHAGLVNQGRQYFD 542

Query: 579 SIANYG-IACQLDHYACMVDILGRAGRIQEAEDFISKMPVEPDFVVWSSFLGSCRKHGAT 638
            + +   I   L HYACMVD+ GRAG +QEA++FI KMP+EPD V W S L +CR H   
Sbjct: 543 MMKDVDKIIPTLSHYACMVDLFGRAGLLQEAQEFIEKMPIEPDVVTWGSLLSACRVHKNI 602

Query: 639 QLAKLSSNKLKELDPSNSLAYVQMSNLYCFSGSFYDADLIRTEMKGSRVRKEPGLSWVEI 698
            L K+++ +L  L+P NS AY  ++NLY   G + +A  IR  MK  RV+KE G SW+E+
Sbjct: 603 DLGKVAAERLLLLEPENSGAYSALANLYSACGKWEEAAKIRKSMKDGRVKKEQGFSWIEV 662

Query: 699 ENQVHEFASGGRRHPQREVICNELEELVGRLKEIGYVPETSLALHDVEQEQKEEQLYHHS 758
           +++VH F      HP++  I   ++++   +K++GYVP+T+  LHD+E+E KE+ L HHS
Sbjct: 663 KHKVHVFGVEDGTHPEKNEIYMTMKKIWDEIKKMGYVPDTASVLHDLEEEVKEQILRHHS 722

Query: 759 EKLALVFSVMNDSNLCRIGTPIRIMKNIRICVDCHNFMKLASTLLKKEIVIRDSNRFHHF 771
           EKLA+ F +++  +     T +RIMKN+R+C DCH  +K  S L+ +EI++RD+ RFHHF
Sbjct: 723 EKLAIAFGLISTPD----KTTLRIMKNLRVCNDCHTAIKFISKLVGREIIVRDTTRFHHF 782

BLAST of Tan0011222 vs. ExPASy Swiss-Prot
Match: Q9SMZ2 (Pentatricopeptide repeat-containing protein At4g33170 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H53 PE=3 SV=1)

HSP 1 Score: 427.6 bits (1098), Expect = 3.1e-118
Identity = 227/662 (34.29%), Postives = 386/662 (58.31%), Query Frame = 0

Query: 117 LFVTNHLINMYCKCGHLIYAYQLFNDMPRRNLVSWTALISGLSQNGHVDECFLLFSRML- 176
           L V+N LINMYCK     +A  +F++M  R+L+SW ++I+G++QNG   E   LF ++L 
Sbjct: 350 LTVSNSLINMYCKLRKFGFARTVFDNMSERDLISWNSVIAGIAQNGLEVEAVCLFMQLLR 409

Query: 177 VDHQPNEFTVASLLTSFGE--HDGERGKQVHGFALKTSLDASVYVANALITMYSKSFYKG 236
              +P+++T+ S+L +           KQVH  A+K +  +  +V+ ALI  YS+     
Sbjct: 410 CGLKPDQYTMTSVLKAASSLPEGLSLSKQVHVHAIKINNVSDSFVSTALIDAYSR----- 469

Query: 237 GAFNDNKDDAWTMFKSIENPSLITWNSMIAGFCFQKLGNQAIHLFIQMNRQGIGFDRATI 296
              N    +A  +F+   N  L+ WN+M+AG+     G++ + LF  M++QG   D  T+
Sbjct: 470 ---NRCMKEAEILFER-HNFDLVAWNAMMAGYTQSHDGHKTLKLFALMHKQGERSDDFTL 529

Query: 297 LSTLSSVSLCNWDEFGLGLSFCHELHCQALKTAFISEVEIITALVKTYAELEGDIADSYR 356
            +   +        F   ++   ++H  A+K+ +  ++ + + ++  Y +  GD++ +  
Sbjct: 530 ATVFKTCG------FLFAINQGKQVHAYAIKSGYDLDLWVSSGILDMYVKC-GDMSAAQF 589

Query: 357 LFVEAGYNRDIVLWTSIMTAFVEH-DPGKTLSLFHQFRQEGLIPDGHNFSIVLKACAGFL 416
            F       D V WT++++  +E+ +  +   +F Q R  G++PD    + + KA +   
Sbjct: 590 AFDSIPVPDD-VAWTTMISGCIENGEEERAFHVFSQMRLMGVLPDEFTIATLAKASSCLT 649

Query: 417 TEKHASTYHSLLIKYMSENDIVLNNALIHAYARCGSITSSEKVFDQMKHRDLVSWNTMMK 476
             +     H+  +K    ND  +  +L+  YA+CGSI  +  +F +++  ++ +WN M+ 
Sbjct: 650 ALEQGRQIHANALKLNCTNDPFVGTSLVDMYAKCGSIDDAYCLFKRIEMMNITAWNAMLV 709

Query: 477 AYAVHGQAEKALQVFSKM---NVPPDSTTFVSLLSACSHAGLVEEGTRLFNSI-ANYGIA 536
             A HG+ ++ LQ+F +M    + PD  TF+ +LSACSH+GLV E  +   S+  +YGI 
Sbjct: 710 GLAQHGEGKETLQLFKQMKSLGIKPDKVTFIGVLSACSHSGLVSEAYKHMRSMHGDYGIK 769

Query: 537 CQLDHYACMVDILGRAGRIQEAEDFISKMPVEPDFVVWSSFLGSCRKHGATQLAKLSSNK 596
            +++HY+C+ D LGRAG +++AE+ I  M +E    ++ + L +CR  G T+  K  + K
Sbjct: 770 PEIEHYSCLADALGRAGLVKQAENLIESMSMEASASMYRTLLAACRVQGDTETGKRVATK 829

Query: 597 LKELDPSNSLAYVQMSNLYCFSGSFYDADLIRTEMKGSRVRKEPGLSWVEIENQVHEFAS 656
           L EL+P +S AYV +SN+Y  +  + +  L RT MKG +V+K+PG SW+E++N++H F  
Sbjct: 830 LLELEPLDSSAYVLLSNMYAAASKWDEMKLARTMMKGHKVKKDPGFSWIEVKNKIHIFVV 889

Query: 657 GGRRHPQREVICNELEELVGRLKEIGYVPETSLALHDVEQEQKEEQLYHHSEKLALVFSV 716
             R + Q E+I  ++++++  +K+ GYVPET   L DVE+E+KE  LY+HSEKLA+ F +
Sbjct: 890 DDRSNRQTELIYRKVKDMIRDIKQEGYVPETDFTLVDVEEEEKERALYYHSEKLAVAFGL 949

Query: 717 MNDSNLCRIGTPIRIMKNIRICVDCHNFMKLASTLLKKEIVIRDSNRFHHFTAGLCSCND 771
           ++        TPIR++KN+R+C DCHN MK  + +  +EIV+RD+NRFH F  G+CSC D
Sbjct: 950 LSTPP----STPIRVIKNLRVCGDCHNAMKYIAKVYNREIVLRDANRFHRFKDGICSCGD 990

BLAST of Tan0011222 vs. NCBI nr
Match: XP_022139075.1 (pentatricopeptide repeat-containing protein At1g71420 isoform X1 [Momordica charantia])

HSP 1 Score: 1389.4 bits (3595), Expect = 0.0e+00
Identity = 671/767 (87.48%), Postives = 711/767 (92.70%), Query Frame = 0

Query: 4   TIHCPFRAKRNLVSYPSKHAFGSHLRYWRSAAEGDIVPFRTEDIDNDYLLDTHRISTHGQ 63
           TIH PF AKRNLV YPSK AF  HLRYWRSAAE D VP RTEDIDNDYL DT  IST G 
Sbjct: 49  TIHYPFLAKRNLVLYPSKWAFPIHLRYWRSAAESDFVPSRTEDIDNDYLWDTRVISTRGH 108

Query: 64  LWQALSLFYSSRQPHSRQTYAHLFHACARLRCLQEGVGLHRYMMSRDPMDSFDLFVTNHL 123
           L  ALSLFYS RQPHSRQTYA+LFHACARLRCL EG+GLHRYMMSRD MDSFDLFVTNHL
Sbjct: 109 LRHALSLFYSFRQPHSRQTYAYLFHACARLRCLHEGMGLHRYMMSRDLMDSFDLFVTNHL 168

Query: 124 INMYCKCGHLIYAYQLFNDMPRRNLVSWTALISGLSQNGHVDECFLLFSRMLVDHQPNEF 183
           INMYCKCGHL YA+QLF++MPRRNLVSWT LISGLSQ GHVDECFLLF RMLVD +PNEF
Sbjct: 169 INMYCKCGHLDYAWQLFDEMPRRNLVSWTVLISGLSQYGHVDECFLLFPRMLVDCRPNEF 228

Query: 184 TVASLLTSFGEHDGERGKQVHGFALKTSLDASVYVANALITMYSKSFYKGGAFNDNKDDA 243
           TVASLLTSFGEHDGERG+QVHGFALKTSLDA VYVANALITMYSKSF KGG FND+ DDA
Sbjct: 229 TVASLLTSFGEHDGERGRQVHGFALKTSLDAFVYVANALITMYSKSFCKGGIFNDSNDDA 288

Query: 244 WTMFKSIENPSLITWNSMIAGFCFQKLGNQAIHLFIQMNRQGIGFDRATILSTLSSVSLC 303
           WTMFKSIENPSLITWNSMIAGFCF+KLGNQAI+LF++MNR+GIGFDRAT+LSTLSS++LC
Sbjct: 289 WTMFKSIENPSLITWNSMIAGFCFRKLGNQAIYLFMKMNREGIGFDRATLLSTLSSLNLC 348

Query: 304 NWDEFGLGLSFCHELHCQALKTAFISEVEIITALVKTYAELEGDIADSYRLFVEAGYNRD 363
           N DEFGLGLSFCHELHC A KTAFISE+E+ TALVKTYA+L GDIADSYRLFVEAGY+ D
Sbjct: 349 NRDEFGLGLSFCHELHCLAFKTAFISEIEVATALVKTYADLGGDIADSYRLFVEAGYHWD 408

Query: 364 IVLWTSIMTAFVEHDPGKTLSLFHQFRQEGLIPDGHNFSIVLKACAGFLTEKHASTYHSL 423
           IVLWTSIMTA VEHDPGKTLSLF QFRQEGL PDGH FSIVLKACAG+LTEKHASTYHSL
Sbjct: 409 IVLWTSIMTALVEHDPGKTLSLFRQFRQEGLTPDGHTFSIVLKACAGYLTEKHASTYHSL 468

Query: 424 LIKYMSENDIVLNNALIHAYARCGSITSSEKVFDQMKHRDLVSWNTMMKAYAVHGQAEKA 483
           LIK MSE+DIVLNNALIHAY RCGSIT S+KVF +MK+RDLVSWNTMMKAYA+HGQA+ A
Sbjct: 469 LIKSMSEDDIVLNNALIHAYGRCGSITLSKKVFKEMKYRDLVSWNTMMKAYAIHGQAKNA 528

Query: 484 LQVFSKMNVPPDSTTFVSLLSACSHAGLVEEGTRLFNSIANYGIACQLDHYACMVDILGR 543
           L +FSKM+VPPDSTTFVSLLSACSHAGLVEEGT LFNSI  YGI CQLDHYACMVDILGR
Sbjct: 529 LHLFSKMDVPPDSTTFVSLLSACSHAGLVEEGTSLFNSIKYYGIVCQLDHYACMVDILGR 588

Query: 544 AGRIQEAEDFISKMPVEPDFVVWSSFLGSCRKHGATQLAKLSSNKLKELDPSNSLAYVQM 603
            GR+QEAE FISKMP+EPDFVVWSSFLGSCRKHGATQLAKL+SNKLKELDPSNSLAYVQM
Sbjct: 589 VGRVQEAEYFISKMPIEPDFVVWSSFLGSCRKHGATQLAKLASNKLKELDPSNSLAYVQM 648

Query: 604 SNLYCFSGSFYDADLIRTEMKGSRVRKEPGLSWVEIENQVHEFASGGRRHPQREVICNEL 663
           SNLYCFSGSFY+ADLIR EMKGSRVRKEPGLSWVEIENQVHEFASGGRRHPQRE ICNEL
Sbjct: 649 SNLYCFSGSFYEADLIRMEMKGSRVRKEPGLSWVEIENQVHEFASGGRRHPQREEICNEL 708

Query: 664 EELVGRLKEIGYVPETSLALHDVEQEQKEEQLYHHSEKLALVFSVMNDSNLCRIGTPIRI 723
           EEL+GRLK++GYVPETS+ALHDVEQEQKEEQLYHHSEKLALVFS+MNDSNLC +GT +RI
Sbjct: 709 EELIGRLKQLGYVPETSIALHDVEQEQKEEQLYHHSEKLALVFSIMNDSNLCHVGTLVRI 768

Query: 724 MKNIRICVDCHNFMKLASTLLKKEIVIRDSNRFHHFTAGLCSCNDYW 771
           MKNIRICVDCHNFMKLAS LLKKEIVIRDSNRFHHFT GLCSCNDYW
Sbjct: 769 MKNIRICVDCHNFMKLASRLLKKEIVIRDSNRFHHFTTGLCSCNDYW 815

BLAST of Tan0011222 vs. NCBI nr
Match: XP_022139076.1 (pentatricopeptide repeat-containing protein At1g71420 isoform X2 [Momordica charantia])

HSP 1 Score: 1389.4 bits (3595), Expect = 0.0e+00
Identity = 671/767 (87.48%), Postives = 711/767 (92.70%), Query Frame = 0

Query: 4   TIHCPFRAKRNLVSYPSKHAFGSHLRYWRSAAEGDIVPFRTEDIDNDYLLDTHRISTHGQ 63
           TIH PF AKRNLV YPSK AF  HLRYWRSAAE D VP RTEDIDNDYL DT  IST G 
Sbjct: 5   TIHYPFLAKRNLVLYPSKWAFPIHLRYWRSAAESDFVPSRTEDIDNDYLWDTRVISTRGH 64

Query: 64  LWQALSLFYSSRQPHSRQTYAHLFHACARLRCLQEGVGLHRYMMSRDPMDSFDLFVTNHL 123
           L  ALSLFYS RQPHSRQTYA+LFHACARLRCL EG+GLHRYMMSRD MDSFDLFVTNHL
Sbjct: 65  LRHALSLFYSFRQPHSRQTYAYLFHACARLRCLHEGMGLHRYMMSRDLMDSFDLFVTNHL 124

Query: 124 INMYCKCGHLIYAYQLFNDMPRRNLVSWTALISGLSQNGHVDECFLLFSRMLVDHQPNEF 183
           INMYCKCGHL YA+QLF++MPRRNLVSWT LISGLSQ GHVDECFLLF RMLVD +PNEF
Sbjct: 125 INMYCKCGHLDYAWQLFDEMPRRNLVSWTVLISGLSQYGHVDECFLLFPRMLVDCRPNEF 184

Query: 184 TVASLLTSFGEHDGERGKQVHGFALKTSLDASVYVANALITMYSKSFYKGGAFNDNKDDA 243
           TVASLLTSFGEHDGERG+QVHGFALKTSLDA VYVANALITMYSKSF KGG FND+ DDA
Sbjct: 185 TVASLLTSFGEHDGERGRQVHGFALKTSLDAFVYVANALITMYSKSFCKGGIFNDSNDDA 244

Query: 244 WTMFKSIENPSLITWNSMIAGFCFQKLGNQAIHLFIQMNRQGIGFDRATILSTLSSVSLC 303
           WTMFKSIENPSLITWNSMIAGFCF+KLGNQAI+LF++MNR+GIGFDRAT+LSTLSS++LC
Sbjct: 245 WTMFKSIENPSLITWNSMIAGFCFRKLGNQAIYLFMKMNREGIGFDRATLLSTLSSLNLC 304

Query: 304 NWDEFGLGLSFCHELHCQALKTAFISEVEIITALVKTYAELEGDIADSYRLFVEAGYNRD 363
           N DEFGLGLSFCHELHC A KTAFISE+E+ TALVKTYA+L GDIADSYRLFVEAGY+ D
Sbjct: 305 NRDEFGLGLSFCHELHCLAFKTAFISEIEVATALVKTYADLGGDIADSYRLFVEAGYHWD 364

Query: 364 IVLWTSIMTAFVEHDPGKTLSLFHQFRQEGLIPDGHNFSIVLKACAGFLTEKHASTYHSL 423
           IVLWTSIMTA VEHDPGKTLSLF QFRQEGL PDGH FSIVLKACAG+LTEKHASTYHSL
Sbjct: 365 IVLWTSIMTALVEHDPGKTLSLFRQFRQEGLTPDGHTFSIVLKACAGYLTEKHASTYHSL 424

Query: 424 LIKYMSENDIVLNNALIHAYARCGSITSSEKVFDQMKHRDLVSWNTMMKAYAVHGQAEKA 483
           LIK MSE+DIVLNNALIHAY RCGSIT S+KVF +MK+RDLVSWNTMMKAYA+HGQA+ A
Sbjct: 425 LIKSMSEDDIVLNNALIHAYGRCGSITLSKKVFKEMKYRDLVSWNTMMKAYAIHGQAKNA 484

Query: 484 LQVFSKMNVPPDSTTFVSLLSACSHAGLVEEGTRLFNSIANYGIACQLDHYACMVDILGR 543
           L +FSKM+VPPDSTTFVSLLSACSHAGLVEEGT LFNSI  YGI CQLDHYACMVDILGR
Sbjct: 485 LHLFSKMDVPPDSTTFVSLLSACSHAGLVEEGTSLFNSIKYYGIVCQLDHYACMVDILGR 544

Query: 544 AGRIQEAEDFISKMPVEPDFVVWSSFLGSCRKHGATQLAKLSSNKLKELDPSNSLAYVQM 603
            GR+QEAE FISKMP+EPDFVVWSSFLGSCRKHGATQLAKL+SNKLKELDPSNSLAYVQM
Sbjct: 545 VGRVQEAEYFISKMPIEPDFVVWSSFLGSCRKHGATQLAKLASNKLKELDPSNSLAYVQM 604

Query: 604 SNLYCFSGSFYDADLIRTEMKGSRVRKEPGLSWVEIENQVHEFASGGRRHPQREVICNEL 663
           SNLYCFSGSFY+ADLIR EMKGSRVRKEPGLSWVEIENQVHEFASGGRRHPQRE ICNEL
Sbjct: 605 SNLYCFSGSFYEADLIRMEMKGSRVRKEPGLSWVEIENQVHEFASGGRRHPQREEICNEL 664

Query: 664 EELVGRLKEIGYVPETSLALHDVEQEQKEEQLYHHSEKLALVFSVMNDSNLCRIGTPIRI 723
           EEL+GRLK++GYVPETS+ALHDVEQEQKEEQLYHHSEKLALVFS+MNDSNLC +GT +RI
Sbjct: 665 EELIGRLKQLGYVPETSIALHDVEQEQKEEQLYHHSEKLALVFSIMNDSNLCHVGTLVRI 724

Query: 724 MKNIRICVDCHNFMKLASTLLKKEIVIRDSNRFHHFTAGLCSCNDYW 771
           MKNIRICVDCHNFMKLAS LLKKEIVIRDSNRFHHFT GLCSCNDYW
Sbjct: 725 MKNIRICVDCHNFMKLASRLLKKEIVIRDSNRFHHFTTGLCSCNDYW 771

BLAST of Tan0011222 vs. NCBI nr
Match: XP_023511808.1 (pentatricopeptide repeat-containing protein At1g71420 isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1388.2 bits (3592), Expect = 0.0e+00
Identity = 669/767 (87.22%), Postives = 712/767 (92.83%), Query Frame = 0

Query: 4   TIHCPFRAKRNLVSYPSKHAFGSHLRYWRSAAEGDIVPFRTEDIDNDYLLDTHRISTHGQ 63
           TIH  F AKRNLV YPSK+AFGS LR+WRS AEGDIV FRTED  +DYL  ++ IST G 
Sbjct: 5   TIHFRFLAKRNLVLYPSKYAFGSQLRFWRSGAEGDIVSFRTEDFRHDYLFGSNVISTRGH 64

Query: 64  LWQALSLFYSSRQPHSRQTYAHLFHACARLRCLQEGVGLHRYMMSRDPMDSFDLFVTNHL 123
           L QALSLFY SRQPHS QTYA+LFHACARLRCL+EGV LHRYMMS DPM SFDLFVTNHL
Sbjct: 65  LGQALSLFY-SRQPHSLQTYAYLFHACARLRCLREGVELHRYMMSLDPMGSFDLFVTNHL 124

Query: 124 INMYCKCGHLIYAYQLFNDMPRRNLVSWTALISGLSQNGHVDECFLLFSRMLVDHQPNEF 183
           INMYCKCGHL YAYQLFN+MPRRNLVSWT LISGLSQ GHVDECFL+FSRMLVDH+PNEF
Sbjct: 125 INMYCKCGHLDYAYQLFNEMPRRNLVSWTVLISGLSQYGHVDECFLIFSRMLVDHRPNEF 184

Query: 184 TVASLLTSFGEHDGERGKQVHGFALKTSLDASVYVANALITMYSKSFYKGGAFNDNKDDA 243
           TVASLLTSFG+HDGERG+QVHGFALK SLDA VYVANALITMYSK+++KGGAFND KDDA
Sbjct: 185 TVASLLTSFGDHDGERGRQVHGFALKRSLDAFVYVANALITMYSKNYFKGGAFNDGKDDA 244

Query: 244 WTMFKSIENPSLITWNSMIAGFCFQKLGNQAIHLFIQMNRQGIGFDRATILSTLSSVSLC 303
           WTMFKSIENPSLITWNSMIAGFCF+K GN+A+HLF+QMN +GIGFDRAT+LSTLSS+SLC
Sbjct: 245 WTMFKSIENPSLITWNSMIAGFCFRKHGNRAVHLFMQMNHKGIGFDRATLLSTLSSISLC 304

Query: 304 NWDEFGLGLSFCHELHCQALKTAFISEVEIITALVKTYAELEGDIADSYRLFVEAGYNRD 363
           NWDE  LGL FC ELHCQALKTAF SEVEIITALVKTYAEL GDI DSYRLF+EAGYNRD
Sbjct: 305 NWDELDLGLGFCRELHCQALKTAFTSEVEIITALVKTYAELGGDITDSYRLFIEAGYNRD 364

Query: 364 IVLWTSIMTAFVEHDPGKTLSLFHQFRQEGLIPDGHNFSIVLKACAGFLTEKHASTYHSL 423
           IVLWTSIMTAFV+HDPGKTLSLF QFRQEGL PDGH FSIVLKACAGFLTEKHASTYHSL
Sbjct: 365 IVLWTSIMTAFVDHDPGKTLSLFRQFRQEGLTPDGHTFSIVLKACAGFLTEKHASTYHSL 424

Query: 424 LIKYMSENDIVLNNALIHAYARCGSITSSEKVFDQMKHRDLVSWNTMMKAYAVHGQAEKA 483
           LIK MSE+D VLNNALIHAY RCGSITSS+KVF+QMKH DLVSWNTMMK YAVHGQAE A
Sbjct: 425 LIKSMSEDDTVLNNALIHAYGRCGSITSSKKVFNQMKHHDLVSWNTMMKVYAVHGQAEIA 484

Query: 484 LQVFSKMNVPPDSTTFVSLLSACSHAGLVEEGTRLFNSIANYGIACQLDHYACMVDILGR 543
           LQ+FSKM VPPDSTTFVSLLSACSHAGLVEEGT LFNSIANYG+ CQLDHYACMVDILGR
Sbjct: 485 LQLFSKMTVPPDSTTFVSLLSACSHAGLVEEGTNLFNSIANYGLVCQLDHYACMVDILGR 544

Query: 544 AGRIQEAEDFISKMPVEPDFVVWSSFLGSCRKHGATQLAKLSSNKLKELDPSNSLAYVQM 603
           +GRIQEAEDFISKMP+EPD+V+WSSFLGSC+KHGATQLAKL+S+KLKELDPSNSLAYVQM
Sbjct: 545 SGRIQEAEDFISKMPIEPDYVIWSSFLGSCKKHGATQLAKLASDKLKELDPSNSLAYVQM 604

Query: 604 SNLYCFSGSFYDADLIRTEMKGSRVRKEPGLSWVEIENQVHEFASGGRRHPQREVICNEL 663
           SNLYC SGSFY+ADLIR EMKGSRVRKEPGLSWVEIENQVHEFASGGR HP+REVICNEL
Sbjct: 605 SNLYCLSGSFYEADLIRMEMKGSRVRKEPGLSWVEIENQVHEFASGGRHHPEREVICNEL 664

Query: 664 EELVGRLKEIGYVPETSLALHDVEQEQKEEQLYHHSEKLALVFSVMNDSNLCRIGTPIRI 723
           EEL+GRLKEIGYVPETSLA+HDVEQEQKEEQLYHHSEKLALVFSVMND+NL  + TPIRI
Sbjct: 665 EELIGRLKEIGYVPETSLAIHDVEQEQKEEQLYHHSEKLALVFSVMNDNNLGCVSTPIRI 724

Query: 724 MKNIRICVDCHNFMKLASTLLKKEIVIRDSNRFHHFTAGLCSCNDYW 771
           MKNIRICVDCHNFMKLAS LL+KEIVIRDSNRFHHF AGLCSCNDYW
Sbjct: 725 MKNIRICVDCHNFMKLASRLLQKEIVIRDSNRFHHFMAGLCSCNDYW 770

BLAST of Tan0011222 vs. NCBI nr
Match: XP_022957425.1 (pentatricopeptide repeat-containing protein At1g71420 isoform X1 [Cucurbita moschata])

HSP 1 Score: 1381.7 bits (3575), Expect = 0.0e+00
Identity = 668/767 (87.09%), Postives = 710/767 (92.57%), Query Frame = 0

Query: 4   TIHCPFRAKRNLVSYPSKHAFGSHLRYWRSAAEGDIVPFRTEDIDNDYLLDTHRISTHGQ 63
           TIH  F AKRNLV YPSK+ FGS LR+WRS AEGDIV FRTED  +DYL  +  IST G 
Sbjct: 5   TIHFRFLAKRNLVLYPSKYGFGSQLRFWRSGAEGDIVSFRTEDFRHDYLFGSPVISTRGH 64

Query: 64  LWQALSLFYSSRQPHSRQTYAHLFHACARLRCLQEGVGLHRYMMSRDPMDSFDLFVTNHL 123
           L QALSLFY SRQPHS QTYA+LFHACARLRCL+EG  LHRYMMS DPM SFDLFVTNHL
Sbjct: 65  LEQALSLFY-SRQPHSFQTYAYLFHACARLRCLREGAVLHRYMMSLDPMGSFDLFVTNHL 124

Query: 124 INMYCKCGHLIYAYQLFNDMPRRNLVSWTALISGLSQNGHVDECFLLFSRMLVDHQPNEF 183
           INMYCKCGHL YAYQLFN+MPRRNLVSWT LISGLSQ GHVDECFL+FSRMLVDH+PNEF
Sbjct: 125 INMYCKCGHLDYAYQLFNEMPRRNLVSWTVLISGLSQYGHVDECFLIFSRMLVDHRPNEF 184

Query: 184 TVASLLTSFGEHDGERGKQVHGFALKTSLDASVYVANALITMYSKSFYKGGAFNDNKDDA 243
           TVASLLTSFG+HDGERG+Q+HGFALK SLDA VYVANALITMYSKS+ KGGAFND+KDDA
Sbjct: 185 TVASLLTSFGDHDGERGRQIHGFALKRSLDAFVYVANALITMYSKSYSKGGAFNDSKDDA 244

Query: 244 WTMFKSIENPSLITWNSMIAGFCFQKLGNQAIHLFIQMNRQGIGFDRATILSTLSSVSLC 303
           WTMFKSIENP LITWNSMIAGFCF+K GN A+HLF+QMNRQGIGFDRAT+LSTLSS+SLC
Sbjct: 245 WTMFKSIENPGLITWNSMIAGFCFRKHGNCAVHLFMQMNRQGIGFDRATLLSTLSSLSLC 304

Query: 304 NWDEFGLGLSFCHELHCQALKTAFISEVEIITALVKTYAELEGDIADSYRLFVEAGYNRD 363
           NWDE  LGL FC ELHCQALKTAF SEVEIITAL+KTYAEL GDIADSYRLF+EAGYNRD
Sbjct: 305 NWDELDLGLGFCRELHCQALKTAFTSEVEIITALLKTYAELGGDIADSYRLFIEAGYNRD 364

Query: 364 IVLWTSIMTAFVEHDPGKTLSLFHQFRQEGLIPDGHNFSIVLKACAGFLTEKHASTYHSL 423
           IVLWTSIMTAFV+HDPGKTLSLF QFRQEGL PDGH FSIVLKACAGFLTEKHASTYHSL
Sbjct: 365 IVLWTSIMTAFVDHDPGKTLSLFCQFRQEGLTPDGHTFSIVLKACAGFLTEKHASTYHSL 424

Query: 424 LIKYMSENDIVLNNALIHAYARCGSITSSEKVFDQMKHRDLVSWNTMMKAYAVHGQAEKA 483
           LIK MSE+D VLNNALIHAY RCGSITSS+KVF+QMKH DLVSWNTMMK YAVHGQAE A
Sbjct: 425 LIKSMSEDDTVLNNALIHAYGRCGSITSSKKVFNQMKHHDLVSWNTMMKVYAVHGQAEIA 484

Query: 484 LQVFSKMNVPPDSTTFVSLLSACSHAGLVEEGTRLFNSIANYGIACQLDHYACMVDILGR 543
           LQ+FSKM VPPDSTTFVSLLSACSHAGLVEEGT+LFNSIANYG+ CQLDHYACMVDILGR
Sbjct: 485 LQLFSKMTVPPDSTTFVSLLSACSHAGLVEEGTKLFNSIANYGLVCQLDHYACMVDILGR 544

Query: 544 AGRIQEAEDFISKMPVEPDFVVWSSFLGSCRKHGATQLAKLSSNKLKELDPSNSLAYVQM 603
           +GRIQEAEDFISKMP+EPD+V+WSSFLGSC+KHGATQLAKL+S+KLKELDPSNSLAYVQM
Sbjct: 545 SGRIQEAEDFISKMPIEPDYVIWSSFLGSCKKHGATQLAKLASDKLKELDPSNSLAYVQM 604

Query: 604 SNLYCFSGSFYDADLIRTEMKGSRVRKEPGLSWVEIENQVHEFASGGRRHPQREVICNEL 663
           SNLYC SGSFY+ADLIR EMKGSRVRKEPGLSWVEIENQVHEFASGGR HP+REVICNEL
Sbjct: 605 SNLYCLSGSFYEADLIRMEMKGSRVRKEPGLSWVEIENQVHEFASGGRHHPEREVICNEL 664

Query: 664 EELVGRLKEIGYVPETSLALHDVEQEQKEEQLYHHSEKLALVFSVMNDSNLCRIGTPIRI 723
           EEL+GRLKEIGYVPETSLA+HDVEQEQKEEQLYHHSEKLALVFSVMND+NL  + TPIRI
Sbjct: 665 EELIGRLKEIGYVPETSLAIHDVEQEQKEEQLYHHSEKLALVFSVMNDNNLGCVSTPIRI 724

Query: 724 MKNIRICVDCHNFMKLASTLLKKEIVIRDSNRFHHFTAGLCSCNDYW 771
           MKNIRICVDCHNFMKLAS LL+KEIVIRDSNRFHHF AGLCSCNDYW
Sbjct: 725 MKNIRICVDCHNFMKLASRLLQKEIVIRDSNRFHHFMAGLCSCNDYW 770

BLAST of Tan0011222 vs. NCBI nr
Match: XP_038892212.1 (pentatricopeptide repeat-containing protein At1g71420 isoform X1 [Benincasa hispida])

HSP 1 Score: 1376.7 bits (3562), Expect = 0.0e+00
Identity = 670/768 (87.24%), Postives = 709/768 (92.32%), Query Frame = 0

Query: 4   TIHCPFRAKRNLVSYPSKHAFGSHLRYWRSAAEGDIVPFRTEDIDNDYLLDTHRISTHGQ 63
           TI+CPF AKRNLVSYPSKHAFG   R WRSAAEGDIV  RTEDIDNDYLL++  IST G 
Sbjct: 5   TIYCPFLAKRNLVSYPSKHAFGLQFRCWRSAAEGDIV-HRTEDIDNDYLLESRPISTRGH 64

Query: 64  LWQALSLFYSSRQPHSRQTYAHLFHACARLRCLQEGVGLHRYMMSR-DPMDSFDLFVTNH 123
           L QALSLFYSSRQPHS QTYA+LFHACARLRCLQEG+GLHRYMMSR DPM++FDLFVTNH
Sbjct: 65  LRQALSLFYSSRQPHSHQTYANLFHACARLRCLQEGMGLHRYMMSRDDPMNTFDLFVTNH 124

Query: 124 LINMYCKCGHLIYAYQLFNDMPRRNLVSWTALISGLSQNGHVDECFLLFSRMLVDHQPNE 183
           LINMYCKCGHL YAYQLFN+MPRRNLVSWT LISGLSQ G VDECFL+FSRMLVDH+PNE
Sbjct: 125 LINMYCKCGHLDYAYQLFNEMPRRNLVSWTVLISGLSQYGLVDECFLIFSRMLVDHRPNE 184

Query: 184 FTVASLLTSFGEHDGERGKQVHGFALKTSLDASVYVANALITMYSKSFYKGGAFNDNKDD 243
           FTVASLLTSFGEHDGERG+Q+HGF LK SLD  VYVANALI MYSKS+ K GA+ND+KDD
Sbjct: 185 FTVASLLTSFGEHDGERGRQIHGFVLKRSLDVFVYVANALIAMYSKSYSKDGAYNDSKDD 244

Query: 244 AWTMFKSIENPSLITWNSMIAGFCFQKLGNQAIHLFIQMNRQGIGFDRATILSTLSSVSL 303
           AWTMFKSIE P+LITWNSMIAGFCF+KLG+QAI+LF+QMN QGIGFDRAT+LSTLSS SL
Sbjct: 245 AWTMFKSIEKPNLITWNSMIAGFCFRKLGHQAIYLFMQMNHQGIGFDRATLLSTLSSTSL 304

Query: 304 CNWDEFGLGLSFCHELHCQALKTAFISEVEIITALVKTYAELEGDIADSYRLFVEAGYNR 363
           CNWDEFG GL FCH++HCQALKTAFISEVEIITALVKT AEL GDIADSYRLFVE GYNR
Sbjct: 305 CNWDEFGDGLGFCHQIHCQALKTAFISEVEIITALVKTNAELGGDIADSYRLFVEGGYNR 364

Query: 364 DIVLWTSIMTAFVEHDPGKTLSLFHQFRQEGLIPDGHNFSIVLKACAGFLTEKHASTYHS 423
           DIVLWTSIMTAFV+HDPGKTLSLF QFRQEGL PDGH FSIVLKACAGFLTEKHASTYHS
Sbjct: 365 DIVLWTSIMTAFVDHDPGKTLSLFCQFRQEGLTPDGHTFSIVLKACAGFLTEKHASTYHS 424

Query: 424 LLIKYMSENDIVLNNALIHAYARCGSITSSEKVFDQMKHRDLVSWNTMMKAYAVHGQAEK 483
           LLIK MSE+D VLNNALIHAY RCGSI+SS+KVFDQMKH DLVSWNTMMKAYAVHGQAE 
Sbjct: 425 LLIKSMSEDDTVLNNALIHAYGRCGSISSSKKVFDQMKHHDLVSWNTMMKAYAVHGQAEI 484

Query: 484 ALQVFSKMNVPPDSTTFVSLLSACSHAGLVEEGTRLFNSIANYGIACQLDHYACMVDILG 543
           ALQ+F+ MNVPPD+TTFVSLLSACSHAGLVEEG  LFNSI +YGI CQLDHYACMVDILG
Sbjct: 485 ALQLFTNMNVPPDATTFVSLLSACSHAGLVEEGISLFNSITDYGIVCQLDHYACMVDILG 544

Query: 544 RAGRIQEAEDFISKMPVEPDFVVWSSFLGSCRKHGATQLAKLSSNKLKELDPSNSLAYVQ 603
           R+G+IQEA DFISKMP+EPDFVVWSSFLGSCRKHGAT+LAKL+S KLKELDP NSLAYVQ
Sbjct: 545 RSGQIQEAHDFISKMPIEPDFVVWSSFLGSCRKHGATELAKLASYKLKELDPGNSLAYVQ 604

Query: 604 MSNLYCFSGSFYDADLIRTEMKGSRVRKEPGLSWVEIENQVHEFASGGRRHPQREVICNE 663
           MSNLYCFSGSFY+ADLIR EMKGSRVRKEPGLSWVEIENQVHEFASGG RHPQREVI NE
Sbjct: 605 MSNLYCFSGSFYEADLIRMEMKGSRVRKEPGLSWVEIENQVHEFASGGCRHPQREVIWNE 664

Query: 664 LEELVGRLKEIGYVPETSLALHDVEQEQKEEQLYHHSEKLALVFSVMNDSNLCRIGTPIR 723
           LEEL+GRLKEIGYVPETSLALHDVE EQKEEQLYHHSEKLALVFSVMND NL R  TPIR
Sbjct: 665 LEELIGRLKEIGYVPETSLALHDVEHEQKEEQLYHHSEKLALVFSVMNDFNLVRADTPIR 724

Query: 724 IMKNIRICVDCHNFMKLASTLLKKEIVIRDSNRFHHFTAGLCSCNDYW 771
           IMKNIRICVDCHNFMKLAS LL+KEIVIRDSNRFHHF AGLCSCNDYW
Sbjct: 725 IMKNIRICVDCHNFMKLASRLLQKEIVIRDSNRFHHFMAGLCSCNDYW 771

BLAST of Tan0011222 vs. ExPASy TrEMBL
Match: A0A6J1CBK6 (pentatricopeptide repeat-containing protein At1g71420 isoform X2 OS=Momordica charantia OX=3673 GN=LOC111010082 PE=3 SV=1)

HSP 1 Score: 1389.4 bits (3595), Expect = 0.0e+00
Identity = 671/767 (87.48%), Postives = 711/767 (92.70%), Query Frame = 0

Query: 4   TIHCPFRAKRNLVSYPSKHAFGSHLRYWRSAAEGDIVPFRTEDIDNDYLLDTHRISTHGQ 63
           TIH PF AKRNLV YPSK AF  HLRYWRSAAE D VP RTEDIDNDYL DT  IST G 
Sbjct: 5   TIHYPFLAKRNLVLYPSKWAFPIHLRYWRSAAESDFVPSRTEDIDNDYLWDTRVISTRGH 64

Query: 64  LWQALSLFYSSRQPHSRQTYAHLFHACARLRCLQEGVGLHRYMMSRDPMDSFDLFVTNHL 123
           L  ALSLFYS RQPHSRQTYA+LFHACARLRCL EG+GLHRYMMSRD MDSFDLFVTNHL
Sbjct: 65  LRHALSLFYSFRQPHSRQTYAYLFHACARLRCLHEGMGLHRYMMSRDLMDSFDLFVTNHL 124

Query: 124 INMYCKCGHLIYAYQLFNDMPRRNLVSWTALISGLSQNGHVDECFLLFSRMLVDHQPNEF 183
           INMYCKCGHL YA+QLF++MPRRNLVSWT LISGLSQ GHVDECFLLF RMLVD +PNEF
Sbjct: 125 INMYCKCGHLDYAWQLFDEMPRRNLVSWTVLISGLSQYGHVDECFLLFPRMLVDCRPNEF 184

Query: 184 TVASLLTSFGEHDGERGKQVHGFALKTSLDASVYVANALITMYSKSFYKGGAFNDNKDDA 243
           TVASLLTSFGEHDGERG+QVHGFALKTSLDA VYVANALITMYSKSF KGG FND+ DDA
Sbjct: 185 TVASLLTSFGEHDGERGRQVHGFALKTSLDAFVYVANALITMYSKSFCKGGIFNDSNDDA 244

Query: 244 WTMFKSIENPSLITWNSMIAGFCFQKLGNQAIHLFIQMNRQGIGFDRATILSTLSSVSLC 303
           WTMFKSIENPSLITWNSMIAGFCF+KLGNQAI+LF++MNR+GIGFDRAT+LSTLSS++LC
Sbjct: 245 WTMFKSIENPSLITWNSMIAGFCFRKLGNQAIYLFMKMNREGIGFDRATLLSTLSSLNLC 304

Query: 304 NWDEFGLGLSFCHELHCQALKTAFISEVEIITALVKTYAELEGDIADSYRLFVEAGYNRD 363
           N DEFGLGLSFCHELHC A KTAFISE+E+ TALVKTYA+L GDIADSYRLFVEAGY+ D
Sbjct: 305 NRDEFGLGLSFCHELHCLAFKTAFISEIEVATALVKTYADLGGDIADSYRLFVEAGYHWD 364

Query: 364 IVLWTSIMTAFVEHDPGKTLSLFHQFRQEGLIPDGHNFSIVLKACAGFLTEKHASTYHSL 423
           IVLWTSIMTA VEHDPGKTLSLF QFRQEGL PDGH FSIVLKACAG+LTEKHASTYHSL
Sbjct: 365 IVLWTSIMTALVEHDPGKTLSLFRQFRQEGLTPDGHTFSIVLKACAGYLTEKHASTYHSL 424

Query: 424 LIKYMSENDIVLNNALIHAYARCGSITSSEKVFDQMKHRDLVSWNTMMKAYAVHGQAEKA 483
           LIK MSE+DIVLNNALIHAY RCGSIT S+KVF +MK+RDLVSWNTMMKAYA+HGQA+ A
Sbjct: 425 LIKSMSEDDIVLNNALIHAYGRCGSITLSKKVFKEMKYRDLVSWNTMMKAYAIHGQAKNA 484

Query: 484 LQVFSKMNVPPDSTTFVSLLSACSHAGLVEEGTRLFNSIANYGIACQLDHYACMVDILGR 543
           L +FSKM+VPPDSTTFVSLLSACSHAGLVEEGT LFNSI  YGI CQLDHYACMVDILGR
Sbjct: 485 LHLFSKMDVPPDSTTFVSLLSACSHAGLVEEGTSLFNSIKYYGIVCQLDHYACMVDILGR 544

Query: 544 AGRIQEAEDFISKMPVEPDFVVWSSFLGSCRKHGATQLAKLSSNKLKELDPSNSLAYVQM 603
            GR+QEAE FISKMP+EPDFVVWSSFLGSCRKHGATQLAKL+SNKLKELDPSNSLAYVQM
Sbjct: 545 VGRVQEAEYFISKMPIEPDFVVWSSFLGSCRKHGATQLAKLASNKLKELDPSNSLAYVQM 604

Query: 604 SNLYCFSGSFYDADLIRTEMKGSRVRKEPGLSWVEIENQVHEFASGGRRHPQREVICNEL 663
           SNLYCFSGSFY+ADLIR EMKGSRVRKEPGLSWVEIENQVHEFASGGRRHPQRE ICNEL
Sbjct: 605 SNLYCFSGSFYEADLIRMEMKGSRVRKEPGLSWVEIENQVHEFASGGRRHPQREEICNEL 664

Query: 664 EELVGRLKEIGYVPETSLALHDVEQEQKEEQLYHHSEKLALVFSVMNDSNLCRIGTPIRI 723
           EEL+GRLK++GYVPETS+ALHDVEQEQKEEQLYHHSEKLALVFS+MNDSNLC +GT +RI
Sbjct: 665 EELIGRLKQLGYVPETSIALHDVEQEQKEEQLYHHSEKLALVFSIMNDSNLCHVGTLVRI 724

Query: 724 MKNIRICVDCHNFMKLASTLLKKEIVIRDSNRFHHFTAGLCSCNDYW 771
           MKNIRICVDCHNFMKLAS LLKKEIVIRDSNRFHHFT GLCSCNDYW
Sbjct: 725 MKNIRICVDCHNFMKLASRLLKKEIVIRDSNRFHHFTTGLCSCNDYW 771

BLAST of Tan0011222 vs. ExPASy TrEMBL
Match: A0A6J1CBA2 (pentatricopeptide repeat-containing protein At1g71420 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111010082 PE=3 SV=1)

HSP 1 Score: 1389.4 bits (3595), Expect = 0.0e+00
Identity = 671/767 (87.48%), Postives = 711/767 (92.70%), Query Frame = 0

Query: 4   TIHCPFRAKRNLVSYPSKHAFGSHLRYWRSAAEGDIVPFRTEDIDNDYLLDTHRISTHGQ 63
           TIH PF AKRNLV YPSK AF  HLRYWRSAAE D VP RTEDIDNDYL DT  IST G 
Sbjct: 49  TIHYPFLAKRNLVLYPSKWAFPIHLRYWRSAAESDFVPSRTEDIDNDYLWDTRVISTRGH 108

Query: 64  LWQALSLFYSSRQPHSRQTYAHLFHACARLRCLQEGVGLHRYMMSRDPMDSFDLFVTNHL 123
           L  ALSLFYS RQPHSRQTYA+LFHACARLRCL EG+GLHRYMMSRD MDSFDLFVTNHL
Sbjct: 109 LRHALSLFYSFRQPHSRQTYAYLFHACARLRCLHEGMGLHRYMMSRDLMDSFDLFVTNHL 168

Query: 124 INMYCKCGHLIYAYQLFNDMPRRNLVSWTALISGLSQNGHVDECFLLFSRMLVDHQPNEF 183
           INMYCKCGHL YA+QLF++MPRRNLVSWT LISGLSQ GHVDECFLLF RMLVD +PNEF
Sbjct: 169 INMYCKCGHLDYAWQLFDEMPRRNLVSWTVLISGLSQYGHVDECFLLFPRMLVDCRPNEF 228

Query: 184 TVASLLTSFGEHDGERGKQVHGFALKTSLDASVYVANALITMYSKSFYKGGAFNDNKDDA 243
           TVASLLTSFGEHDGERG+QVHGFALKTSLDA VYVANALITMYSKSF KGG FND+ DDA
Sbjct: 229 TVASLLTSFGEHDGERGRQVHGFALKTSLDAFVYVANALITMYSKSFCKGGIFNDSNDDA 288

Query: 244 WTMFKSIENPSLITWNSMIAGFCFQKLGNQAIHLFIQMNRQGIGFDRATILSTLSSVSLC 303
           WTMFKSIENPSLITWNSMIAGFCF+KLGNQAI+LF++MNR+GIGFDRAT+LSTLSS++LC
Sbjct: 289 WTMFKSIENPSLITWNSMIAGFCFRKLGNQAIYLFMKMNREGIGFDRATLLSTLSSLNLC 348

Query: 304 NWDEFGLGLSFCHELHCQALKTAFISEVEIITALVKTYAELEGDIADSYRLFVEAGYNRD 363
           N DEFGLGLSFCHELHC A KTAFISE+E+ TALVKTYA+L GDIADSYRLFVEAGY+ D
Sbjct: 349 NRDEFGLGLSFCHELHCLAFKTAFISEIEVATALVKTYADLGGDIADSYRLFVEAGYHWD 408

Query: 364 IVLWTSIMTAFVEHDPGKTLSLFHQFRQEGLIPDGHNFSIVLKACAGFLTEKHASTYHSL 423
           IVLWTSIMTA VEHDPGKTLSLF QFRQEGL PDGH FSIVLKACAG+LTEKHASTYHSL
Sbjct: 409 IVLWTSIMTALVEHDPGKTLSLFRQFRQEGLTPDGHTFSIVLKACAGYLTEKHASTYHSL 468

Query: 424 LIKYMSENDIVLNNALIHAYARCGSITSSEKVFDQMKHRDLVSWNTMMKAYAVHGQAEKA 483
           LIK MSE+DIVLNNALIHAY RCGSIT S+KVF +MK+RDLVSWNTMMKAYA+HGQA+ A
Sbjct: 469 LIKSMSEDDIVLNNALIHAYGRCGSITLSKKVFKEMKYRDLVSWNTMMKAYAIHGQAKNA 528

Query: 484 LQVFSKMNVPPDSTTFVSLLSACSHAGLVEEGTRLFNSIANYGIACQLDHYACMVDILGR 543
           L +FSKM+VPPDSTTFVSLLSACSHAGLVEEGT LFNSI  YGI CQLDHYACMVDILGR
Sbjct: 529 LHLFSKMDVPPDSTTFVSLLSACSHAGLVEEGTSLFNSIKYYGIVCQLDHYACMVDILGR 588

Query: 544 AGRIQEAEDFISKMPVEPDFVVWSSFLGSCRKHGATQLAKLSSNKLKELDPSNSLAYVQM 603
            GR+QEAE FISKMP+EPDFVVWSSFLGSCRKHGATQLAKL+SNKLKELDPSNSLAYVQM
Sbjct: 589 VGRVQEAEYFISKMPIEPDFVVWSSFLGSCRKHGATQLAKLASNKLKELDPSNSLAYVQM 648

Query: 604 SNLYCFSGSFYDADLIRTEMKGSRVRKEPGLSWVEIENQVHEFASGGRRHPQREVICNEL 663
           SNLYCFSGSFY+ADLIR EMKGSRVRKEPGLSWVEIENQVHEFASGGRRHPQRE ICNEL
Sbjct: 649 SNLYCFSGSFYEADLIRMEMKGSRVRKEPGLSWVEIENQVHEFASGGRRHPQREEICNEL 708

Query: 664 EELVGRLKEIGYVPETSLALHDVEQEQKEEQLYHHSEKLALVFSVMNDSNLCRIGTPIRI 723
           EEL+GRLK++GYVPETS+ALHDVEQEQKEEQLYHHSEKLALVFS+MNDSNLC +GT +RI
Sbjct: 709 EELIGRLKQLGYVPETSIALHDVEQEQKEEQLYHHSEKLALVFSIMNDSNLCHVGTLVRI 768

Query: 724 MKNIRICVDCHNFMKLASTLLKKEIVIRDSNRFHHFTAGLCSCNDYW 771
           MKNIRICVDCHNFMKLAS LLKKEIVIRDSNRFHHFT GLCSCNDYW
Sbjct: 769 MKNIRICVDCHNFMKLASRLLKKEIVIRDSNRFHHFTTGLCSCNDYW 815

BLAST of Tan0011222 vs. ExPASy TrEMBL
Match: A0A6J1H0I1 (pentatricopeptide repeat-containing protein At1g71420 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111458832 PE=3 SV=1)

HSP 1 Score: 1381.7 bits (3575), Expect = 0.0e+00
Identity = 668/767 (87.09%), Postives = 710/767 (92.57%), Query Frame = 0

Query: 4   TIHCPFRAKRNLVSYPSKHAFGSHLRYWRSAAEGDIVPFRTEDIDNDYLLDTHRISTHGQ 63
           TIH  F AKRNLV YPSK+ FGS LR+WRS AEGDIV FRTED  +DYL  +  IST G 
Sbjct: 5   TIHFRFLAKRNLVLYPSKYGFGSQLRFWRSGAEGDIVSFRTEDFRHDYLFGSPVISTRGH 64

Query: 64  LWQALSLFYSSRQPHSRQTYAHLFHACARLRCLQEGVGLHRYMMSRDPMDSFDLFVTNHL 123
           L QALSLFY SRQPHS QTYA+LFHACARLRCL+EG  LHRYMMS DPM SFDLFVTNHL
Sbjct: 65  LEQALSLFY-SRQPHSFQTYAYLFHACARLRCLREGAVLHRYMMSLDPMGSFDLFVTNHL 124

Query: 124 INMYCKCGHLIYAYQLFNDMPRRNLVSWTALISGLSQNGHVDECFLLFSRMLVDHQPNEF 183
           INMYCKCGHL YAYQLFN+MPRRNLVSWT LISGLSQ GHVDECFL+FSRMLVDH+PNEF
Sbjct: 125 INMYCKCGHLDYAYQLFNEMPRRNLVSWTVLISGLSQYGHVDECFLIFSRMLVDHRPNEF 184

Query: 184 TVASLLTSFGEHDGERGKQVHGFALKTSLDASVYVANALITMYSKSFYKGGAFNDNKDDA 243
           TVASLLTSFG+HDGERG+Q+HGFALK SLDA VYVANALITMYSKS+ KGGAFND+KDDA
Sbjct: 185 TVASLLTSFGDHDGERGRQIHGFALKRSLDAFVYVANALITMYSKSYSKGGAFNDSKDDA 244

Query: 244 WTMFKSIENPSLITWNSMIAGFCFQKLGNQAIHLFIQMNRQGIGFDRATILSTLSSVSLC 303
           WTMFKSIENP LITWNSMIAGFCF+K GN A+HLF+QMNRQGIGFDRAT+LSTLSS+SLC
Sbjct: 245 WTMFKSIENPGLITWNSMIAGFCFRKHGNCAVHLFMQMNRQGIGFDRATLLSTLSSLSLC 304

Query: 304 NWDEFGLGLSFCHELHCQALKTAFISEVEIITALVKTYAELEGDIADSYRLFVEAGYNRD 363
           NWDE  LGL FC ELHCQALKTAF SEVEIITAL+KTYAEL GDIADSYRLF+EAGYNRD
Sbjct: 305 NWDELDLGLGFCRELHCQALKTAFTSEVEIITALLKTYAELGGDIADSYRLFIEAGYNRD 364

Query: 364 IVLWTSIMTAFVEHDPGKTLSLFHQFRQEGLIPDGHNFSIVLKACAGFLTEKHASTYHSL 423
           IVLWTSIMTAFV+HDPGKTLSLF QFRQEGL PDGH FSIVLKACAGFLTEKHASTYHSL
Sbjct: 365 IVLWTSIMTAFVDHDPGKTLSLFCQFRQEGLTPDGHTFSIVLKACAGFLTEKHASTYHSL 424

Query: 424 LIKYMSENDIVLNNALIHAYARCGSITSSEKVFDQMKHRDLVSWNTMMKAYAVHGQAEKA 483
           LIK MSE+D VLNNALIHAY RCGSITSS+KVF+QMKH DLVSWNTMMK YAVHGQAE A
Sbjct: 425 LIKSMSEDDTVLNNALIHAYGRCGSITSSKKVFNQMKHHDLVSWNTMMKVYAVHGQAEIA 484

Query: 484 LQVFSKMNVPPDSTTFVSLLSACSHAGLVEEGTRLFNSIANYGIACQLDHYACMVDILGR 543
           LQ+FSKM VPPDSTTFVSLLSACSHAGLVEEGT+LFNSIANYG+ CQLDHYACMVDILGR
Sbjct: 485 LQLFSKMTVPPDSTTFVSLLSACSHAGLVEEGTKLFNSIANYGLVCQLDHYACMVDILGR 544

Query: 544 AGRIQEAEDFISKMPVEPDFVVWSSFLGSCRKHGATQLAKLSSNKLKELDPSNSLAYVQM 603
           +GRIQEAEDFISKMP+EPD+V+WSSFLGSC+KHGATQLAKL+S+KLKELDPSNSLAYVQM
Sbjct: 545 SGRIQEAEDFISKMPIEPDYVIWSSFLGSCKKHGATQLAKLASDKLKELDPSNSLAYVQM 604

Query: 604 SNLYCFSGSFYDADLIRTEMKGSRVRKEPGLSWVEIENQVHEFASGGRRHPQREVICNEL 663
           SNLYC SGSFY+ADLIR EMKGSRVRKEPGLSWVEIENQVHEFASGGR HP+REVICNEL
Sbjct: 605 SNLYCLSGSFYEADLIRMEMKGSRVRKEPGLSWVEIENQVHEFASGGRHHPEREVICNEL 664

Query: 664 EELVGRLKEIGYVPETSLALHDVEQEQKEEQLYHHSEKLALVFSVMNDSNLCRIGTPIRI 723
           EEL+GRLKEIGYVPETSLA+HDVEQEQKEEQLYHHSEKLALVFSVMND+NL  + TPIRI
Sbjct: 665 EELIGRLKEIGYVPETSLAIHDVEQEQKEEQLYHHSEKLALVFSVMNDNNLGCVSTPIRI 724

Query: 724 MKNIRICVDCHNFMKLASTLLKKEIVIRDSNRFHHFTAGLCSCNDYW 771
           MKNIRICVDCHNFMKLAS LL+KEIVIRDSNRFHHF AGLCSCNDYW
Sbjct: 725 MKNIRICVDCHNFMKLASRLLQKEIVIRDSNRFHHFMAGLCSCNDYW 770

BLAST of Tan0011222 vs. ExPASy TrEMBL
Match: A0A6J1JQJ9 (pentatricopeptide repeat-containing protein At1g71420 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111486594 PE=3 SV=1)

HSP 1 Score: 1372.5 bits (3551), Expect = 0.0e+00
Identity = 661/767 (86.18%), Postives = 705/767 (91.92%), Query Frame = 0

Query: 4   TIHCPFRAKRNLVSYPSKHAFGSHLRYWRSAAEGDIVPFRTEDIDNDYLLDTHRISTHGQ 63
           TIH  F AKRNLV YPSK+AFGS LR+WRS  EGDIV FRTED   DYL  ++ IST G 
Sbjct: 5   TIHFRFLAKRNLVLYPSKYAFGSQLRFWRSGPEGDIVSFRTEDFRRDYLFGSNVISTRGH 64

Query: 64  LWQALSLFYSSRQPHSRQTYAHLFHACARLRCLQEGVGLHRYMMSRDPMDSFDLFVTNHL 123
           L QALSLFY SRQPHS QTYA+LFHACARLRCL+EG  LHRYMMS DPM SFDLFVTNHL
Sbjct: 65  LEQALSLFY-SRQPHSLQTYAYLFHACARLRCLREGAVLHRYMMSLDPMGSFDLFVTNHL 124

Query: 124 INMYCKCGHLIYAYQLFNDMPRRNLVSWTALISGLSQNGHVDECFLLFSRMLVDHQPNEF 183
           INMYCKCGHL YAYQLFN+MPRRNLVSWT LISGLSQ  HVDECFL+FSRMLVDH+PNEF
Sbjct: 125 INMYCKCGHLDYAYQLFNEMPRRNLVSWTVLISGLSQYDHVDECFLIFSRMLVDHRPNEF 184

Query: 184 TVASLLTSFGEHDGERGKQVHGFALKTSLDASVYVANALITMYSKSFYKGGAFNDNKDDA 243
           TVASLLTSFG+HDGERG+QVHGFALK SLDA VYVANALITMYSKS++KGGAFND KDDA
Sbjct: 185 TVASLLTSFGDHDGERGRQVHGFALKRSLDAFVYVANALITMYSKSYFKGGAFNDGKDDA 244

Query: 244 WTMFKSIENPSLITWNSMIAGFCFQKLGNQAIHLFIQMNRQGIGFDRATILSTLSSVSLC 303
           WTMFKSIENPSLITWNSMIAGFCF+K GN+A+HLF+QMN +GIGFDRAT+LSTLSS+SLC
Sbjct: 245 WTMFKSIENPSLITWNSMIAGFCFRKHGNRAVHLFMQMNHKGIGFDRATLLSTLSSISLC 304

Query: 304 NWDEFGLGLSFCHELHCQALKTAFISEVEIITALVKTYAELEGDIADSYRLFVEAGYNRD 363
           NWDE  LGL FC ELHCQALKTAF SEVEIITALVKTYAEL GDI DSYRLF+EAGYNRD
Sbjct: 305 NWDELDLGLGFCRELHCQALKTAFTSEVEIITALVKTYAELGGDITDSYRLFIEAGYNRD 364

Query: 364 IVLWTSIMTAFVEHDPGKTLSLFHQFRQEGLIPDGHNFSIVLKACAGFLTEKHASTYHSL 423
           IVLWTSIMTAFV+HDPGKTLSLF QFRQEGL PDGH FSIVLKACAGFLTEKHASTYHSL
Sbjct: 365 IVLWTSIMTAFVDHDPGKTLSLFRQFRQEGLTPDGHTFSIVLKACAGFLTEKHASTYHSL 424

Query: 424 LIKYMSENDIVLNNALIHAYARCGSITSSEKVFDQMKHRDLVSWNTMMKAYAVHGQAEKA 483
           LIK  SE+D V+NNALIHAY RCGSITSS+KVFDQMKH DLVSWNTMMK YAVHGQAE A
Sbjct: 425 LIKSTSEDDTVINNALIHAYGRCGSITSSKKVFDQMKHHDLVSWNTMMKVYAVHGQAEIA 484

Query: 484 LQVFSKMNVPPDSTTFVSLLSACSHAGLVEEGTRLFNSIANYGIACQLDHYACMVDILGR 543
           LQ+FSKM VPPDSTTFVSLLSACSHAGLVEEGT+LFNSI NYG+ CQLDHYACMVDILGR
Sbjct: 485 LQLFSKMTVPPDSTTFVSLLSACSHAGLVEEGTKLFNSITNYGLVCQLDHYACMVDILGR 544

Query: 544 AGRIQEAEDFISKMPVEPDFVVWSSFLGSCRKHGATQLAKLSSNKLKELDPSNSLAYVQM 603
           +GRI+EAE F+SKMP+EPD+VVWSSFLGSC+KHGATQLAKL+S+KLKELDPSNSLAYVQM
Sbjct: 545 SGRIKEAEYFMSKMPIEPDYVVWSSFLGSCKKHGATQLAKLASDKLKELDPSNSLAYVQM 604

Query: 604 SNLYCFSGSFYDADLIRTEMKGSRVRKEPGLSWVEIENQVHEFASGGRRHPQREVICNEL 663
           SNLYC SGSFY+ADLIR EMKGSRVRKEPGLSWVEIENQ+HEFASGGR HP+REVICNEL
Sbjct: 605 SNLYCLSGSFYEADLIRMEMKGSRVRKEPGLSWVEIENQIHEFASGGRHHPEREVICNEL 664

Query: 664 EELVGRLKEIGYVPETSLALHDVEQEQKEEQLYHHSEKLALVFSVMNDSNLCRIGTPIRI 723
           EEL+GRLKEIGYVPETSLA+HDVEQEQKEEQLYHHSEKLALVFSVMND+NL  +G PIRI
Sbjct: 665 EELIGRLKEIGYVPETSLAIHDVEQEQKEEQLYHHSEKLALVFSVMNDNNLGCVGIPIRI 724

Query: 724 MKNIRICVDCHNFMKLASTLLKKEIVIRDSNRFHHFTAGLCSCNDYW 771
           MKNIRICVDCHNFMKLAS LL+KEIVIRDSNRFHHF  GLCSCNDYW
Sbjct: 725 MKNIRICVDCHNFMKLASRLLQKEIVIRDSNRFHHFMGGLCSCNDYW 770

BLAST of Tan0011222 vs. ExPASy TrEMBL
Match: A0A5D3D022 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold35G003030 PE=3 SV=1)

HSP 1 Score: 1349.7 bits (3492), Expect = 0.0e+00
Identity = 652/767 (85.01%), Postives = 698/767 (91.00%), Query Frame = 0

Query: 4   TIHCPFRAKRNLVSYPSKHAFGSHLRYWRSAAEGDIVPFRTEDIDNDYLLDTHRISTHGQ 63
           TI+C F   RNLVS PSKHAFG   R WRSAAEGDIV FRTEDIDNDYLL+T  IS+ G 
Sbjct: 5   TIYCSFHGIRNLVSCPSKHAFGFQFRCWRSAAEGDIVHFRTEDIDNDYLLETRTISSRGH 64

Query: 64  LWQALSLFYSSRQPHSRQTYAHLFHACARLRCLQEGVGLHRYMMSRDPMDSFDLFVTNHL 123
           L +ALSLFYSSRQPHS QTYA+LFH CARLRCLQEGVGLHRYM+S++PM SFDLFVTNHL
Sbjct: 65  LRRALSLFYSSRQPHSHQTYAYLFHVCARLRCLQEGVGLHRYMLSQNPMVSFDLFVTNHL 124

Query: 124 INMYCKCGHLIYAYQLFNDMPRRNLVSWTALISGLSQNGHVDECFLLFSRMLVDHQPNEF 183
           INMYCKCGHL YA QLFN+MPRRN VSWT LISGLSQ GHVDECF +FSRMLVD +PNEF
Sbjct: 125 INMYCKCGHLDYANQLFNEMPRRNHVSWTVLISGLSQYGHVDECFYIFSRMLVDQRPNEF 184

Query: 184 TVASLLTSFGEHDGERGKQVHGFALKTSLDASVYVANALITMYSKSFYKGGAFNDNKDDA 243
           TVASLLTSFGEHDGERG+Q+HGFALK SLDASVYVANALITMYSKS+ + G FND KDDA
Sbjct: 185 TVASLLTSFGEHDGERGRQIHGFALKRSLDASVYVANALITMYSKSYSEDGTFNDGKDDA 244

Query: 244 WTMFKSIENPSLITWNSMIAGFCFQKLGNQAIHLFIQMNRQGIGFDRATILSTLSSVSLC 303
           WTMFKSIENPSLITWNSMIAGFCF+KLG QAI+LF+QMNR GIGFDRAT+LSTLSS   C
Sbjct: 245 WTMFKSIENPSLITWNSMIAGFCFRKLGYQAIYLFMQMNRHGIGFDRATLLSTLSSTRFC 304

Query: 304 NWDEFGLGLSFCHELHCQALKTAFISEVEIITALVKTYAELEGDIADSYRLFVEAGYNRD 363
           N DEFG  L FCH++HCQALKTAF SE+EIITALVKTYAEL G+IADSY+LFVEAGYNRD
Sbjct: 305 NRDEFGWRLGFCHQIHCQALKTAFTSEIEIITALVKTYAELGGNIADSYKLFVEAGYNRD 364

Query: 364 IVLWTSIMTAFVEHDPGKTLSLFHQFRQEGLIPDGHNFSIVLKACAGFLTEKHASTYHSL 423
           IVLWTSIM AF++HDPGKTLSLF QFRQEGL PDGH FS+VLKACAGFLTEKHAS YHSL
Sbjct: 365 IVLWTSIMAAFIDHDPGKTLSLFCQFRQEGLTPDGHTFSVVLKACAGFLTEKHASIYHSL 424

Query: 424 LIKYMSENDIVLNNALIHAYARCGSITSSEKVFDQMKHRDLVSWNTMMKAYAVHGQAEKA 483
           LIK MSE+D VLNNALIHAY RCGSI+SS+KVF+QMKH DLVSWNTMMKAYA+HGQAE A
Sbjct: 425 LIKSMSEDDTVLNNALIHAYGRCGSISSSKKVFNQMKHHDLVSWNTMMKAYALHGQAEIA 484

Query: 484 LQVFSKMNVPPDSTTFVSLLSACSHAGLVEEGTRLFNSIANYGIACQLDHYACMVDILGR 543
           LQ+F+KMNVPPD+TTFVSLLSACSHAGLVEEGT LFNSI NYGI CQLDHYACMVDILGR
Sbjct: 485 LQLFTKMNVPPDATTFVSLLSACSHAGLVEEGTSLFNSITNYGIVCQLDHYACMVDILGR 544

Query: 544 AGRIQEAEDFISKMPVEPDFVVWSSFLGSCRKHGATQLAKLSSNKLKELDPSNSLAYVQM 603
           +GR+QEA DFISKMP+EPDFVVWSSFLGSCRK+GA  LAKL+S KLKELDPSNSLAYVQM
Sbjct: 545 SGRVQEAHDFISKMPIEPDFVVWSSFLGSCRKYGAIGLAKLASCKLKELDPSNSLAYVQM 604

Query: 604 SNLYCFSGSFYDADLIRTEMKGSRVRKEPGLSWVEIENQVHEFASGGRRHPQREVICNEL 663
           SNLYCF+GSFY+ADLIRTEM GSRVRKEPGLSWVEIENQVHEFASGGR HPQREVICNEL
Sbjct: 605 SNLYCFNGSFYEADLIRTEMTGSRVRKEPGLSWVEIENQVHEFASGGRCHPQREVICNEL 664

Query: 664 EELVGRLKEIGYVPETSLALHDVEQEQKEEQLYHHSEKLALVFSVMNDSNLCRIGTPIRI 723
           EEL+GRLKEIGYVPET LA +DVEQEQKEEQLYHHSEKLALVFSVMND NL  +  PIRI
Sbjct: 665 EELIGRLKEIGYVPETRLAFYDVEQEQKEEQLYHHSEKLALVFSVMNDYNLGCVNNPIRI 724

Query: 724 MKNIRICVDCHNFMKLASTLLKKEIVIRDSNRFHHFTAGLCSCNDYW 771
           MKNIRICVDCHNFMKLAS LL+KEIVIRDSNRFHHF AGLCSCNDYW
Sbjct: 725 MKNIRICVDCHNFMKLASRLLQKEIVIRDSNRFHHFMAGLCSCNDYW 771

BLAST of Tan0011222 vs. TAIR 10
Match: AT1G71420.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 726.1 bits (1873), Expect = 3.0e-209
Identity = 375/715 (52.45%), Postives = 499/715 (69.79%), Query Frame = 0

Query: 62  GQLWQALSLFYSSR-QPHSRQTYAHLFHACARLRCLQEGVGLHRYMMSRDPMDSFDLFVT 121
           G + +A+SLFYS+  +  S+Q YA LF ACA  R L +G+ LH +M+S     S ++ + 
Sbjct: 40  GDIRRAVSLFYSAPVELQSQQAYAALFQACAEQRNLLDGINLHHHMLSHPYCYSQNVILA 99

Query: 122 NHLINMYCKCGHLIYAYQLFNDMPRRNLVSWTALISGLSQNGHVDECFLLFSRMLVDHQP 181
           N LINMY KCG+++YA Q+F+ MP RN+VSWTALI+G  Q G+  E F LFS ML    P
Sbjct: 100 NFLINMYAKCGNILYARQVFDTMPERNVVSWTALITGYVQAGNEQEGFCLFSSMLSHCFP 159

Query: 182 NEFTVASLLTSFGEHDGERGKQVHGFALKTSLDASVYVANALITMYSKSFYKGGAFNDNK 241
           NEFT++S+LTS      E GKQVHG ALK  L  S+YVANA+I+MY +      A+    
Sbjct: 160 NEFTLSSVLTSCRY---EPGKQVHGLALKLGLHCSIYVANAVISMYGRCHDGAAAY---- 219

Query: 242 DDAWTMFKSIENPSLITWNSMIAGFCFQKLGNQAIHLFIQMNRQGIGFDRATILSTLSSV 301
            +AWT+F++I+  +L+TWNSMIA F    LG +AI +F++M+  G+GFDRAT+L+  SS+
Sbjct: 220 -EAWTVFEAIKFKNLVTWNSMIAAFQCCNLGKKAIGVFMRMHSDGVGFDRATLLNICSSL 279

Query: 302 SLCNWDEFGLGLSFCHELHCQALKTAFISEVEIITALVKTYAELEGDIADSYRLFVEAGY 361
              +          C +LH   +K+  +++ E+ TAL+K Y+E+  D  D Y+LF+E  +
Sbjct: 280 YKSSDLVPNEVSKCCLQLHSLTVKSGLVTQTEVATALIKVYSEMLEDYTDCYKLFMEMSH 339

Query: 362 NRDIVLWTSIMTAFVEHDPGKTLSLFHQFRQEGLIPDGHNFSIVLKACAGFLTEKHASTY 421
            RDIV W  I+TAF  +DP + + LF Q RQE L PD + FS VLKACAG +T +HA + 
Sbjct: 340 CRDIVAWNGIITAFAVYDPERAIHLFGQLRQEKLSPDWYTFSSVLKACAGLVTARHALSI 399

Query: 422 HSLLIKYMSENDIVLNNALIHAYARCGSITSSEKVFDQMKHRDLVSWNTMMKAYAVHGQA 481
           H+ +IK     D VLNN+LIHAYA+CGS+    +VFD M  RD+VSWN+M+KAY++HGQ 
Sbjct: 400 HAQVIKGGFLADTVLNNSLIHAYAKCGSLDLCMRVFDDMDSRDVVSWNSMLKAYSLHGQV 459

Query: 482 EKALQVFSKMNVPPDSTTFVSLLSACSHAGLVEEGTRLFNSIANYGIAC-QLDHYACMVD 541
           +  L VF KM++ PDS TF++LLSACSHAG VEEG R+F S+        QL+HYAC++D
Sbjct: 460 DSILPVFQKMDINPDSATFIALLSACSHAGRVEEGLRIFRSMFEKPETLPQLNHYACVID 519

Query: 542 ILGRAGRIQEAEDFISKMPVEPDFVVWSSFLGSCRKHGATQLAKLSSNKLKEL-DPSNSL 601
           +L RA R  EAE+ I +MP++PD VVW + LGSCRKHG T+L KL+++KLKEL +P+NS+
Sbjct: 520 MLSRAERFAEAEEVIKQMPMDPDAVVWIALLGSCRKHGNTRLGKLAADKLKELVEPTNSM 579

Query: 602 AYVQMSNLYCFSGSFYDADLIRTEMKGSRVRKEPGLSWVEIENQVHEFASGGRRHPQREV 661
           +Y+QMSN+Y   GSF +A+L   EM+  RVRKEP LSW EI N+VHEFASGGR  P +E 
Sbjct: 580 SYIQMSNIYNAEGSFNEANLSIKEMETWRVRKEPDLSWTEIGNKVHEFASGGRHRPDKEA 639

Query: 662 ICNELEELVGRLKEIGYVPETSLALHDVE-QEQKEEQLYHHSEKLALVFSVM--NDSNLC 721
           +  EL+ L+  LKE+GYVPE   A  D+E +EQ+E+ L HHSEKLAL F+VM    S+ C
Sbjct: 640 VYRELKRLISWLKEMGYVPEMRSASQDIEDEEQEEDNLLHHSEKLALAFAVMEGRKSSDC 699

Query: 722 RIGTPIRIMKNIRICVDCHNFMKLASTLLKKEIVIRDSNRFHHFTAGLCSCNDYW 771
            +   I+IMKN RIC+DCHNFMKLAS LL KEI++RDSNRFHHF    CSCNDYW
Sbjct: 700 GVNL-IQIMKNTRICIDCHNFMKLASKLLGKEILMRDSNRFHHFKDSSCSCNDYW 745

BLAST of Tan0011222 vs. TAIR 10
Match: AT5G09950.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 458.0 bits (1177), Expect = 1.5e-128
Identity = 253/688 (36.77%), Postives = 406/688 (59.01%), Query Frame = 0

Query: 96  LQEGVGLHRYMMSRDPMDSFDLFVTNHLINMYCKCGHLIYAYQLFNDMPRRNLVSWTALI 155
           L++G  +H ++++   +D F + + N L+NMY KCG +  A ++F  M  ++ VSW ++I
Sbjct: 329 LKKGREVHGHVITTGLVD-FMVGIGNGLVNMYAKCGSIADARRVFYFMTDKDSVSWNSMI 388

Query: 156 SGLSQNGHVDECFLLFSRM-LVDHQPNEFTVASLLTSFGEHD-GERGKQVHGFALKTSLD 215
           +GL QNG   E    +  M   D  P  FT+ S L+S       + G+Q+HG +LK  +D
Sbjct: 389 TGLDQNGCFIEAVERYKSMRRHDILPGSFTLISSLSSCASLKWAKLGQQIHGESLKLGID 448

Query: 216 ASVYVANALITMYSKSFYKGGAFNDNKDDAWTMFKSIENPSLITWNSMIAGFC-FQKLGN 275
            +V V+NAL+T+Y+++ Y         ++   +F S+     ++WNS+I      ++   
Sbjct: 449 LNVSVSNALMTLYAETGY--------LNECRKIFSSMPEHDQVSWNSIIGALARSERSLP 508

Query: 276 QAIHLFIQMNRQGIGFDRATILSTLSSVSLCNWDEFGLGLSFCHELHCQALKTAFISEVE 335
           +A+  F+   R G   +R T  S LS+VS  ++ E G       ++H  ALK     E  
Sbjct: 509 EAVVCFLNAQRAGQKLNRITFSSVLSAVSSLSFGELG------KQIHGLALKNNIADEAT 568

Query: 336 IITALVKTYAELEGDIADSYRLFVEAGYNRDIVLWTSIMTAFVEHD-PGKTLSLFHQFRQ 395
              AL+  Y +  G++    ++F      RD V W S+++ ++ ++   K L L     Q
Sbjct: 569 TENALIACYGKC-GEMDGCEKIFSRMAERRDNVTWNSMISGYIHNELLAKALDLVWFMLQ 628

Query: 396 EGLIPDGHNFSIVLKACAGFLTEKHASTYHSLLIKYMSENDIVLNNALIHAYARCGSITS 455
            G   D   ++ VL A A   T +     H+  ++   E+D+V+ +AL+  Y++CG +  
Sbjct: 629 TGQRLDSFMYATVLSAFASVATLERGMEVHACSVRACLESDVVVGSALVDMYSKCGRLDY 688

Query: 456 SEKVFDQMKHRDLVSWNTMMKAYAVHGQAEKALQVFSKMNV----PPDSTTFVSLLSACS 515
           + + F+ M  R+  SWN+M+  YA HGQ E+AL++F  M +    PPD  TFV +LSACS
Sbjct: 689 ALRFFNTMPVRNSYSWNSMISGYARHGQGEEALKLFETMKLDGQTPPDHVTFVGVLSACS 748

Query: 516 HAGLVEEGTRLFNSIA-NYGIACQLDHYACMVDILGRAGRIQEAEDFISKMPVEPDFVVW 575
           HAGL+EEG + F S++ +YG+A +++H++CM D+LGRAG + + EDFI KMP++P+ ++W
Sbjct: 749 HAGLLEEGFKHFESMSDSYGLAPRIEHFSCMADVLGRAGELDKLEDFIEKMPMKPNVLIW 808

Query: 576 SSFLGS-CRKHG-ATQLAKLSSNKLKELDPSNSLAYVQMSNLYCFSGSFYDADLIRTEMK 635
            + LG+ CR +G   +L K ++  L +L+P N++ YV + N+Y   G + D    R +MK
Sbjct: 809 RTVLGACCRANGRKAELGKKAAEMLFQLEPENAVNYVLLGNMYAAGGRWEDLVKARKKMK 868

Query: 636 GSRVRKEPGLSWVEIENQVHEFASGGRRHPQREVICNELEELVGRLKEIGYVPETSLALH 695
            + V+KE G SWV +++ VH F +G + HP  +VI  +L+EL  ++++ GYVP+T  AL+
Sbjct: 869 DADVKKEAGYSWVTMKDGVHMFVAGDKSHPDADVIYKKLKELNRKMRDAGYVPQTGFALY 928

Query: 696 DVEQEQKEEQLYHHSEKLALVF--SVMNDSNLCRIGTPIRIMKNIRICVDCHNFMKLAST 755
           D+EQE KEE L +HSEKLA+ F  +    S L     PIRIMKN+R+C DCH+  K  S 
Sbjct: 929 DLEQENKEEILSYHSEKLAVAFVLAAQRSSTL-----PIRIMKNLRVCGDCHSAFKYISK 988

Query: 756 LLKKEIVIRDSNRFHHFTAGLCSCNDYW 771
           +  ++I++RDSNRFHHF  G CSC+D+W
Sbjct: 989 IEGRQIILRDSNRFHHFQDGACSCSDFW 995

BLAST of Tan0011222 vs. TAIR 10
Match: AT4G14850.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 434.5 bits (1116), Expect = 1.8e-121
Identity = 247/668 (36.98%), Postives = 375/668 (56.14%), Query Frame = 0

Query: 118 FVTNHLINMYCKCGHLIYAYQLFNDMPRRNLVSWTALISGLSQNGHVDECFLLFSRMLVD 177
           F+ N+LINMY K  H   A  +    P RN+VSWT+LISGL+QNGH     + F  M  +
Sbjct: 43  FLANYLINMYSKLDHPESARLVLRLTPARNVVSWTSLISGLAQNGHFSTALVEFFEMRRE 102

Query: 178 H-QPNEFT-------VASLLTSFGEHDGERGKQVHGFALKTSLDASVYVANALITMYSKS 237
              PN+FT       VASL           GKQ+H  A+K      V+V  +   MY K+
Sbjct: 103 GVVPNDFTFPCAFKAVASLRLPV------TGKQIHALAVKCGRILDVFVGCSAFDMYCKT 162

Query: 238 FYKGGAFNDNKDDAWTMFKSIENPSLITWNSMIAGFCFQKLGNQAIHLFIQMNRQGIGFD 297
                     +DDA  +F  I   +L TWN+ I+         +AI  FI+  R     +
Sbjct: 163 RL--------RDDARKLFDEIPERNLETWNAFISNSVTDGRPREAIEAFIEFRRIDGHPN 222

Query: 298 RATILSTLSSVSLCNWDEFGLGLSFCHELHCQALKTAFISEVEIITALVKTYAELEGDIA 357
             T  + L++ S  +W    LG+    +LH   L++ F ++V +   L+  Y + +  I 
Sbjct: 223 SITFCAFLNACS--DWLHLNLGM----QLHGLVLRSGFDTDVSVCNGLIDFYGKCK-QIR 282

Query: 358 DSYRLFVEAGYNRDIVLWTSIMTAFVE-HDPGKTLSLFHQFRQEGLIPDGHNFSIVLKAC 417
            S  +F E G  ++ V W S++ A+V+ H+  K   L+ + R++ +       S VL AC
Sbjct: 283 SSEIIFTEMG-TKNAVSWCSLVAAYVQNHEDEKASVLYLRSRKDIVETSDFMISSVLSAC 342

Query: 418 AGFLTEKHASTYHSLLIKYMSENDIVLNNALIHAYARCGSITSSEKVFDQMKHRDLVSWN 477
           AG    +   + H+  +K   E  I + +AL+  Y +CG I  SE+ FD+M  ++LV+ N
Sbjct: 343 AGMAGLELGRSIHAHAVKACVERTIFVGSALVDMYGKCGCIEDSEQAFDEMPEKNLVTRN 402

Query: 478 TMMKAYAVHGQAEKALQVFSKM-----NVPPDSTTFVSLLSACSHAGLVEEGTRLFNSI- 537
           +++  YA  GQ + AL +F +M        P+  TFVSLLSACS AG VE G ++F+S+ 
Sbjct: 403 SLIGGYAHQGQVDMALALFEEMAPRGCGPTPNYMTFVSLLSACSRAGAVENGMKIFDSMR 462

Query: 538 ANYGIACQLDHYACMVDILGRAGRIQEAEDFISKMPVEPDFVVWSSFLGSCRKHGATQLA 597
           + YGI    +HY+C+VD+LGRAG ++ A +FI KMP++P   VW +   +CR HG  QL 
Sbjct: 463 STYGIEPGAEHYSCIVDMLGRAGMVERAYEFIKKMPIQPTISVWGALQNACRMHGKPQLG 522

Query: 598 KLSSNKLKELDPSNSLAYVQMSNLYCFSGSFYDADLIRTEMKGSRVRKEPGLSWVEIENQ 657
            L++  L +LDP +S  +V +SN +  +G + +A+ +R E+KG  ++K  G SW+ ++NQ
Sbjct: 523 LLAAENLFKLDPKDSGNHVLLSNTFAAAGRWAEANTVREELKGVGIKKGAGYSWITVKNQ 582

Query: 658 VHEFASGGRRHPQREVICNELEELVGRLKEIGYVPETSLALHDVEQEQKEEQLYHHSEKL 717
           VH F +  R H   + I   L +L   ++  GY P+  L+L+D+E+E+K  ++ HHSEKL
Sbjct: 583 VHAFQAKDRSHILNKEIQTTLAKLRNEMEAAGYKPDLKLSLYDLEEEEKAAEVSHHSEKL 642

Query: 718 ALVFSVMNDSNLCRIGTPIRIMKNIRICVDCHNFMKLASTLLKKEIVIRDSNRFHHFTAG 771
           AL F +++      +  PIRI KN+RIC DCH+F K  S  +K+EI++RD+NRFH F  G
Sbjct: 643 ALAFGLLS----LPLSVPIRITKNLRICGDCHSFFKFVSGSVKREIIVRDNNRFHRFKDG 684

BLAST of Tan0011222 vs. TAIR 10
Match: AT2G22070.1 (pentatricopeptide (PPR) repeat-containing protein )

HSP 1 Score: 430.6 bits (1106), Expect = 2.6e-120
Identity = 254/731 (34.75%), Postives = 399/731 (54.58%), Query Frame = 0

Query: 99  GVGLHRYMMSRDPMDSFDLFVTNHLINMYCKCGHLIYAYQLFNDMPRRNLVSWTALISGL 158
           G  LH   +  D M     F  N +++ Y K G +    + F+ +P+R+ VSWT +I G 
Sbjct: 63  GYALHARKLF-DEMPLRTAFSWNTVLSAYSKRGDMDSTCEFFDQLPQRDSVSWTTMIVGY 122

Query: 159 SQNGHVDECFLLFSRMLVDH-QPNEFTVASLLTSF-GEHDGERGKQVHGFALKTSLDASV 218
              G   +   +   M+ +  +P +FT+ ++L S       E GK+VH F +K  L  +V
Sbjct: 123 KNIGQYHKAIRVMGDMVKEGIEPTQFTLTNVLASVAATRCMETGKKVHSFIVKLGLRGNV 182

Query: 219 YVANALITMYSKS--------FYKGGAFND---------------NKDDAWTMFKSIENP 278
            V+N+L+ MY+K          +      D                 D A   F+ +   
Sbjct: 183 SVSNSLLNMYAKCGDPMMAKFVFDRMVVRDISSWNAMIALHMQVGQMDLAMAQFEQMAER 242

Query: 279 SLITWNSMIAGFCFQKLGNQAIHLFIQMNRQG-IGFDRATILSTLSSVS----LC----- 338
            ++TWNSMI+GF  +    +A+ +F +M R   +  DR T+ S LS+ +    LC     
Sbjct: 243 DIVTWNSMISGFNQRGYDLRALDIFSKMLRDSLLSPDRFTLASVLSACANLEKLCIGKQI 302

Query: 339 -------NWDEFGLGLSFCHELH--CQALKTA--FISE-------VEIITALVKTYAELE 398
                   +D  G+ L+    ++  C  ++TA   I +       +E  TAL+  Y +L 
Sbjct: 303 HSHIVTTGFDISGIVLNALISMYSRCGGVETARRLIEQRGTKDLKIEGFTALLDGYIKL- 362

Query: 399 GDIADSYRLFVEAGYNRDIVLWTSIMTAFVEHDP-GKTLSLFHQFRQEGLIPDGHNFSIV 458
           GD+  +  +FV    +RD+V WT+++  + +H   G+ ++LF      G  P+ +  + +
Sbjct: 363 GDMNQAKNIFVSL-KDRDVVAWTAMIVGYEQHGSYGEAINLFRSMVGGGQRPNSYTLAAM 422

Query: 459 LKACAGFLTEKHASTYHSLLIKYMSENDIVLNNALIHAYARCGSITSSEKVFDQMK-HRD 518
           L   +   +  H    H   +K      + ++NALI  YA+ G+ITS+ + FD ++  RD
Sbjct: 423 LSVASSLASLSHGKQIHGSAVKSGEIYSVSVSNALITMYAKAGNITSASRAFDLIRCERD 482

Query: 519 LVSWNTMMKAYAVHGQAEKALQVFSKM---NVPPDSTTFVSLLSACSHAGLVEEGTRLFN 578
            VSW +M+ A A HG AE+AL++F  M    + PD  T+V + SAC+HAGLV +G + F+
Sbjct: 483 TVSWTSMIIALAQHGHAEEALELFETMLMEGLRPDHITYVGVFSACTHAGLVNQGRQYFD 542

Query: 579 SIANYG-IACQLDHYACMVDILGRAGRIQEAEDFISKMPVEPDFVVWSSFLGSCRKHGAT 638
            + +   I   L HYACMVD+ GRAG +QEA++FI KMP+EPD V W S L +CR H   
Sbjct: 543 MMKDVDKIIPTLSHYACMVDLFGRAGLLQEAQEFIEKMPIEPDVVTWGSLLSACRVHKNI 602

Query: 639 QLAKLSSNKLKELDPSNSLAYVQMSNLYCFSGSFYDADLIRTEMKGSRVRKEPGLSWVEI 698
            L K+++ +L  L+P NS AY  ++NLY   G + +A  IR  MK  RV+KE G SW+E+
Sbjct: 603 DLGKVAAERLLLLEPENSGAYSALANLYSACGKWEEAAKIRKSMKDGRVKKEQGFSWIEV 662

Query: 699 ENQVHEFASGGRRHPQREVICNELEELVGRLKEIGYVPETSLALHDVEQEQKEEQLYHHS 758
           +++VH F      HP++  I   ++++   +K++GYVP+T+  LHD+E+E KE+ L HHS
Sbjct: 663 KHKVHVFGVEDGTHPEKNEIYMTMKKIWDEIKKMGYVPDTASVLHDLEEEVKEQILRHHS 722

Query: 759 EKLALVFSVMNDSNLCRIGTPIRIMKNIRICVDCHNFMKLASTLLKKEIVIRDSNRFHHF 771
           EKLA+ F +++  +     T +RIMKN+R+C DCH  +K  S L+ +EI++RD+ RFHHF
Sbjct: 723 EKLAIAFGLISTPD----KTTLRIMKNLRVCNDCHTAIKFISKLVGREIIVRDTTRFHHF 782

BLAST of Tan0011222 vs. TAIR 10
Match: AT4G33170.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 427.6 bits (1098), Expect = 2.2e-119
Identity = 227/662 (34.29%), Postives = 386/662 (58.31%), Query Frame = 0

Query: 117 LFVTNHLINMYCKCGHLIYAYQLFNDMPRRNLVSWTALISGLSQNGHVDECFLLFSRML- 176
           L V+N LINMYCK     +A  +F++M  R+L+SW ++I+G++QNG   E   LF ++L 
Sbjct: 350 LTVSNSLINMYCKLRKFGFARTVFDNMSERDLISWNSVIAGIAQNGLEVEAVCLFMQLLR 409

Query: 177 VDHQPNEFTVASLLTSFGE--HDGERGKQVHGFALKTSLDASVYVANALITMYSKSFYKG 236
              +P+++T+ S+L +           KQVH  A+K +  +  +V+ ALI  YS+     
Sbjct: 410 CGLKPDQYTMTSVLKAASSLPEGLSLSKQVHVHAIKINNVSDSFVSTALIDAYSR----- 469

Query: 237 GAFNDNKDDAWTMFKSIENPSLITWNSMIAGFCFQKLGNQAIHLFIQMNRQGIGFDRATI 296
              N    +A  +F+   N  L+ WN+M+AG+     G++ + LF  M++QG   D  T+
Sbjct: 470 ---NRCMKEAEILFER-HNFDLVAWNAMMAGYTQSHDGHKTLKLFALMHKQGERSDDFTL 529

Query: 297 LSTLSSVSLCNWDEFGLGLSFCHELHCQALKTAFISEVEIITALVKTYAELEGDIADSYR 356
            +   +        F   ++   ++H  A+K+ +  ++ + + ++  Y +  GD++ +  
Sbjct: 530 ATVFKTCG------FLFAINQGKQVHAYAIKSGYDLDLWVSSGILDMYVKC-GDMSAAQF 589

Query: 357 LFVEAGYNRDIVLWTSIMTAFVEH-DPGKTLSLFHQFRQEGLIPDGHNFSIVLKACAGFL 416
            F       D V WT++++  +E+ +  +   +F Q R  G++PD    + + KA +   
Sbjct: 590 AFDSIPVPDD-VAWTTMISGCIENGEEERAFHVFSQMRLMGVLPDEFTIATLAKASSCLT 649

Query: 417 TEKHASTYHSLLIKYMSENDIVLNNALIHAYARCGSITSSEKVFDQMKHRDLVSWNTMMK 476
             +     H+  +K    ND  +  +L+  YA+CGSI  +  +F +++  ++ +WN M+ 
Sbjct: 650 ALEQGRQIHANALKLNCTNDPFVGTSLVDMYAKCGSIDDAYCLFKRIEMMNITAWNAMLV 709

Query: 477 AYAVHGQAEKALQVFSKM---NVPPDSTTFVSLLSACSHAGLVEEGTRLFNSI-ANYGIA 536
             A HG+ ++ LQ+F +M    + PD  TF+ +LSACSH+GLV E  +   S+  +YGI 
Sbjct: 710 GLAQHGEGKETLQLFKQMKSLGIKPDKVTFIGVLSACSHSGLVSEAYKHMRSMHGDYGIK 769

Query: 537 CQLDHYACMVDILGRAGRIQEAEDFISKMPVEPDFVVWSSFLGSCRKHGATQLAKLSSNK 596
            +++HY+C+ D LGRAG +++AE+ I  M +E    ++ + L +CR  G T+  K  + K
Sbjct: 770 PEIEHYSCLADALGRAGLVKQAENLIESMSMEASASMYRTLLAACRVQGDTETGKRVATK 829

Query: 597 LKELDPSNSLAYVQMSNLYCFSGSFYDADLIRTEMKGSRVRKEPGLSWVEIENQVHEFAS 656
           L EL+P +S AYV +SN+Y  +  + +  L RT MKG +V+K+PG SW+E++N++H F  
Sbjct: 830 LLELEPLDSSAYVLLSNMYAAASKWDEMKLARTMMKGHKVKKDPGFSWIEVKNKIHIFVV 889

Query: 657 GGRRHPQREVICNELEELVGRLKEIGYVPETSLALHDVEQEQKEEQLYHHSEKLALVFSV 716
             R + Q E+I  ++++++  +K+ GYVPET   L DVE+E+KE  LY+HSEKLA+ F +
Sbjct: 890 DDRSNRQTELIYRKVKDMIRDIKQEGYVPETDFTLVDVEEEEKERALYYHSEKLAVAFGL 949

Query: 717 MNDSNLCRIGTPIRIMKNIRICVDCHNFMKLASTLLKKEIVIRDSNRFHHFTAGLCSCND 771
           ++        TPIR++KN+R+C DCHN MK  + +  +EIV+RD+NRFH F  G+CSC D
Sbjct: 950 LSTPP----STPIRVIKNLRVCGDCHNAMKYIAKVYNREIVLRDANRFHRFKDGICSCGD 990

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9C9H94.2e-20852.45Pentatricopeptide repeat-containing protein At1g71420 OS=Arabidopsis thaliana OX... [more]
Q9FIB22.1e-12736.77Putative pentatricopeptide repeat-containing protein At5g09950 OS=Arabidopsis th... [more]
Q0WSH62.5e-12036.98Pentatricopeptide repeat-containing protein At4g14850 OS=Arabidopsis thaliana OX... [more]
Q9SHZ83.6e-11934.75Pentatricopeptide repeat-containing protein At2g22070 OS=Arabidopsis thaliana OX... [more]
Q9SMZ23.1e-11834.29Pentatricopeptide repeat-containing protein At4g33170 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
XP_022139075.10.0e+0087.48pentatricopeptide repeat-containing protein At1g71420 isoform X1 [Momordica char... [more]
XP_022139076.10.0e+0087.48pentatricopeptide repeat-containing protein At1g71420 isoform X2 [Momordica char... [more]
XP_023511808.10.0e+0087.22pentatricopeptide repeat-containing protein At1g71420 isoform X1 [Cucurbita pepo... [more]
XP_022957425.10.0e+0087.09pentatricopeptide repeat-containing protein At1g71420 isoform X1 [Cucurbita mosc... [more]
XP_038892212.10.0e+0087.24pentatricopeptide repeat-containing protein At1g71420 isoform X1 [Benincasa hisp... [more]
Match NameE-valueIdentityDescription
A0A6J1CBK60.0e+0087.48pentatricopeptide repeat-containing protein At1g71420 isoform X2 OS=Momordica ch... [more]
A0A6J1CBA20.0e+0087.48pentatricopeptide repeat-containing protein At1g71420 isoform X1 OS=Momordica ch... [more]
A0A6J1H0I10.0e+0087.09pentatricopeptide repeat-containing protein At1g71420 isoform X1 OS=Cucurbita mo... [more]
A0A6J1JQJ90.0e+0086.18pentatricopeptide repeat-containing protein At1g71420 isoform X1 OS=Cucurbita ma... [more]
A0A5D3D0220.0e+0085.01Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
Match NameE-valueIdentityDescription
AT1G71420.13.0e-20952.45Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT5G09950.11.5e-12836.77Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G14850.11.8e-12136.98Pentatricopeptide repeat (PPR) superfamily protein [more]
AT2G22070.12.6e-12034.75pentatricopeptide (PPR) repeat-containing protein [more]
AT4G33170.12.2e-11934.29Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 268..410
e-value: 2.7E-11
score: 45.4
coord: 56..196
e-value: 8.9E-27
score: 96.3
coord: 492..644
e-value: 3.3E-21
score: 78.0
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 411..491
e-value: 6.5E-10
score: 40.8
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 77..616
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 149..175
e-value: 1.5E-6
score: 28.0
coord: 121..147
e-value: 1.0E-4
score: 22.3
coord: 256..286
e-value: 2.2E-4
score: 21.2
coord: 534..558
e-value: 0.11
score: 12.8
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 121..148
e-value: 0.0021
score: 16.1
coord: 149..175
e-value: 4.8E-5
score: 21.3
coord: 437..465
e-value: 2.1E-4
score: 19.3
coord: 465..490
e-value: 8.4E-6
score: 23.6
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 463..507
e-value: 5.8E-9
score: 36.0
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 254..288
score: 9.240434
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 116..150
score: 9.930995
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 495..529
score: 9.13082
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 432..466
score: 9.985802
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 632..759
e-value: 3.2E-36
score: 124.0
NoneNo IPR availablePANTHERPTHR47924PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 179..649
coord: 58..409

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0011222.1Tan0011222.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding