Tan0001607 (gene) Snake gourd v1

Overview
NameTan0001607
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat-containing protein
LocationLG04: 9638542 .. 9642122 (-)
RNA-Seq ExpressionTan0001607
SyntenyTan0001607
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CGTCCATCCAGGAAAAGCCACGCCAAAGAATGAAATTTCCCAAAAGATGGGGTTTTCGCTAACCGTTTGTGGCGGTGGGGGAGACGACCAAATTTTCCGGGAAAGTGGTCGCTTTTTTTTCTTGCTCGCTTCACTTTCTCGGCAGACCGTCCTTTTCGCATTTTCAAGGTAAATTCTCTCTCTCTCTCTGACCTTGCTTGAATCATTTCCTTCTTCGATGAACTATTCAGCTTCATCCTGCTCTCGCGTTTTACTTTCTGGGTCGAATTTCAACTACTTTTAGTACTCTGGAACGGCGTCTTATTTACCAATTTCATTTTAAATATACTTCGTCATTTCCATGGCCGGAATTTTATTCAAGTTGTTTTGAAGTTGAGATCATTTACATTCTTAGTAGTAAGATACCTTTGCTCTTGGTCATTCTTGTCCTCTGCTATCCACTACATCGTGTAAGAGTCTAACTATTATGTACTGAAATGGCTATCTTGTGTATTCCAACATGCCAATTCACTCTCACAAAACCAAACTCCACATTTTCCAAGAATGAGTTCATAAATCAACCCCACCCACTTTCCCTTTTGTCCAAATGTACATCTCTCAGAGACCTCAAGCAGATTCAAGCCTATACCATCAAAACCAATCTTCAGAATGACATTTCTGTTCTCACAAAGCTCATTAATATCTGCACGCGCAGCCCCACCACTTCGTCCATGGACCATGCCCACCATCTGTTCGATCAAATTCCCGACAAGGATATTGTCCTGTTTAACATAATGGCACGTGGTTATGCTCGCTCTAATTCTCCATATCTTGCATTTTCTCTTTTTGCTCAAGTCCTGTGCTCTGGCCTTCTTCCTGATGACTACACATTCTCGTCCCTTCTCAAGGCGTGTGCTAGTTCTAAGGCCTTGAAAGAAGGTAGACAGTTGCATTGCTTTGCTATTAAACTTGGACTTAGTCACAATATTTACATATGCCCAACCCTTATAAATATGTATGCGGAGTGTAATTACATGGATGCAGCGCGTGGAGTCTTTGATGGAATGGAGGAGCCATGTATTGTTAGCTATAATGCAATTATCACGGGTTATGCTCGAAGTAGTCGGCCCAATGAGGCTTTGTCGTTGTTTAGAGAATTGCAAGCTAGCAATCTTGAGCCTACTGATGTTACTATGCTTAGTATAATTATGTCATGTTCTCTGTTGGGAGCATTCGACCTGGGAAGGTGGATTCATGAATATGTTAAGAAGAAAGGGTTTGATAAATATGTGAAGGTGAACACTGCACTTATAGATATGTATGCGAAATGTGGAAGTCTTGCTGATGCTGTTTCTATCTTTGAGGGAATGCGAGTGAGAGATACACAAGCTTGGTCTGCAATGATAGTTGCATTTGCAACTCATGGGGATGGCTTGAAAGCTATCTCCATGTTTGAAGAGATGAAGAAGGCAGGGGTTCGACCTGATGAGATTACTTTTTTGGGGCTTTTGTATGCTTGCAGTCATGCTGGGCTAGTAGAGGAAGGTAGAAGGTATTTCCATAGCATGTCTAAAATCTATGGAATAACTCCAGGGATCAAGCATTATGGATGTATGGTTGATTTGCTTGGTCGAACAGGTCATTTAGATGAGGCTTATAAGTTAATAGATGAATTGGAAATCAAGCCCACACCTATACTCTGGCGCACCCTATTATCGTCTTGCAGCAACCATGGTAATGTCGACTTGGCAAAGCGGGTCATTGAACGAATTTTTGAATTAGATGATTCCCATGGAGGGGACTACGTCATATTATCAAACTTGTTTGCTAGAGTAGGAAGATGGGAAGATGTGAACCATCTGAGGAAATTGATGAAAGATAGAGGGGTGGTGAAAGTTCCTGGGTGTAGTTCTGTGGAGGTAAACAATGTAGTACACGAGTTCTTCTCTGGGGACGGAGTTCACTCCATTTCGGAGGAGTTGCGCCGAGCACTCGATGAGTTAATCAAAGAAATAAAGTTGGTGGGATATGTTCCTGATACTTCTTTAGTATATCATGCTGATATGGAAGAGGAAGGGAAAGAACTTGTCCTGAGATATCACAGTGAAAAATTGGCAATGGCTTTTGGGCTCCTGAATACACCCCCTGGTACAACTATAAGGGTAGTCAAGAACCTCCGTATTTGTGGAGATTGTCATAATGCTGCAAAACTTATATCATTAATTTTTGGGAGGCAGATCGTCATTAGGGACGTTCAACGATTCCATCGTTTTGAAGACGGGGAATGCTCCTGTTGTGATTTCTGGTGATAGAATATGTTGAAATCATAAGCCATTCTTACAGTTGGTTCTTTTTTCCTGCAATGGCATGATTTGCAATTCATTTAGTGTAACTTCATGGCCCATGGGGAACCTTTATGTTTGTATAGCCATGAAGGATCTCAGAACAAGATTCATTTAGAGTTTGGATCTCCCCATATATTCATCGGTGTAGCATTGATCATTACATATACTGGATCAAAATAAAGTATATAACTATGGGATGAACATATTTCTACATCATCAACTAATTTGCTTTCACTTAATAACCCTCCTCAGATGTGGATTTGAAAATCTGCAGGTCTTTTAATAATCATTTGATACAAAATCGAGATACTTTTGTAGTTTCTCTCTCATTCACTCAACGGTGTAAAACTACCAGCAACTAGCTTGTTTTATATACTTGTTTTTGACAGTCAACCATGGGTTTCATTGCTATGTCCAAGTCAATAGTTCCTTTTAACAACAATCAGAGTACAGTAGCCAAATTCAAGATGCAGTACAAAAGTAGATCAAATATTGGTGGCTCAAAGTTTGTTTCAATTATAACTGCTTTCAATTTAATGTGGTATCTGTATTTAGAGTGGACAGCTTTCTTGAGCATTATTACTTTTTCTTTCTTTGTACTAATTTATTTATTGTTTCTTTCATAAATTATTGATCATTTTTATCAAACAAAAAAAAAAATTATTCATCATTAAGTTTGCTTCCCCCTATTTATGTCCTTGTGGAAAGTTTAGGATTTATTGGCTATACTCAATAAATAGGAAGTAATGAAAATTTATTTGGTCAAGACAATGCAAACGTTGATATTTGGTGTGACAATTACTCTTTTTTGGAGCACATTATCCACTCACGCGTCTTTTAAAACAAATCTTACTGATTTTTTTTCCCTCTTTCTCCTCTTAAAATGAGAACAACTTTGATTCGTTTATATTTTTTTCTTTTTTTCTTTTTTTTTTCTTTTTTGACCATTGAGCTGCATCTTTAAGCCCATTGACCATTCTATGTTATCTTAATTTACCAGTCAAATATTAGGAAATTGTGATTAATGTCTAAAATAAATAATCTAAAATGTCTAAAAAGGTTTTATATTGTAAATTTGTCTAACTATTTTTCAGGGAGGAGTGTGAAGAAAATTGTATACCCATTTTGCCCCTATTATATAATTATATTTACAGTCAAACTAAATTCATCATAATATTTGGGGAAATATTTGATTCCCCAAATTTTTAGCAATTCGGTCGATTTTGTTGTTGGTTGTTCACTGTGTTGTAGGTTCCC

mRNA sequence

CGTCCATCCAGGAAAAGCCACGCCAAAGAATGAAATTTCCCAAAAGATGGGGTTTTCGCTAACCGTTTGTGGCGGTGGGGGAGACGACCAAATTTTCCGGGAAAGTGGTCGCTTTTTTTTCTTGCTCGCTTCACTTTCTCGGCAGACCGTCCTTTTCGCATTTTCAAGGTAAATTCTCTCTCTCTCTCTGACCTTGCTTGAATCATTTCCTTCTTCGATGAACTATTCAGCTTCATCCTGCTCTCGCGTTTTACTTTCTGGGTCGAATTTCAACTACTTTTAGTACTCTGGAACGGCGTCTTATTTACCAATTTCATTTTAAATATACTTCGTCATTTCCATGGCCGGAATTTTATTCAAGTTGTTTTGAAGTTGAGATCATTTACATTCTTAGTAGTAAGATACCTTTGCTCTTGGTCATTCTTGTCCTCTGCTATCCACTACATCGTGTAAGAGTCTAACTATTATGTACTGAAATGGCTATCTTGTGTATTCCAACATGCCAATTCACTCTCACAAAACCAAACTCCACATTTTCCAAGAATGAGTTCATAAATCAACCCCACCCACTTTCCCTTTTGTCCAAATGTACATCTCTCAGAGACCTCAAGCAGATTCAAGCCTATACCATCAAAACCAATCTTCAGAATGACATTTCTGTTCTCACAAAGCTCATTAATATCTGCACGCGCAGCCCCACCACTTCGTCCATGGACCATGCCCACCATCTGTTCGATCAAATTCCCGACAAGGATATTGTCCTGTTTAACATAATGGCACGTGGTTATGCTCGCTCTAATTCTCCATATCTTGCATTTTCTCTTTTTGCTCAAGTCCTGTGCTCTGGCCTTCTTCCTGATGACTACACATTCTCGTCCCTTCTCAAGGCGTGTGCTAGTTCTAAGGCCTTGAAAGAAGGTAGACAGTTGCATTGCTTTGCTATTAAACTTGGACTTAGTCACAATATTTACATATGCCCAACCCTTATAAATATGTATGCGGAGTGTAATTACATGGATGCAGCGCGTGGAGTCTTTGATGGAATGGAGGAGCCATGTATTGTTAGCTATAATGCAATTATCACGGGTTATGCTCGAAGTAGTCGGCCCAATGAGGCTTTGTCGTTGTTTAGAGAATTGCAAGCTAGCAATCTTGAGCCTACTGATGTTACTATGCTTAGTATAATTATGTCATGTTCTCTGTTGGGAGCATTCGACCTGGGAAGGTGGATTCATGAATATGTTAAGAAGAAAGGGTTTGATAAATATGTGAAGGTGAACACTGCACTTATAGATATGTATGCGAAATGTGGAAGTCTTGCTGATGCTGTTTCTATCTTTGAGGGAATGCGAGTGAGAGATACACAAGCTTGGTCTGCAATGATAGTTGCATTTGCAACTCATGGGGATGGCTTGAAAGCTATCTCCATGTTTGAAGAGATGAAGAAGGCAGGGGTTCGACCTGATGAGATTACTTTTTTGGGGCTTTTGTATGCTTGCAGTCATGCTGGGCTAGTAGAGGAAGGTAGAAGGTATTTCCATAGCATGTCTAAAATCTATGGAATAACTCCAGGGATCAAGCATTATGGATGTATGGTTGATTTGCTTGGTCGAACAGGTCATTTAGATGAGGCTTATAAGTTAATAGATGAATTGGAAATCAAGCCCACACCTATACTCTGGCGCACCCTATTATCGTCTTGCAGCAACCATGGTAATGTCGACTTGGCAAAGCGGGTCATTGAACGAATTTTTGAATTAGATGATTCCCATGGAGGGGACTACGTCATATTATCAAACTTGTTTGCTAGAGTAGGAAGATGGGAAGATGTGAACCATCTGAGGAAATTGATGAAAGATAGAGGGGTGGTGAAAGTTCCTGGGTGTAGTTCTGTGGAGGTAAACAATGTAGTACACGAGTTCTTCTCTGGGGACGGAGTTCACTCCATTTCGGAGGAGTTGCGCCGAGCACTCGATGAGTTAATCAAAGAAATAAAGTTGGTGGGATATGTTCCTGATACTTCTTTAGTATATCATGCTGATATGGAAGAGGAAGGGAAAGAACTTGTCCTGAGATATCACAGTGAAAAATTGGCAATGGCTTTTGGGCTCCTGAATACACCCCCTGGTACAACTATAAGGGTAGTCAAGAACCTCCGTATTTGTGGAGATTGTCATAATGCTGCAAAACTTATATCATTAATTTTTGGGAGGCAGATCGTCATTAGGGACGTTCAACGATTCCATCGTTTTGAAGACGGGGAATGCTCCTGTTGTGATTTCTGGTGATAGAATATGTTGAAATCATAAGCCATTCTTACAGTTGGTTCTTTTTTCCTGCAATGGCATGATTTGCAATTCATTTAGTGTAACTTCATGGCCCATGGGGAACCTTTATGTTTGTATAGCCATGAAGGATCTCAGAACAAGATTCATTTAGAGTTTGGATCTCCCCATATATTCATCGGTGTAGCATTGATCATTACATATACTGGATCAAAATAAAGTATATAACTATGGGATGAACATATTTCTACATCATCAACTAATTTGCTTTCACTTAATAACCCTCCTCAGATGTGGATTTGAAAATCTGCAGGTCTTTTAATAATCATTTGATACAAAATCGAGATACTTTTGTAGTTTCTCTCTCATTCACTCAACGGTGTAAAACTACCAGCAACTAGCTTGTTTTATATACTTGTTTTTGACAGTCAACCATGGGTTTCATTGCTATGTCCAAGTCAATAGTTCCTTTTAACAACAATCAGAGTACAGTAGCCAAATTCAAGATGCAGTACAAAAGTAGATCAAATATTGGTGGCTCAAAGTTTGTTTCAATTATAACTGCTTTCAATTTAATGTGGTATCTGTATTTAGAGTGGACAGCTTTCTTGAGCATTATTACTTTTTCTTTCTTTGTACTAATTTATTTATTGTTTCTTTCATAAATTATTGATCATTTTTATCAAACAAAAAAAAAAATTATTCATCATTAAGTTTGCTTCCCCCTATTTATGTCCTTGTGGAAAGTTTAGGATTTATTGGCTATACTCAATAAATAGGAAGTAATGAAAATTTATTTGGTCAAGACAATGCAAACGTTGATATTTGGTGTGACAATTACTCTTTTTTGGAGCACATTATCCACTCACGCGTCTTTTAAAACAAATCTTACTGATTTTTTTTCCCTCTTTCTCCTCTTAAAATGAGAACAACTTTGATTCGTTTATATTTTTTTCTTTTTTTCTTTTTTTTTTCTTTTTTGACCATTGAGCTGCATCTTTAAGCCCATTGACCATTCTATGTTATCTTAATTTACCAGTCAAATATTAGGAAATTGTGATTAATGTCTAAAATAAATAATCTAAAATGTCTAAAAAGGTTTTATATTGTAAATTTGTCTAACTATTTTTCAGGGAGGAGTGTGAAGAAAATTGTATACCCATTTTGCCCCTATTATATAATTATATTTACAGTCAAACTAAATTCATCATAATATTTGGGGAAATATTTGATTCCCCAAATTTTTAGCAATTCGGTCGATTTTGTTGTTGGTTGTTCACTGTGTTGTAGGTTCCC

Coding sequence (CDS)

ATGGCTATCTTGTGTATTCCAACATGCCAATTCACTCTCACAAAACCAAACTCCACATTTTCCAAGAATGAGTTCATAAATCAACCCCACCCACTTTCCCTTTTGTCCAAATGTACATCTCTCAGAGACCTCAAGCAGATTCAAGCCTATACCATCAAAACCAATCTTCAGAATGACATTTCTGTTCTCACAAAGCTCATTAATATCTGCACGCGCAGCCCCACCACTTCGTCCATGGACCATGCCCACCATCTGTTCGATCAAATTCCCGACAAGGATATTGTCCTGTTTAACATAATGGCACGTGGTTATGCTCGCTCTAATTCTCCATATCTTGCATTTTCTCTTTTTGCTCAAGTCCTGTGCTCTGGCCTTCTTCCTGATGACTACACATTCTCGTCCCTTCTCAAGGCGTGTGCTAGTTCTAAGGCCTTGAAAGAAGGTAGACAGTTGCATTGCTTTGCTATTAAACTTGGACTTAGTCACAATATTTACATATGCCCAACCCTTATAAATATGTATGCGGAGTGTAATTACATGGATGCAGCGCGTGGAGTCTTTGATGGAATGGAGGAGCCATGTATTGTTAGCTATAATGCAATTATCACGGGTTATGCTCGAAGTAGTCGGCCCAATGAGGCTTTGTCGTTGTTTAGAGAATTGCAAGCTAGCAATCTTGAGCCTACTGATGTTACTATGCTTAGTATAATTATGTCATGTTCTCTGTTGGGAGCATTCGACCTGGGAAGGTGGATTCATGAATATGTTAAGAAGAAAGGGTTTGATAAATATGTGAAGGTGAACACTGCACTTATAGATATGTATGCGAAATGTGGAAGTCTTGCTGATGCTGTTTCTATCTTTGAGGGAATGCGAGTGAGAGATACACAAGCTTGGTCTGCAATGATAGTTGCATTTGCAACTCATGGGGATGGCTTGAAAGCTATCTCCATGTTTGAAGAGATGAAGAAGGCAGGGGTTCGACCTGATGAGATTACTTTTTTGGGGCTTTTGTATGCTTGCAGTCATGCTGGGCTAGTAGAGGAAGGTAGAAGGTATTTCCATAGCATGTCTAAAATCTATGGAATAACTCCAGGGATCAAGCATTATGGATGTATGGTTGATTTGCTTGGTCGAACAGGTCATTTAGATGAGGCTTATAAGTTAATAGATGAATTGGAAATCAAGCCCACACCTATACTCTGGCGCACCCTATTATCGTCTTGCAGCAACCATGGTAATGTCGACTTGGCAAAGCGGGTCATTGAACGAATTTTTGAATTAGATGATTCCCATGGAGGGGACTACGTCATATTATCAAACTTGTTTGCTAGAGTAGGAAGATGGGAAGATGTGAACCATCTGAGGAAATTGATGAAAGATAGAGGGGTGGTGAAAGTTCCTGGGTGTAGTTCTGTGGAGGTAAACAATGTAGTACACGAGTTCTTCTCTGGGGACGGAGTTCACTCCATTTCGGAGGAGTTGCGCCGAGCACTCGATGAGTTAATCAAAGAAATAAAGTTGGTGGGATATGTTCCTGATACTTCTTTAGTATATCATGCTGATATGGAAGAGGAAGGGAAAGAACTTGTCCTGAGATATCACAGTGAAAAATTGGCAATGGCTTTTGGGCTCCTGAATACACCCCCTGGTACAACTATAAGGGTAGTCAAGAACCTCCGTATTTGTGGAGATTGTCATAATGCTGCAAAACTTATATCATTAATTTTTGGGAGGCAGATCGTCATTAGGGACGTTCAACGATTCCATCGTTTTGAAGACGGGGAATGCTCCTGTTGTGATTTCTGGTGA

Protein sequence

MAILCIPTCQFTLTKPNSTFSKNEFINQPHPLSLLSKCTSLRDLKQIQAYTIKTNLQNDISVLTKLINICTRSPTTSSMDHAHHLFDQIPDKDIVLFNIMARGYARSNSPYLAFSLFAQVLCSGLLPDDYTFSSLLKACASSKALKEGRQLHCFAIKLGLSHNIYICPTLINMYAECNYMDAARGVFDGMEEPCIVSYNAIITGYARSSRPNEALSLFRELQASNLEPTDVTMLSIIMSCSLLGAFDLGRWIHEYVKKKGFDKYVKVNTALIDMYAKCGSLADAVSIFEGMRVRDTQAWSAMIVAFATHGDGLKAISMFEEMKKAGVRPDEITFLGLLYACSHAGLVEEGRRYFHSMSKIYGITPGIKHYGCMVDLLGRTGHLDEAYKLIDELEIKPTPILWRTLLSSCSNHGNVDLAKRVIERIFELDDSHGGDYVILSNLFARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVVHEFFSGDGVHSISEELRRALDELIKEIKLVGYVPDTSLVYHADMEEEGKELVLRYHSEKLAMAFGLLNTPPGTTIRVVKNLRICGDCHNAAKLISLIFGRQIVIRDVQRFHRFEDGECSCCDFW
Homology
BLAST of Tan0001607 vs. ExPASy Swiss-Prot
Match: Q8LK93 (Pentatricopeptide repeat-containing protein At2g02980, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H26 PE=2 SV=2)

HSP 1 Score: 842.4 bits (2175), Expect = 3.1e-243
Identity = 394/583 (67.58%), Postives = 482/583 (82.68%), Query Frame = 0

Query: 21  SKNEFINQPHPLSLLSKCTSLRDLKQIQAYTIKTNLQNDISVLTKLINICTRSPTTSSMD 80
           SK + +N  +P+ L+SKC SLR+L QIQAY IK++++ D+S + KLIN CT SPT SSM 
Sbjct: 22  SKIDTVNTQNPILLISKCNSLRELMQIQAYAIKSHIE-DVSFVAKLINFCTESPTESSMS 81

Query: 81  HAHHLFDQIPDKDIVLFNIMARGYARSNSPYLAFSLFAQVLCSGLLPDDYTFSSLLKACA 140
           +A HLF+ + + DIV+FN MARGY+R  +P   FSLF ++L  G+LPD+YTF SLLKACA
Sbjct: 82  YARHLFEAMSEPDIVIFNSMARGYSRFTNPLEVFSLFVEILEDGILPDNYTFPSLLKACA 141

Query: 141 SSKALKEGRQLHCFAIKLGLSHNIYICPTLINMYAECNYMDAARGVFDGMEEPCIVSYNA 200
            +KAL+EGRQLHC ++KLGL  N+Y+CPTLINMY EC  +D+AR VFD + EPC+V YNA
Sbjct: 142 VAKALEEGRQLHCLSMKLGLDDNVYVCPTLINMYTECEDVDSARCVFDRIVEPCVVCYNA 201

Query: 201 IITGYARSSRPNEALSLFRELQASNLEPTDVTMLSIIMSCSLLGAFDLGRWIHEYVKKKG 260
           +ITGYAR +RPNEALSLFRE+Q   L+P ++T+LS++ SC+LLG+ DLG+WIH+Y KK  
Sbjct: 202 MITGYARRNRPNEALSLFREMQGKYLKPNEITLLSVLSSCALLGSLDLGKWIHKYAKKHS 261

Query: 261 FDKYVKVNTALIDMYAKCGSLADAVSIFEGMRVRDTQAWSAMIVAFATHGDGLKAISMFE 320
           F KYVKVNTALIDM+AKCGSL DAVSIFE MR +DTQAWSAMIVA+A HG   K++ MFE
Sbjct: 262 FCKYVKVNTALIDMFAKCGSLDDAVSIFEKMRYKDTQAWSAMIVAYANHGKAEKSMLMFE 321

Query: 321 EMKKAGVRPDEITFLGLLYACSHAGLVEEGRRYFHSMSKIYGITPGIKHYGCMVDLLGRT 380
            M+   V+PDEITFLGLL ACSH G VEEGR+YF  M   +GI P IKHYG MVDLL R 
Sbjct: 322 RMRSENVQPDEITFLGLLNACSHTGRVEEGRKYFSQMVSKFGIVPSIKHYGSMVDLLSRA 381

Query: 381 GHLDEAYKLIDELEIKPTPILWRTLLSSCSNHGNVDLAKRVIERIFELDDSHGGDYVILS 440
           G+L++AY+ ID+L I PTP+LWR LL++CS+H N+DLA++V ERIFELDDSHGGDYVILS
Sbjct: 382 GNLEDAYEFIDKLPISPTPMLWRILLAACSSHNNLDLAEKVSERIFELDDSHGGDYVILS 441

Query: 441 NLFARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVVHEFFSGDGVHSISEELRRALD 500
           NL+AR  +WE V+ LRK+MKDR  VKVPGCSS+EVNNVVHEFFSGDGV S + +L RALD
Sbjct: 442 NLYARNKKWEYVDSLRKVMKDRKAVKVPGCSSIEVNNVVHEFFSGDGVKSATTKLHRALD 501

Query: 501 ELIKEIKLVGYVPDTSLVYHADMEEEGKELVLRYHSEKLAMAFGLLNTPPGTTIRVVKNL 560
           E++KE+KL GYVPDTS+V HA+M ++ KE+ LRYHSEKLA+ FGLLNTPPGTTIRVVKNL
Sbjct: 502 EMVKELKLSGYVPDTSMVVHANMNDQEKEITLRYHSEKLAITFGLLNTPPGTTIRVVKNL 561

Query: 561 RICGDCHNAAKLISLIFGRQIVIRDVQRFHRFEDGECSCCDFW 604
           R+C DCHNAAKLISLIFGR++V+RDVQRFH FEDG+CSC DFW
Sbjct: 562 RVCRDCHNAAKLISLIFGRKVVLRDVQRFHHFEDGKCSCGDFW 603

BLAST of Tan0001607 vs. ExPASy Swiss-Prot
Match: Q9LN01 (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 503.4 bits (1295), Expect = 3.5e-141
Identity = 246/605 (40.66%), Postives = 376/605 (62.15%), Query Frame = 0

Query: 29  PHPLSLLSKCTSLRDLKQIQAYTIKTNLQNDISVLTKLIN-------------ICTRSP- 88
           P  L   +K  + ++ +QI  + +K     D+ V T LI+             +  +SP 
Sbjct: 138 PFVLKSCAKSKAFKEGQQIHGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVFDKSPH 197

Query: 89  --------------TTSSMDHAHHLFDQIPDKDIVLFNIMARGYARSNSPYLAFSLFAQV 148
                         +   +++A  LFD+IP KD+V +N M  GYA + +   A  LF  +
Sbjct: 198 RDVVSYTALIKGYASRGYIENAQKLFDEIPVKDVVSWNAMISGYAETGNYKEALELFKDM 257

Query: 149 LCSGLLPDDYTFSSLLKACASSKALKEGRQLHCFAIKLGLSHNIYICPTLINMYAECNYM 208
           + + + PD+ T  +++ ACA S +++ GRQ+H +    G   N+ I   LI++Y++C  +
Sbjct: 258 MKTNVRPDESTMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGEL 317

Query: 209 DAARGVFDGMEEPCIVSYNAIITGYARSSRPNEALSLFRELQASNLEPTDVTMLSIIMSC 268
           + A G+F+ +    ++S+N +I GY   +   EAL LF+E+  S   P DVTMLSI+ +C
Sbjct: 318 ETACGLFERLPYKDVISWNTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSILPAC 377

Query: 269 SLLGAFDLGRWIHEYVKK--KGFDKYVKVNTALIDMYAKCGSLADAVSIFEGMRVRDTQA 328
           + LGA D+GRWIH Y+ K  KG      + T+LIDMYAKCG +  A  +F  +  +   +
Sbjct: 378 AHLGAIDIGRWIHVYIDKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSS 437

Query: 329 WSAMIVAFATHGDGLKAISMFEEMKKAGVRPDEITFLGLLYACSHAGLVEEGRRYFHSMS 388
           W+AMI  FA HG    +  +F  M+K G++PD+ITF+GLL ACSH+G+++ GR  F +M+
Sbjct: 438 WNAMIFGFAMHGRADASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMT 497

Query: 389 KIYGITPGIKHYGCMVDLLGRTGHLDEAYKLIDELEIKPTPILWRTLLSSCSNHGNVDLA 448
           + Y +TP ++HYGCM+DLLG +G   EA ++I+ +E++P  ++W +LL +C  HGNV+L 
Sbjct: 498 QDYKMTPKLEHYGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELG 557

Query: 449 KRVIERIFELDDSHGGDYVILSNLFARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNV 508
           +   E + +++  + G YV+LSN++A  GRW +V   R L+ D+G+ KVPGCSS+E+++V
Sbjct: 558 ESFAENLIKIEPENPGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSV 617

Query: 509 VHEFFSGDGVHSISEELRRALDELIKEIKLVGYVPDTSLVYHADMEEEGKELVLRYHSEK 568
           VHEF  GD  H  + E+   L+E+   ++  G+VPDTS V   +MEEE KE  LR+HSEK
Sbjct: 618 VHEFIIGDKFHPRNREIYGMLEEMEVLLEKAGFVPDTSEVLQ-EMEEEWKEGALRHHSEK 677

Query: 569 LAMAFGLLNTPPGTTIRVVKNLRICGDCHNAAKLISLIFGRQIVIRDVQRFHRFEDGECS 604
           LA+AFGL++T PGT + +VKNLR+C +CH A KLIS I+ R+I+ RD  RFH F DG CS
Sbjct: 678 LAIAFGLISTKPGTKLTIVKNLRVCRNCHEATKLISKIYKREIIARDRTRFHHFRDGVCS 737

BLAST of Tan0001607 vs. ExPASy Swiss-Prot
Match: Q9FJY7 (Pentatricopeptide repeat-containing protein At5g66520 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H61 PE=2 SV=1)

HSP 1 Score: 493.8 bits (1270), Expect = 2.7e-138
Identity = 236/603 (39.14%), Postives = 364/603 (60.36%), Query Frame = 0

Query: 32  LSLLSKCTSLRDLKQIQAYTIKTNLQNDISVLTKLINICTRSPTTSSMDHAHHLFDQIPD 91
           +S L +C+   +LKQI A  +KT L  D   +TK ++ C  S ++  + +A  +FD    
Sbjct: 18  MSCLQRCSKQEELKQIHARMLKTGLMQDSYAITKFLSFCISSTSSDFLPYAQIVFDGFDR 77

Query: 92  KDIVLFNIMARGYARSNSPYLAFSLFAQVLCSGLLPDDYTFSSLLKACASSKALKEGRQL 151
            D  L+N+M RG++ S+ P  +  L+ ++LCS    + YTF SLLKAC++  A +E  Q+
Sbjct: 78  PDTFLWNLMIRGFSCSDEPERSLLLYQRMLCSSAPHNAYTFPSLLKACSNLSAFEETTQI 137

Query: 152 HCFAIKLGLSHNIYICPTLINMYAECNYMDAARGVFDGMEEPCIVSYNAIITGYARSSRP 211
           H    KLG  +++Y   +LIN YA       A  +FD + EP  VS+N++I GY ++ + 
Sbjct: 138 HAQITKLGYENDVYAVNSLINSYAVTGNFKLAHLLFDRIPEPDDVSWNSVIKGYVKAGKM 197

Query: 212 N-------------------------------EALSLFRELQASNLEPTDVTMLSIIMSC 271
           +                               EAL LF E+Q S++EP +V++ + + +C
Sbjct: 198 DIALTLFRKMAEKNAISWTTMISGYVQADMNKEALQLFHEMQNSDVEPDNVSLANALSAC 257

Query: 272 SLLGAFDLGRWIHEYVKKKGFDKYVKVNTALIDMYAKCGSLADAVSIFEGMRVRDTQAWS 331
           + LGA + G+WIH Y+ K        +   LIDMYAKCG + +A+ +F+ ++ +  QAW+
Sbjct: 258 AQLGALEQGKWIHSYLNKTRIRMDSVLGCVLIDMYAKCGEMEEALEVFKNIKKKSVQAWT 317

Query: 332 AMIVAFATHGDGLKAISMFEEMKKAGVRPDEITFLGLLYACSHAGLVEEGRRYFHSMSKI 391
           A+I  +A HG G +AIS F EM+K G++P+ ITF  +L ACS+ GLVEEG+  F+SM + 
Sbjct: 318 ALISGYAYHGHGREAISKFMEMQKMGIKPNVITFTAVLTACSYTGLVEEGKLIFYSMERD 377

Query: 392 YGITPGIKHYGCMVDLLGRTGHLDEAYKLIDELEIKPTPILWRTLLSSCSNHGNVDLAKR 451
           Y + P I+HYGC+VDLLGR G LDEA + I E+ +KP  ++W  LL +C  H N++L + 
Sbjct: 378 YNLKPTIEHYGCIVDLLGRAGLLDEAKRFIQEMPLKPNAVIWGALLKACRIHKNIELGEE 437

Query: 452 VIERIFELDDSHGGDYVILSNLFARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVVH 511
           + E +  +D  HGG YV  +N+ A   +W+     R+LMK++GV KVPGCS++ +    H
Sbjct: 438 IGEILIAIDPYHGGRYVHKANIHAMDKKWDKAAETRRLMKEQGVAKVPGCSTISLEGTTH 497

Query: 512 EFFSGDGVHSISEELRRALDELIKEIKLVGYVPDTSLVYHADMEEEGKELVLRYHSEKLA 571
           EF +GD  H   E+++     + ++++  GYVP+   +    ++++ +E ++  HSEKLA
Sbjct: 498 EFLAGDRSHPEIEKIQSKWRIMRRKLEENGYVPELEEMLLDLVDDDEREAIVHQHSEKLA 557

Query: 572 MAFGLLNTPPGTTIRVVKNLRICGDCHNAAKLISLIFGRQIVIRDVQRFHRFEDGECSCC 604
           + +GL+ T PGT IR++KNLR+C DCH   KLIS I+ R IV+RD  RFH F DG+CSC 
Sbjct: 558 ITYGLIKTKPGTIIRIMKNLRVCKDCHKVTKLISKIYKRDIVMRDRTRFHHFRDGKCSCG 617

BLAST of Tan0001607 vs. ExPASy Swiss-Prot
Match: Q9LIQ7 (Pentatricopeptide repeat-containing protein At3g24000, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-H87 PE=3 SV=1)

HSP 1 Score: 492.3 bits (1266), Expect = 8.0e-138
Identity = 239/575 (41.57%), Postives = 361/575 (62.78%), Query Frame = 0

Query: 33  SLLSKCTSLRDLKQ---IQAYTIKTNLQNDISVLTKLINICTRSPTTSSMDHAHHLFDQI 92
           +LL KCT  + L Q   + A+ +++  ++DI +   L+N+  +     S++ A  +F+++
Sbjct: 65  TLLKKCTVFKLLIQGRIVHAHILQSIFRHDIVMGNTLLNMYAK---CGSLEEARKVFEKM 124

Query: 93  PDKDIVLFNIMARGYARSNSPYLAFSLFAQVLCSGLLPDDYTFSSLLKACASSKALKEGR 152
           P +D V +  +  GY++ + P  A   F Q+L  G  P+++T SS++KA A+ +    G 
Sbjct: 125 PQRDFVTWTTLISGYSQHDRPCDALLFFNQMLRFGYSPNEFTLSSVIKAAAAERRGCCGH 184

Query: 153 QLHCFAIKLGLSHNIYICPTLINMYAECNYMDAARGVFDGMEEPCIVSYNAIITGYARSS 212
           QLH F +K G   N+++   L+++Y     MD A+ VFD +E    VS+NA+I G+AR S
Sbjct: 185 QLHGFCVKCGFDSNVHVGSALLDLYTRYGLMDDAQLVFDALESRNDVSWNALIAGHARRS 244

Query: 213 RPNEALSLFRELQASNLEPTDVTMLSIIMSCSLLGAFDLGRWIHEYVKKKGFDKYVKVNT 272
              +AL LF+ +      P+  +  S+  +CS  G  + G+W+H Y+ K G         
Sbjct: 245 GTEKALELFQGMLRDGFRPSHFSYASLFGACSSTGFLEQGKWVHAYMIKSGEKLVAFAGN 304

Query: 273 ALIDMYAKCGSLADAVSIFEGMRVRDTQAWSAMIVAFATHGDGLKAISMFEEMKKAGVRP 332
            L+DMYAK GS+ DA  IF+ +  RD  +W++++ A+A HG G +A+  FEEM++ G+RP
Sbjct: 305 TLLDMYAKSGSIHDARKIFDRLAKRDVVSWNSLLTAYAQHGFGKEAVWWFEEMRRVGIRP 364

Query: 333 DEITFLGLLYACSHAGLVEEGRRYFHSMSKIYGITPGIKHYGCMVDLLGRTGHLDEAYKL 392
           +EI+FL +L ACSH+GL++EG  Y+  M K  GI P   HY  +VDLLGR G L+ A + 
Sbjct: 365 NEISFLSVLTACSHSGLLDEGWHYYELMKK-DGIVPEAWHYVTVVDLLGRAGDLNRALRF 424

Query: 393 IDELEIKPTPILWRTLLSSCSNHGNVDLAKRVIERIFELDDSHGGDYVILSNLFARVGRW 452
           I+E+ I+PT  +W+ LL++C  H N +L     E +FELD    G +VIL N++A  GRW
Sbjct: 425 IEEMPIEPTAAIWKALLNACRMHKNTELGAYAAEHVFELDPDDPGPHVILYNIYASGGRW 484

Query: 453 EDVNHLRKLMKDRGVVKVPGCSSVEVNNVVHEFFSGDGVHSISEELRRALDELIKEIKLV 512
            D   +RK MK+ GV K P CS VE+ N +H F + D  H   EE+ R  +E++ +IK +
Sbjct: 485 NDAARVRKKMKESGVKKEPACSWVEIENAIHMFVANDERHPQREEIARKWEEVLAKIKEL 544

Query: 513 GYVPDTS-LVYHADMEEEGKELVLRYHSEKLAMAFGLLNTPPGTTIRVVKNLRICGDCHN 572
           GYVPDTS ++ H D +E  +E+ L+YHSEK+A+AF LLNTPPG+TI + KN+R+CGDCH 
Sbjct: 545 GYVPDTSHVIVHVDQQE--REVNLQYHSEKIALAFALLNTPPGSTIHIKKNIRVCGDCHT 604

Query: 573 AAKLISLIFGRQIVIRDVQRFHRFEDGECSCCDFW 604
           A KL S + GR+I++RD  RFH F+DG CSC D+W
Sbjct: 605 AIKLASKVVGREIIVRDTNRFHHFKDGNCSCKDYW 633

BLAST of Tan0001607 vs. ExPASy Swiss-Prot
Match: Q9FI80 (Pentatricopeptide repeat-containing protein At5g48910 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H38 PE=2 SV=1)

HSP 1 Score: 489.2 bits (1258), Expect = 6.7e-137
Identity = 243/626 (38.82%), Postives = 373/626 (59.58%), Query Frame = 0

Query: 30  HPLSL---LSKCTSLRDLKQIQAYTIKTNLQNDISVLTKLINICTRSPT-TSSMDHAHHL 89
           HP SL   ++ C ++RDL QI A  IK+    D     +++  C  S      +D+AH +
Sbjct: 22  HPSSLFPQINNCRTIRDLSQIHAVFIKSGQMRDTLAAAEILRFCATSDLHHRDLDYAHKI 81

Query: 90  FDQIPDKDIVLFNIMARGYARS--NSPYLAFSLFAQVLCSGLL-PDDYTFSSLLKACASS 149
           F+Q+P ++   +N + RG++ S  +   +A +LF +++    + P+ +TF S+LKACA +
Sbjct: 82  FNQMPQRNCFSWNTIIRGFSESDEDKALIAITLFYEMMSDEFVEPNRFTFPSVLKACAKT 141

Query: 150 KALKEGRQLHCFAIKLGLSHNIYICPTLINMYAECNYM---------------------- 209
             ++EG+Q+H  A+K G   + ++   L+ MY  C +M                      
Sbjct: 142 GKIQEGKQIHGLALKYGFGGDEFVMSNLVRMYVMCGFMKDARVLFYKNIIEKDMVVMTDR 201

Query: 210 -----------------------DAARGVFDGMEEPCIVSYNAIITGYARSSRPNEALSL 269
                                   AAR +FD M +  +VS+N +I+GY+ +    +A+ +
Sbjct: 202 RKRDGEIVLWNVMIDGYMRLGDCKAARMLFDKMRQRSVVSWNTMISGYSLNGFFKDAVEV 261

Query: 270 FRELQASNLEPTDVTMLSIIMSCSLLGAFDLGRWIHEYVKKKGFDKYVKVNTALIDMYAK 329
           FRE++  ++ P  VT++S++ + S LG+ +LG W+H Y +  G      + +ALIDMY+K
Sbjct: 262 FREMKKGDIRPNYVTLVSVLPAISRLGSLELGEWLHLYAEDSGIRIDDVLGSALIDMYSK 321

Query: 330 CGSLADAVSIFEGMRVRDTQAWSAMIVAFATHGDGLKAISMFEEMKKAGVRPDEITFLGL 389
           CG +  A+ +FE +   +   WSAMI  FA HG    AI  F +M++AGVRP ++ ++ L
Sbjct: 322 CGIIEKAIHVFERLPRENVITWSAMINGFAIHGQAGDAIDCFCKMRQAGVRPSDVAYINL 381

Query: 390 LYACSHAGLVEEGRRYFHSMSKIYGITPGIKHYGCMVDLLGRTGHLDEAYKLIDELEIKP 449
           L ACSH GLVEEGRRYF  M  + G+ P I+HYGCMVDLLGR+G LDEA + I  + IKP
Sbjct: 382 LTACSHGGLVEEGRRYFSQMVSVDGLEPRIEHYGCMVDLLGRSGLLDEAEEFILNMPIKP 441

Query: 450 TPILWRTLLSSCSNHGNVDLAKRVIERIFELDDSHGGDYVILSNLFARVGRWEDVNHLRK 509
             ++W+ LL +C   GNV++ KRV   + ++     G YV LSN++A  G W +V+ +R 
Sbjct: 442 DDVIWKALLGACRMQGNVEMGKRVANILMDMVPHDSGAYVALSNMYASQGNWSEVSEMRL 501

Query: 510 LMKDRGVVKVPGCSSVEVNNVVHEFFSGDGVHSISEELRRALDELIKEIKLVGYVPDTSL 569
            MK++ + K PGCS ++++ V+HEF   D  H  ++E+   L E+  +++L GY P T+ 
Sbjct: 502 RMKEKDIRKDPGCSLIDIDGVLHEFVVEDDSHPKAKEINSMLVEISDKLRLAGYRPITTQ 561

Query: 570 VYHADMEEEGKELVLRYHSEKLAMAFGLLNTPPGTTIRVVKNLRICGDCHNAAKLISLIF 604
           V   ++EEE KE VL YHSEK+A AFGL++T PG  IR+VKNLRIC DCH++ KLIS ++
Sbjct: 562 VL-LNLEEEDKENVLHYHSEKIATAFGLISTSPGKPIRIVKNLRICEDCHSSIKLISKVY 621

BLAST of Tan0001607 vs. NCBI nr
Match: XP_038893049.1 (pentatricopeptide repeat-containing protein At2g02980, chloroplastic [Benincasa hispida])

HSP 1 Score: 1143.3 bits (2956), Expect = 0.0e+00
Identity = 552/603 (91.54%), Postives = 580/603 (96.19%), Query Frame = 0

Query: 1   MAILCIPTCQFTLTKPNSTFSKNEFINQPHPLSLLSKCTSLRDLKQIQAYTIKTNLQNDI 60
           M + CIPTCQFTLTKP+STFS NEFINQPHPLSLLSKCTSLR+LKQIQAYTIKTNLQND+
Sbjct: 1   MTVSCIPTCQFTLTKPSSTFSNNEFINQPHPLSLLSKCTSLRELKQIQAYTIKTNLQNDV 60

Query: 61  SVLTKLINICTRSPTTSSMDHAHHLFDQIPDKDIVLFNIMARGYARSNSPYLAFSLFAQV 120
           SVLTKLINICTR+PTTSSMD+AHHLFDQI DKDIVLFNIMARGYARSNSP LAFSLFA+V
Sbjct: 61  SVLTKLINICTRNPTTSSMDYAHHLFDQISDKDIVLFNIMARGYARSNSPNLAFSLFAKV 120

Query: 121 LCSGLLPDDYTFSSLLKACASSKALKEGRQLHCFAIKLGLSHNIYICPTLINMYAECNYM 180
           L SGLLPDDYTFSSLLKACASSKA K+G +LHCFAIKLGL+HNIYICPTLINMYAECN M
Sbjct: 121 LSSGLLPDDYTFSSLLKACASSKAFKQGMELHCFAIKLGLNHNIYICPTLINMYAECNDM 180

Query: 181 DAARGVFDGMEEPCIVSYNAIITGYARSSRPNEALSLFRELQASNLEPTDVTMLSIIMSC 240
           +AARGVFD +E+PCIVSYNAIITGYARSS+PNEALSLFRELQAS+LEPTDVTMLSIIMSC
Sbjct: 181 NAARGVFDEIEQPCIVSYNAIITGYARSSQPNEALSLFRELQASHLEPTDVTMLSIIMSC 240

Query: 241 SLLGAFDLGRWIHEYVKKKGFDKYVKVNTALIDMYAKCGSLADAVSIFEGMRVRDTQAWS 300
           +LLGA DLGRWIHEYVKKKGFDKYVKVNTALIDM+AKCGSLADA+SIFEGMRVRDTQAWS
Sbjct: 241 ALLGALDLGRWIHEYVKKKGFDKYVKVNTALIDMFAKCGSLADAISIFEGMRVRDTQAWS 300

Query: 301 AMIVAFATHGDGLKAISMFEEMKKAGVRPDEITFLGLLYACSHAGLVEEGRRYFHSMSKI 360
           AMIVAFATHGDGLKAISMFEEMK+ GVRPDEITFLGLLYACSHAGLVE+GR YF++MSK 
Sbjct: 301 AMIVAFATHGDGLKAISMFEEMKRTGVRPDEITFLGLLYACSHAGLVEQGREYFYNMSKN 360

Query: 361 YGITPGIKHYGCMVDLLGRTGHLDEAYKLIDELEIKPTPILWRTLLSSCSNHGNVDLAKR 420
           YGITPGIKHYGCMVDLLGRTG LDEAY  IDELEIKPTP+LWRTLLS+CS HGNVD+AKR
Sbjct: 361 YGITPGIKHYGCMVDLLGRTGRLDEAYNFIDELEIKPTPVLWRTLLSACSTHGNVDMAKR 420

Query: 421 VIERIFELDDSHGGDYVILSNLFARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVVH 480
           V ERIFELDDSHGGDYVILSNL ARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVVH
Sbjct: 421 VTERIFELDDSHGGDYVILSNLCARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVVH 480

Query: 481 EFFSGDGVHSISEELRRALDELIKEIKLVGYVPDTSLVYHADMEEEGKELVLRYHSEKLA 540
           EFFSGDGVH IS ELRRALDELIKEIKLVGYVPDTSLVYHADMEEEGKELVLRYHSEKLA
Sbjct: 481 EFFSGDGVHCISVELRRALDELIKEIKLVGYVPDTSLVYHADMEEEGKELVLRYHSEKLA 540

Query: 541 MAFGLLNTPPGTTIRVVKNLRICGDCHNAAKLISLIFGRQIVIRDVQRFHRFEDGECSCC 600
           MAFGLLNTPPGTTIRVVKNLRICGDCHNAAKLISLIFGRQI+IRDVQRFHRFE+G+CSC 
Sbjct: 541 MAFGLLNTPPGTTIRVVKNLRICGDCHNAAKLISLIFGRQIIIRDVQRFHRFENGKCSCS 600

Query: 601 DFW 604
           DFW
Sbjct: 601 DFW 603

BLAST of Tan0001607 vs. NCBI nr
Match: XP_022968061.1 (pentatricopeptide repeat-containing protein At2g02980, chloroplastic [Cucurbita maxima])

HSP 1 Score: 1135.6 bits (2936), Expect = 0.0e+00
Identity = 553/603 (91.71%), Postives = 571/603 (94.69%), Query Frame = 0

Query: 1   MAILCIPTCQFTLTKPNSTFSKNEFINQPHPLSLLSKCTSLRDLKQIQAYTIKTNLQNDI 60
           MAI CIPTCQF LTKP     KNEFINQPHPLSL SKC SLR+LKQIQAYTIKTNL NDI
Sbjct: 1   MAISCIPTCQFALTKP-----KNEFINQPHPLSLFSKCASLRELKQIQAYTIKTNLHNDI 60

Query: 61  SVLTKLINICTRSPTTSSMDHAHHLFDQIPDKDIVLFNIMARGYARSNSPYLAFSLFAQV 120
           SVLTKLIN CTR PTTSSMDHAHHLFD++ DKDIVLFNIMARGYARSNSPYL FSLFAQV
Sbjct: 61  SVLTKLINFCTRYPTTSSMDHAHHLFDKMLDKDIVLFNIMARGYARSNSPYLVFSLFAQV 120

Query: 121 LCSGLLPDDYTFSSLLKACASSKALKEGRQLHCFAIKLGLSHNIYICPTLINMYAECNYM 180
           LCSGLLPDDYTFSSLLKACA SKAL+EGRQLHCFAIKLG  HNIYICPTLINMYA CN M
Sbjct: 121 LCSGLLPDDYTFSSLLKACAGSKALEEGRQLHCFAIKLGFGHNIYICPTLINMYAACNDM 180

Query: 181 DAARGVFDGMEEPCIVSYNAIITGYARSSRPNEALSLFRELQASNLEPTDVTMLSIIMSC 240
           +AARGVFDGMEEPCIVSYNAIITGYARSS+PNEALSLFRELQASNLEPTDVTMLSIIMSC
Sbjct: 181 NAARGVFDGMEEPCIVSYNAIITGYARSSQPNEALSLFRELQASNLEPTDVTMLSIIMSC 240

Query: 241 SLLGAFDLGRWIHEYVKKKGFDKYVKVNTALIDMYAKCGSLADAVSIFEGMRVRDTQAWS 300
           +LLGA DLGRWIHEYVKKKGFDK+VKVNTALIDMYAKCGS+ DA+SIFEGMRVRDTQAWS
Sbjct: 241 ALLGALDLGRWIHEYVKKKGFDKFVKVNTALIDMYAKCGSIVDAISIFEGMRVRDTQAWS 300

Query: 301 AMIVAFATHGDGLKAISMFEEMKKAGVRPDEITFLGLLYACSHAGLVEEGRRYFHSMSKI 360
           AMIVA+ATHGDGLKAISMFEEMKKAGVRPDEITFLGLLYACSHAGLVEEGR YF+SM K 
Sbjct: 301 AMIVAYATHGDGLKAISMFEEMKKAGVRPDEITFLGLLYACSHAGLVEEGRGYFYSMYKN 360

Query: 361 YGITPGIKHYGCMVDLLGRTGHLDEAYKLIDELEIKPTPILWRTLLSSCSNHGNVDLAKR 420
           +G+TPGIKHYGCMVDLLGRTG LDEAYK IDEL IKPTPILWRTLLS+CSNHGNVDLAKR
Sbjct: 361 HGMTPGIKHYGCMVDLLGRTGRLDEAYKFIDELAIKPTPILWRTLLSACSNHGNVDLAKR 420

Query: 421 VIERIFELDDSHGGDYVILSNLFARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVVH 480
           VIERIFELDDSHGGDYVILSNL AR+GRWEDVN LRKLMKDRGVVKVPGCSSVEVNNVVH
Sbjct: 421 VIERIFELDDSHGGDYVILSNLCARLGRWEDVNRLRKLMKDRGVVKVPGCSSVEVNNVVH 480

Query: 481 EFFSGDGVHSISEELRRALDELIKEIKLVGYVPDTSLVYHADMEEEGKELVLRYHSEKLA 540
           EFFSGDGVHSIS ELRRALDELI+EIKL GYVPDTSLVYHADMEEE KELVLRYHSEKLA
Sbjct: 481 EFFSGDGVHSISVELRRALDELIQEIKLAGYVPDTSLVYHADMEEEAKELVLRYHSEKLA 540

Query: 541 MAFGLLNTPPGTTIRVVKNLRICGDCHNAAKLISLIFGRQIVIRDVQRFHRFEDGECSCC 600
           MAFGLLNTPPGTTIRVVKNLRICGDCHNAAKLISLIFGRQIVIRDVQRFHRFEDG+CSCC
Sbjct: 541 MAFGLLNTPPGTTIRVVKNLRICGDCHNAAKLISLIFGRQIVIRDVQRFHRFEDGQCSCC 598

Query: 601 DFW 604
           DFW
Sbjct: 601 DFW 598

BLAST of Tan0001607 vs. NCBI nr
Match: XP_023541252.1 (pentatricopeptide repeat-containing protein At2g02980, chloroplastic [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1134.8 bits (2934), Expect = 0.0e+00
Identity = 554/603 (91.87%), Postives = 572/603 (94.86%), Query Frame = 0

Query: 1   MAILCIPTCQFTLTKPNSTFSKNEFINQPHPLSLLSKCTSLRDLKQIQAYTIKTNLQNDI 60
           MAI CIPT QF L+KP     KNEFINQPHPLSL SKC+SLR+LKQIQAYTIKTNL NDI
Sbjct: 1   MAISCIPTSQFALSKP-----KNEFINQPHPLSLFSKCSSLRELKQIQAYTIKTNLHNDI 60

Query: 61  SVLTKLINICTRSPTTSSMDHAHHLFDQIPDKDIVLFNIMARGYARSNSPYLAFSLFAQV 120
           SVLTKLIN CTR+PT SSMDHAHHLFD++ DKDIVLFNIMARGYARSNSPYL FSLFAQV
Sbjct: 61  SVLTKLINFCTRNPTISSMDHAHHLFDKMLDKDIVLFNIMARGYARSNSPYLVFSLFAQV 120

Query: 121 LCSGLLPDDYTFSSLLKACASSKALKEGRQLHCFAIKLGLSHNIYICPTLINMYAECNYM 180
           LCSGLLPDDYTFSSLLKACA SKAL+EGRQLHCFAIKLGL HNIYICPTLINMYA CN M
Sbjct: 121 LCSGLLPDDYTFSSLLKACAGSKALEEGRQLHCFAIKLGLGHNIYICPTLINMYAACNDM 180

Query: 181 DAARGVFDGMEEPCIVSYNAIITGYARSSRPNEALSLFRELQASNLEPTDVTMLSIIMSC 240
           +AARGVFDGMEEPCIVSYNAIITGYARSS+PNEALSLFRELQASNLEPTDVTMLSIIMSC
Sbjct: 181 NAARGVFDGMEEPCIVSYNAIITGYARSSQPNEALSLFRELQASNLEPTDVTMLSIIMSC 240

Query: 241 SLLGAFDLGRWIHEYVKKKGFDKYVKVNTALIDMYAKCGSLADAVSIFEGMRVRDTQAWS 300
           +LLGA DLGRWIHEYVKKKGFDK+VKVNTALIDMYAKCGS+ DA+SIFEGMRVRDTQAWS
Sbjct: 241 ALLGALDLGRWIHEYVKKKGFDKFVKVNTALIDMYAKCGSIVDAISIFEGMRVRDTQAWS 300

Query: 301 AMIVAFATHGDGLKAISMFEEMKKAGVRPDEITFLGLLYACSHAGLVEEGRRYFHSMSKI 360
           AMIVAFATHGDGLKAISMFEEMKKAGVRPDEITFLGLLYACSHAGLVEEGR YF+SM K 
Sbjct: 301 AMIVAFATHGDGLKAISMFEEMKKAGVRPDEITFLGLLYACSHAGLVEEGRGYFYSMYKN 360

Query: 361 YGITPGIKHYGCMVDLLGRTGHLDEAYKLIDELEIKPTPILWRTLLSSCSNHGNVDLAKR 420
           +GITPGIKHYGCMVDLLGRTG LDEAYK IDEL IKPTPILWRTLLS+CSNHGNVDLAKR
Sbjct: 361 HGITPGIKHYGCMVDLLGRTGRLDEAYKFIDELAIKPTPILWRTLLSACSNHGNVDLAKR 420

Query: 421 VIERIFELDDSHGGDYVILSNLFARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVVH 480
           VIERIFELDDSHGGDYVILSNL AR+GRWEDVN LRKLMKDRGVVKVPGCSSVEVNNVVH
Sbjct: 421 VIERIFELDDSHGGDYVILSNLCARLGRWEDVNRLRKLMKDRGVVKVPGCSSVEVNNVVH 480

Query: 481 EFFSGDGVHSISEELRRALDELIKEIKLVGYVPDTSLVYHADMEEEGKELVLRYHSEKLA 540
           EFFSGDGVHSIS ELRRALDELIKEIKL GYVPDTSLVYHADMEEE KELVLRYHSEKLA
Sbjct: 481 EFFSGDGVHSISVELRRALDELIKEIKLAGYVPDTSLVYHADMEEEAKELVLRYHSEKLA 540

Query: 541 MAFGLLNTPPGTTIRVVKNLRICGDCHNAAKLISLIFGRQIVIRDVQRFHRFEDGECSCC 600
           MAFGLLNTPPGTTIRVVKNLRICGDCHNAAKLISLIFGRQIVIRDVQRFHRFEDG+CSCC
Sbjct: 541 MAFGLLNTPPGTTIRVVKNLRICGDCHNAAKLISLIFGRQIVIRDVQRFHRFEDGQCSCC 598

Query: 601 DFW 604
           DFW
Sbjct: 601 DFW 598

BLAST of Tan0001607 vs. NCBI nr
Match: XP_022151060.1 (pentatricopeptide repeat-containing protein At2g02980, chloroplastic isoform X1 [Momordica charantia])

HSP 1 Score: 1132.9 bits (2929), Expect = 0.0e+00
Identity = 547/603 (90.71%), Postives = 572/603 (94.86%), Query Frame = 0

Query: 1   MAILCIPTCQFTLTKPNSTFSKNEFINQPHPLSLLSKCTSLRDLKQIQAYTIKTNLQNDI 60
           M I CIP+CQFTL KPNS F  NEF N PHPL LLSKCTSLR+LKQIQA+TIKTNLQNDI
Sbjct: 1   MGISCIPSCQFTLAKPNSAFPNNEFTNPPHPLFLLSKCTSLRELKQIQAFTIKTNLQNDI 60

Query: 61  SVLTKLINICTRSPTTSSMDHAHHLFDQIPDKDIVLFNIMARGYARSNSPYLAFSLFAQV 120
           SVLTK+IN CT +P+TSSMDHAHHLFDQIPDKDIVLFNIMARGYARSN+PYLAFSLF+QV
Sbjct: 61  SVLTKIINFCTLNPSTSSMDHAHHLFDQIPDKDIVLFNIMARGYARSNTPYLAFSLFSQV 120

Query: 121 LCSGLLPDDYTFSSLLKACASSKALKEGRQLHCFAIKLGLSHNIYICPTLINMYAECNYM 180
           LCSGLLPDDYTFSSLLKACASSKA  EGRQLHCFAIKLGL+HNIYICP+LIN+YAECN M
Sbjct: 121 LCSGLLPDDYTFSSLLKACASSKAFSEGRQLHCFAIKLGLNHNIYICPSLINLYAECNDM 180

Query: 181 DAARGVFDGMEEPCIVSYNAIITGYARSSRPNEALSLFRELQASNLEPTDVTMLSIIMSC 240
           +AARGVFD ME PCIVSYNAIITG+ARSS+PNEALSLFRELQASNLEPTDVTMLSIIMSC
Sbjct: 181 NAARGVFDEMEAPCIVSYNAIITGHARSSQPNEALSLFRELQASNLEPTDVTMLSIIMSC 240

Query: 241 SLLGAFDLGRWIHEYVKKKGFDKYVKVNTALIDMYAKCGSLADAVSIFEGMRVRDTQAWS 300
           +LLGA DLGRWIHEYVKKKGFDK+VKVNTALIDMYAKCGSL DA+SIFE MRVRDTQAWS
Sbjct: 241 ALLGALDLGRWIHEYVKKKGFDKFVKVNTALIDMYAKCGSLVDAISIFEDMRVRDTQAWS 300

Query: 301 AMIVAFATHGDGLKAISMFEEMKKAGVRPDEITFLGLLYACSHAGLVEEGRRYFHSMSKI 360
           AMIVA+ATHGDGLKAISMFEEMK+AGVRPDEITFLGLLYACSHAGLVEEGR YF+SMSK 
Sbjct: 301 AMIVAYATHGDGLKAISMFEEMKRAGVRPDEITFLGLLYACSHAGLVEEGRGYFNSMSKY 360

Query: 361 YGITPGIKHYGCMVDLLGRTGHLDEAYKLIDELEIKPTPILWRTLLSSCSNHGNVDLAKR 420
           YGI PGIKHYGCMVDLLGRTGHLDEAYK ID  EIKPTPILWRTLLS+CSN GNVDLAKR
Sbjct: 361 YGIAPGIKHYGCMVDLLGRTGHLDEAYKFIDGSEIKPTPILWRTLLSACSNRGNVDLAKR 420

Query: 421 VIERIFELDDSHGGDYVILSNLFARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVVH 480
           VIERIFELDDSHGGDYVILSNL ARVGRWEDVNH+RKLMKDRGVVKVPGCSSVEVNNVVH
Sbjct: 421 VIERIFELDDSHGGDYVILSNLCARVGRWEDVNHIRKLMKDRGVVKVPGCSSVEVNNVVH 480

Query: 481 EFFSGDGVHSISEELRRALDELIKEIKLVGYVPDTSLVYHADMEEEGKELVLRYHSEKLA 540
           EFFSGDGVHSIS ELRRALDELIKEIKLVGYVPDTSLVYHADMEEEGKELVLRYHSEKLA
Sbjct: 481 EFFSGDGVHSISVELRRALDELIKEIKLVGYVPDTSLVYHADMEEEGKELVLRYHSEKLA 540

Query: 541 MAFGLLNTPPGTTIRVVKNLRICGDCHNAAKLISLIFGRQIVIRDVQRFHRFEDGECSCC 600
           +AFGLLN+PPGT IRVVKNLRICGDCH AAKLIS IFGRQIVIRDVQRFHRFEDG+CSCC
Sbjct: 541 IAFGLLNSPPGTPIRVVKNLRICGDCHTAAKLISFIFGRQIVIRDVQRFHRFEDGKCSCC 600

Query: 601 DFW 604
           DFW
Sbjct: 601 DFW 603

BLAST of Tan0001607 vs. NCBI nr
Match: KAG7013140.1 (Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1132.5 bits (2928), Expect = 0.0e+00
Identity = 553/603 (91.71%), Postives = 570/603 (94.53%), Query Frame = 0

Query: 1   MAILCIPTCQFTLTKPNSTFSKNEFINQPHPLSLLSKCTSLRDLKQIQAYTIKTNLQNDI 60
           MAI CIPT QF LTKP     K EFINQPHPLSL SKCTSLR+LKQIQAYTIKTNL NDI
Sbjct: 1   MAISCIPTSQFALTKP-----KKEFINQPHPLSLFSKCTSLRELKQIQAYTIKTNLHNDI 60

Query: 61  SVLTKLINICTRSPTTSSMDHAHHLFDQIPDKDIVLFNIMARGYARSNSPYLAFSLFAQV 120
           SVLTKLIN CTR+PTTSSMDHAHHLFD++ DKDIVLFNIMARGYARSNSPYL FSLFAQV
Sbjct: 61  SVLTKLINFCTRNPTTSSMDHAHHLFDKMLDKDIVLFNIMARGYARSNSPYLVFSLFAQV 120

Query: 121 LCSGLLPDDYTFSSLLKACASSKALKEGRQLHCFAIKLGLSHNIYICPTLINMYAECNYM 180
           LCSGLLPDDYTFSSLLKACA SKAL+EGRQLHCFAIKLGL HNIYICPTLINMYA CN M
Sbjct: 121 LCSGLLPDDYTFSSLLKACAGSKALEEGRQLHCFAIKLGLGHNIYICPTLINMYAACNDM 180

Query: 181 DAARGVFDGMEEPCIVSYNAIITGYARSSRPNEALSLFRELQASNLEPTDVTMLSIIMSC 240
           +AARGVFDGMEEPCIVSYNAIITGYARSS+PNEALSLFRELQASNLEPTDVTMLSIIMSC
Sbjct: 181 NAARGVFDGMEEPCIVSYNAIITGYARSSQPNEALSLFRELQASNLEPTDVTMLSIIMSC 240

Query: 241 SLLGAFDLGRWIHEYVKKKGFDKYVKVNTALIDMYAKCGSLADAVSIFEGMRVRDTQAWS 300
           +LLGA DLGRWIHEYVKKKGFDK+VKVNTALIDMYAKCGS+ DA+SIFEGMRVRDTQAWS
Sbjct: 241 ALLGALDLGRWIHEYVKKKGFDKFVKVNTALIDMYAKCGSIVDAISIFEGMRVRDTQAWS 300

Query: 301 AMIVAFATHGDGLKAISMFEEMKKAGVRPDEITFLGLLYACSHAGLVEEGRRYFHSMSKI 360
           AMIVAFATHGDGLKAISMFEEMKKAGVRPDEITFLGLLYACSHAGLV+EGR YF+SM K 
Sbjct: 301 AMIVAFATHGDGLKAISMFEEMKKAGVRPDEITFLGLLYACSHAGLVDEGRGYFYSMYKN 360

Query: 361 YGITPGIKHYGCMVDLLGRTGHLDEAYKLIDELEIKPTPILWRTLLSSCSNHGNVDLAKR 420
           +GITPGIKHYGCMVDLLGRTG LDEAYK IDEL IKPTPILWRTLLS+CSNHGNVDLAKR
Sbjct: 361 HGITPGIKHYGCMVDLLGRTGRLDEAYKFIDELAIKPTPILWRTLLSACSNHGNVDLAKR 420

Query: 421 VIERIFELDDSHGGDYVILSNLFARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVVH 480
           VIERIFELDDSHGGDYVILSNL AR+GRWEDVN LRKLMKDRGV KVPGCSSVEVNNVVH
Sbjct: 421 VIERIFELDDSHGGDYVILSNLCARLGRWEDVNRLRKLMKDRGVGKVPGCSSVEVNNVVH 480

Query: 481 EFFSGDGVHSISEELRRALDELIKEIKLVGYVPDTSLVYHADMEEEGKELVLRYHSEKLA 540
           EFFSGDGVHSIS ELRRALDELIKEIKL GY PDTSLVYHADMEEE KELVLRYHSEKLA
Sbjct: 481 EFFSGDGVHSISVELRRALDELIKEIKLAGYAPDTSLVYHADMEEEAKELVLRYHSEKLA 540

Query: 541 MAFGLLNTPPGTTIRVVKNLRICGDCHNAAKLISLIFGRQIVIRDVQRFHRFEDGECSCC 600
           MAFGLLNTPPGTTIRVVKNLRICGDCHNAAKLISLIFGRQIVIRDVQRFHRFEDG+CSCC
Sbjct: 541 MAFGLLNTPPGTTIRVVKNLRICGDCHNAAKLISLIFGRQIVIRDVQRFHRFEDGQCSCC 598

Query: 601 DFW 604
           DFW
Sbjct: 601 DFW 598

BLAST of Tan0001607 vs. ExPASy TrEMBL
Match: A0A6J1HTT7 (pentatricopeptide repeat-containing protein At2g02980, chloroplastic OS=Cucurbita maxima OX=3661 GN=LOC111467412 PE=3 SV=1)

HSP 1 Score: 1135.6 bits (2936), Expect = 0.0e+00
Identity = 553/603 (91.71%), Postives = 571/603 (94.69%), Query Frame = 0

Query: 1   MAILCIPTCQFTLTKPNSTFSKNEFINQPHPLSLLSKCTSLRDLKQIQAYTIKTNLQNDI 60
           MAI CIPTCQF LTKP     KNEFINQPHPLSL SKC SLR+LKQIQAYTIKTNL NDI
Sbjct: 1   MAISCIPTCQFALTKP-----KNEFINQPHPLSLFSKCASLRELKQIQAYTIKTNLHNDI 60

Query: 61  SVLTKLINICTRSPTTSSMDHAHHLFDQIPDKDIVLFNIMARGYARSNSPYLAFSLFAQV 120
           SVLTKLIN CTR PTTSSMDHAHHLFD++ DKDIVLFNIMARGYARSNSPYL FSLFAQV
Sbjct: 61  SVLTKLINFCTRYPTTSSMDHAHHLFDKMLDKDIVLFNIMARGYARSNSPYLVFSLFAQV 120

Query: 121 LCSGLLPDDYTFSSLLKACASSKALKEGRQLHCFAIKLGLSHNIYICPTLINMYAECNYM 180
           LCSGLLPDDYTFSSLLKACA SKAL+EGRQLHCFAIKLG  HNIYICPTLINMYA CN M
Sbjct: 121 LCSGLLPDDYTFSSLLKACAGSKALEEGRQLHCFAIKLGFGHNIYICPTLINMYAACNDM 180

Query: 181 DAARGVFDGMEEPCIVSYNAIITGYARSSRPNEALSLFRELQASNLEPTDVTMLSIIMSC 240
           +AARGVFDGMEEPCIVSYNAIITGYARSS+PNEALSLFRELQASNLEPTDVTMLSIIMSC
Sbjct: 181 NAARGVFDGMEEPCIVSYNAIITGYARSSQPNEALSLFRELQASNLEPTDVTMLSIIMSC 240

Query: 241 SLLGAFDLGRWIHEYVKKKGFDKYVKVNTALIDMYAKCGSLADAVSIFEGMRVRDTQAWS 300
           +LLGA DLGRWIHEYVKKKGFDK+VKVNTALIDMYAKCGS+ DA+SIFEGMRVRDTQAWS
Sbjct: 241 ALLGALDLGRWIHEYVKKKGFDKFVKVNTALIDMYAKCGSIVDAISIFEGMRVRDTQAWS 300

Query: 301 AMIVAFATHGDGLKAISMFEEMKKAGVRPDEITFLGLLYACSHAGLVEEGRRYFHSMSKI 360
           AMIVA+ATHGDGLKAISMFEEMKKAGVRPDEITFLGLLYACSHAGLVEEGR YF+SM K 
Sbjct: 301 AMIVAYATHGDGLKAISMFEEMKKAGVRPDEITFLGLLYACSHAGLVEEGRGYFYSMYKN 360

Query: 361 YGITPGIKHYGCMVDLLGRTGHLDEAYKLIDELEIKPTPILWRTLLSSCSNHGNVDLAKR 420
           +G+TPGIKHYGCMVDLLGRTG LDEAYK IDEL IKPTPILWRTLLS+CSNHGNVDLAKR
Sbjct: 361 HGMTPGIKHYGCMVDLLGRTGRLDEAYKFIDELAIKPTPILWRTLLSACSNHGNVDLAKR 420

Query: 421 VIERIFELDDSHGGDYVILSNLFARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVVH 480
           VIERIFELDDSHGGDYVILSNL AR+GRWEDVN LRKLMKDRGVVKVPGCSSVEVNNVVH
Sbjct: 421 VIERIFELDDSHGGDYVILSNLCARLGRWEDVNRLRKLMKDRGVVKVPGCSSVEVNNVVH 480

Query: 481 EFFSGDGVHSISEELRRALDELIKEIKLVGYVPDTSLVYHADMEEEGKELVLRYHSEKLA 540
           EFFSGDGVHSIS ELRRALDELI+EIKL GYVPDTSLVYHADMEEE KELVLRYHSEKLA
Sbjct: 481 EFFSGDGVHSISVELRRALDELIQEIKLAGYVPDTSLVYHADMEEEAKELVLRYHSEKLA 540

Query: 541 MAFGLLNTPPGTTIRVVKNLRICGDCHNAAKLISLIFGRQIVIRDVQRFHRFEDGECSCC 600
           MAFGLLNTPPGTTIRVVKNLRICGDCHNAAKLISLIFGRQIVIRDVQRFHRFEDG+CSCC
Sbjct: 541 MAFGLLNTPPGTTIRVVKNLRICGDCHNAAKLISLIFGRQIVIRDVQRFHRFEDGQCSCC 598

Query: 601 DFW 604
           DFW
Sbjct: 601 DFW 598

BLAST of Tan0001607 vs. ExPASy TrEMBL
Match: A0A6J1DA68 (pentatricopeptide repeat-containing protein At2g02980, chloroplastic isoform X1 OS=Momordica charantia OX=3673 GN=LOC111019083 PE=3 SV=1)

HSP 1 Score: 1132.9 bits (2929), Expect = 0.0e+00
Identity = 547/603 (90.71%), Postives = 572/603 (94.86%), Query Frame = 0

Query: 1   MAILCIPTCQFTLTKPNSTFSKNEFINQPHPLSLLSKCTSLRDLKQIQAYTIKTNLQNDI 60
           M I CIP+CQFTL KPNS F  NEF N PHPL LLSKCTSLR+LKQIQA+TIKTNLQNDI
Sbjct: 1   MGISCIPSCQFTLAKPNSAFPNNEFTNPPHPLFLLSKCTSLRELKQIQAFTIKTNLQNDI 60

Query: 61  SVLTKLINICTRSPTTSSMDHAHHLFDQIPDKDIVLFNIMARGYARSNSPYLAFSLFAQV 120
           SVLTK+IN CT +P+TSSMDHAHHLFDQIPDKDIVLFNIMARGYARSN+PYLAFSLF+QV
Sbjct: 61  SVLTKIINFCTLNPSTSSMDHAHHLFDQIPDKDIVLFNIMARGYARSNTPYLAFSLFSQV 120

Query: 121 LCSGLLPDDYTFSSLLKACASSKALKEGRQLHCFAIKLGLSHNIYICPTLINMYAECNYM 180
           LCSGLLPDDYTFSSLLKACASSKA  EGRQLHCFAIKLGL+HNIYICP+LIN+YAECN M
Sbjct: 121 LCSGLLPDDYTFSSLLKACASSKAFSEGRQLHCFAIKLGLNHNIYICPSLINLYAECNDM 180

Query: 181 DAARGVFDGMEEPCIVSYNAIITGYARSSRPNEALSLFRELQASNLEPTDVTMLSIIMSC 240
           +AARGVFD ME PCIVSYNAIITG+ARSS+PNEALSLFRELQASNLEPTDVTMLSIIMSC
Sbjct: 181 NAARGVFDEMEAPCIVSYNAIITGHARSSQPNEALSLFRELQASNLEPTDVTMLSIIMSC 240

Query: 241 SLLGAFDLGRWIHEYVKKKGFDKYVKVNTALIDMYAKCGSLADAVSIFEGMRVRDTQAWS 300
           +LLGA DLGRWIHEYVKKKGFDK+VKVNTALIDMYAKCGSL DA+SIFE MRVRDTQAWS
Sbjct: 241 ALLGALDLGRWIHEYVKKKGFDKFVKVNTALIDMYAKCGSLVDAISIFEDMRVRDTQAWS 300

Query: 301 AMIVAFATHGDGLKAISMFEEMKKAGVRPDEITFLGLLYACSHAGLVEEGRRYFHSMSKI 360
           AMIVA+ATHGDGLKAISMFEEMK+AGVRPDEITFLGLLYACSHAGLVEEGR YF+SMSK 
Sbjct: 301 AMIVAYATHGDGLKAISMFEEMKRAGVRPDEITFLGLLYACSHAGLVEEGRGYFNSMSKY 360

Query: 361 YGITPGIKHYGCMVDLLGRTGHLDEAYKLIDELEIKPTPILWRTLLSSCSNHGNVDLAKR 420
           YGI PGIKHYGCMVDLLGRTGHLDEAYK ID  EIKPTPILWRTLLS+CSN GNVDLAKR
Sbjct: 361 YGIAPGIKHYGCMVDLLGRTGHLDEAYKFIDGSEIKPTPILWRTLLSACSNRGNVDLAKR 420

Query: 421 VIERIFELDDSHGGDYVILSNLFARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVVH 480
           VIERIFELDDSHGGDYVILSNL ARVGRWEDVNH+RKLMKDRGVVKVPGCSSVEVNNVVH
Sbjct: 421 VIERIFELDDSHGGDYVILSNLCARVGRWEDVNHIRKLMKDRGVVKVPGCSSVEVNNVVH 480

Query: 481 EFFSGDGVHSISEELRRALDELIKEIKLVGYVPDTSLVYHADMEEEGKELVLRYHSEKLA 540
           EFFSGDGVHSIS ELRRALDELIKEIKLVGYVPDTSLVYHADMEEEGKELVLRYHSEKLA
Sbjct: 481 EFFSGDGVHSISVELRRALDELIKEIKLVGYVPDTSLVYHADMEEEGKELVLRYHSEKLA 540

Query: 541 MAFGLLNTPPGTTIRVVKNLRICGDCHNAAKLISLIFGRQIVIRDVQRFHRFEDGECSCC 600
           +AFGLLN+PPGT IRVVKNLRICGDCH AAKLIS IFGRQIVIRDVQRFHRFEDG+CSCC
Sbjct: 541 IAFGLLNSPPGTPIRVVKNLRICGDCHTAAKLISFIFGRQIVIRDVQRFHRFEDGKCSCC 600

Query: 601 DFW 604
           DFW
Sbjct: 601 DFW 603

BLAST of Tan0001607 vs. ExPASy TrEMBL
Match: A0A6J1FZ61 (pentatricopeptide repeat-containing protein At2g02980, chloroplastic OS=Cucurbita moschata OX=3662 GN=LOC111449232 PE=3 SV=1)

HSP 1 Score: 1127.1 bits (2914), Expect = 0.0e+00
Identity = 552/603 (91.54%), Postives = 569/603 (94.36%), Query Frame = 0

Query: 1   MAILCIPTCQFTLTKPNSTFSKNEFINQPHPLSLLSKCTSLRDLKQIQAYTIKTNLQNDI 60
           MAI CIPT QF L+KP     KNE INQPHPLSL SKCTSLR+LKQIQAYTIKTNL NDI
Sbjct: 1   MAISCIPTSQFALSKP-----KNESINQPHPLSLFSKCTSLRELKQIQAYTIKTNLHNDI 60

Query: 61  SVLTKLINICTRSPTTSSMDHAHHLFDQIPDKDIVLFNIMARGYARSNSPYLAFSLFAQV 120
           SVLTKLIN CTRSPTTSSMDHAHHLFD++ DKDIVLFNIMARGYARSNSPYL FSLFAQV
Sbjct: 61  SVLTKLINFCTRSPTTSSMDHAHHLFDKMLDKDIVLFNIMARGYARSNSPYLIFSLFAQV 120

Query: 121 LCSGLLPDDYTFSSLLKACASSKALKEGRQLHCFAIKLGLSHNIYICPTLINMYAECNYM 180
           L SGLLPDDYTFSSLLKACA SKAL+EGRQLHCFAIKLG  HNIYICPTLINMYA CN M
Sbjct: 121 LFSGLLPDDYTFSSLLKACAGSKALEEGRQLHCFAIKLGFGHNIYICPTLINMYAACNDM 180

Query: 181 DAARGVFDGMEEPCIVSYNAIITGYARSSRPNEALSLFRELQASNLEPTDVTMLSIIMSC 240
           +AARGVFDGMEEPCIVSYNAIITGYARSS+PNEALSLFRELQASNLEPTDVTMLSIIMSC
Sbjct: 181 NAARGVFDGMEEPCIVSYNAIITGYARSSQPNEALSLFRELQASNLEPTDVTMLSIIMSC 240

Query: 241 SLLGAFDLGRWIHEYVKKKGFDKYVKVNTALIDMYAKCGSLADAVSIFEGMRVRDTQAWS 300
           +LLGA DLGRWIHEYVKKKGFDK+VKVNTALIDMYAKCGS+ DA+SIFEGMRVRDTQAWS
Sbjct: 241 ALLGALDLGRWIHEYVKKKGFDKFVKVNTALIDMYAKCGSIVDAISIFEGMRVRDTQAWS 300

Query: 301 AMIVAFATHGDGLKAISMFEEMKKAGVRPDEITFLGLLYACSHAGLVEEGRRYFHSMSKI 360
           AMIVAFATHGDGLKAISMFEEMKKAGVRPDEITFLGLLYACSHAGLV+EG  YF+SM K 
Sbjct: 301 AMIVAFATHGDGLKAISMFEEMKKAGVRPDEITFLGLLYACSHAGLVDEGTGYFYSMYKN 360

Query: 361 YGITPGIKHYGCMVDLLGRTGHLDEAYKLIDELEIKPTPILWRTLLSSCSNHGNVDLAKR 420
           +GITPGIKHYGCMVDLLGRTG LDEAYK IDEL IKPTPILWRTLLS+CSNHGNVDLAKR
Sbjct: 361 HGITPGIKHYGCMVDLLGRTGRLDEAYKFIDELAIKPTPILWRTLLSACSNHGNVDLAKR 420

Query: 421 VIERIFELDDSHGGDYVILSNLFARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVVH 480
           VIERIFELDDSHGGDYVILSNL AR+GRWEDVN LRKLMKDRGVVKVPGCSSVEVNNVVH
Sbjct: 421 VIERIFELDDSHGGDYVILSNLCARLGRWEDVNRLRKLMKDRGVVKVPGCSSVEVNNVVH 480

Query: 481 EFFSGDGVHSISEELRRALDELIKEIKLVGYVPDTSLVYHADMEEEGKELVLRYHSEKLA 540
           EFFSGDGVHSIS ELRRALDELIKEIKL GYVPDTSLVYHADMEEE KELVLRYHSEKLA
Sbjct: 481 EFFSGDGVHSISVELRRALDELIKEIKLAGYVPDTSLVYHADMEEEAKELVLRYHSEKLA 540

Query: 541 MAFGLLNTPPGTTIRVVKNLRICGDCHNAAKLISLIFGRQIVIRDVQRFHRFEDGECSCC 600
           MAFGLLNTPPGTTIRVVKNLRICGDCHNAAKLISLIFGRQIVIRDVQRFHRFEDG+CSCC
Sbjct: 541 MAFGLLNTPPGTTIRVVKNLRICGDCHNAAKLISLIFGRQIVIRDVQRFHRFEDGQCSCC 598

Query: 601 DFW 604
           DFW
Sbjct: 601 DFW 598

BLAST of Tan0001607 vs. ExPASy TrEMBL
Match: A0A5A7STH8 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1932G00400 PE=3 SV=1)

HSP 1 Score: 1126.7 bits (2913), Expect = 0.0e+00
Identity = 541/604 (89.57%), Postives = 580/604 (96.03%), Query Frame = 0

Query: 1   MAILCIPTCQFTLTKPNSTFSKNEF-INQPHPLSLLSKCTSLRDLKQIQAYTIKTNLQND 60
           MAI CIPTCQFTLTKP+STFSKNEF INQ HPLSLLSKCTSL++LKQIQAYTIKTNLQ+D
Sbjct: 1   MAISCIPTCQFTLTKPSSTFSKNEFVINQLHPLSLLSKCTSLKELKQIQAYTIKTNLQSD 60

Query: 61  ISVLTKLINICTRSPTTSSMDHAHHLFDQIPDKDIVLFNIMARGYARSNSPYLAFSLFAQ 120
           ISVLTKLIN CT +PTTS MDHAHHLFDQI DKDI+LFNIMARGYARSNSPYLAFSLFAQ
Sbjct: 61  ISVLTKLINFCTLNPTTSYMDHAHHLFDQILDKDIILFNIMARGYARSNSPYLAFSLFAQ 120

Query: 121 VLCSGLLPDDYTFSSLLKACASSKALKEGRQLHCFAIKLGLSHNIYICPTLINMYAECNY 180
           +LCSGLLPDDYTFSSLLKACASSKAL++G  LHCFA+KLGL+HNIYICPTLINMYAECN 
Sbjct: 121 LLCSGLLPDDYTFSSLLKACASSKALRQGMGLHCFAVKLGLNHNIYICPTLINMYAECND 180

Query: 181 MDAARGVFDGMEEPCIVSYNAIITGYARSSRPNEALSLFRELQASNLEPTDVTMLSIIMS 240
           M+AARGVFD ME+PCIVSYNAIITGYARSS+PNEALSLFRELQAS++EPTDVTMLS+IMS
Sbjct: 181 MNAARGVFDEMEQPCIVSYNAIITGYARSSQPNEALSLFRELQASDIEPTDVTMLSVIMS 240

Query: 241 CSLLGAFDLGRWIHEYVKKKGFDKYVKVNTALIDMYAKCGSLADAVSIFEGMRVRDTQAW 300
           C+LLGA DLG+WIHEYVKKKGFDKYVKVNTALIDM+AKCGSL DA+SIFEGMRVRDTQAW
Sbjct: 241 CALLGALDLGKWIHEYVKKKGFDKYVKVNTALIDMFAKCGSLTDAISIFEGMRVRDTQAW 300

Query: 301 SAMIVAFATHGDGLKAISMFEEMKKAGVRPDEITFLGLLYACSHAGLVEEGRRYFHSMSK 360
           SAMIVAFATHGDGLK+IS+FEEMK+AGVRPDEITFLGLLYACSHAGLVE+GR YF+SMS+
Sbjct: 301 SAMIVAFATHGDGLKSISIFEEMKRAGVRPDEITFLGLLYACSHAGLVEQGRGYFYSMSR 360

Query: 361 IYGITPGIKHYGCMVDLLGRTGHLDEAYKLIDELEIKPTPILWRTLLSSCSNHGNVDLAK 420
            YGITPGIKHYGCMVDLLGRTG LDEAY  +DELEIKPTPILWRTLLS+CS HGNV++AK
Sbjct: 361 TYGITPGIKHYGCMVDLLGRTGCLDEAYNFVDELEIKPTPILWRTLLSACSTHGNVEMAK 420

Query: 421 RVIERIFELDDSHGGDYVILSNLFARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVV 480
           RVIERIFELDDSHGGDYVILSNL+ARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVV
Sbjct: 421 RVIERIFELDDSHGGDYVILSNLYARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVV 480

Query: 481 HEFFSGDGVHSISEELRRALDELIKEIKLVGYVPDTSLVYHADMEEEGKELVLRYHSEKL 540
           HEFFSGDGVH +S ELRRALDEL+KEIKLVGY+PDTSLVYHADM+EEGKELVLRYHSEKL
Sbjct: 481 HEFFSGDGVHCVSVELRRALDELMKEIKLVGYIPDTSLVYHADMDEEGKELVLRYHSEKL 540

Query: 541 AMAFGLLNTPPGTTIRVVKNLRICGDCHNAAKLISLIFGRQIVIRDVQRFHRFEDGECSC 600
           AMAFGLLNTPPGTTIRV KNLRICGDCHNAAKLIS IFGR+IVIRDVQRFH+FEDG+CSC
Sbjct: 541 AMAFGLLNTPPGTTIRVAKNLRICGDCHNAAKLISFIFGRKIVIRDVQRFHQFEDGKCSC 600

Query: 601 CDFW 604
            DFW
Sbjct: 601 GDFW 604

BLAST of Tan0001607 vs. ExPASy TrEMBL
Match: A0A1S3BFK0 (pentatricopeptide repeat-containing protein At2g02980, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103489124 PE=3 SV=1)

HSP 1 Score: 1126.7 bits (2913), Expect = 0.0e+00
Identity = 541/604 (89.57%), Postives = 580/604 (96.03%), Query Frame = 0

Query: 1   MAILCIPTCQFTLTKPNSTFSKNEF-INQPHPLSLLSKCTSLRDLKQIQAYTIKTNLQND 60
           MAI CIPTCQFTLTKP+STFSKNEF INQ HPLSLLSKCTSL++LKQIQAYTIKTNLQ+D
Sbjct: 1   MAISCIPTCQFTLTKPSSTFSKNEFVINQLHPLSLLSKCTSLKELKQIQAYTIKTNLQSD 60

Query: 61  ISVLTKLINICTRSPTTSSMDHAHHLFDQIPDKDIVLFNIMARGYARSNSPYLAFSLFAQ 120
           ISVLTKLIN CT +PTTS MDHAHHLFDQI DKDI+LFNIMARGYARSNSPYLAFSLFAQ
Sbjct: 61  ISVLTKLINFCTLNPTTSYMDHAHHLFDQILDKDIILFNIMARGYARSNSPYLAFSLFAQ 120

Query: 121 VLCSGLLPDDYTFSSLLKACASSKALKEGRQLHCFAIKLGLSHNIYICPTLINMYAECNY 180
           +LCSGLLPDDYTFSSLLKACASSKAL++G  LHCFA+KLGL+HNIYICPTLINMYAECN 
Sbjct: 121 LLCSGLLPDDYTFSSLLKACASSKALRQGMGLHCFAVKLGLNHNIYICPTLINMYAECND 180

Query: 181 MDAARGVFDGMEEPCIVSYNAIITGYARSSRPNEALSLFRELQASNLEPTDVTMLSIIMS 240
           M+AARGVFD ME+PCIVSYNAIITGYARSS+PNEALSLFRELQAS++EPTDVTMLS+IMS
Sbjct: 181 MNAARGVFDEMEQPCIVSYNAIITGYARSSQPNEALSLFRELQASDIEPTDVTMLSVIMS 240

Query: 241 CSLLGAFDLGRWIHEYVKKKGFDKYVKVNTALIDMYAKCGSLADAVSIFEGMRVRDTQAW 300
           C+LLGA DLG+WIHEYVKKKGFDKYVKVNTALIDM+AKCGSL DA+SIFEGMRVRDTQAW
Sbjct: 241 CALLGALDLGKWIHEYVKKKGFDKYVKVNTALIDMFAKCGSLTDAISIFEGMRVRDTQAW 300

Query: 301 SAMIVAFATHGDGLKAISMFEEMKKAGVRPDEITFLGLLYACSHAGLVEEGRRYFHSMSK 360
           SAMIVAFATHGDGLK+IS+FEEMK+AGVRPDEITFLGLLYACSHAGLVE+GR YF+SMS+
Sbjct: 301 SAMIVAFATHGDGLKSISIFEEMKRAGVRPDEITFLGLLYACSHAGLVEQGRGYFYSMSR 360

Query: 361 IYGITPGIKHYGCMVDLLGRTGHLDEAYKLIDELEIKPTPILWRTLLSSCSNHGNVDLAK 420
            YGITPGIKHYGCMVDLLGRTG LDEAY  +DELEIKPTPILWRTLLS+CS HGNV++AK
Sbjct: 361 TYGITPGIKHYGCMVDLLGRTGCLDEAYNFVDELEIKPTPILWRTLLSACSTHGNVEMAK 420

Query: 421 RVIERIFELDDSHGGDYVILSNLFARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVV 480
           RVIERIFELDDSHGGDYVILSNL+ARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVV
Sbjct: 421 RVIERIFELDDSHGGDYVILSNLYARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVV 480

Query: 481 HEFFSGDGVHSISEELRRALDELIKEIKLVGYVPDTSLVYHADMEEEGKELVLRYHSEKL 540
           HEFFSGDGVH +S ELRRALDEL+KEIKLVGY+PDTSLVYHADM+EEGKELVLRYHSEKL
Sbjct: 481 HEFFSGDGVHCVSVELRRALDELMKEIKLVGYIPDTSLVYHADMDEEGKELVLRYHSEKL 540

Query: 541 AMAFGLLNTPPGTTIRVVKNLRICGDCHNAAKLISLIFGRQIVIRDVQRFHRFEDGECSC 600
           AMAFGLLNTPPGTTIRV KNLRICGDCHNAAKLIS IFGR+IVIRDVQRFH+FEDG+CSC
Sbjct: 541 AMAFGLLNTPPGTTIRVAKNLRICGDCHNAAKLISFIFGRKIVIRDVQRFHQFEDGKCSC 600

Query: 601 CDFW 604
            DFW
Sbjct: 601 GDFW 604

BLAST of Tan0001607 vs. TAIR 10
Match: AT2G02980.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 842.4 bits (2175), Expect = 2.2e-244
Identity = 394/583 (67.58%), Postives = 482/583 (82.68%), Query Frame = 0

Query: 21  SKNEFINQPHPLSLLSKCTSLRDLKQIQAYTIKTNLQNDISVLTKLINICTRSPTTSSMD 80
           SK + +N  +P+ L+SKC SLR+L QIQAY IK++++ D+S + KLIN CT SPT SSM 
Sbjct: 22  SKIDTVNTQNPILLISKCNSLRELMQIQAYAIKSHIE-DVSFVAKLINFCTESPTESSMS 81

Query: 81  HAHHLFDQIPDKDIVLFNIMARGYARSNSPYLAFSLFAQVLCSGLLPDDYTFSSLLKACA 140
           +A HLF+ + + DIV+FN MARGY+R  +P   FSLF ++L  G+LPD+YTF SLLKACA
Sbjct: 82  YARHLFEAMSEPDIVIFNSMARGYSRFTNPLEVFSLFVEILEDGILPDNYTFPSLLKACA 141

Query: 141 SSKALKEGRQLHCFAIKLGLSHNIYICPTLINMYAECNYMDAARGVFDGMEEPCIVSYNA 200
            +KAL+EGRQLHC ++KLGL  N+Y+CPTLINMY EC  +D+AR VFD + EPC+V YNA
Sbjct: 142 VAKALEEGRQLHCLSMKLGLDDNVYVCPTLINMYTECEDVDSARCVFDRIVEPCVVCYNA 201

Query: 201 IITGYARSSRPNEALSLFRELQASNLEPTDVTMLSIIMSCSLLGAFDLGRWIHEYVKKKG 260
           +ITGYAR +RPNEALSLFRE+Q   L+P ++T+LS++ SC+LLG+ DLG+WIH+Y KK  
Sbjct: 202 MITGYARRNRPNEALSLFREMQGKYLKPNEITLLSVLSSCALLGSLDLGKWIHKYAKKHS 261

Query: 261 FDKYVKVNTALIDMYAKCGSLADAVSIFEGMRVRDTQAWSAMIVAFATHGDGLKAISMFE 320
           F KYVKVNTALIDM+AKCGSL DAVSIFE MR +DTQAWSAMIVA+A HG   K++ MFE
Sbjct: 262 FCKYVKVNTALIDMFAKCGSLDDAVSIFEKMRYKDTQAWSAMIVAYANHGKAEKSMLMFE 321

Query: 321 EMKKAGVRPDEITFLGLLYACSHAGLVEEGRRYFHSMSKIYGITPGIKHYGCMVDLLGRT 380
            M+   V+PDEITFLGLL ACSH G VEEGR+YF  M   +GI P IKHYG MVDLL R 
Sbjct: 322 RMRSENVQPDEITFLGLLNACSHTGRVEEGRKYFSQMVSKFGIVPSIKHYGSMVDLLSRA 381

Query: 381 GHLDEAYKLIDELEIKPTPILWRTLLSSCSNHGNVDLAKRVIERIFELDDSHGGDYVILS 440
           G+L++AY+ ID+L I PTP+LWR LL++CS+H N+DLA++V ERIFELDDSHGGDYVILS
Sbjct: 382 GNLEDAYEFIDKLPISPTPMLWRILLAACSSHNNLDLAEKVSERIFELDDSHGGDYVILS 441

Query: 441 NLFARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVVHEFFSGDGVHSISEELRRALD 500
           NL+AR  +WE V+ LRK+MKDR  VKVPGCSS+EVNNVVHEFFSGDGV S + +L RALD
Sbjct: 442 NLYARNKKWEYVDSLRKVMKDRKAVKVPGCSSIEVNNVVHEFFSGDGVKSATTKLHRALD 501

Query: 501 ELIKEIKLVGYVPDTSLVYHADMEEEGKELVLRYHSEKLAMAFGLLNTPPGTTIRVVKNL 560
           E++KE+KL GYVPDTS+V HA+M ++ KE+ LRYHSEKLA+ FGLLNTPPGTTIRVVKNL
Sbjct: 502 EMVKELKLSGYVPDTSMVVHANMNDQEKEITLRYHSEKLAITFGLLNTPPGTTIRVVKNL 561

Query: 561 RICGDCHNAAKLISLIFGRQIVIRDVQRFHRFEDGECSCCDFW 604
           R+C DCHNAAKLISLIFGR++V+RDVQRFH FEDG+CSC DFW
Sbjct: 562 RVCRDCHNAAKLISLIFGRKVVLRDVQRFHHFEDGKCSCGDFW 603

BLAST of Tan0001607 vs. TAIR 10
Match: AT1G08070.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 503.4 bits (1295), Expect = 2.5e-142
Identity = 246/605 (40.66%), Postives = 376/605 (62.15%), Query Frame = 0

Query: 29  PHPLSLLSKCTSLRDLKQIQAYTIKTNLQNDISVLTKLIN-------------ICTRSP- 88
           P  L   +K  + ++ +QI  + +K     D+ V T LI+             +  +SP 
Sbjct: 138 PFVLKSCAKSKAFKEGQQIHGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVFDKSPH 197

Query: 89  --------------TTSSMDHAHHLFDQIPDKDIVLFNIMARGYARSNSPYLAFSLFAQV 148
                         +   +++A  LFD+IP KD+V +N M  GYA + +   A  LF  +
Sbjct: 198 RDVVSYTALIKGYASRGYIENAQKLFDEIPVKDVVSWNAMISGYAETGNYKEALELFKDM 257

Query: 149 LCSGLLPDDYTFSSLLKACASSKALKEGRQLHCFAIKLGLSHNIYICPTLINMYAECNYM 208
           + + + PD+ T  +++ ACA S +++ GRQ+H +    G   N+ I   LI++Y++C  +
Sbjct: 258 MKTNVRPDESTMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGEL 317

Query: 209 DAARGVFDGMEEPCIVSYNAIITGYARSSRPNEALSLFRELQASNLEPTDVTMLSIIMSC 268
           + A G+F+ +    ++S+N +I GY   +   EAL LF+E+  S   P DVTMLSI+ +C
Sbjct: 318 ETACGLFERLPYKDVISWNTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSILPAC 377

Query: 269 SLLGAFDLGRWIHEYVKK--KGFDKYVKVNTALIDMYAKCGSLADAVSIFEGMRVRDTQA 328
           + LGA D+GRWIH Y+ K  KG      + T+LIDMYAKCG +  A  +F  +  +   +
Sbjct: 378 AHLGAIDIGRWIHVYIDKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSS 437

Query: 329 WSAMIVAFATHGDGLKAISMFEEMKKAGVRPDEITFLGLLYACSHAGLVEEGRRYFHSMS 388
           W+AMI  FA HG    +  +F  M+K G++PD+ITF+GLL ACSH+G+++ GR  F +M+
Sbjct: 438 WNAMIFGFAMHGRADASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMT 497

Query: 389 KIYGITPGIKHYGCMVDLLGRTGHLDEAYKLIDELEIKPTPILWRTLLSSCSNHGNVDLA 448
           + Y +TP ++HYGCM+DLLG +G   EA ++I+ +E++P  ++W +LL +C  HGNV+L 
Sbjct: 498 QDYKMTPKLEHYGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELG 557

Query: 449 KRVIERIFELDDSHGGDYVILSNLFARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNV 508
           +   E + +++  + G YV+LSN++A  GRW +V   R L+ D+G+ KVPGCSS+E+++V
Sbjct: 558 ESFAENLIKIEPENPGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSV 617

Query: 509 VHEFFSGDGVHSISEELRRALDELIKEIKLVGYVPDTSLVYHADMEEEGKELVLRYHSEK 568
           VHEF  GD  H  + E+   L+E+   ++  G+VPDTS V   +MEEE KE  LR+HSEK
Sbjct: 618 VHEFIIGDKFHPRNREIYGMLEEMEVLLEKAGFVPDTSEVLQ-EMEEEWKEGALRHHSEK 677

Query: 569 LAMAFGLLNTPPGTTIRVVKNLRICGDCHNAAKLISLIFGRQIVIRDVQRFHRFEDGECS 604
           LA+AFGL++T PGT + +VKNLR+C +CH A KLIS I+ R+I+ RD  RFH F DG CS
Sbjct: 678 LAIAFGLISTKPGTKLTIVKNLRVCRNCHEATKLISKIYKREIIARDRTRFHHFRDGVCS 737

BLAST of Tan0001607 vs. TAIR 10
Match: AT5G66520.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 493.8 bits (1270), Expect = 1.9e-139
Identity = 236/603 (39.14%), Postives = 364/603 (60.36%), Query Frame = 0

Query: 32  LSLLSKCTSLRDLKQIQAYTIKTNLQNDISVLTKLINICTRSPTTSSMDHAHHLFDQIPD 91
           +S L +C+   +LKQI A  +KT L  D   +TK ++ C  S ++  + +A  +FD    
Sbjct: 18  MSCLQRCSKQEELKQIHARMLKTGLMQDSYAITKFLSFCISSTSSDFLPYAQIVFDGFDR 77

Query: 92  KDIVLFNIMARGYARSNSPYLAFSLFAQVLCSGLLPDDYTFSSLLKACASSKALKEGRQL 151
            D  L+N+M RG++ S+ P  +  L+ ++LCS    + YTF SLLKAC++  A +E  Q+
Sbjct: 78  PDTFLWNLMIRGFSCSDEPERSLLLYQRMLCSSAPHNAYTFPSLLKACSNLSAFEETTQI 137

Query: 152 HCFAIKLGLSHNIYICPTLINMYAECNYMDAARGVFDGMEEPCIVSYNAIITGYARSSRP 211
           H    KLG  +++Y   +LIN YA       A  +FD + EP  VS+N++I GY ++ + 
Sbjct: 138 HAQITKLGYENDVYAVNSLINSYAVTGNFKLAHLLFDRIPEPDDVSWNSVIKGYVKAGKM 197

Query: 212 N-------------------------------EALSLFRELQASNLEPTDVTMLSIIMSC 271
           +                               EAL LF E+Q S++EP +V++ + + +C
Sbjct: 198 DIALTLFRKMAEKNAISWTTMISGYVQADMNKEALQLFHEMQNSDVEPDNVSLANALSAC 257

Query: 272 SLLGAFDLGRWIHEYVKKKGFDKYVKVNTALIDMYAKCGSLADAVSIFEGMRVRDTQAWS 331
           + LGA + G+WIH Y+ K        +   LIDMYAKCG + +A+ +F+ ++ +  QAW+
Sbjct: 258 AQLGALEQGKWIHSYLNKTRIRMDSVLGCVLIDMYAKCGEMEEALEVFKNIKKKSVQAWT 317

Query: 332 AMIVAFATHGDGLKAISMFEEMKKAGVRPDEITFLGLLYACSHAGLVEEGRRYFHSMSKI 391
           A+I  +A HG G +AIS F EM+K G++P+ ITF  +L ACS+ GLVEEG+  F+SM + 
Sbjct: 318 ALISGYAYHGHGREAISKFMEMQKMGIKPNVITFTAVLTACSYTGLVEEGKLIFYSMERD 377

Query: 392 YGITPGIKHYGCMVDLLGRTGHLDEAYKLIDELEIKPTPILWRTLLSSCSNHGNVDLAKR 451
           Y + P I+HYGC+VDLLGR G LDEA + I E+ +KP  ++W  LL +C  H N++L + 
Sbjct: 378 YNLKPTIEHYGCIVDLLGRAGLLDEAKRFIQEMPLKPNAVIWGALLKACRIHKNIELGEE 437

Query: 452 VIERIFELDDSHGGDYVILSNLFARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVVH 511
           + E +  +D  HGG YV  +N+ A   +W+     R+LMK++GV KVPGCS++ +    H
Sbjct: 438 IGEILIAIDPYHGGRYVHKANIHAMDKKWDKAAETRRLMKEQGVAKVPGCSTISLEGTTH 497

Query: 512 EFFSGDGVHSISEELRRALDELIKEIKLVGYVPDTSLVYHADMEEEGKELVLRYHSEKLA 571
           EF +GD  H   E+++     + ++++  GYVP+   +    ++++ +E ++  HSEKLA
Sbjct: 498 EFLAGDRSHPEIEKIQSKWRIMRRKLEENGYVPELEEMLLDLVDDDEREAIVHQHSEKLA 557

Query: 572 MAFGLLNTPPGTTIRVVKNLRICGDCHNAAKLISLIFGRQIVIRDVQRFHRFEDGECSCC 604
           + +GL+ T PGT IR++KNLR+C DCH   KLIS I+ R IV+RD  RFH F DG+CSC 
Sbjct: 558 ITYGLIKTKPGTIIRIMKNLRVCKDCHKVTKLISKIYKRDIVMRDRTRFHHFRDGKCSCG 617

BLAST of Tan0001607 vs. TAIR 10
Match: AT5G48910.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 489.2 bits (1258), Expect = 4.8e-138
Identity = 243/626 (38.82%), Postives = 373/626 (59.58%), Query Frame = 0

Query: 30  HPLSL---LSKCTSLRDLKQIQAYTIKTNLQNDISVLTKLINICTRSPT-TSSMDHAHHL 89
           HP SL   ++ C ++RDL QI A  IK+    D     +++  C  S      +D+AH +
Sbjct: 22  HPSSLFPQINNCRTIRDLSQIHAVFIKSGQMRDTLAAAEILRFCATSDLHHRDLDYAHKI 81

Query: 90  FDQIPDKDIVLFNIMARGYARS--NSPYLAFSLFAQVLCSGLL-PDDYTFSSLLKACASS 149
           F+Q+P ++   +N + RG++ S  +   +A +LF +++    + P+ +TF S+LKACA +
Sbjct: 82  FNQMPQRNCFSWNTIIRGFSESDEDKALIAITLFYEMMSDEFVEPNRFTFPSVLKACAKT 141

Query: 150 KALKEGRQLHCFAIKLGLSHNIYICPTLINMYAECNYM---------------------- 209
             ++EG+Q+H  A+K G   + ++   L+ MY  C +M                      
Sbjct: 142 GKIQEGKQIHGLALKYGFGGDEFVMSNLVRMYVMCGFMKDARVLFYKNIIEKDMVVMTDR 201

Query: 210 -----------------------DAARGVFDGMEEPCIVSYNAIITGYARSSRPNEALSL 269
                                   AAR +FD M +  +VS+N +I+GY+ +    +A+ +
Sbjct: 202 RKRDGEIVLWNVMIDGYMRLGDCKAARMLFDKMRQRSVVSWNTMISGYSLNGFFKDAVEV 261

Query: 270 FRELQASNLEPTDVTMLSIIMSCSLLGAFDLGRWIHEYVKKKGFDKYVKVNTALIDMYAK 329
           FRE++  ++ P  VT++S++ + S LG+ +LG W+H Y +  G      + +ALIDMY+K
Sbjct: 262 FREMKKGDIRPNYVTLVSVLPAISRLGSLELGEWLHLYAEDSGIRIDDVLGSALIDMYSK 321

Query: 330 CGSLADAVSIFEGMRVRDTQAWSAMIVAFATHGDGLKAISMFEEMKKAGVRPDEITFLGL 389
           CG +  A+ +FE +   +   WSAMI  FA HG    AI  F +M++AGVRP ++ ++ L
Sbjct: 322 CGIIEKAIHVFERLPRENVITWSAMINGFAIHGQAGDAIDCFCKMRQAGVRPSDVAYINL 381

Query: 390 LYACSHAGLVEEGRRYFHSMSKIYGITPGIKHYGCMVDLLGRTGHLDEAYKLIDELEIKP 449
           L ACSH GLVEEGRRYF  M  + G+ P I+HYGCMVDLLGR+G LDEA + I  + IKP
Sbjct: 382 LTACSHGGLVEEGRRYFSQMVSVDGLEPRIEHYGCMVDLLGRSGLLDEAEEFILNMPIKP 441

Query: 450 TPILWRTLLSSCSNHGNVDLAKRVIERIFELDDSHGGDYVILSNLFARVGRWEDVNHLRK 509
             ++W+ LL +C   GNV++ KRV   + ++     G YV LSN++A  G W +V+ +R 
Sbjct: 442 DDVIWKALLGACRMQGNVEMGKRVANILMDMVPHDSGAYVALSNMYASQGNWSEVSEMRL 501

Query: 510 LMKDRGVVKVPGCSSVEVNNVVHEFFSGDGVHSISEELRRALDELIKEIKLVGYVPDTSL 569
            MK++ + K PGCS ++++ V+HEF   D  H  ++E+   L E+  +++L GY P T+ 
Sbjct: 502 RMKEKDIRKDPGCSLIDIDGVLHEFVVEDDSHPKAKEINSMLVEISDKLRLAGYRPITTQ 561

Query: 570 VYHADMEEEGKELVLRYHSEKLAMAFGLLNTPPGTTIRVVKNLRICGDCHNAAKLISLIF 604
           V   ++EEE KE VL YHSEK+A AFGL++T PG  IR+VKNLRIC DCH++ KLIS ++
Sbjct: 562 VL-LNLEEEDKENVLHYHSEKIATAFGLISTSPGKPIRIVKNLRICEDCHSSIKLISKVY 621

BLAST of Tan0001607 vs. TAIR 10
Match: AT1G11290.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 482.3 bits (1240), Expect = 5.9e-136
Identity = 236/575 (41.04%), Postives = 365/575 (63.48%), Query Frame = 0

Query: 32  LSLLSKCTSLRDL---KQIQAYTIKTNLQNDISVLTKLINICTRSPTTSSMDHAHHLFDQ 91
           +S+L   ++LR +   K+I  Y +++   + +++ T L+++  +     S++ A  LFD 
Sbjct: 240 VSVLPAVSALRLISVGKEIHGYAMRSGFDSLVNISTALVDMYAK---CGSLETARQLFDG 299

Query: 92  IPDKDIVLFNIMARGYARSNSPYLAFSLFAQVLCSGLLPDDYTFSSLLKACASSKALKEG 151
           + ++++V +N M   Y ++ +P  A  +F ++L  G+ P D +    L ACA    L+ G
Sbjct: 300 MLERNVVSWNSMIDAYVQNENPKEAMLIFQKMLDEGVKPTDVSVMGALHACADLGDLERG 359

Query: 152 RQLHCFAIKLGLSHNIYICPTLINMYAECNYMDAARGVFDGMEEPCIVSYNAIITGYARS 211
           R +H  +++LGL  N+ +  +LI+MY +C  +D A  +F  ++   +VS+NA+I G+A++
Sbjct: 360 RFIHKLSVELGLDRNVSVVNSLISMYCKCKEVDTAASMFGKLQSRTLVSWNAMILGFAQN 419

Query: 212 SRPNEALSLFRELQASNLEPTDVTMLSIIMSCSLLGAFDLGRWIHEYVKKKGFDKYVKVN 271
            RP +AL+ F ++++  ++P   T +S+I + + L      +WIH  V +   DK V V 
Sbjct: 420 GRPIDALNYFSQMRSRTVKPDTFTYVSVITAIAELSITHHAKWIHGVVMRSCLDKNVFVT 479

Query: 272 TALIDMYAKCGSLADAVSIFEGMRVRDTQAWSAMIVAFATHGDGLKAISMFEEMKKAGVR 331
           TAL+DMYAKCG++  A  IF+ M  R    W+AMI  + THG G  A+ +FEEM+K  ++
Sbjct: 480 TALVDMYAKCGAIMIARLIFDMMSERHVTTWNAMIDGYGTHGFGKAALELFEEMQKGTIK 539

Query: 332 PDEITFLGLLYACSHAGLVEEGRRYFHSMSKIYGITPGIKHYGCMVDLLGRTGHLDEAYK 391
           P+ +TFL ++ ACSH+GLVE G + F+ M + Y I   + HYG MVDLLGR G L+EA+ 
Sbjct: 540 PNGVTFLSVISACSHSGLVEAGLKCFYMMKENYSIELSMDHYGAMVDLLGRAGRLNEAWD 599

Query: 392 LIDELEIKPTPILWRTLLSSCSNHGNVDLAKRVIERIFELDDSHGGDYVILSNLFARVGR 451
            I ++ +KP   ++  +L +C  H NV+ A++  ER+FEL+   GG +V+L+N++     
Sbjct: 600 FIMQMPVKPAVNVYGAMLGACQIHKNVNFAEKAAERLFELNPDDGGYHVLLANIYRAASM 659

Query: 452 WEDVNHLRKLMKDRGVVKVPGCSSVEVNNVVHEFFSGDGVHSISEELRRALDELIKEIKL 511
           WE V  +R  M  +G+ K PGCS VE+ N VH FFSG   H  S+++   L++LI  IK 
Sbjct: 660 WEKVGQVRVSMLRQGLRKTPGCSMVEIKNEVHSFFSGSTAHPDSKKIYAFLEKLICHIKE 719

Query: 512 VGYVPDTSLVYHADMEEEGKELVLRYHSEKLAMAFGLLNTPPGTTIRVVKNLRICGDCHN 571
            GYVPDT+LV    +E + KE +L  HSEKLA++FGLLNT  GTTI V KNLR+C DCHN
Sbjct: 720 AGYVPDTNLV--LGVENDVKEQLLSTHSEKLAISFGLLNTTAGTTIHVRKNLRVCADCHN 779

Query: 572 AAKLISLIFGRQIVIRDVQRFHRFEDGECSCCDFW 604
           A K ISL+ GR+IV+RD+QRFH F++G CSC D+W
Sbjct: 780 ATKYISLVTGREIVVRDMQRFHHFKNGACSCGDYW 809

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q8LK933.1e-24367.58Pentatricopeptide repeat-containing protein At2g02980, chloroplastic OS=Arabidop... [more]
Q9LN013.5e-14140.66Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
Q9FJY72.7e-13839.14Pentatricopeptide repeat-containing protein At5g66520 OS=Arabidopsis thaliana OX... [more]
Q9LIQ78.0e-13841.57Pentatricopeptide repeat-containing protein At3g24000, mitochondrial OS=Arabidop... [more]
Q9FI806.7e-13738.82Pentatricopeptide repeat-containing protein At5g48910 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
XP_038893049.10.0e+0091.54pentatricopeptide repeat-containing protein At2g02980, chloroplastic [Benincasa ... [more]
XP_022968061.10.0e+0091.71pentatricopeptide repeat-containing protein At2g02980, chloroplastic [Cucurbita ... [more]
XP_023541252.10.0e+0091.87pentatricopeptide repeat-containing protein At2g02980, chloroplastic [Cucurbita ... [more]
XP_022151060.10.0e+0090.71pentatricopeptide repeat-containing protein At2g02980, chloroplastic isoform X1 ... [more]
KAG7013140.10.0e+0091.71Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita a... [more]
Match NameE-valueIdentityDescription
A0A6J1HTT70.0e+0091.71pentatricopeptide repeat-containing protein At2g02980, chloroplastic OS=Cucurbit... [more]
A0A6J1DA680.0e+0090.71pentatricopeptide repeat-containing protein At2g02980, chloroplastic isoform X1 ... [more]
A0A6J1FZ610.0e+0091.54pentatricopeptide repeat-containing protein At2g02980, chloroplastic OS=Cucurbit... [more]
A0A5A7STH80.0e+0089.57Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S3BFK00.0e+0089.57pentatricopeptide repeat-containing protein At2g02980, chloroplastic OS=Cucumis ... [more]
Match NameE-valueIdentityDescription
AT2G02980.12.2e-24467.58Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G08070.12.5e-14240.66Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT5G66520.11.9e-13939.14Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT5G48910.14.8e-13838.82Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G11290.15.9e-13641.04Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 468..593
e-value: 4.8E-38
score: 129.9
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 29..142
e-value: 2.6E-14
score: 55.3
coord: 266..505
e-value: 7.6E-43
score: 149.0
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 143..265
e-value: 6.7E-22
score: 80.3
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 401..425
e-value: 0.25
score: 11.7
coord: 169..192
e-value: 0.37
score: 11.2
coord: 370..394
e-value: 0.025
score: 14.8
coord: 269..294
e-value: 0.0064
score: 16.7
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 92..140
e-value: 1.9E-8
score: 34.4
coord: 295..341
e-value: 1.1E-9
score: 38.3
coord: 193..239
e-value: 6.8E-10
score: 39.0
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 196..229
e-value: 9.8E-8
score: 29.7
coord: 298..331
e-value: 1.7E-6
score: 25.8
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 194..228
score: 11.783455
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 295..329
score: 11.542307
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 93..127
score: 9.426776
NoneNo IPR availablePANTHERPTHR47926PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 21..589
NoneNo IPR availablePANTHERPTHR47926:SF132BNAA02G26650D PROTEINcoord: 21..589

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0001607.1Tan0001607.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:1900865 chloroplast RNA modification
biological_process GO:0031425 chloroplast RNA processing
cellular_component GO:0009507 chloroplast
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding