Tan0004167 (gene) Snake gourd v1

Overview
NameTan0004167
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat-containing protein
LocationLG08: 5418102 .. 5420212 (-)
RNA-Seq ExpressionTan0004167
SyntenyTan0004167
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
TCTTCATCCAACCGGAGCCAGTTCGACGGCAGTCGCTCAGTCTCAAACAAAACAGGTCGCCGTCGCCGAGTCTGATTTCATGGATGAAACTCAAATGGGTATTTCAAAAACTGAGCTCCAAGCTTCCCTCTTGGGCCTCTTCTCTAATCTCCCCTTTCAGAAACCAATTCCATCAAAACCCATTTGCAGAAACCTCCTCAACATTCGTCCTCAACCATGTAGACACAAGCTTCCTTCTATCCATTTGTGGAAGAGAAGGGCTCCTCCATTTGGGCTCTTCCCTCCATGCCTCCATCATCAAGAGCTTCGAGCTCTCCAACCAAGAAAATGGGGTCATCATAATGAACTCTCTCATCTCCATGTACGAGAGGTGCGGTAAATTGCCCGATGCAGTCAAGATGTTTGATGAAATGCCCACAAGAGATTCTGTTTCGTGGAACGCATTGATCGGTGGGTTTATGAGAAATGGGATGTTTTGTGTTGGTTTTAGCTATTTTAAGGCTATGTGTTTGGTTGGTGATTGTAAATTTGACAAAGCTACTTTGACGACGATTTTATCTGCTTGTGATGGCTTGGAGTTGTGTTGCATTATCACAACGATGCATGGTTTGGCGTTTTTGAGTGGGTATGAACGAGAAATTACTGTGGGAAATGCTCTGATTAGTTCGTATTTTAAATGTGGATGTGTTGGTTTGGGGAGGCAAGTTTTTGATGAGATGGAGGAGAGAAATGTGATTACTTGGACGGCTGTGATCTCTGGTTTGGCTCAAAATGGGCATCACGAGCACAGTCTGAACCTGTTTAGGGAGATGATGAGTTGTGGCTCTGTGGAGCCAAATTCTTTAACTTATTTGAGCTTACTCGCTGCTTGTTCTGGTTTGGAGGCATTAGAGGAAGGATGTCAAATTCATGGTCTTATTTTGAAGTTGGGAATTCAATCAGATTTGTGCATTGGGAGTGCTCTGATGGATATGTACTCGAAATCTGGAAGCATTGGAGATGCTTGGAAGATTTTCGAGTCGGCTGAGGAACTTGATATGGTTTCATTGACTGTTATCCTTGCAGGGTTCACACAGAATGGATGTGAGGAAGAAGCCATCCAGATCTTTCTGAAAATGTTGAAAATGGGGATCAAGATTGACGAAAATGTCGTTTCGGCCGTGCTTGGAGTGTTTGGTGCTGATACATCTTTGAAGCTGGGTCAACAAGTTCACTCGTTTATTGTCAAGAAGAACTTTGGTTGCAATCCTTTTGTGAGCAATGGGCTTATAAACATGTACTCCAAGTGTGGAGCATTGGATGAGTCAGTCAAGGTCTTTGATCGGATGCAAGAAAAGAATTCGGTCACATGGAACTCCATGATTGCAGCATTTGCCCGGCATGGAGATGGCTCGAAAGCTCTACATCTTTATGAGAACATGAAACTGGAAGGTGCAAAGCCAACGGATGTCACATTTCTATCGATACTTCATGCCTGTAGTCATGTCGGCCTAGTCAAAAAGGGCATGGAGTTCCTTGAATCCATGAGAAAAGACCACGGGATGAATCCGAGGAGTGAGCACTATGCTTGTGTCGTGGACATGTTAGGTAGGGCAGGACTGCTGTCTGAAGCTAGAAACTTCATTGAGAAACTGCCTGAAAGGCCAGGTTTACTCGTGTGGCAGGCGTTGCTCGGCGCCTGCAGCCTTTATGGCGATTCCGAAATGGGGGAATATGCAGCCGAGCATCTGTTTTCGGAAACTCCATATAGCCCGGTCCCATATGTTTTGTTAGCCAACATATATTCTTCTGAAGGGAATTGGAAGGAAAGAGCAAGGACAATTAGGAAGATGAAGGAGGTAGGCATGGCCAAAGAAACTGGTATCAGTTGGATTGAGATTGACAAGAAAGTCCATAGTTTTACTGTTGGAGACAAAATGCATCCACAAGCGGACATCATTTATCGAGTTTTGATGGAGCTGTTTGAAATAATGGTTGATGAAGGATATGTGCCTGATAAGAAGTTCATACTCTACTACTTGGATCCCGATGACAAGAGGAAACCAGTCGATAACGGTCGGGCTAACTGTCGGAGTTTCACAGGAACCGAAACTCCTTGGGAGTTGTTTTAA

mRNA sequence

TCTTCATCCAACCGGAGCCAGTTCGACGGCAGTCGCTCAGTCTCAAACAAAACAGGTCGCCGTCGCCGAGTCTGATTTCATGGATGAAACTCAAATGGGTATTTCAAAAACTGAGCTCCAAGCTTCCCTCTTGGGCCTCTTCTCTAATCTCCCCTTTCAGAAACCAATTCCATCAAAACCCATTTGCAGAAACCTCCTCAACATTCGTCCTCAACCATGTAGACACAAGCTTCCTTCTATCCATTTGTGGAAGAGAAGGGCTCCTCCATTTGGGCTCTTCCCTCCATGCCTCCATCATCAAGAGCTTCGAGCTCTCCAACCAAGAAAATGGGGTCATCATAATGAACTCTCTCATCTCCATGTACGAGAGGTGCGGTAAATTGCCCGATGCAGTCAAGATGTTTGATGAAATGCCCACAAGAGATTCTGTTTCGTGGAACGCATTGATCGGTGGGTTTATGAGAAATGGGATGTTTTGTGTTGGTTTTAGCTATTTTAAGGCTATGTGTTTGGTTGGTGATTGTAAATTTGACAAAGCTACTTTGACGACGATTTTATCTGCTTGTGATGGCTTGGAGTTGTGTTGCATTATCACAACGATGCATGGTTTGGCGTTTTTGAGTGGGTATGAACGAGAAATTACTGTGGGAAATGCTCTGATTAGTTCGTATTTTAAATGTGGATGTGTTGGTTTGGGGAGGCAAGTTTTTGATGAGATGGAGGAGAGAAATGTGATTACTTGGACGGCTGTGATCTCTGGTTTGGCTCAAAATGGGCATCACGAGCACAGTCTGAACCTGTTTAGGGAGATGATGAGTTGTGGCTCTGTGGAGCCAAATTCTTTAACTTATTTGAGCTTACTCGCTGCTTGTTCTGGTTTGGAGGCATTAGAGGAAGGATGTCAAATTCATGGTCTTATTTTGAAGTTGGGAATTCAATCAGATTTGTGCATTGGGAGTGCTCTGATGGATATGTACTCGAAATCTGGAAGCATTGGAGATGCTTGGAAGATTTTCGAGTCGGCTGAGGAACTTGATATGGTTTCATTGACTGTTATCCTTGCAGGGTTCACACAGAATGGATGTGAGGAAGAAGCCATCCAGATCTTTCTGAAAATGTTGAAAATGGGGATCAAGATTGACGAAAATGTCGTTTCGGCCGTGCTTGGAGTGTTTGGTGCTGATACATCTTTGAAGCTGGGTCAACAAGTTCACTCGTTTATTGTCAAGAAGAACTTTGGTTGCAATCCTTTTGTGAGCAATGGGCTTATAAACATGTACTCCAAGTGTGGAGCATTGGATGAGTCAGTCAAGGTCTTTGATCGGATGCAAGAAAAGAATTCGGTCACATGGAACTCCATGATTGCAGCATTTGCCCGGCATGGAGATGGCTCGAAAGCTCTACATCTTTATGAGAACATGAAACTGGAAGGTGCAAAGCCAACGGATGTCACATTTCTATCGATACTTCATGCCTGTAGTCATGTCGGCCTAGTCAAAAAGGGCATGGAGTTCCTTGAATCCATGAGAAAAGACCACGGGATGAATCCGAGGAGTGAGCACTATGCTTGTGTCGTGGACATGTTAGGTAGGGCAGGACTGCTGTCTGAAGCTAGAAACTTCATTGAGAAACTGCCTGAAAGGCCAGGTTTACTCGTGTGGCAGGCGTTGCTCGGCGCCTGCAGCCTTTATGGCGATTCCGAAATGGGGGAATATGCAGCCGAGCATCTGTTTTCGGAAACTCCATATAGCCCGGTCCCATATGTTTTGTTAGCCAACATATATTCTTCTGAAGGGAATTGGAAGGAAAGAGCAAGGACAATTAGGAAGATGAAGGAGGTAGGCATGGCCAAAGAAACTGGTATCAGTTGGATTGAGATTGACAAGAAAGTCCATAGTTTTACTGTTGGAGACAAAATGCATCCACAAGCGGACATCATTTATCGAGTTTTGATGGAGCTGTTTGAAATAATGGTTGATGAAGGATATGTGCCTGATAAGAAGTTCATACTCTACTACTTGGATCCCGATGACAAGAGGAAACCAGTCGATAACGGTCGGGCTAACTGTCGGAGTTTCACAGGAACCGAAACTCCTTGGGAGTTGTTTTAA

Coding sequence (CDS)

ATGAAACTCAAATGGGTATTTCAAAAACTGAGCTCCAAGCTTCCCTCTTGGGCCTCTTCTCTAATCTCCCCTTTCAGAAACCAATTCCATCAAAACCCATTTGCAGAAACCTCCTCAACATTCGTCCTCAACCATGTAGACACAAGCTTCCTTCTATCCATTTGTGGAAGAGAAGGGCTCCTCCATTTGGGCTCTTCCCTCCATGCCTCCATCATCAAGAGCTTCGAGCTCTCCAACCAAGAAAATGGGGTCATCATAATGAACTCTCTCATCTCCATGTACGAGAGGTGCGGTAAATTGCCCGATGCAGTCAAGATGTTTGATGAAATGCCCACAAGAGATTCTGTTTCGTGGAACGCATTGATCGGTGGGTTTATGAGAAATGGGATGTTTTGTGTTGGTTTTAGCTATTTTAAGGCTATGTGTTTGGTTGGTGATTGTAAATTTGACAAAGCTACTTTGACGACGATTTTATCTGCTTGTGATGGCTTGGAGTTGTGTTGCATTATCACAACGATGCATGGTTTGGCGTTTTTGAGTGGGTATGAACGAGAAATTACTGTGGGAAATGCTCTGATTAGTTCGTATTTTAAATGTGGATGTGTTGGTTTGGGGAGGCAAGTTTTTGATGAGATGGAGGAGAGAAATGTGATTACTTGGACGGCTGTGATCTCTGGTTTGGCTCAAAATGGGCATCACGAGCACAGTCTGAACCTGTTTAGGGAGATGATGAGTTGTGGCTCTGTGGAGCCAAATTCTTTAACTTATTTGAGCTTACTCGCTGCTTGTTCTGGTTTGGAGGCATTAGAGGAAGGATGTCAAATTCATGGTCTTATTTTGAAGTTGGGAATTCAATCAGATTTGTGCATTGGGAGTGCTCTGATGGATATGTACTCGAAATCTGGAAGCATTGGAGATGCTTGGAAGATTTTCGAGTCGGCTGAGGAACTTGATATGGTTTCATTGACTGTTATCCTTGCAGGGTTCACACAGAATGGATGTGAGGAAGAAGCCATCCAGATCTTTCTGAAAATGTTGAAAATGGGGATCAAGATTGACGAAAATGTCGTTTCGGCCGTGCTTGGAGTGTTTGGTGCTGATACATCTTTGAAGCTGGGTCAACAAGTTCACTCGTTTATTGTCAAGAAGAACTTTGGTTGCAATCCTTTTGTGAGCAATGGGCTTATAAACATGTACTCCAAGTGTGGAGCATTGGATGAGTCAGTCAAGGTCTTTGATCGGATGCAAGAAAAGAATTCGGTCACATGGAACTCCATGATTGCAGCATTTGCCCGGCATGGAGATGGCTCGAAAGCTCTACATCTTTATGAGAACATGAAACTGGAAGGTGCAAAGCCAACGGATGTCACATTTCTATCGATACTTCATGCCTGTAGTCATGTCGGCCTAGTCAAAAAGGGCATGGAGTTCCTTGAATCCATGAGAAAAGACCACGGGATGAATCCGAGGAGTGAGCACTATGCTTGTGTCGTGGACATGTTAGGTAGGGCAGGACTGCTGTCTGAAGCTAGAAACTTCATTGAGAAACTGCCTGAAAGGCCAGGTTTACTCGTGTGGCAGGCGTTGCTCGGCGCCTGCAGCCTTTATGGCGATTCCGAAATGGGGGAATATGCAGCCGAGCATCTGTTTTCGGAAACTCCATATAGCCCGGTCCCATATGTTTTGTTAGCCAACATATATTCTTCTGAAGGGAATTGGAAGGAAAGAGCAAGGACAATTAGGAAGATGAAGGAGGTAGGCATGGCCAAAGAAACTGGTATCAGTTGGATTGAGATTGACAAGAAAGTCCATAGTTTTACTGTTGGAGACAAAATGCATCCACAAGCGGACATCATTTATCGAGTTTTGATGGAGCTGTTTGAAATAATGGTTGATGAAGGATATGTGCCTGATAAGAAGTTCATACTCTACTACTTGGATCCCGATGACAAGAGGAAACCAGTCGATAACGGTCGGGCTAACTGTCGGAGTTTCACAGGAACCGAAACTCCTTGGGAGTTGTTTTAA

Protein sequence

MKLKWVFQKLSSKLPSWASSLISPFRNQFHQNPFAETSSTFVLNHVDTSFLLSICGREGLLHLGSSLHASIIKSFELSNQENGVIIMNSLISMYERCGKLPDAVKMFDEMPTRDSVSWNALIGGFMRNGMFCVGFSYFKAMCLVGDCKFDKATLTTILSACDGLELCCIITTMHGLAFLSGYEREITVGNALISSYFKCGCVGLGRQVFDEMEERNVITWTAVISGLAQNGHHEHSLNLFREMMSCGSVEPNSLTYLSLLAACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSKSGSIGDAWKIFESAEELDMVSLTVILAGFTQNGCEEEAIQIFLKMLKMGIKIDENVVSAVLGVFGADTSLKLGQQVHSFIVKKNFGCNPFVSNGLINMYSKCGALDESVKVFDRMQEKNSVTWNSMIAAFARHGDGSKALHLYENMKLEGAKPTDVTFLSILHACSHVGLVKKGMEFLESMRKDHGMNPRSEHYACVVDMLGRAGLLSEARNFIEKLPERPGLLVWQALLGACSLYGDSEMGEYAAEHLFSETPYSPVPYVLLANIYSSEGNWKERARTIRKMKEVGMAKETGISWIEIDKKVHSFTVGDKMHPQADIIYRVLMELFEIMVDEGYVPDKKFILYYLDPDDKRKPVDNGRANCRSFTGTETPWELF
Homology
BLAST of Tan0004167 vs. ExPASy Swiss-Prot
Match: Q9MA85 (Pentatricopeptide repeat-containing protein At3g05340 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E83 PE=2 SV=2)

HSP 1 Score: 760.8 bits (1963), Expect = 1.3e-218
Identity = 391/656 (59.60%), Postives = 474/656 (72.26%), Query Frame = 0

Query: 1   MKLKWVFQKLSSKLPSWASSLISPFRNQFHQNPFAETSSTFVLNHVDTSFLLSICGREGL 60
           M  +WV QKL+S LPS  S+++SP +    Q+P  +  STF+LNHVD S LLSICGREG 
Sbjct: 1   MNSRWVIQKLTSHLPSCLSTVLSPSKILIRQSPNYQV-STFLLNHVDMSLLLSICGREGW 60

Query: 61  L-HLGSSLHASIIKSFELSN------QENGVIIMNSLISMYERCGKLPDAVKMFDEMPTR 120
             HLG  LHASIIK+ E           N +++ NSL+S+Y +CGKL DA+K+FDEMP R
Sbjct: 61  FPHLGPCLHASIIKNPEFFEPVDADIHRNALVVWNSLLSLYAKCGKLVDAIKLFDEMPMR 120

Query: 121 DSVSWNALIGGFMRNGMFCVGFSYFKAMCLVGDCKFDKATLTTILSACDGLELCCIITTM 180
           D +S N +  GF+RN     GF   K M  +G   FD ATLT +LS CD  E C +   +
Sbjct: 121 DVISQNIVFYGFLRNRETESGFVLLKRM--LGSGGFDHATLTIVLSVCDTPEFCLVTKMI 180

Query: 181 HGLAFLSGYEREITVGNALISSYFKCGCVGLGRQVFDEMEERNVITWTAVISGLAQNGHH 240
           H LA LSGY++EI+VGN LI+SYFKCGC   GR VFD M  RNVIT TAVISGL +N  H
Sbjct: 181 HALAILSGYDKEISVGNKLITSYFKCGCSVSGRGVFDGMSHRNVITLTAVISGLIENELH 240

Query: 241 EHSLNLFREMMSCGSVEPNSLTYLSLLAACSGLEALEEGCQIHGLILKLGIQSDLCIGSA 300
           E  L LF  +M  G V PNS+TYLS LAACSG + + EG QIH L+ K GI+S+LCI SA
Sbjct: 241 EDGLRLF-SLMRRGLVHPNSVTYLSALAACSGSQRIVEGQQIHALLWKYGIESELCIESA 300

Query: 301 LMDMYSKSGSIGDAWKIFESAEELDMVSLTVILAGFTQNGCEEEAIQIFLKMLKMGIKID 360
           LMDMYSK GSI DAW IFES  E+D VS+TVIL G  QNG EEEAIQ F++ML+ G++ID
Sbjct: 301 LMDMYSKCGSIEDAWTIFESTTEVDEVSMTVILVGLAQNGSEEEAIQFFIRMLQAGVEID 360

Query: 361 ENVVSAVLGVFGADTSLKLGQQVHSFIVKKNFGCNPFVSNGLINMYSKCGALDESVKVFD 420
            NVVSAVLGV   D SL LG+Q+HS ++K+ F  N FV+NGLINMYSKCG L +S  VF 
Sbjct: 361 ANVVSAVLGVSFIDNSLGLGKQLHSLVIKRKFSGNTFVNNGLINMYSKCGDLTDSQTVFR 420

Query: 421 RMQEKNSVTWNSMIAAFARHGDGSKALHLYENMKLEGAKPTDVTFLSILHACSHVGLVKK 480
           RM ++N V+WNSMIAAFARHG G  AL LYE M     KPTDVTFLS+LHACSHVGL+ K
Sbjct: 421 RMPKRNYVSWNSMIAAFARHGHGLAALKLYEEMTTLEVKPTDVTFLSLLHACSHVGLIDK 480

Query: 481 GMEFLESMRKDHGMNPRSEHYACVVDMLGRAGLLSEARNFIEKLPERPGLLVWQALLGAC 540
           G E L  M++ HG+ PR+EHY C++DMLGRAGLL EA++FI+ LP +P   +WQALLGAC
Sbjct: 481 GRELLNEMKEVHGIEPRTEHYTCIIDMLGRAGLLKEAKSFIDSLPLKPDCKIWQALLGAC 540

Query: 541 SLYGDSEMGEYAAEHLFSETPYSPVPYVLLANIYSSEGNWKERARTIRKMKEVGMAKETG 600
           S +GD+E+GEYAAE LF   P S   ++L+ANIYSS G WKERA+TI++MK +G+ KETG
Sbjct: 541 SFHGDTEVGEYAAEQLFQTAPDSSSAHILIANIYSSRGKWKERAKTIKRMKAMGVTKETG 600

Query: 601 ISWIEIDKKVHSFTVGDKMHPQADIIYRVLMELFEIMVDEGYVPDKKFILYYLDPD 650
           IS IEI+ K HSF V DK+HPQA+ IY VL  LF +MVDEGY PDK+FIL Y   D
Sbjct: 601 ISSIEIEHKTHSFVVEDKLHPQAEAIYDVLSGLFPVMVDEGYRPDKRFILCYTGDD 652

BLAST of Tan0004167 vs. ExPASy Swiss-Prot
Match: Q9FIB2 (Putative pentatricopeptide repeat-containing protein At5g09950 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H35 PE=3 SV=1)

HSP 1 Score: 407.5 bits (1046), Expect = 2.9e-112
Identity = 218/573 (38.05%), Postives = 338/573 (58.99%), Query Frame = 0

Query: 86  IMNSLISMYERCGKLPDAVKMFDEMPTRDSVSWNALIGGFMRNGMFCVGFSYFKAMCLVG 145
           I N L++MY +CG + DA ++F  M  +DSVSWN++I G  +NG F      +K+M    
Sbjct: 351 IGNGLVNMYAKCGSIADARRVFYFMTDKDSVSWNSMITGLDQNGCFIEAVERYKSM-RRH 410

Query: 146 DCKFDKATLTTILSACDGLELCCIITTMHGLAFLSGYEREITVGNALISSYFKCGCVGLG 205
           D      TL + LS+C  L+   +   +HG +   G +  ++V NAL++ Y + G +   
Sbjct: 411 DILPGSFTLISSLSSCASLKWAKLGQQIHGESLKLGIDLNVSVSNALMTLYAETGYLNEC 470

Query: 206 RQVFDEMEERNVITWTAVISGLAQNGHH-EHSLNLFREMMSCGSVEPNSLTYLSLLAACS 265
           R++F  M E + ++W ++I  LA++      ++  F      G  + N +T+ S+L+A S
Sbjct: 471 RKIFSSMPEHDQVSWNSIIGALARSERSLPEAVVCFLNAQRAGQ-KLNRITFSSVLSAVS 530

Query: 266 GLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSKSGSIGDAWKIF-ESAEELDMVSLT 325
            L   E G QIHGL LK  I  +    +AL+  Y K G +    KIF   AE  D V+  
Sbjct: 531 SLSFGELGKQIHGLALKNNIADEATTENALIACYGKCGEMDGCEKIFSRMAERRDNVTWN 590

Query: 326 VILAGFTQNGCEEEAIQIFLKMLKMGIKIDENVVSAVLGVFGADTSLKLGQQVHSFIVKK 385
            +++G+  N    +A+ +   ML+ G ++D  + + VL  F +  +L+ G +VH+  V+ 
Sbjct: 591 SMISGYIHNELLAKALDLVWFMLQTGQRLDSFMYATVLSAFASVATLERGMEVHACSVRA 650

Query: 386 NFGCNPFVSNGLINMYSKCGALDESVKVFDRMQEKNSVTWNSMIAAFARHGDGSKALHLY 445
               +  V + L++MYSKCG LD +++ F+ M  +NS +WNSMI+ +ARHG G +AL L+
Sbjct: 651 CLESDVVVGSALVDMYSKCGRLDYALRFFNTMPVRNSYSWNSMISGYARHGQGEEALKLF 710

Query: 446 ENMKLEGAKPTD-VTFLSILHACSHVGLVKKGMEFLESMRKDHGMNPRSEHYACVVDMLG 505
           E MKL+G  P D VTF+ +L ACSH GL+++G +  ESM   +G+ PR EH++C+ D+LG
Sbjct: 711 ETMKLDGQTPPDHVTFVGVLSACSHAGLLEEGFKHFESMSDSYGLAPRIEHFSCMADVLG 770

Query: 506 RAGLLSEARNFIEKLPERPGLLVWQALLGACSLYG--DSEMGEYAAEHLFSETPYSPVPY 565
           RAG L +  +FIEK+P +P +L+W+ +LGAC       +E+G+ AAE LF   P + V Y
Sbjct: 771 RAGELDKLEDFIEKMPMKPNVLIWRTVLGACCRANGRKAELGKKAAEMLFQLEPENAVNY 830

Query: 566 VLLANIYSSEGNWKERARTIRKMKEVGMAKETGISWIEIDKKVHSFTVGDKMHPQADIIY 625
           VLL N+Y++ G W++  +  +KMK+  + KE G SW+ +   VH F  GDK HP AD+IY
Sbjct: 831 VLLGNMYAAGGRWEDLVKARKKMKDADVKKEAGYSWVTMKDGVHMFVAGDKSHPDADVIY 890

Query: 626 RVLMELFEIMVDEGYVPDKKFILYYLDPDDKRK 654
           + L EL   M D GYVP   F LY L+ ++K +
Sbjct: 891 KKLKELNRKMRDAGYVPQTGFALYDLEQENKEE 921

BLAST of Tan0004167 vs. ExPASy Swiss-Prot
Match: Q9M1V3 (Pentatricopeptide repeat-containing protein At3g63370, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H83 PE=2 SV=2)

HSP 1 Score: 406.0 bits (1042), Expect = 8.4e-112
Identity = 213/601 (35.44%), Postives = 348/601 (57.90%), Query Frame = 0

Query: 52  LSICGREGLLHLGSSLHASIIKSFELSNQENGVIIMNSLISMYERCGKLPDAVKMFDEMP 111
           L+ C       LG  +HAS++KS   S   + + + N+LI+MY RCGK+P A ++  +M 
Sbjct: 291 LTACDGFSYAKLGKEIHASVLKS---STHSSELYVCNALIAMYTRCGKMPQAERILRQMN 350

Query: 112 TRDSVSWNALIGGFMRNGMFCVGFSYFKAMCLVGDCKFDKATLTTILSACDGLELCCIIT 171
             D V+WN+LI G+++N M+     +F  M   G  K D+ ++T+I++A   L       
Sbjct: 351 NADVVTWNSLIKGYVQNLMYKEALEFFSDMIAAGH-KSDEVSMTSIIAASGRLSNLLAGM 410

Query: 172 TMHGLAFLSGYEREITVGNALISSYFKCGCVGLGRQVFDEMEERNVITWTAVISGLAQNG 231
            +H      G++  + VGN LI  Y KC       + F  M ++++I+WT VI+G AQN 
Sbjct: 411 ELHAYVIKHGWDSNLQVGNTLIDMYSKCNLTCYMGRAFLRMHDKDLISWTTVIAGYAQND 470

Query: 232 HHEHSLNLFREMMSCGSVEPNSLTYLSLLAACSGLEALEEGCQIHGLILKLGIQSDLCIG 291
            H  +L LFR++     +E + +   S+L A S L+++    +IH  IL+ G+  D  I 
Sbjct: 471 CHVEALELFRDVAK-KRMEIDEMILGSILRASSVLKSMLIVKEIHCHILRKGL-LDTVIQ 530

Query: 292 SALMDMYSKSGSIGDAWKIFESAEELDMVSLTVILAGFTQNGCEEEAIQIFLKMLKMGIK 351
           + L+D+Y K  ++G A ++FES +  D+VS T +++    NG E EA+++F +M++ G+ 
Sbjct: 531 NELVDVYGKCRNMGYATRVFESIKGKDVVSWTSMISSSALNGNESEAVELFRRMVETGLS 590

Query: 352 IDENVVSAVLGVFGADTSLKLGQQVHSFIVKKNFGCNPFVSNGLINMYSKCGALDESVKV 411
            D   +  +L    + ++L  G+++H ++++K F     ++  +++MY+ CG L  +  V
Sbjct: 591 ADSVALLCILSAAASLSALNKGREIHCYLLRKGFCLEGSIAVAVVDMYACCGDLQSAKAV 650

Query: 412 FDRMQEKNSVTWNSMIAAFARHGDGSKALHLYENMKLEGAKPTDVTFLSILHACSHVGLV 471
           FDR++ K  + + SMI A+  HG G  A+ L++ M+ E   P  ++FL++L+ACSH GL+
Sbjct: 651 FDRIERKGLLQYTSMINAYGMHGCGKAAVELFDKMRHENVSPDHISFLALLYACSHAGLL 710

Query: 472 KKGMEFLESMRKDHGMNPRSEHYACVVDMLGRAGLLSEARNFIEKLPERPGLLVWQALLG 531
            +G  FL+ M  ++ + P  EHY C+VDMLGRA  + EA  F++ +   P   VW ALL 
Sbjct: 711 DEGRGFLKIMEHEYELEPWPEHYVCLVDMLGRANCVVEAFEFVKMMKTEPTAEVWCALLA 770

Query: 532 ACSLYGDSEMGEYAAEHLFSETPYSPVPYVLLANIYSSEGNWKERARTIRKMKEVGMAKE 591
           AC  + + E+GE AA+ L    P +P   VL++N+++ +G W +  +   KMK  GM K 
Sbjct: 771 ACRSHSEKEIGEIAAQRLLELEPKNPGNLVLVSNVFAEQGRWNDVEKVRAKMKASGMEKH 830

Query: 592 TGISWIEIDKKVHSFTVGDKMHPQADIIYRVLMELFEIMVDE-GYVPDKKFILYYLDPDD 651
            G SWIE+D KVH FT  DK HP++  IY  L E+   +  E GYV D KF+L+ +D  +
Sbjct: 831 PGCSWIEMDGKVHKFTARDKSHPESKEIYEKLSEVTRKLEREVGYVADTKFVLHNVDEGE 885

BLAST of Tan0004167 vs. ExPASy Swiss-Prot
Match: Q7Y211 (Pentatricopeptide repeat-containing protein At3g57430, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H81 PE=2 SV=2)

HSP 1 Score: 399.4 bits (1025), Expect = 7.9e-110
Identity = 218/609 (35.80%), Postives = 340/609 (55.83%), Query Frame = 0

Query: 58  EGLLHLGSSLHASIIKSFELSNQENGVIIMNSLISMYERCGKLPDAVKMFDEMPTRDSVS 117
           EGL+ +G  +HA  ++  EL++      I+N+L++MY + GKL  +  +      RD V+
Sbjct: 216 EGLM-MGKQVHAYGLRKGELNS-----FIINTLVAMYGKLGKLASSKVLLGSFGGRDLVT 275

Query: 118 WNALIGGFMRNGMFCVGFSYFKAMCLVGDCKFDKATLTTILSACDGLELCCIITTMHGLA 177
           WN ++    +N        Y + M L G  + D+ T++++L AC  LE+      +H  A
Sbjct: 276 WNTVLSSLCQNEQLLEALEYLREMVLEG-VEPDEFTISSVLPACSHLEMLRTGKELHAYA 335

Query: 178 FLSG-YEREITVGNALISSYFKCGCVGLGRQVFDEMEERNVITWTAVISGLAQNGHHEHS 237
             +G  +    VG+AL+  Y  C  V  GR+VFD M +R +  W A+I+G +QN H + +
Sbjct: 336 LKNGSLDENSFVGSALVDMYCNCKQVLSGRRVFDGMFDRKIGLWNAMIAGYSQNEHDKEA 395

Query: 238 LNLFREMMSCGSVEPNSLTYLSLLAACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMD 297
           L LF  M     +  NS T   ++ AC    A      IHG ++K G+  D  + + LMD
Sbjct: 396 LLLFIGMEESAGLLANSTTMAGVVPACVRSGAFSRKEAIHGFVVKRGLDRDRFVQNTLMD 455

Query: 298 MYSKSGSIGDAWKIFESAEELDMVSLTVILAGFTQNGCEEEAIQIFLKMLKMGIKIDEN- 357
           MYS+ G I  A +IF   E+ D+V+   ++ G+  +   E+A+ +  KM  +  K+ +  
Sbjct: 456 MYSRLGKIDIAMRIFGKMEDRDLVTWNTMITGYVFSEHHEDALLLLHKMQNLERKVSKGA 515

Query: 358 ----------VVSAVLGVFGADTSLKLGQQVHSFIVKKNFGCNPFVSNGLINMYSKCGAL 417
                      +  +L    A ++L  G+++H++ +K N   +  V + L++MY+KCG L
Sbjct: 516 SRVSLKPNSITLMTILPSCAALSALAKGKEIHAYAIKNNLATDVAVGSALVDMYAKCGCL 575

Query: 418 DESVKVFDRMQEKNSVTWNSMIAAFARHGDGSKALHLYENMKLEGAKPTDVTFLSILHAC 477
             S KVFD++ +KN +TWN +I A+  HG+G +A+ L   M ++G KP +VTF+S+  AC
Sbjct: 576 QMSRKVFDQIPQKNVITWNVIIMAYGMHGNGQEAIDLLRMMMVQGVKPNEVTFISVFAAC 635

Query: 478 SHVGLVKKGMEFLESMRKDHGMNPRSEHYACVVDMLGRAGLLSEARNFIEKLP---ERPG 537
           SH G+V +G+     M+ D+G+ P S+HYACVVD+LGRAG + EA   +  +P    + G
Sbjct: 636 SHSGMVDEGLRIFYVMKPDYGVEPSSDHYACVVDLLGRAGRIKEAYQLMNMMPRDFNKAG 695

Query: 538 LLVWQALLGACSLYGDSEMGEYAAEHLFSETPYSPVPYVLLANIYSSEGNWKERARTIRK 597
              W +LLGA  ++ + E+GE AA++L    P     YVLLANIYSS G W +     R 
Sbjct: 696 --AWSSLLGASRIHNNLEIGEIAAQNLIQLEPNVASHYVLLANIYSSAGLWDKATEVRRN 755

Query: 598 MKEVGMAKETGISWIEIDKKVHSFTVGDKMHPQADIIYRVLMELFEIMVDEGYVPDKKFI 652
           MKE G+ KE G SWIE   +VH F  GD  HPQ++ +   L  L+E M  EGYVPD   +
Sbjct: 756 MKEQGVRKEPGCSWIEHGDEVHKFVAGDSSHPQSEKLSGYLETLWERMRKEGYVPDTSCV 815

BLAST of Tan0004167 vs. ExPASy Swiss-Prot
Match: Q5G1T1 (Pentatricopeptide repeat-containing protein At3g49170, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=EMB2261 PE=2 SV=1)

HSP 1 Score: 397.5 bits (1020), Expect = 3.0e-109
Identity = 216/575 (37.57%), Postives = 331/575 (57.57%), Query Frame = 0

Query: 81  ENGVIIMNSLISMYERC-GKLPDAVKMFDEMPTRDSVSWNALIGGFMRNGMFCVGFSYFK 140
           E+ V +  SLI M+ +      +A K+FD+M   + V+W  +I   M+ G       +F 
Sbjct: 199 ESDVCVGCSLIDMFVKGENSFENAYKVFDKMSELNVVTWTLMITRCMQMGFPREAIRFFL 258

Query: 141 AMCLVGDCKFDKATLTTILSACDGLELCCIITTMHGLAFLSGYEREITVGNALISSYFKC 200
            M L G  + DK TL+++ SAC  LE   +   +H  A  SG   ++    +L+  Y KC
Sbjct: 259 DMVLSG-FESDKFTLSSVFSACAELENLSLGKQLHSWAIRSGLVDDVEC--SLVDMYAKC 318

Query: 201 ---GCVGLGRQVFDEMEERNVITWTAVISGLAQNGH-HEHSLNLFREMMSCGSVEPNSLT 260
              G V   R+VFD ME+ +V++WTA+I+G  +N +    ++NLF EM++ G VEPN  T
Sbjct: 319 SADGSVDDCRKVFDRMEDHSVMSWTALITGYMKNCNLATEAINLFSEMITQGHVEPNHFT 378

Query: 261 YLSLLAACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSKSGSIGDAWKIFESAE 320
           + S   AC  L     G Q+ G   K G+ S+  + ++++ M+ KS  + DA + FES  
Sbjct: 379 FSSAFKACGNLSDPRVGKQVLGQAFKRGLASNSSVANSVISMFVKSDRMEDAQRAFESLS 438

Query: 321 ELDMVSLTVILAGFTQNGCEEEAIQIFLKMLKMGIKIDENVVSAVLGVFGADTSLKLGQQ 380
           E ++VS    L G  +N   E+A ++  ++ +  + +     +++L       S++ G+Q
Sbjct: 439 EKNLVSYNTFLDGTCRNLNFEQAFKLLSEITERELGVSAFTFASLLSGVANVGSIRKGEQ 498

Query: 381 VHSFIVKKNFGCNPFVSNGLINMYSKCGALDESVKVFDRMQEKNSVTWNSMIAAFARHGD 440
           +HS +VK    CN  V N LI+MYSKCG++D + +VF+ M+ +N ++W SMI  FA+HG 
Sbjct: 499 IHSQVVKLGLSCNQPVCNALISMYSKCGSIDTASRVFNFMENRNVISWTSMITGFAKHGF 558

Query: 441 GSKALHLYENMKLEGAKPTDVTFLSILHACSHVGLVKKGMEFLESMRKDHGMNPRSEHYA 500
             + L  +  M  EG KP +VT+++IL ACSHVGLV +G     SM +DH + P+ EHYA
Sbjct: 559 AIRVLETFNQMIEEGVKPNEVTYVAILSACSHVGLVSEGWRHFNSMYEDHKIKPKMEHYA 618

Query: 501 CVVDMLGRAGLLSEARNFIEKLPERPGLLVWQALLGACSLYGDSEMGEYAAEHLFSETPY 560
           C+VD+L RAGLL++A  FI  +P +  +LVW+  LGAC ++ ++E+G+ AA  +    P 
Sbjct: 619 CMVDLLCRAGLLTDAFEFINTMPFQADVLVWRTFLGACRVHSNTELGKLAARKILELDPN 678

Query: 561 SPVPYVLLANIYSSEGNWKERARTIRKMKEVGMAKETGISWIEIDKKVHSFTVGDKMHPQ 620
            P  Y+ L+NIY+  G W+E     RKMKE  + KE G SWIE+  K+H F VGD  HP 
Sbjct: 679 EPAAYIQLSNIYACAGKWEESTEMRRKMKERNLVKEGGCSWIEVGDKIHKFYVGDTAHPN 738

Query: 621 ADIIYRVLMELFEIMVDEGYVPDKKFILYYLDPDD 651
           A  IY  L  L   +   GYVPD   +L+ L+ ++
Sbjct: 739 AHQIYDELDRLITEIKRCGYVPDTDLVLHKLEEEN 770

BLAST of Tan0004167 vs. NCBI nr
Match: XP_022946254.1 (pentatricopeptide repeat-containing protein At3g05340 isoform X1 [Cucurbita moschata])

HSP 1 Score: 1241.1 bits (3210), Expect = 0.0e+00
Identity = 607/663 (91.55%), Postives = 632/663 (95.32%), Query Frame = 0

Query: 1   MKLKWVFQKLSSKLPSWASSLISPFRNQFHQNPFAETSSTFVLNHVDTSFLLSICGREGL 60
           MKLKWVFQKLSSKLPSWA+S ISPFRNQFHQNPFAETSSTFVLNHVD SFLLS+CGR+G 
Sbjct: 1   MKLKWVFQKLSSKLPSWATSRISPFRNQFHQNPFAETSSTFVLNHVDPSFLLSVCGRDGH 60

Query: 61  LHLGSSLHASIIKSFELSNQENGVIIMNSLISMYERCGKLPDAVKMFDEMPTRDSVSWNA 120
           L+LGSSLHASIIKSFELSN ENGV+IMNSLISMYERCGKLPDAVK+FDEMPTRD+VSWNA
Sbjct: 61  LYLGSSLHASIIKSFELSNHENGVVIMNSLISMYERCGKLPDAVKVFDEMPTRDTVSWNA 120

Query: 121 LIGGFMRNGMFCVGFSYFKAMCLVGDCKFDKATLTTILSACDGLELCCIITTMHGLAFLS 180
           LIGGFMRNG FC GFSYFKAMCLVGDCKFDKATLTTILSACDGLE+CCII  MHGL FLS
Sbjct: 121 LIGGFMRNGEFCAGFSYFKAMCLVGDCKFDKATLTTILSACDGLEMCCIIKMMHGLTFLS 180

Query: 181 GYEREITVGNALISSYFKCGCVGLGRQVFDEMEERNVITWTAVISGLAQNGHHEHSLNLF 240
           GY++EITVGNALISSYFKCGCVG G+QVF EMEERNVITWTAVISGLAQNG+HEHSL LF
Sbjct: 181 GYKQEITVGNALISSYFKCGCVGFGKQVFYEMEERNVITWTAVISGLAQNGYHEHSLELF 240

Query: 241 REMMSCGSVEPNSLTYLSLLAACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSK 300
           REM+SCGSVEPNSLTYLSLL ACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSK
Sbjct: 241 REMLSCGSVEPNSLTYLSLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSK 300

Query: 301 SGSIGDAWKIFESAEELDMVSLTVILAGFTQNGCEEEAIQIFLKMLKMGIKIDENVVSAV 360
            GSIGDAWKIFESAEELDMVSLTVILAGFTQNGCEEEAIQIFLKMLKMGI+IDENVVSAV
Sbjct: 301 CGSIGDAWKIFESAEELDMVSLTVILAGFTQNGCEEEAIQIFLKMLKMGIEIDENVVSAV 360

Query: 361 LGVFGADTSLKLGQQVHSFIVKKNFGCNPFVSNGLINMYSKCGALDESVKVFDRMQEKNS 420
           LGVFGADTSL+LGQQVHSFIVKKNF CNPFVSNGLINMYSKCGALDESVKVFDRMQ +NS
Sbjct: 361 LGVFGADTSLRLGQQVHSFIVKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMQNRNS 420

Query: 421 VTWNSMIAAFARHGDGSKALHLYENMKLEGAKPTDVTFLSILHACSHVGLVKKGMEFLES 480
           VTWNSMIAAFARHGDG KALHLYENMKLEGAKPTD+TFLS+LHACSHVGLV KGMEFLES
Sbjct: 421 VTWNSMIAAFARHGDGLKALHLYENMKLEGAKPTDITFLSLLHACSHVGLVNKGMEFLES 480

Query: 481 MRKDHGMNPRSEHYACVVDMLGRAGLLSEARNFIEKLPERPGLLVWQALLGACSLYGDSE 540
           M KDH MNPRSEHYACVVDMLGRAGLLSEAR FIEKLPE+PGLLVWQALLGACSLYGDSE
Sbjct: 481 MTKDHRMNPRSEHYACVVDMLGRAGLLSEARTFIEKLPEQPGLLVWQALLGACSLYGDSE 540

Query: 541 MGEYAAEHLFSETPYSPVPYVLLANIYSSEGNWKERARTIRKMKEVGMAKETGISWIEID 600
           MG+YAAEHLFSETP S VPYVLLANIYSSEGNWKERARTIRKMKE GMAKETGISWIEID
Sbjct: 541 MGKYAAEHLFSETPNSSVPYVLLANIYSSEGNWKERARTIRKMKETGMAKETGISWIEID 600

Query: 601 KKVHSFTVGDKMHPQADIIYRVLMELFEIMVDEGYVPDKKFILYYLDPDDKRKPV--DNG 660
           KKVHSFTVGDK HPQADIIY VLM+LF  MVDEGYVPDKKFIL+YLDPDDK++P+  DNG
Sbjct: 601 KKVHSFTVGDKRHPQADIIYGVLMDLFVHMVDEGYVPDKKFILFYLDPDDKKEPIDNDNG 660

Query: 661 RAN 662
           R N
Sbjct: 661 RVN 663

BLAST of Tan0004167 vs. NCBI nr
Match: XP_022999024.1 (pentatricopeptide repeat-containing protein At3g05340 [Cucurbita maxima])

HSP 1 Score: 1241.1 bits (3210), Expect = 0.0e+00
Identity = 605/657 (92.09%), Postives = 628/657 (95.59%), Query Frame = 0

Query: 1   MKLKWVFQKLSSKLPSWASSLISPFRNQFHQNPFAETSSTFVLNHVDTSFLLSICGREGL 60
           MKLKWVFQ+LSSKLPSWASS ISPFRNQFHQNPFAETSSTFVLNHVD SFLLS+CGR+G 
Sbjct: 1   MKLKWVFQRLSSKLPSWASSRISPFRNQFHQNPFAETSSTFVLNHVDPSFLLSVCGRDGH 60

Query: 61  LHLGSSLHASIIKSFELSNQENGVIIMNSLISMYERCGKLPDAVKMFDEMPTRDSVSWNA 120
           L+LGSSLHASIIKSFELSN ENGV+IMNSLISMYERCGKLPDAVK+FDEMPTRD+VSWNA
Sbjct: 61  LYLGSSLHASIIKSFELSNHENGVVIMNSLISMYERCGKLPDAVKVFDEMPTRDTVSWNA 120

Query: 121 LIGGFMRNGMFCVGFSYFKAMCLVGDCKFDKATLTTILSACDGLELCCIITTMHGLAFLS 180
           LIGGFMRNG F  GFSYFKAMCLVGDCKFDKATLTTILSACDG E+CCII  MHGL FLS
Sbjct: 121 LIGGFMRNGEFYAGFSYFKAMCLVGDCKFDKATLTTILSACDGSEMCCIIEMMHGLTFLS 180

Query: 181 GYEREITVGNALISSYFKCGCVGLGRQVFDEMEERNVITWTAVISGLAQNGHHEHSLNLF 240
           GYE+EITVGNALISSYFKCGCVG GRQ+F EMEERNVITWTAVISGLAQNGHHEHSL LF
Sbjct: 181 GYEQEITVGNALISSYFKCGCVGFGRQLFYEMEERNVITWTAVISGLAQNGHHEHSLELF 240

Query: 241 REMMSCGSVEPNSLTYLSLLAACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSK 300
           REMMSCGSVEPNSLTYLSLL ACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSK
Sbjct: 241 REMMSCGSVEPNSLTYLSLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSK 300

Query: 301 SGSIGDAWKIFESAEELDMVSLTVILAGFTQNGCEEEAIQIFLKMLKMGIKIDENVVSAV 360
            GSIGDAWKIFESAEELDMVSLTVILAGFTQNGCEEEAIQIFLKMLKMGI+ID NVVSAV
Sbjct: 301 CGSIGDAWKIFESAEELDMVSLTVILAGFTQNGCEEEAIQIFLKMLKMGIEIDANVVSAV 360

Query: 361 LGVFGADTSLKLGQQVHSFIVKKNFGCNPFVSNGLINMYSKCGALDESVKVFDRMQEKNS 420
           LGVFGADTSL+LGQQVHSFIVKKNF CNPFVSNGLINMYSKCGALDESVKVFDRMQ +NS
Sbjct: 361 LGVFGADTSLRLGQQVHSFIVKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMQNRNS 420

Query: 421 VTWNSMIAAFARHGDGSKALHLYENMKLEGAKPTDVTFLSILHACSHVGLVKKGMEFLES 480
           VTWNSMIAAFARHGDGSKALHLYENMKLEGAKPTD+TFLS+LHACSHVGLV KGMEFLES
Sbjct: 421 VTWNSMIAAFARHGDGSKALHLYENMKLEGAKPTDITFLSLLHACSHVGLVNKGMEFLES 480

Query: 481 MRKDHGMNPRSEHYACVVDMLGRAGLLSEARNFIEKLPERPGLLVWQALLGACSLYGDSE 540
           M KDH MNPRSEHYACVVDMLGRAGLLSEAR FIEKLPE+PGLLVWQALLGACSLYGDSE
Sbjct: 481 MTKDHRMNPRSEHYACVVDMLGRAGLLSEARTFIEKLPEQPGLLVWQALLGACSLYGDSE 540

Query: 541 MGEYAAEHLFSETPYSPVPYVLLANIYSSEGNWKERARTIRKMKEVGMAKETGISWIEID 600
           MG+YAAEHLFSETPYS VPYVLLANIYSSEGNWKERARTIRKMKE GMAKETGISWIEID
Sbjct: 541 MGKYAAEHLFSETPYSSVPYVLLANIYSSEGNWKERARTIRKMKETGMAKETGISWIEID 600

Query: 601 KKVHSFTVGDKMHPQADIIYRVLMELFEIMVDEGYVPDKKFILYYLDPDDKRKPVDN 658
           KKVHSFTVGDK HPQADIIY VLM+LF +MVDEGYVPDK FIL+YLDPDDK++P+DN
Sbjct: 601 KKVHSFTVGDKRHPQADIIYGVLMDLFVLMVDEGYVPDKNFILFYLDPDDKKEPIDN 657

BLAST of Tan0004167 vs. NCBI nr
Match: XP_022946256.1 (pentatricopeptide repeat-containing protein At3g05340 isoform X3 [Cucurbita moschata])

HSP 1 Score: 1241.1 bits (3210), Expect = 0.0e+00
Identity = 607/663 (91.55%), Postives = 632/663 (95.32%), Query Frame = 0

Query: 1   MKLKWVFQKLSSKLPSWASSLISPFRNQFHQNPFAETSSTFVLNHVDTSFLLSICGREGL 60
           MKLKWVFQKLSSKLPSWA+S ISPFRNQFHQNPFAETSSTFVLNHVD SFLLS+CGR+G 
Sbjct: 1   MKLKWVFQKLSSKLPSWATSRISPFRNQFHQNPFAETSSTFVLNHVDPSFLLSVCGRDGH 60

Query: 61  LHLGSSLHASIIKSFELSNQENGVIIMNSLISMYERCGKLPDAVKMFDEMPTRDSVSWNA 120
           L+LGSSLHASIIKSFELSN ENGV+IMNSLISMYERCGKLPDAVK+FDEMPTRD+VSWNA
Sbjct: 61  LYLGSSLHASIIKSFELSNHENGVVIMNSLISMYERCGKLPDAVKVFDEMPTRDTVSWNA 120

Query: 121 LIGGFMRNGMFCVGFSYFKAMCLVGDCKFDKATLTTILSACDGLELCCIITTMHGLAFLS 180
           LIGGFMRNG FC GFSYFKAMCLVGDCKFDKATLTTILSACDGLE+CCII  MHGL FLS
Sbjct: 121 LIGGFMRNGEFCAGFSYFKAMCLVGDCKFDKATLTTILSACDGLEMCCIIKMMHGLTFLS 180

Query: 181 GYEREITVGNALISSYFKCGCVGLGRQVFDEMEERNVITWTAVISGLAQNGHHEHSLNLF 240
           GY++EITVGNALISSYFKCGCVG G+QVF EMEERNVITWTAVISGLAQNG+HEHSL LF
Sbjct: 181 GYKQEITVGNALISSYFKCGCVGFGKQVFYEMEERNVITWTAVISGLAQNGYHEHSLELF 240

Query: 241 REMMSCGSVEPNSLTYLSLLAACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSK 300
           REM+SCGSVEPNSLTYLSLL ACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSK
Sbjct: 241 REMLSCGSVEPNSLTYLSLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSK 300

Query: 301 SGSIGDAWKIFESAEELDMVSLTVILAGFTQNGCEEEAIQIFLKMLKMGIKIDENVVSAV 360
            GSIGDAWKIFESAEELDMVSLTVILAGFTQNGCEEEAIQIFLKMLKMGI+IDENVVSAV
Sbjct: 301 CGSIGDAWKIFESAEELDMVSLTVILAGFTQNGCEEEAIQIFLKMLKMGIEIDENVVSAV 360

Query: 361 LGVFGADTSLKLGQQVHSFIVKKNFGCNPFVSNGLINMYSKCGALDESVKVFDRMQEKNS 420
           LGVFGADTSL+LGQQVHSFIVKKNF CNPFVSNGLINMYSKCGALDESVKVFDRMQ +NS
Sbjct: 361 LGVFGADTSLRLGQQVHSFIVKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMQNRNS 420

Query: 421 VTWNSMIAAFARHGDGSKALHLYENMKLEGAKPTDVTFLSILHACSHVGLVKKGMEFLES 480
           VTWNSMIAAFARHGDG KALHLYENMKLEGAKPTD+TFLS+LHACSHVGLV KGMEFLES
Sbjct: 421 VTWNSMIAAFARHGDGLKALHLYENMKLEGAKPTDITFLSLLHACSHVGLVNKGMEFLES 480

Query: 481 MRKDHGMNPRSEHYACVVDMLGRAGLLSEARNFIEKLPERPGLLVWQALLGACSLYGDSE 540
           M KDH MNPRSEHYACVVDMLGRAGLLSEAR FIEKLPE+PGLLVWQALLGACSLYGDSE
Sbjct: 481 MTKDHRMNPRSEHYACVVDMLGRAGLLSEARTFIEKLPEQPGLLVWQALLGACSLYGDSE 540

Query: 541 MGEYAAEHLFSETPYSPVPYVLLANIYSSEGNWKERARTIRKMKEVGMAKETGISWIEID 600
           MG+YAAEHLFSETP S VPYVLLANIYSSEGNWKERARTIRKMKE GMAKETGISWIEID
Sbjct: 541 MGKYAAEHLFSETPNSSVPYVLLANIYSSEGNWKERARTIRKMKETGMAKETGISWIEID 600

Query: 601 KKVHSFTVGDKMHPQADIIYRVLMELFEIMVDEGYVPDKKFILYYLDPDDKRKPV--DNG 660
           KKVHSFTVGDK HPQADIIY VLM+LF  MVDEGYVPDKKFIL+YLDPDDK++P+  DNG
Sbjct: 601 KKVHSFTVGDKRHPQADIIYGVLMDLFVHMVDEGYVPDKKFILFYLDPDDKKEPIDNDNG 660

Query: 661 RAN 662
           R N
Sbjct: 661 RVN 663

BLAST of Tan0004167 vs. NCBI nr
Match: XP_022946255.1 (pentatricopeptide repeat-containing protein At3g05340 isoform X2 [Cucurbita moschata])

HSP 1 Score: 1241.1 bits (3210), Expect = 0.0e+00
Identity = 607/663 (91.55%), Postives = 632/663 (95.32%), Query Frame = 0

Query: 1   MKLKWVFQKLSSKLPSWASSLISPFRNQFHQNPFAETSSTFVLNHVDTSFLLSICGREGL 60
           MKLKWVFQKLSSKLPSWA+S ISPFRNQFHQNPFAETSSTFVLNHVD SFLLS+CGR+G 
Sbjct: 1   MKLKWVFQKLSSKLPSWATSRISPFRNQFHQNPFAETSSTFVLNHVDPSFLLSVCGRDGH 60

Query: 61  LHLGSSLHASIIKSFELSNQENGVIIMNSLISMYERCGKLPDAVKMFDEMPTRDSVSWNA 120
           L+LGSSLHASIIKSFELSN ENGV+IMNSLISMYERCGKLPDAVK+FDEMPTRD+VSWNA
Sbjct: 61  LYLGSSLHASIIKSFELSNHENGVVIMNSLISMYERCGKLPDAVKVFDEMPTRDTVSWNA 120

Query: 121 LIGGFMRNGMFCVGFSYFKAMCLVGDCKFDKATLTTILSACDGLELCCIITTMHGLAFLS 180
           LIGGFMRNG FC GFSYFKAMCLVGDCKFDKATLTTILSACDGLE+CCII  MHGL FLS
Sbjct: 121 LIGGFMRNGEFCAGFSYFKAMCLVGDCKFDKATLTTILSACDGLEMCCIIKMMHGLTFLS 180

Query: 181 GYEREITVGNALISSYFKCGCVGLGRQVFDEMEERNVITWTAVISGLAQNGHHEHSLNLF 240
           GY++EITVGNALISSYFKCGCVG G+QVF EMEERNVITWTAVISGLAQNG+HEHSL LF
Sbjct: 181 GYKQEITVGNALISSYFKCGCVGFGKQVFYEMEERNVITWTAVISGLAQNGYHEHSLELF 240

Query: 241 REMMSCGSVEPNSLTYLSLLAACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSK 300
           REM+SCGSVEPNSLTYLSLL ACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSK
Sbjct: 241 REMLSCGSVEPNSLTYLSLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSK 300

Query: 301 SGSIGDAWKIFESAEELDMVSLTVILAGFTQNGCEEEAIQIFLKMLKMGIKIDENVVSAV 360
            GSIGDAWKIFESAEELDMVSLTVILAGFTQNGCEEEAIQIFLKMLKMGI+IDENVVSAV
Sbjct: 301 CGSIGDAWKIFESAEELDMVSLTVILAGFTQNGCEEEAIQIFLKMLKMGIEIDENVVSAV 360

Query: 361 LGVFGADTSLKLGQQVHSFIVKKNFGCNPFVSNGLINMYSKCGALDESVKVFDRMQEKNS 420
           LGVFGADTSL+LGQQVHSFIVKKNF CNPFVSNGLINMYSKCGALDESVKVFDRMQ +NS
Sbjct: 361 LGVFGADTSLRLGQQVHSFIVKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMQNRNS 420

Query: 421 VTWNSMIAAFARHGDGSKALHLYENMKLEGAKPTDVTFLSILHACSHVGLVKKGMEFLES 480
           VTWNSMIAAFARHGDG KALHLYENMKLEGAKPTD+TFLS+LHACSHVGLV KGMEFLES
Sbjct: 421 VTWNSMIAAFARHGDGLKALHLYENMKLEGAKPTDITFLSLLHACSHVGLVNKGMEFLES 480

Query: 481 MRKDHGMNPRSEHYACVVDMLGRAGLLSEARNFIEKLPERPGLLVWQALLGACSLYGDSE 540
           M KDH MNPRSEHYACVVDMLGRAGLLSEAR FIEKLPE+PGLLVWQALLGACSLYGDSE
Sbjct: 481 MTKDHRMNPRSEHYACVVDMLGRAGLLSEARTFIEKLPEQPGLLVWQALLGACSLYGDSE 540

Query: 541 MGEYAAEHLFSETPYSPVPYVLLANIYSSEGNWKERARTIRKMKEVGMAKETGISWIEID 600
           MG+YAAEHLFSETP S VPYVLLANIYSSEGNWKERARTIRKMKE GMAKETGISWIEID
Sbjct: 541 MGKYAAEHLFSETPNSSVPYVLLANIYSSEGNWKERARTIRKMKETGMAKETGISWIEID 600

Query: 601 KKVHSFTVGDKMHPQADIIYRVLMELFEIMVDEGYVPDKKFILYYLDPDDKRKPV--DNG 660
           KKVHSFTVGDK HPQADIIY VLM+LF  MVDEGYVPDKKFIL+YLDPDDK++P+  DNG
Sbjct: 601 KKVHSFTVGDKRHPQADIIYGVLMDLFVHMVDEGYVPDKKFILFYLDPDDKKEPIDNDNG 660

Query: 661 RAN 662
           R N
Sbjct: 661 RVN 663

BLAST of Tan0004167 vs. NCBI nr
Match: KAG6599242.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1235.3 bits (3195), Expect = 0.0e+00
Identity = 607/663 (91.55%), Postives = 627/663 (94.57%), Query Frame = 0

Query: 1   MKLKWVFQKLSSKLPSWASSLISPFRNQFHQNPFAETSSTFVLNHVDTSFLLSICGREGL 60
           MKLKWVFQ LSSKLPSWASS ISPFRNQFHQNPFAETSSTFVLNHVD SFLLS+CGR G 
Sbjct: 1   MKLKWVFQNLSSKLPSWASSRISPFRNQFHQNPFAETSSTFVLNHVDPSFLLSVCGRHGN 60

Query: 61  LHLGSSLHASIIKSFELSNQENGVIIMNSLISMYERCGKLPDAVKMFDEMPTRDSVSWNA 120
           L+LGSSLHASIIKSFELSN ENGV+IMNSLISMYERCGKLPDAVK+FDEMPTRD+VSWNA
Sbjct: 61  LYLGSSLHASIIKSFELSNHENGVVIMNSLISMYERCGKLPDAVKVFDEMPTRDTVSWNA 120

Query: 121 LIGGFMRNGMFCVGFSYFKAMCLVGDCKFDKATLTTILSACDGLELCCIITTMHGLAFLS 180
           LIGGFMRNG F  GFSYFKAMCLVGDCKFDKATLTTILSACDG E+ CII  MHGL FLS
Sbjct: 121 LIGGFMRNGEFYAGFSYFKAMCLVGDCKFDKATLTTILSACDGAEMYCIIEMMHGLTFLS 180

Query: 181 GYEREITVGNALISSYFKCGCVGLGRQVFDEMEERNVITWTAVISGLAQNGHHEHSLNLF 240
           GYE+EITVGNALISSYFKCGCVG G+QVF E EERNVITWTAVISGLAQNGHHEHSL LF
Sbjct: 181 GYEQEITVGNALISSYFKCGCVGFGKQVFYETEERNVITWTAVISGLAQNGHHEHSLELF 240

Query: 241 REMMSCGSVEPNSLTYLSLLAACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSK 300
           REMMSCGSVEPNSLTYLSLL ACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSK
Sbjct: 241 REMMSCGSVEPNSLTYLSLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSK 300

Query: 301 SGSIGDAWKIFESAEELDMVSLTVILAGFTQNGCEEEAIQIFLKMLKMGIKIDENVVSAV 360
            GSIGDAWKIFE AEELDMVSLTVILAGFTQNGCEEEAIQIFLKMLKMGI+IDENVVSAV
Sbjct: 301 CGSIGDAWKIFELAEELDMVSLTVILAGFTQNGCEEEAIQIFLKMLKMGIEIDENVVSAV 360

Query: 361 LGVFGADTSLKLGQQVHSFIVKKNFGCNPFVSNGLINMYSKCGALDESVKVFDRMQEKNS 420
           LGVFGADTSL+LGQQVHSFIVKKNF CNPFVSNGLINMYSKCGALDESVKVFDRMQ +NS
Sbjct: 361 LGVFGADTSLRLGQQVHSFIVKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMQNRNS 420

Query: 421 VTWNSMIAAFARHGDGSKALHLYENMKLEGAKPTDVTFLSILHACSHVGLVKKGMEFLES 480
           VTWNSMIAAFARHGDGSKALHLYENMKLEGAKPTD+TFLS+LHACSHVGLV KGMEFLES
Sbjct: 421 VTWNSMIAAFARHGDGSKALHLYENMKLEGAKPTDITFLSLLHACSHVGLVNKGMEFLES 480

Query: 481 MRKDHGMNPRSEHYACVVDMLGRAGLLSEARNFIEKLPERPGLLVWQALLGACSLYGDSE 540
           M KDH MNPRSEHYACVVDMLGRAGLLSEAR FIEKLPE+PGLLVWQALLGACSLYGDSE
Sbjct: 481 MTKDHRMNPRSEHYACVVDMLGRAGLLSEARTFIEKLPEQPGLLVWQALLGACSLYGDSE 540

Query: 541 MGEYAAEHLFSETPYSPVPYVLLANIYSSEGNWKERARTIRKMKEVGMAKETGISWIEID 600
           MG+YAAEHLFSETPYS VPYVLLANIYSSEGNWKERARTIRKMKE GMAKETGISWIEID
Sbjct: 541 MGKYAAEHLFSETPYSSVPYVLLANIYSSEGNWKERARTIRKMKETGMAKETGISWIEID 600

Query: 601 KKVHSFTVGDKMHPQADIIYRVLMELFEIMVDEGYVPDKKFILYYLDPDDKRKPV--DNG 660
           KKVHSFTVGDK HPQADIIY VLM+LF  MVDEGYVPDKKFIL+YLDPDDK++P+  DNG
Sbjct: 601 KKVHSFTVGDKRHPQADIIYGVLMDLFVHMVDEGYVPDKKFILFYLDPDDKKEPIDNDNG 660

Query: 661 RAN 662
           R N
Sbjct: 661 RVN 663

BLAST of Tan0004167 vs. ExPASy TrEMBL
Match: A0A6J1KFW9 (pentatricopeptide repeat-containing protein At3g05340 OS=Cucurbita maxima OX=3661 GN=LOC111493539 PE=3 SV=1)

HSP 1 Score: 1241.1 bits (3210), Expect = 0.0e+00
Identity = 605/657 (92.09%), Postives = 628/657 (95.59%), Query Frame = 0

Query: 1   MKLKWVFQKLSSKLPSWASSLISPFRNQFHQNPFAETSSTFVLNHVDTSFLLSICGREGL 60
           MKLKWVFQ+LSSKLPSWASS ISPFRNQFHQNPFAETSSTFVLNHVD SFLLS+CGR+G 
Sbjct: 1   MKLKWVFQRLSSKLPSWASSRISPFRNQFHQNPFAETSSTFVLNHVDPSFLLSVCGRDGH 60

Query: 61  LHLGSSLHASIIKSFELSNQENGVIIMNSLISMYERCGKLPDAVKMFDEMPTRDSVSWNA 120
           L+LGSSLHASIIKSFELSN ENGV+IMNSLISMYERCGKLPDAVK+FDEMPTRD+VSWNA
Sbjct: 61  LYLGSSLHASIIKSFELSNHENGVVIMNSLISMYERCGKLPDAVKVFDEMPTRDTVSWNA 120

Query: 121 LIGGFMRNGMFCVGFSYFKAMCLVGDCKFDKATLTTILSACDGLELCCIITTMHGLAFLS 180
           LIGGFMRNG F  GFSYFKAMCLVGDCKFDKATLTTILSACDG E+CCII  MHGL FLS
Sbjct: 121 LIGGFMRNGEFYAGFSYFKAMCLVGDCKFDKATLTTILSACDGSEMCCIIEMMHGLTFLS 180

Query: 181 GYEREITVGNALISSYFKCGCVGLGRQVFDEMEERNVITWTAVISGLAQNGHHEHSLNLF 240
           GYE+EITVGNALISSYFKCGCVG GRQ+F EMEERNVITWTAVISGLAQNGHHEHSL LF
Sbjct: 181 GYEQEITVGNALISSYFKCGCVGFGRQLFYEMEERNVITWTAVISGLAQNGHHEHSLELF 240

Query: 241 REMMSCGSVEPNSLTYLSLLAACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSK 300
           REMMSCGSVEPNSLTYLSLL ACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSK
Sbjct: 241 REMMSCGSVEPNSLTYLSLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSK 300

Query: 301 SGSIGDAWKIFESAEELDMVSLTVILAGFTQNGCEEEAIQIFLKMLKMGIKIDENVVSAV 360
            GSIGDAWKIFESAEELDMVSLTVILAGFTQNGCEEEAIQIFLKMLKMGI+ID NVVSAV
Sbjct: 301 CGSIGDAWKIFESAEELDMVSLTVILAGFTQNGCEEEAIQIFLKMLKMGIEIDANVVSAV 360

Query: 361 LGVFGADTSLKLGQQVHSFIVKKNFGCNPFVSNGLINMYSKCGALDESVKVFDRMQEKNS 420
           LGVFGADTSL+LGQQVHSFIVKKNF CNPFVSNGLINMYSKCGALDESVKVFDRMQ +NS
Sbjct: 361 LGVFGADTSLRLGQQVHSFIVKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMQNRNS 420

Query: 421 VTWNSMIAAFARHGDGSKALHLYENMKLEGAKPTDVTFLSILHACSHVGLVKKGMEFLES 480
           VTWNSMIAAFARHGDGSKALHLYENMKLEGAKPTD+TFLS+LHACSHVGLV KGMEFLES
Sbjct: 421 VTWNSMIAAFARHGDGSKALHLYENMKLEGAKPTDITFLSLLHACSHVGLVNKGMEFLES 480

Query: 481 MRKDHGMNPRSEHYACVVDMLGRAGLLSEARNFIEKLPERPGLLVWQALLGACSLYGDSE 540
           M KDH MNPRSEHYACVVDMLGRAGLLSEAR FIEKLPE+PGLLVWQALLGACSLYGDSE
Sbjct: 481 MTKDHRMNPRSEHYACVVDMLGRAGLLSEARTFIEKLPEQPGLLVWQALLGACSLYGDSE 540

Query: 541 MGEYAAEHLFSETPYSPVPYVLLANIYSSEGNWKERARTIRKMKEVGMAKETGISWIEID 600
           MG+YAAEHLFSETPYS VPYVLLANIYSSEGNWKERARTIRKMKE GMAKETGISWIEID
Sbjct: 541 MGKYAAEHLFSETPYSSVPYVLLANIYSSEGNWKERARTIRKMKETGMAKETGISWIEID 600

Query: 601 KKVHSFTVGDKMHPQADIIYRVLMELFEIMVDEGYVPDKKFILYYLDPDDKRKPVDN 658
           KKVHSFTVGDK HPQADIIY VLM+LF +MVDEGYVPDK FIL+YLDPDDK++P+DN
Sbjct: 601 KKVHSFTVGDKRHPQADIIYGVLMDLFVLMVDEGYVPDKNFILFYLDPDDKKEPIDN 657

BLAST of Tan0004167 vs. ExPASy TrEMBL
Match: A0A6J1G350 (pentatricopeptide repeat-containing protein At3g05340 isoform X3 OS=Cucurbita moschata OX=3662 GN=LOC111450393 PE=3 SV=1)

HSP 1 Score: 1241.1 bits (3210), Expect = 0.0e+00
Identity = 607/663 (91.55%), Postives = 632/663 (95.32%), Query Frame = 0

Query: 1   MKLKWVFQKLSSKLPSWASSLISPFRNQFHQNPFAETSSTFVLNHVDTSFLLSICGREGL 60
           MKLKWVFQKLSSKLPSWA+S ISPFRNQFHQNPFAETSSTFVLNHVD SFLLS+CGR+G 
Sbjct: 1   MKLKWVFQKLSSKLPSWATSRISPFRNQFHQNPFAETSSTFVLNHVDPSFLLSVCGRDGH 60

Query: 61  LHLGSSLHASIIKSFELSNQENGVIIMNSLISMYERCGKLPDAVKMFDEMPTRDSVSWNA 120
           L+LGSSLHASIIKSFELSN ENGV+IMNSLISMYERCGKLPDAVK+FDEMPTRD+VSWNA
Sbjct: 61  LYLGSSLHASIIKSFELSNHENGVVIMNSLISMYERCGKLPDAVKVFDEMPTRDTVSWNA 120

Query: 121 LIGGFMRNGMFCVGFSYFKAMCLVGDCKFDKATLTTILSACDGLELCCIITTMHGLAFLS 180
           LIGGFMRNG FC GFSYFKAMCLVGDCKFDKATLTTILSACDGLE+CCII  MHGL FLS
Sbjct: 121 LIGGFMRNGEFCAGFSYFKAMCLVGDCKFDKATLTTILSACDGLEMCCIIKMMHGLTFLS 180

Query: 181 GYEREITVGNALISSYFKCGCVGLGRQVFDEMEERNVITWTAVISGLAQNGHHEHSLNLF 240
           GY++EITVGNALISSYFKCGCVG G+QVF EMEERNVITWTAVISGLAQNG+HEHSL LF
Sbjct: 181 GYKQEITVGNALISSYFKCGCVGFGKQVFYEMEERNVITWTAVISGLAQNGYHEHSLELF 240

Query: 241 REMMSCGSVEPNSLTYLSLLAACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSK 300
           REM+SCGSVEPNSLTYLSLL ACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSK
Sbjct: 241 REMLSCGSVEPNSLTYLSLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSK 300

Query: 301 SGSIGDAWKIFESAEELDMVSLTVILAGFTQNGCEEEAIQIFLKMLKMGIKIDENVVSAV 360
            GSIGDAWKIFESAEELDMVSLTVILAGFTQNGCEEEAIQIFLKMLKMGI+IDENVVSAV
Sbjct: 301 CGSIGDAWKIFESAEELDMVSLTVILAGFTQNGCEEEAIQIFLKMLKMGIEIDENVVSAV 360

Query: 361 LGVFGADTSLKLGQQVHSFIVKKNFGCNPFVSNGLINMYSKCGALDESVKVFDRMQEKNS 420
           LGVFGADTSL+LGQQVHSFIVKKNF CNPFVSNGLINMYSKCGALDESVKVFDRMQ +NS
Sbjct: 361 LGVFGADTSLRLGQQVHSFIVKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMQNRNS 420

Query: 421 VTWNSMIAAFARHGDGSKALHLYENMKLEGAKPTDVTFLSILHACSHVGLVKKGMEFLES 480
           VTWNSMIAAFARHGDG KALHLYENMKLEGAKPTD+TFLS+LHACSHVGLV KGMEFLES
Sbjct: 421 VTWNSMIAAFARHGDGLKALHLYENMKLEGAKPTDITFLSLLHACSHVGLVNKGMEFLES 480

Query: 481 MRKDHGMNPRSEHYACVVDMLGRAGLLSEARNFIEKLPERPGLLVWQALLGACSLYGDSE 540
           M KDH MNPRSEHYACVVDMLGRAGLLSEAR FIEKLPE+PGLLVWQALLGACSLYGDSE
Sbjct: 481 MTKDHRMNPRSEHYACVVDMLGRAGLLSEARTFIEKLPEQPGLLVWQALLGACSLYGDSE 540

Query: 541 MGEYAAEHLFSETPYSPVPYVLLANIYSSEGNWKERARTIRKMKEVGMAKETGISWIEID 600
           MG+YAAEHLFSETP S VPYVLLANIYSSEGNWKERARTIRKMKE GMAKETGISWIEID
Sbjct: 541 MGKYAAEHLFSETPNSSVPYVLLANIYSSEGNWKERARTIRKMKETGMAKETGISWIEID 600

Query: 601 KKVHSFTVGDKMHPQADIIYRVLMELFEIMVDEGYVPDKKFILYYLDPDDKRKPV--DNG 660
           KKVHSFTVGDK HPQADIIY VLM+LF  MVDEGYVPDKKFIL+YLDPDDK++P+  DNG
Sbjct: 601 KKVHSFTVGDKRHPQADIIYGVLMDLFVHMVDEGYVPDKKFILFYLDPDDKKEPIDNDNG 660

Query: 661 RAN 662
           R N
Sbjct: 661 RVN 663

BLAST of Tan0004167 vs. ExPASy TrEMBL
Match: A0A6J1G3C0 (pentatricopeptide repeat-containing protein At3g05340 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111450393 PE=3 SV=1)

HSP 1 Score: 1241.1 bits (3210), Expect = 0.0e+00
Identity = 607/663 (91.55%), Postives = 632/663 (95.32%), Query Frame = 0

Query: 1   MKLKWVFQKLSSKLPSWASSLISPFRNQFHQNPFAETSSTFVLNHVDTSFLLSICGREGL 60
           MKLKWVFQKLSSKLPSWA+S ISPFRNQFHQNPFAETSSTFVLNHVD SFLLS+CGR+G 
Sbjct: 1   MKLKWVFQKLSSKLPSWATSRISPFRNQFHQNPFAETSSTFVLNHVDPSFLLSVCGRDGH 60

Query: 61  LHLGSSLHASIIKSFELSNQENGVIIMNSLISMYERCGKLPDAVKMFDEMPTRDSVSWNA 120
           L+LGSSLHASIIKSFELSN ENGV+IMNSLISMYERCGKLPDAVK+FDEMPTRD+VSWNA
Sbjct: 61  LYLGSSLHASIIKSFELSNHENGVVIMNSLISMYERCGKLPDAVKVFDEMPTRDTVSWNA 120

Query: 121 LIGGFMRNGMFCVGFSYFKAMCLVGDCKFDKATLTTILSACDGLELCCIITTMHGLAFLS 180
           LIGGFMRNG FC GFSYFKAMCLVGDCKFDKATLTTILSACDGLE+CCII  MHGL FLS
Sbjct: 121 LIGGFMRNGEFCAGFSYFKAMCLVGDCKFDKATLTTILSACDGLEMCCIIKMMHGLTFLS 180

Query: 181 GYEREITVGNALISSYFKCGCVGLGRQVFDEMEERNVITWTAVISGLAQNGHHEHSLNLF 240
           GY++EITVGNALISSYFKCGCVG G+QVF EMEERNVITWTAVISGLAQNG+HEHSL LF
Sbjct: 181 GYKQEITVGNALISSYFKCGCVGFGKQVFYEMEERNVITWTAVISGLAQNGYHEHSLELF 240

Query: 241 REMMSCGSVEPNSLTYLSLLAACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSK 300
           REM+SCGSVEPNSLTYLSLL ACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSK
Sbjct: 241 REMLSCGSVEPNSLTYLSLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSK 300

Query: 301 SGSIGDAWKIFESAEELDMVSLTVILAGFTQNGCEEEAIQIFLKMLKMGIKIDENVVSAV 360
            GSIGDAWKIFESAEELDMVSLTVILAGFTQNGCEEEAIQIFLKMLKMGI+IDENVVSAV
Sbjct: 301 CGSIGDAWKIFESAEELDMVSLTVILAGFTQNGCEEEAIQIFLKMLKMGIEIDENVVSAV 360

Query: 361 LGVFGADTSLKLGQQVHSFIVKKNFGCNPFVSNGLINMYSKCGALDESVKVFDRMQEKNS 420
           LGVFGADTSL+LGQQVHSFIVKKNF CNPFVSNGLINMYSKCGALDESVKVFDRMQ +NS
Sbjct: 361 LGVFGADTSLRLGQQVHSFIVKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMQNRNS 420

Query: 421 VTWNSMIAAFARHGDGSKALHLYENMKLEGAKPTDVTFLSILHACSHVGLVKKGMEFLES 480
           VTWNSMIAAFARHGDG KALHLYENMKLEGAKPTD+TFLS+LHACSHVGLV KGMEFLES
Sbjct: 421 VTWNSMIAAFARHGDGLKALHLYENMKLEGAKPTDITFLSLLHACSHVGLVNKGMEFLES 480

Query: 481 MRKDHGMNPRSEHYACVVDMLGRAGLLSEARNFIEKLPERPGLLVWQALLGACSLYGDSE 540
           M KDH MNPRSEHYACVVDMLGRAGLLSEAR FIEKLPE+PGLLVWQALLGACSLYGDSE
Sbjct: 481 MTKDHRMNPRSEHYACVVDMLGRAGLLSEARTFIEKLPEQPGLLVWQALLGACSLYGDSE 540

Query: 541 MGEYAAEHLFSETPYSPVPYVLLANIYSSEGNWKERARTIRKMKEVGMAKETGISWIEID 600
           MG+YAAEHLFSETP S VPYVLLANIYSSEGNWKERARTIRKMKE GMAKETGISWIEID
Sbjct: 541 MGKYAAEHLFSETPNSSVPYVLLANIYSSEGNWKERARTIRKMKETGMAKETGISWIEID 600

Query: 601 KKVHSFTVGDKMHPQADIIYRVLMELFEIMVDEGYVPDKKFILYYLDPDDKRKPV--DNG 660
           KKVHSFTVGDK HPQADIIY VLM+LF  MVDEGYVPDKKFIL+YLDPDDK++P+  DNG
Sbjct: 601 KKVHSFTVGDKRHPQADIIYGVLMDLFVHMVDEGYVPDKKFILFYLDPDDKKEPIDNDNG 660

Query: 661 RAN 662
           R N
Sbjct: 661 RVN 663

BLAST of Tan0004167 vs. ExPASy TrEMBL
Match: A0A6J1G3A2 (pentatricopeptide repeat-containing protein At3g05340 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111450393 PE=3 SV=1)

HSP 1 Score: 1241.1 bits (3210), Expect = 0.0e+00
Identity = 607/663 (91.55%), Postives = 632/663 (95.32%), Query Frame = 0

Query: 1   MKLKWVFQKLSSKLPSWASSLISPFRNQFHQNPFAETSSTFVLNHVDTSFLLSICGREGL 60
           MKLKWVFQKLSSKLPSWA+S ISPFRNQFHQNPFAETSSTFVLNHVD SFLLS+CGR+G 
Sbjct: 1   MKLKWVFQKLSSKLPSWATSRISPFRNQFHQNPFAETSSTFVLNHVDPSFLLSVCGRDGH 60

Query: 61  LHLGSSLHASIIKSFELSNQENGVIIMNSLISMYERCGKLPDAVKMFDEMPTRDSVSWNA 120
           L+LGSSLHASIIKSFELSN ENGV+IMNSLISMYERCGKLPDAVK+FDEMPTRD+VSWNA
Sbjct: 61  LYLGSSLHASIIKSFELSNHENGVVIMNSLISMYERCGKLPDAVKVFDEMPTRDTVSWNA 120

Query: 121 LIGGFMRNGMFCVGFSYFKAMCLVGDCKFDKATLTTILSACDGLELCCIITTMHGLAFLS 180
           LIGGFMRNG FC GFSYFKAMCLVGDCKFDKATLTTILSACDGLE+CCII  MHGL FLS
Sbjct: 121 LIGGFMRNGEFCAGFSYFKAMCLVGDCKFDKATLTTILSACDGLEMCCIIKMMHGLTFLS 180

Query: 181 GYEREITVGNALISSYFKCGCVGLGRQVFDEMEERNVITWTAVISGLAQNGHHEHSLNLF 240
           GY++EITVGNALISSYFKCGCVG G+QVF EMEERNVITWTAVISGLAQNG+HEHSL LF
Sbjct: 181 GYKQEITVGNALISSYFKCGCVGFGKQVFYEMEERNVITWTAVISGLAQNGYHEHSLELF 240

Query: 241 REMMSCGSVEPNSLTYLSLLAACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSK 300
           REM+SCGSVEPNSLTYLSLL ACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSK
Sbjct: 241 REMLSCGSVEPNSLTYLSLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSK 300

Query: 301 SGSIGDAWKIFESAEELDMVSLTVILAGFTQNGCEEEAIQIFLKMLKMGIKIDENVVSAV 360
            GSIGDAWKIFESAEELDMVSLTVILAGFTQNGCEEEAIQIFLKMLKMGI+IDENVVSAV
Sbjct: 301 CGSIGDAWKIFESAEELDMVSLTVILAGFTQNGCEEEAIQIFLKMLKMGIEIDENVVSAV 360

Query: 361 LGVFGADTSLKLGQQVHSFIVKKNFGCNPFVSNGLINMYSKCGALDESVKVFDRMQEKNS 420
           LGVFGADTSL+LGQQVHSFIVKKNF CNPFVSNGLINMYSKCGALDESVKVFDRMQ +NS
Sbjct: 361 LGVFGADTSLRLGQQVHSFIVKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMQNRNS 420

Query: 421 VTWNSMIAAFARHGDGSKALHLYENMKLEGAKPTDVTFLSILHACSHVGLVKKGMEFLES 480
           VTWNSMIAAFARHGDG KALHLYENMKLEGAKPTD+TFLS+LHACSHVGLV KGMEFLES
Sbjct: 421 VTWNSMIAAFARHGDGLKALHLYENMKLEGAKPTDITFLSLLHACSHVGLVNKGMEFLES 480

Query: 481 MRKDHGMNPRSEHYACVVDMLGRAGLLSEARNFIEKLPERPGLLVWQALLGACSLYGDSE 540
           M KDH MNPRSEHYACVVDMLGRAGLLSEAR FIEKLPE+PGLLVWQALLGACSLYGDSE
Sbjct: 481 MTKDHRMNPRSEHYACVVDMLGRAGLLSEARTFIEKLPEQPGLLVWQALLGACSLYGDSE 540

Query: 541 MGEYAAEHLFSETPYSPVPYVLLANIYSSEGNWKERARTIRKMKEVGMAKETGISWIEID 600
           MG+YAAEHLFSETP S VPYVLLANIYSSEGNWKERARTIRKMKE GMAKETGISWIEID
Sbjct: 541 MGKYAAEHLFSETPNSSVPYVLLANIYSSEGNWKERARTIRKMKETGMAKETGISWIEID 600

Query: 601 KKVHSFTVGDKMHPQADIIYRVLMELFEIMVDEGYVPDKKFILYYLDPDDKRKPV--DNG 660
           KKVHSFTVGDK HPQADIIY VLM+LF  MVDEGYVPDKKFIL+YLDPDDK++P+  DNG
Sbjct: 601 KKVHSFTVGDKRHPQADIIYGVLMDLFVHMVDEGYVPDKKFILFYLDPDDKKEPIDNDNG 660

Query: 661 RAN 662
           R N
Sbjct: 661 RVN 663

BLAST of Tan0004167 vs. ExPASy TrEMBL
Match: A0A5A7U175 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold385G00490 PE=3 SV=1)

HSP 1 Score: 1194.9 bits (3090), Expect = 0.0e+00
Identity = 587/675 (86.96%), Postives = 624/675 (92.44%), Query Frame = 0

Query: 1   MKLKWVFQKLSSKLPSWASSLISPFRNQFHQNPFAETSSTFVLNHVDTSFLLSICGREGL 60
           MKLKWVFQK SS LPS  +SLI PFRNQFHQNPFAETSSTFVLNH+D SFLLSICGREG 
Sbjct: 1   MKLKWVFQKSSSHLPSLVTSLIFPFRNQFHQNPFAETSSTFVLNHLDVSFLLSICGREGN 60

Query: 61  LHLGSSLHASIIKSFELSNQENGVIIMNSLISMYERCGKLPDAVKMFDEMPTRDSVSWNA 120
           LHLGSSLHASIIKSFE SN  NGV+IMNSLISMY+RCGKL DAVK+FDEM TRD++SWNA
Sbjct: 61  LHLGSSLHASIIKSFEPSNHYNGVVIMNSLISMYDRCGKLSDAVKVFDEMLTRDTISWNA 120

Query: 121 LIGGFMRNGMFCVGFSYFKAMCLVGDCKFDKATLTTILSACDGLELCCIITTMHGLAFLS 180
           LIGGF+RNG F  GFSYFKAMCLVGDCKFDKATLTTILSACDGLE CCII  MHGLAFLS
Sbjct: 121 LIGGFVRNGKFFAGFSYFKAMCLVGDCKFDKATLTTILSACDGLEFCCIIKMMHGLAFLS 180

Query: 181 GYEREITVGNALISSYFKCGCVGLGRQVFDEMEERNVITWTAVISGLAQNGHHEHSLNLF 240
           G+ +EITVGNAL+SSY KCGCVGLG QVFDEM ERNVITWTAVISGLA+NGHHEHSL LF
Sbjct: 181 GFGQEITVGNALVSSYLKCGCVGLGMQVFDEMGERNVITWTAVISGLARNGHHEHSLKLF 240

Query: 241 REMMSCGSVEPNSLTYLSLLAACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSK 300
           +EMMS GSVEPNSLTYLSLL ACSGLEAL+EGCQIHGLILKLGIQSDLCIGSALMDMYSK
Sbjct: 241 KEMMSYGSVEPNSLTYLSLLTACSGLEALKEGCQIHGLILKLGIQSDLCIGSALMDMYSK 300

Query: 301 SGSIGDAWKIFESAEELDMVSLTVILAGFTQNGCEEEAIQIFLKMLKMGIKIDENVVSAV 360
           SG IG+AWKIFESAEELDMVSLTVILAGFTQNGCEEEAIQIFLKMLKMGI+ID NVVS V
Sbjct: 301 SGRIGEAWKIFESAEELDMVSLTVILAGFTQNGCEEEAIQIFLKMLKMGIEIDGNVVSVV 360

Query: 361 LGVFGADTSLKLGQQVHSFIVKKNFGCNPFVSNGLINMYSKCGALDESVKVFDRMQEKNS 420
           LGVFGADTSL+LGQQVHSF+VKKNF CNPFVSNGLINMYSKCGALDES+KVFDRM+E+NS
Sbjct: 361 LGVFGADTSLRLGQQVHSFVVKKNFICNPFVSNGLINMYSKCGALDESMKVFDRMRERNS 420

Query: 421 VTWNSMIAAFARHGDGSKALHLYENMKLEGAKPTDVTFLSILHACSHVGLVKKGMEFLES 480
           VTWNSMIAAFARHGD SKAL LYENM+LEGAKPTDVTFLS+LHACSH GLVKKGMEFL+S
Sbjct: 421 VTWNSMIAAFARHGDASKALQLYENMQLEGAKPTDVTFLSLLHACSHAGLVKKGMEFLKS 480

Query: 481 MRKDHGMNPRSEHYACVVDMLGRAGLLSEARNFIEKLPERPGLLVWQALLGACSLYGDSE 540
           M KDHGMNPRSEHYACVVDMLGRAG+LSEARNFIEKLPE+PGLLVWQALLGACSLYGDSE
Sbjct: 481 MTKDHGMNPRSEHYACVVDMLGRAGMLSEARNFIEKLPEQPGLLVWQALLGACSLYGDSE 540

Query: 541 MGEYAAEHLFSETPYSPVPYVLLANIYSSEGNWKERARTIRKMKEVGMAKETGISWIEID 600
           MG+YAA+HLF ETP+S VPYVLLANIYSSEGNWKERARTIR+MKEVG AKETGISWIEID
Sbjct: 541 MGKYAADHLFLETPHSTVPYVLLANIYSSEGNWKERARTIRRMKEVGTAKETGISWIEID 600

Query: 601 KKVHSFTVGDKMHPQADIIYRVLMELFEIMVDEGYVPDKKFILYYLDPDDKRKPVDNGRA 660
           KKVHSFTVGDKMHPQ +IIY VL ELF +MVDEGYVPDKKFILYYLD DD+R P+ N +A
Sbjct: 601 KKVHSFTVGDKMHPQTEIIYGVLTELFVLMVDEGYVPDKKFILYYLD-DDRRDPIHNDQA 660

Query: 661 NCRSFTGTETPWELF 676
             ++   TE  WELF
Sbjct: 661 TRQNAIETEVVWELF 674

BLAST of Tan0004167 vs. TAIR 10
Match: AT3G05340.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 760.8 bits (1963), Expect = 9.6e-220
Identity = 391/656 (59.60%), Postives = 474/656 (72.26%), Query Frame = 0

Query: 1   MKLKWVFQKLSSKLPSWASSLISPFRNQFHQNPFAETSSTFVLNHVDTSFLLSICGREGL 60
           M  +WV QKL+S LPS  S+++SP +    Q+P  +  STF+LNHVD S LLSICGREG 
Sbjct: 1   MNSRWVIQKLTSHLPSCLSTVLSPSKILIRQSPNYQV-STFLLNHVDMSLLLSICGREGW 60

Query: 61  L-HLGSSLHASIIKSFELSN------QENGVIIMNSLISMYERCGKLPDAVKMFDEMPTR 120
             HLG  LHASIIK+ E           N +++ NSL+S+Y +CGKL DA+K+FDEMP R
Sbjct: 61  FPHLGPCLHASIIKNPEFFEPVDADIHRNALVVWNSLLSLYAKCGKLVDAIKLFDEMPMR 120

Query: 121 DSVSWNALIGGFMRNGMFCVGFSYFKAMCLVGDCKFDKATLTTILSACDGLELCCIITTM 180
           D +S N +  GF+RN     GF   K M  +G   FD ATLT +LS CD  E C +   +
Sbjct: 121 DVISQNIVFYGFLRNRETESGFVLLKRM--LGSGGFDHATLTIVLSVCDTPEFCLVTKMI 180

Query: 181 HGLAFLSGYEREITVGNALISSYFKCGCVGLGRQVFDEMEERNVITWTAVISGLAQNGHH 240
           H LA LSGY++EI+VGN LI+SYFKCGC   GR VFD M  RNVIT TAVISGL +N  H
Sbjct: 181 HALAILSGYDKEISVGNKLITSYFKCGCSVSGRGVFDGMSHRNVITLTAVISGLIENELH 240

Query: 241 EHSLNLFREMMSCGSVEPNSLTYLSLLAACSGLEALEEGCQIHGLILKLGIQSDLCIGSA 300
           E  L LF  +M  G V PNS+TYLS LAACSG + + EG QIH L+ K GI+S+LCI SA
Sbjct: 241 EDGLRLF-SLMRRGLVHPNSVTYLSALAACSGSQRIVEGQQIHALLWKYGIESELCIESA 300

Query: 301 LMDMYSKSGSIGDAWKIFESAEELDMVSLTVILAGFTQNGCEEEAIQIFLKMLKMGIKID 360
           LMDMYSK GSI DAW IFES  E+D VS+TVIL G  QNG EEEAIQ F++ML+ G++ID
Sbjct: 301 LMDMYSKCGSIEDAWTIFESTTEVDEVSMTVILVGLAQNGSEEEAIQFFIRMLQAGVEID 360

Query: 361 ENVVSAVLGVFGADTSLKLGQQVHSFIVKKNFGCNPFVSNGLINMYSKCGALDESVKVFD 420
            NVVSAVLGV   D SL LG+Q+HS ++K+ F  N FV+NGLINMYSKCG L +S  VF 
Sbjct: 361 ANVVSAVLGVSFIDNSLGLGKQLHSLVIKRKFSGNTFVNNGLINMYSKCGDLTDSQTVFR 420

Query: 421 RMQEKNSVTWNSMIAAFARHGDGSKALHLYENMKLEGAKPTDVTFLSILHACSHVGLVKK 480
           RM ++N V+WNSMIAAFARHG G  AL LYE M     KPTDVTFLS+LHACSHVGL+ K
Sbjct: 421 RMPKRNYVSWNSMIAAFARHGHGLAALKLYEEMTTLEVKPTDVTFLSLLHACSHVGLIDK 480

Query: 481 GMEFLESMRKDHGMNPRSEHYACVVDMLGRAGLLSEARNFIEKLPERPGLLVWQALLGAC 540
           G E L  M++ HG+ PR+EHY C++DMLGRAGLL EA++FI+ LP +P   +WQALLGAC
Sbjct: 481 GRELLNEMKEVHGIEPRTEHYTCIIDMLGRAGLLKEAKSFIDSLPLKPDCKIWQALLGAC 540

Query: 541 SLYGDSEMGEYAAEHLFSETPYSPVPYVLLANIYSSEGNWKERARTIRKMKEVGMAKETG 600
           S +GD+E+GEYAAE LF   P S   ++L+ANIYSS G WKERA+TI++MK +G+ KETG
Sbjct: 541 SFHGDTEVGEYAAEQLFQTAPDSSSAHILIANIYSSRGKWKERAKTIKRMKAMGVTKETG 600

Query: 601 ISWIEIDKKVHSFTVGDKMHPQADIIYRVLMELFEIMVDEGYVPDKKFILYYLDPD 650
           IS IEI+ K HSF V DK+HPQA+ IY VL  LF +MVDEGY PDK+FIL Y   D
Sbjct: 601 ISSIEIEHKTHSFVVEDKLHPQAEAIYDVLSGLFPVMVDEGYRPDKRFILCYTGDD 652

BLAST of Tan0004167 vs. TAIR 10
Match: AT5G09950.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 407.5 bits (1046), Expect = 2.1e-113
Identity = 218/573 (38.05%), Postives = 338/573 (58.99%), Query Frame = 0

Query: 86  IMNSLISMYERCGKLPDAVKMFDEMPTRDSVSWNALIGGFMRNGMFCVGFSYFKAMCLVG 145
           I N L++MY +CG + DA ++F  M  +DSVSWN++I G  +NG F      +K+M    
Sbjct: 351 IGNGLVNMYAKCGSIADARRVFYFMTDKDSVSWNSMITGLDQNGCFIEAVERYKSM-RRH 410

Query: 146 DCKFDKATLTTILSACDGLELCCIITTMHGLAFLSGYEREITVGNALISSYFKCGCVGLG 205
           D      TL + LS+C  L+   +   +HG +   G +  ++V NAL++ Y + G +   
Sbjct: 411 DILPGSFTLISSLSSCASLKWAKLGQQIHGESLKLGIDLNVSVSNALMTLYAETGYLNEC 470

Query: 206 RQVFDEMEERNVITWTAVISGLAQNGHH-EHSLNLFREMMSCGSVEPNSLTYLSLLAACS 265
           R++F  M E + ++W ++I  LA++      ++  F      G  + N +T+ S+L+A S
Sbjct: 471 RKIFSSMPEHDQVSWNSIIGALARSERSLPEAVVCFLNAQRAGQ-KLNRITFSSVLSAVS 530

Query: 266 GLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSKSGSIGDAWKIF-ESAEELDMVSLT 325
            L   E G QIHGL LK  I  +    +AL+  Y K G +    KIF   AE  D V+  
Sbjct: 531 SLSFGELGKQIHGLALKNNIADEATTENALIACYGKCGEMDGCEKIFSRMAERRDNVTWN 590

Query: 326 VILAGFTQNGCEEEAIQIFLKMLKMGIKIDENVVSAVLGVFGADTSLKLGQQVHSFIVKK 385
            +++G+  N    +A+ +   ML+ G ++D  + + VL  F +  +L+ G +VH+  V+ 
Sbjct: 591 SMISGYIHNELLAKALDLVWFMLQTGQRLDSFMYATVLSAFASVATLERGMEVHACSVRA 650

Query: 386 NFGCNPFVSNGLINMYSKCGALDESVKVFDRMQEKNSVTWNSMIAAFARHGDGSKALHLY 445
               +  V + L++MYSKCG LD +++ F+ M  +NS +WNSMI+ +ARHG G +AL L+
Sbjct: 651 CLESDVVVGSALVDMYSKCGRLDYALRFFNTMPVRNSYSWNSMISGYARHGQGEEALKLF 710

Query: 446 ENMKLEGAKPTD-VTFLSILHACSHVGLVKKGMEFLESMRKDHGMNPRSEHYACVVDMLG 505
           E MKL+G  P D VTF+ +L ACSH GL+++G +  ESM   +G+ PR EH++C+ D+LG
Sbjct: 711 ETMKLDGQTPPDHVTFVGVLSACSHAGLLEEGFKHFESMSDSYGLAPRIEHFSCMADVLG 770

Query: 506 RAGLLSEARNFIEKLPERPGLLVWQALLGACSLYG--DSEMGEYAAEHLFSETPYSPVPY 565
           RAG L +  +FIEK+P +P +L+W+ +LGAC       +E+G+ AAE LF   P + V Y
Sbjct: 771 RAGELDKLEDFIEKMPMKPNVLIWRTVLGACCRANGRKAELGKKAAEMLFQLEPENAVNY 830

Query: 566 VLLANIYSSEGNWKERARTIRKMKEVGMAKETGISWIEIDKKVHSFTVGDKMHPQADIIY 625
           VLL N+Y++ G W++  +  +KMK+  + KE G SW+ +   VH F  GDK HP AD+IY
Sbjct: 831 VLLGNMYAAGGRWEDLVKARKKMKDADVKKEAGYSWVTMKDGVHMFVAGDKSHPDADVIY 890

Query: 626 RVLMELFEIMVDEGYVPDKKFILYYLDPDDKRK 654
           + L EL   M D GYVP   F LY L+ ++K +
Sbjct: 891 KKLKELNRKMRDAGYVPQTGFALYDLEQENKEE 921

BLAST of Tan0004167 vs. TAIR 10
Match: AT3G57430.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 399.4 bits (1025), Expect = 5.6e-111
Identity = 218/609 (35.80%), Postives = 340/609 (55.83%), Query Frame = 0

Query: 58  EGLLHLGSSLHASIIKSFELSNQENGVIIMNSLISMYERCGKLPDAVKMFDEMPTRDSVS 117
           EGL+ +G  +HA  ++  EL++      I+N+L++MY + GKL  +  +      RD V+
Sbjct: 216 EGLM-MGKQVHAYGLRKGELNS-----FIINTLVAMYGKLGKLASSKVLLGSFGGRDLVT 275

Query: 118 WNALIGGFMRNGMFCVGFSYFKAMCLVGDCKFDKATLTTILSACDGLELCCIITTMHGLA 177
           WN ++    +N        Y + M L G  + D+ T++++L AC  LE+      +H  A
Sbjct: 276 WNTVLSSLCQNEQLLEALEYLREMVLEG-VEPDEFTISSVLPACSHLEMLRTGKELHAYA 335

Query: 178 FLSG-YEREITVGNALISSYFKCGCVGLGRQVFDEMEERNVITWTAVISGLAQNGHHEHS 237
             +G  +    VG+AL+  Y  C  V  GR+VFD M +R +  W A+I+G +QN H + +
Sbjct: 336 LKNGSLDENSFVGSALVDMYCNCKQVLSGRRVFDGMFDRKIGLWNAMIAGYSQNEHDKEA 395

Query: 238 LNLFREMMSCGSVEPNSLTYLSLLAACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMD 297
           L LF  M     +  NS T   ++ AC    A      IHG ++K G+  D  + + LMD
Sbjct: 396 LLLFIGMEESAGLLANSTTMAGVVPACVRSGAFSRKEAIHGFVVKRGLDRDRFVQNTLMD 455

Query: 298 MYSKSGSIGDAWKIFESAEELDMVSLTVILAGFTQNGCEEEAIQIFLKMLKMGIKIDEN- 357
           MYS+ G I  A +IF   E+ D+V+   ++ G+  +   E+A+ +  KM  +  K+ +  
Sbjct: 456 MYSRLGKIDIAMRIFGKMEDRDLVTWNTMITGYVFSEHHEDALLLLHKMQNLERKVSKGA 515

Query: 358 ----------VVSAVLGVFGADTSLKLGQQVHSFIVKKNFGCNPFVSNGLINMYSKCGAL 417
                      +  +L    A ++L  G+++H++ +K N   +  V + L++MY+KCG L
Sbjct: 516 SRVSLKPNSITLMTILPSCAALSALAKGKEIHAYAIKNNLATDVAVGSALVDMYAKCGCL 575

Query: 418 DESVKVFDRMQEKNSVTWNSMIAAFARHGDGSKALHLYENMKLEGAKPTDVTFLSILHAC 477
             S KVFD++ +KN +TWN +I A+  HG+G +A+ L   M ++G KP +VTF+S+  AC
Sbjct: 576 QMSRKVFDQIPQKNVITWNVIIMAYGMHGNGQEAIDLLRMMMVQGVKPNEVTFISVFAAC 635

Query: 478 SHVGLVKKGMEFLESMRKDHGMNPRSEHYACVVDMLGRAGLLSEARNFIEKLP---ERPG 537
           SH G+V +G+     M+ D+G+ P S+HYACVVD+LGRAG + EA   +  +P    + G
Sbjct: 636 SHSGMVDEGLRIFYVMKPDYGVEPSSDHYACVVDLLGRAGRIKEAYQLMNMMPRDFNKAG 695

Query: 538 LLVWQALLGACSLYGDSEMGEYAAEHLFSETPYSPVPYVLLANIYSSEGNWKERARTIRK 597
              W +LLGA  ++ + E+GE AA++L    P     YVLLANIYSS G W +     R 
Sbjct: 696 --AWSSLLGASRIHNNLEIGEIAAQNLIQLEPNVASHYVLLANIYSSAGLWDKATEVRRN 755

Query: 598 MKEVGMAKETGISWIEIDKKVHSFTVGDKMHPQADIIYRVLMELFEIMVDEGYVPDKKFI 652
           MKE G+ KE G SWIE   +VH F  GD  HPQ++ +   L  L+E M  EGYVPD   +
Sbjct: 756 MKEQGVRKEPGCSWIEHGDEVHKFVAGDSSHPQSEKLSGYLETLWERMRKEGYVPDTSCV 815

BLAST of Tan0004167 vs. TAIR 10
Match: AT3G49170.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 397.5 bits (1020), Expect = 2.1e-110
Identity = 216/575 (37.57%), Postives = 331/575 (57.57%), Query Frame = 0

Query: 81  ENGVIIMNSLISMYERC-GKLPDAVKMFDEMPTRDSVSWNALIGGFMRNGMFCVGFSYFK 140
           E+ V +  SLI M+ +      +A K+FD+M   + V+W  +I   M+ G       +F 
Sbjct: 199 ESDVCVGCSLIDMFVKGENSFENAYKVFDKMSELNVVTWTLMITRCMQMGFPREAIRFFL 258

Query: 141 AMCLVGDCKFDKATLTTILSACDGLELCCIITTMHGLAFLSGYEREITVGNALISSYFKC 200
            M L G  + DK TL+++ SAC  LE   +   +H  A  SG   ++    +L+  Y KC
Sbjct: 259 DMVLSG-FESDKFTLSSVFSACAELENLSLGKQLHSWAIRSGLVDDVEC--SLVDMYAKC 318

Query: 201 ---GCVGLGRQVFDEMEERNVITWTAVISGLAQNGH-HEHSLNLFREMMSCGSVEPNSLT 260
              G V   R+VFD ME+ +V++WTA+I+G  +N +    ++NLF EM++ G VEPN  T
Sbjct: 319 SADGSVDDCRKVFDRMEDHSVMSWTALITGYMKNCNLATEAINLFSEMITQGHVEPNHFT 378

Query: 261 YLSLLAACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSKSGSIGDAWKIFESAE 320
           + S   AC  L     G Q+ G   K G+ S+  + ++++ M+ KS  + DA + FES  
Sbjct: 379 FSSAFKACGNLSDPRVGKQVLGQAFKRGLASNSSVANSVISMFVKSDRMEDAQRAFESLS 438

Query: 321 ELDMVSLTVILAGFTQNGCEEEAIQIFLKMLKMGIKIDENVVSAVLGVFGADTSLKLGQQ 380
           E ++VS    L G  +N   E+A ++  ++ +  + +     +++L       S++ G+Q
Sbjct: 439 EKNLVSYNTFLDGTCRNLNFEQAFKLLSEITERELGVSAFTFASLLSGVANVGSIRKGEQ 498

Query: 381 VHSFIVKKNFGCNPFVSNGLINMYSKCGALDESVKVFDRMQEKNSVTWNSMIAAFARHGD 440
           +HS +VK    CN  V N LI+MYSKCG++D + +VF+ M+ +N ++W SMI  FA+HG 
Sbjct: 499 IHSQVVKLGLSCNQPVCNALISMYSKCGSIDTASRVFNFMENRNVISWTSMITGFAKHGF 558

Query: 441 GSKALHLYENMKLEGAKPTDVTFLSILHACSHVGLVKKGMEFLESMRKDHGMNPRSEHYA 500
             + L  +  M  EG KP +VT+++IL ACSHVGLV +G     SM +DH + P+ EHYA
Sbjct: 559 AIRVLETFNQMIEEGVKPNEVTYVAILSACSHVGLVSEGWRHFNSMYEDHKIKPKMEHYA 618

Query: 501 CVVDMLGRAGLLSEARNFIEKLPERPGLLVWQALLGACSLYGDSEMGEYAAEHLFSETPY 560
           C+VD+L RAGLL++A  FI  +P +  +LVW+  LGAC ++ ++E+G+ AA  +    P 
Sbjct: 619 CMVDLLCRAGLLTDAFEFINTMPFQADVLVWRTFLGACRVHSNTELGKLAARKILELDPN 678

Query: 561 SPVPYVLLANIYSSEGNWKERARTIRKMKEVGMAKETGISWIEIDKKVHSFTVGDKMHPQ 620
            P  Y+ L+NIY+  G W+E     RKMKE  + KE G SWIE+  K+H F VGD  HP 
Sbjct: 679 EPAAYIQLSNIYACAGKWEESTEMRRKMKERNLVKEGGCSWIEVGDKIHKFYVGDTAHPN 738

Query: 621 ADIIYRVLMELFEIMVDEGYVPDKKFILYYLDPDD 651
           A  IY  L  L   +   GYVPD   +L+ L+ ++
Sbjct: 739 AHQIYDELDRLITEIKRCGYVPDTDLVLHKLEEEN 770

BLAST of Tan0004167 vs. TAIR 10
Match: AT4G39530.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 395.6 bits (1015), Expect = 8.1e-110
Identity = 211/581 (36.32%), Postives = 335/581 (57.66%), Query Frame = 0

Query: 49  SFLLSICGREGLLHLGSSLHASIIKSFELSNQENGVIIMNSLISMYERCGKLPDAVKMFD 108
           S +LS C     L  G  +HA I++       E    +MN LI  Y +CG++  A K+F+
Sbjct: 253 STVLSACSILPFLEGGKQIHAHILR----YGLEMDASLMNVLIDSYVKCGRVIAAHKLFN 312

Query: 109 EMPTRDSVSWNALIGGFMRNGMFCVGFSYFKAMCLVGDCKFDKATLTTILSACDGLELCC 168
            MP ++ +SW  L+ G+ +N +       F +M   G  K D    ++IL++C  L    
Sbjct: 313 GMPNKNIISWTTLLSGYKQNALHKEAMELFTSMSKFG-LKPDMYACSSILTSCASLHALG 372

Query: 169 IITTMHGLAFLSGYEREITVGNALISSYFKCGCVGLGRQVFDEMEERNVITWTAVISGLA 228
             T +H     +    +  V N+LI  Y KC C+   R+VFD     +V+ + A+I G +
Sbjct: 373 FGTQVHAYTIKANLGNDSYVTNSLIDMYAKCDCLTDARKVFDIFAAADVVLFNAMIEGYS 432

Query: 229 QNGHH---EHSLNLFREMMSCGSVEPNSLTYLSLLAACSGLEALEEGCQIHGLILKLGIQ 288
           + G       +LN+FR+ M    + P+ LT++SLL A + L +L    QIHGL+ K G+ 
Sbjct: 433 RLGTQWELHEALNIFRD-MRFRLIRPSLLTFVSLLRASASLTSLGLSKQIHGLMFKYGLN 492

Query: 289 SDLCIGSALMDMYSKSGSIGDAWKIFESAEELDMVSLTVILAGFTQNGCEEEAIQIFLKM 348
            D+  GSAL+D+YS    + D+  +F+  +  D+V    + AG+ Q    EEA+ +FL++
Sbjct: 493 LDIFAGSALIDVYSNCYCLKDSRLVFDEMKVKDLVIWNSMFAGYVQQSENEEALNLFLEL 552

Query: 349 LKMGIKIDENVVSAVLGVFGADTSLKLGQQVHSFIVKKNFGCNPFVSNGLINMYSKCGAL 408
                + DE   + ++   G   S++LGQ+ H  ++K+   CNP+++N L++MY+KCG+ 
Sbjct: 553 QLSRERPDEFTFANMVTAAGNLASVQLGQEFHCQLLKRGLECNPYITNALLDMYAKCGSP 612

Query: 409 DESVKVFDRMQEKNSVTWNSMIAAFARHGDGSKALHLYENMKLEGAKPTDVTFLSILHAC 468
           +++ K FD    ++ V WNS+I+++A HG+G KAL + E M  EG +P  +TF+ +L AC
Sbjct: 613 EDAHKAFDSAASRDVVCWNSVISSYANHGEGKKALQMLEKMMSEGIEPNYITFVGVLSAC 672

Query: 469 SHVGLVKKGMEFLESMRKDHGMNPRSEHYACVVDMLGRAGLLSEARNFIEKLPERPGLLV 528
           SH GLV+ G++  E M +  G+ P +EHY C+V +LGRAG L++AR  IEK+P +P  +V
Sbjct: 673 SHAGLVEDGLKQFELMLR-FGIEPETEHYVCMVSLLGRAGRLNKARELIEKMPTKPAAIV 732

Query: 529 WQALLGACSLYGDSEMGEYAAEHLFSETPYSPVPYVLLANIYSSEGNWKERARTIRKMKE 588
           W++LL  C+  G+ E+ E+AAE      P     + +L+NIY+S+G W E  +   +MK 
Sbjct: 733 WRSLLSGCAKAGNVELAEHAAEMAILSDPKDSGSFTMLSNIYASKGMWTEAKKVRERMKV 792

Query: 589 VGMAKETGISWIEIDKKVHSFTVGDKMHPQADIIYRVLMEL 627
            G+ KE G SWI I+K+VH F   DK H +A+ IY VL +L
Sbjct: 793 EGVVKEPGRSWIGINKEVHIFLSKDKSHCKANQIYEVLDDL 826

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9MA851.3e-21859.60Pentatricopeptide repeat-containing protein At3g05340 OS=Arabidopsis thaliana OX... [more]
Q9FIB22.9e-11238.05Putative pentatricopeptide repeat-containing protein At5g09950 OS=Arabidopsis th... [more]
Q9M1V38.4e-11235.44Pentatricopeptide repeat-containing protein At3g63370, chloroplastic OS=Arabidop... [more]
Q7Y2117.9e-11035.80Pentatricopeptide repeat-containing protein At3g57430, chloroplastic OS=Arabidop... [more]
Q5G1T13.0e-10937.57Pentatricopeptide repeat-containing protein At3g49170, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
XP_022946254.10.0e+0091.55pentatricopeptide repeat-containing protein At3g05340 isoform X1 [Cucurbita mosc... [more]
XP_022999024.10.0e+0092.09pentatricopeptide repeat-containing protein At3g05340 [Cucurbita maxima][more]
XP_022946256.10.0e+0091.55pentatricopeptide repeat-containing protein At3g05340 isoform X3 [Cucurbita mosc... [more]
XP_022946255.10.0e+0091.55pentatricopeptide repeat-containing protein At3g05340 isoform X2 [Cucurbita mosc... [more]
KAG6599242.10.0e+0091.55Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
Match NameE-valueIdentityDescription
A0A6J1KFW90.0e+0092.09pentatricopeptide repeat-containing protein At3g05340 OS=Cucurbita maxima OX=366... [more]
A0A6J1G3500.0e+0091.55pentatricopeptide repeat-containing protein At3g05340 isoform X3 OS=Cucurbita mo... [more]
A0A6J1G3C00.0e+0091.55pentatricopeptide repeat-containing protein At3g05340 isoform X2 OS=Cucurbita mo... [more]
A0A6J1G3A20.0e+0091.55pentatricopeptide repeat-containing protein At3g05340 isoform X1 OS=Cucurbita mo... [more]
A0A5A7U1750.0e+0086.96Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
Match NameE-valueIdentityDescription
AT3G05340.19.6e-22059.60Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT5G09950.12.1e-11338.05Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G57430.15.6e-11135.80Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G49170.12.1e-11037.57Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G39530.18.1e-11036.32Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 484..616
e-value: 8.3E-8
score: 34.0
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 28..165
e-value: 3.9E-20
score: 73.9
coord: 271..368
e-value: 7.6E-15
score: 56.7
coord: 172..270
e-value: 1.1E-21
score: 79.0
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 369..483
e-value: 2.2E-28
score: 101.5
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 215..263
e-value: 6.5E-10
score: 39.1
coord: 418..466
e-value: 1.6E-12
score: 47.4
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 218..252
e-value: 3.4E-7
score: 28.0
coord: 321..354
e-value: 1.0E-4
score: 20.2
coord: 190..216
e-value: 1.1E-4
score: 20.1
coord: 421..454
e-value: 1.6E-7
score: 29.1
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 292..315
e-value: 0.7
score: 10.3
coord: 87..113
e-value: 4.5E-5
score: 23.4
coord: 116..141
e-value: 0.074
score: 13.3
coord: 320..350
e-value: 5.9E-4
score: 19.9
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 216..250
score: 10.665402
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 419..453
score: 12.605553
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 83..117
score: 9.119859
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 318..352
score: 9.711769
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 388..418
score: 9.525427
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 593..653
e-value: 8.6E-10
score: 38.6
NoneNo IPR availablePANTHERPTHR47925OS01G0913400 PROTEIN-RELATEDcoord: 30..642
NoneNo IPR availablePANTHERPTHR47925:SF89BNAA05G31840D PROTEINcoord: 30..642

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0004167.1Tan0004167.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding