Tan0004841 (gene) Snake gourd v1

Overview
NameTan0004841
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat-containing protein
LocationLG11: 2449757 .. 2453337 (+)
RNA-Seq ExpressionTan0004841
SyntenyTan0004841
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTGCACAACAAATTGTAGGTGTCACTTTTACTCTAGTTACTGTGCACTCTCGCCTAATTACCTATGACACTCTAAATTCTAGCTTACTCATGTGACTCTCACTCTAGCTTACTCTGTGGCAATCTCACTCTAGTTACTGTTTGGAAAATCTTTAAAAGAATGTGTTGTTCTTTAGGAACACACGTCTCAAAAAAAGTGTTCTCAAGGAAATCGTATTGACTTACCATCACGGGATTTTGAGACTCTCGGTCTCGATGAACATAAAAATTGTAGGTGTCACTCTCACTCTAAGTTCTTGTGGCACTCTCGCCTAGCAATACCTATGACACTCTCTCTAATTTCTGGGCTATACGTGACACACTCACACTCTAGATTACCTATGACACTCTCACTCTAGTTAACTTTTCTTTTATTAAAGAATTAGACTTTTGTTGAATTTATAGATTGAATTTCTTTTAAATCTATTGTTGATTTATCAATCTTATAATATTTTGGAATTATTGATAATTTCGTTTATAATATGAATTGATGTTGAAATATTATTACCAATTAGTTTTGTGTGAGAGTTATTTATGTATTCTTAAAATTTTCAACTTTATAAATTTTGCATCTTTAACTTATTTTGCTAGTTAATGTTTTGATCTCAAATAAACATTAAAAATTTTCAACTTATGAATCGTAATATTTAAATAAGTGATATATGTTGACCTAAAAAGTATTTAGAAAATACAAAAAACATAAAGTATAGAGGTTATGTTAATTTGAACTAGAAATTTGAGTGTCACAGTAAATGTGAGAGCGAGAGTGATACCTACAATTTACTTGTGACACTCTCTAAATTCTAATTATGACGTTATCAATCGTTCGTTAGCATAGTTTAGGAGTTGACGGAGAGAAACAAAACCCGATGAGTCATTTATGATGCTAAAATAATCAACATGTACCTAGATCGAGAGTGGTTGACGACATCAAATGGGTCTTTTGTAGTTAAATCTAAAATATAAGTCGATAATCCAACCAAAAAGAGGTCATCCGTACAAAAACACCGAAATGGATGTGGATACCTCCAATTGATGAACTTGAAACCAGAAAGAACCACTAATTCCAAGGCGGAACTGGAAGCACCGTCAGTAACGGGGCTACTTGCTGAAGTAGCTTGCTAAAGCTCCACGAGAAACTATCATGAGTTTCTCCATCAATTTTGCCGTCGTCCAACCTGCTCAATCCATTCTCTATCCTTCTCGAACACCCAATTTCCAGGCTAGTCTCTCATTCCCTCGAGCAATTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCCAGATTTGGGATTTATTTGCGTAATGCCTATAAATTTTGACAGACAAGGAATGCAGCTTTAACACCAACACCTTCTGCAACCCAAAAAGCTATTGCAGCCTCTGGTATTATAATTCCCAATTCGGTTACAGCTGATAAAACTAATTCCCATCTTGAAATTCAGCCGCTTGTTGATCTCCTGCGTGATTGTGTGGACGTAAGGTTTCTGAAACAAGCTAAGACTATTCATGGGTTTCTGTTAAAATCAAAATTTTCAAACCACGAGTCCCTTGTATTGCTTAATCATGTAGCTCACGCTTATTCGAAATGCTCCGATATTGGTGCTGCCTGTCGTCTGTTTGATCAAATGTCCCAGAGAAACATATTTTCGTGGACTGTCATAATTGTTGGATTGGCTGATAATGGTTTGTTTCTCGATGGGTTTGAGTTATTCTGTGAAATGCAGAGCCTGGGAATTTTCCCAGATCAGTTTGCTTATTCTGGTATCTTGCAGATATGTATTGGTTTGGAGTCCACTGACTTGGGCAGAATGGTTCATGCCCAGATTGTTATTAGAGGCTTTGCATCTCATACTTTTGTGTCTACTGCTCTTCTTAATATGTATGCAAAGTTACAACAGATTGAGGATTCATTCAAGGTGTTTAACAACATGACTGAAGCTAATGTAGTCTCGTGGAATGCTATGATCTCAGGGTTCACATCAAATGGTCTTTACTTAGAGGCTTTTGATCATTTTCTCAGAATGAAGAGAGAAGGAGTAACACCCAATGCACAAACATTTATCGGTCTTGCAAAAGCTATCGGTATGTTACGAGATGTGAACAAGGCAAAAGAAGTTAGCCACTTTGCTTCTGAGTTAGGTGTGGACTCTAATACTCTTGTGGGAACTGCCCTCATTGATATGCATTCTAAATGTGGATCATTGCAAGAGGCAGGATATATCTTTGACTCACATTTTACAAATTGTCGGGTTAATGCCCCGTGGAATGCAATGATTTCGGGGTATTTACAGAGCGAATGTAATGAAAAAGCATTGGAATTGTTTGCCAAAATGTGTCAAAACGACATACACCTGGACCATTACACTTATTGTAGTGTATTTAATGCTATAGCCGTGTTGAAGTGTTTGTCCTCGGGAAAGAAGGTTCATGCCAGGGCTATAAAATCAGGATTGGAAGTGAATCATATAAATATCTCCAATGCAGTGGCAAATGCATATGCTAAATGTGGATCGCTGGACGATGTAAGTAAGGTCTTTTACAGGATTGAAGAGAGAGATTTAGTATCTTGGACCACCCTAGTGACTGCTTATTCTCAATGTTCTGAATGGGATAAAGCAATAGAGATCTTCTCAAATATGAGAGAAGGAGGTTTTACACCCAATCAATTTGCCTTTTCTAGTGTGCTCGTTTCATGTGCTAGCCTTTGCTTACTCGAGTATGGTCAGCAAGTCCACGGGTTCCTCTGCAAGGTTGGCTTGGATATGGACAAATGCATAGAAAGTGCTCTGATTGACATGTATGCCAAATGCGGTAATCTGGTGGAGGCGAAGAAGGTTTTCAATAGAATCTCTAATGCCGATACAGTTTCATGGACTGCTATAATATCGGGTCATGCTCAACATGGTGTTGTGGATGACGCACTTCAACTCTTTAGAAGGATGGAACAGTTAGGTGTGGAGCCCAATGCTGTCACTTTTTTGTGTGTTCTATTTGCATGTAGCCATGGAGGTCTGGTAGAGGAAGGCCTACAGTACTTCAAGCTAATGAAGGAAACCTATGGTTTGGTGCCAGAGATGGAGCATTATTCCTGTATCGTTGATCTCTTAAGTCGTGTGGGGCGTCTAAACGATGCAATGGAGTTTATAAGTAGGATGCCCATAGAGCCCAATGAAATGGTTTGGCAGACCTTGTTGGGAGCATGCAGGGTCCATGGTAATGTTGGATTGGGAGAGCTAGCTGCTCAGAAGATACTTGCTTTTAGAGCAGAAAACTCTGCTACCTTTGTTCTTTTATCCAACACCTATATCGAATCAGGGAGTTACAAAGATGGACTTAGTTTGCGACATGCGATGAAAGAGCAGGGCGTAAAAAAGGAACCAGGATGTAGTTGGATCTCTGTGAATGGTACATTGCATAAGTTTTATGCAGGTGATCAACAACATCCAGAAAAAGATAAAATTTATGCAAAGCTAGAAGAGTTAAGGTTGAAGGCCAATTCTTTGGATGATGTACCAGATTTGAGTTATGAGCTATAA

mRNA sequence

ATGTGCACAACAAATTGTAGGTGTCACTTTTACTCTAGTTACTGTACAAGGAATGCAGCTTTAACACCAACACCTTCTGCAACCCAAAAAGCTATTGCAGCCTCTGGTATTATAATTCCCAATTCGGTTACAGCTGATAAAACTAATTCCCATCTTGAAATTCAGCCGCTTGTTGATCTCCTGCGTGATTGTGTGGACGTAAGGTTTCTGAAACAAGCTAAGACTATTCATGGGTTTCTGTTAAAATCAAAATTTTCAAACCACGAGTCCCTTGTATTGCTTAATCATGTAGCTCACGCTTATTCGAAATGCTCCGATATTGGTGCTGCCTGTCGTCTGTTTGATCAAATGTCCCAGAGAAACATATTTTCGTGGACTGTCATAATTGTTGGATTGGCTGATAATGGTTTGTTTCTCGATGGGTTTGAGTTATTCTGTGAAATGCAGAGCCTGGGAATTTTCCCAGATCAGTTTGCTTATTCTGGTATCTTGCAGATATGTATTGGTTTGGAGTCCACTGACTTGGGCAGAATGGTTCATGCCCAGATTGTTATTAGAGGCTTTGCATCTCATACTTTTGTGTCTACTGCTCTTCTTAATATGTATGCAAAGTTACAACAGATTGAGGATTCATTCAAGGTGTTTAACAACATGACTGAAGCTAATGTAGTCTCGTGGAATGCTATGATCTCAGGGTTCACATCAAATGGTCTTTACTTAGAGGCTTTTGATCATTTTCTCAGAATGAAGAGAGAAGGAGTAACACCCAATGCACAAACATTTATCGGTCTTGCAAAAGCTATCGGTATGTTACGAGATGTGAACAAGGCAAAAGAAGTTAGCCACTTTGCTTCTGAGTTAGGTGTGGACTCTAATACTCTTGTGGGAACTGCCCTCATTGATATGCATTCTAAATGTGGATCATTGCAAGAGGCAGGATATATCTTTGACTCACATTTTACAAATTGTCGGGTTAATGCCCCGTGGAATGCAATGATTTCGGGGTATTTACAGAGCGAATGTAATGAAAAAGCATTGGAATTGTTTGCCAAAATGTGTCAAAACGACATACACCTGGACCATTACACTTATTGTAGTGTATTTAATGCTATAGCCGTGTTGAAGTGTTTGTCCTCGGGAAAGAAGGTTCATGCCAGGGCTATAAAATCAGGATTGGAAGTGAATCATATAAATATCTCCAATGCAGTGGCAAATGCATATGCTAAATGTGGATCGCTGGACGATGTAAGTAAGGTCTTTTACAGGATTGAAGAGAGAGATTTAGTATCTTGGACCACCCTAGTGACTGCTTATTCTCAATGTTCTGAATGGGATAAAGCAATAGAGATCTTCTCAAATATGAGAGAAGGAGGTTTTACACCCAATCAATTTGCCTTTTCTAGTGTGCTCGTTTCATGTGCTAGCCTTTGCTTACTCGAGTATGGTCAGCAAGTCCACGGGTTCCTCTGCAAGGTTGGCTTGGATATGGACAAATGCATAGAAAGTGCTCTGATTGACATGTATGCCAAATGCGGTAATCTGGTGGAGGCGAAGAAGGTTTTCAATAGAATCTCTAATGCCGATACAGTTTCATGGACTGCTATAATATCGGGTCATGCTCAACATGGTGTTGTGGATGACGCACTTCAACTCTTTAGAAGGATGGAACAGTTAGGTGTGGAGCCCAATGCTGTCACTTTTTTGTGTGTTCTATTTGCATGTAGCCATGGAGGTCTGGTAGAGGAAGGCCTACAGTACTTCAAGCTAATGAAGGAAACCTATGGTTTGGTGCCAGAGATGGAGCATTATTCCTGTATCGTTGATCTCTTAAGTCGTGTGGGGCGTCTAAACGATGCAATGGAGTTTATAAGTAGGATGCCCATAGAGCCCAATGAAATGGTTTGGCAGACCTTGTTGGGAGCATGCAGGGTCCATGGTAATGTTGGATTGGGAGAGCTAGCTGCTCAGAAGATACTTGCTTTTAGAGCAGAAAACTCTGCTACCTTTGTTCTTTTATCCAACACCTATATCGAATCAGGGAGTTACAAAGATGGACTTAGTTTGCGACATGCGATGAAAGAGCAGGGCGTAAAAAAGGAACCAGGATGTAGTTGGATCTCTGTGAATGGTACATTGCATAAGTTTTATGCAGGTGATCAACAACATCCAGAAAAAGATAAAATTTATGCAAAGCTAGAAGAGTTAAGGTTGAAGGCCAATTCTTTGGATGATGTACCAGATTTGAGTTATGAGCTATAA

Coding sequence (CDS)

ATGTGCACAACAAATTGTAGGTGTCACTTTTACTCTAGTTACTGTACAAGGAATGCAGCTTTAACACCAACACCTTCTGCAACCCAAAAAGCTATTGCAGCCTCTGGTATTATAATTCCCAATTCGGTTACAGCTGATAAAACTAATTCCCATCTTGAAATTCAGCCGCTTGTTGATCTCCTGCGTGATTGTGTGGACGTAAGGTTTCTGAAACAAGCTAAGACTATTCATGGGTTTCTGTTAAAATCAAAATTTTCAAACCACGAGTCCCTTGTATTGCTTAATCATGTAGCTCACGCTTATTCGAAATGCTCCGATATTGGTGCTGCCTGTCGTCTGTTTGATCAAATGTCCCAGAGAAACATATTTTCGTGGACTGTCATAATTGTTGGATTGGCTGATAATGGTTTGTTTCTCGATGGGTTTGAGTTATTCTGTGAAATGCAGAGCCTGGGAATTTTCCCAGATCAGTTTGCTTATTCTGGTATCTTGCAGATATGTATTGGTTTGGAGTCCACTGACTTGGGCAGAATGGTTCATGCCCAGATTGTTATTAGAGGCTTTGCATCTCATACTTTTGTGTCTACTGCTCTTCTTAATATGTATGCAAAGTTACAACAGATTGAGGATTCATTCAAGGTGTTTAACAACATGACTGAAGCTAATGTAGTCTCGTGGAATGCTATGATCTCAGGGTTCACATCAAATGGTCTTTACTTAGAGGCTTTTGATCATTTTCTCAGAATGAAGAGAGAAGGAGTAACACCCAATGCACAAACATTTATCGGTCTTGCAAAAGCTATCGGTATGTTACGAGATGTGAACAAGGCAAAAGAAGTTAGCCACTTTGCTTCTGAGTTAGGTGTGGACTCTAATACTCTTGTGGGAACTGCCCTCATTGATATGCATTCTAAATGTGGATCATTGCAAGAGGCAGGATATATCTTTGACTCACATTTTACAAATTGTCGGGTTAATGCCCCGTGGAATGCAATGATTTCGGGGTATTTACAGAGCGAATGTAATGAAAAAGCATTGGAATTGTTTGCCAAAATGTGTCAAAACGACATACACCTGGACCATTACACTTATTGTAGTGTATTTAATGCTATAGCCGTGTTGAAGTGTTTGTCCTCGGGAAAGAAGGTTCATGCCAGGGCTATAAAATCAGGATTGGAAGTGAATCATATAAATATCTCCAATGCAGTGGCAAATGCATATGCTAAATGTGGATCGCTGGACGATGTAAGTAAGGTCTTTTACAGGATTGAAGAGAGAGATTTAGTATCTTGGACCACCCTAGTGACTGCTTATTCTCAATGTTCTGAATGGGATAAAGCAATAGAGATCTTCTCAAATATGAGAGAAGGAGGTTTTACACCCAATCAATTTGCCTTTTCTAGTGTGCTCGTTTCATGTGCTAGCCTTTGCTTACTCGAGTATGGTCAGCAAGTCCACGGGTTCCTCTGCAAGGTTGGCTTGGATATGGACAAATGCATAGAAAGTGCTCTGATTGACATGTATGCCAAATGCGGTAATCTGGTGGAGGCGAAGAAGGTTTTCAATAGAATCTCTAATGCCGATACAGTTTCATGGACTGCTATAATATCGGGTCATGCTCAACATGGTGTTGTGGATGACGCACTTCAACTCTTTAGAAGGATGGAACAGTTAGGTGTGGAGCCCAATGCTGTCACTTTTTTGTGTGTTCTATTTGCATGTAGCCATGGAGGTCTGGTAGAGGAAGGCCTACAGTACTTCAAGCTAATGAAGGAAACCTATGGTTTGGTGCCAGAGATGGAGCATTATTCCTGTATCGTTGATCTCTTAAGTCGTGTGGGGCGTCTAAACGATGCAATGGAGTTTATAAGTAGGATGCCCATAGAGCCCAATGAAATGGTTTGGCAGACCTTGTTGGGAGCATGCAGGGTCCATGGTAATGTTGGATTGGGAGAGCTAGCTGCTCAGAAGATACTTGCTTTTAGAGCAGAAAACTCTGCTACCTTTGTTCTTTTATCCAACACCTATATCGAATCAGGGAGTTACAAAGATGGACTTAGTTTGCGACATGCGATGAAAGAGCAGGGCGTAAAAAAGGAACCAGGATGTAGTTGGATCTCTGTGAATGGTACATTGCATAAGTTTTATGCAGGTGATCAACAACATCCAGAAAAAGATAAAATTTATGCAAAGCTAGAAGAGTTAAGGTTGAAGGCCAATTCTTTGGATGATGTACCAGATTTGAGTTATGAGCTATAA

Protein sequence

MCTTNCRCHFYSSYCTRNAALTPTPSATQKAIAASGIIIPNSVTADKTNSHLEIQPLVDLLRDCVDVRFLKQAKTIHGFLLKSKFSNHESLVLLNHVAHAYSKCSDIGAACRLFDQMSQRNIFSWTVIIVGLADNGLFLDGFELFCEMQSLGIFPDQFAYSGILQICIGLESTDLGRMVHAQIVIRGFASHTFVSTALLNMYAKLQQIEDSFKVFNNMTEANVVSWNAMISGFTSNGLYLEAFDHFLRMKREGVTPNAQTFIGLAKAIGMLRDVNKAKEVSHFASELGVDSNTLVGTALIDMHSKCGSLQEAGYIFDSHFTNCRVNAPWNAMISGYLQSECNEKALELFAKMCQNDIHLDHYTYCSVFNAIAVLKCLSSGKKVHARAIKSGLEVNHINISNAVANAYAKCGSLDDVSKVFYRIEERDLVSWTTLVTAYSQCSEWDKAIEIFSNMREGGFTPNQFAFSSVLVSCASLCLLEYGQQVHGFLCKVGLDMDKCIESALIDMYAKCGNLVEAKKVFNRISNADTVSWTAIISGHAQHGVVDDALQLFRRMEQLGVEPNAVTFLCVLFACSHGGLVEEGLQYFKLMKETYGLVPEMEHYSCIVDLLSRVGRLNDAMEFISRMPIEPNEMVWQTLLGACRVHGNVGLGELAAQKILAFRAENSATFVLLSNTYIESGSYKDGLSLRHAMKEQGVKKEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLEELRLKANSLDDVPDLSYEL
Homology
BLAST of Tan0004841 vs. ExPASy Swiss-Prot
Match: Q9ZUW3 (Pentatricopeptide repeat-containing protein At2g27610 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H60 PE=2 SV=1)

HSP 1 Score: 472.6 bits (1215), Expect = 8.1e-132
Identity = 251/671 (37.41%), Postives = 387/671 (57.68%), Query Frame = 0

Query: 84  KFSNHESLVLLNHVAHAYSKCSDIGAACRLFDQMSQRNIFSWTVIIVGLADNGLFLDGFE 143
           KF   + + +   +   Y K S+     ++FD+M +RN+ +WT +I G A N +  +   
Sbjct: 121 KFGFLDDVSVGTSLVDTYMKGSNFKDGRKVFDEMKERNVVTWTTLISGYARNSMNDEVLT 180

Query: 144 LFCEMQSLGIFPDQFAYSGILQICIGLESTDLGRMVHAQIVIRGFASHTFVSTALLNMYA 203
           LF  MQ+ G  P+ F ++  L +         G  VH  +V  G      VS +L+N+Y 
Sbjct: 181 LFMRMQNEGTQPNSFTFAAALGVLAEEGVGGRGLQVHTVVVKNGLDKTIPVSNSLINLYL 240

Query: 204 KLQQIEDSFKVFNNMTEANVVSWNAMISGFTSNGLYLEAFDHFLRMKREGVTPNAQTFIG 263
           K   +  +  +F+     +VV+WN+MISG+ +NGL LEA   F  M+   V  +  +F  
Sbjct: 241 KCGNVRKARILFDKTEVKSVVTWNSMISGYAANGLDLEALGMFYSMRLNYVRLSESSFAS 300

Query: 264 LAKAIGMLRDVNKAKEVSHFASELGVDSNTLVGTALIDMHSKCGSLQEAGYIFDSHFTNC 323
           + K    L+++   +++     + G   +  + TAL+  +SKC ++ +A  +F      C
Sbjct: 301 VIKLCANLKELRFTEQLHCSVVKYGFLFDQNIRTALMVAYSKCTAMLDALRLFKE--IGC 360

Query: 324 RVN-APWNAMISGYLQSECNEKALELFAKMCQNDIHLDHYTYCSVFNAIAVLKCLSSGKK 383
             N   W AMISG+LQ++  E+A++LF++M +  +  + +TY  +  A+ V+    S  +
Sbjct: 361 VGNVVSWTAMISGFLQNDGKEEAVDLFSEMKRKGVRPNEFTYSVILTALPVI----SPSE 420

Query: 384 VHARAIKSGLEVNHINISNAVANAYAKCGSLDDVSKVFYRIEERDLVSWTTLVTAYSQCS 443
           VHA+ +K+  E     +  A+ +AY K G +++ +KVF  I+++D+V+W+ ++  Y+Q  
Sbjct: 421 VHAQVVKTNYE-RSSTVGTALLDAYVKLGKVEEAAKVFSGIDDKDIVAWSAMLAGYAQTG 480

Query: 444 EWDKAIEIFSNMREGGFTPNQFAFSSVLVSCASL-CLLEYGQQVHGFLCKVGLDMDKCIE 503
           E + AI++F  + +GG  PN+F FSS+L  CA+    +  G+Q HGF  K  LD   C+ 
Sbjct: 481 ETEAAIKMFGELTKGGIKPNEFTFSSILNVCAATNASMGQGKQFHGFAIKSRLDSSLCVS 540

Query: 504 SALIDMYAKCGNLVEAKKVFNRISNADTVSWTAIISGHAQHGVVDDALQLFRRMEQLGVE 563
           SAL+ MYAK GN+  A++VF R    D VSW ++ISG+AQHG    AL +F+ M++  V+
Sbjct: 541 SALLTMYAKKGNIESAEEVFKRQREKDLVSWNSMISGYAQHGQAMKALDVFKEMKKRKVK 600

Query: 564 PNAVTFLCVLFACSHGGLVEEGLQYFKLMKETYGLVPEMEHYSCIVDLLSRVGRLNDAME 623
            + VTF+ V  AC+H GLVEEG +YF +M     + P  EH SC+VDL SR G+L  AM+
Sbjct: 601 MDGVTFIGVFAACTHAGLVEEGEKYFDIMVRDCKIAPTKEHNSCMVDLYSRAGQLEKAMK 660

Query: 624 FISRMPIEPNEMVWQTLLGACRVHGNVGLGELAAQKILAFRAENSATFVLLSNTYIESGS 683
            I  MP      +W+T+L ACRVH    LG LAA+KI+A + E+SA +VLLSN Y ESG 
Sbjct: 661 VIENMPNPAGSTIWRTILAACRVHKKTELGRLAAEKIIAMKPEDSAAYVLLSNMYAESGD 720

Query: 684 YKDGLSLRHAMKEQGVKKEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLEELRLKANS 743
           +++   +R  M E+ VKKEPG SWI V    + F AGD+ HP KD+IY KLE+L  +   
Sbjct: 721 WQERAKVRKLMNERNVKKEPGYSWIEVKNKTYSFLAGDRSHPLKDQIYMKLEDLSTRLKD 780

Query: 744 LDDVPDLSYEL 753
           L   PD SY L
Sbjct: 781 LGYEPDTSYVL 784

BLAST of Tan0004841 vs. ExPASy Swiss-Prot
Match: Q9LFI1 (Pentatricopeptide repeat-containing protein At3g53360, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-E86 PE=2 SV=1)

HSP 1 Score: 446.0 bits (1146), Expect = 8.2e-124
Identity = 242/694 (34.87%), Postives = 377/694 (54.32%), Query Frame = 0

Query: 45  ADKTNS-HLEIQPLVDLLRDCVDVRFLKQAKTIHGFLLKSKFSNHESLVLLNHVAHAYSK 104
           A K +S  + ++  + L+  C   R L Q + IH  +L S        +L NH+   Y K
Sbjct: 57  AQKNSSFKIRLRTYISLICACSSSRSLAQGRKIHDHILNSNCK--YDTILNNHILSMYGK 116

Query: 105 CSDIGAACRLFDQMSQRNIFSWTVIIVGLADNGLFLDGFELFCEMQSLGIFPDQFAYSGI 164
           C  +  A  +FD M +RN+ S+T +I G + NG   +   L+ +M    + PDQFA+  I
Sbjct: 117 CGSLRDAREVFDFMPERNLVSYTSVITGYSQNGQGAEAIRLYLKMLQEDLVPDQFAFGSI 176

Query: 165 LQICIGLESTDLGRMVHAQIVIRGFASHTFVSTALLNMYAKLQQIEDSFKVFNNMTEANV 224
           ++ C       LG+ +HAQ++    +SH     AL+ MY +  Q+ D+ +VF  +   ++
Sbjct: 177 IKACASSSDVGLGKQLHAQVIKLESSSHLIAQNALIAMYVRFNQMSDASRVFYGIPMKDL 236

Query: 225 VSWNAMISGFTSNGLYLEAFDHFLRMKREGV-TPNAQTFIGLAKAIGMLRDVNKAKEVSH 284
           +SW+++I+GF+  G   EA  H   M   GV  PN   F    KA   L   +   ++  
Sbjct: 237 ISWSSIIAGFSQLGFEFEALSHLKEMLSFGVFHPNEYIFGSSLKACSSLLRPDYGSQIHG 296

Query: 285 FASELGVDSNTLVGTALIDMHSKCGSLQEAGYIFDSHFTNCRVNAPWNAMISGYLQSECN 344
              +  +  N + G +L DM+++CG L  A  +FD         A WN +I+G   +   
Sbjct: 297 LCIKSELAGNAIAGCSLCDMYARCGFLNSARRVFDQ--IERPDTASWNVIIAGLANNGYA 356

Query: 345 EKALELFAKMCQNDIHLDHYTYCSVFNAIAVLKCLSSGKKVHARAIKSGLEVNHINISNA 404
           ++A+ +F++M  +    D  +  S+  A      LS G ++H+  IK G  +  + + N+
Sbjct: 357 DEAVSVFSQMRSSGFIPDAISLRSLLCAQTKPMALSQGMQIHSYIIKWGF-LADLTVCNS 416

Query: 405 VANAYAKCGSLDDVSKVFYRIEER-DLVSWTTLVTAYSQCSEWDKAIEIFSNMREGGFTP 464
           +   Y  C  L     +F       D VSW T++TA  Q  +  + + +F  M      P
Sbjct: 417 LLTMYTFCSDLYCCFNLFEDFRNNADSVSWNTILTACLQHEQPVEMLRLFKLMLVSECEP 476

Query: 465 NQFAFSSVLVSCASLCLLEYGQQVHGFLCKVGLDMDKCIESALIDMYAKCGNLVEAKKVF 524
           +     ++L  C  +  L+ G QVH +  K GL  ++ I++ LIDMYAKCG+L +A+++F
Sbjct: 477 DHITMGNLLRGCVEISSLKLGSQVHCYSLKTGLAPEQFIKNGLIDMYAKCGSLGQARRIF 536

Query: 525 NRISNADTVSWTAIISGHAQHGVVDDALQLFRRMEQLGVEPNAVTFLCVLFACSHGGLVE 584
           + + N D VSW+ +I G+AQ G  ++AL LF+ M+  G+EPN VTF+ VL ACSH GLVE
Sbjct: 537 DSMDNRDVVSWSTLIVGYAQSGFGEEALILFKEMKSAGIEPNHVTFVGVLTACSHVGLVE 596

Query: 585 EGLQYFKLMKETYGLVPEMEHYSCIVDLLSRVGRLNDAMEFISRMPIEPNEMVWQTLLGA 644
           EGL+ +  M+  +G+ P  EH SC+VDLL+R GRLN+A  FI  M +EP+ +VW+TLL A
Sbjct: 597 EGLKLYATMQTEHGISPTKEHCSCVVDLLARAGRLNEAERFIDEMKLEPDVVVWKTLLSA 656

Query: 645 CRVHGNVGLGELAAQKILAFRAENSATFVLLSNTYIESGSYKDGLSLRHAMKEQGVKKEP 704
           C+  GNV L + AA+ IL     NS   VLL + +  SG++++   LR +MK+  VKK P
Sbjct: 657 CKTQGNVHLAQKAAENILKIDPFNSTAHVLLCSMHASSGNWENAALLRSSMKKHDVKKIP 716

Query: 705 GCSWISVNGTLHKFYAGDQQHPEKDKIYAKLEEL 736
           G SWI +   +H F+A D  HPE+D IY  L  +
Sbjct: 717 GQSWIEIEDKIHIFFAEDIFHPERDDIYTVLHNI 745

BLAST of Tan0004841 vs. ExPASy Swiss-Prot
Match: Q9SN39 (Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=DOT4 PE=2 SV=1)

HSP 1 Score: 443.4 bits (1139), Expect = 5.3e-123
Identity = 228/688 (33.14%), Postives = 390/688 (56.69%), Query Frame = 0

Query: 52  LEIQPLVDLLRDCVDVRFLKQAKTIHGFLLKSKFSNHESLVLLNHVAHAYSKCSDIGAAC 111
           ++ + L  +L+ C D + LK  K +  F+  + F    +L   + ++  Y+ C D+  A 
Sbjct: 92  IDPRTLCSVLQLCADSKSLKDGKEVDNFIRGNGFVIDSNLG--SKLSLMYTNCGDLKEAS 151

Query: 112 RLFDQMSQRNIFSWTVIIVGLADNGLFLDGFELFCEMQSLGIFPDQFAYSGILQICIGLE 171
           R+FD++       W +++  LA +G F     LF +M S G+  D + +S + +    L 
Sbjct: 152 RVFDEVKIEKALFWNILMNELAKSGDFSGSIGLFKKMMSSGVEMDSYTFSCVSKSFSSLR 211

Query: 172 STDLGRMVHAQIVIRGFASHTFVSTALLNMYAKLQQIEDSFKVFNNMTEANVVSWNAMIS 231
           S   G  +H  I+  GF     V  +L+  Y K Q+++ + KVF+ MTE +V+SWN++I+
Sbjct: 212 SVHGGEQLHGFILKSGFGERNSVGNSLVAFYLKNQRVDSARKVFDEMTERDVISWNSIIN 271

Query: 232 GFTSNGLYLEAFDHFLRMKREGVTPNAQTFIGLAKAIGMLRDVNKAKEVSHFASELGVDS 291
           G+ SNGL  +    F++M   G+  +  T + +       R ++  + V     +     
Sbjct: 272 GYVSNGLAEKGLSVFVQMLVSGIEIDLATIVSVFAGCADSRLISLGRAVHSIGVKACFSR 331

Query: 292 NTLVGTALIDMHSKCGSLQEAGYIFDSHFTNCRVNAPWNAMISGYLQSECNEKALELFAK 351
                  L+DM+SKCG L  A  +F     + R    + +MI+GY +     +A++LF +
Sbjct: 332 EDRFCNTLLDMYSKCGDLDSAKAVFRE--MSDRSVVSYTSMIAGYAREGLAGEAVKLFEE 391

Query: 352 MCQNDIHLDHYTYCSVFNAIAVLKCLSSGKKVHARAIKSGLEVNHINISNAVANAYAKCG 411
           M +  I  D YT  +V N  A  + L  GK+VH    ++ L  + I +SNA+ + YAKCG
Sbjct: 392 MEEEGISPDVYTVTAVLNCCARYRLLDEGKRVHEWIKENDLGFD-IFVSNALMDMYAKCG 451

Query: 412 SLDDVSKVFYRIEERDLVSWTTLVTAYSQCSEWDKAIEIFS-NMREGGFTPNQFAFSSVL 471
           S+ +   VF  +  +D++SW T++  YS+    ++A+ +F+  + E  F+P++   + VL
Sbjct: 452 SMQEAELVFSEMRVKDIISWNTIIGGYSKNCYANEALSLFNLLLEEKRFSPDERTVACVL 511

Query: 472 VSCASLCLLEYGQQVHGFLCKVGLDMDKCIESALIDMYAKCGNLVEAKKVFNRISNADTV 531
            +CASL   + G+++HG++ + G   D+ + ++L+DMYAKCG L+ A  +F+ I++ D V
Sbjct: 512 PACASLSAFDKGREIHGYIMRNGYFSDRHVANSLVDMYAKCGALLLAHMLFDDIASKDLV 571

Query: 532 SWTAIISGHAQHGVVDDALQLFRRMEQLGVEPNAVTFLCVLFACSHGGLVEEGLQYFKLM 591
           SWT +I+G+  HG   +A+ LF +M Q G+E + ++F+ +L+ACSH GLV+EG ++F +M
Sbjct: 572 SWTVMIAGYGMHGFGKEAIALFNQMRQAGIEADEISFVSLLYACSHSGLVDEGWRFFNIM 631

Query: 592 KETYGLVPEMEHYSCIVDLLSRVGRLNDAMEFISRMPIEPNEMVWQTLLGACRVHGNVGL 651
           +    + P +EHY+CIVD+L+R G L  A  FI  MPI P+  +W  LL  CR+H +V L
Sbjct: 632 RHECKIEPTVEHYACIVDMLARTGDLIKAYRFIENMPIPPDATIWGALLCGCRIHHDVKL 691

Query: 652 GELAAQKILAFRAENSATFVLLSNTYIESGSYKDGLSLRHAMKEQGVKKEPGCSWISVNG 711
            E  A+K+     EN+  +VL++N Y E+  ++    LR  + ++G++K PGCSWI + G
Sbjct: 692 AEKVAEKVFELEPENTGYYVLMANIYAEAEKWEQVKRLRKRIGQRGLRKNPGCSWIEIKG 751

Query: 712 TLHKFYAGDQQHPEKDKIYAKLEELRLK 739
            ++ F AGD  +PE + I A L ++R +
Sbjct: 752 RVNIFVAGDSSNPETENIEAFLRKVRAR 774

BLAST of Tan0004841 vs. ExPASy Swiss-Prot
Match: Q9SVP7 (Pentatricopeptide repeat-containing protein At4g13650 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H42 PE=2 SV=2)

HSP 1 Score: 443.0 bits (1138), Expect = 6.9e-123
Identity = 245/688 (35.61%), Postives = 370/688 (53.78%), Query Frame = 0

Query: 60  LLRDCVDVRFLKQAKTIHGFLLKSKFSNHESLVLLNHVAHAYSKCSDIGAACRLFDQMSQ 119
           +L  C  +  L+  + +HG +LK  FS+     + N +   Y    ++ +A  +F  MSQ
Sbjct: 294 VLSACKKIESLEIGEQLHGLVLKLGFSS--DTYVCNALVSLYFHLGNLISAEHIFSNMSQ 353

Query: 120 RNIFSWTVIIVGLADNGLFLDGFELFCEMQSLGIFPDQFAYSGILQICIGLESTDLGRMV 179
           R+  ++  +I GL+  G      ELF  M   G+ PD    + ++  C    +   G+ +
Sbjct: 354 RDAVTYNTLINGLSQCGYGEKAMELFKRMHLDGLEPDSNTLASLVVACSADGTLFRGQQL 413

Query: 180 HAQIVIRGFASHTFVSTALLNMYAKLQQIEDSFKVFNNMTEANVVSWNAMISGFTSNGLY 239
           HA     GFAS+  +  ALLN+YAK   IE +   F      NVV WN M+  +      
Sbjct: 414 HAYTTKLGFASNNKIEGALLNLYAKCADIETALDYFLETEVENVVLWNVMLVAYGLLDDL 473

Query: 240 LEAFDHFLRMKREGVTPNAQTFIGLAKAIGMLRDVNKAKEVSHFASELGVDSNTLVGTAL 299
             +F  F +M+ E + PN  T+  + K    L D+   +++     +     N  V + L
Sbjct: 474 RNSFRIFRQMQIEEIVPNQYTYPSILKTCIRLGDLELGEQIHSQIIKTNFQLNAYVCSVL 533

Query: 300 IDMHSKCGSLQEAGYIFDSHFTNCRVNAPWNAMISGYLQSECNEKALELFAKMCQNDIHL 359
           IDM++K G L  A  I    F    V   W  MI+GY Q   ++KAL  F +M    I  
Sbjct: 534 IDMYAKLGKLDTAWDIL-IRFAGKDV-VSWTTMIAGYTQYNFDDKALTTFRQMLDRGIRS 593

Query: 360 DHYTYCSVFNAIAVLKCLSSGKKVHARAIKSGLEVNHINISNAVANAYAKCGSLDDVSKV 419
           D     +  +A A L+ L  G+++HA+A  SG   + +   NA+   Y++CG +++    
Sbjct: 594 DEVGLTNAVSACAGLQALKEGQQIHAQACVSGFS-SDLPFQNALVTLYSRCGKIEESYLA 653

Query: 420 FYRIEERDLVSWTTLVTAYSQCSEWDKAIEIFSNMREGGFTPNQFAFSSVLVSCASLCLL 479
           F + E  D ++W  LV+ + Q    ++A+ +F  M   G   N F F S + + +    +
Sbjct: 654 FEQTEAGDNIAWNALVSGFQQSGNNEEALRVFVRMNREGIDNNNFTFGSAVKAASETANM 713

Query: 480 EYGQQVHGFLCKVGLDMDKCIESALIDMYAKCGNLVEAKKVFNRISNADTVSWTAIISGH 539
           + G+QVH  + K G D +  + +ALI MYAKCG++ +A+K F  +S  + VSW AII+ +
Sbjct: 714 KQGKQVHAVITKTGYDSETEVCNALISMYAKCGSISDAEKQFLEVSTKNEVSWNAIINAY 773

Query: 540 AQHGVVDDALQLFRRMEQLGVEPNAVTFLCVLFACSHGGLVEEGLQYFKLMKETYGLVPE 599
           ++HG   +AL  F +M    V PN VT + VL ACSH GLV++G+ YF+ M   YGL P+
Sbjct: 774 SKHGFGSEALDSFDQMIHSNVRPNHVTLVGVLSACSHIGLVDKGIAYFESMNSEYGLSPK 833

Query: 600 MEHYSCIVDLLSRVGRLNDAMEFISRMPIEPNEMVWQTLLGACRVHGNVGLGELAAQKIL 659
            EHY C+VD+L+R G L+ A EFI  MPI+P+ +VW+TLL AC VH N+ +GE AA  +L
Sbjct: 834 PEHYVCVVDMLTRAGLLSRAKEFIQEMPIKPDALVWRTLLSACVVHKNMEIGEFAAHHLL 893

Query: 660 AFRAENSATFVLLSNTYIESGSYKDGLSLRHAMKEQGVKKEPGCSWISVNGTLHKFYAGD 719
               E+SAT+VLLSN Y  S  +      R  MKE+GVKKEPG SWI V  ++H FY GD
Sbjct: 894 ELEPEDSATYVLLSNLYAVSKKWDARDLTRQKMKEKGVKKEPGQSWIEVKNSIHSFYVGD 953

Query: 720 QQHPEKDKIYAKLEELRLKANSLDDVPD 748
           Q HP  D+I+   ++L  +A+ +  V D
Sbjct: 954 QNHPLADEIHEYFQDLTKRASEIGYVQD 976

BLAST of Tan0004841 vs. ExPASy Swiss-Prot
Match: Q5G1T1 (Pentatricopeptide repeat-containing protein At3g49170, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=EMB2261 PE=2 SV=1)

HSP 1 Score: 441.0 bits (1133), Expect = 2.6e-122
Identity = 241/699 (34.48%), Postives = 399/699 (57.08%), Query Frame = 0

Query: 60  LLRDCVDVRFLKQAKTIHGFLLKSKFSNHESLVLLNHVAHAYSKCSDIGAACRLFDQM-- 119
           LL+ C+  R  +  K +H  L+  +F      VL N +   YSK  D   A  +F+ M  
Sbjct: 68  LLKSCIRARDFRLGKLVHARLI--EFDIEPDSVLYNSLISLYSKSGDSAKAEDVFETMRR 127

Query: 120 -SQRNIFSWTVIIVGLADNGLFLDGFELFCEMQSLGIFPDQFAYSGILQICIGLESTDLG 179
             +R++ SW+ ++    +NG  LD  ++F E   LG+ P+ + Y+ +++ C   +   +G
Sbjct: 128 FGKRDVVSWSAMMACYGNNGRELDAIKVFVEFLELGLVPNDYCYTAVIRACSNSDFVGVG 187

Query: 180 RMVHAQIVIRG-FASHTFVSTALLNMYAKLQ-QIEDSFKVFNNMTEANVVSWNAMISGFT 239
           R+    ++  G F S   V  +L++M+ K +   E+++KVF+ M+E NVV+W  MI+   
Sbjct: 188 RVTLGFLMKTGHFESDVCVGCSLIDMFVKGENSFENAYKVFDKMSELNVVTWTLMITRCM 247

Query: 240 SNGLYLEAFDHFLRMKREGVTPNAQTFIGLAKAIGMLRDVNKAKEVSHFASELGVDSNTL 299
             G   EA   FL M   G   +  T   +  A   L +++  K++  +A   G+  +  
Sbjct: 248 QMGFPREAIRFFLDMVLSGFESDKFTLSSVFSACAELENLSLGKQLHSWAIRSGLVDD-- 307

Query: 300 VGTALIDMHSKC---GSLQEAGYIFDSHFTNCRVNAPWNAMISGYLQSECN--EKALELF 359
           V  +L+DM++KC   GS+ +   +FD    +  ++  W A+I+GY+++ CN   +A+ LF
Sbjct: 308 VECSLVDMYAKCSADGSVDDCRKVFDRMEDHSVMS--WTALITGYMKN-CNLATEAINLF 367

Query: 360 AKM-CQNDIHLDHYTYCSVFNAIAVLKCLSSGKKVHARAIKSGLEVNHINISNAVANAYA 419
           ++M  Q  +  +H+T+ S F A   L     GK+V  +A K GL  N  +++N+V + + 
Sbjct: 368 SEMITQGHVEPNHFTFSSAFKACGNLSDPRVGKQVLGQAFKRGLASNS-SVANSVISMFV 427

Query: 420 KCGSLDDVSKVFYRIEERDLVSWTTLVTAYSQCSEWDKAIEIFSNMREGGFTPNQFAFSS 479
           K   ++D  + F  + E++LVS+ T +    +   +++A ++ S + E     + F F+S
Sbjct: 428 KSDRMEDAQRAFESLSEKNLVSYNTFLDGTCRNLNFEQAFKLLSEITERELGVSAFTFAS 487

Query: 480 VLVSCASLCLLEYGQQVHGFLCKVGLDMDKCIESALIDMYAKCGNLVEAKKVFNRISNAD 539
           +L   A++  +  G+Q+H  + K+GL  ++ + +ALI MY+KCG++  A +VFN + N +
Sbjct: 488 LLSGVANVGSIRKGEQIHSQVVKLGLSCNQPVCNALISMYSKCGSIDTASRVFNFMENRN 547

Query: 540 TVSWTAIISGHAQHGVVDDALQLFRRMEQLGVEPNAVTFLCVLFACSHGGLVEEGLQYFK 599
            +SWT++I+G A+HG     L+ F +M + GV+PN VT++ +L ACSH GLV EG ++F 
Sbjct: 548 VISWTSMITGFAKHGFAIRVLETFNQMIEEGVKPNEVTYVAILSACSHVGLVSEGWRHFN 607

Query: 600 LMKETYGLVPEMEHYSCIVDLLSRVGRLNDAMEFISRMPIEPNEMVWQTLLGACRVHGNV 659
            M E + + P+MEHY+C+VDLL R G L DA EFI+ MP + + +VW+T LGACRVH N 
Sbjct: 608 SMYEDHKIKPKMEHYACMVDLLCRAGLLTDAFEFINTMPFQADVLVWRTFLGACRVHSNT 667

Query: 660 GLGELAAQKILAFRAENSATFVLLSNTYIESGSYKDGLSLRHAMKEQGVKKEPGCSWISV 719
            LG+LAA+KIL       A ++ LSN Y  +G +++   +R  MKE+ + KE GCSWI V
Sbjct: 668 ELGKLAARKILELDPNEPAAYIQLSNIYACAGKWEESTEMRRKMKERNLVKEGGCSWIEV 727

Query: 720 NGTLHKFYAGDQQHPEKDKIYAKLEELRLKANSLDDVPD 748
              +HKFY GD  HP   +IY +L+ L  +      VPD
Sbjct: 728 GDKIHKFYVGDTAHPNAHQIYDELDRLITEIKRCGYVPD 758

BLAST of Tan0004841 vs. NCBI nr
Match: XP_022939229.1 (pentatricopeptide repeat-containing protein At3g16610-like [Cucurbita moschata] >KAG6578466.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1325.8 bits (3430), Expect = 0.0e+00
Identity = 649/736 (88.18%), Postives = 691/736 (93.89%), Query Frame = 0

Query: 17  RNAALTPTPSATQKAIAASGIIIPNSVTADKTNSHLEIQPLVDLLRDCVDVRFLKQAKTI 76
           RN++LT T SATQKAI  SGI IPNSV+ DK+NS LEIQPLVDLLR CVD RFLKQAKT+
Sbjct: 28  RNSSLTTTHSATQKAITTSGIKIPNSVSVDKSNSRLEIQPLVDLLRGCVDARFLKQAKTV 87

Query: 77  HGFLLKSKFSNHESLVLLNHVAHAYSKCSDIGAACRLFDQMSQRNIFSWTVIIVGLADNG 136
           HGFLLKSKFSNH+SLVLLNHVA AYSKCSDI AACRLFD+MSQRNIFSWTVII GLA NG
Sbjct: 88  HGFLLKSKFSNHDSLVLLNHVADAYSKCSDIDAACRLFDKMSQRNIFSWTVIIAGLAKNG 147

Query: 137 LFLDGFELFCEMQSLGIFPDQFAYSGILQICIGLESTDLGRMVHAQIVIRGFASHTFVST 196
           LF DGFE FCEMQS  IFPDQFAYSG+LQICIGLES +LG+MVHAQIVIRGFASHTFVST
Sbjct: 148 LFHDGFEFFCEMQSQDIFPDQFAYSGVLQICIGLESIELGKMVHAQIVIRGFASHTFVST 207

Query: 197 ALLNMYAKLQQIEDSFKVFNNMTEANVVSWNAMISGFTSNGLYLEAFDHFLRMKREGVTP 256
           ALLNMYAKLQ+I+DS++VFN MTE NVVSWNAMISGFTSNGLY +AFDHFLRMK EGVTP
Sbjct: 208 ALLNMYAKLQKIDDSYEVFNTMTEVNVVSWNAMISGFTSNGLYSDAFDHFLRMKGEGVTP 267

Query: 257 NAQTFIGLAKAIGMLRDVNKAKEVSHFASELGVDSNTLVGTALIDMHSKCGSLQEAGYIF 316
           +AQTFI +AKAIGMLRDVNKAKE+S +ASELG+DSN LVGTALIDMHSKCGSLQEA  IF
Sbjct: 268 DAQTFISIAKAIGMLRDVNKAKEISRYASELGMDSNPLVGTALIDMHSKCGSLQEARSIF 327

Query: 317 DSHFTNCRVNAPWNAMISGYLQSECNEKALELFAKMCQNDIHLDHYTYCSVFNAIAVLKC 376
           DSHFTNCRVN PWNAMISGYLQSE NEKALELFAKMC N++HLD YTYCSVFNAIA LKC
Sbjct: 328 DSHFTNCRVNGPWNAMISGYLQSEFNEKALELFAKMCLNNVHLDRYTYCSVFNAIAALKC 387

Query: 377 LSSGKKVHARAIKSGLEVNHINISNAVANAYAKCGSLDDVSKVFYRIEERDLVSWTTLVT 436
           LS GKKVHARAIKSGLEVN+I+ISNAVANAYAKCGSL+D+ KVFY +EERDLVSWTTLVT
Sbjct: 388 LSLGKKVHARAIKSGLEVNNISISNAVANAYAKCGSLEDLRKVFYSMEERDLVSWTTLVT 447

Query: 437 AYSQCSEWDKAIEIFSNMREGGFTPNQFAFSSVLVSCASLCLLEYGQQVHGFLCKVGLDM 496
           AYSQCSEWDKAIEIFSNMRE GF PNQFAFSSVL+SCASLCLLEYGQQVHGF+ KVGLDM
Sbjct: 448 AYSQCSEWDKAIEIFSNMREEGFAPNQFAFSSVLISCASLCLLEYGQQVHGFIGKVGLDM 507

Query: 497 DKCIESALIDMYAKCGNLVEAKKVFNRISNADTVSWTAIISGHAQHGVVDDALQLFRRME 556
           DKCI+SALIDMYAKCG+L EAKKVF++IS+ADT+SWTAII+GHAQHG+VDDALQLFRRME
Sbjct: 508 DKCIQSALIDMYAKCGSLAEAKKVFDKISDADTISWTAIIAGHAQHGMVDDALQLFRRME 567

Query: 557 QLGVEPNAVTFLCVLFACSHGGLVEEGLQYFKLMKETYGLVPEMEHYSCIVDLLSRVGRL 616
           QLGVEPNAVTFLCVLFACSHGGLVEEGLQYFKLMKETYGLVP MEHYSCIVDLLSRVGRL
Sbjct: 568 QLGVEPNAVTFLCVLFACSHGGLVEEGLQYFKLMKETYGLVPGMEHYSCIVDLLSRVGRL 627

Query: 617 NDAMEFISRMPIEPNEMVWQTLLGACRVHGNVGLGELAAQKILAFRAENSATFVLLSNTY 676
           NDAMEFIS+MPIEPNEMVWQTLLGACRVHGNV LGELAA+KI +F+AENSAT+VLLSNTY
Sbjct: 628 NDAMEFISKMPIEPNEMVWQTLLGACRVHGNVELGELAARKIRSFKAENSATYVLLSNTY 687

Query: 677 IESGSYKDGLSLRHAMKEQGVKKEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLEELR 736
           IESGSYKDGLSLRH MKEQGVKKEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLEELR
Sbjct: 688 IESGSYKDGLSLRHVMKEQGVKKEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLEELR 747

Query: 737 LKANSLDDVPDLSYEL 753
           LK NS DDVPDLSYEL
Sbjct: 748 LKVNSSDDVPDLSYEL 763

BLAST of Tan0004841 vs. NCBI nr
Match: KAG7016032.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1325.8 bits (3430), Expect = 0.0e+00
Identity = 649/736 (88.18%), Postives = 691/736 (93.89%), Query Frame = 0

Query: 17  RNAALTPTPSATQKAIAASGIIIPNSVTADKTNSHLEIQPLVDLLRDCVDVRFLKQAKTI 76
           RN++LT T SATQKAI  SGI IPNSV+ DK+NS LEIQPLVDLLR CVD RFLKQAKT+
Sbjct: 10  RNSSLTTTHSATQKAITTSGIKIPNSVSVDKSNSRLEIQPLVDLLRGCVDARFLKQAKTV 69

Query: 77  HGFLLKSKFSNHESLVLLNHVAHAYSKCSDIGAACRLFDQMSQRNIFSWTVIIVGLADNG 136
           HGFLLKSKFSNH+SLVLLNHVA AYSKCSDI AACRLFD+MSQRNIFSWTVII GLA NG
Sbjct: 70  HGFLLKSKFSNHDSLVLLNHVADAYSKCSDIDAACRLFDKMSQRNIFSWTVIIAGLAKNG 129

Query: 137 LFLDGFELFCEMQSLGIFPDQFAYSGILQICIGLESTDLGRMVHAQIVIRGFASHTFVST 196
           LF DGFE FCEMQS  IFPDQFAYSG+LQICIGLES +LG+MVHAQIVIRGFASHTFVST
Sbjct: 130 LFHDGFEFFCEMQSQDIFPDQFAYSGVLQICIGLESIELGKMVHAQIVIRGFASHTFVST 189

Query: 197 ALLNMYAKLQQIEDSFKVFNNMTEANVVSWNAMISGFTSNGLYLEAFDHFLRMKREGVTP 256
           ALLNMYAKLQ+I+DS++VFN MTE NVVSWNAMISGFTSNGLY +AFDHFLRMK EGVTP
Sbjct: 190 ALLNMYAKLQKIDDSYEVFNTMTEVNVVSWNAMISGFTSNGLYSDAFDHFLRMKGEGVTP 249

Query: 257 NAQTFIGLAKAIGMLRDVNKAKEVSHFASELGVDSNTLVGTALIDMHSKCGSLQEAGYIF 316
           +AQTFI +AKAIGMLRDVNKAKE+S +ASELG+DSN LVGTALIDMHSKCGSLQEA  IF
Sbjct: 250 DAQTFISIAKAIGMLRDVNKAKEISRYASELGMDSNPLVGTALIDMHSKCGSLQEARSIF 309

Query: 317 DSHFTNCRVNAPWNAMISGYLQSECNEKALELFAKMCQNDIHLDHYTYCSVFNAIAVLKC 376
           DSHFTNCRVN PWNAMISGYLQSE NEKALELFAKMC N++HLD YTYCSVFNAIA LKC
Sbjct: 310 DSHFTNCRVNGPWNAMISGYLQSEFNEKALELFAKMCLNNVHLDRYTYCSVFNAIAALKC 369

Query: 377 LSSGKKVHARAIKSGLEVNHINISNAVANAYAKCGSLDDVSKVFYRIEERDLVSWTTLVT 436
           LS GKKVHARAIKSGLEVN+I+ISNAVANAYAKCGSL+D+ KVFY +EERDLVSWTTLVT
Sbjct: 370 LSLGKKVHARAIKSGLEVNNISISNAVANAYAKCGSLEDLRKVFYSMEERDLVSWTTLVT 429

Query: 437 AYSQCSEWDKAIEIFSNMREGGFTPNQFAFSSVLVSCASLCLLEYGQQVHGFLCKVGLDM 496
           AYSQCSEWDKAIEIFSNMRE GF PNQFAFSSVL+SCASLCLLEYGQQVHGF+ KVGLDM
Sbjct: 430 AYSQCSEWDKAIEIFSNMREEGFAPNQFAFSSVLISCASLCLLEYGQQVHGFIGKVGLDM 489

Query: 497 DKCIESALIDMYAKCGNLVEAKKVFNRISNADTVSWTAIISGHAQHGVVDDALQLFRRME 556
           DKCI+SALIDMYAKCG+L EAKKVF++IS+ADT+SWTAII+GHAQHG+VDDALQLFRRME
Sbjct: 490 DKCIQSALIDMYAKCGSLAEAKKVFDKISDADTISWTAIIAGHAQHGMVDDALQLFRRME 549

Query: 557 QLGVEPNAVTFLCVLFACSHGGLVEEGLQYFKLMKETYGLVPEMEHYSCIVDLLSRVGRL 616
           QLGVEPNAVTFLCVLFACSHGGLVEEGLQYFKLMKETYGLVP MEHYSCIVDLLSRVGRL
Sbjct: 550 QLGVEPNAVTFLCVLFACSHGGLVEEGLQYFKLMKETYGLVPGMEHYSCIVDLLSRVGRL 609

Query: 617 NDAMEFISRMPIEPNEMVWQTLLGACRVHGNVGLGELAAQKILAFRAENSATFVLLSNTY 676
           NDAMEFIS+MPIEPNEMVWQTLLGACRVHGNV LGELAA+KI +F+AENSAT+VLLSNTY
Sbjct: 610 NDAMEFISKMPIEPNEMVWQTLLGACRVHGNVELGELAARKIRSFKAENSATYVLLSNTY 669

Query: 677 IESGSYKDGLSLRHAMKEQGVKKEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLEELR 736
           IESGSYKDGLSLRH MKEQGVKKEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLEELR
Sbjct: 670 IESGSYKDGLSLRHVMKEQGVKKEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLEELR 729

Query: 737 LKANSLDDVPDLSYEL 753
           LK NS DDVPDLSYEL
Sbjct: 730 LKVNSSDDVPDLSYEL 745

BLAST of Tan0004841 vs. NCBI nr
Match: XP_038884632.1 (pentatricopeptide repeat-containing protein At3g16610-like [Benincasa hispida] >XP_038884633.1 pentatricopeptide repeat-containing protein At3g16610-like [Benincasa hispida])

HSP 1 Score: 1324.7 bits (3427), Expect = 0.0e+00
Identity = 658/740 (88.92%), Postives = 693/740 (93.65%), Query Frame = 0

Query: 13  SYCTRNAALTPTPSATQKAIAASGIIIPNSVTADKTNSHLEIQPLVDLLRDCVDVRFLKQ 72
           +Y  RN+ALT T SATQKAIA S I IP+SVT  KT+SHLEIQ LVDLLRDCVD RFLKQ
Sbjct: 24  NYQIRNSALTITHSATQKAIANSAIKIPDSVTVHKTDSHLEIQQLVDLLRDCVDARFLKQ 83

Query: 73  AKTIHGFLLKSKFSNHESLVLLNHVAHAYSKCSDIGAACRLFDQMSQRNIFSWTVIIVGL 132
           AKT+HGFLLKS+FSNH+SLVLLNHVAHAYSKCSDI AACRLFDQMSQRNIFSWTVIIVGL
Sbjct: 84  AKTVHGFLLKSEFSNHDSLVLLNHVAHAYSKCSDIDAACRLFDQMSQRNIFSWTVIIVGL 143

Query: 133 ADNGLFLDGFELFCEMQSLGIFPDQFAYSGILQICIGLESTDLGRMVHAQIVIRGFASHT 192
           A+NGLFLDGFE FCEMQS GIFPDQFAYSGILQICIGL+S +LG+MVHAQI IRGFASHT
Sbjct: 144 AENGLFLDGFEFFCEMQSHGIFPDQFAYSGILQICIGLDSLELGKMVHAQIFIRGFASHT 203

Query: 193 FVSTALLNMYAKLQQIEDSFKVFNNMTEANVVSWNAMISGFTSNGLYLEAFDHFLRMKRE 252
           FVSTALLNMYAKLQQIE+S+KVFN MTE NVVSWNAMISGFTSNGLYL+AFD FLRMKRE
Sbjct: 204 FVSTALLNMYAKLQQIENSYKVFNTMTEVNVVSWNAMISGFTSNGLYLDAFDLFLRMKRE 263

Query: 253 GVTPNAQTFIGLAKAIGMLRDVNKAKEVSHFASELGVDSNTLVGTALIDMHSKCGSLQEA 312
           GVT +AQTFIG+AKAIGMLRDVNKAKEVS  ASELGVDSNT VGTALIDMHSKCGSL+EA
Sbjct: 264 GVTLDAQTFIGVAKAIGMLRDVNKAKEVSCSASELGVDSNTFVGTALIDMHSKCGSLREA 323

Query: 313 GYIFDSHFTNCRVNAPWNAMISGYLQSECNEKALELFAKMCQNDIHLDHYTYCSVFNAIA 372
             IFDSHFTNCRVNAPWNAMISGYLQSE NEKALELFAKMC NDIHLDHYTYCSVFNAIA
Sbjct: 324 RSIFDSHFTNCRVNAPWNAMISGYLQSEFNEKALELFAKMCLNDIHLDHYTYCSVFNAIA 383

Query: 373 VLKCLSSGKKVHARAIKSGLEVNHINISNAVANAYAKCGSLDDVSKVFYRIEERDLVSWT 432
            LKCLSSGKKVHARAIKSGLEVN+I+ISNAVANAYAKCG L+DV KVFYR+E+RDLVSWT
Sbjct: 384 ALKCLSSGKKVHARAIKSGLEVNYISISNAVANAYAKCGLLEDVRKVFYRMEDRDLVSWT 443

Query: 433 TLVTAYSQCSEWDKAIEIFSNMREGGFTPNQFAFSSVLVSCASLCLLEYGQQVHGFLCKV 492
           TLVTAYSQCSEWDKAIEIFSNMRE G+ PNQFAFSSVLVSCASLCLLEYGQQVHGF+ KV
Sbjct: 444 TLVTAYSQCSEWDKAIEIFSNMREEGYAPNQFAFSSVLVSCASLCLLEYGQQVHGFIYKV 503

Query: 493 GLDMDKCIESALIDMYAKCGNLVEAKKVFNRISNADTVSWTAIISGHAQHGVVDDALQLF 552
           GLDMD CIESALIDMYAKCG L EAKKVF+RISNADTVSWTAII+GHAQHG+VDDALQLF
Sbjct: 504 GLDMDTCIESALIDMYAKCGCLAEAKKVFDRISNADTVSWTAIIAGHAQHGIVDDALQLF 563

Query: 553 RRMEQLGVEPNAVTFLCVLFACSHGGLVEEGLQYFKLMKETYGLVPEMEHYSCIVDLLSR 612
           RRM Q GVEPNAVTFLCVLFACSHGGLVEEGLQYFKLMKETY LVPEMEHYSCIVD+LSR
Sbjct: 564 RRMVQSGVEPNAVTFLCVLFACSHGGLVEEGLQYFKLMKETYSLVPEMEHYSCIVDILSR 623

Query: 613 VGRLNDAMEFISRMPIEPNEMVWQTLLGACRVHGNVGLGELAAQKILAFRAENSATFVLL 672
           VG LNDAMEFISRMP+EPNEMVWQTLLGACR+HGN+ LGELAAQKIL+ +AENSATFVLL
Sbjct: 624 VGHLNDAMEFISRMPVEPNEMVWQTLLGACRIHGNIELGELAAQKILSSKAENSATFVLL 683

Query: 673 SNTYIESGSYKDGLSLRHAMKEQGVKKEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKL 732
           SNTYIESGSYKDGLSLR  MKEQGVKKEPG SWIS+NGTLHKFYAGDQQHPEKDKIYAKL
Sbjct: 684 SNTYIESGSYKDGLSLRLVMKEQGVKKEPGFSWISMNGTLHKFYAGDQQHPEKDKIYAKL 743

Query: 733 EELRLKANSLDDVPDLSYEL 753
           EEL+L A SLDDVPDLSYEL
Sbjct: 744 EELKLAAISLDDVPDLSYEL 763

BLAST of Tan0004841 vs. NCBI nr
Match: XP_023549692.1 (pentatricopeptide repeat-containing protein At3g16610-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1322.0 bits (3420), Expect = 0.0e+00
Identity = 646/736 (87.77%), Postives = 690/736 (93.75%), Query Frame = 0

Query: 17  RNAALTPTPSATQKAIAASGIIIPNSVTADKTNSHLEIQPLVDLLRDCVDVRFLKQAKTI 76
           RN++LT T SATQKAI   GI IPNSV+ DK+NS LEIQPLVDLLR CVD RFLKQAKT+
Sbjct: 28  RNSSLTTTHSATQKAITTPGIKIPNSVSVDKSNSRLEIQPLVDLLRGCVDARFLKQAKTV 87

Query: 77  HGFLLKSKFSNHESLVLLNHVAHAYSKCSDIGAACRLFDQMSQRNIFSWTVIIVGLADNG 136
           HGFLLKSKFSNH+SLVLLNHVA AYSKCSDI AACRLFD+MSQRNIFSWTVII GLA NG
Sbjct: 88  HGFLLKSKFSNHDSLVLLNHVADAYSKCSDIDAACRLFDKMSQRNIFSWTVIIAGLAKNG 147

Query: 137 LFLDGFELFCEMQSLGIFPDQFAYSGILQICIGLESTDLGRMVHAQIVIRGFASHTFVST 196
           LF DGFE FCEMQS  IFPDQFAYSG+LQICIGLES +LG+MVHAQIVIRGFASHTFVST
Sbjct: 148 LFHDGFEFFCEMQSQDIFPDQFAYSGVLQICIGLESIELGKMVHAQIVIRGFASHTFVST 207

Query: 197 ALLNMYAKLQQIEDSFKVFNNMTEANVVSWNAMISGFTSNGLYLEAFDHFLRMKREGVTP 256
           ALLNMYAKLQ+I+DS++VFN MTE NVVSWNAMISGFTSNGLY +AFDHFLRMK EGVTP
Sbjct: 208 ALLNMYAKLQKIDDSYEVFNTMTEVNVVSWNAMISGFTSNGLYSDAFDHFLRMKGEGVTP 267

Query: 257 NAQTFIGLAKAIGMLRDVNKAKEVSHFASELGVDSNTLVGTALIDMHSKCGSLQEAGYIF 316
           +AQTFI +AKAIGMLRDVNKAKE+S +ASELG+DSNTLVGT LIDMHSKCGSLQEA  IF
Sbjct: 268 DAQTFISIAKAIGMLRDVNKAKEISRYASELGMDSNTLVGTGLIDMHSKCGSLQEARSIF 327

Query: 317 DSHFTNCRVNAPWNAMISGYLQSECNEKALELFAKMCQNDIHLDHYTYCSVFNAIAVLKC 376
           DSHFTNCRVN PWNAMISGYLQSE NEKALELFAKMC N++HLD YTYCSVFNAIA LKC
Sbjct: 328 DSHFTNCRVNGPWNAMISGYLQSEFNEKALELFAKMCLNNVHLDRYTYCSVFNAIAALKC 387

Query: 377 LSSGKKVHARAIKSGLEVNHINISNAVANAYAKCGSLDDVSKVFYRIEERDLVSWTTLVT 436
           LS GKKVHARAIKSGLEVN+I+ISNAVANAYAKCGSL+D+ KVFY +EERDLVSWTTLVT
Sbjct: 388 LSLGKKVHARAIKSGLEVNNISISNAVANAYAKCGSLEDLRKVFYSMEERDLVSWTTLVT 447

Query: 437 AYSQCSEWDKAIEIFSNMREGGFTPNQFAFSSVLVSCASLCLLEYGQQVHGFLCKVGLDM 496
           AYSQCSEWDKAIEIFSNMRE GF PNQFAFSSVLVSCASLCLLEYGQQVHG +CKVGLDM
Sbjct: 448 AYSQCSEWDKAIEIFSNMREEGFAPNQFAFSSVLVSCASLCLLEYGQQVHGVICKVGLDM 507

Query: 497 DKCIESALIDMYAKCGNLVEAKKVFNRISNADTVSWTAIISGHAQHGVVDDALQLFRRME 556
           DKCI+SALIDMYAKCG+L EAKKVF++IS+ADT+SWTAII+GHAQHG+VDDALQLFRRM+
Sbjct: 508 DKCIQSALIDMYAKCGSLAEAKKVFDKISDADTISWTAIIAGHAQHGMVDDALQLFRRMD 567

Query: 557 QLGVEPNAVTFLCVLFACSHGGLVEEGLQYFKLMKETYGLVPEMEHYSCIVDLLSRVGRL 616
           QLGVEPNAVTFLCVLFACSHGGLVEEGLQYFKLMKETYGLVP MEHYSCIVDLLSRVGRL
Sbjct: 568 QLGVEPNAVTFLCVLFACSHGGLVEEGLQYFKLMKETYGLVPGMEHYSCIVDLLSRVGRL 627

Query: 617 NDAMEFISRMPIEPNEMVWQTLLGACRVHGNVGLGELAAQKILAFRAENSATFVLLSNTY 676
           NDAMEFIS+MPIEPNEMVWQTLLGACRVHGNV LGELAA+KI +F+AENSAT+VLLSNTY
Sbjct: 628 NDAMEFISKMPIEPNEMVWQTLLGACRVHGNVELGELAARKIRSFKAENSATYVLLSNTY 687

Query: 677 IESGSYKDGLSLRHAMKEQGVKKEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLEELR 736
           IESGSYKDGLSLR+ MKEQGVKKEPGCSWI+VNGTLHKFYAGDQQHPEKDKIYAKLEELR
Sbjct: 688 IESGSYKDGLSLRNVMKEQGVKKEPGCSWIAVNGTLHKFYAGDQQHPEKDKIYAKLEELR 747

Query: 737 LKANSLDDVPDLSYEL 753
           LK NS DDVPDLSYEL
Sbjct: 748 LKVNSSDDVPDLSYEL 763

BLAST of Tan0004841 vs. NCBI nr
Match: XP_022993736.1 (pentatricopeptide repeat-containing protein At2g27610-like [Cucurbita maxima])

HSP 1 Score: 1320.1 bits (3415), Expect = 0.0e+00
Identity = 646/736 (87.77%), Postives = 691/736 (93.89%), Query Frame = 0

Query: 17  RNAALTPTPSATQKAIAASGIIIPNSVTADKTNSHLEIQPLVDLLRDCVDVRFLKQAKTI 76
           RN++LT T SATQKAI  SGI IPNSV+ DK+NS LEIQPLVDLLR C+D RFLKQAKT+
Sbjct: 28  RNSSLTTTHSATQKAITISGIKIPNSVSVDKSNSRLEIQPLVDLLRGCLDARFLKQAKTV 87

Query: 77  HGFLLKSKFSNHESLVLLNHVAHAYSKCSDIGAACRLFDQMSQRNIFSWTVIIVGLADNG 136
           HGFLLKSK SNH+S+VLLNHVA AYSKCSDI AACRLFD+MSQRNIFSWTVII GLA NG
Sbjct: 88  HGFLLKSKLSNHDSMVLLNHVADAYSKCSDIDAACRLFDKMSQRNIFSWTVIIAGLAKNG 147

Query: 137 LFLDGFELFCEMQSLGIFPDQFAYSGILQICIGLESTDLGRMVHAQIVIRGFASHTFVST 196
           LF DGFE FCEMQS  IFPDQFAYSG+LQICIGLES +LG+MVHAQIVIRGFASHTFVST
Sbjct: 148 LFHDGFEFFCEMQSQDIFPDQFAYSGVLQICIGLESIELGKMVHAQIVIRGFASHTFVST 207

Query: 197 ALLNMYAKLQQIEDSFKVFNNMTEANVVSWNAMISGFTSNGLYLEAFDHFLRMKREGVTP 256
           ALLNMYAKLQ+I+DS++VFN MTE NVVSWNAMISGFTSNGLY +AFDHFLRMK EGVTP
Sbjct: 208 ALLNMYAKLQKIDDSYEVFNTMTEVNVVSWNAMISGFTSNGLYSDAFDHFLRMKGEGVTP 267

Query: 257 NAQTFIGLAKAIGMLRDVNKAKEVSHFASELGVDSNTLVGTALIDMHSKCGSLQEAGYIF 316
           +AQTFI +AKAIGMLRDVNKAKE+S +AS+LG+DSNTLVGTALIDMHSKCGSLQEA  IF
Sbjct: 268 DAQTFISIAKAIGMLRDVNKAKEISRYASKLGMDSNTLVGTALIDMHSKCGSLQEARSIF 327

Query: 317 DSHFTNCRVNAPWNAMISGYLQSECNEKALELFAKMCQNDIHLDHYTYCSVFNAIAVLKC 376
           DSHFTNCRVN PWNAMISGYLQSE NE+ALELFAKMC N++HLD YTYCSVFNAIA LKC
Sbjct: 328 DSHFTNCRVNGPWNAMISGYLQSEFNEEALELFAKMCLNNVHLDRYTYCSVFNAIAALKC 387

Query: 377 LSSGKKVHARAIKSGLEVNHINISNAVANAYAKCGSLDDVSKVFYRIEERDLVSWTTLVT 436
           LS GKKVHARAIKSGLEVN+I+ISNAVANAYAKCGSL+D+ KVFY +EERDLVSWTTLVT
Sbjct: 388 LSLGKKVHARAIKSGLEVNNISISNAVANAYAKCGSLEDLRKVFYSMEERDLVSWTTLVT 447

Query: 437 AYSQCSEWDKAIEIFSNMREGGFTPNQFAFSSVLVSCASLCLLEYGQQVHGFLCKVGLDM 496
           AYSQCSEWDKAIEIFSNMRE GF PNQFAFSSVLVSCASLCLLEYGQQVHGF+CKVGLDM
Sbjct: 448 AYSQCSEWDKAIEIFSNMREEGFAPNQFAFSSVLVSCASLCLLEYGQQVHGFICKVGLDM 507

Query: 497 DKCIESALIDMYAKCGNLVEAKKVFNRISNADTVSWTAIISGHAQHGVVDDALQLFRRME 556
           DKCI+SALIDMYAKCG+L EAKK F++IS+ADTVSWTAII+GHAQHG+VD+ALQLFRRME
Sbjct: 508 DKCIQSALIDMYAKCGSLAEAKKAFDKISDADTVSWTAIIAGHAQHGMVDNALQLFRRME 567

Query: 557 QLGVEPNAVTFLCVLFACSHGGLVEEGLQYFKLMKETYGLVPEMEHYSCIVDLLSRVGRL 616
           QLGVEPNAVTFLCVLFACSHGGLVEEGLQYFKLMKETYGLVP MEHYSCIVDLLSRVGRL
Sbjct: 568 QLGVEPNAVTFLCVLFACSHGGLVEEGLQYFKLMKETYGLVPGMEHYSCIVDLLSRVGRL 627

Query: 617 NDAMEFISRMPIEPNEMVWQTLLGACRVHGNVGLGELAAQKILAFRAENSATFVLLSNTY 676
           NDAMEFIS+MPIEPNEMVWQTLLGACRVHGNV LGELAAQKI +F+AENSAT+VLLSNTY
Sbjct: 628 NDAMEFISKMPIEPNEMVWQTLLGACRVHGNVELGELAAQKIRSFKAENSATYVLLSNTY 687

Query: 677 IESGSYKDGLSLRHAMKEQGVKKEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLEELR 736
           IESG YK+GLSLRH MKEQGVKKEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLEELR
Sbjct: 688 IESGIYKNGLSLRHVMKEQGVKKEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLEELR 747

Query: 737 LKANSLDDVPDLSYEL 753
           LKANSLD VPDLSYEL
Sbjct: 748 LKANSLDVVPDLSYEL 763

BLAST of Tan0004841 vs. ExPASy TrEMBL
Match: A0A6J1FGJ6 (pentatricopeptide repeat-containing protein At3g16610-like OS=Cucurbita moschata OX=3662 GN=LOC111445205 PE=4 SV=1)

HSP 1 Score: 1325.8 bits (3430), Expect = 0.0e+00
Identity = 649/736 (88.18%), Postives = 691/736 (93.89%), Query Frame = 0

Query: 17  RNAALTPTPSATQKAIAASGIIIPNSVTADKTNSHLEIQPLVDLLRDCVDVRFLKQAKTI 76
           RN++LT T SATQKAI  SGI IPNSV+ DK+NS LEIQPLVDLLR CVD RFLKQAKT+
Sbjct: 28  RNSSLTTTHSATQKAITTSGIKIPNSVSVDKSNSRLEIQPLVDLLRGCVDARFLKQAKTV 87

Query: 77  HGFLLKSKFSNHESLVLLNHVAHAYSKCSDIGAACRLFDQMSQRNIFSWTVIIVGLADNG 136
           HGFLLKSKFSNH+SLVLLNHVA AYSKCSDI AACRLFD+MSQRNIFSWTVII GLA NG
Sbjct: 88  HGFLLKSKFSNHDSLVLLNHVADAYSKCSDIDAACRLFDKMSQRNIFSWTVIIAGLAKNG 147

Query: 137 LFLDGFELFCEMQSLGIFPDQFAYSGILQICIGLESTDLGRMVHAQIVIRGFASHTFVST 196
           LF DGFE FCEMQS  IFPDQFAYSG+LQICIGLES +LG+MVHAQIVIRGFASHTFVST
Sbjct: 148 LFHDGFEFFCEMQSQDIFPDQFAYSGVLQICIGLESIELGKMVHAQIVIRGFASHTFVST 207

Query: 197 ALLNMYAKLQQIEDSFKVFNNMTEANVVSWNAMISGFTSNGLYLEAFDHFLRMKREGVTP 256
           ALLNMYAKLQ+I+DS++VFN MTE NVVSWNAMISGFTSNGLY +AFDHFLRMK EGVTP
Sbjct: 208 ALLNMYAKLQKIDDSYEVFNTMTEVNVVSWNAMISGFTSNGLYSDAFDHFLRMKGEGVTP 267

Query: 257 NAQTFIGLAKAIGMLRDVNKAKEVSHFASELGVDSNTLVGTALIDMHSKCGSLQEAGYIF 316
           +AQTFI +AKAIGMLRDVNKAKE+S +ASELG+DSN LVGTALIDMHSKCGSLQEA  IF
Sbjct: 268 DAQTFISIAKAIGMLRDVNKAKEISRYASELGMDSNPLVGTALIDMHSKCGSLQEARSIF 327

Query: 317 DSHFTNCRVNAPWNAMISGYLQSECNEKALELFAKMCQNDIHLDHYTYCSVFNAIAVLKC 376
           DSHFTNCRVN PWNAMISGYLQSE NEKALELFAKMC N++HLD YTYCSVFNAIA LKC
Sbjct: 328 DSHFTNCRVNGPWNAMISGYLQSEFNEKALELFAKMCLNNVHLDRYTYCSVFNAIAALKC 387

Query: 377 LSSGKKVHARAIKSGLEVNHINISNAVANAYAKCGSLDDVSKVFYRIEERDLVSWTTLVT 436
           LS GKKVHARAIKSGLEVN+I+ISNAVANAYAKCGSL+D+ KVFY +EERDLVSWTTLVT
Sbjct: 388 LSLGKKVHARAIKSGLEVNNISISNAVANAYAKCGSLEDLRKVFYSMEERDLVSWTTLVT 447

Query: 437 AYSQCSEWDKAIEIFSNMREGGFTPNQFAFSSVLVSCASLCLLEYGQQVHGFLCKVGLDM 496
           AYSQCSEWDKAIEIFSNMRE GF PNQFAFSSVL+SCASLCLLEYGQQVHGF+ KVGLDM
Sbjct: 448 AYSQCSEWDKAIEIFSNMREEGFAPNQFAFSSVLISCASLCLLEYGQQVHGFIGKVGLDM 507

Query: 497 DKCIESALIDMYAKCGNLVEAKKVFNRISNADTVSWTAIISGHAQHGVVDDALQLFRRME 556
           DKCI+SALIDMYAKCG+L EAKKVF++IS+ADT+SWTAII+GHAQHG+VDDALQLFRRME
Sbjct: 508 DKCIQSALIDMYAKCGSLAEAKKVFDKISDADTISWTAIIAGHAQHGMVDDALQLFRRME 567

Query: 557 QLGVEPNAVTFLCVLFACSHGGLVEEGLQYFKLMKETYGLVPEMEHYSCIVDLLSRVGRL 616
           QLGVEPNAVTFLCVLFACSHGGLVEEGLQYFKLMKETYGLVP MEHYSCIVDLLSRVGRL
Sbjct: 568 QLGVEPNAVTFLCVLFACSHGGLVEEGLQYFKLMKETYGLVPGMEHYSCIVDLLSRVGRL 627

Query: 617 NDAMEFISRMPIEPNEMVWQTLLGACRVHGNVGLGELAAQKILAFRAENSATFVLLSNTY 676
           NDAMEFIS+MPIEPNEMVWQTLLGACRVHGNV LGELAA+KI +F+AENSAT+VLLSNTY
Sbjct: 628 NDAMEFISKMPIEPNEMVWQTLLGACRVHGNVELGELAARKIRSFKAENSATYVLLSNTY 687

Query: 677 IESGSYKDGLSLRHAMKEQGVKKEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLEELR 736
           IESGSYKDGLSLRH MKEQGVKKEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLEELR
Sbjct: 688 IESGSYKDGLSLRHVMKEQGVKKEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLEELR 747

Query: 737 LKANSLDDVPDLSYEL 753
           LK NS DDVPDLSYEL
Sbjct: 748 LKVNSSDDVPDLSYEL 763

BLAST of Tan0004841 vs. ExPASy TrEMBL
Match: A0A6J1JX63 (pentatricopeptide repeat-containing protein At2g27610-like OS=Cucurbita maxima OX=3661 GN=LOC111489649 PE=4 SV=1)

HSP 1 Score: 1320.1 bits (3415), Expect = 0.0e+00
Identity = 646/736 (87.77%), Postives = 691/736 (93.89%), Query Frame = 0

Query: 17  RNAALTPTPSATQKAIAASGIIIPNSVTADKTNSHLEIQPLVDLLRDCVDVRFLKQAKTI 76
           RN++LT T SATQKAI  SGI IPNSV+ DK+NS LEIQPLVDLLR C+D RFLKQAKT+
Sbjct: 28  RNSSLTTTHSATQKAITISGIKIPNSVSVDKSNSRLEIQPLVDLLRGCLDARFLKQAKTV 87

Query: 77  HGFLLKSKFSNHESLVLLNHVAHAYSKCSDIGAACRLFDQMSQRNIFSWTVIIVGLADNG 136
           HGFLLKSK SNH+S+VLLNHVA AYSKCSDI AACRLFD+MSQRNIFSWTVII GLA NG
Sbjct: 88  HGFLLKSKLSNHDSMVLLNHVADAYSKCSDIDAACRLFDKMSQRNIFSWTVIIAGLAKNG 147

Query: 137 LFLDGFELFCEMQSLGIFPDQFAYSGILQICIGLESTDLGRMVHAQIVIRGFASHTFVST 196
           LF DGFE FCEMQS  IFPDQFAYSG+LQICIGLES +LG+MVHAQIVIRGFASHTFVST
Sbjct: 148 LFHDGFEFFCEMQSQDIFPDQFAYSGVLQICIGLESIELGKMVHAQIVIRGFASHTFVST 207

Query: 197 ALLNMYAKLQQIEDSFKVFNNMTEANVVSWNAMISGFTSNGLYLEAFDHFLRMKREGVTP 256
           ALLNMYAKLQ+I+DS++VFN MTE NVVSWNAMISGFTSNGLY +AFDHFLRMK EGVTP
Sbjct: 208 ALLNMYAKLQKIDDSYEVFNTMTEVNVVSWNAMISGFTSNGLYSDAFDHFLRMKGEGVTP 267

Query: 257 NAQTFIGLAKAIGMLRDVNKAKEVSHFASELGVDSNTLVGTALIDMHSKCGSLQEAGYIF 316
           +AQTFI +AKAIGMLRDVNKAKE+S +AS+LG+DSNTLVGTALIDMHSKCGSLQEA  IF
Sbjct: 268 DAQTFISIAKAIGMLRDVNKAKEISRYASKLGMDSNTLVGTALIDMHSKCGSLQEARSIF 327

Query: 317 DSHFTNCRVNAPWNAMISGYLQSECNEKALELFAKMCQNDIHLDHYTYCSVFNAIAVLKC 376
           DSHFTNCRVN PWNAMISGYLQSE NE+ALELFAKMC N++HLD YTYCSVFNAIA LKC
Sbjct: 328 DSHFTNCRVNGPWNAMISGYLQSEFNEEALELFAKMCLNNVHLDRYTYCSVFNAIAALKC 387

Query: 377 LSSGKKVHARAIKSGLEVNHINISNAVANAYAKCGSLDDVSKVFYRIEERDLVSWTTLVT 436
           LS GKKVHARAIKSGLEVN+I+ISNAVANAYAKCGSL+D+ KVFY +EERDLVSWTTLVT
Sbjct: 388 LSLGKKVHARAIKSGLEVNNISISNAVANAYAKCGSLEDLRKVFYSMEERDLVSWTTLVT 447

Query: 437 AYSQCSEWDKAIEIFSNMREGGFTPNQFAFSSVLVSCASLCLLEYGQQVHGFLCKVGLDM 496
           AYSQCSEWDKAIEIFSNMRE GF PNQFAFSSVLVSCASLCLLEYGQQVHGF+CKVGLDM
Sbjct: 448 AYSQCSEWDKAIEIFSNMREEGFAPNQFAFSSVLVSCASLCLLEYGQQVHGFICKVGLDM 507

Query: 497 DKCIESALIDMYAKCGNLVEAKKVFNRISNADTVSWTAIISGHAQHGVVDDALQLFRRME 556
           DKCI+SALIDMYAKCG+L EAKK F++IS+ADTVSWTAII+GHAQHG+VD+ALQLFRRME
Sbjct: 508 DKCIQSALIDMYAKCGSLAEAKKAFDKISDADTVSWTAIIAGHAQHGMVDNALQLFRRME 567

Query: 557 QLGVEPNAVTFLCVLFACSHGGLVEEGLQYFKLMKETYGLVPEMEHYSCIVDLLSRVGRL 616
           QLGVEPNAVTFLCVLFACSHGGLVEEGLQYFKLMKETYGLVP MEHYSCIVDLLSRVGRL
Sbjct: 568 QLGVEPNAVTFLCVLFACSHGGLVEEGLQYFKLMKETYGLVPGMEHYSCIVDLLSRVGRL 627

Query: 617 NDAMEFISRMPIEPNEMVWQTLLGACRVHGNVGLGELAAQKILAFRAENSATFVLLSNTY 676
           NDAMEFIS+MPIEPNEMVWQTLLGACRVHGNV LGELAAQKI +F+AENSAT+VLLSNTY
Sbjct: 628 NDAMEFISKMPIEPNEMVWQTLLGACRVHGNVELGELAAQKIRSFKAENSATYVLLSNTY 687

Query: 677 IESGSYKDGLSLRHAMKEQGVKKEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLEELR 736
           IESG YK+GLSLRH MKEQGVKKEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLEELR
Sbjct: 688 IESGIYKNGLSLRHVMKEQGVKKEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLEELR 747

Query: 737 LKANSLDDVPDLSYEL 753
           LKANSLD VPDLSYEL
Sbjct: 748 LKANSLDVVPDLSYEL 763

BLAST of Tan0004841 vs. ExPASy TrEMBL
Match: A0A0A0KBQ4 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G013890 PE=4 SV=1)

HSP 1 Score: 1310.4 bits (3390), Expect = 0.0e+00
Identity = 643/736 (87.36%), Postives = 683/736 (92.80%), Query Frame = 0

Query: 17  RNAALTPTPSATQKAIAASGIIIPNSVTADKTNSHLEIQPLVDLLRDCVDVRFLKQAKTI 76
           RN+ALT T SA QK  A SGI  PNSV  DKT+SHL+IQPLVDLLRDCVD RFLKQAKT+
Sbjct: 18  RNSALTITHSAIQKPFATSGIKTPNSVKVDKTDSHLQIQPLVDLLRDCVDARFLKQAKTV 77

Query: 77  HGFLLKSKFSNHESLVLLNHVAHAYSKCSDIGAACRLFDQMSQRNIFSWTVIIVGLADNG 136
           HGFLLKSKFSNH SLVLLNHVAHAYSKCSDI AACRLFDQMSQRN FSWTV+I GLA+NG
Sbjct: 78  HGFLLKSKFSNHHSLVLLNHVAHAYSKCSDIDAACRLFDQMSQRNTFSWTVLIAGLAENG 137

Query: 137 LFLDGFELFCEMQSLGIFPDQFAYSGILQICIGLESTDLGRMVHAQIVIRGFASHTFVST 196
           LFLDGFE FCEMQS GIFPDQFAYSGILQICIGL+S +LG MVHAQIVIRGF SHTFVST
Sbjct: 138 LFLDGFEFFCEMQSQGIFPDQFAYSGILQICIGLDSIELGNMVHAQIVIRGFTSHTFVST 197

Query: 197 ALLNMYAKLQQIEDSFKVFNNMTEANVVSWNAMISGFTSNGLYLEAFDHFLRMKREGVTP 256
           ALLNMYAKLQ+IEDS+KVFN MTE NVVSWNAMI+GFTSN LYL+AFD FLRM  EGVTP
Sbjct: 198 ALLNMYAKLQEIEDSYKVFNTMTEVNVVSWNAMITGFTSNDLYLDAFDLFLRMMGEGVTP 257

Query: 257 NAQTFIGLAKAIGMLRDVNKAKEVSHFASELGVDSNTLVGTALIDMHSKCGSLQEAGYIF 316
           +AQTFIG+AKAIGMLRDVNKAKEVS +A ELGVDSNTLVGTALIDM+SKCGSLQEA  IF
Sbjct: 258 DAQTFIGVAKAIGMLRDVNKAKEVSGYALELGVDSNTLVGTALIDMNSKCGSLQEARSIF 317

Query: 317 DSHFTNCRVNAPWNAMISGYLQSECNEKALELFAKMCQNDIHLDHYTYCSVFNAIAVLKC 376
           +SHF  CR NAPWNAMISGYL+S  NEKALELFAKMCQNDI+LDHYTYCSVFNAIA LKC
Sbjct: 318 NSHFITCRFNAPWNAMISGYLRSGFNEKALELFAKMCQNDIYLDHYTYCSVFNAIAALKC 377

Query: 377 LSSGKKVHARAIKSGLEVNHINISNAVANAYAKCGSLDDVSKVFYRIEERDLVSWTTLVT 436
           LS GKKVHARAIKSGLEVN+++ISNAVANAYAKCGSL+DV KVF R+E+RDL+SWT+LVT
Sbjct: 378 LSLGKKVHARAIKSGLEVNYVSISNAVANAYAKCGSLEDVRKVFNRMEDRDLISWTSLVT 437

Query: 437 AYSQCSEWDKAIEIFSNMREGGFTPNQFAFSSVLVSCASLCLLEYGQQVHGFLCKVGLDM 496
           AYSQCSEWDKAIEIFSNMR  G  PNQF FSSVLVSCA+LCLLEYGQQVHG +CKVGLDM
Sbjct: 438 AYSQCSEWDKAIEIFSNMRAEGIAPNQFTFSSVLVSCANLCLLEYGQQVHGIICKVGLDM 497

Query: 497 DKCIESALIDMYAKCGNLVEAKKVFNRISNADTVSWTAIISGHAQHGVVDDALQLFRRME 556
           DKCIESAL+DMYAKCG L +AKKVFNRISNADTVSWTAII+GHAQHG+VDDALQLFRRM 
Sbjct: 498 DKCIESALVDMYAKCGCLGDAKKVFNRISNADTVSWTAIIAGHAQHGIVDDALQLFRRMV 557

Query: 557 QLGVEPNAVTFLCVLFACSHGGLVEEGLQYFKLMKETYGLVPEMEHYSCIVDLLSRVGRL 616
           QLGVEPNAVTFLCVLFACSHGGLVEEGLQYFKLMK+TYGLVPEMEHY+CIVDLLSRVG L
Sbjct: 558 QLGVEPNAVTFLCVLFACSHGGLVEEGLQYFKLMKKTYGLVPEMEHYACIVDLLSRVGHL 617

Query: 617 NDAMEFISRMPIEPNEMVWQTLLGACRVHGNVGLGELAAQKILAFRAENSATFVLLSNTY 676
           NDAMEFISRMP+EPNEMVWQTLLGACRVHGNV LGELAAQKIL+F+AENSAT+VLLSNTY
Sbjct: 618 NDAMEFISRMPVEPNEMVWQTLLGACRVHGNVELGELAAQKILSFKAENSATYVLLSNTY 677

Query: 677 IESGSYKDGLSLRHAMKEQGVKKEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLEELR 736
           IESGSYKDGLSLRH MKEQGVKKEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLEEL+
Sbjct: 678 IESGSYKDGLSLRHLMKEQGVKKEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLEELK 737

Query: 737 LKANSLDDVPDLSYEL 753
           LK  SLDDVPDLSYEL
Sbjct: 738 LKLISLDDVPDLSYEL 753

BLAST of Tan0004841 vs. ExPASy TrEMBL
Match: A0A1S4E171 (pentatricopeptide repeat-containing protein At2g27610-like OS=Cucumis melo OX=3656 GN=LOC103496600 PE=4 SV=1)

HSP 1 Score: 1308.1 bits (3384), Expect = 0.0e+00
Identity = 645/740 (87.16%), Postives = 685/740 (92.57%), Query Frame = 0

Query: 13  SYCTRNAALTPTPSATQKAIAASGIIIPNSVTADKTNSHLEIQPLVDLLRDCVDVRFLKQ 72
           +Y  R +ALT T SATQKAIA SGI IPNSV  DKT+SHLEIQPLVDLLR CVD RFLKQ
Sbjct: 24  NYQIRTSALTITHSATQKAIATSGIKIPNSVKVDKTDSHLEIQPLVDLLRGCVDARFLKQ 83

Query: 73  AKTIHGFLLKSKFSNHESLVLLNHVAHAYSKCSDIGAACRLFDQMSQRNIFSWTVIIVGL 132
           AKT+HGFLLKSKFSNH+SLVLLNHVAHAYSKCSDI AACR+FDQMSQRNIFSWT II GL
Sbjct: 84  AKTVHGFLLKSKFSNHDSLVLLNHVAHAYSKCSDIDAACRVFDQMSQRNIFSWTAIIAGL 143

Query: 133 ADNGLFLDGFELFCEMQSLGIFPDQFAYSGILQICIGLESTDLGRMVHAQIVIRGFASHT 192
           A+NGLFLDGFE FCEMQS GIFPD FAYSGILQICIGL+S +LG+MVHAQIVIRGF SHT
Sbjct: 144 AENGLFLDGFEFFCEMQSQGIFPDHFAYSGILQICIGLDSVELGKMVHAQIVIRGFTSHT 203

Query: 193 FVSTALLNMYAKLQQIEDSFKVFNNMTEANVVSWNAMISGFTSNGLYLEAFDHFLRMKRE 252
           FVSTALLNMYAKLQ+IEDS KVFN MTE NVVSWNAMI+GFTSNG YL+AFD FLRMK E
Sbjct: 204 FVSTALLNMYAKLQEIEDSCKVFNTMTEVNVVSWNAMITGFTSNGFYLDAFDLFLRMKGE 263

Query: 253 GVTPNAQTFIGLAKAIGMLRDVNKAKEVSHFASELGVDSNTLVGTALIDMHSKCGSLQEA 312
           GVTP+AQTFIG+AKAIGMLRDVNKAKEVS +A ELGVDSNTLVGTALIDMHSKCGSLQEA
Sbjct: 264 GVTPDAQTFIGVAKAIGMLRDVNKAKEVSGYALELGVDSNTLVGTALIDMHSKCGSLQEA 323

Query: 313 GYIFDSHFTNCRVNAPWNAMISGYLQSECNEKALELFAKMCQNDIHLDHYTYCSVFNAIA 372
             IF+SHF  CR NAPWNAMISGYLQS  NEKALELFAKMCQ+DIHLD YTYCSVFNAIA
Sbjct: 324 RSIFNSHFVTCRFNAPWNAMISGYLQSGLNEKALELFAKMCQSDIHLDRYTYCSVFNAIA 383

Query: 373 VLKCLSSGKKVHARAIKSGLEVNHINISNAVANAYAKCGSLDDVSKVFYRIEERDLVSWT 432
            LKCL SGKKVHARAIKSGLEVN ++ISNAVANAYAKCGSL+DV KVF R+E+RDL+SWT
Sbjct: 384 SLKCLLSGKKVHARAIKSGLEVNCVSISNAVANAYAKCGSLEDVRKVFNRMEDRDLISWT 443

Query: 433 TLVTAYSQCSEWDKAIEIFSNMREGGFTPNQFAFSSVLVSCASLCLLEYGQQVHGFLCKV 492
           +LVTAYSQCSEWDKAIEIFSNMR  G+ PNQFAFSSVLVSCA+LCLLEYGQQVHG +CKV
Sbjct: 444 SLVTAYSQCSEWDKAIEIFSNMRAEGYAPNQFAFSSVLVSCANLCLLEYGQQVHGIICKV 503

Query: 493 GLDMDKCIESALIDMYAKCGNLVEAKKVFNRISNADTVSWTAIISGHAQHGVVDDALQLF 552
           GLDMDKCIESAL+DMYAKCG L +AKKVFNRISNADTVSWTAII+GHAQHG+VDDALQLF
Sbjct: 504 GLDMDKCIESALVDMYAKCGCLADAKKVFNRISNADTVSWTAIIAGHAQHGIVDDALQLF 563

Query: 553 RRMEQLGVEPNAVTFLCVLFACSHGGLVEEGLQYFKLMKETYGLVPEMEHYSCIVDLLSR 612
           RRM  LGVEPNAVTFLCVLFACSHGGLVEEGLQYFKLMK+TYGLVPEMEHY+CIVDLLSR
Sbjct: 564 RRMVLLGVEPNAVTFLCVLFACSHGGLVEEGLQYFKLMKKTYGLVPEMEHYACIVDLLSR 623

Query: 613 VGRLNDAMEFISRMPIEPNEMVWQTLLGACRVHGNVGLGELAAQKILAFRAENSATFVLL 672
           VGRLNDAM FIS+MP+EPNEMVWQTLLGACRVHGNV LGELAAQKIL+F+AENSAT+VLL
Sbjct: 624 VGRLNDAMGFISKMPVEPNEMVWQTLLGACRVHGNVELGELAAQKILSFKAENSATYVLL 683

Query: 673 SNTYIESGSYKDGLSLRHAMKEQGVKKEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKL 732
           SNTYIESGSYKDGLSLRH MKEQGVKKEPG SWISVNGTLHKFYAGDQQHPEKDKIYAKL
Sbjct: 684 SNTYIESGSYKDGLSLRHVMKEQGVKKEPGFSWISVNGTLHKFYAGDQQHPEKDKIYAKL 743

Query: 733 EELRLKANSLDDVPDLSYEL 753
           EEL+LK  SLDDVP LSYEL
Sbjct: 744 EELKLKLISLDDVPYLSYEL 763

BLAST of Tan0004841 vs. ExPASy TrEMBL
Match: A0A5D3C2B1 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold409G001490 PE=4 SV=1)

HSP 1 Score: 1307.4 bits (3382), Expect = 0.0e+00
Identity = 644/736 (87.50%), Postives = 683/736 (92.80%), Query Frame = 0

Query: 17  RNAALTPTPSATQKAIAASGIIIPNSVTADKTNSHLEIQPLVDLLRDCVDVRFLKQAKTI 76
           R +ALT T SATQKAIA SGI IPNSV  DKT+SHLEIQPLVDLLR CVD RFLKQAKT+
Sbjct: 45  RTSALTITHSATQKAIATSGIKIPNSVKVDKTDSHLEIQPLVDLLRGCVDARFLKQAKTV 104

Query: 77  HGFLLKSKFSNHESLVLLNHVAHAYSKCSDIGAACRLFDQMSQRNIFSWTVIIVGLADNG 136
           HGFLLKSKFSNH+SLVLLNHVAHAYSKCSDI AACR+FDQMSQRNIFSWT II GLA+NG
Sbjct: 105 HGFLLKSKFSNHDSLVLLNHVAHAYSKCSDIDAACRVFDQMSQRNIFSWTAIIAGLAENG 164

Query: 137 LFLDGFELFCEMQSLGIFPDQFAYSGILQICIGLESTDLGRMVHAQIVIRGFASHTFVST 196
           LFLDGFE FCEMQS GIFPD FAYSGILQICIGL+S +LG+MVHAQIVIRGF SHTFVST
Sbjct: 165 LFLDGFEFFCEMQSQGIFPDHFAYSGILQICIGLDSVELGKMVHAQIVIRGFTSHTFVST 224

Query: 197 ALLNMYAKLQQIEDSFKVFNNMTEANVVSWNAMISGFTSNGLYLEAFDHFLRMKREGVTP 256
           ALLNMYAKLQ+IEDS KVFN MTE NVVSWNAMI+GFTSNG YL+AFD FLRMK EGVTP
Sbjct: 225 ALLNMYAKLQEIEDSCKVFNTMTEVNVVSWNAMITGFTSNGFYLDAFDLFLRMKGEGVTP 284

Query: 257 NAQTFIGLAKAIGMLRDVNKAKEVSHFASELGVDSNTLVGTALIDMHSKCGSLQEAGYIF 316
           +AQTFIG+AKAIGMLRDVNKAKEVS +A ELGVDSNTLVGTALIDMHSKCGSLQEA  IF
Sbjct: 285 DAQTFIGVAKAIGMLRDVNKAKEVSGYALELGVDSNTLVGTALIDMHSKCGSLQEARSIF 344

Query: 317 DSHFTNCRVNAPWNAMISGYLQSECNEKALELFAKMCQNDIHLDHYTYCSVFNAIAVLKC 376
           +SHF  CR NAPWNAMISGYLQS  NEKALELFAKMCQ+DIHLD YTYCSVFNAIA LKC
Sbjct: 345 NSHFVTCRFNAPWNAMISGYLQSGLNEKALELFAKMCQSDIHLDRYTYCSVFNAIASLKC 404

Query: 377 LSSGKKVHARAIKSGLEVNHINISNAVANAYAKCGSLDDVSKVFYRIEERDLVSWTTLVT 436
           L SGKKVHARAIKSGLEVN ++ISNAVANAYAKCGSL+DV KVF R+E+RDL+SWT+LVT
Sbjct: 405 LLSGKKVHARAIKSGLEVNCVSISNAVANAYAKCGSLEDVRKVFNRMEDRDLISWTSLVT 464

Query: 437 AYSQCSEWDKAIEIFSNMREGGFTPNQFAFSSVLVSCASLCLLEYGQQVHGFLCKVGLDM 496
           AYSQCSEWDKAIEIFSNMR  G+ PNQFAFSSVLVSCA+LCLLEYGQQVHG +CKVGLDM
Sbjct: 465 AYSQCSEWDKAIEIFSNMRAEGYAPNQFAFSSVLVSCANLCLLEYGQQVHGIICKVGLDM 524

Query: 497 DKCIESALIDMYAKCGNLVEAKKVFNRISNADTVSWTAIISGHAQHGVVDDALQLFRRME 556
           DKCIESAL+DMYAKCG L +AKKVFNRISNADTVSWTAII+GHAQHG+VDDALQLFRRM 
Sbjct: 525 DKCIESALVDMYAKCGCLADAKKVFNRISNADTVSWTAIIAGHAQHGIVDDALQLFRRMV 584

Query: 557 QLGVEPNAVTFLCVLFACSHGGLVEEGLQYFKLMKETYGLVPEMEHYSCIVDLLSRVGRL 616
            LGVEPNAVTFLCVLFACSHGGLVEEGLQYFKLMK+TYGLVPEMEHY+CIVDLLSRVGRL
Sbjct: 585 LLGVEPNAVTFLCVLFACSHGGLVEEGLQYFKLMKKTYGLVPEMEHYACIVDLLSRVGRL 644

Query: 617 NDAMEFISRMPIEPNEMVWQTLLGACRVHGNVGLGELAAQKILAFRAENSATFVLLSNTY 676
           NDAM FIS+MP+EPNEMVWQTLLGACRVHGNV LGELAAQKIL+F+AENSAT+VLLSNTY
Sbjct: 645 NDAMGFISKMPVEPNEMVWQTLLGACRVHGNVELGELAAQKILSFKAENSATYVLLSNTY 704

Query: 677 IESGSYKDGLSLRHAMKEQGVKKEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLEELR 736
           IESGSYKDGLSLRH MKEQGVKKEPG SWISVNGTLHKFYAGDQQHPEKDKIYAKLEEL+
Sbjct: 705 IESGSYKDGLSLRHVMKEQGVKKEPGFSWISVNGTLHKFYAGDQQHPEKDKIYAKLEELK 764

Query: 737 LKANSLDDVPDLSYEL 753
           LK  SLDDVP LSYEL
Sbjct: 765 LKLISLDDVPYLSYEL 780

BLAST of Tan0004841 vs. TAIR 10
Match: AT2G27610.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 472.6 bits (1215), Expect = 5.8e-133
Identity = 251/671 (37.41%), Postives = 387/671 (57.68%), Query Frame = 0

Query: 84  KFSNHESLVLLNHVAHAYSKCSDIGAACRLFDQMSQRNIFSWTVIIVGLADNGLFLDGFE 143
           KF   + + +   +   Y K S+     ++FD+M +RN+ +WT +I G A N +  +   
Sbjct: 121 KFGFLDDVSVGTSLVDTYMKGSNFKDGRKVFDEMKERNVVTWTTLISGYARNSMNDEVLT 180

Query: 144 LFCEMQSLGIFPDQFAYSGILQICIGLESTDLGRMVHAQIVIRGFASHTFVSTALLNMYA 203
           LF  MQ+ G  P+ F ++  L +         G  VH  +V  G      VS +L+N+Y 
Sbjct: 181 LFMRMQNEGTQPNSFTFAAALGVLAEEGVGGRGLQVHTVVVKNGLDKTIPVSNSLINLYL 240

Query: 204 KLQQIEDSFKVFNNMTEANVVSWNAMISGFTSNGLYLEAFDHFLRMKREGVTPNAQTFIG 263
           K   +  +  +F+     +VV+WN+MISG+ +NGL LEA   F  M+   V  +  +F  
Sbjct: 241 KCGNVRKARILFDKTEVKSVVTWNSMISGYAANGLDLEALGMFYSMRLNYVRLSESSFAS 300

Query: 264 LAKAIGMLRDVNKAKEVSHFASELGVDSNTLVGTALIDMHSKCGSLQEAGYIFDSHFTNC 323
           + K    L+++   +++     + G   +  + TAL+  +SKC ++ +A  +F      C
Sbjct: 301 VIKLCANLKELRFTEQLHCSVVKYGFLFDQNIRTALMVAYSKCTAMLDALRLFKE--IGC 360

Query: 324 RVN-APWNAMISGYLQSECNEKALELFAKMCQNDIHLDHYTYCSVFNAIAVLKCLSSGKK 383
             N   W AMISG+LQ++  E+A++LF++M +  +  + +TY  +  A+ V+    S  +
Sbjct: 361 VGNVVSWTAMISGFLQNDGKEEAVDLFSEMKRKGVRPNEFTYSVILTALPVI----SPSE 420

Query: 384 VHARAIKSGLEVNHINISNAVANAYAKCGSLDDVSKVFYRIEERDLVSWTTLVTAYSQCS 443
           VHA+ +K+  E     +  A+ +AY K G +++ +KVF  I+++D+V+W+ ++  Y+Q  
Sbjct: 421 VHAQVVKTNYE-RSSTVGTALLDAYVKLGKVEEAAKVFSGIDDKDIVAWSAMLAGYAQTG 480

Query: 444 EWDKAIEIFSNMREGGFTPNQFAFSSVLVSCASL-CLLEYGQQVHGFLCKVGLDMDKCIE 503
           E + AI++F  + +GG  PN+F FSS+L  CA+    +  G+Q HGF  K  LD   C+ 
Sbjct: 481 ETEAAIKMFGELTKGGIKPNEFTFSSILNVCAATNASMGQGKQFHGFAIKSRLDSSLCVS 540

Query: 504 SALIDMYAKCGNLVEAKKVFNRISNADTVSWTAIISGHAQHGVVDDALQLFRRMEQLGVE 563
           SAL+ MYAK GN+  A++VF R    D VSW ++ISG+AQHG    AL +F+ M++  V+
Sbjct: 541 SALLTMYAKKGNIESAEEVFKRQREKDLVSWNSMISGYAQHGQAMKALDVFKEMKKRKVK 600

Query: 564 PNAVTFLCVLFACSHGGLVEEGLQYFKLMKETYGLVPEMEHYSCIVDLLSRVGRLNDAME 623
            + VTF+ V  AC+H GLVEEG +YF +M     + P  EH SC+VDL SR G+L  AM+
Sbjct: 601 MDGVTFIGVFAACTHAGLVEEGEKYFDIMVRDCKIAPTKEHNSCMVDLYSRAGQLEKAMK 660

Query: 624 FISRMPIEPNEMVWQTLLGACRVHGNVGLGELAAQKILAFRAENSATFVLLSNTYIESGS 683
            I  MP      +W+T+L ACRVH    LG LAA+KI+A + E+SA +VLLSN Y ESG 
Sbjct: 661 VIENMPNPAGSTIWRTILAACRVHKKTELGRLAAEKIIAMKPEDSAAYVLLSNMYAESGD 720

Query: 684 YKDGLSLRHAMKEQGVKKEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLEELRLKANS 743
           +++   +R  M E+ VKKEPG SWI V    + F AGD+ HP KD+IY KLE+L  +   
Sbjct: 721 WQERAKVRKLMNERNVKKEPGYSWIEVKNKTYSFLAGDRSHPLKDQIYMKLEDLSTRLKD 780

Query: 744 LDDVPDLSYEL 753
           L   PD SY L
Sbjct: 781 LGYEPDTSYVL 784

BLAST of Tan0004841 vs. TAIR 10
Match: AT3G53360.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 446.0 bits (1146), Expect = 5.8e-125
Identity = 242/694 (34.87%), Postives = 377/694 (54.32%), Query Frame = 0

Query: 45  ADKTNS-HLEIQPLVDLLRDCVDVRFLKQAKTIHGFLLKSKFSNHESLVLLNHVAHAYSK 104
           A K +S  + ++  + L+  C   R L Q + IH  +L S        +L NH+   Y K
Sbjct: 57  AQKNSSFKIRLRTYISLICACSSSRSLAQGRKIHDHILNSNCK--YDTILNNHILSMYGK 116

Query: 105 CSDIGAACRLFDQMSQRNIFSWTVIIVGLADNGLFLDGFELFCEMQSLGIFPDQFAYSGI 164
           C  +  A  +FD M +RN+ S+T +I G + NG   +   L+ +M    + PDQFA+  I
Sbjct: 117 CGSLRDAREVFDFMPERNLVSYTSVITGYSQNGQGAEAIRLYLKMLQEDLVPDQFAFGSI 176

Query: 165 LQICIGLESTDLGRMVHAQIVIRGFASHTFVSTALLNMYAKLQQIEDSFKVFNNMTEANV 224
           ++ C       LG+ +HAQ++    +SH     AL+ MY +  Q+ D+ +VF  +   ++
Sbjct: 177 IKACASSSDVGLGKQLHAQVIKLESSSHLIAQNALIAMYVRFNQMSDASRVFYGIPMKDL 236

Query: 225 VSWNAMISGFTSNGLYLEAFDHFLRMKREGV-TPNAQTFIGLAKAIGMLRDVNKAKEVSH 284
           +SW+++I+GF+  G   EA  H   M   GV  PN   F    KA   L   +   ++  
Sbjct: 237 ISWSSIIAGFSQLGFEFEALSHLKEMLSFGVFHPNEYIFGSSLKACSSLLRPDYGSQIHG 296

Query: 285 FASELGVDSNTLVGTALIDMHSKCGSLQEAGYIFDSHFTNCRVNAPWNAMISGYLQSECN 344
              +  +  N + G +L DM+++CG L  A  +FD         A WN +I+G   +   
Sbjct: 297 LCIKSELAGNAIAGCSLCDMYARCGFLNSARRVFDQ--IERPDTASWNVIIAGLANNGYA 356

Query: 345 EKALELFAKMCQNDIHLDHYTYCSVFNAIAVLKCLSSGKKVHARAIKSGLEVNHINISNA 404
           ++A+ +F++M  +    D  +  S+  A      LS G ++H+  IK G  +  + + N+
Sbjct: 357 DEAVSVFSQMRSSGFIPDAISLRSLLCAQTKPMALSQGMQIHSYIIKWGF-LADLTVCNS 416

Query: 405 VANAYAKCGSLDDVSKVFYRIEER-DLVSWTTLVTAYSQCSEWDKAIEIFSNMREGGFTP 464
           +   Y  C  L     +F       D VSW T++TA  Q  +  + + +F  M      P
Sbjct: 417 LLTMYTFCSDLYCCFNLFEDFRNNADSVSWNTILTACLQHEQPVEMLRLFKLMLVSECEP 476

Query: 465 NQFAFSSVLVSCASLCLLEYGQQVHGFLCKVGLDMDKCIESALIDMYAKCGNLVEAKKVF 524
           +     ++L  C  +  L+ G QVH +  K GL  ++ I++ LIDMYAKCG+L +A+++F
Sbjct: 477 DHITMGNLLRGCVEISSLKLGSQVHCYSLKTGLAPEQFIKNGLIDMYAKCGSLGQARRIF 536

Query: 525 NRISNADTVSWTAIISGHAQHGVVDDALQLFRRMEQLGVEPNAVTFLCVLFACSHGGLVE 584
           + + N D VSW+ +I G+AQ G  ++AL LF+ M+  G+EPN VTF+ VL ACSH GLVE
Sbjct: 537 DSMDNRDVVSWSTLIVGYAQSGFGEEALILFKEMKSAGIEPNHVTFVGVLTACSHVGLVE 596

Query: 585 EGLQYFKLMKETYGLVPEMEHYSCIVDLLSRVGRLNDAMEFISRMPIEPNEMVWQTLLGA 644
           EGL+ +  M+  +G+ P  EH SC+VDLL+R GRLN+A  FI  M +EP+ +VW+TLL A
Sbjct: 597 EGLKLYATMQTEHGISPTKEHCSCVVDLLARAGRLNEAERFIDEMKLEPDVVVWKTLLSA 656

Query: 645 CRVHGNVGLGELAAQKILAFRAENSATFVLLSNTYIESGSYKDGLSLRHAMKEQGVKKEP 704
           C+  GNV L + AA+ IL     NS   VLL + +  SG++++   LR +MK+  VKK P
Sbjct: 657 CKTQGNVHLAQKAAENILKIDPFNSTAHVLLCSMHASSGNWENAALLRSSMKKHDVKKIP 716

Query: 705 GCSWISVNGTLHKFYAGDQQHPEKDKIYAKLEEL 736
           G SWI +   +H F+A D  HPE+D IY  L  +
Sbjct: 717 GQSWIEIEDKIHIFFAEDIFHPERDDIYTVLHNI 745

BLAST of Tan0004841 vs. TAIR 10
Match: AT4G18750.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 443.4 bits (1139), Expect = 3.8e-124
Identity = 228/688 (33.14%), Postives = 390/688 (56.69%), Query Frame = 0

Query: 52  LEIQPLVDLLRDCVDVRFLKQAKTIHGFLLKSKFSNHESLVLLNHVAHAYSKCSDIGAAC 111
           ++ + L  +L+ C D + LK  K +  F+  + F    +L   + ++  Y+ C D+  A 
Sbjct: 92  IDPRTLCSVLQLCADSKSLKDGKEVDNFIRGNGFVIDSNLG--SKLSLMYTNCGDLKEAS 151

Query: 112 RLFDQMSQRNIFSWTVIIVGLADNGLFLDGFELFCEMQSLGIFPDQFAYSGILQICIGLE 171
           R+FD++       W +++  LA +G F     LF +M S G+  D + +S + +    L 
Sbjct: 152 RVFDEVKIEKALFWNILMNELAKSGDFSGSIGLFKKMMSSGVEMDSYTFSCVSKSFSSLR 211

Query: 172 STDLGRMVHAQIVIRGFASHTFVSTALLNMYAKLQQIEDSFKVFNNMTEANVVSWNAMIS 231
           S   G  +H  I+  GF     V  +L+  Y K Q+++ + KVF+ MTE +V+SWN++I+
Sbjct: 212 SVHGGEQLHGFILKSGFGERNSVGNSLVAFYLKNQRVDSARKVFDEMTERDVISWNSIIN 271

Query: 232 GFTSNGLYLEAFDHFLRMKREGVTPNAQTFIGLAKAIGMLRDVNKAKEVSHFASELGVDS 291
           G+ SNGL  +    F++M   G+  +  T + +       R ++  + V     +     
Sbjct: 272 GYVSNGLAEKGLSVFVQMLVSGIEIDLATIVSVFAGCADSRLISLGRAVHSIGVKACFSR 331

Query: 292 NTLVGTALIDMHSKCGSLQEAGYIFDSHFTNCRVNAPWNAMISGYLQSECNEKALELFAK 351
                  L+DM+SKCG L  A  +F     + R    + +MI+GY +     +A++LF +
Sbjct: 332 EDRFCNTLLDMYSKCGDLDSAKAVFRE--MSDRSVVSYTSMIAGYAREGLAGEAVKLFEE 391

Query: 352 MCQNDIHLDHYTYCSVFNAIAVLKCLSSGKKVHARAIKSGLEVNHINISNAVANAYAKCG 411
           M +  I  D YT  +V N  A  + L  GK+VH    ++ L  + I +SNA+ + YAKCG
Sbjct: 392 MEEEGISPDVYTVTAVLNCCARYRLLDEGKRVHEWIKENDLGFD-IFVSNALMDMYAKCG 451

Query: 412 SLDDVSKVFYRIEERDLVSWTTLVTAYSQCSEWDKAIEIFS-NMREGGFTPNQFAFSSVL 471
           S+ +   VF  +  +D++SW T++  YS+    ++A+ +F+  + E  F+P++   + VL
Sbjct: 452 SMQEAELVFSEMRVKDIISWNTIIGGYSKNCYANEALSLFNLLLEEKRFSPDERTVACVL 511

Query: 472 VSCASLCLLEYGQQVHGFLCKVGLDMDKCIESALIDMYAKCGNLVEAKKVFNRISNADTV 531
            +CASL   + G+++HG++ + G   D+ + ++L+DMYAKCG L+ A  +F+ I++ D V
Sbjct: 512 PACASLSAFDKGREIHGYIMRNGYFSDRHVANSLVDMYAKCGALLLAHMLFDDIASKDLV 571

Query: 532 SWTAIISGHAQHGVVDDALQLFRRMEQLGVEPNAVTFLCVLFACSHGGLVEEGLQYFKLM 591
           SWT +I+G+  HG   +A+ LF +M Q G+E + ++F+ +L+ACSH GLV+EG ++F +M
Sbjct: 572 SWTVMIAGYGMHGFGKEAIALFNQMRQAGIEADEISFVSLLYACSHSGLVDEGWRFFNIM 631

Query: 592 KETYGLVPEMEHYSCIVDLLSRVGRLNDAMEFISRMPIEPNEMVWQTLLGACRVHGNVGL 651
           +    + P +EHY+CIVD+L+R G L  A  FI  MPI P+  +W  LL  CR+H +V L
Sbjct: 632 RHECKIEPTVEHYACIVDMLARTGDLIKAYRFIENMPIPPDATIWGALLCGCRIHHDVKL 691

Query: 652 GELAAQKILAFRAENSATFVLLSNTYIESGSYKDGLSLRHAMKEQGVKKEPGCSWISVNG 711
            E  A+K+     EN+  +VL++N Y E+  ++    LR  + ++G++K PGCSWI + G
Sbjct: 692 AEKVAEKVFELEPENTGYYVLMANIYAEAEKWEQVKRLRKRIGQRGLRKNPGCSWIEIKG 751

Query: 712 TLHKFYAGDQQHPEKDKIYAKLEELRLK 739
            ++ F AGD  +PE + I A L ++R +
Sbjct: 752 RVNIFVAGDSSNPETENIEAFLRKVRAR 774

BLAST of Tan0004841 vs. TAIR 10
Match: AT4G13650.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 443.0 bits (1138), Expect = 4.9e-124
Identity = 245/688 (35.61%), Postives = 370/688 (53.78%), Query Frame = 0

Query: 60  LLRDCVDVRFLKQAKTIHGFLLKSKFSNHESLVLLNHVAHAYSKCSDIGAACRLFDQMSQ 119
           +L  C  +  L+  + +HG +LK  FS+     + N +   Y    ++ +A  +F  MSQ
Sbjct: 294 VLSACKKIESLEIGEQLHGLVLKLGFSS--DTYVCNALVSLYFHLGNLISAEHIFSNMSQ 353

Query: 120 RNIFSWTVIIVGLADNGLFLDGFELFCEMQSLGIFPDQFAYSGILQICIGLESTDLGRMV 179
           R+  ++  +I GL+  G      ELF  M   G+ PD    + ++  C    +   G+ +
Sbjct: 354 RDAVTYNTLINGLSQCGYGEKAMELFKRMHLDGLEPDSNTLASLVVACSADGTLFRGQQL 413

Query: 180 HAQIVIRGFASHTFVSTALLNMYAKLQQIEDSFKVFNNMTEANVVSWNAMISGFTSNGLY 239
           HA     GFAS+  +  ALLN+YAK   IE +   F      NVV WN M+  +      
Sbjct: 414 HAYTTKLGFASNNKIEGALLNLYAKCADIETALDYFLETEVENVVLWNVMLVAYGLLDDL 473

Query: 240 LEAFDHFLRMKREGVTPNAQTFIGLAKAIGMLRDVNKAKEVSHFASELGVDSNTLVGTAL 299
             +F  F +M+ E + PN  T+  + K    L D+   +++     +     N  V + L
Sbjct: 474 RNSFRIFRQMQIEEIVPNQYTYPSILKTCIRLGDLELGEQIHSQIIKTNFQLNAYVCSVL 533

Query: 300 IDMHSKCGSLQEAGYIFDSHFTNCRVNAPWNAMISGYLQSECNEKALELFAKMCQNDIHL 359
           IDM++K G L  A  I    F    V   W  MI+GY Q   ++KAL  F +M    I  
Sbjct: 534 IDMYAKLGKLDTAWDIL-IRFAGKDV-VSWTTMIAGYTQYNFDDKALTTFRQMLDRGIRS 593

Query: 360 DHYTYCSVFNAIAVLKCLSSGKKVHARAIKSGLEVNHINISNAVANAYAKCGSLDDVSKV 419
           D     +  +A A L+ L  G+++HA+A  SG   + +   NA+   Y++CG +++    
Sbjct: 594 DEVGLTNAVSACAGLQALKEGQQIHAQACVSGFS-SDLPFQNALVTLYSRCGKIEESYLA 653

Query: 420 FYRIEERDLVSWTTLVTAYSQCSEWDKAIEIFSNMREGGFTPNQFAFSSVLVSCASLCLL 479
           F + E  D ++W  LV+ + Q    ++A+ +F  M   G   N F F S + + +    +
Sbjct: 654 FEQTEAGDNIAWNALVSGFQQSGNNEEALRVFVRMNREGIDNNNFTFGSAVKAASETANM 713

Query: 480 EYGQQVHGFLCKVGLDMDKCIESALIDMYAKCGNLVEAKKVFNRISNADTVSWTAIISGH 539
           + G+QVH  + K G D +  + +ALI MYAKCG++ +A+K F  +S  + VSW AII+ +
Sbjct: 714 KQGKQVHAVITKTGYDSETEVCNALISMYAKCGSISDAEKQFLEVSTKNEVSWNAIINAY 773

Query: 540 AQHGVVDDALQLFRRMEQLGVEPNAVTFLCVLFACSHGGLVEEGLQYFKLMKETYGLVPE 599
           ++HG   +AL  F +M    V PN VT + VL ACSH GLV++G+ YF+ M   YGL P+
Sbjct: 774 SKHGFGSEALDSFDQMIHSNVRPNHVTLVGVLSACSHIGLVDKGIAYFESMNSEYGLSPK 833

Query: 600 MEHYSCIVDLLSRVGRLNDAMEFISRMPIEPNEMVWQTLLGACRVHGNVGLGELAAQKIL 659
            EHY C+VD+L+R G L+ A EFI  MPI+P+ +VW+TLL AC VH N+ +GE AA  +L
Sbjct: 834 PEHYVCVVDMLTRAGLLSRAKEFIQEMPIKPDALVWRTLLSACVVHKNMEIGEFAAHHLL 893

Query: 660 AFRAENSATFVLLSNTYIESGSYKDGLSLRHAMKEQGVKKEPGCSWISVNGTLHKFYAGD 719
               E+SAT+VLLSN Y  S  +      R  MKE+GVKKEPG SWI V  ++H FY GD
Sbjct: 894 ELEPEDSATYVLLSNLYAVSKKWDARDLTRQKMKEKGVKKEPGQSWIEVKNSIHSFYVGD 953

Query: 720 QQHPEKDKIYAKLEELRLKANSLDDVPD 748
           Q HP  D+I+   ++L  +A+ +  V D
Sbjct: 954 QNHPLADEIHEYFQDLTKRASEIGYVQD 976

BLAST of Tan0004841 vs. TAIR 10
Match: AT3G49170.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 441.0 bits (1133), Expect = 1.9e-123
Identity = 241/699 (34.48%), Postives = 399/699 (57.08%), Query Frame = 0

Query: 60  LLRDCVDVRFLKQAKTIHGFLLKSKFSNHESLVLLNHVAHAYSKCSDIGAACRLFDQM-- 119
           LL+ C+  R  +  K +H  L+  +F      VL N +   YSK  D   A  +F+ M  
Sbjct: 68  LLKSCIRARDFRLGKLVHARLI--EFDIEPDSVLYNSLISLYSKSGDSAKAEDVFETMRR 127

Query: 120 -SQRNIFSWTVIIVGLADNGLFLDGFELFCEMQSLGIFPDQFAYSGILQICIGLESTDLG 179
             +R++ SW+ ++    +NG  LD  ++F E   LG+ P+ + Y+ +++ C   +   +G
Sbjct: 128 FGKRDVVSWSAMMACYGNNGRELDAIKVFVEFLELGLVPNDYCYTAVIRACSNSDFVGVG 187

Query: 180 RMVHAQIVIRG-FASHTFVSTALLNMYAKLQ-QIEDSFKVFNNMTEANVVSWNAMISGFT 239
           R+    ++  G F S   V  +L++M+ K +   E+++KVF+ M+E NVV+W  MI+   
Sbjct: 188 RVTLGFLMKTGHFESDVCVGCSLIDMFVKGENSFENAYKVFDKMSELNVVTWTLMITRCM 247

Query: 240 SNGLYLEAFDHFLRMKREGVTPNAQTFIGLAKAIGMLRDVNKAKEVSHFASELGVDSNTL 299
             G   EA   FL M   G   +  T   +  A   L +++  K++  +A   G+  +  
Sbjct: 248 QMGFPREAIRFFLDMVLSGFESDKFTLSSVFSACAELENLSLGKQLHSWAIRSGLVDD-- 307

Query: 300 VGTALIDMHSKC---GSLQEAGYIFDSHFTNCRVNAPWNAMISGYLQSECN--EKALELF 359
           V  +L+DM++KC   GS+ +   +FD    +  ++  W A+I+GY+++ CN   +A+ LF
Sbjct: 308 VECSLVDMYAKCSADGSVDDCRKVFDRMEDHSVMS--WTALITGYMKN-CNLATEAINLF 367

Query: 360 AKM-CQNDIHLDHYTYCSVFNAIAVLKCLSSGKKVHARAIKSGLEVNHINISNAVANAYA 419
           ++M  Q  +  +H+T+ S F A   L     GK+V  +A K GL  N  +++N+V + + 
Sbjct: 368 SEMITQGHVEPNHFTFSSAFKACGNLSDPRVGKQVLGQAFKRGLASNS-SVANSVISMFV 427

Query: 420 KCGSLDDVSKVFYRIEERDLVSWTTLVTAYSQCSEWDKAIEIFSNMREGGFTPNQFAFSS 479
           K   ++D  + F  + E++LVS+ T +    +   +++A ++ S + E     + F F+S
Sbjct: 428 KSDRMEDAQRAFESLSEKNLVSYNTFLDGTCRNLNFEQAFKLLSEITERELGVSAFTFAS 487

Query: 480 VLVSCASLCLLEYGQQVHGFLCKVGLDMDKCIESALIDMYAKCGNLVEAKKVFNRISNAD 539
           +L   A++  +  G+Q+H  + K+GL  ++ + +ALI MY+KCG++  A +VFN + N +
Sbjct: 488 LLSGVANVGSIRKGEQIHSQVVKLGLSCNQPVCNALISMYSKCGSIDTASRVFNFMENRN 547

Query: 540 TVSWTAIISGHAQHGVVDDALQLFRRMEQLGVEPNAVTFLCVLFACSHGGLVEEGLQYFK 599
            +SWT++I+G A+HG     L+ F +M + GV+PN VT++ +L ACSH GLV EG ++F 
Sbjct: 548 VISWTSMITGFAKHGFAIRVLETFNQMIEEGVKPNEVTYVAILSACSHVGLVSEGWRHFN 607

Query: 600 LMKETYGLVPEMEHYSCIVDLLSRVGRLNDAMEFISRMPIEPNEMVWQTLLGACRVHGNV 659
            M E + + P+MEHY+C+VDLL R G L DA EFI+ MP + + +VW+T LGACRVH N 
Sbjct: 608 SMYEDHKIKPKMEHYACMVDLLCRAGLLTDAFEFINTMPFQADVLVWRTFLGACRVHSNT 667

Query: 660 GLGELAAQKILAFRAENSATFVLLSNTYIESGSYKDGLSLRHAMKEQGVKKEPGCSWISV 719
            LG+LAA+KIL       A ++ LSN Y  +G +++   +R  MKE+ + KE GCSWI V
Sbjct: 668 ELGKLAARKILELDPNEPAAYIQLSNIYACAGKWEESTEMRRKMKERNLVKEGGCSWIEV 727

Query: 720 NGTLHKFYAGDQQHPEKDKIYAKLEELRLKANSLDDVPD 748
              +HKFY GD  HP   +IY +L+ L  +      VPD
Sbjct: 728 GDKIHKFYVGDTAHPNAHQIYDELDRLITEIKRCGYVPD 758

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9ZUW38.1e-13237.41Pentatricopeptide repeat-containing protein At2g27610 OS=Arabidopsis thaliana OX... [more]
Q9LFI18.2e-12434.87Pentatricopeptide repeat-containing protein At3g53360, mitochondrial OS=Arabidop... [more]
Q9SN395.3e-12333.14Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis t... [more]
Q9SVP76.9e-12335.61Pentatricopeptide repeat-containing protein At4g13650 OS=Arabidopsis thaliana OX... [more]
Q5G1T12.6e-12234.48Pentatricopeptide repeat-containing protein At3g49170, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
XP_022939229.10.0e+0088.18pentatricopeptide repeat-containing protein At3g16610-like [Cucurbita moschata] ... [more]
KAG7016032.10.0e+0088.18Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
XP_038884632.10.0e+0088.92pentatricopeptide repeat-containing protein At3g16610-like [Benincasa hispida] >... [more]
XP_023549692.10.0e+0087.77pentatricopeptide repeat-containing protein At3g16610-like [Cucurbita pepo subsp... [more]
XP_022993736.10.0e+0087.77pentatricopeptide repeat-containing protein At2g27610-like [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
A0A6J1FGJ60.0e+0088.18pentatricopeptide repeat-containing protein At3g16610-like OS=Cucurbita moschata... [more]
A0A6J1JX630.0e+0087.77pentatricopeptide repeat-containing protein At2g27610-like OS=Cucurbita maxima O... [more]
A0A0A0KBQ40.0e+0087.36Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G013890 PE=4 SV=1[more]
A0A1S4E1710.0e+0087.16pentatricopeptide repeat-containing protein At2g27610-like OS=Cucumis melo OX=36... [more]
A0A5D3C2B10.0e+0087.50Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
Match NameE-valueIdentityDescription
AT2G27610.15.8e-13337.41Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G53360.15.8e-12534.87Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G18750.13.8e-12433.14Pentatricopeptide repeat (PPR) superfamily protein [more]
AT4G13650.14.9e-12435.61Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G49170.11.9e-12334.48Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 329..370
e-value: 9.5E-10
score: 38.5
coord: 427..474
e-value: 4.9E-8
score: 33.0
coord: 528..574
e-value: 2.8E-11
score: 43.4
coord: 222..264
e-value: 3.5E-11
score: 43.1
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 329..360
e-value: 2.8E-4
score: 18.8
coord: 123..156
e-value: 8.8E-6
score: 23.6
coord: 530..564
e-value: 2.1E-8
score: 31.8
coord: 429..462
e-value: 4.4E-7
score: 27.7
coord: 565..598
e-value: 0.0032
score: 15.5
coord: 224..258
e-value: 4.4E-6
score: 24.5
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 95..122
e-value: 0.31
score: 11.4
coord: 123..153
e-value: 1.1E-4
score: 22.2
coord: 602..626
e-value: 0.55
score: 10.6
coord: 502..524
e-value: 0.0045
score: 17.1
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 427..461
score: 12.550746
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 528..562
score: 13.022082
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 222..256
score: 12.024604
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 121..155
score: 10.55579
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 268..375
e-value: 6.1E-15
score: 57.0
coord: 173..267
e-value: 7.0E-18
score: 66.6
coord: 39..172
e-value: 1.4E-16
score: 62.4
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 376..491
e-value: 1.8E-19
score: 72.3
coord: 499..727
e-value: 4.6E-41
score: 143.2
NoneNo IPR availablePANTHERPTHR24015OS07G0578800 PROTEIN-RELATEDcoord: 321..735
coord: 12..221
NoneNo IPR availablePANTHERPTHR24015OS07G0578800 PROTEIN-RELATEDcoord: 224..318
NoneNo IPR availablePANTHERPTHR24015:SF1875PENTATRICOPEPTIDE (PPR) REPEAT PROTEINcoord: 321..735
coord: 12..221
NoneNo IPR availablePANTHERPTHR24015:SF1875PENTATRICOPEPTIDE (PPR) REPEAT PROTEINcoord: 224..318

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0004841.1Tan0004841.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding