Tan0016523 (gene) Snake gourd v1

Overview
NameTan0016523
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionP-loop containing nucleoside triphosphate hydrolases superfamily protein
LocationLG04: 7809196 .. 7811565 (+)
RNA-Seq ExpressionTan0016523
SyntenyTan0016523
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTTCGTCTCTGCTCTTCTCTGCTTCTTTCTTCGGGAACCCAATTCCCATTTCAATACGAACAACAACAGCTCCGTGTAGGAGAAAATCTCTGGCCGTTGAAGCTTCAAAAGAGATTACGAACGTTTCTTCTCAGAACCCAACAAGGATGCTCACTTTTCTTGGCAAAGGCGGCTCGGGAAAGACCACTTCGGCGGTATTCGCCGCTCAGGTCCGTGTATTTGTGTGTTTTTTGGAGTGGGCTTTATGAATTTTTGGAGTTCTAGCACTTTGTGTGATATTGTCTGTGGACACCAACTTCTTTGCTTGGGATTTGAGATTCGTAGGGGCTATACTTTCTCTATGCTTGACTTCTCGCTTCCTGATTTAAACATGAACTATTGAGTGTGGGAAAGGACTTTATGGACCATAATGATATATATCCATAGGGCGCCATTCTGAAGCTAATTGATGGATGTATGGTGTATTGTATCTGGCATCTTGCCAATTATTACTTTTAGACTTTTAGTCCCTGTATAAATGCTCTCTTAGAAGTTATATGAACAAATTTTGTTACTTTTCTAGTTTTCTTACTTCTTACTTTTCATTTATGTATGTATTTTTGGATTTGATGGGTCGTCAGCACTTTGCATTGTCTGGATTTCGCACATGCCTGGCGATACATAATCAAGATCCTACTCCTGAGTACCTTCTGGATTGTAAAATTGGGAATTCTCCTGTTGAATGCGGTCACAACCTCTCGGCTGTTAGGTTGGAAACCACTCAAGTGTGTTTTTCCGTTCACTTCTCTTTTTGTCATCAAGGTTCTACTTTGGATTTCAGACTTATCTTGTCAATGACAACAGATGCTTCTTGAACCTCTCAAATGGCTAAAGCAAGCTGATTCTCGTCTTAATATGACACAAGGAGTTCTTGAAGGGGTGGGTATGTGAACTTTAGAATATGCCAATATGGTGCATCATTAAATGCAAGATTGCATTTCAGTGTTTCTTCTTTGGAGGACTTGCAATGTATCTTCTTGATGTTTTAGTAGGTCGTTGGAGAAGAGCTTGGGATACTTCCAGGAATGGATTCTATCTTTTCGGTACTTCAACTTGAGAGATTTCTTGGGTTCTCAGGGATTATGGCCCAAAGAGACCAAAAAGCTAAATATGACATAGTAATATATGACGGTATCTGCACTGAAGAAACAGTAAGGATGATTGGAGCAACCAGTAAAGCAAGGTTGAGATGCTCTCTCTCTCCCCCTCCCCTTGCACAAATCTGACAAAAATTGTATTACTAAAATAACTTTTGATAGGTTGTACTTAAAATATCTGAAGAGCATTGCTGAAAAAACTGATCTTGGGAGGTTGGCTACTCCTTCAATTTTGAGGTTTGTTGATGAAGCCATGAGTATAAGCGGGCCAGGCTCCCCTCTCAGTAGTAGAACCAGCACTGATATATGGGAGGCACTTGAACGCATGTTAGAGGTAAATTGGTTTGCACTCTTGTGAAACAACATGAATGAAATCTAGTGAAATTGTTTAATGGAAATTCGTGGTACAGAAAGCATCTTCCGCATTTTCAGAGCAAAGTAAATTTAGCTGCTTCATAGTGATGGATCCAACTAGTCCTGCCTCTGTTCAATCTGCATTACGGTACTGGGGCTGTACTATTCAAGCTGGTGCACAAATTTCTGGTGCATTTGCTTCCATTTCTTCACAATTGGATGCAGAATCCATTGCTAGATTGAAGGAAAATTTTTCACCCTTATCTTTGGCCTTTATGCCACAGTTCTCAAGTGGTTCCCCTGTAGATTGGAACACAGTTCTTCGCGATGCATCAAGTAAAGGCCCGAGGGACCTTCTTTCTACGTCAAAAAACCACACCAGCAGTCTGCTATCACCCGTAAAATTCAATCCTGGAAACAAATCGGTTACACTTCTCATGCCAGGCTTCGAGAAGTCTGAAATCAAGCTTTACCAGGTACGTTCATTCTCTCTATTAGTCGGAGAGAAAATATCCTACATCGCAGGGAACTATGAGTTAAGTATCAAATAATGCTCCATTGTGATGATGTTAAGTATGCATTAGACTTGGTTAATGCTTAAGTACGCATTTTCTGGAATTGTCTTGGAAGATTCTCAACTCATTGCAGTTTTGTTTATATTCTAACTTTCTGAGGCACAAGATTTATGACAGAAACACTTCTTACCATCATGCAACTCATTAGTTCCATTCTTTGAAAATGCAGTATAGGGGAGGGTCTGAGCTATTGGTGGAAGCTGGTGATCAGAGGCGTGTAATTTCTTTGCCTAAAGAAATTCAAGGGAAGGTGGGTGGTGCCAAGTTCACGGATAGAAGTCTTGTGATCACAATGCGTTGA

mRNA sequence

ATGGCTTCGTCTCTGCTCTTCTCTGCTTCTTTCTTCGGGAACCCAATTCCCATTTCAATACGAACAACAACAGCTCCGTGTAGGAGAAAATCTCTGGCCGTTGAAGCTTCAAAAGAGATTACGAACGTTTCTTCTCAGAACCCAACAAGGATGCTCACTTTTCTTGGCAAAGGCGGCTCGGGAAAGACCACTTCGGCGGTATTCGCCGCTCAGCACTTTGCATTGTCTGGATTTCGCACATGCCTGGCGATACATAATCAAGATCCTACTCCTGAGTACCTTCTGGATTGTAAAATTGGGAATTCTCCTGTTGAATGCGGTCACAACCTCTCGGCTGTTAGGTTGGAAACCACTCAAATGCTTCTTGAACCTCTCAAATGGCTAAAGCAAGCTGATTCTCGTCTTAATATGACACAAGGAGTTCTTGAAGGGGTCGTTGGAGAAGAGCTTGGGATACTTCCAGGAATGGATTCTATCTTTTCGGTACTTCAACTTGAGAGATTTCTTGGGTTCTCAGGGATTATGGCCCAAAGAGACCAAAAAGCTAAATATGACATAGTAATATATGACGGTATCTGCACTGAAGAAACAGTAAGGATGATTGGAGCAACCAGTAAAGCAAGGTTGTACTTAAAATATCTGAAGAGCATTGCTGAAAAAACTGATCTTGGGAGGTTGGCTACTCCTTCAATTTTGAGGTTTGTTGATGAAGCCATGAGTATAAGCGGGCCAGGCTCCCCTCTCAGTAGTAGAACCAGCACTGATATATGGGAGGCACTTGAACGCATGTTAGAGAAAGCATCTTCCGCATTTTCAGAGCAAAGTAAATTTAGCTGCTTCATAGTGATGGATCCAACTAGTCCTGCCTCTGTTCAATCTGCATTACGGTACTGGGGCTGTACTATTCAAGCTGGTGCACAAATTTCTGGTGCATTTGCTTCCATTTCTTCACAATTGGATGCAGAATCCATTGCTAGATTGAAGGAAAATTTTTCACCCTTATCTTTGGCCTTTATGCCACAGTTCTCAAGTGGTTCCCCTGTAGATTGGAACACAGTTCTTCGCGATGCATCAAGTAAAGGCCCGAGGGACCTTCTTTCTACGTCAAAAAACCACACCAGCAGTCTGCTATCACCCGTAAAATTCAATCCTGGAAACAAATCGGTTACACTTCTCATGCCAGGCTTCGAGAAGTCTGAAATCAAGCTTTACCAGTATAGGGGAGGGTCTGAGCTATTGGTGGAAGCTGGTGATCAGAGGCGTGTAATTTCTTTGCCTAAAGAAATTCAAGGGAAGGTGGGTGGTGCCAAGTTCACGGATAGAAGTCTTGTGATCACAATGCGTTGA

Coding sequence (CDS)

ATGGCTTCGTCTCTGCTCTTCTCTGCTTCTTTCTTCGGGAACCCAATTCCCATTTCAATACGAACAACAACAGCTCCGTGTAGGAGAAAATCTCTGGCCGTTGAAGCTTCAAAAGAGATTACGAACGTTTCTTCTCAGAACCCAACAAGGATGCTCACTTTTCTTGGCAAAGGCGGCTCGGGAAAGACCACTTCGGCGGTATTCGCCGCTCAGCACTTTGCATTGTCTGGATTTCGCACATGCCTGGCGATACATAATCAAGATCCTACTCCTGAGTACCTTCTGGATTGTAAAATTGGGAATTCTCCTGTTGAATGCGGTCACAACCTCTCGGCTGTTAGGTTGGAAACCACTCAAATGCTTCTTGAACCTCTCAAATGGCTAAAGCAAGCTGATTCTCGTCTTAATATGACACAAGGAGTTCTTGAAGGGGTCGTTGGAGAAGAGCTTGGGATACTTCCAGGAATGGATTCTATCTTTTCGGTACTTCAACTTGAGAGATTTCTTGGGTTCTCAGGGATTATGGCCCAAAGAGACCAAAAAGCTAAATATGACATAGTAATATATGACGGTATCTGCACTGAAGAAACAGTAAGGATGATTGGAGCAACCAGTAAAGCAAGGTTGTACTTAAAATATCTGAAGAGCATTGCTGAAAAAACTGATCTTGGGAGGTTGGCTACTCCTTCAATTTTGAGGTTTGTTGATGAAGCCATGAGTATAAGCGGGCCAGGCTCCCCTCTCAGTAGTAGAACCAGCACTGATATATGGGAGGCACTTGAACGCATGTTAGAGAAAGCATCTTCCGCATTTTCAGAGCAAAGTAAATTTAGCTGCTTCATAGTGATGGATCCAACTAGTCCTGCCTCTGTTCAATCTGCATTACGGTACTGGGGCTGTACTATTCAAGCTGGTGCACAAATTTCTGGTGCATTTGCTTCCATTTCTTCACAATTGGATGCAGAATCCATTGCTAGATTGAAGGAAAATTTTTCACCCTTATCTTTGGCCTTTATGCCACAGTTCTCAAGTGGTTCCCCTGTAGATTGGAACACAGTTCTTCGCGATGCATCAAGTAAAGGCCCGAGGGACCTTCTTTCTACGTCAAAAAACCACACCAGCAGTCTGCTATCACCCGTAAAATTCAATCCTGGAAACAAATCGGTTACACTTCTCATGCCAGGCTTCGAGAAGTCTGAAATCAAGCTTTACCAGTATAGGGGAGGGTCTGAGCTATTGGTGGAAGCTGGTGATCAGAGGCGTGTAATTTCTTTGCCTAAAGAAATTCAAGGGAAGGTGGGTGGTGCCAAGTTCACGGATAGAAGTCTTGTGATCACAATGCGTTGA

Protein sequence

MASSLLFSASFFGNPIPISIRTTTAPCRRKSLAVEASKEITNVSSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGFRTCLAIHNQDPTPEYLLDCKIGNSPVECGHNLSAVRLETTQMLLEPLKWLKQADSRLNMTQGVLEGVVGEELGILPGMDSIFSVLQLERFLGFSGIMAQRDQKAKYDIVIYDGICTEETVRMIGATSKARLYLKYLKSIAEKTDLGRLATPSILRFVDEAMSISGPGSPLSSRTSTDIWEALERMLEKASSAFSEQSKFSCFIVMDPTSPASVQSALRYWGCTIQAGAQISGAFASISSQLDAESIARLKENFSPLSLAFMPQFSSGSPVDWNTVLRDASSKGPRDLLSTSKNHTSSLLSPVKFNPGNKSVTLLMPGFEKSEIKLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFTDRSLVITMR
Homology
BLAST of Tan0016523 vs. ExPASy Swiss-Prot
Match: Q6DYE4 (Uncharacterized protein At1g26090, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=At1g26090 PE=1 SV=1)

HSP 1 Score: 498.8 bits (1283), Expect = 6.3e-140
Identity = 264/456 (57.89%), Postives = 336/456 (73.68%), Query Frame = 0

Query: 1   MASSLLFSASFFGNPIPISIRTTTAPCRRKS----LAVEASKEITNV---SSQNPTRMLT 60
           + +S L  +S   N +PI +RT T    RK     +A  +S+++ +    SSQ  T+ +T
Sbjct: 4   LVNSSLTCSSLTLNLLPI-LRTETPSLSRKRRAAYVAATSSRDVNDTAADSSQKLTKFVT 63

Query: 61  FLGKGGSGKTTSAVFAAQHFALSGFRTCLAIHNQDPTPEYLLDCKIGNSPVECGHNLSAV 120
           FLGKGGSGKTT+AVFAAQH+AL+G  TCL IHNQDP+ E+LL  KIG SP     NLS +
Sbjct: 64  FLGKGGSGKTTAAVFAAQHYALAGLSTCLVIHNQDPSAEFLLGSKIGTSPTLINDNLSVI 123

Query: 121 RLETTQMLLEPLKWLKQADSRLNMTQGVLEGVVGEELGILPGMDSIFSVLQLERFLGFSG 180
           RLETT+MLLEPLK LKQAD+RLNMTQGVLEGVVGEELG+LPGMDSIFS+L+LER +GF  
Sbjct: 124 RLETTKMLLEPLKQLKQADARLNMTQGVLEGVVGEELGVLPGMDSIFSMLELERLVGFFR 183

Query: 181 IMAQRDQKAK-YDIVIYDGICTEETVRMIGATSKARLYLKYLKSIAEKTDLGRLATPSIL 240
              +++ K K +D++IYDGI TEET+RMIG +SK RLY KYL+S+AEKTDLGRL +PSI+
Sbjct: 184 QATRKNHKGKPFDVIIYDGISTEETLRMIGLSSKTRLYAKYLRSLAEKTDLGRLTSPSIM 243

Query: 241 RFVDEAMSISGPGSPLSSRTSTDIWEALERMLEKASSAFSEQSKFSCFIVMDPTSPASVQ 300
           RFVDE+M+I+   SP    TS  +W+ LER LE  +SA+ +  +F  F+VMDP +P SV+
Sbjct: 244 RFVDESMNINSNKSPFDGMTSPAMWDTLERFLETGASAWRDPERFRSFLVMDPNNPMSVK 303

Query: 301 SALRYWGCTIQAGAQISGAFASISSQLDAESIARLKENFSPLSLAFMPQFSSGSPVDWNT 360
           +ALRYWGCT+QAG+ +SGAFA  SS L ++     K +F PL  A      + + +DW+ 
Sbjct: 304 AALRYWGCTVQAGSHVSGAFAISSSHLTSQI---PKADFVPLPFASASVPFTITGLDWDK 363

Query: 361 VLRDASSKGPRDLLSTSKNHTSSLLSPVKFNPGNKSVTLLMPGFEKSEIKLYQYRGGSEL 420
           +L D ++   R+LLS + +H +SL   V F+   K VTL MPGFEKSEIKLYQYRGGSEL
Sbjct: 364 ILLDQANSSIRELLSETVSHGTSLTQTVMFDTAKKLVTLFMPGFEKSEIKLYQYRGGSEL 423

Query: 421 LVEAGDQRRVISLPKEIQGKVGGAKFTDRSLVITMR 449
           L+EAGDQRRVI LP +IQGKVGGAKF DRSL++TMR
Sbjct: 424 LIEAGDQRRVIHLPSQIQGKVGGAKFVDRSLIVTMR 455

BLAST of Tan0016523 vs. ExPASy Swiss-Prot
Match: Q46465 (Putative arsenical pump-driving ATPase OS=Prosthecochloris vibrioformis OX=1098 PE=3 SV=1)

HSP 1 Score: 84.0 bits (206), Expect = 4.9e-15
Identity = 94/415 (22.65%), Postives = 182/415 (43.86%), Query Frame = 0

Query: 50  RMLTFLGKGGSGKTTSAVFAAQHFALSGFRTCLAIHNQDPTPEYLLDCKIGNSPVECGHN 109
           R+LTF GKGG GKT+ +   A   +  G RT +   +   +     + ++G  P +   N
Sbjct: 2   RILTFTGKGGVGKTSVSAATAVRLSEMGHRTLVLSTDPAHSLSDSFNLQLGAEPTKIKEN 61

Query: 110 LSAVRLETTQMLLEPLKWLKQADSRLNMTQGVLEGVVGEELGILPGMDSIFSVLQLERFL 169
           L A+ +     L E    +++  +R+ M QGV  GV+ +E+ ILPGM+ +FS+L+++R+ 
Sbjct: 62  LHAIEVNPYVDLKENWHSVQKYYTRVFMAQGV-SGVMADEMTILPGMEELFSLLRIKRY- 121

Query: 170 GFSGIMAQRDQKAKYDIVIYDGICTEETVRMIGATSKARLYLKYLKSIAEKTDLGRLATP 229
                         YD ++ D   T ET+R++              S+ +    G  A  
Sbjct: 122 ---------KSTGLYDALVLDTAPTGETLRLL--------------SLPDTLSWGMKAVK 181

Query: 230 SILRFVDEAMSISGPGSPLSSRTS-----TDIWEALERM---LEKASSAFSEQSKFSCFI 289
           ++ +++     +S P S +S + +      D  E+++++   LE      ++  K +  +
Sbjct: 182 NVNKYI--VRPLSKPLSKMSDKIAYYIPPEDAIESVDQVFDELEDIRDILTDNVKSTVRL 241

Query: 290 VMDPTSPASVQSALRYWGCTIQAGAQISGAFASISSQLDAESIARLKENFSPLSLAFMPQ 349
           VM+     S++  +R        G ++      ++  LDA+  +   E +  +   ++ +
Sbjct: 242 VMN-AEKMSIKETMRALTYLNLYGFKVD--MVLVNKLLDAQENSGYLEKWKGIQQKYLGE 301

Query: 350 FSSG-SPVDWNTV-LRDASSKGPRDLL--------STSKNHTSSLLSPVKFNPGNKSVTL 409
              G SP+    + + D    G + L          T  +       P+KF        +
Sbjct: 302 IEEGFSPLPVKKLKMYDQEIVGVKSLEVFAHDIYGDTDPSDMMYDEPPIKFVRKGDIYEV 361

Query: 410 LMPGFEKSEIKLYQYRGGSELLVEAGDQRRVISLPKEIQG-KVGGAKFTDRSLVI 446
            +     + + +  +  G EL V+ G+QR++I+LP  + G + G A F D+ L I
Sbjct: 362 QLKLMFANPVDIDVWVTGDELFVQIGNQRKIITLPVSLTGLEPGDAVFRDKWLHI 386

BLAST of Tan0016523 vs. ExPASy Swiss-Prot
Match: Q46366 (Putative arsenical pump-driving ATPase OS=Chlorobaculum tepidum (strain ATCC 49652 / DSM 12025 / NBRC 103806 / TLS) OX=194439 GN=CT1945 PE=3 SV=2)

HSP 1 Score: 82.8 bits (203), Expect = 1.1e-14
Identity = 93/415 (22.41%), Postives = 182/415 (43.86%), Query Frame = 0

Query: 50  RMLTFLGKGGSGKTTSAVFAAQHFALSGFRTCLAIHNQDPTPEYLLDCKIGNSPVECGHN 109
           R+LTF GKGG GKT+ +   A   +  G RT +   +   +     + ++G  P +   N
Sbjct: 2   RILTFTGKGGVGKTSVSAATAVRLSEMGHRTLVLSTDPAHSLSDSFNIQLGAEPTKIKEN 61

Query: 110 LSAVRLETTQMLLEPLKWLKQADSRLNMTQGVLEGVVGEELGILPGMDSIFSVLQLERFL 169
           L A+ +     L +    +++  +R+ M QGV  GV+ +E+ ILPGM+ +FS+L+++R+ 
Sbjct: 62  LHAIEVNPYVDLKQNWHSVQKYYTRIFMAQGV-SGVMADEMTILPGMEELFSLLRIKRY- 121

Query: 170 GFSGIMAQRDQKAKYDIVIYDGICTEETVRMIGATSKARLYLKYLKSIAEKTDLGRLATP 229
                         YD ++ D   T ET+R++              S+ +    G  A  
Sbjct: 122 ---------KSAGLYDALVLDTAPTGETLRLL--------------SLPDTLSWGMKAVK 181

Query: 230 SILRFVDEAMSISGPGSPLSSRTS-----TDIWEALERM---LEKASSAFSEQSKFSCFI 289
           ++ +++     +S P S +S + +      D  E+++++   LE      ++  K +  +
Sbjct: 182 NVNKYI--VRPLSKPLSKMSDKIAYYIPPEDAIESVDQVFDELEDIREILTDNVKSTVRL 241

Query: 290 VMDPTSPASVQSALRYWGCTIQAGAQISGAFASISSQLDAESIARLKENFSPLSLAFMPQ 349
           VM+     S++  +R        G ++      ++  LDA+  +   E +  +   ++ +
Sbjct: 242 VMN-AEKMSIKETMRALTYLNLYGFKVD--MVLVNKLLDAQENSGYLEKWKGIQQKYLGE 301

Query: 350 FSSG-SPVDWNTV-LRDASSKGPRDLL--------STSKNHTSSLLSPVKFNPGNKSVTL 409
              G SP+    + + D    G + L          T  +       P+KF        +
Sbjct: 302 IEEGFSPLPVKKLKMYDQEIVGVKSLEVFAHDIYGDTDPSGMMYDEPPIKFVRQGDVYEV 361

Query: 410 LMPGFEKSEIKLYQYRGGSELLVEAGDQRRVISLPKEIQG-KVGGAKFTDRSLVI 446
            +     + + +  +  G EL V+ G+QR++I+LP  + G + G A F D+ L I
Sbjct: 362 QLKLMFANPVDIDVWVTGDELFVQIGNQRKIITLPVSLTGLEPGDAVFKDKWLHI 386

BLAST of Tan0016523 vs. ExPASy Swiss-Prot
Match: Q55794 (Putative arsenical pump-driving ATPase OS=Synechocystis sp. (strain PCC 6803 / Kazusa) OX=1111708 GN=sll0086 PE=3 SV=1)

HSP 1 Score: 72.4 bits (176), Expect = 1.5e-11
Identity = 91/409 (22.25%), Postives = 162/409 (39.61%), Query Frame = 0

Query: 50  RMLTFLGKGGSGKTTSAVFAAQHFALSGFRTCLAIHNQDPTPEYLLDCKIGNSPVECGHN 109
           R++   GKGG GKT+ A       A  G +T +   +   +     D ++G+ P     N
Sbjct: 2   RVILMTGKGGVGKTSVAAATGLRCAELGHKTLVLSTDPAHSLADSFDLELGHEPRLVKEN 61

Query: 110 LSAVRLETTQMLLEPLKWLKQADSRLNMTQGVLEGVVGEELGILPGMDSIFSVLQLERFL 169
           L    L+    L      +K+  +++   +G L+GV  EEL ILPGMD IF +++++R  
Sbjct: 62  LWGAELDALMELEGNWGAVKRYITQVLQARG-LDGVQAEELAILPGMDEIFGLVRMKRHY 121

Query: 170 GFSGIMAQRDQKAKYDIVIYDGICTEETVRMIGATSKARLYLKYLKSIAEKTDLGRLATP 229
                      +A YD++I D   T   +R++        Y++      +   +      
Sbjct: 122 ----------DEADYDVLIIDSAPTGTALRLLSLPEVGGWYMRRFYKPLQGMSVA----- 181

Query: 230 SILRFVDEAMSISGPGSPLSSRTSTDIWEALERMLEKASSAFSEQSKFSCFIVMDPTSPA 289
             LR + E +     G  L  +   D        +E      ++ ++ S  +V +P    
Sbjct: 182 --LRPLVEPLFRPIAGFSLPDKEVMDAPYEFYEQIEALEKVLTDNTQTSVRLVTNPEKMV 241

Query: 290 SVQSALRYWGCTI-QAGAQISGAFASISSQLDAESIARLK-----------ENFSPLSLA 349
             +S   +   ++      +  A   +   +D     R K           +NF PL + 
Sbjct: 242 LKESLRAHAYLSLYNVSTDLVIANRILPETIDDPFFQRWKSNQQVYKQEIYDNFHPLPVK 301

Query: 350 FMPQFSSGSPVDWNTVLRDASSKGPRDLLSTSKNHTSSLLSPVKFNPGNKSVTLLMPGFE 409
             P FS    +     L        +D   +   +  + ++ V+ +  + S+ L +PG  
Sbjct: 302 EAPLFS--EEMCGLAALERLKDTLYKDEDPSQVYYKENTINIVQGSNNDYSLELYLPGIP 361

Query: 410 KSEIKLYQYRGGSELLVEAGDQRRVISLPKEIQG-KVGGAKFTDRSLVI 446
           K +I+L   + G EL V  G+ RR + LP+ +      GAK  D  L I
Sbjct: 362 KEQIQL--NKTGDELNVRIGNHRRNLVLPQALAALSPAGAKMEDDYLKI 388

BLAST of Tan0016523 vs. ExPASy Swiss-Prot
Match: O50593 (Arsenical pump-driving ATPase OS=Acidiphilium multivorum (strain DSM 11245 / JCM 8867 / NBRC 100883 / AIU301) OX=926570 GN=arsA PE=3 SV=1)

HSP 1 Score: 50.1 bits (118), Expect = 7.8e-05
Identity = 43/165 (26.06%), Postives = 68/165 (41.21%), Query Frame = 0

Query: 46  QNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGFRTCLAIHNQDPTPEYLLDCKIGNS--P 105
           QN    L F GKGG GKT+ +   A H A  G R  L   +       + D  IGN+  P
Sbjct: 5   QNIPPYLFFTGKGGVGKTSISCATAIHLAEQGKRVLLVSTDPASNVGQVFDLAIGNTIRP 64

Query: 106 VECGHNLSAVRLETTQ-------MLLEPLKWLKQADSRLNMTQGVLEGVVGEELGILPGM 165
           V     LSA+ ++  +        +++P+K L   D  +N     L G    E+      
Sbjct: 65  VTAVPGLSALEIDPQEAARQYRARIVDPIKGL-LPDDVVNSISEQLSGACTTEIA----- 124

Query: 166 DSIFSVLQLERFLGFSGIMAQRDQKAKYDIVIYDGICTEETVRMI 202
                      F  F+G++       ++D +I+D   T  T+R++
Sbjct: 125 ----------AFDEFTGLLTDASLLTRFDHIIFDTAPTGHTIRLL 153

BLAST of Tan0016523 vs. NCBI nr
Match: XP_022979170.1 (uncharacterized protein At1g26090, chloroplastic [Cucurbita maxima])

HSP 1 Score: 776.2 bits (2003), Expect = 1.6e-220
Identity = 396/448 (88.39%), Postives = 419/448 (93.53%), Query Frame = 0

Query: 1   MASSLLFSASFFGNPIPISIRTTTAPCRRKSLAVEASKEITNVSSQNPTRMLTFLGKGGS 60
           MASSLLFSASFFG+PIPISIRT TAPCRR+S+A+EASKE+T+VSSQN  RMLTFLGKGGS
Sbjct: 1   MASSLLFSASFFGHPIPISIRTRTAPCRRRSMAIEASKEVTDVSSQNQPRMLTFLGKGGS 60

Query: 61  GKTTSAVFAAQHFALSGFRTCLAIHNQDPTPEYLLDCKIGNSPVECGHNLSAVRLETTQM 120
           GKTTSAVFAA+HFALSG RTCL IHNQD TPEYLLDCKIG+SPVEC HNLSAVRLETTQM
Sbjct: 61  GKTTSAVFAARHFALSGLRTCLVIHNQDATPEYLLDCKIGSSPVECSHNLSAVRLETTQM 120

Query: 121 LLEPLKWLKQADSRLNMTQGVLEGVVGEELGILPGMDSIFSVLQLERFLGFSGIMAQRDQ 180
           LLEPLK LKQADS LNMTQG LEGVVGEELGILPGMDSIFSVLQLERFLGFSGIMAQ DQ
Sbjct: 121 LLEPLKRLKQADSPLNMTQGTLEGVVGEELGILPGMDSIFSVLQLERFLGFSGIMAQTDQ 180

Query: 181 KAKYDIVIYDGICTEETVRMIGATSKARLYLKYLKSIAEKTDLGRLATPSILRFVDEAMS 240
           KAKYDIV+YDGICTEET+RMIGATSKARLYLKYL+SIAEKTDLGRLATPSILR VDEAM+
Sbjct: 181 KAKYDIVVYDGICTEETIRMIGATSKARLYLKYLRSIAEKTDLGRLATPSILRLVDEAMN 240

Query: 241 ISGPGSPLSSRTSTDIWEALERMLEKASSAFSEQSKFSCFIVMDPTSPASVQSALRYWGC 300
           IS PGS LS RTSTD W+ALE MLEK SSA +E  +FSCFIVMDPTSPASV+SALRYWGC
Sbjct: 241 ISSPGSHLSGRTSTDTWQALEHMLEKGSSAIAEPRRFSCFIVMDPTSPASVKSALRYWGC 300

Query: 301 TIQAGAQISGAFASISSQLDAESIARLKENFSPLSLAFMPQFSSGSPVDWNTVLRDASSK 360
           TIQAGAQISGAFASISS LDAES ARLKENF PL LAFMPQ S GSPVDWNTVL DASSK
Sbjct: 301 TIQAGAQISGAFASISSGLDAESAARLKENFLPLPLAFMPQISVGSPVDWNTVLPDASSK 360

Query: 361 GPRDLLSTSKNHTSSLLSPVKFNPGNKSVTLLMPGFEKSEIKLYQYRGGSELLVEAGDQR 420
           GPR+LLS+SK+H+S+LLSPVKF+PGNKSVTLLMPGFEKSEI+LYQYRGGSELLVEAGDQR
Sbjct: 361 GPRNLLSSSKSHSSNLLSPVKFDPGNKSVTLLMPGFEKSEIRLYQYRGGSELLVEAGDQR 420

Query: 421 RVISLPKEIQGKVGGAKFTDRSLVITMR 449
           RVISLPKEIQGKVGGAKF DRSLVITMR
Sbjct: 421 RVISLPKEIQGKVGGAKFMDRSLVITMR 448

BLAST of Tan0016523 vs. NCBI nr
Match: XP_023529730.1 (uncharacterized protein At1g26090, chloroplastic [Cucurbita pepo subsp. pepo])

HSP 1 Score: 776.2 bits (2003), Expect = 1.6e-220
Identity = 394/448 (87.95%), Postives = 418/448 (93.30%), Query Frame = 0

Query: 1   MASSLLFSASFFGNPIPISIRTTTAPCRRKSLAVEASKEITNVSSQNPTRMLTFLGKGGS 60
           MASSLLFSASFFG+PIPISIRT TAPCRR+S+A+EASKE+T+VSSQN  RMLTFLGKGGS
Sbjct: 1   MASSLLFSASFFGHPIPISIRTRTAPCRRRSMAIEASKEVTDVSSQNQPRMLTFLGKGGS 60

Query: 61  GKTTSAVFAAQHFALSGFRTCLAIHNQDPTPEYLLDCKIGNSPVECGHNLSAVRLETTQM 120
           GKTTSAVF A+HFALSG RTCL IHNQD TPEYLLDCKIGNSPVEC HNLSAVRLETTQM
Sbjct: 61  GKTTSAVFTARHFALSGLRTCLVIHNQDATPEYLLDCKIGNSPVECSHNLSAVRLETTQM 120

Query: 121 LLEPLKWLKQADSRLNMTQGVLEGVVGEELGILPGMDSIFSVLQLERFLGFSGIMAQRDQ 180
           LLEPLK LKQADSRLNMTQG LEG+VGEELGILPGMDSIFSVLQLERFLG SGIMAQ DQ
Sbjct: 121 LLEPLKRLKQADSRLNMTQGTLEGIVGEELGILPGMDSIFSVLQLERFLGLSGIMAQTDQ 180

Query: 181 KAKYDIVIYDGICTEETVRMIGATSKARLYLKYLKSIAEKTDLGRLATPSILRFVDEAMS 240
           KAKYDIV+YDGICTEET+RMIGATSKARLYLKYL+SIAEKTDLGRLATPSI+R VDEAM+
Sbjct: 181 KAKYDIVVYDGICTEETIRMIGATSKARLYLKYLRSIAEKTDLGRLATPSIMRLVDEAMN 240

Query: 241 ISGPGSPLSSRTSTDIWEALERMLEKASSAFSEQSKFSCFIVMDPTSPASVQSALRYWGC 300
           IS PGS LS RTSTD W+ALE MLEK SSA +E  +FSCFIVMDPTSPASV+SA RYWGC
Sbjct: 241 ISSPGSHLSGRTSTDTWQALEHMLEKGSSAIAEPRRFSCFIVMDPTSPASVKSASRYWGC 300

Query: 301 TIQAGAQISGAFASISSQLDAESIARLKENFSPLSLAFMPQFSSGSPVDWNTVLRDASSK 360
           TIQAGAQISGAFASISS LDAES ARLKENFSPL LAFMPQ S GSPVDWNTVL DASSK
Sbjct: 301 TIQAGAQISGAFASISSGLDAESAARLKENFSPLPLAFMPQISVGSPVDWNTVLLDASSK 360

Query: 361 GPRDLLSTSKNHTSSLLSPVKFNPGNKSVTLLMPGFEKSEIKLYQYRGGSELLVEAGDQR 420
           GPR+LLS+SK+H+++L SPVKFNPGNKSVTLLMPGFEKSEI+LYQYRGGSELLVEAGDQR
Sbjct: 361 GPRNLLSSSKSHSTNLPSPVKFNPGNKSVTLLMPGFEKSEIRLYQYRGGSELLVEAGDQR 420

Query: 421 RVISLPKEIQGKVGGAKFTDRSLVITMR 449
           RVISLPKEIQGKVGGAKFTDRSLVITMR
Sbjct: 421 RVISLPKEIQGKVGGAKFTDRSLVITMR 448

BLAST of Tan0016523 vs. NCBI nr
Match: XP_022956773.1 (uncharacterized protein At1g26090, chloroplastic [Cucurbita moschata])

HSP 1 Score: 773.5 bits (1996), Expect = 1.0e-219
Identity = 394/448 (87.95%), Postives = 416/448 (92.86%), Query Frame = 0

Query: 1   MASSLLFSASFFGNPIPISIRTTTAPCRRKSLAVEASKEITNVSSQNPTRMLTFLGKGGS 60
           MASSLLFSASFFG+PIPISIRT TAPCRR+S+A+EASKE+T+VSSQN  RMLTFLGKGGS
Sbjct: 1   MASSLLFSASFFGHPIPISIRTRTAPCRRRSMAIEASKEVTDVSSQNQPRMLTFLGKGGS 60

Query: 61  GKTTSAVFAAQHFALSGFRTCLAIHNQDPTPEYLLDCKIGNSPVECGHNLSAVRLETTQM 120
           GKTTSAVFAA+HFALSG RTCL IHNQD TPEYLLDCKIGNSPVEC  NLSAVRLETTQM
Sbjct: 61  GKTTSAVFAARHFALSGLRTCLVIHNQDATPEYLLDCKIGNSPVECSRNLSAVRLETTQM 120

Query: 121 LLEPLKWLKQADSRLNMTQGVLEGVVGEELGILPGMDSIFSVLQLERFLGFSGIMAQRDQ 180
           LLEPLK LKQADSRLNMTQG LEG+VGEELGILPGMDSIFSVLQLERFLG SGIMAQ DQ
Sbjct: 121 LLEPLKRLKQADSRLNMTQGTLEGIVGEELGILPGMDSIFSVLQLERFLGLSGIMAQTDQ 180

Query: 181 KAKYDIVIYDGICTEETVRMIGATSKARLYLKYLKSIAEKTDLGRLATPSILRFVDEAMS 240
           K KYDIV+YDGICTEET+RMIGATSKARLYLKYL+SIAEKTDLGRLATPSI+R VDEAM 
Sbjct: 181 KPKYDIVVYDGICTEETIRMIGATSKARLYLKYLRSIAEKTDLGRLATPSIMRLVDEAMK 240

Query: 241 ISGPGSPLSSRTSTDIWEALERMLEKASSAFSEQSKFSCFIVMDPTSPASVQSALRYWGC 300
           IS PGS LS RTSTD W+ALERMLEK SSA +E  +FSCFIVMDPTSPASV+SA RYWGC
Sbjct: 241 ISSPGSHLSGRTSTDTWQALERMLEKGSSAIAEPRRFSCFIVMDPTSPASVKSASRYWGC 300

Query: 301 TIQAGAQISGAFASISSQLDAESIARLKENFSPLSLAFMPQFSSGSPVDWNTVLRDASSK 360
           TIQAGAQISGAFASISS LDAES ARLKENFSPLSL FMPQ S GSPVDWNTVL DASSK
Sbjct: 301 TIQAGAQISGAFASISSGLDAESAARLKENFSPLSLTFMPQISVGSPVDWNTVLLDASSK 360

Query: 361 GPRDLLSTSKNHTSSLLSPVKFNPGNKSVTLLMPGFEKSEIKLYQYRGGSELLVEAGDQR 420
           GPR+LLS+SK+H+S+L SPVKFNPGNKSVTLLMPGFEKSEI+LYQYRGGSELLVEAGDQR
Sbjct: 361 GPRNLLSSSKSHSSNLPSPVKFNPGNKSVTLLMPGFEKSEIRLYQYRGGSELLVEAGDQR 420

Query: 421 RVISLPKEIQGKVGGAKFTDRSLVITMR 449
           RVISLPKEIQGKVGGAKF DRSLVITMR
Sbjct: 421 RVISLPKEIQGKVGGAKFMDRSLVITMR 448

BLAST of Tan0016523 vs. NCBI nr
Match: XP_038891424.1 (uncharacterized protein At1g26090, chloroplastic [Benincasa hispida])

HSP 1 Score: 773.1 bits (1995), Expect = 1.3e-219
Identity = 398/448 (88.84%), Postives = 414/448 (92.41%), Query Frame = 0

Query: 1   MASSLLFSASFFGNPIPISIRTTTAPCRRKSLAVEASKEITNVSSQNPTRMLTFLGKGGS 60
           MASSL FSASFFGNPIPISIRT TAPCR + +A++ASKEIT+VSSQNPTRMLTFLGKGGS
Sbjct: 25  MASSLHFSASFFGNPIPISIRTRTAPCRTRFIALQASKEITDVSSQNPTRMLTFLGKGGS 84

Query: 61  GKTTSAVFAAQHFALSGFRTCLAIHNQDPTPEYLLDCKIGNSPVECGHNLSAVRLETTQM 120
           GKTTSAVFAAQHFALSG RTCL IHNQDPTPEYLLDCKIGNSPVEC  NLSAVRLETTQM
Sbjct: 85  GKTTSAVFAAQHFALSGLRTCLVIHNQDPTPEYLLDCKIGNSPVECSRNLSAVRLETTQM 144

Query: 121 LLEPLKWLKQADSRLNMTQGVLEGVVGEELGILPGMDSIFSVLQLERFLGFSGIMAQRDQ 180
           LLEPLK LKQADSRLNMTQG+LEGVVGEELG+LPG DSIFS+LQLERFLGFSGIM QRDQ
Sbjct: 145 LLEPLKRLKQADSRLNMTQGILEGVVGEELGVLPGTDSIFSMLQLERFLGFSGIMGQRDQ 204

Query: 181 KAKYDIVIYDGICTEETVRMIGATSKARLYLKYLKSIAEKTDLGRLATPSILRFVDEAMS 240
           K KYD+VIYDGICTEET+RMIGATSKARLYLKYL+SIAEKTDLGRLATPSILR VDEAMS
Sbjct: 205 KDKYDMVIYDGICTEETIRMIGATSKARLYLKYLRSIAEKTDLGRLATPSILRLVDEAMS 264

Query: 241 ISGPGSPLSSRTSTDIWEALERMLEKASSAFSEQSKFSCFIVMDPTSPASVQSALRYWGC 300
           IS PGS LS RTSTDIWEALE +LEK SSAF+E  KFSCFIVMDPTSPASVQSALRYWGC
Sbjct: 265 ISRPGSHLSVRTSTDIWEALEHVLEKGSSAFAEPRKFSCFIVMDPTSPASVQSALRYWGC 324

Query: 301 TIQAGAQISGAFASISSQLDAESIARLKENFSPLSLAFMPQFSSGSPVDWNTVLRDASSK 360
           TIQAG QISGA A ISS L AES A LKE FSPLSLAFMPQFS+GS VDWNTVLRDASSK
Sbjct: 325 TIQAGGQISGALAFISSHLSAESTASLKEKFSPLSLAFMPQFSTGSSVDWNTVLRDASSK 384

Query: 361 GPRDLLSTSKNHTSSLLSPVKFNPGNKSVTLLMPGFEKSEIKLYQYRGGSELLVEAGDQR 420
           GPRDLLS SK+ TSSLLSPVKF+PGNKSVTLLMPGF KSEIKLYQYRGGSELLVEAGDQR
Sbjct: 385 GPRDLLSLSKSVTSSLLSPVKFDPGNKSVTLLMPGFGKSEIKLYQYRGGSELLVEAGDQR 444

Query: 421 RVISLPKEIQGKVGGAKFTDRSLVITMR 449
           RVISLPKEIQGKVGGAK TDR LVITMR
Sbjct: 445 RVISLPKEIQGKVGGAKLTDRCLVITMR 472

BLAST of Tan0016523 vs. NCBI nr
Match: XP_008446550.1 (PREDICTED: uncharacterized protein At1g26090, chloroplastic [Cucumis melo])

HSP 1 Score: 748.8 bits (1932), Expect = 2.7e-212
Identity = 382/448 (85.27%), Postives = 405/448 (90.40%), Query Frame = 0

Query: 1   MASSLLFSASFFGNPIPISIRTTTAPCRRKSLAVEASKEITNVSSQNPTRMLTFLGKGGS 60
           MASSLLFS SFFGNPIPISIRT T PC  + + ++ASK+  +VSSQNPTR+LTFLGKGGS
Sbjct: 1   MASSLLFSPSFFGNPIPISIRTRTPPCSTRIIILQASKQTMDVSSQNPTRLLTFLGKGGS 60

Query: 61  GKTTSAVFAAQHFALSGFRTCLAIHNQDPTPEYLLDCKIGNSPVECGHNLSAVRLETTQM 120
           GKTTSAVFAAQHFALSG RTCL I NQDPTPEYLLDCKIGNSPVEC HNLSAVRLETTQM
Sbjct: 61  GKTTSAVFAAQHFALSGLRTCLVIRNQDPTPEYLLDCKIGNSPVECSHNLSAVRLETTQM 120

Query: 121 LLEPLKWLKQADSRLNMTQGVLEGVVGEELGILPGMDSIFSVLQLERFLGFSGIMAQRDQ 180
           LLEPLK LKQADSRLNMTQGVLEGVVGEEL +LPGMDSIFS+LQLERF+GFSGIM QRDQ
Sbjct: 121 LLEPLKRLKQADSRLNMTQGVLEGVVGEELAVLPGMDSIFSILQLERFVGFSGIMGQRDQ 180

Query: 181 KAKYDIVIYDGICTEETVRMIGATSKARLYLKYLKSIAEKTDLGRLATPSILRFVDEAMS 240
           K KYDIVIYDG+CTEET+RMIGATSK RLYLKYL+SIAEKTDLGRLATPSILR VDEAMS
Sbjct: 181 KDKYDIVIYDGVCTEETIRMIGATSKIRLYLKYLRSIAEKTDLGRLATPSILRLVDEAMS 240

Query: 241 ISGPGSPLSSRTSTDIWEALERMLEKASSAFSEQSKFSCFIVMDPTSPASVQSALRYWGC 300
           IS PGS L  RTSTDIWE LE +LEK SSAF+E  KFSC+IVMDPTSPASVQSALRYWGC
Sbjct: 241 ISRPGSHLGGRTSTDIWETLEHVLEKGSSAFAEPRKFSCYIVMDPTSPASVQSALRYWGC 300

Query: 301 TIQAGAQISGAFASISSQLDAESIARLKENFSPLSLAFMPQFSSGSPVDWNTVLRDASSK 360
           TIQAGAQI GA A  SS  +AE+ A LKE FSPLSLAF+PQFS GS VDWNTVLRDASSK
Sbjct: 301 TIQAGAQICGALAFTSSHFNAEASASLKEKFSPLSLAFIPQFSIGSSVDWNTVLRDASSK 360

Query: 361 GPRDLLSTSKNHTSSLLSPVKFNPGNKSVTLLMPGFEKSEIKLYQYRGGSELLVEAGDQR 420
           GPRDLLS+SK+ TSSL+ PVKF+PGNKSVTLLMPGF KSEIKLYQYRGGSELLVEAGDQR
Sbjct: 361 GPRDLLSSSKSLTSSLIPPVKFDPGNKSVTLLMPGFGKSEIKLYQYRGGSELLVEAGDQR 420

Query: 421 RVISLPKEIQGKVGGAKFTDRSLVITMR 449
           RVISLPKEIQGKVGGAKF DRSLVITMR
Sbjct: 421 RVISLPKEIQGKVGGAKFMDRSLVITMR 448

BLAST of Tan0016523 vs. ExPASy TrEMBL
Match: A0A6J1ISG8 (uncharacterized protein At1g26090, chloroplastic OS=Cucurbita maxima OX=3661 GN=LOC111478864 PE=3 SV=1)

HSP 1 Score: 776.2 bits (2003), Expect = 7.6e-221
Identity = 396/448 (88.39%), Postives = 419/448 (93.53%), Query Frame = 0

Query: 1   MASSLLFSASFFGNPIPISIRTTTAPCRRKSLAVEASKEITNVSSQNPTRMLTFLGKGGS 60
           MASSLLFSASFFG+PIPISIRT TAPCRR+S+A+EASKE+T+VSSQN  RMLTFLGKGGS
Sbjct: 1   MASSLLFSASFFGHPIPISIRTRTAPCRRRSMAIEASKEVTDVSSQNQPRMLTFLGKGGS 60

Query: 61  GKTTSAVFAAQHFALSGFRTCLAIHNQDPTPEYLLDCKIGNSPVECGHNLSAVRLETTQM 120
           GKTTSAVFAA+HFALSG RTCL IHNQD TPEYLLDCKIG+SPVEC HNLSAVRLETTQM
Sbjct: 61  GKTTSAVFAARHFALSGLRTCLVIHNQDATPEYLLDCKIGSSPVECSHNLSAVRLETTQM 120

Query: 121 LLEPLKWLKQADSRLNMTQGVLEGVVGEELGILPGMDSIFSVLQLERFLGFSGIMAQRDQ 180
           LLEPLK LKQADS LNMTQG LEGVVGEELGILPGMDSIFSVLQLERFLGFSGIMAQ DQ
Sbjct: 121 LLEPLKRLKQADSPLNMTQGTLEGVVGEELGILPGMDSIFSVLQLERFLGFSGIMAQTDQ 180

Query: 181 KAKYDIVIYDGICTEETVRMIGATSKARLYLKYLKSIAEKTDLGRLATPSILRFVDEAMS 240
           KAKYDIV+YDGICTEET+RMIGATSKARLYLKYL+SIAEKTDLGRLATPSILR VDEAM+
Sbjct: 181 KAKYDIVVYDGICTEETIRMIGATSKARLYLKYLRSIAEKTDLGRLATPSILRLVDEAMN 240

Query: 241 ISGPGSPLSSRTSTDIWEALERMLEKASSAFSEQSKFSCFIVMDPTSPASVQSALRYWGC 300
           IS PGS LS RTSTD W+ALE MLEK SSA +E  +FSCFIVMDPTSPASV+SALRYWGC
Sbjct: 241 ISSPGSHLSGRTSTDTWQALEHMLEKGSSAIAEPRRFSCFIVMDPTSPASVKSALRYWGC 300

Query: 301 TIQAGAQISGAFASISSQLDAESIARLKENFSPLSLAFMPQFSSGSPVDWNTVLRDASSK 360
           TIQAGAQISGAFASISS LDAES ARLKENF PL LAFMPQ S GSPVDWNTVL DASSK
Sbjct: 301 TIQAGAQISGAFASISSGLDAESAARLKENFLPLPLAFMPQISVGSPVDWNTVLPDASSK 360

Query: 361 GPRDLLSTSKNHTSSLLSPVKFNPGNKSVTLLMPGFEKSEIKLYQYRGGSELLVEAGDQR 420
           GPR+LLS+SK+H+S+LLSPVKF+PGNKSVTLLMPGFEKSEI+LYQYRGGSELLVEAGDQR
Sbjct: 361 GPRNLLSSSKSHSSNLLSPVKFDPGNKSVTLLMPGFEKSEIRLYQYRGGSELLVEAGDQR 420

Query: 421 RVISLPKEIQGKVGGAKFTDRSLVITMR 449
           RVISLPKEIQGKVGGAKF DRSLVITMR
Sbjct: 421 RVISLPKEIQGKVGGAKFMDRSLVITMR 448

BLAST of Tan0016523 vs. ExPASy TrEMBL
Match: A0A6J1GY43 (uncharacterized protein At1g26090, chloroplastic OS=Cucurbita moschata OX=3662 GN=LOC111458379 PE=3 SV=1)

HSP 1 Score: 773.5 bits (1996), Expect = 4.9e-220
Identity = 394/448 (87.95%), Postives = 416/448 (92.86%), Query Frame = 0

Query: 1   MASSLLFSASFFGNPIPISIRTTTAPCRRKSLAVEASKEITNVSSQNPTRMLTFLGKGGS 60
           MASSLLFSASFFG+PIPISIRT TAPCRR+S+A+EASKE+T+VSSQN  RMLTFLGKGGS
Sbjct: 1   MASSLLFSASFFGHPIPISIRTRTAPCRRRSMAIEASKEVTDVSSQNQPRMLTFLGKGGS 60

Query: 61  GKTTSAVFAAQHFALSGFRTCLAIHNQDPTPEYLLDCKIGNSPVECGHNLSAVRLETTQM 120
           GKTTSAVFAA+HFALSG RTCL IHNQD TPEYLLDCKIGNSPVEC  NLSAVRLETTQM
Sbjct: 61  GKTTSAVFAARHFALSGLRTCLVIHNQDATPEYLLDCKIGNSPVECSRNLSAVRLETTQM 120

Query: 121 LLEPLKWLKQADSRLNMTQGVLEGVVGEELGILPGMDSIFSVLQLERFLGFSGIMAQRDQ 180
           LLEPLK LKQADSRLNMTQG LEG+VGEELGILPGMDSIFSVLQLERFLG SGIMAQ DQ
Sbjct: 121 LLEPLKRLKQADSRLNMTQGTLEGIVGEELGILPGMDSIFSVLQLERFLGLSGIMAQTDQ 180

Query: 181 KAKYDIVIYDGICTEETVRMIGATSKARLYLKYLKSIAEKTDLGRLATPSILRFVDEAMS 240
           K KYDIV+YDGICTEET+RMIGATSKARLYLKYL+SIAEKTDLGRLATPSI+R VDEAM 
Sbjct: 181 KPKYDIVVYDGICTEETIRMIGATSKARLYLKYLRSIAEKTDLGRLATPSIMRLVDEAMK 240

Query: 241 ISGPGSPLSSRTSTDIWEALERMLEKASSAFSEQSKFSCFIVMDPTSPASVQSALRYWGC 300
           IS PGS LS RTSTD W+ALERMLEK SSA +E  +FSCFIVMDPTSPASV+SA RYWGC
Sbjct: 241 ISSPGSHLSGRTSTDTWQALERMLEKGSSAIAEPRRFSCFIVMDPTSPASVKSASRYWGC 300

Query: 301 TIQAGAQISGAFASISSQLDAESIARLKENFSPLSLAFMPQFSSGSPVDWNTVLRDASSK 360
           TIQAGAQISGAFASISS LDAES ARLKENFSPLSL FMPQ S GSPVDWNTVL DASSK
Sbjct: 301 TIQAGAQISGAFASISSGLDAESAARLKENFSPLSLTFMPQISVGSPVDWNTVLLDASSK 360

Query: 361 GPRDLLSTSKNHTSSLLSPVKFNPGNKSVTLLMPGFEKSEIKLYQYRGGSELLVEAGDQR 420
           GPR+LLS+SK+H+S+L SPVKFNPGNKSVTLLMPGFEKSEI+LYQYRGGSELLVEAGDQR
Sbjct: 361 GPRNLLSSSKSHSSNLPSPVKFNPGNKSVTLLMPGFEKSEIRLYQYRGGSELLVEAGDQR 420

Query: 421 RVISLPKEIQGKVGGAKFTDRSLVITMR 449
           RVISLPKEIQGKVGGAKF DRSLVITMR
Sbjct: 421 RVISLPKEIQGKVGGAKFMDRSLVITMR 448

BLAST of Tan0016523 vs. ExPASy TrEMBL
Match: A0A1S3BET7 (uncharacterized protein At1g26090, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103489245 PE=3 SV=1)

HSP 1 Score: 748.8 bits (1932), Expect = 1.3e-212
Identity = 382/448 (85.27%), Postives = 405/448 (90.40%), Query Frame = 0

Query: 1   MASSLLFSASFFGNPIPISIRTTTAPCRRKSLAVEASKEITNVSSQNPTRMLTFLGKGGS 60
           MASSLLFS SFFGNPIPISIRT T PC  + + ++ASK+  +VSSQNPTR+LTFLGKGGS
Sbjct: 1   MASSLLFSPSFFGNPIPISIRTRTPPCSTRIIILQASKQTMDVSSQNPTRLLTFLGKGGS 60

Query: 61  GKTTSAVFAAQHFALSGFRTCLAIHNQDPTPEYLLDCKIGNSPVECGHNLSAVRLETTQM 120
           GKTTSAVFAAQHFALSG RTCL I NQDPTPEYLLDCKIGNSPVEC HNLSAVRLETTQM
Sbjct: 61  GKTTSAVFAAQHFALSGLRTCLVIRNQDPTPEYLLDCKIGNSPVECSHNLSAVRLETTQM 120

Query: 121 LLEPLKWLKQADSRLNMTQGVLEGVVGEELGILPGMDSIFSVLQLERFLGFSGIMAQRDQ 180
           LLEPLK LKQADSRLNMTQGVLEGVVGEEL +LPGMDSIFS+LQLERF+GFSGIM QRDQ
Sbjct: 121 LLEPLKRLKQADSRLNMTQGVLEGVVGEELAVLPGMDSIFSILQLERFVGFSGIMGQRDQ 180

Query: 181 KAKYDIVIYDGICTEETVRMIGATSKARLYLKYLKSIAEKTDLGRLATPSILRFVDEAMS 240
           K KYDIVIYDG+CTEET+RMIGATSK RLYLKYL+SIAEKTDLGRLATPSILR VDEAMS
Sbjct: 181 KDKYDIVIYDGVCTEETIRMIGATSKIRLYLKYLRSIAEKTDLGRLATPSILRLVDEAMS 240

Query: 241 ISGPGSPLSSRTSTDIWEALERMLEKASSAFSEQSKFSCFIVMDPTSPASVQSALRYWGC 300
           IS PGS L  RTSTDIWE LE +LEK SSAF+E  KFSC+IVMDPTSPASVQSALRYWGC
Sbjct: 241 ISRPGSHLGGRTSTDIWETLEHVLEKGSSAFAEPRKFSCYIVMDPTSPASVQSALRYWGC 300

Query: 301 TIQAGAQISGAFASISSQLDAESIARLKENFSPLSLAFMPQFSSGSPVDWNTVLRDASSK 360
           TIQAGAQI GA A  SS  +AE+ A LKE FSPLSLAF+PQFS GS VDWNTVLRDASSK
Sbjct: 301 TIQAGAQICGALAFTSSHFNAEASASLKEKFSPLSLAFIPQFSIGSSVDWNTVLRDASSK 360

Query: 361 GPRDLLSTSKNHTSSLLSPVKFNPGNKSVTLLMPGFEKSEIKLYQYRGGSELLVEAGDQR 420
           GPRDLLS+SK+ TSSL+ PVKF+PGNKSVTLLMPGF KSEIKLYQYRGGSELLVEAGDQR
Sbjct: 361 GPRDLLSSSKSLTSSLIPPVKFDPGNKSVTLLMPGFGKSEIKLYQYRGGSELLVEAGDQR 420

Query: 421 RVISLPKEIQGKVGGAKFTDRSLVITMR 449
           RVISLPKEIQGKVGGAKF DRSLVITMR
Sbjct: 421 RVISLPKEIQGKVGGAKFMDRSLVITMR 448

BLAST of Tan0016523 vs. ExPASy TrEMBL
Match: A0A6J1D944 (uncharacterized protein At1g26090, chloroplastic OS=Momordica charantia OX=3673 GN=LOC111018739 PE=3 SV=1)

HSP 1 Score: 740.7 bits (1911), Expect = 3.5e-210
Identity = 384/452 (84.96%), Postives = 411/452 (90.93%), Query Frame = 0

Query: 1   MASSLLFSASFFGNPIPIS--IRTTTAPC--RRKSLAVEASKEITNVSSQNPTRMLTFLG 60
           MASSLL+S SFFGNPIPIS  IRT  A    RR++L V++SKEI +   Q PTR+LTFLG
Sbjct: 1   MASSLLYSTSFFGNPIPISMPIRTGRAASTRRRRALPVQSSKEIMD---QKPTRLLTFLG 60

Query: 61  KGGSGKTTSAVFAAQHFALSGFRTCLAIHNQDPTPEYLLDCKIGNSPVECGHNLSAVRLE 120
           KGGSGKT+SAVFAAQHFAL+G RTCL IHNQDPT EYLLDCKIGNSPVECGHNLSAVRLE
Sbjct: 61  KGGSGKTSSAVFAAQHFALAGLRTCLVIHNQDPTSEYLLDCKIGNSPVECGHNLSAVRLE 120

Query: 121 TTQMLLEPLKWLKQADSRLNMTQGVLEGVVGEELGILPGMDSIFSVLQLERFLGFSGIMA 180
           TTQMLLEPLK L+QADSRLNMTQGVLEGVVGEELG+LPGMDS+FSVL LE+FLGFS  MA
Sbjct: 121 TTQMLLEPLKQLRQADSRLNMTQGVLEGVVGEELGVLPGMDSVFSVLLLEKFLGFSENMA 180

Query: 181 QRDQKAKYDIVIYDGICTEETVRMIGATSKARLYLKYLKSIAEKTDLGRLATPSILRFVD 240
           QRD+KA YDIVIYDGI TEET+R++GA SKARLYLKY++S AEKTDLGRLATPSILR VD
Sbjct: 181 QRDRKANYDIVIYDGISTEETIRIMGAASKARLYLKYMRSAAEKTDLGRLATPSILRLVD 240

Query: 241 EAMSISGPGSPLSSRTSTDIWEALERMLEKASSAFSEQSKFSCFIVMDPTSPASVQSALR 300
           EAM IS PGS LS RTSTDIWEALERMLE+ SSAFSE SKF CFIVMDPTSPASVQSALR
Sbjct: 241 EAMGISRPGSHLSGRTSTDIWEALERMLERGSSAFSEPSKFGCFIVMDPTSPASVQSALR 300

Query: 301 YWGCTIQAGAQISGAFASISSQLDAESIARLKENFSPLSLAFMPQFSSGSPVDWNTVLRD 360
           YWGCTIQAGAQISGAFA ISS LDAES++RLKENFSPLSLAFMP+FS GSPVDWNTVL D
Sbjct: 301 YWGCTIQAGAQISGAFAFISSHLDAESVSRLKENFSPLSLAFMPKFSIGSPVDWNTVLHD 360

Query: 361 ASSKGPRDLLSTSKNHTSSLLSPVKFNPGNKSVTLLMPGFEKSEIKLYQYRGGSELLVEA 420
           ASSKGPRDLLS+SK+H SSLLSPVKF+PGN+SVTL MPGFEKSEIKLYQYRGGSELLVEA
Sbjct: 361 ASSKGPRDLLSSSKSHISSLLSPVKFDPGNRSVTLFMPGFEKSEIKLYQYRGGSELLVEA 420

Query: 421 GDQRRVISLPKEIQGKVGGAKFTDRSLVITMR 449
           GDQRRVISLPKEIQGKVGGAKF DRSLVITMR
Sbjct: 421 GDQRRVISLPKEIQGKVGGAKFMDRSLVITMR 449

BLAST of Tan0016523 vs. ExPASy TrEMBL
Match: A0A5A7STS2 (ArsA_ATPase domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold65G005370 PE=3 SV=1)

HSP 1 Score: 689.9 bits (1779), Expect = 7.1e-195
Identity = 354/408 (86.76%), Postives = 372/408 (91.18%), Query Frame = 0

Query: 42  NVSSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGFRTCLAIHNQDPTPEYLLDCKIGN 101
           +VSSQNPTR+LTFLGKGGSGKTTSAVFAAQHFALSG RTCL I NQDPTPEYLLDCKIGN
Sbjct: 2   DVSSQNPTRLLTFLGKGGSGKTTSAVFAAQHFALSGLRTCLVIRNQDPTPEYLLDCKIGN 61

Query: 102 SPVECGHNLSAVRLETTQMLLEPLKWLKQADSRLNMTQGVLEGVVGEELGILPGMDSIFS 161
           SPVEC HNLSAVRLETTQMLLEPLK LKQADSRLNMTQGVLEGVVGEEL +LPGMDSIFS
Sbjct: 62  SPVECSHNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGVLEGVVGEELAVLPGMDSIFS 121

Query: 162 VLQLERFLGFSGIMAQRDQKAKYDIVIYDGICTEETVRMIGATSKARLYLKYLKSIAEKT 221
           +LQLERF+GFSGIM QRDQK KYDIVIYDG+CTEET+RMIGATSK RLYLKYL+SIAEKT
Sbjct: 122 ILQLERFVGFSGIMGQRDQKDKYDIVIYDGVCTEETIRMIGATSKIRLYLKYLRSIAEKT 181

Query: 222 DLGRLATPSILRFVDEAMSISGPGSPLSSRTSTDIWEALERMLEKASSAFSEQSKFSCFI 281
           DLGRLATPSILR VDEAMSIS PGS L  RTSTDIWE LE +LEK SSAF+E  KFSC+I
Sbjct: 182 DLGRLATPSILRLVDEAMSISRPGSHLGGRTSTDIWETLEHVLEKGSSAFAEPRKFSCYI 241

Query: 282 VMDPTSPASVQSALRYWGCTIQAGAQISGAFASISSQLDAESIARLKENFSPLSLAFMPQ 341
           VMDPTSPASVQSALRYWGCTIQAGAQI GA A  SS  +AE+ A LKE FSPLSLAF+PQ
Sbjct: 242 VMDPTSPASVQSALRYWGCTIQAGAQICGALAFTSSHFNAEASASLKEKFSPLSLAFIPQ 301

Query: 342 FSSGSPVDWNTVLRDASSKGPRDLLSTSKNHTSSLLSPVKFNPGNKSVTLLMPGFEKSEI 401
           FS GS VDWNTVLRDASSKGPRDLLS+SK+ TSSL+ PVKF+PGNKSVTLLMPGF KSEI
Sbjct: 302 FSIGSSVDWNTVLRDASSKGPRDLLSSSKSLTSSLIPPVKFDPGNKSVTLLMPGFGKSEI 361

Query: 402 KLYQYR-GGSELLVEAGDQRRVISLPKEIQGKVGGAKFTDRSLVITMR 449
           KLYQ R GGSELLVEAGDQRRVISLPKEIQGKVGGAKF DRSLVITMR
Sbjct: 362 KLYQARSGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITMR 409

BLAST of Tan0016523 vs. TAIR 10
Match: AT1G26090.1 (P-loop containing nucleoside triphosphate hydrolases superfamily protein )

HSP 1 Score: 498.8 bits (1283), Expect = 4.5e-141
Identity = 264/456 (57.89%), Postives = 336/456 (73.68%), Query Frame = 0

Query: 1   MASSLLFSASFFGNPIPISIRTTTAPCRRKS----LAVEASKEITNV---SSQNPTRMLT 60
           + +S L  +S   N +PI +RT T    RK     +A  +S+++ +    SSQ  T+ +T
Sbjct: 4   LVNSSLTCSSLTLNLLPI-LRTETPSLSRKRRAAYVAATSSRDVNDTAADSSQKLTKFVT 63

Query: 61  FLGKGGSGKTTSAVFAAQHFALSGFRTCLAIHNQDPTPEYLLDCKIGNSPVECGHNLSAV 120
           FLGKGGSGKTT+AVFAAQH+AL+G  TCL IHNQDP+ E+LL  KIG SP     NLS +
Sbjct: 64  FLGKGGSGKTTAAVFAAQHYALAGLSTCLVIHNQDPSAEFLLGSKIGTSPTLINDNLSVI 123

Query: 121 RLETTQMLLEPLKWLKQADSRLNMTQGVLEGVVGEELGILPGMDSIFSVLQLERFLGFSG 180
           RLETT+MLLEPLK LKQAD+RLNMTQGVLEGVVGEELG+LPGMDSIFS+L+LER +GF  
Sbjct: 124 RLETTKMLLEPLKQLKQADARLNMTQGVLEGVVGEELGVLPGMDSIFSMLELERLVGFFR 183

Query: 181 IMAQRDQKAK-YDIVIYDGICTEETVRMIGATSKARLYLKYLKSIAEKTDLGRLATPSIL 240
              +++ K K +D++IYDGI TEET+RMIG +SK RLY KYL+S+AEKTDLGRL +PSI+
Sbjct: 184 QATRKNHKGKPFDVIIYDGISTEETLRMIGLSSKTRLYAKYLRSLAEKTDLGRLTSPSIM 243

Query: 241 RFVDEAMSISGPGSPLSSRTSTDIWEALERMLEKASSAFSEQSKFSCFIVMDPTSPASVQ 300
           RFVDE+M+I+   SP    TS  +W+ LER LE  +SA+ +  +F  F+VMDP +P SV+
Sbjct: 244 RFVDESMNINSNKSPFDGMTSPAMWDTLERFLETGASAWRDPERFRSFLVMDPNNPMSVK 303

Query: 301 SALRYWGCTIQAGAQISGAFASISSQLDAESIARLKENFSPLSLAFMPQFSSGSPVDWNT 360
           +ALRYWGCT+QAG+ +SGAFA  SS L ++     K +F PL  A      + + +DW+ 
Sbjct: 304 AALRYWGCTVQAGSHVSGAFAISSSHLTSQI---PKADFVPLPFASASVPFTITGLDWDK 363

Query: 361 VLRDASSKGPRDLLSTSKNHTSSLLSPVKFNPGNKSVTLLMPGFEKSEIKLYQYRGGSEL 420
           +L D ++   R+LLS + +H +SL   V F+   K VTL MPGFEKSEIKLYQYRGGSEL
Sbjct: 364 ILLDQANSSIRELLSETVSHGTSLTQTVMFDTAKKLVTLFMPGFEKSEIKLYQYRGGSEL 423

Query: 421 LVEAGDQRRVISLPKEIQGKVGGAKFTDRSLVITMR 449
           L+EAGDQRRVI LP +IQGKVGGAKF DRSL++TMR
Sbjct: 424 LIEAGDQRRVIHLPSQIQGKVGGAKFVDRSLIVTMR 455

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q6DYE46.3e-14057.89Uncharacterized protein At1g26090, chloroplastic OS=Arabidopsis thaliana OX=3702... [more]
Q464654.9e-1522.65Putative arsenical pump-driving ATPase OS=Prosthecochloris vibrioformis OX=1098 ... [more]
Q463661.1e-1422.41Putative arsenical pump-driving ATPase OS=Chlorobaculum tepidum (strain ATCC 496... [more]
Q557941.5e-1122.25Putative arsenical pump-driving ATPase OS=Synechocystis sp. (strain PCC 6803 / K... [more]
O505937.8e-0526.06Arsenical pump-driving ATPase OS=Acidiphilium multivorum (strain DSM 11245 / JCM... [more]
Match NameE-valueIdentityDescription
XP_022979170.11.6e-22088.39uncharacterized protein At1g26090, chloroplastic [Cucurbita maxima][more]
XP_023529730.11.6e-22087.95uncharacterized protein At1g26090, chloroplastic [Cucurbita pepo subsp. pepo][more]
XP_022956773.11.0e-21987.95uncharacterized protein At1g26090, chloroplastic [Cucurbita moschata][more]
XP_038891424.11.3e-21988.84uncharacterized protein At1g26090, chloroplastic [Benincasa hispida][more]
XP_008446550.12.7e-21285.27PREDICTED: uncharacterized protein At1g26090, chloroplastic [Cucumis melo][more]
Match NameE-valueIdentityDescription
A0A6J1ISG87.6e-22188.39uncharacterized protein At1g26090, chloroplastic OS=Cucurbita maxima OX=3661 GN=... [more]
A0A6J1GY434.9e-22087.95uncharacterized protein At1g26090, chloroplastic OS=Cucurbita moschata OX=3662 G... [more]
A0A1S3BET71.3e-21285.27uncharacterized protein At1g26090, chloroplastic OS=Cucumis melo OX=3656 GN=LOC1... [more]
A0A6J1D9443.5e-21084.96uncharacterized protein At1g26090, chloroplastic OS=Momordica charantia OX=3673 ... [more]
A0A5A7STS27.1e-19586.76ArsA_ATPase domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=... [more]
Match NameE-valueIdentityDescription
AT1G26090.14.5e-14157.89P-loop containing nucleoside triphosphate hydrolases superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR027417P-loop containing nucleoside triphosphate hydrolaseGENE3D3.40.50.300coord: 49..361
e-value: 6.2E-48
score: 165.7
IPR027417P-loop containing nucleoside triphosphate hydrolaseSUPERFAMILY52540P-loop containing nucleoside triphosphate hydrolasescoord: 46..260
IPR008978HSP20-like chaperoneGENE3D2.60.40.790coord: 378..448
e-value: 1.6E-18
score: 67.9
IPR025723Anion-transporting ATPase-like domainPFAMPF02374ArsA_ATPasecoord: 49..285
e-value: 1.4E-16
score: 60.6
IPR040612ArsA, HSP20-like domainPFAMPF17886ArsA_HSP20coord: 386..447
e-value: 1.7E-14
score: 53.1
NoneNo IPR availablePANTHERPTHR43868OS02G0711200 PROTEINcoord: 24..448
NoneNo IPR availableCDDcd02035ArsAcoord: 50..344
e-value: 9.87044E-33
score: 122.616

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0016523.1Tan0016523.1mRNA