Cla97C01G008700 (gene) Watermelon (97103) v2

NameCla97C01G008700
Typegene
OrganismCitrullus lanatus (Watermelon (97103) v2)
DescriptionWAT1-related protein At3g02690, chloroplastic
LocationCla97Chr01 : 9279120 .. 9282891 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGCCCTGGTGGGTTCCTTCTTCAACCCAAGGAATCGTAGCAGAGCCAAAGACAACAACAATGGCGGGCTGTTTAGCCACTGCCACTCTTACACCCACTTCTCCATTCAACTCTCGCCCTCTTCTTCATTTCAGTTTTAGCACACAATTTCTCGCACCGGAAATTTCCTCCGCCGCTCCAATTCTCCGGCAACGATTCCCCTTTCAATTAGGTTTCAGAAGGTATGGTGAGAATGCCCGGTTTCGTTTCGATTATGTTGCAATTCCAGTGGCGAATTGCACCAGAAGTGGTGCAGATACGGAATTGGATTTGACACAGTCCATCGATTGCGTTGGGACTGCGCAGGATGTGGAGTGTCTGGTTTCCCCTACTGATGAAGATCCTTCATCTTCAGTCGGGGTGCCAGTGGAATTAGGGATTTCTTCCGAATTTGGTGGTGATGGTTCTACGGCAGTGTTGGAGAAAGCCTGGGAGTTTGCGGTGTTGGTTTCGCCGTTTTTCTTTTGGGGTACGGCCATGGTGGCGATGAAGGAGGTGCTTCCAAGGTCTGGTCCTTTTTTCGTTTCTGCCTTTCGTCTTATACCTGCCGGTTTCCTCTTGATTGCCTTTGCTGCTTTCCGTGGTCGCCCCTTTCCTTCTGGGTTTTCTGCTTGGATTTCCATCATTCTCTTTGCTCTCGTCGACGCTACATTCTTTCAGGGCTTTCTTGCTCAAGGCTTGCAGAGGACATCCGCAGGGTTGGGCAGTGTATGTTTCATTACAGATTTATGAAACTGCTAATTGATGTATGTATATGCCCTAAAAAAAAGAACAAAAATCAAGAACATAACTGAAGACAAGAAAATTATTTGTTTTAACTTCTTCTTGTTTATTTCGTTTGCTTGCTTGATTAGAGAATCCTTTAATATTGTATGCTTTTGATCTTTGAAGAAGATACAAAGATTTACATGGATATATAGTTACATGGAATTCATGTGGACAAACTTATGGAGATTGCTTTTCAGCGATAGTTATAGGTGCCAATGCATCCGACCCATATTTATCTACTTGGAGTTGAGTTCTCATGATAAGAACTTTTGTATTTATGATCTGTCTCTCATTTGTACCAACTGGGGAAATTTTACTTCAGCTTTCGTACGCGCTTTAGATTATCTTCCTTTTCATCTTTCCAATGACAAAAAAAAAAAAAAAAAAACGCGCATGCACATAGACACTTGACACTTTTCTCTTGACACCTCCGTACACATTCAAATATCATGTGGAAATTGTATCTATACAATTGTGAATTTACATTTTCTGTACATTACCTTGATTACAGGTAATAATTGATTCTCAACCATTAACCGTGGCAGTGCTTGCAGCCTTCTTATTTGGTGAGTCCATCGGTTTAGTTGGAGCCGCTGGACTTGTACTTGGAGTTTTAGGACTTTTACTTCTCGAGGTACTCCTTTTTCCCCTTGGGGGCCCTTGTGTTAGTGTTACTGCGTTGCAGATGCATTATATAACTGAATGGTTATTATAGAGATGTGCGTGACTAATGAGACTAACAAAGTGTAACTCAACAAACTCAATTTATGTAATTTGTGGCGGGTTTGGGGAGCTTTCGGATGATGAGACCCAAATTTTTTAGGAATGTAAATCTCAATTTTATAATACTCTGCTCTCATAATGCAGAGAGGCTCTGAATATAAAATTTATACTGAATTTCATGAACTACTACATGCACTTCTGCATCTTTAAAACTGGGATATGATCAAATCTTTTAATATTTCAATCTCGCTATTCATATTCTTTGTAAACTTTGATTTAACCATCTTATGAATGCTCAGCTTGTCTTCCTGGCCTTTTTAAATAAACCACTTTTGTTTTCTCCTCTGCAATAGGATTTTTTTTTTTAATATATATATATATATATATATATATTTTTTATGTGCAGTGTGTATGAATGTCTTTATAATCTTTTAAACATTATTTAAGTACGATTGTTCAAGAATTGAATTATATTATATGAAAAGAGGTTACATTCATGAAATTACGAGACTAATCTATCTGCACTTTATCAAAGGATAGCTTCTGTTGATATGGCAGTCGGTTTGACAGGTTCCTTCACTTACCTTGGATGCAAATAGCTTTTCATTGTGGGGAAGTGGAGAGTGGTGGATGTTTCTAGCTGCACAGAGCATGGCAGTAGGTACTGTCATGGTCCGCTGGGTTTCCAAGTATTCTGATCCTATTATGGCAACTGGATGGGTAGGGTCAACTGTTGAAGTGATCTCTTATGAAATTACATCACTGTTCTTTCTTATGCTTCTTGTAATACAAATTATTTTGGTGCACCCTTTTTTCATCCTTATTAAAAATGTATATGATCGCATGATTGAAGAAAATGAGTTCTCGATCGACCTTCAGCACATGTTCATAGTCATCATGCGAAAATTTCCTGCTTCCTTCACCCAGTTAACCATCATGCATATCATGATTTTGATAGATCAGAAGAGAGTACAGTTATGAGTTTAATACATGGCGGCTCTGCATTGTACCTTCCCTAATTTCTAATTCCAAGGAAGAGTTGTCAGTTTAGCATGCAGGCAATTAAATACTGAAAGAATCAAACTTCAATGGAATTCCTTATTGGAAGTCATGAAGTACTTGAAGGATGGCTTATACAAGGCTTCAGTCTATTGAAATGAAGGAATTAAAGCAAGTAACTAAATAAATGTAATTTATTTTTTATTTTTTATGTTGCAGCACATGGTGATTGGTGGTCTCCCGCTTTTGATGATCTGTATCCTTAATCATGATCCTGCAGTAAGTGGGAGTCTTAAAGATTTTACAACAAATGATATACTAGCACTCCTTTATGCATCCATTTTTGGGAGTGCTGTTAGCTACGGTTCATTCTTCTTTAGTGCAACAAAAGGTTTCACACTTTCCCCTCCTAGCATTCTCTAGCTTGTAAGGGGCGTTTTGCTTTGAAGTTCCCATTTTATCATCTTGTTGCTTTATTATTATTATTCTTATCGTTACACGTGACTTTCTACTTTGGTTGATAGCTACGATAGATGTTTGGTTCCATGTAAAACATAGTAAAAGACCATGTAAAGGTACAAGCGTGGAAAATATTATAAAAAACAAAGAAGACTAAGAAGTAATGAACCATTTTCAGCTAACACGCACGCAACAATAAACCTGCTGAACTATGATCTATTAGAAATTGAGTAAGTTGATGCAATTATGCTATTTAATTATCAGAATTTTGAATTTTATAAGTATGATCAAAGAAATAAGGACAACATCACCTTATAGATCCTCACCGTGCCAGATTAATAGGCATTTTTAAATCCACTTAGCTTCAAAGTTCACTGATTTTCTGATATGATGTGTGCATCTGATTAAATGTTTAAATGCACCATTATACTTTTTAGACATTCATGGCTTGTTGTTACGTAATAATTACTTTTGTGCTAATTCTCCGTTGTGGATGACTTTTTACTAATAGGTAGTTTGACAAAGCTTAGCTCTCTCACCTTTCTCACTCCAATGTTTGCTTCAGTTTTTGGGTAAGGTTCTACATGCTATTTTTCAATAGGGGACAAATACAATGAGTGAAATATGAAATATCCTCTTTACAGACATTTCAATGTTTCTTCTGATCGTTTATGTCAGGTTTCTATATTTGGGAGAGACATTTTCACCTATTCAACTGGTTGGAGCCGTTGTTACTGTGGTTGCTATATACGTAGTCAACTATGGCAGTAGTTTGGAATGA

mRNA sequence

ATGGCGCCCTGGTGGGTTCCTTCTTCAACCCAAGGAATCGTAGCAGAGCCAAAGACAACAACAATGGCGGGCTGTTTAGCCACTGCCACTCTTACACCCACTTCTCCATTCAACTCTCGCCCTCTTCTTCATTTCAGTTTTAGCACACAATTTCTCGCACCGGAAATTTCCTCCGCCGCTCCAATTCTCCGGCAACGATTCCCCTTTCAATTAGGTTTCAGAAGGTATGGTGAGAATGCCCGGTTTCGTTTCGATTATGTTGCAATTCCAGTGGCGAATTGCACCAGAAGTGGTGCAGATACGGAATTGGATTTGACACAGTCCATCGATTGCGTTGGGACTGCGCAGGATGTGGAGTGTCTGGTTTCCCCTACTGATGAAGATCCTTCATCTTCAGTCGGGGTGCCAGTGGAATTAGGGATTTCTTCCGAATTTGGTGGTGATGGTTCTACGGCAGTGTTGGAGAAAGCCTGGGAGTTTGCGGTGTTGGTTTCGCCGTTTTTCTTTTGGGGTACGGCCATGGTGGCGATGAAGGAGGTGCTTCCAAGGTCTGGTCCTTTTTTCGTTTCTGCCTTTCGTCTTATACCTGCCGGTTTCCTCTTGATTGCCTTTGCTGCTTTCCGTGGTCGCCCCTTTCCTTCTGGGTTTTCTGCTTGGATTTCCATCATTCTCTTTGCTCTCGTCGACGCTACATTCTTTCAGGGCTTTCTTGCTCAAGGCTTGCAGAGGACATCCGCAGGGTTGGGCAGTGTAATAATTGATTCTCAACCATTAACCGTGGCAGTGCTTGCAGCCTTCTTATTTGGTGAGTCCATCGGTTTAGTTGGAGCCGCTGGACTTGTACTTGGAGTTTTAGGACTTTTACTTCTCGAGGTTCCTTCACTTACCTTGGATGCAAATAGCTTTTCATTGTGGGGAAGTGGAGAGTGGTGGATGTTTCTAGCTGCACAGAGCATGGCAGTAGGTACTGTCATGGTCCGCTGGGTTTCCAAGTATTCTGATCCTATTATGGCAACTGGATGGCACATGGTGATTGGTGGTCTCCCGCTTTTGATGATCTGTATCCTTAATCATGATCCTGCAGTAAGTGGGAGTCTTAAAGATTTTACAACAAATGATATACTAGCACTCCTTTATGCATCCATTTTTGGGAGTGCTGTTAGCTACGGTTCATTCTTCTTTAGTGCAACAAAAGGTAGTTTGACAAAGCTTAGCTCTCTCACCTTTCTCACTCCAATGTTTGCTTCAGTTTTTGGGTTTCTATATTTGGGAGAGACATTTTCACCTATTCAACTGGTTGGAGCCGTTGTTACTGTGGTTGCTATATACGTAGTCAACTATGGCAGTAGTTTGGAATGA

Coding sequence (CDS)

ATGGCGCCCTGGTGGGTTCCTTCTTCAACCCAAGGAATCGTAGCAGAGCCAAAGACAACAACAATGGCGGGCTGTTTAGCCACTGCCACTCTTACACCCACTTCTCCATTCAACTCTCGCCCTCTTCTTCATTTCAGTTTTAGCACACAATTTCTCGCACCGGAAATTTCCTCCGCCGCTCCAATTCTCCGGCAACGATTCCCCTTTCAATTAGGTTTCAGAAGGTATGGTGAGAATGCCCGGTTTCGTTTCGATTATGTTGCAATTCCAGTGGCGAATTGCACCAGAAGTGGTGCAGATACGGAATTGGATTTGACACAGTCCATCGATTGCGTTGGGACTGCGCAGGATGTGGAGTGTCTGGTTTCCCCTACTGATGAAGATCCTTCATCTTCAGTCGGGGTGCCAGTGGAATTAGGGATTTCTTCCGAATTTGGTGGTGATGGTTCTACGGCAGTGTTGGAGAAAGCCTGGGAGTTTGCGGTGTTGGTTTCGCCGTTTTTCTTTTGGGGTACGGCCATGGTGGCGATGAAGGAGGTGCTTCCAAGGTCTGGTCCTTTTTTCGTTTCTGCCTTTCGTCTTATACCTGCCGGTTTCCTCTTGATTGCCTTTGCTGCTTTCCGTGGTCGCCCCTTTCCTTCTGGGTTTTCTGCTTGGATTTCCATCATTCTCTTTGCTCTCGTCGACGCTACATTCTTTCAGGGCTTTCTTGCTCAAGGCTTGCAGAGGACATCCGCAGGGTTGGGCAGTGTAATAATTGATTCTCAACCATTAACCGTGGCAGTGCTTGCAGCCTTCTTATTTGGTGAGTCCATCGGTTTAGTTGGAGCCGCTGGACTTGTACTTGGAGTTTTAGGACTTTTACTTCTCGAGGTTCCTTCACTTACCTTGGATGCAAATAGCTTTTCATTGTGGGGAAGTGGAGAGTGGTGGATGTTTCTAGCTGCACAGAGCATGGCAGTAGGTACTGTCATGGTCCGCTGGGTTTCCAAGTATTCTGATCCTATTATGGCAACTGGATGGCACATGGTGATTGGTGGTCTCCCGCTTTTGATGATCTGTATCCTTAATCATGATCCTGCAGTAAGTGGGAGTCTTAAAGATTTTACAACAAATGATATACTAGCACTCCTTTATGCATCCATTTTTGGGAGTGCTGTTAGCTACGGTTCATTCTTCTTTAGTGCAACAAAAGGTAGTTTGACAAAGCTTAGCTCTCTCACCTTTCTCACTCCAATGTTTGCTTCAGTTTTTGGGTTTCTATATTTGGGAGAGACATTTTCACCTATTCAACTGGTTGGAGCCGTTGTTACTGTGGTTGCTATATACGTAGTCAACTATGGCAGTAGTTTGGAATGA

Protein sequence

MAPWWVPSSTQGIVAEPKTTTMAGCLATATLTPTSPFNSRPLLHFSFSTQFLAPEISSAAPILRQRFPFQLGFRRYGENARFRFDYVAIPVANCTRSGADTELDLTQSIDCVGTAQDVECLVSPTDEDPSSSVGVPVELGISSEFGGDGSTAVLEKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAAFRGRPFPSGFSAWISIILFALVDATFFQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAAFLFGESIGLVGAAGLVLGVLGLLLLEVPSLTLDANSFSLWGSGEWWMFLAAQSMAVGTVMVRWVSKYSDPIMATGWHMVIGGLPLLMICILNHDPAVSGSLKDFTTNDILALLYASIFGSAVSYGSFFFSATKGSLTKLSSLTFLTPMFASVFGFLYLGETFSPIQLVGAVVTVVAIYVVNYGSSLE
BLAST of Cla97C01G008700 vs. NCBI nr
Match: XP_004140354.1 (PREDICTED: WAT1-related protein At3g02690, chloroplastic [Cucumis sativus] >KGN51074.1 hypothetical protein Csa_5G429960 [Cucumis sativus])

HSP 1 Score: 760.4 bits (1962), Expect = 3.5e-216
Identity = 397/432 (91.90%), Postives = 408/432 (94.44%), Query Frame = 0

Query: 22  MAGCLATATLTPTSPFNSRPLLHFSFSTQFLAPEISSAAPILRQRFPFQLGFR-RYGENA 81
           MAGCLATATLTPTSP NS P  H        AP ISSAAPILR+R PFQLGFR RY EN+
Sbjct: 1   MAGCLATATLTPTSPSNSPPFFH--------APPISSAAPILRRRLPFQLGFRTRYDENS 60

Query: 82  RFRFDYVAIPVANCTRSGADTELDLTQSIDCVGTAQDVECLVSPTDEDPSSSVGVPVELG 141
           RFRF YVAIPVANCTRSG DTELD T+SIDCVGTAQDVEC+VSP DEDPSSS+GVP++LG
Sbjct: 61  RFRFHYVAIPVANCTRSGGDTELDFTESIDCVGTAQDVECVVSPNDEDPSSSIGVPLKLG 120

Query: 142 ISSEFGGDGSTAVLEKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFL 201
           ISS++ GDGS AVLEKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFL
Sbjct: 121 ISSDYSGDGSVAVLEKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFL 180

Query: 202 LIAFAAFRGRPFPSGFSAWISIILFALVDATFFQGFLAQGLQRTSAGLGSVIIDSQPLTV 261
           LIAFAAFRGRPFPSGFSAWISIILFALVDATFFQGFLAQGLQRTSAGLGSVIIDSQPLTV
Sbjct: 181 LIAFAAFRGRPFPSGFSAWISIILFALVDATFFQGFLAQGLQRTSAGLGSVIIDSQPLTV 240

Query: 262 AVLAAFLFGESIGLVGAAGLVLGVLGLLLLEVPSLTLDANSFSLWGSGEWWMFLAAQSMA 321
           AVLAAFLFGES+GLVGAAGLVLGVLGLLLLEVPSLT DANSFSLWGSGEWWMFLAAQSMA
Sbjct: 241 AVLAAFLFGESLGLVGAAGLVLGVLGLLLLEVPSLTFDANSFSLWGSGEWWMFLAAQSMA 300

Query: 322 VGTVMVRWVSKYSDPIMATGWHMVIGGLPLLMICILNHDPAVSGSLKDFTTNDILALLYA 381
           VGTVMVRWVSKYSDPIMATGWHMVIGGLPLLMICILNHDPAVSGSLKDFTTNDILALLYA
Sbjct: 301 VGTVMVRWVSKYSDPIMATGWHMVIGGLPLLMICILNHDPAVSGSLKDFTTNDILALLYA 360

Query: 382 SIFGSAVSYGSFFFSATKGSLTKLSSLTFLTPMFASVFGFLYLGETFSPIQLVGAVVTVV 441
           SIFGSAVSYGSFF+SATKGSLTKLSSLTFLTPMFASVFGFLYLGETFSPIQLVGAVVTVV
Sbjct: 361 SIFGSAVSYGSFFYSATKGSLTKLSSLTFLTPMFASVFGFLYLGETFSPIQLVGAVVTVV 420

Query: 442 AIYVVNYGSSLE 453
           AIYVVNYGS LE
Sbjct: 421 AIYVVNYGSGLE 424

BLAST of Cla97C01G008700 vs. NCBI nr
Match: XP_008463172.1 (PREDICTED: WAT1-related protein At3g02690, chloroplastic [Cucumis melo])

HSP 1 Score: 744.6 bits (1921), Expect = 2.0e-211
Identity = 390/432 (90.28%), Postives = 402/432 (93.06%), Query Frame = 0

Query: 22  MAGCLATATLTPTSPFNSRPLLHFSFSTQFLAPEISSAAPILRQRFPFQLGF-RRYGENA 81
           MA CLATATLTPTSP NS P         F AP ISSA PILR+R PF+LGF  RY EN 
Sbjct: 1   MAVCLATATLTPTSPSNSPPF--------FRAPSISSAPPILRRRLPFRLGFGTRYDENP 60

Query: 82  RFRFDYVAIPVANCTRSGADTELDLTQSIDCVGTAQDVECLVSPTDEDPSSSVGVPVELG 141
            FRF YVAIPVANCTRSG DTELD T+SIDCVGTAQDVEC+VSPTDEDPSSS+GVP+ELG
Sbjct: 61  LFRFHYVAIPVANCTRSGGDTELDFTESIDCVGTAQDVECVVSPTDEDPSSSIGVPLELG 120

Query: 142 ISSEFGGDGSTAVLEKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFL 201
           ISS++ G GS AVLEKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFL
Sbjct: 121 ISSDYSGGGSVAVLEKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFL 180

Query: 202 LIAFAAFRGRPFPSGFSAWISIILFALVDATFFQGFLAQGLQRTSAGLGSVIIDSQPLTV 261
           L+AFAAFRGRPFPSGFSAWISIILFALVDATFFQGFLAQGLQRTSAGLGSVIIDSQPLTV
Sbjct: 181 LVAFAAFRGRPFPSGFSAWISIILFALVDATFFQGFLAQGLQRTSAGLGSVIIDSQPLTV 240

Query: 262 AVLAAFLFGESIGLVGAAGLVLGVLGLLLLEVPSLTLDANSFSLWGSGEWWMFLAAQSMA 321
           AVLAAFLFGES+GL+GAAGLVLGV GLLLLEVPSLT DANSFSLWGSGEWWMFLAAQSMA
Sbjct: 241 AVLAAFLFGESLGLIGAAGLVLGVFGLLLLEVPSLTFDANSFSLWGSGEWWMFLAAQSMA 300

Query: 322 VGTVMVRWVSKYSDPIMATGWHMVIGGLPLLMICILNHDPAVSGSLKDFTTNDILALLYA 381
           VGTVMVRWVSKYSDPIMATGWHMVIGGLPLLMICILNHDPAVSGSLKDFTTNDILALLYA
Sbjct: 301 VGTVMVRWVSKYSDPIMATGWHMVIGGLPLLMICILNHDPAVSGSLKDFTTNDILALLYA 360

Query: 382 SIFGSAVSYGSFFFSATKGSLTKLSSLTFLTPMFASVFGFLYLGETFSPIQLVGAVVTVV 441
           SIFGSAVSYGSFF+SATKGSLTKLSSLTFLTPMFASVFGFLYLGETFSPIQLVGAVVTVV
Sbjct: 361 SIFGSAVSYGSFFYSATKGSLTKLSSLTFLTPMFASVFGFLYLGETFSPIQLVGAVVTVV 420

Query: 442 AIYVVNYGSSLE 453
           AIY VNYGSSLE
Sbjct: 421 AIYAVNYGSSLE 424

BLAST of Cla97C01G008700 vs. NCBI nr
Match: XP_022993106.1 (WAT1-related protein At3g02690, chloroplastic [Cucurbita maxima])

HSP 1 Score: 674.5 bits (1739), Expect = 2.5e-190
Identity = 365/439 (83.14%), Postives = 380/439 (86.56%), Query Frame = 0

Query: 22  MAGCLATATLTPT-------SPFNSRPLLHFSFSTQFLAPEISSAAPIL-RQRFPFQLGF 81
           MAGCLATATLT T       SP N RPL  FSF  Q +   ISS APIL R+R PFQLGF
Sbjct: 1   MAGCLATATLTTTSPQLSSYSPSNCRPL--FSFRRQIVGLGISSVAPILRRRRVPFQLGF 60

Query: 82  RRYGENARFRFDYVAIPVANCTRSGADTELDLTQSIDCVGTAQDVECLVSPTDEDPSSSV 141
           RRY E  RFR DYVAIPV+NCTRSGADT+LD T+SIDCVGTAQ                 
Sbjct: 61  RRYDEKGRFRVDYVAIPVSNCTRSGADTDLDFTESIDCVGTAQXXXXXXXXXXXXXXXXX 120

Query: 142 GVPVELGISSEFGGDGSTAVLEKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFR 201
                     +  GDGS AVLEKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFR
Sbjct: 121 XF--------DNDGDGSVAVLEKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFR 180

Query: 202 LIPAGFLLIAFAAFRGRPFPSGFSAWISIILFALVDATFFQGFLAQGLQRTSAGLGSVII 261
           L+PAGFLLIAFAAFRGRPFPSGFSAWISI+LFALVDAT FQGFLAQGLQRTSAGLGSVII
Sbjct: 181 LVPAGFLLIAFAAFRGRPFPSGFSAWISILLFALVDATLFQGFLAQGLQRTSAGLGSVII 240

Query: 262 DSQPLTVAVLAAFLFGESIGLVGAAGLVLGVLGLLLLEVPSLTLDANSFSLWGSGEWWMF 321
           DSQPLTVAVLAAFLFGESIGLVGAAGLVLGVLGL+LLEVPSLTLDA+SFSLWGSGEWWMF
Sbjct: 241 DSQPLTVAVLAAFLFGESIGLVGAAGLVLGVLGLILLEVPSLTLDASSFSLWGSGEWWMF 300

Query: 322 LAAQSMAVGTVMVRWVSKYSDPIMATGWHMVIGGLPLLMICILNHDPAVSGSLKDFTTND 381
           LAAQSMAVGTVMVRWVSKYSDPIMATGWHMVIGGLPLLMIC LNH+PAVSGSL+DFTTND
Sbjct: 301 LAAQSMAVGTVMVRWVSKYSDPIMATGWHMVIGGLPLLMICFLNHEPAVSGSLQDFTTND 360

Query: 382 ILALLYASIFGSAVSYGSFFFSATKGSLTKLSSLTFLTPMFASVFGFLYLGETFSPIQLV 441
           ILAL YASIFGSAVSYGSFF+SATKGSLTKLSSLTFLTPMFAS+FGFLYLGETFSPIQLV
Sbjct: 361 ILALFYASIFGSAVSYGSFFYSATKGSLTKLSSLTFLTPMFASIFGFLYLGETFSPIQLV 420

Query: 442 GAVVTVVAIYVVNYGSSLE 453
           GAVVTVV+IYVVNYGSSLE
Sbjct: 421 GAVVTVVSIYVVNYGSSLE 429

BLAST of Cla97C01G008700 vs. NCBI nr
Match: XP_023550684.1 (WAT1-related protein At3g02690, chloroplastic [Cucurbita pepo subsp. pepo] >XP_023550685.1 WAT1-related protein At3g02690, chloroplastic [Cucurbita pepo subsp. pepo])

HSP 1 Score: 671.8 bits (1732), Expect = 1.7e-189
Identity = 364/439 (82.92%), Postives = 379/439 (86.33%), Query Frame = 0

Query: 22  MAGCLATATLTPT-------SPFNSRPLLHFSFSTQFLAPEISSAAPIL-RQRFPFQLGF 81
           MAGCLATATLT T       SP N RPL  FSF  Q +   ISS APIL R+R PFQLGF
Sbjct: 1   MAGCLATATLTTTSPQLSSSSPSNCRPL--FSFRRQIVGLGISSVAPILRRRRVPFQLGF 60

Query: 82  RRYGENARFRFDYVAIPVANCTRSGADTELDLTQSIDCVGTAQDVECLVSPTDEDPSSSV 141
           RRY E  RFR DYVAIPV+NCTRSGADT+LD T+SIDCVGTAQ                 
Sbjct: 61  RRYDEKGRFRVDYVAIPVSNCTRSGADTDLDFTESIDCVGTAQXXXXXXXXXXXXXXXXX 120

Query: 142 GVPVELGISSEFGGDGSTAVLEKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFR 201
                     +  GDGS AVLEKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFR
Sbjct: 121 XF--------DNDGDGSVAVLEKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFR 180

Query: 202 LIPAGFLLIAFAAFRGRPFPSGFSAWISIILFALVDATFFQGFLAQGLQRTSAGLGSVII 261
           L+PAGFLLIAFAAFRGRPFPSGFSAWISI+LFALVDAT FQGFLAQGLQRTSAGLGSVII
Sbjct: 181 LVPAGFLLIAFAAFRGRPFPSGFSAWISILLFALVDATLFQGFLAQGLQRTSAGLGSVII 240

Query: 262 DSQPLTVAVLAAFLFGESIGLVGAAGLVLGVLGLLLLEVPSLTLDANSFSLWGSGEWWMF 321
           DSQPLTVA+LAAFLFGESIGLVGAAGLVLGVLGLLLLEVPSLTLDA+SFSLWGSGEWWMF
Sbjct: 241 DSQPLTVALLAAFLFGESIGLVGAAGLVLGVLGLLLLEVPSLTLDASSFSLWGSGEWWMF 300

Query: 322 LAAQSMAVGTVMVRWVSKYSDPIMATGWHMVIGGLPLLMICILNHDPAVSGSLKDFTTND 381
           LAAQSMAVGTVMVRWVSKYSDPIMATGWHMVIGGLPLL IC LNH+PAVSGSL+DFTTND
Sbjct: 301 LAAQSMAVGTVMVRWVSKYSDPIMATGWHMVIGGLPLLAICFLNHEPAVSGSLQDFTTND 360

Query: 382 ILALLYASIFGSAVSYGSFFFSATKGSLTKLSSLTFLTPMFASVFGFLYLGETFSPIQLV 441
           ILAL YASIFGSAVSYGSFF+SATKGSLTKLSSLTFLTPMFAS+FGFLYLGETFSPIQLV
Sbjct: 361 ILALFYASIFGSAVSYGSFFYSATKGSLTKLSSLTFLTPMFASIFGFLYLGETFSPIQLV 420

Query: 442 GAVVTVVAIYVVNYGSSLE 453
           GAVVTVV+IYVVNYGSSLE
Sbjct: 421 GAVVTVVSIYVVNYGSSLE 429

BLAST of Cla97C01G008700 vs. NCBI nr
Match: XP_022939282.1 (WAT1-related protein At3g02690, chloroplastic [Cucurbita moschata] >XP_022939283.1 WAT1-related protein At3g02690, chloroplastic [Cucurbita moschata])

HSP 1 Score: 669.1 bits (1725), Expect = 1.1e-188
Identity = 363/439 (82.69%), Postives = 376/439 (85.65%), Query Frame = 0

Query: 22  MAGCLATATLTPT-------SPFNSRPLLHFSFSTQFLAPEISSAAPIL-RQRFPFQLGF 81
           MAGCLAT TLT T       SP N RPL  FSF  Q +   ISS APIL R+R PFQLGF
Sbjct: 1   MAGCLATVTLTTTSLQLSSSSPSNCRPL--FSFRRQIVGLGISSVAPILRRRRVPFQLGF 60

Query: 82  RRYGENARFRFDYVAIPVANCTRSGADTELDLTQSIDCVGTAQDVECLVSPTDEDPSSSV 141
           RRY E  RFR DYVAIP  NCTRSGADT+LD T+SIDCVGTAQ                 
Sbjct: 61  RRYDEKGRFRVDYVAIPALNCTRSGADTDLDFTESIDCVGTAQXXXXXXXXXXXXXXXXX 120

Query: 142 GVPVELGISSEFGGDGSTAVLEKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFR 201
                     +  GDGS AVLEKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFR
Sbjct: 121 XF--------DNDGDGSVAVLEKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFR 180

Query: 202 LIPAGFLLIAFAAFRGRPFPSGFSAWISIILFALVDATFFQGFLAQGLQRTSAGLGSVII 261
           L+PAGFLLIAFAAFRGRPFPSGFSAWISI+LFALVDAT FQGFLAQGLQRTSAGLGSVII
Sbjct: 181 LVPAGFLLIAFAAFRGRPFPSGFSAWISILLFALVDATLFQGFLAQGLQRTSAGLGSVII 240

Query: 262 DSQPLTVAVLAAFLFGESIGLVGAAGLVLGVLGLLLLEVPSLTLDANSFSLWGSGEWWMF 321
           DSQPLTVAVLAAFLFGESIGLVGAAGLVLGVLGLLLLEVPSLTLDA+SFSLWGSGEWWMF
Sbjct: 241 DSQPLTVAVLAAFLFGESIGLVGAAGLVLGVLGLLLLEVPSLTLDASSFSLWGSGEWWMF 300

Query: 322 LAAQSMAVGTVMVRWVSKYSDPIMATGWHMVIGGLPLLMICILNHDPAVSGSLKDFTTND 381
           LAAQSMAVGTVMVRWVSKYSDPIMATGWHMVIGGLPLL IC LNH+PAVSGSL+DFTTND
Sbjct: 301 LAAQSMAVGTVMVRWVSKYSDPIMATGWHMVIGGLPLLAICFLNHEPAVSGSLQDFTTND 360

Query: 382 ILALLYASIFGSAVSYGSFFFSATKGSLTKLSSLTFLTPMFASVFGFLYLGETFSPIQLV 441
           ILAL YASIFGSAVSYGSFF+SATKGSLTKLSSLTFLTPMFAS+FGFLYLGETFSPIQLV
Sbjct: 361 ILALFYASIFGSAVSYGSFFYSATKGSLTKLSSLTFLTPMFASIFGFLYLGETFSPIQLV 420

Query: 442 GAVVTVVAIYVVNYGSSLE 453
           GAVVTVV+IYVVNYGSSLE
Sbjct: 421 GAVVTVVSIYVVNYGSSLE 429

BLAST of Cla97C01G008700 vs. TrEMBL
Match: tr|A0A0A0KQD4|A0A0A0KQD4_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G429960 PE=4 SV=1)

HSP 1 Score: 760.4 bits (1962), Expect = 2.3e-216
Identity = 397/432 (91.90%), Postives = 408/432 (94.44%), Query Frame = 0

Query: 22  MAGCLATATLTPTSPFNSRPLLHFSFSTQFLAPEISSAAPILRQRFPFQLGFR-RYGENA 81
           MAGCLATATLTPTSP NS P  H        AP ISSAAPILR+R PFQLGFR RY EN+
Sbjct: 1   MAGCLATATLTPTSPSNSPPFFH--------APPISSAAPILRRRLPFQLGFRTRYDENS 60

Query: 82  RFRFDYVAIPVANCTRSGADTELDLTQSIDCVGTAQDVECLVSPTDEDPSSSVGVPVELG 141
           RFRF YVAIPVANCTRSG DTELD T+SIDCVGTAQDVEC+VSP DEDPSSS+GVP++LG
Sbjct: 61  RFRFHYVAIPVANCTRSGGDTELDFTESIDCVGTAQDVECVVSPNDEDPSSSIGVPLKLG 120

Query: 142 ISSEFGGDGSTAVLEKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFL 201
           ISS++ GDGS AVLEKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFL
Sbjct: 121 ISSDYSGDGSVAVLEKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFL 180

Query: 202 LIAFAAFRGRPFPSGFSAWISIILFALVDATFFQGFLAQGLQRTSAGLGSVIIDSQPLTV 261
           LIAFAAFRGRPFPSGFSAWISIILFALVDATFFQGFLAQGLQRTSAGLGSVIIDSQPLTV
Sbjct: 181 LIAFAAFRGRPFPSGFSAWISIILFALVDATFFQGFLAQGLQRTSAGLGSVIIDSQPLTV 240

Query: 262 AVLAAFLFGESIGLVGAAGLVLGVLGLLLLEVPSLTLDANSFSLWGSGEWWMFLAAQSMA 321
           AVLAAFLFGES+GLVGAAGLVLGVLGLLLLEVPSLT DANSFSLWGSGEWWMFLAAQSMA
Sbjct: 241 AVLAAFLFGESLGLVGAAGLVLGVLGLLLLEVPSLTFDANSFSLWGSGEWWMFLAAQSMA 300

Query: 322 VGTVMVRWVSKYSDPIMATGWHMVIGGLPLLMICILNHDPAVSGSLKDFTTNDILALLYA 381
           VGTVMVRWVSKYSDPIMATGWHMVIGGLPLLMICILNHDPAVSGSLKDFTTNDILALLYA
Sbjct: 301 VGTVMVRWVSKYSDPIMATGWHMVIGGLPLLMICILNHDPAVSGSLKDFTTNDILALLYA 360

Query: 382 SIFGSAVSYGSFFFSATKGSLTKLSSLTFLTPMFASVFGFLYLGETFSPIQLVGAVVTVV 441
           SIFGSAVSYGSFF+SATKGSLTKLSSLTFLTPMFASVFGFLYLGETFSPIQLVGAVVTVV
Sbjct: 361 SIFGSAVSYGSFFYSATKGSLTKLSSLTFLTPMFASVFGFLYLGETFSPIQLVGAVVTVV 420

Query: 442 AIYVVNYGSSLE 453
           AIYVVNYGS LE
Sbjct: 421 AIYVVNYGSGLE 424

BLAST of Cla97C01G008700 vs. TrEMBL
Match: tr|A0A1S3CK55|A0A1S3CK55_CUCME (WAT1-related protein At3g02690, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103501378 PE=4 SV=1)

HSP 1 Score: 744.6 bits (1921), Expect = 1.3e-211
Identity = 390/432 (90.28%), Postives = 402/432 (93.06%), Query Frame = 0

Query: 22  MAGCLATATLTPTSPFNSRPLLHFSFSTQFLAPEISSAAPILRQRFPFQLGF-RRYGENA 81
           MA CLATATLTPTSP NS P         F AP ISSA PILR+R PF+LGF  RY EN 
Sbjct: 1   MAVCLATATLTPTSPSNSPPF--------FRAPSISSAPPILRRRLPFRLGFGTRYDENP 60

Query: 82  RFRFDYVAIPVANCTRSGADTELDLTQSIDCVGTAQDVECLVSPTDEDPSSSVGVPVELG 141
            FRF YVAIPVANCTRSG DTELD T+SIDCVGTAQDVEC+VSPTDEDPSSS+GVP+ELG
Sbjct: 61  LFRFHYVAIPVANCTRSGGDTELDFTESIDCVGTAQDVECVVSPTDEDPSSSIGVPLELG 120

Query: 142 ISSEFGGDGSTAVLEKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFL 201
           ISS++ G GS AVLEKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFL
Sbjct: 121 ISSDYSGGGSVAVLEKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFL 180

Query: 202 LIAFAAFRGRPFPSGFSAWISIILFALVDATFFQGFLAQGLQRTSAGLGSVIIDSQPLTV 261
           L+AFAAFRGRPFPSGFSAWISIILFALVDATFFQGFLAQGLQRTSAGLGSVIIDSQPLTV
Sbjct: 181 LVAFAAFRGRPFPSGFSAWISIILFALVDATFFQGFLAQGLQRTSAGLGSVIIDSQPLTV 240

Query: 262 AVLAAFLFGESIGLVGAAGLVLGVLGLLLLEVPSLTLDANSFSLWGSGEWWMFLAAQSMA 321
           AVLAAFLFGES+GL+GAAGLVLGV GLLLLEVPSLT DANSFSLWGSGEWWMFLAAQSMA
Sbjct: 241 AVLAAFLFGESLGLIGAAGLVLGVFGLLLLEVPSLTFDANSFSLWGSGEWWMFLAAQSMA 300

Query: 322 VGTVMVRWVSKYSDPIMATGWHMVIGGLPLLMICILNHDPAVSGSLKDFTTNDILALLYA 381
           VGTVMVRWVSKYSDPIMATGWHMVIGGLPLLMICILNHDPAVSGSLKDFTTNDILALLYA
Sbjct: 301 VGTVMVRWVSKYSDPIMATGWHMVIGGLPLLMICILNHDPAVSGSLKDFTTNDILALLYA 360

Query: 382 SIFGSAVSYGSFFFSATKGSLTKLSSLTFLTPMFASVFGFLYLGETFSPIQLVGAVVTVV 441
           SIFGSAVSYGSFF+SATKGSLTKLSSLTFLTPMFASVFGFLYLGETFSPIQLVGAVVTVV
Sbjct: 361 SIFGSAVSYGSFFYSATKGSLTKLSSLTFLTPMFASVFGFLYLGETFSPIQLVGAVVTVV 420

Query: 442 AIYVVNYGSSLE 453
           AIY VNYGSSLE
Sbjct: 421 AIYAVNYGSSLE 424

BLAST of Cla97C01G008700 vs. TrEMBL
Match: tr|A0A2C9WKL9|A0A2C9WKL9_MANES (Uncharacterized protein OS=Manihot esculenta OX=3983 GN=MANES_01G125800 PE=4 SV=1)

HSP 1 Score: 498.4 bits (1282), Expect = 1.7e-137
Identity = 269/390 (68.97%), Postives = 305/390 (78.21%), Query Frame = 0

Query: 74  RRYGENARFRFDYVAIPVANCTRSGADTELD-----------LTQSIDCVGTAQDVECLV 133
           R +  NAR R     I + +CT S  + EL+            T   DCVGT  DVECLV
Sbjct: 50  RNHSYNARIR-RRRNIFIGSCTTSSKNVELESKSSDSNGDSSSTPDFDCVGTGLDVECLV 109

Query: 134 SPTDEDPSSSVGVPVELGISSEFGGDGSTAVLEKAWEFAVLVSPFFFWGTAMVAMKEVLP 193
           S     PSS     +             + +LE   E  VLVSPFFFWGTAMVAMKEVLP
Sbjct: 110 S---SSPSSETNGTMXXXXXXXXXXXXXSDLLEMMVETGVLVSPFFFWGTAMVAMKEVLP 169

Query: 194 RSGPFFVSAFRLIPAGFLLIAFAAFRGRPFPSGFSAWISIILFALVDATFFQGFLAQGLQ 253
             GPFFV+AFRLIPAG LL+AFAA +GRP PSGF+AW+SI LF LVDA  FQGFLA+GLQ
Sbjct: 170 LVGPFFVAAFRLIPAGLLLVAFAASKGRPLPSGFTAWLSIALFGLVDAACFQGFLAEGLQ 229

Query: 254 RTSAGLGSVIIDSQPLTVAVLAAFLFGESIGLVGAAGLVLGVLGLLLLEVPSLTLDANSF 313
           RTSAGLGSVIIDSQPLTVAVLAA LFGESIGLVG AGLVLGV+GL+LLE+P+L +D ++F
Sbjct: 230 RTSAGLGSVIIDSQPLTVAVLAALLFGESIGLVGVAGLVLGVIGLILLELPALAIDESNF 289

Query: 314 SLWGSGEWWMFLAAQSMAVGTVMVRWVSKYSDPIMATGWHMVIGGLPLLMICILNHDPAV 373
           SLWGSGEWWM LAAQSMAVGTVMVRWV+KYSDP+MATGWHMVIGG+PL++I ILNHDPA 
Sbjct: 290 SLWGSGEWWMLLAAQSMAVGTVMVRWVTKYSDPVMATGWHMVIGGIPLVVISILNHDPAF 349

Query: 374 SGSLKDFTTNDILALLYASIFGSAVSYGSFFFSATKGSLTKLSSLTFLTPMFASVFGFLY 433
           SGSLK+ T +DILALLY SIFGSA+SYG +F+SATKGSLTKLSSLTFLTPMFAS+FGFLY
Sbjct: 350 SGSLKELTGSDILALLYTSIFGSAISYGVYFYSATKGSLTKLSSLTFLTPMFASIFGFLY 409

Query: 434 LGETFSPIQLVGAVVTVVAIYVVNYGSSLE 453
           LGETFSP QLVGA+VT++AIY+VNY  S E
Sbjct: 410 LGETFSPSQLVGAIVTLIAIYMVNYRDSTE 435

BLAST of Cla97C01G008700 vs. TrEMBL
Match: tr|A0A0B2QYG8|A0A0B2QYG8_GLYSO (Putative transporter OS=Glycine soja OX=3848 GN=glysoja_029756 PE=4 SV=1)

HSP 1 Score: 498.0 bits (1281), Expect = 2.2e-137
Identity = 284/452 (62.83%), Postives = 331/452 (73.23%), Query Frame = 0

Query: 1   MAPWWVPSSTQGIVAEPKTTTMAGCLATATLTPTSPFNSRPLLHFSFSTQFLAPEISSAA 60
           MA WW PS +          T A C  ++ +  TS F  + L  F  S+  L+   SS  
Sbjct: 1   MASWWCPSPSATFTVPAAAATTATC-HSSLIPHTSQFRIQTLT-FPLSSFPLSTTASS-- 60

Query: 61  PILRQRFPFQLGFRRYGENARFRFDYVAIPVANCTRSGADTELDLTQSIDCVGTAQDVEC 120
                             + RFR     +P +N  ++  +TEL     +DCVGT QDVEC
Sbjct: 61  ------------------SLRFR-----LPCSN--KTAFETELP-EDGVDCVGTGQDVEC 120

Query: 121 LVSPTDEDPSSSVGVPVELGISSEFGGDGSTAVLEKAWEFAVLVSPFFFWGTAMVAMKEV 180
           LV+ T+E  S                      + E  WE AVLVSPFFFWGTAMVAMKEV
Sbjct: 121 LVN-TEEKQS---------XXXXXXXXXXXLCLAEALWEGAVLVSPFFFWGTAMVAMKEV 180

Query: 181 LPRSGPFFVSAFRLIPAGFLLIAFAAFRGRPFPSGFSAWISIILFALVDATFFQGFLAQG 240
           LP+ GPFFVSAFRLIPAGFLL+AFAA RGR  PSGF AW+SI LFALVDAT FQGFLA+G
Sbjct: 181 LPKCGPFFVSAFRLIPAGFLLVAFAASRGRSLPSGFIAWLSITLFALVDATCFQGFLAEG 240

Query: 241 LQRTSAGLGSVIIDSQPLTVAVLAAFLFGESIGLVGAAGLVLGVLGLLLLEVPSLTLDAN 300
           LQRTSAGLGS+IIDSQPLTVAVLAA LFGESIG+VGAAGLVLGV+GL+LLE+P+L+ D +
Sbjct: 241 LQRTSAGLGSIIIDSQPLTVAVLAALLFGESIGVVGAAGLVLGVIGLVLLELPALSFDES 300

Query: 301 SFSLWGSGEWWMFLAAQSMAVGTVMVRWVSKYSDPIMATGWHMVIGGLPLLMICILNHDP 360
           +FSLWGSGEWWM LAAQSMAVGTVMVRWVSKYSDP+MATGWHMVIGGLPL++  +LN+DP
Sbjct: 301 NFSLWGSGEWWMLLAAQSMAVGTVMVRWVSKYSDPVMATGWHMVIGGLPLVLFAVLNNDP 360

Query: 361 AVSGSLKDFTTNDILALLYASIFGSAVSYGSFFFSATKGSLTKLSSLTFLTPMFASVFGF 420
           A+S SLK++++ DILALLY S+FGSAVSYG FF+SATKGSLTKLSSLTFLTPMFAS+FGF
Sbjct: 361 ALSLSLKEYSSTDILALLYTSVFGSAVSYGVFFYSATKGSLTKLSSLTFLTPMFASIFGF 412

Query: 421 LYLGETFSPIQLVGAVVTVVAIYVVNYGSSLE 453
           LYLGETFSP+QLVGA+VTV  IY+VN  S+ E
Sbjct: 421 LYLGETFSPVQLVGALVTVAGIYMVNLRSTSE 412

BLAST of Cla97C01G008700 vs. TrEMBL
Match: tr|A0A067LEY3|A0A067LEY3_JATCU (Uncharacterized protein OS=Jatropha curcas OX=180498 GN=JCGZ_07969 PE=4 SV=1)

HSP 1 Score: 498.0 bits (1281), Expect = 2.2e-137
Identity = 262/346 (75.72%), Postives = 295/346 (85.26%), Query Frame = 0

Query: 108 SIDCVGTAQDVECLVSPTD-EDPSSSVGVPVELGISSEFGGDGSTAVLEKAWEFAVLVSP 167
           ++DCVGT  DVECL+SP+D  D  S+  V VE G   E        ++E   E  +LVSP
Sbjct: 100 TVDCVGTGLDVECLISPSDTNDTLSTAAVEVEEGGVKE----SKRNLVETVVETGILVSP 159

Query: 168 FFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAAFRGRPFPSGFSAWISIILFA 227
           FFFWGTAMVAMKEVLP +GPFFV+AFRLIPAG +L+AFAA + RP PSG +AW+SI LF 
Sbjct: 160 FFFWGTAMVAMKEVLPLAGPFFVAAFRLIPAGLILVAFAASKDRPLPSGLTAWLSIALFG 219

Query: 228 LVDATFFQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAAFLFGESIGLVGAAGLVLGVLG 287
           +VDA  FQGFLA+GLQRTSAGLGSVIIDSQPLTVAVLAA LFGESIGLVGAAGL+LGV+G
Sbjct: 220 VVDAACFQGFLAEGLQRTSAGLGSVIIDSQPLTVAVLAALLFGESIGLVGAAGLILGVVG 279

Query: 288 LLLLEVPSLTLDANSFSLWGSGEWWMFLAAQSMAVGTVMVRWVSKYSDPIMATGWHMVIG 347
           LLLLE+P L LD  +FSLWGSGEWWM LAAQSMAVGTVMVRWV+KYSDPIMATGWHMVIG
Sbjct: 280 LLLLELPVLALDEGNFSLWGSGEWWMLLAAQSMAVGTVMVRWVTKYSDPIMATGWHMVIG 339

Query: 348 GLPLLMICILNHDPAVSGSLKDFTTNDILALLYASIFGSAVSYGSFFFSATKGSLTKLSS 407
           GLPL++I ILNHDPA SGSLKD TT+DILALLY SIFGSA+SYG +F+SATKGSLTKLSS
Sbjct: 340 GLPLVVISILNHDPAFSGSLKDLTTSDILALLYTSIFGSAISYGVYFYSATKGSLTKLSS 399

Query: 408 LTFLTPMFASVFGFLYLGETFSPIQLVGAVVTVVAIYVVNYGSSLE 453
           LTFLTPMFAS+FGFLYLGETFS  QLVGAV+T+VAIY+VNY  S+E
Sbjct: 400 LTFLTPMFASIFGFLYLGETFSSSQLVGAVLTLVAIYMVNYRESIE 441

BLAST of Cla97C01G008700 vs. Swiss-Prot
Match: sp|Q93V85|WTR16_ARATH (WAT1-related protein At3g02690, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=At3g02690 PE=1 SV=1)

HSP 1 Score: 465.3 bits (1196), Expect = 7.7e-130
Identity = 237/302 (78.48%), Postives = 268/302 (88.74%), Query Frame = 0

Query: 146 GGDGSTAVLEKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFA 205
           GG+G+        E+ VL+SPFFFWGTAMVAMKEVLP +GPFFV+AFRLIPAG LL+AFA
Sbjct: 117 GGEGTFL------EWTVLISPFFFWGTAMVAMKEVLPITGPFFVAAFRLIPAGLLLVAFA 176

Query: 206 AFRGRPFPSGFSAWISIILFALVDATFFQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAA 265
            ++GRP P G +AW SI LFALVDAT FQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLA+
Sbjct: 177 VYKGRPLPEGINAWFSIALFALVDATCFQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAS 236

Query: 266 FLFGESIGLVGAAGLVLGVLGLLLLEVPSLTLDANSFSLWGSGEWWMFLAAQSMAVGTVM 325
           FLFGESIG+V A GL+LGV GLLLLEVPS+T D N+FSLWGSGEWWM LAAQSMA+GTVM
Sbjct: 237 FLFGESIGIVRAGGLLLGVAGLLLLEVPSVTSDGNNFSLWGSGEWWMLLAAQSMAIGTVM 296

Query: 326 VRWVSKYSDPIMATGWHMVIGGLPLLMICILNHDPAVSGSLKDFTTNDILALLYASIFGS 385
           VRWVSKYSDPIMATGWHMVIGGLPLL I ++NHDP  +GSL+D +TND++ALLY SIFGS
Sbjct: 297 VRWVSKYSDPIMATGWHMVIGGLPLLAISVINHDPVFNGSLQDLSTNDVIALLYTSIFGS 356

Query: 386 AVSYGSFFFSATKGSLTKLSSLTFLTPMFASVFGFLYLGETFSPIQLVGAVVTVVAIYVV 445
           AVSYG +F+SATKGSLTKLSSLTFLTPMFAS+FG+LYL ETFS +QLVGA VT+VAIY+V
Sbjct: 357 AVSYGVYFYSATKGSLTKLSSLTFLTPMFASIFGYLYLNETFSSLQLVGAAVTLVAIYLV 412

Query: 446 NY 448
           N+
Sbjct: 417 NF 412

BLAST of Cla97C01G008700 vs. Swiss-Prot
Match: sp|P74436|Y355_SYNY3 (Uncharacterized transporter sll0355 OS=Synechocystis sp. (strain PCC 6803 / Kazusa) OX=1111708 GN=sll0355 PE=3 SV=1)

HSP 1 Score: 271.2 bits (692), Expect = 2.1e-71
Identity = 156/302 (51.66%), Postives = 210/302 (69.54%), Query Frame = 0

Query: 163 LVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAAFRGRPFPSGFSAWISI 222
           L++PFF WGTAMVAMK VL  + PFFV+  RLIPAG L++ +A  + RP P  +  W  I
Sbjct: 17  LIAPFFLWGTAMVAMKGVLADTTPFFVATVRLIPAGILVLLWAMGQKRPQPQNWQGWGWI 76

Query: 223 ILFALVDATFFQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAAFLFGESIGLVGAAGLVL 282
           ILFALVD T FQGFLAQGL+RT AGLGSVIIDSQP+ VA+L+++LF E IG +G  GL+L
Sbjct: 77  ILFALVDGTLFQGFLAQGLERTGAGLGSVIIDSQPIAVALLSSWLFKEVIGGIGWLGLLL 136

Query: 283 GVLGLLLLEVP-----------SLTLDANSFSLWGSGEWWMFLAAQSMAVGTVMVRWVSK 342
           GV G+ L+ +P            L+++ +  +L  SGE WM LA+ SMAVGTV++ +VS+
Sbjct: 137 GVGGISLIGLPDEWFYQLWHLQGLSINWSGSALGSSGELWMLLASLSMAVGTVLIPFVSR 196

Query: 343 YSDPIMATGWHMVIGGLPLLMICIL-NHDPAVSGSLKDFTTNDILALLYASIFGSAVSYG 402
             DP++ATGWHM+IGGLPLL I ++ + +P  +  L  +       L YA++FGSA++YG
Sbjct: 197 RVDPVVATGWHMIIGGLPLLAIALVQDSEPWQNIDLWGWGN-----LAYATVFGSAIAYG 256

Query: 403 SFFFSATKGSLTKLSSLTFLTPMFASVFGFLYLGETFSPIQLVGAVVTVVAIYVVNYGSS 453
            FF+ A+KG+LT LSSLTFLTP+FA  F  L L E  S +Q +G   T+V+IY++N    
Sbjct: 257 IFFYLASKGNLTSLSSLTFLTPIFALSFSNLILEEQLSSLQWLGVAFTLVSIYLINQREQ 313

BLAST of Cla97C01G008700 vs. Swiss-Prot
Match: sp|P42194|PECM_DICD3 (Protein PecM OS=Dickeya dadantii (strain 3937) OX=198628 GN=pecM PE=3 SV=1)

HSP 1 Score: 75.9 bits (185), Expect = 1.3e-12
Identity = 84/283 (29.68%), Postives = 123/283 (43.46%), Query Frame = 0

Query: 170 WGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAAFRGRPFPSGFSAWISIILFALVD 229
           WGT      + LP   P   +  R +PAG +LI      G+  P     W   +L AL  
Sbjct: 14  WGTTYFVTTQFLPADKPLLAALIRALPAGIILIL-----GKNLPPVGWLWRLFVLGALNI 73

Query: 230 ATFFQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAAFLFGESIGLVGAAGLVLGVLGL-L 289
             FF   L     R   G+ +++   QPL V +L+  L  + +        V G +G+ L
Sbjct: 74  GVFFV-MLFFAAYRLPGGVVALVGSLQPLIVILLSFLLLTQPVLKKQMVAAVAGGIGIVL 133

Query: 290 LLEVPSLTLDANSFSLWGSGEWWMFLAAQSMAVGTVMVRWVSKYSDP-----IMATGWHM 349
           L+ +P   L  N   L  S      LA  SMA G V+ +   K+  P     +  TGW +
Sbjct: 134 LISLPKAPL--NPAGLVASA-----LATMSMASGLVLTK---KWGRPAGMTMLTFTGWQL 193

Query: 350 VIGGLPLLMICILNHDPAVSGSLKDFTT-NDILALLYASIFGSAVSYGSFFFSATKGSLT 409
             GGL +L + +L         L D  T  ++   LY +I GS ++Y  +F      S  
Sbjct: 194 FCGGLVILPVQMLTE------PLPDLVTLTNLAGYLYLAIPGSLLAYFMWFSGLEANSPV 253

Query: 410 KLSSLTFLTPMFASVFGFLYLGETFSPIQLVGAVVTVVAIYVV 446
            +S L FL+P+ A    FL+L +  S  QLVG V    A+ +V
Sbjct: 254 IMSLLGFLSPLVALXXXFLFLQQGLSGAQLVGVVFIFSALIIV 274

BLAST of Cla97C01G008700 vs. Swiss-Prot
Match: sp|O32256|YVBV_BACSU (Uncharacterized transporter YvbV OS=Bacillus subtilis (strain 168) OX=224308 GN=yvbV PE=3 SV=1)

HSP 1 Score: 75.9 bits (185), Expect = 1.3e-12
Identity = 80/286 (27.97%), Postives = 125/286 (43.71%), Query Frame = 0

Query: 170 WGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAAFRGRPFPSGFSAWISIILFALVD 229
           WG      K  L  S P   +  R +  G LL+  A  R          W   ++ AL++
Sbjct: 20  WGVNWPLSKAALAYSPPLLFAGIRTLIGGLLLVIVALPRIHKLRLK-ETWPIYLVSALLN 79

Query: 230 ATFFQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAAFLFGESIGLVGAAGLVLGVLGLLL 289
            T F G    GL    AGL S I+  QP+ + V +    GES+ ++   GL+LG  G+ +
Sbjct: 80  ITLFYGLQTIGLNYLPAGLFSAIVFFQPVLMGVFSWLWLGESMFVMKVIGLILGFAGVAV 139

Query: 290 LEVPSLTLDANSFSLWGSGEWWMFLA---AQSMAVGTVMVRWVSKYSDPIMATGWHMVIG 349
           +            S+ G     + LA   A S A+GTV ++      D I      + IG
Sbjct: 140 ISAAGF---GGHISVIG-----VLLALGSAVSWALGTVYMKKTGSRVDSIWMVALQLTIG 199

Query: 350 GLPLLMICILNHDPAVSGSLKDFTTNDILALLYASIFGSAVSYGSFFFSATKGSLTKLSS 409
            + LL+          S S   +T   I +LL+ S+F  A+ +  FF     G  +K++S
Sbjct: 200 SVFLLISGFWTE----SFSAIQWTAPFITSLLFISVFVIALGWLVFFTLVGSGEASKVAS 259

Query: 410 LTFLTPMFASVFGFLYLGETFSPIQLVGAVVTVVAIYVVNYGSSLE 453
            TFL P+ + V   ++L E  +   L G ++ V +I +VN  S  +
Sbjct: 260 YTFLIPLISIVASSIFLHEPLTLSLLAGLLLIVTSICLVNTKSKAQ 292

BLAST of Cla97C01G008700 vs. Swiss-Prot
Match: sp|O34416|YOAV_BACSU (Uncharacterized transporter YoaV OS=Bacillus subtilis (strain 168) OX=224308 GN=yoaV PE=3 SV=1)

HSP 1 Score: 71.6 bits (174), Expect = 2.5e-11
Identity = 68/282 (24.11%), Postives = 128/282 (45.39%), Query Frame = 0

Query: 163 LVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAAFRGRPFPSGFSAWISI 222
           ++S    WG   VAMK  +    P   S  RL      L      + +          S 
Sbjct: 8   IISVTLIWGYTWVAMKVGIHDIPPLLFSGLRLFIGAVPLFLILFIQRKKLSIQKEHLKSY 67

Query: 223 ILFALVDATFFQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAAFLFGESIGLVGAAGLVL 282
           I+ +L+    + G L  G+Q   +G  SV++ + P+ V V++ F   E + +    GLV 
Sbjct: 68  IIMSLLMGLGYMGILTYGMQFVDSGKTSVLVYTMPIFVTVISHFSLNEKMNVYKTMGLVC 127

Query: 283 GVLGLLLLEVPSLTLDANSFSLWGSGEWWMFLAAQSMAVGTVMVRWVSKYSDPIMATGWH 342
           G+ GLL +    + L+ +  +L+  GE  + +AA S  +  V  +   K+ D I    WH
Sbjct: 128 GLFGLLFIFGKEM-LNIDQSALF--GELCVLVAALSWGIANVFSKLQFKHIDIIHMNAWH 187

Query: 343 MVIGGLPLLMICILNHDPAVSGSLKDFTTNDILALLYASIFGSAVSYGSFFFSATKGSLT 402
           +++G + LL+   +    AV  +  ++T   + +LL+  +  +  ++  +F+   +   +
Sbjct: 188 LMMGAVMLLVFSFIFE--AVPSA--EWTYQAVWSLLFNGLLSTGFTFVVWFWVLNQIQAS 247

Query: 403 KLSSLTFLTPMFASVFGFLYLGETFSPIQLVGAVVTVVAIYV 445
           K S      P+ A  FG+L L E  +   ++GA++    I++
Sbjct: 248 KASMALMFVPVLALFFGWLQLHEQITINIILGALLICCGIFM 282

BLAST of Cla97C01G008700 vs. TAIR10
Match: AT3G02690.1 (nodulin MtN21 /EamA-like transporter family protein)

HSP 1 Score: 465.3 bits (1196), Expect = 4.3e-131
Identity = 237/302 (78.48%), Postives = 268/302 (88.74%), Query Frame = 0

Query: 146 GGDGSTAVLEKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFA 205
           GG+G+        E+ VL+SPFFFWGTAMVAMKEVLP +GPFFV+AFRLIPAG LL+AFA
Sbjct: 117 GGEGTFL------EWTVLISPFFFWGTAMVAMKEVLPITGPFFVAAFRLIPAGLLLVAFA 176

Query: 206 AFRGRPFPSGFSAWISIILFALVDATFFQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAA 265
            ++GRP P G +AW SI LFALVDAT FQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLA+
Sbjct: 177 VYKGRPLPEGINAWFSIALFALVDATCFQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAS 236

Query: 266 FLFGESIGLVGAAGLVLGVLGLLLLEVPSLTLDANSFSLWGSGEWWMFLAAQSMAVGTVM 325
           FLFGESIG+V A GL+LGV GLLLLEVPS+T D N+FSLWGSGEWWM LAAQSMA+GTVM
Sbjct: 237 FLFGESIGIVRAGGLLLGVAGLLLLEVPSVTSDGNNFSLWGSGEWWMLLAAQSMAIGTVM 296

Query: 326 VRWVSKYSDPIMATGWHMVIGGLPLLMICILNHDPAVSGSLKDFTTNDILALLYASIFGS 385
           VRWVSKYSDPIMATGWHMVIGGLPLL I ++NHDP  +GSL+D +TND++ALLY SIFGS
Sbjct: 297 VRWVSKYSDPIMATGWHMVIGGLPLLAISVINHDPVFNGSLQDLSTNDVIALLYTSIFGS 356

Query: 386 AVSYGSFFFSATKGSLTKLSSLTFLTPMFASVFGFLYLGETFSPIQLVGAVVTVVAIYVV 445
           AVSYG +F+SATKGSLTKLSSLTFLTPMFAS+FG+LYL ETFS +QLVGA VT+VAIY+V
Sbjct: 357 AVSYGVYFYSATKGSLTKLSSLTFLTPMFASIFGYLYLNETFSSLQLVGAAVTLVAIYLV 412

Query: 446 NY 448
           N+
Sbjct: 417 NF 412

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004140354.13.5e-21691.90PREDICTED: WAT1-related protein At3g02690, chloroplastic [Cucumis sativus] >KGN5... [more]
XP_008463172.12.0e-21190.28PREDICTED: WAT1-related protein At3g02690, chloroplastic [Cucumis melo][more]
XP_022993106.12.5e-19083.14WAT1-related protein At3g02690, chloroplastic [Cucurbita maxima][more]
XP_023550684.11.7e-18982.92WAT1-related protein At3g02690, chloroplastic [Cucurbita pepo subsp. pepo] >XP_0... [more]
XP_022939282.11.1e-18882.69WAT1-related protein At3g02690, chloroplastic [Cucurbita moschata] >XP_022939283... [more]
Match NameE-valueIdentityDescription
tr|A0A0A0KQD4|A0A0A0KQD4_CUCSA2.3e-21691.90Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G429960 PE=4 SV=1[more]
tr|A0A1S3CK55|A0A1S3CK55_CUCME1.3e-21190.28WAT1-related protein At3g02690, chloroplastic OS=Cucumis melo OX=3656 GN=LOC1035... [more]
tr|A0A2C9WKL9|A0A2C9WKL9_MANES1.7e-13768.97Uncharacterized protein OS=Manihot esculenta OX=3983 GN=MANES_01G125800 PE=4 SV=... [more]
tr|A0A0B2QYG8|A0A0B2QYG8_GLYSO2.2e-13762.83Putative transporter OS=Glycine soja OX=3848 GN=glysoja_029756 PE=4 SV=1[more]
tr|A0A067LEY3|A0A067LEY3_JATCU2.2e-13775.72Uncharacterized protein OS=Jatropha curcas OX=180498 GN=JCGZ_07969 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
sp|Q93V85|WTR16_ARATH7.7e-13078.48WAT1-related protein At3g02690, chloroplastic OS=Arabidopsis thaliana OX=3702 GN... [more]
sp|P74436|Y355_SYNY32.1e-7151.66Uncharacterized transporter sll0355 OS=Synechocystis sp. (strain PCC 6803 / Kazu... [more]
sp|P42194|PECM_DICD31.3e-1229.68Protein PecM OS=Dickeya dadantii (strain 3937) OX=198628 GN=pecM PE=3 SV=1[more]
sp|O32256|YVBV_BACSU1.3e-1227.97Uncharacterized transporter YvbV OS=Bacillus subtilis (strain 168) OX=224308 GN=... [more]
sp|O34416|YOAV_BACSU2.5e-1124.11Uncharacterized transporter YoaV OS=Bacillus subtilis (strain 168) OX=224308 GN=... [more]
Match NameE-valueIdentityDescription
AT3G02690.14.3e-13178.48nodulin MtN21 /EamA-like transporter family protein[more]
The following terms have been associated with this gene:
Vocabulary: Cellular Component
TermDefinition
GO:0016021integral component of membrane
GO:0016020membrane
Vocabulary: INTERPRO
TermDefinition
IPR000620EamA_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0016020 membrane
cellular_component GO:0009507 chloroplast
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C01G008700.1Cla97C01G008700.1mRNA


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000620EamA domainPFAMPF00892EamAcoord: 164..290
e-value: 5.0E-21
score: 75.2
coord: 307..446
e-value: 4.6E-26
score: 91.5
NoneNo IPR availablePANTHERPTHR22911ACYL-MALONYL CONDENSING ENZYME-RELATEDcoord: 96..450
NoneNo IPR availablePANTHERPTHR22911:SF60SUBFAMILY NOT NAMEDcoord: 96..450
NoneNo IPR availableSUPERFAMILYSSF103481Multidrug resistance efflux transporter EmrEcoord: 192..294
NoneNo IPR availableSUPERFAMILYSSF103481Multidrug resistance efflux transporter EmrEcoord: 341..450

The following gene(s) are paralogous to this gene:

None