ClCG01G010790 (gene) Watermelon (Charleston Gray)

NameClCG01G010790
Typegene
OrganismCitrullus lanatus (Watermelon (Charleston Gray))
Descriptionnodulin MtN21 /EamA-like transporter family protein LENGTH=374
LocationCG_Chr01 : 16934637 .. 16939885 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AGCCCCTTTAGCTCACACAAAGAAAGAGAAAAGAGAGGAAGAGAGAAAGAGAGAAAGAGAAGTGAGAGCTATGGCTGCTGGTGGTGATTTCTTGCCCCCCTTGGTTATGCTGGTTGTCCAATTCCTATATGCAGGTTTGAATATTACATCAAAGTTAGCCATGGAGTTTGGGATGAACCCACTAGTTCTAGTTGCTTACAGGCAGATGTTTGCTACCATAGCCATAGCTCCTTGTGCATACTGGTTTGAGAGGTATGTGTAGTGTACATATCATTTTACTTAACGACTTCTAAATATCACTCGTTCTAACAAACGAGATGAACTTTCGTTCTTCAGCAGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAANNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNGTTAGTCTCAAAGCCAGTGTTGTAACAACCTACCAAACAAGCCATGAGATGGTCCTAAAATCTAATTTGAAGCCTAACATCCAAAAATGGGATTGTTGGATTTTCTTATATCCATGTGGGGAAGGAGTTAGACTTTGGTTTATAACTTCTTAAGCTAGATTTAGAGGTGAGAGGCACAAACAATGATAATAACAATTTTTCATCTTCTAGTTCAAATGACAAAAGTTTTCAGCTAACAAATAAAGAGCCAAGTTAGGTTCTTAAGAAAAATGCAGACTTCATAAGCCAAAATAAATAATATGCCTTCTTCCTCTACTTTCACAAAGAAACAGTAATCTTCAAAAGGAAATTTTATTTGACCACCATGAATACATCCATCAGGACCAGAACCCTCTTGAATTTCATTTGACACAAAAGGATCTAACTATCACAAAACAACTTTTTGTTACATATGTTAGATAGTCCTCCCCCAATCTCTATATTAATGATTAAAAAATTTAGATCTAACTATTTAATATATTATCAACAACATAAGTGGTGCTAGCACTGAGAGTTCAAAGAAACCTTGAACTCCTTTACTCTAATACACCACATGTTCACAACTCAAAATCCAAATGGATTACGAGGAACCTTCTTCATTTTATGAAGTTATCAACTTCTGAACTTGTTACTAACAGATATCATAATAACAAACAAAATCACTGTGCAGAAAAGGGAGACCCAAGATTACAAAGCCTATCCTGTTTCAGATTCTACTTTGTTCCTTAACTGGGTAAGCCAAGCTACTTTCATTTCGTTTCCCTGGCTAAACTTTGTAAAACTCTAATTATAATATATATGCAGGGCAACAGCAAACCAAGTCTTTTACTACGTTGGGTTGAAATATTCAACCCCCACCATCGCATGCGCATTGACCAATGTTCTGCCTGCTGCAACTTTTGTCCTCGCTGTTCTTTTCAGGTACCTAAAAATCCTCTTTTTTCACCATTTTCTGAACACAGTGGCAGACATTTCTGCAACTGTTTCTCCTATGATCCAAAGTTCATACAATCTTTTTCTGTTTGATAGACAAGAATCTGTGGGGATCAAAACAAGCCCAGGTGCAGCTAAGGTGATGGGGACTGTTGTATGTGTCGGAGGGGCAATGTTGCTGTCGTTCTACCGTGGACAAACAATCCAGCTTGGTGAATCTGGTATTCATTGGAAGTATGCCGAGTTAATGAGGGGGGGAAGCTCAACCAACCAAGGCAGCAGCATCTGGGGCTCCCTCTGCTTGATCATAAGCTCTGTCGCTTGGGCGGCTTGGTTTGTCATCCAAGTCAGTGTCCTCAATTTTGCTTCCCTTGACTTTCTTGTTCTCAGAGGCCATGTTTTTTCCATAGCCAAATCTAAGACTACTATTAATTCCTGCAGGCAAGAGTGAATGAGAAGTTTCCAGCACCCTACACGAGCACAGCTCTCATGACTTTCATGGCAACCATTCAATGTGGAGCAATTGCAGTTGGCGTTGAGCACAAGACGGCTGCAGCTTGGTCATTGAAAAGTTCAATCAGGCTTGTTGGCGCTCTCTATGCGGTATGATCATTCTCCAACTAACCTTTTCATCTAGAAAGAATTGAGAATGTCTAACATTAAGAATCTGATGAAGAAACCTCAATTCCTGATTCCTCTTCTCTTAACAATACATAAGATTCTAACATCACAACGCTAACGTTGTCCATATATGTTAGAACCTACACGAAAAGCTTAATACTATATACAATAATACTCCCCCTCCCCCTAAAGACCTCATTGTACTGATTTTACAAGGCAACACCAATTACACTTTCAGGGAGTGGCGTGCTCTGGGGTGGCATTTTGCCTAACCTCCTGGAGTATCCAAAAGAAAGGTCCTCTCTATGCCTCAGTTTTCAGCCCCTTCTTGCTCGTTATTGTGGCTATCTTCAGCTGGGCCTTTTTTCAAGAAAAATTATACGTCGGAACGTGAGTACAAACTTCCACCTACAACTTCTGATCCATAAGTTTACATTACATATTGAGCTTGTGTTGAAATGCCAACCAAAAGCATATTGTTAACAGTAGTTTTTGTTTTGTTGGCTTGTCTCAGTGTTGTAGGGTCATTGTTGATTGTTGTTGGGCTATATGCTGTCCTATGGGGAAAGACCAAGGAGGTGAAACTACAACAACATATCGAAATGGCGGCGGCAGCAGAGGCAAAGCTAGATGACTGTAATAACAAAGACTTAGAAGAACAGTCCTATGTGGTTTCAAATGCCACTATTCTCCATAAGTAATAGCTCAGAGAAAGGAGTACGAAAGCTAAATATAGCATGGTGTGATATTTGAGAGTTGTGCTGAGAATACAGATCAAAATCCATCCACTTTCAACAGATAGAGTTCAGAACATGTTTAAAGGATACATACATCCCTTTTTATGAAATGAAAATTTTACGTTGTTCAAG

mRNA sequence

AGCCCCTTTAGCTCACACAAAGAAAGAGAAAAGAGAGGAAGAGAGAAAGAGAGAAAGAGAAGTGAGAGCTATGGCTGCTGGTGGTGATTTCTTGCCCCCCTTGGTTATGCTGGTTGTCCAATTCCTATATGCAGGTTTGAATATTACATCAAAGTTAGCCATGGAGTTTGGGATGAACCCACTAGTTCTAGTTGCTTACAGGCAGATGTTTGCTACCATAGCCATAGCTCCTTGTGCATACTGGTTTGAGAGAAAAGGGAGACCCAAGATTACAAAGCCTATCCTGTTTCAGATTCTACTTTGTTCCTTAACTGGGGCAACAGCAAACCAAGTCTTTTACTACGTTGGGTTGAAATATTCAACCCCCACCATCGCATGCGCATTGACCAATGTTCTGCCTGCTGCAACTTTTGTCCTCGCTGTTCTTTTCAGACAAGAATCTGTGGGGATCAAAACAAGCCCAGGTGCAGCTAAGGTGATGGGGACTGTTGTATGTGTCGGAGGGGCAATGTTGCTGTCGTTCTACCGTGGACAAACAATCCAGCTTGGTGAATCTGGTATTCATTGGAAGTATGCCGAGTTAATGAGGGGGGGAAGCTCAACCAACCAAGGCAGCAGCATCTGGGGCTCCCTCTGCTTGATCATAAGCTCTGTCGCTTGGGCGGCTTGGTTTGTCATCCAAGCAAGAGTGAATGAGAAGTTTCCAGCACCCTACACGAGCACAGCTCTCATGACTTTCATGGCAACCATTCAATGTGGAGCAATTGCAGTTGGCGTTGAGCACAAGACGGCTGCAGCTTGGTCATTGAAAAGTTCAATCAGGCTTGTTGGCGCTCTCTATGCGGGAGTGGCGTGCTCTGGGGTGGCATTTTGCCTAACCTCCTGGAGTATCCAAAAGAAAGGTCCTCTCTATGCCTCAGTTTTCAGCCCCTTCTTGCTCGTTATTGTGGCTATCTTCAGCTGGGCCTTTTTTCAAGAAAAATTATACGTCGGAACTGTTGTAGGGTCATTGTTGATTGTTGTTGGGCTATATGCTGTCCTATGGGGAAAGACCAAGGAGGTGAAACTACAACAACATATCGAAATGGCGGCGGCAGCAGAGGCAAAGCTAGATGACTGTAATAACAAAGACTTAGAAGAACAGTCCTATGTGGTTTCAAATGCCACTATTCTCCATAAGTAATAGCTCAGAGAAAGGAGTACGAAAGCTAAATATAGCATGGTGTGATATTTGAGAGTTGTGCTGAGAATACAGATCAAAATCCATCCACTTTCAACAGATAGAGTTCAGAACATGTTTAAAGGATACATACATCCCTTTTTATGAAATGAAAATTTTACGTTGTTCAAG

Coding sequence (CDS)

ATGGCTGCTGGTGGTGATTTCTTGCCCCCCTTGGTTATGCTGGTTGTCCAATTCCTATATGCAGGTTTGAATATTACATCAAAGTTAGCCATGGAGTTTGGGATGAACCCACTAGTTCTAGTTGCTTACAGGCAGATGTTTGCTACCATAGCCATAGCTCCTTGTGCATACTGGTTTGAGAGAAAAGGGAGACCCAAGATTACAAAGCCTATCCTGTTTCAGATTCTACTTTGTTCCTTAACTGGGGCAACAGCAAACCAAGTCTTTTACTACGTTGGGTTGAAATATTCAACCCCCACCATCGCATGCGCATTGACCAATGTTCTGCCTGCTGCAACTTTTGTCCTCGCTGTTCTTTTCAGACAAGAATCTGTGGGGATCAAAACAAGCCCAGGTGCAGCTAAGGTGATGGGGACTGTTGTATGTGTCGGAGGGGCAATGTTGCTGTCGTTCTACCGTGGACAAACAATCCAGCTTGGTGAATCTGGTATTCATTGGAAGTATGCCGAGTTAATGAGGGGGGGAAGCTCAACCAACCAAGGCAGCAGCATCTGGGGCTCCCTCTGCTTGATCATAAGCTCTGTCGCTTGGGCGGCTTGGTTTGTCATCCAAGCAAGAGTGAATGAGAAGTTTCCAGCACCCTACACGAGCACAGCTCTCATGACTTTCATGGCAACCATTCAATGTGGAGCAATTGCAGTTGGCGTTGAGCACAAGACGGCTGCAGCTTGGTCATTGAAAAGTTCAATCAGGCTTGTTGGCGCTCTCTATGCGGGAGTGGCGTGCTCTGGGGTGGCATTTTGCCTAACCTCCTGGAGTATCCAAAAGAAAGGTCCTCTCTATGCCTCAGTTTTCAGCCCCTTCTTGCTCGTTATTGTGGCTATCTTCAGCTGGGCCTTTTTTCAAGAAAAATTATACGTCGGAACTGTTGTAGGGTCATTGTTGATTGTTGTTGGGCTATATGCTGTCCTATGGGGAAAGACCAAGGAGGTGAAACTACAACAACATATCGAAATGGCGGCGGCAGCAGAGGCAAAGCTAGATGACTGTAATAACAAAGACTTAGAAGAACAGTCCTATGTGGTTTCAAATGCCACTATTCTCCATAAGTAA

Protein sequence

MAAGGDFLPPLVMLVVQFLYAGLNITSKLAMEFGMNPLVLVAYRQMFATIAIAPCAYWFERKGRPKITKPILFQILLCSLTGATANQVFYYVGLKYSTPTIACALTNVLPAATFVLAVLFRQESVGIKTSPGAAKVMGTVVCVGGAMLLSFYRGQTIQLGESGIHWKYAELMRGGSSTNQGSSIWGSLCLIISSVAWAAWFVIQARVNEKFPAPYTSTALMTFMATIQCGAIAVGVEHKTAAAWSLKSSIRLVGALYAGVACSGVAFCLTSWSIQKKGPLYASVFSPFLLVIVAIFSWAFFQEKLYVGTVVGSLLIVVGLYAVLWGKTKEVKLQQHIEMAAAAEAKLDDCNNKDLEEQSYVVSNATILHK
BLAST of ClCG01G010790 vs. Swiss-Prot
Match: WTR2_ARATH (WAT1-related protein At1g09380 OS=Arabidopsis thaliana GN=At1g09380 PE=2 SV=1)

HSP 1 Score: 419.1 bits (1076), Expect = 5.1e-116
Identity = 211/364 (57.97%), Postives = 269/364 (73.90%), Query Frame = 1

Query: 3   AGGDFLPPLVMLVVQFLYAGLNITSKLAMEFGMNPLVLVAYRQMFATIAIAPCAYWFERK 62
           A  D LP L M++VQ  YAG+NITSK+AME GM PL+LVAYRQ+FATIA  P A++ ERK
Sbjct: 2   AKSDMLPFLAMVLVQIGYAGMNITSKMAMEAGMKPLILVAYRQIFATIATFPVAFFLERK 61

Query: 63  GRPKITKPILFQILLCSLTGATANQVFYYVGLKYSTPTIACALTNVLPAATFVLAVLFRQ 122
            RPKIT  IL Q+  CS+TGAT NQV Y+VGL+ S+PTIACALTN+LPA TF+LA +FRQ
Sbjct: 62  TRPKITLRILVQVFFCSITGATGNQVLYFVGLQNSSPTIACALTNLLPAVTFLLAAIFRQ 121

Query: 123 ESVGIKTSPGAAKVMGTVVCVGGAMLLSFYRGQTIQLGESGIHWKYAE-LMRGGSSTNQG 182
           E+VGIK + G AKV+GT+VCV GAM+LSFY G TI +GES IHW YAE + + GSS+   
Sbjct: 122 ETVGIKKASGQAKVIGTLVCVIGAMVLSFYHGHTIGIGESKIHWAYAENITKHGSSSGHS 181

Query: 183 SSIWGSLCLIISSVAWAAWFVIQARVNEKFPAPYTSTALMTFMATIQCGAIAVGVEHKTA 242
           +   G   ++ ++V+WAAWF+IQ +++E F APYTST LM  M +IQCGAIA+  +H T 
Sbjct: 182 NFFLGPFLIMAAAVSWAAWFIIQTKMSETFAAPYTSTLLMCLMGSIQCGAIALISDH-TI 241

Query: 243 AAWSLKSSIRLVGALYAGVACSGVAFCLTSWSIQKKGPLYASVFSPFLLVIVAIFSWAFF 302
           + WSL S +R + ALYAGV  S +AFCL SW++Q+KGPLY SVFSP LLV+VAIFSWA  
Sbjct: 242 SDWSLSSPLRFISALYAGVVASALAFCLMSWAMQRKGPLYVSVFSPLLLVVVAIFSWALL 301

Query: 303 QEKLYVGTVVGSLLIVVGLYAVLWGKTKEV-KLQQHIEMAAAAEAKLDDCNNKDLEEQSY 362
           +EKLY GT +GS L+V+GLY VLWGK +EV + ++  E       K+   +N+D+E +  
Sbjct: 302 EEKLYTGTFMGSALVVIGLYGVLWGKDREVSEKEEEREKVKQQNHKVKSESNEDIESRLP 361

Query: 363 VVSN 365
           V S+
Sbjct: 362 VASS 364

BLAST of ClCG01G010790 vs. Swiss-Prot
Match: WTR38_ARATH (WAT1-related protein At5g07050 OS=Arabidopsis thaliana GN=At5g07050 PE=2 SV=1)

HSP 1 Score: 261.5 bits (667), Expect = 1.4e-68
Identity = 137/327 (41.90%), Postives = 210/327 (64.22%), Query Frame = 1

Query: 9   PPLVMLVVQFLYAGLNITSKLAMEFGMNPLVLVAYRQMFATIAIAPCAYWFERKGRPKIT 68
           P   M+ +QF YAG+NI +K+++  GM+  VLV YR   AT  IAP A++FERK +PKIT
Sbjct: 18  PYFAMISLQFGYAGMNIITKISLNTGMSHYVLVVYRHAIATAVIAPFAFFFERKAQPKIT 77

Query: 69  KPILFQILLCSLTGATANQVFYYVGLKYSTPTIACALTNVLPAATFVLAVLFRQESVGIK 128
             I  Q+ +  L G   +Q FYY+GLKY++PT +CA++N+LPA TF+LAVLFR E + +K
Sbjct: 78  FSIFMQLFILGLLGPVIDQNFYYMGLKYTSPTFSCAMSNMLPAMTFILAVLFRMEMLDLK 137

Query: 129 TSPGAAKVMGTVVCVGGAMLLSFYRGQTIQL-GESGIHWKYAELMRGGSSTNQGSS---I 188
                AK+ GTVV V GAML++ Y+G  ++L     +H + +      SS N  S    +
Sbjct: 138 KLWCQAKIAGTVVTVAGAMLMTIYKGPIVELFWTKYMHIQDSSHANTTSSKNSSSDKEFL 197

Query: 189 WGSLCLIISSVAWAAWFVIQARVNEKFPAPYTS-TALMTFMATIQCGAIAVGVEHKTAAA 248
            GS+ LI +++AWA+ FV+QA++ + +     S T L+ F+ T+Q  A+   +EH   +A
Sbjct: 198 KGSILLIFATLAWASLFVLQAKILKTYAKHQLSLTTLICFIGTLQAVAVTFVMEH-NPSA 257

Query: 249 WSLKSSIRLVGALYAGVACSGVAFCLTSWSIQKKGPLYASVFSPFLLVIVAIFSWAFFQE 308
           W +   + L+ A Y+G+  S +++ +    ++K+GP++A+ FSP ++VIVA+       E
Sbjct: 258 WRIGWDMNLLAAAYSGIVASSISYYVQGIVMKKRGPVFATAFSPLMMVIVAVMGSFVLAE 317

Query: 309 KLYVGTVVGSLLIVVGLYAVLWGKTKE 331
           K+++G V+G++LIV+GLYAVLWGK KE
Sbjct: 318 KIFLGGVIGAVLIVIGLYAVLWGKQKE 343

BLAST of ClCG01G010790 vs. Swiss-Prot
Match: WTR24_ARATH (WAT1-related protein At3g30340 OS=Arabidopsis thaliana GN=At3g30340 PE=2 SV=1)

HSP 1 Score: 240.0 bits (611), Expect = 4.2e-62
Identity = 111/322 (34.47%), Postives = 202/322 (62.73%), Query Frame = 1

Query: 11  LVMLVVQFLYAGLNITSKLAMEFGMNPLVLVAYRQMFATIAIAPCAYWFERKGRPKITKP 70
           L+M ++    + +N+  K  ++ G+N +V   YR    T+ + P A + ER  RPK+T  
Sbjct: 13  LMMSMINIGLSVVNVMFKKMIDEGLNRMVATTYRLAVGTLFLIPFAIFLERHNRPKLTGR 72

Query: 71  ILFQILLCSLTGATANQVFYYVGLKYSTPTIACALTNVLPAATFVLAVLFRQESVGIKTS 130
           IL  +   +L G +  Q F+ +GL+Y++ T + A +N++P+ TF LA++FRQE++ IK++
Sbjct: 73  ILCSLFFSALLGTSLVQYFFLIGLEYTSSTFSLAFSNMVPSVTFALALVFRQETLNIKSN 132

Query: 131 PGAAKVMGTVVCVGGAMLLSFYRGQTIQLGESGIHWKYAELMRGGSSTNQGSSIW--GSL 190
            G AK++GT++C+ GA++L+ Y+G  +    S  H  + E      ST   +  W  GS+
Sbjct: 133 VGRAKLLGTMICICGALVLTLYKGTAL----SREHSTHMETHTRTDSTGAMTQKWAMGSI 192

Query: 191 CLIISSVAWAAWFVIQARVNEKFPAPYTSTALMTFMATIQCGAIAVGVEHKTAAAWSLKS 250
            L+IS + W++WF++QA+++  +P  YTST +++F   IQ   +++ +  ++ + W +K 
Sbjct: 193 MLVISIIIWSSWFIVQAKISRVYPCQYTSTTILSFFGVIQSALLSL-ISERSTSMWVVKD 252

Query: 251 SIRLVGALYAGVACSGVAFCLTSWSIQKKGPLYASVFSPFLLVIVAIFSWAFFQEKLYVG 310
             +++  LY+G+  SG+ +   SW ++++G ++ S F P + V  AIFS++F  E++Y G
Sbjct: 253 KFQVLALLYSGIVGSGLCYVGMSWCLRQRGAVFTSSFIPLIQVFAAIFSFSFLHEQIYCG 312

Query: 311 TVVGSLLIVVGLYAVLWGKTKE 331
           +V+GS++I+VGLY +LWGK+K+
Sbjct: 313 SVIGSMVIIVGLYILLWGKSKD 329

BLAST of ClCG01G010790 vs. Swiss-Prot
Match: WTR14_ARATH (WAT1-related protein At2g39510 OS=Arabidopsis thaliana GN=At2g39510 PE=2 SV=1)

HSP 1 Score: 239.6 bits (610), Expect = 5.5e-62
Identity = 122/323 (37.77%), Postives = 199/323 (61.61%), Query Frame = 1

Query: 9   PPLVMLVVQFLYAGLNITSKLAMEFGMNPLVLVAYRQMFATIAIAPCAYWFERKGRPKIT 68
           P + ++ +QF YAGL+I +K A+  GM+P VL +YR + ATI IAP AY+ +RK RPK+T
Sbjct: 8   PFITVVSLQFGYAGLSIIAKFALNQGMSPHVLASYRHIVATIFIAPFAYFLDRKIRPKMT 67

Query: 69  KPILFQILLCSLTGATANQVFYYVGLKYSTPTIACALTNVLPAATFVLAVLFRQESVGIK 128
             I F+ILL  L   T +Q  YY G+KY++ T   A+TNVLPA  F++A +FR E V +K
Sbjct: 68  LSIFFKILLLGLLEPTIDQNLYYTGMKYTSATFTAAMTNVLPAFAFIMAWIFRLEKVNVK 127

Query: 129 TSPGAAKVMGTVVCVGGAMLLSFYRGQTIQLGESGIHWKYAELMRGGSSTNQGSSIW-GS 188
                AK++GT+V VGGAML++  +G  I L  +  H    ++ +  S+T     +  G+
Sbjct: 128 KIHSQAKILGTIVTVGGAMLMTVVKGPLIPLPWANPH----DIHQDSSNTGVKQDLTKGA 187

Query: 189 LCLIISSVAWAAWFVIQARVNEKFPAPYTSTALMTFMATIQCGAIAVGVEHKTAAAWSLK 248
             + I  + WA +  +QA   + +P   + TA + F+ +I+   +A+ +E    +AW++ 
Sbjct: 188 SLIAIGCICWAGFINLQAITLKSYPVELSLTAYICFLGSIESTIVALFIERGNPSAWAIH 247

Query: 249 SSIRLVGALYAGVACSGVAFCLTSWSIQKKGPLYASVFSPFLLVIVAIFSWAFFQEKLYV 308
              +L+ A+Y GV CSG+ + +    ++ +GP++ + F+P  +VIVAI       E +++
Sbjct: 248 LDSKLLAAVYGGVICSGIGYYVQGVIMKTRGPVFVTAFNPLSMVIVAILGSIILAEVMFL 307

Query: 309 GTVVGSLLIVVGLYAVLWGKTKE 331
           G ++G+++IV+GLY+VLWGK+K+
Sbjct: 308 GRILGAIVIVLGLYSVLWGKSKD 326

BLAST of ClCG01G010790 vs. Swiss-Prot
Match: WTR29_ARATH (WAT1-related protein At4g01440 OS=Arabidopsis thaliana GN=At4g01440 PE=2 SV=1)

HSP 1 Score: 235.3 bits (599), Expect = 1.0e-60
Identity = 119/355 (33.52%), Postives = 207/355 (58.31%), Query Frame = 1

Query: 5   GDFLPPLVMLVVQFLYAGLNITSKLAMEFGMNPLVLVAYRQMFATIAIAPCAYWFERKGR 64
           G + P ++M+++       N   K  ++ G+N +V+  YR   +T+ +AP A+++ERK R
Sbjct: 6   GKWTPVIIMVMINSALGLANALVKKVLDGGVNHMVIATYRLAISTLFLAPIAFFWERKTR 65

Query: 65  PKITKPILFQILLCSLTGATANQVFYYVGLKYSTPTIACALTNVLPAATFVLAVLFRQES 124
           P +T  IL Q+   +L GA+  Q F+ +GL Y++ T+ACA  ++ PA TFV+A++FR E 
Sbjct: 66  PTLTLNILVQLFFSALVGASLTQYFFLLGLSYTSATLACAFISMTPAITFVMALIFRVEK 125

Query: 125 VGIKTSPGAAKVMGTVVCVGGAMLLSFYRGQTI-QLGESGIHWKYAELMRGGSSTNQGSS 184
           + +K+  G   VMG ++C+GGA+LL+ Y+G  + +L +   H    +L+    +    + 
Sbjct: 126 LNMKSKAGMGMVMGALICIGGALLLTMYKGVPLTKLRKLETH----QLINNNHAMKPENW 185

Query: 185 IWGSLCLIISSVAWAAWFVIQARVNEKFPAPYTSTALMTFMATIQCGAIAVGVEHKTAAA 244
           I G + L   S  + +W +IQA+VNEK+P  Y+ST +++F  TIQC  +++ ++ +   A
Sbjct: 186 IIGCVLLFAGSSCFGSWMLIQAKVNEKYPCQYSSTVVLSFFGTIQCALLSL-IKSRDITA 245

Query: 245 WSLKSSIRLVGALYAGVACSGVAFCLTSWSIQKKGPLYASVFSPFLLVIVAIFSWAFFQE 304
           W L   + +V  +YAG    G+    TSW I+K+GP++ S+F+P  L+   +F +     
Sbjct: 246 WILTDKLDIVTIVYAGAVAQGICTVGTSWCIRKRGPIFTSIFTPVGLIFATLFDFLILHR 305

Query: 305 KLYVGTVVGSLLIVVGLYAVLWGKTKEVKLQQHIEMAAAAEAKLDDCNNKDLEEQ 359
           ++++G+VVGS +++ GLY  L GK + +K +         E KL    N+D +E+
Sbjct: 306 QIFLGSVVGSGVVIFGLYIFLLGKVRLMKEE--------CEKKLPCRFNEDDQEE 347

BLAST of ClCG01G010790 vs. TrEMBL
Match: A0A0A0LW49_CUCSA (WAT1-related protein OS=Cucumis sativus GN=Csa_1G181350 PE=3 SV=1)

HSP 1 Score: 703.4 bits (1814), Expect = 1.5e-199
Identity = 356/371 (95.96%), Postives = 363/371 (97.84%), Query Frame = 1

Query: 1   MAAGGDFLPPLVMLVVQFLYAGLNITSKLAMEFGMNPLVLVAYRQMFATIAIAPCAYWFE 60
           MAA GDFLPPLVMLVVQFLYAGLNITSKLAMEFGMNPLVLVAYRQMFATIAIAPCAYWFE
Sbjct: 1   MAAAGDFLPPLVMLVVQFLYAGLNITSKLAMEFGMNPLVLVAYRQMFATIAIAPCAYWFE 60

Query: 61  RKGRPKITKPILFQILLCSLTGATANQVFYYVGLKYSTPTIACALTNVLPAATFVLAVLF 120
           RKGRPKITKPILFQILLCSLTGATANQVFYYVGLKYSTPTIACALTNVLPAATFVLAVLF
Sbjct: 61  RKGRPKITKPILFQILLCSLTGATANQVFYYVGLKYSTPTIACALTNVLPAATFVLAVLF 120

Query: 121 RQESVGIKTSPGAAKVMGTVVCVGGAMLLSFYRGQTIQLGESGIHWKYAELMRGGSSTNQ 180
           RQESVGIKTSPGAAKV+GTVVCVGGAMLLSFYRGQTI+LG+SGIHWKYAELMRG SS+NQ
Sbjct: 121 RQESVGIKTSPGAAKVIGTVVCVGGAMLLSFYRGQTIELGKSGIHWKYAELMRGESSSNQ 180

Query: 181 GSSIWGSLCLIISSVAWAAWFVIQARVNEKFPAPYTSTALMTFMATIQCGAIAVGVEHKT 240
           GSSIWGSLCLIISSVAWAAWFVIQARVNEKFPAPYTSTALMTFMATIQCGAIAVGVEHKT
Sbjct: 181 GSSIWGSLCLIISSVAWAAWFVIQARVNEKFPAPYTSTALMTFMATIQCGAIAVGVEHKT 240

Query: 241 AAAWSLKSSIRLVGALYAGVACSGVAFCLTSWSIQKKGPLYASVFSPFLLVIVAIFSWAF 300
            AAWSLKSSIRLVGALYAGVACSG+AFCLTSWSIQK+GPLYASVFSPFLLVIVAIFSWAF
Sbjct: 241 LAAWSLKSSIRLVGALYAGVACSGMAFCLTSWSIQKRGPLYASVFSPFLLVIVAIFSWAF 300

Query: 301 FQEKLYVGTVVGSLLIVVGLYAVLWGKTKEVKLQQHIEMAAAAEAKLDDCNNK-DLEEQS 360
           FQEKLYVGTVVGSLLIVVGLY+VLWGKTKEVKLQQHIEM AAAEAKLDD NNK DLEEQS
Sbjct: 301 FQEKLYVGTVVGSLLIVVGLYSVLWGKTKEVKLQQHIEMTAAAEAKLDDYNNKEDLEEQS 360

Query: 361 YVVSNATILHK 371
           YVVSNA I HK
Sbjct: 361 YVVSNANIPHK 371

BLAST of ClCG01G010790 vs. TrEMBL
Match: M5W6P9_PRUPE (WAT1-related protein (Fragment) OS=Prunus persica GN=PRUPE_ppa018179mg PE=3 SV=1)

HSP 1 Score: 449.9 bits (1156), Expect = 3.0e-123
Identity = 229/362 (63.26%), Postives = 278/362 (76.80%), Query Frame = 1

Query: 1   MAAGGDFLPPLVMLVVQFLYAGLNITSKLAMEFGMNPLVLVAYRQMFATIAIAPCAYWFE 60
           MA  GD+LP L M++VQ  YAG+NI SKLA+E  MNPLVLVAYRQ+FAT++IAP AYW E
Sbjct: 1   MAGYGDYLPFLAMVLVQMSYAGMNIISKLAIESDMNPLVLVAYRQVFATLSIAPFAYWME 60

Query: 61  RKGRPKITKPILFQILLCSLTGATANQVFYYVGLKYSTPTIACALTNVLPAATFVLAVLF 120
            K RP+IT PILFQ  LCSLTGATANQVFY+VGLK STPTIACALTN LPA TF+LA++F
Sbjct: 61  WKTRPRITMPILFQTFLCSLTGATANQVFYFVGLKTSTPTIACALTNTLPAMTFILALIF 120

Query: 121 RQESVGIKTSPGAAKVMGTVVCVGGAMLLSFYRGQTIQLGESGIHWKYAELM-RGGSSTN 180
           RQES  IK+ PG +KVMGTVVCV GAMLLSFY G  I LGES IHW YA+ M    +S++
Sbjct: 121 RQESAKIKSKPGLSKVMGTVVCVSGAMLLSFYHGHIIGLGESKIHWAYAQRMGEQANSSS 180

Query: 181 QGSSIWGSLCLIISSVAWAAWFVIQARVNEKFPAPYTSTALMTFMATIQCGAIAVGVEHK 240
            GSS  G LC+IIS++ WA WF+IQA+V E FPAPYTST LM  MA+ +CG IAV  +HK
Sbjct: 181 NGSSFVGPLCVIISTLGWAFWFIIQAKVGENFPAPYTSTTLMCLMASFECGIIAVIADHK 240

Query: 241 TAAAWSLKSSIRLVGALYAGVACSGVAFCLTSWSIQKKGPLYASVFSPFLLVIVAIFSWA 300
             +AWSLK+ +RL+ ALY G+  S +AF L+SWSIQ+KGPLY SVFSP LL+IVAI SWA
Sbjct: 241 -VSAWSLKNPMRLISALYCGILGSALAFFLSSWSIQRKGPLYVSVFSPLLLIIVAISSWA 300

Query: 301 FFQEKLYVGTVVGSLLIVVGLYAVLWGKTKEVKLQ---QHIEMAAAAEAKLDDCNNKDLE 359
             +EKLY+GT +GS+LIV GLY VLWGK KE +++   +  +   A  AK  + N+ +L+
Sbjct: 301 LLEEKLYLGTAIGSILIVCGLYLVLWGKNKETEVEKPTKETDTTKADHAKEYERNDLELQ 360

BLAST of ClCG01G010790 vs. TrEMBL
Match: A0A061F4Z0_THECC (WAT1-related protein OS=Theobroma cacao GN=TCM_026803 PE=3 SV=1)

HSP 1 Score: 447.2 bits (1149), Expect = 1.9e-122
Identity = 229/364 (62.91%), Postives = 279/364 (76.65%), Query Frame = 1

Query: 5   GDFLPPLVMLVVQFLYAGLNITSKLAMEFGMNPLVLVAYRQMFATIAIAPCAYWFERKGR 64
           GDF+P L  ++VQ  YAG+NITSKLAME GM PL+LVAYRQ+FAT+AIAP AY+ ERK R
Sbjct: 3   GDFVPFLANVLVQLGYAGMNITSKLAMESGMKPLILVAYRQIFATLAIAPFAYFLERKTR 62

Query: 65  PKITKPILFQILLCSLTGATANQVFYYVGLKYSTPTIACALTNVLPAATFVLAVLFRQES 124
           PKITK ILFQI LCSLTGATANQVFY+VGL+ S+ T+ACAL NVLPAATF LA L RQE+
Sbjct: 63  PKITKHILFQIFLCSLTGATANQVFYFVGLENSSATVACALNNVLPAATFALAALCRQEA 122

Query: 125 VGIKTSPGAAKVMGTVVCVGGAMLLSFYRGQTIQLGESGIHWKYAELMRGGSSTNQGSSI 184
           VGIK + G AKV+GT+VCVGGAMLLSFY G TI +G+S IHW YA+ M   SS+N  +  
Sbjct: 123 VGIKKASGQAKVLGTLVCVGGAMLLSFYHGHTIGIGDSSIHWAYADKMTSKSSSNGSNFF 182

Query: 185 WGSLCLIISSVAWAAWFVIQARVNEKFPAPYTSTALMTFMATIQCGAIAVGVEHKTAAAW 244
            G   ++ S+VAWA W +IQ + ++ FPAPYT TALM FMA+I+C  I +  +HK  +AW
Sbjct: 183 LGPFLVMASAVAWAVWLIIQGQTSKNFPAPYTCTALMCFMASIECTIIGIFSDHK-ISAW 242

Query: 245 SLKSSIRLVGALYAGVACSGVAFCLTSWSIQKKGPLYASVFSPFLLVIVAIFSWAFFQEK 304
           SL SS+RL+ ALYAG+ C+ + FC+ SWSIQKKGPLY SVFSP LLVIVA+ SWA  +EK
Sbjct: 243 SLSSSMRLIAALYAGIVCNAMTFCVLSWSIQKKGPLYVSVFSPLLLVIVAVLSWALLREK 302

Query: 305 LYVGTVVGSLLIVVGLYAVLWGKTKEVKLQQHIEMAAAAEA-KLDDCNNK-DLEEQSYVV 364
           LYVGTVVGS+LIV GLYAVLWGK +E+K  +  E+  A EA K  + + K DLE Q +  
Sbjct: 303 LYVGTVVGSVLIVGGLYAVLWGKDREIKQMKSNELETAEEATKAREKDGKDDLELQLHPQ 362

Query: 365 SNAT 367
           +  T
Sbjct: 363 TKGT 365

BLAST of ClCG01G010790 vs. TrEMBL
Match: A0A067JUZ0_JATCU (WAT1-related protein OS=Jatropha curcas GN=JCGZ_23192 PE=3 SV=1)

HSP 1 Score: 441.4 bits (1134), Expect = 1.1e-120
Identity = 232/365 (63.56%), Postives = 272/365 (74.52%), Query Frame = 1

Query: 5   GDFLPPLVMLVVQFLYAGLNITSKLAMEFGMNPLVLVAYRQMFATIAIAPCAYWFERKGR 64
           GD LP L M++VQF +AG+NI SKLAM+ GM PLVLVAYRQ+FATIA+ P AY+FE K R
Sbjct: 5   GDLLPFLAMVLVQFGFAGMNIISKLAMDSGMKPLVLVAYRQIFATIAMVPFAYFFEWKTR 64

Query: 65  PKITKPILFQILLCSLTGATANQVFYYVGLKYSTPTIACALTNVLPAATFVLAVLFRQES 124
           PKITK +LFQI +CSLTGAT NQVFY++GL+ STPTI CALTN+LPA TF+LAVL RQES
Sbjct: 65  PKITKELLFQIFICSLTGATGNQVFYFIGLQNSTPTIGCALTNILPAVTFILAVLLRQES 124

Query: 125 VGIKTSPGAAKVMGTVVCVGGAMLLSFYRGQTIQLGESGIHWKYAELMRGGSSTNQGSS- 184
           VGIK   G AK++GT +CVGGAMLLSFY G  I +GES IHWKYA+ + G  ++N GS  
Sbjct: 125 VGIKKLSGQAKLLGTAICVGGAMLLSFYHGSRINVGESSIHWKYADDI-GSQNSNDGSKS 184

Query: 185 --IWGSLCLIISSVAWAAWFVIQARVNEKFPAPYTSTALMTFMATIQCGAIAVGVEHKTA 244
             I G L ++ S+V WA WF IQA+V+EKFPAPYTST L+ FM +IQC  I  G  H+ A
Sbjct: 185 NFILGPLFILASAVCWAIWFTIQAKVSEKFPAPYTSTFLLCFMGSIQCVLIGFGANHE-A 244

Query: 245 AAWSLKSSIRLVGALYAGVACSGVAFCLTSWSIQKKGPLYASVFSPFLLVIVAIFSWAFF 304
           A WSL+   RLV ALYA + CS +AF LTSWSIQKKG LY SVFSP LLVIVA+ SWA  
Sbjct: 245 ADWSLRDPGRLVAALYAAIVCSALAFSLTSWSIQKKGALYVSVFSPLLLVIVAVLSWALL 304

Query: 305 QEKLYVGTVVGSLLIVVGLYAVLWGKTKEVKLQQHIEMAAAAEAKLDDCNNKDLEEQSYV 364
           +EKLYVGTVVGS LIV GLYAVLWGK KE+KL+   E+ A  +         DLE Q   
Sbjct: 305 REKLYVGTVVGSALIVAGLYAVLWGKDKEMKLKVIEEIEARKQI-------NDLELQLPA 360

Query: 365 VSNAT 367
            SN T
Sbjct: 365 KSNGT 360

BLAST of ClCG01G010790 vs. TrEMBL
Match: K7MZS7_SOYBN (WAT1-related protein OS=Glycine max GN=GLYMA_19G227000 PE=3 SV=1)

HSP 1 Score: 436.8 bits (1122), Expect = 2.6e-119
Identity = 218/361 (60.39%), Postives = 279/361 (77.29%), Query Frame = 1

Query: 4   GGDFLPPLVMLVVQFLYAGLNITSKLAMEFGMNPLVLVAYRQMFATIAIAPCAYWFERKG 63
           G   +  L+M++VQ +YA +NITSKLA+E GM+PLVLVAYRQ+FAT++IAP AYW E   
Sbjct: 2   GAGLVAFLLMVLVQLVYAVMNITSKLAIESGMSPLVLVAYRQLFATVSIAPFAYWLEWNT 61

Query: 64  RPKITKPILFQILLCSLTGATANQVFYYVGLKYSTPTIACALTNVLPAATFVLAVLFRQE 123
            P+IT+ ++ QIL  SLTG T NQ+ Y+VGLKYS+ TIACALTN+LPA TF+LAVLFRQE
Sbjct: 62  LPRITQRLMIQILFSSLTGVTGNQMLYFVGLKYSSATIACALTNLLPAFTFILAVLFRQE 121

Query: 124 SVGIKTSPGAAKVMGTVVCVGGAMLLSFYRGQTIQLGESGIHWKYAELMRGGSSTNQGSS 183
           ++GIK   G AKV GT++CV GA+LLSFY G+TI LG+S IHW+YAE M G SS+ +G+ 
Sbjct: 122 NLGIKKRAGLAKVFGTILCVSGALLLSFYHGKTIGLGQSSIHWRYAEKMEGTSSSGKGNM 181

Query: 184 IWGSLCLIISSVAWAAWFVIQARVNEKFPAPYTSTALMTFMATIQCGAIAVGVEHKTAAA 243
             G L +I+S++ WAAWF+IQ  +++ FPAPYTST LM FMA+ QC  IAV V+H+ A+A
Sbjct: 182 FLGPLVVILSTLVWAAWFIIQKDISKTFPAPYTSTGLMCFMASFQCVIIAVCVDHR-ASA 241

Query: 244 WSLKSSIRLVGALYAGVACSGVAFCLTSWSIQKKGPLYASVFSPFLLVIVAIFSWAFFQE 303
           WSL +++RL  ALYAG+ C+G+A+CL SW+I++KGPLY SVF+P  LV+ AI SWA  +E
Sbjct: 242 WSLHNAMRLSSALYAGIFCTGLAYCLMSWTIERKGPLYVSVFTPLQLVLTAILSWALLRE 301

Query: 304 KLYVGTVVGSLLIVVGLYAVLWGKTKEVKLQQHIEMAAAAEAKLDDCNNKDLEEQSYVVS 363
           KLYVGT VGSLLIV+GLY+VLWGK++EV     IE  A  EA  D  N  D+E QSYV S
Sbjct: 302 KLYVGTAVGSLLIVLGLYSVLWGKSEEVNKGDGIEEDAVKEAVKDSKN--DMELQSYVPS 359

Query: 364 N 365
           N
Sbjct: 362 N 359

BLAST of ClCG01G010790 vs. TAIR10
Match: AT1G09380.1 (AT1G09380.1 nodulin MtN21 /EamA-like transporter family protein)

HSP 1 Score: 419.1 bits (1076), Expect = 2.9e-117
Identity = 211/364 (57.97%), Postives = 269/364 (73.90%), Query Frame = 1

Query: 3   AGGDFLPPLVMLVVQFLYAGLNITSKLAMEFGMNPLVLVAYRQMFATIAIAPCAYWFERK 62
           A  D LP L M++VQ  YAG+NITSK+AME GM PL+LVAYRQ+FATIA  P A++ ERK
Sbjct: 2   AKSDMLPFLAMVLVQIGYAGMNITSKMAMEAGMKPLILVAYRQIFATIATFPVAFFLERK 61

Query: 63  GRPKITKPILFQILLCSLTGATANQVFYYVGLKYSTPTIACALTNVLPAATFVLAVLFRQ 122
            RPKIT  IL Q+  CS+TGAT NQV Y+VGL+ S+PTIACALTN+LPA TF+LA +FRQ
Sbjct: 62  TRPKITLRILVQVFFCSITGATGNQVLYFVGLQNSSPTIACALTNLLPAVTFLLAAIFRQ 121

Query: 123 ESVGIKTSPGAAKVMGTVVCVGGAMLLSFYRGQTIQLGESGIHWKYAE-LMRGGSSTNQG 182
           E+VGIK + G AKV+GT+VCV GAM+LSFY G TI +GES IHW YAE + + GSS+   
Sbjct: 122 ETVGIKKASGQAKVIGTLVCVIGAMVLSFYHGHTIGIGESKIHWAYAENITKHGSSSGHS 181

Query: 183 SSIWGSLCLIISSVAWAAWFVIQARVNEKFPAPYTSTALMTFMATIQCGAIAVGVEHKTA 242
           +   G   ++ ++V+WAAWF+IQ +++E F APYTST LM  M +IQCGAIA+  +H T 
Sbjct: 182 NFFLGPFLIMAAAVSWAAWFIIQTKMSETFAAPYTSTLLMCLMGSIQCGAIALISDH-TI 241

Query: 243 AAWSLKSSIRLVGALYAGVACSGVAFCLTSWSIQKKGPLYASVFSPFLLVIVAIFSWAFF 302
           + WSL S +R + ALYAGV  S +AFCL SW++Q+KGPLY SVFSP LLV+VAIFSWA  
Sbjct: 242 SDWSLSSPLRFISALYAGVVASALAFCLMSWAMQRKGPLYVSVFSPLLLVVVAIFSWALL 301

Query: 303 QEKLYVGTVVGSLLIVVGLYAVLWGKTKEV-KLQQHIEMAAAAEAKLDDCNNKDLEEQSY 362
           +EKLY GT +GS L+V+GLY VLWGK +EV + ++  E       K+   +N+D+E +  
Sbjct: 302 EEKLYTGTFMGSALVVIGLYGVLWGKDREVSEKEEEREKVKQQNHKVKSESNEDIESRLP 361

Query: 363 VVSN 365
           V S+
Sbjct: 362 VASS 364

BLAST of ClCG01G010790 vs. TAIR10
Match: AT5G07050.1 (AT5G07050.1 nodulin MtN21 /EamA-like transporter family protein)

HSP 1 Score: 261.5 bits (667), Expect = 7.6e-70
Identity = 137/327 (41.90%), Postives = 210/327 (64.22%), Query Frame = 1

Query: 9   PPLVMLVVQFLYAGLNITSKLAMEFGMNPLVLVAYRQMFATIAIAPCAYWFERKGRPKIT 68
           P   M+ +QF YAG+NI +K+++  GM+  VLV YR   AT  IAP A++FERK +PKIT
Sbjct: 18  PYFAMISLQFGYAGMNIITKISLNTGMSHYVLVVYRHAIATAVIAPFAFFFERKAQPKIT 77

Query: 69  KPILFQILLCSLTGATANQVFYYVGLKYSTPTIACALTNVLPAATFVLAVLFRQESVGIK 128
             I  Q+ +  L G   +Q FYY+GLKY++PT +CA++N+LPA TF+LAVLFR E + +K
Sbjct: 78  FSIFMQLFILGLLGPVIDQNFYYMGLKYTSPTFSCAMSNMLPAMTFILAVLFRMEMLDLK 137

Query: 129 TSPGAAKVMGTVVCVGGAMLLSFYRGQTIQL-GESGIHWKYAELMRGGSSTNQGSS---I 188
                AK+ GTVV V GAML++ Y+G  ++L     +H + +      SS N  S    +
Sbjct: 138 KLWCQAKIAGTVVTVAGAMLMTIYKGPIVELFWTKYMHIQDSSHANTTSSKNSSSDKEFL 197

Query: 189 WGSLCLIISSVAWAAWFVIQARVNEKFPAPYTS-TALMTFMATIQCGAIAVGVEHKTAAA 248
            GS+ LI +++AWA+ FV+QA++ + +     S T L+ F+ T+Q  A+   +EH   +A
Sbjct: 198 KGSILLIFATLAWASLFVLQAKILKTYAKHQLSLTTLICFIGTLQAVAVTFVMEH-NPSA 257

Query: 249 WSLKSSIRLVGALYAGVACSGVAFCLTSWSIQKKGPLYASVFSPFLLVIVAIFSWAFFQE 308
           W +   + L+ A Y+G+  S +++ +    ++K+GP++A+ FSP ++VIVA+       E
Sbjct: 258 WRIGWDMNLLAAAYSGIVASSISYYVQGIVMKKRGPVFATAFSPLMMVIVAVMGSFVLAE 317

Query: 309 KLYVGTVVGSLLIVVGLYAVLWGKTKE 331
           K+++G V+G++LIV+GLYAVLWGK KE
Sbjct: 318 KIFLGGVIGAVLIVIGLYAVLWGKQKE 343

BLAST of ClCG01G010790 vs. TAIR10
Match: AT3G30340.1 (AT3G30340.1 nodulin MtN21 /EamA-like transporter family protein)

HSP 1 Score: 240.0 bits (611), Expect = 2.4e-63
Identity = 111/322 (34.47%), Postives = 202/322 (62.73%), Query Frame = 1

Query: 11  LVMLVVQFLYAGLNITSKLAMEFGMNPLVLVAYRQMFATIAIAPCAYWFERKGRPKITKP 70
           L+M ++    + +N+  K  ++ G+N +V   YR    T+ + P A + ER  RPK+T  
Sbjct: 13  LMMSMINIGLSVVNVMFKKMIDEGLNRMVATTYRLAVGTLFLIPFAIFLERHNRPKLTGR 72

Query: 71  ILFQILLCSLTGATANQVFYYVGLKYSTPTIACALTNVLPAATFVLAVLFRQESVGIKTS 130
           IL  +   +L G +  Q F+ +GL+Y++ T + A +N++P+ TF LA++FRQE++ IK++
Sbjct: 73  ILCSLFFSALLGTSLVQYFFLIGLEYTSSTFSLAFSNMVPSVTFALALVFRQETLNIKSN 132

Query: 131 PGAAKVMGTVVCVGGAMLLSFYRGQTIQLGESGIHWKYAELMRGGSSTNQGSSIW--GSL 190
            G AK++GT++C+ GA++L+ Y+G  +    S  H  + E      ST   +  W  GS+
Sbjct: 133 VGRAKLLGTMICICGALVLTLYKGTAL----SREHSTHMETHTRTDSTGAMTQKWAMGSI 192

Query: 191 CLIISSVAWAAWFVIQARVNEKFPAPYTSTALMTFMATIQCGAIAVGVEHKTAAAWSLKS 250
            L+IS + W++WF++QA+++  +P  YTST +++F   IQ   +++ +  ++ + W +K 
Sbjct: 193 MLVISIIIWSSWFIVQAKISRVYPCQYTSTTILSFFGVIQSALLSL-ISERSTSMWVVKD 252

Query: 251 SIRLVGALYAGVACSGVAFCLTSWSIQKKGPLYASVFSPFLLVIVAIFSWAFFQEKLYVG 310
             +++  LY+G+  SG+ +   SW ++++G ++ S F P + V  AIFS++F  E++Y G
Sbjct: 253 KFQVLALLYSGIVGSGLCYVGMSWCLRQRGAVFTSSFIPLIQVFAAIFSFSFLHEQIYCG 312

Query: 311 TVVGSLLIVVGLYAVLWGKTKE 331
           +V+GS++I+VGLY +LWGK+K+
Sbjct: 313 SVIGSMVIIVGLYILLWGKSKD 329

BLAST of ClCG01G010790 vs. TAIR10
Match: AT2G39510.1 (AT2G39510.1 nodulin MtN21 /EamA-like transporter family protein)

HSP 1 Score: 239.6 bits (610), Expect = 3.1e-63
Identity = 122/323 (37.77%), Postives = 199/323 (61.61%), Query Frame = 1

Query: 9   PPLVMLVVQFLYAGLNITSKLAMEFGMNPLVLVAYRQMFATIAIAPCAYWFERKGRPKIT 68
           P + ++ +QF YAGL+I +K A+  GM+P VL +YR + ATI IAP AY+ +RK RPK+T
Sbjct: 8   PFITVVSLQFGYAGLSIIAKFALNQGMSPHVLASYRHIVATIFIAPFAYFLDRKIRPKMT 67

Query: 69  KPILFQILLCSLTGATANQVFYYVGLKYSTPTIACALTNVLPAATFVLAVLFRQESVGIK 128
             I F+ILL  L   T +Q  YY G+KY++ T   A+TNVLPA  F++A +FR E V +K
Sbjct: 68  LSIFFKILLLGLLEPTIDQNLYYTGMKYTSATFTAAMTNVLPAFAFIMAWIFRLEKVNVK 127

Query: 129 TSPGAAKVMGTVVCVGGAMLLSFYRGQTIQLGESGIHWKYAELMRGGSSTNQGSSIW-GS 188
                AK++GT+V VGGAML++  +G  I L  +  H    ++ +  S+T     +  G+
Sbjct: 128 KIHSQAKILGTIVTVGGAMLMTVVKGPLIPLPWANPH----DIHQDSSNTGVKQDLTKGA 187

Query: 189 LCLIISSVAWAAWFVIQARVNEKFPAPYTSTALMTFMATIQCGAIAVGVEHKTAAAWSLK 248
             + I  + WA +  +QA   + +P   + TA + F+ +I+   +A+ +E    +AW++ 
Sbjct: 188 SLIAIGCICWAGFINLQAITLKSYPVELSLTAYICFLGSIESTIVALFIERGNPSAWAIH 247

Query: 249 SSIRLVGALYAGVACSGVAFCLTSWSIQKKGPLYASVFSPFLLVIVAIFSWAFFQEKLYV 308
              +L+ A+Y GV CSG+ + +    ++ +GP++ + F+P  +VIVAI       E +++
Sbjct: 248 LDSKLLAAVYGGVICSGIGYYVQGVIMKTRGPVFVTAFNPLSMVIVAILGSIILAEVMFL 307

Query: 309 GTVVGSLLIVVGLYAVLWGKTKE 331
           G ++G+++IV+GLY+VLWGK+K+
Sbjct: 308 GRILGAIVIVLGLYSVLWGKSKD 326

BLAST of ClCG01G010790 vs. TAIR10
Match: AT4G01440.1 (AT4G01440.1 nodulin MtN21 /EamA-like transporter family protein)

HSP 1 Score: 235.3 bits (599), Expect = 5.9e-62
Identity = 119/355 (33.52%), Postives = 207/355 (58.31%), Query Frame = 1

Query: 5   GDFLPPLVMLVVQFLYAGLNITSKLAMEFGMNPLVLVAYRQMFATIAIAPCAYWFERKGR 64
           G + P ++M+++       N   K  ++ G+N +V+  YR   +T+ +AP A+++ERK R
Sbjct: 6   GKWTPVIIMVMINSALGLANALVKKVLDGGVNHMVIATYRLAISTLFLAPIAFFWERKTR 65

Query: 65  PKITKPILFQILLCSLTGATANQVFYYVGLKYSTPTIACALTNVLPAATFVLAVLFRQES 124
           P +T  IL Q+   +L GA+  Q F+ +GL Y++ T+ACA  ++ PA TFV+A++FR E 
Sbjct: 66  PTLTLNILVQLFFSALVGASLTQYFFLLGLSYTSATLACAFISMTPAITFVMALIFRVEK 125

Query: 125 VGIKTSPGAAKVMGTVVCVGGAMLLSFYRGQTI-QLGESGIHWKYAELMRGGSSTNQGSS 184
           + +K+  G   VMG ++C+GGA+LL+ Y+G  + +L +   H    +L+    +    + 
Sbjct: 126 LNMKSKAGMGMVMGALICIGGALLLTMYKGVPLTKLRKLETH----QLINNNHAMKPENW 185

Query: 185 IWGSLCLIISSVAWAAWFVIQARVNEKFPAPYTSTALMTFMATIQCGAIAVGVEHKTAAA 244
           I G + L   S  + +W +IQA+VNEK+P  Y+ST +++F  TIQC  +++ ++ +   A
Sbjct: 186 IIGCVLLFAGSSCFGSWMLIQAKVNEKYPCQYSSTVVLSFFGTIQCALLSL-IKSRDITA 245

Query: 245 WSLKSSIRLVGALYAGVACSGVAFCLTSWSIQKKGPLYASVFSPFLLVIVAIFSWAFFQE 304
           W L   + +V  +YAG    G+    TSW I+K+GP++ S+F+P  L+   +F +     
Sbjct: 246 WILTDKLDIVTIVYAGAVAQGICTVGTSWCIRKRGPIFTSIFTPVGLIFATLFDFLILHR 305

Query: 305 KLYVGTVVGSLLIVVGLYAVLWGKTKEVKLQQHIEMAAAAEAKLDDCNNKDLEEQ 359
           ++++G+VVGS +++ GLY  L GK + +K +         E KL    N+D +E+
Sbjct: 306 QIFLGSVVGSGVVIFGLYIFLLGKVRLMKEE--------CEKKLPCRFNEDDQEE 347

BLAST of ClCG01G010790 vs. NCBI nr
Match: gi|449443520|ref|XP_004139525.1| (PREDICTED: WAT1-related protein At1g09380-like [Cucumis sativus])

HSP 1 Score: 703.4 bits (1814), Expect = 2.2e-199
Identity = 356/371 (95.96%), Postives = 363/371 (97.84%), Query Frame = 1

Query: 1   MAAGGDFLPPLVMLVVQFLYAGLNITSKLAMEFGMNPLVLVAYRQMFATIAIAPCAYWFE 60
           MAA GDFLPPLVMLVVQFLYAGLNITSKLAMEFGMNPLVLVAYRQMFATIAIAPCAYWFE
Sbjct: 1   MAAAGDFLPPLVMLVVQFLYAGLNITSKLAMEFGMNPLVLVAYRQMFATIAIAPCAYWFE 60

Query: 61  RKGRPKITKPILFQILLCSLTGATANQVFYYVGLKYSTPTIACALTNVLPAATFVLAVLF 120
           RKGRPKITKPILFQILLCSLTGATANQVFYYVGLKYSTPTIACALTNVLPAATFVLAVLF
Sbjct: 61  RKGRPKITKPILFQILLCSLTGATANQVFYYVGLKYSTPTIACALTNVLPAATFVLAVLF 120

Query: 121 RQESVGIKTSPGAAKVMGTVVCVGGAMLLSFYRGQTIQLGESGIHWKYAELMRGGSSTNQ 180
           RQESVGIKTSPGAAKV+GTVVCVGGAMLLSFYRGQTI+LG+SGIHWKYAELMRG SS+NQ
Sbjct: 121 RQESVGIKTSPGAAKVIGTVVCVGGAMLLSFYRGQTIELGKSGIHWKYAELMRGESSSNQ 180

Query: 181 GSSIWGSLCLIISSVAWAAWFVIQARVNEKFPAPYTSTALMTFMATIQCGAIAVGVEHKT 240
           GSSIWGSLCLIISSVAWAAWFVIQARVNEKFPAPYTSTALMTFMATIQCGAIAVGVEHKT
Sbjct: 181 GSSIWGSLCLIISSVAWAAWFVIQARVNEKFPAPYTSTALMTFMATIQCGAIAVGVEHKT 240

Query: 241 AAAWSLKSSIRLVGALYAGVACSGVAFCLTSWSIQKKGPLYASVFSPFLLVIVAIFSWAF 300
            AAWSLKSSIRLVGALYAGVACSG+AFCLTSWSIQK+GPLYASVFSPFLLVIVAIFSWAF
Sbjct: 241 LAAWSLKSSIRLVGALYAGVACSGMAFCLTSWSIQKRGPLYASVFSPFLLVIVAIFSWAF 300

Query: 301 FQEKLYVGTVVGSLLIVVGLYAVLWGKTKEVKLQQHIEMAAAAEAKLDDCNNK-DLEEQS 360
           FQEKLYVGTVVGSLLIVVGLY+VLWGKTKEVKLQQHIEM AAAEAKLDD NNK DLEEQS
Sbjct: 301 FQEKLYVGTVVGSLLIVVGLYSVLWGKTKEVKLQQHIEMTAAAEAKLDDYNNKEDLEEQS 360

Query: 361 YVVSNATILHK 371
           YVVSNA I HK
Sbjct: 361 YVVSNANIPHK 371

BLAST of ClCG01G010790 vs. NCBI nr
Match: gi|659127231|ref|XP_008463595.1| (PREDICTED: WAT1-related protein At1g09380-like [Cucumis melo])

HSP 1 Score: 699.5 bits (1804), Expect = 3.1e-198
Identity = 354/370 (95.68%), Postives = 359/370 (97.03%), Query Frame = 1

Query: 2   AAGGDFLPPLVMLVVQFLYAGLNITSKLAMEFGMNPLVLVAYRQMFATIAIAPCAYWFER 61
           AA  DFLPPLVMLVVQFLYAGLNITSKLAMEFGMNPLVLVAYRQMFATIAIAPCAYWFER
Sbjct: 3   AAAADFLPPLVMLVVQFLYAGLNITSKLAMEFGMNPLVLVAYRQMFATIAIAPCAYWFER 62

Query: 62  KGRPKITKPILFQILLCSLTGATANQVFYYVGLKYSTPTIACALTNVLPAATFVLAVLFR 121
           KGRPKITKPILFQILLCSLTGAT NQVFYYVGLKYSTPTIACALTNVLPAATFVLAVLFR
Sbjct: 63  KGRPKITKPILFQILLCSLTGATGNQVFYYVGLKYSTPTIACALTNVLPAATFVLAVLFR 122

Query: 122 QESVGIKTSPGAAKVMGTVVCVGGAMLLSFYRGQTIQLGESGIHWKYAELMRGGSSTNQG 181
           QESV IKTSPGAAKV+GTVVCVGGAMLLSFYRGQTI+LGESGIHWKYA LMRGGSS+NQG
Sbjct: 123 QESVRIKTSPGAAKVIGTVVCVGGAMLLSFYRGQTIELGESGIHWKYAGLMRGGSSSNQG 182

Query: 182 SSIWGSLCLIISSVAWAAWFVIQARVNEKFPAPYTSTALMTFMATIQCGAIAVGVEHKTA 241
           SSIWGSLCLIISSVAWAAWFVIQARVNEKFPAPYTSTALMTFMATIQCGAIAVG+EHKT 
Sbjct: 183 SSIWGSLCLIISSVAWAAWFVIQARVNEKFPAPYTSTALMTFMATIQCGAIAVGIEHKTL 242

Query: 242 AAWSLKSSIRLVGALYAGVACSGVAFCLTSWSIQKKGPLYASVFSPFLLVIVAIFSWAFF 301
           AAWSLKSSIRLVGALYAGVACSG+AFCLTSWSIQKKGPLYASVFSPFLLVIVAIFSWAFF
Sbjct: 243 AAWSLKSSIRLVGALYAGVACSGMAFCLTSWSIQKKGPLYASVFSPFLLVIVAIFSWAFF 302

Query: 302 QEKLYVGTVVGSLLIVVGLYAVLWGKTKEVKLQQHIEMAAAAEAKLDDCNNK-DLEEQSY 361
           QEKLYVGTVVGSLLIVVGLYAVLWGKTKEVKLQQHIEM AAAEAKLDD NNK DLEEQSY
Sbjct: 303 QEKLYVGTVVGSLLIVVGLYAVLWGKTKEVKLQQHIEMTAAAEAKLDDYNNKEDLEEQSY 362

Query: 362 VVSNATILHK 371
           VVSN  ILHK
Sbjct: 363 VVSNGNILHK 372

BLAST of ClCG01G010790 vs. NCBI nr
Match: gi|595841545|ref|XP_007208228.1| (hypothetical protein PRUPE_ppa018179mg, partial [Prunus persica])

HSP 1 Score: 449.9 bits (1156), Expect = 4.3e-123
Identity = 229/362 (63.26%), Postives = 278/362 (76.80%), Query Frame = 1

Query: 1   MAAGGDFLPPLVMLVVQFLYAGLNITSKLAMEFGMNPLVLVAYRQMFATIAIAPCAYWFE 60
           MA  GD+LP L M++VQ  YAG+NI SKLA+E  MNPLVLVAYRQ+FAT++IAP AYW E
Sbjct: 1   MAGYGDYLPFLAMVLVQMSYAGMNIISKLAIESDMNPLVLVAYRQVFATLSIAPFAYWME 60

Query: 61  RKGRPKITKPILFQILLCSLTGATANQVFYYVGLKYSTPTIACALTNVLPAATFVLAVLF 120
            K RP+IT PILFQ  LCSLTGATANQVFY+VGLK STPTIACALTN LPA TF+LA++F
Sbjct: 61  WKTRPRITMPILFQTFLCSLTGATANQVFYFVGLKTSTPTIACALTNTLPAMTFILALIF 120

Query: 121 RQESVGIKTSPGAAKVMGTVVCVGGAMLLSFYRGQTIQLGESGIHWKYAELM-RGGSSTN 180
           RQES  IK+ PG +KVMGTVVCV GAMLLSFY G  I LGES IHW YA+ M    +S++
Sbjct: 121 RQESAKIKSKPGLSKVMGTVVCVSGAMLLSFYHGHIIGLGESKIHWAYAQRMGEQANSSS 180

Query: 181 QGSSIWGSLCLIISSVAWAAWFVIQARVNEKFPAPYTSTALMTFMATIQCGAIAVGVEHK 240
            GSS  G LC+IIS++ WA WF+IQA+V E FPAPYTST LM  MA+ +CG IAV  +HK
Sbjct: 181 NGSSFVGPLCVIISTLGWAFWFIIQAKVGENFPAPYTSTTLMCLMASFECGIIAVIADHK 240

Query: 241 TAAAWSLKSSIRLVGALYAGVACSGVAFCLTSWSIQKKGPLYASVFSPFLLVIVAIFSWA 300
             +AWSLK+ +RL+ ALY G+  S +AF L+SWSIQ+KGPLY SVFSP LL+IVAI SWA
Sbjct: 241 -VSAWSLKNPMRLISALYCGILGSALAFFLSSWSIQRKGPLYVSVFSPLLLIIVAISSWA 300

Query: 301 FFQEKLYVGTVVGSLLIVVGLYAVLWGKTKEVKLQ---QHIEMAAAAEAKLDDCNNKDLE 359
             +EKLY+GT +GS+LIV GLY VLWGK KE +++   +  +   A  AK  + N+ +L+
Sbjct: 301 LLEEKLYLGTAIGSILIVCGLYLVLWGKNKETEVEKPTKETDTTKADHAKEYERNDLELQ 360

BLAST of ClCG01G010790 vs. NCBI nr
Match: gi|743890239|ref|XP_011038989.1| (PREDICTED: WAT1-related protein At1g09380 [Populus euphratica])

HSP 1 Score: 449.1 bits (1154), Expect = 7.3e-123
Identity = 225/360 (62.50%), Postives = 274/360 (76.11%), Query Frame = 1

Query: 6   DFLPPLVMLVVQFLYAGLNITSKLAMEFGMNPLVLVAYRQMFATIAIAPCAYWFERKGRP 65
           D LP L M +VQF YAG+NITSKLAM+ GM PLVLV YRQ+FATIA+ P AY+FE + RP
Sbjct: 5   DVLPFLAMAIVQFGYAGMNITSKLAMDSGMKPLVLVGYRQIFATIAMVPFAYFFEWRTRP 64

Query: 66  KITKPILFQILLCSLTGATANQVFYYVGLKYSTPTIACALTNVLPAATFVLAVLFRQESV 125
           KIT  +L QI +CSLTG T NQVFY++GL+ STPTI CALTN+LPA TF+LAVLFRQESV
Sbjct: 65  KITMSLLLQIFICSLTGVTGNQVFYFIGLENSTPTIGCALTNILPAVTFILAVLFRQESV 124

Query: 126 GIKTSPGAAKVMGTVVCVGGAMLLSFYRGQTIQLGESGIHWKYAELMRGGSSTNQGSS-I 185
           GIK + G AK++GT+VCVGGAMLLSFY G  I +GES IHW YA+   G S+TN+ S+ +
Sbjct: 125 GIKKASGQAKLLGTIVCVGGAMLLSFYHGHMINIGESSIHWNYAD-STGNSTTNKKSNFV 184

Query: 186 WGSLCLIISSVAWAAWFVIQARVNEKFPAPYTSTALMTFMATIQCGAIAVGVEHKTAAAW 245
            GSLC+I S+++WA WF +QA+V+ KFPAPYT T LM FM +I+C  I +G  HK  + W
Sbjct: 185 LGSLCIIASAISWAIWFTVQAKVSLKFPAPYTCTLLMCFMGSIECVVIGIGANHK-VSEW 244

Query: 246 SLKSSIRLVGALYAGVACSGVAFCLTSWSIQKKGPLYASVFSPFLLVIVAIFSWAFFQEK 305
           SL+S  RL+ ALYAG+ CS +AF LTSWSIQ+KG LY SVFSP LLVIVA+ SWA   EK
Sbjct: 245 SLRSPGRLIAALYAGIVCSALAFSLTSWSIQRKGALYVSVFSPLLLVIVAVLSWALLHEK 304

Query: 306 LYVGTVVGSLLIVVGLYAVLWGKTKEVKLQQHIEMAAAAEAKLDDCNNKDLEEQSYVVSN 365
           +YVGT VGS+LIV GLYAVLWGK KE+K  + IE     +    + NN DLE Q + +SN
Sbjct: 305 IYVGTAVGSILIVAGLYAVLWGKDKELK--EEIEETKVMKLGNKEWNNHDLELQLHAISN 360

BLAST of ClCG01G010790 vs. NCBI nr
Match: gi|590644918|ref|XP_007031215.1| (Nodulin MtN21 /EamA-like transporter family protein isoform 1 [Theobroma cacao])

HSP 1 Score: 447.2 bits (1149), Expect = 2.8e-122
Identity = 229/364 (62.91%), Postives = 279/364 (76.65%), Query Frame = 1

Query: 5   GDFLPPLVMLVVQFLYAGLNITSKLAMEFGMNPLVLVAYRQMFATIAIAPCAYWFERKGR 64
           GDF+P L  ++VQ  YAG+NITSKLAME GM PL+LVAYRQ+FAT+AIAP AY+ ERK R
Sbjct: 3   GDFVPFLANVLVQLGYAGMNITSKLAMESGMKPLILVAYRQIFATLAIAPFAYFLERKTR 62

Query: 65  PKITKPILFQILLCSLTGATANQVFYYVGLKYSTPTIACALTNVLPAATFVLAVLFRQES 124
           PKITK ILFQI LCSLTGATANQVFY+VGL+ S+ T+ACAL NVLPAATF LA L RQE+
Sbjct: 63  PKITKHILFQIFLCSLTGATANQVFYFVGLENSSATVACALNNVLPAATFALAALCRQEA 122

Query: 125 VGIKTSPGAAKVMGTVVCVGGAMLLSFYRGQTIQLGESGIHWKYAELMRGGSSTNQGSSI 184
           VGIK + G AKV+GT+VCVGGAMLLSFY G TI +G+S IHW YA+ M   SS+N  +  
Sbjct: 123 VGIKKASGQAKVLGTLVCVGGAMLLSFYHGHTIGIGDSSIHWAYADKMTSKSSSNGSNFF 182

Query: 185 WGSLCLIISSVAWAAWFVIQARVNEKFPAPYTSTALMTFMATIQCGAIAVGVEHKTAAAW 244
            G   ++ S+VAWA W +IQ + ++ FPAPYT TALM FMA+I+C  I +  +HK  +AW
Sbjct: 183 LGPFLVMASAVAWAVWLIIQGQTSKNFPAPYTCTALMCFMASIECTIIGIFSDHK-ISAW 242

Query: 245 SLKSSIRLVGALYAGVACSGVAFCLTSWSIQKKGPLYASVFSPFLLVIVAIFSWAFFQEK 304
           SL SS+RL+ ALYAG+ C+ + FC+ SWSIQKKGPLY SVFSP LLVIVA+ SWA  +EK
Sbjct: 243 SLSSSMRLIAALYAGIVCNAMTFCVLSWSIQKKGPLYVSVFSPLLLVIVAVLSWALLREK 302

Query: 305 LYVGTVVGSLLIVVGLYAVLWGKTKEVKLQQHIEMAAAAEA-KLDDCNNK-DLEEQSYVV 364
           LYVGTVVGS+LIV GLYAVLWGK +E+K  +  E+  A EA K  + + K DLE Q +  
Sbjct: 303 LYVGTVVGSVLIVGGLYAVLWGKDREIKQMKSNELETAEEATKAREKDGKDDLELQLHPQ 362

Query: 365 SNAT 367
           +  T
Sbjct: 363 TKGT 365

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
WTR2_ARATH5.1e-11657.97WAT1-related protein At1g09380 OS=Arabidopsis thaliana GN=At1g09380 PE=2 SV=1[more]
WTR38_ARATH1.4e-6841.90WAT1-related protein At5g07050 OS=Arabidopsis thaliana GN=At5g07050 PE=2 SV=1[more]
WTR24_ARATH4.2e-6234.47WAT1-related protein At3g30340 OS=Arabidopsis thaliana GN=At3g30340 PE=2 SV=1[more]
WTR14_ARATH5.5e-6237.77WAT1-related protein At2g39510 OS=Arabidopsis thaliana GN=At2g39510 PE=2 SV=1[more]
WTR29_ARATH1.0e-6033.52WAT1-related protein At4g01440 OS=Arabidopsis thaliana GN=At4g01440 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0LW49_CUCSA1.5e-19995.96WAT1-related protein OS=Cucumis sativus GN=Csa_1G181350 PE=3 SV=1[more]
M5W6P9_PRUPE3.0e-12363.26WAT1-related protein (Fragment) OS=Prunus persica GN=PRUPE_ppa018179mg PE=3 SV=1[more]
A0A061F4Z0_THECC1.9e-12262.91WAT1-related protein OS=Theobroma cacao GN=TCM_026803 PE=3 SV=1[more]
A0A067JUZ0_JATCU1.1e-12063.56WAT1-related protein OS=Jatropha curcas GN=JCGZ_23192 PE=3 SV=1[more]
K7MZS7_SOYBN2.6e-11960.39WAT1-related protein OS=Glycine max GN=GLYMA_19G227000 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT1G09380.12.9e-11757.97 nodulin MtN21 /EamA-like transporter family protein[more]
AT5G07050.17.6e-7041.90 nodulin MtN21 /EamA-like transporter family protein[more]
AT3G30340.12.4e-6334.47 nodulin MtN21 /EamA-like transporter family protein[more]
AT2G39510.13.1e-6337.77 nodulin MtN21 /EamA-like transporter family protein[more]
AT4G01440.15.9e-6233.52 nodulin MtN21 /EamA-like transporter family protein[more]
Match NameE-valueIdentityDescription
gi|449443520|ref|XP_004139525.1|2.2e-19995.96PREDICTED: WAT1-related protein At1g09380-like [Cucumis sativus][more]
gi|659127231|ref|XP_008463595.1|3.1e-19895.68PREDICTED: WAT1-related protein At1g09380-like [Cucumis melo][more]
gi|595841545|ref|XP_007208228.1|4.3e-12363.26hypothetical protein PRUPE_ppa018179mg, partial [Prunus persica][more]
gi|743890239|ref|XP_011038989.1|7.3e-12362.50PREDICTED: WAT1-related protein At1g09380 [Populus euphratica][more]
gi|590644918|ref|XP_007031215.1|2.8e-12262.91Nodulin MtN21 /EamA-like transporter family protein isoform 1 [Theobroma cacao][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR000620EamA_dom
Vocabulary: Cellular Component
TermDefinition
GO:0016021integral component of membrane
GO:0016020membrane
Vocabulary: Molecular Function
TermDefinition
GO:0022857transmembrane transporter activity
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0055085 transmembrane transport
biological_process GO:0008150 biological_process
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0016020 membrane
cellular_component GO:0005886 plasma membrane
molecular_function GO:0022857 transmembrane transporter activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG01G010790.1ClCG01G010790.1mRNA


Analysis Name: InterPro Annotations of watermelon (Charleston Gray)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000620EamA domainPFAMPF00892EamAcoord: 185..324
score: 9.1E-18coord: 11..150
score: 4.8
NoneNo IPR availableunknownCoilCoilcoord: 333..353
scor
NoneNo IPR availablePANTHERPTHR31218:SF11SUBFAMILY NOT NAMEDcoord: 1..358
score: 9.9E
NoneNo IPR availableunknownSSF103481Multidrug resistance efflux transporter EmrEcoord: 242..328
score: 3.53E-10coord: 43..152
score: 2.2

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
ClCG01G010790Cucurbita maxima (Rimu)cmawcgB341
ClCG01G010790Cucurbita moschata (Rifu)cmowcgB345
ClCG01G010790Silver-seed gourdcarwcgB0053