Sgr021434.1 (mRNA) Monk fruit (Qingpiguo) v1

Overview
NameSgr021434.1
TypemRNA
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionWAT1-related protein
Locationtig00153699: 183655 .. 195228 (+)
Sequence length933
RNA-Seq ExpressionSgr021434.1
SyntenySgr021434.1
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAGAGGAGCTGGTTCTACAAGGACATTGTGCCATTCGCCGCCATGGTGGCGGCGGAGTGCGCCACCGTGGGCTCCAATACGGGCTTCAAAGCCGCCACCGCCCGAGGCATGAGCTACTACGTCTTCACCCTCTACGTCTGCGTCGCCGCCGCCGTAGCCCTCCTTCCATTCGCCCTCATCTTCCGCAGGTCTCATCCATCGATCTCTAATCAATCAATTTCCCCCATCCCCTTTCACCACCGTCTTTTTCTGCAGGTCGCCGGAACTTCCTCCCCACAAGATATCCTTCTTCCTCAGAATCGTCTGCCTCTCAGCGCTAGGGTAAGACAAATTTGATCCCTCTTTCACATTTCTAAAACTTCGACGACAACGAGATTTACAGAGGAAGCCCTAATTGGTTGGAAATTGGAATTATTGCCAAGGCTGTCTTGCCAGTTGCTTGGGAATAAAGGTCTCCAATACAGCTCGCCGACTCTTTCTTCCGCCATTAGCAACCTCATCCCAGCTTTCACTTTCATTCTCGCCGTCGTTTTCCGGTAAGCTTTCATTTTCTCCTCCCCCTGAACAAAACTTCTATGGATAAGCTTCTTGTTCTATCACTCCTCATATCAGATAAGATCTAATGCTTAATTTCTCCAAAATTAACTACTTATTTTATTACGTTTGAGTGATTTTTTTCGATTTAGTCCTTGTTATCTTAAAAATTTTAATTTAATTTTTTTGGTTTTAATAATTTTTTAATTATATTATTTTTATTAGTAAATATTCAAATTGATTGATGATAAAGTGTATTTCATAAAAATTTAAAGAGAATAAATTAGACCAAGAGAGATAAATTTTTAATTCTTCTTTCAAATTTTCTACAAATAAATGAAAAATTAATGAGAATAAAATAGTTGTTTGTATAAAAAGTTTTGAGTGAATATAACATTTGAAAGATTCAAGATAAAAACAAAACATTTAAAAACTTGAAAGACTAAAAAAGAATTAAAACTTTATTTTATTGTTTTTCTTTACTTTATATATATCCTTTTAAATCATATTTTGGTTCCAAATTTTTATGTTTTTACTATTTTACTCTTAAAACGTTTATGAATTTCATGTTGATTTTTAAATTTTTATGTTTATTTTATTTTAGTTCTTAAATTTTTAAATATTTTATTTTAATCTCTAAATTTTAAATAAAAAATTATTTTAATTTTTTTTATTAATTTTTTTAATAACTATTTAGTTGAAACTTAATATGGCACCTATTTTCTTTGTTTACCTCTTGGGTAGATGTTTATATATATATAAATATTTGAAGTCACATATGATCATGCGTGGTTTCAAAATATATTTAAATTTATGTTAAATGATTAATAAAAGATTAACATGGAAGACGAAAATAATCATTTATATAGAGTTAAGGGACTGAAATAAAAAATTTAAGAGTTGGTGGACTAAAAAGAAAAGTTGAGGGGCTTAAGGACCAAAACACACACAAATATATTTGGTGTTTTTTTATTAGAGTTTTTATTTTTCTAAAAAATAAAATATCTTCATAAACCTATCTTTTATAAATTATGTCACTTTTGCGGCCCCTATGAATAAAAAATTTTAAAAAAATCCAAAGTGAATCTTTTGATACGATCTACTTGAAAATCAAATCAAATGATATAATTATTAATTATAAATTAATAGAATTAAAAAATCTAGATAAATATCAAATGTTTGAACATTGGATGAACCGAATCTAACTTTTTTTAAAGAAAAATAAATCTGCTTACATCACACTTATAATTTACTAGTTTGTTACAACGTCTTATGGACGTTTGTATCTTTTAATTTATTTTATTTTATAATAAAATAAAATTAAAATATGATCGTTAAAAAATAAATAATTTGATGAAACTAATATATGTTTTAAATTTATAATGAGTTATATATATATATATATATATATTTCAAATAACATGAGTATTTTGTCATGTAGTTTTTACAATGATATAAATTATTGTAGAAAATGAGTAGACGTGAGTTATTTTATAATAGTAGAAAATAAAGAAGTTGCAAAATTTTTAAGATTTTAAGACTTTTGATCTTCTAATAATTAAATATAATTAAAGGTAATAAATTTTGAACTAATTTATGGAAACTAAAAGAAAATTTTTCATCTTCTATCATTTTAAATGACGCTTTATGATTTAAATTAAATATATAATTTATTAATTATTGATAATTAATTGGAAGTGACATTTTTGTAAATTTAAAATCTGATAGAGGGACAAAATGGTTCAAAAATGATTCTCCATCTTTCTCATTTTTAATATAGTAAAGATTATAGATTATAGATTATAAAGAATATGTTTGTTAAAAATTATTAATTTATTGGTTTTATTATTATTTTTTTAAAATACTTTATATATTCTTTTTTTAAAATTTCTAAATACTTTATATTGAGTAATTAATTTTAAGGCATTTTCAGAATTTATGGAGTTGTATATTTGGAATTAATAAATGATGTATGAAATTTATTAATTATTAATTTTTAAAGAATAAAAGGGGCGAGATGGTCAAATGAAAATGATATGTTTGTAATTCATGAAGAACACAAAGGGTAAGATGTTCTAAAAGTAAATAGTATATTTGTAATTCACTAAAAATAAAAAGGTAAAATGGTCTTAAAATGATTTTTAAAGATCCGTACTTTTTAATATATTATAGATATAATATTTTTGGTTCTTAAACTATTCGGTATTTTTTTTAATCTGGTCTTTGTTTTATTTATTTATTTATTTATTTTTATAGAATATTGATCTTAGATTATTGTTTAATTATAAATCGATTCTTTTGTTAAATAAGTTTAATTTATGTGTCAACCTGTGCATGATGTAATATTTTTAAAGAAGAAATTTTTTTAATTTATATGTGGTTAATTTAATATTATCATATCAATTATTTCACATGGATGACATAATGATGAAATTTAAATTAAATAATAATAATGACAACTTTCTAGAGGTTGCTTTGCTTGCAACCTCCACTTTGTTTATTTGCTCCATTATTATTTTATGGAAAATAATTGATAATTTTTAATACATTAAGAATAAAATTAAGAATTTTACAAGATTAAAGATCAATTTGATAATTATCTAAAAGATTGAGAGACTATTTTGTATATTTAATCTATTTTTGAATGTTTGATATTTATTTCACTCATATCTAATTTAAAATTTCAACTAACCTCACTCGAGTTAAACATTCACTCGACCCACATCATATTCACTTTCTCCACTATTCACATCTCAAGTAATTAAAAAAAAAAAAGAAAAAGAAGAAATGATGGCATGACTTCATGGGCAAATGAAAAATTCTTGGTGGGTTTTTTTTTTTTAGTTCAACAAATGTAGGGTGGAGATCAAATTTTTAATTTTAAAGAAGGTAATAGGTGCTTTTATCTATTAAGCTAAGTTTGGATTTGACAATCTTTTGTGGGTTTTAATTTAGGGTTGGTTGGATTTTAAAATTGAGTAGAGTACCTATTTAATATTTTCAAAAATGTGAGCGCTCATGAAATAATGTTTTGTTGATGGCATGTCGTTAATCTTTAGTTAAAAATTTTAGTAGCGTTATTGAATAGGGATTTAATTAAATTATAAATAAGAGACCAAGGGTAATGTAACCCTTTATAATTTCTTCAATATTATAGGAATGGAACTTTAAAAATCATAGTAAAAAATGAAAAGAAAATTGAAACATCTTTGGTCTTATTCAAAAATTGACTAAAACTGTAGGGAAATTTTAATGGGTGGATAATTTTCTAAGTCATTACAAATATGTATTTTTATAAAAGTAATTACAATTATAGAATATTTTTAAAAAATATATAATAATTTTTTAAAAATGTATTAATCACTGAAATATCTAAACGTATATATTGATAACAATAAATGATATATCAATATATTAATTAACTATACTCATTGATATATCTATCGATTTTCTTTTTTAAAGAAATTATATCATAACTACGGGTGGAGATTCAAATCTACAACTTTTTAAAGGAAGTAGAAATTTTTTAATCACTGATTTATGCTCAAGTTGGTGATACACATTCGAATTGATTAATTATAATCACTGATATATCATTTGTATACATTGATTAAATGTAACATTAATATATGTGTGTTCTAATTTTTGCTATATTTGCAAATATTTTTTTAGGTTCTATTTTTTTAATATATTATGTAAATTATGATAGATTTGTATTTACTCCAAAACTCTATGTTATAATGTAAAAAACAATTAATCGTATTTTCGTCCCTCAACTTTCCTACTTTCAAAAGTTCAATTTTAATCTGCAAATTTTTATAAAAACTTGTTCGAATCTTAATCATTACGGCTCTATTTGATAACTATTTGGTTTTTCATTTTTAGTTTTTGAAAAGTAACACTACTCTCATGAATGTGTTTCTTTGTTTTGTTATCTAATTTTTTTATAAATGTTCTCAAATTTTAAGCCAAAATTTAAAAACTAAAAGAATGTAGTTTTTAAAAATTTGTTTATGTTTTTTAAATTTAACTAAGAACTCGAGTGTGTTTTTAAGAAAGGTGAAGAATATATCAAAGAAATTGTGAGAAAGAAAACACAATTTTTAAAAGAAAAAAACTAAAAATTAAGGACCCGTTTGATAACCATTTCTGTTTTCTGTTTTTGGTTTTTGGTTTTTGGTTTTTTGTTTTTTGTTTTTAGTTTTCTGTTTTTAAAAAATAAAAACCATACATATTTGATAACTGTTTTTTGTTTTTTGTTTTTTAAAAACAAAAAACCAAACATATTTGATAACTGCTTTTGGTTTTCTGTTATTTAAAAAGTTCGAGAGAGAGAAAAATATATGGAGGCAGGAAACTAAAAGTTTAGAGAGAAAAGTTAAAGAGAGAAATTTTAGAGAGTAACTTTGGAGAGAAAAAGAGAGATTTAGAGAGAGAAAAGTTAGAGAGAGAATAGTAGTTAGAGAAAAAGAAAAAAATGTTAGAGAGAAAGAAAGAAATTTTAGAGAGAGAAAACGAGAAATTTTATAGAGAGAAAAGTTAGAGAGAGAGTAAAAGACATGTTAGAGAAAAAAAGATAAATTAGAAAGAAAAGTTGTAGAGGGAGAAATTTTAGAGAGAGAGATGTTTTTTTTCAGATAGAGAGAGAAATATTAGAGAGAGAAATTTTAGAGAGAAGAAAGTTTGGGAAAAATATTAAAATTTTAGAGAAAGAGAAAAATTTAGAGAGGAAAAAAATTTAGAGAGAGAAAAGTTAGAGATATAAAAGTTAGAGAGAAAAATAAATGAGAGAAAGAAAGAAAAGTTATAGAGGCATAATTTTTAGAGAGAAAGTTTGAAAAAAATATTAAAATTTTAGAGAGAGGGAAGAGTGAGAGAAAAATAAATTAGAGAGAGAAAAAAGTTACACAAGGAAAAAATTTAGAGTGAGAATGTTGTCGAAAAATATTAAAATTTTAGAGAGAGAAAAGTGAGAGAGATATGTTAGAAGAGAAATAAATAAAAGTAATATAGAAATAAAAATTTTTCAAAGAGAGAAAATTTTTAGATAGAGAAATTTTAGAGAAGAAAAAAATTTAGAGATAGAAAAAAATTGCTAGAGAAAGATATGTCAAAGAGAAAAAAAAAAGAGATTTTGAAAAACAAAAATCTATTTTTAAAAACTACTTTTTTTAGTTTTTGAAATTTACTTATAATTTCAAAAATGTTTTTTAAAAACAGAAAACAGAAAACAGAATTGGTAATCAAACAACCCTGTTTTTTAAATATAAAAAACAAAAACTAAAAACAGAAAACAAAATAGTTATAAAACGGCCTTTAAATTAGTATCAATTGGGACCTAATTTTTATATTGTGTCTAATATAATTTTTAAGCGGCTTGTTTCCAAAATGTGAACAGTAAAATGTACTATTTAGTCAATTAAAACTATTTAAAATTTTTTTTATAGTGTTTTATTAGATAATAAAAAAAATATTTTTGACAATTTTAAAAAATAACTAAAAAAAACTTAAAAAATGCTTGGTAAGAAAGCACTTAATTAATATTTCTTCTAAGACCTGTTTACTTCAAGAAATCGGAATGAAAGAATGAGAATAAAAATTTTTATTTTCTTTGATGATGTTTACTAAAATGTAAGAATAGGATTGACCACCAATTTGGGAATCTCATTCTCATGTATTATTCTGTGGGCCCCACAACTATTTCATTTATTACTTTGCAACTTTTATACTTTCTCTGTCATTTAGTAGTGGGACCTAAAATTTTTTTATTAATTGTTTTAAAATCTTTACTATTTCTCTCTTATTTATACTACCAATAATTTTTCTTTTAATACAAAAACATCATACTAATTTTTTATTAGTACATGTGATTTTTTATAAAATTATAAATTTCTATTTAATTTGTTAATTATTTAATTTAAATTAATCATTGTTTATTAATTAATTTGTTAAAATAGAATATTGTTATTATATAATGTATACAAATAAAATTGATCTATATTTTCCACTAATAAAAAACTATACTAACATTTTGTTAGTTTATGTGATTTTATTATAAAATATTACAATCCTCTATTTAATTTTACTTTGCTATTATTAATTTTTATCAATTAAATTGTTAAAATATAATTCATTATTATCAAATATTATTACAATAAAATTTAATAATATTTATATCAATAATTGTAATTCTAATAAAAAATTATATTAATGTTTTTTTAGTTGATGTAATTAGATTTTAATGAGTTATTATTTGTTTATATCAAATAAAATGTTAAAAATATAATTCATTATTATTAAATATCATAACAATAATATTTAATAATATTTTATATTAATAATTGTTGATCTAAAAAAAATTATATTAATTTTTTCTAATTGATGTAATTTTATTAATTAGTTATTATTTGTTAATATCAAATACATTGTTAAAATAAAAAAATATTACTATCAAATAAAATAGTAATAAAATAAATTAAAAATTATTATTGATTATTTTTAATTTAATTAAAAATTATGTTAGCATACTATTTAGTTAATGTATTTTTAATTGACTAGAAATATAAAAGGTATTCTAGTAATTATTATGTCTCTGATTTCTATTTCGAGTCCAGACAATTTTTAGTAAACAACATCGAAAGGTTTCTCATTCGGTTTCCAGACAATTTCTAATAAACAACATCAAAGAAATAACATTTTGATTCTCATTGATCTTATTCCGATTGATTTTTATTTTGATTTTTTGTCATTCTGATTCTTGGTAAGTAAACGGGGCCTAAGAGTATTGATTGAAAACGTATTTTACTTATAAAAGGCATCTAATACTCGTTATAATTTCTTGTCACATAAAATTTTGTGAATTTAGTGGTAATTAAGGACCAAAAACATATTTGTATATAAAATTTAAGGATCAAAACAAGTATGATAGTTAAAAAGATATTGTCAACCAAGTATAGCTCAATAGTTAAGGGATTTCTAACTCTCTTAAGAAGTCGTAGATTCAAATCTACACTTTTGTATTTGCAATCTAATATTTCAAAAAAATTAAAAAAGAAAAAAAAATTTTAACTTGATTTGGGTTATATTTATTAGAGTAATAATTTTGAAACTACTCTCTATGGTCAGGATGGAAAAGGTAGCTTTGAAAAGAAGCAGCAGCACAGCCAAAATGGTGGGCTCTGCAGTGTCCATATCAGGTGCATTGGTAGTGGTTCTGTACAAAGGCCCAATTGTTATTTCAAACCCATATTCTGAAGGCCCAAAAGACCTGGCCCTTTTCCATCAACCTTTGGGCTCTTCACAATTACAGCCCAATTGGATCTTGGGTGGCCTTTGCTTTGTCTTTCAATACCTTTGCAACTCTTTTTGGTACATTCTTCAGGTAATTATACCCTAAAAAACAACTTCCAGCTAAATATATGGTTTAGTTTCTATTGCTTGGTATTTTTTCAATTTAATCGATATTGTTTAAAAATTTTCAATTTAATTTCCTACATTTTAACATTTTTTAATTATATCATTTTTGTTAGTGAATGTTAAATGTGATTGAAGGCAAGCTTGGCATAATACTTAGTAAGTTGGCAGAAATTTAGAAGAAAAATTAAGCTTCCTAGCAGAGAAAAAGTTGAAATTTTTGACAACTTTTTAGAAATGTGAAGGAATTGGAAAATTTTCTCTTAGGTGGTCTAATTCTTCGACTAAAATTCTTTCAACTCACTTTGTATCATGAAATATAAGCTTATCTTTAGCCTCATTTAACATCCACTAACAGAAAATGTATAATTGAAAAACGTTAAAACATAAATGACTAAATGGAAACTTTTAAAACAATAAAAACTAGACACATATATTTAGCCAAAACTTTTCAAATTATCCCTAAATTACAAAAGTTACTCTTGCATGGTCGCCCTGTATCTTTTTACTTTTAAATTTAAGACTAATATGTTGCATGTTAAAAGTCTATTTGACAATGATTTTGCTTTTAAGTTTTCCAATTTAAAAAAATGGGTAAGATTGTTTATATCTAATTCCTTTGTTTTATAGTTTACATTTTAGAAATGTTTTTTCTGAAATGCTAGATTTTAGAGACTAAAAAAAAAAATTTTGGGCCATATTTGACAAGGGTTCTTTTTTTTTTTTCTTTTGGTTTTTCTATTTTTGATTTTCTGAATTCTATTTTAAAAAACAAAAATAAGTACTTTTGATAATTTTATTTTGTTTTCTTTTTTTTTTCTAAAAATGAAAAGCCAGACAACAAATTTTTTGGTTTTTAATATTTATTTTTTAAAATATTTTTCAAAATTGAATGAATTATAAAAAAAAAAAAAAAAAAAACTTAAGAGGTTGTAGGCTATATTTTTAATTGTTTACATTTTCTTTTTTTGTTAAATAATCTCTACTTTCTCTAAGAAGTCATAAGTTTAACTCATTGTCCAGGTATTTATGATGGAAAAAAACAATACTTGTACACTTCAATCGATCATTAGTTCTATTATATCACATTATTTTTCATTAGTACTCACAAAAAGATTGACCCAAGCTCAAGAAACAAAAAAGTAAGAGTAATATACAGACCCGAGCTAAAGTATAAGATATATATACTTTGCAATTTACCCCATATGCTACTTGTTATTTACATTACAGACCCAAATCATAAAATTGTACCCAGATGAGATGAGTGTGGTGGCAGTGTACTACGTTATCCAGGCTCTGTTAACTGCACCAGTGTGTTTGTTATCCGAAAGAGACATGAGTGCTTGGAAACTCACCACTCCACTGAGTTATCTTCTAGTTTTAAACTCGGTAAGATCGAAAGTAAAAGGCTAAATTACAAGTTTAGTTCATGAACTTTTGATATTGTGTCTAATAGATTTTTGAACTTTAAAAAGTATGTAATAAGTTTCTAAACTTTTAATTTTATGTCTAATAGATTCCTAAACTTTAAAAGTGTCTAATTGGTCTCTAAATTTTTAATTGTGTTTGACCTATTAGACATTTTTAAAAATTTTCGAGACCTAAGTCTTGGAATTTAAGAAAAAAGTTTAATAGATCGGAGATCTATTAAATACAAAATTGAAAGTTGAAGAAGTTCATAGATCTATTAGATACAATATTAAAAGTTTAAGGACTTATTAGACATTTTTTAAAATTCTGAAACTTGTTAGTCACAACTCTGAAAATTGAGGAACTAAGCATGTAATTTAACGAAAGGAAAAACAGTCGGGAATTGGTATTCTGTTTGTTTCTCAAGAAAATGATATGAATGTGGAATTTTGTTCGAATAGGGTTTGATGGGTCAGTCGTTTGTGACTGTGATCCACACTTGGGGTCTGAACTTGAAGGGGCCTGTTTATGTGTCGAGTTTCAGGCCATTGTCGATTGCCATTGCTGCTGCTATGGGAGCCATTCTCCTTGGTGATGATCTCCATCTTGGAAGGTGATGATCTCTTGCTCTCTGTCTTTTCTTAGTTATTATTTAGTTATTATTTAGTCATAGGAGAACTTGGACGGTGGTATCTATTTAGTTTCTAAATTAAAAAAGATTCAGTTACGTTTCTAAACTTTGAATTGTGATTTTATGTAATCTTAGTCATTAACATCGTTCATTGGTAATGACGTGACATACCTAGTGATTAGATATGCAGTTGGTTGATTTAATATGATGTGTTAGACAAATGGCAGACGAGATTTGAATAGGTTACTTAAAAAAGAGGGTATGTGTTGTGCCATTTTATTTTGTCTAGCTATATATCCACTTTTTTTGTCATTTTTCTAATACATCATGCCAAATTAGCCAACGGTTGTTTAAAAGGTATGTCACGTCATCAAATAGTTAAGAGTGTTAATGGTAGAGACATATAAAATCGTAATCAAAAGTTCAAGAAGTAGATAGAAATTTGGTAAATTTTAGGGATTAAATAAATGCAACCGTGACTAATTTTTTAATTTAATCTCATTGTTTAGTTTCCATTATTAAATTAAGCTTAAATTATAAAAATTAATCATTGAAGTTTGAGCGATGTGTGTATTTAATTCTTGAAACTTAAAATAGTTTTGAGGTTTAGATGTTATTTTGGTTCTTGAACTTTATGTTTATTTCATTTTAGTCTATAAACTTTCAAACATTCTATATTAGTCCGTAAATTTTTGCATAAAAAATAATTTTAATAATTTTAATCTTTTTTATAAATTATTTAATTGAAACCTAGAATAGACGCATTTATTCTTTATTCACCAAAAGAATGGTAAATGTGTACATGGACAAATATTTGAAGCTATATTTAATAGATCTCATTGTGTTGGATTTCAAGATGTATTCATATTTATGTCAAATAATTAATGGATAGTTAATAGTGCGCAAATAGTAAAATAATTGCTTATGCAAAATTAGAAACTAAAACCAAATATTTAAAAGTTTAGAGATCAAAATAAAATAAATATAAGTGTTAGAGAGACTAAAATAGAACATTTTTTAAAAAAAAAAAAAAATTGGGACCAAAACATGATAAGTATGAAAGTTTTTAGTACAAAATAGGTTTTAAACTGACATATCTCTAAACTTTGAATATAATGTTTTATCTAGCCAGTGTTGTTAATTAACACCATTAATCGAATGTTGATGTGATTTCTCCATTATCTATTGAAGTGTCTAATATATCACGCCAACTAAGGTATCTAACGGGCATGCTAAGTTAACGTTCAAATAATGATATTAATAGTAGGGATTACATAGAAGCATAATCTAAAATTTATCTATGTAATTAAAACTTTTAAAGTTTGTGGACTAAATAGACATAATGTTGCTTAAAGTTTAAGAACTAAATTTATCATTTAATCTTAGATTAATTTGTTGTTTTGATGACAATTTTCCTAAATATTTGCAGTATTATTGGAGCAATCATAATATCAGTTGGGTTCTATGGTATTTTGTGGGGAAAAGCAAAAGAAGAAGAATTGAAGGGATTAGAAGGTGCTTGTAGGTTGGAATCTTCATCCAAAGCTCCATTGCTCCAATATGATAAAGTTGAAGATGCATGA

mRNA sequence

ATGGAGAGGAGCTGGTTCTACAAGGACATTGTGCCATTCGCCGCCATGGTGGCGGCGGAGTGCGCCACCGTGGGCTCCAATACGGGCTTCAAAGCCGCCACCGCCCGAGGCATGAGCTACTACGTCTTCACCCTCTACGTCTGCGTCGCCGCCGCCGTAGCCCTCCTTCCATTCGCCCTCATCTTCCGCAGGTCGCCGGAACTTCCTCCCCACAAGATATCCTTCTTCCTCAGAATCGTCTGCCTCTCAGCGCTAGGGCTGTCTTGCCAGTTGCTTGGGAATAAAGGTCTCCAATACAGCTCGCCGACTCTTTCTTCCGCCATTAGCAACCTCATCCCAGCTTTCACTTTCATTCTCGCCGTCGTTTTCCGGATGGAAAAGGTAGCTTTGAAAAGAAGCAGCAGCACAGCCAAAATGGTGGGCTCTGCAGTGTCCATATCAGGTGCATTGGTAGTGGTTCTGTACAAAGGCCCAATTGTTATTTCAAACCCATATTCTGAAGGCCCAAAAGACCTGGCCCTTTTCCATCAACCTTTGGGCTCTTCACAATTACAGCCCAATTGGATCTTGGGTGGCCTTTGCTTTGTCTTTCAATACCTTTGCAACTCTTTTTGGTACATTCTTCAGGGTTTGATGGGTCAGTCGTTTGTGACTGTGATCCACACTTGGGGTCTGAACTTGAAGGGGCCTGTTTATGTGTCGAGTTTCAGGCCATTGTCGATTGCCATTGCTGCTGCTATGGGAGCCATTCTCCTTGGTGATGATCTCCATCTTGGAAGTATTATTGGAGCAATCATAATATCAGTTGGGTTCTATGGTATTTTGTGGGGAAAAGCAAAAGAAGAAGAATTGAAGGGATTAGAAGGTGCTTGTAGGTTGGAATCTTCATCCAAAGCTCCATTGCTCCAATATGATAAAGTTGAAGATGCATGA

Coding sequence (CDS)

ATGGAGAGGAGCTGGTTCTACAAGGACATTGTGCCATTCGCCGCCATGGTGGCGGCGGAGTGCGCCACCGTGGGCTCCAATACGGGCTTCAAAGCCGCCACCGCCCGAGGCATGAGCTACTACGTCTTCACCCTCTACGTCTGCGTCGCCGCCGCCGTAGCCCTCCTTCCATTCGCCCTCATCTTCCGCAGGTCGCCGGAACTTCCTCCCCACAAGATATCCTTCTTCCTCAGAATCGTCTGCCTCTCAGCGCTAGGGCTGTCTTGCCAGTTGCTTGGGAATAAAGGTCTCCAATACAGCTCGCCGACTCTTTCTTCCGCCATTAGCAACCTCATCCCAGCTTTCACTTTCATTCTCGCCGTCGTTTTCCGGATGGAAAAGGTAGCTTTGAAAAGAAGCAGCAGCACAGCCAAAATGGTGGGCTCTGCAGTGTCCATATCAGGTGCATTGGTAGTGGTTCTGTACAAAGGCCCAATTGTTATTTCAAACCCATATTCTGAAGGCCCAAAAGACCTGGCCCTTTTCCATCAACCTTTGGGCTCTTCACAATTACAGCCCAATTGGATCTTGGGTGGCCTTTGCTTTGTCTTTCAATACCTTTGCAACTCTTTTTGGTACATTCTTCAGGGTTTGATGGGTCAGTCGTTTGTGACTGTGATCCACACTTGGGGTCTGAACTTGAAGGGGCCTGTTTATGTGTCGAGTTTCAGGCCATTGTCGATTGCCATTGCTGCTGCTATGGGAGCCATTCTCCTTGGTGATGATCTCCATCTTGGAAGTATTATTGGAGCAATCATAATATCAGTTGGGTTCTATGGTATTTTGTGGGGAAAAGCAAAAGAAGAAGAATTGAAGGGATTAGAAGGTGCTTGTAGGTTGGAATCTTCATCCAAAGCTCCATTGCTCCAATATGATAAAGTTGAAGATGCATGA

Protein sequence

MERSWFYKDIVPFAAMVAAECATVGSNTGFKAATARGMSYYVFTLYVCVAAAVALLPFALIFRRSPELPPHKISFFLRIVCLSALGLSCQLLGNKGLQYSSPTLSSAISNLIPAFTFILAVVFRMEKVALKRSSSTAKMVGSAVSISGALVVVLYKGPIVISNPYSEGPKDLALFHQPLGSSQLQPNWILGGLCFVFQYLCNSFWYILQGLMGQSFVTVIHTWGLNLKGPVYVSSFRPLSIAIAAAMGAILLGDDLHLGSIIGAIIISVGFYGILWGKAKEEELKGLEGACRLESSSKAPLLQYDKVEDA
Homology
BLAST of Sgr021434.1 vs. NCBI nr
Match: XP_038905800.1 (WAT1-related protein At5g40240-like [Benincasa hispida])

HSP 1 Score: 499.6 bits (1285), Expect = 2.0e-137
Identity = 267/364 (73.35%), Postives = 287/364 (78.85%), Query Frame = 0

Query: 1   MERSWFYKDIVPFAAMVAAECATVGSNTGFKAATARGMSYYVFTLYVCVAAAVALLPFAL 60
           MERS FYKD+VPFAAM+AAECATVGSNTGFKAATARG+SYYVFTLYVC+ AA  L+PFA+
Sbjct: 1   MERSLFYKDVVPFAAMIAAECATVGSNTGFKAATARGISYYVFTLYVCIIAATTLVPFAI 60

Query: 61  IFRRSPELPPHKISFFLRIVCLSALGLSCQLLGNKGLQYSSPTLSSAISNLIPAFTFILA 120
           IF +SPELPP+KISFF +IVCLSALGLSCQLLGNKGL+YSSPTLSSAISNLIPAFTFI+A
Sbjct: 61  IFHKSPELPPNKISFFFKIVCLSALGLSCQLLGNKGLEYSSPTLSSAISNLIPAFTFIMA 120

Query: 121 VVFRMEKVALKRSSSTAKMVGSAVSISGALVVVLYKGPIVISNPYSEGPKDLALF-HQPL 180
           V FRMEK+ +KRSSS  K+VGSAVSISGALVVVLYKGPIVISNPYS GPK+L L+ + PL
Sbjct: 121 VFFRMEKIDMKRSSSIVKIVGSAVSISGALVVVLYKGPIVISNPYSHGPKELGLYQNPPL 180

Query: 181 GSSQLQPNWILGGLCFVFQYLCNSFWYILQ------------------------------ 240
           GSS  QPNWI+GGLCFVFQYL NSFWYILQ                              
Sbjct: 181 GSSHPQPNWIMGGLCFVFQYLSNSFWYILQTQIIKIYPDEVSVVAVYYVIQAVLTAPICL 240

Query: 241 -----------------------GLMGQSFVTVIHTWGLNLKGPVYVSSFRPLSIAIAAA 300
                                  GLMGQSFV  IHTWGLNLKGPVYVSSFRPLSIAIAAA
Sbjct: 241 IAETEISGWKLTNPISFLLILNSGLMGQSFVAAIHTWGLNLKGPVYVSSFRPLSIAIAAA 300

Query: 301 MGAILLGDDLHLGSIIGAIIISVGFYGILWGKAKEEELKGLEGACRLESSSKAPLLQYDK 311
           MGAILLGDDLHLGSIIGAIIIS+GFYGILWGKAKEEE K LEG C LESSSKAPLLQY +
Sbjct: 301 MGAILLGDDLHLGSIIGAIIISIGFYGILWGKAKEEEWKELEGVCGLESSSKAPLLQYYR 360

BLAST of Sgr021434.1 vs. NCBI nr
Match: KAA0053694.1 (WAT1-related protein [Cucumis melo var. makuwa])

HSP 1 Score: 488.8 bits (1257), Expect = 3.5e-134
Identity = 264/364 (72.53%), Postives = 282/364 (77.47%), Query Frame = 0

Query: 2   ERSWFYKDIVPFAAMVAAECATVGSNTGFKAATARGMSYYVFTLYVCVAAAVALLPFALI 61
           +R  FYK++VPFAAMVAAECATVGSNTGFKAATARG+SYYVFTLYVC+ AA AL+PFA  
Sbjct: 3   QRRLFYKELVPFAAMVAAECATVGSNTGFKAATARGLSYYVFTLYVCIVAAAALIPFAFF 62

Query: 62  FRRSPELPPHKISFFLRIVCLSALGLSCQLLGNKGLQYSSPTLSSAISNLIPAFTFILAV 121
           F +S ELPP+KISFF +IVCLSALGLSCQLLGNKGL+YSSPTLSSAISNLIPAFTF+LAV
Sbjct: 63  FHKSAELPPNKISFFFQIVCLSALGLSCQLLGNKGLEYSSPTLSSAISNLIPAFTFMLAV 122

Query: 122 VFRMEKVALKRSSSTAKMVGSAVSISGALVVVLYKGPIVISNPYSEGPKDLALFH--QPL 181
            FRMEK+ LKRSSS  K+VGSAVSISGALVVVLYKGPIVISNPYS  PK L L +   PL
Sbjct: 123 FFRMEKIDLKRSSSIVKIVGSAVSISGALVVVLYKGPIVISNPYSPRPKQLGLLYPNPPL 182

Query: 182 GSSQLQPNWILGGLCFVFQYLCNSFWYILQ------------------------------ 241
            SS  QPNWI+GGLCFVFQYLCNSFWYILQ                              
Sbjct: 183 DSSHPQPNWIMGGLCFVFQYLCNSFWYILQTQIIKVYPDEISVVAVYYLIQALLTAPICL 242

Query: 242 -----------------------GLMGQSFVTVIHTWGLNLKGPVYVSSFRPLSIAIAAA 301
                                  GLMGQSFV  IHTWGLNLKGPVYVSSFRPLSIAIAAA
Sbjct: 243 IAETDMNAWKLTNPLIFLFIFNSGLMGQSFVAAIHTWGLNLKGPVYVSSFRPLSIAIAAA 302

Query: 302 MGAILLGDDLHLGSIIGAIIISVGFYGILWGKAKEEELKGLEGACRLESSSKAPLLQYDK 311
           MGAILLGDDLHLGSIIGAIIIS+GFYGILWGKAKEEELKGLE AC LESSSKAPLLQY K
Sbjct: 303 MGAILLGDDLHLGSIIGAIIISIGFYGILWGKAKEEELKGLEDACGLESSSKAPLLQYYK 362

BLAST of Sgr021434.1 vs. NCBI nr
Match: XP_022989272.1 (WAT1-related protein At4g15540-like [Cucurbita maxima])

HSP 1 Score: 484.2 bits (1245), Expect = 8.5e-133
Identity = 268/366 (73.22%), Postives = 285/366 (77.87%), Query Frame = 0

Query: 1   MERSWFYKDIVPFAAMVAAECATVGSNTGFKAATARGMSYYVFTLYVCVAAAVALLPFAL 60
           MERS FYKD+VPFAAMVAAECATVGSNTGFKAATARGMSYYVFTLYVCVAAAV L+PFAL
Sbjct: 1   MERSLFYKDVVPFAAMVAAECATVGSNTGFKAATARGMSYYVFTLYVCVAAAVVLIPFAL 60

Query: 61  IFRRSPELPPHKISFFLRIVCLSALGLSCQLLGNKGLQYSSPTLSSAISNLIPAFTFILA 120
           IF RS +LPP+KIS F ++V LSALGLSCQLLGNKGL++SSPTLSSAISNLIPAFTFILA
Sbjct: 61  IFHRSQQLPPNKISLFFKLVALSALGLSCQLLGNKGLEFSSPTLSSAISNLIPAFTFILA 120

Query: 121 VVFRMEKVALKRSSSTAKMVGSAVSISGALVVVLYKGPIVISNPYSEGPKDLALFH---- 180
           V FRMEK+ LKRSSS AK+VGSAVSISGALVVVLYKGPIVISNPYS+GPK+L L +    
Sbjct: 121 VFFRMEKIDLKRSSSIAKIVGSAVSISGALVVVLYKGPIVISNPYSQGPKELGLLYHNHN 180

Query: 181 QPLGSSQLQPNWILGGLCFVFQYLCNSFWYILQ--------------------------- 240
           QPLGSS  QPNWILGGLCFVFQYL NSFWYILQ                           
Sbjct: 181 QPLGSSHPQPNWILGGLCFVFQYLSNSFWYILQTQIIKIYPDEVTVVAVYYSIQAVLTAP 240

Query: 241 --------------------------GLMGQSFVTVIHTWGLNLKGPVYVSSFRPLSIAI 300
                                     G+MGQ+FV  IHTWGLNLKGPVYVSSFRPLSIAI
Sbjct: 241 VCLLAETDMDAWKMTNALSFVFILNSGMMGQAFVAAIHTWGLNLKGPVYVSSFRPLSIAI 300

Query: 301 AAAMGAILLGDDLHLGSIIGAIIISVGFYGILWGKAKEEELKGLEGACRLESSSKAPLLQ 310
           AAAMGAILL DDLHLGSIIGAIIISVGFYGILWGKAKEEELK L+G  RL SSSKAPLLQ
Sbjct: 301 AAAMGAILLADDLHLGSIIGAIIISVGFYGILWGKAKEEELKELDGE-RLGSSSKAPLLQ 360

BLAST of Sgr021434.1 vs. NCBI nr
Match: KAG6588972.1 (WAT1-related protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 478.8 bits (1231), Expect = 3.6e-131
Identity = 264/366 (72.13%), Postives = 284/366 (77.60%), Query Frame = 0

Query: 1   MERSWFYKDIVPFAAMVAAECATVGSNTGFKAATARGMSYYVFTLYVCVAAAVALLPFAL 60
           MERS FYKD++PFAAMVAAECATVGSNTGFKAATARGMSYYVFTLYVCVAAAV L+PFAL
Sbjct: 1   MERSLFYKDVLPFAAMVAAECATVGSNTGFKAATARGMSYYVFTLYVCVAAAVVLIPFAL 60

Query: 61  IFRRSPELPPHKISFFLRIVCLSALGLSCQLLGNKGLQYSSPTLSSAISNLIPAFTFILA 120
           IF RS +LPP+KIS F ++V LSALGLSCQLLGNKGL++SSPTLSSAISNLIPAFTFILA
Sbjct: 61  IFHRSQQLPPNKISLFFKLVALSALGLSCQLLGNKGLEFSSPTLSSAISNLIPAFTFILA 120

Query: 121 VVFRMEKVALKRSSSTAKMVGSAVSISGALVVVLYKGPIVISNPYSEGPKDLALFH---- 180
           V FRMEK+ LKR SS AK+VGSAVSISGALVVVLYKGPIVISNPYS+GPK+L+L +    
Sbjct: 121 VFFRMEKIDLKRRSSIAKIVGSAVSISGALVVVLYKGPIVISNPYSQGPKELSLLYHNHN 180

Query: 181 QPLGSSQLQPNWILGGLCFVFQYLCNSFWYILQ--------------------------- 240
           QPLGSS  +PNWILGGLCFVFQYL NSFWYILQ                           
Sbjct: 181 QPLGSSHPRPNWILGGLCFVFQYLSNSFWYILQTQIIKTYPDEVTVVAVYYSIQAVLTAP 240

Query: 241 --------------------------GLMGQSFVTVIHTWGLNLKGPVYVSSFRPLSIAI 300
                                     GLMGQ+FV  IHTWGL+LKGPVYVSSFRPLSIAI
Sbjct: 241 VCLLAETDMDAWKMTNALSFVFILNSGLMGQAFVAAIHTWGLSLKGPVYVSSFRPLSIAI 300

Query: 301 AAAMGAILLGDDLHLGSIIGAIIISVGFYGILWGKAKEEELKGLEGACRLESSSKAPLLQ 310
           AAAMGAILL DDLHLGSIIGAIIISVGFYGILWGKAKEEELK L+G     SSSKAPLLQ
Sbjct: 301 AAAMGAILLADDLHLGSIIGAIIISVGFYGILWGKAKEEELKELDGV----SSSKAPLLQ 360

BLAST of Sgr021434.1 vs. NCBI nr
Match: XP_023529741.1 (WAT1-related protein At4g15540-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 475.3 bits (1222), Expect = 4.0e-130
Identity = 263/366 (71.86%), Postives = 281/366 (76.78%), Query Frame = 0

Query: 1   MERSWFYKDIVPFAAMVAAECATVGSNTGFKAATARGMSYYVFTLYVCVAAAVALLPFAL 60
           MER  FYKD+VPFAAMVAAECATVGSNTGFKAA ARGMSYYVFTLYVCVAAAV L+PFAL
Sbjct: 1   MERRLFYKDVVPFAAMVAAECATVGSNTGFKAAIARGMSYYVFTLYVCVAAAVVLIPFAL 60

Query: 61  IFRRSPELPPHKISFFLRIVCLSALGLSCQLLGNKGLQYSSPTLSSAISNLIPAFTFILA 120
           IF RS +LPP+KIS F ++V LSALGLSCQLLGNKGL++SSPTLSSAISNLIPAFTFILA
Sbjct: 61  IFHRSQQLPPNKISLFFKLVALSALGLSCQLLGNKGLEFSSPTLSSAISNLIPAFTFILA 120

Query: 121 VVFRMEKVALKRSSSTAKMVGSAVSISGALVVVLYKGPIVISNPYSEGPKDLALFH---- 180
           V FRMEK+ LKR SS AK+VGSAVSISGALVVVLYKGPIVISNPYS+GPK+L L +    
Sbjct: 121 VFFRMEKIDLKRRSSIAKIVGSAVSISGALVVVLYKGPIVISNPYSQGPKELGLLYHNHN 180

Query: 181 QPLGSSQLQPNWILGGLCFVFQYLCNSFWYILQ--------------------------- 240
           QPLGSS  +PNWILGGLCFVFQYL NSFWYILQ                           
Sbjct: 181 QPLGSSHPRPNWILGGLCFVFQYLSNSFWYILQTQIIKTYPDEVTVVAVYYSIQAVLTAP 240

Query: 241 --------------------------GLMGQSFVTVIHTWGLNLKGPVYVSSFRPLSIAI 300
                                     GLMGQ+FV   HTWGL+LKGPVYVSSFRPLSIAI
Sbjct: 241 VCLLAETDMNAWKMTNALSFVFILNSGLMGQAFVAATHTWGLSLKGPVYVSSFRPLSIAI 300

Query: 301 AAAMGAILLGDDLHLGSIIGAIIISVGFYGILWGKAKEEELKGLEGACRLESSSKAPLLQ 310
           AAAMGAILL DDLHLGSIIGAIIISVGFYGILWGKAKEEE K L+G  RLESSSKAPLL 
Sbjct: 301 AAAMGAILLADDLHLGSIIGAIIISVGFYGILWGKAKEEEWKELDGE-RLESSSKAPLLH 360

BLAST of Sgr021434.1 vs. ExPASy Swiss-Prot
Match: F4KHA8 (WAT1-related protein At5g40230 OS=Arabidopsis thaliana OX=3702 GN=At5g40230 PE=3 SV=1)

HSP 1 Score: 285.8 bits (730), Expect = 5.8e-76
Identity = 164/359 (45.68%), Postives = 217/359 (60.45%), Query Frame = 0

Query: 4   SWFYKDIVPFAAMVAAECATVGSNTGFKAATARGMSYYVFTLYVCVAAAVALLPFALIFR 63
           S+F +D+VPF AMVA EC TVGSNT FKAAT RG+S+YVF  Y  V A + LLP +LIF 
Sbjct: 13  SYFCRDVVPFTAMVAVECVTVGSNTLFKAATLRGLSFYVFVFYTYVVATLVLLPLSLIFG 72

Query: 64  RSPELPPHKISFFLRIVCLSALGLSCQLLGNKGLQYSSPTLSSAISNLIPAFTFILAVVF 123
           RS  LP  K   F  I  L+ +G    ++G KG++YSSPTL+SAISNL PAFTF LAV+F
Sbjct: 73  RSKRLPSAKTPVFFNIFLLALVGFMSLIVGCKGIEYSSPTLASAISNLTPAFTFTLAVIF 132

Query: 124 RMEKVALKRSSSTAKMVGSAVSISGALVVVLYKGPIVISNPYSEGPKDLALFHQPLGSSQ 183
           RME++ L+ S++ AK++G+ VSISGALVV+LYKGP V+++     P      +Q L S  
Sbjct: 133 RMEQIVLRSSATQAKIIGTIVSISGALVVILYKGPKVLTDASLTPPSPTISLYQHLTS-- 192

Query: 184 LQPNWILGGLCFVFQYLCNSFWYILQ---------------------------------- 243
              +WI+GGL    QYL  S WYILQ                                  
Sbjct: 193 FDSSWIIGGLLLATQYLLVSVWYILQTRVMELYPEEITVVFLYNLCATLISAPVCLFAEK 252

Query: 244 -------------------GLMGQSFVTVIHTWGLNLKGPVYVSSFRPLSIAIAAAMGAI 303
                              G +  SF +VIHTWGL+LKGPVY+S F+PLSI IA AMG +
Sbjct: 253 DLNSFILKPGVSLASVMYSGGLVSSFGSVIHTWGLHLKGPVYISLFKPLSIVIAVAMGVM 312

Query: 304 LLGDDLHLGSIIGAIIISVGFYGILWGKAKEEELKGLEGACRLESSSKAPLLQYDKVED 310
            LGD L+LGS+IG++I+S+GFY ++WGKA+E+ +K + G      + ++PLL    +E+
Sbjct: 313 FLGDALYLGSVIGSLILSLGFYTVIWGKAREDSIKTVAG------TEQSPLLPSHTIEE 363

BLAST of Sgr021434.1 vs. ExPASy Swiss-Prot
Match: Q9FL08 (WAT1-related protein At5g40240 OS=Arabidopsis thaliana OX=3702 GN=At5g40240 PE=2 SV=1)

HSP 1 Score: 283.9 bits (725), Expect = 2.2e-75
Identity = 165/358 (46.09%), Postives = 214/358 (59.78%), Query Frame = 0

Query: 5   WFYKDIVPFAAMVAAECATVGSNTGFKAATARGMSYYVFTLYVCVAAAVALLPFALIFRR 64
           +F +D+VPFAAM A ECATVGSNT FKAAT RG+S+YVF  Y  + + + LLP ++IF R
Sbjct: 13  YFTRDVVPFAAMFAVECATVGSNTLFKAATLRGLSFYVFVFYSYIVSTLLLLPLSVIFGR 72

Query: 65  SPELPPHKISFFLRIVCLSALGLSCQLLGNKGLQYSSPTLSSAISNLIPAFTFILAVVFR 124
           S  LP  K   F +I  L  +G   Q+ G KG+ YSSPTL+SAISNL PAFTF LAV+FR
Sbjct: 73  SRRLPAAKSPLFFKIFLLGLVGFMSQIAGCKGIAYSSPTLASAISNLTPAFTFTLAVIFR 132

Query: 125 MEKVALKRSSSTAKMVGSAVSISGALVVVLYKGPIVISNPYSEGPKDLALFHQPLGSSQL 184
           ME+V L+ S++ AK++G+ +SISGALVVVLYKGP V+++            HQ L S  +
Sbjct: 133 MEQVRLRSSATQAKIIGAILSISGALVVVLYKGPQVLASASFTTVLPTVTLHQQLTS--I 192

Query: 185 QPNWILGGLCFVFQYLCNSFWYILQ----------------------------------- 244
           + +WI+GGL    QY   S WYILQ                                   
Sbjct: 193 ESSWIIGGLLLASQYFLISVWYILQTRVMEVYPEEITVVFFYNLFATLISVPVCLFAESN 252

Query: 245 ------------------GLMGQSFVTVIHTWGLNLKGPVYVSSFRPLSIAIAAAMGAIL 304
                             G+    F  + HTWGL+LKGPVY+S FRPLSIAIA AMGAI 
Sbjct: 253 LTSWVLKPDISLAAIIYSGVFVSLFSALTHTWGLHLKGPVYISLFRPLSIAIAVAMGAIF 312

Query: 305 LGDDLHLGSIIGAIIISVGFYGILWGKAKEEELKGLEGACRLESSSKAPLLQYDKVED 310
           LGD LHLGS+IG++I+ +GFY ++WGKA+E+ +K + G      S ++PLL    +ED
Sbjct: 313 LGDALHLGSVIGSMILCIGFYTVIWGKAREDTIKTVAG------SEQSPLLLTHIIED 362

BLAST of Sgr021434.1 vs. ExPASy Swiss-Prot
Match: F4JK59 (WAT1-related protein At4g15540 OS=Arabidopsis thaliana OX=3702 GN=At4g15540 PE=2 SV=1)

HSP 1 Score: 266.2 bits (679), Expect = 4.8e-70
Identity = 158/358 (44.13%), Postives = 208/358 (58.10%), Query Frame = 0

Query: 5   WFYKDIVPFAAMVAAECATVGSNTGFKAATARGMSYYVFTLYVCVAAAVALLPFALIFRR 64
           +F +D+VPF AM+A EC TVGS+  +KAAT RG S+YVF  Y  V A + LL  +LIF R
Sbjct: 10  YFKRDVVPFTAMIAIECTTVGSSILYKAATLRGFSFYVFVFYAYVGATLVLLLLSLIFGR 69

Query: 65  SPELPPHKISFFLRIVCLSALGLSCQLLGNKGLQYSSPTLSSAISNLIPAFTFILAVVFR 124
           S  LP  K S F +I  L+ LGL+ ++ G KG++YSSPTLSSAISNL PAFTFILA+ FR
Sbjct: 70  SRSLPTAKSSLFFKIFLLALLGLTSRVAGCKGIEYSSPTLSSAISNLTPAFTFILAIFFR 129

Query: 125 MEKVALKRSSSTAKMVGSAVSISGALVVVLYKGPIVISNPYSEGPKDLALFHQPLGSSQL 184
           ME+V L+ S++ AK++G+ VSISGALV+VLYKGP ++                    +  
Sbjct: 130 MEQVMLRSSATQAKIIGTIVSISGALVIVLYKGPKLL---------------VAASFTSF 189

Query: 185 QPNWILGGLCFVFQYLCNSFWYILQ----------------------------------- 244
           + +WI+GGL    Q+L  S W+ILQ                                   
Sbjct: 190 ESSWIIGGLLLGLQFLLLSVWFILQTHIMEIYPEEIAVVFCYNLCATLISGTVCLLVEKD 249

Query: 245 ------------------GLMGQSFVTVIHTWGLNLKGPVYVSSFRPLSIAIAAAMGAIL 304
                             GL   S  +VIHTWGL++KGPVY+S F+PLSIAIA AM AI 
Sbjct: 250 LNSWQLKPGFSLASVIYSGLFDTSLGSVIHTWGLHVKGPVYISLFKPLSIAIAVAMAAIF 309

Query: 305 LGDDLHLGSIIGAIIISVGFYGILWGKAKEEELKGLEGACRLESSSKAPLLQYDKVED 310
           LGD LHLGS+IG++I+S GFY ++WGKA+E+  K +      +S     L  +D+ ED
Sbjct: 310 LGDTLHLGSVIGSVILSFGFYTVIWGKAREDSTKTVS-----DSEQSLLLPSHDREED 347

BLAST of Sgr021434.1 vs. ExPASy Swiss-Prot
Match: Q94JU2 (WAT1-related protein At3g28050 OS=Arabidopsis thaliana OX=3702 GN=At3g28050 PE=2 SV=1)

HSP 1 Score: 250.8 bits (639), Expect = 2.1e-65
Identity = 148/355 (41.69%), Postives = 195/355 (54.93%), Query Frame = 0

Query: 1   MERSWFYKDIVPFAAMVAAECATVGSNTGFKAATARGMSYYVFTLYVCVAAAVALLPFAL 60
           M R +F ++++P  A+V  ECA VG NT FKAAT +GMS++VF +Y    AA+ LLP   
Sbjct: 1   MARKYFQREVLPVTALVIMECANVGLNTLFKAATLKGMSFHVFIVYSYGLAALLLLPSLF 60

Query: 61  IFRRSPELPPHKISFFLRIVCLSALGLSCQLLGNKGLQYSSPTLSSAISNLIPAFTFILA 120
              RS  LPP   S   +IV L  +G    ++G  G+ YSSPTL+SAISNL PAFTF+LA
Sbjct: 61  CSFRSRTLPPMNFSILYKIVLLGIIGCCSNIMGYTGINYSSPTLASAISNLTPAFTFLLA 120

Query: 121 VVFRMEKVALKRSSSTAKMVGSAVSISGALVVVLYKGPIVISNPYSEGPKDLALFHQPLG 180
           VVFRME V+ KR+SS AKM+G+ VSI GA +V LY GP+VI    ++ P  ++     L 
Sbjct: 121 VVFRMESVSFKRTSSVAKMLGTVVSIGGAFIVTLYNGPVVI----AKSPPSVS-----LR 180

Query: 181 SSQLQPNWILGGLCFVFQYLCNSFWYILQ------------------------------- 240
           S    PNWILG      +Y C   WYI+Q                               
Sbjct: 181 SQSTNPNWILGAGFLAVEYFCVPLWYIVQTQIMREYPAEFTVVCFYSIGVSFWTALVTLF 240

Query: 241 -----------------------GLMGQSFVTVIHTWGLNLKGPVYVSSFRPLSIAIAAA 300
                                  GL G      IHTW L +KGP++V+ F+PLSIAIA A
Sbjct: 241 TEGNDLGAWKIKPNIALVSIVCSGLFGSCINNTIHTWALRIKGPLFVAMFKPLSIAIAVA 300

Query: 301 MGAILLGDDLHLGSIIGAIIISVGFYGILWGKAKEEELKGLEGACRLESSSKAPL 302
           MG I L D L++GS+IGA +I++GFY ++WGKAKE  L   +     E +++A L
Sbjct: 301 MGVIFLRDSLYIGSLIGATVITIGFYTVMWGKAKEVALVEDDNKANHEEANEADL 346

BLAST of Sgr021434.1 vs. ExPASy Swiss-Prot
Match: Q56X95 (WAT1-related protein At3g28130 OS=Arabidopsis thaliana OX=3702 GN=At3g28130 PE=2 SV=1)

HSP 1 Score: 201.1 bits (510), Expect = 1.9e-50
Identity = 138/355 (38.87%), Postives = 188/355 (52.96%), Query Frame = 0

Query: 8   KDIVPFAAMVAAECATVGSNTGFKAATARGMSYYVFTLYVCVAAAVALLPFALIFRRSPE 67
           +D V   AM+A E   V  NT FKAAT++G++ Y F +Y  +  ++ LLP  +   RS  
Sbjct: 9   RDAVLLTAMLATETGNVAMNTLFKAATSKGLNSYTFLIYSYLIGSIVLLPSHIFSYRSRS 68

Query: 68  LPPHKISFFLRIVCLSALGLSCQLLGNKGLQYSSPTLSSAISNLIPAFTFILAVVFRMEK 127
           LP   +S   +I  L  LG +  + G  G++YS+PTL+SAISN+ PA TFILA++FRMEK
Sbjct: 69  LPSLSLSILCKIGVLGLLGSTYLITGFIGIEYSNPTLASAISNINPAITFILAIIFRMEK 128

Query: 128 VALKRSSSTAKMVGSAVSISGALVVVLYKGPIVISNPYSEGPKDLALFHQPLGSSQLQPN 187
            + K  SS AKMVG+ VS+ GALVVVLY GP V + P S     L     PL SS    +
Sbjct: 129 ASFKEKSSVAKMVGTIVSLVGALVVVLYHGPRVFT-PSSPPFPQLRQLLLPLSSS--NSD 188

Query: 188 WILGG-----------LCFVFQY---------LCNSFWYILQGLMGQSFVTV-------- 247
           WI+GG           + F+ Q             SF+Y L   +  S + +        
Sbjct: 189 WIIGGCLLAIKDTLVPVAFILQAHIMKLYPAPFTVSFFYFLIASILTSLIGIVAEKNNPS 248

Query: 248 -------------------------IHTWGLNLKGPVYVSSFRPLSIAIAAAMGAILLGD 307
                                    IH W +  KGPVY++ FRPLSI IA  MGAI LGD
Sbjct: 249 IWIIHFDITLVCIVVGGIFNPGYYAIHLWAVRNKGPVYLAIFRPLSILIAVIMGAIFLGD 308

Query: 308 DLHLGSIIGAIIISVGFYGILWGKAKEEELKGLEGACRLESSSKAPLLQYDKVED 310
             +LGS++G I+IS+GFY ++WGKAKE + +       L  S + PLL  + ++D
Sbjct: 309 SFYLGSLVGGILISLGFYTVMWGKAKEGKTQ------FLSLSEETPLLD-ENIDD 353

BLAST of Sgr021434.1 vs. ExPASy TrEMBL
Match: A0A5A7UEP6 (WAT1-related protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold135G00680 PE=3 SV=1)

HSP 1 Score: 488.8 bits (1257), Expect = 1.7e-134
Identity = 264/364 (72.53%), Postives = 282/364 (77.47%), Query Frame = 0

Query: 2   ERSWFYKDIVPFAAMVAAECATVGSNTGFKAATARGMSYYVFTLYVCVAAAVALLPFALI 61
           +R  FYK++VPFAAMVAAECATVGSNTGFKAATARG+SYYVFTLYVC+ AA AL+PFA  
Sbjct: 3   QRRLFYKELVPFAAMVAAECATVGSNTGFKAATARGLSYYVFTLYVCIVAAAALIPFAFF 62

Query: 62  FRRSPELPPHKISFFLRIVCLSALGLSCQLLGNKGLQYSSPTLSSAISNLIPAFTFILAV 121
           F +S ELPP+KISFF +IVCLSALGLSCQLLGNKGL+YSSPTLSSAISNLIPAFTF+LAV
Sbjct: 63  FHKSAELPPNKISFFFQIVCLSALGLSCQLLGNKGLEYSSPTLSSAISNLIPAFTFMLAV 122

Query: 122 VFRMEKVALKRSSSTAKMVGSAVSISGALVVVLYKGPIVISNPYSEGPKDLALFH--QPL 181
            FRMEK+ LKRSSS  K+VGSAVSISGALVVVLYKGPIVISNPYS  PK L L +   PL
Sbjct: 123 FFRMEKIDLKRSSSIVKIVGSAVSISGALVVVLYKGPIVISNPYSPRPKQLGLLYPNPPL 182

Query: 182 GSSQLQPNWILGGLCFVFQYLCNSFWYILQ------------------------------ 241
            SS  QPNWI+GGLCFVFQYLCNSFWYILQ                              
Sbjct: 183 DSSHPQPNWIMGGLCFVFQYLCNSFWYILQTQIIKVYPDEISVVAVYYLIQALLTAPICL 242

Query: 242 -----------------------GLMGQSFVTVIHTWGLNLKGPVYVSSFRPLSIAIAAA 301
                                  GLMGQSFV  IHTWGLNLKGPVYVSSFRPLSIAIAAA
Sbjct: 243 IAETDMNAWKLTNPLIFLFIFNSGLMGQSFVAAIHTWGLNLKGPVYVSSFRPLSIAIAAA 302

Query: 302 MGAILLGDDLHLGSIIGAIIISVGFYGILWGKAKEEELKGLEGACRLESSSKAPLLQYDK 311
           MGAILLGDDLHLGSIIGAIIIS+GFYGILWGKAKEEELKGLE AC LESSSKAPLLQY K
Sbjct: 303 MGAILLGDDLHLGSIIGAIIISIGFYGILWGKAKEEELKGLEDACGLESSSKAPLLQYYK 362

BLAST of Sgr021434.1 vs. ExPASy TrEMBL
Match: A0A6J1JFD0 (WAT1-related protein OS=Cucurbita maxima OX=3661 GN=LOC111486393 PE=3 SV=1)

HSP 1 Score: 484.2 bits (1245), Expect = 4.1e-133
Identity = 268/366 (73.22%), Postives = 285/366 (77.87%), Query Frame = 0

Query: 1   MERSWFYKDIVPFAAMVAAECATVGSNTGFKAATARGMSYYVFTLYVCVAAAVALLPFAL 60
           MERS FYKD+VPFAAMVAAECATVGSNTGFKAATARGMSYYVFTLYVCVAAAV L+PFAL
Sbjct: 1   MERSLFYKDVVPFAAMVAAECATVGSNTGFKAATARGMSYYVFTLYVCVAAAVVLIPFAL 60

Query: 61  IFRRSPELPPHKISFFLRIVCLSALGLSCQLLGNKGLQYSSPTLSSAISNLIPAFTFILA 120
           IF RS +LPP+KIS F ++V LSALGLSCQLLGNKGL++SSPTLSSAISNLIPAFTFILA
Sbjct: 61  IFHRSQQLPPNKISLFFKLVALSALGLSCQLLGNKGLEFSSPTLSSAISNLIPAFTFILA 120

Query: 121 VVFRMEKVALKRSSSTAKMVGSAVSISGALVVVLYKGPIVISNPYSEGPKDLALFH---- 180
           V FRMEK+ LKRSSS AK+VGSAVSISGALVVVLYKGPIVISNPYS+GPK+L L +    
Sbjct: 121 VFFRMEKIDLKRSSSIAKIVGSAVSISGALVVVLYKGPIVISNPYSQGPKELGLLYHNHN 180

Query: 181 QPLGSSQLQPNWILGGLCFVFQYLCNSFWYILQ--------------------------- 240
           QPLGSS  QPNWILGGLCFVFQYL NSFWYILQ                           
Sbjct: 181 QPLGSSHPQPNWILGGLCFVFQYLSNSFWYILQTQIIKIYPDEVTVVAVYYSIQAVLTAP 240

Query: 241 --------------------------GLMGQSFVTVIHTWGLNLKGPVYVSSFRPLSIAI 300
                                     G+MGQ+FV  IHTWGLNLKGPVYVSSFRPLSIAI
Sbjct: 241 VCLLAETDMDAWKMTNALSFVFILNSGMMGQAFVAAIHTWGLNLKGPVYVSSFRPLSIAI 300

Query: 301 AAAMGAILLGDDLHLGSIIGAIIISVGFYGILWGKAKEEELKGLEGACRLESSSKAPLLQ 310
           AAAMGAILL DDLHLGSIIGAIIISVGFYGILWGKAKEEELK L+G  RL SSSKAPLLQ
Sbjct: 301 AAAMGAILLADDLHLGSIIGAIIISVGFYGILWGKAKEEELKELDGE-RLGSSSKAPLLQ 360

BLAST of Sgr021434.1 vs. ExPASy TrEMBL
Match: A0A6J1EKI9 (WAT1-related protein OS=Cucurbita moschata OX=3662 GN=LOC111435310 PE=3 SV=1)

HSP 1 Score: 474.2 bits (1219), Expect = 4.3e-130
Identity = 262/366 (71.58%), Postives = 281/366 (76.78%), Query Frame = 0

Query: 1   MERSWFYKDIVPFAAMVAAECATVGSNTGFKAATARGMSYYVFTLYVCVAAAVALLPFAL 60
           MERS FYKD++PFAAMVAAECATVGSNTGFKAATARGMSYYVFTLYVCVAAAV L+PFAL
Sbjct: 1   MERSLFYKDVLPFAAMVAAECATVGSNTGFKAATARGMSYYVFTLYVCVAAAVVLIPFAL 60

Query: 61  IFRRSPELPPHKISFFLRIVCLSALGLSCQLLGNKGLQYSSPTLSSAISNLIPAFTFILA 120
           IF RS +LPP+KIS F ++V LSALGLSCQLLGNKGL++SSPTLSSAISNLIPAFTFILA
Sbjct: 61  IFHRSQQLPPNKISLFFKLVALSALGLSCQLLGNKGLEFSSPTLSSAISNLIPAFTFILA 120

Query: 121 VVFRMEKVALKRSSSTAKMVGSAVSISGALVVVLYKGPIVISNPYSEGPKDLALFH---- 180
           V F MEK+ LKRSSS AK+VGSAVSISGALVVVLYKGP+VISNPYS+GPK L L +    
Sbjct: 121 VFFGMEKIDLKRSSSIAKIVGSAVSISGALVVVLYKGPVVISNPYSQGPKALGLLYHNHN 180

Query: 181 QPLGSSQLQPNWILGGLCFVFQYLCNSFWYILQ--------------------------- 240
           QPLGSS  QPNWILGGLCFVFQYL  SFWYILQ                           
Sbjct: 181 QPLGSSHPQPNWILGGLCFVFQYLSTSFWYILQTQIIKTYPDEVTVVAVYYSIQAVLTAP 240

Query: 241 --------------------------GLMGQSFVTVIHTWGLNLKGPVYVSSFRPLSIAI 300
                                     G+MGQ+FV  IHTWGL+LKGPVYVSSFRPLSIAI
Sbjct: 241 VCLLAETDMNAWRMTNALSFVFILNSGMMGQAFVAAIHTWGLSLKGPVYVSSFRPLSIAI 300

Query: 301 AAAMGAILLGDDLHLGSIIGAIIISVGFYGILWGKAKEEELKGLEGACRLESSSKAPLLQ 310
           AAAMGAILL DDLHLGSIIGAIIISVGFYGILWGKAKEEELK L+G     SSSKAPLLQ
Sbjct: 301 AAAMGAILLADDLHLGSIIGAIIISVGFYGILWGKAKEEELKELDGG----SSSKAPLLQ 360

BLAST of Sgr021434.1 vs. ExPASy TrEMBL
Match: A0A6J1D9Q1 (WAT1-related protein OS=Momordica charantia OX=3673 GN=LOC111018334 PE=3 SV=1)

HSP 1 Score: 472.6 bits (1215), Expect = 1.2e-129
Identity = 263/366 (71.86%), Postives = 274/366 (74.86%), Query Frame = 0

Query: 1   MERSWFYKDIVPFAAMVAAECATVGSNTGFKAATARGMSYYVFTLYVCVAAAVALLPFAL 60
           M   W YKD+VPFA MVAAECATVGSNTGFKAATA G++YYVFTLYV V AA ALLP AL
Sbjct: 1   MVERWIYKDVVPFAGMVAAECATVGSNTGFKAATAAGITYYVFTLYVAVVAAAALLPVAL 60

Query: 61  IFRR---SPELPPHKISFFLRIVCLSALGLSCQLLGNKGLQYSSPTLSSAISNLIPAFTF 120
           IFRR   SP+LPPHKISFF +IVCLSALGLSCQL GNKGL+YSSP LSSAISNLIPAFTF
Sbjct: 61  IFRRRSPSPKLPPHKISFFFKIVCLSALGLSCQLFGNKGLEYSSPNLSSAISNLIPAFTF 120

Query: 121 ILAVVFRMEKVALKRSSSTAKMVGSAVSISGALVVVLYKGPIVISNPYSEGPKDLALFHQ 180
           +LA+ FRMEKVA K+SSS AKMVGSAVSISGALVVVLYKGPIVISNPYSEGPKD      
Sbjct: 121 VLAIFFRMEKVAFKKSSSIAKMVGSAVSISGALVVVLYKGPIVISNPYSEGPKD-----H 180

Query: 181 PLGSSQLQPNWILGGLCFVFQYLCNSFWYILQ---------------------------- 240
            LGSSQ QPNWILGGLCFVFQY+CNS WYILQ                            
Sbjct: 181 TLGSSQSQPNWILGGLCFVFQYVCNSLWYILQTQIIKIYPDELSVLAVYYVIQALLTAPV 240

Query: 241 -------------------------GLMGQSFVTVIHTWGLNLKGPVYVSSFRPLSIAIA 300
                                    GL+GQ FV  IHTWGL LKGPVYVSSFRPLSIAIA
Sbjct: 241 CLLAETDINAWKLTTPLHFLLLLNSGLVGQCFVASIHTWGLGLKGPVYVSSFRPLSIAIA 300

Query: 301 AAMGAILLGDDLHLGSIIGAIIISVGFYGILWGKAKEEELKGLEGACRLESSSKAPLLQY 311
           AAMGAILLGDDLHLGSIIGAIIISVGFYGILWGKAKEEELK LE    LESSSKAPLLQ 
Sbjct: 301 AAMGAILLGDDLHLGSIIGAIIISVGFYGILWGKAKEEELKELEHVGTLESSSKAPLLQC 360

BLAST of Sgr021434.1 vs. ExPASy TrEMBL
Match: A0A1S4DUI7 (WAT1-related protein OS=Cucumis melo OX=3656 GN=LOC103487063 PE=3 SV=1)

HSP 1 Score: 405.6 bits (1041), Expect = 1.9e-109
Identity = 222/325 (68.31%), Postives = 238/325 (73.23%), Query Frame = 0

Query: 2   ERSWFYKDIVPFAAMVAAECATVGSNTGFKAATARGMSYYVFTLYVCVAAAVALLPFALI 61
           +R  FYK++VPFAAMVAAECATVGSNTGFKAATARG+SYYVFTLYVC+ AA AL+PFA  
Sbjct: 3   QRRLFYKELVPFAAMVAAECATVGSNTGFKAATARGLSYYVFTLYVCIVAAAALIPFAFF 62

Query: 62  FRRSPELPPHKISFFLRIVCLSALGLSCQLLGNKGLQYSSPTLSSAISNLIPAFTFILAV 121
           F +S ELPP+KISFF +IVCLSALGLSCQLLGNKGL+YSSPTLSSAISNLIPAFTF+LAV
Sbjct: 63  FHKSAELPPNKISFFFQIVCLSALGLSCQLLGNKGLEYSSPTLSSAISNLIPAFTFMLAV 122

Query: 122 VFRMEKVALKRSSSTAKMVGSAVSISGALVVVLYKGPIVISNPYSEGPKDLALFH--QPL 181
            FRMEK+ LKRSSS  K+VGSAVSISGALVVVLYKGPIVISNPYS  PK L L +   PL
Sbjct: 123 FFRMEKIDLKRSSSIVKIVGSAVSISGALVVVLYKGPIVISNPYSPRPKQLGLLYPNPPL 182

Query: 182 GSSQLQPNWILGGLCFVFQYLCNSFWYILQ------------------------------ 241
            SS  QPNWI+GGLCFVFQYLCNSFWYILQ                              
Sbjct: 183 DSSHPQPNWIMGGLCFVFQYLCNSFWYILQTQIIKVYPDEISVVAVYYLIQALLTAPICL 242

Query: 242 -----------------------GLMGQSFVTVIHTWGLNLKGPVYVSSFRPLSIAIAAA 272
                                  GLMGQSFV  IHTWGLNLKGPVYVSSFRPLSIAIAAA
Sbjct: 243 IAETDMNAWKLTNPLIFLFIFNSGLMGQSFVAAIHTWGLNLKGPVYVSSFRPLSIAIAAA 302

BLAST of Sgr021434.1 vs. TAIR 10
Match: AT5G40230.1 (nodulin MtN21 /EamA-like transporter family protein )

HSP 1 Score: 285.8 bits (730), Expect = 4.1e-77
Identity = 164/359 (45.68%), Postives = 217/359 (60.45%), Query Frame = 0

Query: 4   SWFYKDIVPFAAMVAAECATVGSNTGFKAATARGMSYYVFTLYVCVAAAVALLPFALIFR 63
           S+F +D+VPF AMVA EC TVGSNT FKAAT RG+S+YVF  Y  V A + LLP +LIF 
Sbjct: 13  SYFCRDVVPFTAMVAVECVTVGSNTLFKAATLRGLSFYVFVFYTYVVATLVLLPLSLIFG 72

Query: 64  RSPELPPHKISFFLRIVCLSALGLSCQLLGNKGLQYSSPTLSSAISNLIPAFTFILAVVF 123
           RS  LP  K   F  I  L+ +G    ++G KG++YSSPTL+SAISNL PAFTF LAV+F
Sbjct: 73  RSKRLPSAKTPVFFNIFLLALVGFMSLIVGCKGIEYSSPTLASAISNLTPAFTFTLAVIF 132

Query: 124 RMEKVALKRSSSTAKMVGSAVSISGALVVVLYKGPIVISNPYSEGPKDLALFHQPLGSSQ 183
           RME++ L+ S++ AK++G+ VSISGALVV+LYKGP V+++     P      +Q L S  
Sbjct: 133 RMEQIVLRSSATQAKIIGTIVSISGALVVILYKGPKVLTDASLTPPSPTISLYQHLTS-- 192

Query: 184 LQPNWILGGLCFVFQYLCNSFWYILQ---------------------------------- 243
              +WI+GGL    QYL  S WYILQ                                  
Sbjct: 193 FDSSWIIGGLLLATQYLLVSVWYILQTRVMELYPEEITVVFLYNLCATLISAPVCLFAEK 252

Query: 244 -------------------GLMGQSFVTVIHTWGLNLKGPVYVSSFRPLSIAIAAAMGAI 303
                              G +  SF +VIHTWGL+LKGPVY+S F+PLSI IA AMG +
Sbjct: 253 DLNSFILKPGVSLASVMYSGGLVSSFGSVIHTWGLHLKGPVYISLFKPLSIVIAVAMGVM 312

Query: 304 LLGDDLHLGSIIGAIIISVGFYGILWGKAKEEELKGLEGACRLESSSKAPLLQYDKVED 310
            LGD L+LGS+IG++I+S+GFY ++WGKA+E+ +K + G      + ++PLL    +E+
Sbjct: 313 FLGDALYLGSVIGSLILSLGFYTVIWGKAREDSIKTVAG------TEQSPLLPSHTIEE 363

BLAST of Sgr021434.1 vs. TAIR 10
Match: AT5G40240.1 (nodulin MtN21 /EamA-like transporter family protein )

HSP 1 Score: 283.9 bits (725), Expect = 1.6e-76
Identity = 165/358 (46.09%), Postives = 214/358 (59.78%), Query Frame = 0

Query: 5   WFYKDIVPFAAMVAAECATVGSNTGFKAATARGMSYYVFTLYVCVAAAVALLPFALIFRR 64
           +F +D+VPFAAM A ECATVGSNT FKAAT RG+S+YVF  Y  + + + LLP ++IF R
Sbjct: 13  YFTRDVVPFAAMFAVECATVGSNTLFKAATLRGLSFYVFVFYSYIVSTLLLLPLSVIFGR 72

Query: 65  SPELPPHKISFFLRIVCLSALGLSCQLLGNKGLQYSSPTLSSAISNLIPAFTFILAVVFR 124
           S  LP  K   F +I  L  +G   Q+ G KG+ YSSPTL+SAISNL PAFTF LAV+FR
Sbjct: 73  SRRLPAAKSPLFFKIFLLGLVGFMSQIAGCKGIAYSSPTLASAISNLTPAFTFTLAVIFR 132

Query: 125 MEKVALKRSSSTAKMVGSAVSISGALVVVLYKGPIVISNPYSEGPKDLALFHQPLGSSQL 184
           ME+V L+ S++ AK++G+ +SISGALVVVLYKGP V+++            HQ L S  +
Sbjct: 133 MEQVRLRSSATQAKIIGAILSISGALVVVLYKGPQVLASASFTTVLPTVTLHQQLTS--I 192

Query: 185 QPNWILGGLCFVFQYLCNSFWYILQ----------------------------------- 244
           + +WI+GGL    QY   S WYILQ                                   
Sbjct: 193 ESSWIIGGLLLASQYFLISVWYILQTRVMEVYPEEITVVFFYNLFATLISVPVCLFAESN 252

Query: 245 ------------------GLMGQSFVTVIHTWGLNLKGPVYVSSFRPLSIAIAAAMGAIL 304
                             G+    F  + HTWGL+LKGPVY+S FRPLSIAIA AMGAI 
Sbjct: 253 LTSWVLKPDISLAAIIYSGVFVSLFSALTHTWGLHLKGPVYISLFRPLSIAIAVAMGAIF 312

Query: 305 LGDDLHLGSIIGAIIISVGFYGILWGKAKEEELKGLEGACRLESSSKAPLLQYDKVED 310
           LGD LHLGS+IG++I+ +GFY ++WGKA+E+ +K + G      S ++PLL    +ED
Sbjct: 313 LGDALHLGSVIGSMILCIGFYTVIWGKAREDTIKTVAG------SEQSPLLLTHIIED 362

BLAST of Sgr021434.1 vs. TAIR 10
Match: AT5G40240.2 (nodulin MtN21 /EamA-like transporter family protein )

HSP 1 Score: 283.9 bits (725), Expect = 1.6e-76
Identity = 165/358 (46.09%), Postives = 214/358 (59.78%), Query Frame = 0

Query: 5   WFYKDIVPFAAMVAAECATVGSNTGFKAATARGMSYYVFTLYVCVAAAVALLPFALIFRR 64
           +F +D+VPFAAM A ECATVGSNT FKAAT RG+S+YVF  Y  + + + LLP ++IF R
Sbjct: 27  YFTRDVVPFAAMFAVECATVGSNTLFKAATLRGLSFYVFVFYSYIVSTLLLLPLSVIFGR 86

Query: 65  SPELPPHKISFFLRIVCLSALGLSCQLLGNKGLQYSSPTLSSAISNLIPAFTFILAVVFR 124
           S  LP  K   F +I  L  +G   Q+ G KG+ YSSPTL+SAISNL PAFTF LAV+FR
Sbjct: 87  SRRLPAAKSPLFFKIFLLGLVGFMSQIAGCKGIAYSSPTLASAISNLTPAFTFTLAVIFR 146

Query: 125 MEKVALKRSSSTAKMVGSAVSISGALVVVLYKGPIVISNPYSEGPKDLALFHQPLGSSQL 184
           ME+V L+ S++ AK++G+ +SISGALVVVLYKGP V+++            HQ L S  +
Sbjct: 147 MEQVRLRSSATQAKIIGAILSISGALVVVLYKGPQVLASASFTTVLPTVTLHQQLTS--I 206

Query: 185 QPNWILGGLCFVFQYLCNSFWYILQ----------------------------------- 244
           + +WI+GGL    QY   S WYILQ                                   
Sbjct: 207 ESSWIIGGLLLASQYFLISVWYILQTRVMEVYPEEITVVFFYNLFATLISVPVCLFAESN 266

Query: 245 ------------------GLMGQSFVTVIHTWGLNLKGPVYVSSFRPLSIAIAAAMGAIL 304
                             G+    F  + HTWGL+LKGPVY+S FRPLSIAIA AMGAI 
Sbjct: 267 LTSWVLKPDISLAAIIYSGVFVSLFSALTHTWGLHLKGPVYISLFRPLSIAIAVAMGAIF 326

Query: 305 LGDDLHLGSIIGAIIISVGFYGILWGKAKEEELKGLEGACRLESSSKAPLLQYDKVED 310
           LGD LHLGS+IG++I+ +GFY ++WGKA+E+ +K + G      S ++PLL    +ED
Sbjct: 327 LGDALHLGSVIGSMILCIGFYTVIWGKAREDTIKTVAG------SEQSPLLLTHIIED 376

BLAST of Sgr021434.1 vs. TAIR 10
Match: AT4G15540.1 (EamA-like transporter family )

HSP 1 Score: 266.2 bits (679), Expect = 3.4e-71
Identity = 158/358 (44.13%), Postives = 208/358 (58.10%), Query Frame = 0

Query: 5   WFYKDIVPFAAMVAAECATVGSNTGFKAATARGMSYYVFTLYVCVAAAVALLPFALIFRR 64
           +F +D+VPF AM+A EC TVGS+  +KAAT RG S+YVF  Y  V A + LL  +LIF R
Sbjct: 10  YFKRDVVPFTAMIAIECTTVGSSILYKAATLRGFSFYVFVFYAYVGATLVLLLLSLIFGR 69

Query: 65  SPELPPHKISFFLRIVCLSALGLSCQLLGNKGLQYSSPTLSSAISNLIPAFTFILAVVFR 124
           S  LP  K S F +I  L+ LGL+ ++ G KG++YSSPTLSSAISNL PAFTFILA+ FR
Sbjct: 70  SRSLPTAKSSLFFKIFLLALLGLTSRVAGCKGIEYSSPTLSSAISNLTPAFTFILAIFFR 129

Query: 125 MEKVALKRSSSTAKMVGSAVSISGALVVVLYKGPIVISNPYSEGPKDLALFHQPLGSSQL 184
           ME+V L+ S++ AK++G+ VSISGALV+VLYKGP ++                    +  
Sbjct: 130 MEQVMLRSSATQAKIIGTIVSISGALVIVLYKGPKLL---------------VAASFTSF 189

Query: 185 QPNWILGGLCFVFQYLCNSFWYILQ----------------------------------- 244
           + +WI+GGL    Q+L  S W+ILQ                                   
Sbjct: 190 ESSWIIGGLLLGLQFLLLSVWFILQTHIMEIYPEEIAVVFCYNLCATLISGTVCLLVEKD 249

Query: 245 ------------------GLMGQSFVTVIHTWGLNLKGPVYVSSFRPLSIAIAAAMGAIL 304
                             GL   S  +VIHTWGL++KGPVY+S F+PLSIAIA AM AI 
Sbjct: 250 LNSWQLKPGFSLASVIYSGLFDTSLGSVIHTWGLHVKGPVYISLFKPLSIAIAVAMAAIF 309

Query: 305 LGDDLHLGSIIGAIIISVGFYGILWGKAKEEELKGLEGACRLESSSKAPLLQYDKVED 310
           LGD LHLGS+IG++I+S GFY ++WGKA+E+  K +      +S     L  +D+ ED
Sbjct: 310 LGDTLHLGSVIGSVILSFGFYTVIWGKAREDSTKTVS-----DSEQSLLLPSHDREED 347

BLAST of Sgr021434.1 vs. TAIR 10
Match: AT3G28050.1 (nodulin MtN21 /EamA-like transporter family protein )

HSP 1 Score: 250.8 bits (639), Expect = 1.5e-66
Identity = 148/355 (41.69%), Postives = 195/355 (54.93%), Query Frame = 0

Query: 1   MERSWFYKDIVPFAAMVAAECATVGSNTGFKAATARGMSYYVFTLYVCVAAAVALLPFAL 60
           M R +F ++++P  A+V  ECA VG NT FKAAT +GMS++VF +Y    AA+ LLP   
Sbjct: 1   MARKYFQREVLPVTALVIMECANVGLNTLFKAATLKGMSFHVFIVYSYGLAALLLLPSLF 60

Query: 61  IFRRSPELPPHKISFFLRIVCLSALGLSCQLLGNKGLQYSSPTLSSAISNLIPAFTFILA 120
              RS  LPP   S   +IV L  +G    ++G  G+ YSSPTL+SAISNL PAFTF+LA
Sbjct: 61  CSFRSRTLPPMNFSILYKIVLLGIIGCCSNIMGYTGINYSSPTLASAISNLTPAFTFLLA 120

Query: 121 VVFRMEKVALKRSSSTAKMVGSAVSISGALVVVLYKGPIVISNPYSEGPKDLALFHQPLG 180
           VVFRME V+ KR+SS AKM+G+ VSI GA +V LY GP+VI    ++ P  ++     L 
Sbjct: 121 VVFRMESVSFKRTSSVAKMLGTVVSIGGAFIVTLYNGPVVI----AKSPPSVS-----LR 180

Query: 181 SSQLQPNWILGGLCFVFQYLCNSFWYILQ------------------------------- 240
           S    PNWILG      +Y C   WYI+Q                               
Sbjct: 181 SQSTNPNWILGAGFLAVEYFCVPLWYIVQTQIMREYPAEFTVVCFYSIGVSFWTALVTLF 240

Query: 241 -----------------------GLMGQSFVTVIHTWGLNLKGPVYVSSFRPLSIAIAAA 300
                                  GL G      IHTW L +KGP++V+ F+PLSIAIA A
Sbjct: 241 TEGNDLGAWKIKPNIALVSIVCSGLFGSCINNTIHTWALRIKGPLFVAMFKPLSIAIAVA 300

Query: 301 MGAILLGDDLHLGSIIGAIIISVGFYGILWGKAKEEELKGLEGACRLESSSKAPL 302
           MG I L D L++GS+IGA +I++GFY ++WGKAKE  L   +     E +++A L
Sbjct: 301 MGVIFLRDSLYIGSLIGATVITIGFYTVMWGKAKEVALVEDDNKANHEEANEADL 346

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038905800.12.0e-13773.35WAT1-related protein At5g40240-like [Benincasa hispida][more]
KAA0053694.13.5e-13472.53WAT1-related protein [Cucumis melo var. makuwa][more]
XP_022989272.18.5e-13373.22WAT1-related protein At4g15540-like [Cucurbita maxima][more]
KAG6588972.13.6e-13172.13WAT1-related protein, partial [Cucurbita argyrosperma subsp. sororia][more]
XP_023529741.14.0e-13071.86WAT1-related protein At4g15540-like [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
F4KHA85.8e-7645.68WAT1-related protein At5g40230 OS=Arabidopsis thaliana OX=3702 GN=At5g40230 PE=3... [more]
Q9FL082.2e-7546.09WAT1-related protein At5g40240 OS=Arabidopsis thaliana OX=3702 GN=At5g40240 PE=2... [more]
F4JK594.8e-7044.13WAT1-related protein At4g15540 OS=Arabidopsis thaliana OX=3702 GN=At4g15540 PE=2... [more]
Q94JU22.1e-6541.69WAT1-related protein At3g28050 OS=Arabidopsis thaliana OX=3702 GN=At3g28050 PE=2... [more]
Q56X951.9e-5038.87WAT1-related protein At3g28130 OS=Arabidopsis thaliana OX=3702 GN=At3g28130 PE=2... [more]
Match NameE-valueIdentityDescription
A0A5A7UEP61.7e-13472.53WAT1-related protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold135... [more]
A0A6J1JFD04.1e-13373.22WAT1-related protein OS=Cucurbita maxima OX=3661 GN=LOC111486393 PE=3 SV=1[more]
A0A6J1EKI94.3e-13071.58WAT1-related protein OS=Cucurbita moschata OX=3662 GN=LOC111435310 PE=3 SV=1[more]
A0A6J1D9Q11.2e-12971.86WAT1-related protein OS=Momordica charantia OX=3673 GN=LOC111018334 PE=3 SV=1[more]
A0A1S4DUI71.9e-10968.31WAT1-related protein OS=Cucumis melo OX=3656 GN=LOC103487063 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT5G40230.14.1e-7745.68nodulin MtN21 /EamA-like transporter family protein [more]
AT5G40240.11.6e-7646.09nodulin MtN21 /EamA-like transporter family protein [more]
AT5G40240.21.6e-7646.09nodulin MtN21 /EamA-like transporter family protein [more]
AT4G15540.13.4e-7144.13EamA-like transporter family [more]
AT3G28050.11.5e-6641.69nodulin MtN21 /EamA-like transporter family protein [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000620EamA domainPFAMPF00892EamAcoord: 23..153
e-value: 8.0E-12
score: 45.5
NoneNo IPR availablePANTHERPTHR31218:SF48WAT1-RELATED PROTEINcoord: 214..284
coord: 10..208
NoneNo IPR availableSUPERFAMILY103481Multidrug resistance efflux transporter EmrEcoord: 47..156
NoneNo IPR availableSUPERFAMILY103481Multidrug resistance efflux transporter EmrEcoord: 202..280
IPR030184WAT1-related proteinPANTHERPTHR31218WAT1-RELATED PROTEINcoord: 214..284
coord: 10..208

Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Sgr021434Sgr021434gene


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
Sgr021434.1.exon1Sgr021434.1.exon1exon
Sgr021434.1.exon2Sgr021434.1.exon2exon
Sgr021434.1.exon3Sgr021434.1.exon3exon
Sgr021434.1.exon4Sgr021434.1.exon4exon
Sgr021434.1.exon5Sgr021434.1.exon5exon
Sgr021434.1.exon6Sgr021434.1.exon6exon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
cds.Sgr021434.1cds.Sgr021434.1CDS
cds.Sgr021434.1cds.Sgr021434.1_2CDS
cds.Sgr021434.1cds.Sgr021434.1_3CDS
cds.Sgr021434.1cds.Sgr021434.1_4CDS
cds.Sgr021434.1cds.Sgr021434.1_5CDS
cds.Sgr021434.1cds.Sgr021434.1_6CDS


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Sgr021434.1Sgr021434.1-proteinpolypeptide


GO Annotation
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0016020 membrane
molecular_function GO:0022857 transmembrane transporter activity