Cp4.1LG01g21100 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG01g21100
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionWAT1-related protein
LocationCp4.1LG01: 17901936 .. 17904480 (+)
RNA-Seq ExpressionCp4.1LG01g21100
SyntenyCp4.1LG01g21100
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CACACACACACACCCCCAAAACAAGCCAACAAACATGAGGAGCTTGGTTAGCTATGCAGAAGCAATGGAGGTCCACAAGCCATACATTGCTATGCTGTTCGTTCAGTGTGTGTACTCAGGAATGGCTTTGTTCTCAAAGGCAGCCATTTCTCAAGGCATGAACCCACCCATCTTCGTCTTCTACCGCCAAGCGTTCGCTACCATCGCCATGGCTCCGTTCGCCTTCTTCCTCGAAAGGTTCGCTTTCAAACTTTCTTTCAATCTTCCGCTCTCAATCGTTTCACTCCGTTACAATATCGAAATCGTTGCAGAAAGAAGGCAGTTCCTTTATCCTTCAAGTTTCTCTTCAAAGTGTTCTTGATTTCTCTAAGTGGGTATGTGAAATAATCTTCAAATTCTAATCTTTATAATTTGATCTTAAATTCATCACCAAAAAAATCACTTTTTTGTTTGTGTTTTTTTTTTTCAGAATCACCTTGAGTTTAAACCTTTATTACATAGCTATCAACCACATATCAGCAACATTTGCAGCCGCTACAACCAACACGATCCCCGCTATTACACTCCTCTTCGCTCTCCTCTTCAGGTAGTAATAACAGATCAAGCCCATTACTAGTAGATATTGTCTTTTTTGAACTCAAAGTTCCGTTCTCCTGTCCAACAGATATGAAGTGATTTCGGTGAGAAAGATGGAAGGGATAGCGAAATTGGTTGGGGGTGTGATAGGTTTTTCTGGGGCTTTGGTGTATGCTTTTGTGAAGGGTCCAGTGATGAAGTTCATGAATTGGTACCCACAAAACGCTCGTAATTCGTTCGAAGGGTACTCGGGTTCGGAATGGATTAAGGGTTCATTTGTTATGCTTTCAGCCAATATTGCTTGGTCTTTGTGGCTTGTTTTGCAGGTTTGATTCTGTTTGTTTTGGGTCAAATTTTTGAAGTTGTTTGTGGGGTTTTGGAGTTTTGATGGAATTTTGGTGTTTTGTTTTAAGGCCTCAATTGTGAAGGAATATCCAGCGAAGTTGAGGATTACAAGCTTGCAGTGCTTCTTCAGCTTGATACAATCAGGTTTGTGGGCTGTGGTGATGGAGAGAAAGGCTGATGCTTGGAAGCTTGGATGGAATCTTCAACTCTTCTCGGTTGCCTATTGCGTAAAAATCTCTCGAGTTTTCCCTCAATGTTTTAAAACGCGTCTTGTAGGGAGAGGTTTCCACACCCTTATAAACAAGGTTTCGCTCTCCCCAACCGATGTGAGATCTCACAATCCACCTCCTTTCAGGACCTAGCGTCGTTGCTGACACTCGTTCCTCTTTCTAGTCAATGTGAGGTCTCACAATCCACCCCTTGTTGGCGCTCGTTCCTCTCTCTAGTCAATGTAGAATCTCACAATCCACCTTCTTTCGGAATCTAGCGTCCTTGCTTACACACCGTCTGGTGTCCACCCTCTTTGGGGATCAGCGTTCTTGTTGGAACACTGCCCAGTGTCTGGCTTTGATACCAACTGTAACGGCCCAAACTCAAACCCACCGCTAACAGATATTGTTCTCTTTGAGTTTTCCCTTTTAGGCTTCCCCTCAAGGTTTTTAAAACGCGTCTACTAAGAAGAGGTTTTCACACACTTATAAAGAATGCTTCGTTCTCCTCACAGTTGGGGAGGAGAACGAAACACCCTTTATAATAGTGTGGAAATCTCTCCCTAGCATACGCGTTTTAAAAACCTTGAGGGTAAGCTCGAAAGGGAAAGTCCAAAGAGGACAATATCTAGTACAGTGGGCTTAGGCCGTTACGTCGACCGATATGGAATCTCACACTAATAGTCTCTCGTCTTAATCTACTCTCTTAAGCTTTCGAGTTTGACCGTCTATCTAACGAACGATTCAAAATTACAGGGTGTGATCGTGACGGGAATGACGTATTGGCTACAAATATGGACAGTAGAGAAGAAAGGACCAGTTTTCACAGCCATGTTTACACCATTAGCACTAATCATAACAGCAATCTTCTCAGCATTAGTATGGAAGGAAGCCCTTCATTGGGGAAGGTGATCAATTGCTCTAACTTCTACCACTAAATATCATCATAAACTAACAAAACAAAGGGCAAAAAGAACACAAATCCATGAATTTTAGCAGAAAATAAGGTTAAAAATCAAATAAAAACCATGAAAAGACTGAATTTTTATTGAAATGTGGCAGTGTTGGTGGGGCTATATTGCTGGTGGTGGGGCTTTATTGTGTTCTATGGGGGAAGAACAAAGAAGAAGATATCAAAACTGAAGCAATTGAACAAAGAGTTGATATCAAAGAGGAAACCAATTCAGCCTCCATTTGCTAATTCATCCTATCATCATCACCATATTAATATCTAATAGATTATTCCATATGTCACTGTCATTAACCTATAGCCCTTGCTGTTTAGCTCTCTTATGATCTTAATGGATGACTTGGTCATCCACTGGATGACTTGGGTCTATGTCAATTGTAAAGAACAGACAAAGTAATTTAATATTCGGGGGTCGGGTTTCTATATGTGAAACGATCCCACTTAG

mRNA sequence

CACACACACACACCCCCAAAACAAGCCAACAAACATGAGGAGCTTGGTTAGCTATGCAGAAGCAATGGAGGTCCACAAGCCATACATTGCTATGCTGTTCGTTCAGTGTGTGTACTCAGGAATGGCTTTGTTCTCAAAGGCAGCCATTTCTCAAGGCATGAACCCACCCATCTTCGTCTTCTACCGCCAAGCGTTCGCTACCATCGCCATGGCTCCGTTCGCCTTCTTCCTCGAAAGAAAGAAGGCAGTTCCTTTATCCTTCAAGTTTCTCTTCAAAGTGTTCTTGATTTCTCTAAGTGGAATCACCTTGAGTTTAAACCTTTATTACATAGCTATCAACCACATATCAGCAACATTTGCAGCCGCTACAACCAACACGATCCCCGCTATTACACTCCTCTTCGCTCTCCTCTTCAGATATGAAGTGATTTCGGTGAGAAAGATGGAAGGGATAGCGAAATTGGTTGGGGGTGTGATAGGTTTTTCTGGGGCTTTGGTGTATGCTTTTGTGAAGGGTCCAGTGATGAAGTTCATGAATTGGTACCCACAAAACGCTCGTAATTCGTTCGAAGGGTACTCGGGTTCGGAATGGATTAAGGGTTCATTTGTTATGCTTTCAGCCAATATTGCTTGGTCTTTGTGGCTTGTTTTGCAGGGTGTGATCGTGACGGGAATGACGTATTGGCTACAAATATGGACAGTAGAGAAGAAAGGACCAGTTTTCACAGCCATGTTTACACCATTAGCACTAATCATAACAGCAATCTTCTCAGCATTAGTATGGAAGGAAGCCCTTCATTGGGGAAGTGTTGGTGGGGCTATATTGCTGGTGGTGGGGCTTTATTGTGTTCTATGGGGGAAGAACAAAGAAGAAGATATCAAAACTGAAGCAATTGAACAAAGAGTTGATATCAAAGAGGAAACCAATTCAGCCTCCATTTGCTAATTCATCCTATCATCATCACCATATTAATATCTAATAGATTATTCCATATGTCACTGTCATTAACCTATAGCCCTTGCTGTTTAGCTCTCTTATGATCTTAATGGATGACTTGGTCATCCACTGGATGACTTGGGTCTATGTCAATTGTAAAGAACAGACAAAGTAATTTAATATTCGGGGGTCGGGTTTCTATATGTGAAACGATCCCACTTAG

Coding sequence (CDS)

ATGAGGAGCTTGGTTAGCTATGCAGAAGCAATGGAGGTCCACAAGCCATACATTGCTATGCTGTTCGTTCAGTGTGTGTACTCAGGAATGGCTTTGTTCTCAAAGGCAGCCATTTCTCAAGGCATGAACCCACCCATCTTCGTCTTCTACCGCCAAGCGTTCGCTACCATCGCCATGGCTCCGTTCGCCTTCTTCCTCGAAAGAAAGAAGGCAGTTCCTTTATCCTTCAAGTTTCTCTTCAAAGTGTTCTTGATTTCTCTAAGTGGAATCACCTTGAGTTTAAACCTTTATTACATAGCTATCAACCACATATCAGCAACATTTGCAGCCGCTACAACCAACACGATCCCCGCTATTACACTCCTCTTCGCTCTCCTCTTCAGATATGAAGTGATTTCGGTGAGAAAGATGGAAGGGATAGCGAAATTGGTTGGGGGTGTGATAGGTTTTTCTGGGGCTTTGGTGTATGCTTTTGTGAAGGGTCCAGTGATGAAGTTCATGAATTGGTACCCACAAAACGCTCGTAATTCGTTCGAAGGGTACTCGGGTTCGGAATGGATTAAGGGTTCATTTGTTATGCTTTCAGCCAATATTGCTTGGTCTTTGTGGCTTGTTTTGCAGGGTGTGATCGTGACGGGAATGACGTATTGGCTACAAATATGGACAGTAGAGAAGAAAGGACCAGTTTTCACAGCCATGTTTACACCATTAGCACTAATCATAACAGCAATCTTCTCAGCATTAGTATGGAAGGAAGCCCTTCATTGGGGAAGTGTTGGTGGGGCTATATTGCTGGTGGTGGGGCTTTATTGTGTTCTATGGGGGAAGAACAAAGAAGAAGATATCAAAACTGAAGCAATTGAACAAAGAGTTGATATCAAAGAGGAAACCAATTCAGCCTCCATTTGCTAA

Protein sequence

MRSLVSYAEAMEVHKPYIAMLFVQCVYSGMALFSKAAISQGMNPPIFVFYRQAFATIAMAPFAFFLERKKAVPLSFKFLFKVFLISLSGITLSLNLYYIAINHISATFAAATTNTIPAITLLFALLFRYEVISVRKMEGIAKLVGGVIGFSGALVYAFVKGPVMKFMNWYPQNARNSFEGYSGSEWIKGSFVMLSANIAWSLWLVLQGVIVTGMTYWLQIWTVEKKGPVFTAMFTPLALIITAIFSALVWKEALHWGSVGGAILLVVGLYCVLWGKNKEEDIKTEAIEQRVDIKEETNSASIC
Homology
BLAST of Cp4.1LG01g21100 vs. ExPASy Swiss-Prot
Match: Q6NMB7 (WAT1-related protein At1g43650 OS=Arabidopsis thaliana OX=3702 GN=At1g43650 PE=2 SV=1)

HSP 1 Score: 288.1 bits (736), Expect = 1.1e-76
Identity = 160/326 (49.08%), Postives = 207/326 (63.50%), Query Frame = 0

Query: 11  MEVHKPYIAMLFVQCVYSGMALFSKAAISQGMNPPIFVFYRQAFATIAMAPFAFFLERKK 70
           M  HK  +AM+FVQ VY+GM L SK AISQG NP +FVFYRQAFA +A++PFAFFLE  K
Sbjct: 2   MMEHKANMAMVFVQIVYAGMPLLSKVAISQGTNPFVFVFYRQAFAALALSPFAFFLESSK 61

Query: 71  AVPLSFKFLFKVFLISLSGITLSLNLYYIAINHISATFAAATTNTIPAITLLFALLFRYE 130
           + PLSF  L K+F ISL G+TLSLNLYY+AI + +ATFAAATTN IP+IT + ALLFR E
Sbjct: 62  SSPLSFILLLKIFFISLCGLTLSLNLYYVAIENTTATFAAATTNAIPSITFVLALLFRLE 121

Query: 131 VISVRKMEGIAKLVGGVIGFSGALVYAFVKGPVMKFMNWYPQNARNSFEGYSGSEWIKGS 190
            ++++K  G+AK+ G ++G  GALV+AFVKGP    +N Y  +   +    S    +KGS
Sbjct: 122 TVTLKKSHGVAKVTGSMVGMLGALVFAFVKGP--SLINHYNSSTIPNGTVPSTKNSVKGS 181

Query: 191 FVMLSANIAWSLWLVLQ------------------------------------------- 250
             ML+AN  W LW+++Q                                           
Sbjct: 182 ITMLAANTCWCLWIIMQSKVMKEYPAKLRLVALQCLFSCIQSAVWAVAVNRNPSVWKIEF 241

Query: 251 ----------GVIVTGMTYWLQIWTVEKKGPVFTAMFTPLALIITAIFSALVWKEALHWG 284
                     G++VTG+TYWLQ+W +EKKGPVFTA++TPLALI+T I S+ ++KE  + G
Sbjct: 242 GLPLLSMAYCGIMVTGLTYWLQVWAIEKKGPVFTALYTPLALILTCIVSSFLFKETFYLG 301

BLAST of Cp4.1LG01g21100 vs. ExPASy Swiss-Prot
Match: Q9FGG3 (WAT1-related protein At5g64700 OS=Arabidopsis thaliana OX=3702 GN=At5g64700 PE=2 SV=1)

HSP 1 Score: 235.7 bits (600), Expect = 6.8e-61
Identity = 142/349 (40.69%), Postives = 202/349 (57.88%), Query Frame = 0

Query: 11  MEVHKPYIAMLFVQCVYSGMALFSKAAISQGMNPPIFVFYRQAFATIAMAPFAFFLERKK 70
           ME  KPY+ +  +Q +Y+ M L SKA  + GMN  +FVFYRQAFATI +AP AFF ERK 
Sbjct: 3   MESKKPYLMVTIIQVIYTIMFLISKAVFNGGMNTFVFVFYRQAFATIFLAPLAFFFERKS 62

Query: 71  AVPLSFKFLFKVFLISLSGITLSLNLYYIAINHISATFAAATTNTIPAITLLFALLFRYE 130
           A PLSF    K+F++SL G+TLSL+L  IA+++ SAT AAATT ++PAIT   ALLF  E
Sbjct: 63  APPLSFVTFIKIFMLSLFGVTLSLDLNGIALSYTSATLAAATTASLPAITFFLALLFGME 122

Query: 131 VISVRKMEGIAKLVGGVIGFSGALVYAFVKGPVMK------FMNWYPQNARNSFEGYSG- 190
            + V+ ++G AKLVG  +   G ++ A  KGP++K      F +      RN+    SG 
Sbjct: 123 RLKVKSIQGTAKLVGITVCMGGVIILAIYKGPLLKLPLCPHFYHGQEHPHRNNPGHVSGG 182

Query: 191 -SEWIKGSFVMLSANIAWSLWLVLQ----------------------------------- 250
            + W+KG  +M+++NI W LWLVLQ                                   
Sbjct: 183 STSWLKGCVLMITSNILWGLWLVLQGRVLKVYPSKLYFTTLHCLLSSIQSFVIAIALERD 242

Query: 251 ------------------GVIVTGMTYWLQIWTVEKKGPVFTAMFTPLALIITAIFSALV 299
                             G IVTG+ Y+LQ W +EK+GPVF +MFTPL+L+ T + SA++
Sbjct: 243 ISAWKLGWNLRLVAVIYCGFIVTGVAYYLQSWVIEKRGPVFLSMFTPLSLLFTLLSSAIL 302

BLAST of Cp4.1LG01g21100 vs. ExPASy Swiss-Prot
Match: Q8GXB4 (WAT1-related protein At1g09380 OS=Arabidopsis thaliana OX=3702 GN=At1g09380 PE=2 SV=1)

HSP 1 Score: 176.4 bits (446), Expect = 4.9e-43
Identity = 109/345 (31.59%), Postives = 181/345 (52.46%), Query Frame = 0

Query: 16  PYIAMLFVQCVYSGMALFSKAAISQGMNPPIFVFYRQAFATIAMAPFAFFLERKKAVPLS 75
           P++AM+ VQ  Y+GM + SK A+  GM P I V YRQ FATIA  P AFFLERK    ++
Sbjct: 8   PFLAMVLVQIGYAGMNITSKMAMEAGMKPLILVAYRQIFATIATFPVAFFLERKTRPKIT 67

Query: 76  FKFLFKVFLISLSGITLSLNLYYIAINHISATFAAATTNTIPAITLLFALLFRYEVISVR 135
            + L +VF  S++G T +  LY++ + + S T A A TN +PA+T L A +FR E + ++
Sbjct: 68  LRILVQVFFCSITGATGNQVLYFVGLQNSSPTIACALTNLLPAVTFLLAAIFRQETVGIK 127

Query: 136 KMEGIAKLVGGVIGFSGALVYAFVKGPVMKF----MNW-YPQNARNSFEGYSGSEWIKGS 195
           K  G AK++G ++   GA+V +F  G  +      ++W Y +N          S +  G 
Sbjct: 128 KASGQAKVIGTLVCVIGAMVLSFYHGHTIGIGESKIHWAYAENITKHGSSSGHSNFFLGP 187

Query: 196 FVMLSANIAWSLWLVLQ------------------------------------------- 255
           F++++A ++W+ W ++Q                                           
Sbjct: 188 FLIMAAAVSWAAWFIIQTKMSETFAAPYTSTLLMCLMGSIQCGAIALISDHTISDWSLSS 247

Query: 256 ----------GVIVTGMTYWLQIWTVEKKGPVFTAMFTPLALIITAIFSALVWKEALHWG 299
                     GV+ + + + L  W +++KGP++ ++F+PL L++ AIFS  + +E L+ G
Sbjct: 248 PLRFISALYAGVVASALAFCLMSWAMQRKGPLYVSVFSPLLLVVVAIFSWALLEEKLYTG 307

BLAST of Cp4.1LG01g21100 vs. ExPASy Swiss-Prot
Match: F4HZQ7 (WAT1-related protein At1g21890 OS=Arabidopsis thaliana OX=3702 GN=At1g21890 PE=2 SV=1)

HSP 1 Score: 174.1 bits (440), Expect = 2.4e-42
Identity = 108/330 (32.73%), Postives = 167/330 (50.61%), Query Frame = 0

Query: 15  KPYIAMLFVQCVYSGMALFSKAAISQGMNPPIFVFYRQAFATIAMAPFAFFLERKKAVPL 74
           KPY+AM+ +Q  Y+GM + +  ++  GMN  +   YR A AT  +APFA F ERK    +
Sbjct: 10  KPYLAMISMQFGYAGMYIITMVSLKHGMNHYVLAVYRHAIATAVIAPFALFHERKIRPKM 69

Query: 75  SFKFLFKVFLISLSGITLSLNLYYIAINHISATFAAATTNTIPAITLLFALLFRYEVISV 134
           +F+   ++ L+      L  NLYY+ + + SATFA+AT N +PAIT + A++FR E ++ 
Sbjct: 70  TFRIFLQIALLGFIEPVLDQNLYYVGMTYTSATFASATANVLPAITFVLAIIFRLESVNF 129

Query: 135 RKMEGIAKLVGGVIGFSGALVYAFVKGPVMKFMNWYPQNARNSFEGYSGS---------- 194
           +K+  IAK+VG VI  SGAL+    KGP++ F+ +       S +G  GS          
Sbjct: 130 KKVRSIAKVVGTVITVSGALLMTLYKGPIVDFIRFGGGGGGGS-DGAGGSHGGAGAAAMD 189

Query: 195 -EWIKGSFVMLSANIAWSLWLVLQ------------------------------------ 254
             WI G+ ++L     W+ + +LQ                                    
Sbjct: 190 KHWIPGTLMLLGRTFGWAGFFILQSFTLKQYPAELSLTTLICLMGTLEGTAVSLVTVRDL 249

Query: 255 -----------------GVIVTGMTYWLQIWTVEKKGPVFTAMFTPLALIITAIFSALVW 281
                            GVI +G+ Y++Q   + ++GPVF A F PL ++ITA    +V 
Sbjct: 250 SAWKIGFDSNLFAAAYSGVICSGVAYYVQGVVMRERGPVFVATFNPLCVVITAALGVVVL 309

BLAST of Cp4.1LG01g21100 vs. ExPASy Swiss-Prot
Match: Q9FL41 (WAT1-related protein At5g07050 OS=Arabidopsis thaliana OX=3702 GN=At5g07050 PE=2 SV=1)

HSP 1 Score: 171.8 bits (434), Expect = 1.2e-41
Identity = 105/344 (30.52%), Postives = 176/344 (51.16%), Query Frame = 0

Query: 1   MRSLVSYAEAMEVHKPYIAMLFVQCVYSGMALFSKAAISQGMNPPIFVFYRQAFATIAMA 60
           M  + S    +   KPY AM+ +Q  Y+GM + +K +++ GM+  + V YR A AT  +A
Sbjct: 3   MEEISSCESFLTSSKPYFAMISLQFGYAGMNIITKISLNTGMSHYVLVVYRHAIATAVIA 62

Query: 61  PFAFFLERKKAVPLSFKFLFKVFLISLSGITLSLNLYYIAINHISATFAAATTNTIPAIT 120
           PFAFF ERK    ++F    ++F++ L G  +  N YY+ + + S TF+ A +N +PA+T
Sbjct: 63  PFAFFFERKAQPKITFSIFMQLFILGLLGPVIDQNFYYMGLKYTSPTFSCAMSNMLPAMT 122

Query: 121 LLFALLFRYEVISVRKMEGIAKLVGGVIGFSGALVYAFVKGPVM-----KFMNWYPQ--- 180
            + A+LFR E++ ++K+   AK+ G V+  +GA++    KGP++     K+M+       
Sbjct: 123 FILAVLFRMEMLDLKKLWCQAKIAGTVVTVAGAMLMTIYKGPIVELFWTKYMHIQDSSHA 182

Query: 181 NARNSFEGYSGSEWIKGSFVMLSANIAWSLWLVLQ------------------------- 240
           N  +S    S  E++KGS +++ A +AW+   VLQ                         
Sbjct: 183 NTTSSKNSSSDKEFLKGSILLIFATLAWASLFVLQAKILKTYAKHQLSLTTLICFIGTLQ 242

Query: 241 -----------------------------GVIVTGMTYWLQIWTVEKKGPVFTAMFTPLA 283
                                        G++ + ++Y++Q   ++K+GPVF   F+PL 
Sbjct: 243 AVAVTFVMEHNPSAWRIGWDMNLLAAAYSGIVASSISYYVQGIVMKKRGPVFATAFSPLM 302

BLAST of Cp4.1LG01g21100 vs. NCBI nr
Match: XP_023546440.1 (WAT1-related protein At1g43650 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 560 bits (1443), Expect = 1.37e-199
Identity = 303/356 (85.11%), Postives = 303/356 (85.11%), Query Frame = 0

Query: 1   MRSLVSYAEAMEVHKPYIAMLFVQCVYSGMALFSKAAISQGMNPPIFVFYRQAFATIAMA 60
           MRSLVSYAEAMEVHKPYIAMLFVQCVYSGMALFSKAAISQGMNPPIFVFYRQAFATIAMA
Sbjct: 1   MRSLVSYAEAMEVHKPYIAMLFVQCVYSGMALFSKAAISQGMNPPIFVFYRQAFATIAMA 60

Query: 61  PFAFFLERKKAVPLSFKFLFKVFLISLSGITLSLNLYYIAINHISATFAAATTNTIPAIT 120
           PFAFFLERKKAVPLSFKFLFKVFLISLSGITLSLNLYYIAINHISATFAAATTNTIPAIT
Sbjct: 61  PFAFFLERKKAVPLSFKFLFKVFLISLSGITLSLNLYYIAINHISATFAAATTNTIPAIT 120

Query: 121 LLFALLFRYEVISVRKMEGIAKLVGGVIGFSGALVYAFVKGPVMKFMNWYPQNARNSFEG 180
           LLFALLFRYEVISVRKMEGIAKLVGGVIGFSGALVYAFVKGPVMKFMNWYPQNARNSFEG
Sbjct: 121 LLFALLFRYEVISVRKMEGIAKLVGGVIGFSGALVYAFVKGPVMKFMNWYPQNARNSFEG 180

Query: 181 YSGSEWIKGSFVMLSANIAWSLWLVLQ--------------------------------- 240
           YSGSEWIKGSFVMLSANIAWSLWLVLQ                                 
Sbjct: 181 YSGSEWIKGSFVMLSANIAWSLWLVLQASIVKEYPAKLRITSLQCFFSLIQSGLWAVVME 240

Query: 241 --------------------GVIVTGMTYWLQIWTVEKKGPVFTAMFTPLALIITAIFSA 300
                               GVIVTGMTYWLQIWTVEKKGPVFTAMFTPLALIITAIFSA
Sbjct: 241 RKADAWKLGWNLQLFSVAYCGVIVTGMTYWLQIWTVEKKGPVFTAMFTPLALIITAIFSA 300

Query: 301 LVWKEALHWGSVGGAILLVVGLYCVLWGKNKEEDIKTEAIEQRVDIKEETNSASIC 303
           LVWKEALHWGSVGGAILLVVGLYCVLWGKNKEEDIKTEAIEQRVDIKEETNSASIC
Sbjct: 301 LVWKEALHWGSVGGAILLVVGLYCVLWGKNKEEDIKTEAIEQRVDIKEETNSASIC 356

BLAST of Cp4.1LG01g21100 vs. NCBI nr
Match: XP_022963786.1 (WAT1-related protein At1g43650 [Cucurbita moschata])

HSP 1 Score: 557 bits (1435), Expect = 2.26e-198
Identity = 301/356 (84.55%), Postives = 302/356 (84.83%), Query Frame = 0

Query: 1   MRSLVSYAEAMEVHKPYIAMLFVQCVYSGMALFSKAAISQGMNPPIFVFYRQAFATIAMA 60
           MRSLVSYAEAMEVHKPYIAMLFVQCVYSGMALFSKAAISQGMNPPIFVFYRQAFATIAMA
Sbjct: 1   MRSLVSYAEAMEVHKPYIAMLFVQCVYSGMALFSKAAISQGMNPPIFVFYRQAFATIAMA 60

Query: 61  PFAFFLERKKAVPLSFKFLFKVFLISLSGITLSLNLYYIAINHISATFAAATTNTIPAIT 120
           PFAFF ERKKAVPLSFKFLFKVFLISLSGITLSLNLYYIAINHISATFAAATTNTIPAIT
Sbjct: 61  PFAFFFERKKAVPLSFKFLFKVFLISLSGITLSLNLYYIAINHISATFAAATTNTIPAIT 120

Query: 121 LLFALLFRYEVISVRKMEGIAKLVGGVIGFSGALVYAFVKGPVMKFMNWYPQNARNSFEG 180
           LLFALLFRYEVISVRKMEGIAKLVGGVIGFSGALVYAFVKGPVMKFMNWYPQNARNSFEG
Sbjct: 121 LLFALLFRYEVISVRKMEGIAKLVGGVIGFSGALVYAFVKGPVMKFMNWYPQNARNSFEG 180

Query: 181 YSGSEWIKGSFVMLSANIAWSLWLVLQ--------------------------------- 240
           YSGSEWIKGSFVMLSANIAWSLWLVLQ                                 
Sbjct: 181 YSGSEWIKGSFVMLSANIAWSLWLVLQASIVKEYPAKLRITSLQCFFSLIQSGLWAVAME 240

Query: 241 --------------------GVIVTGMTYWLQIWTVEKKGPVFTAMFTPLALIITAIFSA 300
                               GVIVTGMTYWLQIWTVEKKGPVFTAMFTPLALIITAIFSA
Sbjct: 241 RKADAWKLGWNLQLFSVAYCGVIVTGMTYWLQIWTVEKKGPVFTAMFTPLALIITAIFSA 300

Query: 301 LVWKEALHWGSVGGAILLVVGLYCVLWGKNKEEDIKTEAIEQRVDIKEETNSASIC 303
           LVWKEALHWGSVGGAILLVVGLYCVLWGKNKEEDIK+EAIEQRVDIKEETNSASIC
Sbjct: 301 LVWKEALHWGSVGGAILLVVGLYCVLWGKNKEEDIKSEAIEQRVDIKEETNSASIC 356

BLAST of Cp4.1LG01g21100 vs. NCBI nr
Match: KAG6602184.1 (WAT1-related protein, partial [Cucurbita argyrosperma subsp. sororia] >KAG7032868.1 WAT1-related protein, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 556 bits (1432), Expect = 6.47e-198
Identity = 300/356 (84.27%), Postives = 302/356 (84.83%), Query Frame = 0

Query: 1   MRSLVSYAEAMEVHKPYIAMLFVQCVYSGMALFSKAAISQGMNPPIFVFYRQAFATIAMA 60
           MRSLVSYAEAM+VHKPYIAMLFVQCVYSGMALFSKAAISQGMNPPIFVFYRQAFATIAMA
Sbjct: 1   MRSLVSYAEAMDVHKPYIAMLFVQCVYSGMALFSKAAISQGMNPPIFVFYRQAFATIAMA 60

Query: 61  PFAFFLERKKAVPLSFKFLFKVFLISLSGITLSLNLYYIAINHISATFAAATTNTIPAIT 120
           PFAFF ERKKAVPLSFKFLFKVFLISLSGITLSLNLYYIAINHISATFAAATTNTIPAIT
Sbjct: 61  PFAFFFERKKAVPLSFKFLFKVFLISLSGITLSLNLYYIAINHISATFAAATTNTIPAIT 120

Query: 121 LLFALLFRYEVISVRKMEGIAKLVGGVIGFSGALVYAFVKGPVMKFMNWYPQNARNSFEG 180
           LLFALLFRYEVISVRKMEGIAKLVGGVIGFSGALVYAFVKGPVMKFMNWYPQNARNSFEG
Sbjct: 121 LLFALLFRYEVISVRKMEGIAKLVGGVIGFSGALVYAFVKGPVMKFMNWYPQNARNSFEG 180

Query: 181 YSGSEWIKGSFVMLSANIAWSLWLVLQ--------------------------------- 240
           YSGSEWIKGSFVMLSANIAWSLWLVLQ                                 
Sbjct: 181 YSGSEWIKGSFVMLSANIAWSLWLVLQASIVKEYPAKLRITSLQCFFSLIQSGLWAVVME 240

Query: 241 --------------------GVIVTGMTYWLQIWTVEKKGPVFTAMFTPLALIITAIFSA 300
                               GVIVTGMTYWLQIWTVEKKGPVFTAMFTPLALIITAIFSA
Sbjct: 241 RKADAWKLGWNLQLFSVAYCGVIVTGMTYWLQIWTVEKKGPVFTAMFTPLALIITAIFSA 300

Query: 301 LVWKEALHWGSVGGAILLVVGLYCVLWGKNKEEDIKTEAIEQRVDIKEETNSASIC 303
           LVWKEALHWGSVGGAILLVVGLYCVLWGKNKEEDIK+EAIEQRVDIKEETNSASIC
Sbjct: 301 LVWKEALHWGSVGGAILLVVGLYCVLWGKNKEEDIKSEAIEQRVDIKEETNSASIC 356

BLAST of Cp4.1LG01g21100 vs. NCBI nr
Match: XP_022990390.1 (WAT1-related protein At1g43650 [Cucurbita maxima])

HSP 1 Score: 534 bits (1376), Expect = 2.72e-189
Identity = 292/362 (80.66%), Postives = 297/362 (82.04%), Query Frame = 0

Query: 1   MRSLVSYAEAMEVHKPYIAMLFVQCVYSGMALFSKAAISQGMNPPIFVFYRQAFATIAMA 60
           MRSLVSYAEAMEVHKPYIAMLFVQCVYSGMALFSKAAIS+GMNPPIFVFYRQAFATIAMA
Sbjct: 1   MRSLVSYAEAMEVHKPYIAMLFVQCVYSGMALFSKAAISRGMNPPIFVFYRQAFATIAMA 60

Query: 61  PFAFFLERKKAVPLSFKFLFKVFLISLSGITLSLNLYYIAINHISATFAAATTNTIPAIT 120
           PFAFFLERKKAVPLSFKFLFKVFLISLSGIT SLNLYYIAINHISATFAAATTNTIPA+T
Sbjct: 61  PFAFFLERKKAVPLSFKFLFKVFLISLSGITSSLNLYYIAINHISATFAAATTNTIPAVT 120

Query: 121 LLFALLFRYEVISVRKMEGIAKLVGGVIGFSGALVYAFVKGPVMKFMNWYPQNAR----- 180
           LLFAL F YEVISVRKMEGIAKLVG VIGFSGALVYAFVKGP MKFMNWYPQNA+     
Sbjct: 121 LLFALFFGYEVISVRKMEGIAKLVGAVIGFSGALVYAFVKGPAMKFMNWYPQNAQTAKMA 180

Query: 181 -NSFEGYSGSEWIKGSFVMLSANIAWSLWLVLQ--------------------------- 240
            NSF+GYS SEWIKGSFVMLSANIAWSLWLVLQ                           
Sbjct: 181 SNSFQGYSDSEWIKGSFVMLSANIAWSLWLVLQASIVKEYPAKLRITSLQCFFSLIQSGL 240

Query: 241 --------------------------GVIVTGMTYWLQIWTVEKKGPVFTAMFTPLALII 300
                                     GVIVTGMTYWLQIWTVEKKGPVFTAMFTPLALII
Sbjct: 241 WAVAMERKADAWKLGWNLQLFSVAYCGVIVTGMTYWLQIWTVEKKGPVFTAMFTPLALII 300

Query: 301 TAIFSALVWKEALHWGSVGGAILLVVGLYCVLWGKNKEEDIKTEAIEQRVDIKEETNSAS 303
           TAIFSALVWKEALHWGSVGGAILLVVGLYCVLWGKNKEEDIK+EAIEQRVDIKEETNSAS
Sbjct: 301 TAIFSALVWKEALHWGSVGGAILLVVGLYCVLWGKNKEEDIKSEAIEQRVDIKEETNSAS 360

BLAST of Cp4.1LG01g21100 vs. NCBI nr
Match: XP_038884292.1 (WAT1-related protein At1g43650 isoform X2 [Benincasa hispida])

HSP 1 Score: 452 bits (1164), Expect = 5.94e-157
Identity = 253/359 (70.47%), Postives = 267/359 (74.37%), Query Frame = 0

Query: 1   MRSLVSYAEAMEVHKPYIAMLFVQCVYSGMALFSKAAISQGMNPPIFVFYRQAFATIAMA 60
           M+S V Y EAMEVHKPYIAMLFVQCVYSGMALFSKAAISQGMNPPIFVFYRQAFAT+AMA
Sbjct: 1   MKSFVGYVEAMEVHKPYIAMLFVQCVYSGMALFSKAAISQGMNPPIFVFYRQAFATVAMA 60

Query: 61  PFAFFLERKKAVPLSFKFLFKVFLISLSGITLSLNLYYIAINHISATFAAATTNTIPAIT 120
           P AF  ERKKAVPL FKFL KVFL+SL GITLSLNLYYIAINH SATFAAATTNTIPAIT
Sbjct: 61  PLAFLFERKKAVPLCFKFLSKVFLVSLVGITLSLNLYYIAINHTSATFAAATTNTIPAIT 120

Query: 121 LLFALLFRYEVISVRKMEGIAKLVGGVIGFSGALVYAFVKGPVMKFMNWYPQNARN---- 180
           LL ALLFRYE I +RK+EG+AKL+G +IGFSGALV+AFVKGP MKFMNWYPQ   N    
Sbjct: 121 LLLALLFRYESICIRKVEGMAKLMGAMIGFSGALVFAFVKGPPMKFMNWYPQTNNNNNNN 180

Query: 181 -----SFEGYSGSEWIKGSFVMLSANIAWSLWLVLQG----------------------- 240
                SF+ YS  EWIKGSF MLSANIAWSLWLVLQG                       
Sbjct: 181 NSNSNSFQPYSTLEWIKGSFTMLSANIAWSLWLVLQGFIVKEYPAKLRITTLQCFFSLIQ 240

Query: 241 ------------------------------VIVTGMTYWLQIWTVEKKGPVFTAMFTPLA 297
                                         VIVTGMTYWLQIW VEKKGPVFTAMFTPLA
Sbjct: 241 SALWAVVMERKPQAWKLGWNLQLFSVAYCGVIVTGMTYWLQIWCVEKKGPVFTAMFTPLA 300

BLAST of Cp4.1LG01g21100 vs. ExPASy TrEMBL
Match: A0A6J1HH16 (WAT1-related protein OS=Cucurbita moschata OX=3662 GN=LOC111463975 PE=3 SV=1)

HSP 1 Score: 557 bits (1435), Expect = 1.09e-198
Identity = 301/356 (84.55%), Postives = 302/356 (84.83%), Query Frame = 0

Query: 1   MRSLVSYAEAMEVHKPYIAMLFVQCVYSGMALFSKAAISQGMNPPIFVFYRQAFATIAMA 60
           MRSLVSYAEAMEVHKPYIAMLFVQCVYSGMALFSKAAISQGMNPPIFVFYRQAFATIAMA
Sbjct: 1   MRSLVSYAEAMEVHKPYIAMLFVQCVYSGMALFSKAAISQGMNPPIFVFYRQAFATIAMA 60

Query: 61  PFAFFLERKKAVPLSFKFLFKVFLISLSGITLSLNLYYIAINHISATFAAATTNTIPAIT 120
           PFAFF ERKKAVPLSFKFLFKVFLISLSGITLSLNLYYIAINHISATFAAATTNTIPAIT
Sbjct: 61  PFAFFFERKKAVPLSFKFLFKVFLISLSGITLSLNLYYIAINHISATFAAATTNTIPAIT 120

Query: 121 LLFALLFRYEVISVRKMEGIAKLVGGVIGFSGALVYAFVKGPVMKFMNWYPQNARNSFEG 180
           LLFALLFRYEVISVRKMEGIAKLVGGVIGFSGALVYAFVKGPVMKFMNWYPQNARNSFEG
Sbjct: 121 LLFALLFRYEVISVRKMEGIAKLVGGVIGFSGALVYAFVKGPVMKFMNWYPQNARNSFEG 180

Query: 181 YSGSEWIKGSFVMLSANIAWSLWLVLQ--------------------------------- 240
           YSGSEWIKGSFVMLSANIAWSLWLVLQ                                 
Sbjct: 181 YSGSEWIKGSFVMLSANIAWSLWLVLQASIVKEYPAKLRITSLQCFFSLIQSGLWAVAME 240

Query: 241 --------------------GVIVTGMTYWLQIWTVEKKGPVFTAMFTPLALIITAIFSA 300
                               GVIVTGMTYWLQIWTVEKKGPVFTAMFTPLALIITAIFSA
Sbjct: 241 RKADAWKLGWNLQLFSVAYCGVIVTGMTYWLQIWTVEKKGPVFTAMFTPLALIITAIFSA 300

Query: 301 LVWKEALHWGSVGGAILLVVGLYCVLWGKNKEEDIKTEAIEQRVDIKEETNSASIC 303
           LVWKEALHWGSVGGAILLVVGLYCVLWGKNKEEDIK+EAIEQRVDIKEETNSASIC
Sbjct: 301 LVWKEALHWGSVGGAILLVVGLYCVLWGKNKEEDIKSEAIEQRVDIKEETNSASIC 356

BLAST of Cp4.1LG01g21100 vs. ExPASy TrEMBL
Match: A0A6J1JPZ3 (WAT1-related protein OS=Cucurbita maxima OX=3661 GN=LOC111487262 PE=3 SV=1)

HSP 1 Score: 534 bits (1376), Expect = 1.32e-189
Identity = 292/362 (80.66%), Postives = 297/362 (82.04%), Query Frame = 0

Query: 1   MRSLVSYAEAMEVHKPYIAMLFVQCVYSGMALFSKAAISQGMNPPIFVFYRQAFATIAMA 60
           MRSLVSYAEAMEVHKPYIAMLFVQCVYSGMALFSKAAIS+GMNPPIFVFYRQAFATIAMA
Sbjct: 1   MRSLVSYAEAMEVHKPYIAMLFVQCVYSGMALFSKAAISRGMNPPIFVFYRQAFATIAMA 60

Query: 61  PFAFFLERKKAVPLSFKFLFKVFLISLSGITLSLNLYYIAINHISATFAAATTNTIPAIT 120
           PFAFFLERKKAVPLSFKFLFKVFLISLSGIT SLNLYYIAINHISATFAAATTNTIPA+T
Sbjct: 61  PFAFFLERKKAVPLSFKFLFKVFLISLSGITSSLNLYYIAINHISATFAAATTNTIPAVT 120

Query: 121 LLFALLFRYEVISVRKMEGIAKLVGGVIGFSGALVYAFVKGPVMKFMNWYPQNAR----- 180
           LLFAL F YEVISVRKMEGIAKLVG VIGFSGALVYAFVKGP MKFMNWYPQNA+     
Sbjct: 121 LLFALFFGYEVISVRKMEGIAKLVGAVIGFSGALVYAFVKGPAMKFMNWYPQNAQTAKMA 180

Query: 181 -NSFEGYSGSEWIKGSFVMLSANIAWSLWLVLQ--------------------------- 240
            NSF+GYS SEWIKGSFVMLSANIAWSLWLVLQ                           
Sbjct: 181 SNSFQGYSDSEWIKGSFVMLSANIAWSLWLVLQASIVKEYPAKLRITSLQCFFSLIQSGL 240

Query: 241 --------------------------GVIVTGMTYWLQIWTVEKKGPVFTAMFTPLALII 300
                                     GVIVTGMTYWLQIWTVEKKGPVFTAMFTPLALII
Sbjct: 241 WAVAMERKADAWKLGWNLQLFSVAYCGVIVTGMTYWLQIWTVEKKGPVFTAMFTPLALII 300

Query: 301 TAIFSALVWKEALHWGSVGGAILLVVGLYCVLWGKNKEEDIKTEAIEQRVDIKEETNSAS 303
           TAIFSALVWKEALHWGSVGGAILLVVGLYCVLWGKNKEEDIK+EAIEQRVDIKEETNSAS
Sbjct: 301 TAIFSALVWKEALHWGSVGGAILLVVGLYCVLWGKNKEEDIKSEAIEQRVDIKEETNSAS 360

BLAST of Cp4.1LG01g21100 vs. ExPASy TrEMBL
Match: A0A5D3DGQ0 (WAT1-related protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1163G00750 PE=3 SV=1)

HSP 1 Score: 432 bits (1111), Expect = 3.29e-149
Identity = 240/355 (67.61%), Postives = 261/355 (73.52%), Query Frame = 0

Query: 1   MRSLVSYAEAMEVHKPYIAMLFVQCVYSGMALFSKAAISQ-GMNPPIFVFYRQAFATIAM 60
           M+S + Y EAM VHKPYIAMLFVQCVYSGMALFSKAAISQ GMNP IFVFYRQAFAT+AM
Sbjct: 1   MKSFLGYVEAMRVHKPYIAMLFVQCVYSGMALFSKAAISQKGMNPAIFVFYRQAFATVAM 60

Query: 61  APFAFFLERKKAVPLSFKFLFKVFLISLSGITLSLNLYYIAINHISATFAAATTNTIPAI 120
           AP AF LERKK VPLSFKF  KVFL+SL G+TLSLNLYY+AINH SATFAAATTNTIPAI
Sbjct: 61  APLAFLLERKKEVPLSFKFHSKVFLVSLIGVTLSLNLYYVAINHTSATFAAATTNTIPAI 120

Query: 121 TLLFALLFRYEVISVRKMEGIAKLVGGVIGFSGALVYAFVKGPVMKFMNWYPQ--NARNS 180
           TLL ALLFRYE I +RK+EG+AKL+G +IGFSGALV+AFVKGP MKFMNWYPQ  N  NS
Sbjct: 121 TLLLALLFRYESICIRKVEGMAKLMGAIIGFSGALVFAFVKGPPMKFMNWYPQTNNITNS 180

Query: 181 FEGYSGSEWIKGSFVMLSANIAWSLWLVLQ------------------------------ 240
           F+ YS  EWIKGSF MLSAN+AWS WLVLQ                              
Sbjct: 181 FQPYSTLEWIKGSFTMLSANLAWSFWLVLQASIVKEYPAKLRVTTLQCFFSLIQSALWAL 240

Query: 241 -----------------------GVIVTGMTYWLQIWTVEKKGPVFTAMFTPLALIITAI 295
                                  GVIVTGMTYWLQIW VEKKGPVFTAMFTPLALIITAI
Sbjct: 241 VMERNPQAWKLGWNLQLFSVAYCGVIVTGMTYWLQIWCVEKKGPVFTAMFTPLALIITAI 300

BLAST of Cp4.1LG01g21100 vs. ExPASy TrEMBL
Match: A0A1S3B3L9 (WAT1-related protein OS=Cucumis melo OX=3656 GN=LOC103485619 PE=3 SV=1)

HSP 1 Score: 432 bits (1111), Expect = 3.29e-149
Identity = 240/355 (67.61%), Postives = 261/355 (73.52%), Query Frame = 0

Query: 1   MRSLVSYAEAMEVHKPYIAMLFVQCVYSGMALFSKAAISQ-GMNPPIFVFYRQAFATIAM 60
           M+S + Y EAM VHKPYIAMLFVQCVYSGMALFSKAAISQ GMNP IFVFYRQAFAT+AM
Sbjct: 1   MKSFLGYVEAMRVHKPYIAMLFVQCVYSGMALFSKAAISQKGMNPAIFVFYRQAFATVAM 60

Query: 61  APFAFFLERKKAVPLSFKFLFKVFLISLSGITLSLNLYYIAINHISATFAAATTNTIPAI 120
           AP AF LERKK VPLSFKF  KVFL+SL G+TLSLNLYY+AINH SATFAAATTNTIPAI
Sbjct: 61  APLAFLLERKKEVPLSFKFHSKVFLVSLIGVTLSLNLYYVAINHTSATFAAATTNTIPAI 120

Query: 121 TLLFALLFRYEVISVRKMEGIAKLVGGVIGFSGALVYAFVKGPVMKFMNWYPQ--NARNS 180
           TLL ALLFRYE I +RK+EG+AKL+G +IGFSGALV+AFVKGP MKFMNWYPQ  N  NS
Sbjct: 121 TLLLALLFRYESICIRKVEGMAKLMGAIIGFSGALVFAFVKGPPMKFMNWYPQTNNITNS 180

Query: 181 FEGYSGSEWIKGSFVMLSANIAWSLWLVLQ------------------------------ 240
           F+ YS  EWIKGSF MLSAN+AWS WLVLQ                              
Sbjct: 181 FQPYSTLEWIKGSFTMLSANLAWSFWLVLQASIVKEYPAKLRVTTLQCFFSLIQSALWAL 240

Query: 241 -----------------------GVIVTGMTYWLQIWTVEKKGPVFTAMFTPLALIITAI 295
                                  GVIVTGMTYWLQIW VEKKGPVFTAMFTPLALIITAI
Sbjct: 241 VMERNPQAWKLGWNLQLFSVAYCGVIVTGMTYWLQIWCVEKKGPVFTAMFTPLALIITAI 300

BLAST of Cp4.1LG01g21100 vs. ExPASy TrEMBL
Match: A0A6J1BVI1 (WAT1-related protein OS=Momordica charantia OX=3673 GN=LOC111006092 PE=3 SV=1)

HSP 1 Score: 427 bits (1098), Expect = 2.52e-147
Identity = 240/356 (67.42%), Postives = 258/356 (72.47%), Query Frame = 0

Query: 1   MRSLVSYAEAMEVHKPYIAMLFVQCVYSGMALFSKAAISQGMNPPIFVFYRQAFATIAMA 60
           M+S V   EAMEVHKPY+AMLFVQCVYSGMALFSKAAIS GMNPP+FVFYRQAFAT+AMA
Sbjct: 1   MKSFVGCVEAMEVHKPYVAMLFVQCVYSGMALFSKAAISAGMNPPVFVFYRQAFATLAMA 60

Query: 61  PFAFFLERKKAVPLSFKFLFKVFLISLSGITLSLNLYYIAINHISATFAAATTNTIPAIT 120
           P AF LERKKAVPLSFKFL KVFL+SL+GITLSLNLYYIAINH SATFAAATTNTIPAIT
Sbjct: 61  PLAFSLERKKAVPLSFKFLSKVFLVSLTGITLSLNLYYIAINHTSATFAAATTNTIPAIT 120

Query: 121 LLFALLFRYEVISVRKMEGIAKLVGGVIGFSGALVYAFVKGPVMKFMNWYPQNARNS--- 180
           LL ALLFRYE I +RKM+GIAKL+G VIG SGALV+AFVKGP MKFMNWYP+   N    
Sbjct: 121 LLLALLFRYENIPIRKMQGIAKLMGAVIGLSGALVFAFVKGPPMKFMNWYPKTNNNDQIT 180

Query: 181 -FEGYSGSEWIKGSFVMLSANIAWSLWLVLQG---------------------------- 240
               YS  EWIKGS +M+SANIAWSLWLV QG                            
Sbjct: 181 PSASYSTLEWIKGSLMMISANIAWSLWLVFQGSIVKEYPAKLRVTTLQCFFSLIQSALWA 240

Query: 241 -------------------------VIVTGMTYWLQIWTVEKKGPVFTAMFTPLALIITA 297
                                    VIVTGMTYWLQIWTVEKKGPVF AMFTPLALIITA
Sbjct: 241 VAMERNPQAWKLGWNLQLISVAYCGVIVTGMTYWLQIWTVEKKGPVFIAMFTPLALIITA 300

BLAST of Cp4.1LG01g21100 vs. TAIR 10
Match: AT1G43650.1 (nodulin MtN21 /EamA-like transporter family protein )

HSP 1 Score: 288.1 bits (736), Expect = 8.1e-78
Identity = 160/326 (49.08%), Postives = 207/326 (63.50%), Query Frame = 0

Query: 11  MEVHKPYIAMLFVQCVYSGMALFSKAAISQGMNPPIFVFYRQAFATIAMAPFAFFLERKK 70
           M  HK  +AM+FVQ VY+GM L SK AISQG NP +FVFYRQAFA +A++PFAFFLE  K
Sbjct: 2   MMEHKANMAMVFVQIVYAGMPLLSKVAISQGTNPFVFVFYRQAFAALALSPFAFFLESSK 61

Query: 71  AVPLSFKFLFKVFLISLSGITLSLNLYYIAINHISATFAAATTNTIPAITLLFALLFRYE 130
           + PLSF  L K+F ISL G+TLSLNLYY+AI + +ATFAAATTN IP+IT + ALLFR E
Sbjct: 62  SSPLSFILLLKIFFISLCGLTLSLNLYYVAIENTTATFAAATTNAIPSITFVLALLFRLE 121

Query: 131 VISVRKMEGIAKLVGGVIGFSGALVYAFVKGPVMKFMNWYPQNARNSFEGYSGSEWIKGS 190
            ++++K  G+AK+ G ++G  GALV+AFVKGP    +N Y  +   +    S    +KGS
Sbjct: 122 TVTLKKSHGVAKVTGSMVGMLGALVFAFVKGP--SLINHYNSSTIPNGTVPSTKNSVKGS 181

Query: 191 FVMLSANIAWSLWLVLQ------------------------------------------- 250
             ML+AN  W LW+++Q                                           
Sbjct: 182 ITMLAANTCWCLWIIMQSKVMKEYPAKLRLVALQCLFSCIQSAVWAVAVNRNPSVWKIEF 241

Query: 251 ----------GVIVTGMTYWLQIWTVEKKGPVFTAMFTPLALIITAIFSALVWKEALHWG 284
                     G++VTG+TYWLQ+W +EKKGPVFTA++TPLALI+T I S+ ++KE  + G
Sbjct: 242 GLPLLSMAYCGIMVTGLTYWLQVWAIEKKGPVFTALYTPLALILTCIVSSFLFKETFYLG 301

BLAST of Cp4.1LG01g21100 vs. TAIR 10
Match: AT5G64700.1 (nodulin MtN21 /EamA-like transporter family protein )

HSP 1 Score: 235.7 bits (600), Expect = 4.8e-62
Identity = 142/349 (40.69%), Postives = 202/349 (57.88%), Query Frame = 0

Query: 11  MEVHKPYIAMLFVQCVYSGMALFSKAAISQGMNPPIFVFYRQAFATIAMAPFAFFLERKK 70
           ME  KPY+ +  +Q +Y+ M L SKA  + GMN  +FVFYRQAFATI +AP AFF ERK 
Sbjct: 3   MESKKPYLMVTIIQVIYTIMFLISKAVFNGGMNTFVFVFYRQAFATIFLAPLAFFFERKS 62

Query: 71  AVPLSFKFLFKVFLISLSGITLSLNLYYIAINHISATFAAATTNTIPAITLLFALLFRYE 130
           A PLSF    K+F++SL G+TLSL+L  IA+++ SAT AAATT ++PAIT   ALLF  E
Sbjct: 63  APPLSFVTFIKIFMLSLFGVTLSLDLNGIALSYTSATLAAATTASLPAITFFLALLFGME 122

Query: 131 VISVRKMEGIAKLVGGVIGFSGALVYAFVKGPVMK------FMNWYPQNARNSFEGYSG- 190
            + V+ ++G AKLVG  +   G ++ A  KGP++K      F +      RN+    SG 
Sbjct: 123 RLKVKSIQGTAKLVGITVCMGGVIILAIYKGPLLKLPLCPHFYHGQEHPHRNNPGHVSGG 182

Query: 191 -SEWIKGSFVMLSANIAWSLWLVLQ----------------------------------- 250
            + W+KG  +M+++NI W LWLVLQ                                   
Sbjct: 183 STSWLKGCVLMITSNILWGLWLVLQGRVLKVYPSKLYFTTLHCLLSSIQSFVIAIALERD 242

Query: 251 ------------------GVIVTGMTYWLQIWTVEKKGPVFTAMFTPLALIITAIFSALV 299
                             G IVTG+ Y+LQ W +EK+GPVF +MFTPL+L+ T + SA++
Sbjct: 243 ISAWKLGWNLRLVAVIYCGFIVTGVAYYLQSWVIEKRGPVFLSMFTPLSLLFTLLSSAIL 302

BLAST of Cp4.1LG01g21100 vs. TAIR 10
Match: AT1G43650.2 (nodulin MtN21 /EamA-like transporter family protein )

HSP 1 Score: 205.7 bits (522), Expect = 5.3e-53
Identity = 110/197 (55.84%), Postives = 142/197 (72.08%), Query Frame = 0

Query: 11  MEVHKPYIAMLFVQCVYSGMALFSKAAISQGMNPPIFVFYRQAFATIAMAPFAFFLERKK 70
           M  HK  +AM+FVQ VY+GM L SK AISQG NP +FVFYRQAFA +A++PFAFFLE  K
Sbjct: 2   MMEHKANMAMVFVQIVYAGMPLLSKVAISQGTNPFVFVFYRQAFAALALSPFAFFLESSK 61

Query: 71  AVPLSFKFLFKVFLISLSGITLSLNLYYIAINHISATFAAATTNTIPAITLLFALLFRYE 130
           + PLSF  L K+F ISL G+TLSLNLYY+AI + +ATFAAATTN IP+IT + ALLFR E
Sbjct: 62  SSPLSFILLLKIFFISLCGLTLSLNLYYVAIENTTATFAAATTNAIPSITFVLALLFRLE 121

Query: 131 VISVRKMEGIAKLVGGVIGFSGALVYAFVKGPVMKFMNWYPQNARNSFEGYSGSEWIKGS 190
            ++++K  G+AK+ G ++G  GALV+AFVKGP    +N Y  +   +    S    +KGS
Sbjct: 122 TVTLKKSHGVAKVTGSMVGMLGALVFAFVKGP--SLINHYNSSTIPNGTVPSTKNSVKGS 181

Query: 191 FVMLSANIAWSLWLVLQ 208
             ML+AN  W LW+++Q
Sbjct: 182 ITMLAANTCWCLWIIMQ 196

BLAST of Cp4.1LG01g21100 vs. TAIR 10
Match: AT1G09380.1 (nodulin MtN21 /EamA-like transporter family protein )

HSP 1 Score: 176.4 bits (446), Expect = 3.5e-44
Identity = 109/345 (31.59%), Postives = 181/345 (52.46%), Query Frame = 0

Query: 16  PYIAMLFVQCVYSGMALFSKAAISQGMNPPIFVFYRQAFATIAMAPFAFFLERKKAVPLS 75
           P++AM+ VQ  Y+GM + SK A+  GM P I V YRQ FATIA  P AFFLERK    ++
Sbjct: 8   PFLAMVLVQIGYAGMNITSKMAMEAGMKPLILVAYRQIFATIATFPVAFFLERKTRPKIT 67

Query: 76  FKFLFKVFLISLSGITLSLNLYYIAINHISATFAAATTNTIPAITLLFALLFRYEVISVR 135
            + L +VF  S++G T +  LY++ + + S T A A TN +PA+T L A +FR E + ++
Sbjct: 68  LRILVQVFFCSITGATGNQVLYFVGLQNSSPTIACALTNLLPAVTFLLAAIFRQETVGIK 127

Query: 136 KMEGIAKLVGGVIGFSGALVYAFVKGPVMKF----MNW-YPQNARNSFEGYSGSEWIKGS 195
           K  G AK++G ++   GA+V +F  G  +      ++W Y +N          S +  G 
Sbjct: 128 KASGQAKVIGTLVCVIGAMVLSFYHGHTIGIGESKIHWAYAENITKHGSSSGHSNFFLGP 187

Query: 196 FVMLSANIAWSLWLVLQ------------------------------------------- 255
           F++++A ++W+ W ++Q                                           
Sbjct: 188 FLIMAAAVSWAAWFIIQTKMSETFAAPYTSTLLMCLMGSIQCGAIALISDHTISDWSLSS 247

Query: 256 ----------GVIVTGMTYWLQIWTVEKKGPVFTAMFTPLALIITAIFSALVWKEALHWG 299
                     GV+ + + + L  W +++KGP++ ++F+PL L++ AIFS  + +E L+ G
Sbjct: 248 PLRFISALYAGVVASALAFCLMSWAMQRKGPLYVSVFSPLLLVVVAIFSWALLEEKLYTG 307

BLAST of Cp4.1LG01g21100 vs. TAIR 10
Match: AT1G21890.1 (nodulin MtN21 /EamA-like transporter family protein )

HSP 1 Score: 174.1 bits (440), Expect = 1.7e-43
Identity = 108/330 (32.73%), Postives = 167/330 (50.61%), Query Frame = 0

Query: 15  KPYIAMLFVQCVYSGMALFSKAAISQGMNPPIFVFYRQAFATIAMAPFAFFLERKKAVPL 74
           KPY+AM+ +Q  Y+GM + +  ++  GMN  +   YR A AT  +APFA F ERK    +
Sbjct: 10  KPYLAMISMQFGYAGMYIITMVSLKHGMNHYVLAVYRHAIATAVIAPFALFHERKIRPKM 69

Query: 75  SFKFLFKVFLISLSGITLSLNLYYIAINHISATFAAATTNTIPAITLLFALLFRYEVISV 134
           +F+   ++ L+      L  NLYY+ + + SATFA+AT N +PAIT + A++FR E ++ 
Sbjct: 70  TFRIFLQIALLGFIEPVLDQNLYYVGMTYTSATFASATANVLPAITFVLAIIFRLESVNF 129

Query: 135 RKMEGIAKLVGGVIGFSGALVYAFVKGPVMKFMNWYPQNARNSFEGYSGS---------- 194
           +K+  IAK+VG VI  SGAL+    KGP++ F+ +       S +G  GS          
Sbjct: 130 KKVRSIAKVVGTVITVSGALLMTLYKGPIVDFIRFGGGGGGGS-DGAGGSHGGAGAAAMD 189

Query: 195 -EWIKGSFVMLSANIAWSLWLVLQ------------------------------------ 254
             WI G+ ++L     W+ + +LQ                                    
Sbjct: 190 KHWIPGTLMLLGRTFGWAGFFILQSFTLKQYPAELSLTTLICLMGTLEGTAVSLVTVRDL 249

Query: 255 -----------------GVIVTGMTYWLQIWTVEKKGPVFTAMFTPLALIITAIFSALVW 281
                            GVI +G+ Y++Q   + ++GPVF A F PL ++ITA    +V 
Sbjct: 250 SAWKIGFDSNLFAAAYSGVICSGVAYYVQGVVMRERGPVFVATFNPLCVVITAALGVVVL 309

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q6NMB71.1e-7649.08WAT1-related protein At1g43650 OS=Arabidopsis thaliana OX=3702 GN=At1g43650 PE=2... [more]
Q9FGG36.8e-6140.69WAT1-related protein At5g64700 OS=Arabidopsis thaliana OX=3702 GN=At5g64700 PE=2... [more]
Q8GXB44.9e-4331.59WAT1-related protein At1g09380 OS=Arabidopsis thaliana OX=3702 GN=At1g09380 PE=2... [more]
F4HZQ72.4e-4232.73WAT1-related protein At1g21890 OS=Arabidopsis thaliana OX=3702 GN=At1g21890 PE=2... [more]
Q9FL411.2e-4130.52WAT1-related protein At5g07050 OS=Arabidopsis thaliana OX=3702 GN=At5g07050 PE=2... [more]
Match NameE-valueIdentityDescription
XP_023546440.11.37e-19985.11WAT1-related protein At1g43650 [Cucurbita pepo subsp. pepo][more]
XP_022963786.12.26e-19884.55WAT1-related protein At1g43650 [Cucurbita moschata][more]
KAG6602184.16.47e-19884.27WAT1-related protein, partial [Cucurbita argyrosperma subsp. sororia] >KAG703286... [more]
XP_022990390.12.72e-18980.66WAT1-related protein At1g43650 [Cucurbita maxima][more]
XP_038884292.15.94e-15770.47WAT1-related protein At1g43650 isoform X2 [Benincasa hispida][more]
Match NameE-valueIdentityDescription
A0A6J1HH161.09e-19884.55WAT1-related protein OS=Cucurbita moschata OX=3662 GN=LOC111463975 PE=3 SV=1[more]
A0A6J1JPZ31.32e-18980.66WAT1-related protein OS=Cucurbita maxima OX=3661 GN=LOC111487262 PE=3 SV=1[more]
A0A5D3DGQ03.29e-14967.61WAT1-related protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold116... [more]
A0A1S3B3L93.29e-14967.61WAT1-related protein OS=Cucumis melo OX=3656 GN=LOC103485619 PE=3 SV=1[more]
A0A6J1BVI12.52e-14767.42WAT1-related protein OS=Momordica charantia OX=3673 GN=LOC111006092 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT1G43650.18.1e-7849.08nodulin MtN21 /EamA-like transporter family protein [more]
AT5G64700.14.8e-6240.69nodulin MtN21 /EamA-like transporter family protein [more]
AT1G43650.25.3e-5355.84nodulin MtN21 /EamA-like transporter family protein [more]
AT1G09380.13.5e-4431.59nodulin MtN21 /EamA-like transporter family protein [more]
AT1G21890.11.7e-4332.73nodulin MtN21 /EamA-like transporter family protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000620EamA domainPFAMPF00892EamAcoord: 15..149
e-value: 6.8E-17
score: 61.9
coord: 202..273
e-value: 8.4E-8
score: 32.5
IPR030184WAT1-related proteinPANTHERPTHR31218WAT1-RELATED PROTEINcoord: 13..210
IPR030184WAT1-related proteinPANTHERPTHR31218WAT1-RELATED PROTEINcoord: 208..287
NoneNo IPR availablePANTHERPTHR31218:SF183WAT1-RELATED PROTEINcoord: 208..287
NoneNo IPR availablePANTHERPTHR31218:SF183WAT1-RELATED PROTEINcoord: 13..210
NoneNo IPR availableSUPERFAMILY103481Multidrug resistance efflux transporter EmrEcoord: 50..150
NoneNo IPR availableSUPERFAMILY103481Multidrug resistance efflux transporter EmrEcoord: 193..278

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG01g21100.1Cp4.1LG01g21100.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0055085 transmembrane transport
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0005886 plasma membrane
cellular_component GO:0016020 membrane
molecular_function GO:0022857 transmembrane transporter activity