CmoCh20G011490 (gene) Cucurbita moschata (Rifu)

NameCmoCh20G011490
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionMajor facilitator superfamily protein
LocationCmo_Chr20 : 11277072 .. 11280681 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TCATCAGCCGCATTCATCTTTGTTGACCCCAACAAATTTCTTATACCCATAGTTCAAACGGCCTGCTGGATCAACACACAACTACTCCGATTCGAATGCCAGGTAATCGATTTCATTGATGAAAAAAATTTGAAAAATCGTTGTCACAAGGACGATTTTGAATCCCGGAATCGTCTGCTTGTTTACCTGAGATTGGACGAATTTTCTTCTCGAATATTAAAATGGCAATGTCAGAATACACATTTGAATGTTGTTCCCGATGAGATTCAAGCTTCTTCTGGGGTTTCTTCACGCTGAAATTGGGTGCCCCAGATCCGCCGTTCATTCGATTGGATTTCTGAATCTTGACGTGATTCCTCAGCTTCTCACTTCCCTCCTCTGATCTTCGATTGACTTTGGTTCCGAGTTTCCTGGAAAATATGAAGTTAGTACCGTGTTCTCTTGCTCTTTTATCTCCTTTTAATCTGTTCTGAATTTGAAGATGTCAGTTAGTGGTTCTGTTTTGTATTGGATGTGTTCTAACAGTTGGATTGTTCCTGCTTGAGAAATGGATTTGTTTTGTGCTGGTTTAATTTATTCTCTTCATTTCAAGTGGTGGGTTGGATCTATCTGCTTTTGATGATGTTCTGTATTCATTTGTTCAGTTTTGTTCCAAATTTGGTTGAAATAGTTTCTTATTTATGCAGAAATATGAAGAAAACATATTCTGTTTTAATTACTTCAATTATTAGTCTTCCTAGTTTCAAAATGTTCTCTGTTTTGTTAAATTGTGCTCATGTGGGATCCCATATCGGTTGGGGAGGAGAACGAAGCATTCTTTATAAGGGTGTGGAAACCTCTTCCTAGCAGACGTGTTTTAAAACCTTGAGTGAAAGTTTGGAAGGGAAAACCCAAAAAGAACAATATCTACTAGCGGTGGGCTTGAGCTGTAACAAACGGTATCAGAGCCAGACACCAGGAGATCTACCAGCAATGAGGCTGAGCCTCAAAGAGGAGTAGACATGAGGCGGTGTGCCAGCAAGGACGCTGGACCTCGAAGGGAGGTGGATTGGGGGTCCATATCGATTGGAGAAGGGAACGAGTGCCAGCGAGGACGCTGGGCCCCAAAAGGGGTGGATTGTGAGAGCCCACATCGGTTGGGGAGGAGAACGAAACATTCTTTATAAGGGTGTGGAAACCTCTCCCTAGCATACACGTTTTTAAACCTTGAGGGGAAGCTTGGAAGGGAAAGTTCGAAGAGGAATATCCACCAGCAGTGGGCTTAGGTTGTTACATGGAAGGAAAAGTCTAAAGACGACAATATCTACTGAACATTGACTTGGGCGGTTATAGCTCATTTACTATCTTTTTCTTTATTTTCTCTTGTAATCATTTATTTGTTTTTGAGACGTAGGTAAGATTTAATGTTTAAGGAAGTACTATATTAGGCAATACTGTTTTATATTGAATGCTTTTACTCTTAGGTTGCCAACATTTCCATCCAACTCATAATGGCGTTGTGATAGAAACACTATATTTGGTCGCTGAATTCACCAACACTGTTATTCAATTCCTTACACCATCTACTAATGCACAGGAACAAAAAAATGTATGGGGTCTCCATCTCTTTGCTTCTTATCAACCTGGCAGCTATAATGGAACGCGCCGATGAGAATCTCCTACCGTCGGTTTACAAGGAAGTCAGCGAAGCTTTCAACGCCAGCCCATCTGATCTAGGATATCTTACATTCATAAGGAACTTTGTGCAGGGATTATGTTCACCCTTGGCAGGAATCCTAGTTCTTAAATACGACCGTCCTAAAGTTCTTGCAATGGGGACGTTTTGTTGGGCCCTTTCAACTGCTGCAGTCGGTATCAGTCTCGAGTTTAAGCAAGTTGCATTCTGGAGAGCCCTGAATGGCTTTGGCTTGGCTATTGTGATTCCAGCACTCCAATCTTTCATTGCTGATAGCTACACGGACGGTGTTCGGGGTATGGGATTCGGCTTGTTAAGCCTCATCGGTTCACTGGGAGGCATCGGAGGTGGTGTCCTTGCAACCGTTATGGCTGGTCAACAGTATTTCGGCATACAAGGATGGCGCTGTGCCTTCATTCTCATGGCTACATTGAGTGCAATAATCGGTATCCTTGTTTACATGTTTGTAGTTGATCCTAGAAAAACAATTAGCAACATTCAGGACAGTTCAGATAGGTACCGCCTGAGGTTTGTTTGTAGTTCATTTGATCTCTTTTCACAAGTCTATGTTCTATTATCTAACTTGCTTGGTATCTGGCTACAGGGATAGTTTGATAAACCGAACCTCGTCTAATTCATCGTCGATATGGTTCGAGTCTTGGAATGCTATGAAAGCCGTTATGAAAGTGCGTACATTTCAAGTCATTGTCCTGCAGGGGATAGTTGGATCACTGCCTTGGACAGCCATGGTGTTCTTCACTATGTGGTTTGAATTGATCGGTGAGTATTGTTTTCGATTTCCCCGTCTCCAATGAACCGTATAATATAATCTTGTGACCGAAAAGTTAGTTAGTATGCTCGCTGTATGTTGTTATAAATATGACATTCCTTGTCGGGTAGGTTCCGACCTGCACGAAACCCCGGTTTCGTTGCTACCGCTGGCTTTTGAAGATATTCCTTCTGTTCTTTCAGGTTTCAGTCATAACAGTACCGCAGTTCTACTTAGTCTTTTTGCAGTGGGGTGTGCATTGGGGTCTCTCATGGGCGGTTTAATTGCGGATAGGTTGTCAAAGATCTATCCTCATTCTGGCAGAATCATGTGTGCTCAATTCAGTGCTAGTATGGGCATACCATTCTCATTGTTCCTCCTCCGAGTCGTTCCGCAGTCTATCGATAGCTATCTCGTCTTCGCCGTTACTCTCTTATTGATGGGATTAACTATCAGCTGGAATGGCACTGCTGTCAATGCTCCCATTTTCGCCGAGGTTGTCCCAATGAAACACCGAACCATGATATATGCGTTTGACCGTGCGTTTGAAGGTTCGTTTTCATCGTTTGCTGCTCCTCTGGTTGGTATTCTCTCGGAGAAAATGTTCGGTTACGACAACGCAGCTATAGGTTCGTTACCGAAGGCTCTTGCATTGTCGAAGGGACTTCTTGCAATGATGGCAGTTCCTTTCGGCGTTTGTTGTTTGTTCTACACGCCATTGTATATATATTTCAGGCTTGACCGCGAAAATGCTCAAATGCAGAGTTCTAAAGGTACTAAACTCATTGATGACCTTTAAGGAACGGTCTGAGATCTCACATCGGTTGGGGAGGAAAACGAAACATTCTTTATAAGGGTGTAGAAACCTCTCTCTAGCAGACGCGTTTTAAAATCTTTGAGAGAAATCTCGAAAGAAAAAGCCTAAAGAGGATAGTATCTGCTAGCAGTGGACTTGGGCCGTTACAAACTGTGTAGGAATGTAGTAGTGTAATTCCACTGCCATTATAGTTTCTTTTTCCCTTTGAAGCCCATTCATTTTTCCTTTTAGAATGTTCTGCATATGAACATGACTCAACATTTCAGGATAAATTGGGCATTTTGTTGAGAACCATGAAGGATTAGAGACATAACTGTGCCTTCATTCTTTCTTTTTACTCGCATTGGTT

mRNA sequence

TCATCAGCCGCATTCATCTTTGTTGACCCCAACAAATTTCTTATACCCATAGTTCAAACGGCCTGCTGGATCAACACACAACTACTCCGATTCGAATGCCAGGTAATCGATTTCATTGATGAAAAAAATTTGAAAAATCGTTGTCACAAGGACGATTTTGAATCCCGGAATCGTCTGCTTGTTTACCTGAGATTGGACGAATTTTCTTCTCGAATATTAAAATGGCAATGTCAGAATACACATTTGAATGTTGTTCCCGATGAGATTCAAGCTTCTTCTGGGGTTTCTTCACGCTGAAATTGGGTGCCCCAGATCCGCCGTTCATTCGATTGGATTTCTGAATCTTGACGTGATTCCTCAGCTTCTCACTTCCCTCCTCTGATCTTCGATTGACTTTGGTTCCGAGTTTCCTGGAAAATATGAAGAACAAAAAAATGTATGGGGTCTCCATCTCTTTGCTTCTTATCAACCTGGCAGCTATAATGGAACGCGCCGATGAGAATCTCCTACCGTCGGTTTACAAGGAAGTCAGCGAAGCTTTCAACGCCAGCCCATCTGATCTAGGATATCTTACATTCATAAGGAACTTTGTGCAGGGATTATGTTCACCCTTGGCAGGAATCCTAGTTCTTAAATACGACCGTCCTAAAGTTCTTGCAATGGGGACGTTTTGTTGGGCCCTTTCAACTGCTGCAGTCGGTATCAGTCTCGAGTTTAAGCAAGTTGCATTCTGGAGAGCCCTGAATGGCTTTGGCTTGGCTATTGTGATTCCAGCACTCCAATCTTTCATTGCTGATAGCTACACGGACGGTGTTCGGGGTATGGGATTCGGCTTGTTAAGCCTCATCGGTTCACTGGGAGGCATCGGAGGTGGTGTCCTTGCAACCGTTATGGCTGGTCAACAGTATTTCGGCATACAAGGATGGCGCTGTGCCTTCATTCTCATGGCTACATTGAGTGCAATAATCGGTATCCTTGTTTACATGTTTGTAGTTGATCCTAGAAAAACAATTAGCAACATTCAGGACAGTTCAGATAGGTACCGCCTGAGGGATAGTTTGATAAACCGAACCTCGTCTAATTCATCGTCGATATGGTTCGAGTCTTGGAATGCTATGAAAGCCGTTATGAAAGTGCGTACATTTCAAGTCATTGTCCTGCAGGGGATAGTTGGATCACTGCCTTGGACAGCCATGGTGTTCTTCACTATGTGGTTTGAATTGATCGGTTTCAGTCATAACAGTACCGCAGTTCTACTTAGTCTTTTTGCAGTGGGGTGTGCATTGGGGTCTCTCATGGGCGGTTTAATTGCGGATAGGTTGTCAAAGATCTATCCTCATTCTGGCAGAATCATGTGTGCTCAATTCAGTGCTAGTATGGGCATACCATTCTCATTGTTCCTCCTCCGAGTCGTTCCGCAGTCTATCGATAGCTATCTCGTCTTCGCCGTTACTCTCTTATTGATGGGATTAACTATCAGCTGGAATGGCACTGCTGTCAATGCTCCCATTTTCGCCGAGGTTGTCCCAATGAAACACCGAACCATGATATATGCGTTTGACCGTGCGTTTGAAGGTTCGTTTTCATCGTTTGCTGCTCCTCTGGTTGGTATTCTCTCGGAGAAAATGTTCGGTTACGACAACGCAGCTATAGGTTCGTTACCGAAGGCTCTTGCATTGTCGAAGGGACTTCTTGCAATGATGGCAGTTCCTTTCGGCGTTTGTTGTTTGTTCTACACGCCATTGTATATATATTTCAGGCTTGACCGCGAAAATGCTCAAATGCAGAGTTCTAAAGGTACTAAACTCATTGATGACCTTTAAGGAACGGTCTGAGATCTCACATCGGTTGGGGAGGAAAACGAAACATTCTTTATAAGGGTGTAGAAACCTCTCTCTAGCAGACGCGTTTTAAAATCTTTGAGAGAAATCTCGAAAGAAAAAGCCTAAAGAGGATAGTATCTGCTAGCAGTGGACTTGGGCCGTTACAAACTGTGTAGGAATGTAGTAGTGTAATTCCACTGCCATTATAGTTTCTTTTTCCCTTTGAAGCCCATTCATTTTTCCTTTTAGAATGTTCTGCATATGAACATGACTCAACATTTCAGGATAAATTGGGCATTTTGTTGAGAACCATGAAGGATTAGAGACATAACTGTGCCTTCATTCTTTCTTTTTACTCGCATTGGTT

Coding sequence (CDS)

ATGAAGAACAAAAAAATGTATGGGGTCTCCATCTCTTTGCTTCTTATCAACCTGGCAGCTATAATGGAACGCGCCGATGAGAATCTCCTACCGTCGGTTTACAAGGAAGTCAGCGAAGCTTTCAACGCCAGCCCATCTGATCTAGGATATCTTACATTCATAAGGAACTTTGTGCAGGGATTATGTTCACCCTTGGCAGGAATCCTAGTTCTTAAATACGACCGTCCTAAAGTTCTTGCAATGGGGACGTTTTGTTGGGCCCTTTCAACTGCTGCAGTCGGTATCAGTCTCGAGTTTAAGCAAGTTGCATTCTGGAGAGCCCTGAATGGCTTTGGCTTGGCTATTGTGATTCCAGCACTCCAATCTTTCATTGCTGATAGCTACACGGACGGTGTTCGGGGTATGGGATTCGGCTTGTTAAGCCTCATCGGTTCACTGGGAGGCATCGGAGGTGGTGTCCTTGCAACCGTTATGGCTGGTCAACAGTATTTCGGCATACAAGGATGGCGCTGTGCCTTCATTCTCATGGCTACATTGAGTGCAATAATCGGTATCCTTGTTTACATGTTTGTAGTTGATCCTAGAAAAACAATTAGCAACATTCAGGACAGTTCAGATAGGTACCGCCTGAGGGATAGTTTGATAAACCGAACCTCGTCTAATTCATCGTCGATATGGTTCGAGTCTTGGAATGCTATGAAAGCCGTTATGAAAGTGCGTACATTTCAAGTCATTGTCCTGCAGGGGATAGTTGGATCACTGCCTTGGACAGCCATGGTGTTCTTCACTATGTGGTTTGAATTGATCGGTTTCAGTCATAACAGTACCGCAGTTCTACTTAGTCTTTTTGCAGTGGGGTGTGCATTGGGGTCTCTCATGGGCGGTTTAATTGCGGATAGGTTGTCAAAGATCTATCCTCATTCTGGCAGAATCATGTGTGCTCAATTCAGTGCTAGTATGGGCATACCATTCTCATTGTTCCTCCTCCGAGTCGTTCCGCAGTCTATCGATAGCTATCTCGTCTTCGCCGTTACTCTCTTATTGATGGGATTAACTATCAGCTGGAATGGCACTGCTGTCAATGCTCCCATTTTCGCCGAGGTTGTCCCAATGAAACACCGAACCATGATATATGCGTTTGACCGTGCGTTTGAAGGTTCGTTTTCATCGTTTGCTGCTCCTCTGGTTGGTATTCTCTCGGAGAAAATGTTCGGTTACGACAACGCAGCTATAGGTTCGTTACCGAAGGCTCTTGCATTGTCGAAGGGACTTCTTGCAATGATGGCAGTTCCTTTCGGCGTTTGTTGTTTGTTCTACACGCCATTGTATATATATTTCAGGCTTGACCGCGAAAATGCTCAAATGCAGAGTTCTAAAGGTACTAAACTCATTGATGACCTTTAA
BLAST of CmoCh20G011490 vs. TrEMBL
Match: A0A0A0LQ36_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_2G296060 PE=4 SV=1)

HSP 1 Score: 830.1 bits (2143), Expect = 1.3e-237
Identity = 425/467 (91.01%), Postives = 444/467 (95.07%), Query Frame = 1

Query: 1   MKNKKMYGVSISLLLINLAAIMERADENLLPSVYKEVSEAFNASPSDLGYLTFIRNFVQG 60
           MKN+K+YGVSISL+LINLAAIMERADENLLPSVYKEVSE FNASPSDLGYLTFIRNFVQG
Sbjct: 1   MKNRKIYGVSISLILINLAAIMERADENLLPSVYKEVSETFNASPSDLGYLTFIRNFVQG 60

Query: 61  LCSPLAGILVLKYDRPKVLAMGTFCWALSTAAVGISLEFKQVAFWRALNGFGLAIVIPAL 120
           LCSPLAGILVL YDRPKVLAMGTFCWALSTAAVGISLEFKQVAFWRA+NGFGLAIVIPAL
Sbjct: 61  LCSPLAGILVLSYDRPKVLAMGTFCWALSTAAVGISLEFKQVAFWRAVNGFGLAIVIPAL 120

Query: 121 QSFIADSYTDGVRGMGFGLLSLIGSLGGIGGGVLATVMAGQQYFGIQGWRCAFILMATLS 180
           QSFIADSY DGVRGMGFGLLSLIGSLGGIGGGVLATVMAGQQYFG++GWRCAFILMATLS
Sbjct: 121 QSFIADSYMDGVRGMGFGLLSLIGSLGGIGGGVLATVMAGQQYFGVEGWRCAFILMATLS 180

Query: 181 AIIGILVYMFVVDPRKTISNIQDSSDRYRLRDSLINRTSSNSSSIWFESWNAMKAVMKVR 240
           AIIGILVYMFVVDPRKTI+NIQ+SSDRY  RD+LI+RT  NSSSIWFESW+AMKAVMKV 
Sbjct: 181 AIIGILVYMFVVDPRKTINNIQESSDRYLRRDNLIDRTLPNSSSIWFESWSAMKAVMKVH 240

Query: 241 TFQVIVLQGIVGSLPWTAMVFFTMWFELIGFSHNSTAVLLSLFAVGCALGSLMGGLIADR 300
           TFQ+IVLQGIVGSLPWTAMVFFTMWFELIGFSHN TAVLLSLFAVGCALGSL+GGLIADR
Sbjct: 241 TFQIIVLQGIVGSLPWTAMVFFTMWFELIGFSHNGTAVLLSLFAVGCALGSLLGGLIADR 300

Query: 301 LSKIYPHSGRIMCAQFSASMGIPFSLFLLRVVPQSIDSYLVFAVTLLLMGLTISWNGTAV 360
           LSKIYPHSGRIMCAQFSASMGIPFSL LLRV+PQS+DS L+F VTL LMGLTISWNGTAV
Sbjct: 301 LSKIYPHSGRIMCAQFSASMGIPFSLLLLRVIPQSVDSLLIFGVTLFLMGLTISWNGTAV 360

Query: 361 NAPIFAEVVPMKHRTMIYAFDRAFEGSFSSFAAPLVGILSEKMFGYDNAAIGSLPKALAL 420
           NAPIFAEVVP+KHRTMIYAFDRAFEGSFSSFAAPLVGILSEKMFGYD+ A  SL KALAL
Sbjct: 361 NAPIFAEVVPIKHRTMIYAFDRAFEGSFSSFAAPLVGILSEKMFGYDDTAGASLLKALAL 420

Query: 421 SKGLLAMMAVPFGVCCLFYTPLYIYFRLDRENAQMQSSKGTKLIDDL 468
           SKGLL MM VPFGVCCL YTPLY YFRLDRENA+MQ SKGTK IDDL
Sbjct: 421 SKGLLTMMTVPFGVCCLCYTPLYKYFRLDRENARMQGSKGTKSIDDL 467

BLAST of CmoCh20G011490 vs. TrEMBL
Match: B9SGI0_RICCO (Carbohydrate transporter, putative OS=Ricinus communis GN=RCOM_0553180 PE=4 SV=1)

HSP 1 Score: 715.7 bits (1846), Expect = 3.7e-203
Identity = 363/466 (77.90%), Postives = 415/466 (89.06%), Query Frame = 1

Query: 2   KNKKMYGVSISLLLINLAAIMERADENLLPSVYKEVSEAFNASPSDLGYLTFIRNFVQGL 61
           + +K++GVS+SL+LINLAAIMERADENLLP+VYKEVSE FNA PSDLGYLTFIRNFVQGL
Sbjct: 20  RTRKIFGVSLSLILINLAAIMERADENLLPAVYKEVSETFNAGPSDLGYLTFIRNFVQGL 79

Query: 62  CSPLAGILVLKYDRPKVLAMGTFCWALSTAAVGISLEFKQVAFWRALNGFGLAIVIPALQ 121
            SPLAG+LV+ YDRP VLAMGTFCWALSTAAVG S  F QVAFWR +NGFGLAIVIPALQ
Sbjct: 80  SSPLAGVLVINYDRPTVLAMGTFCWALSTAAVGASHHFLQVAFWRGVNGFGLAIVIPALQ 139

Query: 122 SFIADSYTDGVRGMGFGLLSLIGSLGGIGGGVLATVMAGQQYFGIQGWRCAFILMATLSA 181
           SFIADSY +GVRG GFGL++LIG+LGGIGGGVLATVMAGQQY+GI GWRCAFI+MATLS+
Sbjct: 140 SFIADSYMEGVRGAGFGLVNLIGNLGGIGGGVLATVMAGQQYWGIPGWRCAFIMMATLSS 199

Query: 182 IIGILVYMFVVDPRKTISNIQDSSDRYRLRDSLINRTSSNSSSIWFESWNAMKAVMKVRT 241
           IIG LV++FV+DPRKTIS  +D+ + +  RD LI R+SS++SS+W ESW AM+AV+KV+T
Sbjct: 200 IIGFLVFLFVIDPRKTISIPRDTRESFE-RDELIERSSSSASSVWTESWTAMQAVIKVKT 259

Query: 242 FQVIVLQGIVGSLPWTAMVFFTMWFELIGFSHNSTAVLLSLFAVGCALGSLMGGLIADRL 301
           FQ+IVLQGIVGSLPWTAMVFF MWFELIGFSHNSTA LLSLFAVGCALGSL+GGLIADRL
Sbjct: 260 FQIIVLQGIVGSLPWTAMVFFAMWFELIGFSHNSTAFLLSLFAVGCALGSLIGGLIADRL 319

Query: 302 SKIYPHSGRIMCAQFSASMGIPFSLFLLRVVPQSIDSYLVFAVTLLLMGLTISWNGTAVN 361
           S  YPHSGRIMCAQFSA MGIPFS FLL+ +P S+ SY  FAVT+ +MGLTISWNGTA N
Sbjct: 320 SHTYPHSGRIMCAQFSALMGIPFSWFLLKEIPLSVSSYHTFAVTIFMMGLTISWNGTAAN 379

Query: 362 APIFAEVVPMKHRTMIYAFDRAFEGSFSSFAAPLVGILSEKMFGYDNAAI----GSLPKA 421
           AP+FAEVVP+KHRTMIYAFDRAFEGS SSFAAPLVGILSEKMFGYD+ +I    GS+ +A
Sbjct: 380 APMFAEVVPVKHRTMIYAFDRAFEGSLSSFAAPLVGILSEKMFGYDSKSIDPVKGSVQEA 439

Query: 422 LALSKGLLAMMAVPFGVCCLFYTPLYIYFRLDRENAQMQSSKGTKL 464
            ALSKGLL+MMAVPFG+CCLFYTPLY +FR DRENA++ S+K  ++
Sbjct: 440 SALSKGLLSMMAVPFGLCCLFYTPLYKFFRQDRENARIASAKEAEM 484

BLAST of CmoCh20G011490 vs. TrEMBL
Match: B9HEN5_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0007s12360g PE=4 SV=2)

HSP 1 Score: 699.9 bits (1805), Expect = 2.1e-198
Identity = 356/468 (76.07%), Postives = 404/468 (86.32%), Query Frame = 1

Query: 1   MKNKKMYGVSISLLLINLAAIMERADENLLPSVYKEVSEAFNASPSDLGYLTFIRNFVQG 60
           MK +K+ GVS+S+ LIN+AAIMERADENLLP+VYKEVSEAFNA PSDLGYLTFIRNFVQG
Sbjct: 30  MKGRKILGVSLSIFLINMAAIMERADENLLPAVYKEVSEAFNAGPSDLGYLTFIRNFVQG 89

Query: 61  LCSPLAGILVLKYDRPKVLAMGTFCWALSTAAVGISLEFKQVAFWRALNGFGLAIVIPAL 120
           L SPLAGILV+ + RP VLAMGT CWALSTAAVG S  F Q AFWRA+NGFGLAIVIPAL
Sbjct: 90  LSSPLAGILVINHARPTVLAMGTLCWALSTAAVGASQHFSQAAFWRAVNGFGLAIVIPAL 149

Query: 121 QSFIADSYTDGVRGMGFGLLSLIGSLGGIGGGVLATVMAGQQYFGIQGWRCAFILMATLS 180
           QSFIADSY DGVRG GFGLLS IG+LGGIGGGVLATVMAGQQY+G+QGWR AFI+MA+LS
Sbjct: 150 QSFIADSYQDGVRGTGFGLLSFIGNLGGIGGGVLATVMAGQQYWGVQGWRFAFIMMASLS 209

Query: 181 AIIGILVYMFVVDPRKTISNIQDSSDRYRLRDSLINRTSSNSSSIWFESWNAMKAVMKVR 240
            +IG+LV++FVVDPRKTI           +   L+ + +S   SIW ESW A KAVMKV+
Sbjct: 210 LLIGLLVFLFVVDPRKTIG----------VNHELVEKGNSYELSIWTESWTATKAVMKVK 269

Query: 241 TFQVIVLQGIVGSLPWTAMVFFTMWFELIGFSHNSTAVLLSLFAVGCALGSLMGGLIADR 300
           TFQ+IVLQGIVGSLPWTAMVFFTMWFELIGF+HN TA LLS FAVGC+LGSL+GG+IADR
Sbjct: 270 TFQIIVLQGIVGSLPWTAMVFFTMWFELIGFNHNKTAALLSFFAVGCSLGSLLGGIIADR 329

Query: 301 LSKIYPHSGRIMCAQFSASMGIPFSLFLLRVVPQSIDSYLVFAVTLLLMGLTISWNGTAV 360
           +S IYPHSGRIMCAQFSA MGIPFS FLL+V+PQS+ SY  FAVTL +MGLTISWNGTAV
Sbjct: 330 MSHIYPHSGRIMCAQFSAFMGIPFSWFLLKVIPQSVSSYSTFAVTLFMMGLTISWNGTAV 389

Query: 361 NAPIFAEVVPMKHRTMIYAFDRAFEGSFSSFAAPLVGILSEKMFGYDNAAI----GSLPK 420
           NAPIFAEVVP+KHRTMIYA+DRAFEGSFSSFAAPLVGILSE+MFGYD+ ++    GS+ +
Sbjct: 390 NAPIFAEVVPVKHRTMIYAYDRAFEGSFSSFAAPLVGILSEQMFGYDSKSVDPIKGSVRE 449

Query: 421 ALALSKGLLAMMAVPFGVCCLFYTPLYIYFRLDRENAQMQSSKGTKLI 465
           A ALSKGLL+MMA+PFG+CCLFYTPLY YFR DRENA+M  SK  +++
Sbjct: 450 ASALSKGLLSMMAIPFGLCCLFYTPLYRYFRQDRENARMAGSKALEIM 487

BLAST of CmoCh20G011490 vs. TrEMBL
Match: A0A061DGI1_THECC (Major facilitator superfamily protein isoform 1 OS=Theobroma cacao GN=TCM_000452 PE=4 SV=1)

HSP 1 Score: 699.1 bits (1803), Expect = 3.6e-198
Identity = 352/468 (75.21%), Postives = 409/468 (87.39%), Query Frame = 1

Query: 1   MKNKKMYGVSISLLLINLAAIMERADENLLPSVYKEVSEAFNASPSDLGYLTFIRNFVQG 60
           MK ++++GVS+SLLLINLAAIMERADENLLPSVYKEVSEAFNA PSDLGYLTFIRNFVQG
Sbjct: 27  MKARRVFGVSLSLLLINLAAIMERADENLLPSVYKEVSEAFNAGPSDLGYLTFIRNFVQG 86

Query: 61  LCSPLAGILVLKYDRPKVLAMGTFCWALSTAAVGISLEFKQVAFWRALNGFGLAIVIPAL 120
           L SPL G+LV+ YDRP VLA+GT CWALSTAAVG S +F QVA WRA+NGFGLAIVIPAL
Sbjct: 87  LASPLTGVLVINYDRPTVLAIGTLCWALSTAAVGASQQFLQVALWRAVNGFGLAIVIPAL 146

Query: 121 QSFIADSYTDGVRGMGFGLLSLIGSLGGIGGGVLATVMAGQQYFGIQGWRCAFILMATLS 180
           QSFIADSYTDGVRG GFGLLS +G+LGGIGGGV+AT+MAGQQ++G+ GWRCAFILMATLS
Sbjct: 147 QSFIADSYTDGVRGAGFGLLSFVGTLGGIGGGVVATIMAGQQFWGMPGWRCAFILMATLS 206

Query: 181 AIIGILVYMFVVDPRKTISNIQDSSDRYRLRDSLINRTSSNSSSIWFESWNAMKAVMKVR 240
           ++IG LV++FVVDPRKT+    D+++ +  RD LI + ++ +SS+WFESW A +AV+KV 
Sbjct: 207 SLIGFLVFLFVVDPRKTVGVNHDAANSFD-RDELIEKGNTGASSVWFESWMATRAVIKVP 266

Query: 241 TFQVIVLQGIVGSLPWTAMVFFTMWFELIGFSHNSTAVLLSLFAVGCALGSLMGGLIADR 300
           TFQ+IVLQGIVGSLPWTAMVFFTMWFELIGF HNSTA LLSLFA+GCA+GS +GGLIAD+
Sbjct: 267 TFQIIVLQGIVGSLPWTAMVFFTMWFELIGFDHNSTAALLSLFAIGCAMGSFLGGLIADK 326

Query: 301 LSKIYPHSGRIMCAQFSASMGIPFSLFLLRVVPQSIDSYLVFAVTLLLMGLTISWNGTAV 360
           +S+IYPHSGRIMCAQFSA MGIPFS FLL+V+PQS+ SY  FAVTL LMGLTISWN TA 
Sbjct: 327 ISQIYPHSGRIMCAQFSAFMGIPFSWFLLKVIPQSVSSYYTFAVTLFLMGLTISWNATAA 386

Query: 361 NAPIFAEVVPMKHRTMIYAFDRAFEGSFSSFAAPLVGILSEKMFGYDNAAI----GSLPK 420
           N P+FAEVVP KHRTMIYAFDRAFEGSFSSFAAPLVGILSE+MFGYD+ +I    GS  +
Sbjct: 387 NGPMFAEVVPAKHRTMIYAFDRAFEGSFSSFAAPLVGILSEQMFGYDSKSIDPINGSPRE 446

Query: 421 ALALSKGLLAMMAVPFGVCCLFYTPLYIYFRLDRENAQMQSSKGTKLI 465
           A ALS+GLLAMMA+PFG+C LFYTPLY  FR DR+N ++ + K  ++I
Sbjct: 447 AFALSRGLLAMMAIPFGLCSLFYTPLYNIFRRDRDNVRLANLKEEEMI 493

BLAST of CmoCh20G011490 vs. TrEMBL
Match: A0A0D2TIS2_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_007G206200 PE=4 SV=1)

HSP 1 Score: 697.2 bits (1798), Expect = 1.4e-197
Identity = 354/469 (75.48%), Postives = 407/469 (86.78%), Query Frame = 1

Query: 1   MKNKKMYGVSISLLLINLAAIMERADENLLPSVYKEVSEAFNASPSDLGYLTFIRNFVQG 60
           MK +K++GVS+S+LLINLAAIMERADENLLPSVYKEVSEAFNA PSDLGYLTFIRNFVQG
Sbjct: 27  MKTRKIFGVSLSILLINLAAIMERADENLLPSVYKEVSEAFNAGPSDLGYLTFIRNFVQG 86

Query: 61  LCSPLAGILVLKYDRPKVLAMGTFCWALSTAAVGISLEFKQVAFWRALNGFGLAIVIPAL 120
           L SPLAG+LV+ YDRP VLA+GT CWALSTAAVG S +F QVA WRA+NGFGLAIVIPAL
Sbjct: 87  LASPLAGVLVINYDRPTVLAIGTMCWALSTAAVGGSQQFHQVAMWRAVNGFGLAIVIPAL 146

Query: 121 QSFIADSYTDGVRGMGFGLLSLIGSLGGIGGGVLATVMAGQQYFGIQGWRCAFILMATLS 180
           QSFIADSY DGVRG GFGLLSL+G+LGGIGGGV+ATVMAGQQ++G  GWRCAFILMATLS
Sbjct: 147 QSFIADSYMDGVRGAGFGLLSLVGTLGGIGGGVVATVMAGQQFWGTPGWRCAFILMATLS 206

Query: 181 AIIGILVYMFVVDPRKTISNIQDSSDRYRL-RDSLINRTSSNSSSIWFESWNAMKAVMKV 240
           + IG+LV++FVVDPRKT+S   D+  R  L RD L+ + +   SS+W ESW A KAV+K+
Sbjct: 207 SFIGLLVFLFVVDPRKTVSVNHDA--RINLERDELVEKGNGRVSSVWSESWMATKAVIKI 266

Query: 241 RTFQVIVLQGIVGSLPWTAMVFFTMWFELIGFSHNSTAVLLSLFAVGCALGSLMGGLIAD 300
            TFQ+IVLQGI+GSLPWTAMVFFTMWFELIGF HNSTA LLSLFA+GCA+GS +GG+IAD
Sbjct: 267 PTFQIIVLQGIIGSLPWTAMVFFTMWFELIGFDHNSTAALLSLFAIGCAMGSFLGGVIAD 326

Query: 301 RLSKIYPHSGRIMCAQFSASMGIPFSLFLLRVVPQSIDSYLVFAVTLLLMGLTISWNGTA 360
           RLS+IYPHSGRI+CAQFSA MGIPFSLFLL+V+PQS+ SY  FA+TL +MGLTISWN TA
Sbjct: 327 RLSQIYPHSGRIICAQFSAFMGIPFSLFLLKVIPQSVSSYYTFAITLFMMGLTISWNATA 386

Query: 361 VNAPIFAEVVPMKHRTMIYAFDRAFEGSFSSFAAPLVGILSEKMFGYDNAAI----GSLP 420
            N P+FAEVVP KHRTMIYAFDRAFEGSFSSFAAPLVGILSEKMFGYD+ +I    GS  
Sbjct: 387 ANGPMFAEVVPSKHRTMIYAFDRAFEGSFSSFAAPLVGILSEKMFGYDSKSIDPINGSPR 446

Query: 421 KALALSKGLLAMMAVPFGVCCLFYTPLYIYFRLDRENAQMQSSKGTKLI 465
           +A ALS+GLL+MMA+PFG+CCL YTPLY  FR DREN ++ + K  ++I
Sbjct: 447 EAFALSRGLLSMMAIPFGLCCLCYTPLYKLFRRDRENVRLAAVKEEEMI 493

BLAST of CmoCh20G011490 vs. TAIR10
Match: AT4G36790.1 (AT4G36790.1 Major facilitator superfamily protein)

HSP 1 Score: 648.3 bits (1671), Expect = 3.7e-186
Identity = 333/469 (71.00%), Postives = 393/469 (83.80%), Query Frame = 1

Query: 1   MKNKKMYGVSISLLLINLAAIMERADENLLPSVYKEVSEAFNASPSDLGYLTFIRNFVQG 60
           +K     GVSISL+LINLAAIMERADENLLPSVYKEVSEAFNA PSDLGYLTF+RNFVQG
Sbjct: 33  IKTGTFLGVSISLILINLAAIMERADENLLPSVYKEVSEAFNAGPSDLGYLTFVRNFVQG 92

Query: 61  LCSPLAGILVLKYDRPKVLAMGTFCWALSTAAVGISLEFKQVAFWRALNGFGLAIVIPAL 120
           L SPLAG+LV+ YDRP VLA+GTFCWALSTAAVG S  F QVA WRA+NGFGLAIVIPAL
Sbjct: 93  LASPLAGVLVITYDRPIVLAIGTFCWALSTAAVGASSYFIQVALWRAVNGFGLAIVIPAL 152

Query: 121 QSFIADSYTDGVRGMGFGLLSLIGSLGGIGGGVLATVMAGQQYFGIQGWRCAFILMATLS 180
           QSFIADSY DG RG GFG+L+LIG++GGIGGGV+ATVMAG +++GI GWRCAFI+MA LS
Sbjct: 153 QSFIADSYKDGARGAGFGMLNLIGTIGGIGGGVVATVMAGSEFWGIPGWRCAFIMMAALS 212

Query: 181 AIIGILVYMFVVDPRKTISNIQDSSDRYRLRDSLINRTSSNSSSIWFESWNAMKAVMKVR 240
           A+IG+LV++FVVDPRK I            R+ L+     NS+S+W +S  A K+V+KV 
Sbjct: 213 AVIGLLVFLFVVDPRKNIE-----------REELMAH-KMNSNSVWNDSLAAAKSVVKVS 272

Query: 241 TFQVIVLQGIVGSLPWTAMVFFTMWFELIGFSHNSTAVLLSLFAVGCALGSLMGGLIADR 300
           TFQ+IV QGI+GS PWTAMVFFTMWFELIGF HN TA LL +FA G A+G+LMGG+IAD+
Sbjct: 273 TFQIIVAQGIIGSFPWTAMVFFTMWFELIGFDHNQTAALLGVFATGGAIGTLMGGIIADK 332

Query: 301 LSKIYPHSGRIMCAQFSASMGIPFSLFLLRVVPQSIDSYLVFAVTLLLMGLTISWNGTAV 360
           +S+IYP+SGR+MCAQFSA MGIPFS+ LL+V+PQS  SY +F++TL LMGLTI+W G+AV
Sbjct: 333 MSRIYPNSGRVMCAQFSAFMGIPFSIILLKVIPQSTSSYSIFSITLFLMGLTITWCGSAV 392

Query: 361 NAPIFAEVVPMKHRTMIYAFDRAFEGSFSSFAAPLVGILSEKMFGYDNAAI-----GSLP 420
           NAP+FAEVVP +HRTMIYAFDRAFEGSFSSFAAPLVGILSEK+FGYD+  I      S+ 
Sbjct: 393 NAPMFAEVVPPRHRTMIYAFDRAFEGSFSSFAAPLVGILSEKLFGYDSRGIDPLKGSSVR 452

Query: 421 KALALSKGLLAMMAVPFGVCCLFYTPLYIYFRLDRENAQMQSSKGTKLI 465
           +A ALSKGLL+MMAVPFG+CCL YTPL+  F+ DRENA++ SSK T++I
Sbjct: 453 EADALSKGLLSMMAVPFGLCCLCYTPLHFVFQKDRENAKIASSKETEMI 489

BLAST of CmoCh20G011490 vs. TAIR10
Match: AT2G18590.1 (AT2G18590.1 Major facilitator superfamily protein)

HSP 1 Score: 529.6 bits (1363), Expect = 1.9e-150
Identity = 278/471 (59.02%), Postives = 352/471 (74.73%), Query Frame = 1

Query: 1   MKNKKMYGVSISLLLINLAAIMERADENLLPSVYKEVSEAFNASPSDLGYLTFIRNFVQG 60
           MK+  +YG+SISL++INLA +M+RADE L+PS  KE+ EAF+A  SD+G L+FIRN VQG
Sbjct: 3   MKSGTIYGISISLIIINLATMMQRADEKLIPSTAKELKEAFHAKLSDIGLLSFIRNIVQG 62

Query: 61  LCSPLAGILVLKYDRPKVLAMGTFCWALSTAAVGISLEFKQVAFWRALNGFGLAIVIPAL 120
           L SPLAG+  + YDRP V A G+F W  ST A G+S  F QV    A NG G AIV P L
Sbjct: 63  LASPLAGLFAISYDRPTVFAFGSFFWVSSTVATGVSRYFIQVTLGVAFNGVGHAIVYPVL 122

Query: 121 QSFIADSYTDGVRGMGFGLLSLIGSLGGIGGGVLATVMAGQQYFGIQGWRCAFILMATLS 180
           QS IADS+ +  RG GFGL +LIG++GGIGG V+ TVMAG  +FGI GWRCAFIL ATLS
Sbjct: 123 QSIIADSFKESSRGFGFGLWNLIGTVGGIGGTVVPTVMAGHDFFGISGWRCAFILSATLS 182

Query: 181 AIIGILVYMFVVDPR--KTISNIQDSSDRYRLRDSLINRT--SSNSSSIWFESWNAMKAV 240
            I+GILV+ FV DPR  KT S I    D++  RD     T   S SSS+W ESW A+K V
Sbjct: 183 TIVGILVFFFVSDPREKKTSSVIVHHDDQHE-RDENNGGTMMESPSSSVWKESWVAIKDV 242

Query: 241 MKVRTFQVIVLQGIVGSLPWTAMVFFTMWFELIGFSHNSTAVLLSLFAVGCALGSLMGGL 300
            K+RTFQ+IVLQGIVGS+PW AM+F+TMWFELIGF HN  A+L  +FA G A+GSL+GG+
Sbjct: 243 TKLRTFQIIVLQGIVGSVPWNAMLFWTMWFELIGFDHNQAALLNGIFATGQAIGSLVGGI 302

Query: 301 IADRLSKIYPHSGRIMCAQFSASMGIPFSLFLLRVVPQSIDSYLVFAVTLLLMGLTISWN 360
           IAD++S+++P+SGR++CAQFS  MG  FS+ LLR++PQS++S+ +F VTL LMGLTI+W 
Sbjct: 303 IADKMSRVFPNSGRLICAQFSVFMGAMFSIVLLRMIPQSVNSFYIFLVTLFLMGLTITWC 362

Query: 361 GTAVNAPIFAEVVPMKHRTMIYAFDRAFEGSFSSFAAPLVGILSEKMFGYDNAAI----G 420
           G A+N+PI AE+VP KHRTM+YAFDRA E +FSSF APLVGI+SEK+FG+D   I     
Sbjct: 363 GPAINSPILAEIVPAKHRTMVYAFDRALEVTFSSFGAPLVGIMSEKLFGFDAKGIDHVND 422

Query: 421 SLPKALALSKGLLAMMAVPFGVCCLFYTPLYIYFRLDRENAQMQSSKGTKL 464
           S  +A AL KG++ MMA+PFG+CCL YTPL+  FR DR+  +  SS+  ++
Sbjct: 423 SGREAEALGKGIMWMMALPFGLCCLCYTPLHFLFRKDRKIDRTTSSREVEM 472

BLAST of CmoCh20G011490 vs. TAIR10
Match: AT5G10190.1 (AT5G10190.1 Major facilitator superfamily protein)

HSP 1 Score: 362.5 bits (929), Expect = 4.0e-100
Identity = 200/465 (43.01%), Postives = 280/465 (60.22%), Query Frame = 1

Query: 6   MYGVSISLLLINLAAIMERADENLLPSVYKEVSEAFNASPSDLGYLTFIRNFVQGLCSPL 65
           M   +++L+L+ LA IMERADE+LLP VYKEV +A +  P+ LG LT  R+ VQ  C PL
Sbjct: 1   MKSETLTLVLVYLAGIMERADESLLPGVYKEVGDALHVDPTALGTLTLFRSIVQSSCYPL 60

Query: 66  AGILVLKYDRPKVLAMGTFCWALSTAAVGISLEFKQVAFWRALNGFGLAIVIPALQSFIA 125
           A  L  +++R  V+A+G F WA +T  V +S  F QVA  R LNG GLAIV PA+QS +A
Sbjct: 61  AAYLSSRHNRAHVIALGAFLWATATFLVAVSTTFFQVAVSRGLNGIGLAIVTPAIQSLVA 120

Query: 126 DSYTDGVRGMGFGLLSLIGSLGGIGGGVLATVMAGQQYFGIQGWRCAFILMATLSAIIGI 185
           DS  D  RGM FG L    ++G I G V + + A + + G+ GWR AF+L+A +S I+GI
Sbjct: 121 DSTDDYNRGMAFGWLGFTSNIGSILGYVCSILFASKSFNGVAGWRIAFLLVAVVSVIVGI 180

Query: 186 LVYMFVVDP----RKTISNIQDSSDRYRLRDSLINRTSSNSSSIWFESWNAMKAVMKVRT 245
           LV +F  DP    RK   +++D      +RD L                   K V+K+ +
Sbjct: 181 LVRLFATDPHYSDRKITKHVKDKPFWSDIRDLL----------------KEAKMVIKIPS 240

Query: 246 FQVIVLQGIVGSLPWTAMVFFTMWFELIGFSHNSTAVLLSLFAVGCALGSLMGGLIADRL 305
           FQ+ V QG+ GS PW+A+ F  +W ELIGFSH +TAVL++LF + C+LG L GG + D L
Sbjct: 241 FQIFVAQGVSGSFPWSALAFAPLWLELIGFSHKTTAVLVTLFTISCSLGGLFGGYMGDTL 300

Query: 306 SKIYPHSGRIMCAQFSASMGIPFSLFLLRVVPQSIDSYLVFAVTLLLMGLTISWNGTAVN 365
           +K +P+ GRI  +Q S+   IP +  LL  +P    +     + L++MGL ISWNG A N
Sbjct: 301 AKKFPNGGRIFLSQVSSGSAIPLAAILLIGLPDDPSTAFSHGLVLVIMGLCISWNGAATN 360

Query: 366 APIFAEVVPMKHRTMIYAFDRAFEGSFSSFAAPLVGILSEKMFGYDNAAIGSL------- 425
            PIFAE+VP + RT IYA DR+FE   +SFA P+VG+L++ ++GY     GS        
Sbjct: 361 GPIFAEIVPERARTSIYALDRSFESILASFAPPIVGMLAQNIYGYKPIPEGSTSSVKIDT 420

Query: 426 --PKALALSKGLLAMMAVPFGVCCLFYTPLYIYFRLDRENAQMQS 458
               A +L+K L   + +P  +CC  Y+ LY  +  DR+ A+MQ+
Sbjct: 421 DRANAASLAKALYTSIGIPMVICCTIYSFLYCTYPRDRDRAKMQA 449

BLAST of CmoCh20G011490 vs. TAIR10
Match: AT1G78130.1 (AT1G78130.1 Major facilitator superfamily protein)

HSP 1 Score: 354.4 bits (908), Expect = 1.1e-97
Identity = 197/461 (42.73%), Postives = 276/461 (59.87%), Query Frame = 1

Query: 6   MYGVSISLLLINLAAIMERADENLLPSVYKEVSEAFNASPSDLGYLTFIRNFVQGLCSPL 65
           M   +++LLL+NLA IMERADE+LLP VYKEV  A +  P+ LG LT +R+ VQ  C PL
Sbjct: 1   MKAETMTLLLVNLAGIMERADESLLPGVYKEVGLALHTDPTGLGSLTLLRSMVQAACYPL 60

Query: 66  AGILVLKYDRPKVLAMGTFCWALSTAAVGISLEFKQVAFWRALNGFGLAIVIPALQSFIA 125
           A  + ++++R  V+A+G F W+ +T  V  S  F QVA  RALNG GLA+V PA+QS +A
Sbjct: 61  AAYMAIRHNRAHVIALGAFLWSAATFLVAFSSTFFQVAVSRALNGIGLALVAPAIQSLVA 120

Query: 126 DSYTDGVRGMGFGLLSLIGSLGGIGGGVLATVMAGQQYFGIQGWRCAFILMATLSAIIGI 185
           DS  D  RG  FG L L  ++G I GG+ + ++A   + GI GWR AF ++  +S I+G+
Sbjct: 121 DSTDDANRGTAFGWLQLTANIGSILGGLCSVLIAPLTFMGIPGWRVAFHIVGVISVIVGV 180

Query: 186 LVYMFVVDPRKTISNIQDSSDRYRLRDSLINRTSSNSSSIWFESWNAMKAVMKVRTFQVI 245
           LV +F  DP      + D S++   R                +       V+K+R+FQ+I
Sbjct: 181 LVRVFANDPHFVKDGV-DVSNQPGSRKPFCTEVK--------DLVREADTVIKIRSFQII 240

Query: 246 VLQGIVGSLPWTAMVFFTMWFELIGFSHNSTAVLLSLFAVGCALGSLMGGLIADRLSKIY 305
           V QG+ GS PW+A+ F  MW ELIGFSH  TA L+ LF    +LG L GG + D LS   
Sbjct: 241 VAQGVTGSFPWSALSFAPMWLELIGFSHGKTAFLMGLFVAASSLGGLFGGKMGDFLSTRL 300

Query: 306 PHSGRIMCAQFSASMGIPFSLFLLRVVPQSIDSYLVFAVTLLLMGLTISWNGTAVNAPIF 365
           P+SGRI+ AQ S++  IP +  LL V+P    +  +  + L+L+GL +SWN  A N PIF
Sbjct: 301 PNSGRIILAQISSASAIPLAAILLLVLPDDPSTAAIHGLILVLLGLFVSWNAPATNNPIF 360

Query: 366 AEVVPMKHRTMIYAFDRAFEGSFSSFAAPLVGILSEKMFGYDNAAIGS---------LPK 425
           AE+VP K RT +YA D++FE   SSFA P+VGIL++ ++GY     GS            
Sbjct: 361 AEIVPEKSRTSVYALDKSFESILSSFAPPIVGILAQHVYGYKPIPEGSSRSTEIATDREN 420

Query: 426 ALALSKGLLAMMAVPFGVCCLFYTPLYIYFRLDRENAQMQS 458
           A +L+K L   + +P   CC  Y+ LY  + LDR+ A+M++
Sbjct: 421 AASLAKALYTSIGLPMAACCFIYSFLYRSYPLDRDRARMEA 452

BLAST of CmoCh20G011490 vs. NCBI nr
Match: gi|449461421|ref|XP_004148440.1| (PREDICTED: protein spinster [Cucumis sativus])

HSP 1 Score: 830.1 bits (2143), Expect = 1.9e-237
Identity = 425/467 (91.01%), Postives = 444/467 (95.07%), Query Frame = 1

Query: 1   MKNKKMYGVSISLLLINLAAIMERADENLLPSVYKEVSEAFNASPSDLGYLTFIRNFVQG 60
           MKN+K+YGVSISL+LINLAAIMERADENLLPSVYKEVSE FNASPSDLGYLTFIRNFVQG
Sbjct: 1   MKNRKIYGVSISLILINLAAIMERADENLLPSVYKEVSETFNASPSDLGYLTFIRNFVQG 60

Query: 61  LCSPLAGILVLKYDRPKVLAMGTFCWALSTAAVGISLEFKQVAFWRALNGFGLAIVIPAL 120
           LCSPLAGILVL YDRPKVLAMGTFCWALSTAAVGISLEFKQVAFWRA+NGFGLAIVIPAL
Sbjct: 61  LCSPLAGILVLSYDRPKVLAMGTFCWALSTAAVGISLEFKQVAFWRAVNGFGLAIVIPAL 120

Query: 121 QSFIADSYTDGVRGMGFGLLSLIGSLGGIGGGVLATVMAGQQYFGIQGWRCAFILMATLS 180
           QSFIADSY DGVRGMGFGLLSLIGSLGGIGGGVLATVMAGQQYFG++GWRCAFILMATLS
Sbjct: 121 QSFIADSYMDGVRGMGFGLLSLIGSLGGIGGGVLATVMAGQQYFGVEGWRCAFILMATLS 180

Query: 181 AIIGILVYMFVVDPRKTISNIQDSSDRYRLRDSLINRTSSNSSSIWFESWNAMKAVMKVR 240
           AIIGILVYMFVVDPRKTI+NIQ+SSDRY  RD+LI+RT  NSSSIWFESW+AMKAVMKV 
Sbjct: 181 AIIGILVYMFVVDPRKTINNIQESSDRYLRRDNLIDRTLPNSSSIWFESWSAMKAVMKVH 240

Query: 241 TFQVIVLQGIVGSLPWTAMVFFTMWFELIGFSHNSTAVLLSLFAVGCALGSLMGGLIADR 300
           TFQ+IVLQGIVGSLPWTAMVFFTMWFELIGFSHN TAVLLSLFAVGCALGSL+GGLIADR
Sbjct: 241 TFQIIVLQGIVGSLPWTAMVFFTMWFELIGFSHNGTAVLLSLFAVGCALGSLLGGLIADR 300

Query: 301 LSKIYPHSGRIMCAQFSASMGIPFSLFLLRVVPQSIDSYLVFAVTLLLMGLTISWNGTAV 360
           LSKIYPHSGRIMCAQFSASMGIPFSL LLRV+PQS+DS L+F VTL LMGLTISWNGTAV
Sbjct: 301 LSKIYPHSGRIMCAQFSASMGIPFSLLLLRVIPQSVDSLLIFGVTLFLMGLTISWNGTAV 360

Query: 361 NAPIFAEVVPMKHRTMIYAFDRAFEGSFSSFAAPLVGILSEKMFGYDNAAIGSLPKALAL 420
           NAPIFAEVVP+KHRTMIYAFDRAFEGSFSSFAAPLVGILSEKMFGYD+ A  SL KALAL
Sbjct: 361 NAPIFAEVVPIKHRTMIYAFDRAFEGSFSSFAAPLVGILSEKMFGYDDTAGASLLKALAL 420

Query: 421 SKGLLAMMAVPFGVCCLFYTPLYIYFRLDRENAQMQSSKGTKLIDDL 468
           SKGLL MM VPFGVCCL YTPLY YFRLDRENA+MQ SKGTK IDDL
Sbjct: 421 SKGLLTMMTVPFGVCCLCYTPLYKYFRLDRENARMQGSKGTKSIDDL 467

BLAST of CmoCh20G011490 vs. NCBI nr
Match: gi|659116082|ref|XP_008457892.1| (PREDICTED: putative glycerol-3-phosphate transporter 2 [Cucumis melo])

HSP 1 Score: 821.2 bits (2120), Expect = 9.0e-235
Identity = 418/467 (89.51%), Postives = 442/467 (94.65%), Query Frame = 1

Query: 1   MKNKKMYGVSISLLLINLAAIMERADENLLPSVYKEVSEAFNASPSDLGYLTFIRNFVQG 60
           MKN+K+YGVSISL+LINLAAIMERADENLLPSVYKEVSE FNASPSDLGYLTFIRNFVQG
Sbjct: 1   MKNRKIYGVSISLILINLAAIMERADENLLPSVYKEVSETFNASPSDLGYLTFIRNFVQG 60

Query: 61  LCSPLAGILVLKYDRPKVLAMGTFCWALSTAAVGISLEFKQVAFWRALNGFGLAIVIPAL 120
           LCSPLAGILVL YDRPKVLAMGTFCWALSTAAVGI LEFKQVAFWRA+NGFGLAIVIPAL
Sbjct: 61  LCSPLAGILVLSYDRPKVLAMGTFCWALSTAAVGICLEFKQVAFWRAVNGFGLAIVIPAL 120

Query: 121 QSFIADSYTDGVRGMGFGLLSLIGSLGGIGGGVLATVMAGQQYFGIQGWRCAFILMATLS 180
           QSFIADSY DGVRGMGFGLLSLIGSLGGIGGGVLATVMAGQQYFG++GWRCAFILMATLS
Sbjct: 121 QSFIADSYMDGVRGMGFGLLSLIGSLGGIGGGVLATVMAGQQYFGVEGWRCAFILMATLS 180

Query: 181 AIIGILVYMFVVDPRKTISNIQDSSDRYRLRDSLINRTSSNSSSIWFESWNAMKAVMKVR 240
           AIIGILVYMFVVDPRKTI++IQ+SSDRY+ RD+LI+RT  NSSSIWFESW AMKAVMKV 
Sbjct: 181 AIIGILVYMFVVDPRKTINHIQESSDRYQRRDNLIDRTLPNSSSIWFESWTAMKAVMKVH 240

Query: 241 TFQVIVLQGIVGSLPWTAMVFFTMWFELIGFSHNSTAVLLSLFAVGCALGSLMGGLIADR 300
           TFQ+IVLQGIVGSLPWTAMVFFTMWFELIGFSHN TAVLLSLFAVGCALG L+GGLIADR
Sbjct: 241 TFQIIVLQGIVGSLPWTAMVFFTMWFELIGFSHNGTAVLLSLFAVGCALGCLLGGLIADR 300

Query: 301 LSKIYPHSGRIMCAQFSASMGIPFSLFLLRVVPQSIDSYLVFAVTLLLMGLTISWNGTAV 360
           LSKIYPHSGRIMCAQFSASMGIPFSL LLR++PQS+DS L+F VTL LMGLTISWNGTAV
Sbjct: 301 LSKIYPHSGRIMCAQFSASMGIPFSLLLLRLIPQSVDSLLIFGVTLFLMGLTISWNGTAV 360

Query: 361 NAPIFAEVVPMKHRTMIYAFDRAFEGSFSSFAAPLVGILSEKMFGYDNAAIGSLPKALAL 420
           NAPIFAEVVP+KHRTMIYAFDRAFEGSFSSFAAPLVGILSEKMFGYD+ A  S+ KALAL
Sbjct: 361 NAPIFAEVVPIKHRTMIYAFDRAFEGSFSSFAAPLVGILSEKMFGYDDTAGASILKALAL 420

Query: 421 SKGLLAMMAVPFGVCCLFYTPLYIYFRLDRENAQMQSSKGTKLIDDL 468
           SKGLL MM VPFG+CCL YTPLY YFRLDRENA+MQ SKG+K IDDL
Sbjct: 421 SKGLLTMMTVPFGICCLCYTPLYKYFRLDRENARMQGSKGSKSIDDL 467

BLAST of CmoCh20G011490 vs. NCBI nr
Match: gi|1000953742|ref|XP_002525099.2| (PREDICTED: uncharacterized protein LOC8264184 isoform X1 [Ricinus communis])

HSP 1 Score: 718.4 bits (1853), Expect = 8.2e-204
Identity = 365/467 (78.16%), Postives = 416/467 (89.08%), Query Frame = 1

Query: 1   MKNKKMYGVSISLLLINLAAIMERADENLLPSVYKEVSEAFNASPSDLGYLTFIRNFVQG 60
           MK +K++GVS+SL+LINLAAIMERADENLLP+VYKEVSE FNA PSDLGYLTFIRNFVQG
Sbjct: 30  MKTRKIFGVSLSLILINLAAIMERADENLLPAVYKEVSETFNAGPSDLGYLTFIRNFVQG 89

Query: 61  LCSPLAGILVLKYDRPKVLAMGTFCWALSTAAVGISLEFKQVAFWRALNGFGLAIVIPAL 120
           L SPLAG+LV+ YDRP VLAMGTFCWALSTAAVG S  F QVAFWR +NGFGLAIVIPAL
Sbjct: 90  LSSPLAGVLVINYDRPTVLAMGTFCWALSTAAVGASHHFLQVAFWRGVNGFGLAIVIPAL 149

Query: 121 QSFIADSYTDGVRGMGFGLLSLIGSLGGIGGGVLATVMAGQQYFGIQGWRCAFILMATLS 180
           QSFIADSY +GVRG GFGL++LIG+LGGIGGGVLATVMAGQQY+GI GWRCAFI+MATLS
Sbjct: 150 QSFIADSYMEGVRGAGFGLVNLIGNLGGIGGGVLATVMAGQQYWGIPGWRCAFIMMATLS 209

Query: 181 AIIGILVYMFVVDPRKTISNIQDSSDRYRLRDSLINRTSSNSSSIWFESWNAMKAVMKVR 240
           +IIG LV++FV+DPRKTIS  +D+ + +  RD LI R+SS++SS+W ESW AM+AV+KV+
Sbjct: 210 SIIGFLVFLFVIDPRKTISIPRDTRESFE-RDELIERSSSSASSVWTESWTAMQAVIKVK 269

Query: 241 TFQVIVLQGIVGSLPWTAMVFFTMWFELIGFSHNSTAVLLSLFAVGCALGSLMGGLIADR 300
           TFQ+IVLQGIVGSLPWTAMVFF MWFELIGFSHNSTA LLSLFAVGCALGSL+GGLIADR
Sbjct: 270 TFQIIVLQGIVGSLPWTAMVFFAMWFELIGFSHNSTAFLLSLFAVGCALGSLIGGLIADR 329

Query: 301 LSKIYPHSGRIMCAQFSASMGIPFSLFLLRVVPQSIDSYLVFAVTLLLMGLTISWNGTAV 360
           LS  YPHSGRIMCAQFSA MGIPFS FLL+ +P S+ SY  FAVT+ +MGLTISWNGTA 
Sbjct: 330 LSHTYPHSGRIMCAQFSALMGIPFSWFLLKEIPLSVSSYHTFAVTIFMMGLTISWNGTAA 389

Query: 361 NAPIFAEVVPMKHRTMIYAFDRAFEGSFSSFAAPLVGILSEKMFGYDNAAI----GSLPK 420
           NAP+FAEVVP+KHRTMIYAFDRAFEGS SSFAAPLVGILSEKMFGYD+ +I    GS+ +
Sbjct: 390 NAPMFAEVVPVKHRTMIYAFDRAFEGSLSSFAAPLVGILSEKMFGYDSKSIDPVKGSVQE 449

Query: 421 ALALSKGLLAMMAVPFGVCCLFYTPLYIYFRLDRENAQMQSSKGTKL 464
           A ALSKGLL+MMAVPFG+CCLFYTPLY +FR DRENA++ S+K  ++
Sbjct: 450 ASALSKGLLSMMAVPFGLCCLFYTPLYKFFRQDRENARIASAKEAEM 495

BLAST of CmoCh20G011490 vs. NCBI nr
Match: gi|223535558|gb|EEF37226.1| (carbohydrate transporter, putative [Ricinus communis])

HSP 1 Score: 715.7 bits (1846), Expect = 5.3e-203
Identity = 363/466 (77.90%), Postives = 415/466 (89.06%), Query Frame = 1

Query: 2   KNKKMYGVSISLLLINLAAIMERADENLLPSVYKEVSEAFNASPSDLGYLTFIRNFVQGL 61
           + +K++GVS+SL+LINLAAIMERADENLLP+VYKEVSE FNA PSDLGYLTFIRNFVQGL
Sbjct: 20  RTRKIFGVSLSLILINLAAIMERADENLLPAVYKEVSETFNAGPSDLGYLTFIRNFVQGL 79

Query: 62  CSPLAGILVLKYDRPKVLAMGTFCWALSTAAVGISLEFKQVAFWRALNGFGLAIVIPALQ 121
            SPLAG+LV+ YDRP VLAMGTFCWALSTAAVG S  F QVAFWR +NGFGLAIVIPALQ
Sbjct: 80  SSPLAGVLVINYDRPTVLAMGTFCWALSTAAVGASHHFLQVAFWRGVNGFGLAIVIPALQ 139

Query: 122 SFIADSYTDGVRGMGFGLLSLIGSLGGIGGGVLATVMAGQQYFGIQGWRCAFILMATLSA 181
           SFIADSY +GVRG GFGL++LIG+LGGIGGGVLATVMAGQQY+GI GWRCAFI+MATLS+
Sbjct: 140 SFIADSYMEGVRGAGFGLVNLIGNLGGIGGGVLATVMAGQQYWGIPGWRCAFIMMATLSS 199

Query: 182 IIGILVYMFVVDPRKTISNIQDSSDRYRLRDSLINRTSSNSSSIWFESWNAMKAVMKVRT 241
           IIG LV++FV+DPRKTIS  +D+ + +  RD LI R+SS++SS+W ESW AM+AV+KV+T
Sbjct: 200 IIGFLVFLFVIDPRKTISIPRDTRESFE-RDELIERSSSSASSVWTESWTAMQAVIKVKT 259

Query: 242 FQVIVLQGIVGSLPWTAMVFFTMWFELIGFSHNSTAVLLSLFAVGCALGSLMGGLIADRL 301
           FQ+IVLQGIVGSLPWTAMVFF MWFELIGFSHNSTA LLSLFAVGCALGSL+GGLIADRL
Sbjct: 260 FQIIVLQGIVGSLPWTAMVFFAMWFELIGFSHNSTAFLLSLFAVGCALGSLIGGLIADRL 319

Query: 302 SKIYPHSGRIMCAQFSASMGIPFSLFLLRVVPQSIDSYLVFAVTLLLMGLTISWNGTAVN 361
           S  YPHSGRIMCAQFSA MGIPFS FLL+ +P S+ SY  FAVT+ +MGLTISWNGTA N
Sbjct: 320 SHTYPHSGRIMCAQFSALMGIPFSWFLLKEIPLSVSSYHTFAVTIFMMGLTISWNGTAAN 379

Query: 362 APIFAEVVPMKHRTMIYAFDRAFEGSFSSFAAPLVGILSEKMFGYDNAAI----GSLPKA 421
           AP+FAEVVP+KHRTMIYAFDRAFEGS SSFAAPLVGILSEKMFGYD+ +I    GS+ +A
Sbjct: 380 APMFAEVVPVKHRTMIYAFDRAFEGSLSSFAAPLVGILSEKMFGYDSKSIDPVKGSVQEA 439

Query: 422 LALSKGLLAMMAVPFGVCCLFYTPLYIYFRLDRENAQMQSSKGTKL 464
            ALSKGLL+MMAVPFG+CCLFYTPLY +FR DRENA++ S+K  ++
Sbjct: 440 SALSKGLLSMMAVPFGLCCLFYTPLYKFFRQDRENARIASAKEAEM 484

BLAST of CmoCh20G011490 vs. NCBI nr
Match: gi|743838454|ref|XP_011025716.1| (PREDICTED: uncharacterized protein LOC105126529 [Populus euphratica])

HSP 1 Score: 712.2 bits (1837), Expect = 5.9e-202
Identity = 360/468 (76.92%), Postives = 411/468 (87.82%), Query Frame = 1

Query: 1   MKNKKMYGVSISLLLINLAAIMERADENLLPSVYKEVSEAFNASPSDLGYLTFIRNFVQG 60
           MK +K+ GVS+S+ LIN+AAIMERADENLLP+VYKEVSEAFNA PSDLGYLTFIRNFVQG
Sbjct: 30  MKGRKILGVSLSIFLINMAAIMERADENLLPAVYKEVSEAFNAGPSDLGYLTFIRNFVQG 89

Query: 61  LCSPLAGILVLKYDRPKVLAMGTFCWALSTAAVGISLEFKQVAFWRALNGFGLAIVIPAL 120
           L SPLAGILV+ + RP VLAMGT CWALSTAAVG S  F Q AFWRA+NGFGLAIVIPAL
Sbjct: 90  LSSPLAGILVINHARPTVLAMGTLCWALSTAAVGASQHFSQAAFWRAVNGFGLAIVIPAL 149

Query: 121 QSFIADSYTDGVRGMGFGLLSLIGSLGGIGGGVLATVMAGQQYFGIQGWRCAFILMATLS 180
           QSFIADSY DGVRG GFGLLS IG+LGGIGGGVLATVMAGQQY+G+QGWR AFI+MA+LS
Sbjct: 150 QSFIADSYKDGVRGTGFGLLSFIGNLGGIGGGVLATVMAGQQYWGVQGWRFAFIMMASLS 209

Query: 181 AIIGILVYMFVVDPRKTISNIQDSSDRYRLRDSLINRTSSNSSSIWFESWNAMKAVMKVR 240
            +IG+LV++FVVDPRKTI   +D S+ +  RD L+ + +S+  SIW ESW A KAVMKV+
Sbjct: 210 LLIGLLVFLFVVDPRKTIGVNRDISENFE-RDELVEKGNSHELSIWTESWTATKAVMKVK 269

Query: 241 TFQVIVLQGIVGSLPWTAMVFFTMWFELIGFSHNSTAVLLSLFAVGCALGSLMGGLIADR 300
           TFQ+IVLQGIVGSLPWTAMVFFTMWFELIGF+HN TA LLS FAVGC+LGSL+GG+IADR
Sbjct: 270 TFQIIVLQGIVGSLPWTAMVFFTMWFELIGFNHNKTAALLSFFAVGCSLGSLLGGIIADR 329

Query: 301 LSKIYPHSGRIMCAQFSASMGIPFSLFLLRVVPQSIDSYLVFAVTLLLMGLTISWNGTAV 360
           +S IYPHSGRIMCAQFSA MGIPFS FLL V+PQS+ SY  FAVTL +MGLTISWNGTAV
Sbjct: 330 MSHIYPHSGRIMCAQFSAFMGIPFSWFLLNVIPQSVSSYFTFAVTLFMMGLTISWNGTAV 389

Query: 361 NAPIFAEVVPMKHRTMIYAFDRAFEGSFSSFAAPLVGILSEKMFGYDNAAI----GSLPK 420
           NAPIFAEVVP+KHRTMIYA+DRAFEGSFSSFAAPLVGILSE+MFGYD+ ++    GS+ +
Sbjct: 390 NAPIFAEVVPVKHRTMIYAYDRAFEGSFSSFAAPLVGILSEQMFGYDSKSVDPIKGSVRE 449

Query: 421 ALALSKGLLAMMAVPFGVCCLFYTPLYIYFRLDRENAQMQSSKGTKLI 465
           A ALSKGLL+MMA+PFG+CCLFYTPLYI+FR DRENA+M  SK  +++
Sbjct: 450 ASALSKGLLSMMAIPFGLCCLFYTPLYIFFRQDRENARMAGSKALEIM 496

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0LQ36_CUCSA1.3e-23791.01Uncharacterized protein OS=Cucumis sativus GN=Csa_2G296060 PE=4 SV=1[more]
B9SGI0_RICCO3.7e-20377.90Carbohydrate transporter, putative OS=Ricinus communis GN=RCOM_0553180 PE=4 SV=1[more]
B9HEN5_POPTR2.1e-19876.07Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0007s12360g PE=4 SV=2[more]
A0A061DGI1_THECC3.6e-19875.21Major facilitator superfamily protein isoform 1 OS=Theobroma cacao GN=TCM_000452... [more]
A0A0D2TIS2_GOSRA1.4e-19775.48Uncharacterized protein OS=Gossypium raimondii GN=B456_007G206200 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G36790.13.7e-18671.00 Major facilitator superfamily protein[more]
AT2G18590.11.9e-15059.02 Major facilitator superfamily protein[more]
AT5G10190.14.0e-10043.01 Major facilitator superfamily protein[more]
AT1G78130.11.1e-9742.73 Major facilitator superfamily protein[more]
Match NameE-valueIdentityDescription
gi|449461421|ref|XP_004148440.1|1.9e-23791.01PREDICTED: protein spinster [Cucumis sativus][more]
gi|659116082|ref|XP_008457892.1|9.0e-23589.51PREDICTED: putative glycerol-3-phosphate transporter 2 [Cucumis melo][more]
gi|1000953742|ref|XP_002525099.2|8.2e-20478.16PREDICTED: uncharacterized protein LOC8264184 isoform X1 [Ricinus communis][more]
gi|223535558|gb|EEF37226.1|5.3e-20377.90carbohydrate transporter, putative [Ricinus communis][more]
gi|743838454|ref|XP_011025716.1|5.9e-20276.92PREDICTED: uncharacterized protein LOC105126529 [Populus euphratica][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR011701MFS
IPR020846MFS_dom
Vocabulary: Cellular Component
TermDefinition
GO:0016021integral component of membrane
Vocabulary: Biological Process
TermDefinition
GO:0055085transmembrane transport
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0055085 transmembrane transport
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh20G011490.1CmoCh20G011490.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011701Major facilitator superfamilyPFAMPF07690MFS_1coord: 18..391
score: 4.6
IPR020846Major facilitator superfamily domainPROFILEPS50850MFScoord: 12..434
score: 21
IPR020846Major facilitator superfamily domainunknownSSF103473MFS general substrate transportercoord: 12..447
score: 2.09
NoneNo IPR availableGENE3DG3DSA:1.20.1250.20coord: 248..424
score: 1.1E-10coord: 13..197
score: 3.2
NoneNo IPR availablePANTHERPTHR23505FAMILY NOT NAMEDcoord: 1..459
score: 3.8E
NoneNo IPR availablePANTHERPTHR23505:SF7SUBFAMILY NOT NAMEDcoord: 1..459
score: 3.8E

The following gene(s) are paralogous to this gene:

None