Cp4.1LG01g15040 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG01g15040
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionMajor facilitator superfamily protein
LocationCp4.1LG01 : 7194981 .. 7197900 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAGGAAGAAAAATCCCCCGCCCCTCTCCTCCACCTCCCCACCAAACTTCCCGATCACCGGAACCACTCCGACCGCCCCTCCCAACCACCCGCCGGCGGGTGGAAAGCTGCAATCTTTGTCATATGTAAAATTTCATACTTCTTTTTCCTTTTCTGGGTCTCTCAATTTTCTGACAAATTTTTCATTTCTGACAGTCGTCGAGGTCGCCGAGCAATTTGCTTTCATCGGCTTGTCCAGCAATCTCATCATGTACTTCACCACCGTCTTCCATGAACCCACCGCCACCGCCGCCAAGATGGTCAACAACTGGTCCGGCGTCTCCGCCGTCTTCCCAATACTCGGCGCCTTCGTCGCCGACTCACTTCTCGGCCGGTTCAAAACCATAATATTTTCCTCTCTTATATACTGCCTTGGAATGGTGCTGTTGACGCTATCGGCGACGGTGATCGGAGCACCTCACCGGAAACCCGTGTTCTTTTTTGCACTGTACATTTTGTCGGTCGGAGAAGGCGGCCACCGTCCTTGTGTACAGACGTTCGCCGCCGATCAATTCGACGAGGAGACGCCGGAGCAGAGGAAGAGGAAAAGCTCGTTCTTTAATTGGTGGTATGTGGGCCTTGTGGTGGGGTCCACTTTAGCTGTCTTTCTGGTTATTTACGTTCAGGTATTGTTTTATTTTTTATTTATTTATTTATTTAGAAGTGGTAACGTTTTATTATTTATTATGGCTGCAAGAAAAAGACAAAATATCACCGAAAAAAATCAATTAAACCGGCTCAATTTGATACCGGCTTATTAAAATCGGTTCTGATCGACCGGCTTCCGGTTTGATTAAAAATAGGTTCAAATATGGGATGACCAAAATTATGCTTTTTTTTTTTTTTTATTAAAAATTTTACTTTAAACTTTTTATTTATGTGGGGACAAAAAAAAAAAAAGAAATTTTTTTTGCTTTAATTCTTTAAATAGGCATTGGGATATGAATAGCGAACAATTTTAGAAAAAAAAAGAAAAAGAAAAGGAAATATTATAAAATTTAAAATTTGTTTAATATTTAAATATCTAATTACCTCAGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTNGATGGGGACTGAGTTTTGGAATTCTAGCCGGAGTTTTGGCGGCGGCGTTACTACTGTTTTTGTGCGGAGTGAAGGTGTACAGACGCCATATTCCTGTCGGAAGTCCGTTGACCAGAATAGCACAGGTGGTGGTGGCGGCGGCCAGGAAGTGGCGCGTGGATAATACGCGCCAGGAATGGAGAGTGTGTTACGAGGAAGATAGTCATGCCAAAAATGAGGATGAAGGTCAACACAAGCCCATGACTTTGGATCGCAGAAGCCAATTCGGGTATGTCTTAATTTTCATTCATAGGTCAACTCTCTTTTTTTTCAAAAAATAATAATATTTTAATATTTATTGTTTTTTTTTTTTTTTTTAAATTTTAATTTTGTTTTTTAGCCTAAGTTTCCATGTTATTTAATGAAAATTTTAATTTAATAAATTCTTAAATATTAAAAATATTTCATAAATATATAAAATTTGAATATTGTGTTATAATGACCTAAAGGTCGCTTTTTTATCGGCCACGTTTAATGTTGTATTAAACATGATCTCGTTTGTAGTAGGATTTTGGATAAGGCGACTTTGATAGATGATGAGGACAAGGCGAGGAAGAAGCGAGACCCGTGGCGACTAAGCACGGTGGCGGAGGTGGAGGAGGTAAAGATGTTGGGACGACTAATTCCGGTGTGGTTTAGTTGTTTAATGTTTGCAGTGGTACAAGCTCAGATCCACACTTTCTTCACAAAGCAAGGCTCCACCATGCTCCGCTCCATCGGCCCCCACTTTCAGATCCCGCCCGCGTCTCTTCAAGGCGTCGTCGGCCTCACCATCCTTCTCACCATTCTTTTCTACGACCGAGTTTTTGTCCCCTCCGCCCGAAAGTTTACTGGCCACCACTCCGGTAAGTACTGAATATAAGTCGAAATTAGGTCCAAATTATAGGTTAAGGAAATGAATATACGTCGATTATTAATGTTTTGTGTAGGCATAACGGTACTACAAAGAATAGGAATAGGCCTATTCATCTCAATCCTCAACATGGTCGCCTCGGCTTTGGTAGAGGCCAAAAGGGTCGCTGTGGCCGCCGAACACGGCCTCATCGACACTCCAAAAGCGACGGTTCCGATGACAATCTGGTGGCTAATTCCACAATACATGCTTTGTGGCGTCTCCGATGCCTTCACCGTCGTCGGTCTCCAAGAACTTTTCTACGACCAAATGCCCGAATCCATGAGGAGCATAGGATCTGCGGCGTACATCAGCGTGATCGGAATGGGGAACTTTCTGAGCACCGCCGTCATCTCGGCAGTCCAGGCGGCGAGCCGCCACAAATGGCTGGTGGACAATCTGAACCGTTCAAAACTGCAGTACTTCTACTGGGTTTTGGCTGGTTTGAGCGGTTTGAATTTGTGTTGCTACATTTGGGTTGCCAATGGTTATGTGTATAAGAGAGTTGGAGGGAAAGATGGTGAAGATCATGGGAAGAACAGCAACAAAGGAGTCTATGGAGATGACATGATTTGAAGATTAATCATTATCATATGATTAGAGGGAGAACATATAATTATGAGATTAATCTCTCTAATCATCATTAATCTTTGTTGTCTTCTAATAATTTGTAGTTAAAAGTTATTCTAATGGTGTTTTCTTTTTTTCTCTTTTTTTTNGGATGTGATAATATTTATATTAAATGAAAAGATTAGTAAAATGAGTCAAATTCTATAAAATAATTACATTATTGGTTTAATTAAAACGGGATAGCAGTGTGTGACACTTAGTTGAACCCAGCAAACAAGTTAGTCGTGTAAATT

mRNA sequence

ATGGAGGAAGAAAAATCCCCCGCCCCTCTCCTCCACCTCCCCACCAAACTTCCCGATCACCGGAACCACTCCGACCGCCCCTCCCAACCACCCGCCGGCGGGTGGAAAGCTGCAATCTTTGTCATATTCGTCGAGGTCGCCGAGCAATTTGCTTTCATCGGCTTGTCCAGCAATCTCATCATGTACTTCACCACCGTCTTCCATGAACCCACCGCCACCGCCGCCAAGATGGTCAACAACTGGTCCGGCGTCTCCGCCGTCTTCCCAATACTCGGCGCCTTCGTCGCCGACTCACTTCTCGGCCGGTTCAAAACCATAATATTTTCCTCTCTTATATACTGCCTTGGAATGGTGCTGTTGACGCTATCGGCGACGGTGATCGGAGCACCTCACCGGAAACCCGTGTTCTTTTTTGCACTGTACATTTTGTCGGTCGGAGAAGGCGGCCACCGTCCTTGTGTACAGACGTTCGCCGCCGATCAATTCGACGAGGAGACGCCGGAGCAGAGGAAGAGGAAAAGCTCGTTCTTTAATTGGTGGTATGTGGGCCTTGTGGTGGGGTCCACTTTAGCTGTCTTTCTGGTTATTTACGTTCAGGTGTACAGACGCCATATTCCTGTCGGAAGTCCGTTGACCAGAATAGCACAGGTGGTGGTGGCGGCGGCCAGGAAGTGGCGCGTGGATAATACGCGCCAGGAATGGAGAGTGTGTTACGAGGAAGATAGTCATGCCAAAAATGAGGATGAAGGTCAACACAAGCCCATGACTTTGGATCGCAGAAGCCAATTCGGGATTTTGGATAAGGCGACTTTGATAGATGATGAGGACAAGGCGAGGAAGAAGCGAGACCCGTGGCGACTAAGCACGGTGGCGGAGGTGGAGGAGGTAAAGATGTTGGGACGACTAATTCCGGTGTGGTTTAGTTGTTTAATGTTTGCAGTGGTACAAGCTCAGATCCACACTTTCTTCACAAAGCAAGGCTCCACCATGCTCCGCTCCATCGGCCCCCACTTTCAGATCCCGCCCGCGTCTCTTCAAGGCGTCGTCGGCCTCACCATCCTTCTCACCATTCTTTTCTACGACCGAGTTTTTGTCCCCTCCGCCCGAAAGTTTACTGGCCACCACTCCGGCATAACGGTACTACAAAGAATAGGAATAGGCCTATTCATCTCAATCCTCAACATGGTCGCCTCGGCTTTGGTAGAGGCCAAAAGGGTCGCTGTGGCCGCCGAACACGGCCTCATCGACACTCCAAAAGCGACGGTTCCGATGACAATCTGGTGGCTAATTCCACAATACATGCTTTGTGGCGTCTCCGATGCCTTCACCGTCGTCGGTCTCCAAGAACTTTTCTACGACCAAATGCCCGAATCCATGAGGAGCATAGGATCTGCGGCGTACATCAGCGTGATCGGAATGGGGAACTTTCTGAGCACCGCCGTCATCTCGGCAGTCCAGGCGGCGAGCCGCCACAAATGGCTGGTGGACAATCTGAACCGTTCAAAACTGCAGTACTTCTACTGGGTTTTGGCTGGTTTGAGCGGTTTGAATTTGTGTTGCTACATTTGGGTTGCCAATGGTTATGTGTATAAGAGAGTTGGAGGGAAAGATGGTGAAGATCATGGGAAGAACAGCAACAAAGGAGTCTATGGAGATGACATGATTTGAAGATTAATCATTATCATATGATTAGAGGGAGAACATATAATTATGAGATTAATCTCTCTAATCATCATTAATCTTTGTTGTCTTCTAATAATTTGTAGTTAAAAGTTATTCTAATGGTGTTTTCTTTTTTTCTCTTTTTTTTNGGATGTGATAATATTTATATTAAATGAAAAGATTAGTAAAATGAGTCAAATTCTATAAAATAATTACATTATTGGTTTAATTAAAACGGGATAGCAGTGTGTGACACTTAGTTGAACCCAGCAAACAAGTTAGTCGTGTAAATT

Coding sequence (CDS)

ATGGAGGAAGAAAAATCCCCCGCCCCTCTCCTCCACCTCCCCACCAAACTTCCCGATCACCGGAACCACTCCGACCGCCCCTCCCAACCACCCGCCGGCGGGTGGAAAGCTGCAATCTTTGTCATATTCGTCGAGGTCGCCGAGCAATTTGCTTTCATCGGCTTGTCCAGCAATCTCATCATGTACTTCACCACCGTCTTCCATGAACCCACCGCCACCGCCGCCAAGATGGTCAACAACTGGTCCGGCGTCTCCGCCGTCTTCCCAATACTCGGCGCCTTCGTCGCCGACTCACTTCTCGGCCGGTTCAAAACCATAATATTTTCCTCTCTTATATACTGCCTTGGAATGGTGCTGTTGACGCTATCGGCGACGGTGATCGGAGCACCTCACCGGAAACCCGTGTTCTTTTTTGCACTGTACATTTTGTCGGTCGGAGAAGGCGGCCACCGTCCTTGTGTACAGACGTTCGCCGCCGATCAATTCGACGAGGAGACGCCGGAGCAGAGGAAGAGGAAAAGCTCGTTCTTTAATTGGTGGTATGTGGGCCTTGTGGTGGGGTCCACTTTAGCTGTCTTTCTGGTTATTTACGTTCAGGTGTACAGACGCCATATTCCTGTCGGAAGTCCGTTGACCAGAATAGCACAGGTGGTGGTGGCGGCGGCCAGGAAGTGGCGCGTGGATAATACGCGCCAGGAATGGAGAGTGTGTTACGAGGAAGATAGTCATGCCAAAAATGAGGATGAAGGTCAACACAAGCCCATGACTTTGGATCGCAGAAGCCAATTCGGGATTTTGGATAAGGCGACTTTGATAGATGATGAGGACAAGGCGAGGAAGAAGCGAGACCCGTGGCGACTAAGCACGGTGGCGGAGGTGGAGGAGGTAAAGATGTTGGGACGACTAATTCCGGTGTGGTTTAGTTGTTTAATGTTTGCAGTGGTACAAGCTCAGATCCACACTTTCTTCACAAAGCAAGGCTCCACCATGCTCCGCTCCATCGGCCCCCACTTTCAGATCCCGCCCGCGTCTCTTCAAGGCGTCGTCGGCCTCACCATCCTTCTCACCATTCTTTTCTACGACCGAGTTTTTGTCCCCTCCGCCCGAAAGTTTACTGGCCACCACTCCGGCATAACGGTACTACAAAGAATAGGAATAGGCCTATTCATCTCAATCCTCAACATGGTCGCCTCGGCTTTGGTAGAGGCCAAAAGGGTCGCTGTGGCCGCCGAACACGGCCTCATCGACACTCCAAAAGCGACGGTTCCGATGACAATCTGGTGGCTAATTCCACAATACATGCTTTGTGGCGTCTCCGATGCCTTCACCGTCGTCGGTCTCCAAGAACTTTTCTACGACCAAATGCCCGAATCCATGAGGAGCATAGGATCTGCGGCGTACATCAGCGTGATCGGAATGGGGAACTTTCTGAGCACCGCCGTCATCTCGGCAGTCCAGGCGGCGAGCCGCCACAAATGGCTGGTGGACAATCTGAACCGTTCAAAACTGCAGTACTTCTACTGGGTTTTGGCTGGTTTGAGCGGTTTGAATTTGTGTTGCTACATTTGGGTTGCCAATGGTTATGTGTATAAGAGAGTTGGAGGGAAAGATGGTGAAGATCATGGGAAGAACAGCAACAAAGGAGTCTATGGAGATGACATGATTTGA

Protein sequence

MEEEKSPAPLLHLPTKLPDHRNHSDRPSQPPAGGWKAAIFVIFVEVAEQFAFIGLSSNLIMYFTTVFHEPTATAAKMVNNWSGVSAVFPILGAFVADSLLGRFKTIIFSSLIYCLGMVLLTLSATVIGAPHRKPVFFFALYILSVGEGGHRPCVQTFAADQFDEETPEQRKRKSSFFNWWYVGLVVGSTLAVFLVIYVQVYRRHIPVGSPLTRIAQVVVAAARKWRVDNTRQEWRVCYEEDSHAKNEDEGQHKPMTLDRRSQFGILDKATLIDDEDKARKKRDPWRLSTVAEVEEVKMLGRLIPVWFSCLMFAVVQAQIHTFFTKQGSTMLRSIGPHFQIPPASLQGVVGLTILLTILFYDRVFVPSARKFTGHHSGITVLQRIGIGLFISILNMVASALVEAKRVAVAAEHGLIDTPKATVPMTIWWLIPQYMLCGVSDAFTVVGLQELFYDQMPESMRSIGSAAYISVIGMGNFLSTAVISAVQAASRHKWLVDNLNRSKLQYFYWVLAGLSGLNLCCYIWVANGYVYKRVGGKDGEDHGKNSNKGVYGDDMI
BLAST of Cp4.1LG01g15040 vs. Swiss-Prot
Match: PTR46_ARATH (Protein NRT1/ PTR FAMILY 5.4 OS=Arabidopsis thaliana GN=NPF5.4 PE=2 SV=1)

HSP 1 Score: 592.0 bits (1525), Expect = 6.6e-168
Identity = 289/538 (53.72%), Postives = 389/538 (72.30%), Query Frame = 1

Query: 33  GGWKAAIFVIFVEVAEQFAFIGLSSNLIMYFTTVFHEPTATAAKMVNNWSGVSAVFPILG 92
           GGW AA+F+I VE+AE+FAF GL+SNLI + T    + TATAAK +N W GVS +FPILG
Sbjct: 14  GGWNAALFIIVVEIAERFAFYGLASNLITFLTNELGQSTATAAKNINTWIGVSCMFPILG 73

Query: 93  AFVADSLLGRFKTIIFSSLIYCLGMVLLTLSATVIGAPHRKPVFFFALYILSVGEGGHRP 152
           AF+ADS+LGRFKT++ +S IY LG+V+L LS TV+    R+ VFF ALY+++VGEGGH+P
Sbjct: 74  AFLADSILGRFKTVLLTSFIYLLGIVMLPLSVTVVARRMREKVFFMALYVMAVGEGGHKP 133

Query: 153 CVQTFAADQFDEETPEQRKRKSSFFNWWYVGLVVGSTLAVFLVIYVQ------------- 212
           CV TFAADQF E   E++  K+SFFN+WY+ +V+ S++AV  +I++Q             
Sbjct: 134 CVMTFAADQFGEANAEEKAAKTSFFNYWYMAIVLASSIAVLALIFIQERVSWSLGFSIIA 193

Query: 213 ---------------VYRRHIPVGSPLTRIAQVVVAAARKWRVDNTRQEWRVCYEEDSHA 272
                           YR+ +PVGSP TR+AQV+VAA +KWR+ +TR  + +CYEE+   
Sbjct: 194 GSVVIAIVIFLIGIPKYRKQVPVGSPFTRVAQVMVAALKKWRLSSTRHHYGLCYEEEDEH 253

Query: 273 KNEDEGQHKPMTLDRRSQFGILDKATLIDDEDKARKKRDPWRLSTVAEVEEVKMLGRLIP 332
           K E    ++   L R +QF  LDKAT+ID+ D   K R+PWRL TV +VEEVK++ RLIP
Sbjct: 254 KLESTNSNQVYLLARTNQFRFLDKATIIDEIDH-NKNRNPWRLCTVNQVEEVKLILRLIP 313

Query: 333 VWFSCLMFAVVQAQIHTFFTKQGSTMLRSIGPHFQIPPASLQGVVGLTILLTILFYDRVF 392
           +W S +MF     Q++TFF KQGS M R+IG HF IPPA+ Q +VG+TIL+ I  YDRVF
Sbjct: 314 IWISLIMFCATLTQLNTFFLKQGSMMDRTIGNHFTIPPAAFQSIVGVTILILIPLYDRVF 373

Query: 393 VPSARKFTGHHSGITVLQRIGIGLFISILNMVASALVEAKRVAVAAEHGLIDTPKATVPM 452
           VP  RK T HHSGIT LQRIG+GLF++  NMV   LVEAKR+ VA +HGLID+PK  VPM
Sbjct: 374 VPMVRKITNHHSGITSLQRIGVGLFVATFNMVICGLVEAKRLKVARDHGLIDSPKEVVPM 433

Query: 453 TIWWLIPQYMLCGVSDAFTVVGLQELFYDQMPESMRSIGSAAYISVIGMGNFLSTAVISA 512
           +  WL+PQY+L G+ D FT+VG+QELFYDQMPE+MRSIG+A +ISV+G+G+F+ST +IS 
Sbjct: 434 SSLWLLPQYILVGIGDVFTIVGMQELFYDQMPETMRSIGAAIFISVVGVGSFVSTGIIST 493

Query: 513 VQAASR---HKWLVDNLNRSKLQYFYWVLAGLSGLNLCCYIWVANGYVYKRVGGKDGE 540
           VQ  S+    +WLV+NLNR+ L Y+YW++A L+ ++LC Y+++AN ++YK++  KD +
Sbjct: 494 VQTISKSHGEEWLVNNLNRAHLDYYYWIIASLNAVSLCFYLFIANHFLYKKLQDKDDD 550

BLAST of Cp4.1LG01g15040 vs. Swiss-Prot
Match: PTR9_ARATH (Protein NRT1/ PTR FAMILY 5.10 OS=Arabidopsis thaliana GN=NPF5.10 PE=2 SV=1)

HSP 1 Score: 441.0 bits (1133), Expect = 1.9e-122
Identity = 250/576 (43.40%), Postives = 352/576 (61.11%), Query Frame = 1

Query: 4   EKSPAPLLHLPTKLPDHRNHSDRPSQPPAGGWKAAIFVIFVEVAEQFAFIGLSSNLIMYF 63
           +++  PLL +     D+RN     S   +GGW++A F+I VEVAE+FA+ G+SSNLI Y 
Sbjct: 8   DEAGTPLLAVTV---DYRNKPAVKSS--SGGWRSAGFIIGVEVAERFAYYGISSNLITYL 67

Query: 64  TTVFHEPTATAAKMVNNWSGVSAVFPILGAFVADSLLGRFKTIIFSSLIYCLGMVLLTLS 123
           T    + TA AA  VN WSG +++ P+LGAFVADS LGRF+TI+ +S +Y +G+ +LTLS
Sbjct: 68  TGPLGQSTAAAAANVNAWSGTASLLPLLGAFVADSFLGRFRTILAASALYIVGLGVLTLS 127

Query: 124 ATVIG-----------APHRKPVFFF-ALYILSVGEGGHRPCVQTFAADQFDEETPEQRK 183
           A +             +P  + + FF ALY++++ +GGH+PCVQ F ADQFDE+ PE+ K
Sbjct: 128 AMIPSDCKVSNLLSSCSPRFQVITFFSALYLVALAQGGHKPCVQAFGADQFDEKEPEECK 187

Query: 184 RKSSFFNWWYVGLVVGSTLAVFLVIYVQ----------------------------VYRR 243
            KSSFFNWWY G+  G+   ++++ Y+Q                             YR 
Sbjct: 188 AKSSFFNWWYFGMCFGTLTTLWVLNYIQDNLSWALGFGIPCIAMVVALVVLLLGTCTYRF 247

Query: 244 HI--PVGSPLTRIAQVVVAAARKWRVDNTRQEWRVCYEEDSHAKNEDEGQHKPMTLDRRS 303
            I     SP  RI  V VAA + W V             D  A  E  G    ++     
Sbjct: 248 SIRREDQSPFVRIGNVYVAAVKNWSVSAL----------DVAAAEERLGL---VSCSSSQ 307

Query: 304 QFGILDKATLIDDEDKARKKRDPWRLSTVAEVEEVKMLGRLIPVWFSCLMFAVVQAQIHT 363
           QF  L+KA +  +              ++ E+EE K + RL P+W +CL++AVV AQ  T
Sbjct: 308 QFSFLNKALVAKNGS-----------CSIDELEEAKSVLRLAPIWLTCLVYAVVFAQSPT 367

Query: 364 FFTKQGSTMLRSIGPHFQIPPASLQGVVGLTILLTILFYDRVFVPSARKFTGHHSGITVL 423
           FFTKQG+TM RSI P ++I PA+LQ  + L+I++ I  YDRV +P AR FT    GIT+L
Sbjct: 368 FFTKQGATMERSITPGYKISPATLQSFISLSIVIFIPIYDRVLIPIARSFTHKPGGITML 427

Query: 424 QRIGIGLFISILNMVASALVEAKRVAVAAEHGLIDTPKATVPMTIWWLIPQYMLCGVSDA 483
           QRIG G+F+S L MV +ALVE KR+  AA++GL+D+P ATVPM++WWL+PQY+L G++D 
Sbjct: 428 QRIGTGIFLSFLAMVVAALVEMKRLKTAADYGLVDSPDATVPMSVWWLVPQYVLFGITDV 487

Query: 484 FTVVGLQELFYDQMPESMRSIGSAAYISVIGMGNFLSTAVISAVQAAS----RHKWLVDN 534
           F +VGLQE FYDQ+P  +RS+G A Y+S+ G+GNFLS+ +IS ++ A+    +  W  +N
Sbjct: 488 FAMVGLQEFFYDQVPNELRSVGLALYLSIFGIGNFLSSFMISIIEKATSQSGQASWFANN 547

BLAST of Cp4.1LG01g15040 vs. Swiss-Prot
Match: PTR22_ARATH (Protein NRT1/ PTR FAMILY 5.14 OS=Arabidopsis thaliana GN=NPF5.14 PE=2 SV=2)

HSP 1 Score: 430.6 bits (1106), Expect = 2.5e-119
Identity = 243/564 (43.09%), Postives = 343/564 (60.82%), Query Frame = 1

Query: 15  TKLPDHRNHSDRPSQPPAGGWKAAIFVIFVEVAEQFAFIGLSSNLIMYFTTVFHEPTATA 74
           T   DHR  + R S    G W+AA+F+I VEVAE+FA+ G+ SNLI Y T    E TA A
Sbjct: 15  TDAVDHRGLAARRSN--TGRWRAALFIIGVEVAERFAYYGIGSNLISYLTGPLGESTAVA 74

Query: 75  AKMVNNWSGVSAVFPILGAFVADSLLGRFKTIIFSSLIYCLGMVLLTLSATVIG------ 134
           A  VN WSG++ + P+LGAFVAD+ LGR++TII SSLIY LG+  LTLSA +I       
Sbjct: 75  AANVNAWSGIATLLPVLGAFVADAFLGRYRTIIISSLIYVLGLAFLTLSAFLIPNTTEVT 134

Query: 135 ---APHRKPVFFFALYILSVGEGGHRPCVQTFAADQFDEETPEQRKRKSSFFNWWYVGLV 194
              +     +FFF+LY++++G+ GH+PCVQ F ADQFDE+  +++  +SSFFNWWY+ L 
Sbjct: 135 SSTSSFLNVLFFFSLYLVAIGQSGHKPCVQAFGADQFDEKDSQEKSDRSSFFNWWYLSLS 194

Query: 195 VGSTLAVFLVIYVQ----------------------------VY----RRHIPVGSPLTR 254
            G   A+ +V+Y+Q                            +Y    RRH    +P TR
Sbjct: 195 AGICFAILVVVYIQEEFSWAFGFGIPCVFMVISLVLFVSGRRIYRYSKRRHEEEINPFTR 254

Query: 255 IAQVVVAAARKWRVDNTRQEWRVCYEEDSHAKNEDEGQHKPMTLDRRSQFGILDKATLID 314
           I +V   A +  R+ ++              K E E    P   +++S F   +KA L+ 
Sbjct: 255 IGRVFFVALKNQRLSSS-----------DLCKVELEANTSP---EKQSFF---NKALLVP 314

Query: 315 DEDKARKKRDPWRLSTVAEVEEVKMLGRLIPVWFSCLMFAVVQAQIHTFFTKQGSTMLRS 374
           ++    +       S  ++VE+   L RLIPVWF+ L +A+  AQ  TFFTKQG TM R+
Sbjct: 315 NDSSQGE-----NASKSSDVEDATALIRLIPVWFTTLAYAIPYAQYMTFFTKQGVTMDRT 374

Query: 375 IGPHFQIPPASLQGVVGLTILLTILFYDRVFVPSARKFTGHHSGITVLQRIGIGLFISIL 434
           I P  +IPPASLQ  +G++I+L +  YDRVFVP AR  T    GIT L+RIG G+ +S +
Sbjct: 375 ILPGVKIPPASLQVFIGISIVLFVPIYDRVFVPIARLITKEPCGITTLKRIGTGIVLSTI 434

Query: 435 NMVASALVEAKRVAVAAEHGLIDTPKATVPMTIWWLIPQYMLCGVSDAFTVVGLQELFYD 494
            MV +ALVE KR+  A EHGLID P+AT+PM+IWWLIPQY+L G++D +T+VG+QE FY 
Sbjct: 435 TMVIAALVEFKRLETAKEHGLIDQPEATLPMSIWWLIPQYLLLGLADVYTLVGMQEFFYS 494

Query: 495 QMPESMRSIGSAAYISVIGMGNFLSTAVISAVQAA----SRHKWLVDNLNRSKLQYFYWV 534
           Q+P  +RSIG A Y+S +G+G+ LS+ +IS +  A    + + W   NLNR+ L YFYW+
Sbjct: 495 QVPTELRSIGLALYLSALGVGSLLSSLLISLIDLATGGDAGNSWFNSNLNRAHLDYFYWL 554

BLAST of Cp4.1LG01g15040 vs. Swiss-Prot
Match: PTR23_ARATH (Protein NRT1/ PTR FAMILY 5.13 OS=Arabidopsis thaliana GN=NPF5.13 PE=2 SV=2)

HSP 1 Score: 413.7 bits (1062), Expect = 3.2e-114
Identity = 238/551 (43.19%), Postives = 333/551 (60.44%), Query Frame = 1

Query: 19  DHRNHSDRPSQPPAGGWKAAIFVIFVEVAEQFAFIGLSSNLIMYFTTVFHEPTATAAKMV 78
           DHR  S R S    G W+AA F+I VEVAE+FA  G+ SNLI Y T    + TA AA  V
Sbjct: 19  DHRGFSARRSI--TGRWRAAWFIIGVEVAERFANYGIGSNLISYLTGPLGQSTAVAAANV 78

Query: 79  NNWSGVSAVFPILGAFVADSLLGRFKTIIFSSLIYCLGMVLLTLSATVIG---------A 138
           N WSG+S + P+LGAFVAD+ LGR+ TII +S IY LG+  LTLSA +I          +
Sbjct: 79  NAWSGISTILPLLGAFVADAFLGRYITIIIASFIYVLGLAFLTLSAFLIPNNTEVTSSPS 138

Query: 139 PHRKPVFFFALYILSVGEGGHRPCVQTFAADQFDEETPEQRKRKSSFFNWWYVGLVVGST 198
                +FFF+LY++++G+ GH+PCVQ F ADQFDE+ P++   +SSFFNWWY+ +  G  
Sbjct: 139 SFLNALFFFSLYLVAIGQSGHKPCVQAFGADQFDEKNPQENSDRSSFFNWWYLSMCAGIG 198

Query: 199 LAVFLVIYVQV---YRRHIPVGSPLTRIAQVVVAAARK-WRVDNTRQE---------WRV 258
           LA+ +V+Y+Q    +     +      I+ V+    RK +R   TRQE          RV
Sbjct: 199 LAILVVVYIQENVSWALGFGIPCVFMVISLVLFVLGRKSYRFSKTRQEEETNPFTRIGRV 258

Query: 259 CYEEDSHAKNEDEGQHKPMTLD-RRSQ-----FGILDKATLI----DDEDKARKKRDPWR 318
            +    + +       K   ++  RSQ        L+KA L+    D+ + A K RD   
Sbjct: 259 FFVAFKNQRLNSSDLCKVELIEANRSQESPEELSFLNKALLVPNDSDEGEVACKSRD--- 318

Query: 319 LSTVAEVEEVKMLGRLIPVWFSCLMFAVVQAQIHTFFTKQGSTMLRSIGPHFQIPPASLQ 378
                 VE+   L RLIPVW + L +A+  AQ  TFFTKQG TM R+I P  +IPPASLQ
Sbjct: 319 ------VEDATALVRLIPVWLTTLAYAIPFAQYMTFFTKQGVTMERTIFPGVEIPPASLQ 378

Query: 379 GVVGLTILLTILFYDRVFVPSARKFTGHHSGITVLQRIGIGLFISILNMVASALVEAKRV 438
            ++ ++I+L +  YDRV VP  R  T    GIT L+RIG G+ ++ L MV +ALVE+KR+
Sbjct: 379 VLISISIVLFVPIYDRVLVPIGRSITKDPCGITTLKRIGTGMVLATLTMVVAALVESKRL 438

Query: 439 AVAAEHGLIDTPKATVPMTIWWLIPQYMLCGVSDAFTVVGLQELFYDQMPESMRSIGSAA 498
             A E+GLID PK T+PM+IWWL PQYML G++D  T+VG+QE FY Q+P  +RS+G A 
Sbjct: 439 ETAKEYGLIDQPKTTLPMSIWWLFPQYMLLGLADVHTLVGMQEFFYSQVPTELRSLGLAI 498

Query: 499 YISVIGMGNFLSTAVISAVQAA----SRHKWLVDNLNRSKLQYFYWVLAGLSGLNLCCYI 534
           Y+S +G+G+ LS+ +I  +  A    + + W   NLNR+ L YFYW+LA +S +    ++
Sbjct: 499 YLSAMGVGSLLSSLLIYLIDLATGGDAGNSWFNSNLNRAHLDYFYWLLAVVSAVGFFTFL 558

BLAST of Cp4.1LG01g15040 vs. Swiss-Prot
Match: PTR45_ARATH (Protein NRT1/ PTR FAMILY 5.7 OS=Arabidopsis thaliana GN=NPF5.7 PE=2 SV=2)

HSP 1 Score: 400.6 bits (1028), Expect = 2.8e-110
Identity = 219/557 (39.32%), Postives = 333/557 (59.78%), Query Frame = 1

Query: 27  PSQPPAGGWKAAIFVIFVEVAEQFAFIGLSSNLIMYFTTVFHEPTATAAKMVNNWSGVSA 86
           P +   G W+AA+F+I +E +E+ ++ G+S+NL++Y TT+ H+    A K  N WSGV+ 
Sbjct: 33  PLRAQTGAWRAALFIIGIEFSERLSYFGISTNLVVYLTTILHQDLKMAVKNTNYWSGVTT 92

Query: 87  VFPILGAFVADSLLGRFKTIIFSSLIYCLGMVLLTLSATVIG--APHR----------KP 146
           + P+LG FVAD+ LGR+ T++ ++ IY +G++LLTLS  + G  A H           + 
Sbjct: 93  LMPLLGGFVADAYLGRYGTVLLATTIYLMGLILLTLSWFIPGLKACHEDMCVEPRKAHEI 152

Query: 147 VFFFALYILSVGEGGHRPCVQTFAADQFDEETPEQRKRKSSFFNWWYVGLVVGSTLAVFL 206
            FF A+Y++S+G GGH+P +++F ADQF++  PE+RK K S+FNWW  GL  G   AV +
Sbjct: 153 AFFIAIYLISIGTGGHKPSLESFGADQFEDGHPEERKMKMSYFNWWNAGLCAGILTAVTV 212

Query: 207 VIYVQ----------------------------VYRRHIPVGSPLTRIAQVVVAAARKWR 266
           ++Y++                             YR   P GSPLT + QV VAA  K  
Sbjct: 213 IVYIEDRIGWGVASIILTIVMATSFFIFRIGKPFYRYRAPSGSPLTPMLQVFVAAIAKRN 272

Query: 267 VDNTRQEWRVCYEEDS---HAKNEDEGQHKPMTLDRRSQFGILDKATLIDDEDKARK--K 326
           +         C  + S      NE+  + + ++  +  +F  LDKA +I+D ++  K  K
Sbjct: 273 LP--------CPSDSSLLHELTNEEYTKGRLLSSSKNLKF--LDKAAVIEDRNENTKAEK 332

Query: 327 RDPWRLSTVAEVEEVKMLGRLIPVWFSCLMFAVVQAQIHTFFTKQGSTMLRSI-GPHFQI 386
           + PWRL+TV +VEEVK+L  +IP+WF  L F V   Q  T F KQ   M R I G  F +
Sbjct: 333 QSPWRLATVTKVEEVKLLINMIPIWFFTLAFGVCATQSSTLFIKQAIIMDRHITGTSFIV 392

Query: 387 PPASLQGVVGLTILLTILFYDRVFVPSARKFTGHHSGITVLQRIGIGLFISILNMVASAL 446
           PPASL  ++ L+I++T+  Y+++ VP  R+ TG+  GI++LQRIG+G+  S+  M+ +AL
Sbjct: 393 PPASLFSLIALSIIITVTIYEKLLVPLLRRATGNERGISILQRIGVGMVFSLFAMIIAAL 452

Query: 447 VEAKRVAVAAEHGLIDTPKATVPMTIWWLIPQYMLCGVSDAFTVVGLQELFYDQMPESMR 506
           +E KR+  A EH +      T+ ++  WL PQ+++ GV+DAFT+VGLQE FYDQ+P+SMR
Sbjct: 453 IEKKRLDYAKEHHM----NKTMTLSAIWLAPQFLVLGVADAFTLVGLQEYFYDQVPDSMR 512

Query: 507 SIGSAAYISVIGMGNFLSTAVISA----VQAASRHKWLVDNLNRSKLQYFYWVLAGLSGL 534
           S+G A Y+SV+G  +F++  +I+      +  S   W   +LN S+L  FYW+LA L+  
Sbjct: 513 SLGIAFYLSVLGAASFVNNLLITVSDHLAEEISGKGWFGKDLNSSRLDRFYWMLAALTAA 572

BLAST of Cp4.1LG01g15040 vs. TrEMBL
Match: A0A0B0N5Q1_GOSAR (Uncharacterized protein OS=Gossypium arboreum GN=F383_13158 PE=3 SV=1)

HSP 1 Score: 664.1 bits (1712), Expect = 1.5e-187
Identity = 342/571 (59.89%), Postives = 413/571 (72.33%), Query Frame = 1

Query: 14  PTKLPDHRNHSDRPSQPPAGGWKAAIFVIFVEVAEQFAFIGLSSNLIMYFTTVFHEPTAT 73
           PTKL   +       +P  GGWKAA FVI VE+AE+FAF GL+ NLI Y T    EP  T
Sbjct: 18  PTKLSSQK-------KPSKGGWKAAFFVISVEMAERFAFYGLAGNLITYLTNNLGEPVIT 77

Query: 74  AAKMVNNWSGVSAVFPILGAFVADSLLGRFKTIIFSSLIYCLGMVLLTLSATVIGAPHRK 133
           AAK VN W GVSA+FP+LGAF+ADS LGRFKTI+ SS+IY LGMVLL+LS +VI    RK
Sbjct: 78  AAKNVNTWVGVSAIFPLLGAFIADSYLGRFKTILASSVIYFLGMVLLSLSVSVIPMHSRK 137

Query: 134 PVFFFALYILSVGEGGHRPCVQTFAADQFDEETPEQRKRKSSFFNWWYVGLVVGSTLAVF 193
            VFF ALY+L++GEGGH+PCVQTFAADQFDE  PE++  KSSFFNWWY+G+V G+++A+ 
Sbjct: 138 AVFFTALYVLAIGEGGHKPCVQTFAADQFDENNPEEKAAKSSFFNWWYLGIVTGASVAIV 197

Query: 194 LVIYVQ----------------------------VYRRHIPVGSPLTRIAQVVVAAARKW 253
           +VIY+Q                             YR+  P GSP T +AQV+VAAA+KW
Sbjct: 198 VVIYLQDNVSWAAGFGVLSGSLAVALAVFLIGIRKYRKQRPTGSPFTSVAQVLVAAAKKW 257

Query: 254 RVDNTRQEWRVCYEEDSHAKNEDEGQHKPMTLDRRSQFGILDKATLIDDEDKARKKRDPW 313
           RV  T     +CYE+D    +  +GQ     L R  QF  LDKA +IDD+D   K RDPW
Sbjct: 258 RVSETHGGRGICYEDDRRGGSHVKGQTMSRNLVRTKQFRFLDKAMIIDDKDTLSKTRDPW 317

Query: 314 RLSTVAEVEEVKMLGRLIPVWFSCLMFAVVQAQIHTFFTKQGSTMLRSIGPHFQIPPASL 373
           RL ++ +VEEVK++ RLIP+W  CLMF+ V  Q+HTFFTKQGSTMLRSIGP+FQ+PPA+L
Sbjct: 318 RLCSLNQVEEVKLVLRLIPIWLGCLMFSAVITQLHTFFTKQGSTMLRSIGPNFQVPPAAL 377

Query: 374 QGVVGLTILLTILFYDRVFVPSARKFTGHHSGITVLQRIGIGLFISILNMVASALVEAKR 433
           Q +VGLTIL+ +  YDRVFVP ARK TGH SGIT+LQRIG GLFISILNMV + LVE  R
Sbjct: 378 QSLVGLTILIAVPIYDRVFVPIARKITGHPSGITMLQRIGTGLFISILNMVVAGLVETAR 437

Query: 434 VAVAAEHGLIDTPKATVPMTIWWLIPQYMLCGVSDAFTVVGLQELFYDQMPESMRSIGSA 493
           V  AA+HGLID PKA VPM++WWL+PQY+L G+ D FT+VGLQELFYDQMPE MRSIG+A
Sbjct: 438 VNTAAKHGLIDAPKAIVPMSVWWLLPQYVLTGLGDVFTIVGLQELFYDQMPEEMRSIGAA 497

Query: 494 AYISVIGMGNFLSTAVISAVQAAS-RH--KWLVDNLNRSKLQYFYWVLAGLSGLNLCCYI 553
           AYISV+G+G+F++TAVIS VQ  S RH  +WL DNLNR+KL YFYWVLAGLS  NLC YI
Sbjct: 498 AYISVVGVGSFINTAVISVVQVISLRHGNEWLGDNLNRAKLNYFYWVLAGLSAFNLCAYI 557

BLAST of Cp4.1LG01g15040 vs. TrEMBL
Match: A0A0D2VQ09_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_011G215200 PE=3 SV=1)

HSP 1 Score: 658.3 bits (1697), Expect = 8.3e-186
Identity = 339/571 (59.37%), Postives = 411/571 (71.98%), Query Frame = 1

Query: 14  PTKLPDHRNHSDRPSQPPAGGWKAAIFVIFVEVAEQFAFIGLSSNLIMYFTTVFHEPTAT 73
           PTKL   +       +P  GGWKAA FVI VE+AE+FAF GL+ NLI Y T    EP  T
Sbjct: 18  PTKLSSQK-------KPSKGGWKAAFFVISVEMAERFAFYGLAGNLITYLTNNLGEPVVT 77

Query: 74  AAKMVNNWSGVSAVFPILGAFVADSLLGRFKTIIFSSLIYCLGMVLLTLSATVIGAPHRK 133
           AAK VN W GVSA+FP+LGAF+ADS LGRFKTI+ SS+IY LGMVLL+LS +VI    RK
Sbjct: 78  AAKNVNTWVGVSAIFPLLGAFIADSYLGRFKTILASSVIYFLGMVLLSLSVSVIPMHSRK 137

Query: 134 PVFFFALYILSVGEGGHRPCVQTFAADQFDEETPEQRKRKSSFFNWWYVGLVVGSTLAVF 193
            VFF ALY+L++GEGGH+PCVQTFAADQFDE  PE++  KSSFFNWWY+G+V G+++A+ 
Sbjct: 138 AVFFTALYVLAIGEGGHKPCVQTFAADQFDENNPEEKAAKSSFFNWWYLGIVTGASVAIV 197

Query: 194 LVIYVQ----------------------------VYRRHIPVGSPLTRIAQVVVAAARKW 253
           +VIY+Q                             YR+  P GSP T +AQV VAAA+KW
Sbjct: 198 VVIYLQDNVSWAAGFGVLSGSLAVALVVFLIGIRKYRKQRPTGSPFTSVAQVFVAAAKKW 257

Query: 254 RVDNTRQEWRVCYEEDSHAKNEDEGQHKPMTLDRRSQFGILDKATLIDDEDKARKKRDPW 313
           RV  T     +CYE+D    +  +GQ     L R  QF  LDKA +IDD+D   K RDPW
Sbjct: 258 RVSETHGGRGICYEDDRRGGSHVKGQTMGRDLVRTRQFRFLDKAMIIDDKDTLGKTRDPW 317

Query: 314 RLSTVAEVEEVKMLGRLIPVWFSCLMFAVVQAQIHTFFTKQGSTMLRSIGPHFQIPPASL 373
           RL ++ +VEEVK++ RLIP+W  CLMF+ V  Q+HTFFTKQGSTMLRSIGP+FQ+PPA+L
Sbjct: 318 RLCSLNQVEEVKLVLRLIPIWLGCLMFSAVITQLHTFFTKQGSTMLRSIGPNFQVPPAAL 377

Query: 374 QGVVGLTILLTILFYDRVFVPSARKFTGHHSGITVLQRIGIGLFISILNMVASALVEAKR 433
           Q +VGLTIL+ +  YDRVFVP ARK TGH SGIT+LQRIG GLFISILNMV + LVE  R
Sbjct: 378 QSLVGLTILIAVPIYDRVFVPIARKITGHPSGITMLQRIGTGLFISILNMVVAGLVETAR 437

Query: 434 VAVAAEHGLIDTPKATVPMTIWWLIPQYMLCGVSDAFTVVGLQELFYDQMPESMRSIGSA 493
           V  A +HGL+D PKA VPM++WWL+PQY+L G+ D FT+VGLQELFYDQMPE MRSIG+A
Sbjct: 438 VNTATKHGLMDAPKAVVPMSVWWLLPQYVLTGLGDVFTIVGLQELFYDQMPEEMRSIGAA 497

Query: 494 AYISVIGMGNFLSTAVISAVQA-ASRH--KWLVDNLNRSKLQYFYWVLAGLSGLNLCCYI 553
           AYISV+G+G+F++TAVIS VQ  +SRH  +WL DNLNR+KL YFYWVLAGLS  NLC YI
Sbjct: 498 AYISVVGVGSFINTAVISVVQVISSRHGKEWLGDNLNRAKLNYFYWVLAGLSAFNLCAYI 557

BLAST of Cp4.1LG01g15040 vs. TrEMBL
Match: B9RGA9_RICCO (Peptide transporter, putative OS=Ricinus communis GN=RCOM_1452620 PE=3 SV=1)

HSP 1 Score: 657.1 bits (1694), Expect = 1.9e-185
Identity = 326/548 (59.49%), Postives = 404/548 (73.72%), Query Frame = 1

Query: 33  GGWKAAIFVIFVEVAEQFAFIGLSSNLIMYFTTVFHEPTATAAKMVNNWSGVSAVFPILG 92
           GGW +AIF+I +EVAE+FAF G++ NLI Y T   H+PTA AAK +N W GVS++FPILG
Sbjct: 66  GGWNSAIFIILLEVAERFAFYGVAGNLITYLTNDLHQPTAAAAKNINTWIGVSSIFPILG 125

Query: 93  AFVADSLLGRFKTIIFSSLIYCLGMVLLTLSATVIGAPHRKPVFFFALYILSVGEGGHRP 152
           AF+ADS LGRFKTI+F+S+IY  GMVLLTLS +V+   HR+  FF ALYIL++GEGGH+P
Sbjct: 126 AFIADSFLGRFKTILFASIIYFTGMVLLTLSVSVVPTHHREMAFFIALYILAIGEGGHKP 185

Query: 153 CVQTFAADQFDEETPEQRKRKSSFFNWWYVGLVVGSTLAVFLVIYVQV------------ 212
           CVQTFAADQFDEE PE++  KSSFFNWWY+ + VG+T+AVF+VIY+Q             
Sbjct: 186 CVQTFAADQFDEEKPEEKAAKSSFFNWWYLSIAVGATVAVFVVIYIQDNVGWTEGSAMLA 245

Query: 213 ----------------YRRHIPVGSPLTRIAQVVVAAARKWRVDNTRQEWRVCYEEDSHA 272
                           YR   P GSP T +AQV VAAA+K RV  TR+ W +C  ++   
Sbjct: 246 GVLVVALMVFLMGIKRYRIEAPCGSPYTTVAQVFVAAAKKQRVSETREGWGLCCVDEKDG 305

Query: 273 KNEDEGQHKPMTLDRRSQFGILDKATLIDDEDKARKKRDPWRLSTVAEVEEVKMLGRLIP 332
            ++ EGQ K   L R  QF  LDKA +ID+ D + K R+PWRL T+ +VEEVK + RLIP
Sbjct: 306 ADDLEGQPKVRVLARTKQFRFLDKAMIIDNIDISNKTRNPWRLCTLNQVEEVKQVLRLIP 365

Query: 333 VWFSCLMFAVVQAQIHTFFTKQGSTMLRSIGPHFQIPPASLQGVVGLTILLTILFYDRVF 392
           +W SCLMF  +  Q HTFF KQGSTM+RSIGPHF  PPASLQ + G+TIL+ +  YD+ F
Sbjct: 366 IWLSCLMFTAIIVQTHTFFIKQGSTMIRSIGPHFLFPPASLQSLTGMTILIAVPIYDKFF 425

Query: 393 VPSARKFTGHHSGITVLQRIGIGLFISILNMVASALVEAKRVAVAAEHGLIDTPKATVPM 452
           VP ARK TGH SGIT+LQRIGIGLF+SIL MV +ALVEAKRV+ A EHGLIDTPKA VPM
Sbjct: 426 VPIARKITGHPSGITMLQRIGIGLFLSILEMVVAALVEAKRVSTAFEHGLIDTPKAIVPM 485

Query: 453 TIWWLIPQYMLCGVSDAFTVVGLQELFYDQMPESMRSIGSAAYISVIGMGNFLSTAVISA 512
            +WWL+PQYM+ G++D FTV+GLQELFYDQMP +MRS+G+AAYIS+IG+G+F+++AVIS 
Sbjct: 486 AVWWLLPQYMISGLADVFTVIGLQELFYDQMPVAMRSMGAAAYISIIGIGSFINSAVISV 545

Query: 513 VQAASR---HKWLVDNLNRSKLQYFYWVLAGLSGLNLCCYIWVANGYVYKRVGGKDGEDH 550
           VQA +    H WL DN+NRS L YFYWVLA LS LNLC Y+W+A+G+VYK+V G   E  
Sbjct: 546 VQAVTSRRGHVWLGDNVNRSHLDYFYWVLAVLSALNLCVYVWIASGFVYKKVEGDHEEKE 605

BLAST of Cp4.1LG01g15040 vs. TrEMBL
Match: A0A067KZV4_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_24527 PE=3 SV=1)

HSP 1 Score: 657.1 bits (1694), Expect = 1.9e-185
Identity = 331/547 (60.51%), Postives = 412/547 (75.32%), Query Frame = 1

Query: 28  SQPPAGGWKAAIFVIFVEVAEQFAFIGLSSNLIMYFTTVFHEPTATAAKMVNNWSGVSAV 87
           ++P  GGWKAA+F+IFVE+AE+FAF GL+ NLI Y T   H+PTATA K VN W GVSA+
Sbjct: 36  TKPSNGGWKAALFIIFVEMAERFAFYGLAGNLITYLTNELHQPTATAVKNVNTWIGVSAI 95

Query: 88  FPILGAFVADSLLGRFKTIIFSSLIYCLGMVLLTLSATVIGAPHRKPVFFFALYILSVGE 147
           FPI GA +ADS LGRF TI+ +S+IY +GMVLLT S ++I   +R+ VFF ALYIL++GE
Sbjct: 96  FPIFGAVLADSFLGRFTTILLASIIYFIGMVLLTFSVSIIPMHYREAVFFLALYILAIGE 155

Query: 148 GGHRPCVQTFAADQFDEETPEQRKRKSSFFNWWYVGLVVGSTLAVFLVIYVQV------- 207
           GGH+PCVQTFAADQF+EE PE++  KSSFFNWWY+G+V+G+T+AVFLVIYVQ        
Sbjct: 156 GGHKPCVQTFAADQFNEEKPEEKAAKSSFFNWWYLGIVIGATVAVFLVIYVQDNVGWTEG 215

Query: 208 ---------------------YRRHIPVGSPLTRIAQVVVAAARKWRVDNTRQEWRVCYE 267
                                YR+  PVGSP T +AQV+VAA RK RV  T+Q W  C E
Sbjct: 216 LAILAGTLVVALIVFLVGMKRYRKEAPVGSPYTAVAQVLVAAVRKRRVSETQQGWGFCCE 275

Query: 268 EDSHAKNED-EGQHKPMTLDRRSQFGILDKATLIDDEDKARKKRDPWRLSTVAEVEEVKM 327
           +D      D EGQ K   L R   F +LDKA +ID+ D + K R+PWRL T+ +VEEVK+
Sbjct: 276 DDDKRVGADLEGQPKGKILCRTKHFRLLDKAMVIDNMDASSKTRNPWRLCTLNQVEEVKL 335

Query: 328 LGRLIPVWFSCLMFAVVQAQIHTFFTKQGSTMLRSIGPHFQIPPASLQGVVGLTILLTIL 387
           + RLIP+W SCL+F  +  Q HTFF KQGSTM+RSIGP+F +PPASLQ ++GLTIL+T+ 
Sbjct: 336 VLRLIPIWLSCLIFTSIIVQNHTFFVKQGSTMIRSIGPNFVLPPASLQCLIGLTILVTVP 395

Query: 388 FYDRVFVPSARKFTGHHSGITVLQRIGIGLFISILNMVASALVEAKRVAVAAEHGLIDTP 447
            YD++FVP ARK TGH SGIT+LQRIGIGLF+SIL M  +ALVEAKR+++A EHGL+DTP
Sbjct: 396 VYDKLFVPLARKITGHPSGITMLQRIGIGLFLSILEMAVAALVEAKRISIAKEHGLLDTP 455

Query: 448 KATVPMTIWWLIPQYMLCGVSDAFTVVGLQELFYDQMPESMRSIGSAAYISVIGMGNFLS 507
           KA VPM++WWL+PQYM+ G SDAF VVGLQELFYDQMPE+MRS+G+AAYIS+IG+G+F++
Sbjct: 456 KAIVPMSVWWLVPQYMISGFSDAFAVVGLQELFYDQMPEAMRSMGAAAYISIIGIGSFVN 515

Query: 508 TAVISAVQA-ASRH-----KWLVDNLNRSKLQYFYWVLAGLSGLNLCCYIWVANGYVYKR 540
           TAVIS VQA  SRH      WL DNLN + L YFYWVLA LSGLNLC Y+W+A+G+ YK+
Sbjct: 516 TAVISVVQAVTSRHGGGGGVWLGDNLNLAHLDYFYWVLAVLSGLNLCLYVWIASGFEYKK 575

BLAST of Cp4.1LG01g15040 vs. TrEMBL
Match: W9RB31_9ROSA (Putative peptide/nitrate transporter OS=Morus notabilis GN=L484_023963 PE=3 SV=1)

HSP 1 Score: 649.4 bits (1674), Expect = 3.9e-183
Identity = 340/592 (57.43%), Postives = 427/592 (72.13%), Query Frame = 1

Query: 3   EEKSPAPLLHLPTKLPDHRNH------SDRPSQPPAGGWKAAIFVIFVEVAEQFAFIGLS 62
           +  S AP+      + +H N       + R  +   GGW AAIFVI VEVA++FAF GLS
Sbjct: 2   DSSSVAPMSIDNGHIEEHTNKELSISSTTRQIKASKGGWNAAIFVILVEVADRFAFYGLS 61

Query: 63  SNLIMYFTTVFHEPTATAAKMVNNWSGVSAVFPILGAFVADSLLGRFKTIIFSSLIYCLG 122
            NLIMY T V H+PT TAAK VN W GVS++FP+LGA VADS LGRFKTI+ SS++Y LG
Sbjct: 62  GNLIMYLTNVLHQPTVTAAKNVNMWVGVSSLFPLLGALVADSFLGRFKTILLSSIVYLLG 121

Query: 123 MVLLTLSATVIGAPHRKPVFFFALYILSVGEGGHRPCVQTFAADQFDEETPEQRKRKSSF 182
           MVLL  S + I A +RK VFF +LY+LSVGEGGH+PCVQTFAADQFDE+TPE+++ KSSF
Sbjct: 122 MVLLCFSVSAIPAGYRKAVFFVSLYVLSVGEGGHKPCVQTFAADQFDEDTPEEKEAKSSF 181

Query: 183 FNWWYVGLVVGSTLAVFLVIYVQ----------------------------VYRRHIPVG 242
           FNWWY+G+V G+  A+ +VIY+Q                             YRR +PVG
Sbjct: 182 FNWWYLGIVAGAAAAILVVIYIQDNVSWTVGFGMLAGVIGAALVLFLLGTRRYRRQVPVG 241

Query: 243 SPLTRIAQVVVAAARKWRVDNTRQEWRVCYEEDSHAKNED--EGQHKPMTLDRRSQFGIL 302
           SP T +AQV+VAAARKWRV ++     V Y ++    +    E Q     L   ++F +L
Sbjct: 242 SPFTAVAQVLVAAARKWRVKDSGNNLGVFYGDERVLLHGHLVEAQPGARHLTLANKFRVL 301

Query: 303 DKATLIDDED-KARKKRDPWRLSTVAEVEEVKMLGRLIPVWFSCLMFAVVQAQIHTFFTK 362
           DKATLID+ D  +   R+PWRL +V +VEEVK++ RLIP+W SCLMF VVQAQ+HTFFTK
Sbjct: 302 DKATLIDNHDGSSSHTRNPWRLCSVHQVEEVKLVIRLIPIWLSCLMFNVVQAQLHTFFTK 361

Query: 363 QGSTMLRSIGPHFQIPPASLQGVVGLTILLTILFYDRVFVPSARKFTGHHSGITVLQRIG 422
           QGSTM+RSIGPHF++PPAS+QG+VG+TIL+T++ YDRVFV  ARK TGH SGIT LQRIG
Sbjct: 362 QGSTMIRSIGPHFKLPPASVQGLVGVTILVTVMIYDRVFVAVARKVTGHPSGITTLQRIG 421

Query: 423 IGLFISILNMVASALVEAKRVAVAAEHGLIDTPKATVPMTIWWLIPQYMLCGVSDAFTVV 482
            GLF+SILNMV SA+VEAKRV VAA+HGL+D+PKA +PM IWWL PQYM+CG+SD F VV
Sbjct: 422 FGLFLSILNMVVSAIVEAKRVDVAAKHGLLDSPKALLPMKIWWLFPQYMICGLSDTFAVV 481

Query: 483 GLQELFYDQMPESMRSIGSAAYISVIGMGNFLSTAVISAVQAAS----RHKWLVDNLNRS 542
           GLQELFY +MP+SMRS+G+AA+IS++G+G+F+S+ +IS VQ  S       WL DNLNRS
Sbjct: 482 GLQELFYAEMPKSMRSLGAAAHISILGVGSFMSSWIISIVQLISYRLTGEGWLRDNLNRS 541

Query: 543 KLQYFYWVLAGLSGLNLCCYIWVANGYVYK-RVGGKDGEDHGKNSNKGVYGD 553
            L YFYWVLAGLS LNLC YIWV+ G+VYK   GG++      + +K + G+
Sbjct: 542 HLDYFYWVLAGLSALNLCVYIWVSKGFVYKANYGGRETTRVNDSGSKCMNGE 593

BLAST of Cp4.1LG01g15040 vs. TAIR10
Match: AT3G54450.1 (AT3G54450.1 Major facilitator superfamily protein)

HSP 1 Score: 592.0 bits (1525), Expect = 3.7e-169
Identity = 289/538 (53.72%), Postives = 389/538 (72.30%), Query Frame = 1

Query: 33  GGWKAAIFVIFVEVAEQFAFIGLSSNLIMYFTTVFHEPTATAAKMVNNWSGVSAVFPILG 92
           GGW AA+F+I VE+AE+FAF GL+SNLI + T    + TATAAK +N W GVS +FPILG
Sbjct: 14  GGWNAALFIIVVEIAERFAFYGLASNLITFLTNELGQSTATAAKNINTWIGVSCMFPILG 73

Query: 93  AFVADSLLGRFKTIIFSSLIYCLGMVLLTLSATVIGAPHRKPVFFFALYILSVGEGGHRP 152
           AF+ADS+LGRFKT++ +S IY LG+V+L LS TV+    R+ VFF ALY+++VGEGGH+P
Sbjct: 74  AFLADSILGRFKTVLLTSFIYLLGIVMLPLSVTVVARRMREKVFFMALYVMAVGEGGHKP 133

Query: 153 CVQTFAADQFDEETPEQRKRKSSFFNWWYVGLVVGSTLAVFLVIYVQ------------- 212
           CV TFAADQF E   E++  K+SFFN+WY+ +V+ S++AV  +I++Q             
Sbjct: 134 CVMTFAADQFGEANAEEKAAKTSFFNYWYMAIVLASSIAVLALIFIQERVSWSLGFSIIA 193

Query: 213 ---------------VYRRHIPVGSPLTRIAQVVVAAARKWRVDNTRQEWRVCYEEDSHA 272
                           YR+ +PVGSP TR+AQV+VAA +KWR+ +TR  + +CYEE+   
Sbjct: 194 GSVVIAIVIFLIGIPKYRKQVPVGSPFTRVAQVMVAALKKWRLSSTRHHYGLCYEEEDEH 253

Query: 273 KNEDEGQHKPMTLDRRSQFGILDKATLIDDEDKARKKRDPWRLSTVAEVEEVKMLGRLIP 332
           K E    ++   L R +QF  LDKAT+ID+ D   K R+PWRL TV +VEEVK++ RLIP
Sbjct: 254 KLESTNSNQVYLLARTNQFRFLDKATIIDEIDH-NKNRNPWRLCTVNQVEEVKLILRLIP 313

Query: 333 VWFSCLMFAVVQAQIHTFFTKQGSTMLRSIGPHFQIPPASLQGVVGLTILLTILFYDRVF 392
           +W S +MF     Q++TFF KQGS M R+IG HF IPPA+ Q +VG+TIL+ I  YDRVF
Sbjct: 314 IWISLIMFCATLTQLNTFFLKQGSMMDRTIGNHFTIPPAAFQSIVGVTILILIPLYDRVF 373

Query: 393 VPSARKFTGHHSGITVLQRIGIGLFISILNMVASALVEAKRVAVAAEHGLIDTPKATVPM 452
           VP  RK T HHSGIT LQRIG+GLF++  NMV   LVEAKR+ VA +HGLID+PK  VPM
Sbjct: 374 VPMVRKITNHHSGITSLQRIGVGLFVATFNMVICGLVEAKRLKVARDHGLIDSPKEVVPM 433

Query: 453 TIWWLIPQYMLCGVSDAFTVVGLQELFYDQMPESMRSIGSAAYISVIGMGNFLSTAVISA 512
           +  WL+PQY+L G+ D FT+VG+QELFYDQMPE+MRSIG+A +ISV+G+G+F+ST +IS 
Sbjct: 434 SSLWLLPQYILVGIGDVFTIVGMQELFYDQMPETMRSIGAAIFISVVGVGSFVSTGIIST 493

Query: 513 VQAASR---HKWLVDNLNRSKLQYFYWVLAGLSGLNLCCYIWVANGYVYKRVGGKDGE 540
           VQ  S+    +WLV+NLNR+ L Y+YW++A L+ ++LC Y+++AN ++YK++  KD +
Sbjct: 494 VQTISKSHGEEWLVNNLNRAHLDYYYWIIASLNAVSLCFYLFIANHFLYKKLQDKDDD 550

BLAST of Cp4.1LG01g15040 vs. TAIR10
Match: AT1G22540.1 (AT1G22540.1 Major facilitator superfamily protein)

HSP 1 Score: 441.0 bits (1133), Expect = 1.1e-123
Identity = 250/576 (43.40%), Postives = 352/576 (61.11%), Query Frame = 1

Query: 4   EKSPAPLLHLPTKLPDHRNHSDRPSQPPAGGWKAAIFVIFVEVAEQFAFIGLSSNLIMYF 63
           +++  PLL +     D+RN     S   +GGW++A F+I VEVAE+FA+ G+SSNLI Y 
Sbjct: 8   DEAGTPLLAVTV---DYRNKPAVKSS--SGGWRSAGFIIGVEVAERFAYYGISSNLITYL 67

Query: 64  TTVFHEPTATAAKMVNNWSGVSAVFPILGAFVADSLLGRFKTIIFSSLIYCLGMVLLTLS 123
           T    + TA AA  VN WSG +++ P+LGAFVADS LGRF+TI+ +S +Y +G+ +LTLS
Sbjct: 68  TGPLGQSTAAAAANVNAWSGTASLLPLLGAFVADSFLGRFRTILAASALYIVGLGVLTLS 127

Query: 124 ATVIG-----------APHRKPVFFF-ALYILSVGEGGHRPCVQTFAADQFDEETPEQRK 183
           A +             +P  + + FF ALY++++ +GGH+PCVQ F ADQFDE+ PE+ K
Sbjct: 128 AMIPSDCKVSNLLSSCSPRFQVITFFSALYLVALAQGGHKPCVQAFGADQFDEKEPEECK 187

Query: 184 RKSSFFNWWYVGLVVGSTLAVFLVIYVQ----------------------------VYRR 243
            KSSFFNWWY G+  G+   ++++ Y+Q                             YR 
Sbjct: 188 AKSSFFNWWYFGMCFGTLTTLWVLNYIQDNLSWALGFGIPCIAMVVALVVLLLGTCTYRF 247

Query: 244 HI--PVGSPLTRIAQVVVAAARKWRVDNTRQEWRVCYEEDSHAKNEDEGQHKPMTLDRRS 303
            I     SP  RI  V VAA + W V             D  A  E  G    ++     
Sbjct: 248 SIRREDQSPFVRIGNVYVAAVKNWSVSAL----------DVAAAEERLGL---VSCSSSQ 307

Query: 304 QFGILDKATLIDDEDKARKKRDPWRLSTVAEVEEVKMLGRLIPVWFSCLMFAVVQAQIHT 363
           QF  L+KA +  +              ++ E+EE K + RL P+W +CL++AVV AQ  T
Sbjct: 308 QFSFLNKALVAKNGS-----------CSIDELEEAKSVLRLAPIWLTCLVYAVVFAQSPT 367

Query: 364 FFTKQGSTMLRSIGPHFQIPPASLQGVVGLTILLTILFYDRVFVPSARKFTGHHSGITVL 423
           FFTKQG+TM RSI P ++I PA+LQ  + L+I++ I  YDRV +P AR FT    GIT+L
Sbjct: 368 FFTKQGATMERSITPGYKISPATLQSFISLSIVIFIPIYDRVLIPIARSFTHKPGGITML 427

Query: 424 QRIGIGLFISILNMVASALVEAKRVAVAAEHGLIDTPKATVPMTIWWLIPQYMLCGVSDA 483
           QRIG G+F+S L MV +ALVE KR+  AA++GL+D+P ATVPM++WWL+PQY+L G++D 
Sbjct: 428 QRIGTGIFLSFLAMVVAALVEMKRLKTAADYGLVDSPDATVPMSVWWLVPQYVLFGITDV 487

Query: 484 FTVVGLQELFYDQMPESMRSIGSAAYISVIGMGNFLSTAVISAVQAAS----RHKWLVDN 534
           F +VGLQE FYDQ+P  +RS+G A Y+S+ G+GNFLS+ +IS ++ A+    +  W  +N
Sbjct: 488 FAMVGLQEFFYDQVPNELRSVGLALYLSIFGIGNFLSSFMISIIEKATSQSGQASWFANN 547

BLAST of Cp4.1LG01g15040 vs. TAIR10
Match: AT1G72120.1 (AT1G72120.1 Major facilitator superfamily protein)

HSP 1 Score: 430.6 bits (1106), Expect = 1.4e-120
Identity = 243/564 (43.09%), Postives = 343/564 (60.82%), Query Frame = 1

Query: 15  TKLPDHRNHSDRPSQPPAGGWKAAIFVIFVEVAEQFAFIGLSSNLIMYFTTVFHEPTATA 74
           T   DHR  + R S    G W+AA+F+I VEVAE+FA+ G+ SNLI Y T    E TA A
Sbjct: 15  TDAVDHRGLAARRSN--TGRWRAALFIIGVEVAERFAYYGIGSNLISYLTGPLGESTAVA 74

Query: 75  AKMVNNWSGVSAVFPILGAFVADSLLGRFKTIIFSSLIYCLGMVLLTLSATVIG------ 134
           A  VN WSG++ + P+LGAFVAD+ LGR++TII SSLIY LG+  LTLSA +I       
Sbjct: 75  AANVNAWSGIATLLPVLGAFVADAFLGRYRTIIISSLIYVLGLAFLTLSAFLIPNTTEVT 134

Query: 135 ---APHRKPVFFFALYILSVGEGGHRPCVQTFAADQFDEETPEQRKRKSSFFNWWYVGLV 194
              +     +FFF+LY++++G+ GH+PCVQ F ADQFDE+  +++  +SSFFNWWY+ L 
Sbjct: 135 SSTSSFLNVLFFFSLYLVAIGQSGHKPCVQAFGADQFDEKDSQEKSDRSSFFNWWYLSLS 194

Query: 195 VGSTLAVFLVIYVQ----------------------------VY----RRHIPVGSPLTR 254
            G   A+ +V+Y+Q                            +Y    RRH    +P TR
Sbjct: 195 AGICFAILVVVYIQEEFSWAFGFGIPCVFMVISLVLFVSGRRIYRYSKRRHEEEINPFTR 254

Query: 255 IAQVVVAAARKWRVDNTRQEWRVCYEEDSHAKNEDEGQHKPMTLDRRSQFGILDKATLID 314
           I +V   A +  R+ ++              K E E    P   +++S F   +KA L+ 
Sbjct: 255 IGRVFFVALKNQRLSSS-----------DLCKVELEANTSP---EKQSFF---NKALLVP 314

Query: 315 DEDKARKKRDPWRLSTVAEVEEVKMLGRLIPVWFSCLMFAVVQAQIHTFFTKQGSTMLRS 374
           ++    +       S  ++VE+   L RLIPVWF+ L +A+  AQ  TFFTKQG TM R+
Sbjct: 315 NDSSQGE-----NASKSSDVEDATALIRLIPVWFTTLAYAIPYAQYMTFFTKQGVTMDRT 374

Query: 375 IGPHFQIPPASLQGVVGLTILLTILFYDRVFVPSARKFTGHHSGITVLQRIGIGLFISIL 434
           I P  +IPPASLQ  +G++I+L +  YDRVFVP AR  T    GIT L+RIG G+ +S +
Sbjct: 375 ILPGVKIPPASLQVFIGISIVLFVPIYDRVFVPIARLITKEPCGITTLKRIGTGIVLSTI 434

Query: 435 NMVASALVEAKRVAVAAEHGLIDTPKATVPMTIWWLIPQYMLCGVSDAFTVVGLQELFYD 494
            MV +ALVE KR+  A EHGLID P+AT+PM+IWWLIPQY+L G++D +T+VG+QE FY 
Sbjct: 435 TMVIAALVEFKRLETAKEHGLIDQPEATLPMSIWWLIPQYLLLGLADVYTLVGMQEFFYS 494

Query: 495 QMPESMRSIGSAAYISVIGMGNFLSTAVISAVQAA----SRHKWLVDNLNRSKLQYFYWV 534
           Q+P  +RSIG A Y+S +G+G+ LS+ +IS +  A    + + W   NLNR+ L YFYW+
Sbjct: 495 QVPTELRSIGLALYLSALGVGSLLSSLLISLIDLATGGDAGNSWFNSNLNRAHLDYFYWL 554

BLAST of Cp4.1LG01g15040 vs. TAIR10
Match: AT1G72125.1 (AT1G72125.1 Major facilitator superfamily protein)

HSP 1 Score: 413.7 bits (1062), Expect = 1.8e-115
Identity = 238/551 (43.19%), Postives = 333/551 (60.44%), Query Frame = 1

Query: 19  DHRNHSDRPSQPPAGGWKAAIFVIFVEVAEQFAFIGLSSNLIMYFTTVFHEPTATAAKMV 78
           DHR  S R S    G W+AA F+I VEVAE+FA  G+ SNLI Y T    + TA AA  V
Sbjct: 19  DHRGFSARRSI--TGRWRAAWFIIGVEVAERFANYGIGSNLISYLTGPLGQSTAVAAANV 78

Query: 79  NNWSGVSAVFPILGAFVADSLLGRFKTIIFSSLIYCLGMVLLTLSATVIG---------A 138
           N WSG+S + P+LGAFVAD+ LGR+ TII +S IY LG+  LTLSA +I          +
Sbjct: 79  NAWSGISTILPLLGAFVADAFLGRYITIIIASFIYVLGLAFLTLSAFLIPNNTEVTSSPS 138

Query: 139 PHRKPVFFFALYILSVGEGGHRPCVQTFAADQFDEETPEQRKRKSSFFNWWYVGLVVGST 198
                +FFF+LY++++G+ GH+PCVQ F ADQFDE+ P++   +SSFFNWWY+ +  G  
Sbjct: 139 SFLNALFFFSLYLVAIGQSGHKPCVQAFGADQFDEKNPQENSDRSSFFNWWYLSMCAGIG 198

Query: 199 LAVFLVIYVQV---YRRHIPVGSPLTRIAQVVVAAARK-WRVDNTRQE---------WRV 258
           LA+ +V+Y+Q    +     +      I+ V+    RK +R   TRQE          RV
Sbjct: 199 LAILVVVYIQENVSWALGFGIPCVFMVISLVLFVLGRKSYRFSKTRQEEETNPFTRIGRV 258

Query: 259 CYEEDSHAKNEDEGQHKPMTLD-RRSQ-----FGILDKATLI----DDEDKARKKRDPWR 318
            +    + +       K   ++  RSQ        L+KA L+    D+ + A K RD   
Sbjct: 259 FFVAFKNQRLNSSDLCKVELIEANRSQESPEELSFLNKALLVPNDSDEGEVACKSRD--- 318

Query: 319 LSTVAEVEEVKMLGRLIPVWFSCLMFAVVQAQIHTFFTKQGSTMLRSIGPHFQIPPASLQ 378
                 VE+   L RLIPVW + L +A+  AQ  TFFTKQG TM R+I P  +IPPASLQ
Sbjct: 319 ------VEDATALVRLIPVWLTTLAYAIPFAQYMTFFTKQGVTMERTIFPGVEIPPASLQ 378

Query: 379 GVVGLTILLTILFYDRVFVPSARKFTGHHSGITVLQRIGIGLFISILNMVASALVEAKRV 438
            ++ ++I+L +  YDRV VP  R  T    GIT L+RIG G+ ++ L MV +ALVE+KR+
Sbjct: 379 VLISISIVLFVPIYDRVLVPIGRSITKDPCGITTLKRIGTGMVLATLTMVVAALVESKRL 438

Query: 439 AVAAEHGLIDTPKATVPMTIWWLIPQYMLCGVSDAFTVVGLQELFYDQMPESMRSIGSAA 498
             A E+GLID PK T+PM+IWWL PQYML G++D  T+VG+QE FY Q+P  +RS+G A 
Sbjct: 439 ETAKEYGLIDQPKTTLPMSIWWLFPQYMLLGLADVHTLVGMQEFFYSQVPTELRSLGLAI 498

Query: 499 YISVIGMGNFLSTAVISAVQAA----SRHKWLVDNLNRSKLQYFYWVLAGLSGLNLCCYI 534
           Y+S +G+G+ LS+ +I  +  A    + + W   NLNR+ L YFYW+LA +S +    ++
Sbjct: 499 YLSAMGVGSLLSSLLIYLIDLATGGDAGNSWFNSNLNRAHLDYFYWLLAVVSAVGFFTFL 558

BLAST of Cp4.1LG01g15040 vs. TAIR10
Match: AT3G53960.1 (AT3G53960.1 Major facilitator superfamily protein)

HSP 1 Score: 400.6 bits (1028), Expect = 1.6e-111
Identity = 219/557 (39.32%), Postives = 333/557 (59.78%), Query Frame = 1

Query: 27  PSQPPAGGWKAAIFVIFVEVAEQFAFIGLSSNLIMYFTTVFHEPTATAAKMVNNWSGVSA 86
           P +   G W+AA+F+I +E +E+ ++ G+S+NL++Y TT+ H+    A K  N WSGV+ 
Sbjct: 33  PLRAQTGAWRAALFIIGIEFSERLSYFGISTNLVVYLTTILHQDLKMAVKNTNYWSGVTT 92

Query: 87  VFPILGAFVADSLLGRFKTIIFSSLIYCLGMVLLTLSATVIG--APHR----------KP 146
           + P+LG FVAD+ LGR+ T++ ++ IY +G++LLTLS  + G  A H           + 
Sbjct: 93  LMPLLGGFVADAYLGRYGTVLLATTIYLMGLILLTLSWFIPGLKACHEDMCVEPRKAHEI 152

Query: 147 VFFFALYILSVGEGGHRPCVQTFAADQFDEETPEQRKRKSSFFNWWYVGLVVGSTLAVFL 206
            FF A+Y++S+G GGH+P +++F ADQF++  PE+RK K S+FNWW  GL  G   AV +
Sbjct: 153 AFFIAIYLISIGTGGHKPSLESFGADQFEDGHPEERKMKMSYFNWWNAGLCAGILTAVTV 212

Query: 207 VIYVQ----------------------------VYRRHIPVGSPLTRIAQVVVAAARKWR 266
           ++Y++                             YR   P GSPLT + QV VAA  K  
Sbjct: 213 IVYIEDRIGWGVASIILTIVMATSFFIFRIGKPFYRYRAPSGSPLTPMLQVFVAAIAKRN 272

Query: 267 VDNTRQEWRVCYEEDS---HAKNEDEGQHKPMTLDRRSQFGILDKATLIDDEDKARK--K 326
           +         C  + S      NE+  + + ++  +  +F  LDKA +I+D ++  K  K
Sbjct: 273 LP--------CPSDSSLLHELTNEEYTKGRLLSSSKNLKF--LDKAAVIEDRNENTKAEK 332

Query: 327 RDPWRLSTVAEVEEVKMLGRLIPVWFSCLMFAVVQAQIHTFFTKQGSTMLRSI-GPHFQI 386
           + PWRL+TV +VEEVK+L  +IP+WF  L F V   Q  T F KQ   M R I G  F +
Sbjct: 333 QSPWRLATVTKVEEVKLLINMIPIWFFTLAFGVCATQSSTLFIKQAIIMDRHITGTSFIV 392

Query: 387 PPASLQGVVGLTILLTILFYDRVFVPSARKFTGHHSGITVLQRIGIGLFISILNMVASAL 446
           PPASL  ++ L+I++T+  Y+++ VP  R+ TG+  GI++LQRIG+G+  S+  M+ +AL
Sbjct: 393 PPASLFSLIALSIIITVTIYEKLLVPLLRRATGNERGISILQRIGVGMVFSLFAMIIAAL 452

Query: 447 VEAKRVAVAAEHGLIDTPKATVPMTIWWLIPQYMLCGVSDAFTVVGLQELFYDQMPESMR 506
           +E KR+  A EH +      T+ ++  WL PQ+++ GV+DAFT+VGLQE FYDQ+P+SMR
Sbjct: 453 IEKKRLDYAKEHHM----NKTMTLSAIWLAPQFLVLGVADAFTLVGLQEYFYDQVPDSMR 512

Query: 507 SIGSAAYISVIGMGNFLSTAVISA----VQAASRHKWLVDNLNRSKLQYFYWVLAGLSGL 534
           S+G A Y+SV+G  +F++  +I+      +  S   W   +LN S+L  FYW+LA L+  
Sbjct: 513 SLGIAFYLSVLGAASFVNNLLITVSDHLAEEISGKGWFGKDLNSSRLDRFYWMLAALTAA 572

BLAST of Cp4.1LG01g15040 vs. NCBI nr
Match: gi|659092217|ref|XP_008446958.1| (PREDICTED: protein NRT1/ PTR FAMILY 5.4 [Cucumis melo])

HSP 1 Score: 872.5 bits (2253), Expect = 4.0e-250
Identity = 441/591 (74.62%), Postives = 487/591 (82.40%), Query Frame = 1

Query: 1   MEEEKSP--APLLHLPTKLPDHRNHSDRPSQPPAGGWKAAIFVIFVEVAEQFAFIGLSSN 60
           ME++KSP   P+++LP K PDH   S+ P+  P GGWKAAIF+IFVEVAEQFA IGLSSN
Sbjct: 1   MEKQKSPNNVPIINLPNKFPDHPTASNTPTVRPGGGWKAAIFIIFVEVAEQFASIGLSSN 60

Query: 61  LIMYFTTVFHEPTATAAKMVNNWSGVSAVFPILGAFVADSLLGRFKTIIFSSLIYCLGMV 120
           LIMYFTTVFHEP   AAK VNNW GVSAVFP+LGAFVADSLLGRFKTII +SL++  GMV
Sbjct: 61  LIMYFTTVFHEPLGVAAKQVNNWVGVSAVFPLLGAFVADSLLGRFKTIIIASLVFFFGMV 120

Query: 121 LLTLSATVIGAPHRKPVFFFALYILSVGEGGHRPCVQTFAADQFDEETPEQRKRKSSFFN 180
           +LT+SATV+G  HRK VFF  LYILSVG+GGHRPCVQTFAADQF+E TPE+RK+KSSFFN
Sbjct: 121 VLTVSATVVGDDHRKAVFFLGLYILSVGQGGHRPCVQTFAADQFEERTPEERKKKSSFFN 180

Query: 181 WWYVGLVVGSTLAVFLVIYVQ----------------------------VYRRHIPVGSP 240
           WWYVGLV GST AVF+VIYVQ                             YRR +PVGSP
Sbjct: 181 WWYVGLVGGSTFAVFVVIYVQDNIGWGLSFGILAGVLAAAIILFLAGVKKYRRQVPVGSP 240

Query: 241 LTRIAQVVVAAARKWRVDNTRQEWRVCYEEDSHAKNEDEGQHKPMTLDRRSQFGILDKAT 300
           LTRIAQVVVAAARKW VD TR EWRVCYEED+HAKNE EGQH  MTL R +QF ILDKAT
Sbjct: 241 LTRIAQVVVAAARKWGVDETRHEWRVCYEEDNHAKNEGEGQHNLMTLARTNQFRILDKAT 300

Query: 301 LIDDEDKARKKRDPWRLSTVAEVEEVKMLGRLIPVWFSCLMFAVVQAQIHTFFTKQGSTM 360
           LID ED+ARKKRDPWRLSTV EVEEVK++ RLIPVW SCLMFAVVQAQIHTFFTKQGSTM
Sbjct: 301 LIDKEDEARKKRDPWRLSTVGEVEEVKLVVRLIPVWVSCLMFAVVQAQIHTFFTKQGSTM 360

Query: 361 LRSIGPHFQIPPASLQGVVGLTILLTILFYDRVFVPSARKFTGHHSGITVLQRIGIGLFI 420
           LRS+GPHFQ+PPASLQGVVGLTILLT+LFYDRVFVP+AR FTGHHSGITVLQRIG+GLFI
Sbjct: 361 LRSVGPHFQLPPASLQGVVGLTILLTVLFYDRVFVPAARNFTGHHSGITVLQRIGMGLFI 420

Query: 421 SILNMVASALVEAKRVAVAAEHGLIDTPKATVPMTIWWLIPQYMLCGVSDAFTVVGLQEL 480
           SIL M  SALVEAKRV +AAEHGL DTPKATVPMTIWWLIPQYMLCGVSDAF +VGLQEL
Sbjct: 421 SILTMGVSALVEAKRVTIAAEHGLSDTPKATVPMTIWWLIPQYMLCGVSDAFAIVGLQEL 480

Query: 481 FYDQMPESMRSIGSAAYISVIGMGNFLSTAVISAVQAASRHKWLVDNLNRSKLQYFYWVL 540
           FYDQMP+ MRS+G+AAYIS+IG+GNFLS+A+IS VQA S  +WL DNLNRS L YFYWVL
Sbjct: 481 FYDQMPQFMRSLGAAAYISIIGVGNFLSSAIISVVQAGSGGRWLDDNLNRSNLHYFYWVL 540

Query: 541 AGLSGLNLCCYIWVANGYVYKRVGGK------DGEDHGKNSNKGVYGDDMI 556
           A LS LNLC Y+W+ANG+VYKRVGG       DG+    N+  G YGDDMI
Sbjct: 541 AALSALNLCGYVWIANGFVYKRVGGNRSINGDDGDVKNSNNINGCYGDDMI 591

BLAST of Cp4.1LG01g15040 vs. NCBI nr
Match: gi|778706817|ref|XP_011655920.1| (PREDICTED: protein NRT1/ PTR FAMILY 5.4 [Cucumis sativus])

HSP 1 Score: 861.3 bits (2224), Expect = 9.3e-247
Identity = 437/592 (73.82%), Postives = 486/592 (82.09%), Query Frame = 1

Query: 1   MEEEKSP--APLLHLPTKLPDHRNHSDRPSQPPAGGWKAAIFVIFVEVAEQFAFIGLSSN 60
           ME++KSP   P+++LP K PDH   S+ P+  P GGWKAAIF+IFVEVAEQFA IGLSSN
Sbjct: 1   MEKQKSPNNVPIINLPNKFPDHPTASNSPTVRPGGGWKAAIFIIFVEVAEQFASIGLSSN 60

Query: 61  LIMYFTTVFHEPTATAAKMVNNWSGVSAVFPILGAFVADSLLGRFKTIIFSSLIYCLGMV 120
           LIMYFTTVFHEP   AAK VNNW GVSAVFP+LGAFVADSLLGRFKTII +SLI+ +GM+
Sbjct: 61  LIMYFTTVFHEPLGVAAKQVNNWVGVSAVFPLLGAFVADSLLGRFKTIIIASLIFFIGMM 120

Query: 121 LLTLSATVIGAPHRKPVFFFALYILSVGEGGHRPCVQTFAADQFDEETPEQRKRKSSFFN 180
           +LT+SATV+G   RK VFF  LYILSVG+GGHRPCVQTFAADQFDEE+PE+RK+KSSFFN
Sbjct: 121 VLTVSATVVGDNQRKAVFFLGLYILSVGQGGHRPCVQTFAADQFDEESPEERKKKSSFFN 180

Query: 181 WWYVGLVVGSTLAVFLVIYVQ----------------------------VYRRHIPVGSP 240
           WWYVGLV GST AVF+VIYVQ                             YRR +PVGSP
Sbjct: 181 WWYVGLVGGSTFAVFVVIYVQDNIGWGLSFGILAGVLAAAIILFLAGVKKYRRQVPVGSP 240

Query: 241 LTRIAQVVVAAARKWRVDNTRQEWRVCYEEDSHAKNEDEGQHKPMTLDRRSQFGILDKAT 300
           LTRIAQVVVAAARKWRVD TR  WR+CYEED+ AKN+ EG+H  MTL R +QF ILDKAT
Sbjct: 241 LTRIAQVVVAAARKWRVDETRNGWRICYEEDNRAKNDAEGEHNLMTLARTNQFRILDKAT 300

Query: 301 LIDDEDKARKKRDPWRLSTVAEVEEVKMLGRLIPVWFSCLMFAVVQAQIHTFFTKQGSTM 360
           LID ED+ARKKRDPWRLSTV EVEEVK++ RLIPVW SCLMFAVVQAQIHTFFTKQGSTM
Sbjct: 301 LIDKEDEARKKRDPWRLSTVEEVEEVKLVVRLIPVWVSCLMFAVVQAQIHTFFTKQGSTM 360

Query: 361 LRSIGPHFQIPPASLQGVVGLTILLTILFYDRVFVPSARKFTGHHSGITVLQRIGIGLFI 420
           LRS+GPHFQ+PPASLQGVVGLTILLT+LFYDRVFVP+AR FTGHHSGITVLQRIG+GLFI
Sbjct: 361 LRSVGPHFQLPPASLQGVVGLTILLTVLFYDRVFVPAARNFTGHHSGITVLQRIGMGLFI 420

Query: 421 SILNMVASALVEAKRVAVAAEHGLIDTPKATVPMTIWWLIPQYMLCGVSDAFTVVGLQEL 480
           SI  M  SALVEAKRV +AAEHGL DTPKATVPMTIWWLIPQYMLCGVSDAF ++GLQEL
Sbjct: 421 SIFTMGVSALVEAKRVTIAAEHGLSDTPKATVPMTIWWLIPQYMLCGVSDAFAIIGLQEL 480

Query: 481 FYDQMPESMRSIGSAAYISVIGMGNFLSTAVISAVQAASRHKWLVDNLNRSKLQYFYWVL 540
           FYDQMPE MRS+G+AAYIS+IG+GNFLS+A+IS VQA S  +WL DNLNRS L YFYWVL
Sbjct: 481 FYDQMPEFMRSLGAAAYISIIGVGNFLSSAIISVVQAGSGGRWLEDNLNRSNLHYFYWVL 540

Query: 541 AGLSGLNLCCYIWVANGYVYKRVGG-----KDGEDHGKNSN--KGVYGDDMI 556
           A LS LNLC Y+W+ANG+VYKR GG        +D  KNSN   G YGDDMI
Sbjct: 541 AALSALNLCGYVWIANGFVYKRAGGNRRSISGDDDDVKNSNNINGCYGDDMI 592

BLAST of Cp4.1LG01g15040 vs. NCBI nr
Match: gi|728830325|gb|KHG09768.1| (hypothetical protein F383_13158 [Gossypium arboreum])

HSP 1 Score: 664.1 bits (1712), Expect = 2.2e-187
Identity = 342/571 (59.89%), Postives = 413/571 (72.33%), Query Frame = 1

Query: 14  PTKLPDHRNHSDRPSQPPAGGWKAAIFVIFVEVAEQFAFIGLSSNLIMYFTTVFHEPTAT 73
           PTKL   +       +P  GGWKAA FVI VE+AE+FAF GL+ NLI Y T    EP  T
Sbjct: 18  PTKLSSQK-------KPSKGGWKAAFFVISVEMAERFAFYGLAGNLITYLTNNLGEPVIT 77

Query: 74  AAKMVNNWSGVSAVFPILGAFVADSLLGRFKTIIFSSLIYCLGMVLLTLSATVIGAPHRK 133
           AAK VN W GVSA+FP+LGAF+ADS LGRFKTI+ SS+IY LGMVLL+LS +VI    RK
Sbjct: 78  AAKNVNTWVGVSAIFPLLGAFIADSYLGRFKTILASSVIYFLGMVLLSLSVSVIPMHSRK 137

Query: 134 PVFFFALYILSVGEGGHRPCVQTFAADQFDEETPEQRKRKSSFFNWWYVGLVVGSTLAVF 193
            VFF ALY+L++GEGGH+PCVQTFAADQFDE  PE++  KSSFFNWWY+G+V G+++A+ 
Sbjct: 138 AVFFTALYVLAIGEGGHKPCVQTFAADQFDENNPEEKAAKSSFFNWWYLGIVTGASVAIV 197

Query: 194 LVIYVQ----------------------------VYRRHIPVGSPLTRIAQVVVAAARKW 253
           +VIY+Q                             YR+  P GSP T +AQV+VAAA+KW
Sbjct: 198 VVIYLQDNVSWAAGFGVLSGSLAVALAVFLIGIRKYRKQRPTGSPFTSVAQVLVAAAKKW 257

Query: 254 RVDNTRQEWRVCYEEDSHAKNEDEGQHKPMTLDRRSQFGILDKATLIDDEDKARKKRDPW 313
           RV  T     +CYE+D    +  +GQ     L R  QF  LDKA +IDD+D   K RDPW
Sbjct: 258 RVSETHGGRGICYEDDRRGGSHVKGQTMSRNLVRTKQFRFLDKAMIIDDKDTLSKTRDPW 317

Query: 314 RLSTVAEVEEVKMLGRLIPVWFSCLMFAVVQAQIHTFFTKQGSTMLRSIGPHFQIPPASL 373
           RL ++ +VEEVK++ RLIP+W  CLMF+ V  Q+HTFFTKQGSTMLRSIGP+FQ+PPA+L
Sbjct: 318 RLCSLNQVEEVKLVLRLIPIWLGCLMFSAVITQLHTFFTKQGSTMLRSIGPNFQVPPAAL 377

Query: 374 QGVVGLTILLTILFYDRVFVPSARKFTGHHSGITVLQRIGIGLFISILNMVASALVEAKR 433
           Q +VGLTIL+ +  YDRVFVP ARK TGH SGIT+LQRIG GLFISILNMV + LVE  R
Sbjct: 378 QSLVGLTILIAVPIYDRVFVPIARKITGHPSGITMLQRIGTGLFISILNMVVAGLVETAR 437

Query: 434 VAVAAEHGLIDTPKATVPMTIWWLIPQYMLCGVSDAFTVVGLQELFYDQMPESMRSIGSA 493
           V  AA+HGLID PKA VPM++WWL+PQY+L G+ D FT+VGLQELFYDQMPE MRSIG+A
Sbjct: 438 VNTAAKHGLIDAPKAIVPMSVWWLLPQYVLTGLGDVFTIVGLQELFYDQMPEEMRSIGAA 497

Query: 494 AYISVIGMGNFLSTAVISAVQAAS-RH--KWLVDNLNRSKLQYFYWVLAGLSGLNLCCYI 553
           AYISV+G+G+F++TAVIS VQ  S RH  +WL DNLNR+KL YFYWVLAGLS  NLC YI
Sbjct: 498 AYISVVGVGSFINTAVISVVQVISLRHGNEWLGDNLNRAKLNYFYWVLAGLSAFNLCAYI 557

BLAST of Cp4.1LG01g15040 vs. NCBI nr
Match: gi|823249040|ref|XP_012457175.1| (PREDICTED: protein NRT1/ PTR FAMILY 5.4-like [Gossypium raimondii])

HSP 1 Score: 658.3 bits (1697), Expect = 1.2e-185
Identity = 339/571 (59.37%), Postives = 411/571 (71.98%), Query Frame = 1

Query: 14  PTKLPDHRNHSDRPSQPPAGGWKAAIFVIFVEVAEQFAFIGLSSNLIMYFTTVFHEPTAT 73
           PTKL   +       +P  GGWKAA FVI VE+AE+FAF GL+ NLI Y T    EP  T
Sbjct: 18  PTKLSSQK-------KPSKGGWKAAFFVISVEMAERFAFYGLAGNLITYLTNNLGEPVVT 77

Query: 74  AAKMVNNWSGVSAVFPILGAFVADSLLGRFKTIIFSSLIYCLGMVLLTLSATVIGAPHRK 133
           AAK VN W GVSA+FP+LGAF+ADS LGRFKTI+ SS+IY LGMVLL+LS +VI    RK
Sbjct: 78  AAKNVNTWVGVSAIFPLLGAFIADSYLGRFKTILASSVIYFLGMVLLSLSVSVIPMHSRK 137

Query: 134 PVFFFALYILSVGEGGHRPCVQTFAADQFDEETPEQRKRKSSFFNWWYVGLVVGSTLAVF 193
            VFF ALY+L++GEGGH+PCVQTFAADQFDE  PE++  KSSFFNWWY+G+V G+++A+ 
Sbjct: 138 AVFFTALYVLAIGEGGHKPCVQTFAADQFDENNPEEKAAKSSFFNWWYLGIVTGASVAIV 197

Query: 194 LVIYVQ----------------------------VYRRHIPVGSPLTRIAQVVVAAARKW 253
           +VIY+Q                             YR+  P GSP T +AQV VAAA+KW
Sbjct: 198 VVIYLQDNVSWAAGFGVLSGSLAVALVVFLIGIRKYRKQRPTGSPFTSVAQVFVAAAKKW 257

Query: 254 RVDNTRQEWRVCYEEDSHAKNEDEGQHKPMTLDRRSQFGILDKATLIDDEDKARKKRDPW 313
           RV  T     +CYE+D    +  +GQ     L R  QF  LDKA +IDD+D   K RDPW
Sbjct: 258 RVSETHGGRGICYEDDRRGGSHVKGQTMGRDLVRTRQFRFLDKAMIIDDKDTLGKTRDPW 317

Query: 314 RLSTVAEVEEVKMLGRLIPVWFSCLMFAVVQAQIHTFFTKQGSTMLRSIGPHFQIPPASL 373
           RL ++ +VEEVK++ RLIP+W  CLMF+ V  Q+HTFFTKQGSTMLRSIGP+FQ+PPA+L
Sbjct: 318 RLCSLNQVEEVKLVLRLIPIWLGCLMFSAVITQLHTFFTKQGSTMLRSIGPNFQVPPAAL 377

Query: 374 QGVVGLTILLTILFYDRVFVPSARKFTGHHSGITVLQRIGIGLFISILNMVASALVEAKR 433
           Q +VGLTIL+ +  YDRVFVP ARK TGH SGIT+LQRIG GLFISILNMV + LVE  R
Sbjct: 378 QSLVGLTILIAVPIYDRVFVPIARKITGHPSGITMLQRIGTGLFISILNMVVAGLVETAR 437

Query: 434 VAVAAEHGLIDTPKATVPMTIWWLIPQYMLCGVSDAFTVVGLQELFYDQMPESMRSIGSA 493
           V  A +HGL+D PKA VPM++WWL+PQY+L G+ D FT+VGLQELFYDQMPE MRSIG+A
Sbjct: 438 VNTATKHGLMDAPKAVVPMSVWWLLPQYVLTGLGDVFTIVGLQELFYDQMPEEMRSIGAA 497

Query: 494 AYISVIGMGNFLSTAVISAVQA-ASRH--KWLVDNLNRSKLQYFYWVLAGLSGLNLCCYI 553
           AYISV+G+G+F++TAVIS VQ  +SRH  +WL DNLNR+KL YFYWVLAGLS  NLC YI
Sbjct: 498 AYISVVGVGSFINTAVISVVQVISSRHGKEWLGDNLNRAKLNYFYWVLAGLSAFNLCAYI 557

BLAST of Cp4.1LG01g15040 vs. NCBI nr
Match: gi|657967258|ref|XP_008375319.1| (PREDICTED: protein NRT1/ PTR FAMILY 5.4 [Malus domestica])

HSP 1 Score: 658.3 bits (1697), Expect = 1.2e-185
Identity = 335/552 (60.69%), Postives = 412/552 (74.64%), Query Frame = 1

Query: 29  QPPAGGWKAAIFVIFVEVAEQFAFIGLSSNLIMYFTTVFHEPTATAAKMVNNWSGVSAVF 88
           +P  GGW AAIFVIFVE+AE+FAF G+S NLIMY T   H+ TATAAK VN W GVS++F
Sbjct: 22  KPSRGGWNAAIFVIFVEIAERFAFYGVSGNLIMYLTKDLHQATATAAKNVNMWIGVSSLF 81

Query: 89  PILGAFVADSLLGRFKTIIFSSLIYCLGMVLLTLSATVIGAPHRKPVFFFALYILSVGEG 148
           P++GA VADS LGRF TIIFSS IY +GM+LL LS +VI   +R+ VFF ALYIL+VGEG
Sbjct: 82  PLVGAVVADSYLGRFVTIIFSSSIYLMGMLLLCLSVSVIPRHYREAVFFVALYILAVGEG 141

Query: 149 GHRPCVQTFAADQFDEETPEQRKRKSSFFNWWYVGLVVGSTLAVFLVIYVQ--------- 208
           GH+PCVQTFAADQFDE++ E++K KSSFFNWWY+ +VV +T+A  +VIY+Q         
Sbjct: 142 GHKPCVQTFAADQFDEDSEEEKKAKSSFFNWWYLAIVVAATVATLVVIYIQDNVSWVVGF 201

Query: 209 -------------------VYRRHIPVGSPLTRIAQVVVAAARKWRVDNTRQEWRVCYEE 268
                               YRR  P+GSP T +AQV VAAARKWRV  T  +W V   +
Sbjct: 202 GVLTVVVAAGLFLFLLGINRYRRQAPLGSPFTAVAQVFVAAARKWRVKET-HDWDVYCGD 261

Query: 269 D--SHAKNEDEGQHKPMTLDRRSQFGILDKATLIDDEDKARKKRDPWRLSTVAEVEEVKM 328
           D  S   N +  + K  TL R  QF  LDKA +ID+ D +   R+PWRL ++ +VEEVK+
Sbjct: 262 DERSGGSNMEVRKLKHRTLARTKQFRFLDKAMIIDNLDASSNVRNPWRLCSLKQVEEVKL 321

Query: 329 LGRLIPVWFSCLMFAVVQAQIHTFFTKQGSTMLRSIGPHFQIPPASLQGVVGLTILLTIL 388
           + RLIP+W SCLMF+ VQ+Q+HT++TKQGSTM+RSIGPHF +PPASL  +VG+ IL+T+ 
Sbjct: 322 VLRLIPIWLSCLMFSAVQSQLHTYYTKQGSTMIRSIGPHFLLPPASLGSLVGVVILITVP 381

Query: 389 FYDRVFVPSARKFTGHHSGITVLQRIGIGLFISILNMVASALVEAKRVAVAAEHGLIDTP 448
            YDR+FVP ARKFTGHHSGITVLQRIG GLF+SILNMV SALVEAKRV+VA +HG+ID P
Sbjct: 382 MYDRIFVPMARKFTGHHSGITVLQRIGFGLFLSILNMVVSALVEAKRVSVAEQHGIIDNP 441

Query: 449 KATVPMTIWWLIPQYMLCGVSDAFTVVGLQELFYDQMPESMRSIGSAAYISVIGMGNFLS 508
           KA VPM +WWL+PQY +CG+SD FT+VGLQELFYDQMPE+MRS G+AAY+S+IG+G+F+S
Sbjct: 442 KAVVPMRMWWLLPQYFICGLSDTFTIVGLQELFYDQMPEAMRSFGAAAYLSIIGVGSFIS 501

Query: 509 TAVISAVQAASR---HKWLVDNLNRSKLQYFYWVLAGLSGLNLCCYIWVANGYVYKRVGG 548
           +AVIS VQA S     +WL DNLNRS L YFYWVLAGLS L+LC Y+W A  +VYK V G
Sbjct: 502 SAVISVVQAISSKAGEEWLGDNLNRSHLDYFYWVLAGLSALSLCAYVWFAMSFVYKIVQG 561

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PTR46_ARATH6.6e-16853.72Protein NRT1/ PTR FAMILY 5.4 OS=Arabidopsis thaliana GN=NPF5.4 PE=2 SV=1[more]
PTR9_ARATH1.9e-12243.40Protein NRT1/ PTR FAMILY 5.10 OS=Arabidopsis thaliana GN=NPF5.10 PE=2 SV=1[more]
PTR22_ARATH2.5e-11943.09Protein NRT1/ PTR FAMILY 5.14 OS=Arabidopsis thaliana GN=NPF5.14 PE=2 SV=2[more]
PTR23_ARATH3.2e-11443.19Protein NRT1/ PTR FAMILY 5.13 OS=Arabidopsis thaliana GN=NPF5.13 PE=2 SV=2[more]
PTR45_ARATH2.8e-11039.32Protein NRT1/ PTR FAMILY 5.7 OS=Arabidopsis thaliana GN=NPF5.7 PE=2 SV=2[more]
Match NameE-valueIdentityDescription
A0A0B0N5Q1_GOSAR1.5e-18759.89Uncharacterized protein OS=Gossypium arboreum GN=F383_13158 PE=3 SV=1[more]
A0A0D2VQ09_GOSRA8.3e-18659.37Uncharacterized protein OS=Gossypium raimondii GN=B456_011G215200 PE=3 SV=1[more]
B9RGA9_RICCO1.9e-18559.49Peptide transporter, putative OS=Ricinus communis GN=RCOM_1452620 PE=3 SV=1[more]
A0A067KZV4_JATCU1.9e-18560.51Uncharacterized protein OS=Jatropha curcas GN=JCGZ_24527 PE=3 SV=1[more]
W9RB31_9ROSA3.9e-18357.43Putative peptide/nitrate transporter OS=Morus notabilis GN=L484_023963 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT3G54450.13.7e-16953.72 Major facilitator superfamily protein[more]
AT1G22540.11.1e-12343.40 Major facilitator superfamily protein[more]
AT1G72120.11.4e-12043.09 Major facilitator superfamily protein[more]
AT1G72125.11.8e-11543.19 Major facilitator superfamily protein[more]
AT3G53960.11.6e-11139.32 Major facilitator superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659092217|ref|XP_008446958.1|4.0e-25074.62PREDICTED: protein NRT1/ PTR FAMILY 5.4 [Cucumis melo][more]
gi|778706817|ref|XP_011655920.1|9.3e-24773.82PREDICTED: protein NRT1/ PTR FAMILY 5.4 [Cucumis sativus][more]
gi|728830325|gb|KHG09768.1|2.2e-18759.89hypothetical protein F383_13158 [Gossypium arboreum][more]
gi|823249040|ref|XP_012457175.1|1.2e-18559.37PREDICTED: protein NRT1/ PTR FAMILY 5.4-like [Gossypium raimondii][more]
gi|657967258|ref|XP_008375319.1|1.2e-18560.69PREDICTED: protein NRT1/ PTR FAMILY 5.4 [Malus domestica][more]
The following terms have been associated with this gene:
Vocabulary: Biological Process
TermDefinition
GO:0006857oligopeptide transport
GO:0006810transport
Vocabulary: Cellular Component
TermDefinition
GO:0016020membrane
Vocabulary: Molecular Function
TermDefinition
GO:0005215transporter activity
Vocabulary: INTERPRO
TermDefinition
IPR020846MFS_dom
IPR018456PTR2_symporter_CS
IPR000109POT_fam
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006857 oligopeptide transport
biological_process GO:0006810 transport
cellular_component GO:0016020 membrane
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0005215 transporter activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG01g15040.1Cp4.1LG01g15040.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000109Proton-dependent oligopeptide transporter familyPANTHERPTHR11654OLIGOPEPTIDE TRANSPORTER-RELATEDcoord: 33..555
score:
IPR000109Proton-dependent oligopeptide transporter familyPFAMPF00854PTR2coord: 103..199
score: 9.4E-25coord: 200..489
score: 2.5
IPR018456PTR2 family proton/oligopeptide symporter, conserved sitePROSITEPS01022PTR2_1coord: 92..116
scor
IPR020846Major facilitator superfamily domainunknownSSF103473MFS general substrate transportercoord: 286..526
score: 8.76E-29coord: 23..241
score: 8.76
NoneNo IPR availableGENE3DG3DSA:1.20.1250.20coord: 38..198
score: 6.0
NoneNo IPR availablePANTHERPTHR11654:SF133PROTEIN NRT1/ PTR FAMILY 5.4coord: 33..555
score:

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Cp4.1LG01g15040Cucurbita pepo (Zucchini)cpecpeB040
Cp4.1LG01g15040Cucurbita maxima (Rimu)cmacpeB429
Cp4.1LG01g15040Cucurbita moschata (Rifu)cmocpeB392
Cp4.1LG01g15040Cucumber (Gy14) v2cgybcpeB648
Cp4.1LG01g15040Silver-seed gourdcarcpeB1021
Cp4.1LG01g15040Silver-seed gourdcarcpeB1177