Lsi04G004660 (gene) Bottle gourd (USVL1VR-Ls)

NameLsi04G004660
Typegene
OrganismLagenaria siceraria (Bottle gourd (USVL1VR-Ls))
DescriptionSmall multi-drug export protein, putative
Locationchr04 : 4403484 .. 4406698 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AGAAAACTTCATCAAAGCAACTAACCATAAACACAATCCTTATGGAGAATAGAGGACAAATGAGAGAAACAAGAATCCCAATAGACCTCATACACTTTTTTACTAGAAAAAGAAAGTAGGGTACCAAAACAACTAAAAGGAGTGCGGATAAATGAGTGAGCCACAAAATTACATGAACGAATACAATTAACATAAGAACTAATGTATGTACGATGCACAAGCTCAAACATGACTTTAACCATATTTTTTATTTTAAAAAGATCAAATGACGGCTCATTGAGCGCACGAATAATCTCGATTGAATTAGATTTAACACTAAAGTGTTACTTCAAAAAACTTTGACATCTAAATAAAGCACGAGAGTCCCATTCTTTTTATCAATATGTATAACTATGTGTTGGTGACTTGGGATGTTGTAGTTAGAATACTCACTCTTATTGTACCTAAAAAAAATAAAAAAAAATAACATCAATTTCTTGTTACTCTTCGTACGGAGTCTTTCTTATCCAAGCCAAACCCAAAACCTCGATAAATCCTCCCTCTTGATGCTAATGCCGCTTCTTCCCACTTTCCTCCCTACCAAATTCATACTCTTCATAGCCTAAAATCAGCAATGCCTTTTCCGATTCATCCCAATCTCCAAGGACCCAATAACTCCGCTCGAACCCCAACGATTACCATTGGAGAAATCTCCTTTAAACCCTCTTCTTCTTCTATCATCATCCTCTGAATCAATTTCCAATGCCACAATCCCACTCCGTCCAAGTGTCTCGTCTAAATCCCGCTGGAGCAGAAGAGGAATTCTCTTGAACAATTCAACCCCCACTTGTGTTCTTCAGCTTCCGGTAAGCGATGACTACTTCTTTACCATTCACTTCACCATTAATCTCCGCATTTTCGCCGAGAAAGACCCTTTTCCCGCTCAAGCTTAATCGGCCCTCCATTAGTCAGAGTAAACTATCTCTTCACTCGTCGAGTCCATGTGTAAATGTTCGCCATTTCAACTGTTTTCATCCTGTTTTCTCGACTTCTCGGATCTTTCGTACTGTCACTCGAAGTTCGTCAAATGGGTTTCTCGAAGATGACGACATTATCCCCTCTTTTGAGGAGAAGCCGGTTAAAGTTCTGCTATTGGTTCTGTTTTGGGCATCTCTATCCCTTGCTTGGTTTGCTGCTTCTGGGGATGCCAAAGCTGCTGTTGATTCTATCAGAGCTTCGAATTTTGGCCTAAAGATCGCCAGAGCATTGCAGAGCTCAGGCTGGCCTGCTGAGGCTGTAGTATTTGCCCTCGCTACGCTTCCTGTAATTGAGCTCCGTGGGGCCATTCCTGTTGGTTACTGGATGCAGCTTAAGCCTGTAGCCCTAACCGTTCTATCCGTACTTGGGTGAGTTTTCAAATTTTAACTTCATTTTTTAGATTCATCTTTAAAAATTAACGAATCTGCATTTGGTTTACTACACATAATATTTCTCCTCAACTGCCGCTGCCTCTTTTGTGTTTGTGTAATTTTCATTCACATCGACATCAATGTTTTAATACAAGGTGAAAGTTTCAGTTTAAATAAACAAGTAGTAATGGAATAGATTACTACAGGCTCATGCTTTATTGGTATTACAAATGTAAGTGGGCCTTTATGTTTATTGAGGAATAGATAGTAAGTTCCATTACACAAGATGTGGAGATTTCAAACCTCAATGTGTTCTCATGTAGACACAATCCTCCTATGAATCAATGAAGGCCTAAATCTTGAAGTATAAGCCAAATCCTTTAGGCATTGATGGATGTTTGAAGTGAACTTTGAAATGGTTCATAAGTTTTTTGGACCAGTGAGTGATGAATTTCTTCGGTTTTCTATCTTAATTTTAAAAACCACAAACTCTTCTGCATGAGAGATACTTAAAGTTTGATTAGATGAAATAAATAAATTGTGGGATTCAAACTGCAGGAACATGGTTCCTGTACCCTTTATCATACTCTACTTGAAGAAATTTGCTACTTTCCTAGCGGGAAGGAATGCTTCTGCCTCCCAATTCCTCGATATGTTATTCAAGAGGGCCAAAGAGAAAGCTGCACCTGTTGAAGAGTTTCAGTGGCTTGGTCTAATGCTATTTGTGGCCGTGCCTTTCCCTGGAACAGGAGCTTGGACTGGTGCCATAATAGCTTCCATCTTAGATATGCCATTCTGGTCAGGTGTCTCTGCAAATTTCTTTGGTGTTGTAGTGGCAGGGCTTCTGGTCAACTTGTTGGTAAATCTTGGTATGAAGGAGGCCATTCTCACTGGAGTGATTCTTTTCATTATATCAACTTTCATGTGGAGCATTCTTCGACTGATAAGAAAAGCTGTCAGAAAATGAATTAATCGAAAAGGTTAGGTATTTCTATGTCTAATAACAACGAACAAGGTAAGGTCGTATCTGATCTTTAGTCTGATATATGTGGACTTAGACTTGTTACTTGTTAGTTTAAGTTGGCATTCTGATCAATTGATTACTTACTATTTTCTTGGTGGCGTAGATTTTTCTCTAATATCATATGAAGAATGAGTAGTTGGAAGTGTGGAGATGATTAACTGTAGATATAAAGTTTGTCTTGTGAACACTGTTCAGAAGAGAAACCATCTGCAAACAACTTTCATTTATCATACCAAGCCGAGAAATGTTGTGTAAATTCTGGTATTATGATTGTTATTTGCTCGTGTAGAATTGCAAATCTTGTTCTTGAGATGTCAGGTGCAAATATAGTTGTGCTTTGAAAATTGTAAAATCATGCTCGAATGTTGTGGTCTGGTTTGATTTGCCGGCAAGCATAGACATTACCTTAGTTGAGGAACAAAAATGGCAAGAAACTGAACCACCCAACACCATGGATTTGCCTTTAACTGCAATAATATACGAATTGTTGACTACTTCTGTAATAGTTGATATGGGCTCACCAATCCAACACATAGTTCAATGGTTGATATGGGGTCGTGAGTTTGACTCTTTGGTGTCTGTACTATGTGGCTATGATCTTGTTCTTAGGTTTGTCTGCCATTTTGTTGCATGTGTTTCTTGCTGCTGCACATCATTGGATTGGTTGTACTGCTCCAAAATGAAAGATACTGTATCAATTTGGATTTGTATTCTCAATGCCTTAGGACATCTTCCACTATTTTGGGTTGTTCCCAGTTTTTGTACTTAGAGTCAGATAATAAAGGTAGGGATTATTATTATTA

mRNA sequence

AGAAAACTTCATCAAAGCAACTAACCATAAACACAATCCTTATGGAGAATAGAGGACAAATGAGAGAAACAAGAATCCCAATAGACCTCATACACTTTTTTACTAGAAAAAGAAAGTAGGGTACCAAAACAACTAAAAGGAGTGCGGATAAATGAGTGAGCCACAAAATTACATGAACGAATACAATTAACATAAGAACTAATGTATGTACGATGCACAAGCTCAAACATGACTTTAACCATATTTTTTATTTTAAAAAGATCAAATGACGGCTCATTGAGCGCACGAATAATCTCGATTGAATTAGATTTAACACTAAAGTGTTACTTCAAAAAACTTTGACATCTAAATAAAGCACGAGAGTCCCATTCTTTTTATCAATATGTATAACTATGTGTTGGTGACTTGGGATGTTGTAGTTAGAATACTCACTCTTATTGTACCTAAAAAAAATAAAAAAAAATAACATCAATTTCTTGTTACTCTTCGTACGGAGTCTTTCTTATCCAAGCCAAACCCAAAACCTCGATAAATCCTCCCTCTTGATGCTAATGCCGCTTCTTCCCACTTTCCTCCCTACCAAATTCATACTCTTCATAGCCTAAAATCAGCAATGCCTTTTCCGATTCATCCCAATCTCCAAGGACCCAATAACTCCGCTCGAACCCCAACGATTACCATTGGAGAAATCTCCTTTAAACCCTCTTCTTCTTCTATCATCATCCTCTGAATCAATTTCCAATGCCACAATCCCACTCCGTCCAAGTGTCTCGTCTAAATCCCGCTGGAGCAGAAGAGGAATTCTCTTGAACAATTCAACCCCCACTTGTGTTCTTCAGCTTCCGGTAAGCGATGACTACTTCTTTACCATTCACTTCACCATTAATCTCCGCATTTTCGCCGAGAAAGACCCTTTTCCCGCTCAAGCTTAATCGGCCCTCCATTAGTCAGAGTAAACTATCTCTTCACTCGTCGAGTCCATGTGTAAATGTTCGCCATTTCAACTGTTTTCATCCTGTTTTCTCGACTTCTCGGATCTTTCGTACTGTCACTCGAAGTTCGTCAAATGGGTTTCTCGAAGATGACGACATTATCCCCTCTTTTGAGGAGAAGCCGGTTAAAGTTCTGCTATTGGTTCTGTTTTGGGCATCTCTATCCCTTGCTTGGTTTGCTGCTTCTGGGGATGCCAAAGCTGCTGTTGATTCTATCAGAGCTTCGAATTTTGGCCTAAAGATCGCCAGAGCATTGCAGAGCTCAGGCTGGCCTGCTGAGGCTGTAGTATTTGCCCTCGCTACGCTTCCTGTAATTGAGCTCCGTGGGGCCATTCCTGTTGGTTACTGGATGCAGCTTAAGCCTGTAGCCCTAACCGTTCTATCCGTACTTGGGAACATGGTTCCTGTACCCTTTATCATACTCTACTTGAAGAAATTTGCTACTTTCCTAGCGGGAAGGAATGCTTCTGCCTCCCAATTCCTCGATATGTTATTCAAGAGGGCCAAAGAGAAAGCTGCACCTGTTGAAGAGTTTCAGTGGCTTGGTCTAATGCTATTTGTGGCCGTGCCTTTCCCTGGAACAGGAGCTTGGACTGGTGCCATAATAGCTTCCATCTTAGATATGCCATTCTGGTCAGGTGTCTCTGCAAATTTCTTTGGTGTTGTAGTGGCAGGGCTTCTGGTCAACTTGTTGGTAAATCTTGGTATGAAGGAGGCCATTCTCACTGGAGTGATTCTTTTCATTATATCAACTTTCATGTGGAGCATTCTTCGACTGATAAGAAAAGCTGTCAGAAAATGAATTAATCGAAAAGGTTAGGTATTTCTATGTCTAATAACAACGAACAAGGTAAGGTCGTATCTGATCTTTAGTCTGATATATGTGGACTTAGACTTGTTACTTGTTAGTTTAAGTTGGCATTCTGATCAATTGATTACTTACTATTTTCTTGGTGGCGTAGATTTTTCTCTAATATCATATGAAGAATGAGTAGTTGGAAGTGTGGAGATGATTAACTGTAGATATAAAGTTTGTCTTGTGAACACTGTTCAGAAGAGAAACCATCTGCAAACAACTTTCATTTATCATACCAAGCCGAGAAATGTTGTGTAAATTCTGGTATTATGATTGTTATTTGCTCGTGTAGAATTGCAAATCTTGTTCTTGAGATGTCAGGTGCAAATATAGTTGTGCTTTGAAAATTGTAAAATCATGCTCGAATGTTGTGGTCTGGTTTGATTTGCCGGCAAGCATAGACATTACCTTAGTTGAGGAACAAAAATGGCAAGAAACTGAACCACCCAACACCATGGATTTGCCTTTAACTGCAATAATATACGAATTGTTGACTACTTCTGTAATAGTTGATATGGGCTCACCAATCCAACACATAGTTCAATGGTTGATATGGGGTCGTGAGTTTGACTCTTTGGTGTCTGTACTATGTGGCTATGATCTTGTTCTTAGGTTTGTCTGCCATTTTGTTGCATGTGTTTCTTGCTGCTGCACATCATTGGATTGGTTGTACTGCTCCAAAATGAAAGATACTGTATCAATTTGGATTTGTATTCTCAATGCCTTAGGACATCTTCCACTATTTTGGGTTGTTCCCAGTTTTTGTACTTAGAGTCAGATAATAAAGGTAGGGATTATTATTATTA

Coding sequence (CDS)

ATGACTACTTCTTTACCATTCACTTCACCATTAATCTCCGCATTTTCGCCGAGAAAGACCCTTTTCCCGCTCAAGCTTAATCGGCCCTCCATTAGTCAGAGTAAACTATCTCTTCACTCGTCGAGTCCATGTGTAAATGTTCGCCATTTCAACTGTTTTCATCCTGTTTTCTCGACTTCTCGGATCTTTCGTACTGTCACTCGAAGTTCGTCAAATGGGTTTCTCGAAGATGACGACATTATCCCCTCTTTTGAGGAGAAGCCGGTTAAAGTTCTGCTATTGGTTCTGTTTTGGGCATCTCTATCCCTTGCTTGGTTTGCTGCTTCTGGGGATGCCAAAGCTGCTGTTGATTCTATCAGAGCTTCGAATTTTGGCCTAAAGATCGCCAGAGCATTGCAGAGCTCAGGCTGGCCTGCTGAGGCTGTAGTATTTGCCCTCGCTACGCTTCCTGTAATTGAGCTCCGTGGGGCCATTCCTGTTGGTTACTGGATGCAGCTTAAGCCTGTAGCCCTAACCGTTCTATCCGTACTTGGGAACATGGTTCCTGTACCCTTTATCATACTCTACTTGAAGAAATTTGCTACTTTCCTAGCGGGAAGGAATGCTTCTGCCTCCCAATTCCTCGATATGTTATTCAAGAGGGCCAAAGAGAAAGCTGCACCTGTTGAAGAGTTTCAGTGGCTTGGTCTAATGCTATTTGTGGCCGTGCCTTTCCCTGGAACAGGAGCTTGGACTGGTGCCATAATAGCTTCCATCTTAGATATGCCATTCTGGTCAGGTGTCTCTGCAAATTTCTTTGGTGTTGTAGTGGCAGGGCTTCTGGTCAACTTGTTGGTAAATCTTGGTATGAAGGAGGCCATTCTCACTGGAGTGATTCTTTTCATTATATCAACTTTCATGTGGAGCATTCTTCGACTGATAAGAAAAGCTGTCAGAAAATGA

Protein sequence

MTTSLPFTSPLISAFSPRKTLFPLKLNRPSISQSKLSLHSSSPCVNVRHFNCFHPVFSTSRIFRTVTRSSSNGFLEDDDIIPSFEEKPVKVLLLVLFWASLSLAWFAASGDAKAAVDSIRASNFGLKIARALQSSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPVALTVLSVLGNMVPVPFIILYLKKFATFLAGRNASASQFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFFGVVVAGLLVNLLVNLGMKEAILTGVILFIISTFMWSILRLIRKAVRK
BLAST of Lsi04G004660 vs. TrEMBL
Match: A0A0A0KQH3_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G604240 PE=4 SV=1)

HSP 1 Score: 546.6 bits (1407), Expect = 2.0e-152
Identity = 288/313 (92.01%), Postives = 297/313 (94.89%), Query Frame = 1

Query: 1   MTTSLPFTSPLISAFSPRKTLFPLKLNRPSISQSKLSLHSSSPCVNVRHFNCFHPVFSTS 60
           MTTSLP TSPL SAFSPRKTLF LKLNRPSI++S  SLH SSP VNV HFNCF PV  TS
Sbjct: 1   MTTSLPLTSPLFSAFSPRKTLFSLKLNRPSITRSTQSLHFSSPFVNVPHFNCFDPVSRTS 60

Query: 61  RIFRTVTRSSSNGFLEDDDIIPSFEEKPVKVLLLVLFWASLSLAWFAASGDAKAAVDSIR 120
           RI RTV RSSSNGFLEDD+IIPSFEEKPVKVLLLVLFWASLSLAWFAASGDAKAAVDSIR
Sbjct: 61  RIIRTVPRSSSNGFLEDDEIIPSFEEKPVKVLLLVLFWASLSLAWFAASGDAKAAVDSIR 120

Query: 121 ASNFGLKIARALQSSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPVALTVLSVLGNM 180
           ASNFGLKIA ALQ+SGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPVALTVLSVLGNM
Sbjct: 121 ASNFGLKIASALQNSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPVALTVLSVLGNM 180

Query: 181 VPVPFIILYLKKFATFLAGRNASASQFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPG 240
           VPVPFIILYLKKFATFLAGRNASASQFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPG
Sbjct: 181 VPVPFIILYLKKFATFLAGRNASASQFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPG 240

Query: 241 TGAWTGAIIASILDMPFWSGVSANFFGVVVAGLLVNLLVNLGMKEAILTGVILFIISTFM 300
           TGAWTGAIIASILDMPFWSGVSANFFGVVVAGLLVNLLVNLG+KEAI+TGVILFIISTFM
Sbjct: 241 TGAWTGAIIASILDMPFWSGVSANFFGVVVAGLLVNLLVNLGLKEAIVTGVILFIISTFM 300

Query: 301 WSILRLIRKAVRK 314
           WSILR+I+K+  K
Sbjct: 301 WSILRMIKKSFEK 313

BLAST of Lsi04G004660 vs. TrEMBL
Match: M5XGS5_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa015818mg PE=4 SV=1)

HSP 1 Score: 397.5 bits (1020), Expect = 1.5e-107
Identity = 221/313 (70.61%), Postives = 254/313 (81.15%), Query Frame = 1

Query: 8   TSPLISAFSPRKTLFPL--KLNRPSISQSKLSLHSSSPCVNVRHFNCFHPVFSTSRIFRT 67
           TSPL S  S  KT F    K  RPSI+ S     +S+  +N +  +  +P+ + S +   
Sbjct: 9   TSPLTSNLSLGKTRFRFSPKHGRPSIAHSIQPPFNSNADLNFQTLSPLNPLLANSPLSHA 68

Query: 68  VTRSSSNGFL---EDDDIIPSFEEKPVKVLLLVLFWASLSLAWFAASGDAKAAV----DS 127
            TR SS+GFL   E DDI+P FEE+PVK +  VL WAS+SLA FAASGDA AA     DS
Sbjct: 69  ATRVSSHGFLDKDEKDDILPVFEERPVKFVFWVLVWASVSLALFAASGDANAAAAAAADS 128

Query: 128 IRASNFGLKIARALQSSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPVALTVLSVLG 187
           IRAS+FGLKIA AL+ SGWP EAVVFALATLPVIELRGAIPVGYW+QLKPV LTVLSVLG
Sbjct: 129 IRASSFGLKIASALRGSGWPDEAVVFALATLPVIELRGAIPVGYWLQLKPVMLTVLSVLG 188

Query: 188 NMVPVPFIILYLKKFATFLAGRNASASQFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPF 247
           NMVPVPFIILYLK+FA+FLAG+N +A++FLD+LF RAKEKA PVEEFQWLGLMLFVAVPF
Sbjct: 189 NMVPVPFIILYLKRFASFLAGKNKAAARFLDILFVRAKEKAGPVEEFQWLGLMLFVAVPF 248

Query: 248 PGTGAWTGAIIASILDMPFWSGVSANFFGVVVAGLLVNLLVNLGMKEAILTGVILFIIST 307
           PGTGAWTGAIIASILDMPFW+ VSANFFGVV+AGLLVNLLVNLG+K AI+TG+ILFIIST
Sbjct: 249 PGTGAWTGAIIASILDMPFWAAVSANFFGVVLAGLLVNLLVNLGLKYAIITGIILFIIST 308

Query: 308 FMWSILRLIRKAV 312
           FMWSILR +RK++
Sbjct: 309 FMWSILRNLRKSL 321

BLAST of Lsi04G004660 vs. TrEMBL
Match: A0A067JBU6_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_21708 PE=4 SV=1)

HSP 1 Score: 390.2 bits (1001), Expect = 2.4e-105
Identity = 214/310 (69.03%), Postives = 246/310 (79.35%), Query Frame = 1

Query: 9   SPLISAFSPRKTLFPLKLNRPSISQSKLSLHSSSPCVNVRHFNCFHPV-FSTSRIFRTV- 68
           S L  +FS RKT F    NR +++   L  H     V       F  V    SR F TV 
Sbjct: 6   SILSLSFSFRKTHFKFLPNRANVNLCPLIAHKKRSLVKSNQILPFQAVDLLASRRFSTVP 65

Query: 69  ---TRSSSNGFL---EDDDIIPSFEEKPVKVLLLVLFWASLSLAWFAASGDAKAAVDSIR 128
              TR+SS+GF+   ED +I+P FEE+P K L  VL WAS SLAWFAASGDA AAVDSI+
Sbjct: 66  LTATRASSDGFIDLTEDKEILPLFEERPAKFLFWVLVWASFSLAWFAASGDANAAVDSIK 125

Query: 129 ASNFGLKIARALQSSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPVALTVLSVLGNM 188
           AS+FGLKIA AL+S GWP E+VVFALATLPV+ELRGAIPVGYWMQLKP+ LTVLSV+GNM
Sbjct: 126 ASSFGLKIASALRSLGWPDESVVFALATLPVLELRGAIPVGYWMQLKPITLTVLSVIGNM 185

Query: 189 VPVPFIILYLKKFATFLAGRNASASQFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPG 248
           VPVP I+LYLK+FA+FLAGRN SAS+FLD+LF+ AK+KAAPVEEFQWLGLMLFVAVPFPG
Sbjct: 186 VPVPLIVLYLKRFASFLAGRNQSASRFLDILFENAKKKAAPVEEFQWLGLMLFVAVPFPG 245

Query: 249 TGAWTGAIIASILDMPFWSGVSANFFGVVVAGLLVNLLVNLGMKEAILTGVILFIISTFM 308
           TGAWTGAI+ASILDMPFW  VSANF GVV+AGLLVNLLVNLG+K AI+TG+ILF+ISTFM
Sbjct: 246 TGAWTGAIVASILDMPFWPAVSANFCGVVLAGLLVNLLVNLGLKYAIVTGIILFLISTFM 305

Query: 309 WSILRLIRKA 311
           WSILR +R +
Sbjct: 306 WSILRSVRNS 315

BLAST of Lsi04G004660 vs. TrEMBL
Match: D7TNF2_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_01s0026g01540 PE=4 SV=1)

HSP 1 Score: 376.7 bits (966), Expect = 2.7e-101
Identity = 209/311 (67.20%), Postives = 242/311 (77.81%), Query Frame = 1

Query: 1   MTTSLPFTSPLISAFSPRKTLFPLKLNRPSISQSKLSLHSSSPCVNVRHFNCFHPVFSTS 60
           M  S+    PL    S R+T   + L+  S + ++  L   +P + +R+        + S
Sbjct: 1   MAASVSSPPPLPLTVSSRRTHLAIWLHSRS-ADNQHRLFKPNPSLALRNSRHSRHPLTIS 60

Query: 61  RIFRTVTRSSSNGFLEDDDIIPSFEEKPVKVLLLVLFWASLSLAWFAASGDAKAAVDSIR 120
               T  ++S + FL   D +  FE  PVK L  VLFWASLS+AWFAASGDA AA DSIR
Sbjct: 61  PPHSTPAQASPDEFL---DKVGDFEGPPVKFLFWVLFWASLSVAWFAASGDANAATDSIR 120

Query: 121 ASNFGLKIARALQSSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPVALTVLSVLGNM 180
           AS+FGLK+A AL+SSGWP EAVV ALATLPVIELRGAIPVGYWMQLKP  LT+LSVLGNM
Sbjct: 121 ASSFGLKVASALRSSGWPDEAVVVALATLPVIELRGAIPVGYWMQLKPATLTILSVLGNM 180

Query: 181 VPVPFIILYLKKFATFLAGRNASASQFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPG 240
           +PVPFIILYLK+FATFLAG+N SAS+FLDMLF++AKEKA PVEEFQWLGLMLFVAVPFPG
Sbjct: 181 IPVPFIILYLKRFATFLAGKNKSASRFLDMLFEKAKEKAGPVEEFQWLGLMLFVAVPFPG 240

Query: 241 TGAWTGAIIASILDMPFWSGVSANFFGVVVAGLLVNLLVNLGMKEAILTGVILFIISTFM 300
           TGAWTGAIIASILDMPFW  VSANFFGVV+AGLLVNLLVNLG+K AI+TGVILF ISTFM
Sbjct: 241 TGAWTGAIIASILDMPFWPAVSANFFGVVLAGLLVNLLVNLGLKYAIVTGVILFFISTFM 300

Query: 301 WSILRLIRKAV 312
           WS+LR + KA+
Sbjct: 301 WSVLRSLMKAL 307

BLAST of Lsi04G004660 vs. TrEMBL
Match: V4TPN6_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10032129mg PE=4 SV=1)

HSP 1 Score: 375.6 bits (963), Expect = 6.1e-101
Identity = 211/306 (68.95%), Postives = 242/306 (79.08%), Query Frame = 1

Query: 9   SPLISAFSPRKTLFPLKLNRP-SISQSKLSLHSSSPCVNVRHFNCF--HPVFSTSRIFRT 68
           +PLI+  +  ++LF    ++P S  QS+  L    P V+    +CF     FS+      
Sbjct: 28  NPLIATNNQIQSLFVFSKSKPFSTFQSRRHL---GPLVS----SCFPTRASFSSDMFPDN 87

Query: 69  VTRSSSNGFLEDDDIIPSFEEKPVKVLLLVLFWASLSLAWFAASGDAKAAVDSIRASNFG 128
           +T        E++ I+P  EE P+K LL V+FWASLSL WF+ SGDA AAVDSIRAS  G
Sbjct: 88  IT--------EEERILPVTEETPLKFLLWVVFWASLSLVWFSTSGDANAAVDSIRASAIG 147

Query: 129 LKIARALQSSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPVALTVLSVLGNMVPVPF 188
           LKIA AL+ SGWP EAVVFALATLPV+ELRGAIPVGYWMQLKPV LTVLSVLGNMVPVPF
Sbjct: 148 LKIATALRRSGWPDEAVVFALATLPVLELRGAIPVGYWMQLKPVLLTVLSVLGNMVPVPF 207

Query: 189 IILYLKKFATFLAGRNASASQFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPGTGAWT 248
           IILYLKKFA+FLAG+N SASQFLDMLF++AKEKA PVEEFQWLGLMLFVAVPFPGTGAWT
Sbjct: 208 IILYLKKFASFLAGKNRSASQFLDMLFQKAKEKAGPVEEFQWLGLMLFVAVPFPGTGAWT 267

Query: 249 GAIIASILDMPFWSGVSANFFGVVVAGLLVNLLVNLGMKEAILTGVILFIISTFMWSILR 308
           GA IA+ILDMPFWS +SANFFGVV+AGLLVNLLVNLG+K AI+TG ILFIISTFMWS LR
Sbjct: 268 GAFIAAILDMPFWSALSANFFGVVIAGLLVNLLVNLGLKYAIVTGAILFIISTFMWSTLR 318

Query: 309 LIRKAV 312
            IRK++
Sbjct: 328 SIRKSL 318

BLAST of Lsi04G004660 vs. TAIR10
Match: AT2G02590.1 (AT2G02590.1 FUNCTIONS IN: molecular_function unknown)

HSP 1 Score: 349.0 bits (894), Expect = 3.1e-96
Identity = 185/260 (71.15%), Postives = 215/260 (82.69%), Query Frame = 1

Query: 64  RTVTR--SSSNGFL-------EDDDII--PSFEEKPVKVLLLVLFWASLSLAWFAASGDA 123
           R  TR  SS +GFL       E ++II  PS    PVK  + V+ WAS SL WFA SGDA
Sbjct: 61  RNFTRFCSSPDGFLRNTKDDEEGNEIIQLPSIGVNPVKFAICVVLWASFSLLWFARSGDA 120

Query: 124 KAAVDSIRASNFGLKIARALQSSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPVALT 183
           KAA DSI++S+FGL+IA  L+  GWP EAVVFALATLPVIELRGAIPVGYWMQLKPV LT
Sbjct: 121 KAATDSIKSSSFGLRIASTLRRFGWPDEAVVFALATLPVIELRGAIPVGYWMQLKPVVLT 180

Query: 184 VLSVLGNMVPVPFIILYLKKFATFLAGRNASASQFLDMLFKRAKEKAAPVEEFQWLGLML 243
             SVLGNMVPVPFI+LYLK FA+F+AG++ +AS+ LD+LFKRAKEKA PVEEF+WLGLML
Sbjct: 181 SFSVLGNMVPVPFIVLYLKTFASFVAGKSQTASKLLDILFKRAKEKAGPVEEFKWLGLML 240

Query: 244 FVAVPFPGTGAWTGAIIASILDMPFWSGVSANFFGVVVAGLLVNLLVNLGMKEAILTGVI 303
           FVAVPFPGTGAWTGAIIASILDMPFWS VS+NF GVV+AGLLVNLLVNLG+K+AI+ G+ 
Sbjct: 241 FVAVPFPGTGAWTGAIIASILDMPFWSAVSSNFCGVVLAGLLVNLLVNLGLKQAIVAGIA 300

Query: 304 LFIISTFMWSILRLIRKAVR 313
           LF +STFMWS+LR IRK+++
Sbjct: 301 LFFVSTFMWSVLRNIRKSIK 320

BLAST of Lsi04G004660 vs. NCBI nr
Match: gi|449434831|ref|XP_004135199.1| (PREDICTED: uncharacterized protein LOC101204187 [Cucumis sativus])

HSP 1 Score: 546.6 bits (1407), Expect = 2.9e-152
Identity = 288/313 (92.01%), Postives = 297/313 (94.89%), Query Frame = 1

Query: 1   MTTSLPFTSPLISAFSPRKTLFPLKLNRPSISQSKLSLHSSSPCVNVRHFNCFHPVFSTS 60
           MTTSLP TSPL SAFSPRKTLF LKLNRPSI++S  SLH SSP VNV HFNCF PV  TS
Sbjct: 1   MTTSLPLTSPLFSAFSPRKTLFSLKLNRPSITRSTQSLHFSSPFVNVPHFNCFDPVSRTS 60

Query: 61  RIFRTVTRSSSNGFLEDDDIIPSFEEKPVKVLLLVLFWASLSLAWFAASGDAKAAVDSIR 120
           RI RTV RSSSNGFLEDD+IIPSFEEKPVKVLLLVLFWASLSLAWFAASGDAKAAVDSIR
Sbjct: 61  RIIRTVPRSSSNGFLEDDEIIPSFEEKPVKVLLLVLFWASLSLAWFAASGDAKAAVDSIR 120

Query: 121 ASNFGLKIARALQSSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPVALTVLSVLGNM 180
           ASNFGLKIA ALQ+SGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPVALTVLSVLGNM
Sbjct: 121 ASNFGLKIASALQNSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPVALTVLSVLGNM 180

Query: 181 VPVPFIILYLKKFATFLAGRNASASQFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPG 240
           VPVPFIILYLKKFATFLAGRNASASQFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPG
Sbjct: 181 VPVPFIILYLKKFATFLAGRNASASQFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPG 240

Query: 241 TGAWTGAIIASILDMPFWSGVSANFFGVVVAGLLVNLLVNLGMKEAILTGVILFIISTFM 300
           TGAWTGAIIASILDMPFWSGVSANFFGVVVAGLLVNLLVNLG+KEAI+TGVILFIISTFM
Sbjct: 241 TGAWTGAIIASILDMPFWSGVSANFFGVVVAGLLVNLLVNLGLKEAIVTGVILFIISTFM 300

Query: 301 WSILRLIRKAVRK 314
           WSILR+I+K+  K
Sbjct: 301 WSILRMIKKSFEK 313

BLAST of Lsi04G004660 vs. NCBI nr
Match: gi|659090985|ref|XP_008446308.1| (PREDICTED: uncharacterized protein LOC103489082 [Cucumis melo])

HSP 1 Score: 540.4 bits (1391), Expect = 2.0e-150
Identity = 285/313 (91.05%), Postives = 294/313 (93.93%), Query Frame = 1

Query: 1   MTTSLPFTSPLISAFSPRKTLFPLKLNRPSISQSKLSLHSSSPCVNVRHFNCFHPVFSTS 60
           MTTSLP TSPL SAFSPRKTLF LKLNRPSI+QS  SLH SSP VNV H NC  PV STS
Sbjct: 1   MTTSLPLTSPLFSAFSPRKTLFSLKLNRPSITQSTHSLHFSSPFVNVPHSNCSDPVSSTS 60

Query: 61  RIFRTVTRSSSNGFLEDDDIIPSFEEKPVKVLLLVLFWASLSLAWFAASGDAKAAVDSIR 120
           RI RTV RSSSNGFLEDD+IIPSFEEKP+KVL+LVLFWASLSLAWFAASGDAKAAVDSIR
Sbjct: 61  RIIRTVPRSSSNGFLEDDEIIPSFEEKPIKVLILVLFWASLSLAWFAASGDAKAAVDSIR 120

Query: 121 ASNFGLKIARALQSSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPVALTVLSVLGNM 180
           ASNFGLKIA ALQSSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPV LTVLSVLGNM
Sbjct: 121 ASNFGLKIASALQSSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPVTLTVLSVLGNM 180

Query: 181 VPVPFIILYLKKFATFLAGRNASASQFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPG 240
           VPVPFIILYLKKFATFLAGRNASASQFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPG
Sbjct: 181 VPVPFIILYLKKFATFLAGRNASASQFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPG 240

Query: 241 TGAWTGAIIASILDMPFWSGVSANFFGVVVAGLLVNLLVNLGMKEAILTGVILFIISTFM 300
           TGAWTGAIIASILDMPFWSGVSANFFGVVVAGLLVNLLVNLG+KEAI+TG ILFIISTFM
Sbjct: 241 TGAWTGAIIASILDMPFWSGVSANFFGVVVAGLLVNLLVNLGLKEAIVTGAILFIISTFM 300

Query: 301 WSILRLIRKAVRK 314
           WSILR+I+K+  K
Sbjct: 301 WSILRMIKKSFEK 313

BLAST of Lsi04G004660 vs. NCBI nr
Match: gi|1009153909|ref|XP_015894881.1| (PREDICTED: uncharacterized protein LOC107428807 [Ziziphus jujuba])

HSP 1 Score: 414.8 bits (1065), Expect = 1.3e-112
Identity = 227/317 (71.61%), Postives = 258/317 (81.39%), Query Frame = 1

Query: 1   MTTSL-PFTSPLISAFSPRKTL--FPLKLNRPSISQSKLSLHSSSPCVNVRHFNCFHPVF 60
           M  SL P T+PL S FS  KTL  F  K +R      K  L SS+ C+N +      P+ 
Sbjct: 1   MAASLSPSTAPLTSKFSLGKTLLGFSPKFDRHCTPHGKQPLCSSNECLNFQTSRRLSPLV 60

Query: 61  STSRIFRTVTRSSSNGFL---EDDDIIPSFEEKPVKVLLLVLFWASLSLAWFAASGDAKA 120
           S S +  T TR+SSNGFL   EDDDI+PSFE +PVK +  VL WASLS+AWFAASGDAKA
Sbjct: 61  SNSPLPVTATRASSNGFLDTTEDDDIVPSFEVRPVKFVFWVLLWASLSVAWFAASGDAKA 120

Query: 121 AVDSIRASNFGLKIARALQSSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPVALTVL 180
           A DSI+AS+FGLKIA AL+  GWP EAVVFALATLPV+ELRGAIPVGYW+QLKP  LTVL
Sbjct: 121 AADSIKASSFGLKIASALRGLGWPDEAVVFALATLPVLELRGAIPVGYWLQLKPWILTVL 180

Query: 181 SVLGNMVPVPFIILYLKKFATFLAGRNASASQFLDMLFKRAKEKAAPVEEFQWLGLMLFV 240
           SV+GNMVPVPFIILYLK+FATFLAGRN + ++FLDMLF RAKEKA PVEEFQWLGLMLFV
Sbjct: 181 SVIGNMVPVPFIILYLKRFATFLAGRNKAGAKFLDMLFVRAKEKAGPVEEFQWLGLMLFV 240

Query: 241 AVPFPGTGAWTGAIIASILDMPFWSGVSANFFGVVVAGLLVNLLVNLGMKEAILTGVILF 300
           AVPFPGTGAWTGAIIASILDMPFWS VSANFFGVV+AGLLVNLLVNLG+K AI+TG+ILF
Sbjct: 241 AVPFPGTGAWTGAIIASILDMPFWSAVSANFFGVVLAGLLVNLLVNLGLKYAIVTGIILF 300

Query: 301 IISTFMWSILRLIRKAV 312
            ISTFMWS+LR ++K++
Sbjct: 301 FISTFMWSVLRNLKKSL 317

BLAST of Lsi04G004660 vs. NCBI nr
Match: gi|645229931|ref|XP_008221691.1| (PREDICTED: uncharacterized protein LOC103321638 [Prunus mume])

HSP 1 Score: 398.3 bits (1022), Expect = 1.3e-107
Identity = 222/313 (70.93%), Postives = 254/313 (81.15%), Query Frame = 1

Query: 8   TSPLISAFSPRKTLFPL--KLNRPSISQSKLSLHSSSPCVNVRHFNCFHPVFSTSRIFRT 67
           TSPL S  S  KT F    K  RPSI+QS     +S+  +N R  +  +P  + S +   
Sbjct: 9   TSPLTSNLSLGKTRFRFSPKHGRPSIAQSIQPPFNSNADLNFRTLSPLNPRLANSPLSHA 68

Query: 68  VTRSSSNGFL---EDDDIIPSFEEKPVKVLLLVLFWASLSLAWFAASGDAKAAV----DS 127
            TR SS+GFL   E DDI+P FEE+PVK +  VL WAS+SLA FAASGDA AA     DS
Sbjct: 69  ATRVSSHGFLDKDEKDDILPVFEERPVKFVFWVLVWASVSLALFAASGDANAAAAAAADS 128

Query: 128 IRASNFGLKIARALQSSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPVALTVLSVLG 187
           IRAS+FGLKIA AL+ SGWP EAVVFALATLPVIELRGAIPVGYW+QLKPV LTVLSVLG
Sbjct: 129 IRASSFGLKIASALRGSGWPDEAVVFALATLPVIELRGAIPVGYWLQLKPVMLTVLSVLG 188

Query: 188 NMVPVPFIILYLKKFATFLAGRNASASQFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPF 247
           NMVPVPFIILYLK+FA+FLAG+N +A++FLD+LF RAKEKA PVEEFQWLGLMLFVAVPF
Sbjct: 189 NMVPVPFIILYLKRFASFLAGKNKAAARFLDILFVRAKEKAGPVEEFQWLGLMLFVAVPF 248

Query: 248 PGTGAWTGAIIASILDMPFWSGVSANFFGVVVAGLLVNLLVNLGMKEAILTGVILFIIST 307
           PGTGAWTGAIIASILDMPFW+ VSANFFGV++AGLLVNLLVNLG+K AI+TG+ILFIIST
Sbjct: 249 PGTGAWTGAIIASILDMPFWAAVSANFFGVLLAGLLVNLLVNLGLKYAIITGIILFIIST 308

Query: 308 FMWSILRLIRKAV 312
           FMWSILR +RK++
Sbjct: 309 FMWSILRNLRKSL 321

BLAST of Lsi04G004660 vs. NCBI nr
Match: gi|596224101|ref|XP_007224123.1| (hypothetical protein PRUPE_ppa015818mg [Prunus persica])

HSP 1 Score: 397.5 bits (1020), Expect = 2.1e-107
Identity = 221/313 (70.61%), Postives = 254/313 (81.15%), Query Frame = 1

Query: 8   TSPLISAFSPRKTLFPL--KLNRPSISQSKLSLHSSSPCVNVRHFNCFHPVFSTSRIFRT 67
           TSPL S  S  KT F    K  RPSI+ S     +S+  +N +  +  +P+ + S +   
Sbjct: 9   TSPLTSNLSLGKTRFRFSPKHGRPSIAHSIQPPFNSNADLNFQTLSPLNPLLANSPLSHA 68

Query: 68  VTRSSSNGFL---EDDDIIPSFEEKPVKVLLLVLFWASLSLAWFAASGDAKAAV----DS 127
            TR SS+GFL   E DDI+P FEE+PVK +  VL WAS+SLA FAASGDA AA     DS
Sbjct: 69  ATRVSSHGFLDKDEKDDILPVFEERPVKFVFWVLVWASVSLALFAASGDANAAAAAAADS 128

Query: 128 IRASNFGLKIARALQSSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPVALTVLSVLG 187
           IRAS+FGLKIA AL+ SGWP EAVVFALATLPVIELRGAIPVGYW+QLKPV LTVLSVLG
Sbjct: 129 IRASSFGLKIASALRGSGWPDEAVVFALATLPVIELRGAIPVGYWLQLKPVMLTVLSVLG 188

Query: 188 NMVPVPFIILYLKKFATFLAGRNASASQFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPF 247
           NMVPVPFIILYLK+FA+FLAG+N +A++FLD+LF RAKEKA PVEEFQWLGLMLFVAVPF
Sbjct: 189 NMVPVPFIILYLKRFASFLAGKNKAAARFLDILFVRAKEKAGPVEEFQWLGLMLFVAVPF 248

Query: 248 PGTGAWTGAIIASILDMPFWSGVSANFFGVVVAGLLVNLLVNLGMKEAILTGVILFIIST 307
           PGTGAWTGAIIASILDMPFW+ VSANFFGVV+AGLLVNLLVNLG+K AI+TG+ILFIIST
Sbjct: 249 PGTGAWTGAIIASILDMPFWAAVSANFFGVVLAGLLVNLLVNLGLKYAIITGIILFIIST 308

Query: 308 FMWSILRLIRKAV 312
           FMWSILR +RK++
Sbjct: 309 FMWSILRNLRKSL 321

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0KQH3_CUCSA2.0e-15292.01Uncharacterized protein OS=Cucumis sativus GN=Csa_5G604240 PE=4 SV=1[more]
M5XGS5_PRUPE1.5e-10770.61Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa015818mg PE=4 SV=1[more]
A0A067JBU6_JATCU2.4e-10569.03Uncharacterized protein OS=Jatropha curcas GN=JCGZ_21708 PE=4 SV=1[more]
D7TNF2_VITVI2.7e-10167.20Putative uncharacterized protein OS=Vitis vinifera GN=VIT_01s0026g01540 PE=4 SV=... [more]
V4TPN6_9ROSI6.1e-10168.95Uncharacterized protein OS=Citrus clementina GN=CICLE_v10032129mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G02590.13.1e-9671.15 FUNCTIONS IN: molecular_function unknown[more]
Match NameE-valueIdentityDescription
gi|449434831|ref|XP_004135199.1|2.9e-15292.01PREDICTED: uncharacterized protein LOC101204187 [Cucumis sativus][more]
gi|659090985|ref|XP_008446308.1|2.0e-15091.05PREDICTED: uncharacterized protein LOC103489082 [Cucumis melo][more]
gi|1009153909|ref|XP_015894881.1|1.3e-11271.61PREDICTED: uncharacterized protein LOC107428807 [Ziziphus jujuba][more]
gi|645229931|ref|XP_008221691.1|1.3e-10770.93PREDICTED: uncharacterized protein LOC103321638 [Prunus mume][more]
gi|596224101|ref|XP_007224123.1|2.1e-10770.61hypothetical protein PRUPE_ppa015818mg [Prunus persica][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR009577Sm_multidrug_ex
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0006508 proteolysis
cellular_component GO:0009507 chloroplast
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function
molecular_function GO:0004177 aminopeptidase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lsi04G004660.1Lsi04G004660.1mRNA


Analysis Name: InterPro Annotations of Lagenaria siceraria
Date Performed: 2017-09-18
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR009577Putative small multi-drug exportPFAMPF06695Sm_multidrug_excoord: 153..273
score: 4.1
NoneNo IPR availablePANTHERPTHR36007FAMILY NOT NAMEDcoord: 3..313
score: 1.1E
NoneNo IPR availablePANTHERPTHR36007:SF1TRANSPORT PROTEIN-RELATEDcoord: 3..313
score: 1.1E

The following gene(s) are paralogous to this gene:

None