Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GCCGAACCCGATTTCATCAATTTATGGTGCCGGTACGCCGCCGTGAACTGATGGACATCAACCATGAGTCATATGGTTCGGGCTCTCCGGCAGTGGCCGATGGTGCAGAAACATTGTCGTGGTTGCGCCGTACATCATTTTCTCTCCTCATCTCCGCCGTGGGTGGCCAAAAGAATCGACTCTCGTCGACTATCTTTAGCTACCGTTCATTCTGCTCGTGGCGAAGTCCAATATGGATCAAAAGGACTCAGATTACCCAAAGCTCCAGCACCAGCCAAATCCCAAGAAGATGAGAGCGTCGATGATGATTCGGATGCTAGAAAGAGCCGCAACCAGCTTAAACGGGAAGCTCGACGAGCCGTCCAATGGGGCATGGATCTTGCGGCCTTCTCCACTCCTCAAATTAAACGCATCCTTAGGTATTTATATCCTCTCGCAACATTATCCGACGACGCCTAATTCTTGTATAATGTTTATCACTGATACTCTTCGGGAAAGGGTTAAGGAGTTTGAGTTTCCATACCGACACTAATGTGAGATTTCTATTACTTTTCTGGGGTGATAGAGTGACGTCTCTCGAGAAAGATATCTTCGACGCAATAATGCTTGTGAAGGTGCTCCTTATTAGTGCTTTGATTTTGTGGTTTGTCGCATCGTTTTGCAATATGGCAACGCGTGAATTTTTTATGTAGCTAAGGTGTTTGATGTTTTGCCGCAGAGGTTTGGGAGTGATGTCAGAGAAGGAAAGCGAAGGCAGTTCAATTACATTGGTAAGCTAATATTTGATATTTTCTTCCGTTGTGAAACGTACATTTGTGCAGTTCGTGTTAATAGTTAGAACCCCATTTGATAATGTAAGAAGATTGAATCTGAAGAACCCTGACATGATGGTTTATCAGGATCAATTGATGAAAATAATGTAAACTTGCTGTCTCTATGGTCCTTGAATATATATGAGCACGAGATCCGGTAAAATGTGAGGCACGAGGTCTTCCTTCGAAAAAACTACTCCTTATAAATCTAAAATGGAAGATTTCAATCTGTCAAATGGTGAATAGAATGGCTTAAATGGTCTGGGCCTTGAATTGATCATCACCCTGGTTAGAGTATTGTGATCAGCTTTGTTTATAGGTTGCCACTTATCCCACTAAACTGCATTCCGGTGACTACTCAACAAGTTGATGTTACTAATTCTAGGAATCTATAGGCTCAATTTAGTTGATTACTTTTATTAAGCTCACTTAATGAAATTGTGATAATACAATTTGCTAAGTAAAACGCTGAAATGCAACAATTTTGAACTGAGCTTACCATTGAATTTTCACCGGTTCCAGACTACAACCTAGGTCTCTAAATATATATTTATTATAATTCCTGAAGCGCTTTACACATAACCCTGGTCTTAAAAAATCCAACCAAAACAGAACATCATCTGTTTTTCTATGGACGTCGTACTTTATTTTTCAAAACATTTGTATTGTGTGGCTCTGATTTTTTTTTTTTTTTTTTTTTAGGCCGTGGCTCTGATAAGTTGATTTAATTTGTATAGGAAAACTGCTGAGGGAAGCACAACCTGACACTGAATTAATGGACATTTTAATACAGGCCACAAAAGCCGGTGACCATAAGATACTACAGAAATTGTGTGCTTCAGTAGATGATGAAGTTTCAAATTCTGTATACGAGGAGGAGGAAGAAGAGGTATTTTATGATATGATTCGTTTACGTATTACCCCTCCGTATTTGGTATAATAGACAAATTGCTCTAAATTTTGTTGAAGTTTATTTGGCTTGCTGACCAATTGGGAAAAATATACTACAGCGTGGAACAAATAAAATGAACAACTTCACTTAGAAAACGAGAGCCAGCGAGTATAACAACTCTTGCTTTTTTAGGAGTTAACTTTGATTGATATAGAGGAACAGCAACTTGTCAAAACTTCAAAAACCTATGTTTCTGGACTCAATTGAAGGGGCTCTATTGTTCATAAATATCATTTTATACCATGTTGTTTATTTTAATTTTATAACTTAATTTTGTTTTTCCTTTCCGTCCGTATCTTAACTAGGGTCCGCATGTGGACATCGCTACAAGATGGCTTGACGGGCTAGTCAGTAAGGACAACAACATTACAAATGAAATTTATTCACTACAAACTGTTGAATTTGACCGTCAGGTACTTGACTTACTGCATCCTGTTTTTTTTTTTTCCCTTTTCATTGCAATGGTATGTTACCCTTATCCATTTTTCTTATACCACCCTCCTGCTTCTTTAAGAAGTTGTTGACTCATTTTGGCTCATGCCACTTAAATCTTATGTAAGATTATTATTATTTCTTTAATAAGAAACATTTAGTTGATGAGATGAAATGACAAAAAGAGGTGAAACCCCCTTCCAACGATTCTAGGAAATTACAAAAAAAATATACTTCCGTTGGGCTATAACTGTAAGTGTAGATAAGCTACAGCTATGAAACTATGGCGAAAAAGGTTATCTAAGATTGTTGCAATCAAAAGAGAGCCTCTCTAGGCTTCTACCTGAAAATGTTCTAATTAGGCTTGTTTTGTCTACTTGATTAGCCAAGAAAAATAGGACAATGATTTTCTGAATGATTACAACTTTGAAGCTTTTATTAGCTAGTTTAGATGTTATATTCCCCTGCAACTCTGACCTACATGACAATCATTTATAGGAGCTGCGGAGACTTGTTCGAAAAGTTCATACGATTGAAGAACGCAAGGCAGCAATTGAAGAGAATGAGGATGAAGTCAATACCGCGATAACAAACGCCACAAAGCCCCTTGCTCGTTTCCTTTGTAGAATGGCGAAACAGTTGCCCTCCTATGAACTCTAGTATGAATTTATTTGTTCACTATATTGATTTCGCATTCTTTCCAATGCAAACTTATTTGCTCTTTACAACCTCGCAGACTGTTTATTCACACTTTCAACACCTACTTTTCCTTTTCTTTCACACCTCCTTTAAAGTCTACTTGACCCCAAGTAGTTGAGCGCTAACCAGGATTTTAGAGATTGTTTATCAGGTTCTGGGTGTAATATGTTTTTCAAGGTCTTTCTTGCTTCTTTTGCTGCCCATATCCAATACTCTTTTCTCTAGGAATGCCTTTTAATAGTGGGGGAATTTATTATAGACCCGAGATCTGAAAGGGAGAGGTAGAAAAGATGGACATTGTGGAAGAGTTTTGACCTATTGATGATGCAACTAGACGCATAATTGAAATTGTTAGGAAGCCAACAAAAGCTAGGTTACATTTAGGGATAGTCATGATATTGTTTAAAATTCTGGTCTTGGCCTTTTCTGCATCAGCTTATGCCAATTGAAGGTTGCATAAACCAACAAAACTTTTGCTTTAAAATTGCAATTTCTTCTCTATGATGGGAGATTAGTATCGAATGACGCCATTTCTTTTAGTGGTATTTACACGCTGATTGAGCAATGATTGGAGGTGACGCAGGCATCCACATCAATCCCTCAAACCAAAAGTCGAGACCTCTTTTTGAATTTTTCATGTCATTAACTTCAATATTTTCGGCCAAAACTTAATATTTGCTGAAATTTAGCCCATTGACATTGACAGGACGATCGCCAACCTGCTCTTAACTCAAATTGGACAAAAGGGTAAGCGAAATTTGAACAGAATTACTCTTGGACATCGTGTGAATTGTGGGAAGCAAATAAAACCCACCAAAATGGCGGTCCAGTTCACCGAAGAAGACAACTTCCTGATAAATAGGGAATTGTGCTTGAGTTGTCTTGTATGTTAATTTGTTCGTTTTTGTGATATTAATTTATTAGATGATTAATTGAATGTCGAATTACAAAAAATATATCCTTGAATTTTGTTGCTGGTTAAAAAAATATTCACATTCTTTCAAA
mRNA sequence
GCCGAACCCGATTTCATCAATTTATGGTGCCGGTACGCCGCCGTGAACTGATGGACATCAACCATGAGTCATATGGTTCGGGCTCTCCGGCAGTGGCCGATGGTGCAGAAACATTGTCGTGGTTGCGCCGTACATCATTTTCTCTCCTCATCTCCGCCGTGGGTGGCCAAAAGAATCGACTCTCGTCGACTATCTTTAGCTACCGTTCATTCTGCTCGTGGCGAAGTCCAATATGGATCAAAAGGACTCAGATTACCCAAAGCTCCAGCACCAGCCAAATCCCAAGAAGATGAGAGCGTCGATGATGATTCGGATGCTAGAAAGAGCCGCAACCAGCTTAAACGGGAAGCTCGACGAGCCGTCCAATGGGGCATGGATCTTGCGGCCTTCTCCACTCCTCAAATTAAACGCATCCTTAGAGTGACGTCTCTCGAGAAAGATATCTTCGACGCAATAATGCTTGTGAAGAGGTTTGGGAGTGATGTCAGAGAAGGAAAGCGAAGGCAGTTCAATTACATTGGATCAATTGATGAAAATAATGTAAACTTGCTGTCTCTATGGTCCTTGAATATATATGAGCACGAGATCCGGTTGCCACTTATCCCACTAAACTGCATTCCGGTGACTACTCAACAAGTTGATGCCACAAAAGCCGGTGACCATAAGATACTACAGAAATTGTGTGCTTCAGTAGATGATGAAGTTTCAAATTCTGTATACGAGGAGGAGGAAGAAGAGGGTCCGCATGTGGACATCGCTACAAGATGGCTTGACGGGCTAGTCAGTAAGGACAACAACATTACAAATGAAATTTATTCACTACAAACTGTTGAATTTGACCGTCAGGAGCTGCGGAGACTTGTTCGAAAAGTTCATACGATTGAAGAACGCAAGGCAGCAATTGAAGAGAATGAGGATGAAGTCAATACCGCGATAACAAACGCCACAAAGCCCCTTGCTCGTTTCCTTTGTAGAATGGCGAAACAGTTGCCCTCCTATGAACTCTAGACGATCGCCAACCTGCTCTTAACTCAAATTGGACAAAAGGGTAAGCGAAATTTGAACAGAATTACTCTTGGACATCGTGTGAATTGTGGGAAGCAAATAAAACCCACCAAAATGGCGGTCCAGTTCACCGAAGAAGACAACTTCCTGATAAATAGGGAATTGTGCTTGAGTTGTCTTGTATGTTAATTTGTTCGTTTTTGTGATATTAATTTATTAGATGATTAATTGAATGTCGAATTACAAAAAATATATCCTTGAATTTTGTTGCTGGTTAAAAAAATATTCACATTCTTTCAAA
Coding sequence (CDS)
ATGAGTCATATGGTTCGGGCTCTCCGGCAGTGGCCGATGGTGCAGAAACATTGTCGTGGTTGCGCCGTACATCATTTTCTCTCCTCATCTCCGCCGTGGGTGGCCAAAAGAATCGACTCTCGTCGACTATCTTTAGCTACCGTTCATTCTGCTCGTGGCGAAGTCCAATATGGATCAAAAGGACTCAGATTACCCAAAGCTCCAGCACCAGCCAAATCCCAAGAAGATGAGAGCGTCGATGATGATTCGGATGCTAGAAAGAGCCGCAACCAGCTTAAACGGGAAGCTCGACGAGCCGTCCAATGGGGCATGGATCTTGCGGCCTTCTCCACTCCTCAAATTAAACGCATCCTTAGAGTGACGTCTCTCGAGAAAGATATCTTCGACGCAATAATGCTTGTGAAGAGGTTTGGGAGTGATGTCAGAGAAGGAAAGCGAAGGCAGTTCAATTACATTGGATCAATTGATGAAAATAATGTAAACTTGCTGTCTCTATGGTCCTTGAATATATATGAGCACGAGATCCGGTTGCCACTTATCCCACTAAACTGCATTCCGGTGACTACTCAACAAGTTGATGCCACAAAAGCCGGTGACCATAAGATACTACAGAAATTGTGTGCTTCAGTAGATGATGAAGTTTCAAATTCTGTATACGAGGAGGAGGAAGAAGAGGGTCCGCATGTGGACATCGCTACAAGATGGCTTGACGGGCTAGTCAGTAAGGACAACAACATTACAAATGAAATTTATTCACTACAAACTGTTGAATTTGACCGTCAGGAGCTGCGGAGACTTGTTCGAAAAGTTCATACGATTGAAGAACGCAAGGCAGCAATTGAAGAGAATGAGGATGAAGTCAATACCGCGATAACAAACGCCACAAAGCCCCTTGCTCGTTTCCTTTGTAGAATGGCGAAACAGTTGCCCTCCTATGAACTCTAG
Protein sequence
MSHMVRALRQWPMVQKHCRGCAVHHFLSSSPPWVAKRIDSRRLSLATVHSARGEVQYGSKGLRLPKAPAPAKSQEDESVDDDSDARKSRNQLKREARRAVQWGMDLAAFSTPQIKRILRVTSLEKDIFDAIMLVKRFGSDVREGKRRQFNYIGSIDENNVNLLSLWSLNIYEHEIRLPLIPLNCIPVTTQQVDATKAGDHKILQKLCASVDDEVSNSVYEEEEEEGPHVDIATRWLDGLVSKDNNITNEIYSLQTVEFDRQELRRLVRKVHTIEERKAAIEENEDEVNTAITNATKPLARFLCRMAKQLPSYEL
Homology
BLAST of Lsi04G017530 vs. ExPASy TrEMBL
Match:
A0A0A0LH61 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G872730 PE=4 SV=1)
HSP 1 Score: 444.5 bits (1142), Expect = 3.7e-121
Identity = 245/318 (77.04%), Postives = 259/318 (81.45%), Query Frame = 0
Query: 1 MSHMVRALRQWP-MVQKHCRGCAVHHFLSSSPPWVAKRIDSRRLSLATVHSARGEVQYGS 60
MSHMVRALRQWP MVQKHC GCAVHHFL SSPPWVAKRI SRRLSLATVHSAR EVQY S
Sbjct: 1 MSHMVRALRQWPSMVQKHCCGCAVHHFLFSSPPWVAKRIYSRRLSLATVHSARREVQYES 60
Query: 61 KGLRLPKAPAPAKSQEDESV-DDDSDARKSRNQLKREARRAVQWGMDLAAFSTPQIKRIL 120
KGLRL KAPA AKSQE ES+ DDD D RKSRNQLKREARRAVQWGMDLA FST QIKRIL
Sbjct: 61 KGLRLSKAPALAKSQEHESINDDDLDVRKSRNQLKREARRAVQWGMDLATFSTSQIKRIL 120
Query: 121 RVTSLEKDIFDAIMLVKRFGSDVREGKRRQFNYIGSIDENNVNLLSLWSLNIYEHEIRLP 180
VTSLEKD+FDAIMLVKR G+DVREGKRRQFNYIG + L + + L
Sbjct: 121 SVTSLEKDVFDAIMLVKRLGNDVREGKRRQFNYIGKL------------LRDAQPDTELM 180
Query: 181 LIPLNCIPVTTQQVDATKAGDHKILQKLCASVDDEVSNSVY--EEEEEEGPHVDIATRWL 240
+ + +TKAGDHKILQ+LCASVDDEVS VY EEEEEEGPHVDIATRWL
Sbjct: 181 DV----------LIQSTKAGDHKILQRLCASVDDEVSKYVYEEEEEEEEGPHVDIATRWL 240
Query: 241 DGLVSKDNNITNEIYSLQTVEFDRQELRRLVRKVHTIEERKAAIEENEDEVNTAITNATK 300
DGL+SK+N IT EIYSLQTVEFDRQELRRLVRKVH +EERKAAIEEN DEVNTA+TNA K
Sbjct: 241 DGLISKNNIITKEIYSLQTVEFDRQELRRLVRKVHMVEERKAAIEENGDEVNTAVTNARK 296
Query: 301 PLARFLCRMAKQLPSYEL 315
PLARFLCRMAKQLPS EL
Sbjct: 301 PLARFLCRMAKQLPSDEL 296
BLAST of Lsi04G017530 vs. ExPASy TrEMBL
Match:
A0A6J1FF35 (uncharacterized protein LOC111443507 OS=Cucurbita moschata OX=3662 GN=LOC111443507 PE=4 SV=1)
HSP 1 Score: 438.0 bits (1125), Expect = 3.4e-119
Identity = 237/317 (74.76%), Postives = 251/317 (79.18%), Query Frame = 0
Query: 1 MSHMVRALRQWPMVQKHCRGCAVHHFLSSSPPWVAKRIDSRRLSLATVHSARGEVQYGSK 60
M HMVRALR WPM+Q HC GC VHHFL SSPPWVAKRIDSRRL+LATVHSAR EVQYGSK
Sbjct: 1 MGHMVRALRHWPMLQNHCFGCTVHHFL-SSPPWVAKRIDSRRLTLATVHSARREVQYGSK 60
Query: 61 GLRLPKAPAPAKSQEDESVDDDSDARKSRNQLKREARRAVQWGMDLAAFSTPQIKRILRV 120
GLRL KA APA+ QEDESVD+D D RKSRNQLKREARRAVQWGMDLAAFSTPQIKRILRV
Sbjct: 61 GLRLSKAQAPAEFQEDESVDEDLDVRKSRNQLKREARRAVQWGMDLAAFSTPQIKRILRV 120
Query: 121 TSLEKDIFDAIMLVKRFGSDVREGKRRQFNYIGSIDENNVNLLSLWSLNIYEHEIRLPLI 180
SLEKD+FDAIMLVKR G DVREGKRRQF+YIG + L + E+ LI
Sbjct: 121 ASLEKDVFDAIMLVKRLGRDVREGKRRQFSYIGKL------------LRDVQPELMDSLI 180
Query: 181 PLNCIPVTTQQVDATKAGDHKILQKLCASV---DDEVSNSVYEEEEEEGPHVDIATRWLD 240
ATK GDH +LQ L SV DDE ++S YEEEEEEGPHVDI TRWLD
Sbjct: 181 ------------QATKDGDHSMLQTLSGSVAVDDDEDTDSEYEEEEEEGPHVDIVTRWLD 240
Query: 241 GLVSKDNNITNEIYSLQTVEFDRQELRRLVRKVHTIEERKAAIEENEDEVNTAITNATKP 300
GLVSKD N+TNEIYSLQTVEFDRQELRRLVRKVH +EERKAA EENEDEVN AIT A KP
Sbjct: 241 GLVSKDKNVTNEIYSLQTVEFDRQELRRLVRKVHMVEERKAATEENEDEVNVAITTAKKP 292
Query: 301 LARFLCRMAKQLPSYEL 315
LARFLCRMAKQLP YEL
Sbjct: 301 LARFLCRMAKQLPPYEL 292
BLAST of Lsi04G017530 vs. ExPASy TrEMBL
Match:
A0A1S4E5J2 (UPF0307 protein plu4061 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103503870 PE=4 SV=1)
HSP 1 Score: 437.6 bits (1124), Expect = 4.5e-119
Identity = 243/321 (75.70%), Postives = 258/321 (80.37%), Query Frame = 0
Query: 1 MSHMVRALRQW-PMVQKHCRGCAVHHFLSSSPPWVAKRIDSRRLSLATVHSARGEVQYGS 60
MSHMVRALRQW PM+QKHC GCAVHHFLS SPPWVAKRI SRRLSLATVHSAR EVQY S
Sbjct: 1 MSHMVRALRQWPPMLQKHCCGCAVHHFLSLSPPWVAKRIYSRRLSLATVHSARREVQYES 60
Query: 61 KGLRLPKAPAPAKSQEDESV-DDDSDARKSRNQLKREARRAVQWGMDLAAFSTPQIKRIL 120
KGLRL KAPA AKSQEDES+ DDDSD RKSRNQLKREARRAVQWGMDLA FST QIKRIL
Sbjct: 61 KGLRLSKAPALAKSQEDESINDDDSDVRKSRNQLKREARRAVQWGMDLATFSTSQIKRIL 120
Query: 121 RVTSLEKDIFDAIMLVKRFGSDVREGKRRQFNYIGSI---DENNVNLLSLWSLNIYEHEI 180
VTSLEKD+FDAIMLVKR G+DVREG+RRQFNYIG + + + LL +
Sbjct: 121 SVTSLEKDVFDAIMLVKRLGNDVREGRRRQFNYIGKLLRDAQPDTELLDI---------- 180
Query: 181 RLPLIPLNCIPVTTQQVDATKAGDHKILQKLCASVDDEVSNSVY--EEEEEEGPHVDIAT 240
+ ATKAGDHKILQ+LCASVDDEVS SV+ EEEEEEGPHVD+AT
Sbjct: 181 ---------------LIQATKAGDHKILQRLCASVDDEVSKSVHEEEEEEEEGPHVDVAT 240
Query: 241 RWLDGLVSKDNNITNEIYSLQTVEFDRQELRRLVRKVHTIEERKAAIEENEDEVNTAITN 300
RW DGL+SKDN IT EIYS QTVEFDRQELRRLVRKVH +EERKAAIEEN DEVN AITN
Sbjct: 241 RWFDGLISKDNIITKEIYS-QTVEFDRQELRRLVRKVHMVEERKAAIEENGDEVNAAITN 295
Query: 301 ATKPLARFLCRMAKQLPSYEL 315
A KPLARFL RMAKQLPS EL
Sbjct: 301 ARKPLARFLYRMAKQLPSDEL 295
BLAST of Lsi04G017530 vs. ExPASy TrEMBL
Match:
A0A1S4E5I7 (UPF0307 protein Asuc_0809 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103503870 PE=4 SV=1)
HSP 1 Score: 437.2 bits (1123), Expect = 5.8e-119
Identity = 243/322 (75.47%), Postives = 258/322 (80.12%), Query Frame = 0
Query: 1 MSHMVRALRQW-PMVQKHCRGCAVHHFLSSSPPWVAKRIDSRRLSLATVHSARGEVQYGS 60
MSHMVRALRQW PM+QKHC GCAVHHFLS SPPWVAKRI SRRLSLATVHSAR EVQY S
Sbjct: 1 MSHMVRALRQWPPMLQKHCCGCAVHHFLSLSPPWVAKRIYSRRLSLATVHSARREVQYES 60
Query: 61 KGLRLPKAPAPAKSQEDESV-DDDSDARKSRNQLKREARRAVQWGMDLAAFSTPQIKRIL 120
KGLRL KAPA AKSQEDES+ DDDSD RKSRNQLKREARRAVQWGMDLA FST QIKRIL
Sbjct: 61 KGLRLSKAPALAKSQEDESINDDDSDVRKSRNQLKREARRAVQWGMDLATFSTSQIKRIL 120
Query: 121 RVTSLEKDIFDAIMLVKRFGSDVREGKRRQFNYIGSI---DENNVNLLSLWSLNIYEHEI 180
VTSLEKD+FDAIMLVKR G+DVREG+RRQFNYIG + + + LL +
Sbjct: 121 SVTSLEKDVFDAIMLVKRLGNDVREGRRRQFNYIGKLLRDAQPDTELLDI---------- 180
Query: 181 RLPLIPLNCIPVTTQQVDATKAGDHKILQKLCASVDDEVSNSVY---EEEEEEGPHVDIA 240
+ ATKAGDHKILQ+LCASVDDEVS SV+ EEEEEEGPHVD+A
Sbjct: 181 ---------------LIQATKAGDHKILQRLCASVDDEVSKSVHEEEEEEEEEGPHVDVA 240
Query: 241 TRWLDGLVSKDNNITNEIYSLQTVEFDRQELRRLVRKVHTIEERKAAIEENEDEVNTAIT 300
TRW DGL+SKDN IT EIYS QTVEFDRQELRRLVRKVH +EERKAAIEEN DEVN AIT
Sbjct: 241 TRWFDGLISKDNIITKEIYS-QTVEFDRQELRRLVRKVHMVEERKAAIEENGDEVNAAIT 296
Query: 301 NATKPLARFLCRMAKQLPSYEL 315
NA KPLARFL RMAKQLPS EL
Sbjct: 301 NARKPLARFLYRMAKQLPSDEL 296
BLAST of Lsi04G017530 vs. ExPASy TrEMBL
Match:
A0A6J1IM05 (uncharacterized protein LOC111476812 OS=Cucurbita maxima OX=3661 GN=LOC111476812 PE=4 SV=1)
HSP 1 Score: 434.1 bits (1115), Expect = 5.0e-118
Identity = 236/317 (74.45%), Postives = 250/317 (78.86%), Query Frame = 0
Query: 1 MSHMVRALRQWPMVQKHCRGCAVHHFLSSSPPWVAKRIDSRRLSLATVHSARGEVQYGSK 60
M HMVRALR WPM+Q HC GC VHHFL SSPPWVAKRIDS RL+LATVHSAR EVQ+GSK
Sbjct: 1 MGHMVRALRHWPMLQNHCFGCTVHHFL-SSPPWVAKRIDSLRLTLATVHSARREVQHGSK 60
Query: 61 GLRLPKAPAPAKSQEDESVDDDSDARKSRNQLKREARRAVQWGMDLAAFSTPQIKRILRV 120
GLRL KA APA+ QEDESVD+D D RKSRNQLKREARRAVQWGMDLAAFSTPQIKRILRV
Sbjct: 61 GLRLSKAQAPAEFQEDESVDEDLDVRKSRNQLKREARRAVQWGMDLAAFSTPQIKRILRV 120
Query: 121 TSLEKDIFDAIMLVKRFGSDVREGKRRQFNYIGSIDENNVNLLSLWSLNIYEHEIRLPLI 180
SLEKD+FDAIMLVKR G DVREGKRRQF+YIG + L + E+ LI
Sbjct: 121 ASLEKDVFDAIMLVKRLGRDVREGKRRQFSYIGKL------------LRDVQPELMDSLI 180
Query: 181 PLNCIPVTTQQVDATKAGDHKILQKLCASV---DDEVSNSVYEEEEEEGPHVDIATRWLD 240
ATK GDH LQ L SV DDE ++S YEEEEEEGPHVDIATRWLD
Sbjct: 181 ------------QATKDGDHSTLQTLSGSVAVDDDEDTDSEYEEEEEEGPHVDIATRWLD 240
Query: 241 GLVSKDNNITNEIYSLQTVEFDRQELRRLVRKVHTIEERKAAIEENEDEVNTAITNATKP 300
GLVSKD N+TNEIYSLQTVEFDRQELRRLVRKVH +EERKAA EENEDEVN AIT A KP
Sbjct: 241 GLVSKDKNVTNEIYSLQTVEFDRQELRRLVRKVHMVEERKAATEENEDEVNVAITTAKKP 292
Query: 301 LARFLCRMAKQLPSYEL 315
LARFLCRMAKQLP YEL
Sbjct: 301 LARFLCRMAKQLPPYEL 292
BLAST of Lsi04G017530 vs. NCBI nr
Match:
XP_038897014.1 (UPF0307 protein ECA0281 isoform X1 [Benincasa hispida])
HSP 1 Score: 447.2 bits (1149), Expect = 1.2e-121
Identity = 243/314 (77.39%), Postives = 255/314 (81.21%), Query Frame = 0
Query: 1 MSHMVRALRQWPMVQKHCRGCAVHHFLSSSPPWVAKRIDSRRLSLATVHSARGEVQYGSK 60
MSHMVRALRQWPM+QKH GCAV H S PWV KR DSRRLSLATVHSAR EVQ SK
Sbjct: 1 MSHMVRALRQWPMLQKHYCGCAVRHLFPSCLPWVPKRTDSRRLSLATVHSARREVQ-ESK 60
Query: 61 GLRLPKAPAPAKSQEDESVDDDSDARKSRNQLKREARRAVQWGMDLAAFSTPQIKRILRV 120
GLRLPKAPAPAKSQEDESV+DDSD RKSRNQLKREARRAVQWGMDLAAFS PQIKRIL V
Sbjct: 61 GLRLPKAPAPAKSQEDESVNDDSDVRKSRNQLKREARRAVQWGMDLAAFSIPQIKRILSV 120
Query: 121 TSLEKDIFDAIMLVKRFGSDVREGKRRQFNYIGSIDENNVNLLSLWSLNIYEHEIRLPLI 180
TSLEKDIFDAIMLVKR GSDVREGKRRQFNYIG + L + + L I
Sbjct: 121 TSLEKDIFDAIMLVKRLGSDVREGKRRQFNYIGKL------------LRDAQPDTELMDI 180
Query: 181 PLNCIPVTTQQVDATKAGDHKILQKLCASVDDEVSNSVYEEEEEEGPHVDIATRWLDGLV 240
+ ATK GDHKILQKLCASVDD+VS SVYEEEEEEGPHV+IATRWLDGL+
Sbjct: 181 ----------LIQATKVGDHKILQKLCASVDDQVSKSVYEEEEEEGPHVEIATRWLDGLI 240
Query: 241 SKDNNITNEIYSLQTVEFDRQELRRLVRKVHTIEERKAAIEENEDEVNTAITNATKPLAR 300
SKDNNITNEIYSLQTVEFDRQELRRLVRKV IE++KAAIEEN DEVN ITNA KPLA
Sbjct: 241 SKDNNITNEIYSLQTVEFDRQELRRLVRKVRMIEKQKAAIEENGDEVNMTITNARKPLAH 291
Query: 301 FLCRMAKQLPSYEL 315
FLCR+AKQLPSYEL
Sbjct: 301 FLCRIAKQLPSYEL 291
BLAST of Lsi04G017530 vs. NCBI nr
Match:
XP_004136378.1 (uncharacterized protein LOC101214378 [Cucumis sativus] >KGN60042.1 hypothetical protein Csa_001476 [Cucumis sativus])
HSP 1 Score: 444.5 bits (1142), Expect = 7.6e-121
Identity = 245/318 (77.04%), Postives = 259/318 (81.45%), Query Frame = 0
Query: 1 MSHMVRALRQWP-MVQKHCRGCAVHHFLSSSPPWVAKRIDSRRLSLATVHSARGEVQYGS 60
MSHMVRALRQWP MVQKHC GCAVHHFL SSPPWVAKRI SRRLSLATVHSAR EVQY S
Sbjct: 1 MSHMVRALRQWPSMVQKHCCGCAVHHFLFSSPPWVAKRIYSRRLSLATVHSARREVQYES 60
Query: 61 KGLRLPKAPAPAKSQEDESV-DDDSDARKSRNQLKREARRAVQWGMDLAAFSTPQIKRIL 120
KGLRL KAPA AKSQE ES+ DDD D RKSRNQLKREARRAVQWGMDLA FST QIKRIL
Sbjct: 61 KGLRLSKAPALAKSQEHESINDDDLDVRKSRNQLKREARRAVQWGMDLATFSTSQIKRIL 120
Query: 121 RVTSLEKDIFDAIMLVKRFGSDVREGKRRQFNYIGSIDENNVNLLSLWSLNIYEHEIRLP 180
VTSLEKD+FDAIMLVKR G+DVREGKRRQFNYIG + L + + L
Sbjct: 121 SVTSLEKDVFDAIMLVKRLGNDVREGKRRQFNYIGKL------------LRDAQPDTELM 180
Query: 181 LIPLNCIPVTTQQVDATKAGDHKILQKLCASVDDEVSNSVY--EEEEEEGPHVDIATRWL 240
+ + +TKAGDHKILQ+LCASVDDEVS VY EEEEEEGPHVDIATRWL
Sbjct: 181 DV----------LIQSTKAGDHKILQRLCASVDDEVSKYVYEEEEEEEEGPHVDIATRWL 240
Query: 241 DGLVSKDNNITNEIYSLQTVEFDRQELRRLVRKVHTIEERKAAIEENEDEVNTAITNATK 300
DGL+SK+N IT EIYSLQTVEFDRQELRRLVRKVH +EERKAAIEEN DEVNTA+TNA K
Sbjct: 241 DGLISKNNIITKEIYSLQTVEFDRQELRRLVRKVHMVEERKAAIEENGDEVNTAVTNARK 296
Query: 301 PLARFLCRMAKQLPSYEL 315
PLARFLCRMAKQLPS EL
Sbjct: 301 PLARFLCRMAKQLPSDEL 296
BLAST of Lsi04G017530 vs. NCBI nr
Match:
XP_022937103.1 (uncharacterized protein LOC111443507 [Cucurbita moschata])
HSP 1 Score: 438.0 bits (1125), Expect = 7.1e-119
Identity = 237/317 (74.76%), Postives = 251/317 (79.18%), Query Frame = 0
Query: 1 MSHMVRALRQWPMVQKHCRGCAVHHFLSSSPPWVAKRIDSRRLSLATVHSARGEVQYGSK 60
M HMVRALR WPM+Q HC GC VHHFL SSPPWVAKRIDSRRL+LATVHSAR EVQYGSK
Sbjct: 1 MGHMVRALRHWPMLQNHCFGCTVHHFL-SSPPWVAKRIDSRRLTLATVHSARREVQYGSK 60
Query: 61 GLRLPKAPAPAKSQEDESVDDDSDARKSRNQLKREARRAVQWGMDLAAFSTPQIKRILRV 120
GLRL KA APA+ QEDESVD+D D RKSRNQLKREARRAVQWGMDLAAFSTPQIKRILRV
Sbjct: 61 GLRLSKAQAPAEFQEDESVDEDLDVRKSRNQLKREARRAVQWGMDLAAFSTPQIKRILRV 120
Query: 121 TSLEKDIFDAIMLVKRFGSDVREGKRRQFNYIGSIDENNVNLLSLWSLNIYEHEIRLPLI 180
SLEKD+FDAIMLVKR G DVREGKRRQF+YIG + L + E+ LI
Sbjct: 121 ASLEKDVFDAIMLVKRLGRDVREGKRRQFSYIGKL------------LRDVQPELMDSLI 180
Query: 181 PLNCIPVTTQQVDATKAGDHKILQKLCASV---DDEVSNSVYEEEEEEGPHVDIATRWLD 240
ATK GDH +LQ L SV DDE ++S YEEEEEEGPHVDI TRWLD
Sbjct: 181 ------------QATKDGDHSMLQTLSGSVAVDDDEDTDSEYEEEEEEGPHVDIVTRWLD 240
Query: 241 GLVSKDNNITNEIYSLQTVEFDRQELRRLVRKVHTIEERKAAIEENEDEVNTAITNATKP 300
GLVSKD N+TNEIYSLQTVEFDRQELRRLVRKVH +EERKAA EENEDEVN AIT A KP
Sbjct: 241 GLVSKDKNVTNEIYSLQTVEFDRQELRRLVRKVHMVEERKAATEENEDEVNVAITTAKKP 292
Query: 301 LARFLCRMAKQLPSYEL 315
LARFLCRMAKQLP YEL
Sbjct: 301 LARFLCRMAKQLPPYEL 292
BLAST of Lsi04G017530 vs. NCBI nr
Match:
XP_016903494.1 (PREDICTED: UPF0307 protein plu4061 isoform X2 [Cucumis melo])
HSP 1 Score: 437.6 bits (1124), Expect = 9.2e-119
Identity = 243/321 (75.70%), Postives = 258/321 (80.37%), Query Frame = 0
Query: 1 MSHMVRALRQW-PMVQKHCRGCAVHHFLSSSPPWVAKRIDSRRLSLATVHSARGEVQYGS 60
MSHMVRALRQW PM+QKHC GCAVHHFLS SPPWVAKRI SRRLSLATVHSAR EVQY S
Sbjct: 1 MSHMVRALRQWPPMLQKHCCGCAVHHFLSLSPPWVAKRIYSRRLSLATVHSARREVQYES 60
Query: 61 KGLRLPKAPAPAKSQEDESV-DDDSDARKSRNQLKREARRAVQWGMDLAAFSTPQIKRIL 120
KGLRL KAPA AKSQEDES+ DDDSD RKSRNQLKREARRAVQWGMDLA FST QIKRIL
Sbjct: 61 KGLRLSKAPALAKSQEDESINDDDSDVRKSRNQLKREARRAVQWGMDLATFSTSQIKRIL 120
Query: 121 RVTSLEKDIFDAIMLVKRFGSDVREGKRRQFNYIGSI---DENNVNLLSLWSLNIYEHEI 180
VTSLEKD+FDAIMLVKR G+DVREG+RRQFNYIG + + + LL +
Sbjct: 121 SVTSLEKDVFDAIMLVKRLGNDVREGRRRQFNYIGKLLRDAQPDTELLDI---------- 180
Query: 181 RLPLIPLNCIPVTTQQVDATKAGDHKILQKLCASVDDEVSNSVY--EEEEEEGPHVDIAT 240
+ ATKAGDHKILQ+LCASVDDEVS SV+ EEEEEEGPHVD+AT
Sbjct: 181 ---------------LIQATKAGDHKILQRLCASVDDEVSKSVHEEEEEEEEGPHVDVAT 240
Query: 241 RWLDGLVSKDNNITNEIYSLQTVEFDRQELRRLVRKVHTIEERKAAIEENEDEVNTAITN 300
RW DGL+SKDN IT EIYS QTVEFDRQELRRLVRKVH +EERKAAIEEN DEVN AITN
Sbjct: 241 RWFDGLISKDNIITKEIYS-QTVEFDRQELRRLVRKVHMVEERKAAIEENGDEVNAAITN 295
Query: 301 ATKPLARFLCRMAKQLPSYEL 315
A KPLARFL RMAKQLPS EL
Sbjct: 301 ARKPLARFLYRMAKQLPSDEL 295
BLAST of Lsi04G017530 vs. NCBI nr
Match:
XP_023535041.1 (uncharacterized protein LOC111796585 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 437.6 bits (1124), Expect = 9.2e-119
Identity = 237/317 (74.76%), Postives = 252/317 (79.50%), Query Frame = 0
Query: 1 MSHMVRALRQWPMVQKHCRGCAVHHFLSSSPPWVAKRIDSRRLSLATVHSARGEVQYGSK 60
M HMVRALR WPM+Q HC GC VHHFL SSPPWVAKRIDSRRL+LATVHSAR EVQYGSK
Sbjct: 1 MGHMVRALRHWPMLQNHCFGCTVHHFL-SSPPWVAKRIDSRRLTLATVHSARREVQYGSK 60
Query: 61 GLRLPKAPAPAKSQEDESVDDDSDARKSRNQLKREARRAVQWGMDLAAFSTPQIKRILRV 120
GLRL KA APA+ QEDESVD+D D RKSRNQLKREARRAVQWGMDLAAFSTPQIKRILRV
Sbjct: 61 GLRLSKAQAPAEFQEDESVDEDLDVRKSRNQLKREARRAVQWGMDLAAFSTPQIKRILRV 120
Query: 121 TSLEKDIFDAIMLVKRFGSDVREGKRRQFNYIGSIDENNVNLLSLWSLNIYEHEIRLPLI 180
SLEKD+FDAIMLVKR G DVREGKRRQF+YIG + L + E+ LI
Sbjct: 121 ASLEKDVFDAIMLVKRLGRDVREGKRRQFSYIGKL------------LRDVQPELMDSLI 180
Query: 181 PLNCIPVTTQQVDATKAGDHKILQKLCASV---DDEVSNSVYEEEEEEGPHVDIATRWLD 240
ATK GDH +LQ L SV DDE ++S YEEEEE+GPHVDIATRWLD
Sbjct: 181 ------------QATKDGDHTMLQTLSGSVAVDDDEDTDSEYEEEEEKGPHVDIATRWLD 240
Query: 241 GLVSKDNNITNEIYSLQTVEFDRQELRRLVRKVHTIEERKAAIEENEDEVNTAITNATKP 300
GLVSKD N+TNEIYSLQTVEFDRQELRRLVRKVH +EERKAA EENEDEVN AIT A KP
Sbjct: 241 GLVSKDKNVTNEIYSLQTVEFDRQELRRLVRKVHMVEERKAATEENEDEVNVAITTARKP 292
Query: 301 LARFLCRMAKQLPSYEL 315
LARFLCRMAKQLP YEL
Sbjct: 301 LARFLCRMAKQLPPYEL 292
BLAST of Lsi04G017530 vs. TAIR 10
Match:
AT4G24175.1 (unknown protein; CONTAINS InterPro DOMAIN/s: Uncharacterised protein family UPF0307 (InterPro:IPR006839); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )
HSP 1 Score: 192.2 bits (487), Expect = 6.3e-49
Identity = 117/262 (44.66%), Postives = 157/262 (59.92%), Query Frame = 0
Query: 65 PKAPAPAKSQEDESVDDD---SDARKSRNQLKREARRAVQWGMDLAAFSTPQIKRILRVT 124
P+A P +E D+D SD+ +SRNQ KR+ARRAV+WGM+LA+FS Q+K+IL+
Sbjct: 62 PEALKPTVIVAEEDGDNDGYESDSLRSRNQRKRDARRAVKWGMELASFSGDQVKQILKAA 121
Query: 125 SLEKDIFDAIMLVKRFGSDVREGKRRQFNYIGSIDENNVNLLSLWSLNIYEHEIRLPLIP 184
SL ++++DA+ML KR GSDVREGKRR FNYIG + L E ++ LI
Sbjct: 122 SLGEEVYDALMLAKRLGSDVREGKRRHFNYIGKL------------LREVEPDLMDTLI- 181
Query: 185 LNCIPVTTQQVDATKAGDHKILQKLCASV-----------DDEVSNSVYEEEEEEGPHVD 244
+ATK GDH LQ L +S DD+ +EEE +
Sbjct: 182 -----------NATKQGDHSTLQTLISSAKDVADDVGDSYDDDTETESEDEEEGSDEYTA 241
Query: 245 IATRWLDGLVSKDNNITNEIYSLQTVEFDRQELRRLVRKVHTI-EERKAAIEENEDEVNT 304
+A RW DGL+S++ +T E+YSLQ+V+FDRQELR+LVRKV + E+RK EE + EV
Sbjct: 242 MAARWFDGLISQNVELTKEVYSLQSVDFDRQELRKLVRKVQLVHEQRKGTTEEKQKEVEA 299
Query: 305 AITNATKPLARFLCRMAKQLPS 312
A+ A K L +FLC MAKQ+ S
Sbjct: 302 ALVTAEKSLNQFLCSMAKQVHS 299
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A0A0LH61 | 3.7e-121 | 77.04 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G872730 PE=4 SV=1 | [more] |
A0A6J1FF35 | 3.4e-119 | 74.76 | uncharacterized protein LOC111443507 OS=Cucurbita moschata OX=3662 GN=LOC1114435... | [more] |
A0A1S4E5J2 | 4.5e-119 | 75.70 | UPF0307 protein plu4061 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103503870 PE=4 ... | [more] |
A0A1S4E5I7 | 5.8e-119 | 75.47 | UPF0307 protein Asuc_0809 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103503870 PE=... | [more] |
A0A6J1IM05 | 5.0e-118 | 74.45 | uncharacterized protein LOC111476812 OS=Cucurbita maxima OX=3661 GN=LOC111476812... | [more] |
Match Name | E-value | Identity | Description | |
XP_038897014.1 | 1.2e-121 | 77.39 | UPF0307 protein ECA0281 isoform X1 [Benincasa hispida] | [more] |
XP_004136378.1 | 7.6e-121 | 77.04 | uncharacterized protein LOC101214378 [Cucumis sativus] >KGN60042.1 hypothetical ... | [more] |
XP_022937103.1 | 7.1e-119 | 74.76 | uncharacterized protein LOC111443507 [Cucurbita moschata] | [more] |
XP_016903494.1 | 9.2e-119 | 75.70 | PREDICTED: UPF0307 protein plu4061 isoform X2 [Cucumis melo] | [more] |
XP_023535041.1 | 9.2e-119 | 74.76 | uncharacterized protein LOC111796585 [Cucurbita pepo subsp. pepo] | [more] |
Match Name | E-value | Identity | Description | |
AT4G24175.1 | 6.3e-49 | 44.66 | unknown protein; CONTAINS InterPro DOMAIN/s: Uncharacterised protein family UPF0... | [more] |