Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
TTTAGATCCATGGTCGGATGCTTTGGCGCAGAAGCCCTAATTAACCATCGTCGGTGCCCGTTTCGATTAGAGAATGAACGCAACGTTGAATTCTGGAATTGTTTTCTCTGAAGGTACAGACAGAGATGTTTTTTGTGTTTTCTGATCTTTCATGTGATTCAATTTTGTTGAGGTTCTTTGGTGATGATGTTTCTTGCAAGCATAGCCGACGGAAGTGATTCGGATACGAATTCTGCTGAAGGATCAGATTACTACGAGCCGATCTCGGCCATTGATGGTGAAGAATCCGATGAAGCTGGATCAGAGGACGAAACCTACAGCTCCGATCCTCATTTCTACCGTATACCTAACGGATGCCGTTTAGAGAATGCAATTTCGTCTCTTAGCTTGAACGACGATGTGGAGAGAAGGTGTAGTGATGACGAAGAGGAGGAGAGGATGAGAGAGGCTTCTGATTCGGCGATAAGAATGGCGTTTAGAGAGGATGAGAGTCGGCGGAATGCTCCGCTGTCGCCGGAGAATGCAACGAGGATCATGGAGGCCATGCGCGGCATCTCGTTTGGCGGCTCTGCTCCAGATTGGACTCGAATTGTTTCCGAGGATCGTTGGATTGATCAACTTCGAAGGCTTAGGCAAATGCCTAGCGTTTCCAATAATTACGGGAACTGA
mRNA sequence
TTTAGATCCATGGTCGGATGCTTTGGCGCAGAAGCCCTAATTAACCATCGTCGGTGCCCGTTTCGATTAGAGAATGAACGCAACGTTGAATTCTGGAATTGTTTTCTCTGAAGCCGACGGAAGTGATTCGGATACGAATTCTGCTGAAGGATCAGATTACTACGAGCCGATCTCGGCCATTGATGGTGAAGAATCCGATGAAGCTGGATCAGAGGACGAAACCTACAGCTCCGATCCTCATTTCTACCGTATACCTAACGGATGCCGTTTAGAGAATGCAATTTCGTCTCTTAGCTTGAACGACGATGTGGAGAGAAGGTGTAGTGATGACGAAGAGGAGGAGAGGATGAGAGAGGCTTCTGATTCGGCGATAAGAATGGCGTTTAGAGAGGATGAGAGTCGGCGGAATGCTCCGCTGTCGCCGGAGAATGCAACGAGGATCATGGAGGCCATGCGCGGCATCTCGTTTGGCGGCTCTGCTCCAGATTGGACTCGAATTGTTTCCGAGGATCGTTGGATTGATCAACTTCGAAGGCTTAGGCAAATGCCTAGCGTTTCCAATAATTACGGGAACTGA
Coding sequence (CDS)
ATGAACGCAACGTTGAATTCTGGAATTGTTTTCTCTGAAGCCGACGGAAGTGATTCGGATACGAATTCTGCTGAAGGATCAGATTACTACGAGCCGATCTCGGCCATTGATGGTGAAGAATCCGATGAAGCTGGATCAGAGGACGAAACCTACAGCTCCGATCCTCATTTCTACCGTATACCTAACGGATGCCGTTTAGAGAATGCAATTTCGTCTCTTAGCTTGAACGACGATGTGGAGAGAAGGTGTAGTGATGACGAAGAGGAGGAGAGGATGAGAGAGGCTTCTGATTCGGCGATAAGAATGGCGTTTAGAGAGGATGAGAGTCGGCGGAATGCTCCGCTGTCGCCGGAGAATGCAACGAGGATCATGGAGGCCATGCGCGGCATCTCGTTTGGCGGCTCTGCTCCAGATTGGACTCGAATTGTTTCCGAGGATCGTTGGATTGATCAACTTCGAAGGCTTAGGCAAATGCCTAGCGTTTCCAATAATTACGGGAACTGA
Protein sequence
MNATLNSGIVFSEADGSDSDTNSAEGSDYYEPISAIDGEESDEAGSEDETYSSDPHFYRIPNGCRLENAISSLSLNDDVERRCSDDEEEERMREASDSAIRMAFREDESRRNAPLSPENATRIMEAMRGISFGGSAPDWTRIVSEDRWIDQLRRLRQMPSVSNNYGN
Homology
BLAST of CaUC03G050300 vs. NCBI nr
Match:
XP_038894769.1 (uncharacterized protein LOC120083197 [Benincasa hispida])
HSP 1 Score: 302.4 bits (773), Expect = 2.5e-78
Identity = 156/169 (92.31%), Postives = 163/169 (96.45%), Query Frame = 0
Query: 1 MNATLNSGIVFSE--ADGSDSDTNSAEGSDYYEPISAIDGEESDEAGSEDETYSSDPHFY 60
MNATLNSGIVFSE ADGSDSDTNSAEGSDYYEPISAIDGEESDEAGSE+ETYSSDPHF+
Sbjct: 1 MNATLNSGIVFSEEIADGSDSDTNSAEGSDYYEPISAIDGEESDEAGSENETYSSDPHFH 60
Query: 61 RIPNGCRLENAISSLSLNDDVERRCSDDEEEERMREASDSAIRMAFREDESRRNAPLSPE 120
+ NGC +ENA+SSLSLNDDVERRCSDDEEEERMREASDSAIRMAFREDE+RRNAPLSPE
Sbjct: 61 HLSNGCGVENAVSSLSLNDDVERRCSDDEEEERMREASDSAIRMAFREDETRRNAPLSPE 120
Query: 121 NATRIMEAMRGISFGGSAPDWTRIVSEDRWIDQLRRLRQMPSVSNNYGN 168
NATRIMEAMRGISFGGSAPDWTRIVSEDRWIDQLRRLRQ PSVSNN+GN
Sbjct: 121 NATRIMEAMRGISFGGSAPDWTRIVSEDRWIDQLRRLRQTPSVSNNFGN 169
BLAST of CaUC03G050300 vs. NCBI nr
Match:
XP_022970483.1 (uncharacterized protein LOC111469451 [Cucurbita maxima] >XP_022970484.1 uncharacterized protein LOC111469451 [Cucurbita maxima] >XP_022970485.1 uncharacterized protein LOC111469451 [Cucurbita maxima])
HSP 1 Score: 291.6 bits (745), Expect = 4.4e-75
Identity = 149/169 (88.17%), Postives = 160/169 (94.67%), Query Frame = 0
Query: 1 MNATLNSGIVFSE--ADGSDSDTNSAEGSDYYEPISAIDGEESDEAGSEDETYSSDPHFY 60
MNA+LNSG+VFSE ADGSDSDTNSAEGSDYYEPISAIDGEESDEAGS+DETYSSD H +
Sbjct: 1 MNASLNSGMVFSEDIADGSDSDTNSAEGSDYYEPISAIDGEESDEAGSDDETYSSDTHLH 60
Query: 61 RIPNGCRLENAISSLSLNDDVERRCSDDEEEERMREASDSAIRMAFREDESRRNAPLSPE 120
+PNGCR+ENA+SSLSLNDDVERRCSD+EEEE MREASDSAIRMAFREDE+RRNAPLSPE
Sbjct: 61 HLPNGCRVENAVSSLSLNDDVERRCSDEEEEESMREASDSAIRMAFREDETRRNAPLSPE 120
Query: 121 NATRIMEAMRGISFGGSAPDWTRIVSEDRWIDQLRRLRQMPSVSNNYGN 168
NATRIMEAMRGISFGGSAPDWTRIVSEDRWIDQLRRLRQ P+ NN+GN
Sbjct: 121 NATRIMEAMRGISFGGSAPDWTRIVSEDRWIDQLRRLRQTPTSPNNFGN 169
BLAST of CaUC03G050300 vs. NCBI nr
Match:
XP_022964640.1 (uncharacterized protein LOC111464656 [Cucurbita moschata])
HSP 1 Score: 289.3 bits (739), Expect = 2.2e-74
Identity = 148/169 (87.57%), Postives = 159/169 (94.08%), Query Frame = 0
Query: 1 MNATLNSGIVFSE--ADGSDSDTNSAEGSDYYEPISAIDGEESDEAGSEDETYSSDPHFY 60
MNA+LNSG+VFSE ADGSDSDTNSAEGSDYYEPISAIDGEESDEAGS+DE YSSD H +
Sbjct: 1 MNASLNSGMVFSEDIADGSDSDTNSAEGSDYYEPISAIDGEESDEAGSDDEPYSSDTHLH 60
Query: 61 RIPNGCRLENAISSLSLNDDVERRCSDDEEEERMREASDSAIRMAFREDESRRNAPLSPE 120
+PNGCR+ENA+SSLSLNDDVERRCSD+EEEE MREASDSAIRMAFREDE+RRNAPLSPE
Sbjct: 61 HLPNGCRVENAVSSLSLNDDVERRCSDEEEEESMREASDSAIRMAFREDETRRNAPLSPE 120
Query: 121 NATRIMEAMRGISFGGSAPDWTRIVSEDRWIDQLRRLRQMPSVSNNYGN 168
NATRIMEAMRGISFGGSAPDWTRIVSEDRWIDQLRRLRQ P+ NN+GN
Sbjct: 121 NATRIMEAMRGISFGGSAPDWTRIVSEDRWIDQLRRLRQTPTSPNNFGN 169
BLAST of CaUC03G050300 vs. NCBI nr
Match:
XP_004153363.1 (uncharacterized protein LOC101214045 [Cucumis sativus] >KGN65844.1 hypothetical protein Csa_023255 [Cucumis sativus])
HSP 1 Score: 287.7 bits (735), Expect = 6.3e-74
Identity = 149/169 (88.17%), Postives = 159/169 (94.08%), Query Frame = 0
Query: 1 MNATLNSGIVFSE--ADGSDSDTNSAEGSDYYEPISAIDGEESDEAGSEDETYSSDPHFY 60
MNATLNSGIVF+E ADGSDSDTNSAEGSDYYEPISAIDGEESD A SEDETYSSDPHF+
Sbjct: 1 MNATLNSGIVFTEDIADGSDSDTNSAEGSDYYEPISAIDGEESDIAESEDETYSSDPHFH 60
Query: 61 RIPNGCRLENAISSLSLNDDVERRCSDDEEEERMREASDSAIRMAFREDESRRNAPLSPE 120
++PNGC +ENA+SSL+LNDDVERRCSDDEEEERMR ASDSAIRMAFREDE+RRNAPLSPE
Sbjct: 61 QLPNGCGVENAVSSLTLNDDVERRCSDDEEEERMRVASDSAIRMAFREDETRRNAPLSPE 120
Query: 121 NATRIMEAMRGISFGGSAPDWTRIVSEDRWIDQLRRLRQMPSVSNNYGN 168
N TRIMEAMRGISF GSAPDWTRIVSEDRWIDQLRRLRQ P+VSN+ GN
Sbjct: 121 NTTRIMEAMRGISFDGSAPDWTRIVSEDRWIDQLRRLRQTPTVSNSLGN 169
BLAST of CaUC03G050300 vs. NCBI nr
Match:
XP_008457323.1 (PREDICTED: uncharacterized protein LOC103497044 [Cucumis melo])
HSP 1 Score: 286.6 bits (732), Expect = 1.4e-73
Identity = 149/169 (88.17%), Postives = 158/169 (93.49%), Query Frame = 0
Query: 1 MNATLNSGIVFSE--ADGSDSDTNSAEGSDYYEPISAIDGEESDEAGSEDETYSSDPHFY 60
MNATLNSGIVFSE DGSDSDTNSAEGSDYYEPISAIDGEESD A SEDETYSSD HF+
Sbjct: 1 MNATLNSGIVFSEDIVDGSDSDTNSAEGSDYYEPISAIDGEESDIAESEDETYSSDTHFH 60
Query: 61 RIPNGCRLENAISSLSLNDDVERRCSDDEEEERMREASDSAIRMAFREDESRRNAPLSPE 120
++PNGC +ENA+SSL+LNDDVERRCSDDEEEERMREASDSAIRMAFREDE+RRNAPLSPE
Sbjct: 61 QLPNGCGVENAVSSLTLNDDVERRCSDDEEEERMREASDSAIRMAFREDETRRNAPLSPE 120
Query: 121 NATRIMEAMRGISFGGSAPDWTRIVSEDRWIDQLRRLRQMPSVSNNYGN 168
N TRIMEAMRGISFGGSAPDWTRIVSEDRWIDQLRRLRQ P+VSN+ N
Sbjct: 121 NTTRIMEAMRGISFGGSAPDWTRIVSEDRWIDQLRRLRQTPTVSNSIRN 169
BLAST of CaUC03G050300 vs. ExPASy TrEMBL
Match:
A0A6J1I3Z1 (uncharacterized protein LOC111469451 OS=Cucurbita maxima OX=3661 GN=LOC111469451 PE=4 SV=1)
HSP 1 Score: 291.6 bits (745), Expect = 2.1e-75
Identity = 149/169 (88.17%), Postives = 160/169 (94.67%), Query Frame = 0
Query: 1 MNATLNSGIVFSE--ADGSDSDTNSAEGSDYYEPISAIDGEESDEAGSEDETYSSDPHFY 60
MNA+LNSG+VFSE ADGSDSDTNSAEGSDYYEPISAIDGEESDEAGS+DETYSSD H +
Sbjct: 1 MNASLNSGMVFSEDIADGSDSDTNSAEGSDYYEPISAIDGEESDEAGSDDETYSSDTHLH 60
Query: 61 RIPNGCRLENAISSLSLNDDVERRCSDDEEEERMREASDSAIRMAFREDESRRNAPLSPE 120
+PNGCR+ENA+SSLSLNDDVERRCSD+EEEE MREASDSAIRMAFREDE+RRNAPLSPE
Sbjct: 61 HLPNGCRVENAVSSLSLNDDVERRCSDEEEEESMREASDSAIRMAFREDETRRNAPLSPE 120
Query: 121 NATRIMEAMRGISFGGSAPDWTRIVSEDRWIDQLRRLRQMPSVSNNYGN 168
NATRIMEAMRGISFGGSAPDWTRIVSEDRWIDQLRRLRQ P+ NN+GN
Sbjct: 121 NATRIMEAMRGISFGGSAPDWTRIVSEDRWIDQLRRLRQTPTSPNNFGN 169
BLAST of CaUC03G050300 vs. ExPASy TrEMBL
Match:
A0A6J1HLE1 (uncharacterized protein LOC111464656 OS=Cucurbita moschata OX=3662 GN=LOC111464656 PE=4 SV=1)
HSP 1 Score: 289.3 bits (739), Expect = 1.0e-74
Identity = 148/169 (87.57%), Postives = 159/169 (94.08%), Query Frame = 0
Query: 1 MNATLNSGIVFSE--ADGSDSDTNSAEGSDYYEPISAIDGEESDEAGSEDETYSSDPHFY 60
MNA+LNSG+VFSE ADGSDSDTNSAEGSDYYEPISAIDGEESDEAGS+DE YSSD H +
Sbjct: 1 MNASLNSGMVFSEDIADGSDSDTNSAEGSDYYEPISAIDGEESDEAGSDDEPYSSDTHLH 60
Query: 61 RIPNGCRLENAISSLSLNDDVERRCSDDEEEERMREASDSAIRMAFREDESRRNAPLSPE 120
+PNGCR+ENA+SSLSLNDDVERRCSD+EEEE MREASDSAIRMAFREDE+RRNAPLSPE
Sbjct: 61 HLPNGCRVENAVSSLSLNDDVERRCSDEEEEESMREASDSAIRMAFREDETRRNAPLSPE 120
Query: 121 NATRIMEAMRGISFGGSAPDWTRIVSEDRWIDQLRRLRQMPSVSNNYGN 168
NATRIMEAMRGISFGGSAPDWTRIVSEDRWIDQLRRLRQ P+ NN+GN
Sbjct: 121 NATRIMEAMRGISFGGSAPDWTRIVSEDRWIDQLRRLRQTPTSPNNFGN 169
BLAST of CaUC03G050300 vs. ExPASy TrEMBL
Match:
A0A0A0LXQ6 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G533510 PE=4 SV=1)
HSP 1 Score: 287.7 bits (735), Expect = 3.0e-74
Identity = 149/169 (88.17%), Postives = 159/169 (94.08%), Query Frame = 0
Query: 1 MNATLNSGIVFSE--ADGSDSDTNSAEGSDYYEPISAIDGEESDEAGSEDETYSSDPHFY 60
MNATLNSGIVF+E ADGSDSDTNSAEGSDYYEPISAIDGEESD A SEDETYSSDPHF+
Sbjct: 1 MNATLNSGIVFTEDIADGSDSDTNSAEGSDYYEPISAIDGEESDIAESEDETYSSDPHFH 60
Query: 61 RIPNGCRLENAISSLSLNDDVERRCSDDEEEERMREASDSAIRMAFREDESRRNAPLSPE 120
++PNGC +ENA+SSL+LNDDVERRCSDDEEEERMR ASDSAIRMAFREDE+RRNAPLSPE
Sbjct: 61 QLPNGCGVENAVSSLTLNDDVERRCSDDEEEERMRVASDSAIRMAFREDETRRNAPLSPE 120
Query: 121 NATRIMEAMRGISFGGSAPDWTRIVSEDRWIDQLRRLRQMPSVSNNYGN 168
N TRIMEAMRGISF GSAPDWTRIVSEDRWIDQLRRLRQ P+VSN+ GN
Sbjct: 121 NTTRIMEAMRGISFDGSAPDWTRIVSEDRWIDQLRRLRQTPTVSNSLGN 169
BLAST of CaUC03G050300 vs. ExPASy TrEMBL
Match:
A0A1S3C5C0 (uncharacterized protein LOC103497044 OS=Cucumis melo OX=3656 GN=LOC103497044 PE=4 SV=1)
HSP 1 Score: 286.6 bits (732), Expect = 6.8e-74
Identity = 149/169 (88.17%), Postives = 158/169 (93.49%), Query Frame = 0
Query: 1 MNATLNSGIVFSE--ADGSDSDTNSAEGSDYYEPISAIDGEESDEAGSEDETYSSDPHFY 60
MNATLNSGIVFSE DGSDSDTNSAEGSDYYEPISAIDGEESD A SEDETYSSD HF+
Sbjct: 1 MNATLNSGIVFSEDIVDGSDSDTNSAEGSDYYEPISAIDGEESDIAESEDETYSSDTHFH 60
Query: 61 RIPNGCRLENAISSLSLNDDVERRCSDDEEEERMREASDSAIRMAFREDESRRNAPLSPE 120
++PNGC +ENA+SSL+LNDDVERRCSDDEEEERMREASDSAIRMAFREDE+RRNAPLSPE
Sbjct: 61 QLPNGCGVENAVSSLTLNDDVERRCSDDEEEERMREASDSAIRMAFREDETRRNAPLSPE 120
Query: 121 NATRIMEAMRGISFGGSAPDWTRIVSEDRWIDQLRRLRQMPSVSNNYGN 168
N TRIMEAMRGISFGGSAPDWTRIVSEDRWIDQLRRLRQ P+VSN+ N
Sbjct: 121 NTTRIMEAMRGISFGGSAPDWTRIVSEDRWIDQLRRLRQTPTVSNSIRN 169
BLAST of CaUC03G050300 vs. ExPASy TrEMBL
Match:
A0A5A7UY79 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold194G00660 PE=4 SV=1)
HSP 1 Score: 268.1 bits (684), Expect = 2.5e-68
Identity = 136/153 (88.89%), Postives = 145/153 (94.77%), Query Frame = 0
Query: 15 DGSDSDTNSAEGSDYYEPISAIDGEESDEAGSEDETYSSDPHFYRIPNGCRLENAISSLS 74
DGSDSDTNSAEGSDYYEPISAIDGEESD A SEDETYSSD HF+++PNGC +ENA+SSL+
Sbjct: 51 DGSDSDTNSAEGSDYYEPISAIDGEESDIAESEDETYSSDTHFHQLPNGCGVENAVSSLT 110
Query: 75 LNDDVERRCSDDEEEERMREASDSAIRMAFREDESRRNAPLSPENATRIMEAMRGISFGG 134
LNDDVERRCSDDEEEERMREASDSAIRMAFREDE+RRNAPLSPEN TRIMEAMRGISFGG
Sbjct: 111 LNDDVERRCSDDEEEERMREASDSAIRMAFREDETRRNAPLSPENTTRIMEAMRGISFGG 170
Query: 135 SAPDWTRIVSEDRWIDQLRRLRQMPSVSNNYGN 168
SAPDWTRIVSEDRWIDQLRRLRQ P+VSN+ N
Sbjct: 171 SAPDWTRIVSEDRWIDQLRRLRQTPTVSNSIRN 203
BLAST of CaUC03G050300 vs. TAIR 10
Match:
AT1G07020.1 (unknown protein; Has 39 Blast hits to 39 proteins in 17 species: Archae - 0; Bacteria - 0; Metazoa - 6; Fungi - 3; Plants - 28; Viruses - 0; Other Eukaryotes - 2 (source: NCBI BLink). )
HSP 1 Score: 110.9 bits (276), Expect = 9.8e-25
Identity = 78/149 (52.35%), Postives = 94/149 (63.09%), Query Frame = 0
Query: 16 GSDSDTNSAEGS-DYYEPISAIDGEESDEAGSEDETY-------SSDPHFYRIPNGCRLE 75
GSDSD+NS E S DYYEPISA+D S++ E+++Y S+ H IP+ E
Sbjct: 8 GSDSDSNSVEDSQDYYEPISAVDLYNSND--DEEDSYLPIGGDGLSNGH-CMIPDA---E 67
Query: 76 NAISSLSLNDDVERRCSDDEEEERMREASDSAIRMAFREDESRRNAPLSPENATRIMEAM 135
ISS+S+ND+ D EEE E IR AF EDE RR +PL ENA R+MEAM
Sbjct: 68 VGISSISINDNT------DSEEETETETGPE-IRRAFEEDERRRRSPLVEENAVRVMEAM 127
Query: 136 RGISFGGSAPDWTRIVSEDRWIDQLRRLR 157
R ISF G+APDW V+EDRWIDQLRRLR
Sbjct: 128 RAISFPGTAPDWASDVNEDRWIDQLRRLR 143
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038894769.1 | 2.5e-78 | 92.31 | uncharacterized protein LOC120083197 [Benincasa hispida] | [more] |
XP_022970483.1 | 4.4e-75 | 88.17 | uncharacterized protein LOC111469451 [Cucurbita maxima] >XP_022970484.1 uncharac... | [more] |
XP_022964640.1 | 2.2e-74 | 87.57 | uncharacterized protein LOC111464656 [Cucurbita moschata] | [more] |
XP_004153363.1 | 6.3e-74 | 88.17 | uncharacterized protein LOC101214045 [Cucumis sativus] >KGN65844.1 hypothetical ... | [more] |
XP_008457323.1 | 1.4e-73 | 88.17 | PREDICTED: uncharacterized protein LOC103497044 [Cucumis melo] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1I3Z1 | 2.1e-75 | 88.17 | uncharacterized protein LOC111469451 OS=Cucurbita maxima OX=3661 GN=LOC111469451... | [more] |
A0A6J1HLE1 | 1.0e-74 | 87.57 | uncharacterized protein LOC111464656 OS=Cucurbita moschata OX=3662 GN=LOC1114646... | [more] |
A0A0A0LXQ6 | 3.0e-74 | 88.17 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G533510 PE=4 SV=1 | [more] |
A0A1S3C5C0 | 6.8e-74 | 88.17 | uncharacterized protein LOC103497044 OS=Cucumis melo OX=3656 GN=LOC103497044 PE=... | [more] |
A0A5A7UY79 | 2.5e-68 | 88.89 | Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... | [more] |
Match Name | E-value | Identity | Description | |
AT1G07020.1 | 9.8e-25 | 52.35 | unknown protein; Has 39 Blast hits to 39 proteins in 17 species: Archae - 0; Bac... | [more] |