CmaCh04G012870 (gene) Cucurbita maxima (Rimu) v1.1

Overview
NameCmaCh04G012870
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu) v1.1)
DescriptionGag-Pol polyprotein/retrotransposon
LocationCma_Chr04: 6560104 .. 6564768 (+)
RNA-Seq ExpressionCmaCh04G012870
SyntenyCmaCh04G012870
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TCTCTCTCTCTCTCTCTCTTTCAAATGCCAACTCCACTTAAGCTCACCAATGGCGCCATCCCTTGCGTCGTTTTCCCGGTGGCGGTAAAAGCTTCACTCGATTTTGAAAATTCTGCGACATTTTAGAATCTGAATGCGTGAACATTTTAATTTCTATTATCATCTTCCCGAATTGTTATTCTTCAGGGTTCGAACGAATTGCAGCCGAAGACAATTTTCAGAGCTTCCCCGCCTTTTCGCGTCTCAAAGCCGAATAACATTTTTGTTAGTCGGAACCCTAGCATCCGACAATGCCTTAACAATGCCGAAATTAGCGTAAGTGTTTCTCAGTGAATTGGCAAAACAGCAATCTCCTTCACTTTCTGCAGTTTAATTCTTTCGTTCTATGGTGTTTTGTCCACTTGCTTCAGAATCTACGCCCTAATTTTTGGGATGGTTTCTGTGTTGTTTTGCTTATTGATGTCTCTTGCCAATTGAATTGTTCGGTCCGCGGAGCGATAATAATGTTACAAACTTCTAATTTCCCTGATTTTGCACCCTCTGAGTACCATCATGCCTTTCTTTTGTCTAATTGGTAATGAATGCTTTGGTATGCTTGGTTCTATGCATTCCATTTGGCCCCATCACTATTTTCGTTTGAATTGTGTTATTATTTCTTATATACTTCAGAACTGACACTTCTATGAAGTCGCTGGAAGACTATTTCTATGAAGATATTTTGCATGTTTAACTGTTGGTTTACTTTTATTGACTTCAGTTGTTTATATAAATAGAATGTGGCAGTAAATGAAATTTTAGGCATGGATGTTCCTGCATTTGGACAGGCGTATGTATTACTGATTAGATACTGTCAATGTCCTGTCGTAGGCCAATGATCCATTGAAATCGGAAAGTGGCTTTTCCAATCATGAAACTGAAGGTATGGCATTAAAAAACCGAACCTATGAACTCATGGTTGAGTCTTGATGCATTCAGTGAGCTTTAAATTTTTAAATGTCATATTCTTGATTAAGAACTTACGAATTCTCGTCCCTTTCATTCTCTCTCTTGTTAGGTTCGATGGAAAAGAATGAAAATCATAAAAAACATCCGCGAAAATCGATTGAGTACTACTACTTGCCTTGCATATATTTCGTGAATTCTAAGGCTACTGCTTTTTTCTGTTGATAATGCCATGGAGGTGGTGAGACCTTCCTTTTTCTGCTCAATCTCTCTCTACTCTCTAGTTTCTAATCAAAGTCATACTTGTACAACAAAGCTTAGCAACAAAAATAGTCTACTTCATGTTCCACTAATATTTGTACTGTTTTGTTGGTTAATGTCATTTTTAGTTTATATTGGTATATTATTTTACATTTAAGAACAAAGGGGGTCTCAGAATATATTTGATCCCCTAAGAATAATAAGTCATATCAAAGAGTAAGTCACAGCCCTCAGACAGGCAATTTTTCATATTTATTTTTTATAATATCATTACTTATTGGTCAAGTCAGCTTTGCGATTCTGGAATTTTGTAGTACATTAATGTTGGGTACAAAATACATAAAGTTATAAATTGAAAAAGAAACAAAGTTTCCGTTTAGCAACCACATGAAATGAAACAGAGATCTTCTTGGCTTTATCAAAATGTTACAAAGATCTTCTTGGCTCTATTGAAAGGAAACGAAGAACTCTTCTTGCCTTCACCGAAATGAAACAAAATCTGCTTGACTTTATCGAAAGGAAACAAAGTTCTTCTTGGCCTTATTGAAATGAAACAAACATCTTATTGGACGCATCAGTGCACACTTCATATTAGGCATACATCAGTCTTGGAGACATTACATTTCATTTTCGCTCATTTATTCTAGTACACGTTCATCTTGTAGGGTGCTGGATAAATTGAGGAGATATGGAGTTTCTGGAATATTGTCTTATGGATTGTTGAATACAGTCTACTATCTTACAACATTTCTTGTTGTGTGGTGAGTCCATTTCTATATTTACTACTTAGTTACTACTATTATGAATTACTACTACGTAGGAGAGACTTAAAGGCTTCATAAGTTTAGGAGGTTATTTGGTATGCGTGTTGGTGATTAGAATTGGATGTTCGGGGATGGAAATGATAGGAACCAGATGATTAGGCTTGGAAATTGTATAGAAAGATAGGTAAGAATTAAGAAACTACAGTAAGATGTTTGAAAGCAATTCTTGGAAGTTGGATGCTTCCACCTCCCTCAACCCCCCGTGTGAGTTTCAACCTTGAACAATCACAATATTGCTACGTCTTGTTGGTTAATTCCGCTCGTTACTATTCTCTCTGACAAACTACACTGCCAATTACCAAATTACCTGTACTCTCTAATTCTATCAAGTAACTATTTTGCGGAACTTTTTTTCACGTAATGACTGTAGCTAATGTTGATATTGATCTTGCCTTTTGGGCAGTGGTCAACATGGGGTTCCATACCTTTGGGGACTGTATTGGGATAATGGGTAGGTGATTAGAATTGGATGTTGGAGAACTCTTGAGGATCTTTGTTCATAATGGGTAGATGGTTCAGAGGAGGAAAGGCTCTTTTTTCAGGATATCCCGAGATTTTGGTTCCAATGGAGATTGGGTAGCGCTTATAGCTCAGAGCATTAGAGGACGTGGTTGTTAACTATAGTGCACACTACTTGCACCTAGATAAAAAGACCTTATGGAACTCTACTTTCGAAGAGCTAGAATTTCAGAATGTACGGGCTAAGGGTTTGGACATGATTCTTGCTTTCAATCAATATCAATTTAGTGGGACTCTGATTAAAAACACCTTTACTACCGATCAGCGGGCGATAACTTATTTTATGTTATTATTATTTAAAAATAATAATAATAACACTCCTTATGTCATACTATCGTTTGTCTTCTGTTTATTTCGTCTCTCTTCCATTAGCAGTGTACTTAGTCATGTTATCATGTTATTTATTTGCATTTTAGGTTCTACATTGCACCAGCACCTGCGAAAATGGGCTATGTTGCAGCTGCTGGAAGGTGAAAAACTTTATTGTAGGAGGCCCTTTTCGTATGGCTAGTTGCTTTACTCCTTTTGTTGAGCTATCATTTTGGTGAACGTTTTCTTTCATTTTTCTCAATGAAAGCTTGGTTTTATATTGAAAGAAATATTTGTGTAGGATGGAGTTTTATACTAAGACATCGACACATTATTACCTTGTAGATTTCTCAAGATAATGGCTACAATCTGGGCTGGAAGCCAAGTTACTAAGCTGGCAAGAGCAGCAGGGTGAGCTCTTATATTCCTGCTTTTTATCTAGTACTTTTCTGATACGTCATTCTGCTCATCATTAAACAACCCAACTATATGAACTGTAAGGTTCAACTTGTTTAGTGGAAAGGGCATTCACTTGTCAAATGGTTGCCTTTCTTTTCCAACTTCCATAACTTAGAAAGGCCTACTTGATAACATCTTTGTTTTCAGTTTTCTATATTTATAATTTAGCTCATGTGAGAGCCAAAACCCATCGCTAGCAGATACTGTTCTCTTTGGTCTTTTTCTTTCGGGCTTCCCCTCAAGGTTTTTAAAATGCGTCTTCTAGGGAGAAGTTTTCACACCCTTATAAATAATGCTTCGTTCTCCTCCCCAACCGATATGGGATCTCACAATCCACCCCCTTCGGGGCCCAGTATCCTCGCTGACATTCATTCCCTTCTCCAATCGATGTGGGACCCCTCAATCCACCCCTCTTCAGAGCCCAGCGTCCTTGTTGGCACACCGCCTCTTGTCCACCCCCTTTGAAGTTCAACTTTCTCGCTGGCACATCGCCCAGTGTCTAGCTCTGAAACCATTTGTAACAACTCAAGCCCACCGCTAGCAGATATTGTCCTCTTTGGACTTTCCCTTTCGGGCTTTCCCTCAAGGTTTTTAAAACGAATCTGCTAGGGTGAGGTTTCCACACCTTTATAAAAAATGTTTTGTTCTACTTCCCAACTCCCCAACTGATATGGGATCTCACAACTCATTTGCTAACTTAATCTTTGCTTTCAAATCTATTTTCAAGATTGATTTTGTTAAAAAAAATATTTAAAATCTCGTCGGTGTTTTAAAAAAGGGAAGCTTTACAAGTTTCTCAAGAACAGATCGAAAGTCGTTGGATATTAAATCTCACTCCCATTTCAATTCCTGAAACGAGGAAGCAACCTGTTAAACCAAAGCTTCTCATTCATGCAGAGCTCTTGCTATGGCGCCGTTCGTCGACAGAGGATTGTCGTGGTTCACAGTCAAATACAACTTCAAGTCTCAGGGGAAGGTTACTAGAAACGAAACTTCTATACATTCATTGCGTTTTTTGTTTCATTTCATTGATGTTATGCTCAAAGTTAACCCATCTCCAGGCAGTTGTGGCGATTGTTGGATTCTGCTTAGGGTTGTCTCTCTTGTTATTCATTGCTGTTACTCTGCTTTCAGCATAAGACAGGTTCTTCCCTTGGGAAAGTAAGTACTTTTTTCTCTCACATCATTCATTACCCTATTCCCAATGAGTGTTTCCTTGTCCTCAATATTGAAGGCTATGACTTTTTTTCTTCCTTAAAATTAAAATTTCCAATATTTTAAGAGATAGGGTGCAGGTTTAACCAAAATTTTATCAATGAGATTGTCGTTATTGTCGAGAATATGTTACACG

mRNA sequence

TCTCTCTCTCTCTCTCTCTTTCAAATGCCAACTCCACTTAAGCTCACCAATGGCGCCATCCCTTGCGTCGTTTTCCCGGTGGCGGGTTCGAACGAATTGCAGCCGAAGACAATTTTCAGAGCTTCCCCGCCTTTTCGCGTCTCAAAGCCGAATAACATTTTTGTTAGTCGGAACCCTAGCATCCGACAATGCCTTAACAATGCCGAAATTAGCGCCAATGATCCATTGAAATCGGAAAGTGGCTTTTCCAATCATGAAACTGAAGGTTCGATGGAAAAGAATGAAAATCATAAAAAACATCCGCGAAAATCGATTGAGGTGCTGGATAAATTGAGGAGATATGGAGTTTCTGGAATATTGTCTTATGGATTGTTGAATACAGTCTACTATCTTACAACATTTCTTGTTGTGTGGTTCTACATTGCACCAGCACCTGCGAAAATGGGCTATGTTGCAGCTGCTGGAAGATTTCTCAAGATAATGGCTACAATCTGGGCTGGAAGCCAAGTTACTAAGCTGGCAAGAGCAGCAGGAGCTCTTGCTATGGCGCCGTTCGTCGACAGAGGATTGTCGTGGTTCACAGTCAAATACAACTTCAAGTCTCAGGGGAAGGCAGTTGTGGCGATTGTTGGATTCTGCTTAGGGTTGTCTCTCTTGTTATTCATTGCTGTTACTCTGCTTTCAGCATAAGACAGGTTCTTCCCTTGGGAAAGTAAGTACTTTTTTCTCTCACATCATTCATTACCCTATTCCCAATGAGTGTTTCCTTGTCCTCAATATTGAAGGCTATGACTTTTTTTCTTCCTTAAAATTAAAATTTCCAATATTTTAAGAGATAGGGTGCAGGTTTAACCAAAATTTTATCAATGAGATTGTCGTTATTGTCGAGAATATGTTACACG

Coding sequence (CDS)

ATGCCAACTCCACTTAAGCTCACCAATGGCGCCATCCCTTGCGTCGTTTTCCCGGTGGCGGGTTCGAACGAATTGCAGCCGAAGACAATTTTCAGAGCTTCCCCGCCTTTTCGCGTCTCAAAGCCGAATAACATTTTTGTTAGTCGGAACCCTAGCATCCGACAATGCCTTAACAATGCCGAAATTAGCGCCAATGATCCATTGAAATCGGAAAGTGGCTTTTCCAATCATGAAACTGAAGGTTCGATGGAAAAGAATGAAAATCATAAAAAACATCCGCGAAAATCGATTGAGGTGCTGGATAAATTGAGGAGATATGGAGTTTCTGGAATATTGTCTTATGGATTGTTGAATACAGTCTACTATCTTACAACATTTCTTGTTGTGTGGTTCTACATTGCACCAGCACCTGCGAAAATGGGCTATGTTGCAGCTGCTGGAAGATTTCTCAAGATAATGGCTACAATCTGGGCTGGAAGCCAAGTTACTAAGCTGGCAAGAGCAGCAGGAGCTCTTGCTATGGCGCCGTTCGTCGACAGAGGATTGTCGTGGTTCACAGTCAAATACAACTTCAAGTCTCAGGGGAAGGCAGTTGTGGCGATTGTTGGATTCTGCTTAGGGTTGTCTCTCTTGTTATTCATTGCTGTTACTCTGCTTTCAGCATAA

Protein sequence

MPTPLKLTNGAIPCVVFPVAGSNELQPKTIFRASPPFRVSKPNNIFVSRNPSIRQCLNNAEISANDPLKSESGFSNHETEGSMEKNENHKKHPRKSIEVLDKLRRYGVSGILSYGLLNTVYYLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMATIWAGSQVTKLARAAGALAMAPFVDRGLSWFTVKYNFKSQGKAVVAIVGFCLGLSLLLFIAVTLLSA
Homology
BLAST of CmaCh04G012870 vs. ExPASy TrEMBL
Match: A0A6J1J0R3 (uncharacterized protein LOC111482401 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111482401 PE=4 SV=1)

HSP 1 Score: 428.3 bits (1100), Expect = 1.9e-116
Identity = 221/221 (100.00%), Postives = 221/221 (100.00%), Query Frame = 0

Query: 1   MPTPLKLTNGAIPCVVFPVAGSNELQPKTIFRASPPFRVSKPNNIFVSRNPSIRQCLNNA 60
           MPTPLKLTNGAIPCVVFPVAGSNELQPKTIFRASPPFRVSKPNNIFVSRNPSIRQCLNNA
Sbjct: 1   MPTPLKLTNGAIPCVVFPVAGSNELQPKTIFRASPPFRVSKPNNIFVSRNPSIRQCLNNA 60

Query: 61  EISANDPLKSESGFSNHETEGSMEKNENHKKHPRKSIEVLDKLRRYGVSGILSYGLLNTV 120
           EISANDPLKSESGFSNHETEGSMEKNENHKKHPRKSIEVLDKLRRYGVSGILSYGLLNTV
Sbjct: 61  EISANDPLKSESGFSNHETEGSMEKNENHKKHPRKSIEVLDKLRRYGVSGILSYGLLNTV 120

Query: 121 YYLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMATIWAGSQVTKLARAAGALAMAPFVDR 180
           YYLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMATIWAGSQVTKLARAAGALAMAPFVDR
Sbjct: 121 YYLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMATIWAGSQVTKLARAAGALAMAPFVDR 180

Query: 181 GLSWFTVKYNFKSQGKAVVAIVGFCLGLSLLLFIAVTLLSA 222
           GLSWFTVKYNFKSQGKAVVAIVGFCLGLSLLLFIAVTLLSA
Sbjct: 181 GLSWFTVKYNFKSQGKAVVAIVGFCLGLSLLLFIAVTLLSA 221

BLAST of CmaCh04G012870 vs. ExPASy TrEMBL
Match: A0A6J1GY06 (uncharacterized protein LOC111458238 OS=Cucurbita moschata OX=3662 GN=LOC111458238 PE=4 SV=1)

HSP 1 Score: 411.4 bits (1056), Expect = 2.4e-111
Identity = 212/221 (95.93%), Postives = 217/221 (98.19%), Query Frame = 0

Query: 1   MPTPLKLTNGAIPCVVFPVAGSNELQPKTIFRASPPFRVSKPNNIFVSRNPSIRQCLNNA 60
           MPTPLKLTNGAIPCVVFPVAGSN+LQPKTIF ASPPFRVSKP NIFVSRNPSIRQCLNNA
Sbjct: 1   MPTPLKLTNGAIPCVVFPVAGSNKLQPKTIFIASPPFRVSKPINIFVSRNPSIRQCLNNA 60

Query: 61  EISANDPLKSESGFSNHETEGSMEKNENHKKHPRKSIEVLDKLRRYGVSGILSYGLLNTV 120
           EISANDPLKSE+GFSNHETEGSMEKNENH+KHP+KSIEVLDKLRRYGVSGILSYGLLNTV
Sbjct: 61  EISANDPLKSENGFSNHETEGSMEKNENHQKHPQKSIEVLDKLRRYGVSGILSYGLLNTV 120

Query: 121 YYLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMATIWAGSQVTKLARAAGALAMAPFVDR 180
           YYLTTFLVVWFYIAP PAKMGYVAAAGRFLKIMAT+WAGSQVTKLARAAGALAMAPFVDR
Sbjct: 121 YYLTTFLVVWFYIAPPPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALAMAPFVDR 180

Query: 181 GLSWFTVKYNFKSQGKAVVAIVGFCLGLSLLLFIAVTLLSA 222
            LSWFTVKYNFKSQGKAVVAIVGFCLGLSLLLFIAVTLLSA
Sbjct: 181 ALSWFTVKYNFKSQGKAVVAIVGFCLGLSLLLFIAVTLLSA 221

BLAST of CmaCh04G012870 vs. ExPASy TrEMBL
Match: A0A6J1J900 (uncharacterized protein LOC111482401 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111482401 PE=4 SV=1)

HSP 1 Score: 389.8 bits (1000), Expect = 7.5e-105
Identity = 196/196 (100.00%), Postives = 196/196 (100.00%), Query Frame = 0

Query: 1   MPTPLKLTNGAIPCVVFPVAGSNELQPKTIFRASPPFRVSKPNNIFVSRNPSIRQCLNNA 60
           MPTPLKLTNGAIPCVVFPVAGSNELQPKTIFRASPPFRVSKPNNIFVSRNPSIRQCLNNA
Sbjct: 1   MPTPLKLTNGAIPCVVFPVAGSNELQPKTIFRASPPFRVSKPNNIFVSRNPSIRQCLNNA 60

Query: 61  EISANDPLKSESGFSNHETEGSMEKNENHKKHPRKSIEVLDKLRRYGVSGILSYGLLNTV 120
           EISANDPLKSESGFSNHETEGSMEKNENHKKHPRKSIEVLDKLRRYGVSGILSYGLLNTV
Sbjct: 61  EISANDPLKSESGFSNHETEGSMEKNENHKKHPRKSIEVLDKLRRYGVSGILSYGLLNTV 120

Query: 121 YYLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMATIWAGSQVTKLARAAGALAMAPFVDR 180
           YYLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMATIWAGSQVTKLARAAGALAMAPFVDR
Sbjct: 121 YYLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMATIWAGSQVTKLARAAGALAMAPFVDR 180

Query: 181 GLSWFTVKYNFKSQGK 197
           GLSWFTVKYNFKSQGK
Sbjct: 181 GLSWFTVKYNFKSQGK 196

BLAST of CmaCh04G012870 vs. ExPASy TrEMBL
Match: A0A5A7SW01 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold65G00960 PE=4 SV=1)

HSP 1 Score: 303.9 bits (777), Expect = 5.4e-79
Identity = 169/222 (76.13%), Postives = 179/222 (80.63%), Query Frame = 0

Query: 1   MPTPLKLTNGAIPCVVFPVAGSNELQPKTIFRASPP-FRVSKPNNIFVSRNPSIRQCLNN 60
           M TP KLTNG IPCV FP   S ELQ K+I RASPP     +PN                
Sbjct: 1   MLTPPKLTNGNIPCVAFP--DSIELQSKSISRASPPVLSPPEPN---------------- 60

Query: 61  AEISANDPLKSESGFSNHETEGSMEKNENHKKHPRKSIEVLDKLRRYGVSGILSYGLLNT 120
               ANDPLKSE  FSNHETEGSMEKNEN +KHP+KS EVLDKLRRYG+SGILSYGLLNT
Sbjct: 61  ---KANDPLKSEDDFSNHETEGSMEKNENRQKHPQKSNEVLDKLRRYGLSGILSYGLLNT 120

Query: 121 VYYLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMATIWAGSQVTKLARAAGALAMAPFVD 180
            YYLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMAT+WAGSQVTKLARAAGALA+APFVD
Sbjct: 121 AYYLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVD 180

Query: 181 RGLSWFTVKYNFKSQGKAVVAIVGFCLGLSLLLFIAVTLLSA 222
           RGLSWFTV YNF+SQGKA +AIVGFCLGL+LLLFI VTLLSA
Sbjct: 181 RGLSWFTVNYNFESQGKAFMAIVGFCLGLALLLFIVVTLLSA 201

BLAST of CmaCh04G012870 vs. ExPASy TrEMBL
Match: A0A1S3BDD7 (uncharacterized protein LOC103488806 isoform X5 OS=Cucumis melo OX=3656 GN=LOC103488806 PE=4 SV=1)

HSP 1 Score: 302.0 bits (772), Expect = 2.1e-78
Identity = 159/189 (84.13%), Postives = 171/189 (90.48%), Query Frame = 0

Query: 36  PFRVSKPNN---IFVSRNPSIRQCLNNAEISANDPLKSESGFSNHETEGSMEKNENHKKH 95
           P R S P +     VSRNPS+R CL+NA+ISANDPLKSE  FSNHETEGSMEKNEN +KH
Sbjct: 24  PRRFSPPQSRIRFSVSRNPSVRLCLSNAKISANDPLKSEDDFSNHETEGSMEKNENRQKH 83

Query: 96  PRKSIEVLDKLRRYGVSGILSYGLLNTVYYLTTFLVVWFYIAPAPAKMGYVAAAGRFLKI 155
           P+KS EVLDKLRRYG+SGILSYGLLNT YYLTTFLVVWFYIAPAPAKMGYVAAAGRFLKI
Sbjct: 84  PQKSNEVLDKLRRYGLSGILSYGLLNTAYYLTTFLVVWFYIAPAPAKMGYVAAAGRFLKI 143

Query: 156 MATIWAGSQVTKLARAAGALAMAPFVDRGLSWFTVKYNFKSQGKAVVAIVGFCLGLSLLL 215
           MAT+WAGSQVTKLARAAGALA+APFVDRGLSWFTV YNF+SQGKA +AIVGFCLGL+LLL
Sbjct: 144 MATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVNYNFESQGKAFMAIVGFCLGLALLL 203

Query: 216 FIAVTLLSA 222
           FI VTLLSA
Sbjct: 204 FIVVTLLSA 212

BLAST of CmaCh04G012870 vs. NCBI nr
Match: XP_022983937.1 (uncharacterized protein LOC111482401 isoform X2 [Cucurbita maxima])

HSP 1 Score: 428.3 bits (1100), Expect = 3.9e-116
Identity = 221/221 (100.00%), Postives = 221/221 (100.00%), Query Frame = 0

Query: 1   MPTPLKLTNGAIPCVVFPVAGSNELQPKTIFRASPPFRVSKPNNIFVSRNPSIRQCLNNA 60
           MPTPLKLTNGAIPCVVFPVAGSNELQPKTIFRASPPFRVSKPNNIFVSRNPSIRQCLNNA
Sbjct: 1   MPTPLKLTNGAIPCVVFPVAGSNELQPKTIFRASPPFRVSKPNNIFVSRNPSIRQCLNNA 60

Query: 61  EISANDPLKSESGFSNHETEGSMEKNENHKKHPRKSIEVLDKLRRYGVSGILSYGLLNTV 120
           EISANDPLKSESGFSNHETEGSMEKNENHKKHPRKSIEVLDKLRRYGVSGILSYGLLNTV
Sbjct: 61  EISANDPLKSESGFSNHETEGSMEKNENHKKHPRKSIEVLDKLRRYGVSGILSYGLLNTV 120

Query: 121 YYLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMATIWAGSQVTKLARAAGALAMAPFVDR 180
           YYLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMATIWAGSQVTKLARAAGALAMAPFVDR
Sbjct: 121 YYLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMATIWAGSQVTKLARAAGALAMAPFVDR 180

Query: 181 GLSWFTVKYNFKSQGKAVVAIVGFCLGLSLLLFIAVTLLSA 222
           GLSWFTVKYNFKSQGKAVVAIVGFCLGLSLLLFIAVTLLSA
Sbjct: 181 GLSWFTVKYNFKSQGKAVVAIVGFCLGLSLLLFIAVTLLSA 221

BLAST of CmaCh04G012870 vs. NCBI nr
Match: XP_023534257.1 (uncharacterized protein LOC111795864 isoform X2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 423.3 bits (1087), Expect = 1.3e-114
Identity = 217/221 (98.19%), Postives = 220/221 (99.55%), Query Frame = 0

Query: 1   MPTPLKLTNGAIPCVVFPVAGSNELQPKTIFRASPPFRVSKPNNIFVSRNPSIRQCLNNA 60
           MPTPLKLTNGAIPCVVFPVAGSN+LQPKTIFRASPPFRVSKPNNIFVSRNPSIRQCLNNA
Sbjct: 1   MPTPLKLTNGAIPCVVFPVAGSNKLQPKTIFRASPPFRVSKPNNIFVSRNPSIRQCLNNA 60

Query: 61  EISANDPLKSESGFSNHETEGSMEKNENHKKHPRKSIEVLDKLRRYGVSGILSYGLLNTV 120
           EISANDPLKSE+GFSNHETEGSMEKNENHKKHPRKSIEVLDKLRRYGVSGILSYGLLNTV
Sbjct: 61  EISANDPLKSENGFSNHETEGSMEKNENHKKHPRKSIEVLDKLRRYGVSGILSYGLLNTV 120

Query: 121 YYLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMATIWAGSQVTKLARAAGALAMAPFVDR 180
           YYLTTFLVVWFYIAP PAKMGYVAAAGRFLKIMAT+WAGSQVTKLARAAGALAMAPFVDR
Sbjct: 121 YYLTTFLVVWFYIAPPPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALAMAPFVDR 180

Query: 181 GLSWFTVKYNFKSQGKAVVAIVGFCLGLSLLLFIAVTLLSA 222
           GLSWFTVKYNFKSQGKAVVAIVGFCLGLSLLLFIAVTLLSA
Sbjct: 181 GLSWFTVKYNFKSQGKAVVAIVGFCLGLSLLLFIAVTLLSA 221

BLAST of CmaCh04G012870 vs. NCBI nr
Match: KAG6601085.1 (hypothetical protein SDJN03_06318, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 414.8 bits (1065), Expect = 4.5e-112
Identity = 213/221 (96.38%), Postives = 219/221 (99.10%), Query Frame = 0

Query: 1   MPTPLKLTNGAIPCVVFPVAGSNELQPKTIFRASPPFRVSKPNNIFVSRNPSIRQCLNNA 60
           MPTPLKLTNGAIPCVVFPVAGSN+LQPKTIFRASPPFRVSKP NIFVSRNPSIRQCL+NA
Sbjct: 1   MPTPLKLTNGAIPCVVFPVAGSNKLQPKTIFRASPPFRVSKPINIFVSRNPSIRQCLDNA 60

Query: 61  EISANDPLKSESGFSNHETEGSMEKNENHKKHPRKSIEVLDKLRRYGVSGILSYGLLNTV 120
           EISANDPLKSE+GFSNHETEGSMEKNENH+KHP+KSIEVLDKLRRYGVSGILSYGLLNTV
Sbjct: 61  EISANDPLKSENGFSNHETEGSMEKNENHQKHPQKSIEVLDKLRRYGVSGILSYGLLNTV 120

Query: 121 YYLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMATIWAGSQVTKLARAAGALAMAPFVDR 180
           YYLTTFLVVWFYIAP PAKMGYVAAAGRFLKIMAT+WAGSQVTKLARAAGALAMAPFVDR
Sbjct: 121 YYLTTFLVVWFYIAPPPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALAMAPFVDR 180

Query: 181 GLSWFTVKYNFKSQGKAVVAIVGFCLGLSLLLFIAVTLLSA 222
           GLSWFTVKYNFKSQGKAVVAIVGFCLGLSLLLFIAVTLLSA
Sbjct: 181 GLSWFTVKYNFKSQGKAVVAIVGFCLGLSLLLFIAVTLLSA 221

BLAST of CmaCh04G012870 vs. NCBI nr
Match: XP_022956520.1 (uncharacterized protein LOC111458238 [Cucurbita moschata])

HSP 1 Score: 411.4 bits (1056), Expect = 5.0e-111
Identity = 212/221 (95.93%), Postives = 217/221 (98.19%), Query Frame = 0

Query: 1   MPTPLKLTNGAIPCVVFPVAGSNELQPKTIFRASPPFRVSKPNNIFVSRNPSIRQCLNNA 60
           MPTPLKLTNGAIPCVVFPVAGSN+LQPKTIF ASPPFRVSKP NIFVSRNPSIRQCLNNA
Sbjct: 1   MPTPLKLTNGAIPCVVFPVAGSNKLQPKTIFIASPPFRVSKPINIFVSRNPSIRQCLNNA 60

Query: 61  EISANDPLKSESGFSNHETEGSMEKNENHKKHPRKSIEVLDKLRRYGVSGILSYGLLNTV 120
           EISANDPLKSE+GFSNHETEGSMEKNENH+KHP+KSIEVLDKLRRYGVSGILSYGLLNTV
Sbjct: 61  EISANDPLKSENGFSNHETEGSMEKNENHQKHPQKSIEVLDKLRRYGVSGILSYGLLNTV 120

Query: 121 YYLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMATIWAGSQVTKLARAAGALAMAPFVDR 180
           YYLTTFLVVWFYIAP PAKMGYVAAAGRFLKIMAT+WAGSQVTKLARAAGALAMAPFVDR
Sbjct: 121 YYLTTFLVVWFYIAPPPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALAMAPFVDR 180

Query: 181 GLSWFTVKYNFKSQGKAVVAIVGFCLGLSLLLFIAVTLLSA 222
            LSWFTVKYNFKSQGKAVVAIVGFCLGLSLLLFIAVTLLSA
Sbjct: 181 ALSWFTVKYNFKSQGKAVVAIVGFCLGLSLLLFIAVTLLSA 221

BLAST of CmaCh04G012870 vs. NCBI nr
Match: XP_022983929.1 (uncharacterized protein LOC111482401 isoform X1 [Cucurbita maxima])

HSP 1 Score: 389.8 bits (1000), Expect = 1.6e-104
Identity = 196/196 (100.00%), Postives = 196/196 (100.00%), Query Frame = 0

Query: 1   MPTPLKLTNGAIPCVVFPVAGSNELQPKTIFRASPPFRVSKPNNIFVSRNPSIRQCLNNA 60
           MPTPLKLTNGAIPCVVFPVAGSNELQPKTIFRASPPFRVSKPNNIFVSRNPSIRQCLNNA
Sbjct: 1   MPTPLKLTNGAIPCVVFPVAGSNELQPKTIFRASPPFRVSKPNNIFVSRNPSIRQCLNNA 60

Query: 61  EISANDPLKSESGFSNHETEGSMEKNENHKKHPRKSIEVLDKLRRYGVSGILSYGLLNTV 120
           EISANDPLKSESGFSNHETEGSMEKNENHKKHPRKSIEVLDKLRRYGVSGILSYGLLNTV
Sbjct: 61  EISANDPLKSESGFSNHETEGSMEKNENHKKHPRKSIEVLDKLRRYGVSGILSYGLLNTV 120

Query: 121 YYLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMATIWAGSQVTKLARAAGALAMAPFVDR 180
           YYLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMATIWAGSQVTKLARAAGALAMAPFVDR
Sbjct: 121 YYLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMATIWAGSQVTKLARAAGALAMAPFVDR 180

Query: 181 GLSWFTVKYNFKSQGK 197
           GLSWFTVKYNFKSQGK
Sbjct: 181 GLSWFTVKYNFKSQGK 196

BLAST of CmaCh04G012870 vs. TAIR 10
Match: AT2G38695.1 (unknown protein; Has 65 Blast hits to 65 proteins in 18 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 62; Viruses - 0; Other Eukaryotes - 3 (source: NCBI BLink). )

HSP 1 Score: 193.4 bits (490), Expect = 2.0e-49
Identity = 106/161 (65.84%), Postives = 126/161 (78.26%), Query Frame = 0

Query: 62  ISANDPLKSESGFSNHETEGSM-EKNENHKKHPRKSIEVLDKLRRYGVSGILSYGLLNTV 121
           +S N   KS++       EG M +KN   KK+P  S E+L KL+RYG+SGILSYGLLNTV
Sbjct: 52  LSHNVSNKSDAEAERSCDEGEMLDKNRISKKNPFVSEELLKKLKRYGLSGILSYGLLNTV 111

Query: 122 YYLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMATIWAGSQVTKLARAAGALAMAPFVDR 181
           YY T FL+VWFY+APAP KMGY+AAA RFLK+MA +WAGSQVTKL R  GA+A+AP VDR
Sbjct: 112 YYSTAFLLVWFYVAPAPGKMGYLAAAERFLKVMAMVWAGSQVTKLIRIGGAVALAPIVDR 171

Query: 182 GLSWFTVKYNFKSQGKAVVAIVGFCLGLSLLLFIAVTLLSA 222
           GLSWFTVK NF+SQGKA  A+VG CLG++L+LFI VTLL A
Sbjct: 172 GLSWFTVKCNFESQGKAFGALVGICLGMALMLFIVVTLLWA 212

BLAST of CmaCh04G012870 vs. TAIR 10
Match: AT2G38695.3 (unknown protein; Has 56 Blast hits to 54 proteins in 13 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 56; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 170.6 bits (431), Expect = 1.4e-42
Identity = 106/209 (50.72%), Postives = 126/209 (60.29%), Query Frame = 0

Query: 62  ISANDPLKSESGFSNHETEGSM-EKNENHKKHPRKSIEVLDKLRRYGVSGILSYGLLNTV 121
           +S N   KS++       EG M +KN   KK+P  S E+L KL+RYG+SGILSYGLLNTV
Sbjct: 52  LSHNVSNKSDAEAERSCDEGEMLDKNRISKKNPFVSEELLKKLKRYGLSGILSYGLLNTV 111

Query: 122 YYLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMATIWAGSQVTKLARAAG---------- 181
           YY T FL+VWFY+APAP KMGY+AAA RFLK+MA +WAGSQVTKL R  G          
Sbjct: 112 YYSTAFLLVWFYVAPAPGKMGYLAAAERFLKVMAMVWAGSQVTKLIRIGGEHVKFLGDKH 171

Query: 182 --------------------------------------ALAMAPFVDRGLSWFTVKYNFK 222
                                                 A+A+AP VDRGLSWFTVK NF+
Sbjct: 172 SRWMQDSRTCTCICSFDCYCRPYLVVMLLPMKNVDGCRAVALAPIVDRGLSWFTVKCNFE 231

BLAST of CmaCh04G012870 vs. TAIR 10
Match: AT2G38695.2 (unknown protein; Has 54 Blast hits to 54 proteins in 13 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 54; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 124.8 bits (312), Expect = 8.7e-29
Identity = 69/110 (62.73%), Postives = 82/110 (74.55%), Query Frame = 0

Query: 62  ISANDPLKSESGFSNHETEGSM-EKNENHKKHPRKSIEVLDKLRRYGVSGILSYGLLNTV 121
           +S N   KS++       EG M +KN   KK+P  S E+L KL+RYG+SGILSYGLLNTV
Sbjct: 52  LSHNVSNKSDAEAERSCDEGEMLDKNRISKKNPFVSEELLKKLKRYGLSGILSYGLLNTV 111

Query: 122 YYLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMATIWAGSQVTKLARAAG 171
           YY T FL+VWFY+APAP KMGY+AAA RFLK+MA +WAGSQVTKL R  G
Sbjct: 112 YYSTAFLLVWFYVAPAPGKMGYLAAAERFLKVMAMVWAGSQVTKLIRIGG 161

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1J0R31.9e-116100.00uncharacterized protein LOC111482401 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1GY062.4e-11195.93uncharacterized protein LOC111458238 OS=Cucurbita moschata OX=3662 GN=LOC1114582... [more]
A0A6J1J9007.5e-105100.00uncharacterized protein LOC111482401 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A5A7SW015.4e-7976.13Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold... [more]
A0A1S3BDD72.1e-7884.13uncharacterized protein LOC103488806 isoform X5 OS=Cucumis melo OX=3656 GN=LOC10... [more]
Match NameE-valueIdentityDescription
XP_022983937.13.9e-116100.00uncharacterized protein LOC111482401 isoform X2 [Cucurbita maxima][more]
XP_023534257.11.3e-11498.19uncharacterized protein LOC111795864 isoform X2 [Cucurbita pepo subsp. pepo][more]
KAG6601085.14.5e-11296.38hypothetical protein SDJN03_06318, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_022956520.15.0e-11195.93uncharacterized protein LOC111458238 [Cucurbita moschata][more]
XP_022983929.11.6e-104100.00uncharacterized protein LOC111482401 isoform X1 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
AT2G38695.12.0e-4965.84unknown protein; Has 65 Blast hits to 65 proteins in 18 species: Archae - 0; Bac... [more]
AT2G38695.31.4e-4250.72unknown protein; Has 56 Blast hits to 54 proteins in 13 species: Archae - 0; Bac... [more]
AT2G38695.28.7e-2962.73unknown protein; Has 54 Blast hits to 54 proteins in 13 species: Archae - 0; Bac... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 67..93
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 78..93
NoneNo IPR availablePANTHERPTHR34370OS04G0600100 PROTEINcoord: 30..221
NoneNo IPR availablePANTHERPTHR34370:SF2GAG-POL POLYPROTEIN/RETROTRANSPOSONcoord: 30..221

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh04G012870.1CmaCh04G012870.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane