Cp4.1LG12g04290 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG12g04290
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionLINE-1 retrotransposable element ORF2 protein
LocationCp4.1LG12: 3226324 .. 3227572 (-)
RNA-Seq ExpressionCp4.1LG12g04290
SyntenyCp4.1LG12g04290
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTCTTCTTCATACTAGACCAAACCCATGAGCTAGCCGAAGCACTGTACGTAATCGCCAGCGTTACTGACATAGCCTATCTCAAATTCTCAATGGAAAAAATGTCCATAATGGGCAGTGCCTCACCAGAAACTGATTGCATCATAGCAATACAACTCTGCCCTCCATACTTCCAAAATTATACCTGCAACCATCTTCATTATTCATGGATTAACATCCACAAAATCTTCCCTCTCATGTCCGATCTCAACCGAGCCGGCTTCTCTTCCTTGTCCTTCTCTTTTACAGCCCCGAATCTTGCCAACCTCGTATTCGAGGGCCCATCCCGTTTACGTGAAGCCGATTTCGAGTTAAATCCGACCAATGGTTCGAGGGATATCGGGCCATATGACTACTCAACTTTTGTCACCTTGGAATCAAAAGAGTTCATAAACATAATCACAGACTTTGAATTTTATCGTTATGGTAATACCCCCTTTAATCAAAAGATACACATTTTATGTTGCTAACATGTTCTTGAATTTCTTCATGTTTGTGGCAGTTCTTGTTACTCTCACGAGCTCGCAAGTCAAATTCTCGTACACGAGGTTTCAGACTATTATTACTCGTGAGGTATTATTATTTGTTGAGGATTGTTAGGAAATGATCATGGGTTTACAAGTCGTTATTAAAATTTGGAATTTATTGCATGTAGAGTGGCCGATGCGTTATTGGAGGTATTGAAGAATCGGGTTTGATTAAATATATAATCACTCTTCATCCAATGCATACTTTCTGCAATTTGGTTCGTCAAACTGAAAGGGTATGGCTATTCAAGTCCGATGAAGCTGCCAAAGGTGTAATTGCTGCCCCTCTAGGACTGCATGCTCGATTTGTGACCTATTTTCCTGATGATTGGAAAGAATGAAATGGGAAAGTTTGTGTAGCCTTACAAAACTATGATCAAATGCCATTTCTTGACATTCTTTTTTTTTTTTTAATTGTTCTACTCTTGAATTTGTTATAATCACGTGTTTCAACGTTGATCAATTCGGAAAATGAATCATCGAATGGAGATAGTATTACTTCGATGATCTTTTTCTTGATATGGGACTACTCTTAATAATTCTGAACCGTCAGTGCCATATAGCCACTGCTAGGATTGAGTTTTCTTTCTAAAAATTTGACATGAATACTTTGATTCTCTAAGTAAGTTGGTTTTGAGTTATTGTTTCAGTACCCTAGTTCTCTTGTTTCGATCAAGATAAT

mRNA sequence

ATGTTCTTCTTCATACTAGACCAAACCCATGAGCTAGCCGAAGCACTGTACGTAATCGCCAGCGTTACTGACATAGCCTATCTCAAATTCTCAATGGAAAAAATGTCCATAATGGGCAGTGCCTCACCAGAAACTGATTGCATCATAGCAATACAACTCTGCCCTCCATACTTCCAAAATTATACCTGCAACCATCTTCATTATTCATGGATTAACATCCACAAAATCTTCCCTCTCATGTCCGATCTCAACCGAGCCGGCTTCTCTTCCTTGTCCTTCTCTTTTACAGCCCCGAATCTTGCCAACCTCAGTGGCCGATGCGTTATTGGAGGTATTGAAGAATCGGGTTTGATTAAATATATAATCACTCTTCATCCAATGCATACTTTCTGCAATTTGGTTCGTCAAACTGAAAGGGTATGGCTATTCAAGTCCGATGAAGCTGCCAAAGGTGTAATTGCTGCCCCTCTAGGACTGCATGCTCGATTTGTGACCTATTTTCCTGATGATTGGAAAGAATGAAATGGGAAAGTTTGTGTAGCCTTACAAAACTATGATCAAATGCCATTTCTTGACATTCTTTTTTTTTTTTTAATTGTTCTACTCTTGAATTTGTTATAATCACGTGTTTCAACGTTGATCAATTCGGAAAATGAATCATCGAATGGAGATAGTATTACTTCGATGATCTTTTTCTTGATATGGGACTACTCTTAATAATTCTGAACCGTCAGTGCCATATAGCCACTGCTAGGATTGAGTTTTCTTTCTAAAAATTTGACATGAATACTTTGATTCTCTAAGTAAGTTGGTTTTGAGTTATTGTTTCAGTACCCTAGTTCTCTTGTTTCGATCAAGATAAT

Coding sequence (CDS)

ATGTTCTTCTTCATACTAGACCAAACCCATGAGCTAGCCGAAGCACTGTACGTAATCGCCAGCGTTACTGACATAGCCTATCTCAAATTCTCAATGGAAAAAATGTCCATAATGGGCAGTGCCTCACCAGAAACTGATTGCATCATAGCAATACAACTCTGCCCTCCATACTTCCAAAATTATACCTGCAACCATCTTCATTATTCATGGATTAACATCCACAAAATCTTCCCTCTCATGTCCGATCTCAACCGAGCCGGCTTCTCTTCCTTGTCCTTCTCTTTTACAGCCCCGAATCTTGCCAACCTCAGTGGCCGATGCGTTATTGGAGGTATTGAAGAATCGGGTTTGATTAAATATATAATCACTCTTCATCCAATGCATACTTTCTGCAATTTGGTTCGTCAAACTGAAAGGGTATGGCTATTCAAGTCCGATGAAGCTGCCAAAGGTGTAATTGCTGCCCCTCTAGGACTGCATGCTCGATTTGTGACCTATTTTCCTGATGATTGGAAAGAATGA

Protein sequence

MFFFILDQTHELAEALYVIASVTDIAYLKFSMEKMSIMGSASPETDCIIAIQLCPPYFQNYTCNHLHYSWINIHKIFPLMSDLNRAGFSSLSFSFTAPNLANLSGRCVIGGIEESGLIKYIITLHPMHTFCNLVRQTERVWLFKSDEAAKGVIAAPLGLHARFVTYFPDDWKE
Homology
BLAST of Cp4.1LG12g04290 vs. NCBI nr
Match: KAG6575236.1 (hypothetical protein SDJN03_25875, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 314 bits (804), Expect = 3.30e-106
Identity = 165/249 (66.27%), Postives = 170/249 (68.27%), Query Frame = 0

Query: 1   MFFFILDQTHELAEALYVIASVTDIAYLKFSMEKMSIMGSASPETDCIIAIQLCPPYFQN 60
           MFFFILDQTHELAEALYVIASVTDIAYLKFSME MSIMGSASPET CIIAIQLCPPYFQ 
Sbjct: 1   MFFFILDQTHELAEALYVIASVTDIAYLKFSMENMSIMGSASPETACIIAIQLCPPYFQT 60

Query: 61  YTCNHLHYSWINIHKIFPLMSDLNRAGFSSLSFSFTAPNLANL----------------- 120
           YTC+HLHYSWINIHKIFPLMSDLNRAGFSSLSFSFTAPNLANL                 
Sbjct: 61  YTCDHLHYSWINIHKIFPLMSDLNRAGFSSLSFSFTAPNLANLVFLGPSRLREADFGLNP 120

Query: 121 -----------------------------------------------------------S 173
                                                                      S
Sbjct: 121 SNGSRDIGPYDYSTFVTLESKEFINIITDFEFYHYVLVTLTSSQVKFSYTRFQTIITRES 180

BLAST of Cp4.1LG12g04290 vs. NCBI nr
Match: XP_023549461.1 (uncharacterized protein LOC111807815 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 271 bits (693), Expect = 9.04e-90
Identity = 142/218 (65.14%), Postives = 142/218 (65.14%), Query Frame = 0

Query: 32  MEKMSIMGSASPETDCIIAIQLCPPYFQNYTCNHLHYSWINIHKIFPLMSDLNRAGFSSL 91
           MEKMSIMGSASPETDCIIAIQLCPPYFQNYTCNHLHYSWINIHKIFPLMSDLNRAGFSSL
Sbjct: 1   MEKMSIMGSASPETDCIIAIQLCPPYFQNYTCNHLHYSWINIHKIFPLMSDLNRAGFSSL 60

Query: 92  SFSFTAPNLANL------------------------------------------------ 151
           SFSFTAPNLANL                                                
Sbjct: 61  SFSFTAPNLANLVFEGPSRLREADFELNPTNGSRDIGPYDYSTFVTLESKEFINIITDFE 120

Query: 152 ----------------------------SGRCVIGGIEESGLIKYIITLHPMHTFCNLVR 173
                                       SGRCVIGGIEESGLIKYIITLHPMHTFCNLVR
Sbjct: 121 FYRYVLVTLTSSQVKFSYTRFQTIITRESGRCVIGGIEESGLIKYIITLHPMHTFCNLVR 180

BLAST of Cp4.1LG12g04290 vs. NCBI nr
Match: XP_023006661.1 (uncharacterized protein LOC111499319 [Cucurbita maxima])

HSP 1 Score: 256 bits (653), Expect = 9.97e-84
Identity = 133/215 (61.86%), Postives = 137/215 (63.72%), Query Frame = 0

Query: 35  MSIMGSASPETDCIIAIQLCPPYFQNYTCNHLHYSWINIHKIFPLMSDLNRAGFSSLSFS 94
           MSIMGSASPET CIIAIQLCPPYF+ YTCNHLHYSWINIHKIFPLMSDLNRAGFSSLSFS
Sbjct: 1   MSIMGSASPETACIIAIQLCPPYFKKYTCNHLHYSWINIHKIFPLMSDLNRAGFSSLSFS 60

Query: 95  FTAPNLANL--------------------------------------------------- 154
           FTAPNLANL                                                   
Sbjct: 61  FTAPNLANLVFEGPSRLREANFELNPCDGSRDIGPYDYSTFVTLESKEFINIITDFEFYR 120

Query: 155 -------------------------SGRCVIGGIEESGLIKYIITLHPMHTFCNLVRQTE 173
                                    SGRCVIGGIEESGLIKY+IT+HPMHTFCNLVRQTE
Sbjct: 121 YVLVTLTSSQVKFSYTRFQNIITRESGRCVIGGIEESGLIKYVITVHPMHTFCNLVRQTE 180

BLAST of Cp4.1LG12g04290 vs. NCBI nr
Match: XP_022959415.1 (uncharacterized protein LOC111460398 [Cucurbita moschata])

HSP 1 Score: 254 bits (650), Expect = 3.15e-83
Identity = 134/218 (61.47%), Postives = 137/218 (62.84%), Query Frame = 0

Query: 32  MEKMSIMGSASPETDCIIAIQLCPPYFQNYTCNHLHYSWINIHKIFPLMSDLNRAGFSSL 91
           ME MSIMGSASPET CIIAIQLCPPYFQ YTC+HLHYSWINIHKIFPLMSDLNRAGFSSL
Sbjct: 1   MENMSIMGSASPETACIIAIQLCPPYFQTYTCDHLHYSWINIHKIFPLMSDLNRAGFSSL 60

Query: 92  SFSFTAPNLANL------------------------------------------------ 151
           SFSFTAPNLANL                                                
Sbjct: 61  SFSFTAPNLANLVFMGPSRLREADFGLNPSNGSRDIGPYDYSTFVTLESKEFINIITDFE 120

Query: 152 ----------------------------SGRCVIGGIEESGLIKYIITLHPMHTFCNLVR 173
                                       SGRCVIGGIEES  IKYIIT+HPMHTFCNLVR
Sbjct: 121 FYRYVLVTLTSSQVKFSYTRFQTIITRESGRCVIGGIEESCFIKYIITVHPMHTFCNLVR 180

BLAST of Cp4.1LG12g04290 vs. NCBI nr
Match: XP_023549342.1 (uncharacterized protein LOC111807724 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 112 bits (279), Expect = 7.26e-27
Identity = 69/235 (29.36%), Postives = 99/235 (42.13%), Query Frame = 0

Query: 11  ELAEALYVIASVTDIAYLKFSMEKMSIMGSASPETDCIIAIQLCPPYFQNYTCNHLHYSW 70
           +L +A  V+AS  D+A +KFS E  +IM  + P    +IA+ L P +F  Y CN L  SW
Sbjct: 3   DLVDAASVVASADDMADMKFSREMFAIMADSYPALGGVIALHLWPQFFDEYVCNELLKSW 62

Query: 71  INIHKIFPLMSDLNRAGFSSLSFSFTAPNLANLS-------------------------- 130
             +  +FPLM D+  +GF+SL+F+ T P  A L                           
Sbjct: 63  TFVKNLFPLMIDMEESGFNSLTFTVTYPESAELKFQAPNGLSNDVEFELIPSLDPLEVGD 122

Query: 131 ----------------------------------------------------GRCVIGGI 167
                                                               G C+IGGI
Sbjct: 123 FDFSSFVSLESEEFVNIVTEYHMFDYVHVIVTSTRVIFSYAIMLETILTQEDGECLIGGI 182

BLAST of Cp4.1LG12g04290 vs. ExPASy TrEMBL
Match: A0A6J1KYD4 (uncharacterized protein LOC111499319 OS=Cucurbita maxima OX=3661 GN=LOC111499319 PE=4 SV=1)

HSP 1 Score: 256 bits (653), Expect = 4.83e-84
Identity = 133/215 (61.86%), Postives = 137/215 (63.72%), Query Frame = 0

Query: 35  MSIMGSASPETDCIIAIQLCPPYFQNYTCNHLHYSWINIHKIFPLMSDLNRAGFSSLSFS 94
           MSIMGSASPET CIIAIQLCPPYF+ YTCNHLHYSWINIHKIFPLMSDLNRAGFSSLSFS
Sbjct: 1   MSIMGSASPETACIIAIQLCPPYFKKYTCNHLHYSWINIHKIFPLMSDLNRAGFSSLSFS 60

Query: 95  FTAPNLANL--------------------------------------------------- 154
           FTAPNLANL                                                   
Sbjct: 61  FTAPNLANLVFEGPSRLREANFELNPCDGSRDIGPYDYSTFVTLESKEFINIITDFEFYR 120

Query: 155 -------------------------SGRCVIGGIEESGLIKYIITLHPMHTFCNLVRQTE 173
                                    SGRCVIGGIEESGLIKY+IT+HPMHTFCNLVRQTE
Sbjct: 121 YVLVTLTSSQVKFSYTRFQNIITRESGRCVIGGIEESGLIKYVITVHPMHTFCNLVRQTE 180

BLAST of Cp4.1LG12g04290 vs. ExPASy TrEMBL
Match: A0A6J1H678 (uncharacterized protein LOC111460398 OS=Cucurbita moschata OX=3662 GN=LOC111460398 PE=4 SV=1)

HSP 1 Score: 254 bits (650), Expect = 1.53e-83
Identity = 134/218 (61.47%), Postives = 137/218 (62.84%), Query Frame = 0

Query: 32  MEKMSIMGSASPETDCIIAIQLCPPYFQNYTCNHLHYSWINIHKIFPLMSDLNRAGFSSL 91
           ME MSIMGSASPET CIIAIQLCPPYFQ YTC+HLHYSWINIHKIFPLMSDLNRAGFSSL
Sbjct: 1   MENMSIMGSASPETACIIAIQLCPPYFQTYTCDHLHYSWINIHKIFPLMSDLNRAGFSSL 60

Query: 92  SFSFTAPNLANL------------------------------------------------ 151
           SFSFTAPNLANL                                                
Sbjct: 61  SFSFTAPNLANLVFMGPSRLREADFGLNPSNGSRDIGPYDYSTFVTLESKEFINIITDFE 120

Query: 152 ----------------------------SGRCVIGGIEESGLIKYIITLHPMHTFCNLVR 173
                                       SGRCVIGGIEES  IKYIIT+HPMHTFCNLVR
Sbjct: 121 FYRYVLVTLTSSQVKFSYTRFQTIITRESGRCVIGGIEESCFIKYIITVHPMHTFCNLVR 180

BLAST of Cp4.1LG12g04290 vs. ExPASy TrEMBL
Match: A0A6J1KWL1 (uncharacterized protein LOC111498888 OS=Cucurbita maxima OX=3661 GN=LOC111498888 PE=4 SV=1)

HSP 1 Score: 112 bits (281), Expect = 3.98e-27
Identity = 76/269 (28.25%), Postives = 107/269 (39.78%), Query Frame = 0

Query: 10  HELAEALYVIASV-TDIAYLKFSMEKMSIMGSASPETDCIIAIQLCPPYFQNYTCNHLHY 69
           H+L +A  V+A    DI  +KFS    SIM + +P + CIIA+QL P +F  Y C+ LHY
Sbjct: 2   HDLVDATSVLAETYDDIFDIKFSPTMFSIMAATTPSSRCIIALQLSPQFFNAYLCHQLHY 61

Query: 70  SWINIHKIFPLMSDLNRAGFSSLSFSFTAPN----------------------------- 129
            +I I   +  M +  R GFSSL+F+F  P+                             
Sbjct: 62  KFIYIESFYDFMHNFERKGFSSLTFTFPEPDRVDSNVALRVMSEACINVYSCTIPWRFLA 121

Query: 130 ------------------------------------------------------------ 169
                                                                       
Sbjct: 122 ATAILKFFDGSNGHFEEVELPMFPSSKVMDVGAFDFGTFVSIDSQEFINIVTCFNDFDYV 181

BLAST of Cp4.1LG12g04290 vs. ExPASy TrEMBL
Match: A0A6J1H4N0 (uncharacterized protein LOC111460009 OS=Cucurbita moschata OX=3662 GN=LOC111460009 PE=4 SV=1)

HSP 1 Score: 112 bits (281), Expect = 4.98e-27
Identity = 75/269 (27.88%), Postives = 104/269 (38.66%), Query Frame = 0

Query: 10  HELAEALYVIASV-TDIAYLKFSMEKMSIMGSASPETDCIIAIQLCPPYFQNYTCNHLHY 69
           H+L +A  V+A    DI  +KFS    SIM + +P + CII +QL P +F  Y C+ LHY
Sbjct: 2   HDLVDATSVLAETYDDIFDIKFSPTMFSIMAATTPSSHCIIELQLSPQFFNAYLCHQLHY 61

Query: 70  SWINIHKIFPLMSDLNRAGFSSLSFSFTAPN----------------------------- 129
            +I I   +  M +  R GFSSL+F+F  P+                             
Sbjct: 62  KFIYIEDFYDFMHNFERKGFSSLTFTFPEPDRVDSNVAVRVMSEACINVYSCTIPWRFLA 121

Query: 130 ------------------------------------------------------------ 169
                                                                       
Sbjct: 122 ATAILKFFHGSNGHFEEVELPMFPSCKVMDVGAFDIGTFVSIDSQEFINIVTYFNDFDYV 181

BLAST of Cp4.1LG12g04290 vs. ExPASy TrEMBL
Match: A0A6J1I815 (uncharacterized protein LOC111470836 OS=Cucurbita maxima OX=3661 GN=LOC111470836 PE=4 SV=1)

HSP 1 Score: 105 bits (262), Expect = 1.45e-24
Identity = 71/251 (28.29%), Postives = 110/251 (43.82%), Query Frame = 0

Query: 1   MFFFILDQTHELAEALYVIASVTDIAYLKFSMEKMSIMGSASPETD-CIIAIQLCPPYFQ 60
           MF F LD  HE  EA  VIA+      LKFS +  SIM +A P +   ++A+Q+ P +F+
Sbjct: 1   MFMFNLDDIHEFVEAASVIATQAPTGVLKFSPQMFSIMATALPPSPRSVLALQIRPQFFR 60

Query: 61  NYTC-NHLHYSWINIHKIFPLMSDLNRAGFSSLSFSFTAPNLANLSGR------------ 120
           +YTC + L Y+WI ++ +   +S+++  GF  L FS T P+ A+L+ R            
Sbjct: 61  SYTCTSQLEYAWIFLNDLQFTLSEMDCNGFPELLFSSTEPDCAHLTFRPPASNYNYRLYK 120

Query: 121 ------------------------------------------------------------ 168
                                                                       
Sbjct: 121 LPLCPGTGEMDMDQEFDCTTFVSIPSDFFNVVLNVFMDCFDYVLVTVTASLARFCNDAGD 180

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
KAG6575236.13.30e-10666.27hypothetical protein SDJN03_25875, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_023549461.19.04e-9065.14uncharacterized protein LOC111807815 [Cucurbita pepo subsp. pepo][more]
XP_023006661.19.97e-8461.86uncharacterized protein LOC111499319 [Cucurbita maxima][more]
XP_022959415.13.15e-8361.47uncharacterized protein LOC111460398 [Cucurbita moschata][more]
XP_023549342.17.26e-2729.36uncharacterized protein LOC111807724 [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
A0A6J1KYD44.83e-8461.86uncharacterized protein LOC111499319 OS=Cucurbita maxima OX=3661 GN=LOC111499319... [more]
A0A6J1H6781.53e-8361.47uncharacterized protein LOC111460398 OS=Cucurbita moschata OX=3662 GN=LOC1114603... [more]
A0A6J1KWL13.98e-2728.25uncharacterized protein LOC111498888 OS=Cucurbita maxima OX=3661 GN=LOC111498888... [more]
A0A6J1H4N04.98e-2727.88uncharacterized protein LOC111460009 OS=Cucurbita moschata OX=3662 GN=LOC1114600... [more]
A0A6J1I8151.45e-2428.29uncharacterized protein LOC111470836 OS=Cucurbita maxima OX=3661 GN=LOC111470836... [more]
Match NameE-valueIdentityDescription
Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG12g04290.1Cp4.1LG12g04290.1mRNA