CmoCh18G006000 (gene) Cucurbita moschata (Rifu)

NameCmoCh18G006000
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
LocationCmo_Chr18 : 5831675 .. 5832253 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTTGGACAACCGTCCTTCTCCCTTAGACATGAAGCCATAAAATACATTTACAACTGCCGTATGAAAGAAGGGACCTCAGTTAGAGAACATGTCCTAGACATCATGGTCCATTTCAATGTGGCAGAAGAAAATGAAGCTGTCATTGATGAGAAGAGTCAAGTCAGTTTTATCATGATGTCTCTTCCGAAGAGCTTCTTTCAGTTCCGCACAAATGTGGTAATGAACAAAATAGAATATAATTTGACTGCTCTTCTCAATGAGCTACAGACTTATCAGTCCCTCTTAACGAACAAGGGACAAACAGGAGAAGCAAATGTTGCTATCTCCAAGAAATTACTACGAGGATCGTTCTCTAAAAATAAGTCTGGACCTTCAACTTCTAAAAGTGTTATGATGAAGAAGGGTAAAGGGAAAAATAAGATTCCTACTAACCGCAAACACAAGGTTCAAAAGCAGATAAAGGAAAATGTTTCCATTGCAACGAAAACGGGCACTGAAAGAGAAACTGCCGAAAATAGCTTGCAGAGAAGAAAGCCGAAAAGACACAACAAGGTAAATATGATTTACTCATTGTAG

mRNA sequence

ATGTTTGGACAACCGTCCTTCTCCCTTAGACATGAAGCCATAAAATACATTTACAACTGCCGTATGAAAGAAGGGACCTCAGTTAGAGAACATGTCCTAGACATCATGGTCCATTTCAATGTGGCAGAAGAAAATGAAGCTGTCATTGATGAGAAGAGTCAAGTCAGTTTTATCATGATGTCTCTTCCGAAGAGCTTCTTTCAGTTCCGCACAAATGTGGTAATGAACAAAATAGAATATAATTTGACTGCTCTTCTCAATGAGCTACAGACTTATCAGTCCCTCTTAACGAACAAGGGACAAACAGGAGAAGCAAATGTTGCTATCTCCAAGAAATTACTACGAGGATCGTTCTCTAAAAATAAGTCTGGACCTTCAACTTCTAAAAGTGTTATGATGAAGAAGGGTAAAGGGAAAAATAAGATTCCTACTAACCGCAAACACAAGGTTCAAAAGCAGATAAAGGAAAATGTTTCCATTGCAACGAAAACGGGCACTGAAAGAGAAACTGCCGAAAATAGCTTGCAGAGAAGAAAGCCGAAAAGACACAACAAGGTAAATATGATTTACTCATTGTAG

Coding sequence (CDS)

ATGTTTGGACAACCGTCCTTCTCCCTTAGACATGAAGCCATAAAATACATTTACAACTGCCGTATGAAAGAAGGGACCTCAGTTAGAGAACATGTCCTAGACATCATGGTCCATTTCAATGTGGCAGAAGAAAATGAAGCTGTCATTGATGAGAAGAGTCAAGTCAGTTTTATCATGATGTCTCTTCCGAAGAGCTTCTTTCAGTTCCGCACAAATGTGGTAATGAACAAAATAGAATATAATTTGACTGCTCTTCTCAATGAGCTACAGACTTATCAGTCCCTCTTAACGAACAAGGGACAAACAGGAGAAGCAAATGTTGCTATCTCCAAGAAATTACTACGAGGATCGTTCTCTAAAAATAAGTCTGGACCTTCAACTTCTAAAAGTGTTATGATGAAGAAGGGTAAAGGGAAAAATAAGATTCCTACTAACCGCAAACACAAGGTTCAAAAGCAGATAAAGGAAAATGTTTCCATTGCAACGAAAACGGGCACTGAAAGAGAAACTGCCGAAAATAGCTTGCAGAGAAGAAAGCCGAAAAGACACAACAAGGTAAATATGATTTACTCATTGTAG
BLAST of CmoCh18G006000 vs. TrEMBL
Match: E2GK51_BRYDI (Gag/pol protein (Fragment) OS=Bryonia dioica PE=4 SV=1)

HSP 1 Score: 176.4 bits (446), Expect = 3.3e-41
Identity = 96/140 (68.57%), Postives = 114/140 (81.43%), Query Frame = 1

Query: 1   MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDIMVHFNVAEENEAVIDEKSQVSFIMM 60
           MFGQPS+SLRHEAIK+IY  RMKEGTSVREHVLD+M+HFN+AE N   IDE +QVSFI+ 
Sbjct: 101 MFGQPSWSLRHEAIKHIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIDEANQVSFILQ 160

Query: 61  SLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKGQTGEANVAISK-KLLRGSFS 120
           SLPKSF  F+TN  +NKIE+NLT LLNELQ +Q+L  +KG+  EANVA++K K +RGS S
Sbjct: 161 SLPKSFVPFQTNASLNKIEFNLTTLLNELQRFQNLTLSKGKEVEANVAVTKRKFIRGSSS 220

Query: 121 KNKSGPSTSKSVMMKKGKGK 140
           KNK GP  SK+ M KKGKGK
Sbjct: 221 KNKVGP--SKAQMKKKGKGK 238

BLAST of CmoCh18G006000 vs. TrEMBL
Match: A5AVN4_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_017217 PE=4 SV=1)

HSP 1 Score: 117.5 bits (293), Expect = 1.8e-23
Identity = 77/183 (42.08%), Postives = 111/183 (60.66%), Query Frame = 1

Query: 1   MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDIMVHFNVAEENEAVIDEKSQVSFIMM 60
           MFG+PS   RHEA+K + N +MK G+SVREHVL ++ HFN AE N A IDEK+QV  I+ 
Sbjct: 24  MFGRPSEQARHEAVKAVMNSKMKNGSSVREHVLKMIHHFNKAEINGAKIDEKTQVGMILE 83

Query: 61  SLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKGQTGEANVAISKKLL-RGSFS 120
           +L  SF QFRTN +MN  + NLT LLNELQ+Y++L+ +KG  G+AN+A +  ++ + S S
Sbjct: 84  TLSPSFLQFRTNYIMNHKKCNLTELLNELQSYETLIDDKG--GKANIAEANAVVGKASSS 143

Query: 121 KNKSGPSTSKSVMMKKGKGKNKIPTNRKHKVQKQIKENVSIATKTGTERETAENSLQRRK 180
           +NK      K   ++  K K KI   ++  V+ + KE      +    +   +  L   K
Sbjct: 144 RNK------KKRNVRNQKDKKKIQKKKRKVVELKPKEKCFYCNQDDHWKRNCKKYLDELK 198

Query: 181 PKR 183
            K+
Sbjct: 204 QKK 198

BLAST of CmoCh18G006000 vs. TrEMBL
Match: W9ST61_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_003432 PE=4 SV=1)

HSP 1 Score: 101.7 bits (252), Expect = 1.0e-18
Identity = 71/153 (46.41%), Postives = 93/153 (60.78%), Query Frame = 1

Query: 1   MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDIMVHFNVAEENEAVIDEKSQVSFIMM 60
           MFG PS   R +A+    N +MK+G+SV+ HVL+++ H + AE N A IDE +QV  I+ 
Sbjct: 76  MFGTPSEKARLDAVWAFMNDKMKKGSSVKAHVLNMIDHLHDAELNGARIDEATQVGIILE 135

Query: 61  SLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKGQTGEANVAISKKLLRGSFSK 120
           SL  +F QF  N VMNK + NLT L+N LQ ++S  TNK + GEANV     L+ G + K
Sbjct: 136 SLSPNFHQFVNNFVMNKKKSNLTELMNNLQNFES--TNKRRGGEANV-----LVAGGYGK 195

Query: 121 NKSGPSTSKSVMMKKG-KGKNKIPTNRKHKVQK 153
           NK      K+    KG KGKNK P N K  +QK
Sbjct: 196 NK-----RKNQNQGKGKKGKNKKPRNTKGPIQK 216

BLAST of CmoCh18G006000 vs. TrEMBL
Match: A0A165U314_9ROSI (Gag/pol protein OS=Momordica dioica PE=4 SV=1)

HSP 1 Score: 97.4 bits (241), Expect = 2.0e-17
Identity = 66/145 (45.52%), Postives = 86/145 (59.31%), Query Frame = 1

Query: 1   MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDIMVHFNVAEENEAVIDEKSQVSFIMM 60
           +F + ++SLRHEA    Y  RMKEGTSV EHVLD+ ++ + AE N   IDE + VSFI+ 
Sbjct: 102 IFQKNTWSLRHEAFTKFYTKRMKEGTSVSEHVLDMAMYSSRAEVNGGPIDEANAVSFILQ 161

Query: 61  SLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSL-LTNKGQTGEANVAISKKLLRG--- 120
           SLPKS+  F  N  MNK+  +   L NELQ +Q+L L+ + +    N   +K+  R    
Sbjct: 162 SLPKSYKGFLLNASMNKMNKSPGELFNELQRFQNLTLSKEVEANMVNKVTAKRFKRNDKG 221

Query: 121 --SFSKNKSGPSTSKSVMMKKGKGK 140
               SKNK GP   K  M KKGKGK
Sbjct: 222 KKGSSKNKVGPDEIK--MKKKGKGK 244

BLAST of CmoCh18G006000 vs. TrEMBL
Match: E2GK52_BRYDI (Gag/pol protein (Fragment) OS=Bryonia dioica PE=4 SV=1)

HSP 1 Score: 92.0 bits (227), Expect = 8.2e-16
Identity = 45/62 (72.58%), Postives = 53/62 (85.48%), Query Frame = 1

Query: 1   MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDIMVHFNVAEENEAVIDEKSQVSFIMM 60
           MFGQPS+SLRHEAIK IY  RMKEGTSVREHVLD+M+HFN+A+ +   IDE +QVSFI+ 
Sbjct: 101 MFGQPSWSLRHEAIKRIYTKRMKEGTSVREHVLDMMMHFNIAKVHGGPIDEANQVSFILQ 160

Query: 61  SL 63
           SL
Sbjct: 161 SL 162

BLAST of CmoCh18G006000 vs. NCBI nr
Match: gi|299474487|gb|ADJ18449.1| (gag/pol protein [Bryonia dioica])

HSP 1 Score: 176.4 bits (446), Expect = 4.8e-41
Identity = 96/140 (68.57%), Postives = 114/140 (81.43%), Query Frame = 1

Query: 1   MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDIMVHFNVAEENEAVIDEKSQVSFIMM 60
           MFGQPS+SLRHEAIK+IY  RMKEGTSVREHVLD+M+HFN+AE N   IDE +QVSFI+ 
Sbjct: 101 MFGQPSWSLRHEAIKHIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIDEANQVSFILQ 160

Query: 61  SLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKGQTGEANVAISK-KLLRGSFS 120
           SLPKSF  F+TN  +NKIE+NLT LLNELQ +Q+L  +KG+  EANVA++K K +RGS S
Sbjct: 161 SLPKSFVPFQTNASLNKIEFNLTTLLNELQRFQNLTLSKGKEVEANVAVTKRKFIRGSSS 220

Query: 121 KNKSGPSTSKSVMMKKGKGK 140
           KNK GP  SK+ M KKGKGK
Sbjct: 221 KNKVGP--SKAQMKKKGKGK 238

BLAST of CmoCh18G006000 vs. NCBI nr
Match: gi|659113937|ref|XP_008456829.1| (PREDICTED: uncharacterized protein LOC103496667 [Cucumis melo])

HSP 1 Score: 168.7 bits (426), Expect = 9.9e-39
Identity = 92/139 (66.19%), Postives = 109/139 (78.42%), Query Frame = 1

Query: 1   MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDIMVHFNVAEENEAVIDEKSQVSFIMM 60
           MFGQ S+ ++H+A+KYIYN R+ EG SVREHVL++MVHFNVAE N AVIDE SQVSFI+ 
Sbjct: 1   MFGQASYQIKHDALKYIYNARINEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILE 60

Query: 61  SLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKGQTGEANVAIS-KKLLRGSFS 120
           SLP+SF QFR+N VMNKI Y LT LLNELQT++SL+  KGQ GEANVA S +K  RGS S
Sbjct: 61  SLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKGQKGEANVATSTRKFHRGSTS 120

Query: 121 KNKSGPSTSKSVMMKKGKG 139
             KS PS+S +   KK KG
Sbjct: 121 GTKSMPSSSGNKKWKKKKG 139

BLAST of CmoCh18G006000 vs. NCBI nr
Match: gi|659113933|ref|XP_008456826.1| (PREDICTED: uncharacterized protein LOC103496664 [Cucumis melo])

HSP 1 Score: 153.3 bits (386), Expect = 4.3e-34
Identity = 96/194 (49.48%), Postives = 124/194 (63.92%), Query Frame = 1

Query: 1   MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDIMVHFNVAEENEAVIDEKSQVSFIMM 60
           MFGQ S+ ++H+A+KYIYN RM +G  VREHVL++MV+FNVAE N AVIDE +QVSFI+ 
Sbjct: 101 MFGQASYQIKHDALKYIYNARMNDGALVREHVLNMMVYFNVAEMNGAVIDEANQVSFILE 160

Query: 61  SLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKGQTGEANVAIS-KKLLRGSFS 120
           SL +SF QFR+NVVMNKI Y LT LLNELQT++SL+  KGQ GEANVA S +K  RGS S
Sbjct: 161 SLLESFLQFRSNVVMNKIAYTLTTLLNELQTFESLMKIKGQKGEANVATSTRKFHRGSTS 220

Query: 121 KNKSGPSTSKSVMMKKGKG----KNKIPTNRKHKVQKQIKENVSIATKTGTERETAENSL 180
             K  PS+S +   KK KG    K  +   +  K  K  K       + G  +      L
Sbjct: 221 GTKYMPSSSGNKKWKKKKGGQGNKANLAATKTSKKAKVAKGICFHCNQEGHWKRNCPKYL 280

Query: 181 QRRKPKRHNKVNMI 190
             +K  +  K +++
Sbjct: 281 AEKKKAKQGKYDLL 294

BLAST of CmoCh18G006000 vs. NCBI nr
Match: gi|659118732|ref|XP_008459275.1| (PREDICTED: uncharacterized protein LOC103498451 [Cucumis melo])

HSP 1 Score: 121.7 bits (304), Expect = 1.4e-24
Identity = 59/95 (62.11%), Postives = 72/95 (75.79%), Query Frame = 1

Query: 1   MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDIMVHFNVAEENEAVIDEKSQVSFIMM 60
           MFGQ S  ++ E IKY+YN RMK+  SV++HVL+++VHFNV E N  V DEKSQVSFI+ 
Sbjct: 90  MFGQLSIQIKQETIKYVYNARMKDSQSVKKHVLNMIVHFNVVEMNVVVFDEKSQVSFILK 149

Query: 61  SLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSL 96
            LPKS  QF  N  MNKI+YN+T  LNELQT+QSL
Sbjct: 150 YLPKSSLQFNNNAEMNKIKYNMTIFLNELQTFQSL 184

BLAST of CmoCh18G006000 vs. NCBI nr
Match: gi|659082068|ref|XP_008441651.1| (PREDICTED: uncharacterized protein LOC103485734 [Cucumis melo])

HSP 1 Score: 119.4 bits (298), Expect = 6.9e-24
Identity = 56/82 (68.29%), Postives = 68/82 (82.93%), Query Frame = 1

Query: 1   MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDIMVHFNVAEENEAVIDEKSQVSFIMM 60
           MFGQPS  +R E IK++YN  M EG SV+EHVLD++V+FN+ E N AV DEKSQVSFI+ 
Sbjct: 62  MFGQPSIQMRQEDIKHVYNVHMNEGQSVKEHVLDMIVYFNIVEINGAVFDEKSQVSFILK 121

Query: 61  SLPKSFFQFRTNVVMNKIEYNL 83
           SLPKSF QFR+NV+MNKIEYN+
Sbjct: 122 SLPKSFLQFRSNVIMNKIEYNM 143

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
E2GK51_BRYDI3.3e-4168.57Gag/pol protein (Fragment) OS=Bryonia dioica PE=4 SV=1[more]
A5AVN4_VITVI1.8e-2342.08Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_017217 PE=4 SV=1[more]
W9ST61_9ROSA1.0e-1846.41Uncharacterized protein OS=Morus notabilis GN=L484_003432 PE=4 SV=1[more]
A0A165U314_9ROSI2.0e-1745.52Gag/pol protein OS=Momordica dioica PE=4 SV=1[more]
E2GK52_BRYDI8.2e-1672.58Gag/pol protein (Fragment) OS=Bryonia dioica PE=4 SV=1[more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
gi|299474487|gb|ADJ18449.1|4.8e-4168.57gag/pol protein [Bryonia dioica][more]
gi|659113937|ref|XP_008456829.1|9.9e-3966.19PREDICTED: uncharacterized protein LOC103496667 [Cucumis melo][more]
gi|659113933|ref|XP_008456826.1|4.3e-3449.48PREDICTED: uncharacterized protein LOC103496664 [Cucumis melo][more]
gi|659118732|ref|XP_008459275.1|1.4e-2462.11PREDICTED: uncharacterized protein LOC103498451 [Cucumis melo][more]
gi|659082068|ref|XP_008441651.1|6.9e-2468.29PREDICTED: uncharacterized protein LOC103485734 [Cucumis melo][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
cellular_component GO:0005575 cellular_component
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0008270 zinc ion binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh18G006000.1CmoCh18G006000.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePFAMPF14223Retrotran_gag_2coord: 15..93
score: 7.3

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
CmoCh18G006000Cla010995Watermelon (97103) v1cmowmB378
The following gene(s) are paralogous to this gene:

None