CmoCh14G011790 (gene) Cucurbita moschata (Rifu)

NameCmoCh14G011790
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
Description(Retrovirus-related Pol polyprotein from transposon TNT 1-94) (3.1.13.-)
LocationCmo_Chr14 : 9730975 .. 9731720 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAAGCTCTTCTCGGTTCTCAACATGCATGGGAGGTGGTCAAAAAAGGTTTCGAAGAACCAAAAGATACCACGGGTTATATGGCGGCACAAAGCAAGGCGTTGAAAGAGGTGCGATCAAAGGATAAGGCGACACTATACATGTTGTTCCGAGCCATTGACGAGTCAGGCTTTGAGAAGATTACCAGTGCAACTACTTCAAAAGAAGTGTGGGACATTTTAGGAAAAGTGTTCAAAGAGGCCGACCAAGTCAAGCAAGTGCGTCTCCAAACTCTTCATTGCGAGTTGGAGAGCATGAAGATGAAGGAGTCAAAAAGTGTATTTGACTACATCACGCGTGTACTAACTGTGATAAACCAACTCAACCGAAACGGAGAAGTATTAACCGAGACGCAGTTGTGGAGAAGATTTTGAGATCATTAACCAACAATTTTGAGAACGTTGTATGCGTGATAGAAGAGTCAAAGGACCTAGCGACGTTCATGGTCGATGAGCTTGCCGGTTCTCTCGAGACACAAGAGCAACGTAAGAAAAAGAAGGAGGAAACACTCGATCAAGCACTTCCAATCAAAGATGAAAAGGTACTCTATTCTCAAAATTTTCAAGGTAGAGGTCGTGATCATGGAAGTCGTGGTCGCAGTGATCAAGGCAGCTGTGTCATGAAGGATTTTATAAGGAGAAGGGACAGTCGAGCCAAGCAAATTGGCGTGGAAGAGGACGCGGTCGAGGAAGAGGCGGCCAATTAA

mRNA sequence

ATGAAAGCTCTTCTCGGTTCTCAACATGCATGGGAGGTGGTCAAAAAAGGTTTCGAAGAACCAAAAGATACCACGGGTTATATGGCGGCACAAAGCAAGGCGTTGAAAGAGGTGCGATCAAAGGATAAGGCGACACTATACATGTTGTTCCGAGCCATTGACGAGTCAGGCTTTGAGAAGATTACCAGTGCAACTACTTCAAAAGAAGTGTGGGACATTTTAGGAAAAGTGTTCAAAGAGGCCGACCAAGTCAAGCAAGTGCGTCTCCAAACTCTTCATTGCGAGTTGGAGAGCATGAAGATGAAGGAGTCAAAAAAAGAGTCAAAGGACCTAGCGACGTTCATGGTCGATGAGCTTGCCGGTTCTCTCGAGACACAAGAGCAACGTAAGAAAAAGAAGGAGGAAACACTCGATCAAGCACTTCCAATCAAAGATGAAAAGGTACTCTATTCTCAAAATTTTCAAGGTAGAGGTCGTGATCATGGAAGTCGTGGTCGCAGTGATCAAGGCAGCTGTGTCATGAAGGATTTTATAAGGAGAAGGGACAGTCGAGCCAAGCAAATTGGCGTGGAAGAGGACGCGGTCGAGGAAGAGGCGGCCAATTAA

Coding sequence (CDS)

ATGAAAGCTCTTCTCGGTTCTCAACATGCATGGGAGGTGGTCAAAAAAGGTTTCGAAGAACCAAAAGATACCACGGGTTATATGGCGGCACAAAGCAAGGCGTTGAAAGAGGTGCGATCAAAGGATAAGGCGACACTATACATGTTGTTCCGAGCCATTGACGAGTCAGGCTTTGAGAAGATTACCAGTGCAACTACTTCAAAAGAAGTGTGGGACATTTTAGGAAAAGTGTTCAAAGAGGCCGACCAAGTCAAGCAAGTGCGTCTCCAAACTCTTCATTGCGAGTTGGAGAGCATGAAGATGAAGGAGTCAAAAAAAGAGTCAAAGGACCTAGCGACGTTCATGGTCGATGAGCTTGCCGGTTCTCTCGAGACACAAGAGCAACGTAAGAAAAAGAAGGAGGAAACACTCGATCAAGCACTTCCAATCAAAGATGAAAAGGTACTCTATTCTCAAAATTTTCAAGGTAGAGGTCGTGATCATGGAAGTCGTGGTCGCAGTGATCAAGGCAGCTGTGTCATGAAGGATTTTATAAGGAGAAGGGACAGTCGAGCCAAGCAAATTGGCGTGGAAGAGGACGCGGTCGAGGAAGAGGCGGCCAATTAA
BLAST of CmoCh14G011790 vs. TrEMBL
Match: A0A151RCL3_CAJCA (Uncharacterized protein OS=Cajanus cajan GN=KK1_038469 PE=4 SV=1)

HSP 1 Score: 166.0 bits (419), Expect = 4.7e-38
Identity = 109/205 (53.17%), Postives = 125/205 (60.98%), Query Frame = 1

Query: 1   MKALLGSQHAWEVVKKGFEEPKDTTGYMAAQSKALKEVRSKDKATLYMLFRAIDESGFEK 60
           MKALLGSQ AWEVV++GFEEPK TTGY AAQ K LKE RSKDKA LY+L+RA+DESGFEK
Sbjct: 26  MKALLGSQDAWEVVQEGFEEPKTTTGYSAAQHKMLKETRSKDKAALYLLYRAVDESGFEK 85

Query: 61  ITSATTSKEVWDILGKVFKEADQVKQVRLQTLHCELESMKMKESKKESKDLATF--MVDE 120
           I  A+TSKE WDIL KV++ AD+VKQV LQTL  ELE+MKMKES+  S  +  F  MV++
Sbjct: 86  IARASTSKEAWDILAKVYRGADKVKQVCLQTLRGELENMKMKESEGVSDYIIRFQTMVNQ 145

Query: 121 LAGSLET---------------------------------------------QEQRKKKK 154
           L  + ET                                              EQRKKKK
Sbjct: 146 LNQNGETLTDVRVVEKILRSLTDSFENVICAIEESKDLTMITVDELAESLEAHEQRKKKK 205

BLAST of CmoCh14G011790 vs. TrEMBL
Match: A0A151RNB3_CAJCA (Uncharacterized protein OS=Cajanus cajan GN=KK1_034467 PE=4 SV=1)

HSP 1 Score: 156.0 bits (393), Expect = 4.9e-35
Identity = 101/181 (55.80%), Postives = 124/181 (68.51%), Query Frame = 1

Query: 1   MKALLGSQHAWEVVKKGFEEPKDTTGYMAAQSKALKEVRSKDKATLYMLFRAIDESGFEK 60
           MKALLGSQ AWEVV++GFEEPK+T GY+ AQ K LKE RSKDKA LY+L+RA+DESGFEK
Sbjct: 26  MKALLGSQDAWEVVQEGFEEPKNTMGYLVAQHKTLKETRSKDKAALYLLYRAVDESGFEK 85

Query: 61  ITSATTSKE-----VWDILGKVFKEADQVKQVRL--QTLHCELESMK-MKESKKESKDLA 120
           I  A+TSKE     V  +L ++ +    +  VR+  + L     S K +  + +ESKDL 
Sbjct: 86  IARASTSKEARNTRVQTLLNQLNQNGQTLTNVRVVEKILRSLTNSFKNVICAIEESKDLT 145

Query: 121 TFMVDELAGSLETQEQRKKKKEE-TLDQAL----PIKDEKVLYSQNFQGRGRDHGSRGRS 169
              V+ELA SLE  EQRKKKKEE  L+QAL     IKDEK LYSQN +GRGR  G  GR+
Sbjct: 146 MLTVNELARSLEAHEQRKKKKEEKILEQALQIKASIKDEKALYSQNIRGRGRGRGG-GRT 205

BLAST of CmoCh14G011790 vs. TrEMBL
Match: A0A059QBK0_PHAVU (Polyprotein OS=Phaseolus vulgaris PE=4 SV=1)

HSP 1 Score: 154.8 bits (390), Expect = 1.1e-34
Identity = 80/108 (74.07%), Postives = 93/108 (86.11%), Query Frame = 1

Query: 1   MKALLGSQHAWEVVKKGFEEPKDTTGYMAAQSKALKEVRSKDKATLYMLFRAIDESGFEK 60
           MKALLGSQ +WEVV++GFEEP +TTGY AAQ+KALKE+RSKDKA LYML+RA+DE+ FEK
Sbjct: 26  MKALLGSQDSWEVVEEGFEEPTNTTGYTAAQTKALKEMRSKDKAALYMLYRAVDEAIFEK 85

Query: 61  ITSATTSKEVWDILGKVFKEADQVKQVRLQTLHCELESMKMKESKKES 109
           I  A+TSKE WDIL KVFK AD+VKQVRLQTL  ELE+MKM ES+  S
Sbjct: 86  IAGASTSKEAWDILEKVFKGADRVKQVRLQTLRGELENMKMMESESVS 133

BLAST of CmoCh14G011790 vs. TrEMBL
Match: A0A151QV96_CAJCA (Uncharacterized protein OS=Cajanus cajan GN=KK1_044903 PE=4 SV=1)

HSP 1 Score: 133.7 bits (335), Expect = 2.6e-28
Identity = 70/109 (64.22%), Postives = 85/109 (77.98%), Query Frame = 1

Query: 1   MKALLGSQHAWEVVKKGFEEPKDTTGYMAAQSKALKEVRSKDKATLYMLFRAIDESGFEK 60
           MKALLGSQ  W+VV+ G+EEP  T GY  AQ  ALK  R+KDKATLY+L+RA+DESGFEK
Sbjct: 1   MKALLGSQDNWDVVENGYEEPVTTEGYTNAQMNALKVARAKDKATLYLLYRAVDESGFEK 60

Query: 61  ITSATTSKEVWDILGKVFKEADQVKQVRLQTLHCELESMKMKESKKESK 110
           I +A +SKE WDIL K  K  ++VKQVRLQTL  ELE+M+MKES+  S+
Sbjct: 61  IANAKSSKEAWDILEKAKKGDERVKQVRLQTLRGELENMRMKESEGVSE 109

BLAST of CmoCh14G011790 vs. TrEMBL
Match: A0A151SYY3_CAJCA (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan GN=KK1_015466 PE=4 SV=1)

HSP 1 Score: 131.3 bits (329), Expect = 1.3e-27
Identity = 84/213 (39.44%), Postives = 115/213 (53.99%), Query Frame = 1

Query: 1   MKALLGSQHAWEVVKKGFEEPKDTTGYMAAQSKALKEVRSKDKATLYMLFRAIDESGFEK 60
           MK LLGSQ  W++V+KGF+EP++      AQ  AL++ R KDK+ LY L+  +DESGFEK
Sbjct: 1   MKTLLGSQSLWDIVEKGFQEPEEDEDQSVAQIAALEKTRVKDKSALYFLYNVVDESGFEK 60

Query: 61  ITSATTSKEVWDILGKVFKEADQVKQVRLQTLHCELESMKMKESK--------------- 120
           I +  +SKE W IL    +    V+Q+RLQTL  E E +KM++ +               
Sbjct: 61  IANTASSKEAWKILEVAHRGNHHVRQIRLQTLRGEFECLKMEDKEQVSEYITRGEPMPAN 120

Query: 121 ---------------------KESKDLATFMVDELAGSLETQEQRKKKKEETLDQA---- 171
                                +ESKDL+   V+ELAGSLE  EQR++K + TLDQA    
Sbjct: 121 RIVEKILRSLTDDFESIACVIEESKDLSVLSVEELAGSLEAHEQRRRKMKGTLDQALQAQ 180

BLAST of CmoCh14G011790 vs. TAIR10
Match: AT1G48720.1 (AT1G48720.1 unknown protein)

HSP 1 Score: 64.3 bits (155), Expect = 9.7e-11
Identity = 27/69 (39.13%), Postives = 45/69 (65.22%), Query Frame = 1

Query: 1  MKALLGSQHAWEVVKKGFEEPKDTTGYMAAQSKALKEVRSKDKATLYMLFRAIDESGFEK 60
          MKA+LG+   WE+V+KGF EP++       Q   L++ R +DK  L ++++ +DE  FEK
Sbjct: 25 MKAILGAHDVWEIVEKGFIEPENEGSLSQTQKDGLRDSRKRDKKALCLIYQGLDEDTFEK 84

Query: 61 ITSATTSKE 70
          +  AT++K+
Sbjct: 85 VVEATSAKD 93

BLAST of CmoCh14G011790 vs. TAIR10
Match: AT3G21000.1 (AT3G21000.1 Gag-Pol-related retrotransposon family protein)

HSP 1 Score: 48.9 bits (115), Expect = 4.2e-06
Identity = 41/127 (32.28%), Postives = 68/127 (53.54%), Query Frame = 1

Query: 2   KALLGSQHAWEVVKKGFEEPKDTTGYMAA--QSKALKEVRS---KDKATLYMLFRAIDES 61
           K+ L  Q  W+VV  G  +       +AA  Q + L + R    KD   L +L  ++ +S
Sbjct: 25  KSTLIEQGLWDVVVNGVPQDPSKNPELAATIQPEELSKWRDFVVKDAKALQILQSSLTDS 84

Query: 62  GFEKITSATTSKEVWDILGKVFKEAD--QVKQVRLQTLHCELESMKM--KESKKESKDLA 120
            F K  SA+++K+VWD+L K  ++A   +++QV ++ L  +LE +KM  KES     D A
Sbjct: 85  VFRKTLSASSAKDVWDLLRKGNEQATIRRLEQVTIRRLEKQLEDLKMVDKESGSSYLDKA 144

BLAST of CmoCh14G011790 vs. NCBI nr
Match: gi|1012328580|gb|KYP40213.1| (hypothetical protein KK1_038469 [Cajanus cajan])

HSP 1 Score: 166.0 bits (419), Expect = 6.7e-38
Identity = 109/205 (53.17%), Postives = 125/205 (60.98%), Query Frame = 1

Query: 1   MKALLGSQHAWEVVKKGFEEPKDTTGYMAAQSKALKEVRSKDKATLYMLFRAIDESGFEK 60
           MKALLGSQ AWEVV++GFEEPK TTGY AAQ K LKE RSKDKA LY+L+RA+DESGFEK
Sbjct: 26  MKALLGSQDAWEVVQEGFEEPKTTTGYSAAQHKMLKETRSKDKAALYLLYRAVDESGFEK 85

Query: 61  ITSATTSKEVWDILGKVFKEADQVKQVRLQTLHCELESMKMKESKKESKDLATF--MVDE 120
           I  A+TSKE WDIL KV++ AD+VKQV LQTL  ELE+MKMKES+  S  +  F  MV++
Sbjct: 86  IARASTSKEAWDILAKVYRGADKVKQVCLQTLRGELENMKMKESEGVSDYIIRFQTMVNQ 145

Query: 121 LAGSLET---------------------------------------------QEQRKKKK 154
           L  + ET                                              EQRKKKK
Sbjct: 146 LNQNGETLTDVRVVEKILRSLTDSFENVICAIEESKDLTMITVDELAESLEAHEQRKKKK 205

BLAST of CmoCh14G011790 vs. NCBI nr
Match: gi|823190427|ref|XP_012491142.1| (PREDICTED: uncharacterized protein LOC105803474 [Gossypium raimondii])

HSP 1 Score: 162.9 bits (411), Expect = 5.7e-37
Identity = 83/105 (79.05%), Postives = 92/105 (87.62%), Query Frame = 1

Query: 1   MKALLGSQHAWEVVKKGFEEPKDTTGYMAAQSKALKEVRSKDKATLYMLFRAIDESGFEK 60
           MKALLGSQ  WEVV++GF EPK TTGY AAQ+KALKE+RSKDKA LYMLFRA+DESGFEK
Sbjct: 25  MKALLGSQDGWEVVQEGFVEPKTTTGYTAAQNKALKEIRSKDKAVLYMLFRAVDESGFEK 84

Query: 61  ITSATTSKEVWDILGKVFKEADQVKQVRLQTLHCELESMKMKESK 106
           I SATTSKE WDIL KV+K AD+VKQVRLQTL  +LE MKMKES+
Sbjct: 85  IASATTSKEAWDILAKVYKGADRVKQVRLQTLRGKLEGMKMKESE 129

BLAST of CmoCh14G011790 vs. NCBI nr
Match: gi|1012332620|gb|KYP44044.1| (hypothetical protein KK1_034467 [Cajanus cajan])

HSP 1 Score: 156.0 bits (393), Expect = 7.0e-35
Identity = 101/181 (55.80%), Postives = 124/181 (68.51%), Query Frame = 1

Query: 1   MKALLGSQHAWEVVKKGFEEPKDTTGYMAAQSKALKEVRSKDKATLYMLFRAIDESGFEK 60
           MKALLGSQ AWEVV++GFEEPK+T GY+ AQ K LKE RSKDKA LY+L+RA+DESGFEK
Sbjct: 26  MKALLGSQDAWEVVQEGFEEPKNTMGYLVAQHKTLKETRSKDKAALYLLYRAVDESGFEK 85

Query: 61  ITSATTSKE-----VWDILGKVFKEADQVKQVRL--QTLHCELESMK-MKESKKESKDLA 120
           I  A+TSKE     V  +L ++ +    +  VR+  + L     S K +  + +ESKDL 
Sbjct: 86  IARASTSKEARNTRVQTLLNQLNQNGQTLTNVRVVEKILRSLTNSFKNVICAIEESKDLT 145

Query: 121 TFMVDELAGSLETQEQRKKKKEE-TLDQAL----PIKDEKVLYSQNFQGRGRDHGSRGRS 169
              V+ELA SLE  EQRKKKKEE  L+QAL     IKDEK LYSQN +GRGR  G  GR+
Sbjct: 146 MLTVNELARSLEAHEQRKKKKEEKILEQALQIKASIKDEKALYSQNIRGRGRGRGG-GRT 205

BLAST of CmoCh14G011790 vs. NCBI nr
Match: gi|545693870|gb|AGW47867.1| (polyprotein [Phaseolus vulgaris])

HSP 1 Score: 154.8 bits (390), Expect = 1.6e-34
Identity = 80/108 (74.07%), Postives = 93/108 (86.11%), Query Frame = 1

Query: 1   MKALLGSQHAWEVVKKGFEEPKDTTGYMAAQSKALKEVRSKDKATLYMLFRAIDESGFEK 60
           MKALLGSQ +WEVV++GFEEP +TTGY AAQ+KALKE+RSKDKA LYML+RA+DE+ FEK
Sbjct: 26  MKALLGSQDSWEVVEEGFEEPTNTTGYTAAQTKALKEMRSKDKAALYMLYRAVDEAIFEK 85

Query: 61  ITSATTSKEVWDILGKVFKEADQVKQVRLQTLHCELESMKMKESKKES 109
           I  A+TSKE WDIL KVFK AD+VKQVRLQTL  ELE+MKM ES+  S
Sbjct: 86  IAGASTSKEAWDILEKVFKGADRVKQVRLQTLRGELENMKMMESESVS 133

BLAST of CmoCh14G011790 vs. NCBI nr
Match: gi|1012321727|gb|KYP34173.1| (hypothetical protein KK1_044903 [Cajanus cajan])

HSP 1 Score: 133.7 bits (335), Expect = 3.7e-28
Identity = 70/109 (64.22%), Postives = 85/109 (77.98%), Query Frame = 1

Query: 1   MKALLGSQHAWEVVKKGFEEPKDTTGYMAAQSKALKEVRSKDKATLYMLFRAIDESGFEK 60
           MKALLGSQ  W+VV+ G+EEP  T GY  AQ  ALK  R+KDKATLY+L+RA+DESGFEK
Sbjct: 1   MKALLGSQDNWDVVENGYEEPVTTEGYTNAQMNALKVARAKDKATLYLLYRAVDESGFEK 60

Query: 61  ITSATTSKEVWDILGKVFKEADQVKQVRLQTLHCELESMKMKESKKESK 110
           I +A +SKE WDIL K  K  ++VKQVRLQTL  ELE+M+MKES+  S+
Sbjct: 61  IANAKSSKEAWDILEKAKKGDERVKQVRLQTLRGELENMRMKESEGVSE 109

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A151RCL3_CAJCA4.7e-3853.17Uncharacterized protein OS=Cajanus cajan GN=KK1_038469 PE=4 SV=1[more]
A0A151RNB3_CAJCA4.9e-3555.80Uncharacterized protein OS=Cajanus cajan GN=KK1_034467 PE=4 SV=1[more]
A0A059QBK0_PHAVU1.1e-3474.07Polyprotein OS=Phaseolus vulgaris PE=4 SV=1[more]
A0A151QV96_CAJCA2.6e-2864.22Uncharacterized protein OS=Cajanus cajan GN=KK1_044903 PE=4 SV=1[more]
A0A151SYY3_CAJCA1.3e-2739.44Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan GN=... [more]
Match NameE-valueIdentityDescription
AT1G48720.19.7e-1139.13 unknown protein[more]
AT3G21000.14.2e-0632.28 Gag-Pol-related retrotransposon family protein[more]
Match NameE-valueIdentityDescription
gi|1012328580|gb|KYP40213.1|6.7e-3853.17hypothetical protein KK1_038469 [Cajanus cajan][more]
gi|823190427|ref|XP_012491142.1|5.7e-3779.05PREDICTED: uncharacterized protein LOC105803474 [Gossypium raimondii][more]
gi|1012332620|gb|KYP44044.1|7.0e-3555.80hypothetical protein KK1_034467 [Cajanus cajan][more]
gi|545693870|gb|AGW47867.1|1.6e-3474.07polyprotein [Phaseolus vulgaris][more]
gi|1012321727|gb|KYP34173.1|3.7e-2864.22hypothetical protein KK1_044903 [Cajanus cajan][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
cellular_component GO:0005575 cellular_component
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0008270 zinc ion binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh14G011790.1CmoCh14G011790.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableunknownCoilCoilcoord: 116..136
scor
NoneNo IPR availablePFAMPF14223Retrotran_gag_2coord: 40..106
score: 3.

The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None