Cucsa.365710 (gene) Cucumber (Gy14) v1

NameCucsa.365710
Typegene
OrganismCucumis sativus (Cucumber (Gy14) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Locationscaffold03611 : 3581695 .. 3582105 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACACACATTAATTGTGAGTATATCATTTATTTGCTTGTTTTTTATTTAGTTGGTCTTGCTCAAATATCTGAAAATCTAAATTCAATCCCTAAGTTGAACAGATTGAATTTCAAGATTTGGAATGAAAGCCTAGAGATACTTCTCAAGTGTATGGATCTAAACCTTGCATTAAGAACCAATAAATCTGCTTCTAATAAGGAACAATCGAATATGGCTAATATTGAGAAGTGGGAACGGTCACATCGCATGAGTTTGATGATTATTAGGCACTCCATTCGAGAGCCATTTCGGTGTTCTATCACTAAAAGTGAAAATGCCAAAAAGTTTCTTGCCAAAATTGAAAAATATTTTGCTAAAAAAGAAAAaGGGGAAGCAAGTAGTTTTTTTACTACTTTTAACTTCCATGAG

mRNA sequence

ATGACACACATTAATTGTGAGTATATCATTTATTTGCTTGTTTTTTATTTAGTTGGTCTTGCTCAAATATCTGAAAATCTAAATTCAATCCCTAAGTTGAACAGATTGAATTTCAAGATTTGGAATGAAAGCCTAGAGATACTTCTCAAGTGTATGGATCTAAACCTTGCATTAAGAACCAATAAATCTGCTTCTAATAAGGAACAATCGAATATGGCTAATATTGAGAAGTGGGAACGGTCACATCGCATGAGTTTGATGATTATTAGGCACTCCATTCGAGAGCCATTTCGGTGTTCTATCACTAAAAGTGAAAATGCCAAAAAGTTTCTTGCCAAAATTGAAAAATATTTTGCTAAAAAAGAAAAAGGGGAAGCAAGTAGTTTTTTTACTACTTTTAACTTCCATGAG

Coding sequence (CDS)

ATGACACACATTAATTGTGAGTATATCATTTATTTGCTTGTTTTTTATTTAGTTGGTCTTGCTCAAATATCTGAAAATCTAAATTCAATCCCTAAGTTGAACAGATTGAATTTCAAGATTTGGAATGAAAGCCTAGAGATACTTCTCAAGTGTATGGATCTAAACCTTGCATTAAGAACCAATAAATCTGCTTCTAATAAGGAACAATCGAATATGGCTAATATTGAGAAGTGGGAACGGTCACATCGCATGAGTTTGATGATTATTAGGCACTCCATTCGAGAGCCATTTCGGTGTTCTATCACTAAAAGTGAAAATGCCAAAAAGTTTCTTGCCAAAATTGAAAAATATTTTGCTAAAAAAGAAAAaGGGGAAGCAAGTAGTTTTTTTACTACTTTTAACTTCCATGAG

Protein sequence

MTHINCEYIIYLLVFYLVGLAQISENLNSIPKLNRLNFKIWNESLEILLKCMDLNLALRTNKSASNKEQSNMANIEKWERSHRMSLMIIRHSIREPFRCSITKSENAKKFLAKIEKYFAKKEKGEASSFFTTFNFHE
BLAST of Cucsa.365710 vs. TrEMBL
Match: A0A0B2NVV3_GLYSO (Uncharacterized protein (Fragment) OS=Glycine soja GN=glysoja_008082 PE=4 SV=1)

HSP 1 Score: 117.1 bits (292), Expect = 1.7e-23
Identity = 56/112 (50.00%), Postives = 78/112 (69.64%), Query Frame = 1

Query: 17  LVGLAQISENLNSIPKLNRLNFKIWNESLEILLKCMDLNLALRTNKSASNKEQSNMANIE 76
           +  +  +S  +NSI  LN  NF++W E +EI+L CMDL+LALR  +  S  + SN   IE
Sbjct: 2   VASVVNVSAQVNSISMLNGTNFQVWKEVIEIVLDCMDLDLALRMERPTSTSKASNEVKIE 61

Query: 77  KWERSHRMSLMIIRHSIREPFRCSITKSENAKKFLAKIEKYFAKKEKGEASS 129
           KW+RS+ M +MI++ SI E FR SI++ ENAKKF+ +IE+YFAK EK E S+
Sbjct: 62  KWDRSNLMCIMIMKRSIPEAFRGSISEGENAKKFIDEIEQYFAKNEKAETSN 113

BLAST of Cucsa.365710 vs. TrEMBL
Match: A0A151RDF9_CAJCA (Retrovirus-related Pol polyprotein from transposon TNT 1-94 (Fragment) OS=Cajanus cajan GN=KK1_038014 PE=4 SV=1)

HSP 1 Score: 116.7 bits (291), Expect = 2.2e-23
Identity = 60/129 (46.51%), Postives = 81/129 (62.79%), Query Frame = 1

Query: 6   CEYIIYLLVFY------LVGLAQISENLNSIPKLNRLNFKIWNESLEILLKCMDLNLALR 65
           CE++I L  F       +     +S  +N IP LN  NFK W E++EI+L CMDL+LALR
Sbjct: 2   CEHVIKLNFFLCSITVVVASAVNLSAQINCIPMLNGTNFKAWKEAVEIILGCMDLDLALR 61

Query: 66  TNKSASNKEQSNMANIEKWERSHRMSLMIIRHSIREPFRCSITKSENAKKFLAKIEKYFA 125
             K   N E  +   +EKWERS+RM LMI++ S+ E FR SI++S+NAK  L  +E+YF 
Sbjct: 62  AEKPTPNPENPDEDKVEKWERSNRMCLMIMKRSVPEVFRDSISESQNAKGLLDVVEQYFT 121

Query: 126 KKEKGEASS 129
             EK +ASS
Sbjct: 122 SNEKADASS 130

BLAST of Cucsa.365710 vs. TrEMBL
Match: A0A151UI88_CAJCA (Uncharacterized protein (Fragment) OS=Cajanus cajan GN=KK1_050268 PE=4 SV=1)

HSP 1 Score: 114.8 bits (286), Expect = 8.5e-23
Identity = 57/125 (45.60%), Postives = 77/125 (61.60%), Query Frame = 1

Query: 4   INCEYIIYLLVFYLVGLAQISENLNSIPKLNRLNFKIWNESLEILLKCMDLNLALRTNKS 63
           I   +    +   +     +S  +N IP  N  NFK W E++EI+L CMDL+LALR  K 
Sbjct: 1   IKLNFFFCSITVAVASAVNLSAQINCIPMFNGTNFKAWKEAVEIILGCMDLDLALRAEKL 60

Query: 64  ASNKEQSNMANIEKWERSHRMSLMIIRHSIREPFRCSITKSENAKKFLAKIEKYFAKKEK 123
             N E  +   +EKWERS+RM LMI++ S+ E FR SI++S+NAK FL  IE+YF   EK
Sbjct: 61  TPNPENPDEDKVEKWERSNRMCLMIMKRSVPEVFRGSISESQNAKGFLDAIEQYFTSNEK 120

Query: 124 GEASS 129
            +ASS
Sbjct: 121 ADASS 125

BLAST of Cucsa.365710 vs. TrEMBL
Match: A0A151S1C3_CAJCA (Uncharacterized protein (Fragment) OS=Cajanus cajan GN=KK1_029697 PE=4 SV=1)

HSP 1 Score: 110.9 bits (276), Expect = 1.2e-21
Identity = 53/102 (51.96%), Postives = 72/102 (70.59%), Query Frame = 1

Query: 27  LNSIPKLNRLNFKIWNESLEILLKCMDLNLALRTNKSASNKEQSNMANIEKWERSHRMSL 86
           +N IP LN  NFK W E++EI+L CMDL+LALR  K   N E  +   +EKWERS+RM L
Sbjct: 3   INCIPMLNGTNFKAWKEAVEIILGCMDLDLALRAEKPTPNPENPDEDKVEKWERSNRMCL 62

Query: 87  MIIRHSIREPFRCSITKSENAKKFLAKIEKYFAKKEKGEASS 129
           MI++ S+ + FR SI++++NAK FL  +E+YF   EK +ASS
Sbjct: 63  MIMKRSVPKVFRGSISENQNAKGFLDVVEQYFTSNEKADASS 104

BLAST of Cucsa.365710 vs. TrEMBL
Match: A0A151RB35_CAJCA (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan GN=KK1_038971 PE=4 SV=1)

HSP 1 Score: 107.1 bits (266), Expect = 1.8e-20
Identity = 52/96 (54.17%), Postives = 68/96 (70.83%), Query Frame = 1

Query: 33  LNRLNFKIWNESLEILLKCMDLNLALRTNKSASNKEQSNMANIEKWERSHRMSLMIIRHS 92
           LN  NFK W E++EI+L CMDL+LALR  K   N E  +   +EKWERS+RM LMI++ S
Sbjct: 2   LNGTNFKAWKEAVEIILGCMDLDLALRAEKPTPNPENPDEDKVEKWERSNRMCLMIMKRS 61

Query: 93  IREPFRCSITKSENAKKFLAKIEKYFAKKEKGEASS 129
           + E FR SI++S+NAK FL  +E+YF   EK +ASS
Sbjct: 62  VPEVFRGSISESKNAKGFLDAVEQYFTSNEKADASS 97

BLAST of Cucsa.365710 vs. TAIR10
Match: AT5G53670.1 (AT5G53670.1 unknown protein)

HSP 1 Score: 79.0 bits (193), Expect = 2.6e-15
Identity = 48/136 (35.29%), Postives = 77/136 (56.62%), Query Frame = 1

Query: 1   MTHINCEYIIYLL--------VFYLVGLAQISENLNSIPKLNRLNFKIWNESLEILLKCM 60
           M +++  Y +YL         +F ++G      N++SIP L+  NF  W E L ++L  M
Sbjct: 1   MLNVSVGYKVYLTTYLTLSFSLFSVLGYTSSLSNVDSIPMLSGSNFSEWKEHLLLVLALM 60

Query: 61  DLNLALRTNKSASNKEQSNMANIEKWERSHRMSLMIIRHSIREPFRCSITKS-ENAKKFL 120
           DL+L+L T + +S KE      ++ W+RS+R+S+MI++  I + FR  +      AK FL
Sbjct: 61  DLDLSLMTERPSSPKE------LKHWDRSNRVSIMIMKIRIPQGFRGVVPDDVTTAKDFL 120

Query: 121 AKIEKYFAKKEKGEAS 128
           A +E +FAK E+ E S
Sbjct: 121 ASLENFFAKNEEAERS 130

BLAST of Cucsa.365710 vs. NCBI nr
Match: gi|659102484|ref|XP_008452157.1| (PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC103493262 [Cucumis melo])

HSP 1 Score: 146.0 bits (367), Expect = 4.9e-32
Identity = 78/104 (75.00%), Postives = 83/104 (79.81%), Query Frame = 1

Query: 18  VGLAQISENLNSIPKLNRLNFKIWNESLEILLKCMDLNLALRTNKSASNKEQSNMANIEK 77
           VGLAQIS NL+ IPKLN  NFKIW ESL ILL CMDL+LAL   K A  KEQSN  NIEK
Sbjct: 4   VGLAQISGNLSLIPKLNGSNFKIWKESLXILLGCMDLDLALSIGKPAFTKEQSNTTNIEK 63

Query: 78  WERSHRMSLMIIRHSIREPFRCSITKSENAKKFLAKIEKYFAKK 122
           WERS+RM LMII HSI E FR SIT+SENAKKFLA+IEK FAKK
Sbjct: 64  WERSNRMCLMIIEHSILESFRGSITESENAKKFLAEIEKSFAKK 107

BLAST of Cucsa.365710 vs. NCBI nr
Match: gi|571540926|ref|XP_006601635.1| (PREDICTED: uncharacterized protein LOC102662751 [Glycine max])

HSP 1 Score: 121.7 bits (304), Expect = 9.9e-25
Identity = 58/112 (51.79%), Postives = 83/112 (74.11%), Query Frame = 1

Query: 17  LVGLAQISENLNSIPKLNRLNFKIWNESLEILLKCMDLNLALRTNKSASNKEQSNMANIE 76
           +  +A ++  +NSIP LN+ NFK+W E++EI+L CMDL+LALRT +  S  E S+   IE
Sbjct: 8   IASVANVTAQVNSIPMLNKTNFKVWKEAVEIVLGCMDLDLALRTERPISILETSSEVKIE 67

Query: 77  KWERSHRMSLMIIRHSIREPFRCSITKSENAKKFLAKIEKYFAKKEKGEASS 129
           KW+RS+RM LMI++ SI E FR SI++ ++ KKFL +IE+YFAK EK + S+
Sbjct: 68  KWDRSNRMCLMIMKCSISEAFRGSISEGQSVKKFLEEIEQYFAKNEKAKTSN 119

BLAST of Cucsa.365710 vs. NCBI nr
Match: gi|1012116575|ref|XP_015961265.1| (PREDICTED: uncharacterized protein LOC107485266 [Arachis duranensis])

HSP 1 Score: 119.8 bits (299), Expect = 3.8e-24
Identity = 58/115 (50.43%), Postives = 78/115 (67.83%), Query Frame = 1

Query: 17  LVGLAQISENLNSIPKLNRLNFKIWNESLEILLKCMDLNLALRTNKSASNKEQSNMANIE 76
           +     IS  ++SIP LN  NFK+W +++EI+L CMDL+ ALR  K  S  E  N   IE
Sbjct: 1   MASATNISAQISSIPMLNGSNFKVWKDTVEIVLGCMDLDTALREEKPTSTPENLNEVKIE 60

Query: 77  KWERSHRMSLMIIRHSIREPFRCSITKSENAKKFLAKIEKYFAKKEKGEASSFFT 132
           KWERS+RMS+MI++ SI E FR SIT+ ++AK+FL  +E +F K EK EASS  +
Sbjct: 61  KWERSNRMSIMIMKRSISEAFRGSITEDKDAKQFLKDVENFFTKNEKAEASSLLS 115

BLAST of Cucsa.365710 vs. NCBI nr
Match: gi|1012196934|ref|XP_015972117.1| (PREDICTED: uncharacterized protein LOC107495482 [Arachis duranensis])

HSP 1 Score: 118.2 bits (295), Expect = 1.1e-23
Identity = 57/115 (49.57%), Postives = 79/115 (68.70%), Query Frame = 1

Query: 17  LVGLAQISENLNSIPKLNRLNFKIWNESLEILLKCMDLNLALRTNKSASNKEQSNMANIE 76
           +   + +S  ++SIP LN  NFK+W +++EI+L CMDL+ ALR  K  S  E  N   IE
Sbjct: 1   MASASNVSAQISSIPMLNGSNFKVWKDTVEIVLDCMDLDTALREEKPTSTPENFNEVKIE 60

Query: 77  KWERSHRMSLMIIRHSIREPFRCSITKSENAKKFLAKIEKYFAKKEKGEASSFFT 132
           KWERS+RMS+MI++ SI E F  SIT+ ++AK+FL  +EK+F K EK EASS  +
Sbjct: 61  KWERSNRMSIMIMKCSIPEVFSGSITEDKDAKQFLKNVEKFFTKNEKAEASSLLS 115

BLAST of Cucsa.365710 vs. NCBI nr
Match: gi|1012204318|ref|XP_015931468.1| (PREDICTED: uncharacterized protein LOC107457801 [Arachis duranensis])

HSP 1 Score: 117.9 bits (294), Expect = 1.4e-23
Identity = 55/109 (50.46%), Postives = 79/109 (72.48%), Query Frame = 1

Query: 23  ISENLNSIPKLNRLNFKIWNESLEILLKCMDLNLALRTNKSASNKEQSNMANIEKWERSH 82
           +S  + +IP LN  NFK+W +++EI+L CMDL++ALR  K  S  +  N   IEKWERS+
Sbjct: 7   VSAQITNIPMLNGSNFKVWKDTVEIVLDCMDLDIALREEKPTSTPKNLNEVKIEKWERSN 66

Query: 83  RMSLMIIRHSIREPFRCSITKSENAKKFLAKIEKYFAKKEKGEASSFFT 132
           RMS+MI++ SI E FR SIT++++AK+FL  +EK+F K +K EASS  +
Sbjct: 67  RMSIMIMKRSISEVFRGSITENKDAKQFLKDVEKFFTKNKKAEASSLLS 115

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
A0A0B2NVV3_GLYSO1.7e-2350.00Uncharacterized protein (Fragment) OS=Glycine soja GN=glysoja_008082 PE=4 SV=1[more]
A0A151RDF9_CAJCA2.2e-2346.51Retrovirus-related Pol polyprotein from transposon TNT 1-94 (Fragment) OS=Cajanu... [more]
A0A151UI88_CAJCA8.5e-2345.60Uncharacterized protein (Fragment) OS=Cajanus cajan GN=KK1_050268 PE=4 SV=1[more]
A0A151S1C3_CAJCA1.2e-2151.96Uncharacterized protein (Fragment) OS=Cajanus cajan GN=KK1_029697 PE=4 SV=1[more]
A0A151RB35_CAJCA1.8e-2054.17Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan GN=... [more]
Match NameE-valueIdentityDescription
AT5G53670.12.6e-1535.29 unknown protein[more]
Match NameE-valueIdentityDescription
gi|659102484|ref|XP_008452157.1|4.9e-3275.00PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC103493262 [Cucumis me... [more]
gi|571540926|ref|XP_006601635.1|9.9e-2551.79PREDICTED: uncharacterized protein LOC102662751 [Glycine max][more]
gi|1012116575|ref|XP_015961265.1|3.8e-2450.43PREDICTED: uncharacterized protein LOC107485266 [Arachis duranensis][more]
gi|1012196934|ref|XP_015972117.1|1.1e-2349.57PREDICTED: uncharacterized protein LOC107495482 [Arachis duranensis][more]
gi|1012204318|ref|XP_015931468.1|1.4e-2350.46PREDICTED: uncharacterized protein LOC107457801 [Arachis duranensis][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0016829 lyase activity
molecular_function GO:0046872 metal ion binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cucsa.365710.1Cucsa.365710.1mRNA


Analysis Name: InterPro Annotations of cucumber (Gy14)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR35317FAMILY NOT NAMEDcoord: 13..127
score: 1.1
NoneNo IPR availablePANTHERPTHR35317:SF2SUBFAMILY NOT NAMEDcoord: 13..127
score: 1.1

The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None