CsaV3_3G020870 (gene) Cucumber (Chinese Long) v3

NameCsaV3_3G020870
Typegene
OrganismCucumis sativus (Cucumber (Chinese Long) v3)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Locationchr3 : 17214662 .. 17223287 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAGCTGAAGGACTTGAAAGTGAAAATTTATCTTTTCCAAGCCATTGATCGCACAATTCTGAAGACCATTCTCAAGAAAAATACTGCAAAAGAAATATGGGATGCTATGAAGAAAAAGTATGAAGGAAATGCAAGAGTCAGGCGGTCTTATCTTCAAGCTCTTTGTAGAGAATTTGAAATTCTTGAGATGAAGTCTGGTGAAGGGGTGACAGAGTACTTCTCCAGAGTCATGATTGTGGCAAATAAGATGCGAACTTATGGTGAAGATATGCAAGATGTAAAAGTAGTTGAAAAAATCCTACGCTCCTTGACTGACAACTTTAATTATATTGTTTCTTCTATTGAAGAGTCAAAAGATCCCAACACTCTCACTATTGATGAATTACAAAGTTCTTTAATAGTACATGAACAAAAGTTTCAACGACGAGGTGGGGAGGAGCAAGCCTTGAAAGTGACAAATGATGAAGGAAGAGGTCGTGGTAGTGGCAGCTATAGAGGAAGAGGTCGGGGAACTTTCAACAAAGCCAATGTGCAGTGTTTTCGATGCCAAAAATTTGGATATTTTCAATATGAATGTTCTGAAAACAAAGAAGCAAACTATGCTGAATTTGATGAGGAAGAAGAAATGTTTTTGATGTCTTATGAGGAAAAACATGGAGTTCAAAGAGAAGATACATGGATTCTTGATTTTGGGTGTTCAAATCATATGTGTGGTGATCGATCAATGTTTAGTGATCTCAATGAAGATTTTCGACATTCAGTGAAATTGGGAAACAACACTAGAATGAATGTCATGGGCAAAGGAAATGTAAAGTTGCTCATAAATGGAGTTAATCACGTTGTTGCTGAGGTATATTACATTCCAGATTTAAGTAGCAACCTATTGAGCATAGGACAATTGCAAGAAAAAGGAATGTCGATTTTGATCAAGCGAGGAGAGTGCAAAATATTTCATCCAAAGATGGATTTGATTATTCAGATCAAGATGAGCAACAGTAGGATGTTTACTTTGCAAGCTCAAACTCAAATATCTTGGGGAATTGACAAAAATAGGCCAAAAATGGGGCAGATAAAGACTTTTAGGATTGATTGTAAAAAGGTTGACATTTAGGACAAATTGGAGGTGAAATGACGAAAATACCCTCATTTAATTTCAATTCACATTTAATTTCCTCCTTCCTCTTTTCTTCTTTTTACTTTTCTTCAACCCAAAAACAGAGAAGGCAGTTCAAAAAAAAAAAAAATGTAAACCTCCCGCATTTGAACTCTCCCGCCTCTGCAGTTACCACCTTCAACTTTGTGCCTTCGGTGAACGTTTTTCTCTTCCGTCCATTTCTTCTCCACTTCCATCCATTTCTTCTCCACGACTCTCCACTATATTTTCCGATTTCAACTTTTTCACCGAACGCCTCCATTTCTTCTGCACGAGTCTCTGTTTTAGTTTCAACTTCAACTTCGGTGAACGCTTCTCCACTTCCATCCATTTCGATTTCAGCTTTGTTTTCACCGGACCTCTCCATTTCTTCTCCACCCCTCTACGTTGGCAGTGCCATATCCATCCTCTTTGAAGTCAAGTGAGTAATAAAATCTCAGTTTTTTTTTTAAAGTTAGTCTGTTGAAGATATGTTTATCTATTATTAGAGTATCCATCTTGTTTGTTTAAGTTGTCTGTTGAGATATGATGTTTTGGGCACATTTTAGTTATATCATTAAATTTGTAGAGAAGAAATGCATGTCGTATGCACACAGACATAGAGAACCGTGGGAGCCATTATGTTTTTAGGAAGAAATACGATTTCCTTTGCAATGTACTGTAATAATAGATAATTGGGTTAAGAATGTATTATATATGTGCATGCTACAAATATATGTATATATACATACCTTGGAGTTTAGAACATGAATTCCAACATATTTGGTATCCCATCTAAACTCTGCAAAGCTACCAGTACAACAATGTTGAACAATGTTGAAAGACATTATCATAAGCTGCATTGTTGTTCTCTACTTTGTTAATGTTTTTAGTGATGTAGTTCCAATAGAATACAATCTTTGTTGCCTTGTATAACCAAGAAGCTCCCCATATTAATTTGTCCTATGTCAAACCAATTTATTTAAGATTCATATGTTAGAGAAAATTTGAAGACAAATGATTCAATTAGTTTTGTTCTTAAACATACATGTAGGCTTTCCTCCTCTATTAATCATTTTAGTAAAGGTGTTACAATATATGTTTAGCCGTGCATGCAATTTTGTTTAGTAATCTAATTTGGTTTATTGGCCTTTACTTGACCAAAATTTGAGTAGAAGTCCCAAGATGAGAGTGTTACCCTTCCTAAATCGTTTAGAATGCATGTTTAACTAGTTTACTGGGTGTTAAGGGTTTAGGGTTAGCAATTTTATTAGACTTAGCAAGTAATAGCTTCGAAATTTTCTATCTCGCACCTATCAACACCTATCTCGCACCTTTCGAAATTTGAGTAGAAGTACCAAGATGAGAGTGTTAACCTTCCTAAACCGTTTAGAATGCATGTTTAGCTAGTTTACTGGGTGTTAAGACGTTAGGGTTAGCAATTTTATTAGACTTAGCAAGTAATAGACTTGAATTTTTCCATCTCGCACCTATCACCACCTATCTCGCACCTTCCAAAATTTGAGTAGAAGTCCCAAGATGAGAGTGTTAACCTTCCTAAACCGTTTAGAATGCATGTTTAGCTAGTTTACTGGGAGTTAAGGCTTTAGGGTTAGCAATTTTATTAGGCTTAGCAAGTAATAACCTTGAATTTTCCATCTCGCACCTATCAGCAACTATCTCGCACCTTCCAAAATTTGAGTAGAAGTCCCAAGATGAGAGTGTTAACCTTCCTAAACCGTTTAGAATGCATGTTTAGCTAGTTTACTAGGAGTTAAGGCTTTAGGGTTAGCAATTTTATTAGACTTAGCAAGTAATAGCCTTGAATTTTTCCATTTCGCACCTATCAGCACCTATCTCGCACCTTCCAAAATTTGAGTAGAAGTCCCAAGATGAGAGTGTTAACCTTCCTAAACCGTGTAGAATGCATGTTTAGCTATTTTACTAGATGTTAAGGCTTTAGGGTTAGCAATTTTATTAGACTTAGCAAGTAATAGCCTTGAATTTTTCCATTTCGCACCTTCCAAAATTTGAGTAGAAGTCCCAAGATGAGAGTGTTAACCTTCCTAAACCGTGTAGAATGCATGTTTAGCTATTTTATTGGATGTTAAGGCTTTAGGGTTAGCAATTTTATTAGACTTAGCAACTTTGCCTTCAATTTTTCTATCTCACACCTATCAACACCTATCTCGCACCTACCAAAATTTGAGTAGAAGTCTCAAGATGAGAGTGCATTCATGTTTAGTAAGTCTCTTATGCAATTATATTTAATTAGTTTTTTTTATGCAGGTTTTATTATGGCTTCGACATTAACTAACGGTCCCATGTACAAGATTGACCCTGCTCATCATTTTCAGTCTATAGTAAGCAGTTTATCTCATTTAGAAAACAGTACGCGTAAAATAAAGGCTAAATTGAAACCGGATCCATTAACCTTATTTAGAAATACAAAGTTTGGCCACTTTTTGGATCTTAATATAGTCTTTAATGGGCCGCTCATCCACTACTTATTGTTAAGGGAGGTGGAAGATGAAAGGGTTGATCATATCAGTTTCAAGATAGGGGAAGTTGTGTGCAGTTTTGGTAGGAGAGAGTTTAATATGATGACGGGTCTATGGAGTTCTCAACCAGAGCCTATTGAATTGGTTGGGAATAGTAGGTTGTTGGAGAAGTTCTTCGAACAAAAAAAGTGTATTTATATAAGTGACTTAGAGGACACATTTGTGGAATACGAGGGGGATGACGATGACGATGACATAGTTAAATTAGCTCTAGTGTACTTTATAGAGCTGTCATTGTTAGGAAAAGATAGGCGGACGAAAGTGGACCGAACTTTATTTAGGATTGCAGATGATTGGACCACATTTAACAATTACGATTGGGGAGGTTTGGTTTTTGGACGTACAATTTCTGCCTTAAAACGAGCCTTGGACATGCAACATGCCAAGGGAAAGAATAAATCAACTAAGACAAAATATACTGTCATGGGATTTCCGCAAGCGTTACAGGTATGTTACTTTACTTCCTATGCACATATTTATATATACATATATCTTGGGATACTTGCTAAACTTGTTTTGTCGAACATGGAATAGGTTTGGGCATATGAGTCTATACCAACCATCACTGAATGTGGTGTACATAAAGTAAGCAACGATGCAATACCACGAATGTTGAGGTGGGTGTGCGAACTATCACCAAAGTCTCATGTCCTACAGAGCCAAGTGTTTGACTCGCCAATGGTAAGTATCAACAAACAGTAACTTACCAAATGCATGACTTTGTTCCTTTATGTTTTGACTAACTAACTACATTTCCACTTTGTAGTTCATAATTAACGTGGTAATTGAGATGATGCCTGAAGAGGAGGAGCATCTAAGAATGTCTTCAGAGGAACTTGTTGAGAAATCCCATCCATCTAACACCGTTTCTGAGAAGAATGGTGATTCAAAACGACCAGGAGAAGCTAGTAATGATGACAATGACTGCAAAAAGAGTAAGAAAAAGAAGAAGTGGAAGTCTAAGATGAAAGAAGTTGTTCGAAAACTCAAATATCGAGTAGCGGTTCTCGAGAATGAACGTGGAAGCCTAAAATCAACGCTGTCGACTATATTGAAACACCTTGAAGTTCAAAAAAAGGTTAGGAATCAAAGTTTGTTTAACATTTTCAAAGTTTGTTTAACATTTTCAAAGTGTTCCTTACCAATACATTTTGACATTTCCAAAGGGTGAAGAAGGAGACTGCACGGGAGTTGAAGGTCATGATGCCCAGACCGAAGATGTTGACAAACCCGGTACACCTTCTTGGTTGTGGATGCCAAAGGAGGATGACACAAGTGATGTGGTGAAACACGTTGAACTTCAAAAAAAGGTTAGGAATCAAAGTTGTGTAATGGGGTGAAAATCATCCTTACTAATGCTTTATCCTTAGTAATTCATTTTGACCTTCCATAGGGTGTCGAAGCAAACCGCACGGAGGATGACACAATGGACGAGTTGGATAAGAAGGTTCATATTGATTTGGAGGAGCCAATAGACGTCGTTGACAATTTCAACGAGGAAATTGGAGTAAAATGTCTTACCTATTTTGATTCAGACGTCATGGAAATAGAACCATTATCCACTAAACGACCACACGTTCGACCCGCACGTAGCAAGCGTGCAAGTCACTTGTCAACCCCCTTCACAGCTTTAGTTAAACGGTCTACAAAATCAACCACCACCACCTCTCAGTCTCAACCATCCGTCTATGGTCCTATGCACAAGATACCTGACACCCATTTAGATCGACTCAGAGCTTGGATCACAGACAAGCGTACAAAAGATGAGGTGCGTGAAACTTTTCACGGGAAAAAATCGAAGGAATTTTTTCAGAGACTTGTTCATGTGTCGTCGGTGGTTGGCGGATGAGGTAAGGACTTTTCTTATTATTTCTTATTTACAGTTTTAGAGCACTTAGTAAAACCATGTTTGTTAACTAAAAATAGTATGTTCAAGAAGTTGGTTTGATCGCATAACCGAATTATATGTTTTCTTTTGTTTCTATTAGCATTTGGATGCACTCTTTCTTCTCATTCGCTTCAACATTAAGTCAGCCATGATACCTTCTGCTCAAAATTTCACAACTGTAGACACACTCTTCATGGTTTGTATTCGCAACCATACTTGATGTACTGAATTTGCTTTTGATATTTGTCTTATTGTCTTATTGTATTTGATATATATGCATGTGTGTTTTTTTTCCTTCCAGCGACTATTAGTTGCGAAGTGGCCTGAATACCAAGAATGTATTAAAGAGAATCGACCATTTCACTTGAAGGAGGAGTATCGATTGGTTGACTATGTTGTCGGATCAAAACAAGACTTTCAAGATCCTTGGGCGAATGTTGATTACATTTACTCTCCATTCAATATCCATGGCAATCATTGGATTCTATTATGCTTGGACTTGGTACGTTGTCAAGTTAAGGTATGGGATTCGCTTCCGTCGCTTACGAGTGCCGAAGATATGAGAAGCATATTAATGTCAATTCGAGAGATGGTGCCAAATTTGCTCGATACTACTGGATTCTTTGTTAGGAGAGGCGGATCATCAACACACAAGGAACCTTGGCCACTTGTCATTGTTGACTCCATTCCACTACAACGCAACAATAGTGATTGTGGTGTATTTACAATTAAGTATTTCGAATATGAAGCTTCTGGTTTAGATGTAGCTACATTATGTCAAGAAAACATGTCATATTTTAGAAAACAATTGGCATTTCAATTATGGACCAACAATCCCATGTATTGACTTTTAGTTCCAACTTTTGAAGGAATGGTATATGTTCCAAACTATGAACATAAATGGTTTTATATAATGGAATGAATACCATTGTTGTTTTATGGTATGAAATTTTTATTTCATTCTCTTGGGAAAATTGTAAAGTGTGCAGAACTTTTATATAATGTACTGCAGTGCAGGTGAGAGCAACATTCTCATGCTCTTAGAATTAGGAAGGCCAATTTATCACACGCTGTATGAAAATTTCACCTCTTTGAATGCTAATTGTTTTTTGCAGGTGTTTTAGAAACATTTTTGGATTTCTGATGTATGGATATAGGCATTCTTGACGCTTAAGTTGATTTTCTTAATTGTGTTAGTTGCTGAAAGTCACGAAAATGGAAAGTTTGTTCTGTCAGAATCCTGCTTTTAAATTAAACATTATTCTTTACAGCTTTGGAGTTGTTCTAATGGAAGTACCAACAGGAAAGAAGCCCATTTTCTAAGACACCTGCCAGGAAAGCAACAAGAAAGCAGAATGAGAGAACTAAAAGGTAATCTGAAAGAGATGGTAGATCCCAGCATATCATAGGCTCAGGTAGAGAACGTAGTCAAAGTGGTAAGGATCGAACTTCGCTACATGGCTAAGATTCCATCTACAAGGCCCTCCATGAAAATGGTGGTTCATATGCTTGAAGAGGTTGAACCTTGTAACTTTATTGACATTGTTGTCAAGAAAGAATGTGAAAACTAAAAGTAATCAGCAAGATTTGCATCATTACTACTGTAGTAGCAAAAGGAGAGCATTTATTGAGTTCTAAAGCACACAATATTCTTTACTTCTCACGGATGGTTCAAACATTATTCTTTACAGCTTTGGAGTTGTTCTAATGGAAGTACCAACAGGAAAGAAGCCCATTTTCTAAGACACCTGCCCAGGAAAGCAACAAGAAAGCAGAATGAGAGAACTAAAAGGTAATCTGAAAGAGATGGTAGATCCCAGCATATCATAGGCTCAGGTAGAGAACGTAGTCAAAGTGGTAAGGATCGAACTTCGCTACATGGCTAAGATTCCATCTACAAGGCCCTCCATGAAAATGGTGGTTCATATGCTTGAAGAGGTTGAACCTTGTAACTTTATTGACATTGTTGTCAAGAAAGAATGTGAAAACTAAAAGTAATCAGCAAGATTTGCATCATTACTACTGTAGTAGCAAAAGGAGAGCATTTATTGAGTTCTAAAGCACACAATATTCTTTACTTCTCACGGATGGTTCAAACATTATTCTTTACAGCTTTTGGAGTTGTTCTAATGGAAGTACCAACAGGAAAGAAGCCCATTTTCTAAGACACCTGCCAGGAAAGCAACAAGAAAGCAGAATGAGAGAACTAAAAGGTAATCTGAAAGAGATGGTAGATCCCAGCATATCATAGGCTCAGGTAGAGAACGTAGTCAAAGTGGTAAGGATCGAACTTCGCTACATGGCTAAGATTCCATCTACAAGGCCCTCCATGAAAATGGTGGTTCATATGCTTGAAGAGGTTGAACCTTGTAACTTTATTGACATTGTTGTCAAGAAAGAATGTGAAAACTAAAAGTAATCAGCAAGATTTGCATCATTACTACTGTAGTAGCAAAAGGAGAGCATTTATTGAGTTCTAAAGCACACAATATTCTTTACTTCTCACGGATGGTTCAAACATTATTCTTTACAGCTTTGGAGTTGTTCTAATGGAAGTACCAACAGGAAAGAAGCCCATTTTCTAAGACACCTGCCATGAAAGCAGAATGAGATATCTAAAAGGTAATCTGAAAGAGATGGTAGATCCAAACATATCATAGGCTCAGGTGGAGAACGCAGTCAAAGTGGTAAGGATCGCACTTCGCTGCACGACTAAGATTCCATCTACAAGGCCCTCCATGCAAATGGTGGTTCATATGCTTGAAGAGGCTGAACCTTGTAACTTTATTGACATTGTTGTCAAGAAAGAATGTGAAAACTAAAAGTAATCAGCAAGATTTGCATCATTAATACTGCAGTAGCAAAAGGAGAGCATTTATGGAGTTTAGTGTTAGGTCATAATAGGATTTCAAGTCTTTCTCAAAAAGCTTGAAATGCCAAAGGAGAGCATTTATGTTAAAAACGACCTCCTGCTTTAGCTATGAAGTTCTACCTTATTCAAACTTTACAGAAAGATGAGATTAAAGTGAAAAATATTCCTTAA

mRNA sequence

ATGAAGCTGAAGGACTTGAAAGTGAAAATTTATCTTTTCCAAGCCATTGATCGCACAATTCTGAAGACCATTCTCAAGAAAAATACTGCAAAAGAAATATGGGATGCTATGAAGAAAAAGTATGAAGGAAATGCAAGAGTCAGGCGGTCTTATCTTCAAGCTCTTTGTAGAGAATTTGAAATTCTTGAGATGAAGTCTGGTGAAGGGGTGACAGAGTACTTCTCCAGAGTCATGATTGTGGCAAATAAGATGCGAACTTATGGTGAAGATATGCAAGATAAAGATGAGATTAAAGTGAAAAATATTCCTTAA

Coding sequence (CDS)

ATGAAGCTGAAGGACTTGAAAGTGAAAATTTATCTTTTCCAAGCCATTGATCGCACAATTCTGAAGACCATTCTCAAGAAAAATACTGCAAAAGAAATATGGGATGCTATGAAGAAAAAGTATGAAGGAAATGCAAGAGTCAGGCGGTCTTATCTTCAAGCTCTTTGTAGAGAATTTGAAATTCTTGAGATGAAGTCTGGTGAAGGGGTGACAGAGTACTTCTCCAGAGTCATGATTGTGGCAAATAAGATGCGAACTTATGGTGAAGATATGCAAGATAAAGATGAGATTAAAGTGAAAAATATTCCTTAA

Protein sequence

MKLKDLKVKIYLFQAIDRTILKTILKKNTAKEIWDAMKKKYEGNARVRRSYLQALCREFEILEMKSGEGVTEYFSRVMIVANKMRTYGEDMQDKDEIKVKNIP
BLAST of CsaV3_3G020870 vs. NCBI nr
Match: XP_011652786.1 (PREDICTED: uncharacterized protein LOC105435094 [Cucumis sativus])

HSP 1 Score: 183.7 bits (465), Expect = 3.1e-43
Identity = 93/93 (100.00%), Postives = 93/93 (100.00%), Query Frame = 0

Query: 1  MKLKDLKVKIYLFQAIDRTILKTILKKNTAKEIWDAMKKKYEGNARVRRSYLQALCREFE 60
          MKLKDLKVKIYLFQAIDRTILKTILKKNTAKEIWDAMKKKYEGNARVRRSYLQALCREFE
Sbjct: 1  MKLKDLKVKIYLFQAIDRTILKTILKKNTAKEIWDAMKKKYEGNARVRRSYLQALCREFE 60

Query: 61 ILEMKSGEGVTEYFSRVMIVANKMRTYGEDMQD 94
          ILEMKSGEGVTEYFSRVMIVANKMRTYGEDMQD
Sbjct: 61 ILEMKSGEGVTEYFSRVMIVANKMRTYGEDMQD 93

BLAST of CsaV3_3G020870 vs. NCBI nr
Match: PNX85109.1 (retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Trifolium pratense])

HSP 1 Score: 155.6 bits (392), Expect = 9.2e-35
Identity = 77/93 (82.80%), Postives = 87/93 (93.55%), Query Frame = 0

Query: 1   MKLKDLKVKIYLFQAIDRTILKTILKKNTAKEIWDAMKKKYEGNARVRRSYLQALCREFE 60
           MKLKDLK K YLFQAIDRT+L+TILKK+T+K+IWDAMKKK+EGNARV+RS+LQAL REFE
Sbjct: 63  MKLKDLKAKNYLFQAIDRTVLETILKKDTSKDIWDAMKKKFEGNARVKRSHLQALRREFE 122

Query: 61  ILEMKSGEGVTEYFSRVMIVANKMRTYGEDMQD 94
            LEM+SGEGVTEYFSRVM VANKMRTYGE+M D
Sbjct: 123 TLEMRSGEGVTEYFSRVMTVANKMRTYGEEMSD 155

BLAST of CsaV3_3G020870 vs. NCBI nr
Match: XP_021648424.1 (uncharacterized protein LOC110641129 [Hevea brasiliensis])

HSP 1 Score: 155.2 bits (391), Expect = 1.2e-34
Identity = 78/93 (83.87%), Postives = 86/93 (92.47%), Query Frame = 0

Query: 1   MKLKDLKVKIYLFQAIDRTILKTILKKNTAKEIWDAMKKKYEGNARVRRSYLQALCREFE 60
           MKLKDLKVK YLFQAIDRT+L TILKK+TAK+IWDAMKKK+EGNA+V+RS+LQAL REFE
Sbjct: 63  MKLKDLKVKNYLFQAIDRTVLDTILKKDTAKDIWDAMKKKFEGNAKVKRSHLQALRREFE 122

Query: 61  ILEMKSGEGVTEYFSRVMIVANKMRTYGEDMQD 94
            LEM+S EGVTEYFSRVM VANKMR YGEDMQD
Sbjct: 123 TLEMRSSEGVTEYFSRVMTVANKMRLYGEDMQD 155

BLAST of CsaV3_3G020870 vs. NCBI nr
Match: XP_015387654.1 (uncharacterized protein LOC107177766 [Citrus sinensis])

HSP 1 Score: 154.5 bits (389), Expect = 2.0e-34
Identity = 77/93 (82.80%), Postives = 87/93 (93.55%), Query Frame = 0

Query: 1   MKLKDLKVKIYLFQAIDRTILKTILKKNTAKEIWDAMKKKYEGNARVRRSYLQALCREFE 60
           MKLKDLKVK YLFQAIDRT+L TILKK+TAKEIWDA+KKK+EGNAR++RS+LQAL REFE
Sbjct: 40  MKLKDLKVKNYLFQAIDRTVLDTILKKDTAKEIWDAIKKKFEGNARLKRSHLQALRREFE 99

Query: 61  ILEMKSGEGVTEYFSRVMIVANKMRTYGEDMQD 94
            LEM+SGEGVT+YFS+VM VANKMR YGEDMQD
Sbjct: 100 TLEMRSGEGVTKYFSKVMTVANKMRIYGEDMQD 132

BLAST of CsaV3_3G020870 vs. NCBI nr
Match: XP_019416461.1 (PREDICTED: uncharacterized protein LOC109327762 [Lupinus angustifolius])

HSP 1 Score: 151.4 bits (381), Expect = 1.7e-33
Identity = 76/93 (81.72%), Postives = 84/93 (90.32%), Query Frame = 0

Query: 1   MKLKDLKVKIYLFQAIDRTILKTILKKNTAKEIWDAMKKKYEGNARVRRSYLQALCREFE 60
           MKL DLK K YLFQAIDRT+L TILKK+T+KEIWDAMKKK+EGNARV+RS+LQAL REFE
Sbjct: 115 MKLMDLKTKNYLFQAIDRTVLDTILKKDTSKEIWDAMKKKFEGNARVKRSHLQALRREFE 174

Query: 61  ILEMKSGEGVTEYFSRVMIVANKMRTYGEDMQD 94
            LEM+SGEGVTEYFSRVM VANKMR YGE+M D
Sbjct: 175 TLEMRSGEGVTEYFSRVMTVANKMRIYGEEMSD 207

BLAST of CsaV3_3G020870 vs. TrEMBL
Match: tr|A0A2N9GII1|A0A2N9GII1_FAGSY (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS27255 PE=4 SV=1)

HSP 1 Score: 158.7 bits (400), Expect = 7.2e-36
Identity = 80/93 (86.02%), Postives = 87/93 (93.55%), Query Frame = 0

Query: 1   MKLKDLKVKIYLFQAIDRTILKTILKKNTAKEIWDAMKKKYEGNARVRRSYLQALCREFE 60
           MKLKDLKVK YLFQAIDRT+L TILKK+TAK+IWDAMKKK+EGNARV+RS+LQAL REFE
Sbjct: 10  MKLKDLKVKNYLFQAIDRTVLDTILKKDTAKDIWDAMKKKFEGNARVKRSHLQALRREFE 69

Query: 61  ILEMKSGEGVTEYFSRVMIVANKMRTYGEDMQD 94
            LEM+SGEGVTEYFSRVM VANKMR YGEDMQD
Sbjct: 70  TLEMRSGEGVTEYFSRVMTVANKMRIYGEDMQD 102

BLAST of CsaV3_3G020870 vs. TrEMBL
Match: tr|A0A2N9H9R9|A0A2N9H9R9_FAGSY (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS39009 PE=4 SV=1)

HSP 1 Score: 157.1 bits (396), Expect = 2.1e-35
Identity = 79/93 (84.95%), Postives = 87/93 (93.55%), Query Frame = 0

Query: 1   MKLKDLKVKIYLFQAIDRTILKTILKKNTAKEIWDAMKKKYEGNARVRRSYLQALCREFE 60
           MKLKDLKVK YLFQAIDRT+L TILKK+TAK+IWDAMKKK+EGNARV+RS+LQAL REFE
Sbjct: 63  MKLKDLKVKNYLFQAIDRTVLDTILKKDTAKDIWDAMKKKFEGNARVKRSHLQALRREFE 122

Query: 61  ILEMKSGEGVTEYFSRVMIVANKMRTYGEDMQD 94
            LEM+SGEGVTEYFSRVM VANKMR YGE+MQD
Sbjct: 123 TLEMRSGEGVTEYFSRVMTVANKMRIYGEEMQD 155

BLAST of CsaV3_3G020870 vs. TrEMBL
Match: tr|A0A2K3M2V4|A0A2K3M2V4_TRIPR (Retrovirus-related Pol polyprotein from transposon TNT 1-94 (Fragment) OS=Trifolium pratense OX=57577 GN=L195_g041175 PE=4 SV=1)

HSP 1 Score: 155.6 bits (392), Expect = 6.1e-35
Identity = 77/93 (82.80%), Postives = 87/93 (93.55%), Query Frame = 0

Query: 1   MKLKDLKVKIYLFQAIDRTILKTILKKNTAKEIWDAMKKKYEGNARVRRSYLQALCREFE 60
           MKLKDLK K YLFQAIDRT+L+TILKK+T+K+IWDAMKKK+EGNARV+RS+LQAL REFE
Sbjct: 63  MKLKDLKAKNYLFQAIDRTVLETILKKDTSKDIWDAMKKKFEGNARVKRSHLQALRREFE 122

Query: 61  ILEMKSGEGVTEYFSRVMIVANKMRTYGEDMQD 94
            LEM+SGEGVTEYFSRVM VANKMRTYGE+M D
Sbjct: 123 TLEMRSGEGVTEYFSRVMTVANKMRTYGEEMSD 155

BLAST of CsaV3_3G020870 vs. TrEMBL
Match: tr|A0A2N9G6A8|A0A2N9G6A8_FAGSY (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS26069 PE=4 SV=1)

HSP 1 Score: 153.7 bits (387), Expect = 2.3e-34
Identity = 78/93 (83.87%), Postives = 85/93 (91.40%), Query Frame = 0

Query: 1   MKLKDLKVKIYLFQAIDRTILKTILKKNTAKEIWDAMKKKYEGNARVRRSYLQALCREFE 60
           MKLKDLKVK YLFQAIDRT+L TILKK+TAK+IWDAMKKK+EGNARV+RS+LQAL REFE
Sbjct: 63  MKLKDLKVKNYLFQAIDRTVLDTILKKDTAKDIWDAMKKKFEGNARVKRSHLQALHREFE 122

Query: 61  ILEMKSGEGVTEYFSRVMIVANKMRTYGEDMQD 94
            LEM+  EGVTEYFSRVM VANKMR YGEDMQD
Sbjct: 123 TLEMRFDEGVTEYFSRVMTVANKMRIYGEDMQD 155

BLAST of CsaV3_3G020870 vs. TrEMBL
Match: tr|A0A151S692|A0A151S692_CAJCA (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan OX=3821 GN=KK1_027981 PE=4 SV=1)

HSP 1 Score: 151.0 bits (380), Expect = 1.5e-33
Identity = 75/93 (80.65%), Postives = 86/93 (92.47%), Query Frame = 0

Query: 1   MKLKDLKVKIYLFQAIDRTILKTILKKNTAKEIWDAMKKKYEGNARVRRSYLQALCREFE 60
           MKLKDL+VK YLFQAIDRTIL+TIL+KNT+KEIWD+MK+KYEGNARV+RS LQ L +EFE
Sbjct: 65  MKLKDLQVKNYLFQAIDRTILETILQKNTSKEIWDSMKRKYEGNARVKRSILQGLRKEFE 124

Query: 61  ILEMKSGEGVTEYFSRVMIVANKMRTYGEDMQD 94
           ILEMKSGE +T+YFSRVM VA+KMRTYGE MQD
Sbjct: 125 ILEMKSGESITDYFSRVMSVASKMRTYGEQMQD 157

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_011652786.13.1e-43100.00PREDICTED: uncharacterized protein LOC105435094 [Cucumis sativus][more]
PNX85109.19.2e-3582.80retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Trifolium ... [more]
XP_021648424.11.2e-3483.87uncharacterized protein LOC110641129 [Hevea brasiliensis][more]
XP_015387654.12.0e-3482.80uncharacterized protein LOC107177766 [Citrus sinensis][more]
XP_019416461.11.7e-3381.72PREDICTED: uncharacterized protein LOC109327762 [Lupinus angustifolius][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
tr|A0A2N9GII1|A0A2N9GII1_FAGSY7.2e-3686.02Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS27255 PE=4 SV=1[more]
tr|A0A2N9H9R9|A0A2N9H9R9_FAGSY2.1e-3584.95Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS39009 PE=4 SV=1[more]
tr|A0A2K3M2V4|A0A2K3M2V4_TRIPR6.1e-3582.80Retrovirus-related Pol polyprotein from transposon TNT 1-94 (Fragment) OS=Trifol... [more]
tr|A0A2N9G6A8|A0A2N9G6A8_FAGSY2.3e-3483.87Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS26069 PE=4 SV=1[more]
tr|A0A151S692|A0A151S692_CAJCA1.5e-3380.65Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan OX=... [more]
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0015074 DNA integration
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0008270 zinc ion binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsaV3_3G020870.1CsaV3_3G020870.1mRNA


Analysis Name: InterPro Annotations of cucumber chineselong genome (v3)
Date Performed: 2019-03-04
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePFAMPF14223Retrotran_gag_2coord: 3..97
e-value: 4.5E-15
score: 55.4

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
CsaV3_3G020870CSPI03G20990Wild cucumber (PI 183967)cpicucB144
CsaV3_3G020870Cucsa.031190Cucumber (Gy14) v1cgycucB006
The following gene(s) are paralogous to this gene:

None