Csa4G123320 (gene) Cucumber (Chinese Long) v2

NameCsa4G123320
Typegene
OrganismCucumis. sativus (Cucumber (Chinese Long) v2)
DescriptionUnknown protein
LocationChr4 : 7520653 .. 7524095 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAAACAAAATATTGGCAGCAAAGTTGCAGCAAAAAAATCAGGATTCTGTCGTTTCAGATCAGAGTAACACTGTTGAAACCAAAGAAATTGCATCCAAGCAAGATTTTGCTACTGCTATGGCTACTACCTGTCCCAAAGAAATCCAGGTAGATGAGACAAAGTTAACTGCTGCCAGCTCTGAGGTAATACCTTACTGGATGACTTACCATTAATTTTTCTGGACTTGTTGATATGGAAACCATCTTCATGTATGTTTGTCATAAGAACTTAAATTTTTTTTCCCACTAAGCAGCAATGCTTTTATATATAATTTTATAAAGCACTTCACTAGAAGATGAAAAGTTACAGCCAAAAAGGACTAATAGGAGTTTCAAGGAAAAAAATCATTGAACTTTGATGTTGGAGGCTTTTTTCCAAATGTAATTTGATGATTGAGAGCCTTTCTTTTTTTCATTCTTCTGTTATATACCTAGATTTATATAAGGCTAAGGGTAGTTAGTTATATAGGCAGTTAGATGGTTATAACTAGGAAGTTGGGAAGGGAAGAAAGGCATGAAGAATTTGGTGAAGATAATGGACTGCAACATTCCTTGAAAGAAAAGAAGCGCAGAGGGTTAGGGACTTTTCAATTAGGATCATAGCCTATGAAATTGCATCTAATGGCACTAGAGTCTGATCTGCCCTCGTGGTGATGAAAATTGTGAGCAGACGTGTAATGAAAAACTCTCTCGGGCATTTGGACTCATAAGTTTACTATTATGGCTTGTTTAAGTTTAGCTTTTAAGGATAGTGACGCTGACGGTTCAGGTAAGCGACGATTCGTGAGCTATGTGTAGGCCGCCCTACTGAGTATATTAATCAAAATGTGAGCTCAATGATGATCTCAAGTTTTTAAAATCATAGAAAAAAAAATGAAAAGCTCCTTTTCTTTAAAGGGAAGTTTTTAAAATCATAGAAAAAGATAAAATCCTCTTATAAACACCTATAGAGTAGTTATTTCTAATAATTGACAACCATCACATTTATGGCTAACTATCTATTTTTCAATGGATAATTTAGGAAGTAAATTTAGAAAAAACGTAATTATGAAACCATCATGAGTCGGCCTAGTGGTAACAAGGTCATTGACTAACTAAGAGGTCATGGGTTCTCGATCCATAGTGGCCACCTACCTAGGAGTTTCCTTGACATCCAAATGTTTGGGTCAGGCAGGTTGTCCCGTGAAATTAGTTGAGGTGCGTGTAAGCTAGTCCGGACACTCACAGATATACAAAAAAAACGTAATTATGCAACAAATAATATTGATTATGGTGGTATAACACATAATGCAAAAAAAGAAAACCACTAGTGCTACAACACTTGGAGGGTAATTTCTAATGTTTTCACTGTTTCACTATGTTTTGGTATTTATGTCATTGTATTTTCATTTGTTGCTAAGATTCCTTTTGTTTCATGTCTGGTACTTTGGGCATTGGACCCATTTCATTATTTAAATGAAAAGTCTTATTTCTGTTAAAAACAGAAAAGAAACCCATATGGGTCCAAGTAAATAGTAAGGGGTTAGAGGGAATGAGCACAAGCCACGGGAAGCTACCTTAGGATTTATTATTGTATGTGTACACCTTGGCAATCAAGCGTAGGAGGGTCAATTAGTTGTCTAATGAGAATAATTGAGGTGTGCCTGGATACTCATGTATGATGTATTTATATACATTTATATACTCATGTATTTAAAAGGAAGTAAAAAAAAGGAAAGAAAAATGAGCATTATTGTCATCAACATCTCCTGTTTTAAGTAATTCAAGTCTTACGATTATTGATTAAAGACTGGAGGGAACTTCAACATTGCTAATTGTGCTCTCAATTTAAAACATGCAAAATATACCCATAATCCATATGAAAGGAATTTGATGGTTAGGATTAGATCGGATGTATTGCTAATTAATAACAAAGTGCAATTAGTTCTGGAAGAAATAACACATTTCTTAAGGTTAGAACTGAAGATCATTCTTGAGCCTATTCTAGTTGTATTTGTTATAAAACTGTCAACATTTATATAATTATGAGTTTCAAAACAATCACAGAAGAGATATCACTTGGCTTGTGAATTTACGGTCCAAGAGATTATGATGTTTAGTTTTAGAAGAACTTATCATCTTGGAACCAAACAGTGCAGTACTTGTTATTGCATAACTTCTCTCTTTCATGGTAGAGGAAGAAAACTGACCAAATGAACTAAAAAAGGAATTAAGAACTTATCTGAAGAAAATGTCACATTTAAGAGTTATGGCATCTTTACAAAAGAACTTATTAATAAAACAACTTTGAAGAACCCCAGTGAAACTTAAAGGAATACAAAAAGACCGAATCTATCAGTAACTAACAACAAACTACATCACCTATTGAACAAATGCTGGCTACTCAGACACGGTATGCAATACAAATGGTACAAATGGACTACTGCACGAAATGACACAATCTATAAGGAATTTTATACATTGCTTAAAAAATATGATGTATTGTCCACTTTGTTAAGCTGTCCATTTCTTTTTCTGTCAACCAGTCAATTTTTATTTAATAGGGGATGGGGATACAATGTCATCATTTTGTTAGGAGGGTTATACGACCATATGAAATCACTTCTCAAGATCTGTTAAGTAATTGAGACATGGTTTTTCATGTTCTCAGTATTCTCTCTATTTGTCTTCATTCTTTCTGTTGTATTGTTTTCTGAAATTTAATGGAGGTGACTGGTGGGCACTGGGAAATGTATCTTTTACTGAAAGAGACCATCCTCTCTTTTTTATAAGACTCAGGATAATATCTCAGTTGGCGGGAGTACGGCTAACTCTGCTGCTATTGAATCTGATTTAGAGACCACAAGAAGAATCGATGTAGCCGTATTGAGCTCAAAGCATCTATTCCTAATACTGCCTGTTTTAATCCTGATTGCTGCGGTTTACTTGTCTTCACTTCAAGATTAACAAATTTACAGAACTCAGAAACAACTTCATTGGCCAAGGATAGGGTCATTTCATTTTGTATCCTTAACCTCCAAAGAAACCATTAACAGGTAGATTTCCAAATTTTTCTGGTTGTATAGAACTTTGTAACGTCCTCGCTGACACAACGGAGAATTTTCTGACTCCTGTCAATGACTGATGTTAAAGATGGGACGAAAACTGACTGAGAATGAGGAAGTGTGCTGCTAGACTAACGGTATGACTATTTAGATGGGTCTGACGGAAAGGAAGCATTCGCTTTAGCTTATACAGGGTCTAGTTTCAATCTCACTCATTTGTCTTCCCTTTAGTCACTTTTCTCTGGAAACAAGACTTGTTCAGTATTCTATAATAACGTTTCTTTGATTTCTGACTGCTGTTGGATCTATTAAATTGGCATGATGTTAATACAATATCCGATATGATCAACGTTTGGC

mRNA sequence

ATGGAAAACAAAATATTGGCAGCAAAGTTGCAGCAAAAAAATCAGGATTCTGTCGTTTCAGATCAGAGTAACACTGTTGAAACCAAAGAAATTGCATCCAAGCAAGATTTTGCTACTGCTATGGCTACTACCTGTCCCAAAGAAATCCAGGTAGATGAGACAAAGTTAACTGCTGCCAGCTCTGAGACTCAGGATAATATCTCAGTTGGCGGGAGTACGGCTAACTCTGCTGCTATTGAATCTGATTTAGAGACCACAAGAAGAATCGATGTAGCCGTATTGAGCTCAAAGCATCTATTCCTAATACTGCCTGTTTTAATCCTGATTGCTGCGGTTTACTTGTCTTCACTTCAAGATTAA

Coding sequence (CDS)

ATGGAAAACAAAATATTGGCAGCAAAGTTGCAGCAAAAAAATCAGGATTCTGTCGTTTCAGATCAGAGTAACACTGTTGAAACCAAAGAAATTGCATCCAAGCAAGATTTTGCTACTGCTATGGCTACTACCTGTCCCAAAGAAATCCAGGTAGATGAGACAAAGTTAACTGCTGCCAGCTCTGAGACTCAGGATAATATCTCAGTTGGCGGGAGTACGGCTAACTCTGCTGCTATTGAATCTGATTTAGAGACCACAAGAAGAATCGATGTAGCCGTATTGAGCTCAAAGCATCTATTCCTAATACTGCCTGTTTTAATCCTGATTGCTGCGGTTTACTTGTCTTCACTTCAAGATTAA

Protein sequence

MENKILAAKLQQKNQDSVVSDQSNTVETKEIASKQDFATAMATTCPKEIQVDETKLTAASSETQDNISVGGSTANSAAIESDLETTRRIDVAVLSSKHLFLILPVLILIAAVYLSSLQD*
BLAST of Csa4G123320 vs. TrEMBL
Match: A0A0A0KY26_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G123320 PE=4 SV=1)

HSP 1 Score: 223.0 bits (567), Expect = 1.9e-55
Identity = 119/119 (100.00%), Postives = 119/119 (100.00%), Query Frame = 1

Query: 1   MENKILAAKLQQKNQDSVVSDQSNTVETKEIASKQDFATAMATTCPKEIQVDETKLTAAS 60
           MENKILAAKLQQKNQDSVVSDQSNTVETKEIASKQDFATAMATTCPKEIQVDETKLTAAS
Sbjct: 1   MENKILAAKLQQKNQDSVVSDQSNTVETKEIASKQDFATAMATTCPKEIQVDETKLTAAS 60

Query: 61  SETQDNISVGGSTANSAAIESDLETTRRIDVAVLSSKHLFLILPVLILIAAVYLSSLQD 120
           SETQDNISVGGSTANSAAIESDLETTRRIDVAVLSSKHLFLILPVLILIAAVYLSSLQD
Sbjct: 61  SETQDNISVGGSTANSAAIESDLETTRRIDVAVLSSKHLFLILPVLILIAAVYLSSLQD 119

BLAST of Csa4G123320 vs. TrEMBL
Match: A0A061EGR9_THECC (WPP domain-interacting protein 1, putative isoform 2 OS=Theobroma cacao GN=TCM_019058 PE=4 SV=1)

HSP 1 Score: 66.2 bits (160), Expect = 3.0e-08
Identity = 49/122 (40.16%), Postives = 67/122 (54.92%), Query Frame = 1

Query: 1   MENKILAAKLQQKNQD-SVVSDQSNTVETKE-IASKQDFATAMATTCPKEIQVDETKLTA 60
           +ENKIL  KL+Q ++D S++    N    KE + SKQD +TA A    +EI    TKL+A
Sbjct: 602 LENKILVVKLKQTDKDPSIIGSHENRGNVKEFLFSKQDSSTASAN---EEI----TKLSA 661

Query: 61  ASSETQDNI-SVGGSTANSAAIESDLETTRRIDVAVLSSKHLFLILPVLILIAAVYLSSL 120
             SE      SVG S        S+ E  RR D  +L+ KH+ L L +L++ AAVY S  
Sbjct: 662 DGSELDKTTESVGESEVKPTDATSEFENVRRTDARLLNFKHVSLALLILLISAAVYFSQN 716

BLAST of Csa4G123320 vs. TrEMBL
Match: A0A061ENG0_THECC (WPP domain-interacting protein 1, putative isoform 1 OS=Theobroma cacao GN=TCM_019058 PE=4 SV=1)

HSP 1 Score: 66.2 bits (160), Expect = 3.0e-08
Identity = 49/122 (40.16%), Postives = 67/122 (54.92%), Query Frame = 1

Query: 1   MENKILAAKLQQKNQD-SVVSDQSNTVETKE-IASKQDFATAMATTCPKEIQVDETKLTA 60
           +ENKIL  KL+Q ++D S++    N    KE + SKQD +TA A    +EI    TKL+A
Sbjct: 601 LENKILVVKLKQTDKDPSIIGSHENRGNVKEFLFSKQDSSTASAN---EEI----TKLSA 660

Query: 61  ASSETQDNI-SVGGSTANSAAIESDLETTRRIDVAVLSSKHLFLILPVLILIAAVYLSSL 120
             SE      SVG S        S+ E  RR D  +L+ KH+ L L +L++ AAVY S  
Sbjct: 661 DGSELDKTTESVGESEVKPTDATSEFENVRRTDARLLNFKHVSLALLILLISAAVYFSQN 715

BLAST of Csa4G123320 vs. TrEMBL
Match: F6GVT9_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_14s0083g00730 PE=4 SV=1)

HSP 1 Score: 65.5 bits (158), Expect = 5.2e-08
Identity = 45/121 (37.19%), Postives = 66/121 (54.55%), Query Frame = 1

Query: 2   ENKILAAKLQQKNQDSVVSDQSNTVETKEIASKQDFATAMATTCPKEIQVDETKLTAASS 61
           +NK+L  KL++    S+ S     V   E   K D  TA   TC KE  V++T+ +A+S 
Sbjct: 600 KNKVLVGKLKKTEDPSIASK----VTRGEFCPKDDLTTA---TCAKECIVEQTEFSASSF 659

Query: 62  ETQD---NISVGGSTANSAAIESDLETTRRIDVAVLSSKHLFLILPVLILIAAVYLSSLQ 120
           E ++   N+SVGG  A  +   S+ ET RR+D   LS K++F  + V IL+ A YL   Q
Sbjct: 660 EMEEAPKNLSVGGIIAGPSDSVSEPETVRRLDPGQLSFKYIF--MAVFILLTAAYLFQQQ 711

BLAST of Csa4G123320 vs. TrEMBL
Match: A5B3T9_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_035977 PE=4 SV=1)

HSP 1 Score: 65.5 bits (158), Expect = 5.2e-08
Identity = 45/121 (37.19%), Postives = 66/121 (54.55%), Query Frame = 1

Query: 2   ENKILAAKLQQKNQDSVVSDQSNTVETKEIASKQDFATAMATTCPKEIQVDETKLTAASS 61
           +NK+L  KL++    S+ S     V   E   K D  TA   TC KE  V++T+ +A+S 
Sbjct: 623 KNKVLVGKLKKTEDPSIASK----VTRGEFCPKDDLTTA---TCAKECIVEQTEFSASSF 682

Query: 62  ETQD---NISVGGSTANSAAIESDLETTRRIDVAVLSSKHLFLILPVLILIAAVYLSSLQ 120
           E ++   N+SVGG  A  +   S+ ET RR+D   LS K++F  + V IL+ A YL   Q
Sbjct: 683 EMEEAPKNLSVGGIIAGPSDSVSEPETVRRLDPGQLSFKYIF--MAVFILLTAAYLFQQQ 734

BLAST of Csa4G123320 vs. NCBI nr
Match: gi|700198601|gb|KGN53759.1| (hypothetical protein Csa_4G123320 [Cucumis sativus])

HSP 1 Score: 223.0 bits (567), Expect = 2.8e-55
Identity = 119/119 (100.00%), Postives = 119/119 (100.00%), Query Frame = 1

Query: 1   MENKILAAKLQQKNQDSVVSDQSNTVETKEIASKQDFATAMATTCPKEIQVDETKLTAAS 60
           MENKILAAKLQQKNQDSVVSDQSNTVETKEIASKQDFATAMATTCPKEIQVDETKLTAAS
Sbjct: 1   MENKILAAKLQQKNQDSVVSDQSNTVETKEIASKQDFATAMATTCPKEIQVDETKLTAAS 60

Query: 61  SETQDNISVGGSTANSAAIESDLETTRRIDVAVLSSKHLFLILPVLILIAAVYLSSLQD 120
           SETQDNISVGGSTANSAAIESDLETTRRIDVAVLSSKHLFLILPVLILIAAVYLSSLQD
Sbjct: 61  SETQDNISVGGSTANSAAIESDLETTRRIDVAVLSSKHLFLILPVLILIAAVYLSSLQD 119

BLAST of Csa4G123320 vs. NCBI nr
Match: gi|778692082|ref|XP_011653404.1| (PREDICTED: WPP domain-interacting tail-anchored protein 1 isoform X2 [Cucumis sativus])

HSP 1 Score: 223.0 bits (567), Expect = 2.8e-55
Identity = 119/119 (100.00%), Postives = 119/119 (100.00%), Query Frame = 1

Query: 1   MENKILAAKLQQKNQDSVVSDQSNTVETKEIASKQDFATAMATTCPKEIQVDETKLTAAS 60
           MENKILAAKLQQKNQDSVVSDQSNTVETKEIASKQDFATAMATTCPKEIQVDETKLTAAS
Sbjct: 507 MENKILAAKLQQKNQDSVVSDQSNTVETKEIASKQDFATAMATTCPKEIQVDETKLTAAS 566

Query: 61  SETQDNISVGGSTANSAAIESDLETTRRIDVAVLSSKHLFLILPVLILIAAVYLSSLQD 120
           SETQDNISVGGSTANSAAIESDLETTRRIDVAVLSSKHLFLILPVLILIAAVYLSSLQD
Sbjct: 567 SETQDNISVGGSTANSAAIESDLETTRRIDVAVLSSKHLFLILPVLILIAAVYLSSLQD 625

BLAST of Csa4G123320 vs. NCBI nr
Match: gi|778692079|ref|XP_011653403.1| (PREDICTED: WPP domain-interacting tail-anchored protein 1 isoform X1 [Cucumis sativus])

HSP 1 Score: 223.0 bits (567), Expect = 2.8e-55
Identity = 119/119 (100.00%), Postives = 119/119 (100.00%), Query Frame = 1

Query: 1   MENKILAAKLQQKNQDSVVSDQSNTVETKEIASKQDFATAMATTCPKEIQVDETKLTAAS 60
           MENKILAAKLQQKNQDSVVSDQSNTVETKEIASKQDFATAMATTCPKEIQVDETKLTAAS
Sbjct: 514 MENKILAAKLQQKNQDSVVSDQSNTVETKEIASKQDFATAMATTCPKEIQVDETKLTAAS 573

Query: 61  SETQDNISVGGSTANSAAIESDLETTRRIDVAVLSSKHLFLILPVLILIAAVYLSSLQD 120
           SETQDNISVGGSTANSAAIESDLETTRRIDVAVLSSKHLFLILPVLILIAAVYLSSLQD
Sbjct: 574 SETQDNISVGGSTANSAAIESDLETTRRIDVAVLSSKHLFLILPVLILIAAVYLSSLQD 632

BLAST of Csa4G123320 vs. NCBI nr
Match: gi|659111085|ref|XP_008455572.1| (PREDICTED: WPP domain-interacting tail-anchored protein 1-like isoform X5 [Cucumis melo])

HSP 1 Score: 213.0 bits (541), Expect = 2.9e-52
Identity = 109/119 (91.60%), Postives = 115/119 (96.64%), Query Frame = 1

Query: 1   MENKILAAKLQQKNQDSVVSDQSNTVETKEIASKQDFATAMATTCPKEIQVDETKLTAAS 60
           MENKIL AKLQQKNQDSVVSDQSNTVETKEIASKQDF TAM T CPKE+QVD+TKLTAAS
Sbjct: 514 MENKILGAKLQQKNQDSVVSDQSNTVETKEIASKQDFTTAMTTACPKEVQVDQTKLTAAS 573

Query: 61  SETQDNISVGGSTANSAAIESDLETTRRIDVAVLSSKHLFLILPVLILIAAVYLSSLQD 120
           SETQDN+SVGGSTANSAAIESDLETTRRIDVAVLSSKHLFL+LP+LI+IAAVYLSSLQD
Sbjct: 574 SETQDNVSVGGSTANSAAIESDLETTRRIDVAVLSSKHLFLVLPILIIIAAVYLSSLQD 632

BLAST of Csa4G123320 vs. NCBI nr
Match: gi|659111083|ref|XP_008455571.1| (PREDICTED: WPP domain-interacting tail-anchored protein 2-like isoform X4 [Cucumis melo])

HSP 1 Score: 208.4 bits (529), Expect = 7.1e-51
Identity = 109/120 (90.83%), Postives = 115/120 (95.83%), Query Frame = 1

Query: 1   MENKILAAKLQQKNQDSVVSDQSNTVETKEIASKQDFATAMATTCPKEIQVDETKLTAAS 60
           MENKIL AKLQQKNQDSVVSDQSNTVETKEIASKQDF TAM T CPKE+QVD+TKLTAAS
Sbjct: 514 MENKILGAKLQQKNQDSVVSDQSNTVETKEIASKQDFTTAMTTACPKEVQVDQTKLTAAS 573

Query: 61  SE-TQDNISVGGSTANSAAIESDLETTRRIDVAVLSSKHLFLILPVLILIAAVYLSSLQD 120
           SE TQDN+SVGGSTANSAAIESDLETTRRIDVAVLSSKHLFL+LP+LI+IAAVYLSSLQD
Sbjct: 574 SEKTQDNVSVGGSTANSAAIESDLETTRRIDVAVLSSKHLFLVLPILIIIAAVYLSSLQD 633

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
A0A0A0KY26_CUCSA1.9e-55100.00Uncharacterized protein OS=Cucumis sativus GN=Csa_4G123320 PE=4 SV=1[more]
A0A061EGR9_THECC3.0e-0840.16WPP domain-interacting protein 1, putative isoform 2 OS=Theobroma cacao GN=TCM_0... [more]
A0A061ENG0_THECC3.0e-0840.16WPP domain-interacting protein 1, putative isoform 1 OS=Theobroma cacao GN=TCM_0... [more]
F6GVT9_VITVI5.2e-0837.19Putative uncharacterized protein OS=Vitis vinifera GN=VIT_14s0083g00730 PE=4 SV=... [more]
A5B3T9_VITVI5.2e-0837.19Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_035977 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
gi|700198601|gb|KGN53759.1|2.8e-55100.00hypothetical protein Csa_4G123320 [Cucumis sativus][more]
gi|778692082|ref|XP_011653404.1|2.8e-55100.00PREDICTED: WPP domain-interacting tail-anchored protein 1 isoform X2 [Cucumis sa... [more]
gi|778692079|ref|XP_011653403.1|2.8e-55100.00PREDICTED: WPP domain-interacting tail-anchored protein 1 isoform X1 [Cucumis sa... [more]
gi|659111085|ref|XP_008455572.1|2.9e-5291.60PREDICTED: WPP domain-interacting tail-anchored protein 1-like isoform X5 [Cucum... [more]
gi|659111083|ref|XP_008455571.1|7.1e-5190.83PREDICTED: WPP domain-interacting tail-anchored protein 2-like isoform X4 [Cucum... [more]
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Csa4G123320.1Csa4G123320.1mRNA