Cp4.1LG01g13020 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG01g13020
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionDna-directed rna polymerase subunit a
LocationCp4.1LG01 : 8463165 .. 8465625 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GGACTCTCTAATGGGCTTTTCAAATTGACGCCACGCGAAAATTAGGGATTAGCTTAGGGGCGGGATAACCCCATCCAATGGCGTCCTACAGTTGCGTTCTGGTTTCGGCCATTGCAGTTTCTCCCGCCAAGACTACATGCTTCAACGGACTTAATTATCCAACATTTTCCCCTTCTTCTTCTTCTTCTTCCCTTGTAAAGCCCTTTGTAAATTGTACCTTAACTTCTTCCATCATATACAGAGCAACTCCGGCCACGGCCAGCTGCGTTTACCATGATCCGATACCGGAGTTTGCAGAAGTTGTAAGTTTTTTGCGACGTAGCTCCTTTGTCTGCTTAAATTGATTCCGGTTTTGTGTTATGTGATCGCGTTGGTTAATTCATTTGGTTGTTTTTCGCAGGAAACCAAGAAATTTAAGGAGCAGTTATCGAAGAAGCTAGCGAAGGATCGCGAGACGTTTGGGAACGACCTCGATTCGGTTGTGGAGGTTTGCTCAAAGGTAATCATTGTTATATAATGAAGAGGTTGTGATTTTCTAGTTTGAATCCCTGATTTCGGTTGAAGAATGTGTCTGAATCAGATGAGGCGATTGTTTGCTTAGAGAGTTAGTTTAGTTTAATCGTGCTCTTGCCTTCAAGTGTTTCCATGGCTTGAGATGAATAATTGAGACCTGAGAAAACATCAGACCTGCCTATAGAGCTACAGAACGATCACTATTTCCAAGACAGCTTCATTAATTGCGCAAAATTGTTCTGAATGTTTGACCTATGCTGCCATTATTGATATACTTCTTGCATTTTGATTTTTTCAAACCAGAAACTGAACGAATATTCCAATTTAAATTCAGATGTCCGAATTTATGTGAGCTTAACATGAATGTACTTTGCAAGTTTGAGCAGATATTTAGTGAATATTTGCATGTGGAGTACGGAGGTCCTGGGACATTATTAGTGGAGCCTTTCACTAATATGTTTATTGCTCTAAACGAGAGGAAATTACCTGGAGCGCCTTTGGCTGCAAGAACTTCGCTTCTATGGGCTCAAAACTATCTAGATCACGATTGGAACATTTGGATCTCAAAACGAGGGCTCAAATGATTTGCTTCTTGTTGTTACATTTGTGAAGATTGGGTATTTGAATCTAGATTCTGTTATGTTTTTTTCTCATGAGTTGAAGATGGAGGAACAAATGGTTGTATCAACACAAAATTACTCCAATACATTGTCAATTTCATTGATTTGATATCTTTGTTGCTATACAAATGCCTTATTTCTTGCAGTACATGAATCATTCTGGGGCTCTGGTTTCACACTGAATTGGAAACTTGGATGCCCATTTTCAAGCTCTCTCTGAATCCTCTGCCATTGCAACGAGCATCTTAAATCAAACTCCTCTATTCTCTGCCGTATGAACCACGCTTGTGCTTGTTATGATGAAAAAAAGTATGACACATAAATCAGGCGGCAGCTTCAGAGATGCTCTGCAATGATGGTCTCCAAGCCTTTGACCTCGCCGCAGTGGCATTTGCTGCTGGTGTTCGAATTCGATGTCTTTCATGTTTGACTGCCTGCAATGTTTTTTGTTTTTTCAGAGAGATTATTGGTATGATTTTATCTCGGAAAATGATTCCCAATCACAATCATGTTTGACTTTTTCACTTTTCTGTTGCAGTGATCGATTCTGATTCCAAGGGAAACAATTTGGGTTTACCTGCTGGTTCTTCTCCGTCTCTTCTACTTCACCGCCGGCAGTCCGCTTCTCCTCCGACTCCATCAATGGCTTCTTGGATTTTGCATACTTTTTAGCCCGCAACACCTTCATAACCTCTGGAGGTTCCAAACAAAACGTCGGTAAGGGCTAAACTTACTACCAAACAGAGAAGGAACAGAAAAGCTGTATCATACCCTGAGTGGTGACGAGCCGATAGGCATGGCCAAGAGCAAGAGTGTCGTTTGGCCGGAGAAGCTTAACACGAGTAAAACGCACCGTTTTGGGCTCCGGATTGTCATCCTCAGATTGAGGTAAGGGAATTATAAGAGAAACATAGTGACCCGGATTAGTTCTCATGACCTCGCTAGCCACCACCGGCCAATACAGCCTCTCAATCCGGCCAGTTGGGTGCTGTATAACCAGAGCCGCCGCATCAACCGCCTGGCAATTCCCCATCCGAACTCTGTGATGGTAGATTGAAAGAAGACGAGGAAGAGGAGGAAAAGGAAGAAGAAGATGATGATAAATAGGAGGAAAAGAAGTCCCTTTCTATACAGCTTCTTTGCTCGCCAAATGGGTCTACTTAGAATTGCAGGCCGTCCATTCATATATCTCGCACTTCCCAACTAATTTAATGCAACGCGGGTCAAACTTATATCGAAAATTGACATTGTAACGTACGGATAATATAGTATTCAACTTCACCTTCTTTTACTTTTTATCACCTTCGGTGATAAAAAAAAAAAAAAAG

mRNA sequence

GGACTCTCTAATGGGCTTTTCAAATTGACGCCACGCGAAAATTAGGGATTAGCTTAGGGGCGGGATAACCCCATCCAATGGCGTCCTACAGTTGCGTTCTGGTTTCGGCCATTGCAGTTTCTCCCGCCAAGACTACATGCTTCAACGGACTTAATTATCCAACATTTTCCCCTTCTTCTTCTTCTTCTTCCCTTGTAAAGCCCTTTGTAAATTGTACCTTAACTTCTTCCATCATATACAGAGCAACTCCGGCCACGGCCAGCTGCGTTTACCATGATCCGATACCGGAGTTTGCAGAAGTTGAAACCAAGAAATTTAAGGAGCAGTTATCGAAGAAGCTAGCGAAGGATCGCGAGACGTTTGGGAACGACCTCGATTCGGTTGTGGAGGTTTGCTCAAAGATATTTAGTGAATATTTGCATGTGGAGTACGGAGGTCCTGGGACATTATTAGTGGAGCCTTTCACTAATATGTTTATTGCTCTAAACGAGAGGAAATTACCTGGAGCGCCTTTGGCTGCAAGAACTTCGCTTCTATGGGCTCAAAACTATCTAGATCACGATTGGAACATTTGGATCTCAAAACGAGGGCTCAAATGATTTGCTTCTTGTTGTTACATTTGTGAAGATTGGGTATTTGAATCTAGATTCTGTTATGTTTTTTTCTCATGAGTTGAAGATGGAGGAACAAATGGTTGTATCAACACAAAATTACTCCAATACATTGTCAATTTCATTGATTTGATATCTTTGTTGCTATACAAATGCCTTATTTCTTGCAGTACATGAATCATTCTGGGGCTCTGGTTTCACACTGAATTGGAAACTTGGATGCCCATTTTCAAGCTCTCTCTGAATCCTCTGCCATTGCAACGAGCATCTTAAATCAAACTCCTCTATTCTCTGCCGTATGAACCACGCTTGTGCTTGTTATGATGAAAAAAAGTATGACACATAAATCAGGCGGCAGCTTCAGAGATGCTCTGCAATGATGGTCTCCAAGCCTTTGACCTCGCCGCAGTGGCATTTGCTGCTGGTGTTCGAATTCGATGTCTTTCATGTTTGACTGCCTGCAATGTTTTTTGTTTTTTCAGAGAGATTATTGGTATGATTTTATCTCGGAAAATGATTCCCAATCACAATCATGTTTGACTTTTTCACTTTTCTGTTGCAGTGATCGATTCTGATTCCAAGGGAAACAATTTGGGTTTACCTGCTGGTTCTTCTCCGTCTCTTCTACTTCACCGCCGGCAGTCCGCTTCTCCTCCGACTCCATCAATGGCTTCTTGGATTTTGCATACTTTTTAGCCCGCAACACCTTCATAACCTCTGGAGGTTCCAAACAAAACGTCGGTAAGGGCTAAACTTACTACCAAACAGAGAAGGAACAGAAAAGCTGTATCATACCCTGAGTGGTGACGAGCCGATAGGCATGGCCAAGAGCAAGAGTGTCGTTTGGCCGGAGAAGCTTAACACGAGTAAAACGCACCGTTTTGGGCTCCGGATTGTCATCCTCAGATTGAGGTAAGGGAATTATAAGAGAAACATAGTGACCCGGATTAGTTCTCATGACCTCGCTAGCCACCACCGGCCAATACAGCCTCTCAATCCGGCCAGTTGGGTGCTGTATAACCAGAGCCGCCGCATCAACCGCCTGGCAATTCCCCATCCGAACTCTGTGATGGTAGATTGAAAGAAGACGAGGAAGAGGAGGAAAAGGAAGAAGAAGATGATGATAAATAGGAGGAAAAGAAGTCCCTTTCTATACAGCTTCTTTGCTCGCCAAATGGGTCTACTTAGAATTGCAGGCCGTCCATTCATATATCTCGCACTTCCCAACTAATTTAATGCAACGCGGGTCAAACTTATATCGAAAATTGACATTGTAACGTACGGATAATATAGTATTCAACTTCACCTTCTTTTACTTTTTATCACCTTCGGTGATAAAAAAAAAAAAAAAG

Coding sequence (CDS)

ATGGCGTCCTACAGTTGCGTTCTGGTTTCGGCCATTGCAGTTTCTCCCGCCAAGACTACATGCTTCAACGGACTTAATTATCCAACATTTTCCCCTTCTTCTTCTTCTTCTTCCCTTGTAAAGCCCTTTGTAAATTGTACCTTAACTTCTTCCATCATATACAGAGCAACTCCGGCCACGGCCAGCTGCGTTTACCATGATCCGATACCGGAGTTTGCAGAAGTTGAAACCAAGAAATTTAAGGAGCAGTTATCGAAGAAGCTAGCGAAGGATCGCGAGACGTTTGGGAACGACCTCGATTCGGTTGTGGAGGTTTGCTCAAAGATATTTAGTGAATATTTGCATGTGGAGTACGGAGGTCCTGGGACATTATTAGTGGAGCCTTTCACTAATATGTTTATTGCTCTAAACGAGAGGAAATTACCTGGAGCGCCTTTGGCTGCAAGAACTTCGCTTCTATGGGCTCAAAACTATCTAGATCACGATTGGAACATTTGGATCTCAAAACGAGGGCTCAAATGA

Protein sequence

MASYSCVLVSAIAVSPAKTTCFNGLNYPTFSPSSSSSSLVKPFVNCTLTSSIIYRATPATASCVYHDPIPEFAEVETKKFKEQLSKKLAKDRETFGNDLDSVVEVCSKIFSEYLHVEYGGPGTLLVEPFTNMFIALNERKLPGAPLAARTSLLWAQNYLDHDWNIWISKRGLK
BLAST of Cp4.1LG01g13020 vs. TrEMBL
Match: A0A0A0KQC6_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G603300 PE=4 SV=1)

HSP 1 Score: 218.4 bits (555), Expect = 6.9e-54
Identity = 104/119 (87.39%), Postives = 110/119 (92.44%), Query Frame = 1

Query: 55  RATPATASCVYHDPIPEFAEVETKKFKEQLSKKLAKDRETFGNDLDSVVEVCSKIFSEYL 114
           RATP TAS VY +PIPEFAEVET+KFKEQLSKKLAKDRETFGND DSVV+VCSKIF EYL
Sbjct: 43  RATPITASYVYPEPIPEFAEVETQKFKEQLSKKLAKDRETFGNDFDSVVDVCSKIFGEYL 102

Query: 115 HVEYGGPGTLLVEPFTNMFIALNERKLPGAPLAARTSLLWAQNYLDHDWNIWISKRGLK 174
           HVEYGGPGTL+VEPFTNMFIALNERKL GAPLAARTSLLWAQN+LD+DWNIW SK G K
Sbjct: 103 HVEYGGPGTLIVEPFTNMFIALNERKLSGAPLAARTSLLWAQNHLDNDWNIWNSKGGFK 161

BLAST of Cp4.1LG01g13020 vs. TrEMBL
Match: A0A0D2UY70_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_009G344500 PE=4 SV=1)

HSP 1 Score: 180.3 bits (456), Expect = 2.1e-42
Identity = 99/182 (54.40%), Postives = 121/182 (66.48%), Query Frame = 1

Query: 1   MASYSCVLVSAIAVS-PAKTTCFNGLNY-------------PTFSPSSSSSSLVKPFVNC 60
           MAS SC LVS    S PAK      L               P+FS  SS    VKPF++ 
Sbjct: 1   MASSSCSLVSIHPPSTPAKIPTSTILILRNAGLILNPPPPPPSFSTLSSKPISVKPFLSA 60

Query: 61  TLTSSIIYRATPATASCVYHDPIPEFAEVETKKFKEQLSKKLAKDRETFGNDLDSVVEVC 120
              S         +   VYHDPIP+FAE ET+KFK +L  KL+KD++ FG+DLDSV++VC
Sbjct: 61  LRASH--------SQKYVYHDPIPKFAEAETQKFKAELFNKLSKDKDKFGDDLDSVIDVC 120

Query: 121 SKIFSEYLHVEYGGPGTLLVEPFTNMFIALNERKLPGAPLAARTSLLWAQNYLDHDWNIW 169
            KIF+++LH EYGGPGTLLVEPFT+MF+AL E+KLPGAP+AAR SLLWAQN+LDHDW +W
Sbjct: 121 VKIFNDFLHNEYGGPGTLLVEPFTDMFVALKEKKLPGAPVAARASLLWAQNHLDHDWEVW 174

BLAST of Cp4.1LG01g13020 vs. TrEMBL
Match: A0A0D2SDD8_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_009G344500 PE=4 SV=1)

HSP 1 Score: 176.0 bits (445), Expect = 3.9e-41
Identity = 98/179 (54.75%), Postives = 119/179 (66.48%), Query Frame = 1

Query: 1   MASYSCVLVSAIAVS-PAKTTCFNGLNY-------------PTFSPSSSSSSLVKPFVNC 60
           MAS SC LVS    S PAK      L               P+FS  SS    VKPF++ 
Sbjct: 1   MASSSCSLVSIHPPSTPAKIPTSTILILRNAGLILNPPPPPPSFSTLSSKPISVKPFLSA 60

Query: 61  TLTSSIIYRATPATASCVYHDPIPEFAEVETKKFKEQLSKKLAKDRETFGNDLDSVVEVC 120
              S         +   VYHDPIP+FAE ET+KFK +L  KL+KD++ FG+DLDSV++VC
Sbjct: 61  LRASH--------SQKYVYHDPIPKFAEAETQKFKAELFNKLSKDKDKFGDDLDSVIDVC 120

Query: 121 SKIFSEYLHVEYGGPGTLLVEPFTNMFIALNERKLPGAPLAARTSLLWAQNYLDHDWNI 166
            KIF+++LH EYGGPGTLLVEPFT+MF+AL E+KLPGAP+AAR SLLWAQN+LDHDW I
Sbjct: 121 VKIFNDFLHNEYGGPGTLLVEPFTDMFVALKEKKLPGAPVAARASLLWAQNHLDHDWEI 171

BLAST of Cp4.1LG01g13020 vs. TrEMBL
Match: A0A061E7M7_THECC (Uncharacterized protein isoform 1 OS=Theobroma cacao GN=TCM_010229 PE=4 SV=1)

HSP 1 Score: 174.9 bits (442), Expect = 8.7e-41
Identity = 95/172 (55.23%), Postives = 124/172 (72.09%), Query Frame = 1

Query: 1   MASYSCVLVSAIAVSPAKTTCFNGLNYPTFSPSSSSSSLVKPFVNCTLTSSIIYRATPAT 60
           MAS S V   A+ ++PAKT     L  P  +     S+   P  + +L+S  ++  +P  
Sbjct: 13  MASSSWVSTRAL-LTPAKTPTSTTLFLPRPACLILHST---PHPSLSLSSPKLF-LSPLP 72

Query: 61  AS----CVYHDPIPEFAEVETKKFKEQLSKKLAKDRETFGNDLDSVVEVCSKIFSEYLHV 120
           AS     VY DPIPEFAE ET+KFK +L KKL+KD++TFG+DLD+V+EVC ++F+ +LH 
Sbjct: 73  ASPPQKYVYPDPIPEFAEAETQKFKSELFKKLSKDKDTFGDDLDAVIEVCVEVFNNFLHK 132

Query: 121 EYGGPGTLLVEPFTNMFIALNERKLPGAPLAARTSLLWAQNYLDHDWNIWIS 169
           EYGGPGTLLVEPFT+MF+AL E+KLPGAP+AAR SLLWAQNY+DHDW +W S
Sbjct: 133 EYGGPGTLLVEPFTDMFVALKEKKLPGAPVAARASLLWAQNYVDHDWEVWNS 179

BLAST of Cp4.1LG01g13020 vs. TrEMBL
Match: A0A061E6U7_THECC (Uncharacterized protein isoform 2 OS=Theobroma cacao GN=TCM_010229 PE=4 SV=1)

HSP 1 Score: 174.9 bits (442), Expect = 8.7e-41
Identity = 95/172 (55.23%), Postives = 124/172 (72.09%), Query Frame = 1

Query: 1   MASYSCVLVSAIAVSPAKTTCFNGLNYPTFSPSSSSSSLVKPFVNCTLTSSIIYRATPAT 60
           MAS S V   A+ ++PAKT     L  P  +     S+   P  + +L+S  ++  +P  
Sbjct: 1   MASSSWVSTRAL-LTPAKTPTSTTLFLPRPACLILHST---PHPSLSLSSPKLF-LSPLP 60

Query: 61  AS----CVYHDPIPEFAEVETKKFKEQLSKKLAKDRETFGNDLDSVVEVCSKIFSEYLHV 120
           AS     VY DPIPEFAE ET+KFK +L KKL+KD++TFG+DLD+V+EVC ++F+ +LH 
Sbjct: 61  ASPPQKYVYPDPIPEFAEAETQKFKSELFKKLSKDKDTFGDDLDAVIEVCVEVFNNFLHK 120

Query: 121 EYGGPGTLLVEPFTNMFIALNERKLPGAPLAARTSLLWAQNYLDHDWNIWIS 169
           EYGGPGTLLVEPFT+MF+AL E+KLPGAP+AAR SLLWAQNY+DHDW +W S
Sbjct: 121 EYGGPGTLLVEPFTDMFVALKEKKLPGAPVAARASLLWAQNYVDHDWEVWNS 167

BLAST of Cp4.1LG01g13020 vs. TAIR10
Match: AT1G10522.1 (AT1G10522.1 unknown protein)

HSP 1 Score: 122.9 bits (307), Expect = 2.0e-28
Identity = 55/102 (53.92%), Postives = 75/102 (73.53%), Query Frame = 1

Query: 65  YHDPIPEFAEVETKKFKEQLSKKLAKDRETFGNDLDSVVEVCSKIFSEYLHVEYGGPGTL 124
           + DPIPEFAE ET+KF++ +  KL+K R+ F + +D +V VC++IF  +L  EYGGPGTL
Sbjct: 75  FPDPIPEFAEAETEKFRDHMLNKLSK-RDLFEDSVDEIVGVCTEIFETFLRSEYGGPGTL 134

Query: 125 LVEPFTNMFIALNERKLPGAPLAARTSLLWAQNYLDHDWNIW 167
           LV PF +M   LNER+LPG P AAR ++ WAQ+++D DW  W
Sbjct: 135 LVIPFIDMADTLNERELPGGPQAARAAIKWAQDHVDKDWKEW 175

BLAST of Cp4.1LG01g13020 vs. NCBI nr
Match: gi|659090864|ref|XP_008446243.1| (PREDICTED: uncharacterized protein LOC103489034 isoform X1 [Cucumis melo])

HSP 1 Score: 287.7 bits (735), Expect = 1.3e-74
Identity = 144/173 (83.24%), Postives = 151/173 (87.28%), Query Frame = 1

Query: 1   MASYSCVLVSAIAVSPAKTTCFNGLNYPTFSPSSSSSSLVKPFVNCTLTSSIIYRATPAT 60
           MA Y C LVSAI VSPAK T FNGL+YPTFS      S VKP VNCTLTSS+IYRATP T
Sbjct: 1   MAPYGCALVSAIEVSPAKITSFNGLHYPTFS---LFPSFVKPLVNCTLTSSVIYRATPIT 60

Query: 61  ASCVYHDPIPEFAEVETKKFKEQLSKKLAKDRETFGNDLDSVVEVCSKIFSEYLHVEYGG 120
           AS VY +PIPEFAEVET+KFKEQLSKKLAKDRETFGND DSVV+VCSKIF EYLHVEYGG
Sbjct: 61  ASYVYPEPIPEFAEVETQKFKEQLSKKLAKDRETFGNDFDSVVDVCSKIFGEYLHVEYGG 120

Query: 121 PGTLLVEPFTNMFIALNERKLPGAPLAARTSLLWAQNYLDHDWNIWISKRGLK 174
           PGTL+VEPFTNMFIALNERKLPGAPLAARTSLLWAQNYLD+DWNIW SK G K
Sbjct: 121 PGTLIVEPFTNMFIALNERKLPGAPLAARTSLLWAQNYLDNDWNIWNSKGGFK 170

BLAST of Cp4.1LG01g13020 vs. NCBI nr
Match: gi|449434893|ref|XP_004135230.1| (PREDICTED: uncharacterized protein LOC101211725 isoform X1 [Cucumis sativus])

HSP 1 Score: 272.7 bits (696), Expect = 4.4e-70
Identity = 138/173 (79.77%), Postives = 147/173 (84.97%), Query Frame = 1

Query: 1   MASYSCVLVSAIAVSPAKTTCFNGLNYPTFSPSSSSSSLVKPFVNCTLTSSIIYRATPAT 60
           MA Y   LVSA+ VSPAK   FNGL+YPTF       S VKP VNCTLTSS+IYRATP T
Sbjct: 1   MARYGGALVSAVEVSPAKIAGFNGLHYPTFF---LFPSFVKPLVNCTLTSSVIYRATPIT 60

Query: 61  ASCVYHDPIPEFAEVETKKFKEQLSKKLAKDRETFGNDLDSVVEVCSKIFSEYLHVEYGG 120
           AS VY +PIPEFAEVET+KFKEQLSKKLAKDRETFGND DSVV+VCSKIF EYLHVEYGG
Sbjct: 61  ASYVYPEPIPEFAEVETQKFKEQLSKKLAKDRETFGNDFDSVVDVCSKIFGEYLHVEYGG 120

Query: 121 PGTLLVEPFTNMFIALNERKLPGAPLAARTSLLWAQNYLDHDWNIWISKRGLK 174
           PGTL+VEPFTNMFIALNERKL GAPLAARTSLLWAQN+LD+DWNIW SK G K
Sbjct: 121 PGTLIVEPFTNMFIALNERKLSGAPLAARTSLLWAQNHLDNDWNIWNSKGGFK 170

BLAST of Cp4.1LG01g13020 vs. NCBI nr
Match: gi|700196656|gb|KGN51833.1| (hypothetical protein Csa_5G603300 [Cucumis sativus])

HSP 1 Score: 218.4 bits (555), Expect = 9.8e-54
Identity = 104/119 (87.39%), Postives = 110/119 (92.44%), Query Frame = 1

Query: 55  RATPATASCVYHDPIPEFAEVETKKFKEQLSKKLAKDRETFGNDLDSVVEVCSKIFSEYL 114
           RATP TAS VY +PIPEFAEVET+KFKEQLSKKLAKDRETFGND DSVV+VCSKIF EYL
Sbjct: 43  RATPITASYVYPEPIPEFAEVETQKFKEQLSKKLAKDRETFGNDFDSVVDVCSKIFGEYL 102

Query: 115 HVEYGGPGTLLVEPFTNMFIALNERKLPGAPLAARTSLLWAQNYLDHDWNIWISKRGLK 174
           HVEYGGPGTL+VEPFTNMFIALNERKL GAPLAARTSLLWAQN+LD+DWNIW SK G K
Sbjct: 103 HVEYGGPGTLIVEPFTNMFIALNERKLSGAPLAARTSLLWAQNHLDNDWNIWNSKGGFK 161

BLAST of Cp4.1LG01g13020 vs. NCBI nr
Match: gi|763794184|gb|KJB61180.1| (hypothetical protein B456_009G344500 [Gossypium raimondii])

HSP 1 Score: 180.3 bits (456), Expect = 3.0e-42
Identity = 99/182 (54.40%), Postives = 121/182 (66.48%), Query Frame = 1

Query: 1   MASYSCVLVSAIAVS-PAKTTCFNGLNY-------------PTFSPSSSSSSLVKPFVNC 60
           MAS SC LVS    S PAK      L               P+FS  SS    VKPF++ 
Sbjct: 1   MASSSCSLVSIHPPSTPAKIPTSTILILRNAGLILNPPPPPPSFSTLSSKPISVKPFLSA 60

Query: 61  TLTSSIIYRATPATASCVYHDPIPEFAEVETKKFKEQLSKKLAKDRETFGNDLDSVVEVC 120
              S         +   VYHDPIP+FAE ET+KFK +L  KL+KD++ FG+DLDSV++VC
Sbjct: 61  LRASH--------SQKYVYHDPIPKFAEAETQKFKAELFNKLSKDKDKFGDDLDSVIDVC 120

Query: 121 SKIFSEYLHVEYGGPGTLLVEPFTNMFIALNERKLPGAPLAARTSLLWAQNYLDHDWNIW 169
            KIF+++LH EYGGPGTLLVEPFT+MF+AL E+KLPGAP+AAR SLLWAQN+LDHDW +W
Sbjct: 121 VKIFNDFLHNEYGGPGTLLVEPFTDMFVALKEKKLPGAPVAARASLLWAQNHLDHDWEVW 174

BLAST of Cp4.1LG01g13020 vs. NCBI nr
Match: gi|763794182|gb|KJB61178.1| (hypothetical protein B456_009G344500 [Gossypium raimondii])

HSP 1 Score: 176.0 bits (445), Expect = 5.6e-41
Identity = 98/179 (54.75%), Postives = 119/179 (66.48%), Query Frame = 1

Query: 1   MASYSCVLVSAIAVS-PAKTTCFNGLNY-------------PTFSPSSSSSSLVKPFVNC 60
           MAS SC LVS    S PAK      L               P+FS  SS    VKPF++ 
Sbjct: 1   MASSSCSLVSIHPPSTPAKIPTSTILILRNAGLILNPPPPPPSFSTLSSKPISVKPFLSA 60

Query: 61  TLTSSIIYRATPATASCVYHDPIPEFAEVETKKFKEQLSKKLAKDRETFGNDLDSVVEVC 120
              S         +   VYHDPIP+FAE ET+KFK +L  KL+KD++ FG+DLDSV++VC
Sbjct: 61  LRASH--------SQKYVYHDPIPKFAEAETQKFKAELFNKLSKDKDKFGDDLDSVIDVC 120

Query: 121 SKIFSEYLHVEYGGPGTLLVEPFTNMFIALNERKLPGAPLAARTSLLWAQNYLDHDWNI 166
            KIF+++LH EYGGPGTLLVEPFT+MF+AL E+KLPGAP+AAR SLLWAQN+LDHDW I
Sbjct: 121 VKIFNDFLHNEYGGPGTLLVEPFTDMFVALKEKKLPGAPVAARASLLWAQNHLDHDWEI 171

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0KQC6_CUCSA6.9e-5487.39Uncharacterized protein OS=Cucumis sativus GN=Csa_5G603300 PE=4 SV=1[more]
A0A0D2UY70_GOSRA2.1e-4254.40Uncharacterized protein OS=Gossypium raimondii GN=B456_009G344500 PE=4 SV=1[more]
A0A0D2SDD8_GOSRA3.9e-4154.75Uncharacterized protein OS=Gossypium raimondii GN=B456_009G344500 PE=4 SV=1[more]
A0A061E7M7_THECC8.7e-4155.23Uncharacterized protein isoform 1 OS=Theobroma cacao GN=TCM_010229 PE=4 SV=1[more]
A0A061E6U7_THECC8.7e-4155.23Uncharacterized protein isoform 2 OS=Theobroma cacao GN=TCM_010229 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G10522.12.0e-2853.92 unknown protein[more]
Match NameE-valueIdentityDescription
gi|659090864|ref|XP_008446243.1|1.3e-7483.24PREDICTED: uncharacterized protein LOC103489034 isoform X1 [Cucumis melo][more]
gi|449434893|ref|XP_004135230.1|4.4e-7079.77PREDICTED: uncharacterized protein LOC101211725 isoform X1 [Cucumis sativus][more]
gi|700196656|gb|KGN51833.1|9.8e-5487.39hypothetical protein Csa_5G603300 [Cucumis sativus][more]
gi|763794184|gb|KJB61180.1|3.0e-4254.40hypothetical protein B456_009G344500 [Gossypium raimondii][more]
gi|763794182|gb|KJB61178.1|5.6e-4154.75hypothetical protein B456_009G344500 [Gossypium raimondii][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG01g13020.1Cp4.1LG01g13020.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR35987FAMILY NOT NAMEDcoord: 64..173
score: 8.2
NoneNo IPR availablePANTHERPTHR35987:SF1SUBFAMILY NOT NAMEDcoord: 64..173
score: 8.2

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG01g13020Cp4.1LG01g04040Cucurbita pepo (Zucchini)cpecpeB375