Cp4.1LG20g01980.1 (mRNA) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG20g01980.1
TypemRNA
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionIntegral membrane HPP family protein
LocationCp4.1LG20: 1176145 .. 1178800 (-)
Sequence length1594
RNA-Seq ExpressionCp4.1LG20g01980.1
SyntenyCp4.1LG20g01980.1
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTTCCTCGACTGTCATTTTACTTTCAAGAAAGTCAAACCGGTAACAGGAATTGACAATCTCCTGACCCATTTGAAATGGTGATATAAAAGCAGGTGGACTTTAAATCCCATATCACAGCTTGGTTGGAGGGTTAAGGTTGTTTTTAGTCATTGCAGATAAGGACAAGCACGATGGGACAGAGAGAAACAGAGCGAAACAGAGCGAAACAGAGAGGCTGCCTGCTGCTATTCTTTATTCATTTTTTCCATATGGTTTAGCCTTTCGGGAATGGCCTTTGGACCTCTTCAAAGGCCAACTCCGCAACACAGCTTCGAGTTTTTATCTGTTTTTTTCCTTTTCAAGCCCTCTGGCGTTCTTCACAGCTTTTCCCAAATCCCATACCCCTCCAGATGGTGTCTTTGTCAGCCTCACCTTTCAATATATTATTTCTCAAACTTTGTTTTGCGGCTTTTACCATACTTTCTTTCTTTGTCTTCATCTTCTCCCCTGTTTTGTCTCCAAACAATCGAAGCTTGGAGGTGGGTCGAGGGTAAAATCCAGTATGAGTCTGCAATTGAAGCCAATTCACCACCGTGGCCAGCAGCCGTATCAACCCAGTTTCCGTGTAAACCATTCATTCATTTCTTTGCTGCCCAATTGCCATTTATTGAATGGAAAACGAGGGGTTTCAATAGATGGGTCTGTTAGGCCGTTGGGATTATTACTCAACGATCGGAGGAGAAGACGAAATGGGGGCGGCGGCGGTGGACTCAGTTACAGGAGTATTGTGGCGTCCGGCATTGCTGGTGCACCGATTTCAGATGGGTCGAAACCAGACAAAGGCTTTGTTTCTCCTCCCCTCAGTGATATCCTTTGGCCTTCTGCAGGTTAGAGAAGGTCCCTTATGAATCATAATCTATGAGAATTTCGACCGTGATGAAGAACTGCGAGATGGGTAACGTTGTTTTGTCATGAATGAAACAGGGGCATTTGCAGCGATGGCAATGCTGGGGAAAATGGATCAGATTCTAGCGCCAAAAGGGCTTTCTATGACAATTGCGCCATTAGGAGCCGTCTGTGCTATCCTGTTCGCAGCGCCCTCATCCCCTGCTGCTCGAGTACTCGCCACATCTCTGACAGCTCAATTGCTTTGAACCAAGAACTAAATTTAGGTTTCTTTTGGTGCAGAAGTACAATATGTTCATGGCCCAGATTGGGTGTGCGGCAATTGGCGTTTTGGCGTTTACTTTGTTGGGGCCTGGATGGCTGGCTAGAAGCTCTGCTCTGGCTGCATCCATGGCGTTCATGATCTATACTGGTTCGACGCACCCACCAGGTTAAATTGAAGCCCCACTCAATTTGTGGTCTGTGTGTTTGGGAATATGGATTGAATTTGATTTTTTATTGTAATGTTAACAGCTGCAAGTTTGCCGCTAATGTTCATCGATGGAGCTAAGATGCAGCATCTCAATTTCTGGTATGCTCTGTTTCCCGGTGCCGCTGGTTGTGTTCTCCTTTGCTTCATAGTGAGTTCTCTCTGTTTCACTATTAATATATTTCTTTCTTTTCTTCATATTTTAAAATCCGATCATAATTGGTAGGCGAGATCTTAATCTTAGATCCATGACTAAGCAATAATAGGACGTTGAATGCTGATTCATCATGAATTTATAAACAAGAAATATTATCTTCATTAGAACGAGGCCTTTTATGAAAGCCCAAATCAAAGCCATGAGCACTTATGCGAACAATATCATATCATTGTGGAGAGTCGTTCATCTAAATTAACTAGATGAATATAGAAGCTCGGTTAGAAAATTTTAGACCCATTGTGGTTAGTGAGTTAGTTAGTGACCCAAATAACTTGAATAATGTTCCGAATCCAACCCAATTCAACACTTTGTCGGGATTGAAAAAAACTCTTCCAATTATTTAAAAACTTTCAAACTCACTACTATGTCCTCTTTGGATATTTTCGAGATTCTTCTCAAGATCTTTAAAATGTGTCTAGCAGTGAGAGGTTTCCACACCCTAATAAACAATATTTCGTTCTCACCCTCGACCGACGTGAGATCTCACAATAAGTCACAATGTACCATTGACATTTATTACCTATGTCAAATGTGTCGTTGGGTTGAGATGTGAACGTTATTTTTGGATTGGCTAAGCCAAAAAGTTCCTAACATATGAACTAAATTGAAAATTTTAACCAACCCAACATAACCCTAACCGTTTAAGTTGGATAGTTTGAATTTTTCGAATCACATGAACGTTGTTTGCTGTTAATAACTATATGAACTAGACACAAATTCGAAAGTGATTATTATTATTTTGTGATGCAGCAAGAGATAGTGGTGTACTTGAAGGAGAAGTTCAAATTTTGAGCAGGTTTTGGAGTGTCATACATGGATCAGCCATTATTGAAGGTGTCCCAAAGCTCATTTTTGTGAGTTTTGATTCAATGGATGATAGCCTTGGAGTGTCACACATGAAACGATGATATGATACGATATGATACGATACGATACGATACGTATTTTTCTTTTTTCTTTTTCATAAATCAAATGGTTATCGAAATCGTCAAACTTAGAATTAGATGTAAATAAGCATGATCTCGTGTAATCATTTACAATAGGTTAGTGTAAGTATCTAAATAATTTCATCAATATGCTAGAG

mRNA sequence

TTTCCTCGACTGTCATTTTACTTTCAAGAAAGTCAAACCGGTAACAGGAATTGACAATCTCCTGACCCATTTGAAATGGTGATATAAAAGCAGGTGGACTTTAAATCCCATATCACAGCTTGGTTGGAGGGTTAAGGTTGTTTTTAGTCATTGCAGATAAGGACAAGCACGATGGGACAGAGAGAAACAGAGCGAAACAGAGCGAAACAGAGAGGCTGCCTGCTGCTATTCTTTATTCATTTTTTCCATATGGTTTAGCCTTTCGGGAATGGCCTTTGGACCTCTTCAAAGGCCAACTCCGCAACACAGCTTCGAGTTTTTATCTGTTTTTTTCCTTTTCAAGCCCTCTGGCGTTCTTCACAGCTTTTCCCAAATCCCATACCCCTCCAGATGGTGTCTTTGTCAGCCTCACCTTTCAATATATTATTTCTCAAACTTTGTTTTGCGGCTTTTACCATACTTTCTTTCTTTGTCTTCATCTTCTCCCCTGTTTTGTCTCCAAACAATCGAAGCTTGGAGGTGGGTCGAGGGTAAAATCCAGTATGAGTCTGCAATTGAAGCCAATTCACCACCGTGGCCAGCAGCCGTATCAACCCAGTTTCCGTGTAAACCATTCATTCATTTCTTTGCTGCCCAATTGCCATTTATTGAATGGAAAACGAGGGGTTTCAATAGATGGGTCTGTTAGGCCGTTGGGATTATTACTCAACGATCGGAGGAGAAGACGAAATGGGGGCGGCGGCGGTGGACTCAGTTACAGGAGTATTGTGGCGTCCGGCATTGCTGGTGCACCGATTTCAGATGGGTCGAAACCAGACAAAGGCTTTGTTTCTCCTCCCCTCAGTGATATCCTTTGGCCTTCTGCAGGGGCATTTGCAGCGATGGCAATGCTGGGGAAAATGGATCAGATTCTAGCGCCAAAAGGGCTTTCTATGACAATTGCGCCATTAGGAGCCGTCTGTGCTATCCTGTTCGCAGCGCCCTCATCCCCTGCTGCTCGAAAGTACAATATGTTCATGGCCCAGATTGGGTGTGCGGCAATTGGCGTTTTGGCGTTTACTTTGTTGGGGCCTGGATGGCTGGCTAGAAGCTCTGCTCTGGCTGCATCCATGGCGTTCATGATCTATACTGGTTCGACGCACCCACCAGCTGCAAGTTTGCCGCTAATGTTCATCGATGGAGCTAAGATGCAGCATCTCAATTTCTGGTATGCTCTGTTTCCCGGTGCCGCTGGTTGTGTTCTCCTTTGCTTCATAGTGACAAGAGATAGTGGTGTACTTGAAGGAGAAGTTCAAATTTTGAGCAGGTTTTGGAGTGTCATACATGGATCAGCCATTATTGAAGGTGTCCCAAAGCTCATTTTTGTGAGTTTTGATTCAATGGATGATAGCCTTGGAGTGTCACACATGAAACGATGATATGATACGATATGATACGATACGATACGATACGTATTTTTCTTTTTTCTTTTTCATAAATCAAATGGTTATCGAAATCGTCAAACTTAGAATTAGATGTAAATAAGCATGATCTCGTGTAATCATTTACAATAGGTTAGTGTAAGTATCTAAATAATTTCATCAATATGCTAGAG

Coding sequence (CDS)

ATGAGTCTGCAATTGAAGCCAATTCACCACCGTGGCCAGCAGCCGTATCAACCCAGTTTCCGTGTAAACCATTCATTCATTTCTTTGCTGCCCAATTGCCATTTATTGAATGGAAAACGAGGGGTTTCAATAGATGGGTCTGTTAGGCCGTTGGGATTATTACTCAACGATCGGAGGAGAAGACGAAATGGGGGCGGCGGCGGTGGACTCAGTTACAGGAGTATTGTGGCGTCCGGCATTGCTGGTGCACCGATTTCAGATGGGTCGAAACCAGACAAAGGCTTTGTTTCTCCTCCCCTCAGTGATATCCTTTGGCCTTCTGCAGGGGCATTTGCAGCGATGGCAATGCTGGGGAAAATGGATCAGATTCTAGCGCCAAAAGGGCTTTCTATGACAATTGCGCCATTAGGAGCCGTCTGTGCTATCCTGTTCGCAGCGCCCTCATCCCCTGCTGCTCGAAAGTACAATATGTTCATGGCCCAGATTGGGTGTGCGGCAATTGGCGTTTTGGCGTTTACTTTGTTGGGGCCTGGATGGCTGGCTAGAAGCTCTGCTCTGGCTGCATCCATGGCGTTCATGATCTATACTGGTTCGACGCACCCACCAGCTGCAAGTTTGCCGCTAATGTTCATCGATGGAGCTAAGATGCAGCATCTCAATTTCTGGTATGCTCTGTTTCCCGGTGCCGCTGGTTGTGTTCTCCTTTGCTTCATAGTGACAAGAGATAGTGGTGTACTTGAAGGAGAAGTTCAAATTTTGAGCAGGTTTTGGAGTGTCATACATGGATCAGCCATTATTGAAGGTGTCCCAAAGCTCATTTTTGTGAGTTTTGATTCAATGGATGATAGCCTTGGAGTGTCACACATGAAACGATGA

Protein sequence

MSLQLKPIHHRGQQPYQPSFRVNHSFISLLPNCHLLNGKRGVSIDGSVRPLGLLLNDRRRRRNGGGGGGLSYRSIVASGIAGAPISDGSKPDKGFVSPPLSDILWPSAGAFAAMAMLGKMDQILAPKGLSMTIAPLGAVCAILFAAPSSPAARKYNMFMAQIGCAAIGVLAFTLLGPGWLARSSALAASMAFMIYTGSTHPPAASLPLMFIDGAKMQHLNFWYALFPGAAGCVLLCFIVTRDSGVLEGEVQILSRFWSVIHGSAIIEGVPKLIFVSFDSMDDSLGVSHMKR
Homology
BLAST of Cp4.1LG20g01980.1 vs. NCBI nr
Match: XP_023519271.1 (uncharacterized protein LOC111782708 isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 475 bits (1223), Expect = 5.99e-168
Identity = 238/238 (100.00%), Postives = 238/238 (100.00%), Query Frame = 0

Query: 1   MSLQLKPIHHRGQQPYQPSFRVNHSFISLLPNCHLLNGKRGVSIDGSVRPLGLLLNDRRR 60
           MSLQLKPIHHRGQQPYQPSFRVNHSFISLLPNCHLLNGKRGVSIDGSVRPLGLLLNDRRR
Sbjct: 1   MSLQLKPIHHRGQQPYQPSFRVNHSFISLLPNCHLLNGKRGVSIDGSVRPLGLLLNDRRR 60

Query: 61  RRNGGGGGGLSYRSIVASGIAGAPISDGSKPDKGFVSPPLSDILWPSAGAFAAMAMLGKM 120
           RRNGGGGGGLSYRSIVASGIAGAPISDGSKPDKGFVSPPLSDILWPSAGAFAAMAMLGKM
Sbjct: 61  RRNGGGGGGLSYRSIVASGIAGAPISDGSKPDKGFVSPPLSDILWPSAGAFAAMAMLGKM 120

Query: 121 DQILAPKGLSMTIAPLGAVCAILFAAPSSPAARKYNMFMAQIGCAAIGVLAFTLLGPGWL 180
           DQILAPKGLSMTIAPLGAVCAILFAAPSSPAARKYNMFMAQIGCAAIGVLAFTLLGPGWL
Sbjct: 121 DQILAPKGLSMTIAPLGAVCAILFAAPSSPAARKYNMFMAQIGCAAIGVLAFTLLGPGWL 180

Query: 181 ARSSALAASMAFMIYTGSTHPPAASLPLMFIDGAKMQHLNFWYALFPGAAGCVLLCFI 238
           ARSSALAASMAFMIYTGSTHPPAASLPLMFIDGAKMQHLNFWYALFPGAAGCVLLCFI
Sbjct: 181 ARSSALAASMAFMIYTGSTHPPAASLPLMFIDGAKMQHLNFWYALFPGAAGCVLLCFI 238

BLAST of Cp4.1LG20g01980.1 vs. NCBI nr
Match: XP_023519272.1 (uncharacterized protein LOC111782708 isoform X2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 468 bits (1205), Expect = 3.20e-165
Identity = 237/238 (99.58%), Postives = 237/238 (99.58%), Query Frame = 0

Query: 1   MSLQLKPIHHRGQQPYQPSFRVNHSFISLLPNCHLLNGKRGVSIDGSVRPLGLLLNDRRR 60
           MSLQLKPIHHRGQQPYQPSFRVNHSFISLLPNCHLLNGKRGVSIDGSVRPLGLLLNDRRR
Sbjct: 1   MSLQLKPIHHRGQQPYQPSFRVNHSFISLLPNCHLLNGKRGVSIDGSVRPLGLLLNDRRR 60

Query: 61  RRNGGGGGGLSYRSIVASGIAGAPISDGSKPDKGFVSPPLSDILWPSAGAFAAMAMLGKM 120
           RRNGGGGGGLSYRSIVASGIAGAPISDGSKPDKGFVSPPLSDILWPSAGAFAAMAMLGKM
Sbjct: 61  RRNGGGGGGLSYRSIVASGIAGAPISDGSKPDKGFVSPPLSDILWPSAGAFAAMAMLGKM 120

Query: 121 DQILAPKGLSMTIAPLGAVCAILFAAPSSPAARKYNMFMAQIGCAAIGVLAFTLLGPGWL 180
           DQILAPKGLSMTIAPLGAVCAILFAAPSSPAAR YNMFMAQIGCAAIGVLAFTLLGPGWL
Sbjct: 121 DQILAPKGLSMTIAPLGAVCAILFAAPSSPAAR-YNMFMAQIGCAAIGVLAFTLLGPGWL 180

Query: 181 ARSSALAASMAFMIYTGSTHPPAASLPLMFIDGAKMQHLNFWYALFPGAAGCVLLCFI 238
           ARSSALAASMAFMIYTGSTHPPAASLPLMFIDGAKMQHLNFWYALFPGAAGCVLLCFI
Sbjct: 181 ARSSALAASMAFMIYTGSTHPPAASLPLMFIDGAKMQHLNFWYALFPGAAGCVLLCFI 237

BLAST of Cp4.1LG20g01980.1 vs. NCBI nr
Match: XP_022924027.1 (uncharacterized protein LOC111431576 isoform X1 [Cucurbita moschata] >KAG6584258.1 hypothetical protein SDJN03_20190, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 468 bits (1204), Expect = 4.71e-165
Identity = 234/238 (98.32%), Postives = 235/238 (98.74%), Query Frame = 0

Query: 1   MSLQLKPIHHRGQQPYQPSFRVNHSFISLLPNCHLLNGKRGVSIDGSVRPLGLLLNDRRR 60
           MSLQLKPIHHRGQQPYQPSFRVNHSFISLLPNCHLLNGKRGVSIDGSVRPLGLLLNDRRR
Sbjct: 1   MSLQLKPIHHRGQQPYQPSFRVNHSFISLLPNCHLLNGKRGVSIDGSVRPLGLLLNDRRR 60

Query: 61  RRNGGGGGGLSYRSIVASGIAGAPISDGSKPDKGFVSPPLSDILWPSAGAFAAMAMLGKM 120
           RRNGGGGGG  YRSIVASGIA APISDGSKPDKGFVSPPLSDILWPSAGAFAAMAMLGKM
Sbjct: 61  RRNGGGGGGFGYRSIVASGIARAPISDGSKPDKGFVSPPLSDILWPSAGAFAAMAMLGKM 120

Query: 121 DQILAPKGLSMTIAPLGAVCAILFAAPSSPAARKYNMFMAQIGCAAIGVLAFTLLGPGWL 180
           DQILAPKGLSMTIAPLGAVCAILFAAPSSPAARKYNMFMAQIGCAAIGVLAFTLLGPGWL
Sbjct: 121 DQILAPKGLSMTIAPLGAVCAILFAAPSSPAARKYNMFMAQIGCAAIGVLAFTLLGPGWL 180

Query: 181 ARSSALAASMAFMIYTGSTHPPAASLPLMFIDGAKMQHLNFWYALFPGAAGCVLLCFI 238
           ARSSALAASMAFMIYTGSTHPPAASLPLMFIDGAKMQHLNFWYALFPGAAGC+LLCFI
Sbjct: 181 ARSSALAASMAFMIYTGSTHPPAASLPLMFIDGAKMQHLNFWYALFPGAAGCLLLCFI 238

BLAST of Cp4.1LG20g01980.1 vs. NCBI nr
Match: XP_023001510.1 (uncharacterized protein LOC111495629 [Cucurbita maxima])

HSP 1 Score: 462 bits (1189), Expect = 9.42e-163
Identity = 232/239 (97.07%), Postives = 235/239 (98.33%), Query Frame = 0

Query: 1   MSLQLKPIHHRGQQPYQPSFRVNHSFISLLPNCHLLNGKRGVSIDGSVRPLGLLLNDRRR 60
           M+LQLKPIHHRGQQPYQPSFRVNHSFISLLPNCHLLNG RG SIDGSVRPLGLLLNDRRR
Sbjct: 1   MNLQLKPIHHRGQQPYQPSFRVNHSFISLLPNCHLLNGNRGFSIDGSVRPLGLLLNDRRR 60

Query: 61  RRNGGGGG-GLSYRSIVASGIAGAPISDGSKPDKGFVSPPLSDILWPSAGAFAAMAMLGK 120
           RRNGGGGG G+ YRSIVASGIAGAPISDGSKPDKGFVSPPLSDILWPSAGAFAAMAMLGK
Sbjct: 61  RRNGGGGGDGIGYRSIVASGIAGAPISDGSKPDKGFVSPPLSDILWPSAGAFAAMAMLGK 120

Query: 121 MDQILAPKGLSMTIAPLGAVCAILFAAPSSPAARKYNMFMAQIGCAAIGVLAFTLLGPGW 180
           MDQ+LAPKGLSMTIAPLGAVCAILFAAPSSPAARKYNMFMAQIGCAAIGVLAFTLLGPGW
Sbjct: 121 MDQMLAPKGLSMTIAPLGAVCAILFAAPSSPAARKYNMFMAQIGCAAIGVLAFTLLGPGW 180

Query: 181 LARSSALAASMAFMIYTGSTHPPAASLPLMFIDGAKMQHLNFWYALFPGAAGCVLLCFI 238
           LARSSALAASMAFMIYTGSTHPPAASLPLMFIDGAKMQHLNFWYALFPGAAGCVLLCFI
Sbjct: 181 LARSSALAASMAFMIYTGSTHPPAASLPLMFIDGAKMQHLNFWYALFPGAAGCVLLCFI 239

BLAST of Cp4.1LG20g01980.1 vs. NCBI nr
Match: XP_022924028.1 (uncharacterized protein LOC111431576 isoform X2 [Cucurbita moschata])

HSP 1 Score: 461 bits (1186), Expect = 2.51e-162
Identity = 233/238 (97.90%), Postives = 234/238 (98.32%), Query Frame = 0

Query: 1   MSLQLKPIHHRGQQPYQPSFRVNHSFISLLPNCHLLNGKRGVSIDGSVRPLGLLLNDRRR 60
           MSLQLKPIHHRGQQPYQPSFRVNHSFISLLPNCHLLNGKRGVSIDGSVRPLGLLLNDRRR
Sbjct: 1   MSLQLKPIHHRGQQPYQPSFRVNHSFISLLPNCHLLNGKRGVSIDGSVRPLGLLLNDRRR 60

Query: 61  RRNGGGGGGLSYRSIVASGIAGAPISDGSKPDKGFVSPPLSDILWPSAGAFAAMAMLGKM 120
           RRNGGGGGG  YRSIVASGIA APISDGSKPDKGFVSPPLSDILWPSAGAFAAMAMLGKM
Sbjct: 61  RRNGGGGGGFGYRSIVASGIARAPISDGSKPDKGFVSPPLSDILWPSAGAFAAMAMLGKM 120

Query: 121 DQILAPKGLSMTIAPLGAVCAILFAAPSSPAARKYNMFMAQIGCAAIGVLAFTLLGPGWL 180
           DQILAPKGLSMTIAPLGAVCAILFAAPSSPAAR YNMFMAQIGCAAIGVLAFTLLGPGWL
Sbjct: 121 DQILAPKGLSMTIAPLGAVCAILFAAPSSPAAR-YNMFMAQIGCAAIGVLAFTLLGPGWL 180

Query: 181 ARSSALAASMAFMIYTGSTHPPAASLPLMFIDGAKMQHLNFWYALFPGAAGCVLLCFI 238
           ARSSALAASMAFMIYTGSTHPPAASLPLMFIDGAKMQHLNFWYALFPGAAGC+LLCFI
Sbjct: 181 ARSSALAASMAFMIYTGSTHPPAASLPLMFIDGAKMQHLNFWYALFPGAAGCLLLCFI 237

BLAST of Cp4.1LG20g01980.1 vs. ExPASy TrEMBL
Match: A0A6J1E7R0 (uncharacterized protein LOC111431576 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111431576 PE=4 SV=1)

HSP 1 Score: 468 bits (1204), Expect = 2.28e-165
Identity = 234/238 (98.32%), Postives = 235/238 (98.74%), Query Frame = 0

Query: 1   MSLQLKPIHHRGQQPYQPSFRVNHSFISLLPNCHLLNGKRGVSIDGSVRPLGLLLNDRRR 60
           MSLQLKPIHHRGQQPYQPSFRVNHSFISLLPNCHLLNGKRGVSIDGSVRPLGLLLNDRRR
Sbjct: 1   MSLQLKPIHHRGQQPYQPSFRVNHSFISLLPNCHLLNGKRGVSIDGSVRPLGLLLNDRRR 60

Query: 61  RRNGGGGGGLSYRSIVASGIAGAPISDGSKPDKGFVSPPLSDILWPSAGAFAAMAMLGKM 120
           RRNGGGGGG  YRSIVASGIA APISDGSKPDKGFVSPPLSDILWPSAGAFAAMAMLGKM
Sbjct: 61  RRNGGGGGGFGYRSIVASGIARAPISDGSKPDKGFVSPPLSDILWPSAGAFAAMAMLGKM 120

Query: 121 DQILAPKGLSMTIAPLGAVCAILFAAPSSPAARKYNMFMAQIGCAAIGVLAFTLLGPGWL 180
           DQILAPKGLSMTIAPLGAVCAILFAAPSSPAARKYNMFMAQIGCAAIGVLAFTLLGPGWL
Sbjct: 121 DQILAPKGLSMTIAPLGAVCAILFAAPSSPAARKYNMFMAQIGCAAIGVLAFTLLGPGWL 180

Query: 181 ARSSALAASMAFMIYTGSTHPPAASLPLMFIDGAKMQHLNFWYALFPGAAGCVLLCFI 238
           ARSSALAASMAFMIYTGSTHPPAASLPLMFIDGAKMQHLNFWYALFPGAAGC+LLCFI
Sbjct: 181 ARSSALAASMAFMIYTGSTHPPAASLPLMFIDGAKMQHLNFWYALFPGAAGCLLLCFI 238

BLAST of Cp4.1LG20g01980.1 vs. ExPASy TrEMBL
Match: A0A6J1KLD7 (uncharacterized protein LOC111495629 OS=Cucurbita maxima OX=3661 GN=LOC111495629 PE=4 SV=1)

HSP 1 Score: 462 bits (1189), Expect = 4.56e-163
Identity = 232/239 (97.07%), Postives = 235/239 (98.33%), Query Frame = 0

Query: 1   MSLQLKPIHHRGQQPYQPSFRVNHSFISLLPNCHLLNGKRGVSIDGSVRPLGLLLNDRRR 60
           M+LQLKPIHHRGQQPYQPSFRVNHSFISLLPNCHLLNG RG SIDGSVRPLGLLLNDRRR
Sbjct: 1   MNLQLKPIHHRGQQPYQPSFRVNHSFISLLPNCHLLNGNRGFSIDGSVRPLGLLLNDRRR 60

Query: 61  RRNGGGGG-GLSYRSIVASGIAGAPISDGSKPDKGFVSPPLSDILWPSAGAFAAMAMLGK 120
           RRNGGGGG G+ YRSIVASGIAGAPISDGSKPDKGFVSPPLSDILWPSAGAFAAMAMLGK
Sbjct: 61  RRNGGGGGDGIGYRSIVASGIAGAPISDGSKPDKGFVSPPLSDILWPSAGAFAAMAMLGK 120

Query: 121 MDQILAPKGLSMTIAPLGAVCAILFAAPSSPAARKYNMFMAQIGCAAIGVLAFTLLGPGW 180
           MDQ+LAPKGLSMTIAPLGAVCAILFAAPSSPAARKYNMFMAQIGCAAIGVLAFTLLGPGW
Sbjct: 121 MDQMLAPKGLSMTIAPLGAVCAILFAAPSSPAARKYNMFMAQIGCAAIGVLAFTLLGPGW 180

Query: 181 LARSSALAASMAFMIYTGSTHPPAASLPLMFIDGAKMQHLNFWYALFPGAAGCVLLCFI 238
           LARSSALAASMAFMIYTGSTHPPAASLPLMFIDGAKMQHLNFWYALFPGAAGCVLLCFI
Sbjct: 181 LARSSALAASMAFMIYTGSTHPPAASLPLMFIDGAKMQHLNFWYALFPGAAGCVLLCFI 239

BLAST of Cp4.1LG20g01980.1 vs. ExPASy TrEMBL
Match: A0A6J1EB70 (uncharacterized protein LOC111431576 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111431576 PE=4 SV=1)

HSP 1 Score: 461 bits (1186), Expect = 1.21e-162
Identity = 233/238 (97.90%), Postives = 234/238 (98.32%), Query Frame = 0

Query: 1   MSLQLKPIHHRGQQPYQPSFRVNHSFISLLPNCHLLNGKRGVSIDGSVRPLGLLLNDRRR 60
           MSLQLKPIHHRGQQPYQPSFRVNHSFISLLPNCHLLNGKRGVSIDGSVRPLGLLLNDRRR
Sbjct: 1   MSLQLKPIHHRGQQPYQPSFRVNHSFISLLPNCHLLNGKRGVSIDGSVRPLGLLLNDRRR 60

Query: 61  RRNGGGGGGLSYRSIVASGIAGAPISDGSKPDKGFVSPPLSDILWPSAGAFAAMAMLGKM 120
           RRNGGGGGG  YRSIVASGIA APISDGSKPDKGFVSPPLSDILWPSAGAFAAMAMLGKM
Sbjct: 61  RRNGGGGGGFGYRSIVASGIARAPISDGSKPDKGFVSPPLSDILWPSAGAFAAMAMLGKM 120

Query: 121 DQILAPKGLSMTIAPLGAVCAILFAAPSSPAARKYNMFMAQIGCAAIGVLAFTLLGPGWL 180
           DQILAPKGLSMTIAPLGAVCAILFAAPSSPAAR YNMFMAQIGCAAIGVLAFTLLGPGWL
Sbjct: 121 DQILAPKGLSMTIAPLGAVCAILFAAPSSPAAR-YNMFMAQIGCAAIGVLAFTLLGPGWL 180

Query: 181 ARSSALAASMAFMIYTGSTHPPAASLPLMFIDGAKMQHLNFWYALFPGAAGCVLLCFI 238
           ARSSALAASMAFMIYTGSTHPPAASLPLMFIDGAKMQHLNFWYALFPGAAGC+LLCFI
Sbjct: 181 ARSSALAASMAFMIYTGSTHPPAASLPLMFIDGAKMQHLNFWYALFPGAAGCLLLCFI 237

BLAST of Cp4.1LG20g01980.1 vs. ExPASy TrEMBL
Match: A0A0A0LR37 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G045560 PE=4 SV=1)

HSP 1 Score: 373 bits (958), Expect = 1.07e-127
Identity = 196/258 (75.97%), Postives = 214/258 (82.95%), Query Frame = 0

Query: 1   MSLQLKPIHHR----------GQQPYQPSF----------RVNHSFISLLPNCHLLNGKR 60
           MSLQLKPIHH             +PYQPS+           +NHSF+SLLP+CHLLNGKR
Sbjct: 1   MSLQLKPIHHHLHHYGGRPCHNNEPYQPSYIENIQVPSGCMLNHSFVSLLPSCHLLNGKR 60

Query: 61  GVSIDGSVRPLGLLLNDRRRRRNGGGGGGLSYRSIVASGIAGAPISDGSKPDKGFVSPPL 120
           G+S     R LGL  ND RRRRN G    + +RSIVAS IAG P+SDGSKP+KGFVSPPL
Sbjct: 61  GIS----ARSLGLF-NDWRRRRNRGSDR-IGHRSIVASSIAGTPVSDGSKPEKGFVSPPL 120

Query: 121 SDILWPSAGAFAAMAMLGKMDQILAPKGLSMTIAPLGAVCAILFAAPSSPAARKYNMFMA 180
           SDILWPSAGAFAAMA+LGKMDQILAPKGLSMTIAPLGAVCA+LFA PS+PAARKYN+F+A
Sbjct: 121 SDILWPSAGAFAAMALLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFLA 180

Query: 181 QIGCAAIGVLAFTLLGPGWLARSSALAASMAFMIYTGSTHPPAASLPLMFIDGAKMQHLN 238
           QIGCAAIGVLAFTLLGPGWLARSSALAASMAFMIYTGSTHPPAASLP++FIDGAKMQ LN
Sbjct: 181 QIGCAAIGVLAFTLLGPGWLARSSALAASMAFMIYTGSTHPPAASLPILFIDGAKMQQLN 240

BLAST of Cp4.1LG20g01980.1 vs. ExPASy TrEMBL
Match: A0A1S3AUM8 (uncharacterized protein LOC103482985 OS=Cucumis melo OX=3656 GN=LOC103482985 PE=4 SV=1)

HSP 1 Score: 373 bits (957), Expect = 1.41e-127
Identity = 197/256 (76.95%), Postives = 214/256 (83.59%), Query Frame = 0

Query: 1   MSLQLKPIHHR--------GQQPYQPSFR----------VNHSFISLLPNCHLLNGKRGV 60
           MSLQLKPIHH           +PYQPS+R          +NHS +SLLP CHLLNGKRG+
Sbjct: 1   MSLQLKPIHHHLHHHGGRHCHKPYQPSYREKIQAPSACMLNHSLVSLLPICHLLNGKRGI 60

Query: 61  SIDGSVRPLGLLLNDRRRRRNGGGGGGLSYRSIVASGIAGAPISDGSKPDKGFVSPPLSD 120
                VR LGL  ND RRRR+ G  G + +RSIVAS IAG P+SDGSKP+KGFVSPPLSD
Sbjct: 61  P----VRSLGLF-NDWRRRRSRGSDG-IGHRSIVASSIAGTPVSDGSKPEKGFVSPPLSD 120

Query: 121 ILWPSAGAFAAMAMLGKMDQILAPKGLSMTIAPLGAVCAILFAAPSSPAARKYNMFMAQI 180
           ILWPSAGAFAAMA+LGKMDQILAPKGLSMTIAPLGAVCA+LFA PS+PAARKYN+FMAQI
Sbjct: 121 ILWPSAGAFAAMALLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFMAQI 180

Query: 181 GCAAIGVLAFTLLGPGWLARSSALAASMAFMIYTGSTHPPAASLPLMFIDGAKMQHLNFW 238
           GCAAIGVLAFTLLGPGWLARSSALAASMAFMIYTGSTHPPAASLP++FIDGAKMQ LNFW
Sbjct: 181 GCAAIGVLAFTLLGPGWLARSSALAASMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFW 240

BLAST of Cp4.1LG20g01980.1 vs. TAIR 10
Match: AT3G47980.1 (Integral membrane HPP family protein )

HSP 1 Score: 235.3 bits (599), Expect = 6.0e-62
Identity = 130/239 (54.39%), Postives = 167/239 (69.87%), Query Frame = 0

Query: 2   SLQLKPIHHRGQQPYQPSFRVNHSFISL-LPNCHLLN-GKRGVSIDGSVRPLGLLLNDRR 61
           SL +KP+     Q +  +  +  S +++     H L     G+ ID SVR +  L +   
Sbjct: 3   SLPVKPLPSGHLQLHSRNLIIPPSMVTVGFKRHHFLGVSSYGLCIDESVRHMRSLRSSSN 62

Query: 62  RRRNGGGGGGLSYRSIVASGIAGAPISDGSKPDKGFVSPPLSDILWPSAGAFAAMAMLGK 121
           RRR     G      + +S    A   +  KP+K  V+P LSD++WP+AGAFAAMA++G+
Sbjct: 63  RRRVSKSAG--VSMPVASSDDFPAVSWESWKPEKTTVAPSLSDVIWPAAGAFAAMAIMGR 122

Query: 122 MDQILAPKGLSMTIAPLGAVCAILFAAPSSPAARKYNMFMAQIGCAAIGVLAFTLLGPGW 181
           +DQ+L PKG+SM++APLGAV AILF  PS+PAARKYNMF AQIGCAAIGVLAF+  GP W
Sbjct: 123 IDQMLNPKGISMSVAPLGAVSAILFTTPSAPAARKYNMFTAQIGCAAIGVLAFSAFGPSW 182

Query: 182 LARSSALAASMAFMIYTGSTHPPAASLPLMFIDGAKMQHLNFWYALFPGAAGCVLLCFI 239
           LARS+ALAAS+AFM+ T + HPPAASLPL+FIDGAK+  LNFWYALFPGAA C+LLCF+
Sbjct: 183 LARSTALAASIAFMVITRANHPPAASLPLLFIDGAKLHKLNFWYALFPGAAACILLCFL 239

BLAST of Cp4.1LG20g01980.1 vs. TAIR 10
Match: AT5G62720.1 (Integral membrane HPP family protein )

HSP 1 Score: 230.7 bits (587), Expect = 1.5e-60
Identity = 114/166 (68.67%), Postives = 139/166 (83.73%), Query Frame = 0

Query: 75  IVASGIAGAPISDGSKPDKGFVSPP--LSDILWPSAGAFAAMAMLGKMDQILAPKGLSMT 134
           + ++G   AP  D  KPDK   +    LSD++WP+AGAFAAMA+LG+MDQ+L+PKG+SM+
Sbjct: 65  VASAGNLTAPSWDSWKPDKTAAATALLLSDVIWPAAGAFAAMALLGRMDQMLSPKGISMS 124

Query: 135 IAPLGAVCAILFAAPSSPAARKYNMFMAQIGCAAIGVLAFTLLGPGWLARSSALAASMAF 194
           +APLGAV AILF  PS+PAARKYN+F+AQIGCAAIGV+AF++ GPGWLARS ALAAS+AF
Sbjct: 125 VAPLGAVSAILFITPSAPAARKYNIFLAQIGCAAIGVVAFSVFGPGWLARSVALAASIAF 184

Query: 195 MIYTGSTHPPAASLPLMFIDGAKMQHLNFWYALFPGAAGCVLLCFI 239
           M+ T + HPPAASLPLMFIDGAK  HLNFWYALFPGAA CV+LC +
Sbjct: 185 MVITRANHPPAASLPLMFIDGAKFHHLNFWYALFPGAAACVILCLL 230

BLAST of Cp4.1LG20g01980.1 vs. TAIR 10
Match: AT5G62720.2 (Integral membrane HPP family protein )

HSP 1 Score: 166.4 bits (420), Expect = 3.4e-41
Identity = 86/136 (63.24%), Postives = 109/136 (80.15%), Query Frame = 0

Query: 75  IVASGIAGAPISDGSKPDKGFVSPP--LSDILWPSAGAFAAMAMLGKMDQILAPKGLSMT 134
           + ++G   AP  D  KPDK   +    LSD++WP+AGAFAAMA+LG+MDQ+L+PKG+SM+
Sbjct: 65  VASAGNLTAPSWDSWKPDKTAAATALLLSDVIWPAAGAFAAMALLGRMDQMLSPKGISMS 124

Query: 135 IAPLGAVCAILFAAPSSPAARKYNMFMAQIGCAAIGVLAFTLLGPGWLARSSALAASMAF 194
           +APLGAV AILF  PS+PAARKYN+F+AQIGCAAIGV+AF++ GPGWLARS ALAAS+AF
Sbjct: 125 VAPLGAVSAILFITPSAPAARKYNIFLAQIGCAAIGVVAFSVFGPGWLARSVALAASIAF 184

Query: 195 MIYTGSTHPPAASLPL 209
           M+ T + HPP   L L
Sbjct: 185 MVITRANHPPGKYLLL 200

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_023519271.15.99e-168100.00uncharacterized protein LOC111782708 isoform X1 [Cucurbita pepo subsp. pepo][more]
XP_023519272.13.20e-16599.58uncharacterized protein LOC111782708 isoform X2 [Cucurbita pepo subsp. pepo][more]
XP_022924027.14.71e-16598.32uncharacterized protein LOC111431576 isoform X1 [Cucurbita moschata] >KAG6584258... [more]
XP_023001510.19.42e-16397.07uncharacterized protein LOC111495629 [Cucurbita maxima][more]
XP_022924028.12.51e-16297.90uncharacterized protein LOC111431576 isoform X2 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
A0A6J1E7R02.28e-16598.32uncharacterized protein LOC111431576 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1KLD74.56e-16397.07uncharacterized protein LOC111495629 OS=Cucurbita maxima OX=3661 GN=LOC111495629... [more]
A0A6J1EB701.21e-16297.90uncharacterized protein LOC111431576 isoform X2 OS=Cucurbita moschata OX=3662 GN... [more]
A0A0A0LR371.07e-12775.97Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G045560 PE=4 SV=1[more]
A0A1S3AUM81.41e-12776.95uncharacterized protein LOC103482985 OS=Cucumis melo OX=3656 GN=LOC103482985 PE=... [more]
Match NameE-valueIdentityDescription
AT3G47980.16.0e-6254.39Integral membrane HPP family protein [more]
AT5G62720.11.5e-6068.67Integral membrane HPP family protein [more]
AT5G62720.23.4e-4163.24Integral membrane HPP family protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR007065HPPPFAMPF04982HPPcoord: 133..238
e-value: 2.0E-21
score: 76.4
NoneNo IPR availablePANTHERPTHR33741:SF3BNAA06G16970D PROTEINcoord: 40..239
NoneNo IPR availablePANTHERPTHR33741TRANSMEMBRANE PROTEIN DDB_G0269096-RELATEDcoord: 40..239

Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Cp4.1LG20g01980Cp4.1LG20g01980gene


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cp4.1LG20g01980.1:three_prime_utr:001Cp4.1LG20g01980.1:three_prime_utr:001three_prime_UTR


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cp4.1LG20g01980.1:exon:005Cp4.1LG20g01980.1:exon:005exon
Cp4.1LG20g01980.1:exon:004Cp4.1LG20g01980.1:exon:004exon
Cp4.1LG20g01980.1:exon:003Cp4.1LG20g01980.1:exon:003exon
Cp4.1LG20g01980.1:exon:002Cp4.1LG20g01980.1:exon:002exon
Cp4.1LG20g01980.1:exon:001Cp4.1LG20g01980.1:exon:001exon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cp4.1LG20g01980.1:cds:001Cp4.1LG20g01980.1:cds:001CDS
Cp4.1LG20g01980.1:cds:002Cp4.1LG20g01980.1:cds:002CDS
Cp4.1LG20g01980.1:cds:003Cp4.1LG20g01980.1:cds:003CDS
Cp4.1LG20g01980.1:cds:004Cp4.1LG20g01980.1:cds:004CDS
Cp4.1LG20g01980.1:cds:005Cp4.1LG20g01980.1:cds:005CDS


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cp4.1LG20g01980.1:five_prime_utr:001Cp4.1LG20g01980.1:five_prime_utr:001five_prime_UTR


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Cp4.1LG20g01980.1Cp4.1LG20g01980.1-proteinpolypeptide


GO Annotation
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane