Cp4.1LG01g04020 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG01g04020
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionStructural polyprotein
LocationCp4.1LG01 : 1562873 .. 1567134 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AGGGGGCGGTTTGGCTTGATCGGTTAACAACTTCCCGAAACTTCAATAAAATATCGAGTATGCATACACATTTAATTCGATATCTAATTTGCAGATTTTGGTAAGAGAGGTTCGCTCAGAGGTGAATAATGTGTTGGAATTTTTGGTTGCATCATTTTCTTCTCCTTCTCTTCTGTTTCTCTGCTTCACTTGACGCTCAATTGCAGAATTTGTAAGCTCTAATTTTCAACGTTATGTGTTCTTATTGATCATTGTGTTATTTATTGATCTGATCTATTGGAATTCCATGGCGATTTGTTGCGTTCGTTCTTAATGTCAGAGGCAGAATTGGGAGAAACGAAGAAACTGAGATAGGGTTTGGGCGCAGAGCTCTGTTGAGTTTCAAAGAGTCTCCTCATGGAAGTAACGTTACCTTCGATTGCTCTCGTTCCGGGCCCTGTGTTGCGTGCCTCCTCTCCGAAAAGGTATCTGTTCGTTAGGTCTTTAAATCGAGATAAACTAAATACTGGTGCTCTTATTCATTCCTGTTCAGTTGGCATGCGTTGTTTTTTTGGGTTGAACTATCTGGAAATTATGATCTAGCTTGTTGGAGTTCTGAAACAGTGAAGCTTTTCGATATCATTTTTGTTATAGCGAGTAGCAACGAGATATTGCAAATGCATGTCAGATTTTTCATCATGCAGCAATGAATTTCCAGTTTTCAATGCATTAGAATTTGATAATGATAGGCCCCTTGCGTTCTCGTTCTCTTTGACCAAAATTTTATGTTGTGCTTTCTTTGAAAATTTGTTCACGCTGGGTATACTTTGTTTAATTTGTAACAAATGCACTTACCCAGTTCACAAAACAGAAAAATAATAATACTAAAAAGACAAAAGATCGACACCAAAGTTTCCTGTCTTTGGATCCTCTTAACCTATTTTCCCATATCCTTTCTGAAAATCCTTCTCTTTTTGGTTAGTTGTAGGGGATTTGAACATGGATACTGTGAGATCCCACATCGGTTGAAGAGGGAAGCAAAGCATTCTTTATAAGGGTGTAGAAATCTCTTCCTAGTAGATGCGTTTTAAAACCTTGAGGGGAAGCCCAAAGAAGACAATATTTACTAGCGGTAGACTTGAAATGTTACAAATGGTATCAAAGCTAGACACAGGGCGGTATGCCAACGGGGATGATGGGCCTCCAAGGGGGTGGATTGTGAGATCACACATTAGATGCGTTTTAAAACCTTGAGAGTAAGTCCAAAAAAGTATTTGCAGGCGGACTGTTTGGAAGGTTACAGATACCACGATTTTATAGTCACAGGGAGGGACAGACAAGGTTGACTCTGTAGTTAATATTTGGATAGTGCTAGGCTTTCAGTTCTTAATGTGTTAGGATTGGGAATGGTAGTCCGTTGCATATTTTCCATCTTTAAATCCAATGTTTCATGTTTAATGGTAGCTTTTGCTTTGCTTTTTGGTATAAAATTTAAAGTAGAAGTAATTTTTCATTATCTTCTTCTTTTTGGTCACATAGAAGGATGAAAATTATGGCTGCAGTGAGACTGGGTATCGTATTCCCCTGAAATGTGTGGAAGTCAAAGACACATCAAAAGTTTCCAATGAGAAAAAATCTCATGATGGTCGCTCAACACTTGAAATGTCTTATGAGCACAAGCTTGTGAACCATTTGCAGAGTGATGCAAGCGATCATGCTTCATCAGTTGCACACAGACATCAACGGGATGGTTCTATCTCTGGAACAGTTGGTTCCCAGGATTACATCATTTACAGAAGCTGTATTCCATCAGTTAACGATGAAAAGTTGTCTGTACTTGGTTTTGAGGTATGCCATTTATCTTAGCTGCTCTATTTCAAAGGGTTTAGTTTACTGCAATTAATGGTTTAATGGCCGTTGCATGCCTCATTATCTTCTCTCAAATTTCGATATATTGCTTTACTCTGTATAATTGATTTTGAATCCGAAAGTAGTCATCAACTAAATAGCTTTGCAGTCATGCTACTATGGTTACTGCTGTTCTTATTTATTTTAATTGTTGTCTGTAACTTTTACTTGCCACAAGATCCTATTTACTTCTTAAGTAGAGAATGCATCAATGGGAGGTGGACAGTTAAGGCACCTCATGAGACCTGTTCATCTTTTACAGAACCAAAATTCGATGGCTAATCAGTGTAATCTATGGATTTTCTTCAGCAGTTCTGTTGTTAAGATGATAGCTAACATCATATCTAGTCAGATTTGCAGTACTAAGGAGGTTCATTCTTTGTGCTACTTTGTTGTCAATCTATCTTTTTTCTTCTTCACATCTTGTATTTTGTAAAGTTAGATGGAAGAATAATTGTTCCTCTTCATATTAAAATCAAACTATTGTTAATTAAGTCAATAATGAATTGGGCTTAAAATACCGTCTTCTATTTTTCAAAAGATCCCACATCAGTTGGTGTGGAAACGTCTCACCAACAGACGCGTACGCGTTTTAAAACCTTGAGGGGAAGCCTGGAAGGGAAAGTCCAAAGAGGACAATATTTGCTAGCAGTGGGCTTGGGCTGTTACAAATGGTCTCAGAGTTAGGCATTGGGCAGTGTGCTAGCGAGGACGCTGGCTCCCGAGGGGTGGATTGTGAGATCCCATTTTGGTTAGAGAGGGGAGCGAAGTAGTCCTCATAAAGGTTGGAAACCTCTCCCTATTAGACACGTTGTAAAACCTTGAGGAAAAGTCCGGACAGGAAAGCCTAAAGAGAATAATATGTGCTGGTAGTGGACTTGGGCTTTTACATTCGAAATAGTGGCGAACAGAACAGGTGAAAACTATTCTCCACTTCCTTTCGAGATGGAAAATGATCATTGAGCTGTTGGTATTTGGGAATTTTGAAGTATTGCAATATGTGAAGAGGAGATATCCTTATCTAACTGGTTGTCTAATAGAGCATGCTACTTTTATCTTGATTGGATTCATTTCATTTTGTGTACTGTTGTGGACAGGGCATTGTGCTAGGCTTGCTGCTTATAAGCGGTTCAGTTGTGTACTTCAGAAGGAAGCGAAGTGTTTCAACGGCTGGTTTTGCATCTGGCAGGGTGCAGTCAAATTCCAGGTTCTAGTACATCACTCATTTTTTCTTACAATTATCCACCAATTCTACACATTCAAATTATACAAAGAATTAGATTCATTTGAACAACTATTTCCTTTCCTTTTTCTCTGTTCTAAAAGCTATACCAAAATTGCCCATCTATGAAAANAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAATAGCCCAAGCCCATATATATACACATAGAGAGAGGGAGATATGTACAATCCAGATGTTTTTTATTTTGGATTATTGAAATTAGAATACAAATAAGAGGCATGTTGTTTGAAGTTAATTAACATTAAAGGGTAACGTTGAGAAGAACCAAGAGACATGAGTTGCTCTCAAACTGGAGGAAGCCTAAGCCTAAGCCTAAGCCTAAGCCTAAGGTCAGGCGGGTGGCCGTCCTTTTACTCCCGTTCGCCTGCAAACCAAACCCAACAAATATTAACTCGCTTAAAATCAATATATATGGTAGGAAAAAAATTTCACACCCTCATAAAAAATGTTTCATTCTCTTCTCAAAATGACGTGGGATCTTACAAAAGTCTAATAAGAAATTTGAACACAACCCGATATCATTGTCATGTTAAAGTTGATCTTCGAAAAAAATCACCTGTTATGAATTGGGTCGGGTCCTTTAGGGACTTTAACTTTGCTCATGAAGTTAAGCTCCTTGGCTTGCTTCATATGGCTTCCCTCAAACTCAGACCTAGCTGCCGGCCAGCTTGTTGTTTTTGCTACCGACCGTTCTCTGCTGTCGGGCATGCCAACGATTAAAAGCCATACAAACATCCCAACTGCCGCAGCTCCAAGCAAGACCTTGAAGTAAGAAAACGAGCAACAACTACCTTCACTTCTCATAAATATATATATATACACTTCTGTCTATGTACGTCGAAAGAGATTAGTTAACTTTACGACTCGAAGAGGCAAGGACGTCAAGCAATGAGAAGAAGAAGAGAGGAAGAGGAAGAAGAAGGAGAGGAGTTTGTTTAGAAAAGGGGAGTGCATGTTTAATTAGGGAGTTGTCTTTTTTTTCTTTATAGGGTTGGTGGATTCTGTTGCACTTTTCTCAAACCCTGCCTTGCCTGTTTGTTTATAATTGACACACACACACACCATTATATTCTAACATATCATATTA

mRNA sequence

AGGGGGCGGTTTGGCTTGATCGGTTAACAACTTCCCGAAACTTCAATAAAATATCGAGTATGCATACACATTTAATTCGATATCTAATTTGCAGATTTTGGTAAGAGAGGTTCGCTCAGAGGTGAATAATGTGTTGGAATTTTTGGTTGCATCATTTTCTTCTCCTTCTCTTCTGTTTCTCTGCTTCACTTGACGCTCAATTGCAGAATTTAGGCAGAATTGGGAGAAACGAAGAAACTGAGATAGGGTTTGGGCGCAGAGCTCTGTTGAGTTTCAAAGAGTCTCCTCATGGAAGTAACGTTACCTTCGATTGCTCTCGTTCCGGGCCCTGTGTTGCGTGCCTCCTCTCCGAAAAGAAGGATGAAAATTATGGCTGCAGTGAGACTGGGTATCGTATTCCCCTGAAATGTGTGGAAGTCAAAGACACATCAAAAGTTTCCAATGAGAAAAAATCTCATGATGGTCGCTCAACACTTGAAATGTCTTATGAGCACAAGCTTGTGAACCATTTGCAGAGTGATGCAAGCGATCATGCTTCATCAGTTGCACACAGACATCAACGGGATGGTTCTATCTCTGGAACAGTTGGTTCCCAGGATTACATCATTTACAGAAGCTGTATTCCATCAGTTAACGATGAAAAGTTGTCTGTACTTGGTTTTGAGGGCATTGTGCTAGGCTTGCTGCTTATAAGCGGTTCAGTTGTGTACTTCAGAAGGAAGCGAAGTGTTTCAACGGCTGGTTTTGCATCTGGCAGGGTGCAGTCAAATTCCAGGTTCTAGTACATCACTCATTTTTTCTTACAATTATCCACCAATTCTACACATTCAAATTATACAAAGAATTAGATTCATTTGAACAACTATTTCCTTTCCTTTTTCTCTGTTCTAAAAGCTATACCAAAATTGCCCATCTATGAAAANAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAATAGCCCAAGCCCATATATATACACATAGAGAGAGGGAGATATGTACAATCCAGATGTTTTTTATTTTGGATTATTGAAATTAGAATACAAATAAGAGGCATGTTGTTTGAAGTTAATTAACATTAAAGGGTAACGTTGAGAAGAACCAAGAGACATGAGTTGCTCTCAAACTGGAGGAAGCCTAAGCCTAAGCCTAAGCCTAAGCCTAAGGTCAGGCGGGTGGCCGTCCTTTTACTCCCGTTCGCCTGCAAACCAAACCCAACAAATATTAACTCGCTTAAAATCAATATATATGGTAGGAAAAAAATTTCACACCCTCATAAAAAATGTTTCATTCTCTTCTCAAAATGACGTGGGATCTTACAAAAGTCTAATAAGAAATTTGAACACAACCCGATATCATTGTCATGTTAAAGTTGATCTTCGAAAAAAATCACCTGTTATGAATTGGGTCGGGTCCTTTAGGGACTTTAACTTTGCTCATGAAGTTAAGCTCCTTGGCTTGCTTCATATGGCTTCCCTCAAACTCAGACCTAGCTGCCGGCCAGCTTGTTGTTTTTGCTACCGACCGTTCTCTGCTGTCGGGCATGCCAACGATTAAAAGCCATACAAACATCCCAACTGCCGCAGCTCCAAGCAAGACCTTGAAGTAAGAAAACGAGCAACAACTACCTTCACTTCTCATAAATATATATATATACACTTCTGTCTATGTACGTCGAAAGAGATTAGTTAACTTTACGACTCGAAGAGGCAAGGACGTCAAGCAATGAGAAGAAGAAGAGAGGAAGAGGAAGAAGAAGGAGAGGAGTTTGTTTAGAAAAGGGGAGTGCATGTTTAATTAGGGAGTTGTCTTTTTTTTCTTTATAGGGTTGGTGGATTCTGTTGCACTTTTCTCAAACCCTGCCTTGCCTGTTTGTTTATAATTGACACACACACACACCATTATATTCTAACATATCATATTA

Coding sequence (CDS)

ATGTGTTGGAATTTTTGGTTGCATCATTTTCTTCTCCTTCTCTTCTGTTTCTCTGCTTCACTTGACGCTCAATTGCAGAATTTAGGCAGAATTGGGAGAAACGAAGAAACTGAGATAGGGTTTGGGCGCAGAGCTCTGTTGAGTTTCAAAGAGTCTCCTCATGGAAGTAACGTTACCTTCGATTGCTCTCGTTCCGGGCCCTGTGTTGCGTGCCTCCTCTCCGAAAAGAAGGATGAAAATTATGGCTGCAGTGAGACTGGGTATCGTATTCCCCTGAAATGTGTGGAAGTCAAAGACACATCAAAAGTTTCCAATGAGAAAAAATCTCATGATGGTCGCTCAACACTTGAAATGTCTTATGAGCACAAGCTTGTGAACCATTTGCAGAGTGATGCAAGCGATCATGCTTCATCAGTTGCACACAGACATCAACGGGATGGTTCTATCTCTGGAACAGTTGGTTCCCAGGATTACATCATTTACAGAAGCTGTATTCCATCAGTTAACGATGAAAAGTTGTCTGTACTTGGTTTTGAGGGCATTGTGCTAGGCTTGCTGCTTATAAGCGGTTCAGTTGTGTACTTCAGAAGGAAGCGAAGTGTTTCAACGGCTGGTTTTGCATCTGGCAGGGTGCAGTCAAATTCCAGGTTCTAG

Protein sequence

MCWNFWLHHFLLLLFCFSASLDAQLQNLGRIGRNEETEIGFGRRALLSFKESPHGSNVTFDCSRSGPCVACLLSEKKDENYGCSETGYRIPLKCVEVKDTSKVSNEKKSHDGRSTLEMSYEHKLVNHLQSDASDHASSVAHRHQRDGSISGTVGSQDYIIYRSCIPSVNDEKLSVLGFEGIVLGLLLISGSVVYFRRKRSVSTAGFASGRVQSNSRF
BLAST of Cp4.1LG01g04020 vs. TrEMBL
Match: A0A0A0KVR8_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G604010 PE=4 SV=1)

HSP 1 Score: 325.1 bits (832), Expect = 6.5e-86
Identity = 168/208 (80.77%), Postives = 182/208 (87.50%), Query Frame = 1

Query: 10  FLLLLFCFSASLDAQLQNLGRIGRNEETEIGFGRRALLSFKESPHGSNVTFDCSRSGPCV 69
           FLLLLF FSASL AQ QN  R+G +EE EIGFGRRALLSFKE+P GSNVTF+CSRSGPCV
Sbjct: 10  FLLLLFTFSASLHAQSQNTDRVGGDEEIEIGFGRRALLSFKETPQGSNVTFECSRSGPCV 69

Query: 70  ACLLSEKKDENYGCSETGYRIPLKCVEVKDTSKVSNEKKSHDGRSTLEMSYEHKLVNHLQ 129
           ACL SEK DE Y CSETGYRIPLKCVE+KDTSKVSN KKSH+GRS L++SYEHK+V    
Sbjct: 70  ACLYSEKNDEKYRCSETGYRIPLKCVEIKDTSKVSNGKKSHNGRSMLDISYEHKVV---P 129

Query: 130 SDASDHASSVAHRHQRDGSISGTVGSQDYIIYRSCIPSVNDEKLSVLGFEGIVLGLLLIS 189
            DAS +ASS+AHR+ RDGS+S T GSQ YI YRSCIPSVN+EKLSVLGFEGIVL LLLIS
Sbjct: 130 DDASGNASSIAHRNLRDGSVSSTDGSQSYITYRSCIPSVNEEKLSVLGFEGIVLCLLLIS 189

Query: 190 GSVVYFRRKRSVSTAGFASGRVQSNSRF 218
           GSVVYFRRKRSVSTAGFASGRVQSNSRF
Sbjct: 190 GSVVYFRRKRSVSTAGFASGRVQSNSRF 214

BLAST of Cp4.1LG01g04020 vs. TrEMBL
Match: M5XNF3_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa011511mg PE=4 SV=1)

HSP 1 Score: 206.5 bits (524), Expect = 3.4e-50
Identity = 119/206 (57.77%), Postives = 139/206 (67.48%), Query Frame = 1

Query: 12  LLLFCFSASLDAQLQNLGRIGRNEETEIGFGRRALLSFKESPHGSNVTFDCSRSGPCVAC 71
           LLLF F A     L ++G      +   G GRRALLS KE+P GSN TF+CS +GPCV C
Sbjct: 9   LLLFFFVAFA---LLSVGHATIQGDESEGIGRRALLSLKETPRGSNTTFECSPAGPCVPC 68

Query: 72  LLSEKKDENYGCSETGYRIPLKCVEVKDTSKVSNEKKSHDGRSTLEMSYEHKLVNHLQSD 131
           L SEKKDE Y CSETGYRIPLKCVE K + K    K S   RSTLE+ Y +    H   +
Sbjct: 69  LYSEKKDEKYRCSETGYRIPLKCVETKRSLKDEKAKGSQKSRSTLEI-YHNDAELH---N 128

Query: 132 ASDHASSVAHRHQRDGSISGTVGSQDYIIYRSCIPSVNDEKLSVLGFEGIVLGLLLISGS 191
           A +  +SV HR   D S +   G Q YI YRSCIP+V++EKLSVLGFE IVL  LLISGS
Sbjct: 129 AEELGTSVKHRSLLDDSATLEDGPQAYITYRSCIPAVSEEKLSVLGFEMIVLFFLLISGS 188

Query: 192 VVYFRRKRSVSTAGFASGRVQSNSRF 218
           VVYFRRK++VS  GF +GR+QSNSRF
Sbjct: 189 VVYFRRKQTVSMTGFGAGRIQSNSRF 207

BLAST of Cp4.1LG01g04020 vs. TrEMBL
Match: A0A067JBW4_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_21738 PE=4 SV=1)

HSP 1 Score: 200.7 bits (509), Expect = 1.9e-48
Identity = 115/207 (55.56%), Postives = 138/207 (66.67%), Query Frame = 1

Query: 12  LLLFCFSASLDAQLQNLGRIGRNEETEIG-FGRRALLSFKESPHGSNVTFDCSRSGPCVA 71
           L+LF  SA+L     +L  I  +++T  G  G R LL FKE P GSN+TFDCS SG CV 
Sbjct: 12  LVLFLVSAAL----LSLSSINASDQTTNGGSGSRVLLVFKEKPEGSNLTFDCSPSGACVP 71

Query: 72  CLLSEKKDENYGCSETGYRIPLKCVEVKDTSKVSNEKKSHDGRSTLEMSYEHKLVNHLQS 131
           C  SEK DE Y CSETGYRIPLKC+E KD  K  N+KKS   RS +E+S E+   + +  
Sbjct: 72  CQYSEKTDEKYRCSETGYRIPLKCIETKDNIKNGNDKKSQKTRSVIEISNENANPHAMLH 131

Query: 132 DASDHASSVAHRHQRDGSISGTVGSQDYIIYRSCIPSVNDEKLSVLGFEGIVLGLLLISG 191
           D    A+S   R   D S +   GSQ YI YRSCIP VN+EKLSVLGFEGI+L L LISG
Sbjct: 132 D----AASNEQRSLLDDSSTLEDGSQAYITYRSCIPPVNEEKLSVLGFEGIILCLFLISG 191

Query: 192 SVVYFRRKRSVSTAGFASGRVQSNSRF 218
           SVVYFRRK++V+ +G   GR+Q NSRF
Sbjct: 192 SVVYFRRKQTVTMSGVGGGRIQMNSRF 210

BLAST of Cp4.1LG01g04020 vs. TrEMBL
Match: A0A067GST4_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g028438mg PE=4 SV=1)

HSP 1 Score: 200.3 bits (508), Expect = 2.4e-48
Identity = 111/208 (53.37%), Postives = 140/208 (67.31%), Query Frame = 1

Query: 10  FLLLLFCFSASLDAQLQNLGRIGRNEETEIGFGRRALLSFKESPHGSNVTFDCSRSGPCV 69
           F+ LLF  +  L +  Q       ++  +   G+R  LSFKE+P GSN TF+CS SGPCV
Sbjct: 6   FIFLLFIIT-QLSSPHQASTYQPESKSADRSVGQRHFLSFKETPSGSNRTFECSPSGPCV 65

Query: 70  ACLLSEKKDENYGCSETGYRIPLKCVEVKDTSKVSNEKKSHDGRSTLEMSYEHKLVNHLQ 129
            CL SEK DE Y CSETGYRIPLKCVE +D S V NEKK+   R  LE+S + +  + + 
Sbjct: 66  RCLYSEKIDEKYRCSETGYRIPLKCVESEDVSNVENEKKT---RGALEISTDSEKPHVIL 125

Query: 130 SDASDHASSVAHRHQRDGSISGTVGSQDYIIYRSCIPSVNDEKLSVLGFEGIVLGLLLIS 189
            DA +  +S+ HR   + S +    SQ YI YRSCIP+VN+EKLSVLGFEGIV  LLLIS
Sbjct: 126 HDAVELTTSIKHRSLLEDSSTSKGKSQAYITYRSCIPAVNEEKLSVLGFEGIVFCLLLIS 185

Query: 190 GSVVYFRRKRSVSTAGFASGRVQSNSRF 218
           GS VY RRKR+V+ +G  +GR+Q+NSRF
Sbjct: 186 GSAVYLRRKRTVTMSGIGTGRIQANSRF 209

BLAST of Cp4.1LG01g04020 vs. TrEMBL
Match: M5Y576_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa011511mg PE=4 SV=1)

HSP 1 Score: 199.9 bits (507), Expect = 3.2e-48
Identity = 118/206 (57.28%), Postives = 138/206 (66.99%), Query Frame = 1

Query: 12  LLLFCFSASLDAQLQNLGRIGRNEETEIGFGRRALLSFKESPHGSNVTFDCSRSGPCVAC 71
           LLLF F A     L ++G      +   G GRRALLS KE+P GSN TF+CS +GPCV C
Sbjct: 9   LLLFFFVAFA---LLSVGHATIQGDESEGIGRRALLSLKETPRGSNTTFECSPAGPCVPC 68

Query: 72  LLSEKKDENYGCSETGYRIPLKCVEVKDTSKVSNEKKSHDGRSTLEMSYEHKLVNHLQSD 131
           L SEK DE Y CSETGYRIPLKCVE K + K    K S   RSTLE+ Y +    H   +
Sbjct: 69  LYSEK-DEKYRCSETGYRIPLKCVETKRSLKDEKAKGSQKSRSTLEI-YHNDAELH---N 128

Query: 132 ASDHASSVAHRHQRDGSISGTVGSQDYIIYRSCIPSVNDEKLSVLGFEGIVLGLLLISGS 191
           A +  +SV HR   D S +   G Q YI YRSCIP+V++EKLSVLGFE IVL  LLISGS
Sbjct: 129 AEELGTSVKHRSLLDDSATLEDGPQAYITYRSCIPAVSEEKLSVLGFEMIVLFFLLISGS 188

Query: 192 VVYFRRKRSVSTAGFASGRVQSNSRF 218
           VVYFRRK++VS  GF +GR+QSNSRF
Sbjct: 189 VVYFRRKQTVSMTGFGAGRIQSNSRF 206

BLAST of Cp4.1LG01g04020 vs. TAIR10
Match: AT1G69980.1 (AT1G69980.1 unknown protein)

HSP 1 Score: 157.5 bits (397), Expect = 9.1e-39
Identity = 90/210 (42.86%), Postives = 126/210 (60.00%), Query Frame = 1

Query: 11  LLLLFCFSASLDAQLQNLGRIGRNEETEIGFGRRALLSFKESPHGSNVTFDCSRSGPCVA 70
           +  +F +++S+ + +         +E     GRR LL FKE+P GSN+TF CS SGPCV+
Sbjct: 12  IFFIFIYASSIPSSILARAHEHDGDEEIRSVGRRFLLGFKETPKGSNITFACSPSGPCVS 71

Query: 71  CLLSEKKDENYGCSETGYRIPLKCVEVKDTSKVSNEKKSHDGRSTLEMSYEHKLVNHLQS 130
           C  SEK+ E Y CSETGYRIP KC EV++  +V   KK+ +  +              Q+
Sbjct: 72  CNSSEKRKEKYRCSETGYRIPFKCKEVRE--EVDAHKKNGEEET--------------QN 131

Query: 131 DASDHASSVAHRHQR--DGSISGTVGSQDYIIYRSCIPSVNDEKLSVLGFEGIVLGLLLI 190
           D S++    A       D S +  V SQ Y  YRSC+PS ++EKLSVLGFE I+LGL L+
Sbjct: 132 DQSNNNDEEAKTRNLLDDSSPATKVKSQSYKTYRSCVPSADEEKLSVLGFESIMLGLFLL 191

Query: 191 SGSVVYFRRKRSVSTAGF-ASGRVQSNSRF 218
           SG+ +Y R++++V   G  +SGR+Q NSRF
Sbjct: 192 SGAAIYIRKRQTVPMLGVSSSGRLQGNSRF 205

BLAST of Cp4.1LG01g04020 vs. NCBI nr
Match: gi|449434859|ref|XP_004135213.1| (PREDICTED: uncharacterized protein LOC101207563 [Cucumis sativus])

HSP 1 Score: 325.1 bits (832), Expect = 9.4e-86
Identity = 168/208 (80.77%), Postives = 182/208 (87.50%), Query Frame = 1

Query: 10  FLLLLFCFSASLDAQLQNLGRIGRNEETEIGFGRRALLSFKESPHGSNVTFDCSRSGPCV 69
           FLLLLF FSASL AQ QN  R+G +EE EIGFGRRALLSFKE+P GSNVTF+CSRSGPCV
Sbjct: 10  FLLLLFTFSASLHAQSQNTDRVGGDEEIEIGFGRRALLSFKETPQGSNVTFECSRSGPCV 69

Query: 70  ACLLSEKKDENYGCSETGYRIPLKCVEVKDTSKVSNEKKSHDGRSTLEMSYEHKLVNHLQ 129
           ACL SEK DE Y CSETGYRIPLKCVE+KDTSKVSN KKSH+GRS L++SYEHK+V    
Sbjct: 70  ACLYSEKNDEKYRCSETGYRIPLKCVEIKDTSKVSNGKKSHNGRSMLDISYEHKVV---P 129

Query: 130 SDASDHASSVAHRHQRDGSISGTVGSQDYIIYRSCIPSVNDEKLSVLGFEGIVLGLLLIS 189
            DAS +ASS+AHR+ RDGS+S T GSQ YI YRSCIPSVN+EKLSVLGFEGIVL LLLIS
Sbjct: 130 DDASGNASSIAHRNLRDGSVSSTDGSQSYITYRSCIPSVNEEKLSVLGFEGIVLCLLLIS 189

Query: 190 GSVVYFRRKRSVSTAGFASGRVQSNSRF 218
           GSVVYFRRKRSVSTAGFASGRVQSNSRF
Sbjct: 190 GSVVYFRRKRSVSTAGFASGRVQSNSRF 214

BLAST of Cp4.1LG01g04020 vs. NCBI nr
Match: gi|659090936|ref|XP_008446282.1| (PREDICTED: uncharacterized protein LOC103489059 [Cucumis melo])

HSP 1 Score: 319.7 bits (818), Expect = 3.9e-84
Identity = 167/217 (76.96%), Postives = 180/217 (82.95%), Query Frame = 1

Query: 1   MCWNFWLHHFLLLLFCFSASLDAQLQNLGRIGRNEETEIGFGRRALLSFKESPHGSNVTF 60
           M W F    FLLLLF FSASL AQ  N  R+  +EE  IGFGRRALLSFKE+P GSNVTF
Sbjct: 1   MYWKFSFLPFLLLLFTFSASLHAQSLNTDRVSGDEEIGIGFGRRALLSFKETPQGSNVTF 60

Query: 61  DCSRSGPCVACLLSEKKDENYGCSETGYRIPLKCVEVKDTSKVSNEKKSHDGRSTLEMSY 120
           +CSRSGPCVACL SEK DE Y CSETGYRIPLKCVE+KDTSKVSNEKKSH+GRS L++SY
Sbjct: 61  ECSRSGPCVACLYSEKNDEKYRCSETGYRIPLKCVEIKDTSKVSNEKKSHNGRSMLDISY 120

Query: 121 EHKLVNHLQSDASDHASSVAHRHQRDGSISGTVGSQDYIIYRSCIPSVNDEKLSVLGFEG 180
           EHK+V     DAS HASS AHR+ RDGS+S T G Q YI YRSCIPSVN+EKLSVLGFEG
Sbjct: 121 EHKVV---PDDASGHASSTAHRNLRDGSVSSTDGPQSYITYRSCIPSVNEEKLSVLGFEG 180

Query: 181 IVLGLLLISGSVVYFRRKRSVSTAGFASGRVQSNSRF 218
           IVL LLLISGSVVYFRRKRS+S AGFASGRVQSNSRF
Sbjct: 181 IVLCLLLISGSVVYFRRKRSISMAGFASGRVQSNSRF 214

BLAST of Cp4.1LG01g04020 vs. NCBI nr
Match: gi|596287730|ref|XP_007225822.1| (hypothetical protein PRUPE_ppa011511mg [Prunus persica])

HSP 1 Score: 206.5 bits (524), Expect = 4.9e-50
Identity = 119/206 (57.77%), Postives = 139/206 (67.48%), Query Frame = 1

Query: 12  LLLFCFSASLDAQLQNLGRIGRNEETEIGFGRRALLSFKESPHGSNVTFDCSRSGPCVAC 71
           LLLF F A     L ++G      +   G GRRALLS KE+P GSN TF+CS +GPCV C
Sbjct: 9   LLLFFFVAFA---LLSVGHATIQGDESEGIGRRALLSLKETPRGSNTTFECSPAGPCVPC 68

Query: 72  LLSEKKDENYGCSETGYRIPLKCVEVKDTSKVSNEKKSHDGRSTLEMSYEHKLVNHLQSD 131
           L SEKKDE Y CSETGYRIPLKCVE K + K    K S   RSTLE+ Y +    H   +
Sbjct: 69  LYSEKKDEKYRCSETGYRIPLKCVETKRSLKDEKAKGSQKSRSTLEI-YHNDAELH---N 128

Query: 132 ASDHASSVAHRHQRDGSISGTVGSQDYIIYRSCIPSVNDEKLSVLGFEGIVLGLLLISGS 191
           A +  +SV HR   D S +   G Q YI YRSCIP+V++EKLSVLGFE IVL  LLISGS
Sbjct: 129 AEELGTSVKHRSLLDDSATLEDGPQAYITYRSCIPAVSEEKLSVLGFEMIVLFFLLISGS 188

Query: 192 VVYFRRKRSVSTAGFASGRVQSNSRF 218
           VVYFRRK++VS  GF +GR+QSNSRF
Sbjct: 189 VVYFRRKQTVSMTGFGAGRIQSNSRF 207

BLAST of Cp4.1LG01g04020 vs. NCBI nr
Match: gi|645229489|ref|XP_008221490.1| (PREDICTED: uncharacterized protein LOC103321462 [Prunus mume])

HSP 1 Score: 206.1 bits (523), Expect = 6.3e-50
Identity = 117/206 (56.80%), Postives = 137/206 (66.50%), Query Frame = 1

Query: 12  LLLFCFSASLDAQLQNLGRIGRNEETEIGFGRRALLSFKESPHGSNVTFDCSRSGPCVAC 71
           LLLF F A     L ++G      +   G GRRALLS KE+P GSN TF+CS +GPCV C
Sbjct: 9   LLLFFFLAFA---LLSVGHATIQSDESEGIGRRALLSLKETPRGSNTTFECSPAGPCVPC 68

Query: 72  LLSEKKDENYGCSETGYRIPLKCVEVKDTSKVSNEKKSHDGRSTLEMSYEHKLVNHLQSD 131
           L SEKKDE Y CSETGYRIPLKCVE K +SK    K S   RSTL + +         +D
Sbjct: 69  LYSEKKDEKYRCSETGYRIPLKCVETKRSSKDEKAKGSQKSRSTLGIYH---------ND 128

Query: 132 ASDHASSVAHRHQRDGSISGTVGSQDYIIYRSCIPSVNDEKLSVLGFEGIVLGLLLISGS 191
           A +  +SV HR   D S +   G Q YI YRSCIP+V++EKLSVLGFE IVL  LLISGS
Sbjct: 129 AEELGTSVKHRSLLDDSATLEDGPQAYITYRSCIPAVSEEKLSVLGFEVIVLFFLLISGS 188

Query: 192 VVYFRRKRSVSTAGFASGRVQSNSRF 218
           VVY RRK++VS  GF +GR+QSNSRF
Sbjct: 189 VVYLRRKQTVSMTGFGAGRIQSNSRF 202

BLAST of Cp4.1LG01g04020 vs. NCBI nr
Match: gi|658009503|ref|XP_008339965.1| (PREDICTED: uncharacterized protein LOC103402955 [Malus domestica])

HSP 1 Score: 202.2 bits (513), Expect = 9.2e-49
Identity = 121/209 (57.89%), Postives = 145/209 (69.38%), Query Frame = 1

Query: 10  FLLLLFCFSASLDAQLQNLGRIGRNEETEIGFGRRALLSFKESPHGSNVTFDCSRSGPCV 69
           FLLLLF  + +L +       IG +EE+  G GRRALLS KE+PHGSN T++CS +GPCV
Sbjct: 8   FLLLLFSIAFNLLSLAH--ATIG-SEESGGGIGRRALLSLKETPHGSNATYECSPAGPCV 67

Query: 70  ACLLSEKKDENYGCSETGYRIPLKCVEVKDTSKVSNEKKSHDGRSTLEMSYEHKLVNHLQ 129
            CL  EKKDE Y CSETGYRIPLKCVE+K  SK +  K +   RS LE+S+     N  +
Sbjct: 68  PCLYPEKKDEKYRCSETGYRIPLKCVEMKHGSKEAKMKGAQKSRSALEISH-----NIEE 127

Query: 130 SDASDHASS-VAHRHQRDGSISGTVGSQDYIIYRSCIPSVNDEKLSVLGFEGIVLGLLLI 189
           SD +D   + V HR   D   S T   Q YI YRSCIP+VN+EKLSVLGFE IVL LLLI
Sbjct: 128 SDKADBLGTYVKHRSLLDD--SSTQEPQAYITYRSCIPAVNEEKLSVLGFEVIVLFLLLI 187

Query: 190 SGSVVYFRRKRSVSTAGFASGRVQSNSRF 218
           SGS+VYFRRK++ STAGF   R+Q+NSRF
Sbjct: 188 SGSLVYFRRKQTTSTAGFV--RIQNNSRF 204

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0KVR8_CUCSA6.5e-8680.77Uncharacterized protein OS=Cucumis sativus GN=Csa_5G604010 PE=4 SV=1[more]
M5XNF3_PRUPE3.4e-5057.77Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa011511mg PE=4 SV=1[more]
A0A067JBW4_JATCU1.9e-4855.56Uncharacterized protein OS=Jatropha curcas GN=JCGZ_21738 PE=4 SV=1[more]
A0A067GST4_CITSI2.4e-4853.37Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g028438mg PE=4 SV=1[more]
M5Y576_PRUPE3.2e-4857.28Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa011511mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G69980.19.1e-3942.86 unknown protein[more]
Match NameE-valueIdentityDescription
gi|449434859|ref|XP_004135213.1|9.4e-8680.77PREDICTED: uncharacterized protein LOC101207563 [Cucumis sativus][more]
gi|659090936|ref|XP_008446282.1|3.9e-8476.96PREDICTED: uncharacterized protein LOC103489059 [Cucumis melo][more]
gi|596287730|ref|XP_007225822.1|4.9e-5057.77hypothetical protein PRUPE_ppa011511mg [Prunus persica][more]
gi|645229489|ref|XP_008221490.1|6.3e-5056.80PREDICTED: uncharacterized protein LOC103321462 [Prunus mume][more]
gi|658009503|ref|XP_008339965.1|9.2e-4957.89PREDICTED: uncharacterized protein LOC103402955 [Malus domestica][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG01g04020.1Cp4.1LG01g04020.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR36336FAMILY NOT NAMEDcoord: 6..217
score: 7.2
NoneNo IPR availablePANTHERPTHR36336:SF1SUBFAMILY NOT NAMEDcoord: 6..217
score: 7.2