CmaCh18G004700 (gene) Cucurbita maxima (Rimu) v1.1

Overview
NameCmaCh18G004700
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu) v1.1)
DescriptionBinding protein
LocationCma_Chr18: 2637315 .. 2641398 (+)
RNA-Seq ExpressionCmaCh18G004700
SyntenyCmaCh18G004700
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AACCAACCACCACCCACCGTCATGAACTCCACCGCCCTCCTCCACCGTGGAATCGCCCCAACACTGCCACTCCCGTTGTCGTTCCGACGCAACATGTCTTCTTCCGTTCACCTCGTCCCCAACGAGACCGTTAACAAGCACGACGCTCGGAGGTGTAGTAATAATGGCATCAAAATAGTGTGTCACAGATATAACCGAATGGAGTCGTCCGGTTCTCAGAACGATGGGTTTCCTTTTTTAAACACCAAAGTGGCTATGCAGTCGCTGCTGTGCTTCTCTTTGGATGCGGCGGCGGAGTTCGAAACCAAAGACATAAGTGCTCAGAAGGTGGGCTTTAGTTTATACTTTCAACTATGTTTAGGTTTGATTGTAAGTTTGAGGGTTTATTGTTTGTGGCAGAGAAAAGCTTTACAGGCATTGTTGGCTAATAATCCTGCAGAGGCTGAGAGGATAATGAAGAAGGTGCTCCAAAAATACAAAAATGACAAAGTTGAGATTAAATATGAAGCTACCTTGGCAATGGTTGCAATTCTCATCCACACGGTATTTTCTCCGGGGTATACATGAGCTAGATTGGGTTGGGTTAGTCGGGTTAAAATTATTTAGGAATTTCACTTATAGATACCTATTAGGTTGATAATTAGTTTAATTATTGAACATTTATCGATTGTTTTTAAAAGTAATTTAGAATTGGATTTCAAGTTGAATTTTGTTGTTATTCTGTTTTTGAGTTGGTCGGATTGAACTAAAATTTGAATTTTTGGTTTTTCGAAAATTTAATTATGAGAGCCAAAGGTCGATTGGAGAGGAGAATGAGTGCATTATTTGTAAGGGTGTGAAAACTTCTATCTAGGAAACGCGTTTTAAAAGTTTTGAGGGGAAACCCAAAAGGAAAAGCCTAGAGAGGAAAAATATTTGGTAGCGGTGGGCTTGGACTCTTACAAATGGTATCAGAGCTAGACACTTGGTGATGTGTCAATGAGGACGCTGAGCCTTGTAGAGGGGAAGACACCAGGCGGTGTGCCAATGAGGATGTTGAGCCTCAGAGGAGGGTAGACACTGCCATTGAGCCCCGGAGGGGGGTGGACACCGCTACGATGTGCCAGCGAGGACGCTGAGCCCCTTGAGAGAGTGGACACTGGGTAGTGTGCTAGCGAGGACGTTAAGCCCCTTAGGAAGGTGGACACTAAGCAGTGTGCCAGCAAGGATGCTGAGCTTCATTGGGGGTGGACACTAGGCGGTGTGCCAACAAGGACATTGAGCCTCGAAGGGGGTGGACACCAGGCGGTGTGCCAGCGAGGATGCTAAGGCTCGTAGGGAATGGACACCAGGCAGTGTGCCAGTGAAGATTCTAAGGCTCGTAGGGGGTGGACAACGAGCGGTGTACTAGCGAGGACATTGAGCCTCGAAAGGAGTGAACACTACCATTGAGCCCTAAAAGGGGTGGACACTAAGCGGTGTACTAGTGAGGACGCTAAGCTCCGTAGGGGGTGAACACCGGGCAGTGTGCCAGCATGGACGTTGAGCCACGTGGGGGGTGGACATCAGGTAGTGTGTCAGCGAGGACATTGTGCTACATAGGGAGTGGACATCAGACAGTGTGTTAGCGAGGACATTGAGACACATAAGAAGTGAACATCAGACAATGTGCAGCAAGGACGTTGAGCCACGTAGAGGATGAACATCGAATAGTGTACTAGTAAGGATGCACGGCCTCAAAGAAAGTCAAGCCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAATCTAACTCAACCCAACAATTTTTCACTCACCTTTATAAGGACTAATATATAATTTATTCCATAAATAATGTGTTTGTTATAATTGGGAAAGAAATGCTTTATTTTTCAAATGATTTTGTATAAACACTCTTTGAAAAAAAAAAAAAATCAATAATTTAATGTTAATTTATAATAGTTTTATTTATTTATTTGTTCATTTAGATGACAAAAGTGGGTTTTGAGAAAATATTCATAAAAAAAGTAAAAAAAATAATAAATAATAAATTTATCATTAATATAACAATAAAAAATATTTTAAATACTTAAAAAAAAAATGTAACTTATAATAATATTTTTATTGAAAATCTATATATATATATATCGAGTTGACTTAGTGATGAAAAGAAGCTTAATAAAATAAATCAAAATGTGTTCGAGACTACTTATTTTAAATTTAAAAATATTATTATTTTAATATAGGGACAAAGAAAAAGCTTGAGTGATGCTATGCTCTATCTAAACGACATAGAAGCGTGGAAGGCTAAGCCAAGTGACATAAAACGTATCCTTTATAGGGTAAGTTATTTGATCTTAATTAATTTTTGTTTCTATTTAAATAATATTTTGATTTTGAAGTTTCTTATTTTAAAATATCTTTTTTAGGTCAATATATTTTTTATTTTCAAACTATCATTTAATCTTTATTTATAAAACTTTAATTGTAGTAATAAAAATAGGTTAAAATAAACTTTTGGCAAGAAATTTATTTAATCTCATAATTGTAAAAATAAAAATAAAAAAATTATAAAGAGAACAAAATAAACATAAAAAAAAAAAAATTAAAATAGGTCTGGGTAAGAAAATCCAAGTTTAACCTTTAATTTTTTTAATAAATTATAATTTTTTTTTTAAGAATAAAATTGAAATTTTAAATAGACTATTTTTGAAATGACCATTATAGGCTTCTCATATATTTTTTTTTTTCTATAATAAATTTATAAAAAATGTTAAATTATAAGTTCAAACTTTAAATTTTTAAAGTTGTGTTTAAATTGGTTTGAATTTTTTTAATAAATGTCTAATAGATTTGAACTAAAATGACCCTTTTAACCTTTTATTTATTTATTTATTTATTTTGTTATTATAGCAATAAATTACAACTTTAACTTTTTGAATTTTTTTTAATAAATATCTAATATTTTCCATTAATAATAAAATTTAAAATTTTAAAAATATATTCCACAAGTGAAATGACAATTATAGTCTTCTCATTTTCTTACTTCTATCATAAATTTATAAATTACAAATTCAAACCTTAAATTTTTAAAGTGGTGTGTTTGACTGAGTTCCAATTTTTTTAATAAATGTCTAATAAATTCTAGTAAGAATAATACTAAAAGTTTAAAAGGTATATTTGATAAGTAAAATGACAATTTTAACCTTTTATTTTTCGTTATTATAGCAAAATTACAAGTTTAACCTTTTGAATTTTTTTAATAAATATCTAATATTTTCCATTAAGAATAAAATTTAAAATTTTAAAACTATATTTCATACGTGAAATTGACAATTATAGCCTTCTCATTTTCTTCTTTCTATTCTTTCTATAATAAATTTATTAAAAAAAACAGTTAAATTACAAGTTCAAACCTTAAATTTTTAAAATTGTGCCGTTTGAAATTTTTTAATAAATGTCTAATAGATTCTAATACGAATAATAACCCGTATATTTCATAAATAAAATGACAATTTTAACTTTTTATTTTTCATTATTACAACTACTAATTAAAATATAATAAATTACAAGTTTAACCTTTAAATTTAACCCGATTAAATTTTTTAATAAATATCTAATATTTTCCATTATGAATAAAATTGAAAATTTAAAAAGTATATTTCATAAGTGAAAGGACAATTATAGCCTTTTCCATTAGTTTTTTAATAAATGTCTAATAAATTCTAATACAATAATACCCGAAGTATATTTCATAAGTAAAATGACAATTTTAACCTTTTATTATTAGTTATTATAACAACTAATTAAAATATTAAATTTTTTAATAAATGTGTAATAAATTTAAAAAGTATATTTGATAAGTAAAATGACAATTTTAGCCTTATATTTTTCTTATTATCCATAATAATTTTTTAAAAAAGGCCACTAATTAAGTAATAATTTCATGCAGGCTGTTATATATACCTTATTGGAGAGCAGTATTGATGCTAAAGATAATTGGAAAATTTTCTCAGACAACATAGGCAGTGGCCCAAACATGATTTGAGTACCATCCTTTTTTATGGGG

mRNA sequence

AACCAACCACCACCCACCGTCATGAACTCCACCGCCCTCCTCCACCGTGGAATCGCCCCAACACTGCCACTCCCGTTGTCGTTCCGACGCAACATGTCTTCTTCCGTTCACCTCGTCCCCAACGAGACCGTTAACAAGCACGACGCTCGGAGGTGTAGTAATAATGGCATCAAAATAGTGTGTCACAGATATAACCGAATGGAGTCGTCCGGTTCTCAGAACGATGGGTTTCCTTTTTTAAACACCAAAGTGGCTATGCAGTCGCTGCTGTGCTTCTCTTTGGATGCGGCGGCGGAGTTCGAAACCAAAGACATAAGTGCTCAGAAGAGAAAAGCTTTACAGGCATTGTTGGCTAATAATCCTGCAGAGGCTGAGAGGATAATGAAGAAGGTGCTCCAAAAATACAAAAATGACAAAGTTGAGATTAAATATGAAGCTACCTTGGCAATGGTTGCAATTCTCATCCACACGGGACAAAGAAAAAGCTTGAGTGATGCTATGCTCTATCTAAACGACATAGAAGCGTGGAAGGCTAAGCCAAGTGACATAAAACGTATCCTTTATAGGGCTGTTATATATACCTTATTGGAGAGCAGTATTGATGCTAAAGATAATTGGAAAATTTTCTCAGACAACATAGGCAGTGGCCCAAACATGATTTGAGTACCATCCTTTTTTATGGGG

Coding sequence (CDS)

ATGAACTCCACCGCCCTCCTCCACCGTGGAATCGCCCCAACACTGCCACTCCCGTTGTCGTTCCGACGCAACATGTCTTCTTCCGTTCACCTCGTCCCCAACGAGACCGTTAACAAGCACGACGCTCGGAGGTGTAGTAATAATGGCATCAAAATAGTGTGTCACAGATATAACCGAATGGAGTCGTCCGGTTCTCAGAACGATGGGTTTCCTTTTTTAAACACCAAAGTGGCTATGCAGTCGCTGCTGTGCTTCTCTTTGGATGCGGCGGCGGAGTTCGAAACCAAAGACATAAGTGCTCAGAAGAGAAAAGCTTTACAGGCATTGTTGGCTAATAATCCTGCAGAGGCTGAGAGGATAATGAAGAAGGTGCTCCAAAAATACAAAAATGACAAAGTTGAGATTAAATATGAAGCTACCTTGGCAATGGTTGCAATTCTCATCCACACGGGACAAAGAAAAAGCTTGAGTGATGCTATGCTCTATCTAAACGACATAGAAGCGTGGAAGGCTAAGCCAAGTGACATAAAACGTATCCTTTATAGGGCTGTTATATATACCTTATTGGAGAGCAGTATTGATGCTAAAGATAATTGGAAAATTTTCTCAGACAACATAGGCAGTGGCCCAAACATGATTTGA

Protein sequence

MNSTALLHRGIAPTLPLPLSFRRNMSSSVHLVPNETVNKHDARRCSNNGIKIVCHRYNRMESSGSQNDGFPFLNTKVAMQSLLCFSLDAAAEFETKDISAQKRKALQALLANNPAEAERIMKKVLQKYKNDKVEIKYEATLAMVAILIHTGQRKSLSDAMLYLNDIEAWKAKPSDIKRILYRAVIYTLLESSIDAKDNWKIFSDNIGSGPNMI
Homology
BLAST of CmaCh18G004700 vs. ExPASy TrEMBL
Match: A0A6J1K2L3 (uncharacterized protein LOC111490483 OS=Cucurbita maxima OX=3661 GN=LOC111490483 PE=4 SV=1)

HSP 1 Score: 419.1 bits (1076), Expect = 1.1e-113
Identity = 213/213 (100.00%), Postives = 213/213 (100.00%), Query Frame = 0

Query: 1   MNSTALLHRGIAPTLPLPLSFRRNMSSSVHLVPNETVNKHDARRCSNNGIKIVCHRYNRM 60
           MNSTALLHRGIAPTLPLPLSFRRNMSSSVHLVPNETVNKHDARRCSNNGIKIVCHRYNRM
Sbjct: 1   MNSTALLHRGIAPTLPLPLSFRRNMSSSVHLVPNETVNKHDARRCSNNGIKIVCHRYNRM 60

Query: 61  ESSGSQNDGFPFLNTKVAMQSLLCFSLDAAAEFETKDISAQKRKALQALLANNPAEAERI 120
           ESSGSQNDGFPFLNTKVAMQSLLCFSLDAAAEFETKDISAQKRKALQALLANNPAEAERI
Sbjct: 61  ESSGSQNDGFPFLNTKVAMQSLLCFSLDAAAEFETKDISAQKRKALQALLANNPAEAERI 120

Query: 121 MKKVLQKYKNDKVEIKYEATLAMVAILIHTGQRKSLSDAMLYLNDIEAWKAKPSDIKRIL 180
           MKKVLQKYKNDKVEIKYEATLAMVAILIHTGQRKSLSDAMLYLNDIEAWKAKPSDIKRIL
Sbjct: 121 MKKVLQKYKNDKVEIKYEATLAMVAILIHTGQRKSLSDAMLYLNDIEAWKAKPSDIKRIL 180

Query: 181 YRAVIYTLLESSIDAKDNWKIFSDNIGSGPNMI 214
           YRAVIYTLLESSIDAKDNWKIFSDNIGSGPNMI
Sbjct: 181 YRAVIYTLLESSIDAKDNWKIFSDNIGSGPNMI 213

BLAST of CmaCh18G004700 vs. ExPASy TrEMBL
Match: A0A6J1GTH1 (uncharacterized protein LOC111457324 OS=Cucurbita moschata OX=3662 GN=LOC111457324 PE=4 SV=1)

HSP 1 Score: 312.0 bits (798), Expect = 1.9e-81
Identity = 159/210 (75.71%), Postives = 173/210 (82.38%), Query Frame = 0

Query: 1   MNSTALLHRGIAPTLPLPLSFRRNMSSSVHLVPNETVNKHDARRCSNNGIKIVCHRYNRM 60
           MNSTALLHRGIAPT PLPLSFRRN+SSSVHLVPNE VNKHDARRC+NNGIKIVCH +N  
Sbjct: 1   MNSTALLHRGIAPTPPLPLSFRRNVSSSVHLVPNEAVNKHDARRCTNNGIKIVCHVFNPK 60

Query: 61  ESSGSQNDGFPFLNTKVAMQSLLCFSLDAAAEFETKDISAQKRKALQALLANNPAEAERI 120
           E S S NDGFP LNTK AMQSLLC  LDAAAE ETKDISA KRKALQALLANNP+ AE+I
Sbjct: 61  EPSSSHNDGFPLLNTKEAMQSLLCLPLDAAAEGETKDISANKRKALQALLANNPSGAEKI 120

Query: 121 MKKVLQKYKNDKVEIKYEATLAMVAILIHTGQRKSLSDAMLYLNDIEAWKAKPSDIKRIL 180
           MK VLQ Y+ D ++ KYEA LA V ILIHTG R+SL DA+ +LN IE W  KPSD+KRIL
Sbjct: 121 MKNVLQTYEKDNMQTKYEAILATVVILIHTGGRESLGDAIHHLNKIEEWDNKPSDVKRIL 180

Query: 181 YRAVIYTLLESSIDAKDNWKIFSDNIGSGP 211
           YRAVIYTLL+   +AK NWK FSD IG GP
Sbjct: 181 YRAVIYTLLDCDSEAKTNWKTFSDQIGKGP 210

BLAST of CmaCh18G004700 vs. ExPASy TrEMBL
Match: A0A0A0LXK7 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G441330 PE=4 SV=1)

HSP 1 Score: 128.6 bits (322), Expect = 3.0e-26
Identity = 92/234 (39.32%), Postives = 132/234 (56.41%), Query Frame = 0

Query: 1   MNSTALLHRGIA--PTLPLPLSF---------------RRNMSSSVHLVPNETVNKHDAR 60
           M+STA+LHRG +  P  PLP +                  N  SSVHL+  ++ + +++R
Sbjct: 1   MDSTAVLHRGFSAPPRPPLPTTTPSSVLRPQLALFTFPSNNALSSVHLLLKKSNDNYNSR 60

Query: 61  RCS----NNGIKIVCHRYNRMESSGSQNDG--FPFLNTKVAMQSLLCFSLDAAAEFETKD 120
             S    NN I I C   +    SGS NDG  FP  N ++A++SLLCFS  +     T  
Sbjct: 61  YGSLNKNNNVIDIQCTNLSTSVPSGS-NDGRNFPVANARLALKSLLCFSYSSKKADHTNF 120

Query: 121 ISAQKRKALQALLANNPAEAERIMKKVLQKYKND-KVEIKYEATLAMVAILIHTGQRKSL 180
           ++ QKRKAL ALL  NP EAE+I+K V  KY+N+   +I+YEA +A++ ILIH G  +SL
Sbjct: 121 LNNQKRKALLALLDQNPKEAEKIIKIVESKYRNNSNKQIQYEAKMAIIQILIHMGTPESL 180

Query: 181 SDAMLYLNDIEAWKAKPSDIKRILYRAVIYTLLESSIDAKDNWKIFSDNIGSGP 211
             A+   ++I   + +PSD    LY AVI TL+  + +  D+WK + + I S P
Sbjct: 181 RWAVKKYSEITNMEERPSDATIFLYNAVIRTLIGDNKNIADHWKAYVNVITSDP 233

BLAST of CmaCh18G004700 vs. ExPASy TrEMBL
Match: A0A6J1CHE7 (uncharacterized protein LOC111010906 OS=Momordica charantia OX=3673 GN=LOC111010906 PE=4 SV=1)

HSP 1 Score: 128.6 bits (322), Expect = 3.0e-26
Identity = 87/212 (41.04%), Postives = 124/212 (58.49%), Query Frame = 0

Query: 7   LHRGIAPTLPLPLSFRRNMSSSVHLVPNETVNKHDARRCSNNGIKIVCHRYNRMESSGS- 66
           L RGIAPT PLPL+  RN+S      P ++  KH   R +     I C R  R   SGS 
Sbjct: 10  LRRGIAPTPPLPLAAFRNVS------PIQSNPKH-RNRLNRVETIIRCGRVTRPRRSGSD 69

Query: 67  QNDGFPFLNTKVAMQSLLCFSLD----AAAEFETKDISAQKRKALQALLANNPAEAERIM 126
           ++ GFP  NT  A+++LLCF++     AA       I+ +K++ALQ L+A N  EAE IM
Sbjct: 70  RSGGFPLPNTTTALKTLLCFTVGTDGWAAPAEHLTAINRKKKEALQKLMAENCVEAENIM 129

Query: 127 KKVLQKYKNDKVEIKYEATLAMVAILIHTGQRKSLSDAMLYLNDIEAWKAK---PSDIKR 186
            +V ++YK D  + +Y+A LA+V  LIH G  +S   A+ +LND+E    K   PSD K 
Sbjct: 130 MRVYEEYKWDNPQTRYDAALALVEFLIHRGTNESWEKAVGHLNDLEHMTMKENLPSDAKL 189

Query: 187 ILYRAVIYTLLESSIDAKDNWKIFSDNIGSGP 211
            LY A++ TLL+   +AK +W  ++ +IG+GP
Sbjct: 190 PLYWAILLTLLDDR-EAKKSWSYYTSHIGTGP 213

BLAST of CmaCh18G004700 vs. ExPASy TrEMBL
Match: A0A1S3BA19 (uncharacterized protein LOC103487676 OS=Cucumis melo OX=3656 GN=LOC103487676 PE=4 SV=1)

HSP 1 Score: 109.8 bits (273), Expect = 1.5e-20
Identity = 92/240 (38.33%), Postives = 133/240 (55.42%), Query Frame = 0

Query: 1   MNSTALLHRGIA----PTLP-----------LP-LSFRRNMSSSVHLVPNET----VNKH 60
           M+STA+LHRG +    P LP           LP L+FR N+ SSVHL+  ++    +N++
Sbjct: 1   MDSTAVLHRGFSAAPQPRLPTTRPSVAGRQLLPLLTFRCNV-SSVHLLVKKSNDNCINRY 60

Query: 61  DAR---RCSNNG-IKIVCHRYNRMESSGSQNDGFPFLNTKVAMQSLLCFSLDAAAEFETK 120
            +R   + +NNG I I C R + +      +  FP  N ++A+QSLLCFS     + +T 
Sbjct: 61  GSRLNKKNNNNGIIDIQCGRESEIPWGDKDDANFPSPNARLALQSLLCFS--PKDDTDTN 120

Query: 121 DISAQKRKALQALLANNPAE----AERIMKKVLQKYKNDKVE-IKYEATLAMVAILIHTG 180
            I+  K  ALQALL   P E    A  IM KV +KY+ND+ + I+YEA +A + ILIH G
Sbjct: 121 TIAKAKISALQALLNKRPNEKRDRATEIMNKVYEKYRNDRNKYIQYEAKMAFIQILIHVG 180

Query: 181 QRKSLSDAMLYLNDIEAWKAKPSDIKRILYRAVIYTLLESSIDAKDNWKIFSDNIGSGPN 212
             KS S A   L ++E  + +PSD    LY+AVI TLL +    K +W  +++     P+
Sbjct: 181 TNKSWSRAHEVLREVEK-QERPSDATFHLYKAVISTLLRAQ-SPKVHWDNYTNLFDDDPS 235

BLAST of CmaCh18G004700 vs. NCBI nr
Match: XP_022994895.1 (uncharacterized protein LOC111490483 [Cucurbita maxima])

HSP 1 Score: 419.1 bits (1076), Expect = 2.3e-113
Identity = 213/213 (100.00%), Postives = 213/213 (100.00%), Query Frame = 0

Query: 1   MNSTALLHRGIAPTLPLPLSFRRNMSSSVHLVPNETVNKHDARRCSNNGIKIVCHRYNRM 60
           MNSTALLHRGIAPTLPLPLSFRRNMSSSVHLVPNETVNKHDARRCSNNGIKIVCHRYNRM
Sbjct: 1   MNSTALLHRGIAPTLPLPLSFRRNMSSSVHLVPNETVNKHDARRCSNNGIKIVCHRYNRM 60

Query: 61  ESSGSQNDGFPFLNTKVAMQSLLCFSLDAAAEFETKDISAQKRKALQALLANNPAEAERI 120
           ESSGSQNDGFPFLNTKVAMQSLLCFSLDAAAEFETKDISAQKRKALQALLANNPAEAERI
Sbjct: 61  ESSGSQNDGFPFLNTKVAMQSLLCFSLDAAAEFETKDISAQKRKALQALLANNPAEAERI 120

Query: 121 MKKVLQKYKNDKVEIKYEATLAMVAILIHTGQRKSLSDAMLYLNDIEAWKAKPSDIKRIL 180
           MKKVLQKYKNDKVEIKYEATLAMVAILIHTGQRKSLSDAMLYLNDIEAWKAKPSDIKRIL
Sbjct: 121 MKKVLQKYKNDKVEIKYEATLAMVAILIHTGQRKSLSDAMLYLNDIEAWKAKPSDIKRIL 180

Query: 181 YRAVIYTLLESSIDAKDNWKIFSDNIGSGPNMI 214
           YRAVIYTLLESSIDAKDNWKIFSDNIGSGPNMI
Sbjct: 181 YRAVIYTLLESSIDAKDNWKIFSDNIGSGPNMI 213

BLAST of CmaCh18G004700 vs. NCBI nr
Match: XP_023542455.1 (uncharacterized protein LOC111802357 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 327.0 bits (837), Expect = 1.2e-85
Identity = 166/210 (79.05%), Postives = 181/210 (86.19%), Query Frame = 0

Query: 1   MNSTALLHRGIAPTLPLPLSFRRNMSSSVHLVPNETVNKHDARRCSNNGIKIVCHRYNRM 60
           MNSTALLHRGIAPT PLPLSFRRN+SSSVHLVPNETVNKHDARRC+NNGIKIVCH +N  
Sbjct: 1   MNSTALLHRGIAPTPPLPLSFRRNVSSSVHLVPNETVNKHDARRCTNNGIKIVCHVFNPK 60

Query: 61  ESSGSQNDGFPFLNTKVAMQSLLCFSLDAAAEFETKDISAQKRKALQALLANNPAEAERI 120
           E S S N GFP LNTKVAMQSLLC  LDAAAE ETKDISA KRKALQ LLANNP+ AE+I
Sbjct: 61  EPSSSHNHGFPLLNTKVAMQSLLCLPLDAAAEGETKDISANKRKALQELLANNPSGAEKI 120

Query: 121 MKKVLQKYKNDKVEIKYEATLAMVAILIHTGQRKSLSDAMLYLNDIEAWKAKPSDIKRIL 180
           MKK+LQ+Y+ND ++ KYEATLAMVAILIHTG  +SL +A+ YLN++EAW  KPSD KRIL
Sbjct: 121 MKKLLQRYENDNMQTKYEATLAMVAILIHTGGVESLRNAIRYLNELEAWNNKPSDAKRIL 180

Query: 181 YRAVIYTLLESSIDAKDNWKIFSDNIGSGP 211
           YRAVIYTLLES IDAK NWK FSD IG GP
Sbjct: 181 YRAVIYTLLESQIDAKTNWKTFSDQIGKGP 210

BLAST of CmaCh18G004700 vs. NCBI nr
Match: KAG6573362.1 (hypothetical protein SDJN03_27249, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 315.1 bits (806), Expect = 4.7e-82
Identity = 160/210 (76.19%), Postives = 178/210 (84.76%), Query Frame = 0

Query: 1   MNSTALLHRGIAPTLPLPLSFRRNMSSSVHLVPNETVNKHDARRCSNNGIKIVCHRYNRM 60
           MNSTALLHRGIAPT PLPLSFRRN+SSSVHLVPN+TVNKHDARRC+NNGIKIVCH +N  
Sbjct: 1   MNSTALLHRGIAPTPPLPLSFRRNVSSSVHLVPNKTVNKHDARRCTNNGIKIVCHVFNPK 60

Query: 61  ESSGSQNDGFPFLNTKVAMQSLLCFSLDAAAEFETKDISAQKRKALQALLANNPAEAERI 120
           E S S N+GFP L+TKVAMQSLLC  LDAAAE ETKDISA KR+AL+ALLANNP+ AE I
Sbjct: 61  EPSSSHNNGFPLLDTKVAMQSLLCLPLDAAAEGETKDISANKREALRALLANNPSRAENI 120

Query: 121 MKKVLQKYKNDKVEIKYEATLAMVAILIHTGQRKSLSDAMLYLNDIEAWKAKPSDIKRIL 180
           MKKVLQ Y+N  ++ KY+ATLAMVAILIHTG R+SL  A+ +LN IEAW  KPSD+KRIL
Sbjct: 121 MKKVLQTYENGNMQTKYDATLAMVAILIHTGGRESLGKAIEHLNTIEAWDNKPSDVKRIL 180

Query: 181 YRAVIYTLLESSIDAKDNWKIFSDNIGSGP 211
           YRAVIYTLLE   +AK NWK FSD IG GP
Sbjct: 181 YRAVIYTLLERDTEAKTNWKTFSDQIGKGP 210

BLAST of CmaCh18G004700 vs. NCBI nr
Match: XP_022955331.1 (uncharacterized protein LOC111457324 [Cucurbita moschata])

HSP 1 Score: 312.0 bits (798), Expect = 4.0e-81
Identity = 159/210 (75.71%), Postives = 173/210 (82.38%), Query Frame = 0

Query: 1   MNSTALLHRGIAPTLPLPLSFRRNMSSSVHLVPNETVNKHDARRCSNNGIKIVCHRYNRM 60
           MNSTALLHRGIAPT PLPLSFRRN+SSSVHLVPNE VNKHDARRC+NNGIKIVCH +N  
Sbjct: 1   MNSTALLHRGIAPTPPLPLSFRRNVSSSVHLVPNEAVNKHDARRCTNNGIKIVCHVFNPK 60

Query: 61  ESSGSQNDGFPFLNTKVAMQSLLCFSLDAAAEFETKDISAQKRKALQALLANNPAEAERI 120
           E S S NDGFP LNTK AMQSLLC  LDAAAE ETKDISA KRKALQALLANNP+ AE+I
Sbjct: 61  EPSSSHNDGFPLLNTKEAMQSLLCLPLDAAAEGETKDISANKRKALQALLANNPSGAEKI 120

Query: 121 MKKVLQKYKNDKVEIKYEATLAMVAILIHTGQRKSLSDAMLYLNDIEAWKAKPSDIKRIL 180
           MK VLQ Y+ D ++ KYEA LA V ILIHTG R+SL DA+ +LN IE W  KPSD+KRIL
Sbjct: 121 MKNVLQTYEKDNMQTKYEAILATVVILIHTGGRESLGDAIHHLNKIEEWDNKPSDVKRIL 180

Query: 181 YRAVIYTLLESSIDAKDNWKIFSDNIGSGP 211
           YRAVIYTLL+   +AK NWK FSD IG GP
Sbjct: 181 YRAVIYTLLDCDSEAKTNWKTFSDQIGKGP 210

BLAST of CmaCh18G004700 vs. NCBI nr
Match: KAG7012526.1 (hypothetical protein SDJN02_25278 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 183.7 bits (465), Expect = 1.6e-42
Identity = 95/132 (71.97%), Postives = 107/132 (81.06%), Query Frame = 0

Query: 79  MQSLLCFSLDAAAEFETKDISAQKRKALQALLANNPAEAERIMKKVLQKYKNDKVEIKYE 138
           MQSLLC  LDAAAE ETKDISA KR+AL+ALLANNP+ AE IMKKVLQ Y+N  ++ KY+
Sbjct: 1   MQSLLCLPLDAAAEGETKDISANKREALRALLANNPSRAENIMKKVLQTYENGNMQTKYD 60

Query: 139 ATLAMVAILIHTGQRKSLSDAMLYLNDIEAWKAKPSDIKRILYRAVIYTLLESSIDAKDN 198
           ATLAMVAILIH G R+SL  A+ +LN IEAW  KPSD+KRILYRAVIYTLLE   +AK N
Sbjct: 61  ATLAMVAILIHMGGRESLGKAIEHLNTIEAWDNKPSDVKRILYRAVIYTLLERDTEAKTN 120

Query: 199 WKIFSDNIGSGP 211
           WK FSD IG GP
Sbjct: 121 WKTFSDQIGKGP 132

BLAST of CmaCh18G004700 vs. TAIR 10
Match: AT2G34540.2 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G34530.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 53.1 bits (126), Expect = 3.1e-07
Identity = 35/113 (30.97%), Postives = 63/113 (55.75%), Query Frame = 0

Query: 97  DISAQKRKALQALLANNPAEAERIMKKVLQKYKNDKVEIKYEATLAMVAILIHTGQRKSL 156
           DI + K +A++ +      EA ++++    +Y+N+  E  +   +A+V ILI   + +  
Sbjct: 173 DIDSIKMEAVRKMKEGKCEEAVQLLRDANMRYRNEP-EANFNVQMALVEILILLERYQEA 232

Query: 157 SDAMLYLNDIEAWKAKPSDIKRILYRAVIYTLLESSIDAKDNWKIFSDNIGSG 210
           ++    LND     A+ SD++  LY+A+IYT+L+   +AK  WK F  +IG G
Sbjct: 233 AEYSC-LND---ENAQISDVRIPLYKAIIYTMLDKDTEAKQCWKEFRKSIGEG 280

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1K2L31.1e-113100.00uncharacterized protein LOC111490483 OS=Cucurbita maxima OX=3661 GN=LOC111490483... [more]
A0A6J1GTH11.9e-8175.71uncharacterized protein LOC111457324 OS=Cucurbita moschata OX=3662 GN=LOC1114573... [more]
A0A0A0LXK73.0e-2639.32Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G441330 PE=4 SV=1[more]
A0A6J1CHE73.0e-2641.04uncharacterized protein LOC111010906 OS=Momordica charantia OX=3673 GN=LOC111010... [more]
A0A1S3BA191.5e-2038.33uncharacterized protein LOC103487676 OS=Cucumis melo OX=3656 GN=LOC103487676 PE=... [more]
Match NameE-valueIdentityDescription
XP_022994895.12.3e-113100.00uncharacterized protein LOC111490483 [Cucurbita maxima][more]
XP_023542455.11.2e-8579.05uncharacterized protein LOC111802357 [Cucurbita pepo subsp. pepo][more]
KAG6573362.14.7e-8276.19hypothetical protein SDJN03_27249, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_022955331.14.0e-8175.71uncharacterized protein LOC111457324 [Cucurbita moschata][more]
KAG7012526.11.6e-4271.97hypothetical protein SDJN02_25278 [Cucurbita argyrosperma subsp. argyrosperma][more]
Match NameE-valueIdentityDescription
AT2G34540.23.1e-0730.97unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR36350:SF2PROTEIN, PUTATIVE-RELATEDcoord: 58..208
NoneNo IPR availablePANTHERPTHR36350TRANSMEMBRANE PROTEINcoord: 58..208

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh18G004700.1CmaCh18G004700.1mRNA