CmaCh04G007380 (gene) Cucurbita maxima (Rimu) v1.1

Overview
NameCmaCh04G007380
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu) v1.1)
DescriptionMucin-2
LocationCma_Chr04: 3751321 .. 3753868 (-)
RNA-Seq ExpressionCmaCh04G007380
SyntenyCmaCh04G007380
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TCTCTTCACTTTCTGACTGCAAATTCTCCTTATTTGTTCTGTGTTTTCTCCCGAAAAATTTCGTGTAAGAGGAACCACAACTTTCTTCTATGAACACTATCAGCGAATCCCTGGCGTCGATCAATGAGAGAGATGAGGCGGCGTGCGGATGCGGATGCGGATGCTGATGCTGATCTGAGGCCTATGAATAACACTTTTCAGACCATTACTACGGCCGCCGATGTGATCGCCACCGTTGATCATCGGTTTCCTCGGGATACTGCCGTCCAGGTATTCGTCATATCACTTCACCTTTTAATCAATCATTTGGAATTTTAGGTGTTAGTGTTAGTTACCTATGGATTGAGATACTGGCCTTGTTTTTGGAAATTATTGCTCGTTCTTGATTCCGTTAGGTATTAGGATGTAGTTGATTTTGGAGGGATGATATATAGAATATAGCGGAATGGAATCTGTGTTGTTTGTTGTGGATTGCGGAATTCTATATAAGGCTTTTAATTTCCCCCTTTTTCTGCTACTTCTGTTTGTTTGCTTTGAAACTGTGAATCGTCACATGGATTCAAGCGAAAAAGGGAATCACGAGTTTCGTTTTCCCTCTTTTTCTCCTTTTGGATTTCTTCTCGCATATGATTATTAGATTTTTAAGGTTTGGAAAGTACCAAGATTAGTGTATTGGATTGGACGACTGTGTCCGGAAGCAATGCGCTGTTCCCAACCAATTTTGTTTGTCTTTTTGTAATATAACACTGCCCTTTTTCTTATCAGAAAGGCATCTCTCCATTCTTCTTTTTATCAAAAAGAGATGCAGAGAGGTCTTTTTTTTTTAAAATGTTCTTCTATATAGAGGAATAGTAATGCTTTTTATATAAAGATTCCAACAAATAGGACTCTCATTTGCCCAAAATCTTTGCACTGTTCTGTACTGATGTCATTATCACTTTCCTCCTTTTCTGTTTGTTTGATTACCCCCAGAAAAGAAGATGGGGTAGCTGTTGGAGTATTTATTGGTGCTTTGGATCTCTCAGACAGAGGAAACGAATTGGGCATGCCGTCCTTGTCCCAGAACCAAGTCCTTCGCCTGAGGCTCATCAAAATTCATTGCAATCCCCAGACATTGTGCTTCCTTTTGCTGCACCTCCCTCTTCCCCTGCATCCTTTCTTCAATCAGAGCCACCTTCTGCTACACAATCACCTTCAAATATACTCTCCTTCACTTCTCTCACTGCTAACATGTATTCTCCTGATGGGCCTTCCTCGATTTTTGCCATTGGCCCATTTGCTCATGAGACACAGCTTGTATCTCCACCTATGAATTTTTCTACTCTCACCACTGAACCATCGACTCCTTCCTTCACTCCTCCTGAGTCTATCCACTTGACTACACCTTCTTCCCCTGAAGTTCCATTTGCTCAGTTTCTTCAACCGAACCTTCCTAAAGCTGAGTCTGATGACCAATATTCATGTCCTAATGATGACTTTCAATCTTATCAATTCTATCCTGGTAGCCCAGTTAGCAACCTCATATCGCCACGCTCTGCCATTTCTCTTTCTGGGGCATCTTCGCCTTTGCCAGATTTGGATTTTGCTTCCTCAGCTTCTCAATTTTCTAATTTCTCATTGGATGTTCCACCTGCGCTGTTGAACCTTGACAGACAAGGGCAAAGTTCTGATTCTTGCACTCAAAATTCTGTAGGATTCAAATCGAATGATGATGATTTTGATTTGAATCCTCGAACTTCAGATTCAATGAATGAATCCCAAAATATTCAAATTCTCATTGATGGAAGCCAAATGGAGGAACCTGATGTTACTAATCATAGATTCTCATTTGAGTTATCTGACGAAGATTCTTTATTAAGAAACGTAGAAAGTAAGCCACTGGAGTCAAATGTTGCAGTTGCATCATCTCCAATGCATGAAACATTTGAAACGGCTAAAGAAACTTCTTCTGGTGGTGGTCATAGCTCAAATAGTATAGAAGAAAAGGCAGCAGACGGTGAAGAAGCAAATCAGCATCAAGAACATCATCATTCTACTACTCTTGGGTCTGTGAATGAATTTAATTTTGATAATGGCAATGGAAGTAATGCACTTAAGCCTAATATCAACTCGGACTGGTGGGCTAATGCGAAAGATGCAGAGACAAAAGGCACGACCACGGGGGCCTGGTCATTCTTTCCAATGGCGCAGCAAAGATGAGCAAACTGGTGATTATCCTCTGGAATTTCCTCATGCCCATCATGTTTTGCAGTTGCAATTTAGTAGGTAATAGGTAAGACAAATTGCTAGAGGACTGGTGAGCTTTGAAGGTAAAAAAGAGGACAAATCATGAAAAGAGAAAAACCAGAAGCCATATTATTTTCAACAATCTGACCTCCTAAACACAGGCAGGTCTGAATAGTATGATAATTAGAAATCTGTAGTCGACAATGGGCCCTATTAACAAACAGTAGTGGCTCCTCACTTGAATTGTAACAGCTATTAGTATTCTGTAGAAATTGAAAGTGTGTAAATATGGTAATAAAAATTGTTTTTATCTTTTGACAGC

mRNA sequence

TCTCTTCACTTTCTGACTGCAAATTCTCCTTATTTGTTCTGTGTTTTCTCCCGAAAAATTTCGTGTAAGAGGAACCACAACTTTCTTCTATGAACACTATCAGCGAATCCCTGGCGTCGATCAATGAGAGAGATGAGGCGGCGTGCGGATGCGGATGCGGATGCTGATGCTGATCTGAGGCCTATGAATAACACTTTTCAGACCATTACTACGGCCGCCGATGTGATCGCCACCGTTGATCATCGGTTTCCTCGGGATACTGCCGTCCAGAAAAGAAGATGGGGTAGCTGTTGGAGTATTTATTGGTGCTTTGGATCTCTCAGACAGAGGAAACGAATTGGGCATGCCGTCCTTGTCCCAGAACCAAGTCCTTCGCCTGAGGCTCATCAAAATTCATTGCAATCCCCAGACATTGTGCTTCCTTTTGCTGCACCTCCCTCTTCCCCTGCATCCTTTCTTCAATCAGAGCCACCTTCTGCTACACAATCACCTTCAAATATACTCTCCTTCACTTCTCTCACTGCTAACATGTATTCTCCTGATGGGCCTTCCTCGATTTTTGCCATTGGCCCATTTGCTCATGAGACACAGCTTGTATCTCCACCTATGAATTTTTCTACTCTCACCACTGAACCATCGACTCCTTCCTTCACTCCTCCTGAGTCTATCCACTTGACTACACCTTCTTCCCCTGAAGTTCCATTTGCTCAGTTTCTTCAACCGAACCTTCCTAAAGCTGAGTCTGATGACCAATATTCATGTCCTAATGATGACTTTCAATCTTATCAATTCTATCCTGGTAGCCCAGTTAGCAACCTCATATCGCCACGCTCTGCCATTTCTCTTTCTGGGGCATCTTCGCCTTTGCCAGATTTGGATTTTGCTTCCTCAGCTTCTCAATTTTCTAATTTCTCATTGGATGTTCCACCTGCGCTGTTGAACCTTGACAGACAAGGGCAAAGTTCTGATTCTTGCACTCAAAATTCTGTAGGATTCAAATCGAATGATGATGATTTTGATTTGAATCCTCGAACTTCAGATTCAATGAATGAATCCCAAAATATTCAAATTCTCATTGATGGAAGCCAAATGGAGGAACCTGATGTTACTAATCATAGATTCTCATTTGAGTTATCTGACGAAGATTCTTTATTAAGAAACGTAGAAAGTAAGCCACTGGAGTCAAATGTTGCAGTTGCATCATCTCCAATGCATGAAACATTTGAAACGGCTAAAGAAACTTCTTCTGGTGGTGGTCATAGCTCAAATAGTATAGAAGAAAAGGCAGCAGACGGTGAAGAAGCAAATCAGCATCAAGAACATCATCATTCTACTACTCTTGGGTCTGTGAATGAATTTAATTTTGATAATGGCAATGGAAGTAATGCACTTAAGCCTAATATCAACTCGGACTGGTGGGCTAATGCGAAAGATGCAGAGACAAAAGGCACGACCACGGGGGCCTGGTCATTCTTTCCAATGGCGCAGCAAAGATGAGCAAACTGGTGATTATCCTCTGGAATTTCCTCATGCCCATCATGTTTTGCAGTTGCAATTTAGTAGGTAATAGGTAAGACAAATTGCTAGAGGACTGGTGAGCTTTGAAGGTAAAAAAGAGGACAAATCATGAAAAGAGAAAAACCAGAAGCCATATTATTTTCAACAATCTGACCTCCTAAACACAGGCAGGTCTGAATAGTATGATAATTAGAAATCTGTAGTCGACAATGGGCCCTATTAACAAACAGTAGTGGCTCCTCACTTGAATTGTAACAGCTATTAGTATTCTGTAGAAATTGAAAGTGTGTAAATATGGTAATAAAAATTGTTTTTATCTTTTGACAGC

Coding sequence (CDS)

ATGAGAGAGATGAGGCGGCGTGCGGATGCGGATGCGGATGCTGATGCTGATCTGAGGCCTATGAATAACACTTTTCAGACCATTACTACGGCCGCCGATGTGATCGCCACCGTTGATCATCGGTTTCCTCGGGATACTGCCGTCCAGAAAAGAAGATGGGGTAGCTGTTGGAGTATTTATTGGTGCTTTGGATCTCTCAGACAGAGGAAACGAATTGGGCATGCCGTCCTTGTCCCAGAACCAAGTCCTTCGCCTGAGGCTCATCAAAATTCATTGCAATCCCCAGACATTGTGCTTCCTTTTGCTGCACCTCCCTCTTCCCCTGCATCCTTTCTTCAATCAGAGCCACCTTCTGCTACACAATCACCTTCAAATATACTCTCCTTCACTTCTCTCACTGCTAACATGTATTCTCCTGATGGGCCTTCCTCGATTTTTGCCATTGGCCCATTTGCTCATGAGACACAGCTTGTATCTCCACCTATGAATTTTTCTACTCTCACCACTGAACCATCGACTCCTTCCTTCACTCCTCCTGAGTCTATCCACTTGACTACACCTTCTTCCCCTGAAGTTCCATTTGCTCAGTTTCTTCAACCGAACCTTCCTAAAGCTGAGTCTGATGACCAATATTCATGTCCTAATGATGACTTTCAATCTTATCAATTCTATCCTGGTAGCCCAGTTAGCAACCTCATATCGCCACGCTCTGCCATTTCTCTTTCTGGGGCATCTTCGCCTTTGCCAGATTTGGATTTTGCTTCCTCAGCTTCTCAATTTTCTAATTTCTCATTGGATGTTCCACCTGCGCTGTTGAACCTTGACAGACAAGGGCAAAGTTCTGATTCTTGCACTCAAAATTCTGTAGGATTCAAATCGAATGATGATGATTTTGATTTGAATCCTCGAACTTCAGATTCAATGAATGAATCCCAAAATATTCAAATTCTCATTGATGGAAGCCAAATGGAGGAACCTGATGTTACTAATCATAGATTCTCATTTGAGTTATCTGACGAAGATTCTTTATTAAGAAACGTAGAAAGTAAGCCACTGGAGTCAAATGTTGCAGTTGCATCATCTCCAATGCATGAAACATTTGAAACGGCTAAAGAAACTTCTTCTGGTGGTGGTCATAGCTCAAATAGTATAGAAGAAAAGGCAGCAGACGGTGAAGAAGCAAATCAGCATCAAGAACATCATCATTCTACTACTCTTGGGTCTGTGAATGAATTTAATTTTGATAATGGCAATGGAAGTAATGCACTTAAGCCTAATATCAACTCGGACTGGTGGGCTAATGCGAAAGATGCAGAGACAAAAGGCACGACCACGGGGGCCTGGTCATTCTTTCCAATGGCGCAGCAAAGATGA

Protein sequence

MREMRRRADADADADADLRPMNNTFQTITTAADVIATVDHRFPRDTAVQKRRWGSCWSIYWCFGSLRQRKRIGHAVLVPEPSPSPEAHQNSLQSPDIVLPFAAPPSSPASFLQSEPPSATQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPMNFSTLTTEPSTPSFTPPESIHLTTPSSPEVPFAQFLQPNLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPLPDLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDDDFDLNPRTSDSMNESQNIQILIDGSQMEEPDVTNHRFSFELSDEDSLLRNVESKPLESNVAVASSPMHETFETAKETSSGGGHSSNSIEEKAADGEEANQHQEHHHSTTLGSVNEFNFDNGNGSNALKPNINSDWWANAKDAETKGTTTGAWSFFPMAQQR
Homology
BLAST of CmaCh04G007380 vs. ExPASy Swiss-Prot
Match: Q9SRE5 (Uncharacterized protein At1g76660 OS=Arabidopsis thaliana OX=3702 GN=At1g76660 PE=2 SV=1)

HSP 1 Score: 139.4 bits (350), Expect = 9.9e-32
Identity = 128/316 (40.51%), Postives = 157/316 (49.68%), Query Frame = 0

Query: 49  QKRRWGSCWSIYWCFGSLRQRKRIGHAVLVPE-----PSPSPEAHQ----NSLQSPDIVL 108
           Q++RWG C  ++ CF S +  KRI  A  +PE      S    AHQ    N+  +  I L
Sbjct: 7   QRKRWGGCLGVFSCFKSQKGGKRIVPASRIPEGGNVSASQPNGAHQAGVLNNQAAGGINL 66

Query: 109 PFAAPPSSPASFLQSEPPSATQSPSNILSFTSLTANMYSPDGP-SSIFAIGPFAHETQLV 168
              APPSSPASF  S  PS TQSP+    + SL AN  SP GP SS++A GP+AHETQLV
Sbjct: 67  SLLAPPSSPASFTNSALPSTTQSPN---CYLSLAAN--SPGGPSSSMYATGPYAHETQLV 126

Query: 169 SPPMNFSTLTTEPSTPSFT-PPESIHLTTPSSPEVPFAQFLQPNLPKAESDDQYSCPNDD 228
           SPP+ FST TTEPST  FT PPE   LT PSSP+VP+A+FL  ++    S   +   ND 
Sbjct: 127 SPPV-FSTFTTEPSTAPFTPPPELARLTAPSSPDVPYARFLTSSMDLKNSGKGHY--NDL 186

Query: 229 FQSYQFYPGSPVSNLISPRSAISLSGASSPLP--------------DLDFASSASQFSNF 288
             +Y  YPGSP S L SP S  S  G  SP                D +  S+  Q SNF
Sbjct: 187 QATYSLYPGSPASALRSPISRASGDGLLSPQNGKCSRSDSGNTFGYDTNGVSTPLQESNF 246

Query: 289 SLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDDDFDLNPRTSDSMNESQNIQILIDGSQM 340
                 A   LD      D     + G  S   D D+ P T+   N +QN Q       M
Sbjct: 247 FCPETFAKFYLDH-----DPSVPQNGGRLSVSKDSDVYP-TNGYGNGNQNRQNRSPKQDM 306

BLAST of CmaCh04G007380 vs. ExPASy TrEMBL
Match: A0A6J1IUL0 (uncharacterized protein At1g76660-like OS=Cucurbita maxima OX=3661 GN=LOC111480076 PE=4 SV=1)

HSP 1 Score: 892.9 bits (2306), Expect = 5.7e-256
Identity = 457/457 (100.00%), Postives = 457/457 (100.00%), Query Frame = 0

Query: 1   MREMRRRADADADADADLRPMNNTFQTITTAADVIATVDHRFPRDTAVQKRRWGSCWSIY 60
           MREMRRRADADADADADLRPMNNTFQTITTAADVIATVDHRFPRDTAVQKRRWGSCWSIY
Sbjct: 1   MREMRRRADADADADADLRPMNNTFQTITTAADVIATVDHRFPRDTAVQKRRWGSCWSIY 60

Query: 61  WCFGSLRQRKRIGHAVLVPEPSPSPEAHQNSLQSPDIVLPFAAPPSSPASFLQSEPPSAT 120
           WCFGSLRQRKRIGHAVLVPEPSPSPEAHQNSLQSPDIVLPFAAPPSSPASFLQSEPPSAT
Sbjct: 61  WCFGSLRQRKRIGHAVLVPEPSPSPEAHQNSLQSPDIVLPFAAPPSSPASFLQSEPPSAT 120

Query: 121 QSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPMNFSTLTTEPSTPSFTPPE 180
           QSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPMNFSTLTTEPSTPSFTPPE
Sbjct: 121 QSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPMNFSTLTTEPSTPSFTPPE 180

Query: 181 SIHLTTPSSPEVPFAQFLQPNLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAIS 240
           SIHLTTPSSPEVPFAQFLQPNLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAIS
Sbjct: 181 SIHLTTPSSPEVPFAQFLQPNLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAIS 240

Query: 241 LSGASSPLPDLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDDDFDL 300
           LSGASSPLPDLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDDDFDL
Sbjct: 241 LSGASSPLPDLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDDDFDL 300

Query: 301 NPRTSDSMNESQNIQILIDGSQMEEPDVTNHRFSFELSDEDSLLRNVESKPLESNVAVAS 360
           NPRTSDSMNESQNIQILIDGSQMEEPDVTNHRFSFELSDEDSLLRNVESKPLESNVAVAS
Sbjct: 301 NPRTSDSMNESQNIQILIDGSQMEEPDVTNHRFSFELSDEDSLLRNVESKPLESNVAVAS 360

Query: 361 SPMHETFETAKETSSGGGHSSNSIEEKAADGEEANQHQEHHHSTTLGSVNEFNFDNGNGS 420
           SPMHETFETAKETSSGGGHSSNSIEEKAADGEEANQHQEHHHSTTLGSVNEFNFDNGNGS
Sbjct: 361 SPMHETFETAKETSSGGGHSSNSIEEKAADGEEANQHQEHHHSTTLGSVNEFNFDNGNGS 420

Query: 421 NALKPNINSDWWANAKDAETKGTTTGAWSFFPMAQQR 458
           NALKPNINSDWWANAKDAETKGTTTGAWSFFPMAQQR
Sbjct: 421 NALKPNINSDWWANAKDAETKGTTTGAWSFFPMAQQR 457

BLAST of CmaCh04G007380 vs. ExPASy TrEMBL
Match: A0A6J1FSP7 (uncharacterized protein At1g76660-like isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111446946 PE=4 SV=1)

HSP 1 Score: 856.3 bits (2211), Expect = 5.9e-245
Identity = 441/464 (95.04%), Postives = 446/464 (96.12%), Query Frame = 0

Query: 1   MREMRRRADADADAD-------ADLRPMNNTFQTITTAADVIATVDHRFPRDTAVQKRRW 60
           MR MRRRADADADAD       ADLRPMNNTFQTIT AAD IATVDHRFPR TAVQKRRW
Sbjct: 1   MRAMRRRADADADADADADADAADLRPMNNTFQTITAAADAIATVDHRFPRATAVQKRRW 60

Query: 61  GSCWSIYWCFGSLRQRKRIGHAVLVPEPSPSPEAHQNSLQSPDIVLPFAAPPSSPASFLQ 120
           GSCWSIYWCFGSL+QRKRIGHAVLVPEPSPSPEAHQNSLQSPDIVLPFAAPPSSP SFLQ
Sbjct: 61  GSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSPEAHQNSLQSPDIVLPFAAPPSSPVSFLQ 120

Query: 121 SEPPSATQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPMNFSTLTTEPST 180
           SEPPSATQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPP+NFSTLTTEPST
Sbjct: 121 SEPPSATQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPST 180

Query: 181 PSFTPPESIHLTTPSSPEVPFAQFLQPNLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLI 240
           PSFTPPESIHLTTPSSPEVPFAQFLQP LPKAESDDQYSCPNDDFQSYQFYPGSPVSNLI
Sbjct: 181 PSFTPPESIHLTTPSSPEVPFAQFLQPTLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLI 240

Query: 241 SPRSAISLSGASSPLPDLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKS 300
           SPRSAISLSGASSP  DLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKS
Sbjct: 241 SPRSAISLSGASSPWTDLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKS 300

Query: 301 NDDDFDLNPRTSDSMNESQNIQILIDGSQMEEPDVTNHRFSFELSDEDSLLRNVESKPLE 360
           NDDDFDL+PRTSDSMNESQNIQILIDGSQMEEPDV NHRFSFELSDEDSLLRN+ESKPLE
Sbjct: 301 NDDDFDLDPRTSDSMNESQNIQILIDGSQMEEPDVANHRFSFELSDEDSLLRNIESKPLE 360

Query: 361 SNVAVASSPMHETFETAKETSSGGGHSSNSIEEKAADGEEANQHQEHHHSTTLGSVNEFN 420
           SNVAVASSPMHETFETAKETSSGGGHSSN IEEKAADGEEANQHQEHHHSTTLGSVNEFN
Sbjct: 361 SNVAVASSPMHETFETAKETSSGGGHSSNGIEEKAADGEEANQHQEHHHSTTLGSVNEFN 420

Query: 421 FDNGNGSNALKPNINSDWWANAKDAETKGTTTGAWSFFPMAQQR 458
           FDNGNGSNALKPNI+SDWWANAKD ETKGTTTGAWSFFPMAQQR
Sbjct: 421 FDNGNGSNALKPNIHSDWWANAKDVETKGTTTGAWSFFPMAQQR 464

BLAST of CmaCh04G007380 vs. ExPASy TrEMBL
Match: A0A6J1FP20 (uncharacterized protein At1g76660-like isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111446946 PE=4 SV=1)

HSP 1 Score: 852.4 bits (2201), Expect = 8.5e-244
Identity = 438/457 (95.84%), Postives = 443/457 (96.94%), Query Frame = 0

Query: 1   MREMRRRADADADADADLRPMNNTFQTITTAADVIATVDHRFPRDTAVQKRRWGSCWSIY 60
           MR MRRRADADA   ADLRPMNNTFQTIT AAD IATVDHRFPR TAVQKRRWGSCWSIY
Sbjct: 1   MRAMRRRADADA---ADLRPMNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIY 60

Query: 61  WCFGSLRQRKRIGHAVLVPEPSPSPEAHQNSLQSPDIVLPFAAPPSSPASFLQSEPPSAT 120
           WCFGSL+QRKRIGHAVLVPEPSPSPEAHQNSLQSPDIVLPFAAPPSSP SFLQSEPPSAT
Sbjct: 61  WCFGSLKQRKRIGHAVLVPEPSPSPEAHQNSLQSPDIVLPFAAPPSSPVSFLQSEPPSAT 120

Query: 121 QSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPMNFSTLTTEPSTPSFTPPE 180
           QSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPP+NFSTLTTEPSTPSFTPPE
Sbjct: 121 QSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPPE 180

Query: 181 SIHLTTPSSPEVPFAQFLQPNLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAIS 240
           SIHLTTPSSPEVPFAQFLQP LPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAIS
Sbjct: 181 SIHLTTPSSPEVPFAQFLQPTLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAIS 240

Query: 241 LSGASSPLPDLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDDDFDL 300
           LSGASSP  DLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDDDFDL
Sbjct: 241 LSGASSPWTDLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDDDFDL 300

Query: 301 NPRTSDSMNESQNIQILIDGSQMEEPDVTNHRFSFELSDEDSLLRNVESKPLESNVAVAS 360
           +PRTSDSMNESQNIQILIDGSQMEEPDV NHRFSFELSDEDSLLRN+ESKPLESNVAVAS
Sbjct: 301 DPRTSDSMNESQNIQILIDGSQMEEPDVANHRFSFELSDEDSLLRNIESKPLESNVAVAS 360

Query: 361 SPMHETFETAKETSSGGGHSSNSIEEKAADGEEANQHQEHHHSTTLGSVNEFNFDNGNGS 420
           SPMHETFETAKETSSGGGHSSN IEEKAADGEEANQHQEHHHSTTLGSVNEFNFDNGNGS
Sbjct: 361 SPMHETFETAKETSSGGGHSSNGIEEKAADGEEANQHQEHHHSTTLGSVNEFNFDNGNGS 420

Query: 421 NALKPNINSDWWANAKDAETKGTTTGAWSFFPMAQQR 458
           NALKPNI+SDWWANAKD ETKGTTTGAWSFFPMAQQR
Sbjct: 421 NALKPNIHSDWWANAKDVETKGTTTGAWSFFPMAQQR 454

BLAST of CmaCh04G007380 vs. ExPASy TrEMBL
Match: A0A6J1C828 (uncharacterized protein At1g76660-like OS=Momordica charantia OX=3673 GN=LOC111008285 PE=4 SV=1)

HSP 1 Score: 666.4 bits (1718), Expect = 8.6e-188
Identity = 361/470 (76.81%), Postives = 395/470 (84.04%), Query Frame = 0

Query: 4   MRRRADADADADADLRPMNNTFQTITTAADVIATVDHRFPRDTAVQKRRWGSCWSIYWCF 63
           MRRR   DADADADL P+NNTFQTIT AAD IATVDHRFPR TAVQKRRWGSCWSIYWCF
Sbjct: 1   MRRR--PDADADADLSPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCF 60

Query: 64  GSLRQRKRIGHAVLVPEPSPSPEAHQNSLQSPDIVLPFAAPPSSPASFLQSEPPSATQSP 123
           GSL+QRKRIGHAVLVPEPSPS E  +N+LQSPDIVLPFAAPPSSP SFLQSEPPSATQSP
Sbjct: 61  GSLKQRKRIGHAVLVPEPSPSTEPPENTLQSPDIVLPFAAPPSSPVSFLQSEPPSATQSP 120

Query: 124 SNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPMNFSTLTTEPSTPSFTPPESIH 183
           + ILSFTSLTANMYSPDGPSSIFA+GPFAHETQLVSPP+NFST+TT+PST  FTPPESIH
Sbjct: 121 TAILSFTSLTANMYSPDGPSSIFAVGPFAHETQLVSPPLNFSTVTTQPSTAPFTPPESIH 180

Query: 184 LTTPSSPEVPFAQFLQPNLPKAESDDQY-SCPNDDFQSYQFYPGSPVSNLISPRSAISLS 243
           LTTPSSPEVPFAQ+LQP+  K ESD QY   PNDDFQSYQFYPGSPVS+LISPRS IS S
Sbjct: 181 LTTPSSPEVPFAQYLQPSHQKVESDHQYDQFPNDDFQSYQFYPGSPVSHLISPRSVISRS 240

Query: 244 GASSPLPDLDFASSASQFSNFSLDVPPALLNLD-------RQGQSSDSCTQNSVGFKSND 303
           GASSPLPD DF  S S FSNF ++VPP LLNLD       R  QSSDSCTQNSVG+KS+ 
Sbjct: 241 GASSPLPDCDFTPSGSSFSNFPIEVPPTLLNLDQHSIQDWRLQQSSDSCTQNSVGYKSS- 300

Query: 304 DDFDLNPRTSDSM------NESQNIQILIDGSQMEEPDVTNHRFSFELSDEDSLLRNVES 363
           +DF LNP+TS+S+      NE  NIQIL DGSQ +E    NHRFSFELSDED+LL++VE+
Sbjct: 301 NDFVLNPQTSESVSDYHASNEYHNIQILTDGSQRDEAAAANHRFSFELSDEDALLKSVEN 360

Query: 364 KPLESN-VAVASSPMHETFETAKETSSGGGHSSNSIEE-KAADGEEANQHQE-HHHSTTL 423
           KPLESN +AVASSP+HE  ETAKETS  GGH+SN  EE + ADGEE + HQE  HHS TL
Sbjct: 361 KPLESNELAVASSPIHEPLETAKETSHVGGHTSNDTEEQEKADGEEVHGHQEVEHHSVTL 420

Query: 424 GSVNEFNFDNGNGSNALKPNINSDWWANAKDAETKGTTTGAWSFFPMAQQ 457
           G+V EFNFDNGNG + LKPNINS WWAN KDAET+GTTTGAWSFFP+ QQ
Sbjct: 421 GTVKEFNFDNGNGCDTLKPNINSAWWANGKDAETEGTTTGAWSFFPITQQ 467

BLAST of CmaCh04G007380 vs. ExPASy TrEMBL
Match: A0A5D3CYQ2 (Mucin-2 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold21G004630 PE=4 SV=1)

HSP 1 Score: 645.2 bits (1663), Expect = 2.1e-181
Identity = 353/471 (74.95%), Postives = 383/471 (81.32%), Query Frame = 0

Query: 4   MRRRADADADADADLRPMNNTFQTITTAADVIATVDHRFPRDTAVQKRRWGSCWSIYWCF 63
           MRRR D D     D RP+NNTFQTIT AAD IATVDHRFPR TAVQKRRWGSC SIYWCF
Sbjct: 1   MRRRTDTD-----DFRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCF 60

Query: 64  GSLRQRKRIGHAVLVPEPSPSPEAHQNSLQSPDIVLPFAAPPSSPASFLQSEPPSATQSP 123
           GSL+QRKRIGHAVLVPEPSPS E H+N+LQSPDIVLPFAAPPSSP S LQSEPPSA QSP
Sbjct: 61  GSLKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSSPVSLLQSEPPSAIQSP 120

Query: 124 SNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPMNFSTLTTEPSTPSFTPPESIH 183
           + ++SFTSLTANMYSPDGPSSIFAIGPFAHE QLVSPP+NFSTLTTEPSTP FTPPESIH
Sbjct: 121 TALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIH 180

Query: 184 LTTPSSPEVPFAQFLQPNLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSG 243
           LTTPSSPEVPFAQF+ P+L K ESD+QY+ PNDDFQSYQFYPGSPVS+LISPRS IS SG
Sbjct: 181 LTTPSSPEVPFAQFVPPSLQKVESDNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSG 240

Query: 244 ASSPLPDLDFASSASQFSNFSLDVPPALLNLD-------RQGQSSDSCTQNSVGFKSNDD 303
           ASSPLPD DFAS  SQF NF L+VPP L NLD       RQ QS+DSCTQ+S+ FKS+ +
Sbjct: 241 ASSPLPDYDFASFGSQFLNFPLEVPPTLSNLDKHSIHNWRQRQSTDSCTQDSIEFKSS-N 300

Query: 304 DFDLNPRTSDSM------NESQNIQILID--GSQMEEPDVTNHRFSFELSDEDSLLRNVE 363
           DF LNP TS+SM      NESQNIQILID    + EEP  TNHRFSFELSD D L ++V 
Sbjct: 301 DFVLNPHTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVG 360

Query: 364 SKPLESN-VAVASSPMHETFETAKETSSGGGHSSNSIEEKA-ADGEEANQHQEHHHSTTL 423
           SKPLESN + V SSP+HE FET KE S  G H+SN IEEK  ADG+EA+QHQE HHS  L
Sbjct: 361 SKPLESNELPVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGDEAHQHQE-HHSVAL 420

Query: 424 GSVNEFNFDNGNGSNALKPNINSDWWANAKDAETKGTTTGAWSFFPMAQQR 458
           GSV EFNFDN NGS+   P INSDWW NAKD  T+GTTTGAWSFFP  QQR
Sbjct: 421 GSVKEFNFDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQR 464

BLAST of CmaCh04G007380 vs. NCBI nr
Match: XP_022980796.1 (uncharacterized protein At1g76660-like [Cucurbita maxima])

HSP 1 Score: 892.9 bits (2306), Expect = 1.2e-255
Identity = 457/457 (100.00%), Postives = 457/457 (100.00%), Query Frame = 0

Query: 1   MREMRRRADADADADADLRPMNNTFQTITTAADVIATVDHRFPRDTAVQKRRWGSCWSIY 60
           MREMRRRADADADADADLRPMNNTFQTITTAADVIATVDHRFPRDTAVQKRRWGSCWSIY
Sbjct: 1   MREMRRRADADADADADLRPMNNTFQTITTAADVIATVDHRFPRDTAVQKRRWGSCWSIY 60

Query: 61  WCFGSLRQRKRIGHAVLVPEPSPSPEAHQNSLQSPDIVLPFAAPPSSPASFLQSEPPSAT 120
           WCFGSLRQRKRIGHAVLVPEPSPSPEAHQNSLQSPDIVLPFAAPPSSPASFLQSEPPSAT
Sbjct: 61  WCFGSLRQRKRIGHAVLVPEPSPSPEAHQNSLQSPDIVLPFAAPPSSPASFLQSEPPSAT 120

Query: 121 QSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPMNFSTLTTEPSTPSFTPPE 180
           QSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPMNFSTLTTEPSTPSFTPPE
Sbjct: 121 QSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPMNFSTLTTEPSTPSFTPPE 180

Query: 181 SIHLTTPSSPEVPFAQFLQPNLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAIS 240
           SIHLTTPSSPEVPFAQFLQPNLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAIS
Sbjct: 181 SIHLTTPSSPEVPFAQFLQPNLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAIS 240

Query: 241 LSGASSPLPDLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDDDFDL 300
           LSGASSPLPDLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDDDFDL
Sbjct: 241 LSGASSPLPDLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDDDFDL 300

Query: 301 NPRTSDSMNESQNIQILIDGSQMEEPDVTNHRFSFELSDEDSLLRNVESKPLESNVAVAS 360
           NPRTSDSMNESQNIQILIDGSQMEEPDVTNHRFSFELSDEDSLLRNVESKPLESNVAVAS
Sbjct: 301 NPRTSDSMNESQNIQILIDGSQMEEPDVTNHRFSFELSDEDSLLRNVESKPLESNVAVAS 360

Query: 361 SPMHETFETAKETSSGGGHSSNSIEEKAADGEEANQHQEHHHSTTLGSVNEFNFDNGNGS 420
           SPMHETFETAKETSSGGGHSSNSIEEKAADGEEANQHQEHHHSTTLGSVNEFNFDNGNGS
Sbjct: 361 SPMHETFETAKETSSGGGHSSNSIEEKAADGEEANQHQEHHHSTTLGSVNEFNFDNGNGS 420

Query: 421 NALKPNINSDWWANAKDAETKGTTTGAWSFFPMAQQR 458
           NALKPNINSDWWANAKDAETKGTTTGAWSFFPMAQQR
Sbjct: 421 NALKPNINSDWWANAKDAETKGTTTGAWSFFPMAQQR 457

BLAST of CmaCh04G007380 vs. NCBI nr
Match: XP_023529207.1 (uncharacterized protein At1g76660-like isoform X1 [Cucurbita pepo subsp. pepo] >XP_023529208.1 uncharacterized protein At1g76660-like isoform X2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 865.1 bits (2234), Expect = 2.6e-247
Identity = 445/457 (97.37%), Postives = 447/457 (97.81%), Query Frame = 0

Query: 1   MREMRRRADADADADADLRPMNNTFQTITTAADVIATVDHRFPRDTAVQKRRWGSCWSIY 60
           MR MRRRADADADA ADLRPMNNTFQTIT AAD IATVDHRFPR TAVQKRRWGSCWSIY
Sbjct: 1   MRAMRRRADADADA-ADLRPMNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIY 60

Query: 61  WCFGSLRQRKRIGHAVLVPEPSPSPEAHQNSLQSPDIVLPFAAPPSSPASFLQSEPPSAT 120
           WCFGSL+QRKRIGHAVLVPEPSPSPEAHQNSLQSPDIVLPFAAPPSSP SFLQSEPPSAT
Sbjct: 61  WCFGSLKQRKRIGHAVLVPEPSPSPEAHQNSLQSPDIVLPFAAPPSSPVSFLQSEPPSAT 120

Query: 121 QSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPMNFSTLTTEPSTPSFTPPE 180
           QSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPP+NFSTLTTEPSTPSFTPPE
Sbjct: 121 QSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPPE 180

Query: 181 SIHLTTPSSPEVPFAQFLQPNLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAIS 240
           SIHLTTPSSPEVPFAQFLQP L KAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAIS
Sbjct: 181 SIHLTTPSSPEVPFAQFLQPTLQKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAIS 240

Query: 241 LSGASSPLPDLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDDDFDL 300
           LSGASSPLPDLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDDDFDL
Sbjct: 241 LSGASSPLPDLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDDDFDL 300

Query: 301 NPRTSDSMNESQNIQILIDGSQMEEPDVTNHRFSFELSDEDSLLRNVESKPLESNVAVAS 360
           NPRTSDSMNESQNIQILIDGSQMEEPDVTNHRFSFELSDEDSLLRNVESKPLESNVAVAS
Sbjct: 301 NPRTSDSMNESQNIQILIDGSQMEEPDVTNHRFSFELSDEDSLLRNVESKPLESNVAVAS 360

Query: 361 SPMHETFETAKETSSGGGHSSNSIEEKAADGEEANQHQEHHHSTTLGSVNEFNFDNGNGS 420
           SPMHETFETAKETSSGGGHSSN IEEKAADGEEANQHQEHHHSTTLGSVNEFNFDNGNGS
Sbjct: 361 SPMHETFETAKETSSGGGHSSNGIEEKAADGEEANQHQEHHHSTTLGSVNEFNFDNGNGS 420

Query: 421 NALKPNINSDWWANAKDAETKGTTTGAWSFFPMAQQR 458
           NALKPNINSDWWANAKD ETKGTTTGAWSFFPMAQQR
Sbjct: 421 NALKPNINSDWWANAKDVETKGTTTGAWSFFPMAQQR 456

BLAST of CmaCh04G007380 vs. NCBI nr
Match: XP_023522163.1 (uncharacterized protein At1g76660-like [Cucurbita pepo subsp. pepo] >XP_023529173.1 uncharacterized protein At1g76660-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 860.9 bits (2223), Expect = 4.9e-246
Identity = 443/457 (96.94%), Postives = 445/457 (97.37%), Query Frame = 0

Query: 1   MREMRRRADADADADADLRPMNNTFQTITTAADVIATVDHRFPRDTAVQKRRWGSCWSIY 60
           MR MRRRADADA   ADLRPMNNTFQTIT AAD IATVDHRFPR TAVQKRRWGSCWSIY
Sbjct: 1   MRAMRRRADADA---ADLRPMNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIY 60

Query: 61  WCFGSLRQRKRIGHAVLVPEPSPSPEAHQNSLQSPDIVLPFAAPPSSPASFLQSEPPSAT 120
           WCFGSL+QRKRIGHAVLVPEPSPSPEAHQNSLQSPDIVLPFAAPPSSP SFLQSEPPSAT
Sbjct: 61  WCFGSLKQRKRIGHAVLVPEPSPSPEAHQNSLQSPDIVLPFAAPPSSPVSFLQSEPPSAT 120

Query: 121 QSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPMNFSTLTTEPSTPSFTPPE 180
           QSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPP+NFSTLTTEPSTPSFTPPE
Sbjct: 121 QSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPPE 180

Query: 181 SIHLTTPSSPEVPFAQFLQPNLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAIS 240
           SIHLTTPSSPEVPFAQFLQP L KAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAIS
Sbjct: 181 SIHLTTPSSPEVPFAQFLQPTLQKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAIS 240

Query: 241 LSGASSPLPDLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDDDFDL 300
           LSGASSPLPDLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDDDFDL
Sbjct: 241 LSGASSPLPDLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDDDFDL 300

Query: 301 NPRTSDSMNESQNIQILIDGSQMEEPDVTNHRFSFELSDEDSLLRNVESKPLESNVAVAS 360
           NPRTSDSMNESQNIQILIDGSQMEEPDVTNHRFSFELSDEDSLLRNVESKPLESNVAVAS
Sbjct: 301 NPRTSDSMNESQNIQILIDGSQMEEPDVTNHRFSFELSDEDSLLRNVESKPLESNVAVAS 360

Query: 361 SPMHETFETAKETSSGGGHSSNSIEEKAADGEEANQHQEHHHSTTLGSVNEFNFDNGNGS 420
           SPMHETFETAKETSSGGGHSSN IEEKAADGEEANQHQEHHHSTTLGSVNEFNFDNGNGS
Sbjct: 361 SPMHETFETAKETSSGGGHSSNGIEEKAADGEEANQHQEHHHSTTLGSVNEFNFDNGNGS 420

Query: 421 NALKPNINSDWWANAKDAETKGTTTGAWSFFPMAQQR 458
           NALKPNINSDWWANAKD ETKGTTTGAWSFFPMAQQR
Sbjct: 421 NALKPNINSDWWANAKDVETKGTTTGAWSFFPMAQQR 454

BLAST of CmaCh04G007380 vs. NCBI nr
Match: XP_022941648.1 (uncharacterized protein At1g76660-like isoform X1 [Cucurbita moschata])

HSP 1 Score: 856.3 bits (2211), Expect = 1.2e-244
Identity = 441/464 (95.04%), Postives = 446/464 (96.12%), Query Frame = 0

Query: 1   MREMRRRADADADAD-------ADLRPMNNTFQTITTAADVIATVDHRFPRDTAVQKRRW 60
           MR MRRRADADADAD       ADLRPMNNTFQTIT AAD IATVDHRFPR TAVQKRRW
Sbjct: 1   MRAMRRRADADADADADADADAADLRPMNNTFQTITAAADAIATVDHRFPRATAVQKRRW 60

Query: 61  GSCWSIYWCFGSLRQRKRIGHAVLVPEPSPSPEAHQNSLQSPDIVLPFAAPPSSPASFLQ 120
           GSCWSIYWCFGSL+QRKRIGHAVLVPEPSPSPEAHQNSLQSPDIVLPFAAPPSSP SFLQ
Sbjct: 61  GSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSPEAHQNSLQSPDIVLPFAAPPSSPVSFLQ 120

Query: 121 SEPPSATQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPMNFSTLTTEPST 180
           SEPPSATQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPP+NFSTLTTEPST
Sbjct: 121 SEPPSATQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPST 180

Query: 181 PSFTPPESIHLTTPSSPEVPFAQFLQPNLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLI 240
           PSFTPPESIHLTTPSSPEVPFAQFLQP LPKAESDDQYSCPNDDFQSYQFYPGSPVSNLI
Sbjct: 181 PSFTPPESIHLTTPSSPEVPFAQFLQPTLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLI 240

Query: 241 SPRSAISLSGASSPLPDLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKS 300
           SPRSAISLSGASSP  DLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKS
Sbjct: 241 SPRSAISLSGASSPWTDLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKS 300

Query: 301 NDDDFDLNPRTSDSMNESQNIQILIDGSQMEEPDVTNHRFSFELSDEDSLLRNVESKPLE 360
           NDDDFDL+PRTSDSMNESQNIQILIDGSQMEEPDV NHRFSFELSDEDSLLRN+ESKPLE
Sbjct: 301 NDDDFDLDPRTSDSMNESQNIQILIDGSQMEEPDVANHRFSFELSDEDSLLRNIESKPLE 360

Query: 361 SNVAVASSPMHETFETAKETSSGGGHSSNSIEEKAADGEEANQHQEHHHSTTLGSVNEFN 420
           SNVAVASSPMHETFETAKETSSGGGHSSN IEEKAADGEEANQHQEHHHSTTLGSVNEFN
Sbjct: 361 SNVAVASSPMHETFETAKETSSGGGHSSNGIEEKAADGEEANQHQEHHHSTTLGSVNEFN 420

Query: 421 FDNGNGSNALKPNINSDWWANAKDAETKGTTTGAWSFFPMAQQR 458
           FDNGNGSNALKPNI+SDWWANAKD ETKGTTTGAWSFFPMAQQR
Sbjct: 421 FDNGNGSNALKPNIHSDWWANAKDVETKGTTTGAWSFFPMAQQR 464

BLAST of CmaCh04G007380 vs. NCBI nr
Match: KAG6600562.1 (hypothetical protein SDJN03_05795, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 856.3 bits (2211), Expect = 1.2e-244
Identity = 440/458 (96.07%), Postives = 443/458 (96.72%), Query Frame = 0

Query: 1   MREMRRRADADADAD-ADLRPMNNTFQTITTAADVIATVDHRFPRDTAVQKRRWGSCWSI 60
           MR MRRRADADADAD ADLRPMNNTFQTIT AAD IATVDHRFPR TAVQKRRWGSCWSI
Sbjct: 1   MRAMRRRADADADADAADLRPMNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSI 60

Query: 61  YWCFGSLRQRKRIGHAVLVPEPSPSPEAHQNSLQSPDIVLPFAAPPSSPASFLQSEPPSA 120
           YWCFGSL+QRKRIGHAVLVPEPSPSPEAHQNSLQSPD VLPFAAPPSSP SFLQSEPPS 
Sbjct: 61  YWCFGSLKQRKRIGHAVLVPEPSPSPEAHQNSLQSPDTVLPFAAPPSSPVSFLQSEPPSV 120

Query: 121 TQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPMNFSTLTTEPSTPSFTPP 180
           TQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPP+NFSTLTTEPSTPSFTPP
Sbjct: 121 TQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPP 180

Query: 181 ESIHLTTPSSPEVPFAQFLQPNLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAI 240
           ESIHLTTPSSPEVPFAQFLQP LPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAI
Sbjct: 181 ESIHLTTPSSPEVPFAQFLQPTLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAI 240

Query: 241 SLSGASSPLPDLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDDDFD 300
           SLSGASSPLPDLDFASSASQFS FSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDDD D
Sbjct: 241 SLSGASSPLPDLDFASSASQFSIFSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDDDLD 300

Query: 301 LNPRTSDSMNESQNIQILIDGSQMEEPDVTNHRFSFELSDEDSLLRNVESKPLESNVAVA 360
           LNPRTSDSMNESQNIQILIDGSQMEEPDV NHRFSFELSDEDSLLRN+ESKPLESNVAVA
Sbjct: 301 LNPRTSDSMNESQNIQILIDGSQMEEPDVANHRFSFELSDEDSLLRNIESKPLESNVAVA 360

Query: 361 SSPMHETFETAKETSSGGGHSSNSIEEKAADGEEANQHQEHHHSTTLGSVNEFNFDNGNG 420
           SSPMHETFETAKETSSGGGHSSN IEEKAADGEEANQHQEHHHSTTLGSVNEFNFDNGNG
Sbjct: 361 SSPMHETFETAKETSSGGGHSSNGIEEKAADGEEANQHQEHHHSTTLGSVNEFNFDNGNG 420

Query: 421 SNALKPNINSDWWANAKDAETKGTTTGAWSFFPMAQQR 458
           SNALKPNINSDWWANAKD ETKGTT GAWSFFPMAQQR
Sbjct: 421 SNALKPNINSDWWANAKDVETKGTTPGAWSFFPMAQQR 458

BLAST of CmaCh04G007380 vs. TAIR 10
Match: AT5G52430.1 (hydroxyproline-rich glycoprotein family protein )

HSP 1 Score: 218.8 bits (556), Expect = 9.2e-57
Identity = 175/463 (37.80%), Postives = 233/463 (50.32%), Query Frame = 0

Query: 21  MNNTFQTITTAADVIATVDHRFPRDTAVQKRRWGSCWSIYWCFGSLRQRKRIGHAVLVPE 80
           +NN+ +T+  AA  I T + R  + ++ QK RWG CWS+Y CFG+ +  KRIG+AVLVPE
Sbjct: 5   VNNSVETVNAAATAIVTAESRV-QPSSSQKGRWGKCWSLYSCFGTQKNNKRIGNAVLVPE 64

Query: 81  PSPS---PEAHQNSLQSPDIVLPFAAPPSSPASFLQSEPPSATQSPSNILSFTSLTANMY 140
           P  S       QNS  S  +VLPF APPSSPASFLQS+P S + SP   L   SLT+N +
Sbjct: 65  PVTSGVPVVTVQNSATSTTVVLPFIAPPSSPASFLQSDPSSVSHSPVGPL---SLTSNTF 124

Query: 141 SPDGPSSIFAIGPFAHETQLVSPPMNFSTLTTEPSTPSFTPP--ESIHLTTPSSPEVPFA 200
           SP  P S+F +GP+A+ETQ V+PP+ FS   TEPST  +TPP   S+H+TTPSSPEVPFA
Sbjct: 125 SPKEPQSVFTVGPYANETQPVTPPV-FSAFITEPSTAPYTPPPESSVHITTPSSPEVPFA 184

Query: 201 QFLQPNLPKAESD------DQYSCPNDDFQSYQFYPGSP-VSNLISPRSAISLSGASSPL 260
           Q L  +L     D       ++S  + +F+S Q  PGSP   NLISP S IS SG SSP 
Sbjct: 185 QLLTSSLELTRRDSTSGMNQKFSSSHYEFRSNQVCPGSPGGGNLISPGSVISNSGTSSPY 244

Query: 261 PDLDFASSASQFSNFSLDVPPALLNLD---------RQGQSSDSCTQNSVGFKSN----- 320
           P        S    F +  PP  L  +         R G  S +   +  G  S      
Sbjct: 245 P------GKSPMVEFRIGEPPKFLGFEHFTARKWGSRFGSGSITPVGHGSGLASGALTPN 304

Query: 321 -----DDDFDLNPRTSDSMNESQNIQILIDGSQMEEPDVTNHRFSFELSDEDSLLRNVES 380
                  +   N  T    N+   +  L +     E  V +HR SFEL+ ED + R + S
Sbjct: 305 GPEIVSGNLTPNNTTWPLQNQISEVASLANSDHGSEVMVADHRVSFELTGED-VARCLAS 364

Query: 381 KPLESNVAVASSPMHETFETAKETSSGGGHSSNSIEEKAADGEEANQHQEHHHSTTLGSV 440
           K   S+  + ++   ET E      S       +IE+++ D E      +   S+++GS 
Sbjct: 365 KLNRSHDRMNNNDRIETEE------SSSTDIRRNIEKRSGDRENEQHRIQKLSSSSIGSS 424

Query: 441 NEFNFDNGNGSNALKPNINSDWWANAKDAETKGTTTGAWSFFP 453
            EF FD                  N KD   +     +WSFFP
Sbjct: 425 KEFKFD------------------NTKDENIEKVAGNSWSFFP 431

BLAST of CmaCh04G007380 vs. TAIR 10
Match: AT1G63720.1 (BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT5G52430.1); Has 490 Blast hits to 394 proteins in 96 species: Archae - 0; Bacteria - 2; Metazoa - 132; Fungi - 88; Plants - 175; Viruses - 14; Other Eukaryotes - 79 (source: NCBI BLink). )

HSP 1 Score: 198.7 bits (504), Expect = 9.8e-51
Identity = 138/291 (47.42%), Postives = 170/291 (58.42%), Query Frame = 0

Query: 22  NNTFQTITTAADVIATVDHRFPRDTAV-QKRRWGSCWSIYWCFGSLRQRKRIGHAVLVPE 81
           NN F TI  AA  IA+ D R  + + + +KR+W + WS+  CFGS RQRKRIG++VLVPE
Sbjct: 8   NNVFDTINAAASAIASSDDRLHQSSPIHKKRKWWNRWSLLKCFGSSRQRKRIGNSVLVPE 67

Query: 82  P----SPSPEAHQNSLQSPDIVLPFAAPPSSPASFLQSEPPSATQSPSNILSFTSLTANM 141
           P    S +     +  +S    LPF APPSSPASF QSEPPSATQSP  ILSF+ L  N 
Sbjct: 68  PVSMSSSNSTTSNSGYRSVITTLPFIAPPSSPASFFQSEPPSATQSPVGILSFSPLPCN- 127

Query: 142 YSPDGPSSIFAIGPFAHETQLVSPPMNFSTLTTEPSTPSFTPP---ESIHL--TTPSSPE 201
                  SIFAIGP+AHETQLVSPP+ FST TTEPS+   TPP    SI+L  TTPSSPE
Sbjct: 128 ----NRPSIFAIGPYAHETQLVSPPV-FSTYTTEPSSAPITPPLDDSSIYLTTTTPSSPE 187

Query: 202 VPFAQFLQPNLPKAESDDQYSCPND-DFQSYQFYPGSPVSNLISPRSAISLSGASSPLPD 261
           VPFAQ    N        ++   +  +FQ YQ  PGSP+  LISP      SG +SP PD
Sbjct: 188 VPFAQLFNSNHQTGSYGYKFPMSSSYEFQFYQLPPGSPLGQLISPSPG---SGPTSPFPD 247

Query: 262 LDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDDDFDLN 302
                  S F +F +  PP LL+    G ++  C +  +        FDL+
Sbjct: 248 ----GETSLFPHFQVSDPPKLLSPKTAGVTT-PCKEQKIVRPHKPVSFDLD 284

BLAST of CmaCh04G007380 vs. TAIR 10
Match: AT4G25620.1 (hydroxyproline-rich glycoprotein family protein )

HSP 1 Score: 182.2 bits (461), Expect = 9.5e-46
Identity = 165/465 (35.48%), Postives = 225/465 (48.39%), Query Frame = 0

Query: 22  NNTFQTITTAADVIATVDHRFPRDTAVQKRRWGSCWSIYWCFGSLRQRKRIGHAVLVPEP 81
           N++  T+  AA  I + + R  + ++VQK+R GS WS+YWCFGS +  KRIGHAVLVPEP
Sbjct: 6   NSSVDTVNAAASAIVSAESR-TQPSSVQKKR-GSWWSLYWCFGSKKNNKRIGHAVLVPEP 65

Query: 82  SPSPEA----HQNSLQSPDIVLPFAAPPSSPASFLQSEPPSATQSPSNILSFTSLTANMY 141
           + S  A      +S  S  I +PF APPSSPASFL S PPSA+ +P   L   SLT N  
Sbjct: 66  AASGAAVAPVQNSSSNSTSIFMPFIAPPSSPASFLPSGPPSASHTPDPGL-LCSLTVN-- 125

Query: 142 SPDGPSSIFAIGPFAHETQLVSPPMNFSTLTTEPSTPSFTPPESIHLTTPSSPEVPFAQF 201
               P S F IGP+AHETQ V+PP+ FS  TTEPST  FTPP      +PSSPEVPFAQ 
Sbjct: 126 ---EPPSAFTIGPYAHETQPVTPPV-FSAFTTEPSTAPFTPPPE----SPSSPEVPFAQL 185

Query: 202 LQPNLPKAE------SDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPLPDL 261
           L  +L +A        + ++S  + +F+S Q YPGSP  NLISP      SG SSP P  
Sbjct: 186 LTSSLERARRNSGGGMNQKFSAAHYEFKSCQVYPGSPGGNLISPG-----SGTSSPYP-- 245

Query: 262 DFASSASQFSNFSLDVPPALLNLD-----RQGQSSDSCTQNSVGFKSNDDDFDLNP---R 321
                      F +  PP  L  +     + G    S +    G  S      L P   +
Sbjct: 246 ----GKCSIIEFRIGEPPKFLGFEHFTARKWGSRFGSGSITPAGQGSRLGSGALTPDGSK 305

Query: 322 TSDSMNESQNIQILIDGSQMEEPDVTNHRFSFELSDEDSLLRNVESKPLESNVAVASSPM 381
            +  +      + +I  S      +       ++S+  SL  +       ++ A+   P 
Sbjct: 306 LTSGVVTPNGAETVIRMSYGNLTPLEGSLLDSQISEVASLANSDHGSSRHNDEALV-VPH 365

Query: 382 HETFE---------TAKETSSGGGHSSNSIEEK-----AADGEEANQHQEHHHSTTLGSV 441
             +FE          A + +  G H   S E          GE  ++  +   S + GS 
Sbjct: 366 RVSFELTGEDVARCLASKLNRSGSHEKASGEHLRPNCCKTSGETESEQSQKLRSFSTGSN 425

Query: 442 NEFNFDNGNGSNALKPNINSDWWANAKDA-ETKGTTTGAWSFFPM 454
            EF FD+ N    +   I S+WWAN K A +   +   +W+FFP+
Sbjct: 426 KEFKFDSTN--EEMIEKIRSEWWANEKVAGKGDHSPRNSWTFFPV 443

BLAST of CmaCh04G007380 vs. TAIR 10
Match: AT1G76660.1 (FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: plasma membrane; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT5G52430.1); Has 353 Blast hits to 231 proteins in 60 species: Archae - 0; Bacteria - 6; Metazoa - 57; Fungi - 22; Plants - 125; Viruses - 4; Other Eukaryotes - 139 (source: NCBI BLink). )

HSP 1 Score: 139.4 bits (350), Expect = 7.1e-33
Identity = 128/316 (40.51%), Postives = 157/316 (49.68%), Query Frame = 0

Query: 49  QKRRWGSCWSIYWCFGSLRQRKRIGHAVLVPE-----PSPSPEAHQ----NSLQSPDIVL 108
           Q++RWG C  ++ CF S +  KRI  A  +PE      S    AHQ    N+  +  I L
Sbjct: 7   QRKRWGGCLGVFSCFKSQKGGKRIVPASRIPEGGNVSASQPNGAHQAGVLNNQAAGGINL 66

Query: 109 PFAAPPSSPASFLQSEPPSATQSPSNILSFTSLTANMYSPDGP-SSIFAIGPFAHETQLV 168
              APPSSPASF  S  PS TQSP+    + SL AN  SP GP SS++A GP+AHETQLV
Sbjct: 67  SLLAPPSSPASFTNSALPSTTQSPN---CYLSLAAN--SPGGPSSSMYATGPYAHETQLV 126

Query: 169 SPPMNFSTLTTEPSTPSFT-PPESIHLTTPSSPEVPFAQFLQPNLPKAESDDQYSCPNDD 228
           SPP+ FST TTEPST  FT PPE   LT PSSP+VP+A+FL  ++    S   +   ND 
Sbjct: 127 SPPV-FSTFTTEPSTAPFTPPPELARLTAPSSPDVPYARFLTSSMDLKNSGKGHY--NDL 186

Query: 229 FQSYQFYPGSPVSNLISPRSAISLSGASSPLP--------------DLDFASSASQFSNF 288
             +Y  YPGSP S L SP S  S  G  SP                D +  S+  Q SNF
Sbjct: 187 QATYSLYPGSPASALRSPISRASGDGLLSPQNGKCSRSDSGNTFGYDTNGVSTPLQESNF 246

Query: 289 SLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDDDFDLNPRTSDSMNESQNIQILIDGSQM 340
                 A   LD      D     + G  S   D D+ P T+   N +QN Q       M
Sbjct: 247 FCPETFAKFYLDH-----DPSVPQNGGRLSVSKDSDVYP-TNGYGNGNQNRQNRSPKQDM 306

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9SRE59.9e-3240.51Uncharacterized protein At1g76660 OS=Arabidopsis thaliana OX=3702 GN=At1g76660 P... [more]
Match NameE-valueIdentityDescription
A0A6J1IUL05.7e-256100.00uncharacterized protein At1g76660-like OS=Cucurbita maxima OX=3661 GN=LOC1114800... [more]
A0A6J1FSP75.9e-24595.04uncharacterized protein At1g76660-like isoform X1 OS=Cucurbita moschata OX=3662 ... [more]
A0A6J1FP208.5e-24495.84uncharacterized protein At1g76660-like isoform X2 OS=Cucurbita moschata OX=3662 ... [more]
A0A6J1C8288.6e-18876.81uncharacterized protein At1g76660-like OS=Momordica charantia OX=3673 GN=LOC1110... [more]
A0A5D3CYQ22.1e-18174.95Mucin-2 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold21G004630 PE=4 S... [more]
Match NameE-valueIdentityDescription
XP_022980796.11.2e-255100.00uncharacterized protein At1g76660-like [Cucurbita maxima][more]
XP_023529207.12.6e-24797.37uncharacterized protein At1g76660-like isoform X1 [Cucurbita pepo subsp. pepo] >... [more]
XP_023522163.14.9e-24696.94uncharacterized protein At1g76660-like [Cucurbita pepo subsp. pepo] >XP_02352917... [more]
XP_022941648.11.2e-24495.04uncharacterized protein At1g76660-like isoform X1 [Cucurbita moschata][more]
KAG6600562.11.2e-24496.07hypothetical protein SDJN03_05795, partial [Cucurbita argyrosperma subsp. sorori... [more]
Match NameE-valueIdentityDescription
AT5G52430.19.2e-5737.80hydroxyproline-rich glycoprotein family protein [more]
AT1G63720.19.8e-5147.42BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein fam... [more]
AT4G25620.19.5e-4635.48hydroxyproline-rich glycoprotein family protein [more]
AT1G76660.17.1e-3340.51FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknow... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 274..309
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 165..190
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 274..294
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 401..421
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 385..400
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 366..421
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 366..382
NoneNo IPR availablePANTHERPTHR31798:SF2HYDROXYPROLINE-RICH GLYCOPROTEIN FAMILY PROTEINcoord: 17..455
IPR040420Uncharacterized protein At1g76660-likePANTHERPTHR31798HYDROXYPROLINE-RICH GLYCOPROTEIN-LIKEcoord: 17..455

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh04G007380.1CmaCh04G007380.1mRNA